On this page
Introduction
Developer Playground
Deployment Options
Benefits
Application and Agent Samples
Blog
Videos/Tutorials
Community
Get the Qualcomm® newsletter straight to your inbox.
A comprehensive set of ready-to-use AI applications, agents, tools, and libraries for developing and deploying AI inference on premises or via cloud deployments.
The Qualcomm® AI Inference Suite comprises a Python SDK and OpenAI-compatible APIs. These interfaces simplify deployment of AI applications and agents powered by Qualcomm® Cloud AI inference accelerator cards to achieve industry-leading performance per watt at a low total-cost-of-ownership (TCO).
Use pre-trained generative AI models for chat, generative multimedia, and retrieval-augmented generation (RAG).


