Delivers the performance and power efficiency necessary to deploy and accelerate AI inference at-scale
The Qualcomm Cloud AI 100 Ultra, the newest member of our portfolio of cloud artificial intelligence (AI) inference cards, is a performance- and cost-optimized AI inference solution, designed for Generative AI and large language models (LLMs). With up to 576 MB of on-die SRAM and 64 AI cores per card - and programmability for a wide range of workloads and acceleration techniques - Qualcomm Cloud AI 100 solutions address the unique requirements for scaling classic and generative AI workloads, ranging from computer vision and natural language processing to transformer-based LLMs.
Qualcomm Cloud AI is a product of Qualcomm Technologies, Inc., and/or its subsidiaries.
- 9 MB each AI core


