Cloud AI 100 Ultra
Contact sales
100 UltraCloud AI

Delivers the performance and power efficiency necessary to deploy and accelerate AI inference at-scale

The Qualcomm Cloud AI 100 Ultra, the newest member of our portfolio of cloud artificial intelligence (AI) inference cards, is a performance- and cost-optimized AI inference solution, designed for Generative AI and large language models (LLMs). With up to 576 MB of on-die SRAM and 64 AI cores per card - and programmability for a wide range of workloads and acceleration techniques - Qualcomm Cloud AI 100 solutions address the unique requirements for scaling classic and generative AI workloads, ranging from computer vision and natural language processing to transformer-based LLMs.

Qualcomm Cloud AI is a product of Qualcomm Technologies, Inc., and/or its subsidiaries.

Product license agreement

Qualcomm® AI Inference Suite for Cloud and On-Prem

The Qualcomm AI Inference Suite is a comprehensive set of software and services for Qualcomm Cloud AI accelerators spanning across on-premises solutions and cloud deployments.

It includes ready-to-use AI applications and agents, tools, and libraries for operationalizing generative AI at scale. For enterprises developing chatbots, co-pilots, and AI agents, the AI Inference Suite features a rich set of OpenAI-compatible APIs including user management and administration capabilities. It supports chat, image generation, multi-modal AI capabilities, and retrieval-augmented generation (RAG) functionalities.

The suite also supports integration with popular generative AI models, frameworks, cloud services, and is deployed using Kubernetes or bare-metal containers. Customers can build gen AI applications and agents using open-source or proprietary models, scale and optimize workflows, and run enterprise-grade solutions.

Cloud AI 100 Ultra

01:26
Cloud AI 100 Ultra

1:26

Video Player is loading.
Current Time 0:00
Duration 1:26
Loaded: 6.95%
Stream Type LIVE
Remaining Time 1:26
 
1x
  • Chapters
  • descriptions off, selected
  • captions off, selected
  • en (Main), selected

Specifications

SKU Form factor Power (TDP) Peak Integer Ops (INT8) Peak FP Ops (FP16) SRAM DRAM (w/ ECC) DRAM BW Host Interface
AI 100 Ultra PCIe FH3/4L (111.2mm x 237.9mm) 150W Up to 870 TOPS Up to 288 TFLOPS 576 MB 128 GB LPR4x 548 GB/s PCIe Gen4, 16 lanes
AI 80 Ultra PCIe FH3/4L (111.2mm x 237.9mm) 150W Up to 618 TOPS (155 TOPs/SoC) Up to 222 TFLOPS (56 TFLOPS/SoC) 576 MB 128 GB LPR4x 548 GB/s PCIe Gen4, 16 lanes
AI 100 Pro PCIe HHHL (68.9mm x 169.5mm) 75W Up to 400 TOPS Up to 200 TFLOPS 144 MB 32 GM LPR4x 137 GB/s PCIe Gen4, 8 lanes
AI 100 Standard PCIe HHHL (68.9mm x 169.5mm) 75W Up to 350 TOPS Up to 175 TFLOPS 126 MB 16 GB LPR4x 137 GB/s PCIe Gen4, 8 lanes
AI 80 Standard PCIe HHHL (68.9mm x 169.5mm) 75W Up to 190 TOPS Up to 86 TFLOPS 126 MB 16 GB LPR4x 137 GB/s PCIe Gen4, 8 lanes
Specifications
Qualcomm AI Cores
AI Number of Cores
Up to 16
Memory
AI SRAM Density
144 MB1
Data Types
Data Types
FP16, INT16, INT8, FP32
Process Node and Technology
Process Node
7 nm
Card
Raw Tera Operations Per Second (TOPS)
Dual M.2 (edge): 70 TOPS@15W TDP, PCIe: 400 TOPS@ 75W TDP, Dual M.2: 200 TOPS@ 25W TDP, PCIe: 350 TOPS @ 75W
Type
PCIe (HHHL), Dual M.2, Dual M.2 (edge)
PCIe Interface
8 lane Gen3/4 (PCIe), 4 lane Gen3/4 (Dual M.2)
DRAM Speed
2.1 GHz
DRAM Type
LPDDR4x
DRAM Density
Up to 32 GB
DRAM Bit Width
64-bit
DRAM Number of Channels
4
  1. 9 MB each AI core
Access more.
To access more Cloud AI 100 Ultra resources, you need to be a member of a verified company. Company verification is initiated by filling out our access request form.
Already have a Qualcomm account?
Qualcomm relentlessly innovates to deliver intelligent computing everywhere, helping the world tackle some of its most important challenges. Our leading-edge AI, high performance, low-power computing, and unrivaled connectivity deliver proven solutions that transform major industries. At Qualcomm, we are engineering human progress.

Stay connected

Get the latest Qualcomm and industry information delivered to your inbox.

Subscribe
Manage your subscription

© Qualcomm Technologies, Inc. and/or its affiliated companies.

Snapdragon and Qualcomm branded products are products of Qualcomm Technologies, Inc. and/or its subsidiaries. Qualcomm patented technologies are licensed by Qualcomm Incorporated.

Note: Certain services and materials may require you to accept additional terms and conditions before accessing or using those items.

References to "Qualcomm" may mean Qualcomm Incorporated, or subsidiaries or business units within the Qualcomm corporate structure, as applicable.

Qualcomm Incorporated includes our licensing business, QTL, and the vast majority of our patent portfolio. Qualcomm Technologies, Inc., a subsidiary of Qualcomm Incorporated, operates, along with its subsidiaries, substantially all of our engineering, research and development functions, and substantially all of our products and services businesses, including our QCT semiconductor business.

Materials that are as of a specific date, including but not limited to press releases, presentations, blog posts and webcasts, may have been superseded by subsequent events or disclosures.

Nothing in these materials is an offer to sell or license any of the services or materials referenced herein.