What is Cloud AI SDK

Cloud AI SDKs enable developers to optimize trained deep learning models for high-performance inference on Qualcomm® Cloud AI 100 accelerators.

The Cloud AI SDKs provide workflows to optimize the models for best performance, provides runtime for execution and supports integration with ONNX Runtime and Triton Inference Server for deployment.

Featured DocumentsDocumentation
FAQs

Our trusted partners

Cloud Service Providers


 



Server Providers
 




Get started with Amazon EC2 DL2q instances powered by Qualcomm® AI 100 Standard accelerators.

Software architecture

Application (Apps) SDK

The Apps SDK is used to convert models and prepare runtime binaries for Cloud AI platforms. It contains model development tools, a sophisticated parallelizing graph compiler, performance and integration tools, and code samples. Apps SDK is supported on x86-64 Linux-based systems.

Platform SDK

The Platform SDK provides driver support for Cloud AI acclerators, APIs and tools for executing and debugging model binaries, and tools for card health, monitoring and telemetry. Platform SDK consists of a kernel driver, userspace runtime with APIs and language bindings, and card firmware. Platform SDK is supported on x86-64 and ARM64 hosts

Use cases


Build your models to support any AI use case

Generative AI/LLMs

Natural Language Processing

Object Detection

Semantic Segmentation

Traffic Detection

Quality Control

Character Recognition

Search Engines

Benefits


Flexible toolchain to meet the growing inference needs of cloud data centers.

Running multiple complex AI models

High performance Generative AI, Natural Language Processing, and Computer Vision models.

Optimizing performance

Optimizing performance of the models per application requirements (throughput, accuracy and latency) through various quantization techniques.

Cross platforms apps development

Development of inference applications through support
for multiple OS and Docker containers.

Deployment at-scale

Deployment of inference applications at scale with support for Triton inference server.

Power efficiency

Sets a higher bar for power efficient inference processing delivering the highest performance/watt.

Resources


Accelerate your work with project cloning, AI models and tutorials.




Tutorials
 

8:08

Stable Diffusion Notebook

1:09

Stable Diffusion Demo

6:54

CodeGen Notebook

Learn more

Cloud AI SDK platform in the cloud.

Cloud AI SDK on-premises

Connect with our communities

Stay ahead of the curve

Receive the latest updates, exclusive offers, and valuable insights delivered through the Qualcomm newsletter straight to your inbox.

Stay ahead of the curve

Receive the latest updates, exclusive offers, and valuable insights delivered through the Qualcomm newsletter straight to your inbox.

Qualcomm relentlessly innovates to deliver intelligent computing everywhere, helping the world tackle some of its most important challenges. Our leading-edge AI, high performance, low-power computing, and unrivaled connectivity deliver proven solutions that transform major industries. At Qualcomm, we are engineering human progress.

Stay connected

Get the latest Qualcomm and industry information delivered to your inbox.

Subscribe
Manage your subscription

© Qualcomm Technologies, Inc. and/or its affiliated companies.

Snapdragon and Qualcomm branded products are products of Qualcomm Technologies, Inc. and/or its subsidiaries. Qualcomm patented technologies are licensed by Qualcomm Incorporated.

Note: Certain services and materials may require you to accept additional terms and conditions before accessing or using those items.

References to "Qualcomm" may mean Qualcomm Incorporated, or subsidiaries or business units within the Qualcomm corporate structure, as applicable.

Qualcomm Incorporated includes our licensing business, QTL, and the vast majority of our patent portfolio. Qualcomm Technologies, Inc., a subsidiary of Qualcomm Incorporated, operates, along with its subsidiaries, substantially all of our engineering, research and development functions, and substantially all of our products and services businesses, including our QCT semiconductor business.

Materials that are as of a specific date, including but not limited to press releases, presentations, blog posts and webcasts, may have been superseded by subsequent events or disclosures.

Nothing in these materials is an offer to sell or license any of the services or materials referenced herein.