Back to All
OnQ Blog

Introducing Qualcomm Cloud AI 100 Ultra

High performance and cost-optimized cloud AI inferencing for generative AI
Qualcomm-image

Today we announce the Qualcomm Cloud AI 100 Ultra, a new member of our portfolio of cloud artificial intelligence (AI) Inference cards, purpose-built for generative AI and large language models (LLMs).

The Qualcomm Cloud AI 100 Ultra uses Qualcomm’s industry leading AI cores to deliver up to four times the performance of the previous generation — enabling best-in-class performance for each total cost of ownership (TCO) dollar. It can support 100 billion parameter models on a single 150-watt card, 175 billion parameter models with two cards, and even larger models through multiple Qualcomm Cloud AI 100 Ultra cards using our Qualcomm AI Stack and Cloud AI SDK.

The Qualcomm Cloud AI 100 Ultra is a programmable AI accelerator and can support recent advances in AI techniques and data formats. It leverages the Qualcomm AI Stack which allows customers to ‘Train anywhere and Infer on Qualcomm Cloud AI 100 Ultra’ — supporting the process of porting and optimizing their models.

Hewlett Packard Enterprise (HPE) will support the Qualcomm Cloud AI 100 Ultra with the HPE ProLiant DL380a Gen 11 server designed for accelerator-optimized generative AI workloads, including natural language processing (NLP).

Justin Hotard, executive vice president and general manager of high-performance computing, AI and labs at HPE, said:

To unlock value from generative AI, enterprises will require an AI-native architecture that is purpose-built to support any part of their journey, including inferencing. In collaboration with Qualcomm, we look forward to offering our customers a compute solution that is optimized for inference and provides the performance and power efficiency necessary to deploy and accelerate AI inference at-scale.

 

Andrew Feldman, CEO of Cerebras, adds:

The Qualcomm Cloud AI 100 Ultra is groundbreaking: with the ability to deploy a staggering 100 billion parameter model on a single-width, energy-efficient 150W card, it sets a new standard in the industry. By harnessing the power of Cerebras’ leading training technology combined with the Qualcomm Cloud AI 100 Ultra, the solution strikes the perfect balance between lightning-fast training and unparalleled performance per TCO dollar for inference deployment. In a rapidly evolving market, forward-thinking customers will naturally gravitate towards this high-performance solution, encompassed within an intelligent and robust ecosystem.
A graphic showing the numerical specifications for the Qualcomm Cloud AI 100 Ultra.
Qualcomm Cloud AI 100 Ultra by the numbers.

In both cloud and enterprise use cases, Qualcomm Cloud AI 100 Ultra delivers two to five times the performance per TCO dollar across generative AI including LLMs, NLP and computer vision workloads compared to competing offerings — enabling our customers to maximize their return on investments. This blend of price-performance, power-efficiency, scalability, and security make it the ideal choice for organizations looking to embrace cutting-edge AI and transform their operations, all while supporting sustainability targets.

Talal Al Kaissi, executive vice president, chief product and global partnerships officer at Core42, said:

Core42 [a subsidiary of G42] delivers national-scale enterprise cloud and AI solutions by partnering with the global technology leaders who are bringing impactful solutions to market. Qualcomm’s Cloud AI 100 Ultra promises to deliver exceptional performance at a fraction of power and cost, enabling scalable and sustainable AI inference solutions. We look forward to strengthening our collaboration with Qualcomm.

 

With over a decade of AI leadership, Qualcomm delivers transformative performance and efficiency with the Qualcomm Cloud AI 100 Ultra. Combining our industry-leading AI cores and expertise in low-power design, this solution enables enterprises to harness the power of AI to drive business outcomes. Embrace the future of generative AI with the Qualcomm Cloud AI 100 Ultra, where innovation meets efficiency.

 

 

Opinions expressed in the content posted here are the personal opinions of the original authors, and do not necessarily reflect those of Qualcomm Incorporated or its subsidiaries ("Qualcomm"). The content is provided for informational purposes only and is not meant to be an endorsement or representation by Qualcomm or any other party. This site may also provide links or references to non-Qualcomm sites and resources. Qualcomm makes no representations, warranties, or other commitments whatsoever about any non-Qualcomm sites or third-party resources that may be referenced, accessible from, or linked to this site.

 

Qualcomm branded products are products of Qualcomm Technologies, Inc. and/or its subsidiaries.

About the Author
Rashid Attar
Rashid AttarVP, Engineering, Qualcomm Technologies, Inc.
Qualcomm relentlessly innovates to deliver intelligent computing everywhere, helping the world tackle some of its most important challenges. Our leading-edge AI, high performance, low-power computing, and unrivaled connectivity deliver proven solutions that transform major industries. At Qualcomm, we are engineering human progress.

Stay connected

Get the latest Qualcomm and industry information delivered to your inbox.

Subscribe
Manage your subscription

© Qualcomm Technologies, Inc. and/or its affiliated companies.

Snapdragon and Qualcomm branded products are products of Qualcomm Technologies, Inc. and/or its subsidiaries. Qualcomm patented technologies are licensed by Qualcomm Incorporated.

Note: Certain services and materials may require you to accept additional terms and conditions before accessing or using those items.

References to "Qualcomm" may mean Qualcomm Incorporated, or subsidiaries or business units within the Qualcomm corporate structure, as applicable.

Qualcomm Incorporated includes our licensing business, QTL, and the vast majority of our patent portfolio. Qualcomm Technologies, Inc., a subsidiary of Qualcomm Incorporated, operates, along with its subsidiaries, substantially all of our engineering, research and development functions, and substantially all of our products and services businesses, including our QCT semiconductor business.

Materials that are as of a specific date, including but not limited to press releases, presentations, blog posts and webcasts, may have been superseded by subsequent events or disclosures.

Nothing in these materials is an offer to sell or license any of the services or materials referenced herein.