Back to All
OnQ Blog

Qualcomm Cloud AI 100, AMD EPYC 7003 Series Processor, and Gigabyte server solutions breaks the Peta Operations Per Second barrier for AI Inferencing [video]

Qualcomm & Gigabyte Cloud AI 100

Mar 10, 2021 | 0:28

Video Player is loading.
Current Time 0:00
Duration 0:27
Loaded: 35.78%
Stream Type LIVE
Remaining Time 0:27
 
1x
  • Chapters
  • descriptions off, selected
  • captions off, selected
  • en (Main), selected

There is no doubt that AI is the driving force for next-generation consumer experiences. Virtually every experience on a mobile device somehow involves AI, whether it’s scrolling through your favorite social apps or online shopping that offers recommendations based on tens of thousands of AI inferences on its own. Now what happens when these platforms are serving millions of users on a given day? That really requires racks upon racks of powerful servers that can deliver the AI inferencing performance required to keep these platforms humming along.

Today, Qualcomm Technologies is enabling a powerful server rack that can meet these high-performance requirements by pairing with the latest AMD EPYC 7003 Series processors and Gigabyte’s latest G292-Z43 server solutions. This amalgamation of hardware expertise offers incredible performance and raises the bar for the modern data center. Qualcomm Technologies’ cutting-edge Qualcomm Cloud AI 100 fits perfectly into Gigabyte’s server system and is capable of driving the incredible AI use cases in the field of high-speed data analysis, personalized recommendations, smart cities, 5G communications, and more.

The Gigabyte G292-Z43 server supports two AMD EPYC 7003 Series processors for its processing power along with multiple Qualcomm Cloud AI 100 cards for computationally intensive applications supporting AI inferencing workloads. The Qualcomm Cloud AI 100 Inference Accelerator boasts up to 400 TOPS with breakthrough performance/Watt and that’s just with one single Qualcomm Cloud AI 100 card. Now imagine a Gigabyte server can host up to 16 Qualcomm Cloud AI 100 inferencing cards per server that, cumulatively, can deliver up to 6.4 Peta OPS (400 TOPS x 16, one Peta OPS is 1000 TOPS). This marks the first time a Qualcomm Technologies AI-based solution breaking the PetaOPs barrier. And it gets even better:  A server rack can host 19 or more of these server units, which easily exceeds 100 Peta OPS. That is a lot of Qualcomm Technologies AI muscle. See the infographic below on how it is being configured.

3

Qualcomm Cloud AI 100

Purpose-built for high performance, low-power AI processing in the cloud.
Qualcomm-image

To put this into a bit more context, a single 400TOPS HHHL Qualcomm Cloud AI 100 inference card can drive around 19,000 Resnet50 images/sec. That translates to more than 6M images per second on one server rack. This kind of AI performance can surely enhance, extend, and scale AI experiences to the world. We want to thank AMD and Gigabyte for this amazing achievement.

Check out these photos of the Qualcomm Cloud AI 100 cards with the AMD EPYC 7003 Series processor-powered Gigabyte servers ready to rock and roll.

Qualcomm-image
Qualcomm-image

Qualcomm Cloud AI is a product of Qualcomm Technologies, Inc. and/or its subsidiaries. AMD, the AMD Arrow logo, EPYC, and combinations thereof are trademarks of Advanced Micro Devices, Inc.

Opinions expressed in the content posted here are the personal opinions of the original authors, and do not necessarily reflect those of Qualcomm Incorporated or its subsidiaries ("Qualcomm"). The content is provided for informational purposes only and is not meant to be an endorsement or representation by Qualcomm or any other party. This site may also provide links or references to non-Qualcomm sites and resources. Qualcomm makes no representations, warranties, or other commitments whatsoever about any non-Qualcomm sites or third-party resources that may be referenced, accessible from, or linked to this site.

About the Author
Mike VildibillVice President, Product Management, Qualcomm Technologies
Qualcomm relentlessly innovates to deliver intelligent computing everywhere, helping the world tackle some of its most important challenges. Our leading-edge AI, high performance, low-power computing, and unrivaled connectivity deliver proven solutions that transform major industries. At Qualcomm, we are engineering human progress.

Stay connected

Get the latest Qualcomm and industry information delivered to your inbox.

Subscribe
Manage your subscription

© Qualcomm Technologies, Inc. and/or its affiliated companies.

Snapdragon and Qualcomm branded products are products of Qualcomm Technologies, Inc. and/or its subsidiaries. Qualcomm patented technologies are licensed by Qualcomm Incorporated.

Note: Certain services and materials may require you to accept additional terms and conditions before accessing or using those items.

References to "Qualcomm" may mean Qualcomm Incorporated, or subsidiaries or business units within the Qualcomm corporate structure, as applicable.

Qualcomm Incorporated includes our licensing business, QTL, and the vast majority of our patent portfolio. Qualcomm Technologies, Inc., a subsidiary of Qualcomm Incorporated, operates, along with its subsidiaries, substantially all of our engineering, research and development functions, and substantially all of our products and services businesses, including our QCT semiconductor business.

Materials that are as of a specific date, including but not limited to press releases, presentations, blog posts and webcasts, may have been superseded by subsequent events or disclosures.

Nothing in these materials is an offer to sell or license any of the services or materials referenced herein.