Back to All
OnQ Blog

AWS introduces new EC2 instance powered by the Qualcomm Cloud AI 100

Unlocking new possibilities and reducing costs, AWS collaborates with Qualcomm to introduce an EC2 instance that harnesses the power of the Qualcomm Cloud AI 100, enabling enhanced aptitude in deploying AI to meet evolving market demands
Qualcomm-image

Building on our technology collaboration with AWS, the Qualcomm Cloud AI 100 launch marked the first major milestone in the company’s joint efforts with the general availability of new Amazon Elastic Compute Cloud (Amazon EC2) DL2q instances. The Amazon EC2 DL2q instances serve as the first instances to bring the Qualcomm artificial intelligence (AI) solution to the cloud.

With its flexible and scalable multi-core architecture, the Cloud AI 100 accelerator supports a wide range of use-cases spanning:

  • Generative AI and Large Language Models (LLMs): Covering productivity and creativity use cases with support to models with up to 16B parameters on single card and 8x that in one DL2q instance, and
  • Classic AI: Including natural language processing and Computer vision.

 

At this year’s AWS re:Invent 2023 we recently demonstrated diverse applications employing AWS EC DL2q powered by Cloud AI 100:

 

To get started using DL2q instances, visit the AWS Management Console, AWS Command Line Interface (CLI), and AWS SDKs.

Qualcomm Cloud AI 100

Dec 4, 2023 | 1:38

Video Player is loading.
Current Time 0:00
Duration 1:38
Loaded: 6.06%
Stream Type LIVE
Remaining Time 1:38
 
1x
  • Chapters
  • descriptions off, selected
  • captions off, selected
  • en (Main), selected

“Working with AWS is empowering us to build on our established industry leadership in high-performance, low-power deep learning inference acceleration technology,” said Nakul Duggal, SVP & GM, Automotive & Cloud Computing at Qualcomm Technologies, Inc. “Our work to date demonstrates the strong potential in integrating cloud technologies into software development and deployment cycles.

We look forward to continuing our work with AWS to unlock new possibilities across all industries with solutions that will be designed to significantly reduce costs, while also enabling enhanced aptitude in deploying AI to respond to ever-evolving market demands.

 

A cost-effective AI revolution

The Amazon EC2 DL2q instance enables EC2 customers to run inference on a broad range of models with best-in-class performance-per-total cost of ownership (TCO). For example:

  • Up to 50% better price-performance for (DL inference models — compared to current-generation graphics processing unit (GPU)-based Amazon EC2 instances.
  • More than three times reduction in Inference cards with CV-based security, leading to much lower-cost system solution.
  • Enabling two and a half smaller models such Deci.ai models optimized on Cloud AI100.

The DL2q instance features the Qualcomm AI Stack which delivers a consistent developer experience across Qualcomm AI in the cloud and other Qualcomm products.

The same Qualcomm AI Stack and base AI technology runs on the DL2q instances and Qualcomm edge devices, enabling customers to enjoy a consistent developer experience, with a unified application programming interface (API) across their:

  • cloud,
  • automotive,
  • PC,
  • extended reality, and
  • smartphone development environments.

Customers can use the AWS Deep Learning AMI (DLAMI), which comes prepackaged with Qualcomm’s Software Development Kits (SDK) and popular machine learning frameworks, such as PyTorch and TensorFlow.

For more information, please visit Qualcomm Cloud AI100.
 

To get started using DL2q instances, visit:

Opinions expressed in the content posted here are the personal opinions of the original authors, and do not necessarily reflect those of Qualcomm Incorporated or its subsidiaries ("Qualcomm"). The content is provided for informational purposes only and is not meant to be an endorsement or representation by Qualcomm or any other party. This site may also provide links or references to non-Qualcomm sites and resources. Qualcomm makes no representations, warranties, or other commitments whatsoever about any non-Qualcomm sites or third-party resources that may be referenced, accessible from, or linked to this site.

 

The Qualcomm AI Cloud 100 and Qualcomm AI Stack are products of Qualcomm Technologies, Inc.

About the Authors
Rashid Attar
Rashid AttarVP, Engineering, Qualcomm Technologies, Inc.
Mohammed Al Khairy
Mohammed Al KhairyDirector, Product Marketing, Qualcomm Technologies
Qualcomm relentlessly innovates to deliver intelligent computing everywhere, helping the world tackle some of its most important challenges. Our leading-edge AI, high performance, low-power computing, and unrivaled connectivity deliver proven solutions that transform major industries. At Qualcomm, we are engineering human progress.

Stay connected

Get the latest Qualcomm and industry information delivered to your inbox.

Subscribe
Manage your subscription

© Qualcomm Technologies, Inc. and/or its affiliated companies.

Snapdragon and Qualcomm branded products are products of Qualcomm Technologies, Inc. and/or its subsidiaries. Qualcomm patented technologies are licensed by Qualcomm Incorporated.

Note: Certain services and materials may require you to accept additional terms and conditions before accessing or using those items.

References to "Qualcomm" may mean Qualcomm Incorporated, or subsidiaries or business units within the Qualcomm corporate structure, as applicable.

Qualcomm Incorporated includes our licensing business, QTL, and the vast majority of our patent portfolio. Qualcomm Technologies, Inc., a subsidiary of Qualcomm Incorporated, operates, along with its subsidiaries, substantially all of our engineering, research and development functions, and substantially all of our products and services businesses, including our QCT semiconductor business.

Materials that are as of a specific date, including but not limited to press releases, presentations, blog posts and webcasts, may have been superseded by subsequent events or disclosures.

Nothing in these materials is an offer to sell or license any of the services or materials referenced herein.