On this Page
Introduction
Diagram
Community
Get the Qualcomm® newsletter straight to your inbox.
Qualcomm Gen AI Inference Extensions is a robust software library designed to simplify on-device Gen AI execution.
Generative AI models such as large language models (LLMs) and large vision models (LVMs) contain multiple binaries after optimization due to their complexity and larger size. These binaries require a specific order of execution to utilize the processing power of a Neural Processing Unit (NPU). With Gen AI Inference Extensions (GENIE), you can streamline on-device Gen AI models in a single execution job.
GENIE simplifies Gen AI inferencing at the edge by providing software library commands and resources integrated with Qualcomm® AI Engine direct.
Qualcomm AI Hub supports GENIE, offering optimized models and sample apps on how to use Gen AI Inference Extensions (GENIE) to execute LLMs.
A single SDK download includes the Qualcomm Neural Processing SDK, Qualcomm AI Engine direct SDK, and GENIE with the corresponding APIs, providing developers with a complete package you need to build your next project.
