Back to All
Developer Blog

Edge AI developer hackathon in Bengaluru: and the winners are...

Sign up for Developer monthly newsletter-image

Sign up for Developer monthly newsletter

Join thousands of developers around the globe who receive latest news and updates from our monthly curated newsletter.

Sign up
Come for support, stay for the community-image

Come for support, stay for the community

Get support from experts, connect with like-minded developers, and access exclusive virtual events.

Join Developer Discord
 

 

TLDR: 

  • Edge AI Developer Hackathon in Bengaluru – 50 teams build edge apps for laptops powered by Snapdragon X Series with LLMs from Qualcomm AI Hub.
  • Grand prize goes to GameSense – Intelligent narration of a VR cricket game through a small language model that runs entirely on the device.
  • People’s Choice award goes to EdgeFit Coach – Personalized coaching and correction through an LLM so users can improve their posture while working.

For a developer, the only thing more compelling than working with AI is working with AI entirely on the local device. That’s the essence of edge AI: running chatbots, generative AI and inference workloads without having to shuttle data between device and cloud. So, an edge AI developer hackathon – like the ones we’ve been hosting – is your chance to build applications on affordable hardware optimized for AI.

At our Edge AI Developer Hackathon in Bengaluru (Bangalore), 50 teams spent most of their weekend hacking on-device AI apps. Working on Copilot+ PCs with the Snapdragon X processor or Snapdragon X Elite processor, they applied AI to wildly diverse spaces including gameplay commentary, ergonomics, dyslexia, sign language, companionship and methane tracking.

To whet your appetite for our upcoming Edge AI Developer Hackathons, here’s an under-the-hood look at the two prize winners from the Bengaluru hackathon.

Grand Prize: GameSense – Real-time commentary that adapts to live gameplay

Game engines, especially in virtual/augmented reality (VR/AR), offer deep immersion to users – except in narration. Even the best-built game lacks commentary, or non-player character (NPC) voice lines, that can narrate spontaneously and follow gameplay as human commentators do.

GameSense solves that by providing intelligent narration of a VR cricket game through a small language model (SLM) that uses the  Hexagon NPU and runs entirely on the device. It’s an ideal use case for edge AI because gaming requires real-time engagement, players won’t tolerate latency in commentary. Besides, for games with a huge player base, the demand for centralized computing resources to keep thousands of threads running would be prohibitive.

The GameSense team consisted of Hemang Mohan, Vivek Kumar Yadav, Archana M and Ponathipan Jawahar, all from Tech Mahindra.

Overview

The team developed GameSense for the Copilot+ PC, powered by a Snapdragon X Series processor. They ran the SLM on the Hexagon NPU and used the Qualcomm AI Stack combined with WebSockets for nearly real-time performance.

The NPU efficiently handled all the AI commentary generation, which meant the CPU and GPU could fully focus on running the game itself. That led to shorter response times, smoother gameplay and significantly lower power consumption than in cloud-based or CPU-only setups.

GameSense delivered real-time, on-device AI performance without draining the battery, making it ideal for extended sessions in VR and mobile gaming on portable or battery-powered devices.

All core workloads in GameSense run entirely on the edge device – here, the Copilot+ PC. No cloud connection is required for real-time gameplay or commentary generation.

Software and hardware

Languages:

·         C++ – Core application logic and integration with Qualcomm Gen AI Inference Extensions (GENIE)

·         Python – Middleware for interpreting game events and running audio engine

·         C# – For building the Unity-based VR cricket game

Frameworks and tools:

·         Unity – VR game development (Meta Quest-compatible)

·         Flask – Python middleware to handle game events and serve AI-generated audio

·         Visual Studio 2022 – For building the C++ app with GENIE integration and WebSockets

·         OpenVR/Oculus SDK – VR integration in Unity

AI runtime:

·         Qualcomm AI Runtime SDK – For running SLM inference on the NPU

Additional hardware:

·         Meta Quest VR Headset – Used to build and test the immersive cricket game experience

Models implemented

The developers used the LLaMa 3.2 3b model. They did not fine-tune the model for the initial version of GameSense; however, they do have a pipeline to fine-tune the model, quantize it and generate its NPU binaries. They discovered that that pipeline yields more-realistic commentary personalized by language, favorite commentator voices or pattern.

Additionally, they found the Qualcomm AI Hub a useful, one-stop shop for models ranging from LLMs to image generators and automatic speech recognition (ASR) models. They enjoyed quick access to models without much hassle.

Compute cores used

They designed GameSense to take full advantage of heterogeneous computing, distributing workloads across the CPU, GPU and NPU.

·         CPU – Handled the game logic, API calls, audio engine and communication between Unity, the Flask middleware and the backend AI module. It managed JSON parsing, socket streaming and game event updates.

·         GPU – Used by Unity to render the 3D VR cricket environment, handle physics-based animations and process input from the VR controllers to simulate batting mechanics.

·         NPU – Ran the entire core of the application – real-time AI commentary generation – using GENIE to execute the language model. That allowed them to run the LLaMA 3.2 3B model with low latency and low power consumption, freeing up the CPU and GPU for other tasks.

Development flow

GameSense data flow
Figure 1: GameSense data flow

Every minute counts in a 24-hour hackathon. The GameSense team points to a half-dozen well thought-out steps in their development process:

  1. Designing the flow and architecture – They sketched out a system architecture and planned the data flow: a VR cricket game triggers events → events are processed → an LLM generates live commentary → output is converted to speech/audio.
  2. Selecting the AI model and text-to-speech (TTS) engine – They turned to the subject matter experts from Qualcomm Technologies to learn which models in the Qualcomm AI Hub can run on the Hexagon NPU. They also had questions about generating NPU-compatible versions of the binaries for the models. Finally, they selected LLaMa 3.2 3B for the SLM and PyTTS for the audio generation.
  3. Building the data flow pipeline – To ensure the proper connectivity and data flow among components, they built a simple app in VR without graphics and used it to communicate via the Flask middleware. That passed the data to the AI backend, which generated the audio and handed it off via Flask to the VR.
  4. Developing the VR front end in Unity – Once the entire data pipeline was ready, the team started developing the core VR game, the essence of the immersive experience.
  5. Incorporating real-time streaming – In parallel, they optimized the data flow to stream data at all layers and ensure nearly real-time performance.
  6. Integrating and testing – They integrated all the different components, then tested it extensively and fixed bugs.

Easy parts and hard parts

In their front-end development, the team found that the easiest part was developing the core game mechanics: setting up the entire scene in Unity3D, integrating all components and deploying to VR. Unity made it straightforward to build the environment, connect input controls and handle interactions like bat movement and game event triggers. In back-end development, the easiest part was setting up the Flask middleware.

The part that took the most time and work on the front end was simulating real-world physics, especially getting the bat-ball interaction – force, angle, timing – right. Integrating the gameplay with the scoring system required fine-tuning Unity’s physics engine and adjusting multiple parameters to mimic real cricket kinematics.

On the back end, it took a lot of work to achieve a streaming response with the C++ chat application using the Qualcomm GENIE model. But the judges were impressed, noting that the team used the flow to great effect.

People’s Choice Award: EdgeFit Coach – “We won’t have your data, but we will have your back.”

Remote work, online classes and extended computer usage have led to widespread poor posture, a leading cause of chronic back/neck pain, fatigue and musculoskeletal disorders. But most people lack real-time feedback or awareness when they begin to slouch or sit incorrectly. Even with tools like apps and wearables, it’s hard to muster the consistency and motivation to improve habits.

EdgeFit Coach provides personalized coaching and correction so users can improve their posture while working. It uses edge AI to ensure real-time feedback without delays caused by round trips to/from the cloud. Its cloud-independence means that users benefit even in low- and no-internet environments (e.g., in rural or mobile settings) without the need for specialized hardware. The app selectively uses LLMs for personalized coaching (motivational quotes, analysis), executing the compute-intensive operations on the device.

Most of all, EdgeFit Coach processes all sensitive visuals like posture data and health analytics locally; hence the motto, “We won’t have your data, but we will have your back.”

Team members include Akshat Gosain, Kalash Poddar, Ayush Pareek and Parv Jain from the K.K. Birla Goa Campus of BITS Pilani.

Overview

The Copilot+ PC powered by the Snapdragon X Series processor is ideal for this project. Most prominently, EdgeFit Coach runs its ONNX-based model for pose estimation on the processor’s Hexagon NPU, resulting in 2 to 3x faster inference compared to CPU. The lower power requirement from running on NPU means the app can deliver non-intrusive, persistent posture correction in the background without degrading battery life unduly.

The processor’s image signal processor (ISP) and hardware-accelerated computer vision resulted in faster frame acquisition for quicker posture recognition.

For landmark detection, the hackathon team based the data pipeline on the MediaPipe BlazePose model, converted to ONNX format and optimized for edge inference. That gave them real-time video analysis at more than 13 frames per second with response latency of less than 100 ms — entirely on the device.

They built the coaching with a lightweight, FastAPI back end and LLM integration for surprisingly context-aware motivational quotes, analysis of six useful posture metrics and interactive chat served over WebSockets.

Software and hardware

Language:

  • Python – Core application logic, API back end, pose processing, analytics and LLM integration

Environments and frameworks:

  • FastAPI + Uvicorn – High-performance REST APIs with async support for back end
  • Streamlit – Front-end dashboard and user interaction
  • WebSockets library – Real-time motivational quote streaming
  • OpenCV + MediaPipe – Real-time pose tracking and image processing
  • ONNX runtime – Optimized model inference
  • Custom llm_handler.py – Using HTTP requests to local LLM
  • Streamlit charts and JSON-based analytics pipeline – For data presentation
  • Win10toast – Desktop alerts

Development tools:

  • Visual Studio Code – Primary IDE
  • Git + GitHub – Version control
  • Edge AI Dev Terminal – On the Copilot+ PC for testing NPU inference
  • ·         ONNX Optimizer – For converting and simplifying the BlazePose model

Additional hardware:

None. The entire system ran on standard, webcam-equipped laptops, proving that posture monitoring can be done using just software, with inference on CPU/NPU.

Models implemented

For pose estimation, the team used MediaPipe BlazePose (converted to ONNX) with optimizations like operator fusion, float16 quantization and graph simplification. To estimate the distance from user to camera, they used MiDaS-v2.

For context-aware motivational feedback and query response, they integrated an LLM as AI coach and chatbot based on Meta LLaMA 3.2B with a Sarvam-M local back end. The LLM required no fine-tuning, since prompt engineering and context chaining ensured relevance.

They used Qualcomm AI Hub as their source for the BlazePose and MiDaS-v2 models, which integrated easily with their FastAPI + ONNX setup and ran natively on the Hexagon NPU.

Choosing models from the Qualcomm AI Hub helped shorten development and deployment time, and the models imposed little load on the CPU, even with continuous, real-time posture detection and analysis.

Compute cores used

EdgeFit Coach distributed workloads between CPU and NPU.

The CPU ran the FastAPI back end, LLM coaching/feedback and WebSocket server for streaming. It classified and analyzed posture data, served the Streamlit dashboard and handled system-level tasks and desktop notifications.

The NPU performed pose estimation inference using the ONNX-optimized BlazePose model from the Qualcomm AI Hub. It handled all real-time landmark detection from webcam frames at 13 FPS and up.

EdgeFit Coach - Compute core flow
Figure 2: EdgeFit Coach - Compute core flow

EdgeFit Coach ran entirely on the edge device, ensuring privacy, low latency and power-efficiency.

Development flow

The team worked out the following steps during the hackathon:

  1. Define the problem and design the user flow – Once they had identified posture monitoring as their prime use case, they mapped out the user flow of webcam capture → pose detection → feedback → analytics.
  2. Select and optimize models – They chose MediaPipe BlazePose for body landmark detection – specifically, the model in the Qualcomm AI Hub that was quantized and ONNX-optimized for the Hexagon NPU.
  3. Develop an edge-centric pipeline – Using Python, they built a pipeline to capture webcam frames (OpenCV), run pose inference (NPU), classify posture and exercises, and stream real-time data (WebSocket) and dashboard metrics (FastAPI). They designed all modules to run on the device, without relying on external resources.
  4. Integrate LLM for coaching – They created a modular LLM handler (ask_llm) to power AI-based posture analysis and motivational chat. The handler had the flexibility to use small models locally or route selectively to lightweight APIs when needed.
  5. Create front-end and analytics interface – They built a Streamlit dashboard to present metrics like posture trends, slouch frequency and consistency score, and to deliver real-time feedback with motivational quotes and posture reports.
  6. Test on Copilot+ PC – Finally, they ran EdgeFit Coach on a Copilot+ PC powered by the Snapdragon X Series processor. Tests showed end-to-end performance of pose detection at more than 13 FPS, response times of less than 100 ms, smooth UI and minimal CPU usage.

Qualcomm Technologies’ experts helped most when it came to model optimization and deployment on NPU. The experts assisted the team in selecting the right ONNX format, enabling NPU acceleration, benchmarking performance and debugging inference behavior.

Easy parts and hard parts

The team found it relatively easy to build the application logic and API system. Using Python, FastAPI and Streamlit, they were able to quickly spin up a back end, create real-time APIs for posture data and chat and put together a working front-end dashboard. Also, once they had the optimized model from Qualcomm AI Hub, they found it easy to plug it into the ONNX runtime with no additional reconfiguration.

The most difficult part lay in optimizing for heterogeneous computing (CPU vs. NPU). The difficulty wasn’t in writing code but in understanding how to make full use of the Snapdragon architecture. The team put considerable effort into making sure the model was actually using the NPU (not CPU fallback), profiling performance bottlenecks, and balancing async processing, UI refresh and inference threading.

They also found it tricky to coordinate the LLM chat with the real-time UI; for example, to keep the AI chat responsive without blocking pose detection or dashboard updates. It became a balancing act among the asynchronous WebSocket updates, the chat history management and the rendering of the front end.

Next steps

Those are the two winners from our Bengaluru Edge AI Developer Hackathon. Of course, we judges voted for GameSense because of the remarkable pipeline from the gameplay all the way to the VR headset without leaving the Copilot+ PC. (And because we like cricket.) Then, we think EdgeFit Coach won People’s Choice because most of us developers know that our posture could use some help.

We want you to bring your developer friends to one of our upcoming Developer events and show us what you’ve got. Read through our FAQ, put together your team and sign up for the next hackathon near you.

Meanwhile, look through the Awesome Qualcomm Developer Projects repo to see how you can build edge AI into your own products. You’ll find projects like GameSense and EdgeFit Coach, and we’ll be posting results and projects from more hackathons there soon.

Connect with us on Discord and be sure to sign up for the Qualcomm Developer newsletter to be notified of our upcoming developer events across the Unites States, Asia and Europe.

Read about Edge AI Developer Hackathon in Paris for more inspiring stories.

Opinions expressed in the content posted here are the personal opinions of the original authors, and do not necessarily reflect those of Qualcomm Incorporated or its subsidiaries ("Qualcomm"). The content is provided for informational purposes only and is not meant to be an endorsement or representation by Qualcomm or any other party. This site may also provide links or references to non-Qualcomm sites and resources. Qualcomm makes no representations, warranties, or other commitments whatsoever about any non-Qualcomm sites or third-party resources that may be referenced, accessible from, or linked to this site.

Snapdragon and Qualcomm branded products are products of Qualcomm Technologies, Inc. and/or its subsidiaries.

About the Authors
Nick Debeurre
Nick DebeurreSenior Product Manager and AI Developer Advocate
Lauren Lunde
Lauren LundeSenior Product Manager
Qualcomm relentlessly innovates to deliver intelligent computing everywhere, helping the world tackle some of its most important challenges. Our leading-edge AI, high performance, low-power computing, and unrivaled connectivity deliver proven solutions that transform major industries. At Qualcomm, we are engineering human progress.

Stay connected

Get the latest Qualcomm and industry information delivered to your inbox.

Subscribe
Manage your subscription

© Qualcomm Technologies, Inc. and/or its affiliated companies.

Snapdragon and Qualcomm branded products are products of Qualcomm Technologies, Inc. and/or its subsidiaries. Qualcomm patented technologies are licensed by Qualcomm Incorporated.

Note: Certain services and materials may require you to accept additional terms and conditions before accessing or using those items.

References to "Qualcomm" may mean Qualcomm Incorporated, or subsidiaries or business units within the Qualcomm corporate structure, as applicable.

Qualcomm Incorporated includes our licensing business, QTL, and the vast majority of our patent portfolio. Qualcomm Technologies, Inc., a subsidiary of Qualcomm Incorporated, operates, along with its subsidiaries, substantially all of our engineering, research and development functions, and substantially all of our products and services businesses, including our QCT semiconductor business.

Materials that are as of a specific date, including but not limited to press releases, presentations, blog posts and webcasts, may have been superseded by subsequent events or disclosures.

Nothing in these materials is an offer to sell or license any of the services or materials referenced herein.