Back to All
OnQ Blog

Using the Qualcomm Hexagon SDK for on-device speech processing

Ever since Qualcomm Technologies first announced always-on device activation in early 2013, we have seen an industry-wide adoption of always-on voice services. These services rely on waking up the device using a preset phrase such as “Hey, Snapdragon,” and then sending raw speech to the cloud, where actual recognition is done and the response formulated.

This raises a number of legitimate privacy questions:

  1. How long is the voice recording stored in the cloud?
  2. Is the recorded voice available to a human listener, such as a software developer, who needs to use it for legitimate work on the application?
  3. Are there sufficient safeguards to prevent somebody from tracing the recorded voice back to a specific user?
  4. Apart from the user saying the wake-up phrase, is any background audio also stored in the cloud and how is such data used?
  5. Is the voice data encrypted in the cloud?

Several companies have been trying to respond to these and similar questions of privacy. I believe that performing speech recognition and response formation on the device instead of running them in the cloud is an option toward mitigating privacy and security concerns.

The Qualcomm Hexagon DSP SDK is designed to allow OEMs and ISVs to write fully capable, low-power speech recognition and synthesis applications that obviate the need to send voice samples to the cloud. Registered developers for the Hexagon SDK also get access to a text-to-speech framework to accelerate application development. This opportunity provides OEMs and ISVs with the option to perform some speech recognition on the device, and not need the cloud at all.

Some of the most common queries can be answered by information available on the device itself, without the need for network connectivity. For instance, queries about personal schedule can be answered by accessing the on-device calendar. The Hexagon SDK also provides capabilities to change many spoken queries into text before sending them over the network. This provides a privacy benefit of not collecting a recorded voice in the cloud.

Thoughts? Let me know what you think in the comments below.

Qualcomm Hexagon is a product of Qualcomm Technologies, Inc.

Opinions expressed in the content posted here are the personal opinions of the original authors, and do not necessarily reflect those of Qualcomm Incorporated or its subsidiaries ("Qualcomm"). The content is provided for informational purposes only and is not meant to be an endorsement or representation by Qualcomm or any other party. This site may also provide links or references to non-Qualcomm sites and resources. Qualcomm makes no representations, warranties, or other commitments whatsoever about any non-Qualcomm sites or third-party resources that may be referenced, accessible from, or linked to this site.

Qualcomm relentlessly innovates to deliver intelligent computing everywhere, helping the world tackle some of its most important challenges. Our leading-edge AI, high performance, low-power computing, and unrivaled connectivity deliver proven solutions that transform major industries. At Qualcomm, we are engineering human progress.

Stay connected

Get the latest Qualcomm and industry information delivered to your inbox.

Subscribe
Manage your subscription

© Qualcomm Technologies, Inc. and/or its affiliated companies.

Snapdragon and Qualcomm branded products are products of Qualcomm Technologies, Inc. and/or its subsidiaries. Qualcomm patented technologies are licensed by Qualcomm Incorporated.

Note: Certain services and materials may require you to accept additional terms and conditions before accessing or using those items.

References to "Qualcomm" may mean Qualcomm Incorporated, or subsidiaries or business units within the Qualcomm corporate structure, as applicable.

Qualcomm Incorporated includes our licensing business, QTL, and the vast majority of our patent portfolio. Qualcomm Technologies, Inc., a subsidiary of Qualcomm Incorporated, operates, along with its subsidiaries, substantially all of our engineering, research and development functions, and substantially all of our products and services businesses, including our QCT semiconductor business.

Materials that are as of a specific date, including but not limited to press releases, presentations, blog posts and webcasts, may have been superseded by subsequent events or disclosures.

Nothing in these materials is an offer to sell or license any of the services or materials referenced herein.