Back to All
Developer Blog

How to run DeepSeek models on Windows on Snapdragon – LM Studio tutorial

Sign up for Developer monthly newsletter-image

Sign up for Developer monthly newsletter

Join thousands of developers around the globe who receive latest news and updates from our monthly curated newsletter.

Sign up
Come for support, stay for the community-image

Come for support, stay for the community

Get support from experts, connect with like-minded developers, and access exclusive virtual events.

Join Developer Discord

Co-written with Li He, Srinivasa Deevi, Hongqiang Wang, Sai Gayatri Gampa.

DeepSeek-R1 is an open-source reasoning model developed by DeepSeek to handle tasks requiring logical inference, mathematical problem-solving and real-time decision making. One of its standout features is the ability to trace its logic, which makes it easier to understand and, if necessary, challenge its output.

This transparency is particularly valuable in fields where explainable outcomes are crucial, such as research and complex decision making.

AI distillation is a process that creates smaller, more efficient models from larger ones, retaining much of their reasoning power while reducing computational demands. DeepSeek has applied this technique to develop a suite of distilled models from R1, using Qwen and Llama architectures. That allows users to take advantage of the capabilities of DeepSeek-R1 on standard laptops.

Developers have a few options to run their AI models on Windows on Snapdragon. One of the most popular options is to leverage LLM platforms like LM Studio. LM Studio stands out due to its user-friendly interface, robust performance, and seamless integration with popular AI models. It supports a wide range of models making it highly versatile.

Additionally, LM Studio allows developers to run models locally, ensuring data privacy and reducing dependency on internet connectivity. Its ability to work offline and its compatibility makes it an ideal choice for developers looking to experiment with AI models efficiently.

The platform has also received positive feedback from prominent figures in the tech community, further solidifying its reputation as a reliable and effective tool for AI development.

This tutorial shows you how to run DeepSeek-R1 models on Windows on Snapdragon CPU using LM Studio. You can run the steps below on Snapdragon X Series laptops.

Running on CPU – LM Studio how to guide

1. Visit the LM Studio website. Open your browser and go to https://lmstudio.ai

LM Studio, Windows on Snapdragon option
Figure 1: LM Studio, Windows on Snapdragon option

2. Download and install. Download the LM Studio for Windows (arm64) installer. Follow the on-screen instructions to complete the installation and launch the application.

LM Studio installer(1)
Figure 2: LM Studio installer step 1.
LM Studio installer(2)
Figure 3: LM Studio installer Step 2.
LM Studio installer(3)
Figuer 4: LM Studio installer Step 3.

3. Initialize your model. From the onboarding screen, select your LLM model and wait for it to download:

LM Studio model selection
Figure 5: LM Studio model selection.
LM Studio model download(1)
Figure 6: LM Studio model download Step 1.
LM Studio model download(2)
Figure 7: LM Studio model download Step 2.

4. Start using the application. Once the download is complete, you can load the model and begin using the chat application based on the selected LLM model:

Start using LM Studio(1)
Figure 8: Start using LM Studio Step 1.
Start using LM Studio(2)
Figure 9: Start using LM Studio Step 2.
Start using LM Studio(3)
Figure 10: Start using LM Studio Step 3.

5. Switch models. You can select a different model from the dropdown menu at the top; for example, DeepSeek R1 Distilled (Qwen 7B), as shown below:

LM Studio, selecting DeepSeek-R1
Figure 11: LM Studio, selecting DeepSeek-R1

6. Monitor performance. Open the Windows Task Manager to see the model running locally (using CPU) on your device:

LM Studio, monitoring model performance on CPU in Windows Task Manager
Figure 12: LM Studio, monitoring model performance on CPU in Windows Task Manager.

Next steps

We’ll have more details shortly about running on NPU.

Meanwhile, Microsoft is bringing NPU-optimized versions of DeepSeek-R1 directly to Copilot+ PCs, starting with Qualcomm Snapdragon X Series devices. The company also announced that the distilled DeepSeek R1 models, optimized using ONNX, are now available on Snapdragon-powered Copilot+ PCs. These models offer a time to first token of less than 70 ms for short prompts (<64 tokens) and a throughput rate of 25-40 tokens/s, with longer responses achieving higher throughput. Get started today by downloading the AI Toolkit extension in VS Code.

Want to find out more about DeepSeek on Windows on Snapdragon? Join our Developer Discord for more insights and real-time conversations with fellow developers and our technical experts.

Opinions expressed in the content posted here are the personal opinions of the original authors, and do not necessarily reflect those of Qualcomm Incorporated or its subsidiaries ("Qualcomm"). The content is provided for informational purposes only and is not meant to be an endorsement or representation by Qualcomm or any other party. This site may also provide links or references to non-Qualcomm sites and resources. Qualcomm makes no representations, warranties, or other commitments whatsoever about any non-Qualcomm sites or third-party resources that may be referenced, accessible from, or linked to this site.

Qualcomm-branded products are products of Qualcomm Technologies, Inc. and/or its subsidiaries.

About the Authors
Devang Aggarwal
Devang AggarwalProduct Manager, Senior
Dileep Karpur
Dileep Karpur
Qualcomm relentlessly innovates to deliver intelligent computing everywhere, helping the world tackle some of its most important challenges. Our leading-edge AI, high performance, low-power computing, and unrivaled connectivity deliver proven solutions that transform major industries. At Qualcomm, we are engineering human progress.

Stay connected

Get the latest Qualcomm and industry information delivered to your inbox.

Subscribe
Manage your subscription

© Qualcomm Technologies, Inc. and/or its affiliated companies.

Snapdragon and Qualcomm branded products are products of Qualcomm Technologies, Inc. and/or its subsidiaries. Qualcomm patented technologies are licensed by Qualcomm Incorporated.

Note: Certain services and materials may require you to accept additional terms and conditions before accessing or using those items.

References to "Qualcomm" may mean Qualcomm Incorporated, or subsidiaries or business units within the Qualcomm corporate structure, as applicable.

Qualcomm Incorporated includes our licensing business, QTL, and the vast majority of our patent portfolio. Qualcomm Technologies, Inc., a subsidiary of Qualcomm Incorporated, operates, along with its subsidiaries, substantially all of our engineering, research and development functions, and substantially all of our products and services businesses, including our QCT semiconductor business.

Materials that are as of a specific date, including but not limited to press releases, presentations, blog posts and webcasts, may have been superseded by subsequent events or disclosures.

Nothing in these materials is an offer to sell or license any of the services or materials referenced herein.