PastaInc logo
Pasta Inc
ai

Run DeepSeek-R1 Locally in Your Browser with WebGPU Acceleration

Run DeepSeek-R1 Locally in Your Browser with WebGPU Acceleration
0 views
3 min read
#ai

Run DeepSeek-R1 Locally in Your Browser with WebGPU Acceleration

Overview: DeepSeek-R1 WebGPU, by Hugging Face, is a next-generation reasoning model that runs locally in your browser with WebGPU acceleration

If you're looking for a free, open-source, local, and fully private model to use, look no further! Hugging Face is breaking down the barriers to development and research for so many humans, in a world where OpenAI and Anthropic are significantly increasing costs and significantly cutting performance.

What is WebGPU, anyway? WebGPU is a modern graphics API framework, or middleware, which is designed to provide high-performance graphics and computation capabilities directly in web browsers. It is the successor to WebGL and aims to offer more advanced features and better performance by leveraging a combination of the capabilities of modern GPUs and the web's infrastructure.

Key Features of WebGPU:

  • Functionality: WebGPU acts as a wrapper for WebGL, the browser's built-in graphics API. It uses the GPU to process and render graphics tasks, making it faster and more efficient than regular WebGL.
  • Optimization: WebGPU provides low-level access to the GPU, allowing for more efficient rendering and computation. Unlike WebGL, WebGPU supports compute shaders, enabling general-purpose GPU computing.
  • Applications: It's used in web applications for rendering graphics, handling animations, and processing images. It supports features like 3D models, videos, and real-time graphics.

In other words, think of WebGPU as a special camera in a room that can take pictures, or renderings, without needing software. It's connected to the room, or the browser, through a network, processing those pictures, or renderings, and displaying them.



How-To Guide:

Run DeepSeek-R1 Locally in Your Browser with WebGPU Acceleration

Prerequisites for the How-To Guide:

These instructions are for a Windows machine!

  • You must have node.js installed
  • You must have Google Chrome installed

Setup Instructions:

  1. Start by creating a local directory on your computer. In your favorite IDE, navigate to that directory.

  2. Run the following command:

git clone https://github.com/huggingface/transformers.js-examples.git
  1. Navigate to the deepseek-r1-webgpu folder with the following command:
cd transformers.js-examples/deepseek-r1-webgpu
  1. Install all of the necessary project dependencies with the following command:
npm i
  1. Run the development server with the following command:
npm run dev
  1. In your IDE, you should see a link to the name of [your-machine:the-port] the program is running on, typically "localhost:5173" (it is not an issue if the port is a different number)
localhost:5173

Open this address in Google Chrome.

  1. When you first navigate to this address, you will see a "Load model" button; click it.

When the model is loaded, you are ready to chat! 🎉

Resources

  1. Hugging Face DeepSeek-R1 WebGPU GitHub
  2. Why OpenAI Needs So Much Money
  3. ChatGPT's Recent Performance Issues: A Deep Dive into Degradation, Discrimination, and the Importance of User Evaluation