DeepSeek-OCR Client

A real-time Electron-based desktop GUI for DeepSeek-OCR

Unaffiliated with DeepSeek

Features

Drag-and-drop image upload
Real-time OCR processing

Click regions to copy
Export results as ZIP with markdown images
GPU acceleration (CUDA)

Requirements

Windows 10/11, other OS are experimental
Node.js 18+ (download)
Python 3.12+ (download)
NVIDIA GPU with CUDA

在 PowerShell 中

conda create -yn ocr python=3.12 conda activate ocr pip install -r requirements.txt #重要： $env:ELECTRON_MIRROR="https://npmmirror.com/mirrors/electron/" npm install npm install --registry=https://registry.npmjs.org/

https://registry.npmmirror.com

You are using a model of type deepseek_v1_v2 to instantiate a model of type DeepseekOCR. This is not supported for all configurations of models and can yield errors.

你正在使用 deepseek_v1_v2 类型的模型来实例化 DeepseekOCR 类型的模型。这在某些配置下不被支持，可能会产生错误。

原因这是 Hugging Face Transformers 库的兼容性警告：

DeepSeek-OCR 模型的配置文件中定义的模型类型是 deepseek_v1_v2 但实际加载时使用的是 DeepseekOCR 类 Transformers 检测到类型不完全匹配，发出警告这是官网 https://deepseekocr.com/zh-CN#app

Quick Start (Windows)

Extract the ZIP file
Run start-client.bat
- First run will automatically install dependencies.
- Subsequent runs will start quicker.
Load Model - Click the "Load Model" button in the app, this will download or load the model.
- If this is the first run, this might take some time.
Drop an image or click the drop zone to select one.
Run OCR - Click "Run OCR" to process.

Note: if you have issues processing images but the model loads properly, please close and re-open the app and try with the default resolution for "base" and "size". This is a known issue, if you can help to fix it I would appreciate it!

Linux/macOS

Note: Linux and macOS have not been tested yet. Use start-client.sh instead of start-client.bat.

PRs welcome! If you test on Linux/macOS and encounter issues, please open a pull request with fixes.

Links

Future goals (PRs welcome!)

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
backend		backend
docs/images		docs/images
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
index.html		index.html
main.js		main.js
package-lock.json		package-lock.json
package.json		package.json
renderer.js		renderer.js
requirements.txt		requirements.txt
start-client.bat		start-client.bat
start-client.sh		start-client.sh
start.py		start.py
styles.css		styles.css

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DeepSeek-OCR Client

Features

Requirements

在 PowerShell 中

Quick Start (Windows)

Linux/macOS

Links

Future goals (PRs welcome!)

License

About

Uh oh!

Releases

Packages

Languages

License

tommycom-sudo/deepseek-ocr-client

Folders and files

Latest commit

History

Repository files navigation

DeepSeek-OCR Client

Features

Requirements

在 PowerShell 中

Quick Start (Windows)

Linux/macOS

Links

Future goals (PRs welcome!)

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages