DeepSeek-OCR

DeepSeek-OCR is an open-weight, multilingual vision-language OCR model from DeepSeek that converts images/documents to text (e.g., Markdown) using a context-aware optical compression approach. It runs via Transformers and vLLM, supports FlashAttention, and is provided as BF16 safetensors (~3B params) with example inference code and broad community adoption.

Key Information

  • Category: Vision Models
  • Source: Huggingface
  • Tags: image-text-to-text
  • Last updated: January 17, 2026

Structured Metrics

No structured metrics captured yet.

Links

Canonical source: https://huggingface.co/deepseek-ai/DeepSeek-OCR