olmOCR-7B-0225-preview
A preview release of AllenAI's olmOCR model, fine-tuned from Qwen2-VL-7B-Instruct using the olmOCR-mix-0225 dataset. It is designed for document OCR and recognition, processing PDF images by extracting text and metadata. The model is intended to be used in conjunction with the olmOCR toolkit for efficient, large-scale document processing.
Key Information
- Category: Vision Models
- Source: Huggingface
- Tags: image-to-text
- Last updated: January 09, 2026
Structured Metrics
No structured metrics captured yet.
Links
Canonical source: https://huggingface.co/allenai/olmOCR-7B-0225-preview