Tesseract OCR

Tesseract OCR is an open-source optical character recognition engine that can recognize text from images. It supports over 100 languages, multiple image formats (PNG, JPEG, TIFF), and offers both an LSTM-based OCR engine and a legacy mode for character pattern recognition.

Key Information

  • Category: Developer Tools
  • Source: Github
  • Last updated: January 09, 2026

Structured Metrics

No structured metrics captured yet.

Links

Canonical source: https://github.com/tesseract-ocr/tesseract