Skip to content

Pinned Loading

  1. tesseract Public

    Tesseract Open Source OCR Engine (main repository)

    C++ 66.2k 9.8k

  2. tessdata_best Public

    Best (most accurate) trained LSTM models.

    1.3k 400

  3. tessdata Public

    Trained models with fast variant of the "best" LSTM models + legacy models

    6.8k 2.3k

  4. tessdata_fast Public

    Fast integer versions of trained LSTM models

    529 151

Repositories

Showing 10 of 14 repositories
  • langdata Public

    Source training data for Tesseract for lots of languages

    853 Apache-2.0 883 45 (1 issue needs help) 9 Updated Apr 1, 2025
  • tesseract Public

    Tesseract Open Source OCR Engine (main repository)

    C++ 66,158 Apache-2.0 9,826 416 (7 issues need help) 26 Updated Mar 27, 2025
  • tessdoc Public

    Tesseract documentation

    HTML 2,004 380 18 5 Updated Feb 5, 2025
  • tessdata_contrib Public

    User contributed (non Google) OCR models for Tesseract

    25 Apache-2.0 24 0 4 Updated Oct 22, 2024
  • tessdata_fast Public

    Fast integer versions of trained LSTM models

    529 Apache-2.0 151 3 0 Updated Aug 1, 2024
  • test Public

    Repository for tesseract testing

    Shell 31 Apache-2.0 31 1 0 Updated Jun 10, 2024
  • tesstrain Public

    Train Tesseract LSTM with make

    Python 666 Apache-2.0 203 63 2 Updated Jun 4, 2024
  • tessdata_best Public

    Best (most accurate) trained LSTM models.

    1,328 Apache-2.0 400 22 1 Updated Mar 9, 2024
  • tessdata Public

    Trained models with fast variant of the "best" LSTM models + legacy models

    6,847 Apache-2.0 2,308 51 (2 issues need help) 2 Updated Mar 9, 2024
  • langdata_lstm Public

    Data used for LSTM model training

    117 Apache-2.0 155 24 (1 issue needs help) 5 Updated Mar 9, 2024