Skip to content

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

Sign in

Sign up

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

tesseract-ocr

Tesseract OCR

2.1k followers
https://github.com/tesseract-ocr/

Overview
Repositories 14
Projects
Packages
People 1

More

Overview
Repositories
Projects
Packages
People

Pinned Loading

tesseract Public

Tesseract Open Source OCR Engine (main repository)

C++ 66.4k 9.8k
tessdata_best Public

Best (most accurate) trained LSTM models.

1.3k 400
tessdata Public

Trained models with fast variant of the "best" LSTM models + legacy models

6.9k 2.3k
tessdata_fast Public

Fast integer versions of trained LSTM models

532 150

Repositories

Loading

Type

Select type

All Public Sources Forks Archived Mirrors Templates

Language

Select language

All C++ HTML Makefile Python Ruby Shell

Sort

Select order

Last updated Name Stars

Showing 10 of 14 repositories

tesstrain Public
Train Tesseract LSTM with make

Python 672 Apache-2.0 204 62 2 Updated Apr 18, 2025
tessdata_contrib Public
User contributed (non Google) OCR models for Tesseract

26 Apache-2.0 24 0 3 Updated Apr 18, 2025
langdata Public
Source training data for Tesseract for lots of languages

854 Apache-2.0 883 45 (1 issue needs help) 9 Updated Apr 1, 2025
tesseract Public
Tesseract Open Source OCR Engine (main repository)

C++ 66,382 Apache-2.0 9,841 417 (7 issues need help) 26 Updated Mar 27, 2025
tessdoc Public
Tesseract documentation

HTML 2,011 382 19 5 Updated Feb 5, 2025
tessdata_fast Public
Fast integer versions of trained LSTM models

532 Apache-2.0 150 3 0 Updated Aug 1, 2024
test Public
Repository for tesseract testing

Shell 31 Apache-2.0 31 1 0 Updated Jun 9, 2024
tessdata_best Public
Best (most accurate) trained LSTM models.

1,332 Apache-2.0 400 22 1 Updated Mar 9, 2024
tessdata Public
Trained models with fast variant of the "best" LSTM models + legacy models

6,872 Apache-2.0 2,314 51 (2 issues need help) 2 Updated Mar 9, 2024
langdata_lstm Public
Data used for LSTM model training

117 Apache-2.0 156 24 (1 issue needs help) 7 Updated Mar 9, 2024

View all repositories

People

Top languages

HTML Shell Ruby Python Makefile

Most used topics

ocr tesseract hacktoberfest tesseract-ocr

Footer

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.