What's new? | Help | Directory | Sign in
Google
ocropus
The OCRopus(tm) open source document analysis and OCR system
  
  
  
  
    
Show all Featured Downloads:
ocropus-0.1.1.tar.gz

OCRopus(tm) is a state-of-the-art document analysis and OCR system, featuring pluggable layout analysis, pluggable character recognition, statistical natural language modeling, and multi-lingual capabilities.

Documentation and Resources

Please have a look at the FrequentlyAskedQuestions

Installation

You can install the 0.1.1 (alpha) release: GettingStartedWithAlpha

You can also install the Subversion release: GettingStartedWithBleedingEdge

Reporting Bugs

When submitting bug reports, please keep the following in mind:

Background

The OCRopus engine is based on two research projects: a high-performance handwriting recognizer developed in the mid-90's and deployed by the US Census bureau, and novel high-performance layout analysis methods.

OCRopus is development is sponsored by Google and is initially intended for high-throughput, high-volume document conversion efforts. We expect that it will also be an excellent OCR system for many other applications.

Related Standards and Projects

Acknowledgements

The system is combining the work of many contributors and previous projects. The core developers work at the IUPR research group at the DFKI and gratefully acknowledge funding by Google.