My favorites | Sign in
Logo
                
Show all Featured downloads:
ocropus-0.4.tar.gz

About

OCRopus(tm) is a state-of-the-art document analysis and OCR system, featuring pluggable layout analysis, pluggable character recognition, statistical natural language modeling, and multi-lingual capabilities.

The OCRopus engine is based on two research projects: a high-performance handwriting recognizer developed in the mid-90's and deployed by the US Census bureau, and novel high-performance layout analysis methods.

OCRopus is development is sponsored by Google and is initially intended for high-throughput, high-volume document conversion efforts. We expect that it will also be an excellent OCR system for many other applications.

Resources

Related Projects

Documentation

The following is the most important documentation:

If you want to contribute to the primary documentation, please check out hg clone https://wiki.ocropus.googlecode.com/hg and submit patches against the documentation.

Additional links you may find useful are here:

Bugs and Contributions

Please use the "Issues" tab above to submit bugs, feature requests, etc.

When submitting bug reports, please keep the following in mind:

If you have patches or other contributions, please supply them as a Mercurial bundle (preferred) or patch. Please tag with FixBundle or FixPatch, respectively.

Acknowledgements

The system is combining the work of many contributors and previous projects. The core developers work at the IUPR research group at the DFKI and gratefully acknowledge funding by Google.









Hosted by Google Code