ocrivist

A program for digital archiving of printed documents

Introduction

Ocrivist makes it easy to scan and OCR books and other printed documents and then export them to PDF or Djvu format. Ocrivist can be used on Linux or Windows and uses the Leptonica and Tesseract libraries.

The program uses freepascal bindings for libtesseract and the Leptonica library. These can be found in their own folders in the base folder of the SVN repository.

Future releases of Ocrivist can be found at ocrivist.com

Ocrivist version 0.8

Release date: 5 June 2015

Downloads for Linux

Ocrivist 0.8 for Linux MD5 checksum: 0acd66571e4aad19fa272c8be7024a97

Installation (Linux)

Ocrivist can be installed on Linux using the installation script install.sh. On running Ocrivist for the first time it will suggest that you download OCR data for one or more languages from the tesseract-ocr repository on googlecode. Languages can also be installed from the Preferences dialog in the Edit menu.

To benefit from all the functionality of Ocrivist you should also install DjVuLibre and GNU Aspell, with appropriate language and dictionary packs. These can normally be installed using the package manager of your Linux distribution.

Downloads for Windows

Ocrivist 0.8 for Windows MD5 checksum: 712f70603f49a49652db7215644c2eed

Installation (Windows)

Ocrivist can be installed on Windows 32 and 64 bit versions by running the downloaded setup program. To benefit from all the functionality of Ocrivist you should also install DjVuLibre and GNU Aspell, with appropriate language and dictionary packs. Scanning images has not been tested on Windows 8.

Project Information

The project was created on Apr 23, 2011.

License: GNU GPL v3
10 stars
svn-based source control

Labels:
Linux Pascal scanning OCR tesseract leptonica DjVu Windows

Code

Archive