|
Project Information
Featured
Downloads
|
OverviewA Java implementation of the Structured Prediction Cascades framework as proposed by Weiss & Taskar (2010). This project contains an updated version of the code used to generate the handwriting recognition/OCR results in the paper, as well as a properly formatted version of the OCR dataset. This project is published under the MIT license, which allows free usage and redistribution of the code for any purpose so long as the license and copyright statement remains in the code. An early version of this code was used to generate the OCR results in the original paper. The results have improved further, yielding <7% error. This is less than half the error of other state-of-the-art methods, when trained using the 5500/600 train/test example split on this dataset.
The OCR dataset preformatted for use with this code is available for download from this website. References:
ContributorsThe authors of this package are David Weiss and Kuzman Ganchev, with additional help from Joao Graca. Installation (binary)
java -cp antlr-runtime.jar:trove-2.1.0.jar:structured-cascades.jar
cascades.programs.TrainTagger <args>To install from source code, check out the code from the repository and follow the README instructions. The repository can be checked out directly from Eclipse if you install the Mercurial Eclipse plugin from the Eclipse Marketplace. UsageSee the wiki documentation for usage instructions:
|