Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ocropus-hocr should use property image instead of file #112

Closed
zuphilip opened this issue Oct 4, 2016 · 0 comments
Closed

ocropus-hocr should use property image instead of file #112

zuphilip opened this issue Oct 4, 2016 · 0 comments
Labels

Comments

@zuphilip
Copy link
Collaborator

zuphilip commented Oct 4, 2016

In the hocr fileformat it is important to link to the image from each ocr_page, see http://kba.github.io/hocr-spec/1.2/#image, and this should look like

<div class='ocr_page' title='image 433934212_0017.bin.png'>

where currently ocropus produces the nonstandard

<div class='ocr_page' title='file 433934212_0017.bin.png'>

I guess it is enough to change this line https://github.com/tmbdev/ocropy/blob/master/ocropus-hocr#L66

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant