| |
ID |
Type |
Status |
Priority |
Milestone |
Owner |
|
Summary + Labels |
... |
| |
4 |
Defect
|
Accepted
|
Low
|
----
|
theraysmith
|
|
[ 1589334 ] segfault for filenames without a dot
|
|
| |
15 |
Enhancement
|
Accepted
|
Medium
|
----
|
theraysmith
|
|
add unit tests
|
|
| |
35 |
Defect
|
Accepted
|
Medium
|
----
|
theraysmith
|
|
Windows Compile with libtiff support
|
|
| |
40 |
Defect
|
Accepted
|
Medium
|
----
|
----
|
|
Result Not The Same
|
|
| |
41 |
Port
|
Accepted
|
Medium
|
----
|
theraysmith
|
|
Straw poll - VC++6 vs VC++ Express
|
|
| |
42 |
Defect
|
New
|
Medium
|
----
|
----
|
|
How to port Tesseract engine into vb6 project?
|
|
| |
43 |
Defect
|
Accepted
|
Medium
|
----
|
----
|
|
Tesseract.exe 2.00 Vista 64bit does not run
|
|
| |
44 |
Defect
|
Accepted
|
Medium
|
----
|
----
|
|
In-complete OCR result
|
|
| |
50 |
Defect
|
Started
|
Medium
|
----
|
----
|
|
tesserat-ocr russian
|
|
| |
52 |
Enhancement
|
Accepted
|
Low
|
----
|
----
|
|
should produce something sensible with --version or -v, etc.
|
|
| |
56 |
Defect
|
Accepted
|
Medium
|
----
|
----
|
|
WeOCR server/Tesserac works better than Tesseract 2.00 standalone version
|
|
| |
59 |
Defect
|
Accepted
|
Medium
|
----
|
----
|
|
add option to include position information in text output
|
|
| |
64 |
Defect
|
Accepted
|
Medium
|
----
|
----
|
|
Tesseract crashes when it tries to process the attached tif file
|
|
| |
66 |
Defect
|
New
|
Medium
|
----
|
----
|
|
Portuguese language it's not visible on the selection list
|
|
| |
68 |
Defect
|
Accepted
|
Medium
|
----
|
----
|
|
Error: trying to read a DAWG kan(240 lines).freq-dawg that contains 1714 edges while the maximum is 1500."
|
|
| |
69 |
Defect
|
Started
|
Medium
|
----
|
----
|
|
Tesseract should recognize common ligatures for improved recognition rates
|
|
| |
70 |
Defect
|
New
|
Medium
|
----
|
----
|
|
specifiying explicitly binary/text flag in fopen() calls would be better for OS/2 port
|
|
| |
75 |
Defect
|
Accepted
|
Medium
|
----
|
----
|
|
TessAPI problem. Finding tessdata directory when used in scripting languages.
|
|
| |
77 |
Defect
|
New
|
Medium
|
----
|
----
|
|
patch of SWIG wrapper for python
|
|
| |
78 |
Defect
|
New
|
Medium
|
----
|
----
|
|
Attached picture makes Tesseract freeze, then crash
|
|
| |
85 |
Defect
|
New
|
Medium
|
----
|
----
|
|
TIFF file causes DLLTest.exe to hang
|
|
| |
88 |
Defect
|
New
|
Medium
|
----
|
----
|
|
Delphi wrapper for tessdll.dll ??
|
|
| |
89 |
Enhancement
|
New
|
Medium
|
----
|
----
|
|
Feature request: option to list available languages
|
|
| |
90 |
Defect
|
New
|
Medium
|
----
|
----
|
|
Fails to build with gcc-4.3
|
|
| |
92 |
Defect
|
New
|
Medium
|
----
|
----
|
|
AccessViolation fixed in tessdll.dll
|
|
| |
93 |
Defect
|
New
|
Medium
|
----
|
----
|
|
TessDLL wraper for java
|
|
| |
98 |
Defect
|
Accepted
|
Medium
|
----
|
----
|
|
leptonica_pageseg.cpp fails to compile in 2.02
|
|
| |
99 |
Defect
|
Accepted
|
Medium
|
----
|
----
|
|
Dollar symbol recognised as digit 8 if struck through
|
|
| |
101 |
Defect
|
New
|
Medium
|
----
|
----
|
|
tesseract should accept stdin instead of image file so it can be a filter.
|
|
| |
103 |
Defect
|
New
|
Medium
|
----
|
----
|
|
Compiler Error on Leopard 10.5.2 (MacBook)
|
|
| |
107 |
Defect
|
New
|
Medium
|
----
|
----
|
|
make fails due to ./makedummies fra not found as a command
|
|
| |
109 |
Defect
|
New
|
Medium
|
----
|
----
|
|
robustness patches
|
|
| |
114 |
Defect
|
New
|
Medium
|
----
|
----
|
|
the scanned tif document is not directly recognizing by tesseract ocr
|
|
| |
119 |
Defect
|
Accepted
|
Medium
|
----
|
----
|
|
More intelligence in the OCR, eg, for numbers, addresses, phones
|
|
| |
120 |
Defect
|
New
|
Medium
|
----
|
----
|
|
Spell check requested
|
|
| |
122 |
Defect
|
New
|
Medium
|
----
|
----
|
|
compile with mingw
|
|
| |
123 |
Defect
|
New
|
Medium
|
----
|
----
|
|
user-words file specified by command line
|
|
| |
124 |
Defect
|
New
|
Medium
|
----
|
----
|
|
Tesseract dll can't be instantiate two time
|
|
| |
127 |
Defect
|
New
|
Medium
|
----
|
----
|
|
Line with bolded numbers cause 'o' to be recognized as '0'
|
|
| |
132 |
Defect
|
Accepted
|
Medium
|
----
|
----
|
|
./configure --prefix=$HOME doesn't work
|
|
| |
136 |
Defect
|
Accepted
|
Medium
|
----
|
----
|
|
Suggestions
|
|
| |
137 |
Defect
|
Accepted
|
Medium
|
----
|
----
|
|
Error: Size of unicharset of mftraining is greater than MAX_NUM_CLASSES for "mftraining", fatal
|
|
| |
138 |
Defect
|
Accepted
|
Medium
|
----
|
----
|
|
Need a way to merge different versions of inttemp, normproto, pffmtable, unicharset
|
|
| |
139 |
Defect
|
Accepted
|
Medium
|
----
|
----
|
|
wordlist2dawg segfaults on dictionaries larger than 1000000 million words
|
|
| |
141 |
Defect
|
New
|
Medium
|
----
|
----
|
|
unresolved references to semaphores
|
|
| |
144 |
Defect
|
Accepted
|
Low
|
----
|
----
|
|
macro EXIT conflicts with EXIT in OpenCV
|
|
| |
148 |
Defect
|
New
|
Medium
|
----
|
----
|
|
mftraining problem -Kannada -20 tr files
|
|
| |
149 |
Defect
|
Accepted
|
Medium
|
----
|
----
|
|
Failed to create/build freq-dawg and word-dawg for Slovak language.
|
|
| |
155 |
Defect
|
Started
|
Medium
|
----
|
----
|
|
Error: 6804672 classes in inttemp while unicharset contains 112 unichars.
|
|
| |
156 |
Defect
|
Accepted
|
Medium
|
----
|
----
|
|
2.03 - make fails on older Puppy/Grafpup Linux
|
|
| |
157 |
Defect
|
Accepted
|
Low
|
----
|
soonhui.ngu
|
|
Optional files that are not presented in the tessdata folder will crash the program when running the recognition
|
|
| |
159 |
Defect
|
Accepted
|
Medium
|
----
|
soonhui.ngu
|
|
Cannot Train chinese font file
|
|
| |
161 |
----
|
Accepted
|
----
|
----
|
----
|
|
common OCR errors for numeric characters
|
|
| |
163 |
----
|
Accepted
|
Low
|
----
|
----
|
|
does not recognize .tiff extension
|
|
| |
164 |
----
|
Started
|
----
|
----
|
----
|
|
Tesseract OCR doesn't recognize digits?
|
|
| |
165 |
----
|
Started
|
----
|
----
|
----
|
|
Failed to genarate dawg files for malayalam
|
|
| |
173 |
----
|
New
|
----
|
----
|
----
|
|
compiling errors on OS/2
|
|
| |
174 |
----
|
New
|
----
|
----
|
----
|
|
tesseract ported to use libtool, shared libraries on many platforms, solution to paths problem
|
|
| |
176 |
----
|
Started
|
----
|
----
|
----
|
|
Memory leak. When using tesseract api's
|
|
| |
178 |
----
|
New
|
----
|
----
|
----
|
|
How to compile it on windows ?
|
|
| |
181 |
----
|
Started
|
----
|
----
|
----
|
|
Support for multiple dictionaries
|
|
| |
183 |
----
|
Started
|
----
|
----
|
----
|
|
Well recognized text by tesseract-1.01 is no more recognized by tesseract-2.03
|
|
| |
186 |
----
|
Accepted
|
----
|
----
|
----
|
|
4-byte boundary not respected
|
|
| |
189 |
----
|
Started
|
----
|
----
|
----
|
|
About support in Chinese !
|
|
| |
191 |
----
|
Accepted
|
----
|
----
|
----
|
|
Doesn't recognize small text: Recognize "*354110-53153" instead of "96-020-53753", very clear image, why ?
|
|
| |
194 |
----
|
Started
|
----
|
----
|
----
|
|
Windows Binaries for 2.03 Not Available?
|
|
| |
196 |
----
|
New
|
----
|
----
|
----
|
|
nobatch digits - tessedit_char_whitelist ignored
|
|
| |
197 |
----
|
Accepted
|
----
|
----
|
----
|
|
Memory lack with windows (mouse-click) confirmation
|
|
| |
203 |
----
|
New
|
----
|
----
|
----
|
|
Segmentation fault
|
|
| |
207 |
----
|
New
|
----
|
----
|
----
|
|
tesseract exit when treating multiple images with the baseapi
|
|
| |
213 |
----
|
New
|
----
|
----
|
----
|
|
build error in Debian Linux
|
|
| |
214 |
----
|
New
|
----
|
----
|
----
|
|
Capitol file extension segfault.
|
|
| |
216 |
----
|
New
|
----
|
----
|
----
|
|
uable to locate box files -kannada
|
|
| |
217 |
----
|
New
|
----
|
----
|
----
|
|
APPLY_BOXES: boxfile ...: FAILURE! box overlaps no blobs or blobs in multiple rows - kannada
|
|
| |
218 |
----
|
New
|
----
|
----
|
----
|
|
Revision 282: SVN Build, Header Missing: viewer/svutil.cpp
|
|
| |
219 |
----
|
New
|
----
|
----
|
----
|
|
I think the revision 236 does not work properly. You can never know the directory where is running the dll because the code never reaches the basedir.cpp's line 101
|
|
| |
220 |
----
|
New
|
----
|
----
|
----
|
|
program too big -cmd error
|
|
| |
221 |
----
|
New
|
----
|
----
|
----
|
|
tesseract hOCR output with page, line and word classes, so it can be converted to djvu-hidden-text structure
|
|
| |
222 |
----
|
New
|
----
|
----
|
----
|
|
Please supply dictionary files in language source data
|
|
| |
223 |
----
|
New
|
----
|
----
|
----
|
|
Boxes too low, missing the characters
|
|
| |
224 |
----
|
New
|
----
|
----
|
----
|
|
Upgrading wipes out language-packs
|
|
| |
227 |
----
|
New
|
----
|
----
|
----
|
|
Is there a minimum number of characters required for results? Varying results on 2.04, tessdll, and svn version from 7.30.09
|
|
| |
233 |
----
|
Accepted
|
----
|
----
|
----
|
|
different output looping on the same bitmap
|
|
| |
235 |
----
|
New
|
----
|
----
|
----
|
|
failed to generate mntraining as well as tesseract.log file
|
|
| |
236 |
----
|
New
|
----
|
----
|
----
|
|
tesseract uses locale-aware fscanf to parse fixed-locale files
|
|
| |
238 |
----
|
New
|
----
|
----
|
----
|
|
Leptonica 1.62 release of 7/26/2009 breaks tesseract 2.04 build if HAVE_LEPTLIB is defined
|
|
| |
239 |
----
|
New
|
----
|
----
|
----
|
|
correct output.txt does not produced
|
|
| |
240 |
----
|
New
|
----
|
----
|
----
|
|
Ubuntu9.04>g++4.3.3>ld2.19.1>compile error with ld
|
|
| |
241 |
----
|
New
|
----
|
----
|
----
|
|
Could tesseract support chinese?
|
|
| |
242 |
----
|
New
|
----
|
----
|
----
|
|
Segmentation fault
|
|
| |
243 |
----
|
New
|
----
|
----
|
----
|
|
Crashes on PowerPC in 3.0
|
|
| |
244 |
----
|
New
|
----
|
----
|
----
|
|
Crashes in 3.0 when scanning text with long words (or long lines)
|
|
| |
245 |
----
|
New
|
----
|
----
|
----
|
|
unable to work with new tessdata
|
|
| |
246 |
----
|
New
|
----
|
----
|
----
|
|
Circular dependency with inttemp and .tr files?
|
|
| |
247 |
----
|
New
|
----
|
----
|
----
|
|
Add Bash Completion Script
|
|
| |
248 |
----
|
New
|
----
|
----
|
----
|
|
new[]/delete mismatch in image/imgs.cpp
|
|
| |
249 |
----
|
New
|
----
|
----
|
----
|
|
tessbaseAPI: leptonica's ¿warnings?
|
|
| |
250 |
----
|
New
|
----
|
----
|
----
|
|
Error: Illegal min or max specification occur after start the programm
|
|
| |
251 |
----
|
New
|
----
|
----
|
----
|
|
Centos 5.3, 32 bit, build gets warnings, program produces no output
|
|
| |
252 |
----
|
New
|
----
|
----
|
----
|
|
make fails on Mac OS10.6 snow leopard
|
|