"min_characters_to_try" parameter does not work
frank-pian opened this issue · comments
Current Behavior
"min_characters_to_try" parameter does not work.
tesseract "C:\Users\bianyongfang\Downloads\ocr-training-20240305-184120.png" - -l eng --psm 12 -c min_characters_to_try=1
Too few characters. Skipping this page
OSD: Weak margin (0.00) for 5 blob text block, but using orientation anyway: 0
Empty page!!
Too few characters. Skipping this page
OSD: Weak margin (0.00) for 5 blob text block, but using orientation anyway: 0
Empty page!!
but
tesseract "C:\Users\bianyongfang\Downloads\ocr-training-20240305-184120.png" - -l eng --psm 6
1 6
2 5
3 4
Expected Behavior
No response
Suggested Fix
No response
tesseract -v
tesseract v5.3.3.20231005
leptonica-1.83.1
libgif 5.2.1 : libjpeg 8d (libjpeg-turbo 2.1.4) : libpng 1.6.40 : libtiff 4.6.0 : zlib 1.2.13 : libwebp 1.3.2 : libopenjp2 2.5.0
Found AVX2
Found AVX
Found FMA
Found SSE4.1
Found libarchive 3.7.2 zlib/1.3 liblzma/5.4.4 bz2lib/1.0.8 liblz4/1.9.4 libzstd/1.5.5
Found libcurl/8.3.0 Schannel zlib/1.3 brotli/1.1.0 zstd/1.5.5 libidn2/2.3.4 libpsl/0.21.2 (+libidn2/2.3.3) libssh2/1.11.0
Operating System
Windows 10
Other Operating System
No response
uname -a
No response
Compiler
No response
CPU
No response
Virtualization / Containers
No response
Other Information
No response
Do psm 0 and 1 work here?
Do psm 0 and 1 work here?
same
tesseract "C:\Users\bianyongfang\Downloads\ocr-training-20240305-184120.png" - -l eng --psm 0 -c min_characters_to_try=1
Warning, detects only orientation with -l eng
Error, OSD requires a model for the legacy engine
tesseract "C:\Users\bianyongfang\Downloads\ocr-training-20240305-184120.png" - -l eng --psm 1 -c min_characters_to_try=1
Too few characters. Skipping this page
OSD: Weak margin (0.00) for 5 blob text block, but using orientation anyway: 0
Empty page!!
Too few characters. Skipping this page
OSD: Weak margin (0.00) for 5 blob text block, but using orientation anyway: 0
Empty page!!