Giters
kermitt2
/
pdfalto
PDF to XML ALTO file converter
Geek Repo:
Geek Repo
Github PK Tool:
Github PK Tool
Stargazers:
201
Watchers:
15
Issues:
138
Forks:
66
kermitt2/pdfalto Issues
No rule to make target `libs/image/png/mac/arm64/libpng.a'
Updated
11 days ago
Error case: double column, and line numbers
Updated
21 days ago
macOS build error - fontconfig.h file not found
Updated
2 months ago
Comments count
8
Error case, missing digits
Updated
2 months ago
Comments count
4
ARM binaries for the Apple M1
Closed
3 months ago
Comments count
3
Compilation error on arch linux
Updated
5 months ago
Comments count
1
[Suggestion] Reporting the byte location of images
Updated
7 months ago
Comments count
2
Wrong characters / difference between extraction and display
Updated
7 months ago
Comments count
1
Building for Apple Silicon failed due to missing directories (with manual fix)
Updated
9 months ago
Building on arm64 Ubuntu Server 22.04 fails
Updated
10 months ago
PDF cause a crash with annotation option
Updated
a year ago
Soft hyphens omitted
Updated
a year ago
Comments count
3
Cannot run pdfalto
Closed
a year ago
Comments count
5
xpdf version 4.04
Updated
2 years ago
PDF to XML conversion time out for some files in server mode but run the pdfalto_server cmd in shell is fast and returns ok.
Closed
2 years ago
Comments count
1
[Question] Is it possible to configure GROBID to ignore invisible/hidden text somehow?
Closed
4 years ago
Comments count
1
Segmentation fault with pdf with comments
Updated
2 years ago
Error case with invalid characters mapping
Updated
2 years ago
compile error on RHEL 8.6 (Ootpa): /usr/bin/ld: cannot find -lstdc++
Updated
2 years ago
Comments count
1
heap-buffer-overflow found?
Updated
2 years ago
empty image / svg
Updated
2 years ago
Is there an option to output ALTO XML to STDOUT?
Updated
2 years ago
Comments count
3
XML to PDF
Updated
2 years ago
Comments count
1
support discarding diagonal text like pdftotext(xpdf version)
Updated
2 years ago
Comments count
1
export `ROTATION` attribute for TextBlock.
Updated
2 years ago
Comments count
1
Error case character composition
Updated
2 years ago
Question about <MeasurementUnit>
Updated
2 years ago
Certain characters, such as the grave accent, tilde and degree sign causing string construction discrepancies
Updated
2 years ago
Comments count
5
Invalid alto xml
Closed
2 years ago
Comments count
4
"Broken" links, or links with no destination in annotations file
Updated
2 years ago
File name encoding Windows
Updated
2 years ago
Blank image overlay
Updated
3 years ago
Comments count
2
Can't compile pdfalto: error: ‘for’ loop initial declarations are only allowed in C99 mode
Closed
3 years ago
Comments count
4
can't clone repository xpdf. using ssh instead of https
Closed
3 years ago
Comments count
4
heap-buffer-overflow (/lib/x86_64-linux-gnu/libasan.so.5+0xc717c) in strncat
Updated
3 years ago
Bold check doesn't support fonts named "fontname-heavy"
Updated
3 years ago
Option for generating extracted svg graphics only
Updated
3 years ago
Syntax Warning: Invalid entry in bfchar block in ToUnicode CMap
Updated
3 years ago
One PDF where random strings are dropped (depending on filename length?)
Updated
3 years ago
Comments count
1
multiple runs of pdfalto on macOs returning different results
Updated
3 years ago
Missing String elements from large PDF
Updated
3 years ago
Comments count
2
combining chars eat spaces
Closed
3 years ago
Comments count
2
Missing numbers
Closed
3 years ago
Comments count
4
The extracted coordinates of different tokens are pointing to the same positions
Updated
3 years ago
Comments count
1
Feature Request: Additional Styles
Updated
3 years ago
Comments count
3
No font in show
Updated
3 years ago
Comments count
2
Suggestion: Unit tests
Updated
3 years ago
Add rotation attribute in ALTO output
Updated
3 years ago
Randomly omitted characters
Updated
3 years ago
Comments count
2
Space between most of the character for some documents
Closed
3 years ago
Comments count
6
Previous
Next