Function to extract bold or italicised text
sanjmeh opened this issue · comments
Sanjay Mehrotra commented
I was redirected to this package from the SO question https://stackoverflow.com/q/53398611/1972786.
I see only 4 functions in pdfbox R package 0.2.0
extract_text, extract_uris, image_count, pdf_info
I tried all 4, None of these can be used for extracting bold or italicised words from the pdf doc.
Please can you throw some light on this and also is there any hidden way to get the meta data of the pdf text extracted from the pdf files?
boB Rudis commented
It + the dependent pkg provides the JARs which can let R use the Java functions. As noted in the "answer" on the site, thinking out of the box is likely a better option.