All Questions
4
questions
0
votes
0
answers
277
views
Extract text from PDF section keeping strings in one line
I have a bunch of PDF files and I need to extract some information from them. The "section" have the text "Referências" and looks like the picture below:
I tried a lot of text extractor tools to ...
0
votes
1
answer
1k
views
Issue parsing PDF with Apache Nutch - extractor plugin
I am trying to index web pages AND pdf documents from a website. I am using Nutch 1.9.
I downloade the nutch-custom-search plugin from https://github.com/BayanGroup/nutch-custom-search. The plugin is ...
0
votes
0
answers
145
views
PDF text extractor
I need to upload into a database the information stored in advertising leaflets from the major retail chains (eg Carrefour, Lidl etc).
The quickest solution I've found is:
I download the PDF from ...
-1
votes
1
answer
257
views
Extract a specific data from pdf files
Can someone give some solution, idea or just an indication about how I can extract some specific data from a pdf file, I used pdfbox with PDfTextStripper to extract all the text in the pdf file, it's ...