All Questions
3
questions
0
votes
1
answer
126
views
Using each plugin in Nutch separately
I'm using extractor plugin with Nutch-1.15. The plugin makes use of parsed data.
The plugin works fine when used as a whole. The problem arises when a few changes are made to the custom-extractos....
0
votes
1
answer
90
views
how to access the inner html content with the css engine in extractor plugin for filtering process
I have configured Apache Nutch , Solr with the extractor plug in for filtering of html content. how could i be able to access the inner div content with using css engine or xpath engine.
Thanks in ...
0
votes
1
answer
1k
views
Issue parsing PDF with Apache Nutch - extractor plugin
I am trying to index web pages AND pdf documents from a website. I am using Nutch 1.9.
I downloade the nutch-custom-search plugin from https://github.com/BayanGroup/nutch-custom-search. The plugin is ...