All Questions

Tagged with
Filter by
Sorted by
Tagged with
0 votes
1 answer
126 views

Using each plugin in Nutch separately

I'm using extractor plugin with Nutch-1.15. The plugin makes use of parsed data. The plugin works fine when used as a whole. The problem arises when a few changes are made to the custom-extractos....
Sudha Sompura's user avatar
0 votes
1 answer
90 views

how to access the inner html content with the css engine in extractor plugin for filtering process

I have configured Apache Nutch , Solr with the extractor plug in for filtering of html content. how could i be able to access the inner div content with using css engine or xpath engine. Thanks in ...
A.J.K's user avatar
  • 1
0 votes
1 answer
1k views

Issue parsing PDF with Apache Nutch - extractor plugin

I am trying to index web pages AND pdf documents from a website. I am using Nutch 1.9. I downloade the nutch-custom-search plugin from https://github.com/BayanGroup/nutch-custom-search. The plugin is ...
cgoasduff's user avatar
  • 181