Lucid Imagination

Secondary links

  • Contact Us
  • Sign Up or Login
  • Downloads
  • Solutions
    • Partners |
    • Blog |
    • Software |
    • Services |
    • Training |
    • Case Studies |
    • Webcasts |
  • Developers
    • Blog |
    • Tech Articles |
    • Community |
    • Docs |
    • Downloads |
    • Whitepapers |
    • Podcasts |
  • About
    • Market Overview |
    • Management |
    • Company News |
    • In the Media |
    • Contact |

beta

Start new search

Back to search results

  1. FromDate
  2. Ahmad Al-Amri2010-02-16 06:47
  3. xiao yang2010-02-24 08:57

[nutch-user] Inject and index single url

Subject:
Re: Inject and index single url
From:
xiao yang <yangxiao9901@...>
Date:
2010-02-24 08:57
There's no good way to do this.
I'm waiting for Hbase integration with Nutch, which will make this
operation much easier. The data store structure nutch is using now is
not suitable for adding a single url to the index as I know.

Thanks!
Xiao

On Tue, Feb 16, 2010 at 7:47 PM, Ahmad Al-Amri <amri_jo@yahoo.com> wrote:
Hello; I want to inject a single url which is given as a string, I am thinking an add a method in the Injector;; something like this: injector.injectUrl(crawlDb, "http://example.com"); instead of the current inject method, which I guess uses hadoop FileInputFormat to get the urls and inject them into the crawldb... after that, I need to index it only; I guess just use the current generator and other stuff with depth equals to one doing it. what I supposed to use for doing this; and any other missing information I should know ?!! and is building a plug-in is more suitable for doing this. thank you .

Solr Powered

Give us your feedback

  • Lucene
  • Solr
  • Nutch
  • Tika
  • Mahout
  • Droids
  • PyLucene
  • Lucene.Net
  • Lucy
  • Lucene4c
  • Open Relevance Project
  • How We Can Help:
    • Getting Started |
    • Support Subscriptions |
    • White Papers |
    • Training |
    • Consulting |
    • Contact Us |
  • Developers:
    • Blog |
    • Documentation |
    • Tech Articles |
    • Podcasts and Videos |
    • Community |
  • Downloads:
    • LucidWorks for Solr |
    • LucidWorks for Lucene |
    • LucidGaze for Solr |
    • LucidGaze for Lucene |
  • Products:
  • Services:

Contact | Privacy Policy | Legal Terms of Use | Copyrights and Disclaimers | Admin

Apache Solr, Apache Lucene, ApacheCon and their logos are trademarks of the Apache Software Foundation.

© 2010 Lucid Imagination. All Right reserved.