Enterprise Search support for Apache Lucene and Solr by Lucid Imagination

Wanna see our new website?
Click here to see the Public Beta of the new lucidimagination.com

Top-Level Navigation

  • Home
  • Community
  • Downloads
  • Documentation
  • How We Can Help

Secondary links

  • Search
  • Blog
  • Contact
  • About
  • Sign Up or Login

beta

Start new search

Options

  • results per page

Clear all facets

  • Project clear projects

  • Source clear sources

  • Author clear authors

Search Results for

Results loading...

Found 29,154 results in 0.09 seconds. Displaying page 1 of 2,916, sorted by

  1. [nutch-user] encoding detector

    Sent 2010-02-08 by Ted Yu <yuzhihong@...>

    Hi, I was reading http://wiki.apache.org/nutch/LanguageIdentifier and tried to access EncodingDetectorPluginlink but the page isn't there: http://wiki.apache.org/nutch/EncodingDetectorPlugin Can someone provide more information ? Thanks

  2. [nutch-user] Re: Nutch + Solr: filtering URL while indexing

    Sent 2010-02-08 by Julien Nioche <lists.digitalpebble@...>

    Hi, You'd need to filter the URLs from the segments as well before you index. Removing the entries from the linkDB will just prevent them from getting anchor fields - they'll still be added to the index. Look at the class IndexerMapReduce for more details. An option would be to add support for ...

  3. [nutch-user] Re: Nutch + Solr: filtering URL while indexing

    Sent 2010-02-08 by Stefano Cherchi <stefanocherchi@...>

    Is there nobody out there who can provide some kind of hint? I'm really stuck with this problem and I cannot figure out what else I can do. Thanks S ----- Messaggio originale ----- > Da: Stefano Cherchi > A: nutch-user@lucene.apache..org > Inviato: Gio 4 febbra...

  4. [nutch-dev] example for crawl a url

    Sent 2010-02-08 by Esteve Schouten <eschouten@...>

    Hi, I need help with nutch. I have lucene indexer in my proyect and i need to add documents with the content of the url's crawls with nutch in my lucene index. how can i do it? Steve -- ----------------------------------------------------------------------- Esteve Schouten Ginard Àrea d'...

  5. [nutch-user] Re: About HBase Integration

    Sent 2010-02-08 by Ryan Smith <ryan.justin.smith@...>

    FWIW, there is a plugin for heritrix to write to hbase as a back end store. Maybe it will help for making a nutch plugin? http://code.google.com/p/hbase-writer -Ryan On Mon, Feb 8, 2010 at 4:32 AM, Hua Su wrote: > Hi all, > > Any recent progress on HBase integration? There...

  6. [nutch-user] About HBase Integration

    Sent 2010-02-08 by Hua Su <huas.su@...>

    Hi all, Any recent progress on HBase integration? There is a filed issue NUTCH-650 . I really love the idea of using HBase as nutch storage backend. It not only simplifies nutch storage, but also makes much url/page processing work more efficient ...

  7. [nutch-dev] Hudson build is back to normal : Nutch-trunk #1062

    Sent 2010-02-08 by Apache Hudson Server <hudson@...>

    See

  8. [nutch-dev] plugin dev trouble

    Sent 2010-02-07 by Sahil Shah <sahilshah2650@...>

    Hey Everyone, I want to write a plugin that generates snippets/ summary based on the query by using index based approach. I have read the wiki but I am still not clear as to how to understand the source code.The API collection is also huge.... There are so many interfaces and classes. Where to s...

  9. [nutch-user] Nutch + Solr: filtering URL while indexing

    Sent 2010-02-04 by Stefano Cherchi <stefanocherchi@...>

    Hi everybody. I've been struggling for three days now with a quite trivial problem, without solution. I need to index a few web sites with the following structure: Page type 1: List of posts (http://www.website.com/list.html?page=XXx) where XXx is a progressive number from 00 to 999. Each page...

  10. [nutch-user] Re: PDF Parsing

    Sent 2010-02-04 by Alexander Aristov <alexander.aristov@...>

    Your problem has nothing to do with PDFs. Do you have messages/exceptions where you are merging indexes? Best Regards Alexander Aristov On 4 February 2010 12:58, Withanage, Dulip < withanage@asia-europe.uni-heidelberg.de> wrote: > Thanks for the initial ideas. > >>do they really corrupt or th...

  1. 1
  2. 2
  3. 3
  4. 4
  5. 5
  6. 6
  7. 7
  8. 8
  9. 9
  10. 10
  11. >>

Solr Powered

Give us your feedback

  • Lucene
  • Solr
  • Nutch
  • Tika
  • Mahout
  • Droids
  • PyLucene
  • Lucene.Net
  • Lucy
  • Lucene4c
  • Open Relevance Project
  • Home:
  • Community:
    • Hear from the Experts |
      • Tech Articles |
      • Podcasts and Videos |
      • Blog |
    • Marketplace |
  • How We Can Help:
    • Get Started Program |
    • Support Subscriptions |
    • White Papers |
    • Training |
    • Consulting |
    • Contact Us |
  • Downloads:
    • LucidWorks for Solr |
    • LucidWorks for Lucene |
    • LucidGaze for Solr |
    • LucidGaze for Lucene |
    • Get Started Program |
    • Downloads FAQ |
    • Choosing Lucene or Solr |
  • Documentation:
    • Apache Lucene |
    • Apache Solr |
    • Additional Resources |
    • Related Apache Projects |
  • About:
    • Market Overview |
    • Technical Leadership |
    • Management |
    • Company FAQ |
    • Company News |
    • In the Media |
    • Careers |
    • Contact |
  • Privacy Policy:

Contact | Privacy Policy | Legal Terms of Use | Copyrights and Disclaimers | Login

Apache Solr, Apache Lucene, ApacheCon and their logos are trademarks of the Apache Software Foundation.

© 2009 Lucid Imagination. All Right reserved.