Enterprise Search support for Apache Lucene and Solr by Lucid Imagination

Secondary links

  • Contact Us
  • Log in
  • Downloads
  • Solutions
    • Software |
    • Services |
    • Training |
    • White Papers & Case Studies |
    • Webinars & Events |
  • Developers
    • Blog |
    • Tech Articles |
    • Community |
    • Documentation |
    • Downloads |
    • Webcasts & Podcasts |
  • About
    • Market Overview |
    • Management |
    • Company News |
    • In the Media |
    • Contact |

beta

Start new search

Options

  • results per page

Clear all facets

  • Project clear projects

  • Source clear sources

  • Author clear authors

Search Results for

Results loading...

Found 29,416 results in 0.024 seconds. Displaying page 3 of 2,942, sorted by

  1. [nutch-user] Re: Stemming issues

    Sent 2010-03-10 by kanimesh <kanimesh@...>

    hey! Were you able to nail this? Can you share your findings / code? Best, Animesh David Jashi wrote: > > By the way, Otis, and what should one do to make found words highlight in > search results? > If the found word is not in the form that search criteria is, its not > highlighted. > > O...

  2. [nutch-user] use different confs for different crawls

    Sent 2010-03-10 by Claudio Martella <claudio.martella@...>

    Hi, I'm using nutch to crawl different intranet sites. The idea is to use the craw-urlfilter to tell the crawler to "stay" inside the seeded domain. I don't want it to follow links all around my intranet and crawl the same sites twice. This ideally means i'd have to rewrite the nutch-site.xml ea...

  3. [nutch-dev] Re: 1.1 release?

    Sent 2010-03-09 by Andrzej Bialecki <ab@...>

    On 2010-03-09 18:17, Julien Nioche wrote: > Hi Chris, > > Excellent idea! There have been quite a few changes since 1.0 and it's > probably the right time to have a new release. +1. Let's just check JIRA and make sure we didn't forget anything important ... > Not really a blocker but https://...

  4. [nutch-user] Re: Abt: Detect slow and timeout servers and drop their URLs

    Sent 2010-03-09 by Julien Nioche <lists.digitalpebble@...>

    Bonjour Yves, Did you see https://issues.apache.org/jira/browse/NUTCH-770? It has been committed to the trunk back in December. HTH Julien -- DigitalPebble Ltd http://www.digitalpebble.com On 9 March 2010 17:26, Yves Petinot wrote: > I was wondering if the current release ...

  5. [nutch-user] Abt: Detect slow and timeout servers and drop their URLs

    Sent 2010-03-09 by Yves Petinot <yves@...>

    I was wondering if the current release of Nutch provides any support for slow servers ? The issue has been previously described in the following JIRA entry: https://issues.apache.org/jira/browse/NUTCH-629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12...

  6. [nutch-dev] Re: 1.1 release?

    Sent 2010-03-09 by Julien Nioche <lists.digitalpebble@...>

    Hi Chris, Excellent idea! There have been quite a few changes since 1.0 and it's probably the right time to have a new release. Not really a blocker but https://issues.apache.org/jira/browse/NUTCH-762would be nice to have in 1.1, just needs a bit of reviewing / testing I suppose. Otherwise this ...

  7. [nutch-user] Re: Two Nutch parallel crawl with two conf folder.

    Sent 2010-03-09 by eks dev <eksdev@...>

    sorry for the noise.. I've mixed up Emails ----- Original Message ---- > From: eks dev > To: nutch-user@lucene.apache.org > Sent: Tue, 9 March, 2010 18:07:47 > Subject: Re: Two Nutch parallel crawl with two conf folder. > > coool answer > > > > ----- Original Message ...

  8. [nutch-dev] 1.1 release?

    Sent 2010-03-09 by "Mattmann, Chris A (388J)" <chris.a.mattmann@...>

    Hey Guys, I have some extra time this weekend and early next week. Want me to be the RM and push out a 1.1 release? Any blockers? I'm happy to do it just let me know. Cheers, Chris ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Chris Mattmann, Ph.D. Senior Computer Scienti...

  9. [nutch-user] Re: Two Nutch parallel crawl with two conf folder.

    Sent 2010-03-09 by eks dev <eksdev@...>

    coool answer ----- Original Message ---- > From: MilleBii > To: nutch-user@lucene.apache.org > Sent: Tue, 9 March, 2010 8:35:42 > Subject: Re: Two Nutch parallel crawl with two conf folder. > > Yes it should work, I personnaly run some tests crawl on the same > hardware, ...

  10. [nutch-user] RE: Content of redirected urls empty

    Sent 2010-03-09 by BELLINI ADAM <mbellil@...>

    hi, i dont know if you did find few minutes to see my problem :) but i want to explain it again, mabe it wasnt clear : i have HTTP pages redirected to HTTPS (but it's the same URL): HTTP://page1.com redirrected to HTTPS://page1.com the content of my page HTTP is empty. the content of m...

  1. <<
  2. 1
  3. 2
  4. 3
  5. 4
  6. 5
  7. 6
  8. 7
  9. 8
  10. 9
  11. 10
  12. >>

Solr Powered

Give us your feedback

  • Lucene
  • Solr
  • Nutch
  • Tika
  • Mahout
  • Droids
  • PyLucene
  • Lucene.Net
  • Lucy
  • Lucene4c
  • Open Relevance Project
  • How We Can Help:
    • Getting Started |
    • Support Subscriptions |
    • White Papers |
    • Training |
    • Consulting |
    • Contact Us |
  • Developers:
    • Blog |
    • Documentation |
    • Tech Articles |
    • Podcasts and Videos |
    • Community |
  • Downloads:
    • LucidWorks for Solr |
    • LucidWorks for Lucene |
    • LucidGaze for Solr |
    • LucidGaze for Lucene |
  • Products:
  • Services:

Contact | Privacy Policy | Legal Terms of Use | Copyrights and Disclaimers | Admin

Apache Solr, Apache Lucene, ApacheCon and their logos are trademarks of the Apache Software Foundation.

© 2010 Lucid Imagination. All Right reserved.