Enterprise Search support for Apache Lucene and Solr by Lucid Imagination

Secondary links

  • Contact Us
  • Log in
  • Downloads
  • Solutions
    • Software |
    • Services |
    • Training |
    • White Papers & Case Studies |
    • Webinars & Events |
  • Developers
    • Blog |
    • Tech Articles |
    • Community |
    • Documentation |
    • Downloads |
    • Webcasts & Podcasts |
  • About
    • Market Overview |
    • Management |
    • Company News |
    • In the Media |
    • Contact |

beta

Start new search

Options

  • results per page

Clear all facets

  • Project clear projects

  • Source clear sources

  • Author clear authors

Search Results for

Results loading...

Found 29,421 results in 0.1 seconds. Displaying page 9 of 2,943, sorted by

  1. [nutch-user] Text.encode failing during de-duplication

    Sent 2010-02-25 by Eddie Drapkin <oorza2k5@...>

    Hello, I'm trying to upgrade from Nutch 0.9 to Nutch 1.0 and I've solved all of the issues that I seem be having, except for one. When I run a web crawl, everything fetches fine until it gets to dedup, in which case, I get this stack trace: 2010-02-25 14:31:46,592 WARN mapred.LocalJobRunner ...

  2. [nutch-user] Re: String "menu"

    Sent 2010-02-25 by reinhard schwab <reinhard.schwab@...>

    crawl-urlfilter.txt and regex-urlfilter.txt take regular expressions as input. if you want filter out urls, which contain "menu", then just add -.*menu this rule will filter out any urls which contain "menu". note that the first matching rule from top wins. if there is a rule before this rule m...

  3. [nutch-user] Re: String "menu"

    Sent 2010-02-25 by QueroVc <yuri.gopfert@...>

    But the crawl-urlfilter.txt not accept only characters instead of strings? If accepted, as I write? # Skip URLs containing certain characters as probable queries, etc.. -[?*!@=] Could be? # Skip URLs containing certain characters as probable queries, etc.. - [ "menu"] Thanks QueroVc wrote:...

  4. [nutch-user] HTTP ERROR: 404 missing core name in path after integrating nutch

    Sent 2010-02-25 by "Ian M. Evans" <ianevans@...>

    Hi everyone, Last night I was able to get solr up and running. Ran and was able to access: http://www.digitalhit.com:8983/solr/admin This morning, I started on the nutch crawling instructions over at: http://www.lucidimagination.com/blog/2009/03/09/nutch-solr/ After adding the following to ...

  5. [nutch-user] Re: Nutch v0.4

    Sent 2010-02-25 by Ashley Sterritt <ashley.sterritt@...>

    Great, thanks! 2010/2/25 Pedro Bezunartea López : > I was curious about this, and after a little browsing through sourceforge, I > found the CVS link: > > http://nutch.cvs.sourceforge.net/viewvc/nutch/nutch/?pathrev=nutch_0_4 > > HTH, > > Pedro. > > > 2010/2/25 Andrzej Bia...

  6. [nutch-user] Re: Nutch v0.4

    Sent 2010-02-25 by Pedro Bezunartea López <pedro@...>

    I was curious about this, and after a little browsing through sourceforge, I found the CVS link: http://nutch.cvs.sourceforge.net/viewvc/nutch/nutch/?pathrev=nutch_0_4 HTH, Pedro. 2010/2/25 Andrzej Bialecki > On 2010-02-24 17:34, Pedro Bezunartea López wrote: > >> Hi Ashl...

  7. [nutch-user] Re: Nutch v0.4

    Sent 2010-02-25 by Andrzej Bialecki <ab@...>

    On 2010-02-24 17:34, Pedro Bezunartea López wrote: > Hi Ashley, > > Hi, >> I'm looking to reproduce program analysis results based on Nutch v0.4. I >> realize this is a very old release, but is it possible to obtain the source >> from somewhere? I see some of the classes I'm looking for in v0.7,...

  8. [nutch-user] Re: regex-urlfilter.txt and paging variables

    Sent 2010-02-25 by "Andreas P. Koenzen" <akoenzen@...>

    Replace it with this: -[@!*] That's it... Best regards, --- Andreas P. Koenzen On 25/02/2010, at 03:06 a.m., Ian M. Evans wrote: > I suck at regex and in keeping with the Olympic spirit, I probably > suck > at giant slalom too. > > In the regex-urlfilter.txt there's the suggested probable ...

  9. [nutch-user] Re: regex-urlfilter.txt and paging variables

    Sent 2010-02-25 by MilleBii <millebii@...>

    You can add a specific rule before that exclusion rule Something like : +.*/?page=.* 2010/2/25, Ian M. Evans : > I suck at regex and in keeping with the Olympic spirit, I probably suck > at giant slalom too. > > In the regex-urlfilter.txt there's the suggested probable q...

  10. [nutch-user] Re: Seattle Hadoop/Scalability/NoSQL Meetup Tonight!

    Sent 2010-02-25 by Bradford Stephens <bradfordstephens@...>

    Thanks for coming, everyone! We had around 25 people. A *huge* success, for Seattle. And a big thanks to 10gen for sending Richard. Can't wait to see you all next month. On Wed, Feb 24, 2010 at 2:15 PM, Bradford Stephens wrote: > The Seattle Hadoop/Scalability/NoSQL...

  1. <<
  2. 4
  3. 5
  4. 6
  5. 7
  6. 8
  7. 9
  8. 10
  9. 11
  10. 12
  11. 13
  12. >>

Solr Powered

Give us your feedback

  • Lucene
  • Solr
  • Nutch
  • Tika
  • Mahout
  • Droids
  • PyLucene
  • Lucene.Net
  • Lucy
  • Lucene4c
  • Open Relevance Project
  • How We Can Help:
    • Getting Started |
    • Support Subscriptions |
    • White Papers |
    • Training |
    • Consulting |
    • Contact Us |
  • Developers:
    • Blog |
    • Documentation |
    • Tech Articles |
    • Podcasts and Videos |
    • Community |
  • Downloads:
    • LucidWorks for Solr |
    • LucidWorks for Lucene |
    • LucidGaze for Solr |
    • LucidGaze for Lucene |
  • Products:
  • Services:

Contact | Privacy Policy | Legal Terms of Use | Copyrights and Disclaimers | Admin

Apache Solr, Apache Lucene, ApacheCon and their logos are trademarks of the Apache Software Foundation.

© 2010 Lucid Imagination. All Right reserved.