Found 29,416 results in 0.024 seconds. Displaying page 3 of 2,942, sorted by
Sent 2010-03-10 by kanimesh <kanimesh@...>
hey!
Were you able to nail this? Can you share your findings / code?
Best,
Animesh
David Jashi wrote:
>
> By the way, Otis, and what should one do to make found words highlight in
> search results?
> If the found word is not in the form that search criteria is, its not
> highlighted.
>
> O...
Sent 2010-03-10 by Claudio Martella <claudio.martella@...>
Hi,
I'm using nutch to crawl different intranet sites. The idea is to use
the craw-urlfilter to tell the crawler to "stay" inside the seeded
domain. I don't want it to follow links all around my intranet and crawl
the same sites twice. This ideally means i'd have to rewrite the
nutch-site.xml ea...
Sent 2010-03-09 by Andrzej Bialecki <ab@...>
On 2010-03-09 18:17, Julien Nioche wrote:
> Hi Chris,
>
> Excellent idea! There have been quite a few changes since 1.0 and it's
> probably the right time to have a new release.
+1. Let's just check JIRA and make sure we didn't forget anything
important ...
> Not really a blocker but https://...
Sent 2010-03-09 by Julien Nioche <lists.digitalpebble@...>
Bonjour Yves,
Did you see https://issues.apache.org/jira/browse/NUTCH-770? It has been
committed to the trunk back in December.
HTH
Julien
--
DigitalPebble Ltd
http://www.digitalpebble.com
On 9 March 2010 17:26, Yves Petinot wrote:
> I was wondering if the current release ...
Sent 2010-03-09 by Yves Petinot <yves@...>
I was wondering if the current release of Nutch provides any support for
slow servers ? The issue has been previously described in the following
JIRA entry:
https://issues.apache.org/jira/browse/NUTCH-629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12...
Sent 2010-03-09 by Julien Nioche <lists.digitalpebble@...>
Hi Chris,
Excellent idea! There have been quite a few changes since 1.0 and it's
probably the right time to have a new release.
Not really a blocker but
https://issues.apache.org/jira/browse/NUTCH-762would be nice to have
in 1.1, just needs a bit of reviewing / testing I
suppose. Otherwise this ...
Sent 2010-03-09 by eks dev <eksdev@...>
sorry for the noise.. I've mixed up Emails
----- Original Message ----
> From: eks dev
> To: nutch-user@lucene.apache.org
> Sent: Tue, 9 March, 2010 18:07:47
> Subject: Re: Two Nutch parallel crawl with two conf folder.
>
> coool answer
>
>
>
> ----- Original Message ...
Sent 2010-03-09 by "Mattmann, Chris A (388J)" <chris.a.mattmann@...>
Hey Guys,
I have some extra time this weekend and early next week. Want me to be the
RM and push out a 1.1 release? Any blockers? I'm happy to do it just let me
know.
Cheers,
Chris
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scienti...
Sent 2010-03-09 by eks dev <eksdev@...>
coool answer
----- Original Message ----
> From: MilleBii
> To: nutch-user@lucene.apache.org
> Sent: Tue, 9 March, 2010 8:35:42
> Subject: Re: Two Nutch parallel crawl with two conf folder.
>
> Yes it should work, I personnaly run some tests crawl on the same
> hardware, ...
Sent 2010-03-09 by BELLINI ADAM <mbellil@...>
hi,
i dont know if you did find few minutes to see my problem :)
but i want to explain it again, mabe it wasnt clear :
i have HTTP pages redirected to HTTPS (but it's the same URL):
HTTP://page1.com redirrected to HTTPS://page1.com
the content of my page HTTP is empty.
the content of m...