FWIW, there is a plugin for heritrix to write to hbase as a back end store.
Maybe it will help for making a nutch plugin?
http://code.google.com/p/hbase-writer
-Ryan
On Mon, Feb 8, 2010 at 4:32 AM, Hua Su <huas.su@gmail.com> wrote:
Hi all,
Any recent progress on HBase integration? There is a filed issue
NUTCH-650<http://issues.apache.org/jira/browse/NUTCH-650>
.
I really love the idea of using HBase as nutch storage backend. It not only
simplifies nutch storage, but also makes much url/page processing work more
efficient due to the features of HBase: HBase data is mutable and indexed
by keys/columns/timestamps.
The issue has been open for a long time (about 17 months). Is there any
plan
to close this issue and release a nutch version with hbase enabled?
Best,
Hua