nutch存储数据文件sequencefile mapfile对应keyValue
[code="java"]
crawldb
(org.apache.hadoop.io.Text,org.apache.nutch.crawl.CrawlDatum)
segments/content
(org.apache.hadoop.io.Text,org.apache.nutch.protocol.Content)
segments/crawl_fetch
(org.apach...
原创
2013-10-04 10:17:03 ·
107 阅读 ·
0 评论