heritrix一个不错的blog:http://guoyunsky.iteye.com/blog/613249
heritrix定制:http://jyjsjd.iteye.com/blog/1547207
heritrix架构:http://www.oschina.net/p/heritrix/
heritrix速度优化:http://guoyunsky.iteye.com/blog/629891#6680
heritrix 3.1版本比较全的代码分析:http://www.cnblogs.com/chenying99/category/468890.html
Heritrix增量爬取思想:http://blog.youkuaiyun.com/historyasamirror/article/details/6706174
Heritrix增爬:http://blog.sina.com.cn/s/articlelist_1823802015_0_1.html
Heritrix按模块介绍:http://caixinbao1.blog.163.com/blog/static/161494162009730115520760/
heritrix源码分析之URL:http://wliufu.iteye.com/blog/1872446
heritrix order.xml配置http://www.cnblogs.com/loveyakamoz/archive/2011/11/26/2264526.html
new heritrix:http://blog.sina.com.cn/s/blog_6cc084c90100nf39.html