
Lucene
molian123456
这个作者很懒,什么都没留下…
展开
-
lucene:IndexSearch
IndexSearcher is = new IndexSearcher("C:\\luceneIndex"); //对path 进行分词,否则只有输入文章全名才能搜索到 QueryParser qp = new QueryParser("path", new StandardAnalyzer()); Query q = qp.parse("安徽"); Hit...原创 2012-04-01 08:54:47 · 128 阅读 · 0 评论 -
Access restriction: The type FileURLConnection is not accessible due to restrict
Access restriction: The type FileURLConnection is not accessible due to restriction on required library C:\Program Files\Java\jre6\lib\rt.jar 解决方法: 这个是eclipse设置问题,它默认把这些受访问限制的API设成了ERROR,只要把 Windows-P...原创 2012-04-03 09:45:29 · 108 阅读 · 0 评论 -
Heritrix TransformerConfigurationException
致命错误:“无法编译样式表”2010-07-16 21:35:03.359 严重 thread-12 org.archive.crawler.framework.WriterPoolProcessor.getFirstrecordBody() Failed transform javax.xml.transform.TransformerConfigurationException: 无...原创 2012-04-03 10:45:38 · 117 阅读 · 0 评论 -
unacceptable 'user-agent' or 'from' (correct your configuration).
Level: SEVERE Message: On crawl: NewsSohu You must set the User-Agent and From HTTP header values to acceptable strings. User-Agent: [software-name](+[info-url])[misc] From: [e...原创 2012-04-06 08:45:34 · 178 阅读 · 0 评论 -
Java JE MMAnalyzer
需要引入je-analysis-1.4.0.jar包package com.test;import java.io.IOException;import java.io.StringReader;import jeasy.analysis.MMAnalyzer;public class testJE { public static void mai...原创 2012-04-12 12:58:21 · 332 阅读 · 1 评论 -
Lucene Filter&Sort SortField
String Index_Store_Path = "index"; IndexWriter iw = null; try { iw = new IndexWriter(Index_Store_Path,new StandardAnalyzer(),true); iw.setUseCompoundFile(false); String[] con...原创 2012-04-13 09:42:51 · 131 阅读 · 0 评论 -
Lucene Filter CachingWrapperFilter
IndexSearcher is = new IndexSearcher(Index_Store_Path); TermQuery tq = new TermQuery(new Term("bk", "bc")); QueryFilter qf = new QueryFilter(tq);QueryParser qp = new QueryParser("bk", new...原创 2012-04-13 17:15:28 · 208 阅读 · 0 评论