
postagger
文章平均质量分 77
iteye_10365
这个作者很懒,什么都没留下…
展开
专栏收录文章
- 默认排序
- 最新发布
- 最早发布
- 最多阅读
- 最少阅读
-
penn tree bank 1/n
Building a Large Annotated Corpus ofEnglish: The Penn TreebankMitchell P Marcus*University of PennsylvaniaBeatrice SantorinitNorthwestern UniversityMary Ann MarcinkiewicztUniversity of P...原创 2010-08-15 23:24:16 · 211 阅读 · 0 评论 -
lucene analyzer pos
Parsing? Tokenization? Analysis!Lucene, indexing and search library, accepts only plain text input.ParsingApplications that build their search capabilities upon Lucene may support document...原创 2010-08-20 07:16:14 · 163 阅读 · 0 评论 -
university 3/n
Chapter 1.Introductiondiagnostic tools for the evaluation of exam prompts,using the corpus analyses asa baseline.In the first stage of the project,we constructed the T2K-SWAL Corpus,whichwas d...原创 2010-08-24 07:55:49 · 258 阅读 · 0 评论 -
university 4/n (45)
chapter?Vocabulary use in classroomteaching and textbooks?.?IntroductionThe description of vocabulary use in university contexts is an essential prerequi-site to the development of effective...原创 2010-08-24 07:57:03 · 295 阅读 · 0 评论 -
Lucene 3.0 分词 IKAnalyzer
最近lucene已经更新到lucene 3.0版本了 2.X版本的一些用法已经彻底不在支持了。下面的例子主要是介绍中文分词器IKAnalyzer的使用和Lucene高亮显示。lucene 3.x版本中有些2.x方法已经完全被剔除了,这里会捎带一下3.x的用法,当然我这里用的还是2.X的版本。lucene自带的分词方式对中文分词十分的不友好,基本上可以用惨不忍睹来形容,所以这里推荐使用I...原创 2010-09-02 12:46:44 · 151 阅读 · 0 评论 -
stanford postagger tagsets
stanford postagger stagsetsstanford出的postagger用的词性标签集正是pen的词性标签集。The Penn Treebank Tag SetThe Penn Treebank Tag SetThe tagset used in tagging the demo corpus available here is the Penn Treebank ...2010-03-20 10:12:47 · 193 阅读 · 0 评论 -
hack the stanford postagger demo
stanford postagger 的demo默认情况下输出slashtags。在代码中如何修改,才能让它输出xml?在MaxentTagger.java文件中,修改public String apply(String o)代码 public String apply(String o) { StringBuilder taggedSentence = new Str...2010-03-20 10:18:33 · 111 阅读 · 0 评论 -
[ZZ]功能词与实义词
Words are divided into two categories: Function Words and Content Words. Function words are closed class words (only about 300 in English) while content words are open class words (new words are being...2010-03-21 22:21:12 · 1221 阅读 · 0 评论