Chapter 1:
Easy, just using some API.
Important API : FreqDist().
Some natural language understanding technologies :
Chapter 2:
Learn to access text corpora and lexical resource.
Important API : ConditionalFreqDist(), words(), sents(),raw().
Corpora:
To get lexical resource :
WordNet (semantic oriented, large number of info about words)
WordList
StopWords
To get a collection of text(some with category):
Gutenberg
Brown
Web and Chat Text
本文介绍了自然语言处理(NLP)的基本概念和技术,包括词频分布、词语消歧、代词解析等关键技术,并探讨了如何获取文本语料库及词汇资源,如WordNet等。
753

被折叠的 条评论
为什么被折叠?



