WordNet介绍和使用

最新推荐文章于 2025-04-20 08:30:00 发布

ICTExtr9

最新推荐文章于 2025-04-20 08:30:00 发布

阅读量4.6w

点赞数 10

分类专栏： e 其他文章标签： distance 自然语言处理 path books python 工具

本文链接：https://blog.youkuaiyun.com/ictextr9/article/details/4008703

版权

e 其他专栏收录该内容

16 篇文章

订阅专栏

Wordnet是一个词典。每个词语(word)可能有多个不同的语义，对应不同的sense。而每个不同的语义（sense）又可能对应多个词，如topic和subject在某些情况下是同义的，一个sense中的多个消除了多义性的词语叫做lemma。例如，“publish”是一个word，它可能有多个sense：

1. (39) print, publish -- (put into print; "The newspaper published the news of the royal couple's divorce"; "These news should not be printed")

2. (14) publish, bring out, put out, issue, release -- (prepare and issue for public distribution or sale; "publish a magazine or newspaper")

3. (4) publish, write -- (have (one's written work) issued for publication; "How many books did Georges Simenon write?"; "She published 25 books during her long career")

在第一个sense中，print和publish都是lemma。Sense 1括号内的数字39表示publish以sense 1在某外部语料中出现的次数。显然，publish大多数时候以sense 1出现，很少以sense 3出现。