jieba库是一个非常优秀的中文分词软件,其中一个应用就是来完成一个词云图的设计
首先你需要准备这些库
import jieba
import matplotlib.pyplot as plt
import matplotlib
from wordcloud import WordCloud
一般来讲jieba库和matplotlib库安装没太有问题,wordcloud第三方库会较为繁琐,可以采用先安装wheel再使用pip安装下载到本地的wordcloud
然后完成读取文本的函数
def read_txt():
txt = open("三体节选.txt", "r", encoding="utf-8").read()
wordlist = jieba.lcut(txt)
wordcount = {}
for word in wordlist:
if len(word) == 1:
continue
if word not in wordcount.keys():
wordcount[word] = 0
wordcount[word] += 1
return wordcount, wordlist
最后就可以直接创建词云图了
def create_word_cloud(wordcount):
matplotlib.rcParams['figure.figsize'] = (12.0, 12.0)
word_cloud = WordCloud(font_path='origi