遇到的问题:
1. scipy 库 导入有问题 找不到模块
第一次安装时下载了 .whl 文件
pip install 绝对路径\.... .whl
- 解决办法:
pip install scipy
Orz...................................................................
2.英文(上课老师的demo)
import re
import numpy as np
import matplotlib.pyplot as plt
from wordcloud import WordCloud
from PIL import Image
file1=open("wcs.txt",'r')
txt=file1.read()
file1.close()
txt=txt.lower()
re.sub(',."/%?:',' ',txt)#replace
words=txt.split()
words_index=set(words)
dic={i:words.count(i) for i in words_index}
print(dic)
image1=Image.open("hreat.jpg")
graph=np.array(image1)
wc= WordCloud(background_color='white',mask=graph)
wc.generate_from_frequencies(dic)
plt.imshow(wc)# create picture
plt.axis("off")
plt.show() # show pic
待处理的问题:
- 屏蔽无效词语 and the ……
- stopword作用
3.中文词云
from os import path
import jieba
import numpy as np
from wordcloud import WordCloud,ImageColorGenerator
import matplotlib.pyplot as plt
from scipy.misc import imread
from PIL import Image
txt1=open('quanzhi.txt',encoding='gbk').read()
txt1_list=txt1.split(',')#列表
d= path.dirname(__file__)
word_list=["".join(jieba.cut(sentences))for sentences in txt1_list]
new_text=''.join(word_list)
# 设置背景图片
alice_coloring = imread(path.join(d, "yexiu.jpg"))
fontname=path.join(d,"simfang.ttf")
wordcloud= WordCloud(
background_color='white',
mask=alice_coloring, # 设置背景图片
font_path=fontname,
max_words=2000
).generate(new_text)#中文必须制定字体路径
plt.imshow(wordcloud)#生成随机颜色的词云
plt.axis("off")
#plt.show()
image= Image.open('yexiu.jpg')#生成图片颜色的词云
graph = np.array(image)
image_color = ImageColorGenerator(graph)
plt.imshow(wordcloud.recolor(color_func=image_color))
plt.axis("off")
plt.show()
wordcloud.to_file('wc_c.jpg') #生成图片
待处理的问题:
- 词频
- 怎么更好的展现图片……应该词汇越来越多会更清楚些吧……