运用jieba库统计词频及制作词云
一、对新时代中国特色社会主义做词频统计
import jieba txt = open("新时代中国特色社会主义.txt","r",encoding="utf-8").read() words = jieba.lcut(txt) counts = {} for word in words: if len(word) == 1: continue else: counts[word] = counts.get(word,0)+1 items = list(counts.items()) items.sort(key=lambda x:x[1], reverse=True) for i in range(20): word, count = items[i] print("{0:<10}{1:>5}".format(word, count))
二、根据词频制作词云
#GovRptWordCloudv2.py import jieba import wordcloud from imageio import imread mask = imread("dd.png") f = open("新时代中国特色社会主义.txt","r",encoding="utf-8") t = f.read() f.close() ls = jieba.lcut(t) txt = " ".join(ls) w = wordcloud.WordCloud(font_path = "simkai.ttf",mask = mask,width = 1000,height = 700,background_color = "black",max_words = 20) w.generate(txt) w.to_file("grwordcloud.png")
相关推荐
kikaylee 2020-07-05
zooozx 2020-06-27
xiaocao0 2020-06-25
pySVNA 2020-06-14
fkyyly 2020-05-31
ustbclearwang 2020-05-09
cqulun 2020-04-19
chongtianfeiyu 2020-04-10
xiaocao0 2020-04-09
fkyyly 2020-04-07
chouliqingke 2020-04-07
fkyyly 2020-03-28
cqulun 2020-02-13
cqulun 2020-02-10
laityc 2020-02-10
wordmhg 2020-02-09
小发猫 2020-02-02
fkyyly 2020-01-28