jieba库应用 python
应用实例:
准备一个txt文件
import jieba txt = open("三国演义.txt","r", encoding = ‘gbk‘,errors=‘ignore‘).read() #读取已存好的txt文档 words = jieba.lcut(txt) #进行分词 counts = {} for word in words: if len(word)== 1: #去掉标点字符和其它单字符 continue else: counts[word] = counts.get(word, 0) + 1 #计数 items = list(counts.items()) #把对象对象转化为列表形式,利于下面操作 #sort() 函数用于对原列表进行排序,如果指定参数,则使用比较函数指定的比较函数 #reverse 排序规则,reverse = True 降序, reverse = False 升序(默认) #key 是用来比较的参数 items.sort(key=lambda x: x[1], reverse = True) for i in range(1000): word, count= items[i] print("{0:<10}{1:>5}".format(word, count))
结果
相关推荐
kikaylee 2020-07-05
zooozx 2020-06-27
xiaocao0 2020-06-25
pySVNA 2020-06-14
fkyyly 2020-05-31
ustbclearwang 2020-05-09
cqulun 2020-04-19
chongtianfeiyu 2020-04-10
xiaocao0 2020-04-09
fkyyly 2020-04-07
chouliqingke 2020-04-07
fkyyly 2020-03-28
cqulun 2020-02-13
cqulun 2020-02-10
laityc 2020-02-10
小发猫 2020-02-02
fkyyly 2020-01-28