赞
踩
本文主要介绍通过python实现数据聚类、脚本开发、办公自动化。读取voc数据,聚类voc数据。
python learning.py
# VOC数据聚类 import pandas as pd import jieba import jieba.posseg as pseg from collections import Counter fileName = "100034532823" # sku # 加载数据 df = pd.read_excel('clean/cleaned_voc'+fileName+'.xlsx') # 创建一个 Counter 对象来存储词频统计结果 counter = Counter() # 加载停用词 with open('stopwordsfull', 'r', encoding='utf-8',errors='replace') as f: stopwords = [line.strip() for line in f.readlines()] # print(stopwords
Copyright © 2003-2013 www.wpsshop.cn 版权所有,并保留所有权利。