赞
踩
本文主要介绍通过python实现数据清洗、脚本开发、办公自动化。读取voc数据,存储新清洗后的voc数据数据。
python clean.py
# voc数据清洗 import pandas as pd import jieba import jieba.posseg as pseg from collections import Counter import re fileName = "100070291457" # sku # 加载停用词 with open('stopwordsfull', 'r', encoding='utf-8',errors='replace') as f: stopwords = [line.strip() for line in f.readlines()] # 加载自定义词典 jieba.load_userdict("luyouqi.txt") # luyouqi.txt is your custom dictionary # 读取Excel文件中的所有表格 xls = pd.ExcelFile('fil
Copyright © 2003-2013 www.wpsshop.cn 版权所有,并保留所有权利。