赞
踩
import jieba.analyse
提取例子:
- import jieba.analyse as analyse
- import pandas as pd
- df = pd.read_csv('./origin_data/technology_news.csv')
- df = df.dropna()
- lines = df.content.values.tolist()
- content = "".join(lines)
- print(" ".join(analyse.extract_tags(content, topK=30, withWeight=False, allowPOS=())))
算法论文:http://web.eecs.umich.edu/~mihalcea/papers/mihalcea.emnlp04.pdf
基本思想:
TextRank的核心就是PageRank,PageRank介绍:https://www.jianshu.com/p/f6d66ab97332
提取例子:
- import jieba.analyse as analyse
- import pandas as pd
- df = pd.read_csv("./origin_data/military_news.csv", encoding='utf-8')
- df = df.dropna()
- lines=df.content.values.tolist()
- content = "".join(lines)
-
- print(" ".join(analyse.textrank(content, topK=20, withWeight=False, allowPOS=('ns', 'n', 'vn', 'v'))))
- print("---------------------我是分割线----------------")
- print(" ".join(analyse.textrank(content, topK=20, withWeight=False, allowPOS=('ns', 'n'))))
Copyright © 2003-2013 www.wpsshop.cn 版权所有,并保留所有权利。