自然语言识别python_一个快速从自然语言文本中提取和识别关键短语的工具

作者：小桥流水78 | 2024-08-06 09:18:40

踩

如何通过自然语言识别文字中的物品

chinese_keyphrase_extractor (CKPE)

一个从中文自然语言文本中抽取关键短语的工具，只消耗 35M 内存

A tool for automatic keyphrase extraction from Chinese text.

本项目即将迁移至 jionlp 工具包，性能更好，速度更快哦~~~

应用场景 Application scenario

1.抽取关键短语

在很多关键词提取任务中，使用tfidf、textrank等方法提取得到的仅仅是若干零碎词汇。

这样的零碎词汇无法真正的表达文章的原本含义，我们并不想要它。

In many keyword extraction tasks, only a few fragmentary words are extracted when using tfidf, textrank and other methods.

Such fragmentary words cannot really express the original meaning of the article. We do not want it.

例如：

For example:

>>> text = '朝鲜确认金正恩出访俄罗斯将与普京举行会谈...'

>>> keywords = ['俄罗斯', '朝鲜', '普京', '金正恩', '俄方']

我们往往需要更细化的短语描述，来作为文本的关键信息展示。这样的需求在生成词云、提供摘要阅读、关键信息检索等任务中都非常重要。

We often need more detailed phrase descriptions to display the key information of the text. Such requirements, namely keyphrases extraction, are very important in generating word cloud, providing abstract reading, key information retrieval and other tasks.

例如： For example:

>>> phrases = ['俄罗斯克里姆林宫', '邀请金正恩访俄', '最高司令官金正恩',

'朝方转交普京', '举行会谈']

2.扩展相关短语词汇

有时产

声明：本文内容由网友自发贡献，不代表【wpsshop博客】立场，版权归原作者所有，本站不承担相应法律责任。如您发现有侵权的内容，请联系我们。转载请注明出处：https://www.wpsshop.cn/w/小桥流水78/article/detail/936821