当前位置:   article > 正文

Java回炉重造(三)使用Apache Commons Text库计算文本相似性:jaccard相似系数、余弦相似度_map leftvector

map leftvector

Java回炉重造(三)使用Apache Commons Text库计算文本相似性:jaccard相似系数、余弦相似度

运行结果

这里写图片描述

代码图片

这里写图片描述

code

https://code.csdn.net/u012995856/apache-commons-learn/tree/master

maven依赖

        <dependency>
            <groupId>org.apache.commons</groupId>
            <artifactId>commons-text</artifactId>
            <version>1.1</version>
        </dependency>
  • 1
  • 2
  • 3
  • 4
  • 5

代码

TextSimilaryTest.java

package cn.pangpython.acl.text;

import java.util.HashMap;
import java.util.Map;

import org.apache.commons.text.similarity.CosineSimilarity;
import org.apache.commons.text.similarity.JaccardSimilarity;

/**
 * @Project ApacheCommonsLearn
 * @Package cn.pangpython.acl.text
 * @Author pangPython
 * @Time 下午10:53:59
 */
public class TextSimilaryTest {
    public static void main(String[] args) {
        //计算jaccard相似系数
        JaccardSimilarity jaccardSimilarity = new JaccardSimilarity();
        double jcdsimilary1 = jaccardSimilarity.apply("hello", "hell");
        System.out.println("jcdsimilary1:"+jcdsimilary1);
        double jcdsimilary2 = jaccardSimilarity.apply("this is an apple", "this is an app");
        System.out.println("jcdsimilary2:"+jcdsimilary2);
        //计算余弦相似度
        CosineSimilarity cosineSimilarity = new CosineSimilarity();
        Map<CharSequence, Integer> leftVector = new HashMap<>();
        Map<CharSequence, Integer> rightVector = new HashMap<>();
        leftVector.put("a", 1);
        leftVector.put("b", 0);
        leftVector.put("c", 1);
        rightVector.put("a", 1);
        rightVector.put("b", 1);
        rightVector.put("c", 0);
        double cosSimilary = cosineSimilarity.cosineSimilarity(leftVector, rightVector);
        System.out.println("cosSimilary:"+cosSimilary);
    }
}
  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • 28
  • 29
  • 30
  • 31
  • 32
  • 33
  • 34
  • 35
  • 36
  • 37
声明:本文内容由网友自发贡献,不代表【wpsshop博客】立场,版权归原作者所有,本站不承担相应法律责任。如您发现有侵权的内容,请联系我们。转载请注明出处:https://www.wpsshop.cn/w/小舞很执着/article/detail/1012887
推荐阅读
相关标签
  

闽ICP备14008679号