赞
踩
Rank-BM25: A two line search engine
A collection of algorithms for querying a set of documents and returning the ones most relevant to the query. The most common use case for these algorithms is, as you might have guessed, to create search engines.
So far the algorithms that have been implemented are:
Okapi BM25
BM25L
BM25+
BM25-Adpt
BM25T
These algorithms were taken from this paper, which gives a nice overview of each method, and also benchmarks them against each other. A nice inclusion is that they compare different kinds of preprocessing like stemming vs no-stemming, stopword removal or not, etc. Great read if you're new to the topic.
Installation
The easiest way to install this package is through pip, using
pip install rank_bm25
If you want to be sure you're getti
Copyright © 2003-2013 www.wpsshop.cn 版权所有,并保留所有权利。