赞
踩
翻译自: https://www.zhihu.com/question/304798594
https://www.jianshu.com/p/60deff0f64e1 这篇文章总结得也很好
These metrics are all used to evaluate the quality of text generation under supervision. The general approach is to compare the similarity of a candidate text (usually generated by a machine) and several other reference texts (usually marked by humans). However, the applicable scenarios are slightly different. BLEU, METEOR, ROUGE are generally used in translation, and CIDEr is generally used in image captioning.
Copyright © 2003-2013 www.wpsshop.cn 版权所有,并保留所有权利。