赞
踩
Machine learning algorithms prefer well define fixed length input/output (机器学习更喜欢固定的输入输出)
Feature engineering(FE) is the key to ML method before deep learning(DL)
DL train deep neural networks to extract features(深度学习可以自动提取特征,而很多机器学习方法需要FE提取特征)
int/float : directly use or bin to n unique int values (数据转换)
categorical data:one-hot encoding (数据独热编码)
Data-time :a feature list such as (时间变换)
Feature combination: Cartesian product of two feature groups (数据组合)
Represent text as token features (将文本转换为token)
Bag of words(BoW) model
Word embeddings(e.g. Word2vec) (词嵌入)
Pre-trained language models(e.g. BERT ,GPT-3) : (预训练深度神经网络抽取特征)
Copyright © 2003-2013 www.wpsshop.cn 版权所有,并保留所有权利。