赞
踩
feed-forward layer指的是 a linear layer or a single-layer MLP 说白了就是一个fc层 出自牛津《Do You Even Need Attention? A Stack of Feed-Forward Layers Does Surprisingly Well on ImageNet》
feed-forward layer指的是 a linear layer or a single-layer MLP
说白了就是一个fc层
出自牛津《Do You Even Need Attention? A Stack of Feed-Forward Layers Does Surprisingly Well on ImageNet》