AlphaFold2: Highly accurate protein structure prediction with AlphaFold笔记

作者：一键难忘520 | 2024-06-25 22:42:21

踩

highly accurate protein structure prediction with alphafold

AlphaFold2: Highly accurate protein structure prediction with AlphaFold

笔记：https://www.bilibili.com/video/BV1oR4y1K7Xr/?spm_id_from=333.788

DeepMind 2021年7月的发表在nature上的工作。同一天，University of Washington 发表了RoseTTAFold 并登在了8月的science的封面。

摘要

AlphaFold1不能在原子精度给出蛋白质结构。AlphaFold2在CASP14库结果较好。

整体架构

图1（1）：CASP14各队的成绩比较，AlphFold2的误差为第一条竖线，为1埃原子精度
图1（2）：绿色是实验室结果，蓝色为预测结果，黑色为碳原子
图2为PDB数据集结果

模型整体架构图

编码器

编码器架构

transoformer的编码器和解码器

第一个自注意力实现细节：MSA row-wise gated self-attention with pair bias. Dimensions: s: sequences, r:
residues, c: channels, h: heads.
第一个自注意力的伪代码

第二个自注意力实现细节，MSA column-wise gated self-attention. Dimensions: s: sequences, r: residues,c: channels, h: heads.

MLP层结构

信息融合模块：Outer product mean. Dimensions: s: sequences, r: residues, c: channels.

第三个自注意力机制与第一个注意力机制非常相似，Triangular self-attention around starting node. Dimensions: r: residues, c: channels, h: heads

第三个自注意力机制的伪代码

第四个自注意力机制的伪代码

中间模块，用于减少计算开销和信息传递，Triangular multiplicative update using “outgoing” edges
信息向外传递和信息向内传递
信息传递