当前位置:   article > 正文

万字长文解读图像超分辨率 Real-ESRGAN 论文笔记+代码阅读

real-esrgan

目录

一、介绍

二、重点创新

1.ESRGAN 

2.Real-ESRGAN

三、生成器结构

1.整体结构

2.RRDB结构

四、判别器结构

五、高阶退化模型

六、损失函数

1.生成模型损失函数

2.判别模型损失函数


 

一、介绍

        超分辨率(Super-Resolution)指通过硬件或软件的方法提高原有图像的分辨率,通过一系列低分辨率的图像来得到一幅高分辨率的图像过程。通俗的说就是在保持原图像清晰度不变的前提下,将图像放大。使用深度学习模型进行超分已经是比较常用的手段,而且深度学习模型又一个好处,可以在数据增强的时候对数据进行退化处理,在超分的时候也可以做去模糊、去噪、去划痕等操作。        

        深度学习超分模型有几个里程碑:SRCNN > SRGAN > ESRGAN > Real-ESRGAN,SRCNN 和SRGAN 有些古老了,现在基本用不上, Real-ESRGAN是在ESRGAN的基础上做的升级,于是我们主要介绍Real-ESRGAN,用ESRGAN作为补充

        ESRGAN 论文地址:http://arxiv.org/abs/1609.04802

        Real-ESRGAN论文地址:  https://arxiv.org/abs/2107.10833v2

        代码地址:GitHub - oaifaye/dcm-denoise-SR

二、重点创新

1.ESRGAN 

        (1)提出新的backbone:RRDB(Residual in Residual Dense Block)。这里的Dense指的不是全连接而是卷积层中有着密集的残差链接,这样做的好处是可以获得更深入、更复杂的结构,网络容量也变得更高。

        (2)删除BN层。作者发现,BN 层在网络比较深,而且在 GAN 框架下进行训练的时候,更会产生伪影降低了训练的稳定性和一致性。此外,去掉 BN 层也能提高模型的泛化能力,减少计算复杂度和内存占用。

        (3)网络插值(Network Interpolation),或者叫残差缩放。即将残差信息乘以一个 0 到 1 之间的数(通过实验最终确定0.2),这样可以使训练更稳定,在保持纹理的同时的减少伪影。

        (4)使用相对论RaGAN改进了判别器,它学习判断“一幅图像是否比另一幅图像更真实”,而不是“一幅图像是真实的还是假的”。论文给出的图很形象了。backbone用的VGG,这一点在Real-ESRGAN中被替换。而且在Real-ESRGAN中并没有使用RaGAN的判别器...

5ea2d208a6b0494f9441a6c9bd4169b4.png

2.Real-ESRGAN

        Real-ESRGAN的优化是在ESRGAN的基础上做的,主要内容如下:

        (1)给出了一个数据高阶退化过程。即拼接几个典型退化过程来建模(其中还包括sinc filter),从而获得更加接近现实的低质图像。最终作者采用了一个二阶退化过程,以求在简单性和有效性之间取得良好的平衡。这很重要,我们后面重点介绍。

        (2)判别器用U-Net代替VGG。Real-ESRGAN中的鉴别器对复杂的训练输出需要更大的鉴别能力,它还需要为局部纹理产生精确的梯度反馈,而不是只区分全局样式。因此使用更加强大的U-Net作为判别器。输出每个像素的真实度值,并可以向生成器提供详细的每像素反馈,增强了图像对细节上的对抗学习。判别器我们下面也会重点介绍。

        (3)引入谱归一化(Spectral Normalization)以稳定由于复杂数据集和U-Net判别器带来的训练不稳定情况。

三、生成器结构

1.整体结构

        我们以batch_size=1,输入64x64的4x超分为例,生成器整体结构如下:

b543792056aa42c4932437ba11ffa6c5.png

        可以看到整体模型结构并不复杂,大体是一个序贯的结构,数据经过了23个RRDB模块,每个RDDB块由3个ResidualDenseBlock组成,输入和输出形状一样;然后进行两次Unsample,Unsample采用nearest插值,每次Unsample之后会有卷积层来细化插值细节;最后通道数变成3输出。

        其实生成器的大体机构和SRGAN是一致的,但是将Unsample前的16个残差块换成了23个RRDB模块,这极大的提升了特征提取能力,这也是为什么SRGAN能很好的还原图片细节的原因。每个RDDB块由3个ResidualDenseBlock组成,在底部做Add之前,使用了前面提到的网络插值,即输出乘以0.2再和输出相加,这提高了训练的稳定性。

代码实现:

  1. # 位置 basicsr/archs/rrdbnet_arch.py
  2. class RRDBNet(nn.Module):
  3. """Networks consisting of Residual in Residual Dense Block, which is used
  4. in ESRGAN.
  5. ESRGAN: Enhanced Super-Resolution Generative Adversarial Networks.
  6. We extend ESRGAN for scale x2 and scale x1.
  7. Note: This is one option for scale 1, scale 2 in RRDBNet.
  8. We first employ the pixel-unshuffle (an inverse operation of pixelshuffle to reduce the spatial size
  9. and enlarge the channel size before feeding inputs into the main ESRGAN architecture.
  10. Args:
  11. num_in_ch (int): Channel number of inputs.
  12. num_out_ch (int): Channel number of outputs.
  13. num_feat (int): Channel number of intermediate features.
  14. Default: 64
  15. num_block (int): Block number in the trunk network. Defaults: 23
  16. num_grow_ch (int): Channels for each growth. Default: 32.
  17. """
  18. def __init__(self, num_in_ch, num_out_ch, scale=4, num_feat=64, num_block=23, num_grow_ch=32):
  19. super(RRDBNet, self).__init__()
  20. self.scale = scale
  21. if scale == 2:
  22. num_in_ch = num_in_ch * 4
  23. elif scale == 1:
  24. num_in_ch = num_in_ch * 16
  25. self.conv_first = nn.Conv2d(num_in_ch, num_feat, 3, 1, 1)
  26. self.body = make_layer(RRDB, num_block, num_feat=num_feat, num_grow_ch=num_grow_ch)
  27. self.conv_body = nn.Conv2d(num_feat, num_feat, 3, 1, 1)
  28. # upsample
  29. self.conv_up1 = nn.Conv2d(num_feat, num_feat, 3, 1, 1)
  30. self.conv_up2 = nn.Conv2d(num_feat, num_feat, 3, 1, 1)
  31. self.conv_hr = nn.Conv2d(num_feat, num_feat, 3, 1, 1)
  32. self.conv_last = nn.Conv2d(num_feat, num_out_ch, 3, 1, 1)
  33. self.lrelu = nn.LeakyReLU(negative_slope=0.2, inplace=True)
  34. def forward(self, x):
  35. if self.scale == 2:
  36. feat = pixel_unshuffle(x, scale=2)
  37. elif self.scale == 1:
  38. feat = pixel_unshuffle(x, scale=4)
  39. else:
  40. feat = x
  41. feat = self.conv_first(feat)
  42. # 23个RRDB
  43. body_feat = self.conv_body(self.body(feat))
  44. feat = feat + body_feat
  45. # upsample
  46. feat = self.lrelu(self.conv_up1(F.interpolate(feat, scale_factor=2, mode='nearest')))
  47. feat = self.lrelu(self.conv_up2(F.interpolate(feat, scale_factor=2, mode='nearest')))
  48. out = self.conv_last(self.lrelu(self.conv_hr(feat)))
  49. return out

2.RRDB结构

        Real-ESRGAN核心是RRDB,特点是密集的残差链接,同时残差边两端以Concat的方式相连,结构图如下:

e3c277ec03304ebe828085c57fae97a4.png

         是不是很dense,看上去热闹,其实是有规律,也就是每个卷积激活层的输出会作为下面所有节点的输入。因为有4个concat操作,每个Concat节点的度(出度+入度)都是4。

代码实现:

  1. # 位置 basicsr/archs/rrdbnet_arch.py
  2. class ResidualDenseBlock(nn.Module):
  3. """Residual Dense Block.
  4. Used in RRDB block in ESRGAN.
  5. Args:
  6. num_feat (int): Channel number of intermediate features.
  7. num_grow_ch (int): Channels for each growth.
  8. """
  9. def __init__(self, num_feat=64, num_grow_ch=32):
  10. super(ResidualDenseBlock, self).__init__()
  11. self.conv1 = nn.Conv2d(num_feat, num_grow_ch, 3, 1, 1)
  12. self.conv2 = nn.Conv2d(num_feat + num_grow_ch, num_grow_ch, 3, 1, 1)
  13. self.conv3 = nn.Conv2d(num_feat + 2 * num_grow_ch, num_grow_ch, 3, 1, 1)
  14. self.conv4 = nn.Conv2d(num_feat + 3 * num_grow_ch, num_grow_ch, 3, 1, 1)
  15. self.conv5 = nn.Conv2d(num_feat + 4 * num_grow_ch, num_feat, 3, 1, 1)
  16. self.lrelu = nn.LeakyReLU(negative_slope=0.2, inplace=True)
  17. # initialization
  18. default_init_weights([self.conv1, self.conv2, self.conv3, self.conv4, self.conv5], 0.1)
  19. def forward(self, x):
  20. x1 = self.lrelu(self.conv1(x))
  21. x2 = self.lrelu(self.conv2(torch.cat((x, x1), 1)))
  22. x3 = self.lrelu(self.conv3(torch.cat((x, x1, x2), 1)))
  23. x4 = self.lrelu(self.conv4(torch.cat((x, x1, x2, x3), 1)))
  24. x5 = self.conv5(torch.cat((x, x1, x2, x3, x4), 1))
  25. # Empirically, we use 0.2 to scale the residual for better performance
  26. return x5 * 0.2 + x
  27. class RRDB(nn.Module):
  28. """Residual in Residual Dense Block.
  29. Used in RRDB-Net in ESRGAN.
  30. Args:
  31. num_feat (int): Channel number of intermediate features.
  32. num_grow_ch (int): Channels for each growth.
  33. """
  34. def __init__(self, num_feat, num_grow_ch=32):
  35. super(RRDB, self).__init__()
  36. self.rdb1 = ResidualDenseBlock(num_feat, num_grow_ch)
  37. self.rdb2 = ResidualDenseBlock(num_feat, num_grow_ch)
  38. self.rdb3 = ResidualDenseBlock(num_feat, num_grow_ch)
  39. def forward(self, x):
  40. out = self.rdb1(x)
  41. out = self.rdb2(out)
  42. out = self.rdb3(out)
  43. # Empirically, we use 0.2 to scale the residual for better performance
  44. return out * 0.2 + x

四、判别器结构

        判别器使用带有谱归一化的U-Net,结构如下:

5c40a2662df0421093570f1df161a4d6.png

         判别器分三部分:

        Downsample:三层带有谱归一化的卷积层,每层通道翻倍,宽高减半。

        Unsample:使用nearest插值做上采样,三层带有谱归一化的卷积层,每层通道减半,宽高翻倍,同时与Downsample有残差边相连。

         输出层:两层有谱归一化的卷积、一层卷积输出层。

五、高阶退化模型

        高阶退化模型(High-order Degradation Model)是Real-ESRGAN最重要的创新点。经典的退化模型不能模拟一些复杂的退化问题,特别是未知的噪声和复杂的伪影,这是因为合成的低分辨率图像与现实的退化图像仍然有很大的差距。因此,Real-ESRGAN将经典的退化模型扩展到高阶过程,以模拟更实际的退化。

        所谓高阶退化模型通俗的说就是将经典退化算法排列组合,本文将退化算法分为Blur、Resize、Noise、JPEG Compression四类,如下图:

58ace54bf078404d82621d16d17727ce.png

        从代码中可以看出,整个退化模型循环两遍上面四种退化过程,每个过程随机选一种算法,步骤如下: 

        1.1 Blur:概率选择使用sinc filter还是其他模糊算法(iso/aniso/generalized_iso/generalized_aniso/plateau_iso/plateau_aniso),sinc filter概率默认10%。sinc filter是为了模拟振铃伪影(ring artifacts)和过冲伪影(overshoot artifacts),两种伪影长这个样子:

613ce3c8433940099ddb14db7a779818.png

        1.2 Resize:随机放大或缩小,插值方式area/bilinear/bicubic选一个;

        1.3 Noise:噪声分布随机选择gaussian/poisson;噪声形式随机选择color/gray,color噪声就是三通道数值不一样(默认概率60%),gray噪声三通道数值一样(默认概率40%);

        1.4 JPEG compression:JPEG压缩,默认质量30-950;

        2.1 Blur:默认80%概率执行,同1.1;

        2.2 Resize:同1.2;

        2.3 Noise:同1.3;

        2.4 JPEG compression:这一步比较特殊,有两个组合可选[resize back + sinc filter] + JPEG compression  / 
JPEG compression + [resize back + sinc filter], 其中resize back是吧突变resize成gt_size

       随机各种退化核的代码在realesrgan_dataset.py中,代码如下:

  1. # 位置 realesrgan/data/realesrgan_dataset.py
  2. ......
  3. # ------------------------ 随机生成第一步的各种退化核 ------------------------ #
  4. kernel_size = random.choice(self.kernel_range)
  5. # 概率选择使用sinc filter还是其他模糊算法,sinc filter概率默认10%
  6. if np.random.uniform() < self.opt['sinc_prob']:
  7. # this sinc filter setting is for kernels ranging from [7, 21]
  8. if kernel_size < 13:
  9. omega_c = np.random.uniform(np.pi / 3, np.pi)
  10. else:
  11. omega_c = np.random.uniform(np.pi / 5, np.pi)
  12. kernel = circular_lowpass_kernel(omega_c, kernel_size, pad_to=False)
  13. else:
  14. # blur
  15. kernel = random_mixed_kernels(
  16. self.kernel_list,
  17. self.kernel_prob,
  18. kernel_size,
  19. self.blur_sigma,
  20. self.blur_sigma, [-math.pi, math.pi],
  21. self.betag_range,
  22. self.betap_range,
  23. noise_range=None)
  24. # pad kernel
  25. pad_size = (21 - kernel_size) // 2
  26. kernel = np.pad(kernel, ((pad_size, pad_size), (pad_size, pad_size)))
  27. # ------------------------ 随机生成第一步的各种退化核 ------------------------ #
  28. kernel_size = random.choice(self.kernel_range)
  29. if np.random.uniform() < self.opt['sinc_prob2']:
  30. if kernel_size < 13:
  31. omega_c = np.random.uniform(np.pi / 3, np.pi)
  32. else:
  33. omega_c = np.random.uniform(np.pi / 5, np.pi)
  34. kernel2 = circular_lowpass_kernel(omega_c, kernel_size, pad_to=False)
  35. else:
  36. kernel2 = random_mixed_kernels(
  37. self.kernel_list2,
  38. self.kernel_prob2,
  39. kernel_size,
  40. self.blur_sigma2,
  41. self.blur_sigma2, [-math.pi, math.pi],
  42. self.betag_range2,
  43. self.betap_range2,
  44. noise_range=None)
  45. # pad kernel
  46. pad_size = (21 - kernel_size) // 2
  47. kernel2 = np.pad(kernel2, ((pad_size, pad_size), (pad_size, pad_size)))
  48. # ------------------------------------- 随机最后一部中的 sinc kernel ------------------------------------- #
  49. if np.random.uniform() < self.opt['final_sinc_prob']:
  50. kernel_size = random.choice(self.kernel_range)
  51. omega_c = np.random.uniform(np.pi / 3, np.pi)
  52. sinc_kernel = circular_lowpass_kernel(omega_c, kernel_size, pad_to=21)
  53. sinc_kernel = torch.FloatTensor(sinc_kernel)
  54. else:
  55. sinc_kernel = self.pulse_tensor
  56. ......

        执行退化流程大代码:

  1. # realesrgan/models/realesrgan_model.py
  2. ......
  3. # ----------------------- The first degradation process ----------------------- #
  4. # 1.1 执行blur
  5. out = filter2D(self.gt_usm, self.kernel1)
  6. # 1.2 执行random resize
  7. updown_type = random.choices(['up', 'down', 'keep'], self.opt['resize_prob'])[0]
  8. if updown_type == 'up':
  9. scale = np.random.uniform(1, self.opt['resize_range'][1])
  10. elif updown_type == 'down':
  11. scale = np.random.uniform(self.opt['resize_range'][0], 1)
  12. else:
  13. scale = 1
  14. mode = random.choice(['area', 'bilinear', 'bicubic'])
  15. out = F.interpolate(out, scale_factor=scale, mode=mode)
  16. # 1.3 执行add noise
  17. gray_noise_prob = self.opt['gray_noise_prob']
  18. if np.random.uniform() < self.opt['gaussian_noise_prob']:
  19. out = random_add_gaussian_noise_pt(
  20. out, sigma_range=self.opt['noise_range'], clip=True, rounds=False, gray_prob=gray_noise_prob)
  21. else:
  22. out = random_add_poisson_noise_pt(
  23. out,
  24. scale_range=self.opt['poisson_scale_range'],
  25. gray_prob=gray_noise_prob,
  26. clip=True,
  27. rounds=False)
  28. # 1.4 执行JPEG compression
  29. jpeg_p = out.new_zeros(out.size(0)).uniform_(*self.opt['jpeg_range'])
  30. out = torch.clamp(out, 0, 1) # clamp to [0, 1], otherwise JPEGer will result in unpleasant artifacts
  31. out = self.jpeger(out, quality=jpeg_p)
  32. # ----------------------- The second degradation process ----------------------- #
  33. # 2.1 blur
  34. if np.random.uniform() < self.opt['second_blur_prob']:
  35. out = filter2D(out, self.kernel2)
  36. # 2.2 random resize
  37. updown_type = random.choices(['up', 'down', 'keep'], self.opt['resize_prob2'])[0]
  38. if updown_type == 'up':
  39. scale = np.random.uniform(1, self.opt['resize_range2'][1])
  40. elif updown_type == 'down':
  41. scale = np.random.uniform(self.opt['resize_range2'][0], 1)
  42. else:
  43. scale = 1
  44. mode = random.choice(['area', 'bilinear', 'bicubic'])
  45. out = F.interpolate(
  46. out, size=(int(ori_h / self.opt['scale'] * scale), int(ori_w / self.opt['scale'] * scale)), mode=mode)
  47. # 2.3 add noise
  48. gray_noise_prob = self.opt['gray_noise_prob2']
  49. if np.random.uniform() < self.opt['gaussian_noise_prob2']:
  50. out = random_add_gaussian_noise_pt(
  51. out, sigma_range=self.opt['noise_range2'], clip=True, rounds=False, gray_prob=gray_noise_prob)
  52. else:
  53. out = random_add_poisson_noise_pt(
  54. out,
  55. scale_range=self.opt['poisson_scale_range2'],
  56. gray_prob=gray_noise_prob,
  57. clip=True,
  58. rounds=False)
  59. # 2.4 执行JPEG compression和收尾操作
  60. # 我们还需要将图像调整到所需的大小。我们将[size back + sinc filter]组合在一起操作。
  61. # 有两个选项可选:
  62. # 1. [resize back + sinc filter] + JPEG compression
  63. # 2. JPEG compression + [resize back + sinc filter]
  64. # 根据经验,我们发现组合(sinc + JPEG + Resize)会引入扭曲的线条。
  65. if np.random.uniform() < 0.5:
  66. # resize back + the final sinc filter
  67. mode = random.choice(['area', 'bilinear', 'bicubic'])
  68. out = F.interpolate(out, size=(ori_h // self.opt['scale'], ori_w // self.opt['scale']), mode=mode)
  69. out = filter2D(out, self.sinc_kernel)
  70. # JPEG compression
  71. jpeg_p = out.new_zeros(out.size(0)).uniform_(*self.opt['jpeg_range2'])
  72. out = torch.clamp(out, 0, 1)
  73. out = self.jpeger(out, quality=jpeg_p)
  74. else:
  75. # JPEG compression
  76. jpeg_p = out.new_zeros(out.size(0)).uniform_(*self.opt['jpeg_range2'])
  77. out = torch.clamp(out, 0, 1)
  78. out = self.jpeger(out, quality=jpeg_p)
  79. # resize back + the final sinc filter
  80. mode = random.choice(['area', 'bilinear', 'bicubic'])
  81. out = F.interpolate(out, size=(ori_h // self.opt['scale'], ori_w // self.opt['scale']), mode=mode)
  82. out = filter2D(out, self.sinc_kernel)
  83. ......

六、损失函数

        先说明一下数学符号:

        eq?x:输入

        eq?%5Cphi:VGG19模型

        eq?y:ground truth

        eq?G生成模型

        eq?D:判别模型

        eq?y%5E%7Br%7D:真实的label,就是一个全是1的矩阵

        eq?y%5E%7Bf%7D:G模型生成的假的label,就是一个全是0的矩阵

1.生成模型损失函数

        生成模型损失函数:

eq?L_%7BG%7D%3DL_%7Bpercep%7D&plus;%5Clambda%20L_%7BG%7D&plus;%5Ceta%20L_%7B1%7D

        eq?%5Clambda默认0.1,eq?%5Ceta默认1

        eq?L_%7Bpercep%7D:感知损失,将gt和生成模型的输出分别送入预训练VGG19,取conv1_2(bx64x256x256)、conv2_2(bx128,128x128)、conv3_4(bx256x64x64)、conv4_4(bx512x32x32)、conv5_4(bx512x16x16)层的数据,然后计算L1loss,公式如下:

eq?L_%7Bpercep%7D%3D%5Cleft%20%5C%7C%20%5Cphi%20%28x_%7Bi%7D%29-%20%5Cphi%20%28y_%7Bi%7D%29%5Cright%20%5C%7C_%7B1%7D

        eq?L_%7BG%7D:GANLoss,将生成模型的输出送入判别模型(U-Net),将结果(bx1x256x256)和babel(全是1)计算二进制交叉熵损失(BCELoss),公式如下:

eq?L_%7BG%7D%3D-%28y_%7Bi%7D%5E%7Br%7DlogD%28x_%7Bi%7D%29%29%20-%20%281-y_%7Bi%7D%5E%7Br%7D%29log%281-D%28x_%7Bi%7D%29%29%3D-%28y_%7Bi%7D%5E%7Br%7DlogD%28x_%7Bi%7D%29%29

        eq?L_%7B1%7D:gt和生成模型的输出直接计算L1loss,公式如下:

eq?L_%7B1%7D%3Dmean%5Cleft%20%5C%7C%20G%28x_%7Bi%7D%29-y_%7Bi%7D%20%5Cright%20%5C%7C_%7B1%7D

        代码实现:

  1. # 位置 realesrgan/models/realesrgan_model.py
  2. # pixel loss
  3. if self.cri_pix:
  4. l_g_pix = self.cri_pix(self.output, l1_gt)
  5. l_g_total += l_g_pix
  6. loss_dict['l_g_pix'] = l_g_pix
  7. # perceptual loss
  8. if self.cri_perceptual:
  9. l_g_percep, l_g_style = self.cri_perceptual(self.output, percep_gt)
  10. if l_g_percep is not None:
  11. l_g_total += l_g_percep
  12. loss_dict['l_g_percep'] = l_g_percep
  13. if l_g_style is not None:
  14. l_g_total += l_g_style
  15. loss_dict['l_g_style'] = l_g_style
  16. # gan loss
  17. fake_g_pred = self.net_d(self.output)
  18. l_g_gan = self.cri_gan(fake_g_pred, True, is_disc=False)
  19. l_g_total += l_g_gan
  20. loss_dict['l_g_gan'] = l_g_gan
  21. l_g_total.backward()
  22. self.optimizer_g.step()

2.判别模型损失函数

        Real-ESRGAN的判别模型优化分两步:

        (1)优化判别真的能力,即构造一个全是1的eq?y%5E%7Br%7D,然后计算eq?D%28y_%7Bi%7D%29eq?y%5E%7Br%7D的BECLoss,公式如下:

eq?L_%7BD%7D%5E%7Br%7D%3D-%28y_%7Bi%7D%5E%7Br%7DlogD%28y_%7Bi%7D%29%29%20-%20%281-y_%7Bi%7D%29log%281-D%28y_%7Bi%7D%29%29%3D-%28y_%7Bi%7D%5E%7Br%7DlogD%28y_%7Bi%7D%29%29

        (2)优化判别假的能力,即构造一个全是0的eq?y%5E%7Bf%7D,然后计算eq?D%28y_%7Bi%7D%29eq?y%5E%7Bf%7D的BECLoss,公式如下:

eq?L_%7BD%7D%5E%7Bf%7D%3D-%28y_%7Bi%7D%5E%7Bf%7DlogD%28y_%7Bi%7D%29%29%20-%20%281-y_%7Bi%7D%5E%7Bf%7D%29log%281-D%28y_%7Bi%7D%29%29%3D%28y_%7Bi%7D%5E%7Bf%7D-1%29log%281-D%28y_%7Bi%7D%29%29

        代码实现:

  1. # 位置 realesrgan/models/realesrgan_model.py
  2. self.optimizer_d.zero_grad()
  3. # real
  4. real_d_pred = self.net_d(gan_gt)
  5. l_d_real = self.cri_gan(real_d_pred, True, is_disc=True)
  6. loss_dict['l_d_real'] = l_d_real
  7. loss_dict['out_d_real'] = torch.mean(real_d_pred.detach())
  8. l_d_real.backward()
  9. # fake
  10. fake_d_pred = self.net_d(self.output.detach().clone()) # clone for pt1.9
  11. l_d_fake = self.cri_gan(fake_d_pred, False, is_disc=True)
  12. loss_dict['l_d_fake'] = l_d_fake
  13. loss_dict['out_d_fake'] = torch.mean(fake_d_pred.detach())
  14. l_d_fake.backward()
  15. self.optimizer_d.step()

 

        Real-ESRGAN就介绍到这里,还有很多关于Real-ESRGAN实现的细节,很快会再更一期,关注不迷路!!!

关注订阅号了解更多精品文章

c168ea67aa0d45e4a8db954fd4eaa145.jpeg
交流探讨、商务合作请加微信

ddac0a92f7884455ae19fa73f5bf6dbd.jpeg

 

声明:本文内容由网友自发贡献,不代表【wpsshop博客】立场,版权归原作者所有,本站不承担相应法律责任。如您发现有侵权的内容,请联系我们。转载请注明出处:https://www.wpsshop.cn/w/不正经/article/detail/442228
推荐阅读
相关标签
  

闽ICP备14008679号