当前位置:   article > 正文

Windows 系统上安装 CUDA Toolkit 10.1

cuda toolkit 10.1

1. NVCUDA.DLL - NVIDIA CUDA 10.1.135 driver - NVIDIA 驱动程序版本

NVIDIA 控制面板 -> 帮助 -> 系统信息 -> 组件

在这里插入图片描述

2. CUDA Toolkit

CUDA Toolkit
https://developer.nvidia.com/cuda-toolkit

CUDA Toolkit Download
https://developer.nvidia.com/cuda-downloads

CUDA Toolkit Archive
https://developer.nvidia.com/cuda-toolkit-archive

3. CUDA Toolkit 10.1 original Archive

CUDA Toolkit 10.1 original Archive
https://developer.nvidia.com/cuda-10.1-download-archive-base

CUDA Toolkit 10.1 (Feb 2019), Online Documentation
https://docs.nvidia.com/cuda/archive/10.1/

The checksums for the installer and patches can be found in Installer Checksums.
https://developer.download.nvidia.cn/compute/cuda/10.1/Prod/docs/sidebar/md5sum.txt

For further information, see the Installation Guide for Microsoft Windows and the CUDA Quick Start Guide.
https://docs.nvidia.com/cuda/cuda-installation-guide-microsoft-windows/index.html
https://docs.nvidia.com/cuda/cuda-quick-start-guide/index.html

在这里插入图片描述

4. cuda_10.1.105_418.96_win10.exe

1. Double click cuda_10.1.105_418.96_win10.exe

C:\Users\cheng\AppData\Local\Temp\CUDA

在这里插入图片描述

2. NVIDIA 软件许可协议

在这里插入图片描述

在这里插入图片描述

3. 自定义

在这里插入图片描述

在这里插入图片描述

在这里插入图片描述

4. 选择安装位置

在这里插入图片描述

5. NVIDIA 安装程序失败

在这里插入图片描述

在这里插入图片描述

出现 “NVIDIA 安装程序失败” 的提示,关闭安装界面。重新安装,在 “自定义安装选项” 页面,取消勾选 Visual Studio Integration,即可正常安装。

6. 自定义安装选项 - 取消勾选 Visual Studio Integration

在这里插入图片描述

在这里插入图片描述

在这里插入图片描述

7. CUDA Samples

C:\ProgramData\NVIDIA Corporation\CUDA Samples\v10.1

在这里插入图片描述

5. nvcc -V

Microsoft Windows [版本 10.0.17763.1158]
(c) 2018 Microsoft Corporation。保留所有权利。

C:\Users\cheng>nvcc -V
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2019 NVIDIA Corporation
Built on Fri_Feb__8_19:08:26_Pacific_Standard_Time_2019
Cuda compilation tools, release 10.1, V10.1.105

C:\Users\cheng>
  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10

在这里插入图片描述

6. C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.1

在这里插入图片描述

7. 环境变量 -> 系统变量

CUDA_PATH
C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.1

CUDA_PATH_V10_1
C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.1

NVCUDASAMPLES_ROOT
C:\ProgramData\NVIDIA Corporation\CUDA Samples\v10.1

NVCUDASAMPLES10_1_ROOT
C:\ProgramData\NVIDIA Corporation\CUDA Samples\v10.1

在这里插入图片描述

8. demo_suite

C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.1\extras\demo_suite

在这里插入图片描述

Shift + 鼠标右键 -> 在此处打开 Powershell 窗口

在这里插入图片描述

PS C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.1\extras\demo_suite> dir


    目录: C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.1\extras\demo_suite


Mode                LastWriteTime         Length Name
----                -------------         ------ ----
d-----         2020/5/4     21:16                nbody_data_files
d-----         2020/5/4     21:16                oceanFFT_data_files
d-----         2020/5/4     21:16                randomFog_data_files
-a----         2019/2/9     13:57         250880 bandwidthTest.exe
-a----         2019/2/9     13:57         279552 busGrind.exe
-a----         2019/2/9     13:57         200192 deviceQuery.exe
-a----         2019/2/9     13:57         311296 freeglut.dll
-a----         2019/2/9     13:57         229376 glew64.dll
-a----         2019/2/9     13:57        1265664 nbody.exe
-a----         2019/2/9     13:57         552448 oceanFFT.exe
-a----         2019/2/9     13:57         409088 randomFog.exe
-a----         2019/2/9     13:57         224256 vectorAdd.exe


PS C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.1\extras\demo_suite>
  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23

.\bandwidthTest.exe

PS C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.1\extras\demo_suite> .\bandwidthTest.exe
[CUDA Bandwidth Test] - Starting...
Running on...

 Device 0: GeForce GTX 1060 with Max-Q Design
 Quick Mode

 Host to Device Bandwidth, 1 Device(s)
 PINNED Memory Transfers
   Transfer Size (Bytes)        Bandwidth(MB/s)
   33554432                     6343.7

 Device to Host Bandwidth, 1 Device(s)
 PINNED Memory Transfers
   Transfer Size (Bytes)        Bandwidth(MB/s)
   33554432                     6409.8

 Device to Device Bandwidth, 1 Device(s)
 PINNED Memory Transfers
   Transfer Size (Bytes)        Bandwidth(MB/s)
   33554432                     146651.3

Result = PASS

NOTE: The CUDA Samples are not meant for performance measurements. Results may vary when GPU Boost is enabled.
PS C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.1\extras\demo_suite>
  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26

.\deviceQuery.exe

PS C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.1\extras\demo_suite> .\deviceQuery.exe
C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.1\extras\demo_suite\deviceQuery.exe Starting...

 CUDA Device Query (Runtime API) version (CUDART static linking)

Detected 1 CUDA Capable device(s)

Device 0: "GeForce GTX 1060 with Max-Q Design"
  CUDA Driver Version / Runtime Version          10.1 / 10.1
  CUDA Capability Major/Minor version number:    6.1
  Total amount of global memory:                 6144 MBytes (6442450944 bytes)
  (10) Multiprocessors, (128) CUDA Cores/MP:     1280 CUDA Cores
  GPU Max Clock rate:                            1342 MHz (1.34 GHz)
  Memory Clock rate:                             4004 Mhz
  Memory Bus Width:                              192-bit
  L2 Cache Size:                                 1572864 bytes
  Maximum Texture Dimension Size (x,y,z)         1D=(131072), 2D=(131072, 65536), 3D=(16384, 16384, 16384)
  Maximum Layered 1D Texture Size, (num) layers  1D=(32768), 2048 layers
  Maximum Layered 2D Texture Size, (num) layers  2D=(32768, 32768), 2048 layers
  Total amount of constant memory:               zu bytes
  Total amount of shared memory per block:       zu bytes
  Total number of registers available per block: 65536
  Warp size:                                     32
  Maximum number of threads per multiprocessor:  2048
  Maximum number of threads per block:           1024
  Max dimension size of a thread block (x,y,z): (1024, 1024, 64)
  Max dimension size of a grid size    (x,y,z): (2147483647, 65535, 65535)
  Maximum memory pitch:                          zu bytes
  Texture alignment:                             zu bytes
  Concurrent copy and kernel execution:          Yes with 5 copy engine(s)
  Run time limit on kernels:                     Yes
  Integrated GPU sharing Host Memory:            No
  Support host page-locked memory mapping:       Yes
  Alignment requirement for Surfaces:            Yes
  Device has ECC support:                        Disabled
  CUDA Device Driver Mode (TCC or WDDM):         WDDM (Windows Display Driver Model)
  Device supports Unified Addressing (UVA):      Yes
  Device supports Compute Preemption:            Yes
  Supports Cooperative Kernel Launch:            No
  Supports MultiDevice Co-op Kernel Launch:      No
  Device PCI Domain ID / Bus ID / location ID:   0 / 1 / 0
  Compute Mode:
     < Default (multiple host threads can use ::cudaSetDevice() with device simultaneously) >

deviceQuery, CUDA Driver = CUDART, CUDA Driver Version = 10.1, CUDA Runtime Version = 10.1, NumDevs = 1, Device0 = GeForce GTX 1060 with Max-Q Design
Result = PASS
PS C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.1\extras\demo_suite>
  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • 28
  • 29
  • 30
  • 31
  • 32
  • 33
  • 34
  • 35
  • 36
  • 37
  • 38
  • 39
  • 40
  • 41
  • 42
  • 43
  • 44
  • 45
  • 46
  • 47

References

[1] Yongqiang Cheng, https://yongqiang.blog.csdn.net/
[2] CUDA Toolkit, https://developer.nvidia.com/cuda-toolkit
[3] GPU-Accelerated Libraries for AI and HPC, https://developer.nvidia.com/gpu-accelerated-libraries
[4] NVIDIA Developer Tools Overview, https://developer.nvidia.com/tools-overview
[5] CUDA Toolkit Documentation, https://docs.nvidia.com/cuda/
[6] CUDA Installation Guide for Microsoft Windows, https://docs.nvidia.com/cuda/cuda-installation-guide-microsoft-windows/index.html
[7] CUDA Quick Start Guide, https://docs.nvidia.com/cuda/cuda-quick-start-guide/index.html
[8] CUDA Code Samples, https://developer.nvidia.com/cuda-code-samples
[9] An Even Easier Introduction to CUDA, https://devblogs.nvidia.com/even-easier-introduction-cuda/
[10] NVIDIA Developer Blog, https://devblogs.nvidia.com/
[11] Accelerated Computing - Training, https://developer.nvidia.com/accelerated-computing-training
[12] NVIDIA Developer Forums, https://forums.developer.nvidia.com/
[13] Display Driver Uninstaller (DDU), https://www.wagnardsoft.com/

声明:本文内容由网友自发贡献,不代表【wpsshop博客】立场,版权归原作者所有,本站不承担相应法律责任。如您发现有侵权的内容,请联系我们。转载请注明出处:https://www.wpsshop.cn/w/Monodyee/article/detail/696658
推荐阅读
相关标签
  

闽ICP备14008679号