未完成
パッケージ管理機構を使って nvidia ドライバーを入れてみる。
RHEL9系なら
dnf -y config-manager --add-repo=https://developer.download.nvidia.com/compute/cuda/repos/rhel9/x86_64/cuda-rhel9.repo
RHEL8系なら
dnf -y config-manager --add-repo=https://developer.download.nvidia.com/compute/cuda/repos/rhel8/x86_64/cuda-rhel8.repo
としてリポジトリを登録します。
っでその中身ですが、
[root@rockylinux ~]# cat /etc/yum.repos.d/cuda-rhel8.repo
[cuda-rhel8-x86_64]
name=cuda-rhel8-x86_64
baseurl=https://developer.download.nvidia.com/compute/cuda/repos/rhel8/x86_64
enabled=1
gpgcheck=1
gpgkey=https://developer.download.nvidia.com/compute/cuda/repos/rhel8/x86_64/D42D0685.pub
[root@rockylinux ~]#
となってます。「enabled=1」はあまり好きではないので「enabled=0」とかにします。
このリポジトリで提供されるものは
[root@rockylinux ~]# dnf repolist -v cuda-rhel8-x86_64
Loaded plugins: builddep, changelog, config-manager, copr, debug, debuginfo-install, download, generate_completion_cache, groups-manager, kpatch, needs-restarting, nvidia, nvidia-plugin, playground, repoclosure, repodiff, repograph, repomanage, reposync, system-upgrade
DNF version: 4.7.0
cachedir: /var/cache/dnf
Last metadata expiration check: 0:09:50 ago on Fri 21 Jun 2024 02:10:41 PM JST.
NVIDIA driver: filtering kernel 4.18.0-553.5.1.el8_10, no precompiled modules available for version 3:550.90.07
NVIDIA driver: filtering kernel 4.18.0-553.el8_10, no precompiled modules available for version 3:550.90.07
Repo-id : cuda-rhel8-x86_64
Repo-name : cuda-rhel8-x86_64
Repo-status : disabled
Repo-baseurl : https://developer.download.nvidia.com/compute/cuda/repos/rhel8/x86_64
Repo-expire : 172,800 second(s) (last: unknown)
Repo-filename : /etc/yum.repos.d/cuda-rhel8.repo
Total packages: 0
[root@rockylinux ~]# dnf list available --disablerepo=* --enablerepo=cuda-rhel8-x86_64
Last metadata expiration check: 0:10:05 ago on Fri 21 Jun 2024 02:11:15 PM JST.
Available Packages
:
cuda-12-3.x86_64 12.3.2-1 cuda-rhel8-x86_64
cuda-12-4.x86_64 12.4.1-1 cuda-rhel8-x86_64
cuda-12-5.x86_64 12.5.0-1 cuda-rhel8-x86_64
:
[root@rockylinux ~]#
と一見存在しないように見えるのですが、
[root@rockylinux ~]# dnf module list --disablerepo=* --enablerepo=cuda-rhel8-x86_64
:
cuda-rhel8-x86_64
Name Stream Profiles Summary
nvidia-driver latest default [d], fm, ks, src Nvidia driver for latest branch
nvidia-driver latest-dkms [d] default [d], fm, ks Nvidia driver for latest-dkms branch
nvidia-driver open-dkms default [d], fm, ks, src Nvidia driver for open-dkms branch
nvidia-driver 418 default [d], fm, ks, src Nvidia driver for 418 branch
nvidia-driver 418-dkms default [d], fm, ks Nvidia driver for 418-dkms branch
:
nvidia-driver 555 default [d], fm, ks, src Nvidia driver for 555 branch
nvidia-driver 555-dkms default [d], fm, ks Nvidia driver for 555-dkms branch
nvidia-driver 555-open default [d], fm, ks, src Nvidia driver for 555-open branch
:
[root@rockylinux ~]#
(あるいは)
[root@rockylinux ~]# dnf --enablerepo=cuda-rhel8-x86_64 module list nvidia-driver
Last metadata expiration check: 0:00:51 ago on Fri 21 Jun 2024 02:33:00 PM JST.
cuda-rhel8-x86_64
Name Stream Profiles Summary
nvidia-driver latest default [d], fm, ks, src Nvidia driver for latest branch
nvidia-driver latest-dkms [d] default [d], fm, ks Nvidia driver for latest-dkms branch
nvidia-driver open-dkms default [d], fm, ks, src Nvidia driver for open-dkms branch
nvidia-driver 418 default [d], fm, ks, src Nvidia driver for 418 branch
nvidia-driver 418-dkms default [d], fm, ks Nvidia driver for 418-dkms branch
nvidia-driver 440 default [d], fm, ks, src Nvidia driver for 440 branch
nvidia-driver 440-dkms default [d], fm, ks Nvidia driver for 440-dkms branch
nvidia-driver 450 default [d], fm, ks, src Nvidia driver for 450 branch
nvidia-driver 450-dkms default [d], fm, ks Nvidia driver for 450-dkms branch
nvidia-driver 455 default [d], fm, ks, src Nvidia driver for 455 branch
nvidia-driver 455-dkms default [d], fm, ks Nvidia driver for 455-dkms branch
nvidia-driver 460 default [d], fm, ks, src Nvidia driver for 460 branch
nvidia-driver 460-dkms default [d], fm, ks Nvidia driver for 460-dkms branch
nvidia-driver 465 default [d], fm, ks, src Nvidia driver for 465 branch
nvidia-driver 465-dkms default [d], fm, ks Nvidia driver for 465-dkms branch
nvidia-driver 470 default [d], fm, ks, src Nvidia driver for 470 branch
nvidia-driver 470-dkms default [d], fm, ks Nvidia driver for 470-dkms branch
nvidia-driver 495 default [d], fm, ks, src Nvidia driver for 495 branch
nvidia-driver 495-dkms default [d], fm, ks Nvidia driver for 495-dkms branch
nvidia-driver 510 default [d], fm, ks, src Nvidia driver for 510 branch
nvidia-driver 510-dkms default [d], fm, ks Nvidia driver for 510-dkms branch
nvidia-driver 515 default [d], fm, ks, src Nvidia driver for 515 branch
nvidia-driver 515-dkms default [d], fm, ks Nvidia driver for 515-dkms branch
nvidia-driver 515-open default [d], fm, ks, src Nvidia driver for 515-open branch
nvidia-driver 520 default [d], fm, ks, src Nvidia driver for 520 branch
nvidia-driver 520-dkms default [d], fm, ks Nvidia driver for 520-dkms branch
nvidia-driver 520-open default [d], fm, ks, src Nvidia driver for 520-open branch
nvidia-driver 525 default [d], fm, ks, src Nvidia driver for 525 branch
nvidia-driver 525-dkms default [d], fm, ks Nvidia driver for 525-dkms branch
nvidia-driver 525-open default [d], fm, ks, src Nvidia driver for 525-open branch
nvidia-driver 530 default [d], fm, ks, src Nvidia driver for 530 branch
nvidia-driver 530-dkms default [d], fm, ks Nvidia driver for 530-dkms branch
nvidia-driver 530-open default [d], fm, ks, src Nvidia driver for 530-open branch
nvidia-driver 535 default [d], fm, ks, src Nvidia driver for 535 branch
nvidia-driver 535-dkms default [d], fm, ks Nvidia driver for 535-dkms branch
nvidia-driver 535-open default [d], fm, ks, src Nvidia driver for 535-open branch
nvidia-driver 545 default [d], fm, ks, src Nvidia driver for 545 branch
nvidia-driver 545-dkms default [d], fm, ks Nvidia driver for 545-dkms branch
nvidia-driver 545-open default [d], fm, ks, src Nvidia driver for 545-open branch
nvidia-driver 550 default [d], fm, ks, src Nvidia driver for 550 branch
nvidia-driver 550-dkms default [d], fm, ks Nvidia driver for 550-dkms branch
nvidia-driver 550-open default [d], fm, ks, src Nvidia driver for 550-open branch
nvidia-driver 555 default [d], fm, ks, src Nvidia driver for 555 branch
nvidia-driver 555-dkms default [d], fm, ks Nvidia driver for 555-dkms branch
nvidia-driver 555-open default [d], fm, ks, src Nvidia driver for 555-open branch
Hint: [d]efault, [e]nabled, [x]disabled, [i]nstalled
[root@rockylinux ~]#
と選べる。ここでは 550 を入れてみたいので
[root@rockylinux ~]# dnf remove nvidia-driver
[root@rockylinux ~]# dnf module reset nvidia-driver
[root@rockylinux ~]# dnf --enablerepo=cuda-rhel8-x86_64 module install nvidia-driver:535
とすればドライバーがインストールされます。
あとは再起動して
[root@rockylinux ~]# nvidia-smi
Fri Jun 21 14:50:18 2024
+---------------------------------------------------------------------------------------+
| NVIDIA-SMI 535.183.01 Driver Version: 535.183.01 CUDA Version: 12.2 |
|-----------------------------------------+----------------------+----------------------+
| GPU Name Persistence-M | Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|=========================================+======================+======================|
| 0 NVIDIA RTX A2000 Off | 00000000:13:00.0 Off | Off |
| 30% 60C P0 26W / 70W | 0MiB / 6138MiB | 0% Default |
| | | N/A |
+-----------------------------------------+----------------------+----------------------+
+---------------------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=======================================================================================|
| No running processes found |
+---------------------------------------------------------------------------------------+
[root@rockylinux ~]#
となる。
っが、550以上のドライバを入れると上手く機能しない。esxiの問題かも