NVIDIA drivers
Amazon Linux 2023 provides NVIDIA GPU drivers and CUDA toolkit packages through a dedicated
repository. This repository is maintained by AWS and provides security advisories
through the Amazon Linux Security Center (ALAS)
Topics
About the NVIDIA repository
The AL2023 NVIDIA repository mirrors packages from the official NVIDIA CUDA
repository for AL2023
The repository is available in all AWS Commercial Regions, including the AWS GovCloud (US) Regions and AWS China Regions.
The repository provides NVIDIA Tesla (data center compute) and graphics drivers for x86_64 architectures. GRID drivers, used for virtual display and remote workstation capabilities, are not included. For GRID driver installation, see Install NVIDIA drivers in the EC2 User Guide.
Enabling the NVIDIA repository
To enable the NVIDIA repository on your AL2023 instance, install the
nvidia-release package. This adds the repository configuration
and GPG keys to your system.
[ec2-user ~]$sudo dnf install nvidia-release -y
Verify the repository was added:
[ec2-user ~]$dnf repolist
You should see the amazonlinux-nvidia repository in the list.
repo id repo name status
amazonlinux Amazon Linux 2023 repository enabled
amazonlinux-nvidia Amazon Linux 2023 NVIDIA repository enabled
Installing NVIDIA drivers
After enabling the repository, you can install NVIDIA driver packages using
dnf.
-
Install the kernel headers and development packages for your running kernel:
[ec2-user ~]$sudo dnf install kernel-devel-$(uname -r) kernel-headers-$(uname -r) -y -
Install the NVIDIA driver:
[ec2-user ~]$sudo dnf install nvidia-driver-cuda -y -
Reboot the instance:
[ec2-user ~]$sudo reboot -
After rebooting, verify the driver is loaded:
[ec2-user ~]$nvidia-smi
Installing the CUDA toolkit
After installing the NVIDIA driver, you can install the CUDA toolkit:
[ec2-user ~]$sudo dnf install cuda-toolkit -y
Note
For GPU instances that require NVIDIA Fabric Manager (such as P4d, P5, and P6 instance types), install and enable the additional packages:
[ec2-user ~]$DRV_BRANCH="$(modinfo nvidia | grep "^version:" | tr -s ' ' | cut -d ' ' -f 2)"[ec2-user ~]$sudo dnf install nvidia-fabricmanager-${DRV_BRANCH} -y[ec2-user ~]$sudo systemctl enable --now nvidia-fabricmanager[ec2-user ~]$sudo systemctl enable --now nvidia-persistenced
Verify that Fabric Manager is running and the GPUs are connected through NVSwitch:
[ec2-user ~]$sudo systemctl status nvidia-fabricmanager[ec2-user ~]$nvidia-smi topo -m
In the topology matrix, connections between GPUs should show
NV links, indicating active NVSwitch
connectivity.
For detailed instructions on installing NVIDIA drivers on EC2 GPU instances, including instance type-specific requirements, see Install NVIDIA public drivers in the EC2 User Guide.
Removing the NVIDIA repository
To remove the NVIDIA repository configuration from your system:
[ec2-user ~]$sudo dnf remove nvidia-release -y
Important
Removing the repository configuration does not remove any NVIDIA packages already installed on the system.