定位显卡在主板中的位置
在多显卡的服务器中如何定位显卡的物理位置?
执行下面命令找到对应的Bus-Id
nvidia-smi
--------------------------------------------------------------------------------------+
| NVIDIA-SMI 570.124.06 Driver Version: 570.124.06 CUDA Version: 12.8 |
|-----------------------------------------+------------------------+----------------------+
| GPU Name Persistence-M | Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|=========================================+========================+======================|
| 0 NVIDIA GeForce RTX 2080 Ti Off | 00000000:1A:00.0 Off | N/A |
| 29% 36C P8 18W / 250W | 10525MiB / 22528MiB | 0% Default |
| | | N/A |
+-----------------------------------------+------------------------+----------------------+
| 1 NVIDIA GeForce RTX 2080 Ti Off | 00000000:1B:00.0 Off | N/A |
| 30% 37C P8 24W / 250W | 10445MiB / 22528MiB | 0% Default |
| | | N/A |
+-----------------------------------------+------------------------+----------------------+
| 2 NVIDIA GeForce RTX 2080 Ti Off | 00000000:1C:00.0 Off | N/A |
| 25% 34C P8 22W / 250W | 1MiB / 22528MiB | 0% Default |
| | | N/A |
+-----------------------------------------+------------------------+----------------------+
| 3 NVIDIA GeForce RTX 2080 Ti Off | 00000000:1D:00.0 Off | N/A |
| 30% 40C P2 49W / 250W | 9019MiB / 22528MiB | 0% Default |
| | | N/A |
+-----------------------------------------+------------------------+----------------------+
| 4 NVIDIA GeForce RTX 2080 Ti Off | 00000000:1E:00.0 Off | N/A |
| 28% 36C P8 1W / 250W | 10549MiB / 22528MiB | 0% Default |
| | | N/A |
+-----------------------------------------+------------------------+----------------------+
| 5 NVIDIA GeForce RTX 2080 Ti Off | 00000000:3D:00.0 Off | N/A |
| 27% 34C P8 28W / 250W | 10489MiB / 22528MiB | 0% Default |
| | | N/A |
+-----------------------------------------+------------------------+----------------------+
| 6 NVIDIA GeForce RTX 2080 Ti Off | 00000000:3E:00.0 Off | N/A |
| 29% 36C P8 30W / 250W | 10507MiB / 22528MiB | 0% Default |
| | | N/A |
+-----------------------------------------+------------------------+----------------------+
| 7 NVIDIA GeForce RTX 2080 Ti Off | 00000000:3F:00.0 Off | N/A |
| 24% 32C P8 2W / 250W | 10525MiB / 22528MiB | 0% Default |
| | | N/A |
+-----------------------------------------+------------------------+----------------------+
| 8 NVIDIA GeForce RTX 2080 Ti Off | 00000000:40:00.0 Off | N/A |
| 27% 35C P8 23W / 250W | 10505MiB / 22528MiB | 0% Default |
| | | N/A |
+-----------------------------------------+------------------------+----------------------+
| 9 NVIDIA GeForce RTX 2080 Ti Off | 00000000:41:00.0 Off | N/A |
| 26% 37C P2 75W / 250W | 10487MiB / 22528MiB | 0% Default |
| | | N/A |
+-----------------------------------------+------------------------+----------------------+
这里假设我要找GPU 2这张显卡在主板中的物理位置。
首先从上面的信息中得知GPU 2显卡对应的Bus-Id是00000000:1C:00.0。
执行下面命令看整台服务器中的PCIE信息。
dmidecode -t slot
然后找到对应的Bus Address值为0000:1c:00.0,如下所示,这个Slot9就是这张显卡的物理位置,在主板上找到PCIE标有Slot9的位置就使用要找的这张显卡的物理位置。
Handle 0x0071, DMI type 9, 17 bytes
System Slot Information
Designation: CPU1 Slot9 PCI-E 3.0 X16
Type: x16 PCI Express 3 x16
Current Usage: In Use
Length: Long
ID: 9
Characteristics:
3.3 V is provided
PME signal is supported
SMBus signal is supported
Async/surprise removal is supported
Bus Address: 0000:1c:00.0
Q.E.D.


