Jump to content
Welcome to our new Citrix community!
  • 0

gpu-group-list not updated with new GPU


Robin Tan

Question

Hi, I'm adding/remove GPU (for testing), and I notice the "xe gpu-group-list" does not show the new GPU. 

 

The doc says the group is updated by xen automatically, but I have removed the T4,M60 and put in the RTX, but it is not updated.

 

[root@localhost ~]# nvidia-smi

Tue Feb 25 16:23:17 2020

+-----------------------------------------------------------------------------+

| NVIDIA-SMI 440.53 Driver Version: 440.53 CUDA Version: N/A |

|-------------------------------+----------------------+----------------------+

| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |

| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |

|===============================+======================+======================|

| 0 Quadro RTX 6000 On | 00000000:86:00.0 Off | Off |

| 33% 55C P8 33W / 260W | 176MiB / 24575MiB | 0% Default |

+-------------------------------+----------------------+----------------------+

+-----------------------------------------------------------------------------+

| Processes: GPU Memory |

| GPU PID Type Process name Usage |

|=============================================================================|

| No running processes found |

+-----------------------------------------------------------------------------+

[root@localhost ~]# xe gpu-group-list

uuid ( RO) : e062fc86-82d6-e7fc-969e-9c2837698e0f

name-label ( RW): Group of NVIDIA Corporation Device 1e30 GPUs

name-description ( RW):

uuid ( RO) : 313004be-b34a-115c-d6eb-d7d8431232ac

name-label ( RW): Group of NVIDIA Corporation TU104GL [Tesla T4] GPUs

name-description ( RW):

uuid ( RO) : 6afa24fa-4b19-4174-82e4-93b8687271d6

name-label ( RW): Group of NVIDIA Corporation GM204GL [Tesla M60] GPUs

name-description ( RW):

uuid ( RO) : 60b2c8b3-ed41-3a02-b334-133e3bd78204

name-label ( RW): Group of Matrox Electronics Systems Ltd. Device 0538 GPUs

name-description ( RW):

Link to comment

20 answers to this question

Recommended Posts

  • 0

For RTX, I don't think there is a graphics/compute mode anymore. On M60, yes, its on graphics mode. The lspci looks fine (I switched the cards to RTX6000 and RTX8000)

lspci | grep NVIDIA

37:00.0 VGA compatible controller: NVIDIA Corporation TU102GL [Quadro RTX 6000/8000] (rev a1)

37:00.1 Audio device: NVIDIA Corporation Device 10f7 (rev a1)

37:00.2 USB controller: NVIDIA Corporation Device 1ad6 (rev a1)

37:00.3 Serial bus controller [0c80]: NVIDIA Corporation Device 1ad7 (rev a1)

86:00.0 VGA compatible controller: NVIDIA Corporation TU102GL [Quadro RTX 6000/8000] (rev a1)

86:00.1 Audio device: NVIDIA Corporation Device 10f7 (rev a1)

86:00.2 USB controller: NVIDIA Corporation Device 1ad6 (rev a1)

86:00.3 Serial bus controller [0c80]: NVIDIA Corporation Device 1ad7 (rev a1)

Link to comment
  • 0

I have configured them, but not powered on. I have them powered on, but no difference.   The gpu in the vm works fine.

 

I don't see a xe pu-group-delete or something since the doc says it's should be automatically managed by xen. Without the cli gpu-group, I can't do scripting vgpu-create as it requires a gpu-group-uuid

 

nvidia-smi

Thu Feb 27 12:45:38 2020

+-----------------------------------------------------------------------------+

| NVIDIA-SMI 440.53 Driver Version: 440.53 CUDA Version: N/A |

|-------------------------------+----------------------+----------------------+

| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |

| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |

|===============================+======================+======================|

| 0 Quadro RTX 8000 On | 00000000:37:00.0 Off | Off |

| 33% 58C P0 83W / 260W | 49011MiB / 49151MiB | 0% Default |

+-------------------------------+----------------------+----------------------+

| 1 Quadro RTX 6000 On | 00000000:86:00.0 Off | Off |

| 33% 38C P8 30W / 260W | 24441MiB / 24575MiB | 5% Default |

+-------------------------------+----------------------+----------------------+

+-----------------------------------------------------------------------------+

| Processes: GPU Memory |

| GPU PID Type Process name Usage |

|=============================================================================|

| 0 31207 C+G /usr/lib64/xen/bin/vgpu 16230MiB |

| 0 31272 C+G /usr/lib64/xen/bin/vgpu 16230MiB |

| 0 31565 C+G /usr/lib64/xen/bin/vgpu 16230MiB |

| 1 2075 C+G /usr/lib64/xen/bin/vgpu 6066MiB |

| 1 2426 C+G /usr/lib64/xen/bin/vgpu 6066MiB |

| 1 2600 C+G /usr/lib64/xen/bin/vgpu 6066MiB |

| 1 2996 C+G /usr/lib64/xen/bin/vgpu 6066MiB |

+-----------------------------------------------------------------------------+

[root@localhost ~]# xe gpu-group-list

uuid ( RO) : e062fc86-82d6-e7fc-969e-9c2837698e0f

name-label ( RW): Group of NVIDIA Corporation Device 1e30 GPUs

name-description ( RW):

uuid ( RO) : 313004be-b34a-115c-d6eb-d7d8431232ac

name-label ( RW): Group of NVIDIA Corporation TU104GL [Tesla T4] GPUs

name-description ( RW):

uuid ( RO) : 6afa24fa-4b19-4174-82e4-93b8687271d6

name-label ( RW): Group of NVIDIA Corporation GM204GL [Tesla M60] GPUs

name-description ( RW):

uuid ( RO) : 60b2c8b3-ed41-3a02-b334-133e3bd78204

name-label ( RW): Group of Matrox Electronics Systems Ltd. Device 0538 GPUs

name-description ( RW):

Link to comment
  • 0

xen 7.1.2. I cannot upgrade as this is a supported release when it goes production.

It is recognizing the RTX8000/60000 as M60 /1e30 . THe T4, nobody is using it.

 

xe gpu-group-list

uuid ( RO) : e062fc86-82d6-e7fc-969e-9c2837698e0f

name-label ( RW): Group of NVIDIA Corporation Device 1e30 GPUs

name-description ( RW):

 

uuid ( RO) : 313004be-b34a-115c-d6eb-d7d8431232ac

name-label ( RW): Group of NVIDIA Corporation TU104GL [Tesla T4] GPUs

name-description ( RW):

 

uuid ( RO) : 6afa24fa-4b19-4174-82e4-93b8687271d6

name-label ( RW): Group of NVIDIA Corporation GM204GL [Tesla M60] GPUs

name-description ( RW):

 

uuid ( RO) : 60b2c8b3-ed41-3a02-b334-133e3bd78204

name-label ( RW): Group of Matrox Electronics Systems Ltd. Device 0538 GPUs

name-description ( RW):

 

xe vgpu-list

uuid ( RO) : ef073272-df46-43ed-ba35-247af30e1f01

vm-uuid ( RO): d0c4560f-9348-881e-d6fc-69428494adef

gpu-group-uuid ( RO): e062fc86-82d6-e7fc-969e-9c2837698e0f

 

uuid ( RO) : 4838ed84-7705-3897-e208-dad12ae9c746

vm-uuid ( RO): d848814b-5dca-c9fc-5610-d7aba861f500

gpu-group-uuid ( RO): 6afa24fa-4b19-4174-82e4-93b8687271d6

 

uuid ( RO) : b7651d2f-a75f-a722-f0c8-456c1791fa35

vm-uuid ( RO): 91b6332e-7596-558c-5f1c-17fcef319d06

gpu-group-uuid ( RO): e062fc86-82d6-e7fc-969e-9c2837698e0f

 

uuid ( RO) : 672368d3-4659-86ce-67b5-ddeb63a8863a

vm-uuid ( RO): 86e0bdc9-4b42-9659-3cd0-c76a7dcd5a5c

gpu-group-uuid ( RO): e062fc86-82d6-e7fc-969e-9c2837698e0f

 

uuid ( RO) : 1528d525-1f0d-2310-1914-9384ccf23dba

vm-uuid ( RO): 290fe529-ff86-5625-a64f-9f5a18c1f331

gpu-group-uuid ( RO): e062fc86-82d6-e7fc-969e-9c2837698e0f

 

uuid ( RO) : af166f52-d709-a635-b25a-77738ee5e853

vm-uuid ( RO): dab1558b-36d7-086c-4419-be36f6bdc2fc

gpu-group-uuid ( RO): e062fc86-82d6-e7fc-969e-9c2837698e0f

 

uuid ( RO) : bbfac1c8-d7ef-3fc8-34b5-7167b00aa917

vm-uuid ( RO): 24f9ede1-371f-12f9-ba93-184306bd3381

gpu-group-uuid ( RO): e062fc86-82d6-e7fc-969e-9c2837698e0f

 

uuid ( RO) : 357ef9e4-d829-4bed-d267-1918265d65a7

vm-uuid ( RO): b9d6f5ce-cd60-1bf6-c66c-2c27b5e8b36c

gpu-group-uuid ( RO): e062fc86-82d6-e7fc-969e-9c2837698e0f

 

uuid ( RO) : 52b393d2-5072-88bb-c987-fc918f35dd21

vm-uuid ( RO): 8f7ffdc2-1934-695b-3ca9-4ece67bc0fd5

gpu-group-uuid ( RO): e062fc86-82d6-e7fc-969e-9c2837698e0f

 

uuid ( RO) : 0d1186b7-6b46-13a9-db19-8c9365f2dd28

vm-uuid ( RO): 80ba560f-591c-5082-51f3-59fdde4dad55

gpu-group-uuid ( RO): e062fc86-82d6-e7fc-969e-9c2837698e0f

 

uuid ( RO) : c2b1fa28-449b-1053-a621-375994a5f2d3

vm-uuid ( RO): 8909e6c7-e995-6083-61d3-6535c440dde7

gpu-group-uuid ( RO): e062fc86-82d6-e7fc-969e-9c2837698e0f

 

uuid ( RO) : 8ecfcfc9-459e-fb7f-3a00-2e02a137343f

vm-uuid ( RO): b54d661e-1b37-3df8-2c26-c0887603cfbb

gpu-group-uuid ( RO): e062fc86-82d6-e7fc-969e-9c2837698e0f

 

uuid ( RO) : 791203cd-4048-aec8-9fd7-5a492c7efc09

vm-uuid ( RO): 25805446-b040-8717-4a32-76c3db343bfb

gpu-group-uuid ( RO): e062fc86-82d6-e7fc-969e-9c2837698e0f

Link to comment

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now
×
×
  • Create New...