Go to All Forums

Monitor the utilization of your GPU servers

Dear Team,

We are looking for monitoring  the GPU server utilization metrics.

Kindly advice us for the required plugins.

 

Thanks in advance

 

 

Like (2) Reply
Replies (3)

Hi,
 
Please find attached a plugin script for monitoring your GPU servers. You can monitor performance metrics including the server's memory, temperature, and CPU. 
 
Prerequisite: 
Ensure you have the gpustat python module installed. To install the module, execute the command:
pip install gpustat
 
Installation:
  1. Unzip the attachment and execute the plugin using the command: python gpumonitoring/gpumonitoring.py
  2. Move the downloaded folder to Site24x7's Linux server monitoring agent plugins directory - /opt/site24x7/monagent/plugins/
  3. Log in to Site24x7 and go to Server > Plugins > click on the plugin monitor to view the performance metrics.
Hope this helps. Let us know for further queries if any.
 
Happy Monitoring!

 

 

 

 

Attachments
gpu_monitoring.zip
Size: 1.53 KB
Like (0) Reply

when i am execute python gpumonitoring/gpumonitoring.py, i got this error


```python /home/nodeflux/gpu_monitoring/gpu_monitoring.py
Traceback (most recent call last):
File "/home/nodeflux/gpu_monitoring/gpu_monitoring.py", line 24, in <module>
p = subprocess.Popen(cmd, stdout=subprocess.PIPE, shell=True)
NameError: name 'subprocess' is not defined `

Like (0) Reply

ok already solve , i am adding subprocess library

Like (0) Reply

Was this post helpful?