w-e-w / stable-diffusion-webui-GPU-temperature-protection

Pause image generation when GPU temperature exceeds threshold

Home Page:https://github.com/AUTOMATIC1111/stable-diffusion-webui

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

GPU temperature protection

Pause image generation when GPU temperature exceeds threshold

this extension uses nvidia-smi to monitor GPU temperature at the end of each step, if temperature exceeds threshold pause image generation until criteria are met.

Support

  • Nvidia GPU on Windows and Linux
  • AMD GPU on Linux with ROCm and Windows with OpenHardwareMonitor

Installation

  • method 1: Install using extensions tab Available extensions
  • method 2: Install using extensions tab Install from URL by manually copying the repository URL
https://github.com/w-e-w/stable-diffusion-webui-GPU-temperature-protection.git

Setting

Settings can be found at Setting > GPU Temperature

  • GPU temperature monitor minimum interval
    • checking temperature too often will reduce image generation performance
    • set to 0 well effectively disable this extension
    • to completely disable extension disable the file extension tab
  • GPU sleep temperature
    • generation will pause if GPU core temperature exceeds this temperature
  • GPU wake temperature
    • generation will continue to pause until temperature has drops below this temperature
    • setting a higher value than GPU sleep temperaturewill effectively disable this
  • Sleep Time
    • seconds to sleep before checking temperature again
  • Max sleep Time
    • max number of seconds that it's allowed to pause
    • generation will continue disregarding GPU wake temperature after the allotted time has passed
    • set to 0 disable this limit allowing it to pause indefinitely
  • Print GPU Core temperature while sleeping in terminal
    • print the GPU core temperature reading from nvidia-smi to console when generation is paused
    • providing information
  • GPU device index
    • selecting the correct temperature reading for multi GPU systems
    • in most cases and for single GPU system this value should be 0

Notes

  • Temperature unit Celsius, Time unit seconds
  • To be honest I don't recommend anyone using this extension, if your GPU is having temperature issues and don't have the means to improve the cooling, then recommend using software like MSI afterburner to undervolt or further power limit or thermal limit the GPU.

About

Pause image generation when GPU temperature exceeds threshold

https://github.com/AUTOMATIC1111/stable-diffusion-webui

License:GNU Affero General Public License v3.0


Languages

Language:Python 100.0%