microsoft / pai

Resource scheduling and cluster management for AI

Home Page:https://openpai.readthedocs.io

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Uninstall Pai service but interrupted by 'nvidia-device-plugin-daemonset'

18AlexHua18 opened this issue · comments

When I use Paictl.py service delete but interrupted by Trying to stop nvidia-device-plugin-daemonset
截屏2022-05-21 下午9 47 24

OpenPAI Environment:

  • OpenPAI version: v1.8.0
  • OS (e.g. from /etc/os-release):VERSION="18.04.6 LTS (Bionic Beaver)"
  • Kernel (e.g. uname -a): x86_64 GNU/Linu

You can use kubectl describe ds nvidia-device-plugin -n kube-system to find the reason. The message means it is stuck when executing kubectl command