microsoft / pai

Resource scheduling and cluster management for AI

Home Page:https://openpai.readthedocs.io

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

NoJobError happens when use rest-api to submit a job

HaoLiuHust opened this issue · comments

Organization Name:

Short summary about the issue/question:
I am trying to use rest-api to submit a job, the api is :
/api/v2/jobs
sometimes, it return state code 202, which means succeed, but sometimes it return 404, NoJobError, which means failed, but I can find the submited job on web ui

Brief what process you are following:

How to reproduce it:

OpenPAI Environment:

  • OpenPAI version: 1.8.0
  • Cloud provider or hardware configuration:
  • OS (e.g. from /etc/os-release):
  • Kernel (e.g. uname -a):
  • Hardware (e.g. core number, memory size, storage size, GPU type etc.):
  • Others:

Anything else we need to know: