seqeralabs / nf-tower

Nextflow Tower system

Home Page:https://tower.nf

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Zombie runs, redux: Jobs still shown as running

lina-kim opened this issue · comments

Hi all,

First, thanks for creating and maintaining Nextflow Tower! I've been using it since it was in beta, and am a big fan.

I recently had an issue identical to #313: A number of previously killed jobs (15 in all) are still marked active on my dashboard, some apparently active for three weeks. All were launched and killed from the command line. Active tasks are labeled either aborted or running, with no exit codes.

Screenshot 2022-05-18 at 10 36 00

This isn't something I've seen in previous releases, so figured I'd bring this up. Thanks!

I have the same issue of "zombie" runs. Sometimes runs that didn't even start (e.g. failed to locate input reads) but all appear and cannot be deleted due to being ongoing.

mag
g

Can you please copy & paste the workflow Ids here?

Thanks! Is there a good way of avoiding these in the future, other than not killing local jobs from the command line?

Workflow IDs for my zombie runs:

  • 5zoFIXe3O0q0Vl
  • 382BrwC5rQE634
  • 155XhuX860zL9I
  • 5FmXukqvWvLqID
  • 2f4fghu0QzQnF5
  • 5UTCc6l7ZsbUqJ
  • 3wvkc8sEPaV6fi
  • 20DoBLIUOqVjqe
  • NtatYzN1Z9TYS
  • 1ZjLbkjarb0js7
  • 401GOTT16Sy64p
  • 5MJ0lL4GDwNlfg
  • 4PPOt6qRjWBCmA
  • 3aYnsyI0pC4zvF
  • 2KiNKtqg2PzXHG
  • 38UcPAyEE7Xn9u

Recent zombie jobs:

  • 1W9Oc0ejPJb9B5
  • 4zQKz80vo8aYdV
  • 1mnnAOHOUmLewE
  • 5nCuAURTjAVubW
  • 8XrlaVnZ8y4WF
  • KR4FyTBNa0y8J
  • 5FgYlmRMwbVynU
  • 1nASdBfSvMsUwH
  • yJ9hQmYn9z5zp
  • 2KhaCMuan3yPSs
  • oQJFDAGmzf7l
  • 3885wmDxFLfU94
  • 2vAFVFCMCLn9dB

Older zombie job still "running":

  • 3Srtcvg2ADmKKv (Jun 12)
  • 5z92833zr2m4QK (Apr 30)
  • 4Sfxx5Fi88xB4R (Apr 30)
  • 43Szq0sEimReFv (Apr 8)

Hope I got them all right...
Thanks for checking, @pditommaso !

Thanks for reporting this problem. We have indeed found an issue related to this. Some of the above workflows should be reported as terminated now.

Most are now terminated - thanks @pditommaso!