a deep learning-driven scheduler for elastic training in deep learning clusters
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool
vihowe opened this issue 2 years ago · comments
I am studying your paper and reading the code. Could you please tell me what the central agent is in train.py is? Thanks a lot.
@vihowe It works like a parameter server to sync parameters among workers.