Repository from Github https://github.comcometyang/distributed-training-and-deepspeedRepository from Github https://github.comcometyang/distributed-training-and-deepspeed
Hostfile, the master machine need to be put in the first row