Notes and summaries of papers I've read
- Large Scale Distributed Deep Networks [notes][paper]
- Deep Gradient Compression: Reducing the communication bandwidth for distributed training [notes][paper]
MIT © Manraj Singh
Notes and summaries of papers I've read