VITA-Group/BERT-Tickets Issues
Rewinding doesn't work
ClosedTransformer Vesion
UpdatedRuntimeError: CUDA out of memory
Closed 3Duplicated code ¿?
Closed 1
[NeurIPS 2020] "The Lottery Ticket Hypothesis for Pre-trained BERT Networks", Tianlong Chen, Jonathan Frankle, Shiyu Chang, Sijia Liu, Yang Zhang, Zhangyang Wang, Michael Carbin