lucidrains/electra-pytorch Issues
Disallow Sampling From Correct
Updated 1Custom Dataset
UpdatedElectra-small performance
Updated 1nan loss during pretraining
Updated
A simple and working implementation of Electra, the fastest way to pretrain language models from scratch, in Pytorch