where did the self.bias get defined in the casual attention class
nebyu08 opened this issue · comments
nebiyu youhannes commented
Chase Lambert commented
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
nebyu08 opened this issue · comments