bclarkson-code / Tricycle

Deep learning framework completely from scratch in python + numpy

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Get GPT loss to decrease to 0 for single batch

bclarkson-code opened this issue · comments

To make sure that everything is working, we should be able to drop the loss to 0 on a single batch for the model. If it doesn't then there are some bugs that need fixing