Build layer objects
bclarkson-code opened this issue · comments
Some wrappers should be made around tensors and their multiplication to abstract them as layers instead of separate parameters and operations
Autograd to GPT-2 completely from scratch
bclarkson-code opened this issue · comments
Some wrappers should be made around tensors and their multiplication to abstract them as layers instead of separate parameters and operations