Compute-optimal Perceiver AR models
krasserm opened this issue · comments
Martin Krasser commented
Application of the Chinchilla paper on small scale.
A PyTorch implementation of Perceiver, Perceiver IO and Perceiver AR with PyTorch Lightning scripts for distributed training
krasserm opened this issue · comments
Application of the Chinchilla paper on small scale.