Rotary Position Embeddings
bclarkson-code opened this issue · comments
Modern LLMs basically all use rotary position embeddings. these should be added to tricycle
Deep learning framework completely from scratch in python + numpy
bclarkson-code opened this issue · comments
Modern LLMs basically all use rotary position embeddings. these should be added to tricycle