bclarkson-code / Tricycle

Deep learning framework completely from scratch in python + numpy

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Rotary Position Embeddings

bclarkson-code opened this issue · comments

Modern LLMs basically all use rotary position embeddings. these should be added to tricycle