kaiokendev / cutoff-len-is-context-len

Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

kaiokendev/cutoff-len-is-context-len Issues