Implement SuperHOT/interpolated RoPE support

Question

philpax opened this issue a year ago · comments

Another llama.cpp feature that seems to have shrunk the paper-to-implementation pipeline to less than one week!

This allows for a much longer context (assuming you have the (V)RAM for it)

We can probably close out #77 if this is done.

Lukas Kreussel · Answer 1 · Wed Jul 19 2023 15:47:18 GMT+0800 (China Standard Time)

To do this we only need a new rope_scaling model parameter. Or am i missing something?