[Feature]: rope_scaling for qwen2
HappyLynn opened this issue Β· comments
Lynn commented
π The feature, motivation and pitch
We found that qwen2 such as Qwen2Attention does not accept rope_scaling. However, we need to use yarn/ntk feature. Could you support that?
Alternatives
No response
Additional context
No response