[Feature]: rope_scaling for qwen2

Question

HappyLynn opened this issue 17 days ago · comments

We found that qwen2 such as Qwen2Attention does not accept rope_scaling. However, we need to use yarn/ntk feature. Could you support that?

No response

No response