What's the Prompt and Response length in the Paper?
JadeRay opened this issue · comments
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
Repository from Github https://github.comdeepseek-ai/DeepSeek-V2
JadeRay opened this issue · comments