deepseek-ai / DeepSeek-Coder-V2

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Why DeepSeek-Coder-v2 236B is not trained with FIM objective?

wasiahmad opened this issue · comments

The paper mentions that DeepSeek-Coder-v2 236B is trained by only utilizing the Next-Token-Prediction objective. No FIM objective is used. Is there any reason not to use FIM?

The deepseek-coder-v2 236B model was not intended for code completion, so FIM (Fill-in-the-Middle) was not used.

Is it intended to use as instruction following LLM?

yes.