DeepSeek-Coder-V2-Lite model GPU/RAM requirement

Question

DeepSeek-Coder-V2-Lite model GPU/RAM requirement

HashedViking opened this issue 2 months ago · comments

Hi, thank you for the amazing work! In the readme you say "DeepSeek-Coder-V2 in BF16 format for inference, 80GB*8 GPUs are required".
How much GPU/RAM is needed for inference for DeepSeek-Coder-V2-Lite model?

Daya Guo · Answer 1 · Thu Jun 20 2024 15:26:40 GMT+0800 (China Standard Time)

40GB *1 GPUs in BF16 format

orderer0001 · Answer 2 · Fri Jun 21 2024 09:33:37 GMT+0800 (China Standard Time)

40GB *1 GPUs in BF16 format

3*24G 4090 GPU doesn’t work?

Daya Guo · Answer 3 · Fri Jun 21 2024 09:49:30 GMT+0800 (China Standard Time)

40GB *1 GPUs in BF16 format

3*24G 4090 GPU doesn’t work?

If you have 3*24G 4090 GPUs, you need to enable TP 2 or PP 2. Alternatively, you can use the quantized version of the model on Ollama.