deepseek-ai / DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

About datasets

ftgreat opened this issue · comments

Hi, thank you for your great work!

Could you provide more details about the pretrain dataset?
How has the pretrain dataset been optimized in DeepSeek-V2 compared to the previous version, DeepSeek?

Thank you.