InternLM / InternLM

Official release of InternLM2 7B and 20B base and chat models. 200K context support

Home Page:https://internlm.intern-ai.org.cn/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

一人血书求讲解怎么来洗数据!

Jianfeng777 opened this issue · comments

描述该功能

d48fd2b2bcab5b2f7493abed89d953f
希望讲解高质量数据集的清洗方法!

是否希望自己实现该功能?

  • 我希望自己来实现这一功能,并向 InternLM 贡献代码!

微臣附议!

We just found the news from: https://mp.weixin.qq.com/s/Pt02LXlh2Uu_hgM0ZL5GGg
Greatly eager to learn about data cleaning!

臣附议!

Please refer to the technical report of WanJuan-CC: https://arxiv.org/abs/2402.19282