togethercomputer / RedPajama-Data

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

togethercomputer/RedPajama-Data Issues