Fernando Carranza's starred repositories

License:NOASSERTIONStargazers:1Issues:0Issues:0

RedPajama-Data

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Language:PythonLicense:Apache-2.0Stargazers:4479Issues:0Issues:0

UD_Spanish-AnCora

Spanish data from the AnCora corpus.

License:NOASSERTIONStargazers:28Issues:0Issues:0
Language:JavaScriptStargazers:3Issues:0Issues:0