Fernando Carranza's starred repositories
RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
UD_Spanish-AnCora
Spanish data from the AnCora corpus.
noamilei.github.io
Sitio web
Buenos Aires, Argentina
https://sites.google.com/view/fernando-carranza/p%C3%A1gina-principal
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
Spanish data from the AnCora corpus.
Sitio web