datta-TG's starred repositories
GrammarGPT
The code and data for GrammarGPT.
OpenAgents
[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild
Prompt-Engineering-Guide
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
diffusion-models-class
Materials for the Hugging Face Diffusion Models Course
pytorch-lightning
Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.
Made-With-ML
Learn how to design, develop, deploy and iterate on production-grade ML applications.
lm-spanish
Official source for spanish Language Models and resources made @ BSC-TEMU within the "Plan de las Tecnologías del Lenguaje" (Plan-TL).
C4_200M-synthetic-dataset-for-grammatical-error-correction
This dataset contains synthetic training data for grammatical error correction. The corpus is generated by corrupting clean sentences from C4 using a tagged corruption model. The approach and the dataset are described in more detail by Stahlberg and Kumar (2021) (https://www.aclweb.org/anthology/2021.bea-1.4/)
data-science-on-aws
AI and Machine Learning with Kubeflow, Amazon EKS, and SageMaker