YenTing (Adam) Lin's starred repositories
ai-workshop-code
Code I wrote for my AI & LLM workshops
awesome-synthetic-datasets
awesome synthetic (text) datasets
cohere-toolkit
Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.
text-dedup
All-in-one text de-duplication
ml-engineering
Machine Learning Engineering Open Book
LLaMA-Factory
Unify Efficient Fine-Tuning of 100+ LLMs
DL-Art-School
TorToiSe fine-tuning with DLAS
tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
insanely-fast-whisper
Incredibly fast Whisper-large-v3
RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.