Yen-Ting Lin's starred repositories
Open-Reasoning-Tasks
A comprehensive repository of reasoning tasks for LLMs (and beyond)
text-clustering
Easily embed, cluster and semantically label text datasets
flash-attention
Fast and memory-efficient exact attention
zh-tw-embedding-model-benchmark
使用繁體中文資料集做的 Embedding 模型評測
TWLLM-Tutor
Taiwan-LLM Tutor: Large Language Models for Taiwanese Secondary Education
DataDreamer
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤
ai-workshop-code
Code I wrote for my AI & LLM workshops
awesome-synthetic-datasets
awesome synthetic (text) datasets
cohere-toolkit
Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.
text-dedup
All-in-one text de-duplication
ml-engineering
Machine Learning Engineering Open Book
LLaMA-Factory
A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)