Sen's starred repositories
LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
TranslucentTB
A lightweight utility that makes the Windows taskbar translucent/transparent.
llama-recipes
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
tree-of-thought-llm
[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models
mamba-minimal
Simple, minimal implementation of the Mamba SSM in one file of PyTorch.
direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
flash-linear-attention
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
Triton-Puzzles
Puzzles for learning Triton
BIG-Bench-Hard
Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them
openlogprobs
Extract full next-token probabilities via language model APIs
Model-Editing-Hurt
Model Editing Can Hurt General Abilities of Large Language Models