sungjun lee's starred repositories
llama-recipes
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
whisper-vits-svc
Core Engine of Singing Voice Conversion & Singing Voice Clone
LLMDataHub
A quick guide (especially) for trending instruction finetuning datasets
LLMTest_NeedleInAHaystack
Doing simple retrieval from LLM models at various context lengths to measure accuracy
resource-stream
CUDA related news and material links
zsh-in-docker
Install Zsh, Oh My Zsh and plugins inside a Docker container with one line!
KICE_slayer_AI_Korean
수능 국어 1등급에 도전하는 AI
newspaper4k
📰 Newspaper4k a fork of the beloved Newspaper3k. Extraction of articles, titles, and metadata from news websites.
text-clustering
Easily embed, cluster and semantically label text datasets
wikipedia-markdown-generator
A simple python script to convert any Wikipedia article to Markdown.
text-anonymization
A guide to anonymize text effortlessly using Presidio, an open-source library developed by Microsoft.