Jungseob Lee's starred repositories
LLaMA-Factory
A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
llama-recipes
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.
faster-whisper
Faster Whisper transcription with CTranslate2
LLaMA-Adapter
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
alignment-handbook
Robust recipes to align language models with human and AI preferences
CTranslate2
Fast inference engine for Transformer models
direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
llm-hallucination-survey
Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models"
open-korean-instructions
언어모델을 학습하기 위한 공개 한국어 instruction dataset들을 모아두었습니다.
LLM-Factuality-Survey
The repository for the survey paper <<Survey on Large Language Models Factuality: Knowledge, Retrieval and Domain-Specificity>>
MSMARCO-Question-Answering
MS MARCO(Microsoft Machine Reading Comprehension) is a large scale dataset focused on machine reading comprehension and question answering
batched-chatgpt
Auto-retrying, Customizable, Easy-calling batched chatgpt library for researchers
internal_know_conf
Internal state know knowledge confliction