geronimi73's repositories
3090_shorts
minimal LLM scripts for 24GB VRAM GPUs. training, inference, whatever
FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
SD-minimal
my SD playground
accelerate
🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
deep_4_all
Courses a codes that I use to teach deeplearing
EQ-Bench
A benchmark for emotional intelligence in large language models
fsdp_qlora
Training LLMs with QLoRA + FSDP
geronimi73
Config files for my GitHub profile.
Latte
The official implementation of Latte: Latent Diffusion Transformer for Video Generation.
llm-datasets
High-quality datasets, tools, and concepts for LLM fine-tuning.
LLMTest_NeedleInAHaystack
Doing simple retrieval from LLM models at various context lengths to measure accuracy
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
trl
Train transformer language models with reinforcement learning.