mst272's starred repositories
MedicalGPT
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。
LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
generative_agents
Generative Agents: Interactive Simulacra of Human Behavior
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Chatterbox
Chinese large language model
torchkeras
Pytorch❤️ Keras 😋😋
generative-models
Generative Models by Stability AI
how-to-train-tokenizer
怎么训练一个LLM分词器
lilianweng.github.io
My personal page
OpenPrompt
An Open-Source Framework for Prompt-Learning.
annotated_deep_learning_paper_implementations
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
nlp-tutorial
NLP新手入门教程
pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
fast-autoaugment
Official Implementation of 'Fast AutoAugment' in PyTorch.
Auto-Augment
Reproduction of paper: AutoAugment: Learning Augmentation Strategies from Data