zhouhai88's starred repositories
annotated_deep_learning_paper_implementations
🧑🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
LLaMA-Factory
Unify Efficient Fine-Tuning of 100+ LLMs
Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
ChatGLM2-6B
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
flash-attention
Fast and memory-efficient exact attention
Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.
Megatron-LM
Ongoing research training transformer models at scale
improved-diffusion
Release for Improved Denoising Diffusion Probabilistic Models
Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
lm-human-preferences
Code for the paper Fine-Tuning Language Models from Human Preferences
awesome_LLMs_interview_notes
LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案
PythonAiRoad
source code of some articles
lm-human-preference-details
RLHF implementation details of OAI's 2019 codebase
cuda_learning
learning how CUDA works