gyeongchan-yun

Gyeongchan-Yun's starred repositories

Awesome_LLM_System-PaperList

Since the emergence of chatGPT in 2022, the acceleration of Large Language Model has become increasingly important. Here is a list of papers on accelerating LLMs, currently focusing mainly on inference acceleration, and related works will be gradually added in the future. Welcome contributions!

11700

Megatron-Kwai

Ongoing research training transformer models at scale

Language:PythonNOASSERTION1400

AMP

(NeurIPS 2022) Automatically finding good model-parallel strategies, especially for complex models and clusters.

Language:Python3200

alpa

Training and serving large-scale neural networks with auto parallelization.

Language:PythonApache-2.0301300

Megatron-LM

Artifact for DynaPipe: Optimizing Multi-task Training through Dynamic Pipelines

Language:PythonNOASSERTION100

python-patterns

A collection of design patterns/idioms in Python

Language:Python3991000

awesome-AI-system

paper and its code for AI System

18700

zero-bubble-pipeline-parallelism

Zero Bubble Pipeline Parallelism

Language:PythonNOASSERTION22800

optimizing-multitask-training-through-dynamic-pipelines

Official repository for the paper DynaPipe: Optimizing Multi-task Training through Dynamic Pipelines

Language:PythonApache-2.01100

Optimus-CC

[ASPLOS'23] Optimus-CC: Efficient Large NLP Model Training with 3D Parallelism Aware Communication Compression

MIT300

Hetu-Galvatron

Galvatron is an automatic distributed training system designed for Transformer models, including Large Language Models (LLMs). If you have any interests, please visit/star/fork https://github.com/PKU-DAIR/Hetu-Galvatron

Language:Python900