ymjiang

followers

following

stars

AML@ByteDance

Organizations

bytedance

dmlc

Yimin Jiang's starred repositories

S-LoRA

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Language:PythonApache-2.0170900

MS-AMP

Microsoft Automatic Mixed Precision Library

Language:PythonMIT51000

Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Language:Jupyter NotebookApache-2.0221600

flash-attention

Fast and memory-efficient exact attention

Language:PythonBSD-3-Clause1350400

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonMIT445100

Prompt-Engineering-Guide

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

Language:MDXMIT4789300

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonMIT3632700

NM-sparsity

Language:Python21200

PiPPy

Pipeline Parallelism for PyTorch

Language:PythonBSD-3-Clause71200

veGiantModel

Language:PythonApache-2.020500

pcm

Intel® Performance Counter Monitor (Intel® PCM)

Language:C++BSD-3-Clause274500

fedlearner

A multi-party collaborative machine learning framework

Language:PythonApache-2.089200

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonNOASSERTION1006400

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonApache-2.03484300

attention-is-all-you-need-pytorch

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Language:PythonMIT877000

incubator-mxnet

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

Language:C++Apache-2.0400

byteps

A high performance and generic framework for distributed DNN training

Language:PythonNOASSERTION361800

ps-lite

A lightweight parameter server interface

Language:C++Apache-2.07200

horovod

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

Language:PythonNOASSERTION1416900