Yimin Jiang's starred repositories
flash-attention
Fast and memory-efficient exact attention
Prompt-Engineering-Guide
š Guides, papers, lecture, notebooks and resources for prompt engineering
fedlearner
A multi-party collaborative machine learning framework
Megatron-LM
Ongoing research training transformer models at scale
attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need".
incubator-mxnet
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more