powergiant

followers

following

stars

powergiant's starred repositories

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonApache-2.03496500

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonNOASSERTION1016400

gpt-neox

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Language:PythonApache-2.0685600

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonMIT3649900

open_llama

OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset

Apache-2.0735500

EasyLM

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

Language:PythonApache-2.0237700

deeplearningtheory

Apache-2.023800

information-bottleneck

demonstration of the information bottleneck theory for deep learning

Language:Jupyter Notebook5800

mean-field-theory-deep-learning

paper lists and information on mean-field theory of deep learning

MIT7500

Compositional_Deep_Learning

Deep learning via category theory and functional programming

Language:HaskellMIT13800

RL-Theory-book

Reinforcement learning theory book about foundations of deep RL algorithms with proofs.

Language:TeX26900

AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Language:PythonMIT16721000