Beast code in Giters

Zhenghui Jin's repositories

incubator-mxnet

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

Language:C++Apache-2.0100

mxnet-eopc

Language:Jupyter Notebook1 10

alpa

Auto parallelization for large-scale neural networks

Language:PythonApache-2.0000

amazon-eks-ami

Packer configuration for building a custom EKS AMI

Language:ShellMIT-0000

apex

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Language:PythonBSD-3-Clause000

array-api-tests

Test suite for the PyData Array APIs standard

Language:PythonMIT000

arrow

Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing

Language:C++Apache-2.0000

aws-efa-eks

Deploying P4d in EKS utilizing GPUDirectRDMA over EFA

Language:DockerfileMIT-0000

aws-efa-nccl-baseami-pipeline

EFA/NCCL base AMI build Packer and CodeBuild/Pipeline files. Also base Docker build files to enable EFA/NCCL in containers

Language:PythonMIT-0000

d2l-en

Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 175 universities.

Language:PythonNOASSERTION000

dcgm-exporter

NVIDIA GPU metrics exporter for Prometheus leveraging DCGM

Language:GoApache-2.0000

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.

Language:PythonMIT000

elastic

PyTorch elastic training

Language:PythonBSD-3-Clause000

gluon-nlp

NLP made easy

Language:PythonApache-2.0000

horovod

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

Language:PythonNOASSERTION000

jax

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Language:PythonApache-2.0000

kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Language:ShellNOASSERTION000

mxnet_recipes

MXNet conda recipes

Language:Shell000

nccl

Optimized primitives for collective multi-GPU communication

Language:C++NOASSERTION000

NeMo

NeMo: a toolkit for conversational AI

Language:PythonApache-2.0000

python-etcd

A python client for etcd

Language:PythonNOASSERTION000

pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Language:PythonNOASSERTION000

pytorch-lightning

The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.

Language:PythonApache-2.0000

ray_lightning

Pytorch Lightning Distributed Accelerators using Ray

Language:PythonApache-2.0000

staged-recipes

A place to submit conda recipes before they become fully fledged conda-forge feedstocks

Language:PythonBSD-3-Clause000

torchx

TorchX is a library containing standard DSLs for authoring and running PyTorch related components for an E2E production ML pipeline

Language:PythonBSD-3-Clause000

training-operator

Training operators on Kubernetes.

Language:GoApache-2.0000

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonApache-2.0000

xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Language:PythonNOASSERTION000

xla

Enabling PyTorch on XLA Devices (e.g. Google TPU)

Language:C++NOASSERTION000