Deepak Narayanan's repositories
tensorflow-benchmarks
Benchmark code
wiki_search
Wikipedia search engine
Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Megatron-LM
Ongoing research training transformer language models at scale, including: BERT & GPT-2
apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
covid19
Some scripts to analyze COVID-19 data
DeepLearningExamples
Deep Learning Examples
Halide
a language for fast, portable data-parallel computation
mxnet-resnet
Reproduce ResNet-v2(Identity Mappings in Deep Residual Networks) with MXNet
pytorch-cifar
95.16% on CIFAR10 with PyTorch
tensorflow
Computation using data flow graphs for scalable machine learning
tensorflow-models
Models built with TensorFlow
TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.