Siddharth Singh's repositories
Masters-Project-Thesis
Title of thesis - PySchedCL: A Framework for Automatically Exploiting Concurrency in Heterogeneous Data-Parallel Applications
wait-free-backprop
Course Project of CMSC818X : Introduction to Parallel Computing at University of Maryland, College Park
axonn
A parallel framework for training deep neural networks
CMSC764-UMD
Advanced numerical optimization course assignments
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
DeepSpeedExamples
Example models using DeepSpeed
examples
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
fast_adversarial
[ICLR 2020] A repository for extremely fast adversarial training using FGSM
FastRFS
Fast Robinson Foulds Supertrees
hatchet
Graph-indexed Pandas DataFrames for analyzing hierarchical performance data
hipBLAS
ROCm BLAS marshalling library
join-order-benchmark
Join Order Benchmark (JOB)
Megatron-LM
Ongoing research training transformer language models at scale, including: BERT & GPT-2
nccl4py
Python Extensions for Nvidia's Collective Communication Library (NCCL)
phylokit
C++ library for high performance phylogenetics
phylonaut
Dynamic programming for phylogenetics applications
siddharth9820.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
stylegan-xl
[SIGGRAPH'22] StyleGAN-XL: Scaling StyleGAN to Large Diverse Datasets
torch_sputnik
PyTorch interface for Sputnik, A Library for Sparse Matrix Multiplication on GPUs
vision
Datasets, Transforms and Models specific to Computer Vision