Ryan Spring (rdspring1)

rdspring1

Geek Repo

Company:Rice University; @RUSH-LAB ; @Nvidia

Location:Santa Clara

Home Page:https://www.linkedin.com/in/rdspring1

Twitter:@ryanspring13

Github PK Tool:Github PK Tool


Organizations
RUSH-LAB

Ryan Spring's repositories

PyTorch_GBW_LM

PyTorch Language Model for 1-Billion Word (LM1B / GBW) Dataset

Language:PythonLicense:Apache-2.0Stargazers:122Issues:6Issues:15

LSH_DeepLearning

Scalable and Sustainable Deep Learning via Randomized Hashing

Language:JavaLicense:Apache-2.0Stargazers:91Issues:13Issues:3

Count-Sketch-Optimizers

A compressed adaptive optimizer for training large-scale deep learning models using PyTorch

Language:PythonLicense:Apache-2.0Stargazers:26Issues:4Issues:0

MISSION

MISSION: Ultra Large-Scale Feature Selection using Count-Sketches

Language:C++License:Apache-2.0Stargazers:13Issues:6Issues:7

LSH-Mutual-Information

Use LSH Sampling for Mutual Information Estimation

Language:PythonLicense:Apache-2.0Stargazers:5Issues:2Issues:0

comp450-Reachability-Guided-RRT

Use dynamic constraints to sample plausible states for RRT algorithm, improving robot motion planning

comp450-planning_under_uncertainty

Motion planning for a steerable needle under action uncertainty

Language:C++Stargazers:2Issues:2Issues:0

pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Language:PythonLicense:NOASSERTIONStargazers:1Issues:2Issues:0

RzLinear

A compressed alternative to matrix multiplication using state-of-the art compression ROBE-Z

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

lightning-thunder

Source to source compiler for PyTorch. It makes PyTorch programs faster on single accelerators and distributed.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

AITemplate

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

atari-representation-learning

Code for "Unsupervised State Representation Learning in Atari"

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

Auto-GPT

An experimental open-source attempt to make GPT-4 fully autonomous.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

Autopilot-TensorFlow

A TensorFlow implementation of this Nvidia paper: https://arxiv.org/pdf/1604.07316.pdf with some changes

Language:Jupyter NotebookLicense:MITStargazers:0Issues:3Issues:0

cs231n

Solutions to Stanford CS231n Spring 2018 Course Assignments.

Language:Jupyter NotebookStargazers:0Issues:1Issues:0

cuda-training-series

Training materials associated with NVIDIA's CUDA Training Series (www.olcf.ornl.gov/cuda-training-series/)

Language:CudaStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:1Issues:0

micrograd

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0

minGPT

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

License:MITStargazers:0Issues:0Issues:0

mongoose

A Learnable LSH Framework for Efficient NN Training

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

NvFuser

A Fusion Code Generator for NVIDIA GPUs

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

nvprims-torchdynamo

A Python-level JIT compiler designed to make unmodified PyTorch programs faster.

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:1Issues:0

Optimizing-DGEMM-on-Intel-CPUs-with-AVX512F

Stepwise optimizations of DGEMM on CPU, reaching performance faster than Intel MKL eventually, even under multithreading.

Language:CLicense:GPL-3.0Stargazers:0Issues:1Issues:0

Optimizing-DGEMV-on-Intel-CPUs

Highly optimized DGEMV on CPU with both serial and parallel performance better than MKL and OpenBLAS.

Language:CLicense:GPL-3.0Stargazers:0Issues:1Issues:0

Optimizing-SGEMM-on-NVIDIA-Turing-GPUs

Optimizing SGEMM kernel functions on NVIDIA GPUs to a close-to-cuBLAS performance.

Language:PythonLicense:GPL-3.0Stargazers:0Issues:1Issues:0

tutel

Tutel MoE: An Optimized Mixture-of-Experts Implementation

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

twitter-algorithm-ml

Source code for Twitter's Recommendation Algorithm

Language:PythonLicense:AGPL-3.0Stargazers:0Issues:0Issues:0

vector-search-class-notes

Class notes for the course "Long Term Memory in AI - Vector Search and Databases" COS 495 @ Princeton Fall 2023

Language:TeXLicense:MITStargazers:0Issues:0Issues:0

xla

Enabling PyTorch on Google TPU

Language:C++License:NOASSERTIONStargazers:0Issues:1Issues:0