shreyansh26

Shreyansh Singh's repositories

Annotated-ML-Papers

Annotations of the interesting ML papers I read

FlashAttention-PyTorch

Implementation of FlashAttention in PyTorch

Language:PythonMIT86 20

Speculative-Sampling

Implementation of Speculative Sampling as described in "Accelerating Large Language Model Decoding with Speculative Sampling" by Deepmind

Language:PythonMIT54 20

Red-Teaming-Language-Models-with-Language-Models

A re-implementation of the "Red Teaming Language Models with Language Models" paper by Perez et al., 2022

Language:Python19 20

DBMS-Project

DBMS Project (CS361) for Somnath Sevashram

Language:Java6 40

shreyansh26.github.io

My personal website

Language:HTML5 30

LLM-Activation-Steering-Experiments

Some experiments with activation steering in LLMs

Language:Python2 30

ML-Paper-Implementations

Implementations of some interesting ML research papers and projects.

2 30

RAG-ML-Engg-Open-Book

Query the ML Engineering Open Book using RAG

Language:Python200

shreyansh26

2 20

Tensor-Puzzles-Solutions

Solutions to Tensor puzzles by Sasha Rush - https://github.com/srush/Tensor-Puzzles

Language:Jupyter NotebookMIT2 20

VAE-Implementation

A simple implementation of Autoencoder and Variational Autoencoder

Language:Jupyter Notebook2 30

Weekend-Projects

Small and interesting projects done in my free time.

Language:PythonMIT2 30

CS344-Parallel-Programming-Solutions

Language:C++1 30

CTF-Writeups

CTF (Capture The Flag) writeups, code snippets, notes, scripts

Language:Java1 20

Experiments-with-the-Neural-Tangent-Kernel

Learning more about the NTK through existing paper implementations

Language:Jupyter Notebook1 30

Gradient-Descent-on-Neural-Networks-Typically-Occurs-at-the-Edge-of-Stability

A re-implementation of the paper "Gradient Descent on Neural Networks Typically Occurs at the Edge of Stability" by Cohen et al.

Language:Jupyter Notebook1 40

hydragen-attention

An implementation of the core attention algorithm in the paper "Hydragen: High-Throughput LLM Inference with Shared Prefixes".

Language:Python1 30

Long-Context-Biencoder

A bi-encoder (sentence/paragraph embedding) model which can work with sequence lengths upto 1024 tokens.

Language:Python1 20

Lottery-Ticket-Hypothesis

A re-implementation of the paper "The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks" by Jonathan Frankle and Michael Carbin

Language:Jupyter Notebook1 40

Personal-Website

Language:HTMLMIT1 30

deep_learning_curriculum

Language model alignment-focused deep learning curriculum

Language:Python010

LLM-Training-Puzzles-Solutions

The LLM Training Puzzles by Sasha Rush

Language:Jupyter NotebookMIT020

MLSys-Experiments

A collection of scripts on experimenting and implementing MLSys-related stuff

Language:Jupyter Notebook020

nanoGPT-kvcache

A fork of nanoGPT which uses KV Cache to do faster inference

Language:PythonMIT010

resource-stream

CUDA related news and material links

MIT000

SocketIO_Chat

Language:HTML020

Solving-Substitution-Ciphers-using-MCMC

Solving substitution ciphers using Markov Chain Monte Carlo (MCMC)

Language:Python030

Transformer-Puzzles-Solutions

Solutions to the Transformer Puzzles by Sasha Rush - https://github.com/srush/Transformer-Puzzles

Language:Jupyter NotebookMIT020

Wargames

Wargames exploit or writeups

Language:Python030