Peter Kim's repositories
flash-attention-minimal
Flash Attention in ~100 lines of CUDA (forward pass only)
insta-chat
DIY Instagram Chat Automation with Google Sheets
mixed-precision-from-scratch
Mixed precision training from scratch with Tensors and CUDA
paged-attention-minimal
a minimal cache manager for PagedAttention, on top of llama3.
attentive-reader
TensorFlow-based implementation of Stanford's Attentive Reader for the CNN/DailyMail datatset
lucene-wikipedia
Lucene-based Multi-sentence context retriever for open-domain QA setting
tweet-emoji-predictor
Tweet Emoji Predictor - SemEval 2018 Task 2
carla-driver
Autonomous Driving Agent for CARLA
Hemorrhage-Predictor-from-MRIs
Accurate Hemorrhage Predictor using MRIs
PDB-Evaluator
A Python-based inference system for Probabilistic Databases
positioning
RNN based prediction of wearer's position given sensor data from Android Wear
style-transferer
Lightweight style transferer built with SqueezeNet
incubator-eagle
Mirror of Apache Eagle (Incubating)
ml-algorithms
from the scratch
policy-gradient
vanilla policy gradient on pong and maybe more
tspeterkim.github.io
thanks for the free compute, github
uvicorn-gunicorn-fastapi-docker
Docker image with Uvicorn managed by Gunicorn for high-performance FastAPI web applications in Python 3.6 and above with performance auto-tuning. Optionally with Alpine Linux.