Suyeon Lee's starred repositories
linux-network-performance-parameters
Learn where some of the network sysctl variables fit into the Linux/Kernel network flow. Translations: 🇷🇺
Awesome-LLM-Inference
đź“–A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
DeepSpeed-MII
MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.
ebpf-beginners
The beginner's guide to eBPF
ramulator
A Fast and Extensible DRAM Simulator, with built-in support for modeling many different DRAM technologies including DDRx, LPDDRx, GDDRx, WIOx, HBMx, and various academic proposals. Described in the IEEE CAL 2015 paper by Kim et al. at http://users.ece.cmu.edu/~omutlu/pub/ramulator_dram_simulator-ieee-cal15.pdf
LLMSys-PaperList
Large Language Model (LLM) Systems Paper List
TheArtofHPC_pdfs
All pdfs of Victor Eijkhout's Art of HPC books and courses
serverless-store-demo
A web e-commerce demo app showcasing serverless capabilities of Google Cloud Platform.
awesome-vector-database
A curated list of awesome works related to high dimensional structure/vector search & database
llm-scheduling-artifact
Artifact of OSDI '24 paper, ”Llumnix: Dynamic Scheduling for Large Language Model Serving“
traindb-model
ML model library for TrainDB
quantumComputing
Updated and most comprehensive Repository On Quantum Computing Resources. It contains all the material I use for my research on Quantum Computing for Both Theories and Codes - I update it regularly.
cache-coherence-protocol-bench
Benchmarking code for evaluating the cost of cache coherence protocols implemented on different platforms