Chen Gong's starred repositories
flash-attention
Fast and memory-efficient exact attention
publications
Publications from Trail of Bits
bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
include-what-you-use
A tool for use with clang to analyze #includes in C and C++ source files
awesome-machine-learning-in-compilers
Must read research papers and links to tools and datasets that are related to using machine learning for compilers and systems optimisation
segment-anything-fast
A batched offline inference oriented version of segment-anything
paper-reading
ๆทฑๅบฆๅญฆไน ็ปๅ ธใๆฐ่ฎบๆ้ๆฎต็ฒพ่ฏป
Learn-LLVM-17
Learn LLVM 17, published by Packt
DirectXShaderCompiler
This repo hosts the source for the DirectX Shader Compiler which is based on LLVM/Clang.
CppCon2023
Slides and other materials from CppCon 2023
ML-YouTube-Courses
๐บ Discover the latest machine learning / AI courses on YouTube.
annotated_deep_learning_paper_implementations
๐งโ๐ซ 60 Implementations/tutorials of deep learning papers with side-by-side notes ๐; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), ๐ฎ reinforcement learning (ppo, dqn), capsnet, distillation, ... ๐ง
streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks