Chen Gong's starred repositories

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:12704Issues:0Issues:0

breakpad

Mirror of Google Breakpad project

Language:C++License:NOASSERTIONStargazers:2584Issues:0Issues:0

remill

Library for lifting machine code to LLVM bitcode

Language:C++License:Apache-2.0Stargazers:1238Issues:0Issues:0

publications

Publications from Trail of Bits

Language:PythonLicense:CC-BY-SA-4.0Stargazers:1382Issues:0Issues:0

HVM

A massively parallel, optimal functional runtime in Rust

Language:CudaLicense:Apache-2.0Stargazers:10339Issues:0Issues:0

ir

Lightweight JIT Compilation Framework

Language:CLicense:MITStargazers:334Issues:0Issues:0

bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Language:PythonLicense:MITStargazers:5841Issues:0Issues:0

xed

The X86 Encoder Decoder (XED), is a software library for encoding and decoding X86 (IA32 and Intel64) instructions

Language:PythonLicense:Apache-2.0Stargazers:1383Issues:0Issues:0

zydis

Fast and lightweight x86/x86-64 disassembler and code generation library

Language:CLicense:MITStargazers:3323Issues:0Issues:0

abi-aa

Application Binary Interface for the Armยฎ Architecture

Language:HTMLLicense:NOASSERTIONStargazers:883Issues:0Issues:0

include-what-you-use

A tool for use with clang to analyze #includes in C and C++ source files

Language:C++License:NOASSERTIONStargazers:4018Issues:0Issues:0

gemma.cpp

lightweight, standalone C++ inference engine for Google's Gemma models.

Language:C++License:Apache-2.0Stargazers:5839Issues:0Issues:0

flax

Flax is a neural network library for JAX that is designed for flexibility.

Language:PythonLicense:Apache-2.0Stargazers:5852Issues:0Issues:0

awesome-machine-learning-in-compilers

Must read research papers and links to tools and datasets that are related to using machine learning for compilers and systems optimisation

License:CC0-1.0Stargazers:1405Issues:0Issues:0

segment-anything-fast

A batched offline inference oriented version of segment-anything

Language:PythonLicense:Apache-2.0Stargazers:1157Issues:0Issues:0

paper-reading

ๆทฑๅบฆๅญฆไน ็ปๅ…ธใ€ๆ–ฐ่ฎบๆ–‡้€ๆฎต็ฒพ่ฏป

License:Apache-2.0Stargazers:25372Issues:0Issues:0

Learn-LLVM-17

Learn LLVM 17, published by Packt

Language:C++License:MITStargazers:97Issues:0Issues:0

DirectXShaderCompiler

This repo hosts the source for the DirectX Shader Compiler which is based on LLVM/Clang.

Language:C++License:NOASSERTIONStargazers:3005Issues:0Issues:0

unsloth

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:13504Issues:0Issues:0

Fuser

A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")

Language:C++License:NOASSERTIONStargazers:239Issues:0Issues:0

CppCon2023

Slides and other materials from CppCon 2023

Stargazers:257Issues:0Issues:0

ML-YouTube-Courses

๐Ÿ“บ Discover the latest machine learning / AI courses on YouTube.

License:CC0-1.0Stargazers:14598Issues:0Issues:0

annotated_deep_learning_paper_implementations

๐Ÿง‘โ€๐Ÿซ 60 Implementations/tutorials of deep learning papers with side-by-side notes ๐Ÿ“; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), ๐ŸŽฎ reinforcement learning (ppo, dqn), capsnet, distillation, ... ๐Ÿง 

Language:PythonLicense:MITStargazers:52414Issues:0Issues:0

TTS

๐Ÿธ๐Ÿ’ฌ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:32412Issues:0Issues:0

leptonai

A Pythonic framework to simplify AI service building

Language:PythonLicense:Apache-2.0Stargazers:2603Issues:0Issues:0

streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Language:PythonLicense:MITStargazers:6400Issues:0Issues:0

S-LoRA

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Language:PythonLicense:Apache-2.0Stargazers:1656Issues:0Issues:0

lightllm

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Language:PythonLicense:Apache-2.0Stargazers:2145Issues:0Issues:0

ggml

Tensor library for machine learning

Language:C++License:MITStargazers:10479Issues:0Issues:0

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:54814Issues:0Issues:0