tengdecheng's starred repositories
flash-attention
Fast and memory-efficient exact attention
cs231n.github.io
Public facing notes page
bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
openmlsys-zh
《Machine Learning Systems: Design and Implementation》- Chinese Version
riscv-isa-sim
Spike, a RISC-V ISA Simulator
SparseConvNet
Submanifold sparse convolutional networks
TensorComprehensions
A domain specific language to express machine learning workloads.
huggingface_hub
The official Python client for the Huggingface Hub.
torch-mlir
The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.
Triton-Puzzles
Puzzles for learning Triton
mlir-tutorial
MLIR For Beginners tutorial
llvm-project
PLCT实验室的 RISC-V V Spec 实现,基于llvm/llvm-project,rkruppe/rvv-llvm 和 https://repo.hca.bsc.es/gitlab/rferrer/llvm-epi-0.8
tvm_gpu_gemm
play gemm with tvm
SHARK-Turbine
Unified compiler/runtime for interfacing with PyTorch Dynamo.
conv3x3_m1
This is a demo how to write a high performance convolution run on apple silicon
rvv-benchmark
PLCT实验室 rvv-llvm 实现配套的 benchmark / testcases
countdownlatchcpp
CountDownLatch in C++
fixed_point_math
a templated header-only fixed point math library for C++