zwshan's repositories
SYsU-lang-doc
提供 24年春季学期中山大学编译原理实验课程文档
libtorch_with_cuda_kernel
libtorch with custom cuda kernel
2023-Project-117
Проект для курса «Моя первая научная статья», задача 117:: Поиск зависимостей биомеханических системах. Project for M1P, task 117: Search for dependencies in biomechanical systems
bitsandbytes
8-bit CUDA functions for PyTorch
bonito
A PyTorch Basecaller for Oxford Nanopore Reads
brocolli
Torch Fx Pytorch Model Converter
buddy-benchmark
Benchmark Framework for Buddy Projects
ChatPaper
Use ChatGPT to summarize the arXiv papers.
ChatRWKV
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
composable_kernel
Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
cudnn-frontend
cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it
cutlass-learning
the code of learning code
cutlass_quant
Playing with quantization
dm-ticket
大麦网自动购票, 支持docker一键部署。https://t.me/+2EELgNTYiMYxMTFl
HAWQ
Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.
HIPIFY
HIPIFY: Convert CUDA to Portable C++ Code
ont_fast5_api
Oxford Nanopore Technologies fast5 API software
parallel-decoding
Repository of the paper "Accelerating Transformer Inference for Translation via Parallel Decoding"
ppq
PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.
SYsU-lang2
中山大学编译原理课程实验(完全重构版本)
TensorRT
PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT
tickets
一个基于 tauri + rust + vue 的抢票软件,大麦抢票软件。
tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
tvm_gpu_gemm
play gemm with tvm
zwshan.github.io
store my resume