zwshan

followers

following

stars

zwshan's repositories

SYsU-lang-doc

提供 24年春季学期中山大学编译原理实验课程文档

200

libtorch_with_cuda_kernel

libtorch with custom cuda kernel

Language:CMake100

2023-Project-117

Проект для курса «Моя первая научная статья», задача 117:: Поиск зависимостей биомеханических системах. Project for M1P, task 117: Search for dependencies in biomechanical systems

Language:Jupyter NotebookMIT000

basecalling_architectures

Language:PythonUnlicense000

bitsandbytes

8-bit CUDA functions for PyTorch

Language:PythonMIT000

bonito

A PyTorch Basecaller for Oxford Nanopore Reads

Language:PythonNOASSERTION000

brocolli

Torch Fx Pytorch Model Converter

Language:PythonMIT000

buddy-benchmark

Benchmark Framework for Buddy Projects

Language:MLIRApache-2.0000

ChatPaper

Use ChatGPT to summarize the arXiv papers.

Language:PythonNOASSERTION000

ChatRWKV

ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.

Apache-2.0000

composable_kernel

Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators

NOASSERTION000

cudnn-frontend

cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it

Language:C++MIT000

cutlass-learning

the code of learning code

000

cutlass_quant

Playing with quantization

Apache-2.0000

dm-ticket

大麦网自动购票, 支持docker一键部署。https://t.me/+2EELgNTYiMYxMTFl

MIT000

gcc

GPL-2.0000

golsm

Language:Go000

HAWQ

Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.

MIT000

HIPIFY

HIPIFY: Convert CUDA to Portable C++ Code

MIT000

MSRCall

000

nanopore_benchmark

Unlicense000

ont_fast5_api

Oxford Nanopore Technologies fast5 API software

NOASSERTION000

parallel-decoding

Repository of the paper "Accelerating Transformer Inference for Translation via Parallel Decoding"

Language:PythonMIT000

ppq

PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.

Apache-2.0000

SYsU-lang2

中山大学编译原理课程实验（完全重构版本）

GPL-3.0000

TensorRT

PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT

BSD-3-Clause000

tickets

一个基于 tauri + rust + vue 的抢票软件，大麦抢票软件。

MIT000

tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Apache-2.0000

tvm_gpu_gemm

play gemm with tvm

000

zwshan.github.io

store my resume

Language:HTMLApache-2.0000