Lei Wang (LeiWang1999)

LeiWang1999

Geek Repo

Company:Institute of Computing Technology, UCAS

Location:Peking

Home Page:https://leiblog.wang

Twitter:@Lei_Wang_1999

Github PK Tool:Github PK Tool


Organizations
microsoft

Lei Wang's repositories

ZYNQ-NVDLA

NVDLA (An Opensource DL Accelerator Framework) implementation on FPGA.

tvm_gpu_gemm

play gemm with tvm

AutoGPTQ.tvm

GPTQ inference TVM kernel

VehicleFlowDetection

Implement of vehicle flow statistics based on tensorflow and yolo3 with pyqt5 GUI.

leiblog.wang

My New Blog Powered by HEXO http://leiblog.wang

Language:HTMLStargazers:5Issues:2Issues:0
Language:PythonLicense:MITStargazers:4Issues:0Issues:0
Language:PythonStargazers:4Issues:0Issues:0

cv

resume.

Language:TeXStargazers:3Issues:2Issues:0

tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Language:PythonLicense:Apache-2.0Stargazers:3Issues:1Issues:0
Language:C++License:NOASSERTIONStargazers:2Issues:2Issues:0

Ladder

@DataStructures_Cbased I'm Coming!

Language:PythonLicense:MITStargazers:2Issues:0Issues:0
Language:PythonStargazers:2Issues:0Issues:0

Roller

Build and Train AlexNet with PyTorch and Predict with TVM and Pytorch, compare the performance between them

Language:PythonStargazers:2Issues:2Issues:0

_cutlass

CUDA Templates for Linear Algebra Subroutines

Language:C++License:NOASSERTIONStargazers:1Issues:1Issues:0

MSBitBLAS

BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

nnfusion

A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.

Language:C++License:MITStargazers:1Issues:1Issues:0
Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

apex

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:1Issues:0

AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:PythonLicense:MITStargazers:0Issues:1Issues:0

gptq_faster

Faster 3bit CUDA Kernel for gptq.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Stargazers:0Issues:1Issues:0

mlc-llm

Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Language:HTMLStargazers:0Issues:1Issues:0

nni

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

ppq

PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

vllm-bitblas

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Welder_artifacts

OSDI 2023 WElder artifacts

Language:PythonStargazers:0Issues:1Issues:0