Lei Wang (LeiWang1999)

LeiWang1999

Geek Repo

Company:Institute of Computing Technology, UCAS

Location:Peking

Home Page:https://leiblog.wang

Twitter:@Lei_Wang_1999

Github PK Tool:Github PK Tool


Organizations
microsoft

Lei Wang's repositories

FPGA

帮助大家进行FPGA的入门,分享FPGA相关的优秀文章,优秀项目

ZYNQ-NVDLA

NVDLA (An Opensource DL Accelerator Framework) implementation on FPGA.

AICS-Course

《智能计算系统 AI Computing Systems》习题答案、实验答案、课程笔记

tvm_gpu_gemm

play gemm with tvm

AutoGPTQ.tvm

GPTQ inference TVM kernel

VehicleFlowDetection

Implement of vehicle flow statistics based on tensorflow and yolo3 with pyqt5 GUI.

nvdla-parser

A NVDLA Loadable Parser.

Language:CStargazers:11Issues:2Issues:0
Language:MakefileStargazers:5Issues:2Issues:0

leiblog.wang

My New Blog Powered by HEXO http://leiblog.wang

Language:HTMLStargazers:5Issues:2Issues:0

LeiBlog

用Vuetify.js+Vue.js+Node.js(KOA 自己撸一个博客。http://leiblog.wang

Language:CSSStargazers:4Issues:2Issues:0

cv

resume.

Language:TeXStargazers:3Issues:2Issues:0

tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Language:PythonLicense:Apache-2.0Stargazers:3Issues:1Issues:0

compiler-and-arch

A list of tutorials, paper, talks, and open-source projects for emerging compiler and architecture

_cutlass

CUDA Templates for Linear Algebra Subroutines

License:NOASSERTIONStargazers:1Issues:0Issues:0

nnfusion

A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.

Language:C++License:MITStargazers:1Issues:1Issues:0

antares

Antares: an automatic engine for multi-platform kernel generation and optimization. Supporting CPU, CUDA, ROCm, DirectX12, GraphCore, SYCL for CPU/GPU, OpenCL for AMD/NVIDIA, Android CPU/GPU backends.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

apex

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:1Issues:0

AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:PythonLicense:MITStargazers:0Issues:1Issues:0

ComputeShaderPlayground

Compute Shader Playground with DirectX12

Language:C++Stargazers:0Issues:2Issues:0

gptq_faster

Faster 3bit CUDA Kernel for gptq.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Stargazers:0Issues:1Issues:0

mlc-llm

Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Language:HTMLStargazers:0Issues:1Issues:0

nni

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

ppq

PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

ucas-covid19

ucas疫情防控每日填报助手

Language:PythonStargazers:0Issues:1Issues:0

Welder_artifacts

OSDI 2023 WElder artifacts

Stargazers:0Issues:0Issues:0