Wei Tao's repositories
Efficient-LLM-Inferencing-on-GPUs
Penn CIS 5650 (GPU Programming and Architecture) Final Project
Language:C++MIT000
mlc-llm
Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.
Language:PythonApache-2.0000
optimum-benchmark
A unified multi-backend utility for benchmarking Transformers, Timm, Diffusers and Sentence-Transformers with full support of Optimum's hardware optimizations & quantization schemes.
Language:PythonApache-2.0000
oss101
开源软件通识三部曲
000
solecnugit.github.io
Source code for SOLE website
Language:HTMLBSD-3-Clause000
tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
Language:PythonApache-2.0000