Shizhi Tang's repositories

FreeTensor

A language and compiler for irregular tensor programs.

Language:C++License:Apache-2.0Stargazers:132Issues:7Issues:35

YAUJ

Yet Another Universal Judge

vfk_uoj_sandbox

vfk's sandbox for uoj

Language:CLicense:MITStargazers:6Issues:5Issues:1

FreeTensor_experiments

Experiments on FreeTensor

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4Issues:2Issues:1

ADBench

Benchmarking various AD tools.

Language:C++License:MITStargazers:1Issues:1Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

License:Apache-2.0Stargazers:1Issues:0Issues:0

async-syscall-app

Userspace for roastduck/linux:async. Working in progress.

Language:C++Stargazers:0Issues:2Issues:0

AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

License:MITStargazers:0Issues:0Issues:0

benchmark

A microbenchmark support library

License:Apache-2.0Stargazers:0Issues:0Issues:0

capsule

Capsule network implemented with TVM

Language:PythonStargazers:0Issues:0Issues:0

carbox2d

Simulator car evolution like http://boxcar2d.com

Language:C++Stargazers:0Issues:2Issues:0

checkout

Action for checking out a repo

Language:TypeScriptLicense:MITStargazers:0Issues:0Issues:0

cutlass

CUDA Templates for Linear Algebra Subroutines

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

EETQ

Easy and Efficient Quantization for Transformers

License:Apache-2.0Stargazers:0Issues:0Issues:0

Enzyme

High-performance automatic differentiation of LLVM and MLIR.

License:NOASSERTIONStargazers:0Issues:0Issues:0

fastmoe

A fast MoE impl for PyTorch

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

googletest

GoogleTest - Google Testing and Mocking Framework

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

incubator-tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

linux

Linux kernel source tree

Language:CLicense:NOASSERTIONStargazers:0Issues:2Issues:0

llm-awq

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

License:MITStargazers:0Issues:0Issues:0

longformer

Longformer: The Long-Document Transformer

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

onnx

Open standard for machine learning interoperability

License:Apache-2.0Stargazers:0Issues:0Issues:0

pybind11

Seamless operability between C++11 and Python

Language:C++License:NOASSERTIONStargazers:0Issues:1Issues:0

pytorch-benchmark

TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

rfs

A simple FS as a practice of Rust

Language:RustStargazers:0Issues:2Issues:0

spdlog

Fast C++ logging library.

License:NOASSERTIONStargazers:0Issues:0Issues:0

taco

The Tensor Algebra Compiler (taco) computes sparse tensor expressions on CPUs and GPUs

Language:C++License:NOASSERTIONStargazers:0Issues:1Issues:0

taskflow

A General-purpose Task-parallel Programming System using Modern C++

License:NOASSERTIONStargazers:0Issues:0Issues:0

transformers

🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0