Yang's repositories
How-to-Optim-Algo-with-Triton
Optimize the important algorithm with Triton
AutoBench4TensorComputation
automatic benchmark of tensor program generation and optimization
gemm-benchmark
Python based gemm benchmark for tensor computation on NV & AMD GPUs
efficient-lora
research work about parameter efficient fine-tuning
triton-mlir
Development repository for the Triton language and compiler
programmable-accelerator-design
Research papers related to accelerator design and tensor program optimization with compiler.
RISC-V-Custom-Extension
RISC-V Extension with MLIR Dialect
awesome-machine-learning-in-compilers
Must read research papers and links to tools and datasets that are related to using machine learning for compilers and systems optimisation
Awesome-Pruning
A curated list of neural network pruning resources.
buddy-mlir-1
An MLIR-Based Ideas Landing Project
cutlass_performance_profiling
Exploration of GEMM Performance Improvement with CUTLASS
hidet-artifacts
This repository is the artifact of paper "Hidet: Task Mapping Programming Paradigm for Deep Learning Tensor Programs".
lanyon
A content-first, sliding sidebar theme for Jekyll.
llvm-project
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies. Note: the repository does not accept github pull requests at this moment. Please submit your patches at http://reviews.llvm.org.
nlp-llm-compiler-paper
This is a repo for the paper related to NLP & Compiler & LLM.
one-yolov5
A more efficient yolov5 with oneflow backend πππ
onnx-simplifier
Simplify your onnx model
Tech_Blog
This is a personal technical blog to descripe how to become a full-stack hacker with PyTorch, MLIR, RISC-V and Spatial Accelerators.
ToMe
A method to increase the speed and lower the memory footprint of existing vision transformers.
triton-dev
Development repository for the Triton language and compiler
triton-shared
Shared Middle-Layer for Triton Compilation
yolov5
YOLOv5 π in PyTorch > ONNX > CoreML > TFLite