wang-y-z's repositories
leetcode-algorithm
hand over it
cutlass
CUDA Templates for Linear Algebra Subroutines
ncnn
ncnn is a high-performance neural network inference framework optimized for the mobile platform
nnfusion
A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.
sparsegpt
Code for the paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".
SparseTIR
SparseTIR: Sparse Tensor Compiler for Deep Learning
triton
Development repository for the Triton language and compiler
architect-awesome
后端架构师技术图谱
chatgpt-demo
A demo repo based on OpenAI API (gpt-3.5-turbo)
dgl
Python package built to ease deep learning on graph, on top of existing DL frameworks.
FlashAttention20
Get down and dirty with FlashAttention2.0 in pytorch, plug in and play no complex CUDA kernels
heterogeneity-aware-lowering-and-optimization
heterogeneity-aware-lowering-and-optimization
iree
👻
lianjia-beike-spider
链家网和贝壳网房价爬虫,采集北京上海广州深圳等21个**主要城市的房价数据(小区,二手房,出租房,新房),稳定可靠快速!支持csv,MySQL, MongoDB,Excel, json存储,支持Python2和3,图表展示数据,注释丰富 ,点星支持,仅供学习参考,请勿用于商业用途,后果自负。
onnx-mlir
Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure
pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
RWKV-CUDA
The CUDA version of the RWKV language model ( https://github.com/BlinkDL/RWKV-LM )
sparsetir-artifact
Repository for artifact evaluation of ASPLOS 2023 paper "SparseTIR: Composable Abstractions for Sparse Compilation in Deep Learning"
tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
vim
Personal Vim Profile
vim-fast
Vim快速配置
weibo-crawler
新浪微博爬虫,用python爬取新浪微博数据,并下载微博图片和微博视频
xla
Enabling PyTorch on Google TPU
xv6-riscv
Xv6 for RISC-V