Amanda-Barbara's repositories
AI_compiler_development_guide
Free resource for the book AI Compiler Development Guide
AITemplate
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
chip-spv
CHIP-SPV is a backend infrastructure for HIP running on SPIR-V
CLBlast
Tuned OpenCL BLAS
DeepLearningSystem
Deep Learning System core principles introduction.
dlpack
common in-memory tensor structure
doxygen
Official doxygen git repository
intel-llvm
Intel staging area for llvm.org contribution. Home for Intel LLVM-based projects.
iree
Intermediate Representation Execution Environment
lbd
llvm backend document
linux-kernel-lkmpg
The Linux Kernel Module Programming Guide (updated for 5.x kernels)
linux-kernel-runninglinuxkernel_5.0
奔跑吧linux内核第二版(卷1,卷2,入门篇) 实验平台
maplab
A Modular and Multi-Modal Mapping Framework
MIOpen
AMD's Machine Intelligence Library
MNN
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba
oneDNN
oneAPI Deep Neural Network Library (oneDNN)
open-gpu-kernel-modules
NVIDIA Linux open GPU kernel module source
Paddle3D
A 3D computer vision development toolkit based on PaddlePaddle. It supports point-cloud object detection, segmentation, and monocular 3D object detection models.
PTXprofiler
A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.
torch-mlir
The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.
tpu-mlir
Machine learning compiler based on MLIR for Sophgo TPU.
triton
Development repository for the Triton language and compiler
tvm_walk_through
code reading for tvm
VeriGPU
OpenSource GPU, in Verilog, loosely based on RISC-V ISA
workflow
Parallel Computing and Asynchronous Networking Engine ⭐️⭐️⭐️
xgboost
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
xla
A community-driven and modular open source compiler for ML.