Amanda-Barbara's repositories

AI_compiler_development_guide

Free resource for the book AI Compiler Development Guide

Language:LLVMStargazers:0Issues:0Issues:0

AITemplate

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

chip-spv

CHIP-SPV is a backend infrastructure for HIP running on SPIR-V

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

CLBlast

Tuned OpenCL BLAS

License:Apache-2.0Stargazers:0Issues:0Issues:0

DeepLearningSystem

Deep Learning System core principles introduction.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

dlpack

common in-memory tensor structure

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

doxygen

Official doxygen git repository

Language:C++License:GPL-2.0Stargazers:0Issues:0Issues:0

intel-llvm

Intel staging area for llvm.org contribution. Home for Intel LLVM-based projects.

License:NOASSERTIONStargazers:0Issues:0Issues:0

iree

Intermediate Representation Execution Environment

License:Apache-2.0Stargazers:0Issues:0Issues:0

lbd

llvm backend document

Language:C++Stargazers:0Issues:0Issues:0

linux-kernel-lkmpg

The Linux Kernel Module Programming Guide (updated for 5.x kernels)

Language:TeXLicense:OSL-3.0Stargazers:0Issues:0Issues:0

linux-kernel-runninglinuxkernel_5.0

奔跑吧linux内核第二版(卷1,卷2,入门篇) 实验平台

License:NOASSERTIONStargazers:0Issues:0Issues:0

maplab

A Modular and Multi-Modal Mapping Framework

License:Apache-2.0Stargazers:0Issues:0Issues:0

MIOpen

AMD's Machine Intelligence Library

License:MITStargazers:0Issues:0Issues:0

MNN

MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba

Language:C++Stargazers:0Issues:0Issues:0

oneDNN

oneAPI Deep Neural Network Library (oneDNN)

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

open-gpu-kernel-modules

NVIDIA Linux open GPU kernel module source

Language:CLicense:NOASSERTIONStargazers:0Issues:0Issues:0

Paddle3D

A 3D computer vision development toolkit based on PaddlePaddle. It supports point-cloud object detection, segmentation, and monocular 3D object detection models.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

PTXprofiler

A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.

License:NOASSERTIONStargazers:0Issues:0Issues:0
Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

torch-mlir

The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0
Language:CudaStargazers:0Issues:0Issues:0

tpu-mlir

Machine learning compiler based on MLIR for Sophgo TPU.

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

triton

Development repository for the Triton language and compiler

License:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

tvm_walk_through

code reading for tvm

Language:PythonStargazers:0Issues:0Issues:0

VeriGPU

OpenSource GPU, in Verilog, loosely based on RISC-V ISA

Language:SystemVerilogLicense:MITStargazers:0Issues:0Issues:0

workflow

Parallel Computing and Asynchronous Networking Engine ⭐️⭐️⭐️

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

xgboost

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

License:Apache-2.0Stargazers:0Issues:0Issues:0

xla

A community-driven and modular open source compiler for ML.

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0