Amanda-Barbara's repositories

AITemplate

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

annotated_deep_learning_paper_implementations

🧑‍🏫 59 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

chip-spv

CHIP-SPV is a backend infrastructure for HIP running on SPIR-V

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

CTC-loss-introduction

介绍ctc算法原理以及numpy简单实现

Language:PythonStargazers:0Issues:0Issues:0

doxygen

Official doxygen git repository

Language:C++License:GPL-2.0Stargazers:0Issues:0Issues:0

FluidX3D

The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs via OpenCL.

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

gpt4all

gpt4all: an ecosystem of open-source chatbots trained on a massive collections of clean assistant data including code, stories and dialogue

Language:C++License:MITStargazers:0Issues:0Issues:0

intel-llvm

Intel staging area for llvm.org contribution. Home for Intel LLVM-based projects.

License:NOASSERTIONStargazers:0Issues:0Issues:0

linux-kernel-lkmpg

The Linux Kernel Module Programming Guide (updated for 5.x kernels)

Language:TeXLicense:OSL-3.0Stargazers:0Issues:0Issues:0

MIOpen

AMD's Machine Intelligence Library

License:MITStargazers:0Issues:0Issues:0

MNN

MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba

Language:C++Stargazers:0Issues:0Issues:0

oneDNN

oneAPI Deep Neural Network Library (oneDNN)

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

OpenCL-Benchmark

A small OpenCL benchmark program to measure peak GPU/CPU performance.

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

opencl-intercept-layer

Intercept Layer for Debugging and Analyzing OpenCL Applications

Language:C++License:MITStargazers:0Issues:0Issues:0

OpenCL-Wrapper

OpenCL is the most powerful programming language ever created. Yet the OpenCL C++ bindings are cumbersome and the code overhead prevents many people from getting started. I created this lightweight OpenCL-Wrapper to greatly simplify OpenCL software development with C++ while keeping functionality and performance.

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

openmlsys-zh

《Machine Learning Systems: Design and Implementation》- Chinese Version

Language:TeXStargazers:0Issues:0Issues:0

pocl

pocl - Portable Computing Language

Language:CLicense:MITStargazers:0Issues:0Issues:0

Python

最良心的 Python 教程:

Language:PythonStargazers:0Issues:0Issues:0

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

stable-diffusion

A latent text-to-image diffusion model

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:0Issues:0

torch-mlir

The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0
Language:CudaStargazers:0Issues:0Issues:0

triton

Development repository for the Triton language and compiler

License:MITStargazers:0Issues:0Issues:0

tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

tvm_walk_through

code reading for tvm

Language:PythonStargazers:0Issues:0Issues:0

ultralytics

YOLOv8 🚀 in PyTorch > ONNX > CoreML > TFLite

Language:PythonLicense:AGPL-3.0Stargazers:0Issues:0Issues:0

VeriGPU

OpenSource GPU, in Verilog, loosely based on RISC-V ISA

Language:SystemVerilogLicense:MITStargazers:0Issues:0Issues:0

xla

A community-driven and modular open source compiler for ML.

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0