tvm

There are 10 repositories under tvm topic.

mlc-ai / mlc-llm
Universal LLM Deployment Engine with ML Compilation
llm machine-learning-compilation language-model tvm
Language:Python 21333
mlc-ai / web-llm
High-performance In-browser LLM Inference Engine
deep-learning llm tvm webgpu webml chatgpt language-model
Language:TypeScript 16441
apache / tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
compiler tensor deep-learning gpu opencl metal performance javascript rocm tvm vulkan spirv machine-learning
Language:Python 12622
mlc-ai / web-stable-diffusion
Bringing stable diffusion models to web browsers. Everything runs inside the browser with no server support.
deep-learning stable-diffusion tvm web-assembly webgpu webml
Language:Jupyter Notebook 3652
hyperai / tvm-cn
TVM Documentation in Chinese Simplified / TVM 中文文档
deep-learning documentation gpu machine-learning tvm chinese-simplified translation apache
Language:TypeScript 2314
dmlc / nnvm
computation-graph deep-learning optimization deployment nnvm tvm cuda opencl rocm metal
Language:C++ 1659
OAID / AutoKernel
AutoKernel 是一个简单易用，低门槛的自动算子优化工具，提高深度学习算法部署效率。
auto autosearch deep-learning halide optimization pytorch reinforcement-learning tengine tensor tensorflow tvm
Language:C++ 736
yolort
zhiqwang / yolort
yolort is a runtime stack for yolov5 on specialized accelerators such as tensorrt, libtorch, onnxruntime, tvm and ncnn.
libtorch yolov5 inference torchscript onnx onnxruntime tvm pytorch detection jit visualization yolo lightning yolort deployment ncnn tensorrt graghsurgeon trt nms
Language:Python 731
Ryan-yang125 / ChatLLM-Web
🗣️ Chat with LLM like Vicuna totally in your browser with WebGPU, safely, privately, and with no server. Powered by web llm.
chatgpt deep-learning llm nextjs pwa react tvm vicuna webgpu webml
Language:JavaScript 633
Zhen-Dong / HAWQ
Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.
quantization tvm model-compression distillation quantized-neural-networks pytorch hardware-aware mixed-precision efficient-neural-networks 8-bit 4-bit tensorcore hessian
Language:Python 446
grants-and-bounties
ton-society / grants-and-bounties
TON Foundation invites talent to imagine and realize projects that have the potential to integrate with the daily lives of users.
blockchain flash-grants func grants ton tvm web3
384
apache / tvm-vta
Open, Modular, Deep Learning Accelerator
tvm vta tensor hardware machine-learning
Language:Scala 284
tonkeeper / tongo
Golang SDK for from Tonkeeper team
adnl liteserver tl tlb ton tvm
Language:Go 244
JackonYang / paper-reading
比做算法的懂工程落地，比做工程的懂算法模型。
arxiv chip-design computer-architecture llm mlir tvm
Language:Jupyter Notebook 241
coderonion / awesome-cuda-and-hpc
🚀🚀🚀 This repository lists some awesome public CUDA, cuda-python, cuBLAS, cuDNN, CUTLASS, TensorRT, TensorRT-LLM, Triton, TVM, MLIR, PTX and High Performance Computing (HPC) projects.
awesome blas cublas cuda cudnn cutlass deepseek gemm gpu hpc llama llm mlir openblas ptx pytorch tensorrt tensorrt-llm triton tvm
240
tqchen / ffi-navigator
ffi language-server tvm vscode-extension
Language:Python 235
merrymercy / tvm-mali
Optimizing Mobile Deep Learning on ARM GPU with TVM
arm deep-learning mali opencl tvm
Language:C 181
TVM-Solidity-Compiler
everx-labs / TVM-Solidity-Compiler
Solidity compiler for TVM
everscale blockchain everos compiler smart-contracts solidity venom-blockchain venom-developer-program tvm
Language:C++ 126
apache / tvm-rfcs
A home for the final text of all TVM RFCs.
hardware machine-learning tensor tvm vta
102
traveller59 / torch2trt
convert torch module to tensorrt network or tvm function
pytorch tensorrt tvm
Language:Python 88
tlc-pack / TLCBench
Benchmark scripts for TVM
tvm benchmark deep-learning auto-tuning tuning-logs
Language:Python 74
YongtaoGe / RetinaFace
Reimplement RetinaFace using PyTorch.
retinaface face-detection pytorch tvm
Language:C++ 74
andersy005 / tvm-in-action
TVM stack: exploring the incredible explosion of deep-learning frameworks and how to bring them together
tvm deep-learning
Language:Jupyter Notebook 64
xiayouran / VisuTVM
TVM Relay IR Visualization Tool (TVM 可视化工具)
cpp python3 tvm visualization visualization-tools relay-ir tvm-relay-ir visu-relay-ir visu-tvm neural-network neural-network-visualization
Language:Python 64
DietCode
UofT-EcoSystem / DietCode
DietCode Code Release
tvm auto-scheduler machine-learning-compiler
Language:Cuda 62
whitelok / tvm-lesson
动手学习TVM核心原理教程
acceleration deep-learning gpu-acceleration high-performance-computing inference tvm
Language:Python 61
Yulv-git / Model-Inference-Deployment
A curated list of awesome inference deployment framework of artificial intelligence (AI) models. OpenVINO, TensorRT, MediaPipe, TensorFlow Lite, TensorFlow Serving, ONNX Runtime, LibTorch, NCNN, TNN, MNN, TVM, MACE, Paddle Lite, MegEngine Lite, OpenPPL, Bolt, ExecuTorch.
inference deployment onnx openvino tensorrt mediapipe tensorflow-lite tensorflow-serving onnx-runtime libtorch ncnn tnn mnn tvm mace paddle-lite megengine-lite openppl bolt awesome
Language:Python 56
markson14 / FaceRecognitionCpp
Large input size REAL-TIME Face Detector on Cpp. It can also support face verification using MobileFaceNet+Arcface with real-time inference. 480P Over 30FPS on CPU
tvm mtcnn arcface insightface real-tim face-recognition face-detection lightweight retinaface retinaface-detector cpp
Language:C++ 50
l1nkr / DL-Compiler-Navigation
Machine Learning Compiler Road Map
deep-learning tvm cuda high-performance-computing inference machine-learning machine-learning-systems pytorch
Language:Jupyter Notebook 43
ton-community / ton-onboarding-challenge
⛏ Boilerplate for mining your very first NFT and becoming a TVM Developer.
javascript ton tvm typescript
Language:TypeScript 42
mrtnetwork / On_chain
Streamline Ethereum, Solana, Aptos, Sui and Tron operations. Effortlessly create transactions, interact with smart contracts, sign, and send transactions for a seamless blockchain experience.
blockchain dart ethereum ethereum-blockchain ethereum-wallet evm flutter smart-contracts tron tron-api tron-protocol tvm wallet hd-wallet metaplex solana cardano aptos sui
Language:Dart 36
Howave / RetinaFace-TVM
tvm retinaface
Language:C++ 30
ehsanmok / tvm-rust
(MERGED) Rust bindings for TVM runtime
rust-library tvm nnvm deep-learning compiler
Language:Rust 28
tum-ei-eda / utvm_staticrt_codegen
This project contains a code generator that produces static C NN inference deployment code targeting tiny micro-controllers (TinyML) as replacement for other µTVM runtimes. This tools generates a runtime, which statically executes the compiled model. This reduces the overhead in terms of code size and execution time compared to having a dynamic on-device runtime.
tvm
Language:C 28
LCAI-TIHU / SW
LCAI-TIHU SW is a software stack of the AI inference processor based on RISC-V
aipu compiler inference nvdla riscv runtime tvm lcai-tihu risc-v
Language:C 23
yungwine / pytvm
TON Blockchain TVM and Transactions emulator
emulator ton tvm
Language:Python 22

tvm

mlc-ai / mlc-llm

mlc-ai / web-llm

apache / tvm

mlc-ai / web-stable-diffusion

hyperai / tvm-cn

dmlc / nnvm

OAID / AutoKernel

zhiqwang / yolort

Ryan-yang125 / ChatLLM-Web

Zhen-Dong / HAWQ

ton-society / grants-and-bounties

apache / tvm-vta

tonkeeper / tongo

JackonYang / paper-reading

coderonion / awesome-cuda-and-hpc

tqchen / ffi-navigator

merrymercy / tvm-mali

everx-labs / TVM-Solidity-Compiler

apache / tvm-rfcs

traveller59 / torch2trt

tlc-pack / TLCBench

YongtaoGe / RetinaFace

andersy005 / tvm-in-action

xiayouran / VisuTVM

UofT-EcoSystem / DietCode

whitelok / tvm-lesson

Yulv-git / Model-Inference-Deployment

markson14 / FaceRecognitionCpp

l1nkr / DL-Compiler-Navigation

ton-community / ton-onboarding-challenge

mrtnetwork / On_chain

Howave / RetinaFace-TVM

ehsanmok / tvm-rust

tum-ei-eda / utvm_staticrt_codegen

LCAI-TIHU / SW

yungwine / pytvm