jiweibo

Wilber's repositories

onnx_bench

onnx benchmark and tools

Language:C++1 10

The LLVM Project is a collection of modular and reusable compiler and toolchain technologies. Note: the repository does not accept github pull requests at this moment. Please submit your patches at http://reviews.llvm.org.

Language:C++000

onnx-mlir

Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure

Language:C++Apache-2.0000

Paddle

PArallel Distributed Deep LEarning （『飞桨』核心框架，高性能单机、分布式训练和跨平台部署）

Language:PythonApache-2.0010

Paddle-Lite

Multi-platform high performance deep learning inference engine (『飞桨』多平台高性能深度学习预测引擎）

Language:C++Apache-2.0000

triton

Development repository for the Triton language and compiler

Language:C++MIT000

tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Language:PythonApache-2.0000

CINN

Compiler Infrastructure for Neural Networks

Apache-2.0000

CS-Notes

:books: 技术面试必备基础知识、Leetcode、计算机操作系统、计算机网络、系统设计

Language:Java000

env

Software Development Environment

000

FluidDoc

Documentations for PaddlePaddle

Language:ShellApache-2.0010

Halide

a language for fast, portable data-parallel computation

NOASSERTION000

mlir-mma

Optimize gpu mma based on mlir.

MIT000

MMA

Matrix Multiplication Addition

Language:CudaMIT000

mobile-aloha

Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation

MIT000

onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

Language:C++MIT000

Paddle-Inference-Demo

Language:C++Apache-2.0010

paddlepaddle_backend

Language:C++BSD-3-Clause010

PaddleTest

PaddlePaddle TestSuite

000

ProjectTest

some project demo

Language:C++BSD-3-Clause020

Scripts

Commonly used scripts or simple and useful programs

Language:PythonBSD-3-Clause010

StarInf

A useless trash.

Apache-2.0000

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++Apache-2.0000

jiweibo

Wilber's repositories

jiweibo.github.io

onnx_bench

iree

llvm-project

onnx-mlir

Paddle

Paddle-Lite

triton

tvm

CINN

CS-Notes

env

FluidDoc

Halide

mlir-mma

MMA

mobile-aloha

onnxruntime

Paddle-Inference-Demo

paddlepaddle_backend

PaddleTest

ProjectTest

Scripts

StarInf

TensorRT-LLM

test-paddle

tf-serving

transformers

triton_server

WeChatMsg