simonzgx

Xu Zhang's repositories

some changes based on an existing open source projects--VNPY

Language:Python13 20

Language:Jupyter NotebookApache-2.03 10

LLM inference in C/C++

Language:C++MIT000

The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.

Language:LLVMNOASSERTION000

Personal learning note

Language:C++000

Triton backend that enables pre-process, post-processing and other logic to be implemented in Python.

Language:C++BSD-3-Clause000

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Language:PythonNOASSERTION000

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Language:PythonBSD-3-Clause000

The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.

Language:C++NOASSERTION000

Development repository for the Triton language and compiler

Language:C++MIT000

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonApache-2.0000