Beast code in Giters

Zakor Gyula's repositories

Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators

NOASSERTION000

AMD's graph optimization engine.

MIT000

Fast and memory-efficient exact attention

BSD-3-Clause000

A high-throughput and memory-efficient inference and serving engine for LLMs

Apache-2.0000

Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.

BSD-3-Clause000

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

BSD-3-Clause000

Examples for using ONNX Runtime for machine learning inferencing.

MIT000

Third-party source packages that are modified for use in Triton.

BSD-3-Clause000

The core library and APIs implementing the Triton Inference Server.

BSD-3-Clause000

The Triton backend for the ONNX Runtime.

BSD-3-Clause000

Common source, scripts and utilities for creating Triton backends.

BSD-3-Clause000

A collection of pre-trained, state-of-the-art models in the ONNX format

Apache-2.0000