Zakor Gyula (gyulaz-htec)

gyulaz-htec

Geek Repo

Github PK Tool:Github PK Tool

Zakor Gyula's repositories

composable_kernel

Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators

License:NOASSERTIONStargazers:0Issues:0Issues:0

AMDMIGraphX

AMD's graph optimization engine.

License:MITStargazers:0Issues:0Issues:0

flash-attention

Fast and memory-efficient exact attention

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

License:Apache-2.0Stargazers:0Issues:0Issues:0

client

Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

server

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

onnxruntime-inference-examples

Examples for using ONNX Runtime for machine learning inferencing.

License:MITStargazers:0Issues:0Issues:0

third_party

Third-party source packages that are modified for use in Triton.

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

core

The core library and APIs implementing the Triton Inference Server.

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

onnxruntime_backend

The Triton backend for the ONNX Runtime.

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

backend

Common source, scripts and utilities for creating Triton backends.

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

models

A collection of pre-trained, state-of-the-art models in the ONNX format

License:Apache-2.0Stargazers:0Issues:0Issues:0