RunningLeon

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

Language:PythonMIT010

onnx

Open standard for machine learning interoperability

Language:C++Apache-2.0010

onnx-simplifier

Simplify your onnx model

Language:C++Apache-2.0010

opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (LLaMA, LLaMa2, ChatGLM2, ChatGPT, Claude, etc) over 50+ datasets.

Language:PythonApache-2.0000

ppl.nn

A primitive library for neural network

Language:C++Apache-2.0010

NVIDIA® TensorRT™, an SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that delivers low latency and high throughput for inference applications.

Language:C++Apache-2.0000

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++Apache-2.0000

xmodaler

X-modaler is a versatile and high-performance codebase for cross-modal analytics.

Language:PythonNOASSERTION010

RunningLeon

RunningLeon's repositories

mmdeploy

mmdeploy-testdata

mmdetection

ppq

lmdeploy

Efficient-AI-Backbones

InternLM

llama.cpp

llamafile

mmclassification

mmcv

mmediting

mmocr

mmpose

mmrazor

mmyolo

MobileSAM

MQBench

Multimodal-GPT

MyST-Parser

NART

nni

nnUNet

onnx

onnx-simplifier

opencompass

ppl.nn

TensorRT

TensorRT-LLM

xmodaler