Beast code in Giters

zhangxs's starred repositories

alpha-free-matting

800

onnxsim_large_model

simplify >2GB large onnx model

Language:PythonMIT4100

OutfitAnyone

Outfit Anyone: Ultra-high quality virtual try-on for Any Clothing and Any Person

547500

SimD

Language:PythonApache-2.01400

LeYOLO

Language:PythonAGPL-3.015200

LW-DETR

This repository is an official implementation of the paper "LW-DETR: A Transformer Replacement to YOLO for Real-Time Detection".

Language:PythonApache-2.018000

dash-infer

DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including x86 and ARMv9.

Language:C++Apache-2.012700

rtp-llm

RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.

Language:C++Apache-2.049300

xDiT

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) on multi-GPU Clusters

Language:PythonApache-2.040000

KVQuant

KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

Language:Python27400

MambaOut

MambaOut: Do We Really Need Mamba for Vision?

Language:PythonApache-2.0194300

inferflow

Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).

Language:C++MIT23200

ppl.nn

A primitive library for neural network

Language:C++Apache-2.0126700

Awesome-LLM-Inference

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

GPL-3.0232300

LookaheadDecoding

[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

Language:PythonApache-2.0107700

InferLLM

a lightweight LLM model inference framework

Language:C++Apache-2.067100

Hetu-Galvatron

Galvatron is an automatic distributed training system designed for Transformer models, including Large Language Models (LLMs).

Language:Python1100

Samba

Language:PythonApache-2.010200

ICELUT

[ECCV 2024] Taming Lookup Tables for Efficient Image Retouching

Language:Python2300

tvm_mlir_learn

compiler learning resources collect.

Language:Python202100

ViT-CoMer

Official implementation of the CVPR 2024 paper ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense Predictions.

Language:PythonApache-2.018500