zhangxs (janicevidal)

janicevidal

Geek Repo

Github PK Tool:Github PK Tool

zhangxs's starred repositories

onnxsim_large_model

simplify >2GB large onnx model

Language:PythonLicense:MITStargazers:41Issues:0Issues:0

OutfitAnyone

Outfit Anyone: Ultra-high quality virtual try-on for Any Clothing and Any Person

Stargazers:5475Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:14Issues:0Issues:0
Language:PythonLicense:AGPL-3.0Stargazers:152Issues:0Issues:0

LW-DETR

This repository is an official implementation of the paper "LW-DETR: A Transformer Replacement to YOLO for Real-Time Detection".

Language:PythonLicense:Apache-2.0Stargazers:180Issues:0Issues:0

dash-infer

DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including x86 and ARMv9.

Language:C++License:Apache-2.0Stargazers:127Issues:0Issues:0

rtp-llm

RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.

Language:C++License:Apache-2.0Stargazers:493Issues:0Issues:0

xDiT

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) on multi-GPU Clusters

Language:PythonLicense:Apache-2.0Stargazers:400Issues:0Issues:0

KVQuant

KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

Language:PythonStargazers:274Issues:0Issues:0

MambaOut

MambaOut: Do We Really Need Mamba for Vision?

Language:PythonLicense:Apache-2.0Stargazers:1943Issues:0Issues:0

inferflow

Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).

Language:C++License:MITStargazers:232Issues:0Issues:0

ppl.nn

A primitive library for neural network

Language:C++License:Apache-2.0Stargazers:1267Issues:0Issues:0

Awesome-LLM-Inference

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

License:GPL-3.0Stargazers:2323Issues:0Issues:0

LookaheadDecoding

[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

Language:PythonLicense:Apache-2.0Stargazers:1077Issues:0Issues:0

InferLLM

a lightweight LLM model inference framework

Language:C++License:Apache-2.0Stargazers:671Issues:0Issues:0

Hetu-Galvatron

Galvatron is an automatic distributed training system designed for Transformer models, including Large Language Models (LLMs).

Language:PythonStargazers:11Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:102Issues:0Issues:0

ICELUT

[ECCV 2024] Taming Lookup Tables for Efficient Image Retouching

Language:PythonStargazers:23Issues:0Issues:0

tvm_mlir_learn

compiler learning resources collect.

Language:PythonStargazers:2021Issues:0Issues:0

ViT-CoMer

Official implementation of the CVPR 2024 paper ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense Predictions.

Language:PythonLicense:Apache-2.0Stargazers:185Issues:0Issues:0

CAMixerSR

CAMixerSR: Only Details Need More “Attention” (CVPR 2024)

Language:PythonLicense:Apache-2.0Stargazers:190Issues:0Issues:0

distrifuser

[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models

Language:PythonLicense:MITStargazers:543Issues:0Issues:0

llama.cpp

LLM inference in C/C++

Language:C++License:MITStargazers:64209Issues:0Issues:0

Effective-Fusion-Factor

Effective Fusion Factor in FPN for Tiny Object Detection(WACV2021)

Language:PythonLicense:MITStargazers:58Issues:0Issues:0

HorNet

[NeurIPS 2022] HorNet: Efficient High-Order Spatial Interactions with Recursive Gated Convolutions

Language:PythonLicense:MITStargazers:314Issues:0Issues:0

Conformer

Official code for Conformer: Local Features Coupling Global Representations for Visual Recognition

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:529Issues:0Issues:0
Language:PythonLicense:GPL-3.0Stargazers:707Issues:0Issues:0

llama2.c

Inference Llama 2 in one file of pure C

Language:CLicense:MITStargazers:17080Issues:0Issues:0

TinySAM

Official PyTorch implementation of "TinySAM: Pushing the Envelope for Efficient Segment Anything Model"

Language:PythonLicense:Apache-2.0Stargazers:386Issues:0Issues:0