zhangxs (janicevidal)

janicevidal

Geek Repo

Github PK Tool:Github PK Tool

zhangxs's starred repositories

llama.cpp

LLM inference in C/C++

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:22193Issues:200Issues:3300

llama2.c

Inference Llama 2 in one file of pure C

tvm_mlir_learn

compiler learning resources collect.

Awesome-LLM-Inference

đź“–A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

MambaOut

MambaOut: Do We Really Need Mamba for Vision?

Language:PythonLicense:Apache-2.0Stargazers:1868Issues:6Issues:240

ppl.nn

A primitive library for neural network

Language:C++License:Apache-2.0Stargazers:1239Issues:36Issues:113
Language:PythonLicense:GPL-3.0Stargazers:702Issues:7Issues:68

InferLLM

a lightweight LLM model inference framework

Language:C++License:Apache-2.0Stargazers:656Issues:10Issues:54

Conformer

Official code for Conformer: Local Features Coupling Global Representations for Visual Recognition

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:520Issues:6Issues:38

distrifuser

[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models

Language:PythonLicense:MITStargazers:490Issues:8Issues:16

rtp-llm

RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.

Language:C++License:Apache-2.0Stargazers:434Issues:11Issues:70

TinySAM

Official PyTorch implementation of "TinySAM: Pushing the Envelope for Efficient Segment Anything Model"

Language:PythonLicense:Apache-2.0Stargazers:372Issues:13Issues:24

HorNet

[NeurIPS 2022] HorNet: Efficient High-Order Spatial Interactions with Recursive Gated Convolutions

Language:PythonLicense:MITStargazers:312Issues:5Issues:37

SlimSAM

SlimSAM: 0.1% Data Makes Segment Anything Slim

Language:PythonLicense:Apache-2.0Stargazers:248Issues:7Issues:19

KVQuant

KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

inferflow

Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).

Language:C++License:MITStargazers:230Issues:7Issues:16

ViT-CoMer

Official implementation of the CVPR 2024 paper ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense Predictions.

Language:PythonLicense:Apache-2.0Stargazers:169Issues:4Issues:18

CAMixerSR

CAMixerSR: Only Details Need More “Attention” (CVPR 2024)

Language:PythonLicense:Apache-2.0Stargazers:157Issues:4Issues:23

dash-infer

DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including x86 and ARMv9.

Language:C++License:Apache-2.0Stargazers:102Issues:4Issues:16

PipeFusion

A Suite of Parallel Approaches for Inference of Diffusion Transformer Models on GPU Clusters

Language:PythonLicense:Apache-2.0Stargazers:93Issues:1Issues:16
Language:PythonLicense:Apache-2.0Stargazers:85Issues:2Issues:6

LW-DETR

This repository is an official implementation of the paper "LW-DETR: A Transformer Replacement to YOLO for Real-Time Detection".

Effective-Fusion-Factor

Effective Fusion Factor in FPN for Tiny Object Detection(WACV2021)

Language:PythonLicense:MITStargazers:59Issues:2Issues:3

u-mixformer

OpenMMLab Semantic Segmentation Toolbox and Benchmark.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:53Issues:0Issues:0

ITER

PyTorch codes for "Iterative Token Evaluation and Refinement for Real-World Super-Resolution", AAAI 2024

Language:PythonLicense:NOASSERTIONStargazers:46Issues:4Issues:1

ICELUT

Taming Lookup Tables for Efficient Image Retouching

Language:PythonStargazers:13Issues:0Issues:0

Hetu-Galvatron

Galvatron is an automatic distributed training system designed for Transformer models, including Large Language Models (LLMs).

DHU-MMCT

Towards Effective Multi-Moving Camera Tracking: A New Dataset and Lightweight Link Model

Language:PythonLicense:NOASSERTIONStargazers:6Issues:1Issues:1