quinwu

followers

following

stars

ShangHai

Kuan Wu's starred repositories

llama.cpp

LLM inference in C/C++

Language:C++MIT59109 505 3079

llama

Inference code for LLaMA models

Language:PythonNOASSERTION50895 499 872

faiss

A library for efficient similarity search and clustering of dense vectors.

Language:C++MIT28693 475 2366

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonApache-2.020136 194 2848

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonApache-2.017071 153 1324

detr

End-to-End Object Detection with Transformers

Language:PythonApache-2.012969 149 526

flash-attention

Fast and memory-efficient exact attention

Language:PythonBSD-3-Clause11316 104 815

llama-gpt

A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device. New: Code Llama support!

Language:TypeScriptMIT10416 79 124

bisheng

Bisheng is an open LLM devops platform for next generation AI applications.

Language:PythonApache-2.07275 641 114

FastSAM

Fast Segment Anything

Language:PythonAGPL-3.06963 56 182

mae

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

Language:PythonNOASSERTION6854 57 183

Baichuan-7B

A large-scale 7B pretraining language model developed by BaiChuan-Inc.

Language:PythonApache-2.05641 67 127

GroundingDINO

Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Language:PythonApache-2.05236 35 273

Segment-Everything-Everywhere-All-At-Once

[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"

Language:PythonApache-2.04097 55 133

mmpretrain

OpenMMLab Pre-training Toolbox and Benchmark

Language:PythonApache-2.03221 30 752

transformer-deploy

Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀

Language:PythonApache-2.01625 27 121

WebGLM

WebGLM: An Efficient Web-enhanced Question Answering System (KDD 2023)

Language:PythonApache-2.01522 25 69

lang-segment-anything

SAM with text prompt

Language:Jupyter NotebookApache-2.01250 9 41

segment-anything-fast

A batched offline inference oriented version of segment-anything

Language:PythonApache-2.01127 11 40

SoM

Set-of-Mark Prompting for LMMs

Language:PythonMIT976 21 28

GPT4RoI

GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest

Language:PythonNOASSERTION460 8 42

llm-search

Querying local documents, powered by LLM

Language:Jupyter NotebookMIT397 11 42

FlashRank

Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cross-encoders and more. Created by Prithivi Da, open for PRs & Collaborations.

Language:PythonApache-2.0375 5 16

sft_datasets

开源SFT数据集整理,随时补充

TinySAM

Official PyTorch implementation of "TinySAM: Pushing the Envelope for Efficient Segment Anything Model"

Language:PythonApache-2.0365 12 23

chat2plot

chat to visualization with LLM

Language:PythonMIT158 5 9

segment-anything-tensorrt

Language:Jupyter NotebookMIT73 3 16

segment_anything_tensorrt

Accelerate segment anything model inference using Tensorrt 8.6.1.6

Language:Python71 1 6

BYZER-RETRIEVAL

Byzer-retrieval is a distributed retrieval system which designed as a backend for LLM RAG (Retrieval Augmented Generation). The system supports both BM25 retrieval algorithm and vector retrieval algorithm.

Language:Java39 20

triton-server

triton server with segment anything(SAM)

Language:PythonBSD-3-Clause300