Kuan Wu (quinwu)

quinwu

Geek Repo

Location:ShangHai

Github PK Tool:Github PK Tool

Kuan Wu's starred repositories

llama.cpp

LLM inference in C/C++

llama

Inference code for LLaMA models

Language:PythonLicense:NOASSERTIONStargazers:50895Issues:499Issues:872

faiss

A library for efficient similarity search and clustering of dense vectors.

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:20136Issues:194Issues:2848

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:17071Issues:153Issues:1324

detr

End-to-End Object Detection with Transformers

Language:PythonLicense:Apache-2.0Stargazers:12969Issues:149Issues:526

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:11316Issues:104Issues:815

llama-gpt

A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device. New: Code Llama support!

Language:TypeScriptLicense:MITStargazers:10416Issues:79Issues:124

bisheng

Bisheng is an open LLM devops platform for next generation AI applications.

Language:PythonLicense:Apache-2.0Stargazers:7275Issues:641Issues:114

FastSAM

Fast Segment Anything

Language:PythonLicense:AGPL-3.0Stargazers:6963Issues:56Issues:182

mae

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

Language:PythonLicense:NOASSERTIONStargazers:6854Issues:57Issues:183

Baichuan-7B

A large-scale 7B pretraining language model developed by BaiChuan-Inc.

Language:PythonLicense:Apache-2.0Stargazers:5641Issues:67Issues:127

GroundingDINO

Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Language:PythonLicense:Apache-2.0Stargazers:5236Issues:35Issues:273

Segment-Everything-Everywhere-All-At-Once

[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"

Language:PythonLicense:Apache-2.0Stargazers:4097Issues:55Issues:133

mmpretrain

OpenMMLab Pre-training Toolbox and Benchmark

Language:PythonLicense:Apache-2.0Stargazers:3221Issues:30Issues:752

transformer-deploy

Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀

Language:PythonLicense:Apache-2.0Stargazers:1625Issues:27Issues:121

WebGLM

WebGLM: An Efficient Web-enhanced Question Answering System (KDD 2023)

Language:PythonLicense:Apache-2.0Stargazers:1522Issues:25Issues:69

lang-segment-anything

SAM with text prompt

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1250Issues:9Issues:41

segment-anything-fast

A batched offline inference oriented version of segment-anything

Language:PythonLicense:Apache-2.0Stargazers:1127Issues:11Issues:40

SoM

Set-of-Mark Prompting for LMMs

Language:PythonLicense:MITStargazers:976Issues:21Issues:28

GPT4RoI

GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest

Language:PythonLicense:NOASSERTIONStargazers:460Issues:8Issues:42

llm-search

Querying local documents, powered by LLM

Language:Jupyter NotebookLicense:MITStargazers:397Issues:11Issues:42

FlashRank

Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cross-encoders and more. Created by Prithivi Da, open for PRs & Collaborations.

Language:PythonLicense:Apache-2.0Stargazers:375Issues:5Issues:16

sft_datasets

开源SFT数据集整理,随时补充

TinySAM

Official PyTorch implementation of "TinySAM: Pushing the Envelope for Efficient Segment Anything Model"

Language:PythonLicense:Apache-2.0Stargazers:365Issues:12Issues:23

chat2plot

chat to visualization with LLM

Language:PythonLicense:MITStargazers:158Issues:5Issues:9

segment_anything_tensorrt

Accelerate segment anything model inference using Tensorrt 8.6.1.6

BYZER-RETRIEVAL

Byzer-retrieval is a distributed retrieval system which designed as a backend for LLM RAG (Retrieval Augmented Generation). The system supports both BM25 retrieval algorithm and vector retrieval algorithm.

Language:JavaStargazers:39Issues:2Issues:0

triton-server

triton server with segment anything(SAM)

Language:PythonLicense:BSD-3-ClauseStargazers:3Issues:0Issues:0