Yanwei (yanwei-li)

yanwei-li

Geek Repo

Company:The Chinese University of Hong Kong

Location:Hong Kong, China

Home Page:yanwei-li.com

Github PK Tool:Github PK Tool


Organizations
dvlab-research

Yanwei's starred repositories

grok-1

Grok open release

Language:PythonLicense:Apache-2.0Stargazers:49015Issues:557Issues:197

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonLicense:Apache-2.0Stargazers:23309Issues:195Issues:3648

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:21929Issues:175Issues:170

mistral-inference

Official inference library for Mistral models

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:9008Issues:117Issues:122

ImageBind

ImageBind One Embedding Space to Bind Them All

Language:PythonLicense:NOASSERTIONStargazers:7992Issues:100Issues:83

Depth-Anything

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

Language:PythonLicense:Apache-2.0Stargazers:6053Issues:46Issues:169

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Language:PythonLicense:Apache-2.0Stargazers:5463Issues:65Issues:390

MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Language:PythonLicense:Apache-2.0Stargazers:3047Issues:25Issues:121

NExT-GPT

Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model

Language:PythonLicense:BSD-3-ClauseStargazers:2986Issues:60Issues:87

LongLoRA

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Language:PythonLicense:Apache-2.0Stargazers:2511Issues:13Issues:168

Vary

Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.

LISA

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Language:PythonLicense:Apache-2.0Stargazers:1541Issues:10Issues:126

Emu

Emu Series: Generative Multimodal Models from BAAI

Language:PythonLicense:Apache-2.0Stargazers:1530Issues:21Issues:84

lmms-eval

Accelerating the development of large multimodal models (LMMs) with lmms-eval

Language:PythonLicense:NOASSERTIONStargazers:952Issues:3Issues:69

SPIN

The official implementation of Self-Play Fine-Tuning (SPIN)

Language:PythonLicense:Apache-2.0Stargazers:858Issues:11Issues:26

3D-LLM

Code for 3D-LLM: Injecting the 3D World into Large Language Models

Language:PythonLicense:MITStargazers:820Issues:16Issues:55

EdgeSAM

Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:715Issues:16Issues:22

DriveLM

DriveLM: Driving with Graph Visual Question Answering

Language:HTMLLicense:Apache-2.0Stargazers:671Issues:20Issues:64

LLaMA-VID

Official Implementation for LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models

Language:PythonLicense:Apache-2.0Stargazers:609Issues:11Issues:94

Woodpecker

✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models. The first work to correct hallucinations in MLLMs.

APE

[CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception

Language:PythonLicense:Apache-2.0Stargazers:446Issues:6Issues:49

LLMGA

This project is the official implementation of 'LLMGA: Multimodal Large Language Model based Generation Assistant'

Language:PythonLicense:Apache-2.0Stargazers:261Issues:4Issues:5

LLaVA-RLHF

Aligning LMMs with Factually Augmented RLHF

Language:PythonLicense:GPL-3.0Stargazers:254Issues:8Issues:30

ALLaVA

Harnessing 1.4M GPT4V-synthesized Data for A Lite Vision-Language Model

Language:PythonLicense:Apache-2.0Stargazers:196Issues:11Issues:11

RIVAL

[NeurIPS 2023 Spotlight] Real-World Image Variation by Aligning Diffusion Inversion Chain

Language:PythonLicense:Apache-2.0Stargazers:138Issues:17Issues:8
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:122Issues:3Issues:14

Prompt-Highlighter

[CVPR 2024] Prompt Highlighter: Interactive Control for Multi-Modal LLMs

Language:PythonLicense:MITStargazers:104Issues:2Issues:2

X-Ray

The official source code for "X-Ray: A Sequential 3D Representation for Generation".

Language:PythonStargazers:81Issues:0Issues:0