Xiangyu Zhao's repositories

MG-LLaVA

Official repository for paper MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning(https://arxiv.org/abs/2406.17770).

Language:PythonLicense:Apache-2.0Stargazers:142Issues:2Issues:7

CVPR2024-MLLM-Abstract

Abstracts of papers from CVPR 2024 related to MLLM (Multimodal Large Language Models).

Stargazers:0Issues:0Issues:0

mmdetection

OpenMMLab Detection Toolbox and Benchmark

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

mmsegmentation

OpenMMLab Semantic Segmentation Toolbox and Benchmark.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Open-GroundingDino

This is the third party implementation of the paper Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:JavaScriptStargazers:0Issues:0Issues:0

VLMEvalKit

Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 30+ benchmarks

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

xtuner

An efficient, flexible and full-featured toolkit for fine-tuning large models (InternLM, Llama, Baichuan, Qwen, ChatGLM)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0