duanduanduanyuchen

This repo contains evaluation code for the paper "BLINK: Multimodal Large Language Models Can See but Not Perceive". https://arxiv.org/abs/2404.12390 [ECCV 2024]

Language:PythonApache-2.09600

LLMTest_NeedleInAHaystack

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Language:Jupyter NotebookNOASSERTION139500

VLMEvalKit

Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 30+ benchmarks

Language:PythonApache-2.086300

CoMat

Official code for 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching

Language:Python11600

PlainMamba

[BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition

Language:PythonApache-2.05800

DDPS

Official Implementation of "Denoising Diffusion Semantic Segmentation with Mask Prior Modeling"

Language:Python6400

Vision-RWKV

Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures

Language:PythonApache-2.031100

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

Language:PythonApache-2.03115300

InternGPT

InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)

Language:PythonApache-2.0317800

duanduanduanyuchen

Yuchen Duan's starred repositories

LongLoRA

ToMe

Awesome-Multimodal-Large-Language-Models

LlamaGen

MM-NIAH

Amazing-Python-Scripts

BLINK_Benchmark

LLMTest_NeedleInAHaystack

Find-Needle-In-Sea

VLMEvalKit

CoMat

PlainMamba

DDPS

Vision-RWKV

pytorch-image-models

InternGPT

InternImage

TsinghuaBookCrawler

Uni-Perceiver

ViT-Adapter

PVT

YOLOX

duoyun

CVPR20_CLVision_challenge

autojs

FBDQA-2020S

Python-100-Days