linzhiqiu's starred repositories

sentence-transformers

State-of-the-Art Text Embeddings

Language:PythonLicense:Apache-2.0Stargazers:14894Issues:139Issues:2134

Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:14811Issues:114Issues:385

open_clip

An open source implementation of CLIP.

Language:PythonLicense:NOASSERTIONStargazers:9857Issues:77Issues:470

dinov2

PyTorch code and models for the DINOv2 self-supervised learning method.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:8842Issues:95Issues:392

GroundingDINO

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Language:PythonLicense:Apache-2.0Stargazers:6344Issues:41Issues:296

threestudio

A unified framework for 3D content generation.

Language:PythonLicense:Apache-2.0Stargazers:6150Issues:80Issues:329

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Language:PythonLicense:Apache-2.0Stargazers:5891Issues:65Issues:421

arxiv-latex-cleaner

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Language:PythonLicense:Apache-2.0Stargazers:5209Issues:32Issues:52

Emu

Emu Series: Generative Multimodal Models from BAAI

Language:PythonLicense:Apache-2.0Stargazers:1610Issues:21Issues:86

ImageReward

[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation

Language:PythonLicense:Apache-2.0Stargazers:1110Issues:14Issues:85

MM-REACT

Official repo for MM-REACT

Language:PythonLicense:MITStargazers:929Issues:19Issues:10

MIC

MMICL, a state-of-the-art VLM with the in context learning ability from ICL, PKU

VIRL

(ECCV 2024) Code for V-IRL: Grounding Virtual Intelligence in Real Life

GPTEval3D

[ CVPR 2024 ] Implementation for "GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation"

tifa

TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering

Language:PythonLicense:Apache-2.0Stargazers:133Issues:3Issues:9

X2-VLM

All-In-One VLM: Image + Video + Transfer to Other Languages / Domains (TPAMI 2023)

Language:PythonLicense:BSD-3-ClauseStargazers:132Issues:5Issues:20

LLMScore

LLMScore: Unveiling the Power of Large Language Models in Text-to-Image Synthesis Evaluation

Cola

[NeurIPS2023] Official implementation of the paper "Large Language Models are Visual Reasoning Coordinators"

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:99Issues:3Issues:2

mt-metrics-eval

Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.

Language:PythonLicense:Apache-2.0Stargazers:85Issues:5Issues:13

DSG

Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)

Language:Jupyter NotebookStargazers:74Issues:3Issues:7

sugar-crepe

[NeurIPS 2023] A faithful benchmark for vision-language compositionality

Language:PythonLicense:MITStargazers:66Issues:10Issues:8

VPEval

VPEval Codebase from Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)

Language:PythonLicense:MITStargazers:42Issues:2Issues:4

GLA

[NeurIPS 2023] Generalized Logit Adjustment

COLA

COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!

Language:PythonLicense:MITStargazers:21Issues:3Issues:2

synth-set-annotation-ui

streamlit annotation ui for annotation of synthetic image

Language:PythonStargazers:1Issues:1Issues:0