Pefect96's starred repositories

Language:PythonLicense:CC-BY-SA-4.0Stargazers:1Issues:0Issues:0

AltaCV

Yet another alternative curriculum vitae/résumé class with LaTeX

Language:TeXLicense:NOASSERTIONStargazers:1266Issues:0Issues:0

moderncv

A modern curriculum vitae class for LaTeX

Language:TeXStargazers:1798Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:429Issues:0Issues:0

segment-anything-2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:9674Issues:0Issues:0

DCI

Densely Captioned Images (DCI) dataset repository.

Language:PythonLicense:NOASSERTIONStargazers:153Issues:0Issues:0

Awesome-Composed-Image-Retrieval

Collection of Composed Image Retrieval (CIR) papers.

Stargazers:50Issues:0Issues:0

nsfc

nsfc - 国家自然科学基金项目LaTeX模版(面青地)

Language:TeXStargazers:168Issues:0Issues:0

JiuTian-LION

[CVPR 2024] LION: Empowering Multimodal Large Language Model with Dual-Level Visual Knowledge

Language:Jupyter NotebookLicense:MITStargazers:112Issues:0Issues:0

EVCap

[CVPR 2024] Retrieval-Augmented Image Captioning with External Visual-Name Memory for Open-World Comprehension

Language:PythonStargazers:26Issues:0Issues:0

LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Language:PythonLicense:MITStargazers:1142Issues:0Issues:0

FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Language:PythonLicense:MITStargazers:6510Issues:0Issues:0

WCA

[ICML 2024] "Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models"

Language:PythonLicense:MITStargazers:31Issues:0Issues:0

PlugIR

Official repository of "Interactive Text-to-Image Retrieval with Large Language Models: A Plug-and-Play Approach" (ACL 2024 Oral)

Language:PythonLicense:MITStargazers:15Issues:0Issues:0

VALSE

Data repository for the VALSE benchmark.

Language:PythonLicense:MITStargazers:33Issues:0Issues:0

schedule_free

Schedule-Free Optimization in PyTorch

Language:PythonLicense:Apache-2.0Stargazers:1766Issues:0Issues:0

Efficient-Multimodal-LLMs-Survey

Efficient Multimodal Large Language Models: A Survey

License:Apache-2.0Stargazers:208Issues:0Issues:0

MemVP

[ICML 2024] Memory-Space Visual Prompting for Efficient Vision-Language Fine-Tuning

Language:PythonStargazers:41Issues:0Issues:0

DenseConnector

Dense Connector for MLLMs

Language:PythonLicense:Apache-2.0Stargazers:91Issues:0Issues:0

oven_eval

ICCV 2023 (Oral) Open-domain Visual Entity Recognition Towards Recognizing Millions of Wikipedia Entities

Language:PythonLicense:MITStargazers:30Issues:0Issues:0

Open-LLaVA-NeXT

An open-source implementation for training LLaVA-NeXT.

Language:PythonStargazers:227Issues:0Issues:0

VLMEvalKit

Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 30+ benchmarks

Language:PythonLicense:Apache-2.0Stargazers:874Issues:0Issues:0

EfficientTrain

1.5−3.0× lossless training or pre-training speedup. An off-the-shelf, easy-to-implement algorithm for the efficient training of foundation visual backbones.

Language:PythonLicense:MITStargazers:194Issues:0Issues:0

VDG

Visual Delta Generator with Large Multi-modal Model for Semi-supervised Composed Image Retrieval - CVPR2024

Language:PythonStargazers:9Issues:0Issues:0

diht

Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training

Language:PythonLicense:NOASSERTIONStargazers:127Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:106Issues:0Issues:0

minGPT

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Language:PythonLicense:MITStargazers:19694Issues:0Issues:0

MambaOut

MambaOut: Do We Really Need Mamba for Vision?

Language:PythonLicense:Apache-2.0Stargazers:1923Issues:0Issues:0

efficient-kan

An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).

Language:PythonLicense:MITStargazers:3739Issues:0Issues:0