hongbo-sun

Zhixing Sun's repositories

Vitron

A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing

000

Monkey

【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models

MIT000

IELT

Source code of the paper Fine-Grained Visual Classification via Internal Ensemble Learning Transformer

MIT000

Open-VCLIP

000

ovsam

[arXiv preprint] The official code of paper "Open-Vocabulary SAM".

NOASSERTION000

LP-DiF

000

Agent-Attention

Official repository of Agent Attention

000

FGVP

Official Codes for Fine-Grained Visual Prompting, NeurIPS 2023

000

2024-AAAI-HPT

Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models (AAAI 2024)

MIT000

LLaVA-Plus-Codebase

LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills

Apache-2.0000

MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2

BSD-3-Clause000

FLatten-Transformer

Official repository of FLatten Transformer (ICCV2023)

000

RevisitingCIL

The code repository for "Revisiting Class-Incremental Learning with Pre-Trained Models: Generalizability and Adaptivity are All You Need" in PyTorch.

000

HCL_TMM2023

000

SHIP

Official code for ICCV 2023 paper, "Improving Zero-Shot Generalization for CLIP with Synthesized Prompts"

000

PRAKA

000

multimodal-prompt-learning

[CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".

MIT000

SMT

MIT000

recognize-anything

Code for the Recognize Anything Model (RAM) and Tag2Text Model

Apache-2.0000

AttriCLIP

CVPR2023: AttriCLIP: A Non-Incremental Learner for Incremental Knowledge Learning

000

code-samples

Holds code for our CVPR'23 tutorial: All Things ViTs: Understanding and Interpreting Attention in Vision.

Apache-2.0000

PSVMA

000

sunhongbo.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

MIT000

BiDistFSCIL

Official implementation of CVPR 2023 paper Few-Shot Class-Incremental Learning via Class-Aware Bilateral Distillation.

000

vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

MIT000

RPF

This is a repository contains the implementation of our SIGIR'23 full paper From Region to Patch: Attribute-Aware Foreground-Background Contrastive Learning for Fine-Grained Fashion Retrieval.

000

opencon

Code for TMLR 2023 paper "OpenCon: Open-world Contrastive Learning"

000

Gard

Code for Graph-based High-Order Relation Discovery for Fine-grained Recognition in CVPR 2021

MIT000

APE

[ICCV 2023] Code for "Not All Features Matter: Enhancing Few-shot CLIP with Adaptive Prior Refinement"

000

CLIP_Surgery

CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary Tasks

000