hongbo-sun

Zhixing Sun's repositories

AliceMind

ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab

Language:PythonApache-2.0000

An-Erudite-FGVC-Model

Code release for Your “An Erudite Fine-Grained Visual Classification Model (CVPR 2023)

Language:PythonMIT000

CP-CNN

Official PyTorch Implementation of CP-CNN (TIP'22)

000

cross_modal_adaptation

Cross-modal few-shot adaptation with CLIP

MIT000

DCCL

000

DUET

Code for the paper: DUET: Cross-modal Semantic Grounding for Contrastive Zero-shot Learning [AAAI 2023 Oral]

MIT000

ema-pytorch

A simple way to keep track of an Exponential Moving Average (EMA) version of your pytorch model

MIT000

FKD

Official code for our ECCV'22 paper "A Fast Knowledge Distillation Framework for Visual Recognition"

MIT000

generalized-category-discovery

Code for our CVPR 2022 paper 'Generalized Category Discovery'. Project page: https://www.robots.ox.ac.uk/~vgg/research/gcd/

MIT000

Hawkeye

Open source deep learning based fine-grained image recognition toolbox built on PyTorch🔥

MIT000

InternImage

[CVPR 2023] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions

Language:PythonMIT000

ISL

[ECCV 2022] Balancing between Forgetting and Acquisition in Incremental Subpopulation Learning

MIT000

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

BSD-3-Clause000

LOUPE

000

MKT

Official implementation of "Open-Vocabulary Multi-Label Classification via Multi-Modal Knowledge Transfer".

MIT000

On-the-fly-Category-Discovery

Code release for Your “On-the-fly Category Discovery (CVPR 2023)”

MIT000

Painter

Painter & SegGPT Series: Vision Foundation Models from BAAI

MIT000

Partial_Distance_Correlation

This is the official GitHub for paper: On the Versatile Uses of Partial Distance Correlation in Deep Learning, in ECCV 2022

Language:PythonMIT000

Pix2NeRF

000

ptp

[CVPR2023] The code for 《Position-guided Text Prompt for Vision-Language Pre-training》

Apache-2.0000

ReAttentionTransformer

TRT for WSOL

Language:Jupyter Notebook010

SAVC

[CVPR 2023] Learning with Fantasy: Semantic-Aware Virtual Contrastive Constraint for Few-Shot Class-Incremental Learning

MIT000

scenic

Scenic: A Jax Library for Computer Vision Research and Beyond

Apache-2.0000

SIM-Trans_ACMMM2022

000

some_useful_python_program

some useful python program

Language:Python000

SuS-X

Code for the paper: "SuS-X: Training-Free Name-Only Transfer of Vision-Language Models"

000

TPT

Test-time Prompt Tuning (TPT) for zero-shot generalization in vision-language models (NeurIPS 2022))

MIT000

VLE

VLE: Vision-Language Encoder (VLE: 视觉-语言多模态预训练模型)

Apache-2.0000

VNext

Next-generation Video instance recognition framework on top of Detectron2 which supports SeqFormer(ECCV Oral) and IDOL(ECCV Oral))

Apache-2.0000

vpt

❄️🔥 Visual Prompt Tuning [ECCV 2022] https://arxiv.org/abs/2203.12119

NOASSERTION000