Zhixing Sun's repositories
An-Erudite-FGVC-Model
Code release for Your “An Erudite Fine-Grained Visual Classification Model (CVPR 2023)
DUET
Code for the paper: DUET: Cross-modal Semantic Grounding for Contrastive Zero-shot Learning [AAAI 2023 Oral]
On-the-fly-Category-Discovery
Code release for Your “On-the-fly Category Discovery (CVPR 2023)”
Painter
Painter & SegGPT Series: Vision Foundation Models from BAAI
SAVC
[CVPR 2023] Learning with Fantasy: Semantic-Aware Virtual Contrastive Constraint for Few-Shot Class-Incremental Learning
ptp
[CVPR2023] The code for 《Position-guided Text Prompt for Vision-Language Pre-training》
InternImage
[CVPR 2023] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
cross_modal_adaptation
Cross-modal few-shot adaptation with CLIP
VLE
VLE: Vision-Language Encoder (VLE: 视觉-语言多模态预训练模型)
scenic
Scenic: A Jax Library for Computer Vision Research and Beyond
MKT
Official implementation of "Open-Vocabulary Multi-Label Classification via Multi-Modal Knowledge Transfer".
Partial_Distance_Correlation
This is the official GitHub for paper: On the Versatile Uses of Partial Distance Correlation in Deep Learning, in ECCV 2022
generalized-category-discovery
Code for our CVPR 2022 paper 'Generalized Category Discovery'. Project page: https://www.robots.ox.ac.uk/~vgg/research/gcd/
CP-CNN
Official PyTorch Implementation of CP-CNN (TIP'22)
SuS-X
Code for the paper: "SuS-X: Training-Free Name-Only Transfer of Vision-Language Models"
ISL
[ECCV 2022] Balancing between Forgetting and Acquisition in Incremental Subpopulation Learning
Hawkeye
Open source deep learning based fine-grained image recognition toolbox built on PyTorch🔥
some_useful_python_program
some useful python program
TPT
Test-time Prompt Tuning (TPT) for zero-shot generalization in vision-language models (NeurIPS 2022))
AliceMind
ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab
LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
FKD
Official code for our ECCV'22 paper "A Fast Knowledge Distillation Framework for Visual Recognition"
ReAttentionTransformer
TRT for WSOL
VNext
Next-generation Video instance recognition framework on top of Detectron2 which supports SeqFormer(ECCV Oral) and IDOL(ECCV Oral))
ema-pytorch
A simple way to keep track of an Exponential Moving Average (EMA) version of your pytorch model
vpt
❄️🔥 Visual Prompt Tuning [ECCV 2022] https://arxiv.org/abs/2203.12119