Zhixing Sun's repositories
AliceMind
ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab
An-Erudite-FGVC-Model
Code release for Your “An Erudite Fine-Grained Visual Classification Model (CVPR 2023)
CP-CNN
Official PyTorch Implementation of CP-CNN (TIP'22)
cross_modal_adaptation
Cross-modal few-shot adaptation with CLIP
DUET
Code for the paper: DUET: Cross-modal Semantic Grounding for Contrastive Zero-shot Learning [AAAI 2023 Oral]
ema-pytorch
A simple way to keep track of an Exponential Moving Average (EMA) version of your pytorch model
FKD
Official code for our ECCV'22 paper "A Fast Knowledge Distillation Framework for Visual Recognition"
generalized-category-discovery
Code for our CVPR 2022 paper 'Generalized Category Discovery'. Project page: https://www.robots.ox.ac.uk/~vgg/research/gcd/
Hawkeye
Open source deep learning based fine-grained image recognition toolbox built on PyTorch🔥
InternImage
[CVPR 2023] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
ISL
[ECCV 2022] Balancing between Forgetting and Acquisition in Incremental Subpopulation Learning
LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
MKT
Official implementation of "Open-Vocabulary Multi-Label Classification via Multi-Modal Knowledge Transfer".
On-the-fly-Category-Discovery
Code release for Your “On-the-fly Category Discovery (CVPR 2023)”
Painter
Painter & SegGPT Series: Vision Foundation Models from BAAI
Partial_Distance_Correlation
This is the official GitHub for paper: On the Versatile Uses of Partial Distance Correlation in Deep Learning, in ECCV 2022
ptp
[CVPR2023] The code for 《Position-guided Text Prompt for Vision-Language Pre-training》
ReAttentionTransformer
TRT for WSOL
SAVC
[CVPR 2023] Learning with Fantasy: Semantic-Aware Virtual Contrastive Constraint for Few-Shot Class-Incremental Learning
scenic
Scenic: A Jax Library for Computer Vision Research and Beyond
some_useful_python_program
some useful python program
SuS-X
Code for the paper: "SuS-X: Training-Free Name-Only Transfer of Vision-Language Models"
TPT
Test-time Prompt Tuning (TPT) for zero-shot generalization in vision-language models (NeurIPS 2022))
VLE
VLE: Vision-Language Encoder (VLE: 视觉-语言多模态预训练模型)
VNext
Next-generation Video instance recognition framework on top of Detectron2 which supports SeqFormer(ECCV Oral) and IDOL(ECCV Oral))
vpt
❄️🔥 Visual Prompt Tuning [ECCV 2022] https://arxiv.org/abs/2203.12119