Zhixing Sun's repositories

An-Erudite-FGVC-Model

Code release for Your “An Erudite Fine-Grained Visual Classification Model (CVPR 2023)

License:MITStargazers:0Issues:0Issues:0

DUET

Code for the paper: DUET: Cross-modal Semantic Grounding for Contrastive Zero-shot Learning [AAAI 2023 Oral]

License:MITStargazers:0Issues:0Issues:0

On-the-fly-Category-Discovery

Code release for Your “On-the-fly Category Discovery (CVPR 2023)”

License:MITStargazers:0Issues:0Issues:0

Painter

Painter & SegGPT Series: Vision Foundation Models from BAAI

License:MITStargazers:0Issues:0Issues:0

SAVC

[CVPR 2023] Learning with Fantasy: Semantic-Aware Virtual Contrastive Constraint for Few-Shot Class-Incremental Learning

License:MITStargazers:0Issues:0Issues:0

ptp

[CVPR2023] The code for 《Position-guided Text Prompt for Vision-Language Pre-training》

License:Apache-2.0Stargazers:0Issues:0Issues:0

InternImage

[CVPR 2023] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions

License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

cross_modal_adaptation

Cross-modal few-shot adaptation with CLIP

License:MITStargazers:0Issues:0Issues:0

VLE

VLE: Vision-Language Encoder (VLE: 视觉-语言多模态预训练模型)

License:Apache-2.0Stargazers:0Issues:0Issues:0

scenic

Scenic: A Jax Library for Computer Vision Research and Beyond

License:Apache-2.0Stargazers:0Issues:0Issues:0

MKT

Official implementation of "Open-Vocabulary Multi-Label Classification via Multi-Modal Knowledge Transfer".

License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

Partial_Distance_Correlation

This is the official GitHub for paper: On the Versatile Uses of Partial Distance Correlation in Deep Learning, in ECCV 2022

License:MITStargazers:0Issues:0Issues:0

generalized-category-discovery

Code for our CVPR 2022 paper 'Generalized Category Discovery'. Project page: https://www.robots.ox.ac.uk/~vgg/research/gcd/

License:MITStargazers:0Issues:0Issues:0

CP-CNN

Official PyTorch Implementation of CP-CNN (TIP'22)

Stargazers:0Issues:0Issues:0

SuS-X

Code for the paper: "SuS-X: Training-Free Name-Only Transfer of Vision-Language Models"

Stargazers:0Issues:0Issues:0

ISL

[ECCV 2022] Balancing between Forgetting and Acquisition in Incremental Subpopulation Learning

License:MITStargazers:0Issues:0Issues:0

Hawkeye

Open source deep learning based fine-grained image recognition toolbox built on PyTorch🔥

License:MITStargazers:0Issues:0Issues:0

some_useful_python_program

some useful python program

Language:PythonStargazers:0Issues:0Issues:0

TPT

Test-time Prompt Tuning (TPT) for zero-shot generalization in vision-language models (NeurIPS 2022))

License:MITStargazers:0Issues:0Issues:0

AliceMind

ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab

License:Apache-2.0Stargazers:0Issues:0Issues:0

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

License:BSD-3-ClauseStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

FKD

Official code for our ECCV'22 paper "A Fast Knowledge Distillation Framework for Visual Recognition"

License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

VNext

Next-generation Video instance recognition framework on top of Detectron2 which supports SeqFormer(ECCV Oral) and IDOL(ECCV Oral))

License:Apache-2.0Stargazers:0Issues:0Issues:0

ema-pytorch

A simple way to keep track of an Exponential Moving Average (EMA) version of your pytorch model

License:MITStargazers:0Issues:0Issues:0

vpt

❄️🔥 Visual Prompt Tuning [ECCV 2022] https://arxiv.org/abs/2203.12119

License:NOASSERTIONStargazers:0Issues:0Issues:0