Zhixing Sun's repositories
convit
Code for the Convolutional Vision Transformer (ConViT)
pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
Awesome-Visual-Transformer
Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)
External-Attention-pytorch
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
PCL
PyTorch code for "Prototypical Contrastive Learning of Unsupervised Representations"
Deformable-ConvNets
Deformable Convolutional Networks
CoOp
Learning to Prompt for Vision-Language Models.
WSNFGVC
Web-Supervised Network for Fine-Grained Visual Classification
BriVL
Bridging Vision and Language Model
YOLOX
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/
big_transfer
Official repository for the "Big Transfer (BiT): General Visual Representation Learning" paper.
CAL
[ICCV 2021] Counterfactual Attention Learning for Fine-Grained Visual Categorization and Re-identification
ViT-pytorch-1
Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)
SPOL
Codes for CVPR2021 paper "Shallow Feature Matters for Weakly Supervised Object Localization"
HLA-Face-Code
Code for HLA-Face: Joint High-Low Adaptation for Low Light Face Detection (CVPR21)
tps_stn_pytorch
PyTorch implementation of Spatial Transformer Network (STN) with Thin Plate Spline (TPS)
volo
VOLO: Vision Outlooker for Visual Recognition
VisionPermutator
MLP-Like Vision Permutator for Visual Recognition (PyTorch)
BBN
The official PyTorch implementation of paper BBN: Bilateral-Branch Network with Cumulative Learning for Long-Tailed Visual Recognition
C2-Matching
Code for C2-Matching (CVPR2021). Paper: Robust Reference-based Super-Resolution via C2-Matching.
EANet
External Attention Network
CLIP
Contrastive Language-Image Pretraining
inat_comp
iNaturalist competition details
DASR
[CVPR 2021] Unsupervised Degradation Representation Learning for Blind Super-Resolution
InvDN
Implementation for the paper: Invertible Denoising Network: A Light Solution for Real Noise Removal (CVPR2021).