There are 2 repositories under vision-transformer-models topic.
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNet-V3/V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
Multi-label classification based on timm.
Multi-label classification based on timm, and add SimCLR to timm.
Solution for NeurIPS 2023 - MedFM Challenge
Code for the base version of the the model vision transformer in pytorch.
This project focuses on evaluating Convolutional Neural Networks (CNN) and Vision Transformers (ViT) for image classification tasks, specifically distinguishing between Asian elephants and African elephants.