There are 6 repositories under vision-transformers topic.
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
Recent Transformer-based CV and related works.
A collection of resources on applications of Transformers in Medical Imaging.
[NeurIPS 2021] [T-PAMI] DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification
[ICLR'24 Spotlight] Uni3D: 3D Visual Representation from BAAI
Official implementation for the paper "Deep ViT Features as Dense Visual Descriptors".
🤗🖼️ HuggingPics: Fine-tune Vision Transformers for anything using images found on the web.
[NIVT Workshop @ ICCV 2023] SeMask: Semantically Masked Transformers for Semantic Segmentation
Implementation of MetNet-3, SOTA neural weather model out of Google Deepmind, in Pytorch
SimpleClick: Interactive Image Segmentation with Simple Vision Transformers (ICCV 2023)
Implementation of CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification
Holds code for our CVPR'23 tutorial: All Things ViTs: Understanding and Interpreting Attention in Vision.
A Monocular depth-estimation for in-the-wild AutoFocus application.
An unofficial implementation of ViTPose [Y. Xu et al., 2022]
Determine whether a given video sequence has been manipulated or synthetically generated
Few-Shot Diffusion Models
[NeurIPS'21] "Chasing Sparsity in Vision Transformers: An End-to-End Exploration" by Tianlong Chen, Yu Cheng, Zhe Gan, Lu Yuan, Lei Zhang, Zhangyang Wang
Library - Vanilla, ViT, DeiT, BERT, GPT
Code for the paper "A Light Recipe to Train Robust Vision Transformers" [SaTML 2023]
Recent Advances on Efficient Vision Transformers
[ICCV'21] [Tensorflow] Semi-supervised Retinal Image Synthesis and Disease Prediction using Vision Transformers
(Unofficial) PyTorch implementation of Training Vision Transformers for Image Retrieval(El-Nouby, Alaaeldin, et al. 2021).
This repository hosts code for converting the original Vision Transformer models (JAX) to TensorFlow.
Implementation of vision transformer. ⭐⭐⭐
This repository shows various ways of deploying a vision model (TensorFlow) from 🤗 Transformers.
iSegFormer: Interactive Image/Volume Segmentation using Vision Transformers (MICCAI 2022)