Contrastive Multiview Coding (self-supervised learning from multiple sensors/views/modalities)
Official DeiT repository
End-to-End Object Detection with Transformers
A fast high compression read-only file system
The project is about predicting sets (of classes) from images.
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
PyTorch implementation of paper: Paint Transformer: Feed Forward Neural Painting with Stroke Prediction, ICCV 2021.
A utility to read and write PDFs with Python
PyTorch image models, scripts, pretrained weights -- (SE)ResNet/ResNeXT, DPN, EfficientNet, MixNet, MobileNet-V3/V2, MNASNet, Single-Path NAS, FBNet, and more
The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.
Toolbox of models, callbacks, and datasets for AI/ML researchers.
Reformer, the efficient Transformer, in Pytorch
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
A PyTorch based library for all things neural differential equations
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"
[CVPR 2020] The official pytorch implementation of ``Visual Commonsense R-CNN''
Datasets, Transforms and Models specific to Computer Vision
Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)
You Only Look at One Sequence (https://arxiv.org/abs/2106.00666)