Claire TAN's starred repositories
ViT-pytorch
Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)
vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
ViViT-pytorch
Implementation of ViViT: A Video Vision Transformer
msasl-video-downloader
MS-ASL Video Downloader is a tool for easily downloading videos from the MS-ASL datasets.
VideoTransformer-pytorch
PyTorch implementation of a collections of scalable Video Transformer Benchmarks.