Swall0w's repositories
SparseR-CNN
End-to-End Object Detection with Learnable Proposal, CVPR2021
alpaca_ja
alpacaデータセットを日本語化したものです
CMaskTrack-RCNN
Code for CMaskTrack R-CNN (proposed in Occluded Video Instance Segmentation)
ConformerViT
ViT + Conformer = ( ͡❛ ͜ʖ ͡❛)👌
cycle-diffusion
Code for "Unifying Diffusion Models' Latent Space, with Applications to CycleDiffusion and Guidance"
deep-text-recognition-benchmark
PyTorch code of my ICDAR 2021 paper Vision Transformer for Fast and Efficient Scene Text Recognition (ViTSTR)
Deformable-DETR
Deformable DETR: Deformable Transformers for End-to-End Object Detection.
donut
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
DPText-DETR
[AAAI 2023 Oral] DPText-DETR: Towards Better Scene Text Detection with Dynamic Points in Transformer
FastChat
The release repo for "Vicuna: An Open Chatbot Impressing GPT-4"
IFC
Video Instance Segmentation using Inter-Frame Communication Transformers (NeurIPS 2021)
norfair
Lightweight Python library for adding real-time object tracking to any detector.
pytorch-CycleGAN-and-pix2pix
Image-to-Image Translation in PyTorch
stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
TeViT
Temporally Efficient Vision Transformer for Video Instance Segmentation, CVPR 2022 Oral
TextZoom
A super-resolution dataset of paired LR-HR scene text images
TransVTSpotter
A new video text spotting framework with Transformer
vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch