Yutong Lin's repositories
Plain-DETR
[ICCV2023] DETR Doesn’t Need Multi-Scale or Locality Design
pytorch-fid
Compute FID scores with PyTorch.
Swin-Transformer-Object-Detection
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Object Detection and Instance Segmentation.
CLIP1
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Language:Jupyter NotebookMIT000
img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
Language:PythonMIT000
k-vim
vim配置
Language:Vim Script000
taming-transformers-1
Taming Transformers for High-Resolution Image Synthesis
Language:Jupyter NotebookMIT000