A Collection of the Popular Vision Transformer Structure with Support for Downstream Tasks.(Mainly for the Object Detection/Instance Segmentation Tasks)
- Based on the Easy-to-use cvpods
- Support for the state-of-the-art models
- Swin-T/S/B/L
- DeiT
- PVT
- Swin-T
- cvpods
- detectron2
- mmdetection
- swin-transformer