zhang-tao-whu / DVIS_Plus

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Tao Zhang, XingYe Tian, Yikang Zhou, ShunPing Ji, Xuebo Wang, Xin Tao,

Yuan Zhang, Pengfei Wan, Zhongyuan Wang and Yu Wu

PWC PWC PWC PWC PWC PWC

News

  • DVIS and DVIS++ achieved 1st place in the VPS Track of the PVUW challenge at CVPR 2023. 2023.5.25
  • DVIS and DVIS++ achieved 1st place in the VIS Track of the 5th LSVOS challenge at ICCV 2023. 2023.8.15

Features

  • DVIS++ is a universal video segmentation framework that supports VIS, VPS and VSS.
  • DVIS++ can run in both online and offline modes.
  • DVIS++ achieved SOTA performance on YTVIS 2019&2021&2022, OVIS, VIPSeg and VSPW datasets.
  • OV-DVIS++ is the first open-vocabulary video universal segmentation framework with powerful zero-shot segmentation capability.

Demos

VIS

VSS

VPS

Open-vocabulary demos

Installation

See Installation Instructions.

Getting Started

See Preparing Datasets for DVIS++.

See Getting Started with DVIS++.

Model Zoo

Trained models are available for download in the DVIS++ Model Zoo.

Citing DVIS and DVIS++

@article{zhang2023dvis,
  title={DVIS: Decoupled Video Instance Segmentation Framework},
  author={Zhang, Tao and Tian, Xingye and Wu, Yu and Ji, Shunping and Wang, Xuebo and Zhang, Yuan and Wan, Pengfei},
  journal={arXiv preprint arXiv:2306.03413},
  year={2023}
}

@article{zhang2023dvisplus,
  title={DVIS++: Improved Decoupled Framework for Universal Video Segmentation}, 
  author={Tao Zhang and Xingye Tian and Yikang Zhou and Shunping Ji and Xuebo Wang and Xin Tao and Yuan Zhang and Pengfei Wan and Zhongyuan Wang and Yu Wu},
  journal={arXiv preprint arXiv:2312.13305},
  year={2023},
}

Acknowledgement

This repo is largely based on Mask2Former, MinVIS, VITA, CTVIS, FC-CLIP and DVIS. Thanks for their excellent works.

About

License:MIT License


Languages

Language:Python 95.4%Language:Cuda 4.1%Language:C++ 0.5%Language:Shell 0.0%