sergregory / cvat

Powerful and efficient Computer Vision Annotation Tool (CVAT)

Home Page:

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Computer Vision Annotation Tool (CVAT)

CI Codacy Badge Gitter chat Coverage Status DOI

CVAT is free, online, interactive video and image annotation tool for computer vision. It is being used by our team to annotate million of objects with different properties. Many UI and UX decisions are based on feedbacks from professional data annotation team. Try it online

CVAT screenshot



Supported annotation formats

Format selection is possible after clicking on the Upload annotation and Dump annotation buttons. Datumaro dataset framework allows additional dataset transformations via its command line tool and Python library.

For more information about supported formats look at the documentation.

Annotation format Import Export
CVAT for images X X
CVAT for a video X X
Datumaro X
Segmentation masks from PASCAL VOC X X
MS COCO Object Detection X X
TFrecord X X
LabelMe 3.0 X X
ImageNet X X
CamVid X X

Deep learning models for automatic labeling

Name Type Framework CPU GPU
Deep Extreme Cut interactor OpenVINO X
Faster RCNN detector TensorFlow X X
Mask RCNN detector OpenVINO X
YOLO v3 detector OpenVINO X
Text detection v4 detector OpenVINO X
Semantic segmentation for ADAS detector OpenVINO X
Mask RCNN detector TensorFlow X
Object reidentification reid OpenVINO X

Online demo:

This is an online demo with the latest version of the annotation tool. Try it online without local installation. Only own or assigned tasks are visible to users.

Disabled features:


  • No more than 10 tasks per user
  • Uploaded data is limited to 500Mb


Automatically generated Swagger documentation for Django REST API is available on <cvat_origin>/api/swagger (default: localhost:8080/api/swagger).

Swagger documentation is visiable on allowed hostes, Update environement variable in docker-compose.yml file with cvat hosted machine IP or domain name. Example - ALLOWED_HOSTS: 'localhost,')


Code released under the MIT License.


CVAT usage related questions or unclear concepts can be posted in our Gitter chat for quick replies from contributors and other users.

However, if you have a feature request or a bug report that can reproduced, feel free to open an issue (with steps to reproduce the bug if it's a bug report) on GitHub* issues.

If you are not sure or just want to browse other users common questions, Gitter chat is the way to go.

Other ways to ask questions and get our support:


Projects using CVAT

  • Onepanel - Onepanel is an open source vision AI platform that fully integrates CVAT with scalable data processing and parallelized training pipelines.


Powerful and efficient Computer Vision Annotation Tool (CVAT)

License:MIT License


Language:TypeScript 41.1%Language:JavaScript 30.6%Language:Python 26.2%Language:SCSS 1.6%Language:Shell 0.2%Language:Dockerfile 0.2%Language:HTML 0.1%