Data Labeling Platforms Comparison
Data labeling is a hot topic as a recent report from AI research and advisory firm Cognilytica found that over 80% of the time enterprises spend on AI projects goes toward preparing, cleaning and labeling data.
A new category of tools has emerged, called data labeling platforms, training data management platforms. Three years ago, this market did not exist. Today, the offer is plentiful and it is sometimes complicated to find one's way.
A good annotation tool must meet three needs:
- annotate quickly (productivity of annotators)
- annotate well (control the quality / consistency of annotations)
- annotate the right data (source diversity in the dataset)
Here is an attempt to do a fair comparison of data labeling / training data platforms.
I tried to be as accurate as possible, based on my experience with those solutions and their documentation. If you see any mistake (this is likely), please create an issue / push a PR.
Table of Content
- Tasks / Tools by Asset Type
- Productivity Management Features
- Quality Management Features
- Dataset Management Features
- API
- Deployment
- Automation
- Workforce
- Security
Tasks / Tools by Asset Type
Image
The criteria are as follows:
- Classification
- Hierarchical classification
- Bounding Box
- Bounding Box
- Polygon
- Polyline
- Pose estimation
- Point
- Segmentation
- Vector Segmentation
- Image transcription / OCR
- Object relation
- Satellite imagery support
- High-resolution support
- DICOM support
Classif. | Hierarchical classif. | Bounding Box | Polygon | Polyline | Pose estimation | Point | |
---|---|---|---|---|---|---|---|
Appen | ✔ | ✔ | ✔ | ✔ | ✔ | ||
ClarifAI | ✔ | ||||||
CVAT | ✔ | ✔ | ✔ | ✔ | ✔ | ||
Dataloop | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | |
Hasty | ✔ | ✔ | ✔ | ||||
Hive | ✔ | ✔ | ✔ | ✔ | |||
Innotescus | ✔ | ✔ | |||||
Kili | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ |
Labelbox | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | |
LabelStudio | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | |
Playment | ✔ | ✔ | ✔ | ✔ | ✔ | ||
Roboflow | ✔ | ✔ | |||||
Scale | ✔ | ✔ | ✔ | ✔ | |||
Snorkel | |||||||
SuperAnnotate | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | |
Supervisely | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ |
V7 | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ |
Seg. | Vector Seg. | Image transcription / OCR | Object relation | Satellite | High-resolution support | DICOM support | |
---|---|---|---|---|---|---|---|
Appen | ✔ | ||||||
ClarifAI | ✔ | ✔ | |||||
CVAT | |||||||
Dataloop | ✔ | ✔ | |||||
Hasty | ✔ | ||||||
Hive | ✔ | ✔ | |||||
Innotescus | ✔ | ||||||
Kili | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | |
Labelbox | ✔ | ✔ | ✔ | ||||
LabelStudio | ✔ | ✔ | ✔ | ✔ | ✔ | ||
Playment | ✔ | ||||||
Roboflow | |||||||
Scale | ✔ | ||||||
Snorkel | |||||||
SuperAnnotate | ✔ | ✔ | |||||
Supervisely | ✔ | ||||||
V7 | ✔ | ✔ | ✔ | ✔ | ✔ |
Text
The criteria are as follows:
- Classification
- Hierarchical classification
- Named entities recognition
- Relations extraction
Classification | Hierarchical classification | Named entities recognition | Relations extraction | |
---|---|---|---|---|
Appen | ✔ | ✔ | ||
ClarifAI | ✔ | ✔ | ||
CVAT | ||||
Dataloop | ✔ | ✔ | ✔ | |
Hasty | ||||
Hive | ✔ | ✔ | ||
Innotescus | ||||
Kili | ✔ | ✔ | ✔ | ✔ |
Labelbox | ✔ | ✔ | ✔ | ✔ |
LabelStudio | ✔ | ✔ | ✔ | ✔ |
Playment | ||||
Roboflow | ||||
Scale | ||||
Snorkel | ✔ | ✔ | ✔ | |
SuperAnnotate | ✔ | ✔ | ||
Supervisely | ||||
V7 | ✔ | ✔ | ✔ |
The criteria are as follows:
- PDF document processing
- Bounding Box on PDF
- Object relation
PDF document processing | Bounding Box on PDF | Object relation | |
---|---|---|---|
Appen | |||
ClarifAI | |||
CVAT | |||
Dataloop | |||
Hasty | |||
Hive | ✔ | ✔ | |
Innotescus | |||
Kili | ✔ | ✔ | ✔ |
Labelbox | ✔ | ||
LabelStudio | ✔ | ||
Playment | |||
Roboflow | |||
Scale | ✔ | ||
Snorkel | ✔ | ||
SuperAnnotate | |||
Supervisely | |||
V7 | ✔ |
Video
The criteria are as follows:
- Classification
- Hierarchical classification
- Native Video/Frame
- Classification
- Object detection
- Transcription
Classif. | Hierarchical classif. | Native Video/Frame | Classification | Object detection | Transcription | |
---|---|---|---|---|---|---|
Appen | ✔ | ✔ | ✔ | ✔ | ||
ClarifAI | ✔ | ✔ | ||||
CVAT | ✔ | |||||
Dataloop | ✔ | ✔ | ✔ | |||
Hasty | ||||||
Hive | ✔ | ✔ | ✔ | ✔ | ||
Innotescus | ✔ | ✔ | ✔ | |||
Kili | ✔ | ✔ | ✔ | ✔ | ||
Labelbox | ✔ | ✔ | ✔ | ✔ | ||
LabelStudio | ✔ | ✔ | ✔ | ✔ | ✔ | |
Playment | ✔ | ✔ | ||||
Roboflow | ||||||
Scale | ✔ | |||||
Snorkel | ||||||
SuperAnnotate | ✔ | ✔ | ✔ | ✔ | ||
Supervisely | ✔ | ✔ | ✔ | |||
V7 | ✔ | ✔ | ✔ | ✔ |
Audio
The criteria are as follows:
- Voice transcription / Speech to text
- Diarization
- Ultrasound support
Voice transcription / Speech to text | Diarization | Ultrasound support | |
---|---|---|---|
Appen | ✔ | ||
ClarifAI | |||
CVAT | |||
Dataloop | |||
Hasty | |||
Hive | ✔ | ✔ | |
Innotescus | |||
Kili | ✔ | ✔ | |
Labelbox | ✔ | ||
LabelStudio | ✔ | ||
Playment | |||
Roboflow | |||
Scale | |||
Snorkel | |||
SuperAnnotate | |||
Supervisely | |||
V7 |
Timeserie
The criteria are as follows:
- Classification
- Range detection
Classification | Range detection | |
---|---|---|
Appen | ||
ClarifAI | ||
CVAT | ||
Dataloop | ||
Hasty | ||
Hive | ||
Innotescus | ||
Kili | ✔ | ✔ |
Labelbox | ||
LabelStudio | ✔ | ✔ |
Playment | ||
Roboflow | ||
Scale | ||
Snorkel | ✔ | ✔ |
SuperAnnotate | ||
Supervisely | ||
V7 |
Productivity Management Features
The criteria are as follows:
- Interface Customization
- Asset preloading
- Keyboard shortcuts / Hotkeys
- Workload distribution
- Autosave
- Labeler productivity dashboard
Interface Customization | Asset preloading | Keyboard shortcuts / Hotkeys | Workload distribution | Autosave | Labeler productivity dashboard | |
---|---|---|---|---|---|---|
Appen | ||||||
ClarifAI | ✔ | |||||
CVAT | ✔ | ✔ | ✔ | |||
Dataloop | ✔ | ✔ | ✔ | |||
Hasty | ✔ | |||||
Hive | ✔ | ✔ | ||||
Innotescus | ✔ | ✔ | ||||
Kili | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ |
Labelbox | ✔ | ✔ | ✔ | ✔ | ||
LabelStudio | ✔ | ✔ | ✔ | ✔ | ||
Playment | ✔ | ✔ | ||||
Roboflow | ✔ | ✔ | ||||
Scale | ||||||
Snorkel | ✔ | |||||
SuperAnnotate | ✔ | ✔ | ✔ | ✔ | ||
Supervisely | ✔ | ✔ | ✔ | ✔ | ||
V7 | ✔ | ✔ | ✔ | ✔ |
Quality Management Features
The criteria are as follows:
- Issue management
- Consensus Image
- Consensus Video
- Consensus Text
- Consensus Audio
Issue management | Consensus Image | Consensus Video | Consensus Text | Consensus Audio | |
---|---|---|---|---|---|
Appen | |||||
ClarifAI | ✔ | ||||
CVAT | |||||
Dataloop | ✔ | ✔ | |||
Hasty | ✔ | ||||
Hive | |||||
Innotescus | ✔ | ||||
Kili | ✔ | ✔ | ✔ | ✔ | |
Labelbox | ✔ | ✔ | ✔ | ||
LabelStudio | ✔ | ✔ | ✔ | ✔ | |
Playment | |||||
Roboflow | ✔ | ✔ | |||
Scale | |||||
Snorkel | |||||
SuperAnnotate | ✔ | ✔ | |||
Supervisely | ✔ | ✔ | |||
V7 | ✔ |
Dataset Management Features
The criteria are as follows:
- Data hub
- Version control
- Public project
- ML export formats
Data hub | Version control | Public project | ML export formats | |
---|---|---|---|---|
Appen | ✔ | |||
ClarifAI | ||||
CVAT | ✔ | |||
Dataloop | ✔ | ✔ | ✔ | |
Hasty | ✔ | |||
Hive | ||||
Innotescus | ||||
Kili | ✔ | ✔ | ✔ | ✔ |
Labelbox | ||||
LabelStudio | ✔ | |||
Playment | ✔ | |||
Roboflow | ✔ | ✔ | ✔ | |
Scale | ✔ | |||
Snorkel | ||||
SuperAnnotate | ✔ | |||
Supervisely | ✔ | |||
V7 | ✔ | ✔ | ✔ |
API
How to programmatically interact with the tool. The criteria are as follows:
- GraphQL API
- Messaging bus API
- CLI SDK
- Python SDK
GraphQL API | Messaging bus API | CLI SDK | Python SDK | |
---|---|---|---|---|
Appen | ||||
ClarifAI | ✔ | |||
CVAT | ||||
Dataloop | ✔ | |||
Hasty | ||||
Hive | ✔ | |||
Innotescus | ||||
Kili | ✔ | ✔ | ✔ | |
Labelbox | ✔ | ✔ | ||
LabelStudio | ✔ | |||
Playment | ✔ | |||
Roboflow | ✔ | |||
Scale | ✔ | |||
Snorkel | ||||
SuperAnnotate | ✔ | |||
Supervisely | ✔ | |||
V7 | ✔ | ✔ |
Deployment
The different options to deploy the tool. The criteria are as follows:
- On-premise data
- On-premise deployment
- Air gap deployment
On-premise data | On-premise deployment | Air gap deployment | |
---|---|---|---|
Appen | |||
ClarifAI | ✔ | ✔ | |
CVAT | ✔ | ✔ | |
Dataloop | ✔ | ||
Hasty | |||
Hive | |||
Innotescus | |||
Kili | ✔ | ✔ | ✔ |
Labelbox | ✔ | ✔ | |
LabelStudio | ✔ | ✔ | |
Playment | |||
Roboflow | ✔ | ||
Scale | |||
Snorkel | ✔ | ||
SuperAnnotate | |||
Supervisely | ✔ | ||
V7 |
Automation
The offering in terms of labeling automation. The criteria are as follows:
- Model-assisted labeling
- Model based preannotation
- Queue prioritisation
- Model Training
- Model Inference
Model-assisted labeling | Model based preannotation | Queue prioritisation | Model Training | Model Inference | |
---|---|---|---|---|---|
Appen | ✔ | ✔ | ✔ | ||
ClarifAI | ✔ | ✔ | ✔ | ||
CVAT | |||||
Dataloop | ✔ | ✔ | |||
Hasty | ✔ | ✔ | ✔ | ✔ | |
Hive | |||||
Innotescus | ✔ | ||||
Kili | ✔ | ✔ | ✔ | ✔ | |
Labelbox | ✔ | ✔ | ✔ | ||
LabelStudio | ✔ | ||||
Playment | ✔ | ||||
Roboflow | ✔ | ✔ | ✔ | ||
Scale | ✔ | ✔ | |||
Snorkel | ✔ | ✔ | ✔ | ||
SuperAnnotate | ✔ | ✔ | ✔ | ||
Supervisely | ✔ | ✔ | |||
V7 | ✔ | ✔ | ✔ |
Workforce
Availability of workforce to help labeling. The criteria are as follows:
- Human in the loop
- External workforce
- On-premise external workforce
Human in the loop | External workforce | On-premise external workforce | |
---|---|---|---|
Appen | ✔ | ||
ClarifAI | |||
CVAT | |||
Dataloop | |||
Hasty | |||
Hive | |||
Innotescus | |||
Kili | ✔ | ✔ | ✔ |
Labelbox | ✔ | ||
LabelStudio | |||
Playment | |||
Roboflow | |||
Scale | ✔ | ||
Snorkel | |||
SuperAnnotate | |||
Supervisely | |||
V7 |
Security
Security certifications. The criteria are as follows:
- HIPAA Compliance
- SOC 2 Compliance
HIPAA Compliance | SOC 2 Compliance | |
---|---|---|
Appen | ✔ | ✔ |
ClarifAI | ✔ | ✔ |
CVAT | ||
Dataloop | ✔ | |
Hasty | ||
Hive | ||
Innotescus | ||
Kili | ✔ | |
Labelbox | ✔ | ✔ |
LabelStudio | ||
Playment | ✔ | |
Roboflow | ✔ | ✔ |
Scale | ✔ | ✔ |
Snorkel | ||
SuperAnnotate | ||
Supervisely | ||
V7 | ✔ | ✔ |