edarchimbaud / data-labeling-platforms-comparison

Feature benchmarks of main data labeling platforms

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Data Labeling Platforms Comparison

Data labeling is a hot topic as a recent report from AI research and advisory firm Cognilytica found that over 80% of the time enterprises spend on AI projects goes toward preparing, cleaning and labeling data.

A new category of tools has emerged, called data labeling platforms, training data management platforms. Three years ago, this market did not exist. Today, the offer is plentiful and it is sometimes complicated to find one's way.

A good annotation tool must meet three needs:

  • annotate quickly (productivity of annotators)
  • annotate well (control the quality / consistency of annotations)
  • annotate the right data (source diversity in the dataset)

Here is an attempt to do a fair comparison of data labeling / training data platforms.

I tried to be as accurate as possible, based on my experience with those solutions and their documentation. If you see any mistake (this is likely), please create an issue / push a PR.

Table of Content

Tasks / Tools by Asset Type

Image

The criteria are as follows:

  • Classification
  • Hierarchical classification
  • Bounding Box
  • Bounding Box
  • Polygon
  • Polyline
  • Pose estimation
  • Point
  • Segmentation
  • Vector Segmentation
  • Image transcription / OCR
  • Object relation
  • Satellite imagery support
  • High-resolution support
  • DICOM support
Classif. Hierarchical classif. Bounding Box Polygon Polyline Pose estimation Point
Appen
ClarifAI
CVAT
Dataloop
Hasty
Hive
Innotescus
Kili
Labelbox
LabelStudio
Playment
Roboflow
Scale
Snorkel
SuperAnnotate
Supervisely
V7
Seg. Vector Seg. Image transcription / OCR Object relation Satellite High-resolution support DICOM support
Appen
ClarifAI
CVAT
Dataloop
Hasty
Hive
Innotescus
Kili
Labelbox
LabelStudio
Playment
Roboflow
Scale
Snorkel
SuperAnnotate
Supervisely
V7

Text

The criteria are as follows:

  • Classification
  • Hierarchical classification
  • Named entities recognition
  • Relations extraction
Classification Hierarchical classification Named entities recognition Relations extraction
Appen
ClarifAI
CVAT
Dataloop
Hasty
Hive
Innotescus
Kili
Labelbox
LabelStudio
Playment
Roboflow
Scale
Snorkel
SuperAnnotate
Supervisely
V7

PDF

The criteria are as follows:

  • PDF document processing
  • Bounding Box on PDF
  • Object relation
PDF document processing Bounding Box on PDF Object relation
Appen
ClarifAI
CVAT
Dataloop
Hasty
Hive
Innotescus
Kili
Labelbox
LabelStudio
Playment
Roboflow
Scale
Snorkel
SuperAnnotate
Supervisely
V7

Video

The criteria are as follows:

  • Classification
  • Hierarchical classification
  • Native Video/Frame
  • Classification
  • Object detection
  • Transcription
Classif. Hierarchical classif. Native Video/Frame Classification Object detection Transcription
Appen
ClarifAI
CVAT
Dataloop
Hasty
Hive
Innotescus
Kili
Labelbox
LabelStudio
Playment
Roboflow
Scale
Snorkel
SuperAnnotate
Supervisely
V7

Audio

The criteria are as follows:

  • Voice transcription / Speech to text
  • Diarization
  • Ultrasound support
Voice transcription / Speech to text Diarization Ultrasound support
Appen
ClarifAI
CVAT
Dataloop
Hasty
Hive
Innotescus
Kili
Labelbox
LabelStudio
Playment
Roboflow
Scale
Snorkel
SuperAnnotate
Supervisely
V7

Timeserie

The criteria are as follows:

  • Classification
  • Range detection
Classification Range detection
Appen
ClarifAI
CVAT
Dataloop
Hasty
Hive
Innotescus
Kili
Labelbox
LabelStudio
Playment
Roboflow
Scale
Snorkel
SuperAnnotate
Supervisely
V7

Productivity Management Features

The criteria are as follows:

  • Interface Customization
  • Asset preloading
  • Keyboard shortcuts / Hotkeys
  • Workload distribution
  • Autosave
  • Labeler productivity dashboard
Interface Customization Asset preloading Keyboard shortcuts / Hotkeys Workload distribution Autosave Labeler productivity dashboard
Appen
ClarifAI
CVAT
Dataloop
Hasty
Hive
Innotescus
Kili
Labelbox
LabelStudio
Playment
Roboflow
Scale
Snorkel
SuperAnnotate
Supervisely
V7

Quality Management Features

The criteria are as follows:

  • Issue management
  • Consensus Image
  • Consensus Video
  • Consensus Text
  • Consensus Audio
Issue management Consensus Image Consensus Video Consensus Text Consensus Audio
Appen
ClarifAI
CVAT
Dataloop
Hasty
Hive
Innotescus
Kili
Labelbox
LabelStudio
Playment
Roboflow
Scale
Snorkel
SuperAnnotate
Supervisely
V7

Dataset Management Features

The criteria are as follows:

  • Data hub
  • Version control
  • Public project
  • ML export formats
Data hub Version control Public project ML export formats
Appen
ClarifAI
CVAT
Dataloop
Hasty
Hive
Innotescus
Kili
Labelbox
LabelStudio
Playment
Roboflow
Scale
Snorkel
SuperAnnotate
Supervisely
V7

API

How to programmatically interact with the tool. The criteria are as follows:

  • GraphQL API
  • Messaging bus API
  • CLI SDK
  • Python SDK
GraphQL API Messaging bus API CLI SDK Python SDK
Appen
ClarifAI
CVAT
Dataloop
Hasty
Hive
Innotescus
Kili
Labelbox
LabelStudio
Playment
Roboflow
Scale
Snorkel
SuperAnnotate
Supervisely
V7

Deployment

The different options to deploy the tool. The criteria are as follows:

  • On-premise data
  • On-premise deployment
  • Air gap deployment
On-premise data On-premise deployment Air gap deployment
Appen
ClarifAI
CVAT
Dataloop
Hasty
Hive
Innotescus
Kili
Labelbox
LabelStudio
Playment
Roboflow
Scale
Snorkel
SuperAnnotate
Supervisely
V7

Automation

The offering in terms of labeling automation. The criteria are as follows:

  • Model-assisted labeling
  • Model based preannotation
  • Queue prioritisation
  • Model Training
  • Model Inference
Model-assisted labeling Model based preannotation Queue prioritisation Model Training Model Inference
Appen
ClarifAI
CVAT
Dataloop
Hasty
Hive
Innotescus
Kili
Labelbox
LabelStudio
Playment
Roboflow
Scale
Snorkel
SuperAnnotate
Supervisely
V7

Workforce

Availability of workforce to help labeling. The criteria are as follows:

  • Human in the loop
  • External workforce
  • On-premise external workforce
Human in the loop External workforce On-premise external workforce
Appen
ClarifAI
CVAT
Dataloop
Hasty
Hive
Innotescus
Kili
Labelbox
LabelStudio
Playment
Roboflow
Scale
Snorkel
SuperAnnotate
Supervisely
V7

Security

Security certifications. The criteria are as follows:

  • HIPAA Compliance
  • SOC 2 Compliance
HIPAA Compliance SOC 2 Compliance
Appen
ClarifAI
CVAT
Dataloop
Hasty
Hive
Innotescus
Kili
Labelbox
LabelStudio
Playment
Roboflow
Scale
Snorkel
SuperAnnotate
Supervisely
V7

About

Feature benchmarks of main data labeling platforms