Nexdata-AI / 225-Hours-Swedish-Spontaneous-Speech-Data

Swedish Spontaneous Speech Data

https://www.nexdata.ai/datasets/1249?source=Github

asr audio speaker-recognition speech-recognition swedish voiceprint

225-Hours-Swedish-Spontaneous-Speech-Data

Description

The 225 Hours - Swedish Spontaneous Speech Data, the content covering multiple topics. All the speech audio was manually transcribed into text; speaker identity, gender, and other attribution are also annotated. This dataset can be used for voiceprint recognition model training, corpus construction for machine translation, and algorithm research introduction

For more details, please refer to the link: https://www.nexdata.ai/datasets/1249?source=Github

Specifications

Format

16kHz, 16bit, mono channel;

Content category

including self-meida,interview, etc.

Language

Swedish

Annotation

annotation for the transcription text, speaker identification, gender;

Application scenarios

speech recognitio, video caption generation and video content review;

Accuracy

at a Word Accuracy Rate (WAR) of being no less than 95%.

Licensing Information

Commercial License

About

Swedish Spontaneous Speech Data

https://www.nexdata.ai/datasets/1249?source=Github

asr audio speaker-recognition speech-recognition swedish voiceprint