asr asr-model audio dataset deep-learning speech speech-recognition speech-to-text

762-Hours-Non-Hispanic-Spanish-Speech-Data-by-Mobile-Phone

Description

1,630 non-Spanish nationality native Spanish speakers such as Mexicans and Colombians participated in the recording with authentic accent. The recorded script is designed by linguists and cover a wide range of topics including generic, interactive, in-vehicle and home. The text is manually proofread with high accuracy. It matches with mainstream Android and Apple system phones.

For more details, please refer to the link: https://www.nexdata.ai/datasets/970?source=Github

Format

Mobile phone, 16kHz, 16bit, uncompressed wav, Mono channel

Recording environment

quiet indoor environment, low background noise, without echo

Recording content (read speech)

oral category; human-machine interaction category; smart home command and in-car command category; numbers; news category

Demographics

1,630 speakers totally, with 48% male and 52% female; and 55% speakers of all are in the age group of 16-25,40% speakers of all are in the age group of 26-45, 5% speakers of all are in the age group of 46-69;

Device

iPhone, Android mobile phone

Application scene

speech recognition; voiceprint recognition

Licensing Information

Commercial License

About

Non Hispanic Spanish Speech Dataset

https://www.nexdata.ai/datasets/970?source=Github

asr asr-model audio dataset deep-learning speech speech-recognition speech-to-text