Nexdata-AI / 209-Hours-Portuguese-Speaking-English-Speech-Data-by-Mobile-Phone

Portuguese English Speech Dataset

Home Page:https://www.nexdata.ai/datasets/1023?source=Github

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

209-Hours-Portuguese-Speaking-English-Speech-Data-by-Mobile-Phone

Description

532 Portuguese recorded in a relatively quiet environment in authentic English. The recorded script is designed by linguists and covers a wide range of topics including generic, interactive, on-board and home. The text is manually proofread with high accuracy. It matches with mainstream Android and Apple system phones.

For more details, please refer to the link: https://www.nexdata.ai/datasets/1023?source=Github

Format

Mobile Phone: 16kHz, 16bit, uncompressed wav, mono channel

Recording environment

quiet indoor environment, low background noise, without echo

Recording content (read speech)

generic category; human-machine interaction category; smart home command and control category; in-car command and control category; numbers

Demographics

532 speakers totally, with 47% males and 53% females; and 59% speakers of all are in the age group of 18-25,39% speakers of all are in the age group of 26-45, 2% speakers of all are in the age group of 46-60.

Device

Android mobile phone, iPhone

Language

English

Application Scenarios

speech recognition; voiceprint recognition

Licensing Information

Commercial License