iammartian0 / Audio_Tasks

Different Task Guides for Audio Data

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Audio_Tasks

Different Task Guides for Audio Data

Audio Classification

The goal of this task is to categorize audio input into different types such as Music, Speech, or sounds from Nature.

Automatic Speech Recognition(Speech to text)

This task involves coverting spoken words(Speech) into text. The use cases are Communicating with computer-machines, Voice activated commands, Live-transcription, Live-Translation etc..

Text to Speech

This task is opposite of ASR i.e. converting input text to synthetic speech. Different use cases are helping visually impaired people, Live communication in foreign language etc..

others