audio audio-classification audio-data audio-processing automatic-speech-recognition huggingface-transformers speecht5 text-to-speech transcribe translation whisper deep-learning

Audio_Tasks

Different Task Guides for Audio Data

Audio Classification

The goal of this task is to categorize audio input into different types such as Music, Speech, or sounds from Nature.

Automatic Speech Recognition(Speech to text)

This task involves coverting spoken words(Speech) into text. The use cases are Communicating with computer-machines, Voice activated commands, Live-transcription, Live-Translation etc..

Text to Speech

This task is opposite of ASR i.e. converting input text to synthetic speech. Different use cases are helping visually impaired people, Live communication in foreign language etc..

others

About

Different Task Guides for Audio Data

audio audio-classification audio-data audio-processing automatic-speech-recognition huggingface-transformers speecht5 text-to-speech transcribe translation whisper deep-learning

MIT License

Languages

Language:Jupyter Notebook 100.0%