abdouaziz / asr-wolof

This repository is about automatic speech recognition in Wolof

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

🌟 Automatique Speech Recognition in Wolof 🌟

Automatic Speech Recognition or ASR is one of the tasks in NLP which consists of transcribing the corresponding text onto an audio clip.

With the advent of deep learning, significant advances have been made in terms of speech recognition.

In this repository, we will implement the models that have allowed this advance in the Wolof language.

Objective

To achieve our goals in this project we will implement two models related to the paper:

Getting Started

These instructions will get you a copy of the project up and running on your local machine for development and testing purposes.

Prerequisites

install the dependencies for this project by running the following commands in your terminal:

 pip install -r requirements.txt

run the deepspeech2 model by running the following command in your terminal:

python deep-speech2/src/train.py --train_file="./input/Train.csv" \
                        --dev_file="./input/Test.csv" \
                        --audio_dir="./input/clips" \
                        --n_filters=256 \
                        --conv_stide=2 \
                        --conv_border='valid' \
                        --n_lstm_units=256 \
                        --n_dense_units=42 \
                        --epochs=10 \
                        --batch_size=32 \
                        --output_dir="./output" \

Ressources

Here are some useful papers for automatique speech recognition :

About

This repository is about automatic speech recognition in Wolof


Languages

Language:Python 100.0%