tarun360 / LanguageIDORL

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Language ID

This Repository contains the code for language identification from speech utterance. This repository uses s3prl library to load various upstream models like wav2vec2, CPC, TERA etc.

Installation

Use the package manager pip to install the required packages for preparing the dataset, training and testing the model.

pip install -r requirements.txt

Usage

Update Config and Logger

Update the config.py file to update the upstream model, batch_size, gpus, lr, etc and change the preferred logger in train.py files

Training

./run.sh

Testing

./run_test.sh

License

MIT

Reference

  • [1] S3prl: The self-supervised speech pre-training and representation learning toolkit. AT Liu, Y Shu-wen

About


Languages

Language:Python 99.2%Language:Shell 0.8%