language-identification language-recognition speech-processing self-supervised-learning

JSTSP

Efficient Self-supervised Learning Representations for Spoken Language Identification.

Before running the training scripts,

Download asv-subtools and move the folder "subtool" to your kaldi/esg/
Download s3prl and run the pre_processing.py in s3prl to extract the features.

An example:

python pre_processing.py --json xsa_config.json
python train_xsa.py --json xsa_config.json

Before running, pls revise the json configuration file according to your own root.
Mainly the "Input" part, you can keep others since they are the parameters I am using :)

About

Source code for: Efficient Self-supervised Learning Representations for Spoken Language Identification

language-identification language-recognition speech-processing self-supervised-learning

Languages

Language:Python 100.0%