Easily reproducible baselines for automatic speech recognition using semi-supervised contrastive learning.
- Download CommonVoice English Dataset
- Setup
config.toml
to use the paths where data was downloaded. - Install requirements using
pip3 install -r requirements.txt
- Prepare data using
python3 -m dataset.prepare
- Train using
python3 -m train
- Start tensorboard using
tensorboard --logdir training_artifacts/tb_logs
- supervised training and dataset
- Check online evaluator piece from Pybolts Simclr
Add more logs.streaming convnets modelsave and load projection weighs for trainingCheck if anything is missing from Athena Simclr