SpecAugment.py

A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition

SpecAugment is a SOTA-achieving data augmentation approach on speech recognition. The paper's authors did not publish code that I could find and their implementation was in TensorFlow.

To use:

run install.sh (I recommend to use a unique conda env for the project)
Check out SpecAugment.ipynb (a Jupyter notebook) for the functions.

Augmentations

Time Warp (DONE!)
Time Mask (DONE!)
Frequency Mask (DONE!)

Let's be friends!

About

🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition

https://arxiv.org/abs/1904.08779

Languages

Language:Jupyter Notebook 99.8%Language:Python 0.2%Language:Shell 0.0%