MaisyZhang / spec_augment

🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition

Home Page:https://arxiv.org/abs/1904.08779

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

SpecAugment.py

A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition

SpecAugment is a SOTA-achieving data augmentation approach on speech recognition. The paper's authors did not publish code that I could find and their implementation was in TensorFlow.

To use:

  1. run install.sh (I recommend to use a unique conda env for the project)
  2. Check out SpecAugment.ipynb (a Jupyter notebook) for the functions.

Augmentations

  1. Time Warp (DONE!)

  2. Time Mask (DONE!)

  3. Frequency Mask (DONE!)

Let's be friends!

About

🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition

https://arxiv.org/abs/1904.08779


Languages

Language:Jupyter Notebook 99.8%Language:Python 0.2%Language:Shell 0.0%