CalvinMT / neural-acoustic-word-embeddings

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Neural Acoustic Word Embeddings for Speech Commands v0.02 & DyLNet

Overview:

This is a recipe for learning neural acoustic word embeddings for a subset of Speech Commands v0.02 & DyLNet. The models are explained in greater detail in Settle & Livescu, 2016 as well as Settle et al., 2017:

  • S. Settle and K. Livescu, "Discriminative Acoustic Word Embeddings: Recurrent Neural Network-Based Approaches," in Proc. SLT, 2016.
  • S. Settle, K. Levin, H. Kamper, and K. Livescu, "Query-by-Example Search with Discriminative Neural Acoustic Word Embeddings," in Proc. Interspeech, 2017.

Contents:

code/

  • python code to create, run, and save the model

Steps:

  1. Ensure access to installed dependencies.

  2. Clone repo.

  3. Download Speech Commands v0.02 corpus

  4. Navigate to code directory and run "python main.py -t=0.05 -l=sc2 <corpus_dir>". This will train, evaluate, and save the model named "sc2" on 5% of the corpus.

About


Languages

Language:Shell 73.7%Language:Python 20.3%Language:Perl 5.9%