ivaste / KeyWordSpotting

Detect a set of predefined keywords in an audioclip.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

End-to-End Framework for Key-Word Spotting

Detect a set of predefined keywords in an audioclip.

KeyWordSpotting

๐Ÿ“ˆ Download Presentation PDF

๐Ÿ“„ Download Paper PDF

๐Ÿ•น๏ธ Try Live Demo

๐Ÿ“บ Watch Presentation

๐Ÿ“บ Watch Live Demo

Table of Contents

Overview

The End-to-End framework for the KeyWordSpotting task is made of a sliding window of 1 second, a Voice Activity Detection module or a Silence Filter that select onfly the frames containig human voice, from those frames a feature extraction module will extract the Mel Spectogram or the Mel Cepstral Coefficients, this will be the input of the model. Finally a fusion rule aggregates all frames pedictions in a single one.

Usage

.......

Download best model

If you just want to download and use the best model in your application you need to...

import librosa
import numpy as np
import Models #Our models
import LoadAndPreprocessDataset
from tensorflow.keras.models import load_model
.....

Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

Team

Stefano Ivancich Luca Masiero
Stefano Ivancich Luca Masiero
github.com/ivaste github.com/TyllanDrake

License

License

About

Detect a set of predefined keywords in an audioclip.


Languages

Language:Jupyter Notebook 84.2%Language:Python 9.7%Language:TeX 6.2%