shekofteh

Shekofteh's repositories

allosaurus

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

GPL-3.0100

asr_assignment

Code for the first assignment of the ASR course for 2020

100

Bachelors-Project-Allosaurus

extra files used for bachelor's project

100

DeepSpeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

MPL-2.0100

denoising-wavenet-small

100

E2PCast

E2PCast: An English to Persian Voice Casting Dataset

100

IIRI-Net

The code of the paper: "IIRI-Net: An interpretable convolutional front-end inspired by IIR filters for speaker identification".

100

InterpretableCNN

An extended version of SincNet in which some general auditory filter models are added for the Speaker Identification task

100

kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

NOASSERTION100

MineSweeper-Matlab

Matlab Project 99

100

NF_Prj_MIMII_Dataset

A machine learning approach to machine anomaly detection on the MIMII dataset.

MIT100

PAVID-CVs

Persian Audio-Visual Database

100

SampleDataWakeWordDetection

100

SGR_AFM

The code of the paper: "Exploiting auditory filter models as interpretable convolutional frontends to obtain optimal architectures for speaker gender recognition".

100