Shekofteh's repositories

allosaurus

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

License:GPL-3.0Stargazers:1Issues:0Issues:0

asr_assignment

Code for the first assignment of the ASR course for 2020

Stargazers:1Issues:0Issues:0

Bachelors-Project-Allosaurus

extra files used for bachelor's project

Stargazers:1Issues:0Issues:0

DeepSpeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

License:MPL-2.0Stargazers:1Issues:0Issues:0

E2PCast

E2PCast: An English to Persian Voice Casting Dataset

Stargazers:1Issues:0Issues:0

IIRI-Net

The code of the paper: "IIRI-Net: An interpretable convolutional front-end inspired by IIR filters for speaker identification".

Stargazers:1Issues:0Issues:0

InterpretableCNN

An extended version of SincNet in which some general auditory filter models are added for the Speaker Identification task

Stargazers:1Issues:0Issues:0

kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

License:NOASSERTIONStargazers:1Issues:0Issues:0

MineSweeper-Matlab

Matlab Project 99

Stargazers:1Issues:0Issues:0

NF_Prj_MIMII_Dataset

A machine learning approach to machine anomaly detection on the MIMII dataset.

License:MITStargazers:1Issues:0Issues:0

PAVID-CVs

Persian Audio-Visual Database

Stargazers:1Issues:0Issues:0

SGR_AFM

The code of the paper: "Exploiting auditory filter models as interpretable convolutional frontends to obtain optimal architectures for speaker gender recognition".

Stargazers:1Issues:0Issues:0

ShEMO-Modification

A modification on the ShEMO database

Language:Jupyter NotebookStargazers:1Issues:0Issues:0

speech-denoising-wavenet

A neural network for end-to-end speech denoising

License:MITStargazers:1Issues:0Issues:0

Audio-Classification

Code for YouTube series: Deep Learning for Audio Classification

License:MITStargazers:0Issues:0Issues:0

AudioSignalProcessingForML

Code and slides of my YouTube series called "Audio Signal Proessing for Machine Learning"

License:MITStargazers:0Issues:0Issues:0

Classification-of-Heart-Sound-Signal-Using-Multiple-Features-

Data plus code fo Classification of Heart Sound Signal Using Multiple Features

Stargazers:0Issues:0Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

License:MITStargazers:0Issues:0Issues:0

math-tools-nyu

DS-GA 1013 Mathematical Tools for Data Science

Stargazers:0Issues:0Issues:0

MOSNet

Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"

License:NOASSERTIONStargazers:0Issues:0Issues:0

nn-zero-to-hero

Neural Networks: Zero to Hero

License:MITStargazers:0Issues:0Issues:0

opensmile

The Munich Open-Source Large-Scale Multimedia Feature Extractor

License:NOASSERTIONStargazers:0Issues:0Issues:0

parstwiner

Name Entity Recognition (NER) on the Persian Twitter dataset.

License:MITStargazers:0Issues:0Issues:0

similarity_scoring_system

A siamese neural networks based voice similarity scoring

Stargazers:0Issues:0Issues:0

SpeechTransProgress

Tracking the progress in end-to-end speech translation

License:CC0-1.0Stargazers:0Issues:0Issues:0

x-vector-pytorch

Implementation of the paper "Spoken Language Recognition using X-vectors" in Pytorch

Stargazers:0Issues:0Issues:0