Shekofteh's repositories
itsp
Introduction to Speech Processing
E2PCast-Final
A Dataset for English to Persian Voice Casting
Bachelors-Project-Allosaurus
extra files used for bachelor's project
Audio-Classification
Code for YouTube series: Deep Learning for Audio Classification
InterpretableCNN
An extended version of SincNet in which some general auditory filter models are added for the Speaker Identification task
nn-zero-to-hero
Neural Networks: Zero to Hero
ShEMO-Modification
A modification on the ShEMO database
fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
SpeechTransProgress
Tracking the progress in end-to-end speech translation
MOSNet
Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"
parstwiner
Name Entity Recognition (NER) on the Persian Twitter dataset.
Classification-of-Heart-Sound-Signal-Using-Multiple-Features-
Data plus code fo Classification of Heart Sound Signal Using Multiple Features
math-tools-nyu
DS-GA 1013 Mathematical Tools for Data Science
allosaurus
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
asr_assignment
Code for the first assignment of the ASR course for 2020
DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
MineSweeper-Matlab
Matlab Project 99
AudioSignalProcessingForML
Code and slides of my YouTube series called "Audio Signal Proessing for Machine Learning"
opensmile
The Munich Open-Source Large-Scale Multimedia Feature Extractor
x-vector-pytorch
Implementation of the paper "Spoken Language Recognition using X-vectors" in Pytorch
NF_Prj_MIMII_Dataset
A machine learning approach to machine anomaly detection on the MIMII dataset.