mfcc-features

There are 2 repositories under mfcc-features topic.

filippogiruzzi / voice_activity_detection
Voice Activity Detection based on Deep Learning & TensorFlow
voice-activity-detection deep-learning speech tensorflow time-series time-series-classification resnet speech-recognition speech-detection python mfcc-features machine-learning vad deeplearning artificial-intelligence deep-neural-networks librispeech librispeech-dataset
Language:Python 339
jsingh811 / pyAudioProcessing
Audio feature extraction and classification
audio-data feature-extraction classify classify-audio mfcc mfcc-features mfcc-extractor gfcc gfcc-features gfcc-extractor spectral-features chroma-features classifier-options classify-audio-samples wav-files classifier audio-files hyperparameter-tuning pyaudioprocessing
Language:Python 213
Earnings-Call-Dataset / MAEC-A-Multimodal-Aligned-Earnings-Conference-Call-Dataset-for-Financial-Risk-Prediction
Repository for CIKM 2020 resource track paper: MAEC: A Multimodal Aligned Earnings Conference Call Dataset for Financial Risk Prediction
earnings-calls dataset audio-text mfcc-features python35
Language:Python 76
datarootsio / fresh-coffee-listener
Using a raspberry pi, we listen to the coffee machine and count the number of coffee consumption
python raspberrypi raspbian coffee-machine counter mfcc-features systemd sound-processing
Language:Python 53
adrianagaler / Snoring-Detection
Tiny Machine Learning Snoring Detection Model for Embedded devices
mfcc-features tinyml audio signal-processing tensorflow-micro
Language:C++ 31
IhabBendidi / Voice-authentification-API
A RESTFUL API implementation of an authentification system using voice fingerprint
mfcc-features mfcc-extractor mfcc-analysis mfcc machine-learning gmm flask api server voice-recognition voice authentication security encoding
Language:Python 24
harmanpreet93 / audio_classification
Multi-class audio classification with MFCC features using CNN
audio-classification cnn deep-learning machine-learning mfcc-features
Language:Jupyter Notebook 22
geekysethi / audio_classification
audio cnn-classification cnn machine-learning deep-learning pytorch resnet audio-processing lstm spectrogram mfcc mfcc-features
Language:Python 21
FedericaPaoli1 / stm32-speech-recognition-and-traduction
stm32-speech-recognition-and-traduction is a project developed for the Advances in Operating Systems exam at the University of Milan (academic year 2020-2021). It implements a speech recognition and speech-to-text translation system using a pre-trained machine learning model running on the stm32f407vg microcontroller.
stm32 stm32cubemx mems-microphone mems i2s pcm pdm i2s-audio i2s-microphone tflite tflite-model speech-recognition speech-to-text audio-processing mfcc-features mfcc bsp stm32f4 stm32f407vg stm32f4-discovery
Language:C 18
brihijoshi / vanilla-stft-mfcc
A Python implementation of STFT and MFCC audio features from scratch
signal-processing stft mfcc-features audio
Language:Jupyter Notebook 15
Jason-Oleana / speech-emotion-classification
MFCC features + SVM for speech emotion classification
mfcc-features speech-emotion-classification speech-features
Language:Jupyter Notebook 14
mikex86 / SonopyJava
Java Implementation of the Sonopy Audio Feature Extraction Library by MycroftAI
audio-feature-extraction mfcc-features mfcc-extractor mfcc-algorithm audio-processing mel-spectrogram powerspectrum discrete-cosine-transform dct dct2 fft fastfouriertransform rfft sonopy numpy scipy
Language:Java 12
ShreeshaN / AlcoAudio
Detect alcohol induced intoxication level from a voice sample
audio-classification convolutional-neural-networks filter-banks intoxication-detection mfcc-features
Language:Python 12
AlexKly / Simple-Voice-Activity-Detector-using-MFCC-based-on-FPGA-Kintex
Voice Activity Detector based on MFCC features and DNN model
deep-learning fpga machine-learning mfcc-features voice-activity-detection
Language:VHDL 11
IESTAC
Giuseppe-Della-Corte / IESTAC
A corpus that can be used to train English-to-Italian End-to-End Speech-to-Text Machine Translation models
machine-translation speech-translation corpus parallel-corpus parallel-corpora end-to-end-machine-learning forced-alignment speech-processing mfcc-features bitext sentence-embeddings sentence-similarity statistical-machine-translation speech-recognition text-processing text-preprocessinig web-scraping named-entity-recognition audio-data sql-database
11
tharunchitipolu / Speaker-recognition
An automatic speaker recognition system built from digital signal processing tools, Vector Quantization and LBG algorithm
matlab mfcc-features mfcc-extractor euclidean-distances speaker-recognition speaker-verification lbg algorithm codebook fft vector-quantization digital-signal-processing voice
Language:MATLAB 10
bekirbakar / replay-attack-detection
Deep learning-based audio spoofing attack detection experiments for speaker verification.
python sidekit theano asvspoof dnn mlp replay-attack speech-processing spoofing-attack keras replay-attacks speaker-verification deep-learning ltas mfcc-features
Language:Python 9
romanyshyn-natalia / english-accents-classification
Signal Processing Course project
speech-processing mfcc-features deep-learning accent-classification
Language:Jupyter Notebook 9
sarthak268 / Audio-Classification-using-MFCC-and-Spectrogram
Audio classification using a simple SVM classifier making use of MFCC and Spectrogram features coded from scratch
audio audio-classification mfcc mfcc-features spectrogram svm-classifier
Language:Python 9
dinhanhx / Automatic_Speaker_Recognition
A repos for USTH Digital Signal Processing 2020 Group 3 project. It's quite obvious in the title.
dsp digital-signal-processing voice-recognition machine-learning mfcc-features gmm sklearn python3 voice human python python-3 signal-processing wav-files speaker-recognition datasets
Language:Python 8
Jason-Oleana / speech-classification
In this challenge, the goal is to learn to recognize which of several English words is pronounced in an audio recording. This is a multiclass classification task.
convolutional-neural-network mfcc-features speech-classification
Language:Jupyter Notebook 8
kar7hik / mfcc
Common-lisp implementation of MFCC
feature-extraction fft mfcc mfcc-features signal-processing wav
Language:Common Lisp 8
thangdnsf / Audio-command-recognition
Audio command recognition by DTW and classification
mfcc-features pca knn dtw cnn delta-coef audio-recognition
Language:Jupyter Notebook 7
Luisa13 / Speech_Emotion_Recogntion_in_Foreign_Language
Classify and recognize emotions through voice signal in a foreign language
dataset machine-learning audio-processing voice-signal speech-emotion-recognition mfcc-features
Language:Jupyter Notebook 6
mrzaizai2k / Coughvid-19-CRNN-attention
Another project for classifying Covid and non-covid patients through cough sound. Using CRNN-Attention model with the sound data converted into image data
attention cough image-processing crnn sound mfcc-features spectrogram
Language:Jupyter Notebook 6
AdityaDutt / Music-Genre-Classification
Classify music in two categories progressive rock and non-progressive rock using mfcc features, MLP, and CNN.
music-genre-classification cnn mfcc-features rock-music
Language:Python 5
K-GOKULAPPADURAI / RespireNet-Respiratory-Disease-Prediction-Web-Application-Using-Deep-Learning
RespireNet is an innovative web-based application that harnesses the capabilities of deep learning and Mel-frequency cepstral coefficients (MFCC) as a feature extraction technique for accurate respiratory disease prediction. The primary objective of this user-friendly web application is to facilitate early detection.
audio-processing deep-learning flask-application gunicorn-web-server keras-tensorflow librosa mfcc-features numpy python3 render-deployment respiratory-monitor respiratory-sounds tensorflow
Language:Python 5
abhishk12 / Urban-Sound-Classification-using-CNN
Classification of urban sounds such as air conditioner, jackhammer, drilling, siren, street music, engine idling and children playing by using Mel-frequency Cepstral Coefficients (MFCCs) as audio feature and CNN algorithm.
mfcc-features sound-classification cnn-classification librosa urban-sound-classification
Language:Jupyter Notebook 4
FandosA / Singer_Recognition_Keras_TF
This project was my final Bachelor's degree thesis. In it I decided to mix my passion, music, and the syllabus that I liked the most in my degree, deep learning.
deeplearning mfcc mfcc-features singer recognition-algorithms python cnn deep-learning fully-connected-network machine-learning recurrent-neural-networks cnn-model keras lstm lstm-model tensorflow
Language:Python 4
FilipTirnanic96 / mfcc_extraction
Implementation of Mel-Frequency Cepstral Coefficients (MFCC) extraction
audio-classification audio-processing mfcc-algorithm mfcc-extractor mfcc-features signal-processing-algorithms mfcc-analysis
Language:Python 4
parham1998 / Isolated-Digits-Recognition
Implementation of Persian Isolated-Digits Recognition with Matlab
matlab speech-recognition mfcc-features digits-recognition isolated-digits
Language:MATLAB 4
bayuwira / Kendang-Tunggal-Classification-Using-Backpropagation-and-Onset-Detection
Bali has a diversity of arts that has been recognized by the world, where one of the most famous Balinese arts is the Karawitan art, especially the Kendang Tunggal instrument. Notation documentation or more commonly known as music transcription, can make learning a song easier, and in the case of this research, it makes it easier to learn to play the Kendang Tunggal instrument. The first approach method used to document a kendang tunggal song is onset detection. Onset is when the signal experiences an attack period, which helps segment the sound color of the drum instrument. The segmented kendang tunggal sound color classification uses the Backpropagation algorithm with several features of the frequency domain and time domain as a characteristic of the sound color. Then the kendang tunggal song is revived into a synthetic sound with the Mel Spectral Approximation filter. Based on the research, the optimal parameter for drum sound color segmentation with onset detection is the hop size 110 with normalization of the features on its onset detection function. The optimal backpropagation architecture obtained with a learning rate of 0.9, neurons 10, and epoch 2000 produces an accuracy of 60.85%. The synthesis method using the Mel Log Spectrum Approximation can make synthetic sounds similar to kendang songs with an accuracy of 83.33%
classification onset sound synthesis drums machine-learning machine-learning-algorithms fastfouriertransform mel-spectrogram zero-crossing-rate-variation mfcc-features mel-log-spectrum-approximation onset-detection
Language:Python 3
Brave-Cookie / Emotion-recognition
⚙ 감정 인식 모델 개발 ⚙
emotion-recognition librosa mfcc-features sklearn svm-model
Language:Jupyter Notebook 3
konstantd / Automatic_Speaker_Recognition
Development of a Voice Activity Detector and a Speaker Recognition System. Feature extraction in time and frequency domain. Classification in ten individual speakers.
audio-processing machine-learning-algorithms mfcc-features mlp voice-activity-detection
Language:MATLAB 3
PiasRoY / Bangla-Spoken-Number-Recognition
recognizing spoken Bangla numbers using MFCCs and CNN.
speech-recognition bangla mfcc-features cnn-model
Language:Jupyter Notebook 3
Audio-Processing
Ribin-Baby / Audio-Processing
👉 This repository contains basic audio 🔊 processing code with feature extraction explained. 🎶 🎶 🎶
audio-analysis audio-processing audio-visualizer feature-extraction mfcc-extractor mfcc-features pre-processing python3
Language:Jupyter Notebook 3

mfcc-features

filippogiruzzi / voice_activity_detection

jsingh811 / pyAudioProcessing

Earnings-Call-Dataset / MAEC-A-Multimodal-Aligned-Earnings-Conference-Call-Dataset-for-Financial-Risk-Prediction

datarootsio / fresh-coffee-listener

adrianagaler / Snoring-Detection

IhabBendidi / Voice-authentification-API

harmanpreet93 / audio_classification

geekysethi / audio_classification

FedericaPaoli1 / stm32-speech-recognition-and-traduction

brihijoshi / vanilla-stft-mfcc

Jason-Oleana / speech-emotion-classification

mikex86 / SonopyJava

ShreeshaN / AlcoAudio

AlexKly / Simple-Voice-Activity-Detector-using-MFCC-based-on-FPGA-Kintex

Giuseppe-Della-Corte / IESTAC

tharunchitipolu / Speaker-recognition

bekirbakar / replay-attack-detection

romanyshyn-natalia / english-accents-classification

sarthak268 / Audio-Classification-using-MFCC-and-Spectrogram

dinhanhx / Automatic_Speaker_Recognition

Jason-Oleana / speech-classification

kar7hik / mfcc

thangdnsf / Audio-command-recognition

Luisa13 / Speech_Emotion_Recogntion_in_Foreign_Language

mrzaizai2k / Coughvid-19-CRNN-attention

AdityaDutt / Music-Genre-Classification

K-GOKULAPPADURAI / RespireNet-Respiratory-Disease-Prediction-Web-Application-Using-Deep-Learning

abhishk12 / Urban-Sound-Classification-using-CNN

FandosA / Singer_Recognition_Keras_TF

FilipTirnanic96 / mfcc_extraction

parham1998 / Isolated-Digits-Recognition

bayuwira / Kendang-Tunggal-Classification-Using-Backpropagation-and-Onset-Detection

Brave-Cookie / Emotion-recognition

konstantd / Automatic_Speaker_Recognition

PiasRoY / Bangla-Spoken-Number-Recognition

Ribin-Baby / Audio-Processing