speech-classification

There are 1 repository under speech-classification topic.

YuanGongND / ast
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
pytorch audio-classification deep-learning audio representation-learning keyword-spotting speech-commands speech-classification
Language:Jupyter Notebook 1077
YuanGongND / ssast
Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".
audio audio-processing audio-classification speech-classification self-supervised-learning
Language:Python 358
m3hrdadfi / soxan
Wav2Vec for speech recognition, classification, and audio classification
speech-emotion-recognition emotion-recognition automatic-speech-recognition speech-recognition speech-classification
Language:Jupyter Notebook 235
kaistmm / Audio-Mamba-AuM
Official Implementation of the work "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning"
audio audio-classification deep-learning mamba pytorch representation-learning speaker-identification speech-classification state-space-model audio-mamba
Language:Python 66
felixchenfy / Speech-Commands-Classification-by-LSTM-PyTorch
Classification of 11 types of audio clips using MFCCs features and LSTM. Pretrained on Speech Command Dataset with intensive data augmentation.
machine-learning speech-recognition speech-classification lstm audio-processing
Language:Jupyter Notebook 36
HoseinAzad / Transformer-based-SER
Transformer-based model for Speech Emotion Recognition(SER) - implemented by Pytorch
emotion-recognition speech-classification speech-emotion-classification speech-emotion-recognition transformer-pytorch speech-python speech-classification-python
Language:Python 29
anik8gupta / Toxic_Speech_Classification
It is a full-fetched web application.Based on sentiment classification, by using nltk library it predicts that a speech is how much toxic, sever toxic, insult, obscene, threat.
sentiment-analysis machine-learning machine-learning-projects nltk speech-classification
Language:Python 15
Sreyan88 / Toxicity-Detection-in-Spoken-Utterances
This repository contains the code for the paper: "DeToxy: A Large-Scale Multimodal Dataset for Toxicity Classification in Spoken Utterances"
speech speech-classification toxicity-classification wav2vec2
Language:Jupyter Notebook 10
Jason-Oleana / speech-classification
In this challenge, the goal is to learn to recognize which of several English words is pronounced in an audio recording. This is a multiclass classification task.
speech-classification convolutional-neural-network mfcc-features
Language:Jupyter Notebook 8
deep-spin / speech-continuous-attention
Speech Classification using Continuous Attention Mechanisms
speech-classification continuous-attention continuous-softmax continuous-sparsemax
Language:Python 3
EmanuelAlogna / Gender-Classification-using-ML
Gender Classification with different Machine Learning models, using the LibriSpeech ASR dataset.
speech-recognition speech-classification machine-learning machine-learning-algorithms logistic-regression naive-bayes perceptron svm mlp deep-learning convolutional-neural-networks librispeech librispeech-dataset k-nearest-neighbors
Language:Jupyter Notebook 3
Mubarekethio / Voice-Recognition-Qafaraf-and-Amharic
Qafar-af and Amharic voice Command Recognition project to control the movement of wheelchair
amharic-words keyword-spotting kws voice-commands voice-control voice-recognition qafaraf-voice afar-language amharic audio-classification speech-classification speech-recognition afaraf qafaraf
Language:Jupyter Notebook 2
Chris-Winnard / Speech-Gender-Classifier
A convolutional neural network for gender classification, which achieved an F1-score of 94.3% when tested on the RAVDESS dataset. Created as postgraduate coursework, the report is included. The report also discusses Sodiq Adebiy's CNN, which I'd recommend looking at to anyone interested in emotion classification.
audio-analysis convolutional-neural-networks deep-learning deep-neural-networks gender-classification gender-recognition machine-learning speech-classification
Language:Jupyter Notebook 1
sarthak268 / Multimedia-Computing-and-Applications
This repository contains code for all assignments in the Multimedia Computing and Applications (CSE563) course.
multimedia multimedia-computing speech-classification text-representation text-retrieval
Language:Python 1
KrajShuffle / Classifying_SpeechAudio_CNN
CNN Based Approach for Audio File Classification. Contains Notebooks Illustrating Data Preprocessing, Feature Extraction, Model Training, & Model Inference Workflows & Overall Pipeline
convolutional-neural-networks data-preprocessing feature-engineering feature-extraction model-inference model-training-and-evaluation speech-classification metrics-visualization
Language:Jupyter Notebook 0
MilanaShhanukova / uni-research-dementia-detection
This project represents my research on dementia classification using audio data.
attention-mechanism deep-learning dementia-detection speech-classification
Language:Jupyter Notebook 0
ryanquinnnelson / CMU-11685-Utterance-to-Phoneme-Mapping
Fall 2021 Introduction to Deep Learning - Homework 3 Part 2 (RNN-based phoneme recognition)
torch rnn ctc-loss melspectrogram lstm cnn ctcdecode speech-classification octopus
Language:Python 0
Amir-Hofo / Speech-commands-Classification
In this notebook, we aim to recognize speech commands using classification. For this purpose, we used the SPEECHCOMMANDS dataset and the deep convolutional model M5. The code is written in Python and designed for the PyTorch platform.
ai artificial-intelligence audio-classification cnn convolutional-neural-networks deep-learning machine-learning pytorch speech-classification speech-recognition speech-to-text torchaudio
Language:Jupyter Notebook
manashpratim / Frame-Level-Classification-of-Speech
speech-classification deep-learning python google-colaboratory google-colab jupyter-notebook pytorch mlp-classifier
Language:Jupyter Notebook
vishaal27 / IFN-Python
A Python implementation of the Iterative Feature Normalization algorithm
speech-classification feature-normalization feature-extraction machine-learning
Language:Jupyter Notebook

speech-classification

YuanGongND / ast

YuanGongND / ssast

m3hrdadfi / soxan

kaistmm / Audio-Mamba-AuM

felixchenfy / Speech-Commands-Classification-by-LSTM-PyTorch

HoseinAzad / Transformer-based-SER

anik8gupta / Toxic_Speech_Classification

Sreyan88 / Toxicity-Detection-in-Spoken-Utterances

Jason-Oleana / speech-classification

deep-spin / speech-continuous-attention

EmanuelAlogna / Gender-Classification-using-ML

Mubarekethio / Voice-Recognition-Qafaraf-and-Amharic

Chris-Winnard / Speech-Gender-Classifier

sarthak268 / Multimedia-Computing-and-Applications

KrajShuffle / Classifying_SpeechAudio_CNN

MilanaShhanukova / uni-research-dementia-detection

ryanquinnnelson / CMU-11685-Utterance-to-Phoneme-Mapping

Amir-Hofo / Speech-commands-Classification

manashpratim / Frame-Level-Classification-of-Speech

vishaal27 / IFN-Python