There are 1 repository under audio-data topic.
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
A list of publically available audio data that anyone can download for ASR or other speech activities
Audio feature extraction and classification
Visualizers made entirely from DOM elements and CSS3 Animations and Transforms.
doing audio digital signal processing in tensorflow to try to recreate digital audio effects
An in-depth analysis of audio classification on the RAVDESS dataset. Feature engineering, hyperparameter optimization, model evaluation, and cross-validation with a variety of ML techniques and MLP
A JS micro library for just the getSpectrum and getWaveform methods from Dancer.js, using Web Audio API.
HMS audio android sample code encapsulates APIs of the HUAWEI Audio Kit, which focuses on audio playback, audio effects and audio data.
A corpus that can be used to train English-to-Italian End-to-End Speech-to-Text Machine Translation models
Basic Implimentation of a Schroeder All-Pass Filter
This repository contains data preprocessing and analysis techniques for audio data.
Rock or rap? Machine Learning methods in Python to classify songs into genres.
Classifying daily sounds
This is a set of scripts to collect audio data from field sites and transfer them to host periodically.
Applications of machine learning methods in Python to classify songs into genres.
A Collection of Data Science and Machine Learning Projects Utilizing Scikit-Learn, TensorFlow, and R for Predictive Modeling, Time Series Analysis, and Statistical Methods.
Hindi(India) Spontaneous Dialogue Smartphone speech dataset
A machine learning project using Convolutional Neural Networks trained through various data augmentation and feature extraction techniques to detect emotions of a given audio file with 85% accuracy.
Course project for APS360 (Artificial Intelligence Fundamentals) at U of T
Different Task Guides for Audio Data
Remote Asynchronous Peripheral Access
Using Hidden Markov modelling to establish emotional states in survey research
CLARK (Comprehensive Live Audio Rendering Kit) is an audio visualizer capable of visualizing the audio playing on your computer in real time.
Converting text to audio and applying audio augmentation