There are 5 repositories under audio-embedding topic.
Pytorch port of Google Research's VGGish model used for extracting audio features.
Audio search using Azure Cognitive Search
SERVER: Multi-modal Speech Emotion Recognition using Transformer-based and Vision-based Embeddings
Extract audio embeddings from an audio file using Python
Directly from voice, recognise speaker emotion, intensity, & sentiment in speaker utterances.
Generate audio embedding out of pruned L3
Visualizations of music semantics calculus using Spotify and deep embeddings.
Audio Deep Learning Project in Java
Re-Implementation of Google Research's VGGish model used for extracting audio features using Pytorch with GPU support.
Audio Embeddings using VGGish
Generate realistic, synthetic call center conversations