There are 1 repository under audio-retrieval topic.
Reading list for research topics in Sound AI
Implementation of "Audio Retrieval with Natural Language Queries: A Benchmark Study".
Tracking states of the arts and recent results (bibliography) on sound tasks.
Implementation of "Audio Retrieval with Natural Language Queries", INTERSPEECH 2021, PyTorch
This is the official codebase used for obtaining the results in the ICASSP 2024 paper: A SOUND APPROACH: Using Large Language Models to generate audio descriptions for egocentric text-audio retrieval
During the project for the DIGITAL SIGNAL IMAGE MANAGEMENT course I learned how to manage and process audio and image files. The aim of the project was the classification, through machine learning and deep learning models, of musical genres by extracting specific audio features from the "gtzan dataset" dataset files with which to train the models (SVM, Linear Regression, Decision tree , Random Forest, Neural Network). Mel spectograms were also extracted to train convolutional neural network models. In addition, the extracted audio features have been used to develop a model of music retrieval which given an audio track in input produces as output 5 audio tracks recommended meiante the use of cousine similarity.
code release for "AudioNet: Supervised Deep Hashing for Retrieval of Similar Audio Events"