mamezy's starred repositories
LivePortrait
Bring portraits to life!
ChromaTone
Color to Sound
audio_embeddings
Audio search using Azure Cognitive Search
MusiCNN-embeddings
The project consists in evaluating music similarity and building a genre classifier using song embeddings from GTZAN dataset extracted with Essentia’s MSD-MusiCNN model.
spotifytrack
A personal homepage showing users' top songs and artists, providing a shareable link that they can use to show it off to friends.
vocalsound
Dataset and baseline code for the VocalSound dataset (ICASSP2022).
fingerprintjs
Browser fingerprinting library. Accuracy of this version is 40-60%, accuracy of the commercial Fingerprint Identification is 99.5%. V4 of this library is BSL licensed.
coversong_identification
Cover song identification using 2DFT sequences
deep_hashing_coverSongDetection
Cover Song Detection System
CoverHunter
Official PyTorch implementation of CoverHunter
song-recognition
Full-stack song recognition application with audio fingerprinting and hum to search (QbSH) modules
cross_version_learning
Code for the paper "A Cross-Version Approach to Audio Representation Learning for Orchestral Music", ISMIR 2023
Hum-to-song
An API that takes vocal input, identifies most probable song match and returns the list of matching songs. Helps users identify that unknown song title with just a hum or sung melody.
inaSpeechSegmenter
CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
NeuralNote
Audio Plugin for Audio to MIDI transcription using deep learning.