Alef Iury's repositories
SE-R-2022-SER-Track
Code for the winning solution in the SE&R 2022 Challenge - SER track.
multilingual_kws_pytorch
Unofficial PyTorch implementation of Few-Shot Keyword Spotting in Any Language. A model for few-shot keyword spotting in any language, trained with the Multilingual Spoken Words Corpus.
Speech-Synthesis-Evaluation-System
This is a web application developed in flask for quality evaluation of synthesized speech.
Audio-Tagging-Single-Attention-CNN
This is an implementation in Pytorch of a Single Attention Convolutional Neural Network model for audio tagging and sound event detection.
Automatic-Gender-Classification
Implementation in Pytorch of Deep Learning models for Automatic Gender Recognition (AGR) for the paper: "A Comparison of Deep Learning Architectures for Automatic Gender Recognition from Audio Signals".
SE-R_2022_Challenge_Wav2vec2
Code for the paper "Domain Specific Wav2vec 2.0 Fine-tuning For The SE&R 2022 Challenge"
audioset-download
This package aims at simplifying the download of the AudioSet dataset.
bees-tomato
This repository has all the codes used in the work: classification of bees
ConfidenceIntervals
Confidence interval computation for evaluation in machine learning using the bootstrapping approach
DDSP-SVC-Dynamic-Loading
Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)
Fast-Audioset-Download
Download audioset data super fastly with youtube-dl, ffmpeg and python multiprocessing
FT-w2v2-ser
Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition
ser-with-w2v2
Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'
so-vits-svc-fork
so-vits-svc fork with realtime support, improved interface and more features.
voicebox-pytorch
Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch