ahmeftah's repositories
TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
-Evaluation-Metrics-Used-For-The-Performance-Evaluation-of-Voice-Conversion-VC-Models
Evaluation Metrics Used For The Performance Evaluation of Voice Conversion (VC) Models
ALGAN-VC-Generated-Audio-Samples
Generated Audio Samples by ALGAN-VC model are available in the folder
audio_course_project
Project done as part of Audio Processing course at Tampere University. Topic was separation of harmonic and percussive elements according to paper EPARATION OF A MONAURAL AUDIO SIGNAL INTO HARMONIC/PERCUSSIVE COMPONENTS BY COMPLEMENTARY DIFFUSION ON SPECTROGRAM by Nobutaka Ono, Kenichi Miyamoto, Jonathan Le Roux, Hirokazu Kameoka, and Shigeki Sagayama.
audiomentations
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
Coursera-Deep-Learning
My notes / works on deep learning from Coursera
DeepLearningExamples
Deep Learning Examples
DYGANVC
demo page https://MingjieChen.github.io/dygan-vc
ECAPA-TDNN
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
emo-stargan
Implementation of Emo-StarGAN
Emovox
This is the implementation of the paper "Emotion Intensity and its Control for Emotional Voice Conversion".
espnet
End-to-End Speech Processing Toolkit
evotorch
EvoTorch is an advanced evolutionary computation library built directly on top of PyTorch, created at NNAISENSE.
examples
TensorFlow examples
FastSpeech2-1
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
hifi-gan-bwe
Unofficial implementation of HiFi-GAN+ from the paper "Bandwidth Extension is All You Need" by Su, et al.
keras-io
Keras documentation, hosted live at keras.io
MelGAN-VC
MelGAN-VC: Voice Conversion and Audio Style Transfer on arbitrarily long samples using Spectrograms
MOSNet
Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"
pyACA
Python scripts accompanying the book "An Introduction to Audio Content Analysis" (www.AudioContentAnalysis.org)
s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
spectrogram-inversion
spectrogram inversion tools in PyTorch. Documentation: https://spectrogram-inversion.readthedocs.io
tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
VocGAN
VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
wavenet_autoencoders
WaveNet auto-ancoders for ZeroSpeech challenge 2020