Michael Kuhlmann's repositories
phoneme_rate_conversion
Supplementary code for the publication: "Investigation into Target Speaking Rate Adaptation for Voice Conversion" (INTERSPEECH 2022).
cpython
The Python programming language
espnet
End-to-End Speech Processing Toolkit
fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
lazy_dataset
lazy_dataset: Process large datasets as if it was an iterable.
micropython
MicroPython - a lean and efficient Python implementation for microcontrollers and constrained systems
nachrichtentechnik
Jupyter noteboooks for the lecture "Nachrichtentechnik" (communications engineering) with explanations in german.
paderbox
Paderbox: A collection of utilities for audio / speech processing
padertorch
A collection of common functionality to simplify the design, training and evaluation of machine learning models based on pytorch with an emphasis on speech processing.
ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
TriAAN-VC
TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion
UnsupSeg
Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation (INTERSPEECH 2020)
wavenet_vocoder
WaveNet vocoder