Songxiang Liu's repositories
Large-Audio-Models
Keep track of big models in audio domain, including speech, singing, music etc.
efficient_tts
Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"
BNE-Seq2SeqMoL-VC
Demo for "Any-to-Many Voice Conversion with Location-Relative Sequence-to-Sequence Modeling"
AcademiCodec
AcademiCodec: An Open Source Audio Codec Model for Academic Research
liusongxiang.github.io
Personal homepage:
aishell-3-baseline-fc
The code for aishell-3 baseline acoustic model
audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
ForwardTacotron
⏩ Generating speech in a single forward pass without any attention!
ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch
Parselmouth
Praat in Python, the Pythonic way
phonemizer
Simple text to phones converter for multiple languages
rayeren.github.io
My personal homepage
WavAugment
A library for speech data augmentation in time-domain