xzm2004's repositories
Speech-Editing-Toolkit
It's a repository for implementations of neural speech editing algorithms.
awesome-music-informatics
A curated list of awesome article, tutorial, library, webpage, etc.
muzic
Muzic: Music Understanding and Generation with Artificial Intelligence
Waveformer
An efficient architecture for real-time target sound extraction.
pop2piano
Official Repo of the paper "Pop2Piano : Pop Audio-based Piano Cover Generation"
DualCycleGAN
Official implementation of DualCycleGAN for nonparallel audio super resolution
onoma-to-wave_transformer
Unofficial implementations of environmental sound synthesis system with Transformer
VI-SVS
Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.
Muskits
An opensource music processing toolkit
Mixed_Emotions
This is the code for "Speech Synthesis with Mixed Emotions".
ddsp-singing-vocoders
Official implementation of SawSing (ISMIR'22)
Text-to-sound-Synthesis
The source code of our paper "Diffsound: discrete diffusion model for text-to-sound generation"
study-music
A survey of books, resources and courses to study everything about music and sound in the broadest sense
DeepAFx-ST
DeepAFx-ST - Style transfer of audio effects with differentiable signal processing. Please see https://csteinmetz1.github.io/DeepAFx-ST/
Awesome-Diffusion-Models
A collection of resources and papers on Diffusion Models and Score-based Models, a darkhorse in the field of Generative Models
voicesmith
[WIP] VoiceSmith makes training text to speech models easy.
ai-audio-startups
Community list of startups working with AI in audio and music technology
wetts
Production First and Production Ready End-to-End Text-to-Speech Toolkit
StarGANv2-VC
StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion
book-text-to-speech
A book about Text-to-Speech (TTS) in Chinese.
genmusic_demo_list
a list of demo websites for automatic music generation research
Unet-TTS
One-shot TTS with Improved Unseen Speaker and Style Transfer
opentts
Open Text to Speech Server
midi-ddsp
Synthesis of MIDI with DDSP (https://midi-ddsp.github.io/)
LINNE
(Beta) LInear-predictive Neural Net Encoder -- A lossless audio codec
course
高性能并行编程与优化 - 课件
survey
A Survey on Neural Speech Synthesis https://arxiv.org/pdf/2106.15561.pdf