There are 34 repositories under singing-voice-synthesis topic.
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
A paper and project list about the cutting edge Speech Synthesis, Text-to-Speech (TTS), Singing Voice Synthesis (SVS), Voice Conversion (VC), Singing Voice Conversion (SVC), and related interesting works (such as Music Synthesis, Automatic Music Transcription, Automatic MOS Prediction, SSL-based ASR...etc).
Learning the Beauty in Songs: Neural Singing Voice Beautifier; ACL 2022 (Main conference); Official code
PyTorch Implementation of StyleSinger(AAAI 2024): Style Transfer for Out-of-Domain Singing Voice Synthesis
VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by Digital Signal Processing Synthesizer
Dataset and code of GTSinger(NeurIPS 2024 Spotlight): A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks
PyTorch Implementation of TCSinger(EMNLP 2024): Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control
An opensource music processing toolkit
Diffusion and Mutual Information-Based Target Speaker SVS by Learning from Singing Teacher
PyTorch Implementation of Multi-Singer (ACM-MM'21)
Official implementation of MLP Singer: Towards Rapid Parallel Korean Singing Voice Synthesis (IEEE MLSP 2021)
Robust Singing Voice Transcription and MIDI Extraction
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
A universal converter for singing voice projects which is cross-platform and multi-lingual
A GUI for the Neutrino neural singing synthesizer
a guide to grapheme-to-phoneme conversion and phoneme list for ace singing voice synthesis engine
Multispeaker Community Vocoder Model for DiffSinger
Toolkit to convert MusicXML files into Blob Opera scores with real lyrics.
Unofficial implementation JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation(https://arxiv.org/abs/2310.19180)
Korean language support for NNSVS/ENUNU
The Original Support for English NNSVS Dataset Creation
Vocal Synthesis Through MIDI and Vocal Transformation Using RVC (KO, EN, JA, ZH)
A convenient third party tool for editing the symbols of DeepVocal voice database in the process of making the database
🎤 Revocalize AI API: Sing like your favorite artist with our powerful AI voice synthesizer. Real-time auto-tuning 🎵, emotional range capture 🎭, and voice modulation 🎛️ – all in one place!
BEGANSing - Korean SVS + SVC + AudioSR