xzm2004260

xzm2004's repositories

Speech-Editing-Toolkit

It's a repository for implementations of neural speech editing algorithms.

000

awesome-music-informatics

A curated list of awesome article, tutorial, library, webpage, etc.

100

muzic

Muzic: Music Understanding and Generation with Artificial Intelligence

MIT000

diffwave-sr

MIT000

Waveformer

An efficient architecture for real-time target sound extraction.

MIT000

pop2piano

Official Repo of the paper "Pop2Piano : Pop Audio-based Piano Cover Generation"

000

DualCycleGAN

Official implementation of DualCycleGAN for nonparallel audio super resolution

Apache-2.0000

onoma-to-wave_transformer

Unofficial implementations of environmental sound synthesis system with Transformer

MIT000

deepaudio-tts

000

VI-SVS

Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.

Apache-2.0000

Muskits

An opensource music processing toolkit

Apache-2.0000

Mixed_Emotions

This is the code for "Speech Synthesis with Mixed Emotions".

000

ddsp-singing-vocoders

Official implementation of SawSing (ISMIR'22)

AGPL-3.0000

Text-to-sound-Synthesis

The source code of our paper "Diffsound: discrete diffusion model for text-to-sound generation"

000

study-music

A survey of books, resources and courses to study everything about music and sound in the broadest sense

000

DeepAFx-ST

DeepAFx-ST - Style transfer of audio effects with differentiable signal processing. Please see https://csteinmetz1.github.io/DeepAFx-ST/

NOASSERTION000

Awesome-Diffusion-Models

A collection of resources and papers on Diffusion Models and Score-based Models, a darkhorse in the field of Generative Models

MIT000

voicesmith

[WIP] VoiceSmith makes training text to speech models easy.

Apache-2.0000

ai-audio-startups

Community list of startups working with AI in audio and music technology

Apache-2.0000

wetts

Production First and Production Ready End-to-End Text-to-Speech Toolkit

Apache-2.0000

StarGANv2-VC

StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion

MIT000

book-text-to-speech

A book about Text-to-Speech (TTS) in Chinese.

NOASSERTION000

genmusic_demo_list

a list of demo websites for automatic music generation research

000

Unet-TTS

One-shot TTS with Improved Unseen Speaker and Style Transfer

000

opentts

Open Text to Speech Server

MIT000

midi-ddsp

Synthesis of MIDI with DDSP (https://midi-ddsp.github.io/)

Apache-2.0000

LINNE

(Beta) LInear-predictive Neural Net Encoder -- A lossless audio codec

MIT000

course

高性能并行编程与优化 - 课件

000

survey

A Survey on Neural Speech Synthesis https://arxiv.org/pdf/2106.15561.pdf

000

ai-research-code

Apache-2.0000