MMMMichaelzhang

0

followers

following

stars

MMMMichaelzhang's repositories

gulaerchen.github.io

Language:JavaScript100

StarGANv2-VC

StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion

Language:PythonMIT100

assem-vc

Official Code for Assem-VC @ICASSP2022

Language:Jupyter NotebookBSD-3-Clause000

asteroid

The PyTorch-based audio source separation toolkit for researchers || Pretrained models available

Language:PythonMIT000

AudioMass

Free full-featured web-based audio & waveform editing tool

Language:JavaScript000

AuxiliaryASR

Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)

Language:PythonMIT000

bark

🔊 Text-Prompted Generative Audio Model

Language:Jupyter NotebookMIT000

CMGAN

Conformer-based Metric GAN for speech enhancement

Language:PythonMIT000

Cross-Lingual-Voice-Cloning

Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.

Language:Jupyter NotebookBSD-3-Clause000

DeepFilterNet

Noise supression using deep filtering

NOASSERTION000

demucs

Code for the paper Hybrid Spectrogram and Waveform Source Separation

MIT000

FACIAL

FACIAL: Synthesizing Dynamic Talking Face With Implicit Attribute Learning. ICCV, 2021.

AGPL-3.0000

fastVC

A simple voice conversion tool

000

FullSubNet-plus

The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".

Apache-2.0000

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

MIT000

matchering

🎚️ Open Source Audio Matching and Mastering

GPL-3.0000

mellotron

Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data

BSD-3-Clause000

mir-svc

Unsupervised WaveNet-based Singing Voice Conversion Using Pitch Augmentation and Two-phase Approach

NOASSERTION000

MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

NOASSERTION000

NeuralSVB

Learning the Beauty in Songs: Neural Singing Voice Beautifier; ACL 2022 (Main conference); Official code

Language:PythonGPL-3.0000

nonparaSeq2seqVC_code

Implementation code of non-parallel sequence-to-sequence VC

MIT000

NVC-Net

Apache-2.0000

ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

MIT000

PitchExtractor

Deep Neural Pitch Extractor for Voice Conversion and TTS Training

Language:PythonMIT000

Retrieval-based-Voice-Conversion-WebUI

Voice data <= 10 mins can also be used to train a good VC model!

MIT000

so-vits-svc-fork

so-vits-svc fork with realtime support, improved interface and more features.

NOASSERTION000

ssr_eval

Evaluation and Benchmarking of Speech Super-resolution Methods

000

StyleTTS

Official Implementation of StyleTTS

MIT000

svoice

We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new method for separating a mixed audio sequence, in which multiple voices speak simultaneously. The new method employs gated neural networks that are trained to separate the voices at multiple processing steps, while maintaining the speaker in each output channel fixed. A different model is trained for every number of possible speakers, and the model with the largest number of speakers is employed to select the actual number of speakers in a given sample. Our method greatly outperforms the current state of the art, which, as we show, is not competitive for more than two speakers.

NOASSERTION000

ultimatevocalremovergui

GUI for a Vocal Remover that uses Deep Neural Networks.

MIT000