Charlottecuc

Xiaomin Tang's repositories

cargan

Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"

Language:PythonMIT100

Cross-Speaker-Emotion-Transfer

PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech

Language:PythonMIT000

CycleGAN-VC2

Voice Conversion by CycleGAN (语音克隆/语音转换): CycleGAN-VC2

Language:PythonMIT010

dpss-exp3-VC-PPG

Voice Conversion Experiments for THUHCSI Course : <Digital Processing of Speech Signals>

Language:Python010

editts

Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech

Language:PythonNOASSERTION000

efficient_tts

Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"

Language:Python000

A Python library for creating and manipulating musical patterns, designed for use in algorithmic composition, generative music and sonification. Can be used to generate MIDI events, MIDI files, OSC messages, or custom events.

Language:PythonMIT000

malaya-speech

Speech Toolkit for bahasa Malaysia, https://malaya-speech.readthedocs.io/

Language:Jupyter NotebookMIT000

Meta-TTS

Language:Python000

MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:JavaScriptNOASSERTION010

Montreal-Forced-Aligner

Command line utility for forced alignment using Kaldi

Language:PythonMIT000

nnsvs

Neural network-based singing voice synthesis library for research

Language:PythonMIT010

Notes

Some Markdown Notes...

Language:Jupyter NotebookGPL-3.0010

OMGD

Online Multi-Granularity Distillation for GAN Compression (ICCV2021)

Language:Python000

OSM-one-shot-multispeaker

Framework for one-shot multispeaker system based on Deep Learning

Language:PythonMIT000

Parallel-Tacotron2

PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling

Language:PythonMIT010

project-NN-Pytorch-scripts

Language:Jupyter NotebookBSD-3-Clause010

pytorch-kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Language:Python010

Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonNOASSERTION000

reinforcement-learning-an-introduction

Python Implementation of Reinforcement Learning: An Introduction

Language:PythonMIT000

stargan

StarGAN - Official PyTorch Implementation (CVPR 2018)

Language:PythonMIT000

StarGANv2-VC

StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion

Language:PythonMIT000

StreamingCNN

To train deep convolutional neural networks, the input data and the activations need to be kept in memory. Given the limited memory available in current GPUs, this limits the maximum dimensions of the input data. Here we demonstrate a method to train convolutional neural networks while holding only parts of the image in memory.

Language:Jupyter NotebookMIT010

Charlottecuc

Xiaomin Tang's repositories

ai-research-code

cargan

Cross-Speaker-Emotion-Transfer

CycleGAN-VC2

dpss-exp3-VC-PPG

editts

efficient_tts

isobar

malaya-speech

Meta-TTS

MockingBird

Montreal-Forced-Aligner

nnsvs

Notes

OMGD

OSM-one-shot-multispeaker

Parallel-Tacotron2

project-NN-Pytorch-scripts

pytorch-kaldi

Real-Time-Voice-Cloning

reinforcement-learning-an-introduction

stargan

StarGANv2-VC

StreamingCNN

TTS

tuna

VAENAR-TTS

vits

VQMIVC

wenet