Xiaomin Tang (Charlottecuc)

Charlottecuc

Geek Repo

Company:University of Edinburgh

Location:UK

Github PK Tool:Github PK Tool

Xiaomin Tang's repositories

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

cargan

Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

Cross-Speaker-Emotion-Transfer

PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

CycleGAN-VC2

Voice Conversion by CycleGAN (语音克隆/语音转换): CycleGAN-VC2

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

dpss-exp3-VC-PPG

Voice Conversion Experiments for THUHCSI Course : <Digital Processing of Speech Signals>

Language:PythonStargazers:0Issues:1Issues:0

editts

Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

efficient_tts

Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"

Language:PythonStargazers:0Issues:1Issues:0

isobar

A Python library for creating and manipulating musical patterns, designed for use in algorithmic composition, generative music and sonification. Can be used to generate MIDI events, MIDI files, OSC messages, or custom events.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

malaya-speech

Speech Toolkit for bahasa Malaysia, https://malaya-speech.readthedocs.io/

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:1Issues:0

MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:JavaScriptLicense:NOASSERTIONStargazers:0Issues:1Issues:0

Montreal-Forced-Aligner

Command line utility for forced alignment using Kaldi

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

nnsvs

Neural network-based singing voice synthesis library for research

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

Notes

Some Markdown Notes...

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:0Issues:1Issues:0

OMGD

Online Multi-Granularity Distillation for GAN Compression (ICCV2021)

Language:PythonStargazers:0Issues:1Issues:0

OSM-one-shot-multispeaker

Framework for one-shot multispeaker system based on Deep Learning

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

Parallel-Tacotron2

PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:0Issues:1Issues:0

pytorch-kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Language:PythonStargazers:0Issues:1Issues:0

Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

reinforcement-learning-an-introduction

Python Implementation of Reinforcement Learning: An Introduction

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

stargan

StarGAN - Official PyTorch Implementation (CVPR 2018)

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

StarGANv2-VC

StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

StreamingCNN

To train deep convolutional neural networks, the input data and the activations need to be kept in memory. Given the limited memory available in current GPUs, this limits the maximum dimensions of the input data. Here we demonstrate a method to train convolutional neural networks while holding only parts of the image in memory.

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:Jupyter NotebookLicense:MPL-2.0Stargazers:0Issues:1Issues:0

tuna

An audio effects library for the Web Audio API.

Language:JavaScriptStargazers:0Issues:1Issues:0

VAENAR-TTS

The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

VQMIVC

Official implementation of VQMIVC: One-shot Voice Conversion @ Interspeech 2021

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Language:C++License:Apache-2.0Stargazers:0Issues:1Issues:0