Beast code in Giters

Shaojin Ding's repositories

Adversarial-Many-to-Many-VC

[InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by Shaojin Ding, Guanlong Zhao, Ricardo Gutierrez-Osuna

Language:PythonNOASSERTION39 2 3

GroupLatentEmbedding

Pytorch implementation of "Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion" [Interspeech 2019]

Language:PythonMIT28 10

fac-via-ppg

Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)

Language:PythonApache-2.02 10

Golden-Speaker-Builder

Language:HTML2 3 2

PyTorch_Speaker_Verification

PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.

Language:PythonBSD-3-Clause2 10

bdd-data

Scripts to use BDD Dataset

Language:PythonBSD-3-Clause010

caffe

Caffe: a fast open framework for deep learning.

Language:C++NOASSERTION010

DANet

Dual Attention Network for Scene Segmentation

Language:PythonNOASSERTION010

darts.pytorch1.1

Implementation with latest PyTorch (v1.1) for multi-gpu DARTS https://arxiv.org/abs/1806.09055

Language:Python010

DeepSpeaker-pytorch

Speaker embedding(verification and recognition) using Pytorch

Language:PythonMIT010

Detectron.pytorch

A pytorch implementation of Detectron. Both training from scratch and inferring directly from pretrained Detectron weights are available.

Language:PythonMIT000

dragonmapper

Identification and conversion functions for Chinese text processing

Language:PythonMIT000

examples

A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.

Language:PythonBSD-3-Clause010

faster-rcnn.pytorch

A faster pytorch implementation of faster r-cnn

Language:PythonMIT000

Listen-Attend-and-Spell-Pytorch

Listen Attend and Spell (LAS) implement in pytorch

Language:Jupyter NotebookMIT000

merlin

This is now the official location of the Merlin project.

Language:PythonApache-2.0010

Multilingual_Text_to_Speech

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.

Language:PythonMIT000

Python-Wrapper-for-World-Vocoder

A Python wrapper for the high-quality vocoder "World"

Language:PythonMIT010

pytorch-vq-vae

PyTorch implementation of VQ-VAE by Aäron van den Oord et al.

Language:Jupyter NotebookMIT000

rasta_py

RASTA-PLP and MFCC tool based rasta-mat

Language:Python000

Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonNOASSERTION010

shaojinding.github.io

000

speaker-id

This repository contains audio samples and supplementary materials accompanying publications related to the speaker-id team at Google.

Language:HTMLNOASSERTION010

Speech-Accent-Recognition

000

Speech_Recognition_with_Tensorflow

Implementation of a seq2seq model for Speech Recognition using the latest version of TensorFlow. Architecture similar to Listen, Attend and Spell.

Language:Jupyter NotebookMIT010

tacotron2

Tacotron 2 - PyTorch implementation with faster-than-realtime inference

Language:Jupyter NotebookBSD-3-Clause010

VAE-GMVAE

This repository contains the implementation of the VAE and Gaussian Mixture VAE using TensorFlow and several network architectures

Language:PythonApache-2.0000

VQ-VAE-Speech

PyTorch implementation of VQ-VAE + WaveNet by [Chorowski et al., 2019] and VQ-VAE on speech signals by [van den Oord et al., 2017]

MIT000

warp-ctc

Language:CudaApache-2.0020

wavenet_vocoder

WaveNet vocoder

Language:PythonNOASSERTION000