Songxiang Liu (liusongxiang)

liusongxiang

Geek Repo

Company:miHoYo

Location:Shenzhen, China

Home Page:http://liusongxiang.github.io

Github PK Tool:Github PK Tool

Songxiang Liu's repositories

Large-Audio-Models

Keep track of big models in audio domain, including speech, singing, music etc.

ppg-vc

PPG-Based Voice Conversion

Language:PythonLicense:Apache-2.0Stargazers:309Issues:10Issues:31

efficient_tts

Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"

Language:PythonLicense:MITStargazers:114Issues:12Issues:13

diffsvc

DiffSVC demo page

BNE-Seq2SeqMoL-VC

Demo for "Any-to-Many Voice Conversion with Location-Relative Sequence-to-Sequence Modeling"

AcademiCodec

AcademiCodec: An Open Source Audio Codec Model for Academic Research

Language:PythonStargazers:2Issues:1Issues:0

bigvsan

Pytorch implementation of BigVSAN

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

WaveGrad

Implementation of Google Brain's WaveGrad high-fidelity vocoder (paper: https://arxiv.org/pdf/2009.00713.pdf). First implementation on GitHub.

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:1Issues:2Issues:0

liusongxiang.github.io

Personal homepage:

Language:SCSSLicense:MITStargazers:0Issues:2Issues:0

aishell-3-baseline-fc

The code for aishell-3 baseline acoustic model

Language:Jupyter NotebookLicense:MITStargazers:0Issues:2Issues:0

audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

cceyda

Short profile with some stats and keywords

Stargazers:0Issues:2Issues:0

CPC_audio

An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

ForwardTacotron

⏩ Generating speech in a single forward pass without any attention!

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

glow-tts

A Generative Flow for Text-to-Speech via Monotonic Alignment Search

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Language:PythonLicense:MITStargazers:0Issues:2Issues:0
Language:PythonLicense:MITStargazers:0Issues:2Issues:0
Language:HTMLStargazers:0Issues:2Issues:0

ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch

Language:Jupyter NotebookLicense:MITStargazers:0Issues:2Issues:0

Parselmouth

Praat in Python, the Pythonic way

Language:C++License:GPL-3.0Stargazers:0Issues:2Issues:0

phonemizer

Simple text to phones converter for multiple languages

Language:PythonLicense:GPL-3.0Stargazers:0Issues:1Issues:0

rayeren.github.io

My personal homepage

Language:SCSSLicense:MITStargazers:0Issues:2Issues:0

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0
Language:HTMLStargazers:0Issues:2Issues:0
Language:PythonStargazers:0Issues:2Issues:0

VQMIVC

Official implementation of VQMIVC: One-shot Voice Conversion @ Interspeech 2021

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

WavAugment

A library for speech data augmentation in time-domain

Language:PythonLicense:MITStargazers:0Issues:2Issues:0
Stargazers:0Issues:2Issues:0