WhiteFu's repositories

audio-dataset

Audio Dataset for training CLAP and other models

Language:PythonStargazers:0Issues:0Issues:0

audio-diffusion-pytorch

Audio generation using diffusion models, in PyTorch.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

audio-preprocessing-scripts

数据集制作-从录播到伴奏分离到切片脚本

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

chinese-dialect-lexicons

Grapheme-to-Phoneme lexicons for Chinese dialects

Stargazers:0Issues:0Issues:0

control-vc

This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:0Issues:0

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

espnet_onnx

Onnx wrapper for espnet infrernce model

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

fluenttts

FluentTTS: Text-dependent Fine-grained Style Control for Multi-style TTS

Language:PythonStargazers:0Issues:0Issues:0

FreeVC

FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

g2pE_mobile

g2p for english tts

Language:PythonStargazers:0Issues:0Issues:0

GenerSpeech

PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model towards zero-shot style transfer of OOD custom voice.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

hello-algo

《Hello 算法》一本动画图解、能运行、可提问的数据结构与算法入门书

Language:JavaLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

IMS-Toucan

IMS-Toucan is a toolkit to train state-of-the-art Speech Synthesis models. Everything is pure Python and PyTorch based to keep it as simple and beginner-friendly, yet powerful as possible.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

larynx2

A fast, local neural text to speech system

Language:C++License:MITStargazers:0Issues:0Issues:0

musika

Fast Infinite Waveform Music Generation

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

NISQA

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

SiFiGAN

Official implementation of the source-filter HiFiGAN vocoder

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

so-vits-svc-toolkit

A toolkit and documentation version of so-vits-svc.

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

StyleTTS

Official Implementation of StyleTTS

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

T2A

Project page for "T2A: Robust Text-to-Animation" for ICASSP2023

Language:PythonStargazers:0Issues:0Issues:0

torch-nansypp

NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

vall-e

An unofficial PyTorch implementation of the audio LM VALL-E, WIP

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

vlabeler

Open source voice labeling application

Language:KotlinLicense:Apache-2.0Stargazers:0Issues:0Issues:0

zac2022-lyric-alignment

Solution for Zalo AI Challenge 2022 - Lyrics Alignment

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0