yingfenging

followers

following

stars

yingfenging's repositories

awesome-large-audio-models

Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.

000

cantonese-books-data

粵音資料集叢：典籍資料

000

CharsiuG2P

Multilingual G2P in over 100 languages

Language:Jupyter NotebookMIT000

chinese_speech_pretrain

chinese speech pretrained models

Language:Shell000

ChineseBert

Language:PythonMIT000

FastDiff

PyTorch Implementation of FastDiff (IJCAI'22)

Language:Python000

FastGithub

github加速神器，解决github打不开、用户头像无法加载、releases无法上传下载、git-clone、git-pull、git-push失败等问题

Language:C#MIT000

g2pW

Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音

Language:PythonApache-2.0000

GraphemeBERT

This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme models

MIT000

LPCNet

Efficient neural speech synthesis

Language:CBSD-3-Clause000

Meta-TTS

Official repository of https://arxiv.org/abs/2111.04040v1

Language:Python000

NATSpeech

A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)

Language:PythonMIT000

ncnn

ncnn is a high-performance neural network inference framework optimized for the mobile platform

Language:C++NOASSERTION000

NeMo

NeMo: a toolkit for conversational AI

Language:Jupyter NotebookApache-2.0000

NeuralSpeech

Language:Python000

onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

MIT000

paper2gui

Convert AI papers to GUI，Make it easy and convenient for everyone to use artificial intelligence technology。让每个人都简单方便的使用前沿人工智能技术

Language:Jupyter NotebookMIT000

Parselmouth

Praat in Python, the Pythonic way

Language:C++GPL-3.0000

PitchExtractor

Deep Neural Pitch Extractor for Voice Conversion and TTS Training

Language:PythonMIT000

pycantonese

Cantonese Linguistics and NLP in Python

MIT000

Retrieval-based-Voice-Conversion-WebUI

Voice data <= 10 mins can also be used to train a good VC model!

MIT000

so-vits-svc

基于vits与softvc的歌声音色转换模型

Language:PythonMIT000

STYLER

STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech, INTERSPEECH 2021

MIT000

StyleSpeech

Official implementation of Meta-StyleSpeech and StyleSpeech

Language:PythonMIT000

TTS-Objective-Metrics

Objective metrics used in several text-to-speech (TTS) papers.

Language:PythonGPL-3.0000

vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

MIT000

vits_chinese

vits chinese, tts chinese, tts mandarin 史上训练最简单，音质最好的语音合成系统，兼容性非常好的合成框架

Language:Python000

voicefixer_main

General Speech Restoration

000

VQMIVC

Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!

Language:Jupyter NotebookMIT000

Wav2Lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.

Language:Python000