yingfenging's repositories

awesome-large-audio-models

Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.

Stargazers:0Issues:0Issues:0

cantonese-books-data

粵音資料集叢:典籍資料

Stargazers:0Issues:0Issues:0

CharsiuG2P

Multilingual G2P in over 100 languages

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

chinese_speech_pretrain

chinese speech pretrained models

Language:ShellStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

FastDiff

PyTorch Implementation of FastDiff (IJCAI'22)

Language:PythonStargazers:0Issues:0Issues:0

FastGithub

github加速神器,解决github打不开、用户头像无法加载、releases无法上传下载、git-clone、git-pull、git-push失败等问题

Language:C#License:MITStargazers:0Issues:0Issues:0

g2pW

Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

GraphemeBERT

This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme models

License:MITStargazers:0Issues:0Issues:0

LPCNet

Efficient neural speech synthesis

Language:CLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

Meta-TTS

Official repository of https://arxiv.org/abs/2111.04040v1

Language:PythonStargazers:0Issues:0Issues:0

NATSpeech

A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

ncnn

ncnn is a high-performance neural network inference framework optimized for the mobile platform

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

NeMo

NeMo: a toolkit for conversational AI

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

License:MITStargazers:0Issues:0Issues:0

paper2gui

Convert AI papers to GUI,Make it easy and convenient for everyone to use artificial intelligence technology。让每个人都简单方便的使用前沿人工智能技术

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

Parselmouth

Praat in Python, the Pythonic way

Language:C++License:GPL-3.0Stargazers:0Issues:0Issues:0

PitchExtractor

Deep Neural Pitch Extractor for Voice Conversion and TTS Training

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

pycantonese

Cantonese Linguistics and NLP in Python

License:MITStargazers:0Issues:0Issues:0

Retrieval-based-Voice-Conversion-WebUI

Voice data <= 10 mins can also be used to train a good VC model!

License:MITStargazers:0Issues:0Issues:0

so-vits-svc

基于vits与softvc的歌声音色转换模型

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

STYLER

STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech, INTERSPEECH 2021

License:MITStargazers:0Issues:0Issues:0

StyleSpeech

Official implementation of Meta-StyleSpeech and StyleSpeech

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

TTS-Objective-Metrics

Objective metrics used in several text-to-speech (TTS) papers.

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

License:MITStargazers:0Issues:0Issues:0

vits_chinese

vits chinese, tts chinese, tts mandarin 史上训练最简单,音质最好的语音合成系统,兼容性非常好的合成框架

Language:PythonStargazers:0Issues:0Issues:0

voicefixer_main

General Speech Restoration

Stargazers:0Issues:0Issues:0

VQMIVC

Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

Wav2Lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.

Language:PythonStargazers:0Issues:0Issues:0