hcwu1993's repositories

NATSpeech

A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)

License:MITStargazers:1Issues:0Issues:0

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

License:MITStargazers:0Issues:0Issues:0

awesome-knowledge-distillation

Awesome Knowledge Distillation

Stargazers:0Issues:0Issues:0
License:NOASSERTIONStargazers:0Issues:0Issues:0

CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

License:Apache-2.0Stargazers:0Issues:0Issues:0

DCGAN-LSGAN-WGAN-GP-DRAGAN-Tensorflow-2

DCGAN LSGAN WGAN-GP DRAGAN Tensorflow 2

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

deepvoice3_pytorch

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

License:MITStargazers:0Issues:0Issues:0

forced-alignment-tools

A collection of links and notes on forced alignment tools

License:NOASSERTIONStargazers:0Issues:0Issues:0

GenshinAudio

All audio extracted from Genshin Impact, music, voicelines and everything else

Stargazers:0Issues:0Issues:0

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

hello-world

Begining of github

Stargazers:0Issues:0Issues:0

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

License:MITStargazers:0Issues:0Issues:0

llama2.c

Inference Llama 2 in one file of pure C

License:MITStargazers:0Issues:0Issues:0

merlin

This is now the official location of the Merlin project.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Montreal-Forced-Aligner

Command line utility for forced alignment using Kaldi

License:MITStargazers:0Issues:0Issues:0

pandas-cookbook

Recipes for using Python's pandas library

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

parler-tts

Inference and training library for high-quality TTS models.

License:Apache-2.0Stargazers:0Issues:0Issues:0

parrot

RNN-based generative models for speech.

Language:PythonStargazers:0Issues:0Issues:0

taming-transformers

Taming Transformers for High-Resolution Image Synthesis

License:MITStargazers:0Issues:0Issues:0

tensorflow

Computation using data flow graphs for scalable machine learning

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

TensorFlow-Examples

TensorFlow Tutorial and Examples for beginners

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:0Issues:0

tensorflow-wavenet

A TensorFlow implementation of DeepMind's WaveNet paper

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

License:MPL-2.0Stargazers:0Issues:0Issues:0

video-subtitle-extractor

视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.

License:Apache-2.0Stargazers:0Issues:0Issues:0

vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

License:MITStargazers:0Issues:0Issues:0

waveglow

A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

wavenet_vocoder

WaveNet vocoder

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

wechat_jump_game

python 微信《跳一跳》辅助

Language:PythonLicense:MITStargazers:0Issues:0Issues:0