Beast code in Giters

Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (arXiv:2401.01498)

Language:Python000

onnx-simplifier

Simplify your onnx model

Language:C++Apache-2.0000

pflow-encodec

Implementation of TTS model based on NVIDIA P-Flow TTS Paper

Language:Python000

pflowtts_pytorch

Unofficial implementation of NVIDIA P-Flow TTS paper

Language:PythonMIT000

Real3DPortrait

Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code

Language:Python000

RepCodec

Models and code for RepCodec: A Speech Representation Codec for Speech Tokenization

Language:PythonNOASSERTION000

snac

Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate

Language:PythonMIT000

StableTTS

Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3

Language:PythonMIT000

TiCodec

Language:Python000

TTS-arxiv-daily

Automatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)

Language:PythonApache-2.0000

UniCATS-CTX-vec2wav

Code for CTX-vec2wav in UniCATS

Language:Python000

VoiceFlow-TTS

This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"

Language:Python000

wavenext_pytorch

Unofficial implementation of wavenext vocoder

Language:PythonMIT000

wavmark

AI-based Audio Watermarking Tool

Language:PythonMIT000

X-E-Speech-code

X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion

Language:PythonMIT000

ZEST

Zero-Shot Emotion Style Transfer

Language:Python000

ZMM-TTS

ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations

Language:CBSD-3-Clause000