Beast code in Giters

Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (arXiv:2401.01498)

000

onnx-simplifier

Simplify your onnx model

Apache-2.0000

pflowtts_pytorch

Unofficial implementation of NVIDIA P-Flow TTS paper

MIT000

QuickVC-VoiceConversion

QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion

MIT000

Real3DPortrait

Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code

000

RepCodec

Models and code for RepCodec: A Speech Representation Codec for Speech Tokenization

NOASSERTION000

snac

Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate

Language:PythonMIT000

StableTTS

Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3

MIT000

TiCodec

000

UniCATS-CTX-vec2wav

Code for CTX-vec2wav in UniCATS

Language:Python000

VoiceFlow-TTS

This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"

Language:Python000

wavmark

AI-based Audio Watermarking Tool

MIT000

X-E-Speech-code

X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion

MIT000

yaml-ui-editor

YAML UI editor application with Git repository storage

Apache-2.0000

ZEST

Zero-Shot Emotion Style Transfer

000

ZMM-TTS

ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations

BSD-3-Clause000