Aby Louw's repositories

APNet2

Source code of APNet2, a vocoder

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

audioseal

Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

audiowmark

Audio Watermarking

Language:C++License:GPL-3.0Stargazers:0Issues:0Issues:0

ConsistencyVC-voive-conversion

Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

convnext_tts

Unofficial implementation of ConvNeXt-TTS powered by lightning and Rye

Stargazers:0Issues:0Issues:0

dectalk

Modern builds for the 90s/00s DECtalk text-to-speech application.

License:NOASSERTIONStargazers:0Issues:0Issues:0

descript-audio-vae

VAE GAN modified from Descript Audio Codec, which replaces the RVQ with VAE

License:MITStargazers:0Issues:0Issues:0

DiscreteSpeechMetrics

Reference-aware automatic speech evaluation toolkit

License:MITStargazers:0Issues:0Issues:0

flet

Flet enables developers to easily build realtime web, mobile and desktop apps in Python. No frontend experience required.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

istftnet

iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform

License:Apache-2.0Stargazers:0Issues:0Issues:0

LipSick

🤢 LipSick: Fast, High Quality, Low Resource Lipsync Tool 🤮

Language:PythonStargazers:0Issues:0Issues:0

Matcha-TTS

🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

License:MITStargazers:0Issues:0Issues:0

MB-iSTFT-VITS2

Application of MB-iSTFT-VITS components to vits2_pytorch

License:MITStargazers:0Issues:0Issues:0

MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

License:NOASSERTIONStargazers:0Issues:0Issues:0

Neural-Transducers-for-Two-Stage-Text-to-Speech-via-Semantic-Token-Prediction

Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (arXiv:2401.01498)

Stargazers:0Issues:0Issues:0

onnx-simplifier

Simplify your onnx model

License:Apache-2.0Stargazers:0Issues:0Issues:0

pflowtts_pytorch

Unofficial implementation of NVIDIA P-Flow TTS paper

License:MITStargazers:0Issues:0Issues:0

QuickVC-VoiceConversion

QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion

License:MITStargazers:0Issues:0Issues:0

Real3DPortrait

Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code

Stargazers:0Issues:0Issues:0

RepCodec

Models and code for RepCodec: A Speech Representation Codec for Speech Tokenization

License:NOASSERTIONStargazers:0Issues:0Issues:0

snac

Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

StableTTS

Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3

License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

UniCATS-CTX-vec2wav

Code for CTX-vec2wav in UniCATS

Language:PythonStargazers:0Issues:0Issues:0

VoiceFlow-TTS

This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"

Language:PythonStargazers:0Issues:0Issues:0

wavmark

AI-based Audio Watermarking Tool

License:MITStargazers:0Issues:0Issues:0

X-E-Speech-code

X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion

License:MITStargazers:0Issues:0Issues:0

yaml-ui-editor

YAML UI editor application with Git repository storage

License:Apache-2.0Stargazers:0Issues:0Issues:0

ZEST

Zero-Shot Emotion Style Transfer

Stargazers:0Issues:0Issues:0

ZMM-TTS

ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations

License:BSD-3-ClauseStargazers:0Issues:0Issues:0