Aby Louw's repositories

APNet2

Source code of APNet2, a vocoder

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

audioseal

Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

audiowmark

Audio Watermarking

Language:C++License:GPL-3.0Stargazers:0Issues:0Issues:0

ConsistencyVC-voive-conversion

Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

DeepPhonemizer

Grapheme to phoneme conversion with deep learning.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

dhasa2023_styleguide

Style guide for the Digital Humanities Association of Southern Africa (DHASA) fourth conference, 2023.

License:CC0-1.0Stargazers:0Issues:1Issues:0

dectalk

Modern builds for the 90s/00s DECtalk text-to-speech application.

License:NOASSERTIONStargazers:0Issues:0Issues:0

descript-audio-vae

VAE GAN modified from Descript Audio Codec, which replaces the RVQ with VAE

License:MITStargazers:0Issues:0Issues:0

DiscreteSpeechMetrics

Reference-aware automatic speech evaluation toolkit

License:MITStargazers:0Issues:0Issues:0

flet

Flet enables developers to easily build realtime web, mobile and desktop apps in Python. No frontend experience required.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

hubert

HuBERT content encoders for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion

License:MITStargazers:0Issues:0Issues:0

istftnet

iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform

License:Apache-2.0Stargazers:0Issues:0Issues:0

licensecc

Software licensing, copy protection in C++. It has few dependencies and it's cross-platform.

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

Matcha-TTS

🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

License:MITStargazers:0Issues:0Issues:0

MB-iSTFT-VITS2

Application of MB-iSTFT-VITS components to vits2_pytorch

License:MITStargazers:0Issues:0Issues:0

MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

License:NOASSERTIONStargazers:0Issues:0Issues:0

onnx-simplifier

Simplify your onnx model

License:Apache-2.0Stargazers:0Issues:0Issues:0

pflowtts_pytorch

Unofficial implementation of NVIDIA P-Flow TTS paper

License:MITStargazers:0Issues:0Issues:0

phonepiece

phone inventory library

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

pytorch-fid

Compute FID scores with PyTorch.

License:Apache-2.0Stargazers:0Issues:0Issues:0

QuickVC-VoiceConversion

QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion

License:MITStargazers:0Issues:0Issues:0

Real3DPortrait

Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code

Stargazers:0Issues:0Issues:0

Sunsynk-Home-Assistant-Power-Flow-Card

A simple card to emulate the Sunsynk power flow thats show on the Inverter

Language:JavaScriptStargazers:0Issues:0Issues:0

UniCATS-CTX-vec2wav

Code for CTX-vec2wav in UniCATS

Language:PythonStargazers:0Issues:0Issues:0

vocos

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

VoiceFlow-TTS

This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"

Language:PythonStargazers:0Issues:0Issues:0

wavmark

AI-based Audio Watermarking Tool

License:MITStargazers:0Issues:0Issues:0

yaml-ui-editor

YAML UI editor application with Git repository storage

License:Apache-2.0Stargazers:0Issues:0Issues:0

ZEST

Zero-Shot Emotion Style Transfer

Stargazers:0Issues:0Issues:0