Ewald Enzinger (entn-at)

entn-at

User data from Github https://github.com/entn-at

Location:Portland, Oregon

Home Page:https://entn.at/

GitHub:@entn-at

Twitter:@entn_at

Ewald Enzinger's repositories

AudioDiffuser

Companion codebase for the paper "A Review on Score-based Generative Models for Audio Applications" (https://arxiv.org/abs/2506.08457)

License:MITStargazers:0Issues:0Issues:0

bournemouth-forced-aligner

Extract phoneme-level timestamps from speeh audio. MFA alternative. work in progress

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

CapSpeech

CapSpeech: Enabling Downstream Applications in Style-Captioned Text-to-Speech

License:NOASSERTIONStargazers:0Issues:0Issues:0

chatterbox

SoTA open-source TTS

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

contexless-phonemes-CUPE

pytorch model for contexless-phoneme prediction from speech audio

License:GPL-3.0Stargazers:0Issues:0Issues:0

delayed-streams-modeling

Delayed Streams Modeling (DSM) is a flexible formulation for streaming, multimodal sequence-to-sequence learning.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Diffusion-Speech-Tokenizer

This repository contains a series of works on diffusion-based speech tokenizers, including the official implementation of the paper: "TaDiCodec: Text-aware Diffusion Speech Tokenizer for Speech Language Modeling" https://hecheng0625.github.io/assets/pdf/Arxiv_TaDiCodec.pdf

Stargazers:0Issues:0Issues:0

DiFlow-TTS

DiFlow-TTS: Discrete Flow Matching with Factorized Speech Tokens for Low-Latency Zero-Shot Text-to-Speech

Stargazers:0Issues:0Issues:0

EZ-VC

Official code for EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion [EMNLP 2025 Findings]

License:MITStargazers:0Issues:0Issues:0

Flamed-TTS

This repository implement a novel zero-shot TTS framework, named Flamed-TTS, focusing on the efficient generation and dynamic pacing in speech synthesis.

Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

HH-Codec

[ICML 2025 Tokenization Workshop] HH-Codec: High Compression High-fidelity Discrete Neural Codec for Spoken Language Modeling

License:Apache-2.0Stargazers:0Issues:0Issues:0

hnet

H-Net: Hierarchical Network with Dynamic Chunking

License:MITStargazers:0Issues:0Issues:0

index-tts

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

InfiniteTalk

​​Unlimited-length talking video generation​​ that supports image-to-video and video-to-video generation

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

KittenTTS

State-of-the-art TTS model under 25MB 😻

License:Apache-2.0Stargazers:0Issues:0Issues:0

learnable-speech

This repo is text to speech with learnable audio encoder without alignment with transcript reference

Stargazers:0Issues:0Issues:0

Marco-Voice

A Unified Framework for Expressive Speech Synthesis with Voice Cloning

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

OpenReader-WebUI

Web EPUB and PDF text to speech document reader. Read documents in realtime with high-quality TTS; or extract audiobooks. Use your own Kokoro TTS API or Open AI API endpoint.

Language:TypeScriptLicense:MITStargazers:0Issues:0Issues:0

rwkv-tts-rs

RWKV-based Text-to-Speech implementation in Rust

Stargazers:0Issues:0Issues:0

S3Tokenizer

Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

senko

Very fast speaker diarization

License:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

tts

Inworld TTS

License:MITStargazers:0Issues:0Issues:0

UniAudio2

The open-source code of UniAudio2.0

Stargazers:0Issues:0Issues:0

unmute

Make text LLMs listen and speak

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

zipa

A family of efficient speech models for multilingual phone recognition

Language:PythonLicense:MITStargazers:0Issues:0Issues:0