entn-at

followers

following

stars

Portland, Oregon

https://entn.at/

Ewald Enzinger's repositories

Aligners

HMM, CTC, RNN-Transducer, forward-backward algorithm

Language:Jupyter Notebook000

ARMHuBERT

PyTorch Implementation of ARMHuBERT (INTERSPEECH 2023)

Language:PythonApache-2.0000

bark-voice-cloning-HuBERT-quantizer

The code for the bark-voicecloning model. Training and inference.

Language:PythonMIT000

bert-ns2

Language:Python000

bigvsan

Pytorch implementation of BigVSAN

MIT000

ddc_onset

Music onset detector from Dance Dance Convolution packaged as a lightweight PyTorch module

MIT000

dover-lap

Method for combining overlap-aware diarization system outputs.

Language:PythonMIT010

emospeech

Apache-2.0000

kaldi-decoder

Decoders from Kaldi using OpenFst

Apache-2.0000

kaldialign

Python wrappers for Kaldi Levenshtein's distance and alignment code.

Language:CMakeApache-2.0000

knn-vc

Voice Conversion With Just Nearest Neighbors

Language:PythonNOASSERTION000

libriheavy

Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context

Apache-2.0000

lvc-vc

End-to-End Zero-Shot Voice Conversion with Location-Variable Convolutions

Language:PythonMIT000

Matcha-TTS

🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

Language:Jupyter NotebookMIT000

MB-iSTFT-VITS2

Application of MB-iSTFT-VITS components to vits2_pytorch

Language:PythonMIT000

miipher

Unofficial implementation of miipher

000

MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

NOASSERTION000

multipa

Universal multilingual automatic speech transcription into IPA

Language:Python000

naturalspeech

A fully working pytorch implementation of NaturalSpeech (Tan et al., 2022)

Language:Python000

naturalspeech2-pytorch

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Language:PythonMIT000

NS2VC

Unofficial implementation of NaturalSpeech2 for Voice Conversion

Language:Jupyter Notebook000

sherpa

Streaming and non-streaming ASR server in Python

Language:C++Apache-2.0000

SoundStorm

The reproduced code for Google's SoundStorm

Language:Python000

spear-tts-pytorch

An unofficial PyTorch implementation of SPEAR-TTS.

Language:Jupyter NotebookMIT000

SpeechTokenizer

This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on

Apache-2.0000

text_search

Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup

Language:Python000

tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Language:Jupyter NotebookApache-2.0000

tract

Tiny, no-nonsense, self contained, Tensorflow and ONNX inference

Language:RustNOASSERTION000

vits2

VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design

Language:Jupyter NotebookMIT000

whisper.tflite

Language:C++MIT000