Ewald Enzinger (entn-at)

entn-at

Geek Repo

Location:Portland, Oregon

Home Page:https://entn.at/

Twitter:@entn_at

Github PK Tool:Github PK Tool

Ewald Enzinger's repositories

Aligners

HMM, CTC, RNN-Transducer, forward-backward algorithm

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

ARMHuBERT

PyTorch Implementation of ARMHuBERT (INTERSPEECH 2023)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

bark-voice-cloning-HuBERT-quantizer

The code for the bark-voicecloning model. Training and inference.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

bigvsan

Pytorch implementation of BigVSAN

License:MITStargazers:0Issues:0Issues:0

ddc_onset

Music onset detector from Dance Dance Convolution packaged as a lightweight PyTorch module

License:MITStargazers:0Issues:0Issues:0

dover-lap

Method for combining overlap-aware diarization system outputs.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

kaldi-decoder

Decoders from Kaldi using OpenFst

License:Apache-2.0Stargazers:0Issues:0Issues:0

kaldialign

Python wrappers for Kaldi Levenshtein's distance and alignment code.

Language:CMakeLicense:Apache-2.0Stargazers:0Issues:0Issues:0

knn-vc

Voice Conversion With Just Nearest Neighbors

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

libriheavy

Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context

License:Apache-2.0Stargazers:0Issues:0Issues:0

lvc-vc

End-to-End Zero-Shot Voice Conversion with Location-Variable Convolutions

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Matcha-TTS

🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

MB-iSTFT-VITS2

Application of MB-iSTFT-VITS components to vits2_pytorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

miipher

Unofficial implementation of miipher

Stargazers:0Issues:0Issues:0

MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

License:NOASSERTIONStargazers:0Issues:0Issues:0

multipa

Universal multilingual automatic speech transcription into IPA

Language:PythonStargazers:0Issues:0Issues:0

naturalspeech

A fully working pytorch implementation of NaturalSpeech (Tan et al., 2022)

Language:PythonStargazers:0Issues:0Issues:0

naturalspeech2-pytorch

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

NS2VC

Unofficial implementation of NaturalSpeech2 for Voice Conversion

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

sherpa

Streaming and non-streaming ASR server in Python

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

SoundStorm

The reproduced code for Google's SoundStorm

Language:PythonStargazers:0Issues:0Issues:0

spear-tts-pytorch

An unofficial PyTorch implementation of SPEAR-TTS.

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

SpeechTokenizer

This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on

License:Apache-2.0Stargazers:0Issues:0Issues:0

text_search

Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup

Language:PythonStargazers:0Issues:0Issues:0

tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

tract

Tiny, no-nonsense, self contained, Tensorflow and ONNX inference

Language:RustLicense:NOASSERTIONStargazers:0Issues:0Issues:0

vits2

VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0
Language:C++License:MITStargazers:0Issues:0Issues:0