Ewald Enzinger (entn-at)

entn-at

Geek Repo

Location:Portland, Oregon

Home Page:https://entn.at/

Twitter:@entn_at

Github PK Tool:Github PK Tool

Ewald Enzinger's repositories

faster-whisper

Faster Whisper transcription with CTranslate2

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

lyra

A Very Low-Bitrate Codec for Speech Compression

Language:C++License:Apache-2.0Stargazers:1Issues:2Issues:0

AudioDec

An Open-source Streaming High-fidelity Neural Audio Codec

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Auto_Tuning_Zeroshot_TTS_and_VC

Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis", Interspeech 2023

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

BetaVAE_VC

Implementation for paper "Disentangled Speech Representation Learning for One-Shot Cross-Lingual Voice Conversion Using ß-VAE"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

D-TDNN

PyTorch implementation of Densely Connected Time Delay Neural Network

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

DPHuBERT

INTERSPEECH 2023: "DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech Models"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

License:MITStargazers:0Issues:0Issues:0

fad_pytorch

Frechet Audio Distance evaluation in PyTorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

fstalign

An efficient OpenFST-based tool for calculating WER and aligning two transcript sequences.

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

oobleck

open soundstream-ish VAE codecs for downstream neural audio synthesis

License:MITStargazers:0Issues:0Issues:0

pyctcdecode

A fast and lightweight python-based CTC beam search decoder for speech recognition.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

riva-asrlib-decoder

Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva

Language:PythonStargazers:0Issues:0Issues:0

SAT

Streaming Audiotransformers for online Audio tagging

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

sequence_align

Efficient implementations of Needleman-Wunsch and other sequence alignment algorithms written in Rust with Python bindings via PyO3.

License:Apache-2.0Stargazers:0Issues:0Issues:0

SnakeGAN

Please visit https://thuhcsi.github.io/SnakeGAN/

License:CC0-1.0Stargazers:0Issues:0Issues:0

so-vits-svc-fork

so-vits-svc fork with realtime support, improved interface and more features.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

soundstorm-pytorch

Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

SV_eval_protocols_for_SD

Speaker verification evaluation protocols simulating speaker diarisation

License:MITStargazers:0Issues:0Issues:0

tango

Codes and Model of the paper "Text-to-Audio Generation using Instruction Tuned LLM and Latent Diffusion Model"

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

TTS-Cube

End-2-end speech synthesis with recurrent neural networks

Language:PythonLicense:Apache-2.0Stargazers:0Issues:3Issues:0
Stargazers:0Issues:0Issues:0

Waveformer

An efficient architecture for real-time target sound extraction.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

whisper-finetuning

[WIP] Scripts for fine-tuning Whisper

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

whisper-punctuator

Zero-shot Punctuation Insertion using Whisper

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

zm-text-tts

Learning to Speak from Text for Low-Resource TTS

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0