zyser's repositories
spear-tts-pytorch
An unofficial PyTorch implementation of SPEAR-TTS.
alltalk_tts
AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advanced features, such as a settings page, low VRAM support, DeepSpeed, narrator, model finetuning, custom models, wav file maintenance. It can also be used with 3rd Party software via JSON calls.
Bert-VITS2
vits2 backbone with bert
agent-attention-pytorch
Implementation of Agent Attention in Pytorch
Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
AudioDec
An Open-source Streaming High-fidelity Neural Audio Codec
BigVGAN-NVIDIA
Official implementation of BigVGAN in PyTorch
diffsptk
A differential version of SPTK
e2-tts-pytorch
Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch
fairseq_meta_fork
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
gmm-torch
Gaussian mixture models in PyTorch.
golf
A DDSP-based neural vocoder.
HierSpeechpp_zero_shot_vc
The official implementation of HierSpeech++
local-attention
An implementation of local windowed attention for language modeling
metavoice-src
AI for human-level speech intelligence
phonemizer
Simple text to phones converter for multiple languages
ring-attention-pytorch
Explorations into Ring Attention, from Liu et al. at Berkeley AI
SLAM-LLM
Speech, Language, Audio, Music Processing with Large Language Model
soundata
Python library for downloading, loading & working with sound datasets
SpeechTokenizer
This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on
supervoice-vall-e-2
VALL-E 2 reproduction
torchlpc
LPC with Pytoch
vector-quantize-pytorch
Vector Quantization, in Pytorch
vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
x-transformers
A simple but complete full-attention transformer with a set of promising experimental features from various papers
xlstm
Official repository of the xLSTM.