zyser's repositories

spear-tts-pytorch

An unofficial PyTorch implementation of SPEAR-TTS.

Language:Jupyter NotebookLicense:MITStargazers:1Issues:0Issues:0

alltalk_tts

AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advanced features, such as a settings page, low VRAM support, DeepSpeed, narrator, model finetuning, custom models, wav file maintenance. It can also be used with 3rd Party software via JSON calls.

Language:HTMLLicense:AGPL-3.0Stargazers:0Issues:0Issues:0

Bert-VITS2

vits2 backbone with bert

Language:PythonLicense:AGPL-3.0Stargazers:0Issues:0Issues:0

agent-attention-pytorch

Implementation of Agent Attention in Pytorch

License:MITStargazers:0Issues:0Issues:0

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

AudioDec

An Open-source Streaming High-fidelity Neural Audio Codec

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

BigVGAN-NVIDIA

Official implementation of BigVGAN in PyTorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

diffsptk

A differential version of SPTK

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

e2-tts-pytorch

Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

fairseq_meta_fork

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

gmm-torch

Gaussian mixture models in PyTorch.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

golf

A DDSP-based neural vocoder.

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

HierSpeechpp_zero_shot_vc

The official implementation of HierSpeech++

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

local-attention

An implementation of local windowed attention for language modeling

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

LPCNet

Efficient neural speech synthesis

Language:CLicense:BSD-3-ClauseStargazers:0Issues:3Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

metavoice-src

AI for human-level speech intelligence

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

phonemizer

Simple text to phones converter for multiple languages

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

ring-attention-pytorch

Explorations into Ring Attention, from Liu et al. at Berkeley AI

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

SLAM-LLM

Speech, Language, Audio, Music Processing with Large Language Model

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

soundata

Python library for downloading, loading & working with sound datasets

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

SpeechTokenizer

This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

supervoice-vall-e-2

VALL-E 2 reproduction

Stargazers:0Issues:0Issues:0

torchlpc

LPC with Pytoch

License:MITStargazers:0Issues:0Issues:0

vector-quantize-pytorch

Vector Quantization, in Pytorch

Language:PythonLicense:MITStargazers:0Issues:2Issues:0
Language:PythonStargazers:0Issues:0Issues:0

vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

x-transformers

A simple but complete full-attention transformer with a set of promising experimental features from various papers

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

xlstm

Official repository of the xLSTM.

Language:PythonLicense:AGPL-3.0Stargazers:0Issues:0Issues:0