Beast code in Giters

This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN, Non-attentive Tacotron, GST, VAE, GMVAE, and X-vectors for building prosody encoder.

Language:Python7400

VocGAN

VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network

Language:PythonMIT31700

SQLCipher-Password-Cracker-OpenCL

Password cracker for SQLCipher v2 using OpenCL

Language:CMIT10700

sqlcipher

SQLCipher is a standalone fork of SQLite that adds 256 bit AES encryption of database files and other security features.

Language:CNOASSERTION606900

translatotron

Language:Python4100

Non-Attentive-Tacotron

This is Pytorch Implementation of Google's Non-attentive Tacotron.

Language:Jupyter NotebookApache-2.05700

Parallel-Tacotron2

PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling

Language:PythonMIT18700

Voice_Activity_Detector

A statistical model-based Voice Activity Detection

Language:Jupyter Notebook18700

Voice-Activity-Detection

Efficient voice activity detection algorithms using long-term speech information in C++

Language:C++MIT9300

xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Language:PythonNOASSERTION813300

FlexGen

Running large language models on a single GPU for throughput-oriented scenarios.

Language:PythonApache-2.0909600

whisper.cpp

Port of OpenAI's Whisper model in C/C++

Language:C++MIT3343300

causal-transformer-decoder

Language:PythonMIT7000

SpeechT5

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

Language:PythonMIT111800

Autoformer

About Code release for "Autoformer: Decomposition Transformers with Auto-Correlation for Long-Term Series Forecasting" (NeurIPS 2021), https://arxiv.org/abs/2106.13008

MIT100

bigbird

Transformers for Longer Sequences

Language:PythonApache-2.055900

longformer

Longformer: The Long-Document Transformer

Language:PythonApache-2.0201000

performer-pytorch

An implementation of Performer, a linear attention-based transformer, in Pytorch

Language:PythonMIT107300

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Language:PythonMIT185500

TransTacoS-RetuneGAN

A toy-like Text-to-Speech for Chinese/Mandarin synthesize, inspired by Tacotron & FastSpeech2 & RefineGAN.

Language:PythonMIT1500

ensky0

ensky0's starred repositories

iCanHazShortcut

poetry-cython-example

covost

silero-vad

lhotse

sacrebleu

audiotools

AcademiCodec

descript-audio-codec

seamless_communication

ExpressiveTacotron