ensky0

ensky0

Geek Repo

Github PK Tool:Github PK Tool

ensky0's starred repositories

iCanHazShortcut

simple shortcut manager for macOS

Language:PureBasicLicense:UnlicenseStargazers:384Issues:0Issues:0
Language:PythonStargazers:22Issues:0Issues:0

covost

CoVoST: A Large-Scale Multilingual Speech-To-Text Translation Corpus (CC0 Licensed)

Language:PythonLicense:NOASSERTIONStargazers:334Issues:0Issues:0

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Language:PythonLicense:MITStargazers:3525Issues:0Issues:0

lhotse

Tools for handling speech data in machine learning projects.

Language:PythonLicense:Apache-2.0Stargazers:913Issues:0Issues:0

sacrebleu

Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons

Language:PythonLicense:Apache-2.0Stargazers:1022Issues:0Issues:0

audiotools

Object-oriented handling of audio data, with GPU-powered augmentations, and more.

Language:PythonLicense:MITStargazers:205Issues:0Issues:0

AcademiCodec

AcademiCodec: An Open Source Audio Codec Model for Academic Research

Language:PythonStargazers:544Issues:0Issues:0

descript-audio-codec

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Language:PythonLicense:MITStargazers:1049Issues:0Issues:0

seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:10603Issues:0Issues:0

ExpressiveTacotron

This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN, Non-attentive Tacotron, GST, VAE, GMVAE, and X-vectors for building prosody encoder.

Language:PythonStargazers:74Issues:0Issues:0

VocGAN

VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network

Language:PythonLicense:MITStargazers:317Issues:0Issues:0

SQLCipher-Password-Cracker-OpenCL

Password cracker for SQLCipher v2 using OpenCL

Language:CLicense:MITStargazers:107Issues:0Issues:0

sqlcipher

SQLCipher is a standalone fork of SQLite that adds 256 bit AES encryption of database files and other security features.

Language:CLicense:NOASSERTIONStargazers:6069Issues:0Issues:0
Language:PythonStargazers:41Issues:0Issues:0

Non-Attentive-Tacotron

This is Pytorch Implementation of Google's Non-attentive Tacotron.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:57Issues:0Issues:0

Parallel-Tacotron2

PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling

Language:PythonLicense:MITStargazers:187Issues:0Issues:0

Voice_Activity_Detector

A statistical model-based Voice Activity Detection

Language:Jupyter NotebookStargazers:187Issues:0Issues:0

Voice-Activity-Detection

Efficient voice activity detection algorithms using long-term speech information in C++

Language:C++License:MITStargazers:93Issues:0Issues:0

xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Language:PythonLicense:NOASSERTIONStargazers:8133Issues:0Issues:0

FlexGen

Running large language models on a single GPU for throughput-oriented scenarios.

Language:PythonLicense:Apache-2.0Stargazers:9096Issues:0Issues:0

whisper.cpp

Port of OpenAI's Whisper model in C/C++

Language:C++License:MITStargazers:33433Issues:0Issues:0
Language:PythonLicense:MITStargazers:70Issues:0Issues:0

SpeechT5

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

Language:PythonLicense:MITStargazers:1118Issues:0Issues:0

Autoformer

About Code release for "Autoformer: Decomposition Transformers with Auto-Correlation for Long-Term Series Forecasting" (NeurIPS 2021), https://arxiv.org/abs/2106.13008

License:MITStargazers:1Issues:0Issues:0

bigbird

Transformers for Longer Sequences

Language:PythonLicense:Apache-2.0Stargazers:559Issues:0Issues:0

longformer

Longformer: The Long-Document Transformer

Language:PythonLicense:Apache-2.0Stargazers:2010Issues:0Issues:0

performer-pytorch

An implementation of Performer, a linear attention-based transformer, in Pytorch

Language:PythonLicense:MITStargazers:1073Issues:0Issues:0

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Language:PythonLicense:MITStargazers:1855Issues:0Issues:0

TransTacoS-RetuneGAN

A toy-like Text-to-Speech for Chinese/Mandarin synthesize, inspired by Tacotron & FastSpeech2 & RefineGAN.

Language:PythonLicense:MITStargazers:15Issues:0Issues:0