xinkez's repositories

alexa-end-to-end-slu

This setup allows to train end-to-end neural models for spoken language understanding (SLU).

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

apam

APAM toolkit is built on PyTorch and provides recipes to adapt pretrained acoustic models with a variety of sequence discriminative training criterions.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

audio_source_separation

An implementation of audio source separation tools.

Language:PythonStargazers:0Issues:1Issues:0

charsiu

Charsiu: A neural phonetic aligner.

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0

CharsiuG2P

Multilingual G2P in over 100 languages

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

EfficientConformer

Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

FastSpeech2

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

GigaSpeech

Large, modern dataset for speech recognition

Language:ShellLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:0Issues:0

kaldi-serve

Server framework for Kaldi ASR Toolkit

Language:C++License:Apache-2.0Stargazers:0Issues:1Issues:0

Lattice-ELMo

Source code for ACL 2020 paper "Learning Spoken Language Representations with Neural Lattice Language Modeling"

Language:PythonStargazers:0Issues:1Issues:0

norbert

Painless Wiener filters for audio separation

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

OMLSA-IMCRA

Python implementation of OMLSA+IMCRA algorithm for speech enhancement.

Language:PythonStargazers:0Issues:1Issues:0

OpenTransformer

A No-Recurrence Sequence-to-Sequence Model for Speech Recognition

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

PercepNet

(Under construct) Unofficial implementation of PercepNet: A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech

Language:C++License:BSD-3-ClauseStargazers:0Issues:2Issues:0

pika

a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi

License:Apache-2.0Stargazers:0Issues:0Issues:0

pygsound

Impulse response generation based on state-of-the-art geometric sound propagation engine.

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity and Number Detector

License:MITStargazers:0Issues:0Issues:0

simpletransformers

Transformers for Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:1Issues:0

supervoice-gpt

GPT-style network for phonemization with durations of text

Stargazers:0Issues:0Issues:0

torch-audiomentations

Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

torchaudacity

PyTorch wrappers for using your model in audacity!

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0

torchprof

PyTorch layer-by-layer model profiler

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

torchsubband

Pytorch implementation of subband decomposition

Language:HTMLLicense:MITStargazers:0Issues:1Issues:0

Transformer-Transducer

A pytorch implementation of Transformer Transducer(T-T)

Language:PythonStargazers:0Issues:1Issues:0

VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

voice-activity-detection-2

Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

voicefixer_main

General Speech Restoration

Language:PythonLicense:AGPL-3.0Stargazers:0Issues:1Issues:0

wikipron

Massively multilingual pronunciation mining

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0