A.HEBA's repositories
arabic-tacotron-tts
End to end Arabic TTS system based on tacotron
speechbrain-aheba-contribs
A PyTorch-based Speech Toolkit
DtmfDetection
C# implementation of the Goertzel algorithm for DTMF tone (a.k.a. Touch-Tone) detection and localization in audio data. Includes wrappers and extensions for NAudio.
End-to-end-ASR-Pytorch
This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pytorch, the well known deep learning toolkit.
face-parsing.PyTorch
Using modified BiSeNet for face parsing in PyTorch
fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
fastText
Library for fast text representation and classification.
incbin
Include binary files in C/C++
iptv
Collection of publicly available IPTV channels from all over the world
Kaldi-for-ASR-of-Swiss-German
The ASR Kaldi recipe adapted for the Swiss German data from the ArchiMob spoken corpus is offered.
Litmus
AI Assistant for Building Reliable, High-performing and Fair Multilingual NLP Systems
mmpose
OpenMMLab Pose Estimation Toolbox and Benchmark.
neural_sp
End-to-end ASR/LM implementation with PyTorch
plato-research-dialogue-system
This is the Plato Research Dialogue System, a flexible platform for developing conversational AI agents.
pytorch-kaldi
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
pytorch-pretrained-BERT
A PyTorch implementation of Google AI's BERT model provided with Google's pre-trained models, examples and utilities.
rpunct-cpu
📝An easy-to-use package to restore punctuation of the text.
s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
sentencepiece
Unsupervised text tokenizer for Neural Network-based text generation.
shazam-demo
Audio search and analyzer application (like a Shazam)
Speaker-Recognition
This repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1
Speech_Emotion_Recognition_DNN-ELM
Implementation of Speech Emotion Recognition using DNN-ELM
Telephone
SIP softphone for Mac
Voice-Privacy-Challenge-2020
Baseline Recipe for VoicePrivacy Challenge 2020: https://www.voiceprivacychallenge.org/vp2020/docs/VoicePrivacy_2020_Eval_Plan_v1_3.pdf
vosk-server
WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
warp-transducer
A fast parallel implementation of RNN Transducer.
x-vector-kaldi-tf
Tensorflow implementation of x-vector topology on top of Kaldi recipe