Amir Hussein's repositories
Simplified-DSN
Simplified implementation for Domain Seperation Networks
Applied-Deep-Learning
Applied Deep Learning Course
allosaurus
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
code-switching-papers
A curated list of research papers and resources on code-switching
BLEU
Implement the BLEU metric of machine translation.
DisVoice
feature extraction from speech signals
espnet-ml
End-to-End Speech Processing Toolkit
From-0-to-Research-Scientist-resources-guide
Detailed and tailored guide for undergraduate students or anybody want to dig deep into the field of AI with solid foundation.
kaldi
This is the official location of the Kaldi project.
lhotse
Tools for handling speech data in machine learning projects.
ML-YouTube-Courses
📺 A place to discover the latest machine learning courses on YouTube.
NeMo-text-processing
NeMo text processing for ASR and TTS
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
segan_pytorch
Speech Enhancement Generative Adversarial Network in PyTorch
sherpa-installation
K2/icefall/sherpa installation
shinjiwlab.github.io
wav lab
slurm_tutorial
Collection of slurm scripts examples
SpeechMix
Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together
SpeechTransProgress
Tracking the progress in end-to-end speech translation
Squeezeformer
PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech Recognition"
VBx
Variational Bayes HMM over x-vectors diarization