Ajili M's repositories
awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
cnn-audio-denoiser
Tensorflow 2.0 implementation of the paper: A Fully Convolutional Neural Network for Speech Enhancement
DRENet
The official implementation of DRENet (Degraded Reconstruction Enhancement Network) for tiny ship detection in remote sensing Images
easy-kaldi
Use your data to create a speech recognition system in Kaldi. Fast.
espnet
End-to-End Speech Processing Toolkit
faiss
A library for efficient similarity search and clustering of dense vectors.
kaldi-model-server
Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone
LipNet-PyTorch
"LipNet: End-to-End Sentence-level Lipreading" in PyTorch
Machine-Learning-Web-Apps
Building and Embedding Machine Learning Model into a Web App(With Flask,Streamlit,etc)
MKCF
Multiple Kernelized Correlation Filters (MKCF) for Extended Object Tracking Using X-band Marine Radar Data. [Keywords: Object tracking, Visual tracking, Radar data, EOT, ETT]
multispectral-object-detection
Multispectral Object Detection with Yolov5 and Transformer
NeMo
NeMo: a toolkit for conversational AI
open_stt
Open STT - amazing resources
openai-whisper-cpu
Improving transcription performance of OpenAI Whisper for CPU based deployment
pytorch_xvectors
Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196
Satellite-Imagery-Datasets-Containing-Ships
A list of radar and optical satellite datasets for ship detection, classification, semantic segmentation and instance segmentation tasks.
Ship-Detection-from-Satellite-Images-using-YOLOV4
Ship detection from remote sensing imagery is a crucial application for maritime security which includes among others traffic surveillance, protection against illegal fisheries, oil discharge control and sea pollution monitoring. This is typically done through the use of an Automated Identification System (AIS), which uses VHF radio frequencies to
Speaker-Embeddings
PyTorch implementation of a self-attentive speaker embedding
speaker-id
This repository contains audio samples and supplementary materials accompanying publications related to the speaker-id team at Google.
TensorFlow-Examples
TensorFlow Tutorial and Examples for Beginners (support TF v1 & v2)
the-incredible-pytorch
The Incredible PyTorch: a curated list of tutorials, papers, projects, communities and more relating to PyTorch.
voice-activity-detection
Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021
voicefilter
Unofficial PyTorch implementation of Google AI's VoiceFilter system
voxceleb_trainer
In defence of metric learning for speaker recognition
whisper
Robust Speech Recognition via Large-Scale Weak Supervision
whisper-timestamped
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Writing
📚📝 Notes on the journey
YOLOv5-ODConvNeXt
YOLOv5-ODConvNeXt is an improved version of YOLOv5 for ship detection on drone-captured images.
zamia-speech
Open tools and data for cloudless automatic speech recognition