Ajili M (ajilim)

ajilim

Geek Repo

Location:France

Github PK Tool:Github PK Tool

Ajili M's repositories

awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

License:Apache-2.0Stargazers:0Issues:0Issues:0

cnn-audio-denoiser

Tensorflow 2.0 implementation of the paper: A Fully Convolutional Neural Network for Speech Enhancement

Stargazers:0Issues:0Issues:0

DRENet

The official implementation of DRENet (Degraded Reconstruction Enhancement Network) for tiny ship detection in remote sensing Images

License:GPL-3.0Stargazers:0Issues:0Issues:0

easy-kaldi

Use your data to create a speech recognition system in Kaldi. Fast.

License:Apache-2.0Stargazers:0Issues:0Issues:0

espnet

End-to-End Speech Processing Toolkit

License:Apache-2.0Stargazers:0Issues:0Issues:0

faiss

A library for efficient similarity search and clustering of dense vectors.

License:MITStargazers:0Issues:0Issues:0

kaldi-model-server

Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone

License:Apache-2.0Stargazers:0Issues:0Issues:0

LipNet-PyTorch

"LipNet: End-to-End Sentence-level Lipreading" in PyTorch

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

Machine-Learning-Web-Apps

Building and Embedding Machine Learning Model into a Web App(With Flask,Streamlit,etc)

Stargazers:0Issues:0Issues:0

MKCF

Multiple Kernelized Correlation Filters (MKCF) for Extended Object Tracking Using X-band Marine Radar Data. [Keywords: Object tracking, Visual tracking, Radar data, EOT, ETT]

License:GPL-3.0Stargazers:0Issues:0Issues:0

multispectral-object-detection

Multispectral Object Detection with Yolov5 and Transformer

License:AGPL-3.0Stargazers:0Issues:0Issues:0

NeMo

NeMo: a toolkit for conversational AI

License:Apache-2.0Stargazers:0Issues:0Issues:0

open_stt

Open STT - amazing resources

License:NOASSERTIONStargazers:0Issues:0Issues:0

openai-whisper-cpu

Improving transcription performance of OpenAI Whisper for CPU based deployment

License:MITStargazers:0Issues:0Issues:0

pytorch_xvectors

Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196

License:MITStargazers:0Issues:0Issues:0

Satellite-Imagery-Datasets-Containing-Ships

A list of radar and optical satellite datasets for ship detection, classification, semantic segmentation and instance segmentation tasks.

License:MITStargazers:0Issues:0Issues:0

Ship-Detection-from-Satellite-Images-using-YOLOV4

Ship detection from remote sensing imagery is a crucial application for maritime security which includes among others traffic surveillance, protection against illegal fisheries, oil discharge control and sea pollution monitoring. This is typically done through the use of an Automated Identification System (AIS), which uses VHF radio frequencies to

Stargazers:0Issues:0Issues:0

Speaker-Embeddings

PyTorch implementation of a self-attentive speaker embedding

License:MITStargazers:0Issues:0Issues:0

speaker-id

This repository contains audio samples and supplementary materials accompanying publications related to the speaker-id team at Google.

License:NOASSERTIONStargazers:0Issues:0Issues:0

TensorFlow-Examples

TensorFlow Tutorial and Examples for Beginners (support TF v1 & v2)

License:NOASSERTIONStargazers:0Issues:0Issues:0

the-incredible-pytorch

The Incredible PyTorch: a curated list of tutorials, papers, projects, communities and more relating to PyTorch.

License:MITStargazers:0Issues:0Issues:0

voice-activity-detection

Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021

License:MITStargazers:0Issues:0Issues:0

voicefilter

Unofficial PyTorch implementation of Google AI's VoiceFilter system

Stargazers:0Issues:0Issues:0

voxceleb_trainer

In defence of metric learning for speaker recognition

License:MITStargazers:0Issues:0Issues:0

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

License:MITStargazers:0Issues:0Issues:0

whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

License:AGPL-3.0Stargazers:0Issues:0Issues:0

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

License:BSD-4-ClauseStargazers:0Issues:0Issues:0

Writing

📚📝 Notes on the journey

License:CC-BY-4.0Stargazers:0Issues:0Issues:0

YOLOv5-ODConvNeXt

YOLOv5-ODConvNeXt is an improved version of YOLOv5 for ship detection on drone-captured images.

Stargazers:0Issues:0Issues:0

zamia-speech

Open tools and data for cloudless automatic speech recognition

License:LGPL-3.0Stargazers:0Issues:0Issues:0