AigizK's repositories
silero-models
Silero Models: pre-trained STT models and benchmarks made embarrassingly simple
transport-tycoon
Transport Tycoon Exercises for DDD
cassandra-lucene-index
Lucene based secondary indexes for Cassandra
csharp-driver
DataStax .NET Driver for Apache Cassandra
DeOldify
A Deep Learning based project for colorizing and restoring old images (and video!)
face_recognition
The world's simplest facial recognition api for Python and the command line
go-http-auth
Basic and Digest HTTP Authentication for golang http
groove2groove
Code for "Groove2Groove: One-Shot Music Style Transfer with Supervision from Synthetic Data"
LocalSTT
Android Speech Recognition Service using Vosk/Kaldi and Mozilla DeepSpeech
messageVault
Publish-subscribe with replay and batching for Windows Azure.
ML-Resources
books and courses on machine learning
mmtracking
OpenMMLab Video Perception Toolbox. It supports Single Object Tracking (SOT), Multiple Object Tracking (MOT), Video Object Detection (VID) with a unified framework.
musicinformationretrieval.com
Instructional notebooks on music information retrieval.
nlp_tasks
Natural Language Processing Tasks and References
oisin
Oisín: Wave Function Collapse for poetry
prolog-poetry
Генератор стихов на Prolog
s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
super-convergence
Files to create the figures in the paper "Super-Convergence: Very Fast Training of Residual Networks Using Large Learning Rates"
Talking-Face_PC-AVS
Code for Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation (CVPR 2021)
Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.
wordvectors
Pre-trained word vectors of 30+ languages