Maxim Markitantov's starred repositories
PyTorch-GAN
PyTorch implementations of Generative Adversarial Networks.
Recorderjs
A plugin for recording/exporting the output of Web Audio API nodes
deep-voice-conversion
Deep neural networks for voice conversion (voice style transfer) in Tensorflow
Resemblyzer
A python package to analyze and compare voices with deep learning
ultimate-fastapi-tutorial
The Ultimate FastAPI Tutorial
Multimodal-Transformer
[ACL'19] [PyTorch] Multimodal Transformer
Dedicated_Valheim_Server_Script
Valheim Server Manager . Supports: ValheimPlus, Bepinex, Multi-world, Multi-Lang, Update, Backup, Restore and more: Built for Linux
w2v2-how-to
How to use our public wav2vec2 dimensional emotion model
OptimizedHTML-4
OptimizedHTML 4: Startup HTML template based on Gulp & Bootstrap 5
torchvggish
Pytorch port of Google Research's VGGish model used for extracting audio features.
voice-vector
Deep neural networks for getting text-independent speaker embedding written in TensorFlow
torchaudio-augmentations
Audio transformations library for PyTorch
torch-scan
Seamless analysis of your PyTorch models (RAM usage, FLOPs, MACs, receptive field, etc.)
cv-dataset
Metadata and versioning details for the Common Voice dataset
multimodal-emotion-recognition
This repository provides implementation for the paper "Self-attention fusion for audiovisual emotion recognition with incomplete data".
MOSEI_UMONS
A Transformer-based joint-encoding for Emotion Recognition and Sentiment Analysis
FootAndBall
FootAndBall: Integrated player and ball detector
TUT-live-age-estimator
Python implementation of a live deep learning based age/gender/expression recognizer
Agendernet
Age and Gender Prediction
voxceleb_enrichment_age_gender
Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021
SpeakerProfiling
Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf
dialectID_e2e
End to End Dialect Identification using Convolutional Neural Network
w2v2-age-gender-how-to
How to use our public wav2vec2 age and gender model
realtime_YAMNET
Simple real-time Sound Event Detector based on YAMNet and pyaudio.