markitantov

User data from Github https://github.com/markitantov

followers

following

stars

SPC RAS

Saint Petersburg

http://hci.nw.ru/en/employees/10

Maxim Markitantov's starred repositories

PyTorch-GAN

PyTorch implementations of Generative Adversarial Networks.

Language:PythonMIT16840 221 158

Recorderjs

A plugin for recording/exporting the output of Web Audio API nodes

Language:JavaScript4211 184 157

deep-voice-conversion

Deep neural networks for voice conversion (voice style transfer) in Tensorflow

Language:PythonMIT3928 161 128

Resemblyzer

A python package to analyze and compare voices with deep learning

Language:PythonApache-2.02878 73 83

torch-cam

Class activation maps for your PyTorch models (CAM, Grad-CAM, Grad-CAM++, Smooth Grad-CAM++, Score-CAM, SS-CAM, IS-CAM, XGrad-CAM, Layer-CAM)

Language:PythonApache-2.02135 11 60

ultimate-fastapi-tutorial

The Ultimate FastAPI Tutorial

Language:Python1123 16 15

Multimodal-Transformer

[ACL'19] [PyTorch] Multimodal Transformer

Language:PythonMIT867 13 49

Dedicated_Valheim_Server_Script

Valheim Server Manager . Supports: ValheimPlus, Bepinex, Multi-world, Multi-Lang, Update, Backup, Restore and more: Built for Linux

Language:ShellAGPL-3.0708 26 142

w2v2-how-to

How to use our public wav2vec2 dimensional emotion model

Language:Jupyter NotebookMIT489 9 16

OptimizedHTML-4

OptimizedHTML 4: Startup HTML template based on Gulp & Bootstrap 5

Language:SCSS467 73 41

torchvggish

Pytorch port of Google Research's VGGish model used for extracting audio features.

Language:PythonApache-2.0384 8 23

voice-vector

Deep neural networks for getting text-independent speaker embedding written in TensorFlow

Language:PythonMIT309 26 10

timit

The DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus.

torchaudio-augmentations

Audio transformations library for PyTorch

Language:PythonMIT230 1 10

torch-scan

Seamless analysis of your PyTorch models (RAM usage, FLOPs, MACs, receptive field, etc.)

Language:PythonApache-2.0218 7 17

panns_inference

Language:PythonMIT213 4 15

TDNN

Time delay neural network (TDNN) implementation in Pytorch using unfold method

Language:Python201 6 3

cv-dataset

Metadata and versioning details for the Common Voice dataset

Language:JavaScriptMPL-2.0146 17 27

multimodal-emotion-recognition

This repository provides implementation for the paper "Self-attention fusion for audiovisual emotion recognition with incomplete data".

Language:PythonMIT124 3 22

MOSEI_UMONS

A Transformer-based joint-encoding for Emotion Recognition and Sentiment Analysis

Language:PythonMIT121 8 22

FootAndBall

FootAndBall: Integrated player and ball detector

Language:PythonMIT116 6 9

TUT-live-age-estimator

Python implementation of a live deep learning based age/gender/expression recognizer

Language:PythonMIT83 4 12

Agendernet

Age and Gender Prediction

Language:Python70 3 6

voxceleb_enrichment_age_gender

Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021

Language:Jupyter NotebookMIT67 4 6

SpeakerProfiling

Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf

Language:PythonMIT66 3 9

emotion

Emotion Recognition ToolKit (ERTK): tools for emotion recognition. Dataset processing, feature extraction, experiments,

Language:Jupyter NotebookMIT57 3 5

dialectID_e2e

End to End Dialect Identification using Convolutional Neural Network

Language:Python52 40

w2v2-age-gender-how-to

How to use our public wav2vec2 age and gender model

Language:Jupyter NotebookMIT39 4 5

OCEANAI

Algorithms for Intelligent Assessment of Human Personality Traits based on His Multimodal Data for ranking potential candidates to perform professional responsibilities

Language:PythonBSD-3-Clause32 4 3

realtime_YAMNET

Simple real-time Sound Event Detector based on YAMNet and pyaudio.

Language:PythonMIT22 10