Salvador Medina's repositories
SpeechDrivenTongueAnimation
ML-driven tongue animation (CVPR'22)
ContinuousTongueMotionAnalysis
Site for the Continuous Tongue Motion Analysis Project
academicpages.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
av_hubert
A self-supervised learning framework for audio-visual speech
deepspeech.pytorch
Speech Recognition using DeepSpeech2.
email_verifier
Verifies from a list of emails which domains are valid
foma
Automatically exported from code.google.com/p/foma
head-pose-estimation
Head pose estimation by TensorFlow and OpenCV
hopfield-layers
Hopfield Networks is All You Need
Lipreading_using_Temporal_Convolutional_Networks
ICASSP'22 Training Strategies for Improved Lip-Reading; ICASSP'21 Towards Practical Lipreading with Distilled and Efficient Models; ICASSP'20 Lipreading using Temporal Convolutional Networks
math_workspace
Study scripts and notebooks
pase
Problem Agnostic Speech Encoder
PhISANet
Repository for the PhISANet model
PlotNeuralNet
Latex code for making neural networks diagrams
pytorch-CycleGAN-and-pix2pix
Image-to-Image Translation in PyTorch
PytorchLightningSample
Sample code for learning Pytorch Lightning and integration with wandb
Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
salmedina.github.io
Personal webpage
SGConv
Sandbox for Structured Global Convolution
soundnet_pytorch
SoundNet was intialliy implemented in torch, popularized through TF. This is an attempt to make a solid usable repo with a PyTorch port from other repos.
SuperGluePretrainedNetwork
SuperGlue: Learning Feature Matching with Graph Neural Networks (CVPR 2020, Oral)
whisper
Robust Speech Recognition via Large-Scale Weak Supervision