salmedina

Salvador Medina's repositories

SpeechDrivenTongueAnimation

ML-driven tongue animation (CVPR'22)

Language:PythonMIT41 4 3

pdf2thumb

This little program generates a thumbnail of a certain pdf for quick visualization. It is based on ImageMagick as it has all the functionality required.

Language:Python17 20

ContinuousTongueMotionAnalysis

Site for the Continuous Tongue Motion Analysis Project

Language:SCSSMIT1 20

CriticalTweetsClassifier

Language:Jupyter Notebook1 30

FomaSREvaluation

Language:PythonMIT1 20

academicpages.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:JavaScriptMIT010

av_hubert

A self-supervised learning framework for audio-visual speech

Language:PythonNOASSERTION000

deepspeech.pytorch

Speech Recognition using DeepSpeech2.

Language:PythonMIT010

email_verifier

Verifies from a list of emails which domains are valid

Language:Python020

foma

Automatically exported from code.google.com/p/foma

000

Gravity

Minimal is the new cool.

Language:SCSSMIT010

head-pose-estimation

Head pose estimation by TensorFlow and OpenCV

Language:PythonMIT010

hopfield-layers

Hopfield Networks is All You Need

Language:PythonNOASSERTION010

leaf-audio

Language:PythonApache-2.0010

Lipreading_using_Temporal_Convolutional_Networks

ICASSP'22 Training Strategies for Improved Lip-Reading; ICASSP'21 Towards Practical Lipreading with Distilled and Efficient Models; ICASSP'20 Lipreading using Temporal Convolutional Networks

Language:PythonNOASSERTION000

math_workspace

Study scripts and notebooks

Language:Jupyter Notebook020

p2fa_py3

Penn Phonetics Lab Forced Aligner Toolkit (P2FA) for Python3

Language:Python010

pase

Problem Agnostic Speech Encoder

Language:PythonMIT000

PhISANet

Repository for the PhISANet model

000

places365

The Places365-CNNs for Scene Classification

Language:PythonMIT010

PlotNeuralNet

Latex code for making neural networks diagrams

MIT000

pytorch-CycleGAN-and-pix2pix

Image-to-Image Translation in PyTorch

Language:PythonNOASSERTION010

PytorchLightningSample

Sample code for learning Pytorch Lightning and integration with wandb

Language:PythonMIT000

Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonNOASSERTION000

salmedina.github.io

Personal webpage

Language:HTMLNOASSERTION020

SGConv

Sandbox for Structured Global Convolution

000

soundnet_pytorch

SoundNet was intialliy implemented in torch, popularized through TF. This is an attempt to make a solid usable repo with a PyTorch port from other repos.

Language:PythonMIT020

SuperGluePretrainedNetwork

SuperGlue: Learning Feature Matching with Graph Neural Networks (CVPR 2020, Oral)

Language:PythonNOASSERTION020

VIBE

Official implementation of CVPR2020 paper "VIBE: Video Inference for Human Body Pose and Shape Estimation"

Language:PythonNOASSERTION020

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

MIT000