Salvador Medina (salmedina)

salmedina

Geek Repo

Company:Carnegie Mellon University

Location:Pittsburgh

Github PK Tool:Github PK Tool

Salvador Medina's repositories

SpeechDrivenTongueAnimation

ML-driven tongue animation (CVPR'22)

Language:PythonLicense:MITStargazers:41Issues:4Issues:3

pdf2thumb

This little program generates a thumbnail of a certain pdf for quick visualization. It is based on ImageMagick as it has all the functionality required.

Language:PythonStargazers:17Issues:2Issues:0

ContinuousTongueMotionAnalysis

Site for the Continuous Tongue Motion Analysis Project

Language:SCSSLicense:MITStargazers:1Issues:2Issues:0
Language:Jupyter NotebookStargazers:1Issues:3Issues:0
Language:PythonLicense:MITStargazers:1Issues:2Issues:0

academicpages.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:JavaScriptLicense:MITStargazers:0Issues:1Issues:0

av_hubert

A self-supervised learning framework for audio-visual speech

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

deepspeech.pytorch

Speech Recognition using DeepSpeech2.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

email_verifier

Verifies from a list of emails which domains are valid

Language:PythonStargazers:0Issues:2Issues:0

foma

Automatically exported from code.google.com/p/foma

Stargazers:0Issues:0Issues:0

Gravity

Minimal is the new cool.

Language:SCSSLicense:MITStargazers:0Issues:1Issues:0

head-pose-estimation

Head pose estimation by TensorFlow and OpenCV

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

hopfield-layers

Hopfield Networks is All You Need

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

Lipreading_using_Temporal_Convolutional_Networks

ICASSP'22 Training Strategies for Improved Lip-Reading; ICASSP'21 Towards Practical Lipreading with Distilled and Efficient Models; ICASSP'20 Lipreading using Temporal Convolutional Networks

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

math_workspace

Study scripts and notebooks

Language:Jupyter NotebookStargazers:0Issues:2Issues:0

p2fa_py3

Penn Phonetics Lab Forced Aligner Toolkit (P2FA) for Python3

Language:PythonStargazers:0Issues:1Issues:0

pase

Problem Agnostic Speech Encoder

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

PhISANet

Repository for the PhISANet model

Stargazers:0Issues:0Issues:0

places365

The Places365-CNNs for Scene Classification

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

PlotNeuralNet

Latex code for making neural networks diagrams

License:MITStargazers:0Issues:0Issues:0

pytorch-CycleGAN-and-pix2pix

Image-to-Image Translation in PyTorch

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

PytorchLightningSample

Sample code for learning Pytorch Lightning and integration with wandb

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

salmedina.github.io

Personal webpage

Language:HTMLLicense:NOASSERTIONStargazers:0Issues:2Issues:0

SGConv

Sandbox for Structured Global Convolution

Stargazers:0Issues:0Issues:0

soundnet_pytorch

SoundNet was intialliy implemented in torch, popularized through TF. This is an attempt to make a solid usable repo with a PyTorch port from other repos.

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

SuperGluePretrainedNetwork

SuperGlue: Learning Feature Matching with Graph Neural Networks (CVPR 2020, Oral)

Language:PythonLicense:NOASSERTIONStargazers:0Issues:2Issues:0

VIBE

Official implementation of CVPR2020 paper "VIBE: Video Inference for Human Body Pose and Shape Estimation"

Language:PythonLicense:NOASSERTIONStargazers:0Issues:2Issues:0

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

License:MITStargazers:0Issues:0Issues:0