Praveen Narayanan's repositories
pyramidal_rnns
Experiments to hack together a pyramidal bilstm from the listen, attend and spell paper
HighwayLayerTest
Scratch notebook to use Highway layers
SuperResGANUnet
Attempt at creating larger images with the StackGAN concept
vae_ebgan_mnist
VAE EBGAN
wasserstein_autoencoders
Implementation of Wasserstein Autoencoders
BEGAN_MNIST
BEGAN experiments
bidirectional-rnn
Design a simple bi-RNN by hand
deep-voice-conversion
Deep neural networks for voice conversion (voice style transfer) in Tensorflow
ebgan_mnist
EBGAN/VAE experiments with mnist
librispeech-alignments
Word alignments generated by the Montreal Forced Aligner for the Librispeech dataset
MNIST_svhn_dataloader
General utils for dataloader and vis
numpy-100
100 numpy exercises (with solutions)
PyTorch_Speaker_Verification
PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.
Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
TensorRT
NVIDIA® TensorRT™, an SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that delivers low latency and high throughput for inference applications.
vaegan_lsun
VAEGAN experiments with patchgan strategy
VideoPose3D
Efficient 3D human pose estimation in video using 2D keypoint trajectories
VisionMamba
Implementation of Vision Mamba from the paper: "Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model" It's 2.8x faster than DeiT and saves 86.8% GPU memory when performing batch inference to extract features on high-res images
voice_conversion
Some code from my voice conversion paper
wgan_mnist
WGAN wiith conv layers