Sravani Dandu's repositories
DeepFaceLab
DeepFaceLab is the leading software for creating deepfakes.
pytorch_geometric
Geometric Deep Learning Extension Library for PyTorch
Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
awesome-object-detection
Awesome Object Detection based on handong1587 github: https://handong1587.github.io/deep_learning/2015/10/09/object-detection.html
coursera-gan-specialization
Programming assignments and quizzes from all courses within the GANs specialization offered by deeplearning.ai
DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
Tacotron-2
DeepMind's Tacotron-2 Tensorflow implementation
awesome-speech-enhancement
A curated list of awesome Speech Enhancement papers, libraries, datasets, and other resources.
ba
Master the essential skills needed to recognize and solve complex real-world problems with Machine Learning and Deep Learning by leveraging the highly popular Python Machine Learning Eco-system.
computer-science
:mortar_board: Path to a free self-taught education in Computer Science!
deepvoice3_pytorch
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
examples
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
go-figure-kubernetes
Kubernetes environment for running go figure apps
hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
machine-learning-interview
Machine Learning Interviews from FAAG, Snapchat, LinkedIn. I have offers from Snapchat, Coupang, Stitchfix etc.
ml-system-design-pattern
System design patterns for machine learning
Multilingual_Text_to_Speech
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
overview
Description-FAQ of the process
ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
pytorch-dc-tts
Text to Speech with PyTorch (English and Mongolian)
stylegan2-training
Materials for StyleGAN2 Training class
TTS-Style-Transfer
Official PyTorch implementation of TTS Style Transfer
WaveRNN
WaveRNN Vocoder + TTS