Sravani Dandu (sravanidn)

sravanidn

User data from Github https://github.com/sravanidn

Company:ML Researcher at Comcast Labs

Location:San Francisco, California

GitHub:@sravanidn

Sravani Dandu's repositories

DeepFaceLab

DeepFaceLab is the leading software for creating deepfakes.

Language:PythonLicense:GPL-3.0Stargazers:1Issues:1Issues:0

FARM

:house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:1Issues:0

pytorch_geometric

Geometric Deep Learning Extension Library for PyTorch

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonLicense:NOASSERTIONStargazers:1Issues:1Issues:0

audacity

Audio Editor

Language:CLicense:NOASSERTIONStargazers:0Issues:1Issues:0

awesome-object-detection

Awesome Object Detection based on handong1587 github: https://handong1587.github.io/deep_learning/2015/10/09/object-detection.html

Stargazers:0Issues:0Issues:0

coursera-gan-specialization

Programming assignments and quizzes from all courses within the GANs specialization offered by deeplearning.ai

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0

DeepSpeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

Language:C++License:MPL-2.0Stargazers:0Issues:1Issues:0

denoiser

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:RStargazers:0Issues:2Issues:0

Tacotron-2

DeepMind's Tacotron-2 Tensorflow implementation

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

awesome-speech-enhancement

A curated list of awesome Speech Enhancement papers, libraries, datasets, and other resources.

License:CC0-1.0Stargazers:0Issues:1Issues:0

ba

Master the essential skills needed to recognize and solve complex real-world problems with Machine Learning and Deep Learning by leveraging the highly popular Python Machine Learning Eco-system.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

computer-science

:mortar_board: Path to a free self-taught education in Computer Science!

License:MITStargazers:0Issues:1Issues:0

deepvoice3_pytorch

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

examples

A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0
Language:Visual BasicStargazers:0Issues:2Issues:0

go-figure-kubernetes

Kubernetes environment for running go figure apps

Language:ShellLicense:MITStargazers:0Issues:1Issues:0

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

machine-learning-interview

Machine Learning Interviews from FAAG, Snapchat, LinkedIn. I have offers from Snapchat, Coupang, Stitchfix etc.

Stargazers:0Issues:1Issues:0

ml-system-design-pattern

System design patterns for machine learning

License:MITStargazers:0Issues:0Issues:0

Multilingual_Text_to_Speech

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

overview

Description-FAQ of the process

License:MITStargazers:0Issues:0Issues:0

ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0

pytorch-dc-tts

Text to Speech with PyTorch (English and Mongolian)

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0

stylegan2-training

Materials for StyleGAN2 Training class

Language:Jupyter NotebookStargazers:0Issues:1Issues:0

tacotron

A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Stargazers:0Issues:1Issues:0

TTS-Style-Transfer

Official PyTorch implementation of TTS Style Transfer

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

WaveRNN

WaveRNN Vocoder + TTS

Language:PythonLicense:MITStargazers:0Issues:0Issues:0