Rishikesh (ऋषिकेश) (rishikksh20)

rishikksh20

Geek Repo

Company:Dubpro.ai

Location:New Delhi, India

Home Page:https://www.dubpro.ai/

Twitter:@ai_rishikesh

Github PK Tool:Github PK Tool


Organizations
coala
EpicGames

Rishikesh (ऋषिकेश)'s repositories

ViViT-pytorch

Implementation of ViViT: A Video Vision Transformer

Language:PythonLicense:MITStargazers:459Issues:8Issues:9

VocGAN

VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network

Language:PythonLicense:MITStargazers:318Issues:13Issues:18

FastSpeech2

PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:212Issues:10Issues:12

iSTFTNet-pytorch

iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform

Language:PythonLicense:Apache-2.0Stargazers:208Issues:10Issues:15

AdaSpeech

AdaSpeech: Adaptive Text to Speech for Custom Voice

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:155Issues:7Issues:11

HiFiplusplus-pytorch

HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement

Language:PythonLicense:MITStargazers:143Issues:12Issues:6

SoundStorm-pytorch

Google's SoundStorm: Efficient Parallel Audio Generation

Language:PythonLicense:MITStargazers:115Issues:17Issues:5

Avocodo-pytorch

Avocodo: Generative Adversarial Network for Artifact-free Vocoder

Language:PythonLicense:MITStargazers:114Issues:15Issues:4

Fre-GAN-pytorch

Fre-GAN: Adversarial Frequency-consistent Audio Synthesis

Language:PythonLicense:MITStargazers:99Issues:7Issues:9

vae_tacotron2

VAE Tacotron 2, an alternative of GST Tacotron

Language:PythonLicense:MITStargazers:85Issues:7Issues:9

HiFi-GAN

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Language:PythonLicense:MITStargazers:78Issues:7Issues:7

LightSpeech

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

Language:PythonLicense:Apache-2.0Stargazers:77Issues:9Issues:5

AdaSpeech2

AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data

Language:Jupyter NotebookLicense:MITStargazers:69Issues:9Issues:0

UnivNet-pytorch

UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation

Language:PythonLicense:MITStargazers:67Issues:6Issues:4
Language:PythonLicense:MITStargazers:66Issues:13Issues:0

AudioMAE-pytorch

Unofficial PyTorch implementation of Masked Autoencoders that Listen

Language:PythonLicense:MITStargazers:60Issues:4Issues:2

Liveness-Detection

Liveness Detection for human face

Language:PythonStargazers:52Issues:4Issues:0

gmvae_tacotron

Gaussian Mixture VAE Tacotron

Language:PythonLicense:MITStargazers:51Issues:6Issues:3

iSTFT-Avocodo-pytorch

Ultrafast GAN based Vocoder for Text to Speech

Language:PythonLicense:MITStargazers:51Issues:6Issues:2

Phone-Level-Mixture-Density-Network-for-TTS

Rich Prosody Diversity Modelling with Phone-level Mixture Density Network

Language:Jupyter NotebookLicense:MITStargazers:45Issues:5Issues:1

Zero-Shot-TTS

Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration

Language:PythonLicense:MITStargazers:34Issues:4Issues:2

NU-Wave2-pytorch

NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]

Language:PythonLicense:MITStargazers:24Issues:6Issues:0

Bidirectional-LEM-pytorch

Pytorch Implementation of Bidirectional Long Expressive Memory

Language:PythonLicense:MITStargazers:9Issues:2Issues:0

WaveFlow

WaveFlow : A Compact Flow-based Model for Raw Audio

Language:PythonStargazers:4Issues:2Issues:0

ai-audio-startups

Community list of startups working with AI in audio and music technology

License:Apache-2.0Stargazers:3Issues:1Issues:0

Inception-Transformer-pytorch

iFormer: Inception Transformer

AcademiCodec

AcademiCodec: An Open Source Audio Codec Model for Academic Research

Language:PythonStargazers:0Issues:0Issues:0
Stargazers:0Issues:1Issues:0

PL-BERT

Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions

Language:PythonLicense:MITStargazers:0Issues:0Issues:0