Rishikesh (ऋषिकेश) (rishikksh20)

rishikksh20

Geek Repo

Company:Open Source

Location:New Delhi, India

Home Page:https://deepsync.co/

Twitter:@ai_rishikesh

Github PK Tool:Github PK Tool


Organizations
coala
EpicGames
ezoic increase your site revenue

Rishikesh (ऋषिकेश)'s repositories

VocGAN

VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network

Language:PythonLicense:MITStargazers:275Issues:13Issues:18

ViViT-pytorch

Implementation of ViViT: A Video Vision Transformer

Language:PythonLicense:MITStargazers:249Issues:7Issues:5

FNet-pytorch

Unofficial implementation of Google's FNet: Mixing Tokens with Fourier Transforms

Language:PythonLicense:MITStargazers:205Issues:4Issues:5

convolution-vision-transformers

PyTorch Implementation of CvT: Introducing Convolutions to Vision Transformers

Language:PythonLicense:MITStargazers:181Issues:6Issues:6

FastSpeech2

PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:164Issues:10Issues:11

AdaSpeech

AdaSpeech: Adaptive Text to Speech for Custom Voice

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:107Issues:5Issues:9

HiFiplusplus-pytorch

HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement

Language:PythonLicense:MITStargazers:97Issues:11Issues:3

Fre-GAN-pytorch

Fre-GAN: Adversarial Frequency-consistent Audio Synthesis

Language:PythonLicense:MITStargazers:85Issues:6Issues:9

iSTFTNet-pytorch

iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform

Language:PythonLicense:Apache-2.0Stargazers:82Issues:6Issues:8

vae_tacotron2

VAE Tacotron 2, an alternative of GST Tacotron

Language:PythonLicense:MITStargazers:82Issues:6Issues:9

TalkNet2-pytorch

TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.

Language:PythonLicense:MITStargazers:64Issues:8Issues:3

Avocodo-pytorch

Avocodo: Generative Adversarial Network for Artifact-free Vocoder

Language:PythonLicense:MITStargazers:63Issues:12Issues:1

HiFi-GAN

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Language:PythonLicense:MITStargazers:61Issues:6Issues:6

AdaSpeech2

AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data

Language:Jupyter NotebookLicense:MITStargazers:55Issues:9Issues:0

gmvae_tacotron

Gaussian Mixture VAE Tacotron

Language:PythonLicense:MITStargazers:49Issues:5Issues:3

LightSpeech

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

Language:PythonLicense:Apache-2.0Stargazers:48Issues:8Issues:5

Liveness-Detection

Liveness Detection for human face

Language:PythonStargazers:48Issues:3Issues:0

UnivNet-pytorch

UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation

Language:PythonLicense:MITStargazers:48Issues:5Issues:4

Phone-Level-Mixture-Density-Network-for-TTS

Rich Prosody Diversity Modelling with Phone-level Mixture Density Network

Language:Jupyter NotebookLicense:MITStargazers:37Issues:2Issues:0

Zero-Shot-TTS

Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration

Language:PythonLicense:MITStargazers:32Issues:3Issues:2

iSTFT-Avocodo-pytorch

Ultrafast GAN based Vocoder for Text to Speech

Language:PythonLicense:MITStargazers:30Issues:4Issues:1

AudioMAE-pytorch

Unofficial PyTorch implementation of Masked Autoencoders that Listen

Language:PythonLicense:MITStargazers:29Issues:1Issues:1

NU-Wave-pytorch

NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling

Language:PythonLicense:MITStargazers:29Issues:2Issues:1

NU-Wave2-pytorch

NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]

Language:PythonLicense:MITStargazers:22Issues:0Issues:0

ResMLP-pytorch

ResMLP: Feedforward networks for image classification with data-efficient training

Language:PythonLicense:MITStargazers:18Issues:1Issues:0

Bidirectional-LEM-pytorch

Pytorch Implementation of Bidirectional Long Expressive Memory

Language:PythonLicense:MITStargazers:5Issues:0Issues:0

WaveFlow

WaveFlow : A Compact Flow-based Model for Raw Audio

Language:PythonStargazers:4Issues:1Issues:0

Inception-Transformer-pytorch

iFormer: Inception Transformer

License:MITStargazers:1Issues:0Issues:0

ai-audio-startups

Community list of startups working with AI in audio and music technology

License:Apache-2.0Stargazers:0Issues:0Issues:0