rishikksh20

Rishikesh (ऋषिकेश)'s repositories

ViViT-pytorch

Implementation of ViViT: A Video Vision Transformer

Language:PythonMIT525 7 9

ResUnet

Pytorch implementation of ResUnet and ResUnet ++

Language:Python487 2 10

VocGAN

VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network

Language:PythonMIT319 10 18

iSTFTNet-pytorch

iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform

Language:PythonApache-2.0242 7 19

FastSpeech2

PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech

Language:Jupyter NotebookApache-2.0231 10 12

AdaSpeech

AdaSpeech: Adaptive Text to Speech for Custom Voice

Language:Jupyter NotebookApache-2.0157 6 11

HiFiplusplus-pytorch

HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement

Language:PythonMIT155 11 6

SoundStorm-pytorch

Google's SoundStorm: Efficient Parallel Audio Generation

Language:PythonMIT131 17 5

Avocodo-pytorch

Avocodo: Generative Adversarial Network for Artifact-free Vocoder

Language:PythonMIT117 13 4

Fre-GAN-pytorch

Fre-GAN: Adversarial Frequency-consistent Audio Synthesis

Language:PythonMIT103 6 9

vae_tacotron2

VAE Tacotron 2, an alternative of GST Tacotron

Language:PythonMIT88 6 9

LightSpeech

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

Language:PythonApache-2.085 7 5

HiFi-GAN

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Language:PythonMIT82 7 7

UnivNet-pytorch

UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation

Language:PythonMIT74 6 4

AdaSpeech2

AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data

Language:Jupyter NotebookMIT70 90

NaturalSpeech2

Language:PythonMIT69 12 1

AudioMAE-pytorch

Unofficial PyTorch implementation of Masked Autoencoders that Listen

Language:PythonMIT66 2 3

gmvae_tacotron

Gaussian Mixture VAE Tacotron

Language:PythonMIT53 6 3

Liveness-Detection

Liveness Detection for human face

Language:Python52 40

iSTFT-Avocodo-pytorch

Ultrafast GAN based Vocoder for Text to Speech

Language:PythonMIT50 5 2

Phone-Level-Mixture-Density-Network-for-TTS

Rich Prosody Diversity Modelling with Phone-level Mixture Density Network

Language:Jupyter NotebookMIT45 5 1

Zero-Shot-TTS

Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration

Language:PythonMIT34 4 2

NU-Wave2-pytorch

NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]

Language:PythonMIT24 60

Bidirectional-LEM-pytorch

Pytorch Implementation of Bidirectional Long Expressive Memory

Language:PythonMIT9 20

ai-audio-startups

Community list of startups working with AI in audio and music technology

Apache-2.03 10

Inception-Transformer-pytorch

iFormer: Inception Transformer

MIT1 1 1

rishikksh20

03 7

AcademiCodec

AcademiCodec: An Open Source Audio Codec Model for Academic Research

Language:Python000

ahmetfurkaann

010

PL-BERT

Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions

Language:PythonMIT000