rishikksh20

Rishikesh (ऋषिकेश)'s repositories

ViViT-pytorch

Implementation of ViViT: A Video Vision Transformer

Language:PythonMIT459 8 9

VocGAN

VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network

Language:PythonMIT318 13 18

FastSpeech2

PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech

Language:Jupyter NotebookApache-2.0212 10 12

iSTFTNet-pytorch

iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform

Language:PythonApache-2.0208 10 15

AdaSpeech

AdaSpeech: Adaptive Text to Speech for Custom Voice

Language:Jupyter NotebookApache-2.0155 7 11

HiFiplusplus-pytorch

HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement

Language:PythonMIT143 12 6

SoundStorm-pytorch

Google's SoundStorm: Efficient Parallel Audio Generation

Language:PythonMIT115 17 5

Avocodo-pytorch

Avocodo: Generative Adversarial Network for Artifact-free Vocoder

Language:PythonMIT114 15 4

Fre-GAN-pytorch

Fre-GAN: Adversarial Frequency-consistent Audio Synthesis

Language:PythonMIT99 7 9

vae_tacotron2

VAE Tacotron 2, an alternative of GST Tacotron

Language:PythonMIT85 7 9

HiFi-GAN

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Language:PythonMIT78 7 7

LightSpeech

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

Language:PythonApache-2.077 9 5

AdaSpeech2

AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data

Language:Jupyter NotebookMIT69 90

UnivNet-pytorch

UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation

Language:PythonMIT67 6 4

NaturalSpeech2

Language:PythonMIT66 130

AudioMAE-pytorch

Unofficial PyTorch implementation of Masked Autoencoders that Listen

Language:PythonMIT60 4 2

Liveness-Detection

Liveness Detection for human face

Language:Python52 40

gmvae_tacotron

Gaussian Mixture VAE Tacotron

Language:PythonMIT51 6 3

iSTFT-Avocodo-pytorch

Ultrafast GAN based Vocoder for Text to Speech

Language:PythonMIT51 6 2

Phone-Level-Mixture-Density-Network-for-TTS

Rich Prosody Diversity Modelling with Phone-level Mixture Density Network

Language:Jupyter NotebookMIT45 5 1

Zero-Shot-TTS

Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration

Language:PythonMIT34 4 2

NU-Wave2-pytorch

NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]

Language:PythonMIT24 60

Bidirectional-LEM-pytorch

Pytorch Implementation of Bidirectional Long Expressive Memory

Language:PythonMIT9 20

WaveFlow

WaveFlow : A Compact Flow-based Model for Raw Audio

Language:Python4 20

ai-audio-startups

Community list of startups working with AI in audio and music technology

Apache-2.03 10

Inception-Transformer-pytorch

iFormer: Inception Transformer

MIT1 2 1

rishikksh20

03 6

AcademiCodec

AcademiCodec: An Open Source Audio Codec Model for Academic Research

Language:Python000

ahmetfurkaann

010

PL-BERT

Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions

Language:PythonMIT000