Rishikesh (ऋषिकेश)'s repositories
ViViT-pytorch
Implementation of ViViT: A Video Vision Transformer
FNet-pytorch
Unofficial implementation of Google's FNet: Mixing Tokens with Fourier Transforms
convolution-vision-transformers
PyTorch Implementation of CvT: Introducing Convolutions to Vision Transformers
FastSpeech2
PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech
iSTFTNet-pytorch
iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform
HiFiplusplus-pytorch
HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement
Avocodo-pytorch
Avocodo: Generative Adversarial Network for Artifact-free Vocoder
Fre-GAN-pytorch
Fre-GAN: Adversarial Frequency-consistent Audio Synthesis
vae_tacotron2
VAE Tacotron 2, an alternative of GST Tacotron
TalkNet2-pytorch
TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.
AdaSpeech2
AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data
UnivNet-pytorch
UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation
LightSpeech
LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search
AudioMAE-pytorch
Unofficial PyTorch implementation of Masked Autoencoders that Listen
iSTFT-Avocodo-pytorch
Ultrafast GAN based Vocoder for Text to Speech
gmvae_tacotron
Gaussian Mixture VAE Tacotron
Liveness-Detection
Liveness Detection for human face
Phone-Level-Mixture-Density-Network-for-TTS
Rich Prosody Diversity Modelling with Phone-level Mixture Density Network
NU-Wave-pytorch
NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling
Zero-Shot-TTS
Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
NU-Wave2-pytorch
NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]
Bidirectional-LEM-pytorch
Pytorch Implementation of Bidirectional Long Expressive Memory
ai-audio-startups
Community list of startups working with AI in audio and music technology
Inception-Transformer-pytorch
iFormer: Inception Transformer