Shivam Mehta's repositories
Matcha-TTS
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
Neural-HMM
Neural HMMs are all you need (for high-quality attention-free TTS)
Matcha-TTS-checkpoints
Repository specific for hosting Matcha-TTS's checkpoints in its release. Mitigation due to the bug in gdown
lightning-tutorials
Collection of Pytorch lightning tutorial form as rich scripts automatically transformed to ipython notebooks.
Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
Bayesian-Flow-Networks
A simple implimentation of Bayesian Flow Networks (BFN)
conditional-flow-matching
Conditional Flow Matching: Simulation-Free Dynamic Optimal Transport
FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
Fun-Coding
I will be saving and committing everyday, Something or update Study progress or Notes.
Grad-TTS_Repo
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Nvidia-DeepLearningExamples
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
pytorch-lightning
The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.
wasp_SE_course
Resources and student assignments for the WASP Software Engineering course
WhisperFusion
WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.