Shivam Mehta (shivammehta25)

shivammehta25

Geek Repo

Company:@KTH

Location:Stockholm, Sweden

Home Page:http://www.shivammehta.me

Twitter:@shivammehta007

Github PK Tool:Github PK Tool

Shivam Mehta's repositories

Matcha-TTS

[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

Language:Jupyter NotebookLicense:MITStargazers:410Issues:13Issues:41

Neural-HMM

Neural HMMs are all you need (for high-quality attention-free TTS)

Language:Jupyter NotebookLicense:MITStargazers:149Issues:7Issues:14

OverFlow

Putting flows on top of neural transducers for better TTS

Language:Jupyter NotebookLicense:MITStargazers:61Issues:6Issues:2

Diff-TTSG

Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis

Language:PythonStargazers:38Issues:4Issues:0

Matcha-TTS-checkpoints

Repository specific for hosting Matcha-TTS's checkpoints in its release. Mitigation due to the bug in gdown

Stargazers:3Issues:0Issues:0

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Language:Jupyter NotebookLicense:MITStargazers:2Issues:1Issues:0

lightning-tutorials

Collection of Pytorch lightning tutorial form as rich scripts automatically transformed to ipython notebooks.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:1Issues:0

shivammehta25.github.io

Migrating my old webpage from old shivammehta.me (Wordpress) to GitHub.

Language:RubyLicense:MITStargazers:1Issues:0Issues:0

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:0Issues:1Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

Awesome-Diffusion-Models

A collection of resources and papers on Diffusion Models

License:MITStargazers:0Issues:1Issues:0

Bayesian-Flow-Networks

A simple implimentation of Bayesian Flow Networks (BFN)

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:1Issues:0

CLAP

Contrastive Language-Audio Pretraining

Language:PythonLicense:CC0-1.0Stargazers:0Issues:1Issues:0

conditional-flow-matching

Conditional Flow Matching: Simulation-Free Dynamic Optimal Transport

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

dinov2

PyTorch code and models for the DINOv2 self-supervised learning method.

License:Apache-2.0Stargazers:0Issues:0Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

FastSpeech2

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Fun-Coding

I will be saving and committing everyday, Something or update Study progress or Notes.

Language:PythonStargazers:0Issues:2Issues:0

Grad-TTS_Repo

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

Language:Jupyter NotebookStargazers:0Issues:1Issues:0

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

NeMo

NeMo: a toolkit for conversational AI

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

Nvidia-DeepLearningExamples

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Stargazers:0Issues:1Issues:0

pytorch-lightning

The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

rustlings

:crab: Small exercises to get you used to reading and writing Rust code!

Language:RustLicense:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

License:MITStargazers:0Issues:0Issues:0

wasp_SE_course

Resources and student assignments for the WASP Software Engineering course

Language:TeXStargazers:0Issues:1Issues:0

WhisperFusion

WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.

Stargazers:0Issues:0Issues:0