Rishikesh (ऋषिकेश) (rishikksh20)

rishikksh20

Geek Repo

Company:Dubpro.ai

Location:New Delhi, India

Home Page:https://www.dubpro.ai/

Twitter:@ai_rishikesh

Github PK Tool:Github PK Tool


Organizations
coala
EpicGames

Rishikesh (ऋषिकेश)'s starred repositories

OpenVoice

Instant voice cloning by MyShell.

Language:PythonLicense:NOASSERTIONStargazers:15597Issues:142Issues:126

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++License:Apache-2.0Stargazers:6136Issues:77Issues:1121

latent-consistency-model

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Language:PythonLicense:MITStargazers:3964Issues:66Issues:84

AnyText

Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>

Language:PythonLicense:Apache-2.0Stargazers:3625Issues:52Issues:76

WhisperLive

A nearly-live implementation of OpenAI's Whisper.

Language:PythonLicense:MITStargazers:1040Issues:24Issues:105

resemble-enhance

AI powered speech denoising and enhancement

Language:PythonLicense:MITStargazers:843Issues:12Issues:18

fish-speech

Brand new TTS solution

Language:PythonLicense:BSD-3-ClauseStargazers:780Issues:24Issues:70

SEINE

[ICLR 2024] SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction

Language:PythonLicense:Apache-2.0Stargazers:774Issues:24Issues:24

Make-An-Audio

PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model

llama-moe

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training

Language:PythonLicense:Apache-2.0Stargazers:662Issues:8Issues:18
Language:PythonLicense:Apache-2.0Stargazers:476Issues:16Issues:14

M2UGen

This is the official repository for M2UGen

Language:Jupyter NotebookLicense:MITStargazers:359Issues:11Issues:4

emotion2vec

Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

Language:PythonLicense:Apache-2.0Stargazers:232Issues:12Issues:15

tamil-llama

A New Tamil Large Language Model (LLM) Based on Llama 2

Language:PythonLicense:GPL-3.0Stargazers:205Issues:7Issues:6

megatts2

Unoffical implementation of Megatts2

Language:PythonLicense:MITStargazers:183Issues:19Issues:11

FaceTalk

[CVPR 2024] FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head Models

Language:ShellLicense:NOASSERTIONStargazers:125Issues:21Issues:2

wav2vec

a simplified version of wav2vec(1.0, vq, 2.0) in fairseq

Auffusion

Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation"

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:103Issues:0Issues:0

whisper-punctuator

Zero-shot multimodal punctuation insertion and truecasing using Whisper

Language:PythonLicense:MITStargazers:85Issues:5Issues:4
Language:PythonLicense:NOASSERTIONStargazers:81Issues:8Issues:15

unipaint

Code Implementation of "Uni-paint: A Unified Framework for Multimodal Image Inpainting with Pretrained Diffusion Model"

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:80Issues:14Issues:4

StoryTTS

[ICASSP 2024] StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations

CMG

The official implementation of Achieving Cross Modal Generalization with Multimodal Unified Representation (NeurIPS '23)

Automatic-Prosody-Annotator-with-SSWP-CLAP

An automatic prosodic boundary annotation tool for Text-to-Speech Synthesis (TTS).

Language:PythonLicense:Apache-2.0Stargazers:39Issues:0Issues:0
Language:PythonLicense:MITStargazers:26Issues:1Issues:1

NEUTART

PyTorch implementation of NEUTART, a system that creates photorealistic talking avatars from an input text transcription.

Language:PythonLicense:NOASSERTIONStargazers:19Issues:0Issues:0
Language:PythonStargazers:11Issues:2Issues:0