Pranjalya Tiwari's repositories
tts-tortoise-gradio
A Gradio setup for Tortoise TTS.
google-calendar-telegram-bot
A telegram bot for sending daily notifications of the events coming in upcoming days.
AudioLDM2
Text-to-Audio/Music Generation
BasicSR
Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. Currently, it includes EDSR, RCAN, SRResNet, SRGAN, ESRGAN, EDVR, BasicVSR, SwinIR, ECBSR, etc. Also support StyleGAN2, DFDNet.
cinematic-sound-demixing
Cinematic Sound Demixing model and inference serving
diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
GroundingDINO
Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
HierSpeechpp
The official implementation of HierSpeech++
NS2VC
Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech (v4)
RepCodec
Models and code for RepCodec: A Speech Representation Codec for Speech Tokenization
rpunct
📝An easy-to-use package to restore punctuation of the text.
SadTalker
(CVPR 2023)SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
sdxl-shopify-monorepo
A monorepo of Shopify's SDXL model
seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
segment-anything-video
MetaSeg: Packaged version of the Segment Anything repository
transformer-deploy
Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀
triton-inference-docker
Docker image to run a Sentence Transformer / Transformer model in NVIDIA Triton Inference.
TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
UnitSpeech
An official implementation of "UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data"
WhisperS2T
An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Enginer
WhisperSpeech
An Open Source text-to-speech system built by inverting Whisper.