Pranjalya Tiwari's repositories
google-calendar-telegram-bot
A telegram bot for sending daily notifications of the events coming in upcoming days.
Auto1111SDK
An SDK/Python library for Automatic 1111 to run state-of-the-art diffusion models
BasicSR
Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. Currently, it includes EDSR, RCAN, SRResNet, SRGAN, ESRGAN, EDVR, BasicVSR, SwinIR, ECBSR, etc. Also support StyleGAN2, DFDNet.
diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
ffmpeg-gpu-runpod-template
GPU enabled FFMPEG template for Runpod
ffmpeg_cuda
CUDA compiled ffmpeg for CUDA enabled GPU-utilizing encoding.
Fooocus
Focus on prompting and generating
generative-ai-python
The Gemini API Python SDK enables developers to use Google's state-of-the-art generative AI models to build AI-powered features and applications.
GroundingDINO
Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
HierSpeechpp
The official implementation of HierSpeech++
NS2VC
Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech (v4)
pranjalya.github.io
Portfolio website
RepCodec
Models and code for RepCodec: A Speech Representation Codec for Speech Tokenization
SadTalker
(CVPR 2023)SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
sd-webui-controlnet
WebUI extension for ControlNet
sdxl-shopify-monorepo
A monorepo of Shopify's SDXL model
seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
segment-anything-video
MetaSeg: Packaged version of the Segment Anything repository
stable-diffusion-webui
Stable Diffusion web UI
triton-inference-docker
Docker image to run a Sentence Transformer / Transformer model in NVIDIA Triton Inference.
TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
WhisperS2T
An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Enginer
WhisperSpeech
An Open Source text-to-speech system built by inverting Whisper.