Saurabh Saxena's starred repositories
Awesome-Talking-Face
📖 A curated list of resources dedicated to talking face.
Transcrypt
Python 3.9 to JavaScript compiler - Lean, fast, open!
transcrypt
transparently encrypt files within a git repository
ViewCrafter
Official implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"
CogVideoX-Fun
📹 A more flexible CogVideoX that can generate videos at any resolution and creates videos from images.
Arazzo-Specification
The Arazzo Specification - A Tapestry for Deterministic API Workflows
metavoice-src
Foundational model for human-like, expressive TTS
css-demystified-course-material
Course material for CSS Demystified
Prompt-Generator-for-AI-Text-to-Image-Models
A simple, modular, customizable app to help you generate prompts quickly and easily for Stable Diffusion, Midjourney, and Dall-E 2.
meetingsdk-headless-linux-sample
A demo on creating a headless meeting bot using the Zoom Meeting SDK for Linux and Docker
OneTrainer
OneTrainer is a one-stop solution for all your stable diffusion training needs.
tts-generation-webui
TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS)
llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
comfyui-colab
comfyui colabs templates new nodes
SimpleTransfromerTTS
Simple transformer tts
VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
PhotoMaker
PhotoMaker [CVPR 2024]
Moore-AnimateAnyone
Character Animation (AnimateAnyone, Face Reenactment)
magic-animate
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model