sausax

followers

following

stars

Saurabh Saxena's starred repositories

EchoMimic

Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning

Language:PythonApache-2.0255000

Awesome-Talking-Face

📖 A curated list of resources dedicated to talking face.

MIT128800

nvkind

Language:Go4500

kr8s

A batteries-included Python client library for Kubernetes that feels familiar for folks who already know how to use kubectl

Language:PythonBSD-3-Clause80300

Transcrypt

Python 3.9 to JavaScript compiler - Lean, fast, open!

Language:PythonApache-2.0284900

transcrypt

transparently encrypt files within a git repository

Language:ShellMIT146100

cogstudio

Language:Python20900

ViewCrafter

Official implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"

Language:PythonApache-2.075200

CogVideoX-Fun

📹 A more flexible CogVideoX that can generate videos at any resolution and creates videos from images.

Language:PythonApache-2.030200

void

Language:TypeScriptMIT723400

Arazzo-Specification

The Arazzo Specification - A Tapestry for Deterministic API Workflows

Language:JavaScriptApache-2.019600

metavoice-src

Foundational model for human-like, expressive TTS

Language:PythonApache-2.0377200

htpy

Generate HTML in Python

Language:PythonMIT23100

launchr

Launchr is an open source SaaS starter kit, based on Django.

Language:HTMLMIT23400

css-demystified-course-material

Course material for CSS Demystified

Language:HTML7900

Prompt-Generator-for-AI-Text-to-Image-Models

A simple, modular, customizable app to help you generate prompts quickly and easily for Stable Diffusion, Midjourney, and Dall-E 2.

Language:JavaScriptGPL-3.04100

meetingsdk-headless-linux-sample

A demo on creating a headless meeting bot using the Zoom Meeting SDK for Linux and Docker

Language:C++MIT2800

OneTrainer

OneTrainer is a one-stop solution for all your stable diffusion training needs.

Language:PythonAGPL-3.0167200

tts-generation-webui

TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS)

Language:TypeScriptMIT168300

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Language:Jupyter NotebookApache-2.03771600

comfyui-colab

comfyui colabs templates new nodes

Language:Jupyter NotebookUnlicense36900

SimpleTransfromerTTS

Simple transformer tts

Language:Python2900

Omost

Your image is almost there!

Language:PythonApache-2.0724600

VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Language:Jupyter NotebookNOASSERTION751800

AniPortrait

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Language:PythonApache-2.0454300

OMG

[ECCV 2024] OMG: Occlusion-friendly Personalized Multi-concept Generation In Diffusion Models

Language:Python62200

PhotoMaker

PhotoMaker [CVPR 2024]

Language:Jupyter NotebookNOASSERTION939600

SuGaR

[CVPR 2024] Official PyTorch implementation of SuGaR: Surface-Aligned Gaussian Splatting for Efficient 3D Mesh Reconstruction and High-Quality Mesh Rendering

Language:C++NOASSERTION216700

Moore-AnimateAnyone

Character Animation (AnimateAnyone, Face Reenactment)

Language:PythonApache-2.0310400

magic-animate

[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

Language:PythonBSD-3-Clause1039700