bwhwang's starred repositories

gpt-engineer

Specify what you want it to build, the AI asks for clarification, and then builds it. Completely separate team and codebase from the AI Web App builder https://gptengineer.app

Language:PythonLicense:MITStargazers:51626Issues:501Issues:472

DeepFaceLive

Real-time face swap for PC streaming or video calls

Language:PythonLicense:GPL-3.0Stargazers:25484Issues:357Issues:144

Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

Language:PythonLicense:MITStargazers:22151Issues:165Issues:1557

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:21241Issues:178Issues:440

devika

Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI.

Language:PythonLicense:MITStargazers:18123Issues:208Issues:379

AniPortrait

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Language:PythonLicense:Apache-2.0Stargazers:4417Issues:62Issues:177

champ

Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

Language:PythonLicense:MITStargazers:3539Issues:175Issues:109

motion-diffusion-model

The official PyTorch implementation of the paper "Human Motion Diffusion Model"

Language:PythonLicense:MITStargazers:3019Issues:68Issues:201

MusePose

MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation

Language:PythonLicense:NOASSERTIONStargazers:2024Issues:41Issues:56

pytorch-cpp

C++ Implementation of PyTorch Tutorials for Everyone

Language:C++License:MITStargazers:1919Issues:51Issues:65

InstantStyle

InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥

Language:Jupyter NotebookStargazers:1536Issues:21Issues:36

ConsistentID

Customized ID Consistent for human

Language:PythonLicense:MITStargazers:825Issues:32Issues:44

HiDiffusion

[ECCV 2024] HiDiffusion: Increases the resolution and speed of your diffusion model by only adding a single line of code!

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:718Issues:6Issues:30
Language:PythonLicense:Apache-2.0Stargazers:646Issues:33Issues:22

UniAnimate

Code for Paper "UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation".

Language:PythonLicense:AGPL-3.0Stargazers:481Issues:17Issues:36

photoswap

Official implementation of the NeurIPS 2023 paper "Photoswap: Personalized Subject Swapping in Images"

Language:Jupyter NotebookLicense:MITStargazers:338Issues:25Issues:14

ComfyUI-IC-Light-Wrapper

Wraps the IC-Light Diffuser demo to a ComfyUI node

Language:PythonLicense:Apache-2.0Stargazers:323Issues:6Issues:12

MiraData

Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"

Language:PythonLicense:GPL-3.0Stargazers:323Issues:14Issues:14

DesignEdit

Code for DesignEdit

Language:PythonLicense:MITStargazers:294Issues:9Issues:6

ComfyUI_VisualStylePrompting

ComfyUI Version of "Visual Style Prompting with Swapping Self-Attention"

Language:PythonLicense:Apache-2.0Stargazers:269Issues:8Issues:22

Perturbed-Attention-Guidance

Official implementation of "Perturbed-Attention Guidance"

Language:Jupyter NotebookLicense:MITStargazers:235Issues:2Issues:0

swap-anything

"SwapAnything: Enabling Arbitrary Object Swapping in Personalized Visual Editing"

InTeX

Interactive Text-to-Texture Synthesis via Unified Depth-aware Inpainting.

MultiPly

MultiPly: Reconstruction of Multiple People from Monocular Video in the Wild (CVPR2024 Oral)

ASH

The Training and Demo code for: Ash: Animatable gaussian splats for efficient and photoreal human rendering (CVPR 2024)

MotionChain

MotionChain: Conversational Motion Controllers via Multimodal Prompts

Geo-SRF

Geometry Transfer for Stylizing Radiance Fields

nvidia-human-ai-lipsync

This project is a digital human that can talk to you and is animated based on your questions. It uses the Nvidia API endpoint Meta llama3-70b to generate responses, Eleven Labs to generate voice and Rhubarb Lip Sync to generate the lip sync.

Language:JavaScriptLicense:MITStargazers:23Issues:1Issues:0