sce285's starred repositories

Language:PythonStargazers:14Issues:0Issues:0
Stargazers:34Issues:0Issues:0

FreeTalker

Freetalker: Controllable Speech and Text-Driven Gesture Generation Based on Diffusion Models for Enhanced Speaker Naturalness (ICASSP 2024)

Language:PythonStargazers:61Issues:0Issues:0

FaceTalk

[CVPR 2024] FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head Models

Language:ShellLicense:NOASSERTIONStargazers:196Issues:0Issues:0

DreamScene

DreamScene: 3D Gaussian-based Text-to-3D Scene Generation via Formation Pattern Sampling

Language:PythonStargazers:69Issues:0Issues:0

video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Language:PythonLicense:Apache-2.0Stargazers:6430Issues:0Issues:0

4DGen

"4DGen: Grounded 4D Content Generation with Spatial-temporal Consistency", Yuyang Yin*, Dejia Xu*, Zhangyang Wang, Yao Zhao, Yunchao Wei

Language:PythonStargazers:213Issues:0Issues:0

ScoreHMR

ScoreHMR: Score-Guided Diffusion for 3D Human Recovery (CVPR 2024)

Language:PythonLicense:MITStargazers:383Issues:0Issues:0
Language:CStargazers:308Issues:0Issues:0

AnimatableGaussians

Code of [CVPR 2024] "Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar Modeling"

Language:PythonLicense:NOASSERTIONStargazers:895Issues:0Issues:0

MonoGS

[CVPR'24 Highlight & Best Demo Award] Gaussian Splatting SLAM

Language:PythonLicense:NOASSERTIONStargazers:1282Issues:0Issues:0

CoRA

[CVPR 2024] High-Quality Facial Geometry and Appearance Capture at Home.

Language:PythonStargazers:143Issues:0Issues:0

SLD

🔥 [CVPR2024] Official implementation of "Self-correcting LLM-controlled Diffusion Models (SLD)

Language:PythonLicense:MITStargazers:147Issues:0Issues:0

GraphDreamer

[CVPR'24] GraphDreamer: a novel framework of generating compositional 3D scenes from scene graphs.

Language:PythonLicense:MITStargazers:160Issues:0Issues:0

Dataset

News: the 10k dataset is ready for download.

Language:HTMLLicense:NOASSERTIONStargazers:279Issues:0Issues:0

GaussianDreamer

GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models (CVPR 2024)

Language:PythonLicense:Apache-2.0Stargazers:646Issues:0Issues:0
Language:Jupyter NotebookStargazers:56Issues:0Issues:0

PAIR-Diffusion

[CVPR 2024] PAIR Diffusion: A Comprehensive Multimodal Object-Level Image Editor

Language:PythonLicense:MITStargazers:498Issues:0Issues:0

Peekaboo

Interactive Video Generation via Masked-Diffusion

Language:PythonLicense:MITStargazers:64Issues:0Issues:0

GaussianEditor

[CVPR 2024] GaussianEditor: Swift and Controllable 3D Editing with Gaussian Splatting

Language:C++License:NOASSERTIONStargazers:1072Issues:0Issues:0
Language:PythonStargazers:5Issues:0Issues:0

4DGaussians

[CVPR 2024] 4D Gaussian Splatting for Real-Time Dynamic Scene Rendering

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:2110Issues:0Issues:0
Language:PythonStargazers:69Issues:0Issues:0

ubisoft-laforge-FFHQ-UV-Intrinsics

FFHQ-UV-Intrinstics: A dataset containing intrinsic face decomposition for 10k subjects of FFHQ-UV

License:NOASSERTIONStargazers:33Issues:0Issues:0

PHC

Official Implementation of the ICCV 2023 paper: Perpetual Humanoid Control for Real-time Simulated Avatars

Language:PythonLicense:NOASSERTIONStargazers:437Issues:0Issues:0

audioseal

Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector

Language:PythonLicense:MITStargazers:419Issues:0Issues:0

ChatRTX

A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM

Language:TypeScriptLicense:NOASSERTIONStargazers:2672Issues:0Issues:0

MoneyPrinter

Automate Creation of YouTube Shorts using MoviePy.

Language:PythonLicense:MITStargazers:10120Issues:0Issues:0

weblinx

WebLINX is a benchmark for building web navigation agents with conversational capabilities

Language:PythonLicense:Apache-2.0Stargazers:112Issues:0Issues:0

ComfyUI-3D-Pack

An extensive node suite that enables ComfyUI to process 3D inputs (Mesh & UV Texture, etc) using cutting edge algorithms (3DGS, NeRF, etc.)

Language:PythonLicense:MITStargazers:2207Issues:0Issues:0