wgqtmac

"Diffusion4D: Fast Spatial-temporal Consistent 4D Generation via Video Diffusion Models", Hanwen Liang*, Yuyang Yin*, Dejia Xu, Hanxue Liang, Zhangyang Wang, Konstantinos N. Plataniotis, Yao Zhao, Yunchao Wei

Language:Python19900

AVDC

Official repository of Learning to Act from Actionless Videos through Dense Correspondences.

Language:PythonMIT15300

General-World-Models-Survey

MIT22500

Embodied_AI_Paper_List

[Embodied-AI-Survey-2024] Paper list and projects for Embodied AI

35900

mar

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Language:PythonMIT60100

DSCL

AAAI-24 Decoupled Contrastive Learning for Long-Tailed Recognition

Language:Python1400

Awesome-World-Model

Collect some World Models for Autonomous Driving papers.

36000

diff-sampler

An open-source toolbox for fast sampling of diffusion models. Official implementations for our [CVPR-2024, ICML-2024] papers

Language:Jupyter NotebookApache-2.015300

autoregressive-diffusion-pytorch

Implementation of Autoregressive Diffusion in Pytorch

Language:PythonMIT23800

sam-hq

Segment Anything in High Quality [NeurIPS 2023]

Language:PythonApache-2.0361900

Efficient4D

Language:Python7700

FollowYourEmoji

[ArXiv 2024] Follow-Your-Emoji: This repo is the official implementation of "Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait Animation"

Language:Python27800

Generalizable-BEV

Language:Python14700

dino-tracker

Official Pytorch Implementation for “DINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single Video”

Language:PythonMIT34900

ttt-lm-pytorch

Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Language:PythonMIT93500

4Diffusion

Official code for 4Diffusion: Multi-view Video Diffusion Model for 4D Generation.

Language:PythonApache-2.07100

cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Language:PythonApache-2.0166600

mvsplat

🌊 [ECCV'24 Oral] MVSplat: Efficient 3D Gaussian Splatting from Sparse Multi-View Images

Language:PythonMIT66500

ControlNeXt

Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA

Language:PythonApache-2.099000

LiDAR4D

💫 [CVPR 2024] LiDAR4D: Dynamic Neural Fields for Novel Space-time View LiDAR Synthesis

Language:PythonApache-2.013100