Xiaodong Wang's starred repositories
waymo-open-dataset
Waymo Open Dataset
ShareGPT4Video
An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
LLaVA-RLHF
Aligning LMMs with Factually Augmented RLHF
NVS_Solver
Source code of paper "NVS-Solver: Video Diffusion Model as Zero-Shot Novel View Synthesizer"
OmniTokenizer
OmniTokenizer: one model and one weight for image-video joint tokenization.
EthicalTrajectoryPlanning
An Ethical Trajectory Planning Algorithm for Autonomous Vehicles
Diffusion4D
"Diffusion4D: Fast Spatial-temporal Consistent 4D Generation via Video Diffusion Models", Hanwen Liang*, Yuyang Yin*, Dejia Xu, Hanxue Liang, Zhangyang Wang, Konstantinos N. Plataniotis, Yao Zhao, Yunchao Wei
titok-pytorch
Implementation of TiTok, proposed by Bytedance in "An Image is Worth 32 Tokens for Reconstruction and Generation"
Recap-DataComp-1B
This is the official repository of our paper "What If We Recaption Billions of Web Images with LLaMA-3 ?"
richhf-18k
RichHF-18K dataset contains rich human feedback labels we collected for our CVPR'24 paper: https://arxiv.org/pdf/2312.10240, along with the file name of the associated labeled images (no urls or images are included in this dataset).
Learning-Naturalistic-Driving-Environment
This repo contains the code for paper "Learning naturalistic driving environment with statistical realism"