wgqtmac

wgqtmac

Geek Repo

Company:Shanghai Jiao Tong University

Location:Shanghai

Github PK Tool:Github PK Tool

wgqtmac's starred repositories

ShapeSplat-Gaussian_MAE

Offical implementation of work: A Large-scale Dataset of Gaussian Splats and Their Self-Supervised Pretraining

Stargazers:33Issues:0Issues:0

ESAM

EmbodiedSAM: Online Segment Any 3D Thing in Real Time

Language:PythonStargazers:56Issues:0Issues:0

Vista

A Generalizable World Model for Autonomous Driving

Language:PythonLicense:Apache-2.0Stargazers:440Issues:0Issues:0

co-tracker

CoTracker is a model for tracking any point (pixel) on a video.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:2637Issues:0Issues:0

Monkey

【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models

Language:PythonLicense:MITStargazers:1666Issues:0Issues:0

Lumina-mGPT

Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining"

Language:PythonStargazers:414Issues:0Issues:0

AI-Scientist

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:5984Issues:0Issues:0

LimSim

LimSim & LimSim++: Integrated traffic and autonomous driving simulators with (M)LLM support

Language:PythonLicense:GPL-3.0Stargazers:368Issues:0Issues:0

nano-llama31

nanoGPT style version of Llama 3.1

Language:PythonStargazers:1068Issues:0Issues:0

CogVideo

Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Language:PythonLicense:Apache-2.0Stargazers:5814Issues:0Issues:0

Diffusion4D

"Diffusion4D: Fast Spatial-temporal Consistent 4D Generation via Video Diffusion Models", Hanwen Liang*, Yuyang Yin*, Dejia Xu, Hanxue Liang, Zhangyang Wang, Konstantinos N. Plataniotis, Yao Zhao, Yunchao Wei

Language:PythonStargazers:199Issues:0Issues:0

AVDC

Official repository of Learning to Act from Actionless Videos through Dense Correspondences.

Language:PythonLicense:MITStargazers:153Issues:0Issues:0

Embodied_AI_Paper_List

[Embodied-AI-Survey-2024] Paper list and projects for Embodied AI

Stargazers:359Issues:0Issues:0

mar

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Language:PythonLicense:MITStargazers:601Issues:0Issues:0

DSCL

AAAI-24 Decoupled Contrastive Learning for Long-Tailed Recognition

Language:PythonStargazers:14Issues:0Issues:0

Awesome-World-Model

Collect some World Models for Autonomous Driving papers.

Stargazers:360Issues:0Issues:0

diff-sampler

An open-source toolbox for fast sampling of diffusion models. Official implementations for our [CVPR-2024, ICML-2024] papers

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:153Issues:0Issues:0

autoregressive-diffusion-pytorch

Implementation of Autoregressive Diffusion in Pytorch

Language:PythonLicense:MITStargazers:238Issues:0Issues:0

sam-hq

Segment Anything in High Quality [NeurIPS 2023]

Language:PythonLicense:Apache-2.0Stargazers:3619Issues:0Issues:0
Language:PythonStargazers:77Issues:0Issues:0

FollowYourEmoji

[ArXiv 2024] Follow-Your-Emoji: This repo is the official implementation of "Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait Animation"

Language:PythonStargazers:278Issues:0Issues:0
Language:PythonStargazers:147Issues:0Issues:0

dino-tracker

Official Pytorch Implementation for “DINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single Video”

Language:PythonLicense:MITStargazers:349Issues:0Issues:0

ttt-lm-pytorch

Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Language:PythonLicense:MITStargazers:935Issues:0Issues:0

4Diffusion

Official code for 4Diffusion: Multi-view Video Diffusion Model for 4D Generation.

Language:PythonLicense:Apache-2.0Stargazers:71Issues:0Issues:0

cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Language:PythonLicense:Apache-2.0Stargazers:1666Issues:0Issues:0

mvsplat

🌊 [ECCV'24 Oral] MVSplat: Efficient 3D Gaussian Splatting from Sparse Multi-View Images

Language:PythonLicense:MITStargazers:665Issues:0Issues:0

ControlNeXt

Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA

Language:PythonLicense:Apache-2.0Stargazers:990Issues:0Issues:0

LiDAR4D

💫 [CVPR 2024] LiDAR4D: Dynamic Neural Fields for Novel Space-time View LiDAR Synthesis

Language:PythonLicense:Apache-2.0Stargazers:131Issues:0Issues:0