zachary (zachytong)

zachytong

Geek Repo

Location:shanghai

Github PK Tool:Github PK Tool

zachary's starred repositories

diffusion-forcing

code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Language:PythonLicense:NOASSERTIONStargazers:520Issues:0Issues:0

ml-4m

4M: Massively Multimodal Masked Modeling

Language:PythonLicense:Apache-2.0Stargazers:1568Issues:0Issues:0

Forge_VFM4AD

A comprehensive survey of forging vision foundation models for autonomous driving, including challenges, methodologies, and opportunities.

Stargazers:222Issues:0Issues:0

LCSim

LCSim: A Large-Scale Controllable Traffic Simulator

Language:Jupyter NotebookLicense:MITStargazers:46Issues:0Issues:0

CAT-DM

CAT-DM: Controllable Accelerated Virtual Try-on with Diffusion Model

Language:PythonStargazers:110Issues:0Issues:0

MOFA-Video

[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.

Language:PythonLicense:NOASSERTIONStargazers:597Issues:0Issues:0
Language:HTMLLicense:MITStargazers:281Issues:0Issues:0

DiffSynth-Studio

Enjoy the magic of Diffusion models!

Language:PythonLicense:Apache-2.0Stargazers:6403Issues:0Issues:0

awesome-end-to-end-autonomous-driving

A curated list of awesome End-to-End Autonomous Driving resources (continually updated)

License:Apache-2.0Stargazers:379Issues:0Issues:0

Decouple-Traj

Official Code for "Fully Decoupling Trajectory and Scene Encoding for Lightweight Heatmap-oriented Trajectory Prediction"

License:MITStargazers:2Issues:0Issues:0

LLM101n

LLM101n: Let's build a Storyteller

Stargazers:29142Issues:0Issues:0

plant

[CoRL'22] PlanT: Explainable Planning Transformers via Object-Level Representations

Language:PythonLicense:MITStargazers:221Issues:0Issues:0

SMART

[NeurIPS 2024] SMART: Scalable Multi-agent Real-time Motion Generation via Next-token Prediction

Language:PythonLicense:Apache-2.0Stargazers:42Issues:0Issues:0

ml-stable-diffusion

Stable Diffusion with Core ML on Apple Silicon

Language:PythonLicense:MITStargazers:16735Issues:0Issues:0

map4d

Photo-realistic mapping of dynamic urban areas

Language:PythonLicense:Apache-2.0Stargazers:205Issues:0Issues:0
Stargazers:252Issues:0Issues:0

ctrl-sim

Official repository for "CtRL-Sim: Reactive and Controllable Driving Agents with Offline Reinforcement Learning"

License:MITStargazers:20Issues:0Issues:0

SimGen

Simulator-conditioned Driving Scene Generation

Stargazers:47Issues:0Issues:0

TrafficBotsV1.5

TrafficBots V1.5: TrafficBots + HPTR. 3rd place solution for Waymo Open Sim Agent Challenge 2024.

Language:PythonLicense:NOASSERTIONStargazers:18Issues:0Issues:0

Hydra-MDP

The official repository of Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation

Stargazers:25Issues:0Issues:0

LAW

Enhancing End-to-End Autonomous Driving with Latent World Model

License:MITStargazers:76Issues:0Issues:0

StreamSpeech

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

Language:PythonLicense:MITStargazers:885Issues:0Issues:0

scepter

SCEPTER is an open-source framework used for training, fine-tuning, and inference with generative models.

Language:PythonLicense:Apache-2.0Stargazers:369Issues:0Issues:0

SparseDrive

SparseDrive: End-to-End Autonomous Driving via Sparse Scene Representation

Language:PythonLicense:MITStargazers:318Issues:0Issues:0

RAG-Driver

A Multi-Modal Large Language Model with Retrieval-augmented In-context Learning capacity designed for generalisable and explainable end-to-end driving

Language:PythonLicense:Apache-2.0Stargazers:67Issues:0Issues:0
Stargazers:12Issues:0Issues:0

streamv2v

Official Pytorch implementation of StreamV2V.

Language:PythonLicense:NOASSERTIONStargazers:435Issues:0Issues:0

P-MapNet

Received by RAL

Language:PythonLicense:GPL-3.0Stargazers:169Issues:0Issues:0
Language:PythonStargazers:6Issues:0Issues:0

CASD

Official Implementation for "Cross Attention Based Style Distribution for Controllable Person Image Synthesis" (ECCV2022))

Language:PythonStargazers:63Issues:0Issues:0