zachytong

zachary's starred repositories

diffusion-forcing

code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Language:PythonNOASSERTION52000

ml-4m

4M: Massively Multimodal Masked Modeling

Language:PythonApache-2.0156800

Forge_VFM4AD

A comprehensive survey of forging vision foundation models for autonomous driving, including challenges, methodologies, and opportunities.

22200

LCSim

LCSim: A Large-Scale Controllable Traffic Simulator

Language:Jupyter NotebookMIT4600

CAT-DM

CAT-DM: Controllable Accelerated Virtual Try-on with Diffusion Model

Language:Python11000

MOFA-Video

[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.

Language:PythonNOASSERTION59700

The_Prompt_Report

Language:HTMLMIT28100

DiffSynth-Studio

Enjoy the magic of Diffusion models!

Language:PythonApache-2.0640300

awesome-end-to-end-autonomous-driving

A curated list of awesome End-to-End Autonomous Driving resources (continually updated)

Apache-2.037900

Decouple-Traj

Official Code for "Fully Decoupling Trajectory and Scene Encoding for Lightweight Heatmap-oriented Trajectory Prediction"

MIT200

LLM101n

LLM101n: Let's build a Storyteller

2914200

plant

[CoRL'22] PlanT: Explainable Planning Transformers via Object-Level Representations

Language:PythonMIT22100

SMART

[NeurIPS 2024] SMART: Scalable Multi-agent Real-time Motion Generation via Next-token Prediction

Language:PythonApache-2.04200

ml-stable-diffusion

Stable Diffusion with Core ML on Apple Silicon

Language:PythonMIT1673500

map4d

Photo-realistic mapping of dynamic urban areas

Language:PythonApache-2.020500

Hydra-MDP

25200

ctrl-sim

Official repository for "CtRL-Sim: Reactive and Controllable Driving Agents with Offline Reinforcement Learning"

MIT2000

SimGen

Simulator-conditioned Driving Scene Generation

4700

TrafficBotsV1.5

TrafficBots V1.5: TrafficBots + HPTR. 3rd place solution for Waymo Open Sim Agent Challenge 2024.

Language:PythonNOASSERTION1800

Hydra-MDP

The official repository of Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation

2500

LAW

Enhancing End-to-End Autonomous Driving with Latent World Model

MIT7600

StreamSpeech

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

Language:PythonMIT88500

scepter

SCEPTER is an open-source framework used for training, fine-tuning, and inference with generative models.

Language:PythonApache-2.036900

SparseDrive

SparseDrive: End-to-End Autonomous Driving via Sparse Scene Representation

Language:PythonMIT31800

RAG-Driver

A Multi-Modal Large Language Model with Retrieval-augmented In-context Learning capacity designed for generalisable and explainable end-to-end driving

Language:PythonApache-2.06700

AD-H

1200

streamv2v

Official Pytorch implementation of StreamV2V.

Language:PythonNOASSERTION43500

P-MapNet

Received by RAL

Language:PythonGPL-3.016900

3D-GCL

Language:Python600

CASD

Official Implementation for "Cross Attention Based Style Distribution for Controllable Person Image Synthesis" (ECCV2022))

Language:Python6300