zachary's starred repositories
diffusion-forcing
code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"
Forge_VFM4AD
A comprehensive survey of forging vision foundation models for autonomous driving, including challenges, methodologies, and opportunities.
MOFA-Video
[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.
DiffSynth-Studio
Enjoy the magic of Diffusion models!
awesome-end-to-end-autonomous-driving
A curated list of awesome End-to-End Autonomous Driving resources (continually updated)
Decouple-Traj
Official Code for "Fully Decoupling Trajectory and Scene Encoding for Lightweight Heatmap-oriented Trajectory Prediction"
ml-stable-diffusion
Stable Diffusion with Core ML on Apple Silicon
TrafficBotsV1.5
TrafficBots V1.5: TrafficBots + HPTR. 3rd place solution for Waymo Open Sim Agent Challenge 2024.
StreamSpeech
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
SparseDrive
SparseDrive: End-to-End Autonomous Driving via Sparse Scene Representation
RAG-Driver
A Multi-Modal Large Language Model with Retrieval-augmented In-context Learning capacity designed for generalisable and explainable end-to-end driving