Show Lab's repositories
Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, and various other applications.
computer_use_ootb
Out-of-the-box (OOTB) GUI Agent for Windows and macOS
Awesome-GUI-Agent
💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.
Awesome-MLLM-Hallucination
📖 A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).
Awesome-Unified-Multimodal-Models
📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.
PhotoDoodle
Code Implementation of "PhotoDoodle: Learning Artistic Image Editing from Few-Shot Pairwise Data"
MakeAnything
Official code of "MakeAnything: Harnessing Diffusion Transformers for Multi-Domain Procedural Sequence Generation"
MovieAgent
MovieAgent: Automated Movie Generation via Multi-Agent CoT Planning
Awesome-Robotics-Diffusion
(In progress) A curated list of recent robot learning papers incorporating diffusion models for robotics tasks.
GUI-Thinker
Enable AI to control your PC. This repo includes the WorldGUI Benchmark and GUI-Thinker Agent Framework.
MovieBench
[CVPR 2025] A Hierarchical Movie Level Dataset for Long Video Generation