Xinchen Zhang's repositories
BoxDiff-XL
Extend BoxDiff to SDXL (SDXL-based layout-to-image generation)
Long_Video_Generation
A pipeline to generate long videos according to text prompt
Deepfake_and_Anti-Deepfake
This is my final project for the Cognitive Computing course.
Awesome-LLMs-meet-Multimodal-Generation
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
Language:HTML000
000
Language:TeXMIT000
minisora
The Mini Sora project aims to explore the implementation path and future development direction of Sora.
Language:PythonApache-2.0000
RealCompo
RealCompo: Dynamic Equilibrium between Realism and Compositionality Improves Text-to-Image Diffusion Models
Language:Python000
Language:JavaScript000