Cominclip

Xinchen Zhang's repositories

Extend BoxDiff to SDXL (SDXL-based layout-to-image generation)

Language:Python1200

Official code for "Recurrent Progressive Fusion-based Learning for Multi-source Remote Sensing Image Classification"

Language:PythonMIT8 20

A pipeline to generate long videos according to text prompt

Language:Python5 10

Language:Python1 10

This is my final project for the Cognitive Computing course.

Language:Python100

Language:C++010

🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).

Language:HTML000

000

Language:TeXMIT000

The Mini Sora project aims to explore the implementation path and future development direction of Sora.

Language:PythonApache-2.0000

RealCompo: Dynamic Equilibrium between Realism and Compositionality Improves Text-to-Image Diffusion Models

Language:Python000

Language:JavaScript000