Xiangtai Li's starred repositories
AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
Awesome-Open-Vocabulary
(TPAMI 2024) A Survey on Open Vocabulary Learning
Awesome-Segmentation-With-Transformer
[Arxiv-04-2023] Transformer-Based Visual Segmentation: A Survey
Awesome-LLMs-meet-Multimodal-Generation
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
betrayed-by-captions
(ICCV 2023) Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation
PointCloudMamba
Point Cloud Mamba: Point Cloud Learning via State Space Model