felixfuu's starred repositories
infinite-zoom-automatic1111-webui
infinite zoom effect extension for AUTOMATIC1111's webui - stable diffusion
MS-Diffusion
Official implementation of MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance
IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
MimicBrush
Official implementations for paper: Zero-shot Image Editing with Reference Imitation
style-aligned
Official code for "Style Aligned Image Generation via Shared Attention"
ReMoDiffuse
ReMoDiffuse: Retrieval-Augmented Motion Diffusion Model
Awesome-Open-Vocabulary-Detection-and-Segmentation
Awesome OVD-OVS - A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future
Prompt-Diffusion
Official PyTorch implementation of the paper "In-Context Learning Unlocked for Diffusion Models"
HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
cv-arxiv-daily
🎓Automatically Update CV Papers Daily using Github Actions (Update Every 12th hours)
mPLUG-DocOwl
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
llmblueprint
[ICLR 2024] Official code for the paper "LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts"
scaling_on_scales
When do we not need larger vision models?
InstructCV
[ ICLR 2024 ] Official Codebase for "InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists"