jjdbear's starred repositories
RPG-DiffusionMaster
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)
Fewshot_Detection
Few-shot Object Detection via Feature Reweighting
VideoMamba
VideoMamba: State Space Model for Efficient Video Understanding
daclip-uir
[ICLR 2024] Controlling Vision-Language Models for Universal Image Restoration. 5th place in the NTIRE 2024 Restore Any Image Model in the Wild Challenge.
Chat-UniVi
[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding
LayerDiffuse
Transparent Image Layer Diffusion using Latent Transparency
ControlNet
Let us control diffusion models!
YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
stable-diffusion-videos
Create 🔥 videos with Stable Diffusion by exploring the latent space and morphing between text prompts
magic-animate
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
vizwiz-fewshot
Convenience API for the VizWiz-FewShot dataset
Inpaint-Anything
Inpaint anything using Segment Anything and inpainting models.
awesome-detection-transformer
Collect some papers about transformer for detection and segmentation. Awesome Detection Transformer for Computer Vision (CV)
recognize-anything
Open-source and strong foundation image recognition models.
repaint123
Official implementation of Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting
Open-AnimateAnyone
Unofficial Implementation of Animate Anyone