Beast code in Giters

[ICLR 2024] Controlling Vision-Language Models for Universal Image Restoration. 5th place in the NTIRE 2024 Restore Any Image Model in the Wild Challenge.

Language:PythonMIT58900

CoDet

(NeurIPS2023) CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection

Language:Python10200

Chat-UniVi

[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding

Language:PythonApache-2.072600

LayerDiffuse

Transparent Image Layer Diffusion using Latent Transparency

Apache-2.0190000

ControlNet

Let us control diffusion models!

Language:PythonApache-2.02912800

OWLVIT-ONNX-AX650-CPP

Language:C++1700

YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Language:PythonGPL-3.0399200

Latte

Latte: Latent Diffusion Transformer for Video Generation.

Language:PythonApache-2.0151500

stable-diffusion-videos

Create 🔥 videos with Stable Diffusion by exploring the latent space and morphing between text prompts

Language:PythonApache-2.0435300

magic-animate

[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

Language:PythonBSD-3-Clause1016900

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonNOASSERTION571000

vizwiz-fewshot

Convenience API for the VizWiz-FewShot dataset

Language:PythonMIT300

Inpaint-Anything

Inpaint anything using Segment Anything and inpainting models.

Language:Jupyter NotebookApache-2.0592800

awesome-detection-transformer

Collect some papers about transformer for detection and segmentation. Awesome Detection Transformer for Computer Vision (CV)

122400

recognize-anything

Open-source and strong foundation image recognition models.

Language:Jupyter NotebookApache-2.0260700

Telechat

Language:Python170100