Beast code in Giters

Show Lab's repositories

Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

2800 124 17

MotionDirector

MotionDirector: Motion Customization of Text-to-Video Diffusion Models.

Language:PythonApache-2.0744 33 32

X-Adapter

[CVPR 2024] X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model

Language:PythonApache-2.0699 44 28

DragAnything

[ECCV 2024] DragAnything: Motion Control for Anything using Entity Representation

Language:Python368 17 20

VideoSwap

Code for [CVPR 2024] VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence

321 31 2

UniVTG

[ICCV2023] UniVTG: Towards Unified Video-Language Temporal Grounding

Language:PythonMIT299 5 45

Awesome-MLLM-Hallucination

📖 A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).

257 5 7

BoxDiff

[ICCV 2023] BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion

Language:Python227 4 14

EgoVLP

[NeurIPS2022] Egocentric Video-Language Pretraining

Language:Python218 3 27

VisorGPT

[NeurIPS 2023] Customize spatial layouts for conditional image synthesis models, e.g., ControlNet, using GPT

Language:PythonMIT128 2 7

videollm-online

VideoLLM-online: Online Video Large Language Model for Streaming Video (CVPR 2024)

Language:PythonApache-2.08500

T2VScore

T2VScore: Towards A Better Metric for Text-to-Video Generation

73 8 2

cosmo

Language:Python69 4 2

sparseformer

(ICLR 2024, CVPR 2024) SparseFormer

Language:PythonMIT62 9 3

CLVQA

[AAAI2023] Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task (Oral)

Language:Python34 4 4

Long-form-Video-Prior

Language:Python22 3 1

assistgui

Language:JavaScript18 11 1

Efficient-CLS

[ICCV 2023] Label-Efficient Online Continual Object Detection in Streaming Video

Language:Python16 5 5

videogui

official repo of "VideoGUI: A Benchmark for GUI Automation from Instructional Videos"

Language:JavaScript1600

BYOC

[IEEE-VR 2024] Bring Your Own Character: A Holistic Solution for Automatic Facial Animation Generation of Customized Characters

Language:C#1400

LOVA3

The official repo of "Learning to Visual Question Answering, Asking and Assessment"

Language:Python900

VisInContext

Official implementation of Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal Learning

Language:Python6 3 1

RingID

5 20

Tune-An-Ellipse

[CVPR 2024] Tune-An-Ellipse: CLIP Has Potential to Find What You Want

400

DynVideo-E

This is the project page for DynVideo-E.

Language:JavaScript3 40

magicanimate

Language:JavaScript2 20

cvpr2024-tutorial-video-diffusion-models

Language:HTMLMIT100

AssistGaze

Language:Python000

GUI-Narrator

Repository of GUI Action Narrator

Language:JavaScript000

Moonshot

Language:JavaScript030