Show Lab (showlab)

Show Lab

showlab

Geek Repo

Home Page:https://sites.google.com/view/showlab

Github PK Tool:Github PK Tool

Show Lab's repositories

Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

MotionDirector

MotionDirector: Motion Customization of Text-to-Video Diffusion Models.

Language:PythonLicense:Apache-2.0Stargazers:744Issues:33Issues:32

X-Adapter

[CVPR 2024] X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model

Language:PythonLicense:Apache-2.0Stargazers:699Issues:44Issues:28

DragAnything

[ECCV 2024] DragAnything: Motion Control for Anything using Entity Representation

VideoSwap

Code for [CVPR 2024] VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence

UniVTG

[ICCV2023] UniVTG: Towards Unified Video-Language Temporal Grounding

Language:PythonLicense:MITStargazers:299Issues:5Issues:45

Awesome-MLLM-Hallucination

đź“– A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).

BoxDiff

[ICCV 2023] BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion

EgoVLP

[NeurIPS2022] Egocentric Video-Language Pretraining

VisorGPT

[NeurIPS 2023] Customize spatial layouts for conditional image synthesis models, e.g., ControlNet, using GPT

Language:PythonLicense:MITStargazers:128Issues:2Issues:7

videollm-online

VideoLLM-online: Online Video Large Language Model for Streaming Video (CVPR 2024)

Language:PythonLicense:Apache-2.0Stargazers:85Issues:0Issues:0

T2VScore

T2VScore: Towards A Better Metric for Text-to-Video Generation

sparseformer

(ICLR 2024, CVPR 2024) SparseFormer

Language:PythonLicense:MITStargazers:62Issues:9Issues:3

CLVQA

[AAAI2023] Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task (Oral)

Efficient-CLS

[ICCV 2023] Label-Efficient Online Continual Object Detection in Streaming Video

videogui

official repo of "VideoGUI: A Benchmark for GUI Automation from Instructional Videos"

Language:JavaScriptStargazers:16Issues:0Issues:0

BYOC

[IEEE-VR 2024] Bring Your Own Character: A Holistic Solution for Automatic Facial Animation Generation of Customized Characters

Language:C#Stargazers:14Issues:0Issues:0

LOVA3

The official repo of "Learning to Visual Question Answering, Asking and Assessment"

Language:PythonStargazers:9Issues:0Issues:0

VisInContext

Official implementation of Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal Learning

Tune-An-Ellipse

[CVPR 2024] Tune-An-Ellipse: CLIP Has Potential to Find What You Want

Stargazers:4Issues:0Issues:0

DynVideo-E

This is the project page for DynVideo-E.

Language:JavaScriptStargazers:3Issues:4Issues:0
Language:JavaScriptStargazers:2Issues:2Issues:0
Language:HTMLLicense:MITStargazers:1Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

GUI-Narrator

Repository of GUI Action Narrator

Language:JavaScriptStargazers:0Issues:0Issues:0
Language:JavaScriptStargazers:0Issues:3Issues:0