ZhaoQiiii's starred repositories
latent-consistency-model
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
Ask-Anything
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
Video-LLaVA
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
DynamiCrafter
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
InternLM-XComposer
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
Awesome-Video-Diffusion-Models
[CSUR] A Survey on Video Diffusion Models
SyncDreamer
[ICLR 2024 Spotlight] SyncDreamer: Generating Multiview-consistent Images from a Single-view Image
OpenLane-V2
[NeurIPS 2023 Track Datasets and Benchmarks] OpenLane-V2: The First Perception and Reasoning Benchmark for Road Driving
LLMRiddles
Open-Source Reproduction/Demo of the LLM Riddles Game
InterpAny-Clearer
[ECCV2024 Oral] Clearer anytime frame interpolation & Manipulated interpolation of anything
Video-Bench
A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large Language Models!
CodeMorpheus
CodeMorpheus: Generate code self-portraits with one click(一键生成代码自画像,决策型 AI + 生成式 AI)