Qiuyue Wang's starred repositories

RingAttention

Transformers with Arbitrarily Large Context

Language:PythonLicense:Apache-2.0Stargazers:545Issues:0Issues:0

LLaMA2-Accessory

An Open-source Toolkit for LLM Development

Language:PythonLicense:NOASSERTIONStargazers:2567Issues:0Issues:0

CVQ-VAE

[ICCV 2023] Online Clustered Codebook

Language:PythonLicense:MITStargazers:114Issues:0Issues:0

MaskGIT-pytorch

Pytorch implementation of MaskGIT: Masked Generative Image Transformer (https://arxiv.org/pdf/2202.04200.pdf)

Language:PythonLicense:MITStargazers:387Issues:0Issues:0

VQGAN-pytorch

Pytorch implementation of VQGAN (Taming Transformers for High-Resolution Image Synthesis) (https://arxiv.org/pdf/2012.09841.pdf)

Language:PythonLicense:MITStargazers:381Issues:0Issues:0

tifa

TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering

Language:PythonLicense:Apache-2.0Stargazers:114Issues:0Issues:0

TATS

Official PyTorch implementation of TATS: A Long Video Generation Framework with Time-Agnostic VQGAN and Time-Sensitive Transformer (ECCV 2022)

Language:PythonLicense:MITStargazers:246Issues:0Issues:0

InstantID

InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥

Language:PythonLicense:Apache-2.0Stargazers:10139Issues:0Issues:0
Language:PythonLicense:GPL-3.0Stargazers:621Issues:0Issues:0

SciGraphQA

SciGraphQA

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:34Issues:0Issues:0

MotionCtrl

MotionCtrl: A Unified and Flexible Motion Controller for Video Generation

Language:PythonLicense:Apache-2.0Stargazers:1104Issues:0Issues:0

magvit2-pytorch

Implementation of MagViT2 Tokenizer in Pytorch

Language:PythonLicense:MITStargazers:440Issues:0Issues:0

particle-sfm

ParticleSfM: Exploiting Dense Point Trajectories for Localizing Moving Cameras in the Wild. ECCV 2022.

Language:C++License:GPL-3.0Stargazers:237Issues:0Issues:0

zero123

Zero-1-to-3: Zero-shot One Image to 3D Object (ICCV 2023)

Language:PythonLicense:MITStargazers:2525Issues:0Issues:0

One-2-3-45

[NeurIPS 2023] Official code of "One-2-3-45: Any Single Image to 3D Mesh in 45 Seconds without Per-Shape Optimization"

Language:PythonLicense:Apache-2.0Stargazers:1461Issues:0Issues:0

improved-aesthetic-predictor

CLIP+MLP Aesthetic Score Predictor

Language:PythonLicense:Apache-2.0Stargazers:732Issues:0Issues:0

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Language:Jupyter NotebookLicense:MITStargazers:22777Issues:0Issues:0

generative-models

Generative Models by Stability AI

Language:PythonLicense:MITStargazers:22752Issues:0Issues:0

TokenFlow

Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow" (ICLR 2024)

Language:PythonLicense:MITStargazers:1485Issues:0Issues:0

Fooocus

Focus on prompting and generating

Language:PythonLicense:GPL-3.0Stargazers:36635Issues:0Issues:0

stablediffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Language:PythonLicense:MITStargazers:36764Issues:0Issues:0

opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Language:PythonLicense:Apache-2.0Stargazers:2838Issues:0Issues:0

gill

🐟 Code and models for the NeurIPS 2023 paper "Generating Images with Multimodal Language Models".

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:388Issues:0Issues:0

guided-inpainting

Towards Unified Keyframe Propagation Models

Language:PythonLicense:MITStargazers:231Issues:0Issues:0

latent-diffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Language:Jupyter NotebookLicense:MITStargazers:10794Issues:0Issues:0

MotionGPT

[NeurIPS 2023] MotionGPT: Human Motion as a Foreign Language, a unified motion-language generation model using LLMs

Language:PythonLicense:MITStargazers:1278Issues:0Issues:0

Dreambooth-Stable-Diffusion

Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion

Language:Jupyter NotebookLicense:MITStargazers:7482Issues:0Issues:0

video-question-answering

Video Question Answering via Gradually Refined Attention over Appearance and Motion

Language:PythonLicense:MITStargazers:138Issues:0Issues:0

HomE

HomE: Homography-Equivariant Video Representation Learning

Stargazers:5Issues:0Issues:0

Fantasia3D

(ICCV2023) official repository for "Fantasia3D: Disentangling Geometry and Appearance for High-quality Text-to-3D Content Creation"

Language:PythonLicense:Apache-2.0Stargazers:678Issues:0Issues:0