OliviaWang123456

followers

following

stars

Qiuyue Wang's starred repositories

RingAttention

Transformers with Arbitrarily Large Context

Language:PythonApache-2.054500

LLaMA2-Accessory

An Open-source Toolkit for LLM Development

Language:PythonNOASSERTION256700

CVQ-VAE

[ICCV 2023] Online Clustered Codebook

Language:PythonMIT11400

MaskGIT-pytorch

Pytorch implementation of MaskGIT: Masked Generative Image Transformer (https://arxiv.org/pdf/2202.04200.pdf)

Language:PythonMIT38700

VQGAN-pytorch

Pytorch implementation of VQGAN (Taming Transformers for High-Resolution Image Synthesis) (https://arxiv.org/pdf/2012.09841.pdf)

Language:PythonMIT38100

tifa

TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering

Language:PythonApache-2.011400

TATS

Official PyTorch implementation of TATS: A Long Video Generation Framework with Time-Agnostic VQGAN and Time-Sensitive Transformer (ECCV 2022)

Language:PythonMIT24600

InstantID

InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥

Language:PythonApache-2.01013900

GPT4V-Image-Captioner

Language:PythonGPL-3.062100

SciGraphQA

SciGraphQA

Language:Jupyter NotebookApache-2.03400

MotionCtrl

MotionCtrl: A Unified and Flexible Motion Controller for Video Generation

Language:PythonApache-2.0110400

magvit2-pytorch

Implementation of MagViT2 Tokenizer in Pytorch

Language:PythonMIT44000

particle-sfm

ParticleSfM: Exploiting Dense Point Trajectories for Localizing Moving Cameras in the Wild. ECCV 2022.

Language:C++GPL-3.023700

zero123

Zero-1-to-3: Zero-shot One Image to 3D Object (ICCV 2023)

Language:PythonMIT252500

One-2-3-45

[NeurIPS 2023] Official code of "One-2-3-45: Any Single Image to 3D Mesh in 45 Seconds without Per-Shape Optimization"

Language:PythonApache-2.0146100

improved-aesthetic-predictor

CLIP+MLP Aesthetic Score Predictor

Language:PythonApache-2.073200

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Language:Jupyter NotebookMIT2277700

generative-models

Generative Models by Stability AI

Language:PythonMIT2275200

TokenFlow

Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow" (ICLR 2024)

Language:PythonMIT148500

Fooocus

Focus on prompting and generating

Language:PythonGPL-3.03663500

stablediffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Language:PythonMIT3676400

opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Language:PythonApache-2.0283800

gill

🐟 Code and models for the NeurIPS 2023 paper "Generating Images with Multimodal Language Models".

Language:Jupyter NotebookApache-2.038800

guided-inpainting

Towards Unified Keyframe Propagation Models

Language:PythonMIT23100

latent-diffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Language:Jupyter NotebookMIT1079400

MotionGPT

[NeurIPS 2023] MotionGPT: Human Motion as a Foreign Language, a unified motion-language generation model using LLMs

Language:PythonMIT127800

Dreambooth-Stable-Diffusion

Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion

Language:Jupyter NotebookMIT748200

video-question-answering

Video Question Answering via Gradually Refined Attention over Appearance and Motion

Language:PythonMIT13800

HomE

HomE: Homography-Equivariant Video Representation Learning

500

Fantasia3D

(ICCV2023) official repository for "Fantasia3D: Disentangling Geometry and Appearance for High-quality Text-to-3D Content Creation"

Language:PythonApache-2.067800