wileewang

followers

following

stars

Hong Kong University of Science and Technology (Guangzhou)

wileewang.github.io

Luozhou Wang's starred repositories

Awesome-Video-Datasets

Video datasets

diffusion-forcing

code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Language:PythonNOASSERTION40000

Defect_Spectrum

Defect Spectrum: A Granular Look of Large-Scale Defect Datasets with Rich Semantics [ECCV2024]

Language:PythonApache-2.02300

SEED-Story

SEED-Story: Multimodal Long Story Generation with Large Language Model

Language:PythonNOASSERTION62500

FilmRemoval

[CVPR 2024] Official Implementation of Learning to Remove Wrinkled Transparent Film with Polarized Prior

Language:PythonMIT2500

via-video

2200

HunyuanDiT

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Language:PythonNOASSERTION297400

MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Language:PythonApache-2.0311100

mvdream_diffusers

A unified diffusers implementation for MVDream and ImageDream

Language:Python8200

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonMIT1106800

VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonMIT389800

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonApache-2.02103400

diffusion-motion-transfer

Official Pytorch Implementation for "Space-Time Diffusion Features for Zero-Shot Text-Driven Motion Transfer""

Language:Python13400

MotionDirector

MotionDirector: Motion Customization of Text-to-Video Diffusion Models.

Language:PythonApache-2.077000

DDSM

Denoising Diffusion Step-aware Models (ICLR2024)

Language:PythonMIT5000

MotionInversion

Language:Python6700

Moore-AnimateAnyone

Character Animation (AnimateAnyone, Face Reenactment)

Language:PythonApache-2.0300100

HumanML3D

HumanML3D: A large and diverse 3d human motion-language dataset.

Language:PythonMIT69800

magvit2-pytorch

Implementation of MagViT2 Tokenizer in Pytorch

Language:PythonMIT50800

Video-Swin-Transformer

This is an official implementation for "Video Swin Transformers".

Language:PythonApache-2.0138300

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonApache-2.01531900

MotionCtrl

Official Code for MotionCtrl [SIGGRAPH 2024]

Language:PythonApache-2.0120700

LLaMA2-Accessory

An Open-source Toolkit for LLM Development

Language:PythonNOASSERTION264600

magic-animate

[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

Language:PythonBSD-3-Clause1027400

Decompose-and-Realign

Language:Python2100

InstaFlow

:zap: InstaFlow! One-Step Stable Diffusion with Rectified Flow (ICLR 2024)

Language:PythonMIT109500

LucidDreamer

Official implementation of "LucidDreamer: Towards High-Fidelity Text-to-3D Generation via Interval Score Matching"

Language:PythonMIT72200

gpt4free

The official gpt4free repository | various collection of powerful language models

Language:PythonGPL-3.05952300

langchain-gpt4free

LangChain x gpt4free

Language:PythonMIT16800

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models