HuaZheLei's starred repositories
PySceneDetect
:movie_camera: Python and OpenCV-based scene cut/transition detection program & library.
PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
vector-quantize-pytorch
Vector (and Scalar) Quantization, in Pytorch
style-aligned
Official code for "Style Aligned Image Generation via Shared Attention"
improved-aesthetic-predictor
CLIP+MLP Aesthetic Score Predictor
SoraReview
The official GitHub page for the review paper "Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models".
aesthetic-predictor
A linear estimator on top of clip to predict the aesthetic quality of pictures
Youku-mPLUG
Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks
DriveDreamer
DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving
WorldDreamer
WorldDreamer: Towards General World Models for Video Generation via Predicting Masked Tokens
TransCore-M
Large Multimodal Model