Bo Pan's starred repositories
stable-diffusion
A latent text-to-image diffusion model
generative-models
Generative Models by Stability AI
Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
learning_research
本人的科研经验
frame-interpolation
FILM: Frame Interpolation for Large Motion, In ECCV 2022.
Depth-Anything-V2
Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
RAG-Survey
Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".
llm-paper-daily
Daily updated LLM papers. 每日更新 LLM 相关的论文,欢迎订阅 👏 喜欢的话动动你的小手 🌟 一个
blended-latent-diffusion
Official implementation for "Blended Latent Diffusion" [SIGGRAPH 2023]
MOFA-Video
[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.
DragAnything
[ECCV 2024] DragAnything: Motion Control for Anything using Entity Representation
TaleCrafter
[SIGGRAPH Asia 2023] An interactive story visualization tool that support multiple characters
paper_downloader
Download papers and supplemental materials from open-access paper website, such as AAAI, AISTATS, COLT, CORL, CVPR, ECCV, ICCV, ICLR, ICML, IJCAI, JMLR, NIPS, RSS, WACV.
NVS_Solver
Source code of paper "NVS-Solver: Video Diffusion Model as Zero-Shot Novel View Synthesizer"
Diffusion4D
"Diffusion4D: Fast Spatial-temporal Consistent 4D Generation via Video Diffusion Models", Hanwen Liang*, Yuyang Yin*, Dejia Xu, Hanxue Liang, Zhangyang Wang, Konstantinos N. Plataniotis, Yao Zhao, Yunchao Wei
MotionMaster
[ACM MM24] MotionMaster: Training-free Camera Motion Transfer For Video Generation
RealEstate10K_Downloader
These scripts are used to download RealEstate10K dataset.