anthonyyuan's repositories
Ctrl-Adapter
Official implementation of Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model
ChronoDepth
ChronoDepth: Learning Temporally Consistent Video Depth from Video Diffusion Priors
ComfyUI-MimicMotion
a comfyui custom node for MimicMotion
ControlNeXt
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, LoRA
DepthFlow
🌊 Image to → 2.5D Parallax Effect Video. High quality, user first
DiffSynth-Studio
Enjoy the magic of Diffusion models!
EditWorld
EditWorld: Simulating World Dynamics for Instruction-Following Image Editing
EvTexture
[ICML 2024] EvTexture: Event-driven Texture Enhancement for Video Super-Resolution
flowsam
Official Implementation of "Moving Object Segmentation: All You Need Is SAM (and Flow)" Junyu Xie, Charig Yang, Weidi Xie, Andrew Zisserman
Gaussian-Wild
Official implementation of the paper "Gaussian in the Wild: 3D Gaussian Splatting for Unconstrained Image Collections"
Glyph-ByT5
This is an official inference code of the paper "Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering"
hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Inf-DiT
Official implementation of Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer
InstantStyle
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥
Kolors
Kolors Team
lama
🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022
MaxKB
💬 基于 LLM 大语言模型的知识库问答系统。开箱即用,支持快速嵌入到第三方业务系统,1Panel 官方出品。
MiniCPM-V
MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone
RPG-DiffusionMaster
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)
SIGNeRF
SIGNeRF: Scene Integrated Generation for Neural Radiance Fields
t2v-turbo
Code repository for T2V-Turbo
VADER
Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various reward models such as HPS, PickScore, VideoMAE, VJEPA, YOLO, Aesthetics etc.
VideoTetris
VideoTetris: Towards Compositional Text-To-Video Generation
zest_code
This is the official implementation of ZeST