anthonyyuan

followers

following

stars

anthonyyuan's repositories

Ctrl-Adapter

Official implementation of Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model

Language:PythonApache-2.0100

phidata

Add memory, knowledge and tools to LLMs

Language:PythonMPL-2.0100

camp_zipnerf

Apache-2.0000

ChronoDepth

ChronoDepth: Learning Temporally Consistent Video Depth from Video Diffusion Priors

MIT000

ComfyUI-MimicMotion

a comfyui custom node for MimicMotion

000

ControlNeXt

Controllable video and image Generation, SVD, Animate Anyone, ControlNet, LoRA

Apache-2.0000

DepthFlow

🌊 Image to → 2.5D Parallax Effect Video. High quality, user first

AGPL-3.0000

DiffSynth-Studio

Enjoy the magic of Diffusion models!

Apache-2.0000

EditWorld

EditWorld: Simulating World Dynamics for Instruction-Following Image Editing

000

EvTexture

[ICML 2024] EvTexture: Event-driven Texture Enhancement for Video Super-Resolution

Apache-2.0000

Face-Adapter

000

FlashFace

MIT000

flowsam

Official Implementation of "Moving Object Segmentation: All You Need Is SAM (and Flow)" Junyu Xie, Charig Yang, Weidi Xie, Andrew Zisserman

Apache-2.0000

Gaussian-Wild

Official implementation of the paper "Gaussian in the Wild: 3D Gaussian Splatting for Unconstrained Image Collections"

000

Glyph-ByT5

This is an official inference code of the paper "Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering"

000

hallo

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

MIT000

ID-Animator

000

Inf-DiT

Official implementation of Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer

Apache-2.0000

InstantID-IPAdapter-ControlNet-jupyter

000

InstantStyle

InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥

000

Kolors

Kolors Team

Apache-2.0000

lama

🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022

Apache-2.0000

MaxKB

💬 基于 LLM 大语言模型的知识库问答系统。开箱即用，支持快速嵌入到第三方业务系统，1Panel 官方出品。

GPL-3.0000

MiniCPM-V

MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone

Apache-2.0000

RPG-DiffusionMaster

Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)

000

SIGNeRF

SIGNeRF: Scene Integrated Generation for Neural Radiance Fields

000

t2v-turbo

Code repository for T2V-Turbo

000

VADER

Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various reward models such as HPS, PickScore, VideoMAE, VJEPA, YOLO, Aesthetics etc.

000

VideoTetris

VideoTetris: Towards Compositional Text-To-Video Generation

000

zest_code

This is the official implementation of ZeST

000