anthonyyuan's repositories

Ctrl-Adapter

Official implementation of Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

phidata

Add memory, knowledge and tools to LLMs

Language:PythonLicense:MPL-2.0Stargazers:1Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

ChronoDepth

ChronoDepth: Learning Temporally Consistent Video Depth from Video Diffusion Priors

License:MITStargazers:0Issues:0Issues:0

ComfyUI-MimicMotion

a comfyui custom node for MimicMotion

Stargazers:0Issues:0Issues:0

ControlNeXt

Controllable video and image Generation, SVD, Animate Anyone, ControlNet, LoRA

License:Apache-2.0Stargazers:0Issues:0Issues:0

DepthFlow

🌊 Image to → 2.5D Parallax Effect Video. High quality, user first

License:AGPL-3.0Stargazers:0Issues:0Issues:0

DiffSynth-Studio

Enjoy the magic of Diffusion models!

License:Apache-2.0Stargazers:0Issues:0Issues:0

EditWorld

EditWorld: Simulating World Dynamics for Instruction-Following Image Editing

Stargazers:0Issues:0Issues:0

EvTexture

[ICML 2024] EvTexture: Event-driven Texture Enhancement for Video Super-Resolution

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

flowsam

Official Implementation of "Moving Object Segmentation: All You Need Is SAM (and Flow)" Junyu Xie, Charig Yang, Weidi Xie, Andrew Zisserman

License:Apache-2.0Stargazers:0Issues:0Issues:0

Gaussian-Wild

Official implementation of the paper "Gaussian in the Wild: 3D Gaussian Splatting for Unconstrained Image Collections"

Stargazers:0Issues:0Issues:0

Glyph-ByT5

This is an official inference code of the paper "Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering"

Stargazers:0Issues:0Issues:0

hallo

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

Inf-DiT

Official implementation of Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

InstantStyle

InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥

Stargazers:0Issues:0Issues:0

Kolors

Kolors Team

License:Apache-2.0Stargazers:0Issues:0Issues:0

lama

🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022

License:Apache-2.0Stargazers:0Issues:0Issues:0

MaxKB

💬 基于 LLM 大语言模型的知识库问答系统。开箱即用,支持快速嵌入到第三方业务系统,1Panel 官方出品。

License:GPL-3.0Stargazers:0Issues:0Issues:0

MiniCPM-V

MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone

License:Apache-2.0Stargazers:0Issues:0Issues:0

RPG-DiffusionMaster

Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)

Stargazers:0Issues:0Issues:0

SIGNeRF

SIGNeRF: Scene Integrated Generation for Neural Radiance Fields

Stargazers:0Issues:0Issues:0

t2v-turbo

Code repository for T2V-Turbo

Stargazers:0Issues:0Issues:0

VADER

Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various reward models such as HPS, PickScore, VideoMAE, VJEPA, YOLO, Aesthetics etc.

Stargazers:0Issues:0Issues:0

VideoTetris

VideoTetris: Towards Compositional Text-To-Video Generation

Stargazers:0Issues:0Issues:0

zest_code

This is the official implementation of ZeST

Stargazers:0Issues:0Issues:0