xyxxmb's starred repositories
aesthetic-predictor-v2-5
SigLIP-based Aesthetic Score Predictor
HiDiffusion
[ECCV 2024] HiDiffusion: Increases the resolution and speed of your diffusion model by only adding a single line of code!
Glyph-ByT5
[ECCV2024] This is an official inference code of the paper "Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering" and "Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Multilingual Visual Text Rendering""
llama3-from-scratch
llama3 implementation one matrix multiplication at a time
HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
PuLID_ComfyUI
PuLID native implementation for ComfyUI
MagicDance
[ICML 2024] MagicPose(also known as MagicDance): Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion
StoryDiffusion
Create Magic Story!
comfyui-portrait-master-zh-cn
肖像大师 中文版 comfyui-portrait-master
Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
ComfyUI-YoloWorld-EfficientSAM
Unofficial implementation of YOLO-World + EfficientSAM for ComfyUI
Autonomous-Agents
Autonomous Agents (LLMs) research papers. Updated Daily.
Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything