Beast code in Giters

felixfuu's starred repositories

LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Language:PythonApache-2.031245 199 4849

MiniCPM-V

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Language:PythonApache-2.012016 101 523

AnimateDiff

Official implementation of AnimateDiff.

Language:PythonApache-2.010315 103 350

Omost

Your image is almost there!

Language:PythonApache-2.07229 44 77

IP-Adapter

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Language:Jupyter NotebookApache-2.05030 61 376

ebooks

收藏的一些经典的历史、政治、心理、哲学、数学、计算机方面电子书(约10万本）

Language:JavaScript3975 57 13

chameleon

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Language:PythonNOASSERTION1766 26 46

sd-webui-regional-prompter

set prompt to divided region

Language:PythonAGPL-3.01551 17 233

ControlNeXt

Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA

Language:PythonApache-2.01275 21 58

LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Language:PythonMIT1205 21 54

style-aligned

Official code for "Style Aligned Image Generation via Shared Attention"

Language:PythonApache-2.01197 22 26

EasyAnimate

📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion

Language:PythonApache-2.01185 17 85

MimicBrush

Official implementations for paper: Zero-shot Image Editing with Reference Imitation

Language:PythonApache-2.01075 14 20

ovsam

[ECCV 2024] The official code of paper "Open-Vocabulary SAM".

Language:PythonNOASSERTION908 13 43

MAP-NEO

Language:Python839 10 34

megactor

Language:PythonApache-2.0735 37 23

infinite-zoom-automatic1111-webui

infinite zoom effect extension for AUTOMATIC1111's webui - stable diffusion

Language:PythonMIT654 9 62

X-Pose

[ECCV 2024] Official implementation of the paper "X-Pose: Detecting Any Keypoints"

Language:PythonNOASSERTION434 21 28

Awesome-CVPR2024-ECCV2024-AIGC

A Collection of Papers and Codes for CVPR2024/ECCV2024 AIGC

413 7 1

Prompt-Diffusion

Official PyTorch implementation of the paper "In-Context Learning Unlocked for Diffusion Models"

Language:PythonApache-2.0371 7 13

Inf-DiT

Official implementation of Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer

Language:PythonApache-2.0365 21 27

ReMoDiffuse

ReMoDiffuse: Retrieval-Augmented Motion Diffusion Model

Language:PythonNOASSERTION315 15 21

MoMA

MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation

Language:Jupyter Notebook181 3 10

PSALM

[ECCV2024] This is an official implementation for "PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model"

Language:PythonApache-2.0174 7 17

UniIR

Official code for paper "UniIR: Training and Benchmarking Universal Multimodal Information Retrievers" (ECCV 2024)

Language:PythonMIT93 3 14

Awesome-Open-Vocabulary-Detection-and-Segmentation

Awesome OVD-OVS - A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future

91 20

HQ-Edit

HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing

Language:PythonNOASSERTION70 6 6

ControlCap

[ECCV 2024] ControlCap: Controllable Region-level Captioning

Language:Python49 5 5

Switch-DiT

[ECCV 2024] Official pytorch implementation of "Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts"

Language:PythonMIT31 3 1

lvlm-interpret

Language:PythonApache-2.030 1 5