Beast code in Giters

Hay Kim's repositories

flux

Official inference repo for FLUX.1 models

Language:PythonApache-2.0000

MimicMotion

High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance

Language:PythonNOASSERTION000

Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

000

CogVideo

Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Language:PythonApache-2.0000

AnyV2V

A Plug-and-Play Framework For Any Video-to-Video Editing Tasks

Language:Jupyter NotebookMIT000

champ

Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

Language:PythonMIT000

ControlNeXt

Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA

Language:PythonApache-2.0000

GOT-OCR2.0-

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Language:Python000

Cinemo

Cinemo: Consistent and Controllable Image Animation with Motion Diffusion Models

Language:PythonApache-2.0000

FollowYourEmoji

[Siggraph Asia 2024] Follow-Your-Emoji: This repo is the official implementation of "Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait Animation"

000

MOFA-Video

[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.

Language:PythonNOASSERTION000

Show-o

Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

MIT000

Monkey

【CVPR 2024】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models

Language:PythonMIT000

manga-image-translator

Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/

GPL-3.0000

EchoMimic

Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning

Language:PythonApache-2.0000

VEnhancer

Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation

000

Lumina-mGPT

Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining"

000

UniAnimate

Code for Paper "UniAnimate: Taming Unified Video Diﬀusion Models for Consistent Human Image Animation".

Language:Python000

mPLUG-Owl

mPLUG-Owl: The Powerful Multi-modal Large Language Model Family

MIT000

UniPortrait

UniPortrait: A Unified Framework for Identity-Preserving Single- and Multi-Human Image Personalizations

Apache-2.0000

Deep-Live-Cam

real time face swap and one-click video deepfake with only a single image (uncensored)

AGPL-3.0000

LivePortrait

Make one portrait alive!

Language:PythonNOASSERTION000

ComfyUI-RefUNet

A set of nodes to use Reference UNets

GPL-3.0000

SimpleTuner

A general fine-tuning kit geared toward Stable Diffusion 2.1, Stable Diffusion 3, DeepFloyd, and SDXL.

Language:PythonAGPL-3.0000

MiniCPM-V

MiniCPM-Llama3-V 2.5: A GPT-4V Level MLLM on Your Phone

Language:PythonApache-2.0000

OpenDiT

OpenDiT: An Easy, Fast and Memory-Efficient System for DiT Training and Inference

Apache-2.0000

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Apache-2.0000

MagicClothing

Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesis

NOASSERTION000

HunyuanDiT

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

NOASSERTION000

MotionBooth

The official implement of research paper "MotionBooth: Motion-Aware Customized Text-to-Video Generation"

Language:Python000

TongHengcheng

Hay Kim's repositories

flux

MimicMotion

Awesome-Video-Diffusion

CogVideo

AnyV2V

champ

ControlNeXt

GOT-OCR2.0-

Cinemo

FollowYourEmoji

MOFA-Video

Show-o

Monkey

manga-image-translator

EchoMimic

VEnhancer

Lumina-mGPT

UniAnimate

mPLUG-Owl

UniPortrait

Deep-Live-Cam

LivePortrait

ComfyUI-RefUNet

SimpleTuner

MiniCPM-V

OpenDiT

segment-anything-2

MagicClothing

HunyuanDiT

MotionBooth