Hay Kim's repositories
AdvancedLiterateMachinery
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Alibaba DAMO Academy.
AnimateDiff
Official implementation of AnimateDiff.
AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
AnyV2V
A Plug-and-Play Framework For Any Video-to-Video Editing Tasks
BrushNet
The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
champ
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
ConsistI2V
ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation
ControlNet_Plus_Plus
Inference code for: ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback
Ctrl-Adapter
Official implementation of Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model
diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
DragNUWA
图像编辑
FRESCO
[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
Dough
Dough is a open source tool for steering AI animations with precision.
facefusion
Next generation face swapper and enhancer
img2img-turbo
One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more
InstantStyle
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥
MagicTime
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
MiniCPM
MiniCPM-2B: An end-side LLM outperforms Llama2-13B.
MiniGemini
Official implementation for Mini-Gemini
MoneyPrinterTurbo
利用大模型,一键生成短视频
Monkey
【CVPR 2024】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models
MuseV
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), but we only have limited resource. We deeply wish the all open source community can contribute to this project.
Real3DPortrait
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code
StreamingT2V
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
TableStructureRec
整理目前开源的表格识别模型,完善前后处理,模型转换为ONNX
VAR
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction"
yolov9
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information