Hay Kim (TongHengcheng)

TongHengcheng

Geek Repo

Company:Aire

Github PK Tool:Github PK Tool

Hay Kim's repositories

AdvancedLiterateMachinery

A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Alibaba DAMO Academy.

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

AnimateDiff

Official implementation of AnimateDiff.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

AniPortrait

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

AnyV2V

A Plug-and-Play Framework For Any Video-to-Video Editing Tasks

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

BrushNet

The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

champ

Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

ConsistI2V

ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

ControlNet_Plus_Plus

Inference code for: ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Ctrl-Adapter

Official implementation of Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

DragNUWA

图像编辑

License:MITStargazers:0Issues:0Issues:0

FRESCO

[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:0Issues:0

Dough

Dough is a open source tool for steering AI animations with precision.

License:NOASSERTIONStargazers:0Issues:0Issues:0

facefusion

Next generation face swapper and enhancer

License:NOASSERTIONStargazers:0Issues:0Issues:0

img2img-turbo

One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more

License:MITStargazers:0Issues:0Issues:0

InstantStyle

InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥

Stargazers:0Issues:0Issues:0

MagicTime

MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators

License:Apache-2.0Stargazers:0Issues:0Issues:0

MiniCPM

MiniCPM-2B: An end-side LLM outperforms Llama2-13B.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

MiniGemini

Official implementation for Mini-Gemini

License:Apache-2.0Stargazers:0Issues:0Issues:0

MoneyPrinterTurbo

利用大模型,一键生成短视频

License:MITStargazers:0Issues:0Issues:0

Monkey

【CVPR 2024】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

MuseV

MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising

License:MITStargazers:0Issues:0Issues:0

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), but we only have limited resource. We deeply wish the all open source community can contribute to this project.

License:MITStargazers:0Issues:0Issues:0

Real3DPortrait

Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code

License:MITStargazers:0Issues:0Issues:0

StreamingT2V

StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text

Stargazers:0Issues:0Issues:0

TableStructureRec

整理目前开源的表格识别模型,完善前后处理,模型转换为ONNX

License:Apache-2.0Stargazers:0Issues:0Issues:0

VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction"

License:MITStargazers:0Issues:0Issues:0

yolov9

Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0