신동협(Donghyeop Shin) (donghyeops)

donghyeops

Geek Repo

Company:ABLY Corp.

Location:Seoul

Github PK Tool:Github PK Tool

신동협(Donghyeop Shin)'s starred repositories

sketchdeco-code

Official implementation of "SketchDeco: Decorating B&W Sketches with Colour"

Language:PythonLicense:MITStargazers:46Issues:0Issues:0

MS-Diffusion

Official implementation of MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance

Language:PythonLicense:MITStargazers:90Issues:0Issues:0

MG-LLaVA

Official repository for paper MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning(https://arxiv.org/abs/2406.17770).

Language:PythonLicense:Apache-2.0Stargazers:61Issues:0Issues:0

ml-4m

4M: Massively Multimodal Masked Modeling

Language:PythonLicense:Apache-2.0Stargazers:1141Issues:0Issues:0

CoLLaVO

Official PyTorch Implementation code for realizing the technical part of CoLLaVO: Crayon Large Language and Vision mOdel to significantly improve zero-shot vision language performances (ACL 2024 Findings)

Language:PythonLicense:MITStargazers:82Issues:0Issues:0

flash-diffusion

Official implementation of ⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation

Language:PythonLicense:NOASSERTIONStargazers:313Issues:0Issues:0
Language:PythonStargazers:1010Issues:0Issues:0

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:20028Issues:0Issues:0

retinaface

RetinaFace: Deep Face Detection Library for Python

Language:PythonLicense:MITStargazers:1031Issues:0Issues:0

StyleFeatureEditor

Official Implementation for "The Devil is in the Details: StyleFeatureEditor for Detail-Rich StyleGAN Inversion and High Quality Image Editing"

Language:Jupyter NotebookLicense:MITStargazers:73Issues:0Issues:0

TroL

Official PyTorch implementation code for realizing the technical part of Traversal of Layers (TroL) presenting new propagation operation to get super vision language performances. (Under Review)

Language:PythonStargazers:65Issues:0Issues:0

Awesome-Image-Editing

A Survey of Image Editing

License:MITStargazers:117Issues:0Issues:0

Depth-Anything-V2

Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Language:PythonLicense:Apache-2.0Stargazers:2091Issues:0Issues:0

AsyncDiff

Official implementation of "AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising"

Language:PythonLicense:Apache-2.0Stargazers:110Issues:0Issues:0

BIRD

This is the official implementation of "Blind Image Restoration via Fast Diffusion Inversion"

Language:PythonStargazers:211Issues:0Issues:0

sscd-copy-detection

Open source implementation of "A Self-Supervised Descriptor for Image Copy Detection" (SSCD).

Language:PythonLicense:MITStargazers:228Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:515Issues:0Issues:0

StablePose

Official Pytorch Implementation of Paper - Stable-Pose: Leveraging Transformers for Pose-Guided Text-to-Image Generation

Language:PythonLicense:GPL-3.0Stargazers:80Issues:0Issues:0

ReNO

ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization

Language:PythonLicense:MITStargazers:50Issues:0Issues:0

stable-audio-tools

Generative models for conditional audio generation

Language:PythonLicense:MITStargazers:2231Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:656Issues:0Issues:0

ddpm-torch

Unofficial PyTorch Implementation of Denoising Diffusion Probabilistic Models (DDPM)

Language:PythonLicense:MITStargazers:151Issues:0Issues:0

EasyAnimate

📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion

Language:PythonLicense:Apache-2.0Stargazers:641Issues:0Issues:0

FinRobot

FinRobot: An Open-Source AI Agent Platform for Financial Applications using LLMs 🚀 🚀 🚀

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1126Issues:0Issues:0

omniglue

Code release for CVPR'24 submission 'OmniGlue'

Language:PythonLicense:Apache-2.0Stargazers:435Issues:0Issues:0

yolov10

YOLOv10: Real-Time End-to-End Object Detection

Language:PythonLicense:AGPL-3.0Stargazers:7958Issues:0Issues:0

diamond

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model.

Language:PythonLicense:MITStargazers:186Issues:0Issues:0

diffusion

Denoising Diffusion Probabilistic Models

Language:PythonStargazers:3437Issues:0Issues:0

ToonCrafter

a research paper for generative cartoon interpolation

Language:PythonLicense:Apache-2.0Stargazers:4667Issues:0Issues:0

MusePose

MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation

Language:PythonLicense:NOASSERTIONStargazers:1814Issues:0Issues:0