Beast code in Giters

신동협(Donghyeop Shin)'s starred repositories

sketchdeco-code

Official implementation of "SketchDeco: Decorating B&W Sketches with Colour"

Language:PythonMIT4600

MS-Diffusion

Official implementation of MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance

Language:PythonMIT9000

MG-LLaVA

Official repository for paper MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning(https://arxiv.org/abs/2406.17770).

Language:PythonApache-2.06100

ml-4m

4M: Massively Multimodal Masked Modeling

Language:PythonApache-2.0114100

Official PyTorch Implementation code for realizing the technical part of CoLLaVO: Crayon Large Language and Vision mOdel to significantly improve zero-shot vision language performances (ACL 2024 Findings)

Language:PythonMIT8200

flash-diffusion

Official implementation of ⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation

Language:PythonNOASSERTION31300

LLaVA-NeXT

Language:Python101000

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonApache-2.02002800

retinaface

RetinaFace: Deep Face Detection Library for Python

Language:PythonMIT103100

StyleFeatureEditor

Official Implementation for "The Devil is in the Details: StyleFeatureEditor for Detail-Rich StyleGAN Inversion and High Quality Image Editing"

Language:Jupyter NotebookMIT7300

TroL

Official PyTorch implementation code for realizing the technical part of Traversal of Layers (TroL) presenting new propagation operation to get super vision language performances. (Under Review)

Language:Python6500

Awesome-Image-Editing

A Survey of Image Editing

MIT11700

Depth-Anything-V2

Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Language:PythonApache-2.0209100

AsyncDiff

Official implementation of "AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising"

Language:PythonApache-2.011000

BIRD

This is the official implementation of "Blind Image Restoration via Fast Diffusion Inversion"

Language:Python21100

sscd-copy-detection

Open source implementation of "A Self-Supervised Descriptor for Image Copy Detection" (SSCD).

Language:PythonMIT22800

megactor

Language:PythonApache-2.051500

StablePose

Official Pytorch Implementation of Paper - Stable-Pose: Leveraging Transformers for Pose-Guided Text-to-Image Generation

Language:PythonGPL-3.08000

ReNO

ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization

Language:PythonMIT5000

stable-audio-tools

Generative models for conditional audio generation

Language:PythonMIT223100

HiDiffusion

Language:Jupyter NotebookApache-2.065600

ddpm-torch

Unofficial PyTorch Implementation of Denoising Diffusion Probabilistic Models (DDPM)

Language:PythonMIT15100

EasyAnimate

📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion

Language:PythonApache-2.064100

FinRobot

FinRobot: An Open-Source AI Agent Platform for Financial Applications using LLMs 🚀 🚀 🚀

Language:Jupyter NotebookApache-2.0112600

omniglue

Code release for CVPR'24 submission 'OmniGlue'

Language:PythonApache-2.043500

yolov10

YOLOv10: Real-Time End-to-End Object Detection

Language:PythonAGPL-3.0795800

diamond

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model.

Language:PythonMIT18600

diffusion

Denoising Diffusion Probabilistic Models

Language:Python343700

ToonCrafter

a research paper for generative cartoon interpolation

Language:PythonApache-2.0466700

MusePose

MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation

Language:PythonNOASSERTION181400