신동협(Donghyeop Shin) (donghyeops)

donghyeops

Geek Repo

Company:ABLY Corp.

Location:Seoul

Github PK Tool:Github PK Tool

신동협(Donghyeop Shin)'s starred repositories

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonLicense:MITStargazers:11153Issues:0Issues:0

rho

Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.

License:MITStargazers:283Issues:0Issues:0

MagicTime

MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators

Language:PythonLicense:Apache-2.0Stargazers:1255Issues:0Issues:0

moondream

tiny vision language model

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4778Issues:0Issues:0

cog-face-to-many

Turn any face into a video game character, pixel art, claymation, 3D or toy

Language:PythonLicense:NOASSERTIONStargazers:1219Issues:0Issues:0

InstantStyle

InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥

Language:Jupyter NotebookStargazers:1549Issues:0Issues:0

unitable

UniTable: Towards a Unified Table Foundation Model

Language:Jupyter NotebookLicense:MITStargazers:329Issues:0Issues:0

champ

Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

Language:PythonLicense:MITStargazers:3550Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:390Issues:0Issues:0

FouriScale

Official implementation of FouriScale (ECCV2024)

Language:PythonLicense:Apache-2.0Stargazers:128Issues:0Issues:0

PSALM

[ECCV2024] This is an official implementation for "PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model"

Language:PythonLicense:Apache-2.0Stargazers:167Issues:0Issues:0

SSR_Encoder

Pytorch Implementation of "SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation"(CVPR 2024)

Language:PythonStargazers:82Issues:0Issues:0

Awesome-SSLRec-Papers

A Comprehensive Survey of Self-Supervised Learning for Recommendation

Stargazers:89Issues:0Issues:0

DEADiff

[CVPR 2024] Official implementation of "DEADiff: An Efficient Stylization Diffusion Model with Disentangled Representations"

Language:PythonLicense:Apache-2.0Stargazers:207Issues:0Issues:0

T-GATE

T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!

Language:PythonLicense:MITStargazers:327Issues:0Issues:0

open-parse

Improved file parsing for LLM’s

Language:PythonLicense:MITStargazers:2283Issues:0Issues:0

HairFastGAN

Official Implementation for "HairFastGAN: Realistic and Robust Hair Transfer with a Fast Encoder-Based Approach"

Language:PythonLicense:MITStargazers:402Issues:0Issues:0

mPLUG-DocOwl

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

Language:PythonLicense:Apache-2.0Stargazers:1207Issues:0Issues:0

AdvancedLiterateMachinery

A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.

Language:C++License:Apache-2.0Stargazers:1279Issues:0Issues:0

deep-text-recognition-benchmark

Text recognition (optical character recognition) with deep learning methods, ICCV 2019

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3705Issues:0Issues:0

BasicPBC

Official Implementation of "Learning Inclusion Matching for Animation Paint Bucket Colorization"

Language:PythonLicense:NOASSERTIONStargazers:225Issues:0Issues:0

Perturbed-Attention-Guidance

Official implementation of "Perturbed-Attention Guidance"

Language:Jupyter NotebookLicense:MITStargazers:242Issues:0Issues:0

attention-interpolation-diffusion

Interpolation Between Text-to-Image Generation!

Language:PythonStargazers:77Issues:0Issues:0

Arc2Face

[ECCV 2024 Oral🔥] Arc2Face: A Foundation Model of Human Faces

Language:PythonLicense:MITStargazers:536Issues:0Issues:0

MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Language:PythonLicense:Apache-2.0Stargazers:3145Issues:0Issues:0

AniPortrait

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Language:PythonLicense:Apache-2.0Stargazers:4438Issues:0Issues:0

AdaIR

AdaIR: Adaptive All-in-One Image Restoration via Frequency Mining and Modulation

Language:PythonLicense:MITStargazers:86Issues:0Issues:0

sdxs

Official repo of our paper "SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions"

Language:PythonLicense:Apache-2.0Stargazers:579Issues:0Issues:0

L2CS-Net

The official PyTorch implementation of L2CS-Net for gaze estimation and tracking

Language:PythonLicense:MITStargazers:312Issues:0Issues:0

jupyterlab-favorites

Add the ability to save favorite folders to JupyterLab for quicker browsing

Language:TypeScriptLicense:BSD-3-ClauseStargazers:14Issues:0Issues:0