Hyun-Bin Oh's starred repositories

MMM

Official repository for "MMM: Generative Masked Motion Model"

Language:Jupyter NotebookStargazers:54Issues:0Issues:0

VDT

[ICLR2024] The official implementation of paper "VDT: General-purpose Video Diffusion Transformers via Mask Modeling", by Haoyu Lu, Guoxing Yang, Nanyi Fei, Yuqi Huo, Zhiwu Lu, Ping Luo, Mingyu Ding.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:200Issues:0Issues:0

av2av

[CVPR 2024] AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation

Language:PythonLicense:MITStargazers:16Issues:0Issues:0

Video-LLaVA

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

paig

Code for the paper Physics-as-Inverse-Graphics: Joint Unsupervised Learning of Objects and Physics from Video

Language:PythonLicense:MITStargazers:38Issues:0Issues:0

kubric

A data generation pipeline for creating semi-realistic synthetic multi-object videos with rich annotations such as instance segmentation masks, depth maps, and optical flow.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2257Issues:0Issues:0

Multimodal-AND-Large-Language-Models

Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.

Stargazers:488Issues:0Issues:0

SAiD

SAiD: Blendshape-based Audio-Driven Speech Animation with Diffusion

Language:PythonLicense:Apache-2.0Stargazers:69Issues:0Issues:0

Awesome-Talking-Face

đź“– A curated list of resources dedicated to talking face.

License:MITStargazers:1218Issues:0Issues:0

Korean-FastSpeech2-Pytorch

Implementation of Korean FastSpeech2

Language:PythonLicense:MITStargazers:206Issues:0Issues:0

ComfyUI

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Language:PythonLicense:GPL-3.0Stargazers:46377Issues:0Issues:0

ml-papers

My collection of machine learning papers

License:MITStargazers:258Issues:0Issues:0

FLAME-Universe

Summary of publicly available ressources such as code, datasets, and scientific papers for the FLAME 3D head model

Stargazers:367Issues:0Issues:0

awesome-audiovisual-learning

A curated list of audio-visual learning methods and datasets.

Stargazers:215Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:44Issues:0Issues:0

One-sentence_Diffusion_summary

The repo for studying and sharing diffusion models.

Stargazers:378Issues:0Issues:0

3DOI

[ICCV 2023] Understanding 3D Object Interaction from a Single Image

Language:PythonStargazers:36Issues:0Issues:0

Track-Anything

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

Language:PythonLicense:MITStargazers:6359Issues:0Issues:0

sam-hq

Segment Anything in High Quality [NeurIPS 2023]

Language:PythonLicense:Apache-2.0Stargazers:3602Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:1100Issues:0Issues:0

Scene-Text-Recognition-Recommendations

Papers, Datasets, Algorithms, SOTA for STR. Long-time Maintaining

Language:PythonLicense:MITStargazers:313Issues:0Issues:0

playground

A central hub for gathering and showcasing amazing projects that extend OpenMMLab with SAM and other exciting features.

Language:PythonLicense:Apache-2.0Stargazers:1077Issues:0Issues:0

OCR-SAM

Combining MMOCR with Segment Anything & Stable Diffusion. Automatically detect, recognize and segment text instances, with serval downstream tasks, e.g., Text Removal and Text Inpainting

Language:PythonStargazers:508Issues:0Issues:0

pbdl-book

Welcome to the Physics-based Deep Learning Book (v0.2)

Language:Jupyter NotebookStargazers:956Issues:0Issues:0

Clothes-3D

clothes research in 3D

License:MITStargazers:157Issues:0Issues:0

TailorNet_dataset

[CVPR 2020] Dataset of "TailorNet: Predicting Clothing in 3D as a Function of Human Pose, Shape and Garment Style"

Language:PythonLicense:NOASSERTIONStargazers:141Issues:0Issues:0

awesome-fashion-ai

A repository to curate and summarise research papers related to fashion and e-commerce

Stargazers:1160Issues:0Issues:0

Physics-Based-Deep-Learning

Links to works on deep learning algorithms for physics problems, TUM-I15 and beyond

Stargazers:1684Issues:0Issues:0
Language:PythonLicense:BSD-3-ClauseStargazers:13Issues:0Issues:0