Shoukang Hu (skhu101)

skhu101

Geek Repo

Company:Nanyang Technological University Singapore

Home Page:https://skhu101.github.io

Github PK Tool:Github PK Tool

Shoukang Hu's starred repositories

detectron2

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

Language:PythonLicense:Apache-2.0Stargazers:29610Issues:389Issues:3481

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Language:Jupyter NotebookLicense:MITStargazers:23981Issues:316Issues:388

gaussian-splatting

Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"

Language:PythonLicense:NOASSERTIONStargazers:13024Issues:112Issues:853

muzic

Muzic: Music Understanding and Generation with Artificial Intelligence

Language:PythonLicense:MITStargazers:4396Issues:76Issues:169

pytorch-openpose

pytorch implementation of openpose including Hand and Body Pose Estimation.

Language:Jupyter NotebookStargazers:2040Issues:25Issues:78

4K4D

[CVPR 2024] 4K4D: Real-Time 4D View Synthesis at 4K Resolution

Language:PythonLicense:NOASSERTIONStargazers:1512Issues:88Issues:43

AvatarCLIP

[SIGGRAPH 2022 Journal Track] AvatarCLIP: Zero-Shot Text-Driven Generation and Animation of 3D Avatars

Language:PythonLicense:NOASSERTIONStargazers:1056Issues:20Issues:20

EdgeSAM

Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:859Issues:16Issues:25

Awesome-Segmentation-With-Transformer

[T-PAMI-2024] Transformer-Based Visual Segmentation: A Survey

3DTopia

Text-to-3D Generation within 5 Minutes

Language:PythonLicense:Apache-2.0Stargazers:587Issues:12Issues:12

EVA3D

[ICLR 2023 Spotlight] EVA3D: Compositional 3D Human Generation from 2D Image Collections

Language:PythonLicense:NOASSERTIONStargazers:577Issues:33Issues:32

RelateAnything

Relate Anything Model is capable of taking an image as input and utilizing SAM to identify the corresponding mask within the image.

Language:PythonLicense:Apache-2.0Stargazers:438Issues:10Issues:12

CIHP_PGN

Code repository for Part Grouping Network, ECCV 2018

Language:PythonLicense:MITStargazers:427Issues:18Issues:74

MetaMath

MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models

Language:PythonLicense:Apache-2.0Stargazers:360Issues:7Issues:27

FreeNoise

[ICLR 2024] Code for FreeNoise based on VideoCrafter

Language:PythonLicense:Apache-2.0Stargazers:353Issues:6Issues:13

MultiModal-DeepFake

[TPAMI 2024 & CVPR 2023] PyTorch code for DGM4: Detecting and Grounding Multi-Modal Media Manipulation and beyond

Language:PythonLicense:NOASSERTIONStargazers:316Issues:3Issues:38

GauHuman

Code for our CVPR'2024 paper "GauHuman: Articulated Gaussian Splatting from Monocular Human Videos"

Language:PythonLicense:NOASSERTIONStargazers:306Issues:12Issues:39

SHERF

Code for our ICCV'2023 paper "SHERF: Generalizable Human NeRF from a Single Image"

Language:PythonLicense:NOASSERTIONStargazers:297Issues:34Issues:40

TADA

[3DV 2024] Official Repository for "TADA! Text to Animatable Digital Avatars".

Language:PythonLicense:MITStargazers:264Issues:15Issues:18

SparseNeRF

[ICCV 2023] SparseNeRF: Distilling Depth Ranking for Few-shot Novel View Synthesis

Language:PythonLicense:NOASSERTIONStargazers:262Issues:12Issues:31

DS-Net

[CVPR 2021/TPAMI 2023] Rank 1st in the public leaderboard of SemanticKITTI Panoptic Segmentation (2020-11-16)

Language:PythonLicense:MITStargazers:240Issues:10Issues:20

MU-LLaMA

MU-LLaMA: Music Understanding Large Language Model

Language:PythonLicense:GPL-3.0Stargazers:136Issues:7Issues:15

HCMoCo

[CVPR 2022 Oral] Versatile Multi-Modal Pre-Training for Human-Centric Perception

Language:PythonLicense:MITStargazers:117Issues:9Issues:4

SAM-Graph

Code for "SAM-guided Graph Cut for 3D Instance Segmentation" ECCV 2024

ConsistentNeRF

ConsistentNeRF Enhances Neural Radiance Fields with 3D Consistency for Sparse View Synthesis

MPS-NeRF

[TPAMI' 2022' MPS-NeRF]

HumanLiff

HumanLiff learns layer-wise 3D human with a unified diffusion process.

Language:PythonLicense:NOASSERTIONStargazers:45Issues:5Issues:1
Language:PythonLicense:NOASSERTIONStargazers:28Issues:1Issues:0

kaldi_bayes_adapt

This is a modified version of Kaldi speech recognition toolkit with the codes of standard and Bayesian adaptation approaches, e.g., LHUC, LHN, PAct, etc..

Language:ShellLicense:NOASSERTIONStargazers:2Issues:1Issues:0