Beast code in Giters

Shoukang Hu's starred repositories

detectron2

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

Language:PythonApache-2.029610 389 3481

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Language:Jupyter NotebookMIT23981 316 388

gaussian-splatting

Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"

Language:PythonNOASSERTION13024 112 853

muzic

Muzic: Music Understanding and Generation with Artificial Intelligence

Language:PythonMIT4396 76 169

pytorch-openpose

pytorch implementation of openpose including Hand and Body Pose Estimation.

Language:Jupyter Notebook2040 25 78

4K4D

[CVPR 2024] 4K4D: Real-Time 4D View Synthesis at 4K Resolution

Language:PythonNOASSERTION1512 88 43

AvatarCLIP

[SIGGRAPH 2022 Journal Track] AvatarCLIP: Zero-Shot Text-Driven Generation and Animation of 3D Avatars

Language:PythonNOASSERTION1056 20 20

EdgeSAM

Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"

Language:Jupyter NotebookNOASSERTION859 16 25

Awesome-Segmentation-With-Transformer

[T-PAMI-2024] Transformer-Based Visual Segmentation: A Survey

633 10 5

3DTopia

Text-to-3D Generation within 5 Minutes

Language:PythonApache-2.0587 12 12

EVA3D

[ICLR 2023 Spotlight] EVA3D: Compositional 3D Human Generation from 2D Image Collections

Language:PythonNOASSERTION577 33 32

RelateAnything

Relate Anything Model is capable of taking an image as input and utilizing SAM to identify the corresponding mask within the image.

Language:PythonApache-2.0438 10 12

CIHP_PGN

Code repository for Part Grouping Network, ECCV 2018

Language:PythonMIT427 18 74

MetaMath

MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models

Language:PythonApache-2.0360 7 27

FreeNoise

[ICLR 2024] Code for FreeNoise based on VideoCrafter

Language:PythonApache-2.0353 6 13

MultiModal-DeepFake

[TPAMI 2024 & CVPR 2023] PyTorch code for DGM4: Detecting and Grounding Multi-Modal Media Manipulation and beyond

Language:PythonNOASSERTION316 3 38

GauHuman

Code for our CVPR'2024 paper "GauHuman: Articulated Gaussian Splatting from Monocular Human Videos"

Language:PythonNOASSERTION306 12 39

diff-gaussian-rasterization

Language:CudaNOASSERTION304 7 21

SHERF

Code for our ICCV'2023 paper "SHERF: Generalizable Human NeRF from a Single Image"

Language:PythonNOASSERTION297 34 40

TADA

[3DV 2024] Official Repository for "TADA! Text to Animatable Digital Avatars".

Language:PythonMIT264 15 18

SparseNeRF

[ICCV 2023] SparseNeRF: Distilling Depth Ranking for Few-shot Novel View Synthesis

Language:PythonNOASSERTION262 12 31

DS-Net

[CVPR 2021/TPAMI 2023] Rank 1st in the public leaderboard of SemanticKITTI Panoptic Segmentation (2020-11-16)

Language:PythonMIT240 10 20

MU-LLaMA

MU-LLaMA: Music Understanding Large Language Model

Language:PythonGPL-3.0136 7 15

HCMoCo

[CVPR 2022 Oral] Versatile Multi-Modal Pre-Training for Human-Centric Perception

Language:PythonMIT117 9 4

SAM-Graph

Code for "SAM-guided Graph Cut for 3D Instance Segmentation" ECCV 2024

77 16 4

ConsistentNeRF

ConsistentNeRF Enhances Neural Radiance Fields with 3D Consistency for Sparse View Synthesis

Language:Python69 8 3

MPS-NeRF

[TPAMI' 2022' MPS-NeRF]

Language:Python47 4 11

HumanLiff

HumanLiff learns layer-wise 3D human with a unified diffusion process.

Language:PythonNOASSERTION45 5 1

Correlational-Image-Modeling

Language:PythonNOASSERTION28 10

kaldi_bayes_adapt

This is a modified version of Kaldi speech recognition toolkit with the codes of standard and Bayesian adaptation approaches, e.g., LHUC, LHN, PAct, etc..

Language:ShellNOASSERTION2 10