SUN, Pengzhan (pengzhansun)

pengzhansun

Geek Repo

Company:National University of Singapore

Location:Singapore

Home Page:https://pengzhansun.github.io/

Twitter:@pengzhan_sun

Github PK Tool:Github PK Tool

SUN, Pengzhan's starred repositories

Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:14140Issues:116Issues:373

gaussian-splatting

Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"

Language:PythonLicense:NOASSERTIONStargazers:12586Issues:110Issues:816

dino

PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO

Language:PythonLicense:Apache-2.0Stargazers:6071Issues:68Issues:245

GroundingDINO

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Language:PythonLicense:Apache-2.0Stargazers:5635Issues:36Issues:283

sort

Simple, online, and realtime tracking of multiple objects in a video sequence.

Language:PythonLicense:GPL-3.0Stargazers:3822Issues:73Issues:156

Moore-AnimateAnyone

Character Animation (AnimateAnyone, Face Reenactment)

Language:PythonLicense:Apache-2.0Stargazers:2928Issues:35Issues:140

co-tracker

CoTracker is a model for tracking any point (pixel) on a video.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:2528Issues:25Issues:75

Multimodal-GPT

Multimodal-GPT

Language:PythonLicense:Apache-2.0Stargazers:1444Issues:12Issues:16

CLIP4Clip

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

Language:PythonLicense:MITStargazers:818Issues:12Issues:109
Language:PythonLicense:NOASSERTIONStargazers:710Issues:8Issues:62

Grounding-DINO-1.5-API

API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

Language:PythonLicense:Apache-2.0Stargazers:606Issues:11Issues:28

ChatCaptioner

Official Repository of ChatCaptioner

Language:Jupyter NotebookLicense:MITStargazers:447Issues:4Issues:7

Pandora

Pandora: Towards General World Model with Natural Language Actions and Video States

Uni3D

[ICLR'24 Spotlight] Uni3D: 3D Visual Representation from BAAI

Language:PythonLicense:MITStargazers:429Issues:12Issues:21

unmasked_teacher

[ICCV2023 Oral] Unmasked Teacher: Towards Training-Efficient Video Foundation Models

Language:PythonLicense:MITStargazers:267Issues:14Issues:39

paco

This repo contains documentation and code needed to use PACO dataset: data loaders and training and evaluation scripts for objects, parts, and attributes prediction models, query evaluation scripts, and visualization notebooks.

Language:PythonLicense:MITStargazers:259Issues:19Issues:8

EgoVLP

[NeurIPS2022] Egocentric Video-Language Pretraining

IART

[CVPR 2024 Highlight] Enhancing Video Super-Resolution via Implicit Resampling-based Alignment.

Language:PythonLicense:MITStargazers:95Issues:4Issues:11

EgoVLPv2

Code release for "EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone" [ICCV, 2023]

Language:PythonLicense:MITStargazers:82Issues:5Issues:10

attention-interpolation-diffusion

Interpolation Between Text-to-Image Generation!

Language:PythonLicense:MITStargazers:67Issues:4Issues:7

InstructHumans

Editing Animated 3D Human Textures with Instructions

Language:PythonStargazers:51Issues:2Issues:0

hoi-forecast

[CVPR 2022] Joint hand motion and interaction hotspots prediction from egocentric videos

Language:PythonLicense:MITStargazers:49Issues:5Issues:13

MRFA

[NeurIPS 2023] Learning Motion Refinement for Unsupervised Face Animation

IVG

This repo holds the official code and data for "Beyond Literal Descriptions: Understanding and Locating Open-World Objects Aligned with Human Intentions", which is accepted by ACL 2024 (Findings).

License:Apache-2.0Stargazers:15Issues:0Issues:0