yoyoshuang's starred repositories

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:44613Issues:294Issues:640

jax

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Language:PythonLicense:Apache-2.0Stargazers:28282Issues:323Issues:5193

gaussian-splatting

Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"

Language:PythonLicense:NOASSERTIONStargazers:11832Issues:105Issues:769

DALLE2-pytorch

Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch

Language:PythonLicense:MITStargazers:10896Issues:122Issues:207

SlowFast

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

Language:PythonLicense:Apache-2.0Stargazers:6320Issues:94Issues:663

shapely

Manipulation and analysis of geometric objects

Language:PythonLicense:BSD-3-ClauseStargazers:3696Issues:88Issues:1166

unrealcv

UnrealCV: Connecting Computer Vision to Unreal Engine

Language:C++License:MITStargazers:1838Issues:96Issues:205

AB3DMOT

(IROS 2020, ECCVW 2020) Official Python Implementation for "3D Multi-Object Tracking: A Baseline and New Evaluation Metrics"

Language:PythonLicense:NOASSERTIONStargazers:1637Issues:48Issues:103

Monkey

【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models

Language:PythonLicense:MITStargazers:1445Issues:19Issues:85

VideoX

VideoX: a collection of video cross-modal models

Language:PythonLicense:NOASSERTIONStargazers:935Issues:22Issues:109

MotionBERT

[ICCV 2023] PyTorch Implementation of "MotionBERT: A Unified Perspective on Learning Human Motion Representations"

Language:PythonLicense:Apache-2.0Stargazers:883Issues:21Issues:129

SUSTechPOINTS

3D Point Cloud Annotation Platform for Autonomous Driving

Language:JavaScriptLicense:GPL-3.0Stargazers:769Issues:20Issues:174

splatter-image

Official implementation of `Splatter Image: Ultra-Fast Single-View 3D Reconstruction' CVPR 2024

Language:PythonLicense:BSD-3-ClauseStargazers:696Issues:22Issues:47

TransFusion

[PyTorch] Official implementation of CVPR2022 paper "TransFusion: Robust LiDAR-Camera Fusion for 3D Object Detection with Transformers". https://arxiv.org/abs/2203.11496

Language:PythonLicense:Apache-2.0Stargazers:587Issues:13Issues:105

ActionCLIP

This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"

Language:PythonLicense:MITStargazers:468Issues:4Issues:50

Uni3D

[ICLR'24 Spotlight] Uni3D: 3D Visual Representation from BAAI

Language:PythonLicense:MITStargazers:405Issues:12Issues:19

concept-graphs

Official code release for ConceptGraphs

Language:PythonLicense:MITStargazers:281Issues:10Issues:36

embodied-generalist

[ICML 2024] Official code repository for 3D embodied generalist agent LEO

Language:PythonLicense:MITStargazers:235Issues:14Issues:18

TriDet

[CVPR2023] Code for the paper, TriDet: Temporal Action Detection with Relative Boundary Modeling

Language:PythonLicense:MITStargazers:150Issues:3Issues:36

pytorch-i3d-feature-extraction

Code for I3D Feature Extraction

Language:PythonLicense:Apache-2.0Stargazers:131Issues:4Issues:0

SSR-code

Official implementation of 3DV24 paper "Single-view 3D Scene Reconstruction with High-fidelity Shape and Texture"

SSTAP

Code for our CVPR 2021 Paper "Self-Supervised Learning for Semi-Supervised Temporal Action Proposal".

Language:Jupyter NotebookLicense:MITStargazers:43Issues:2Issues:8
Language:PythonStargazers:42Issues:0Issues:0

tridetplus

Code for the paper, Temporal Action Localization with Enhanced Instant Discriminability

Language:PythonLicense:MITStargazers:18Issues:2Issues:8

IntentQA

Official repository for "IntentQA: Context-aware Video Intent Reasoning" from ICCV 2023.

Stargazers:5Issues:0Issues:0