tomztyang

tomztyang

Geek Repo

Company:The Chinese University of Hong Kong

Github PK Tool:Github PK Tool

tomztyang's starred repositories

AnyControl

[ECCV 2024] AnyControl, a multi-control image synthesis model that supports any combination of user provided control signals. 一个支持用户自由输入控制信号的图像生成模型,能够根据多种控制生成自然和谐的结果!

Language:PythonLicense:MITStargazers:78Issues:0Issues:0

StyleShot

StyleShot: A SnapShot on Any Style. 一款可以迁移任意风格到任意内容的模型,无需针对图片微调,即能生成高质量的个性风格化图片!

Language:PythonLicense:MITStargazers:127Issues:0Issues:0

MPI

[RSS 2024] Learning Manipulation by Predicting Interaction

Language:PythonLicense:MITStargazers:65Issues:0Issues:0

Vista

A Generalizable World Model for Autonomous Driving

Language:PythonLicense:Apache-2.0Stargazers:407Issues:0Issues:0
Language:PythonStargazers:320Issues:0Issues:0

TopoNet

Topology Reasoning for Scene Perception in Autonomous Driving

Language:PythonLicense:Apache-2.0Stargazers:259Issues:0Issues:0

ViDAR

[CVPR 2024 Highlight] Visual Point Cloud Forecasting

Language:PythonLicense:Apache-2.0Stargazers:241Issues:0Issues:0

street_gaussians

[ECCV 2024] Street Gaussians: Modeling Dynamic Urban Scenes with Gaussian Splatting

Stargazers:652Issues:0Issues:0

gaussian-splatting

Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"

Language:PythonLicense:NOASSERTIONStargazers:12871Issues:0Issues:0

4d-occ-forecasting

CVPR 2023: Official code for `Point Cloud Forecasting as a Proxy for 4D Occupancy Forecasting'

Language:PythonLicense:MITStargazers:206Issues:0Issues:0

LISA

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Language:PythonLicense:Apache-2.0Stargazers:1664Issues:0Issues:0

VoxelNeXt

VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking (CVPR 2023)

Language:PythonLicense:Apache-2.0Stargazers:685Issues:0Issues:0

SPS-Conv

(NeurlPS 2022) Spatial Pruned Sparse Convolution for Efficient 3D Object Detection

Language:PythonLicense:Apache-2.0Stargazers:62Issues:0Issues:0

UnboundedNeRFPytorch

State-of-the-art, simple, fast unbounded / large-scale NeRFs.

Language:PythonLicense:MITStargazers:1327Issues:0Issues:0

pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

Language:PythonLicense:Apache-2.0Stargazers:30860Issues:0Issues:0

DeepVision3D

DeepVision3D is an open source toolbox for point-cloud understanding.

Language:PythonStargazers:119Issues:0Issues:0

FocalsConv

Focal Sparse Convolutional Networks for 3D Object Detection (CVPR 2022, Oral)

Language:PythonLicense:Apache-2.0Stargazers:364Issues:0Issues:0

Entity

EntitySeg Toolbox: Towards Open-World and High-Quality Image Segmentation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:683Issues:0Issues:0

SupContrast

PyTorch implementation of "Supervised Contrastive Learning" (and SimCLR incidentally)

Language:PythonLicense:BSD-2-ClauseStargazers:2995Issues:0Issues:0

lingvo

Lingvo

Language:PythonLicense:Apache-2.0Stargazers:2802Issues:0Issues:0

WS_DAN

The official TensorFlow implementation of WS-DAN.

Language:PythonStargazers:111Issues:0Issues:0

DSGN

DSGN: Deep Stereo Geometry Network for 3D Object Detection (CVPR 2020)

Language:PythonLicense:MITStargazers:324Issues:0Issues:0

3DSSD

3DSSD: Point-based 3D Single Stage Object Detector (CVPR 2020)

Language:PythonLicense:MITStargazers:375Issues:0Issues:0