wenguanwang

followers

following

stars

Zhejiang University

China

https://sites.google.com/view/wenguanwang

Wenguan Wang's repositories

AGS

Learning Unsupervised Video Object Segmentation through Visual Attention (CVPR19, PAMI20)

Language:Python209 4 7

DHF1K

Revisiting Video Saliency: A Large-scale Benchmark and a New Model (CVPR18, PAMI19)

Language:MATLAB135 11 19

VOS_Correspondence

Official code for CVPR2023 Boosting Video Object Segmentation via Space-time Correspondence Learning

Language:Python16 3 3

HA_deblur

Human-Aware Motion Deblurring (ICCV2019)

300

Stereoscopic-thumbnail-creation-via-efficient-stereo-saliency-detection

Stereoscopic Thumbnail Creation via Efficient Stereo Saliency Detection (TVCG16)

Language:C++3 10

SupertrajectorySeg

Semi-Supervised Video Object Segmentation with Super-Trajectories (ICCV2017, PAMI2018)

Language:MATLAB3 2 1

ContrastiveSeg

Exploring Cross-Image Pixel Contrast for Semantic Segmentation

Language:PythonMIT200

visdial-gnn

(CVPR19Oral) Reasoning Visual Dialogs with Structural and Partial Observations

Language:PythonMIT2 20

Active_VLN

The repository of ECCV 2020 paper `Active Visual Information Gathering for Vision-Language Navigation`

Language:PythonMIT1 10

LOAF

100

Segment-and-Track-Anything

An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) for key-frame segmentation and Associating Objects with Transformers (AOT) for efficient tracking and propagation purposes.

Language:Jupyter NotebookAGPL-3.0100

AGNN

Zero-shot Video Object Segmentation via Attentive Graph Neural Networks (ICCV2019 Oral)

Language:Python000

C-HOI

Cascaded Human-Object Interaction Recognition (CVPR2020)

Language:Python000

CompositionalHumanParsing

(ICCV2019) Learning Compositional Neural Infomation Fusion for Human Parsing

Language:Python000

DNC

Official Pytorch implementation of 'Visual Recognition with Deep Nearest Centroids'. (ICLR2023 Spotlight)

Language:PythonMIT000

DoraemonGPT

Official repository of DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models

BSD-3-Clause000

ETPNav

Official Implementation of "ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous Environments"

000

GMMSeg

[NeurIPS 2022 Spotlight] GMMSeg: Gaussian Mixture based Generative Semantic Segmentation Models

Language:PythonApache-2.0000

GraphMemVOS

Code for ECCV 2020 paper: Video Object Segmentation with Episodic Graph Memory Networks

Language:Python000

HieraSeg

000

HSSN_pytorch

Language:Python000

Human-Gaze-Communication

Language:Python000

LANA-VLN

Repository of our CVPR2023 paper "Lana: A Language-Capable Navigator for Instruction Following and Generation"

MIT000

ProtoSeg

CVPR2022 (Oral) - Rethinking Semantic Segmentation: A Prototype View

Language:PythonMIT000

SSM-VLN

Code and Data for our CVPR 2021 paper "Structured Scene Memory for Vision-Language Navigation"

Language:C++MIT000

TD-STP

Code for MM 22 "Target-Driven Structured Transformer Planner for Vision-Language Navigation"

Language:Python000

UMA-MOT

A Unified Object Motion and Affinity Model for Online Multi-Object Tracking (CVPR2020)

Language:PythonMIT010

VAR

[CVPR 2022] Visual Abductive Reasoning

Language:PythonMIT000

VS-Survey

000

WS3D

Official version of 'Weakly Supervised 3D object detection from Lidar Point Cloud'(ECCV2020)

Language:PythonMIT010