yinhefeng / ICCV2021-Papers-with-Code

ICCV 2021 论文和开源项目合集

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

ICCV2021-Papers-with-Code

ICCV 2021 论文和开源项目合集(papers with code)!

1617 papers accepted - 25.9% acceptance rate

ICCV 2021 收录论文IDs:https://docs.google.com/spreadsheets/u/1/d/e/2PACX-1vRfaTmsNweuaA0Gjyu58H_Cx56pGwFhcTYII0u1pg0U7MbhlgY0R6Y-BbK3xFhAiwGZ26u3TAtN5MnS/pubhtml

注1:欢迎各位大佬提交issue,分享ICCV 2021论文和开源项目!

注2:关于往年CV顶会论文以及其他优质CV论文和大盘点,详见: https://github.com/amusi/daily-paper-computer-vision

【ICCV 2021 论文和开源目录】

Backbone

Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions

AutoFormer: Searching Transformers for Visual Recognition

Bias Loss for Mobile Neural Networks

Visual Transformer

An Empirical Study of Training Self-Supervised Vision Transformers

Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions

Spatial-Temporal Transformer for Dynamic Scene Graph Generation

GAN

Labels4Free: Unsupervised Segmentation using StyleGAN

GNeRF: GAN-based Neural Radiance Field without Posed Camera

EigenGAN: Layer-Wise Eigen-Learning for GANs

NAS

AutoFormer: Searching Transformers for Visual Recognition

NeRF

GNeRF: GAN-based Neural Radiance Field without Posed Camera

KiloNeRF: Speeding up Neural Radiance Fields with Thousands of Tiny MLPs

In-Place Scene Labelling and Understanding with Implicit Scene Representation

Putting NeRF on a Diet: Semantically Consistent Few-Shot View Synthesis

Loss

Rank & Sort Loss for Object Detection and Instance Segmentation

Bias Loss for Mobile Neural Networks

长尾(Long-tailed)

Parametric Contrastive Learning

无监督/自监督(Un/Self-Supervised)

An Empirical Study of Training Self-Supervised Vision Transformers

DetCo: Unsupervised Contrastive Learning for Object Detection

2D目标检测(Object Detection)

DetCo: Unsupervised Contrastive Learning for Object Detection

Detecting Invisible People

Active Learning for Deep Object Detection via Probabilistic Modeling

Conditional Variational Capsule Network for Open Set Recognition

MDETR : Modulated Detection for End-to-End Multi-Modal Understanding

Rank & Sort Loss for Object Detection and Instance Segmentation

SimROD: A Simple Adaptation Method for Robust Object Detection

语义分割(Semantic Segmentation)

半监督语义分割(Semi-supervised Semantic Segmentation)

Leveraging Auxiliary Tasks with Affinity Learning for Weakly Supervised Semantic Segmentation

Re-distributing Biased Pseudo Labels for Semi-supervised Semantic Segmentation: A Baseline Investigation

无监督分割(Unsupervised Segmentation)

Labels4Free: Unsupervised Segmentation using StyleGAN

实例分割(Instance Segmentation)

Instances as Queries

Crossover Learning for Fast Online Video Instance Segmentation

Rank & Sort Loss for Object Detection and Instance Segmentation

Few-shot Segmentation

Mining Latent Classes for Few-shot Segmentation

目标跟踪(Object Tracking)

Learning to Adversarially Blur Visual Object Tracking

3D Point Cloud

Unsupervised Point Cloud Pre-Training via View-Point Occlusion, Completion

Point Cloud Semantic Segmentation(点云语义分割)

ReDAL: Region-based and Diversity-aware Active Learning for Point Cloud Semantic Segmentation

Point Cloud Denoising(点云去噪)

Score-Based Point Cloud Denoising

Point Cloud Registration(点云配准)

HRegNet: A Hierarchical Network for Large-scale Outdoor LiDAR Point Cloud Registration

超分辨率(Super-Resolution)

Learning for Scale-Arbitrary Super-Resolution from Scale-Specific Networks

行人重识别(Person Re-identification)

TransReID: Transformer-based Object Re-Identification

2D/3D人体姿态估计(2D/3D Human Pose Estimation)

2D 人体姿态估计

Human Pose Regression with Residual Log-likelihood Estimation

3D人头重建(3D Head Reconstruction)

H3D-Net: Few-Shot High-Fidelity 3D Head Reconstruction

行为识别(Action Recognition)

MGSampler: An Explainable Sampling Strategy for Video Action Recognition

文本检测(Text Detection)

Adaptive Boundary Proposal Network for Arbitrary Shape Text Detection

文本识别(Text Recognition)

Joint Visual Semantic Reasoning: Multi-Stage Decoder for Text Recognition

深度估计(Depth Estimation)

单目深度估计

MonoIndoor: Towards Good Practice of Self-Supervised Monocular Depth Estimation for Indoor Environments

人群计数(Crowd Counting)

Rethinking Counting and Localization in Crowds:A Purely Point-Based Framework

异常检测(Anomaly Detection)

Weakly-supervised Video Anomaly Detection with Robust Temporal Feature Magnitude Learning

场景图生成(Scene Graph Generation)

Spatial-Temporal Transformer for Dynamic Scene Graph Generation

数据集(Datasets)

H3D-Net: Few-Shot High-Fidelity 3D Head Reconstruction

其他(Others)

Hand-Object Contact Consistency Reasoning for Human Grasps Generation

Equivariant Imaging: Learning Beyond the Range Space

Just Ask: Learning to Answer Questions from Millions of Narrated Videos

About

ICCV 2021 论文和开源项目合集