AlvaHan / CVPR2022-Papers-with-Code

CVPR 2022 论文和开源项目合集

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

CVPR 2022 论文和开源项目合集(Papers with Code)

CVPR 2022 论文和开源项目合集(papers with code)!

CVPR 2022 收录列表ID:https://drive.google.com/file/d/15JFhfPboKdUcIH9LdbCMUFmGq_JhaxhC/view

注1:欢迎各位大佬提交issue,分享CVPR 2022论文和开源项目!

注2:关于往年CV顶会论文以及其他优质CV论文和大盘点,详见: https://github.com/amusi/daily-paper-computer-vision

如果你想了解最新最优质的的CV论文、开源项目和学习资料,欢迎扫码加入【CVer学术交流群】!互相学习,一起进步~

【CVPR 2022 论文开源目录】

Backbone

A ConvNet for the 2020s

MPViT : Multi-Path Vision Transformer for Dense Prediction

CLIP

HairCLIP: Design Your Hair by Text and Reference Image

PointCLIP: Point Cloud Understanding by CLIP

Blended Diffusion for Text-driven Editing of Natural Images

NAS

ISNAS-DIP: Image-Specific Neural Architecture Search for Deep Image Prior

NeRF

Mip-NeRF 360: Unbounded Anti-Aliased Neural Radiance Fields

Point-NeRF: Point-based Neural Radiance Fields

NeRF in the Dark: High Dynamic Range View Synthesis from Noisy Raw Images

Visual Transformer

Backbone

MPViT : Multi-Path Vision Transformer for Dense Prediction

应用

Language-based Video Editing via Multi-Modal Multi-Level Transformer

MixSTE: Seq2seq Mixed Spatio-Temporal Encoder for 3D Human Pose Estimation in Video

Embracing Single Stride 3D Object Detector with Sparse Transformer

Multi-class Token Transformer for Weakly Supervised Semantic Segmentation

自监督学习(Self-supervised Learning)

Crafting Better Contrastive Views for Siamese Representation Learning

数据增强(Data Augmentation)

TeachAugment: Data Augmentation Optimization Using Teacher Knowledge

AlignMix: Improving representation by interpolating aligned features

目标检测(Object Detection)

DN-DETR: Accelerate DETR Training by Introducing Query DeNoising

Localization Distillation for Dense Object Detection

Focal and Global Knowledge Distillation for Detectors

目标跟踪(Visual Tracking)

Correlation-Aware Deep Tracking

TCTrack: Temporal Contexts for Aerial Tracking

语义分割(Semantic Segmentation)

弱监督语义分割

Class Re-Activation Maps for Weakly-Supervised Semantic Segmentation

Multi-class Token Transformer for Weakly Supervised Semantic Segmentation

半监督语义分割

ST++: Make Self-training Work Better for Semi-supervised Semantic Segmentation

实例分割(Instance Segmentation)

E2EC: An End-to-End Contour-based Method for High-Quality High-Speed Instance Segmentation

自监督实例分割

FreeSOLO: Learning to Segment Objects without Annotations

视频实例分割

Efficient Video Instance Segmentation via Tracklet Query and Proposal

图像编辑(Image Editing)

Blended Diffusion for Text-driven Editing of Natural Images

Low-level Vision

ISNAS-DIP: Image-Specific Neural Architecture Search for Deep Image Prior

超分辨率(Super-Resolution)

视频超分辨率

BasicVSR++: Improving Video Super-Resolution with Enhanced Propagation and Alignment

3D点云(3D Point Cloud)

A Unified Query-based Paradigm for Point Cloud Understanding

CrossPoint: Self-Supervised Cross-Modal Contrastive Learning for 3D Point Cloud Understanding

PointCLIP: Point Cloud Understanding by CLIP

3D目标检测(3D Object Detection)

Embracing Single Stride 3D Object Detector with Sparse Transformer

Canonical Voting: Towards Robust Oriented Bounding Box Detection in 3D Scenes

3D目标跟踪(3D Object Tracking)

Beyond 3D Siamese Tracking: A Motion-Centric Paradigm for 3D Single Object Tracking in Point Clouds

3D人体姿态估计(3D Human Pose Estimation)

MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation

MixSTE: Seq2seq Mixed Spatio-Temporal Encoder for 3D Human Pose Estimation in Video

3D语义场景补全(3D Semantic Scene Completion)

MonoScene: Monocular 3D Semantic Scene Completion

3D重建(3D Reconstruction)

BANMo: Building Animatable 3D Neural Models from Many Casual Videos

深度估计(Depth Estimation)

单目深度估计

NeW CRFs: Neural Window Fully-connected CRFs for Monocular Depth Estimation

OmniFusion: 360 Monocular Depth Estimation via Geometry-Aware Fusion

Toward Practical Self-Supervised Monocular Indoor Depth Estimation

车道线检测(Lane Detection)

Rethinking Efficient Lane Detection via Curve Modeling

图像修复(Image Inpainting)

Incremental Transformer Structure Enhanced Image Inpainting with Masking Positional Encoding

人群计数(Crowd Counting)

Leveraging Self-Supervision for Cross-Domain Crowd Counting

医学图像(Medical Image)

BoostMIS: Boosting Medical Image Semi-supervised Learning with Adaptive Pseudo Labeling and Informative Active Annotation

场景图生成(Scene Graph Generation)

SGTR: End-to-end Scene Graph Generation with Transformer

风格迁移(Style Transfer)

StyleMesh: Style Transfer for Indoor 3D Scene Reconstructions

水印(Watermarking)

Deep 3D-to-2D Watermarking: Embedding Messages in 3D Meshes and Extracting Them from 2D Renderings

数据集(Datasets)

It's About Time: Analog Clock Reading in the Wild

Toward Practical Self-Supervised Monocular Indoor Depth Estimation

Kubric: A scalable dataset generator

新任务(New Task)

Language-based Video Editing via Multi-Modal Multi-Level Transformer

It's About Time: Analog Clock Reading in the Wild

其他(Others)

Kubric: A scalable dataset generator

About

CVPR 2022 论文和开源项目合集