ECCV 2020 Overview

Useful Links

All papers

Overview of Topics

Interesting Papers
Image/Video Inpainting
Generative Models
Detection

ECCV Dailies

Interesting papers:

Towards Streaming Perception (Best paper honorable mention)
- Proposes a simple quantitative metric for streaming data (e.g. video stream from autonomous systems) that works with several computer vision tasks. Examples for object detection and instance segmentation are shown.
The tradeoff between accuracy versus latency can now be measured quantitatively and there exists an optimal "sweet spot" that maximizes streaming accuracy, (2) asynchronous tracking and future forecasting naturally emerge as internal representations that enable streaming image understanding, and (3) dynamic scheduling can be used to overcome temporal aliasing, yielding the paradoxical result that latency is sometimes minimized by sitting idle and "doing nothing"."
- [ECCV Link] [Project Page] [Paper]
NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis (Best paper honorable mention)
- We present a method that achieves state-of-the-art results for synthesizing novel views of complex scenes by optimizing an underlying continuous volumetric scene function using a sparse set of input views
- Propose a simple fully-connected layer to map a 5D coordinate (spatial + direction - x, y, z, $\theta$, $\phi$) to the volume density and texture. Does not use voxelgrids to represent 3D data.
- [ECCV Link] [Project Page] [Paper]
RAFT: Recurrent All-Pairs Field Transforms for Optical Flow (Best paper)
- [ECCV Link] [Project Page] [Paper]
A Generalization of Otsu's Method and Minimum Error Thresholding
- A generalized histogram tresholding algorithm, which can be simplified to Otsu's method, Minimum Error Tresholding (MET), and weighted percentile thresholding.
GHT thereby enables the continuous interpolation between those three algorithms, which allows thresholding accuracy to be improved significantly GHT works by performing approximate maximum a posteriori estimation of a mixture of Gaussians with appropriate priors.
- [ECCV Link] [Code] [Paper] [Video]
A Metric Learning Reality Check (poster)
- Deep metric learning papers from the past four years have consistently claimed great advances in accuracy, often more than doubling the performance of decade-old methods. In this paper, we take a closer look at the field to see if this is actually true. We find flaws in the experimental methodology of numerous metric learning papers, and show that the actual improvements over time have been marginal at best.
- [ECCV Link] [Project Page] [Paper] [Blog]

Image/Video Inpainting

Rethinking image inpainting via a mutual encoder-decoder with feature equalization (oral)
- Assumes that image inpainting consists of structure and texture generation, and propose a two-branch module.
- [ECCV Link] [Project Page] [Paper]
Hallucinating Visual Instances in Total Absentia
- Unlike conventional image inpainting task that works on images with only part of a visual instance missing, HVITA concerns scenarios where an object is completely absent from the scene
- [ECCV Link] [Paper]
High-Resolution Image Inpainting with Iterative Confidence Feedback and Guided Upsampling
- Propose to predict the confidence for each pixel, then by basic thresholding they can re-inpaint corrupted regions of the image.
- [ECCV Link] [Paper]
DVI: Depth Guided Video Inpainting for Autonomous Driving
- Propose a method to remove moving objects. They do not consider the case when the vehicle is static?
- [ECCV Link] [Paper]
Guidance and Evaluation: Semantic-Aware Image Inpainting for Mixed Scenes
- Propose different approaches to incorporate semantic segmentation for image inpainting.
- [ECCV Link] [Paper]
Proposal based Video Completion
- we use 3D convolutions to obtain an initial inpainting estimate which is subsequently refined by fusing a generated set of proposals
- [ECCV Link] [Paper]

Generative Models

Rewriting a Deep Generative Model (oral)
- Builds upon the assumption that a convolutional layer is an associative memory (key -> value mapping). With this, they can "rewrite" the result of the generative model (e.g. make a rule that a horse always wears a hat).
To address the problem, we propose a formulation in which the desired rule is changed by manipulating a layer of a deep network as a linear associative memory.
- [ECCV Link] [Project Page] [Paper] [Short Video]
Exploiting Deep Generative Prior for Versatile Image Restoration and Manipulation
- [ECCV Link] [Paper] [code]
GAN Slimming: All-in-One GAN Compression by A Unified Optimization Framework
- [ECCV Link] [Paper]
Learning to Factorize and Relight a City
- Utilizing google street view they symthesise images at different lighnting conditions.
- Learns disentanglement between static objects over time (buildings) vs dynamic objects.
- [ECCV Link] [Paper]
ForkGAN: Seeing into the Rainy Night (oral)
- Propose an image translation network to translate images between weather conditions.
Our innovation is a fork-shape generator with one encoder and two decoders that disentangles the domain-specific and domain-invariant information.
- [ECCV Link] [Project Page] [Paper]
High-Fidelity Synthesis with Disentangled Representation
- Utilizes VAE for their strong disentanglement, and a GAN-based generator for high-fidelity results.
- [ECCV Link] [Paper]
SRFlow: Learning the Super-Resolution Space with Normalizing Flow
- [ECCV Link] [Paper]
What makes fake images detectable? Understanding properties that generalize
- We seek to understand what properties of fake images make them detectable and identify what generalizes across different model architectures, datasets, and variations in training.
- [ECCV Link] [Paper]

Detection

End-to-End Object Detection with Transformers (oral)
- Propose a method to view object detection as a direct set prediction problem by the use of transformers. This removes the need of hand-crafted modules, such as non-maxima suppression and anchor generation.
- [ECCV Link] [Project Page] [Paper] [Code]
Learning What to Learn for Video Object Segmentation (oral)
- Propose a model for tracking in video segmentation where the target object is given in the first frame
Our learner is designed to predict a powerful parametric model of the target by minimizing a segmentation error in the first frame. We further go beyond the standard few-shot learning paradigm by learning what our target model should learn in order to maximize segmentation accuracy.
- [ECCV Link] [Project Page] [Paper]
Du$^2$Net: Learning Depth Estimation from Dual-Cameras and Dual-Pixels (oral)
- We present a novel approach based on neural networks for depth estimation that combines stereo from dual cameras with stereo from a dual-pixel sensor, which is increasingly common on consumer cameras
- [ECCV Link] [Paper]
Conditional Convolutions for Instance Segmentation (oral)
- Propose to replace the RoIAlign/Crop in Mask R-CNN for class-conditional convolutions for instance segmentation.
- [ECCV Link] [Paper]
Corner Proposal Network for Anchor-free, Two-stage Object Detection (spotlight)
- Propose a two-stage detector without anchors; the first stage detects corners, then all poosible combinations are considered to classify object detections in the second stage.
- [ECCV Link] [Paper]
SimPose: Effectively Learning DensePose and Surface Normal of People from Simulated Data
- [https://papers.eccv2020.eu/paper/6637/]

Other

Learning Object Placement by Inpainting for Compositional Data Augmentation
- We propose a self-learning framework that automatically generates the necessary training data without any manual labeling by detecting, cutting, and inpainting objects from an image
- [ECCV Link] [Paper]

hukkelas / ECCV2020

ECCV 2020 Overview

Useful Links

Overview of Topics

ECCV Dailies

Interesting papers:

Image/Video Inpainting

Generative Models

Detection

Other

About