CV-Archive

My personal paper/code/dataset archives about computer vision

#	Paper	Published in	Summary	Misc
1	Learning Priors for Semantic 3D Reconstruction	ECCV 2018		也适用于室内;暂时未开源
2	RayNet: Learning Volumetric 3D Reconstruction with Ray Potentials	CVPR 2018	基于multi-view的考虑，在learning-based重建的基础上加入了几何约束	非语义重建；已开源
3	Robust Dense Mapping for Large-Scale Dynamic Environments	ICRA 2018	SLAM + semantic segmentation + scene flow	双目;已开源
4	Semantic Multi-view Stereo: Jointly Estimating Objects and Voxels	CVPR 2017	引入shape prior解决遮挡，进行probabilistic重建
5	Semantically Informed Multiview Surface Refinement	ICCV 2017	3D网格几何表面和语义标签的联合优化	已开源
6	Semantic 3D reconstruction with continuous regularization and ray potentials using a visibility consistency constraint	CVPR 2016
7	Discrete Optimization of Ray Potentials for Semantic 3D Reconstruction	CVPR 2015	直接优化三维模型的投影和图像的误差
8	Joint 3D Scene Reconstruction and Class Segmentation	CVPR 2013	提出了一个分割和重建联合优化解决的框架
9	An overview of recent progress in volumetric semantic 3D reconstruction	ICPR 2016	语义三维重建的综述
10	Dense Semantic 3D Reconstruction	TPAMI 2017	同8
11	Large-Scale Outdoor 3D Reconstruction on a Mobile Device	CVIU 2017	移动设备（Google Project Tango)上的大规模重建	基本real-time；没有结合语义
12	Incremental Dense Semantic Stereo Fusion for Large-Scale Semantic Scene Reconstruction	ICRA 2015	第一个实时的大范围室外场景稠密语义重建	总结并比较了了大规模语义重建的相关论文
13	Large-Scale Semantic 3D Reconstruction: An Adaptive Multi-Resolution Model	CVPR 2016	提出了一种自适应的multi-resolution的框架	离线处理
14	Joint Semantic Segmentation and 3D Reconstruction from Monocular Video	ECCV 2014	构建体素块的离散图重建街景	单目，不需要估计深度
15	The Semantic Paintbrush: Interactive 3D Mapping and Recognition in Large Outdoor Spaces	CHI 2015	眼镜 + 双目RGB-红外 + 手持激光笔重建室外场景	实时
16	Incremental Dense Multi-modal 3D Scene Reconstruction	IROS 2015	stereo + lidar	实时
17	Urban 3D Semantic Modelling Using Stereo Vision	ICRA 2013
18	Mesh Based Semantic Modelling for Indoor and Outdoor Scenes	CVPR 2013	提出了三维物体（网格模型）标签的生成方法
19	Efﬁcient 3-d scene analysis from streaming data	ICRA 2013	提出了一种场景表达方式保证速度和精度	实时
20	Multi-Label Semantic 3D Reconstruction using Voxel Blocks	3DV 2016	利用voxel blocks解决了多标签内存占用过多的问题
21	Semantic 3D Reconstruction with Finite Element Bases	arXiv 2017	其实没看懂。。。
22	Joint Optimization for Object Class Segmentation and Dense Stereo Reconstruction	IJCV 2012	双目重建+分割联合优化框架
23	Extracting 3D Scene-Consistent Object Proposals and Depth from Stereo Images	ECCV 2012	引入了物体和场景的约束
24	Semantic Structure From Motion with Points, Regions, and Objects	CVPR 2012	SfM中引入语义信息
25	MVSNet: Depth Inference for Unstructured Multi-view Stereo	ECCV 2018	从多视角图像估计深度	已开源
26	3D modeling on the go: Interactive 3d reconstruction of large-scale scenes on mobile devices	3DV 2015

可以关注一下几个组和个人的主页

ETH Computer Vision and Geometry group
Max Planck Research Group for Autonomous Vision
Autonomous Vision Group(和上面的是同一个组)
Christian Häne
Nikolay Savinov
Patrick Pérez

室内语义重建

#	Paper	Published in	Summary	Misc
1	3D Semantic Parsing of Large-Scale Indoor Spaces	CVPR 2016	大规模室内场景3D语义解析	提供了数据集
2	Dense 3D Semantic Mapping of Indoor Scenes from RGB-D Images	ICRA 2014	基于贝叶斯更新和三维CRF的2D到3D标签迁移	实时
3	A fully end-to-end deep learning approach for real-time simultaneous 3D reconstruction and material recognition	ICRA 2017	一个端到端的三维重建+材质识别系统	基本实时
4	Database-Assisted Object Retrieval for Real-Time 3D Reconstruction	Computer Graphics Forum 2015
5	ScanComplete: Large-Scale Scene Completion and Semantic Segmentation for 3D Scans	CVPR 2018	输入不完整场景预测完整模型并进行语义分割	已开源
6	SemanticFusion: Dense 3D semantic mapping with convolutional neural networks	ICRA 2017	SLAM提供匹配点，CNN进行语义预测并融合

重点关注

Angela Dai

Object语义重建

#	Paper	Published in	Summary	Misc
1	Dense Object Reconstruction with Semantic Priors	CVPR 2013	通过目标检测和形状先验引入语义信息，克服了传统MVS的缺点	有多视角图片训练出物体的语义先验
2	Segment Based 3D Object Shape Priors	CVPR 2015	提出了新的shape prior formulation，将物体分成多个凸块,重建问题可以看作容积式的多标签分割
3	Class Specific 3D Object Shape Priors Using Surface Normal	CVPR 2014	利用物体表面的法向信息作为shape prior
4	Learning a Multi-view Stereo Machine	NIPS 2017	一个端到端的网络，输入多视角图片，输出Voxel Occupancy Grid或者深度图	已开源
5	Semantic Object Reconstruction via Casual Handheld Scanning	SIGGRAPH ASIA 2018	利用语义标签提升重建质量，并提出了一种主动式的自学习框架

（语义）SLAM

#	Paper	Published in	Summary	Misc
1	Multi-view 3D Entangled Forest For Semantic Segmentation and Mapping	ICRA 2018	在语义建图中引入3DEF分类器，提出了新的多视角融合方法改善了分割效果
2	CNN-SLAM: Real-time dense monocular SLAM with learned depth prediction	CVPR 2017	CNN预测单目深度并进行语义分割
3	REMODE: Probabilistic, Monocular Dense Reconstruction in Real Time	ICRA 2014	单目图片流中估计深度概率图
4	Quadtree-accelerated Real-time Monocular Dense Mapping	IROS 2018	单目实时估计深度+稠密建图	已开源
5	Multi-level mapping: Real-time dense monocular SLAM	ICRA 2016	四叉树增强局部纹理
6	REMODE: Probabilistic, Monocular Dense Reconstruction in Real Time	ICRA 2014		已开源

重点关注

HKUST Aerial Robotics Group

传统重建

#	Paper	Published in	Summary	Misc
1	Building Rome in a Day	ICCV 2009	large scale
2	Real-time 3D Reconstruction at Scale using Voxel Hashing	TOG 2013	利用voxel hashing节省GPU资源	RGB-D;已开源
3	Scalable Real-time Volumetric Surface Reconstruction	TOG 2013	octree
4	A Volumetric Method for Building Complex Models from Range Images	SIGGRAPH 1996	提出了volumetric式重建

Stereo

#	Paper	Published in	Summary	Misc
1	Direction Matters: Depth Estimation with a Surface Normal Classifier	CVPR 2015	在depth estimation中引入了表面法向量分类
2	Efﬁcient Large-Scale Stereo Matching	ACCV 2010	寻找可以可靠匹配的特征点并以此为支撑做三角变换,对视差进行插值计算	效果一般，勉强实时；已开源（CPU）
3	Massively Parallel Multiview Stereopsis by Surface Normal Diffusion	ICCV 2015		已开源

Camera pose estimation

#	Paper	Published in	Summary	Misc
1	Visual Odometry and Mapping for Autonomous Flight Using an RGB-D Camera	ISRR 2011

3D object retrieval

#	Paper	Published in	Summary
1	On Visual Similarity Based 3D Model Retrieval	2003	比较silhouette的相似度
2	DeepCCFV: Camera Constraint-Free Multi-View Convolutional Neural Network for 3D Object Retrieval	AAAI 2019
3	Multi-view Convolutional Neural Networks for 3D Shape Recognition	ICCV 2015
4	Camera Constraint-Free View-Based 3-D Object Retrieval	TIP 2012
5	A new 3D model retrieval approach based on the elevation descriptor

Dataset

3D数据集

#	Dataset	Misc
1	3D Warehouse	有一些建筑模型,格式为skp，同时关注下SketchUp
2	ShapeNet	有一些建筑（8000+）和vegetation的模型，格式为skp（数据集来源于1。。。）
3	GrabCAD	建筑、制造、机械
4

2D数据集

Semantic segmentation

Dataset	Hyperlink	Paper	Comment
NYU Depth Dataset V2	https://cs.nyu.edu/~silberman/datasets/nyu_depth_v2.html		RGB + D + class labels
Semantic3D	http://www.semantic3d.net/	Semantic3D: A new Large-scale Point Cloud Classification Benchmark	大规模点云分类的benchmark
Cityscapes Dataset	https://www.cityscapes-dataset.com/		城市街道的语义理解

zekeingithub / CV-Archive

CV-Archive

目录

Paper

室外(语义)重建

室内语义重建

Object语义重建

（语义）SLAM

传统重建

Stereo

Camera pose estimation

3D object retrieval

Dataset

3D数据集

2D数据集

Semantic segmentation

About