huangqx / image_shape_align

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% License and attribution:
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

This package provides software modules for image and shape analysis. The code is released under the MIT license and can be used for any purpose with proper attribution. The code accompanies the following paper, which should be cited in publications that use the provided modules:

Single-View Reconstruction via Joint Analysis of Image and Shape Collections
Qixing Huang, Hai Wang, and Vladlen Koltun
ACM Transactions on Graphics 34(4), 2015

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% Loading the parameters
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
  load('data\parameters.mat');

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% Attention: The rendering setting in Para (i.e., Para.rHeights,
% Para.rWidhts) is tested on Windows machines. If you are using
% Linux or Mac, please call the following function to update
% these two vectors:
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
  Para = update_para(Para);

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% Loading the images and shapes
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
  load('data\car.mat');
  Note that the shapes are assumed to be consistently oriented in
  a world coordinate system

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% Module I: Pose estimation
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
  The pose estimation module predicts the camera pose for the underlying object with respect to the world coordinate system of the shapes. This is done by sampling the camera poses, rendering the shapes from the camera poses, and comparing real images to rendered images. The most time-consuming part is image rendering, which is done by precomputation.

  Cameras = cam_camera_sampling(Para); % camera simulation
  hogRender = cam_shape_hog_dess(...   %
                Shapes,...  % The aligned input shapes
                Cameras,... % The sampled camera poses
                Para,...    % Please refer to the function body
                verbose);   % verbose = 1 if you want to print it
                              out


  Camera estimation can be done in two ways: for a single image
  in isolation or for a collection of images

  Camera_init = cam_pose_est_single(...
                  Image,...     % The input image object
                  Shapes,...    % The aligned input shapes
                  Cameras,...   % The sampled camera poses
                  hogRender,... % The rendered hog descriptors
                  Para,...  % Please refer to the function body
                  verbose); % verbose = 1 if you want to print it
                              out

  Cameras_init = cam_pose_est_joint(...
                  inputImages,...  % The input images
                  Shapes,...       % The aligned input shapes
                  cameraSamples,...% The sampled camera poses
                  hogRender,...    % The rendered hog descriptors
                  Para,...   % Please refer to the function body
                  verbose)   % verbose = 1 if you want to print
                               it out

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% Module II: Pairwise image-shape alignment
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
  This module performs image-shape alignment for one image. It selects the closest shape and deforms it to fit the image object. Note that the pose estimation module is packed in the following main function:

  [Camera_opt, Shape_opt] = demo_i2s_align(...
    Image,...  % Input image
    Shapes,... % Input shapes
    Para);     % Parameters used in alignment. Please refer to the
                function body for details

 Output argments:
  'Shape_opt' : The optimized shape, which aligns with the input
                image object
  'Camera_opt': The associated optimized camera configuration

Demo:
   >load('data\chair.mat');
   >load('data\parameters.mat');
   >[Camera_opt, Shape_opt] = demo_i2s_align(Images{1}, Shapes, Para);


%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% Module III: Assembly-based reconstruction
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
  Assembly-based reconstruction aims to recover the underlying 3D model of an image object by assembling parts from a small collection of relevant shapes. The input consists of one image and a few shapes, each of which comprises a number of components. We assume that the shapes are consistently aligned in a world coordinate system. We assume that the pose of the image object is given. The main function is

  [Shape_opt, Camera_opt] = demo_ibm_mix_and_match(...
    Image,...             % The input image
    Camera_init,...       % The initial camera configuration for
                            all input shapes
    Shapes,...            % The input shapes
    Para);                % Please refer to the functional body
                            for details

  Output argments:
    'Shape_opt' : The reconstructed shape in the world coordinate
                  system
    'Camera_opt': The associated optimized camera configuration

Demo:
   >load('data\mix_and_match.mat');
   >load('data\parameters.mat');
   >[Shape_opt, Camera_opt] = demo_ibm_mix_and_match(Image, Camera_init, Shapes, Para);

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% Module IV: Joint image-shape correspondence
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
  Joint image-shape correspondence module estimates dense pixelwise correspondences among a collection of images and a collection of shapes. We assume that (i) the camera poses of the images are pre-computed (e.g., using Module I), and (ii) the shapes are consistently aligned in a world coordinate system. The main function is

  PairMatches = demo_joint_i2s_corres_main(...
    Images,...       % Input images
    ImageCameras,... % The camera configurations associated with
                       each input image
    Shapes,...       % The input shapes (aligned in a
                       world coordinate system)
    Top,...          % (#Images x #Shapes) is a sparse matrix
                       that specifies image-shape pairs
                       for matching.
    Para);           %

  Warning: If you use >100 images and/or >100 shapes,
           the computation may take a very long time.

  Demo:
    >load('data\joint_matching.mat');
    >load('data\parameters.mat');
    >Corres_rs = demo_joint_i2s_corres_main(Images(1:10), ImageCameras(1:10), Shapes(1:5), ones(10,5), Para);


%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% External libraries
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
  The software package uses the following external libraries:

  Piotr's Image & Video Matlab Toolbox:
  http://vision.ucsd.edu/~pdollar/toolbox/doc/
  http://research.microsoft.com/en-us/downloads/389109f6-b4e8-404c-84bf-239f7cbf4e3d/

  The SIFT flow package:
  http://people.csail.mit.edu/celiu/SIFTflow/

  The OPENGM2 structure predication package:
  http://hci.iwr.uni-heidelberg.de/opengm2/?l0=library

  HOG descriptor: functions 'imResample', 'hog'

  Edge map: function 'edgesDetect'

  OPENGM2: the trws algorithm

References:

Navneet Dalal, Bill Triggs: Histograms of Oriented Gradients for Human Detection. CVPR 2005

Piotr Dollár, C. Lawrence Zitnick: Fast Edge Detection Using Structured Forests. IEEE Trans. Pattern Anal. Mach. Intell. 37(8): 1558-1570 (2015)

Ce Liu, Jenny Yuen, Antonio Torralba: SIFT Flow: Dense Correspondence across Scenes and Its Applications. IEEE Trans. Pattern Anal. Mach. Intell. 33(5): 978-994 (2011)

Andres, B. and Beier T. and Kappes, J.H. : OpenGM: A C++ Library for Discrete Graphical Models. http://arxiv.org/abs/1206.0111

About


Languages

Language:MATLAB 75.2%Language:C++ 23.7%Language:C 0.7%Language:M 0.4%