confusion on the translation = [-4.44, 0, 2.31]

Question

confusion on the translation = [-4.44, 0, 2.31]

chensjtu opened this issue 2 years ago · comments

I'm a little confused about the translation in config. For instance, in configs/scene0050_00.txt there is a config named translation. Each config contains a different translation, what is the effect of the translation?

Dejan · Answer 1 · Wed Feb 16 2022 18:22:17 GMT+0800 (China Standard Time)

I translate and scale each scene so it roughly lies in a [-1, 1] cube. I did this manually by opening each scene in Meshlab and looking at the center of the bounding box, but you could also do it automatically by computing the bounding box for all the cameras. I haven't tested if this actually improves the results, but at least it's a bit more convenient to evaluate the SDF after training.

Franklin Yang · Answer 2 · Wed Feb 16 2022 19:07:09 GMT+0800 (China Standard Time)

OK, thanks for your kindly reply!

Franklin Yang · Answer 3 · Wed Feb 16 2022 20:09:16 GMT+0800 (China Standard Time)

Hey, dazinovic. I'm a new learner of 3D vision. When I use the pose in trainval_poses to do the tsdfusion, I got terrible reconstruction effects. Does there any notice for me to mind? maybe I should transfer the openGL pose to openCV pose? if so, please tell me how to transfer! Many thanks!

Franklin Yang · Answer 4 · Wed Feb 16 2022 20:10:22 GMT+0800 (China Standard Time)

this is the result of "breakfast room"

Dejan · Answer 5 · Wed Feb 16 2022 20:43:28 GMT+0800 (China Standard Time)

You definitely need to transform the poses into whichever coordinate system your tsdffusion code uses. Can you try pre-multiplying by:

[[1, 0, 0, 0],
[0, 0, -1, 0],
[0, 1, 0, 0],
[0, 0, 0, 1]] ?

If it still doesn't work, try swapping some columns of the pose matrix or changing the sign of some of the columns. It looks like in OpenCV the y-axis points downward and the z-axis forward, so you can try flipping (changing the sign) the 2nd and 3rd columns.

Franklin Yang · Answer 6 · Thu Feb 17 2022 11:14:04 GMT+0800 (China Standard Time)

Thanks for your kindly reply! but still, I have trouble with the pose conversation. Can you share the blender rendering code sothat I can understand the difficult coord operations. BTW, what is the difference between pose.txt and the train_val_pose.txt? I use the pose0 in blender, but the rendered image is not eq to the image0.png. for instance,
1 0 0 0
0 0 -1 0
0 1 0 0
0 0 0 1
is the first pose in breakfast room, but cam loc in 0,0,0 is not reasonable!

Franklin Yang · Answer 7 · Thu Feb 17 2022 11:38:33 GMT+0800 (China Standard Time)

I just use the code from"https://github.com/andyzeng/tsdf-fusion-python", with little modification on file path and pose loader.

Dejan · Answer 8 · Thu Feb 17 2022 18:08:14 GMT+0800 (China Standard Time)

To go from OpenGL to Blender coordinates, you can use the following code:

import sys
import numpy as np

def load_poses(posefile):
    file = open(posefile, "r")
    pose_floats = [[float(x) for x in line.split()] for line in file]
    file.close()
    lines_per_matrix = 4
    all_poses = [ [pose_floats[lines_per_matrix*idx + 0], pose_floats[lines_per_matrix*idx + 1], pose_floats[lines_per_matrix*idx + 2], pose_floats[lines_per_matrix*idx + 3]] for idx in range(0, len(pose_floats)//lines_per_matrix) ]
    return all_poses


if __name__ == '__main__':
    posefile = sys.argv[1]

    poses = load_poses(posefile)
    poses = np.array(poses).astype(np.float32)

    for pose in poses:
        # Swap y and z axis
        pose[[1, 2], :] = pose[[2, 1], :]

        # Invert y-axis
        pose[1, :] *= -1

    poses = np.reshape(poses, [-1, 16])

    dst_file = sys.argv[2]
    np.savetxt(dst_file, poses, fmt='%.6f')

Some of the Blender scenes had an unreasonable size, so I scaled them down first (IIRC I scaled the breakfast room by a factor of 0.35).

poses.txt contains the ground truth poses in the OpenGL coordinate system (the poses I used to render the scenes). trainval_poses.txt contains BundleFusion's estimated poses that I use as the initial poses in my method.

Dejan · Answer 9 · Thu Feb 17 2022 18:11:47 GMT+0800 (China Standard Time)

blender_poses.zip

These are the poses I used for Blender.

Franklin Yang · Answer 10 · Thu Feb 17 2022 19:14:25 GMT+0800 (China Standard Time)

So the pose_array@trainval_poses.txt is the final pose used in all exp?
the optimized pose is pretty far away from the pose provided in pose.txt.
Can you tell me how to cal the table2 in your paper ?

I get the optimized pose with :

while the pose.txt, it is:

I gauss you use the relative pose error estimation?

Franklin Yang · Answer 11 · Thu Feb 17 2022 19:22:10 GMT+0800 (China Standard Time)

So if I want to use your dataset to conduct tsdf fusion, I can use the pose in pose.txt and transfer it to OpenCV's definition. Then I ought to get what I want?

Dejan · Answer 12 · Fri Feb 18 2022 18:48:47 GMT+0800 (China Standard Time)

So the pose_array@trainval_poses.txt is the final pose used in all exp?

Kind of. They are not in the same space, though. You can check extract_optimized_poses.py to see how the conversion works.

the optimized pose is pretty far away from the pose provided in pose.txt.
Can you tell me how to cal the table2 in your paper ?

You need to align the two trajectories. I did that by aligning the first camera of both trajectories. A better way might be to actually solve an optimization problem that finds the best alignment, but I don't believe it would change the reported numbers significantly.

So if I want to use your dataset to conduct tsdf fusion, I can use the pose in pose.txt and transfer it to OpenCV's definition. Then I ought to get what I want?

You need to convert it into whatever tsdf_fusion uses.

Franklin Yang · Answer 13 · Fri Feb 18 2022 20:11:55 GMT+0800 (China Standard Time)

You need to align the two trajectories. I did that by aligning the first camera of both trajectories. A better way might be to actually solve an optimization problem that finds the best alignment, but I don't believe it would change the reported numbers significantly.

Thanks for the detailed response. I've gotten the method of processing trajectories.

You need to convert it into whatever tsdf_fusion uses.

I've tried different combinations of poses, and finally, I got what I want. Many thanks again for your kindness and fantastic work.

ZirongChan · Answer 14 · Fri Apr 08 2022 15:03:50 GMT+0800 (China Standard Time)

@chensjtu hi, how did you solve the alignment between different poses files? I was trying to align the recovered point cloud from each frame to get a complete sence but yet failed, the breakfast_room for instance. Which pose file should I use, just to put the point clouds together ? Any suggestions ? @dazinovic

soumyadipm · Answer 15 · Tue May 31 2022 13:18:11 GMT+0800 (China Standard Time)

Can you guys please help me to run this code on ScanNet? Struggling for last two weeks.

soumyadipm · Answer 16 · Tue May 31 2022 19:41:13 GMT+0800 (China Standard Time)

Can you guys please help me to run this code on ScanNet? Struggling for last two weeks.