Have you used 3d loss for fitting about simplify-X?

Question

Have you used 3d loss for fitting about simplify-X?

zhLawliet opened this issue 3 years ago · comments

Thanks a lot for sharing the Simplify-X fits for H36M. Did you use 3d loss when fitting? I found that his side view is slanted, indicating that the depth is incorrect

Gyeongsik Moon · Answer 1 · Wed Aug 11 2021 20:09:40 GMT+0800 (China Standard Time)

The fits are in world coordinate system. You should apply camera extrinsics for other view redering.

L · Answer 2 · Thu Aug 12 2021 11:32:50 GMT+0800 (China Standard Time)

@mks0601
thanks,i find you have merged root pose and camera rotation.
# merge root pose and camera rotation
root_pose = smpl_pose[self.root_joint_idx,:].numpy()
root_pose, _ = cv2.Rodrigues(root_pose)
root_pose, _ = cv2.Rodrigues(np.dot(R,root_pose))
smpl_pose[self.root_joint_idx] = torch.from_numpy(root_pose).view(3)

I try to understand what you mean is that the R corresponds to the x-y view，the R is not suitable for z-y view?
if I want to show the z-y, I need other R?
the code is:

smpl_mesh_coord, smpl_joint_coord = self.smpl.layer['neutral'](smpl_pose, smpl_shape)
smpl_mesh_coord = smpl_mesh_coord.numpy().astype(np.float32).reshape(-1,3);
fit_mesh_coord_cam = smpl_mesh_coord[...,[2,1,0]] xyz->zyx
fit_mesh_coord_cam = (fit_mesh_coord_cam + 1)/2 * 255
vis(fit_mesh_coord_cam)

Gyeongsik Moon · Answer 3 · Thu Aug 12 2021 15:08:29 GMT+0800 (China Standard Time)

I can't understand your question.. R is just a rotation matrix, included in the camera extrinsic parameter.

L · Answer 4 · Thu Aug 12 2021 15:31:14 GMT+0800 (China Standard Time)

yes, the camera extrinsic parameter include the R and the T ,I think the fit_mesh_coord_cam have applied camera extrinsics by the " merge root pose and camera rotation". but the his side view is slanted.

Gyeongsik Moon · Answer 5 · Thu Aug 12 2021 15:48:14 GMT+0800 (China Standard Time)

How did you visualize your results?

L · Answer 6 · Thu Aug 12 2021 15:55:12 GMT+0800 (China Standard Time)

The code is for side view:
pose, shape, trans = smpl_param['pose'], smpl_param['shape'], smpl_param['trans']
smpl_pose = torch.FloatTensor(pose).view(-1,3); smpl_shape = torch.FloatTensor(shape).view(1,-1); # smpl parameters (pose: 72 dimension, shape: 10 dimension)
R, t = np.array(cam_param['R'], dtype=np.float32).reshape(3,3), np.array(cam_param['t'], dtype=np.float32).reshape(3) # camera rotation and translation
# merge root pose and camera rotation
root_pose = smpl_pose[self.root_joint_idx,:].numpy()
root_pose, _ = cv2.Rodrigues(root_pose)
root_pose, _ = cv2.Rodrigues(np.dot(R,root_pose))
smpl_pose[self.root_joint_idx] = torch.from_numpy(root_pose).view(3)
smpl_mesh_coord, smpl_joint_coord = self.smpl.layer['neutral'](smpl_pose, smpl_shape)
smpl_mesh_coord = smpl_mesh_coord.numpy().astype(np.float32).reshape(-1,3);
fit_mesh_coord_cam = smpl_mesh_coord[...,[2,1,0]] #xyz->zyx
fit_mesh_coord_cam = (fit_mesh_coord_cam + 1)/2 * 255
fit_mesh_coord_cam = vis_mesh(img.copy(), fit_mesh_coord_cam, radius=1,color = (0,0,255),IS_cmap = False)

Gyeongsik Moon · Answer 7 · Thu Aug 12 2021 15:57:56 GMT+0800 (China Standard Time)

what is this line?

fit_mesh_coord_cam = smpl_mesh_coord[...,[2,1,0]] #xyz->zyx

and why not apply extrinsic translation?

Gyeongsik Moon · Answer 8 · Thu Aug 12 2021 15:58:08 GMT+0800 (China Standard Time)

Could you follow my codes in Human36M/Human36M.py?

L · Answer 9 · Thu Aug 12 2021 16:14:01 GMT+0800 (China Standard Time)

yes, I follow your codes in Human36M/Human36M.py,i can get right result about front view, which have applied extrinsic translation(R,T) and internal parameters（cam_param['focal'], cam_param['princpt']）

the original coordinate system is x,y,z , this line is to convert the coordinates xyz->xzy for side view, I think the smpl_mesh_coord have applied extrinsic translation by the "merge root pose and camera rotation",there is no internal parameters for xzy, so I just want to visualize the orientation of the whole about side view.

Gyeongsik Moon · Answer 10 · Thu Aug 12 2021 16:18:28 GMT+0800 (China Standard Time)

I can't get what is 'internal parameters'. You can just apply extrinsics without axis transpose like xyz->zyx.

L · Answer 11 · Thu Aug 12 2021 16:30:31 GMT+0800 (China Standard Time)

thanks,the 'internal parameters' is cam_param['focal'] and cam_param['princpt'],there is just one extrinsics for front view, now I want to visualize the orientation of the whole about side view. my unclear description may confuse you.
Maybe I need to change the question: How can I get the correct side view？

Gyeongsik Moon · Answer 12 · Thu Aug 12 2021 16:32:03 GMT+0800 (China Standard Time)

The extrinsics are defined for all camera viewpoints.
You can apply extrinsics of the side viewpoint.

L · Answer 13 · Thu Aug 12 2021 16:43:28 GMT+0800 (China Standard Time)

Thanks for your patient reply， I try it

L · Answer 14 · Mon Oct 11 2021 17:40:16 GMT+0800 (China Standard Time)

@mks0601 Can you provide the benchmark code for 3DPW challenge? how can I reproduce the competition performance

Gyeongsik Moon · Answer 15 · Mon Oct 11 2021 19:46:33 GMT+0800 (China Standard Time)

Most of codes of the winning entry of 3DPW challenge is based on this repo. The tracking codes are newly added though.

L · Answer 16 · Mon Oct 11 2021 20:09:45 GMT+0800 (China Standard Time)

Thank you for your reply，your I2L-MeshNet wons the first and second place at 3DPW challenge on unknown assocation track which is not allowed to use ground truth data in any form, so how can you get right person for multi person. Another question is that "bbox_root_pw3d_output.json" is just for 3DPW_test.json, but the above 3DPW challenge use the entire dataset including its train, validation and test splits for evaluation. so It's my pleasure that you can release this part of the code about the ECCV2020 3DPW challenge.

Gyeongsik Moon · Answer 17 · Mon Oct 11 2021 20:31:44 GMT+0800 (China Standard Time)

Q. how can you get right person for multi person -> I used yolov5 human detector.
Q. I used param stage of I2L-MeshNet, so rootnet output is not required.

L · Answer 18 · Mon Oct 11 2021 20:55:08 GMT+0800 (China Standard Time)

thank you, I understand. can release the part of the code that submit the result for ECCV2020 3DPW challenge

Gyeongsik Moon · Answer 19 · Mon Oct 11 2021 21:47:23 GMT+0800 (China Standard Time)

Sorry I don't have codes for the 3DPW challenge. But there is no big change from this repo.

L · Answer 20 · Tue Oct 12 2021 10:21:59 GMT+0800 (China Standard Time)

thanks,I try it

L · Answer 21 · Thu Oct 14 2021 21:36:58 GMT+0800 (China Standard Time)

@mks0601 Can you share your all results of yolov4 for 3DPW, which is used for 3DPW challenge. There is only the YOLO.json for test datasets: "data/PW3D/Human_detection_result/YOLO.json". I tried to get bbox through yolo4 by myself, but it couldn’t match yours effectively， thanks.

Gyeongsik Moon · Answer 22 · Thu Oct 14 2021 22:55:21 GMT+0800 (China Standard Time)

Sorry we don't have. Which problem do you have?

L · Answer 23 · Fri Oct 15 2021 10:00:16 GMT+0800 (China Standard Time)

This should be a tracking issue，I want to reproduce your competition performance which wons the first and second place at 3DPW challenge on unknown assocation track. There are multiple candidate boxs in each frame by yolov4. How to choose the best matching box, especially for multiple people and scenes with overlapping characters. such as

Gyeongsik Moon · Answer 24 · Fri Oct 15 2021 11:11:08 GMT+0800 (China Standard Time)

Most of codes of the winning entry of 3DPW challenge is based on this repo. The tracking codes are newly added though.

we added human tracking codes as mentioned above.

L · Answer 25 · Fri Oct 15 2021 11:22:00 GMT+0800 (China Standard Time)

ok, thanks