how to train the SphereFormer in DDP mode?

Question

how to train the SphereFormer in DDP mode?

kingbackyang opened this issue a year ago · comments

Yang Jingru commented a year ago

Xin Lai · Answer 1 · Sun Jun 04 2023 16:06:41 GMT+0800 (China Standard Time)

Current implementation is already based on DDP.

Yang Jingru · Answer 2 · Sun Jun 04 2023 16:16:56 GMT+0800 (China Standard Time)

Is it enought for the DDP mode to adding "CUDA_VISIBLE_DEVICES=0,1,2,3***" before "python train.py --config config/semantic_kitti/semantic_kitti_unet32_spherical_transformer.yaml". Because the DDP mode usually needs to set the lanuch mode, such as python -m torch****. I have just come across this repo and have not run it.

Yang Jingru · Answer 3 · Sun Jun 04 2023 16:23:27 GMT+0800 (China Standard Time)

Some questions about the performance gap between the validation set and the test set on the semanticKITTI benchmark.
I have read your paper, which is an excellent work.The performance of this SphereFormer on the SemanticKITTI validation set in this repo is 67.8， meanwhile the performance on the test set is 74.8. However, for my model, the performances on the val set and the test set are 71.4 and 72.3, respectively. May I ask you how to solve this problem? Thank you

Xin Lai · Answer 4 · Sun Jun 04 2023 16:28:57 GMT+0800 (China Standard Time)

Firstly, you can also run with ddp mode without ddp launcher, like in the released train.py file.
Secondly, to reach a better performance on the test benchmark, many works use some tricks like gt sampling and model ensembling. You can try them. The reported validation results are without these tricks.

Yang Jingru · Answer 5 · Sun Jun 04 2023 16:32:32 GMT+0800 (China Standard Time)

I am a Phd student in Zhejiang University. could I have your contact information? My email address is jr.young@zju.edu.cn.

Yang Jingru · Answer 6 · Sun Jun 04 2023 16:32:35 GMT+0800 (China Standard Time)

I am a Phd student in Zhejiang University. could I have your contact information? My email address is jr.young@zju.edu.cn.

Yang Jingru · Answer 7 · Sun Jun 04 2023 16:36:48 GMT+0800 (China Standard Time)

Firstly, you can also run with ddp mode without ddp launcher, like in the released train.py file. Secondly, to reach a better performance on the test benchmark, many works use some tricks like gt sampling and model ensembling. You can try them. The reported validation results are without these tricks.
Model ensembling means the ensembling among different epochs or the ensembling among the semantic models such as SPVCNN, MinkUnet, Cylinder? Gt sampling means the data augmentation method in the training stage in the SECOND paper?

Xin Lai · Answer 8 · Sun Jun 04 2023 20:36:35 GMT+0800 (China Standard Time)

Model ensembling means ensembling the same models with different hyperparameters (e.g., batch size, weight decay, learning rate, etc.).
For GT sampling, please refer to Sec. 3.4 of the paper of RPVNet.

Yang Jingru · Answer 9 · Mon Jun 05 2023 00:13:14 GMT+0800 (China Standard Time)

Model ensembling means ensembling the same models with different hyperparameters (e.g., batch size, weight decay, learning rate, etc.). For GT sampling, please refer to Sec. 3.4 of the paper of RPVNet.

Thank you!