Training Time and Dense Point Cloud Quality

Question

Training Time and Dense Point Cloud Quality

jonstephens85 opened this issue 3 months ago · comments

Jonathan Stephens commented 3 months ago

Great project everyone!

I had a two questions regarding training and the dense point cloud.

Training Time
I noticed after 100 epochs, the training speed is going at about 25% the speed it was going before the 100th epoch. I was seeing around 60 e/s and now its around 15 e/s. I also noticed my VRAM usage went from 17 GB to 24 GB at that point. Should it be maxing out my VRAM?

Also, this is an outdoor scene, should I have used --PipelineParams.enable_environment_map true? Not sure what "extensive" means when referring to a scene.

Point Cloud Quality
The Dense point cloud I got from Colmap is 9.7 million points. There isn't too much noise in the output, however, it did project a lot of points below ground. For reference, I filmed 3 loops around a statue. There are a lot of points under the statue that are underground. Would it speed up processing if I had cleaned up the noisy data below surface?

Jonathan Stephens · Answer 1 · Wed Feb 21 2024 04:29:51 GMT+0800 (China Standard Time)

Update, I got to epoch 263 and ran out of VRAM. I am running an RTX 3090ti.

I took a look at the sample datasets and noticed the images were 1920x1080 jpgs. I used the same sized images but realized I had png images that were 5x bigger file sizes. Not sure if that had something to do with it.

iFimo · Answer 2 · Wed Feb 21 2024 17:50:43 GMT+0800 (China Standard Time)

I noticed after 100 epochs, the training speed is going at about 25% the speed it was going before the 100th epoch. I was seeing around 60 e/s and now its around 15 e/s. I also noticed my VRAM usage went from 17 GB to 24 GB at that point. Should it be maxing out my VRAM?

I can briefly confirm to you that it is exactly the same for me. I did a small training yesterday evening with 14 pictures. It took about 40 minutes on my 3080.
Here, too, after the first few epochs it went up to 24 GB of VRAM. Just like yours.
The images I tested were 2155x1094 JPG, each about 1,3 mb.

You have probably already tested a smaller data set. Try JPG, it won't hurt and it could save you the crucial resources.

Also, this is an outdoor scene, should I have used --PipelineParams.enable_environment_map true? Not sure what "extensive" means when referring to a scene.

Unfortunately I can't say anything about the command itself.

I'm now installing Trips on a second PC with 2x 4090 and will then start my 400 image set there. Hope I don't run into the same problems. I keep you updated.

Linus Franke · Answer 3 · Wed Feb 21 2024 18:46:45 GMT+0800 (China Standard Time)

Hi, thanks!

I noticed after 100 epochs, the training speed is going at about 25% the speed it was going before the 100th epoch. I was seeing around 60 e/s and now its around 15 e/s. I also noticed my VRAM usage went from 17 GB to 24 GB at that point. Should it be maxing out my VRAM?

This is by design. After 100 epochs, we add the VGG loss to the mix. This loss is quite computational expensive as well as requires significant VRAM. We achieved best results with it, however you can change this by using --TrainParams.only_start_vgg_after_epochs 100 with an epoch later/earlier than 100.

The Dense point cloud I got from Colmap is 9.7 million points. There isn't too much noise in the output, however, it did project a lot of points below ground. For reference, I filmed 3 loops around a statue. There are a lot of points under the statue that are underground. Would it speed up processing if I had cleaned up the noisy data below surface?

The point cloud size is similar to our examples and if there are no extreme areas missing it should be ok. Outlier points are automatically made transparent, so that should not be a big issue. For removing the underground: I would expect it to speed up the training and rendering, but most likely not by very much. So I'm not sure if it is worth the effort.

Also, this is an outdoor scene, should I have used --PipelineParams.enable_environment_map true? Not sure what "extensive" means when referring to a scene.

In most cases, COLMAP point clouds do very well in outdoor scenes. For far away objects, some points are created which can be used by TRIPS. I would start of by not using the environment map and add it if the rendering result for backgrounds is bad. In general, background reconstruction in our method is not perfect and the environment map is more a band-aid fix.

Jonathan Stephens · Answer 4 · Thu Feb 22 2024 01:18:33 GMT+0800 (China Standard Time)

@lfranke thank you for the update and confirming that I didn't run into a bug!