rabbityl / lepard

[CVPR 2022, Oral] Learning Partial point cloud matching in Rigid and Deformable scenes

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

dataloader memory leak issue

wuqianliang opened this issue · comments

Hi, thank you for your excellent work.
When I run your code, it seems always to have the data loader worker killed after running for a while. Is there a possible memory leak?
I have set the load workers to 4 and batch size to 2 and have a TITAN card and 72G memory.
image

I have never seen this on my machine.
It's possibly caused by memory size.
Maybe you can try to use a smaller num_worker, say 1, 2 or even 0.
also see this: pytorch/pytorch#8976 (comment)

commented

Hi, I meet the same problem, did you slove it?

I find that the problem is caused by AverageMeter.update() in a training step. I solve the problem by detach the input tensor during the accumulation in AverageMeter.update().