Memory requirements

Question

Memory requirements

luiscarm9 opened this issue 3 years ago · comments

Luis Carlos Rivera commented 3 years ago

Thanks for the awesome repository!
I am trying to run the cell graph generation example but I get CUDA out of memory errors. I am using a GPU with 8.5Gb, not running anything else or shared in any way.
Is there a minimum requirement for graph representation inference?

Antonio Foncubierta Rodríguez · Answer 1 · Mon Apr 26 2021 16:14:18 GMT+0800 (China Standard Time)

With GPU it is normally OK to run the examples, can you try modifying the batch_size ?
Anyway, can you post your error log to have a look at it ?

Luis Carlos Rivera · Answer 2 · Tue Apr 27 2021 07:03:44 GMT+0800 (China Standard Time)

But as far as I understand the batch_size in the example script is 1. isn't it?

I tried the code in a bigger GPU and I always get the same out of memory error:

Using backend: pytorch

Patch-level nuclei detection:   0%|          | 0/2 [00:00<?, ?it/s]

Traceback (most recent call last):
  File "cell_graph_generation.py", line 91, in <module>
    generate_cell_graph(image_path=os.path.join('output', 'images'))
  File "cell_graph_generation.py", line 64, in generate_cell_graph
    nuclei_map, _ = nuclei_detector.process(image)
  File "/home/rivera/.local/lib/python3.8/site-packages/histocartography/pipeline.py", line 138, in process
    return self._process(*args, **kwargs)
  File "/home/rivera/.local/lib/python3.8/site-packages/histocartography/preprocessing/nuclei_extraction.py", line 97, in _process
    return self._extract_nuclei(input_image, tissue_mask)
  File "/home/rivera/.local/lib/python3.8/site-packages/histocartography/preprocessing/nuclei_extraction.py", line 139, in _extract_nuclei
    out = self.model(image_batch).cpu()
  File "/usr/local/lib/python3.8/dist-packages/torch/nn/modules/module.py", line 727, in _call_impl
    result = self.forward(*input, **kwargs)
  File "/home/rivera/.local/lib/python3.8/site-packages/histocartography/ml/models/hovernet.py", line 44, in forward
    d = self.encode(images)
  File "/usr/local/lib/python3.8/dist-packages/torch/nn/modules/module.py", line 727, in _call_impl
    result = self.forward(*input, **kwargs)
  File "/home/rivera/.local/lib/python3.8/site-packages/histocartography/ml/models/hovernet.py", line 105, in forward
    x2 = self.group0(x1)
  File "/usr/local/lib/python3.8/dist-packages/torch/nn/modules/module.py", line 727, in _call_impl
    result = self.forward(*input, **kwargs)
  File "/home/rivera/.local/lib/python3.8/site-packages/histocartography/ml/models/hovernet.py", line 201, in forward
    shortcut = getattr(self, 'block0_convshortcut')(in_feats)
  File "/usr/local/lib/python3.8/dist-packages/torch/nn/modules/module.py", line 727, in _call_impl
    result = self.forward(*input, **kwargs)
  File "/home/rivera/.local/lib/python3.8/site-packages/histocartography/ml/models/hovernet.py", line 385, in forward
    x = self.conv(x)
  File "/usr/local/lib/python3.8/dist-packages/torch/nn/modules/module.py", line 727, in _call_impl
    result = self.forward(*input, **kwargs)
  File "/usr/local/lib/python3.8/dist-packages/torch/nn/modules/conv.py", line 423, in forward
    return self._conv_forward(input, self.weight)
  File "/usr/local/lib/python3.8/dist-packages/torch/nn/modules/conv.py", line 419, in _conv_forward
    return F.conv2d(input, weight, self.bias, self.stride,
RuntimeError: CUDA out of memory. Tried to allocate 2.00 GiB (GPU 0; 15.75 GiB total capacity; 767.27 MiB already allocated; 1.60 GiB free; 786.00 MiB reserved in total by PyTorch)

Guillaume Jaume · Answer 3 · Tue Apr 27 2021 22:21:45 GMT+0800 (China Standard Time)

It seems to be an issue in the nuclei detector.

Could you tell me if you run on your own images or on the set of dummy images?
If on your own, can you make sure that the input you provide to the nuclei extraction module is a numpy array
Antonio was referring to the batch size of the patches when processing the image. As the images are larger than what a ANN can handle, we have to break them down into patches. By default, it is set to 32 with cuda available (and 2 with cpu). Can you try reducing it, e.g., to 8, when declaring the NucleiExtractor, i.e., NucleiExtractor(batch_size=8)

Hope this will solve the issue :)

Luis Carlos Rivera · Answer 4 · Wed Apr 28 2021 02:30:07 GMT+0800 (China Standard Time)

I am trying both. But for now to make the example script work.
Ok I will keep that in mind
Ok, but I think the way it is implemented does not allow to use of the batch_size as a parameter of the NucleiExtractor. When I try I get that 'NucleiExtractor' object has no attribute 'batch_size'.
But if I go to the implementation of the extractor (nuclei_extraction.py) and change the default value from 32 to 8 it works.

So I think there is a small bug there to allow the parameter to be changed. Maybe the default value also should be smaller to allow implementation in most GPUs

Guillaume Jaume · Answer 5 · Wed Apr 28 2021 02:38:50 GMT+0800 (China Standard Time)

Yes, you're right, thanks for reporting the issue - will make a change.

Guillaume Jaume · Answer 6 · Tue May 04 2021 19:33:27 GMT+0800 (China Standard Time)

Addressed in #9