Only using 65% of GPU memory

Question

Only using 65% of GPU memory

tannercollin opened this issue 2 years ago · comments

While running some benchmarks I noticed min-dalle was only using 65% of my 3070 Ti GPU's memory:

Here's the function I'm calling:

def run_dalle():
    generate_image(
        is_mega=True,
        text='rich ducks playing poker',
        seed=0,
        grid_size=3,
        top_k=256,
        image_path='generated',
        models_root='pretrained',
        fp16=True,
    )

After running it 50 times, each image takes 54.5 seconds on average to generate. It's running baremetal on 32x E5-2630 v3 threads and 64 GB RAM.

Is there a way to make it use more of the GPU? or am I reading this wrong? Thanks!

Tanner · Answer 1 · Thu Jul 21 2022 11:07:06 GMT+0800 (China Standard Time)

Also the GPU-Util field fluctuates between 0% and ~40%.

78Alpha · Answer 2 · Wed Jul 27 2022 10:15:44 GMT+0800 (China Standard Time)

Doesn't seem too off. I use the bfloat16 and get a usage of 41% with 7.7 GB/ 8 GB VRAM. That's with a 3070 (non-Ti)

EDIT:

The time, however seems a little strange, as I am at 11 seconds per image

Tanner · Answer 3 · Sun Jul 31 2022 12:07:30 GMT+0800 (China Standard Time)

@78Alpha are you using the Mega model?

78Alpha · Answer 4 · Sun Jul 31 2022 16:51:19 GMT+0800 (China Standard Time)

@78Alpha are you using the Mega model?

The default for the pip package

EDIT:

Checking back, it is the Mega Version. Will also try non-mega...

Non-mega went to about 9 seconds per image

Tanner · Answer 5 · Tue Aug 02 2022 07:40:10 GMT+0800 (China Standard Time)

I'm surprised there's only two seconds difference. Note that mega defaults as off, so you have to pass --mega into the command line.

I was also getting around 10 seconds in the non-mega version.

78Alpha · Answer 6 · Wed Aug 03 2022 03:47:17 GMT+0800 (China Standard Time)

I had been using my own utility script for batch generation here

I have it set to is_mega=True