What is

Question

What is

olliacc opened this issue 3 months ago · comments

I'm asking for the lowest amount of GPU video memory (VRAM) necessary to run latte video generation effectively? for both training and inference.

Xin Ma · Answer 1 · Sun Feb 25 2024 08:39:53 GMT+0800 (China Standard Time)

I'm asking for the lowest amount of GPU video memory (VRAM) necessary to run latte video generation effectively? for both training and inference.

Hi, thanks for your interest. Inferencing one video on the A100 requires 20916MiB of GPU memory under fp16 precision mode. As for the GPU memory requirement of training, I think it may be dependent on your batch size.

Chenxin Li · Answer 2 · Mon Feb 26 2024 16:59:13 GMT+0800 (China Standard Time)

@maxin-cn
May i set the local bz=1 for training latte on my own dataset? I mean, I heard that the enough batchsize seems to be key for the training of diffusion.

Hi, you can set batchsize as 1. But I'm not sure if this will slow down performance. You can try it first. Looking forward to your feedback later~