CompVis / latent-diffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Training speed.

andreYoo opened this issue · comments

I am using two A6000 RTX GPUs.

When I train the autoencoder models with the ImageNet dataset.

it shows 2.14s/it.

is it normal?

As I remember, "s/it" stands for seconds per batch, which greatly depends on your batch size. With a standard batch size, yes, this duration is typical. Additionally, it's highly dependent on the number of workers you have in your dataloader.