Free tensors from RAM if they are offloaded to an Accelerator

Question

Free tensors from RAM if they are offloaded to an Accelerator

LLukas22 opened this issue a year ago · comments

Right now the data of a tensor isn't freed if it is offloaded to a GPU. We should fix that to enable users to run bigger models which are split between CPU and GPU.