`unload_model` support for `Generator`

Question

`unload_model` support for `Generator`

NeonBohdan opened this issue 3 months ago · comments

NeonBohdan commented 3 months ago

unload_model here is a unique feature of ctranslate2
But it's supported only for Translator

Can it also be supported for Generator models?
It will optimize memory management for them alot

Chih-Chiang Chang · Answer 1 · Mon Apr 15 2024 12:32:21 GMT+0800 (China Standard Time)

+1, can this support Whisper models as well?

Minh-Thuc · Answer 2 · Tue Apr 16 2024 15:07:30 GMT+0800 (China Standard Time)

Hello, we can support it for Generator and Whisper of course. I will add it when I have time.

BBC-Esq · Answer 3 · Tue Apr 23 2024 00:16:52 GMT+0800 (China Standard Time)

In the meantime, hit me up if you want some quick code snippets on how to delete the model object, perform torch.cuda freeing of vram, garbage collection, etc., which is what I've resorted to...