rti / gbnc

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Embedding generation runs on CPU only

rti opened this issue · comments

commented

Terms

Behavior

When generating embeddings, only CPU is used, no GPU acceleration is leveraged.
This makes embedding generation for our full example data requiring 18h on 16 cores.
Typically, GPU acceleration can be activated by providing a device="cuda" parameter. This should speed up the embedding generation.

Operating System

linux, our container on runpod.io host with nvidia 3090

commented

fixed in 9ee8a32

commented

part of #23