Pretraining and computational resource?

Question

Pretraining and computational resource?

opened this issue a year ago · comments

Hi, thank you for sharing your great work!
I am interested in the concept of your paper and would like to try pretraining as written in your paper.
How can I pretrain using this repository?
Another question is about the computational resource.
In your paper, it took total 170 days and 800 times. Does the pretraining require the same computational time?
Is it possible to pretrain using a single GPU?

Thank you in advance:)

phseidl · Answer 1 · Mon Jun 26 2023 20:54:22 GMT+0800 (China Standard Time)

Hi concon23,
pretraining on the full PubChem18 dataset should take around 2-5 days with a modest consumer GPU after preprocessing the data.
You can follow the instructions in the reproduce section of the readme.
Hope you manage, otherwise I'm happy to help.

Deleted user · Answer 2 · Mon Jun 26 2023 22:17:36 GMT+0800 (China Standard Time)

Hi @phseidl
Thank you for your kind reply.
I understand that.
It is friendly to users with common computational resources!

Sincerely:)

Deleted user · Answer 3 · Thu Jul 13 2023 23:53:15 GMT+0800 (China Standard Time)

Hi @phseidl
Sorry for asking a question again.
python clamp/train.py --dataset=./data/fsmol --assay_mode=clip --split=FSMOL_split
The above command runs the pretraining?
Or runs a few shot training or something other?

Thank you in advance:)

phseidl · Answer 4 · Thu Aug 03 2023 22:48:45 GMT+0800 (China Standard Time)

Hi @concon23,
this performs pretraining and evaluates it on zero-shot.
To run few-shot you can add --support_set_size=k where k is the number of support-samples you want.
Best, Philipp