Question about the training for stage 3

Question

Question about the training for stage 3

QizhiPei opened this issue 4 months ago · comments

Thanks for the interesting work that bridges 3D molecule and text. I'm a little confused about the training for stage 3.

Does the model is jointly trained on PubChem and PubChemQC datasets at the same time? bash ./scripts/stage3_train.sh seems only train the model on one of them. May I need to manually merge these two datasets provided in huggingface?
bash ./scripts/stage3_train.sh seems use the pretrain subset of PubChem and PubChemQC datasets. So I'm a little confused about when the train subset of PubChem and PubChemQC is used? I know that the pretrain subset of PubChem is used for stage 1 and starge 2 pre-training, and the train subset of PubChem is used for stage 1 and stage 2 fine-tuning. But for stage 3 I'm confused.

Any help you might provide is appreciated and thanks for your time and attention.

Sihang Li commented 4 months ago

Yes

Sihang Li · Answer 1 · Tue Apr 02 2024 11:44:30 GMT+0800 (China Standard Time)

Thanks for your interest in our work.

(1) Yes, it is jointly trained on both PubChem and PubChemQC. Please check the data provider fucntions for detailes (instruct_dataset.py & balance_dataset.py in data_provider folder).

(2) Sorry for the confusion in our codes. Actually, in stage 3, only train subsets of PubChem and PubChemQC are used. We have revised the mode name (pretrain -> train) in stage 3 to avoid this confusion.

Qizhi Pei · Answer 2 · Tue Apr 02 2024 11:54:01 GMT+0800 (China Standard Time)

Thanks for your response. Does it means that the pretrain subset of PubChemQC is not used for 3D-MoLM training?

Qizhi Pei · Answer 3 · Tue Apr 02 2024 14:00:34 GMT+0800 (China Standard Time)

Thanks for your quick reply~

Khiem Le · Answer 4 · Tue Apr 23 2024 01:02:44 GMT+0800 (China Standard Time)

@QizhiPei @lsh0520 Hi, I'm also a little confused about stage 2 and stage 3. What is the difference between stafe2-ft and stage3-train?
Are they corresponding to Specialist and Generalist, respectively?