lsj2408 / Transformer-M

[ICLR 2023] One Transformer Can Understand Both 2D & 3D Molecular Data (official implementation)

Home Page:https://arxiv.org/abs/2210.01765

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

How to encode proteins in the PDBbind task?

fanxiaoyu0 opened this issue · comments

Very enlightening work. Congratulations on your great achievements in the OGB Challenge! In addition, I noticed that you have made fine-tuning on the PDBbind dataset. How should you encode the protein information? Because proteins usually contain more heavy atoms, do you directly use Transformer-M to encode proteins?

Also looking forward to the answer to this question...

It is a good catch! For the PDBBind task, each data instance is a protein-ligand complex. Since the size of a protein is much larger
than that of a ligand, each protein-ligand complex only contains the protein atoms around the ligand. Thus, the model receives the complex as input and predicts its binding affinity.