whitead / dmol-book

Deep learning for molecules and materials book

Home Page:https://dmol.pub

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Protein Solubility Dataset Vocab (Chapter 7)

frenio opened this issue · comments

Thank you so much for this amazing resource! I would like to experiment a little more with the protein solubility dataset from Section 7.4 and would be interested in the actual amino acid sequences in the dataset. Is there an easy way to provide the vocabulary for the numericalized dataset in 'solubility.npz'?

I found something close to what I was looking for in the data repository (here) of the DeepSol paper.