argonne-lcf / molan

Data analytics for molecular solids melting points

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

MOLAN, A Machine Learning Workflow for MolecularAnalysis: Application to Melting Points

If you are using the MOLAN workflow in your research paper, please cite us as

@article{sivaramandiversified,
  title={A diversified machine learning strategy for predicting and understanding molecular melting points},
  author={Sivaraman, Ganesh and Jackson, Nicholas E and Sanchez-Lengeling, Benjamin and V{\'a}zquez-Mayagoitia, {\'A}lvaro and Aspuru-Guzik, Al{\'a}n and Vishwanath, Venkatram and de Pablo, Juan J}
}

Intro statement

How do I run this?

There are two options:

  • Locally

This requires a standard scientific Python 3 environment with rdkit and tensorflow+pytorch and a cloned github. A simple way of getting that is installing Anaconda.

First to clone the github and then replicate a new anaconda environment using the environment.yml file:

git clone https://github.com/argonne-lcf/molan
cd molan
conda env create -f environment.yml
  • Remotely via Google Colab

Visit google colab (requires a gdrive account) and open a colab notebook via github:

What's inside?

  • data
    • 47K: Folder of json, each containing information for one molecule.
    • *csv: csv files.
  • notebooks: Jupyter notebooks (run these!)
    • Exploratory_Data_Analysis.ipynb
    • semisupervised_VAE.ipynb
    • Graph_Neural_Networks.ipynb
    • Gaussian_Processes.ipynb
  • code: Repo specific modules for training and creating the models.
  • results: Figures and weights for models.
  • media: Assorted images.

Acknowledgements

This research used resources of the Argonne Leadership Computing Facility, which is a DOE Office of Science User Facility supported under Contract DE-AC02-06CH11357. Argonne National Laboratory’s work was supported by the U.S. Department of Energy, Office of Science, under contract DE-AC02-06CH11357.

Code contributors

  • Benjamin Sanchez-Lengeling beangoben::AT:::gmail.com
  • Ganesh Sivaraman gsivaraman:AT::anl.gov
  • Nicholas Jackson jackson.nick.e::aT:gmail.com
  • Alvaro Vazquez-Mayagoitia alvaro::At:anl.gov
  • Alan Aspuru-Guzik aspuru::at:utoronto.ca

About

Data analytics for molecular solids melting points

License:MIT License


Languages

Language:Jupyter Notebook 98.6%Language:Python 1.3%Language:Shell 0.2%Language:TeX 0.0%