calvinmccarter / building-better-benchmarks-blog

Code accompanying our blog post on building better benchmarks for ADME optimization

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Building Better Benchmarks Blog Post Code

The code for the analyses in our blog post Get with the Program: Building ADME benchmark datasets that drive impact. This post:

  1. Shows how in ML for small molecule drug discovery, one reason that research advances don't always translate to practical impact is because existing benchmark datasets don't capture realistic components of drug programs.
  2. Provides a path forward for constructing better benchmarks from existing public data by carefully selecting data sets, setting constraints on allowable training data, and using appropriate evaluation metrics.

To run the code, install dependencies from requirements.txt into a virtual environment:

pip install --upgrade pip && pip install -r requirements.txt

About

Code accompanying our blog post on building better benchmarks for ADME optimization

License:MIT License


Languages

Language:Jupyter Notebook 100.0%