HINT: Hierarchical Interaction Network for Clinical Trial Outcome Prediction
This repository hosts HINT, a deep learning based method for clinical trial outcome prediction. The repository can be mainly divided into three parts:
benchmark
describes the process of curating benchmark dataset named Trial Outcome Prediction (TOP) for clinical trial outcome prediction.HINT
is the Hierarchical Interaction Network, a deep learning based method.data
stores processed data.
The following figure illustrates the pipeline of HINT.
Table Of Contents
- Installation
- Benchmark
- HINT: Learn and Inference
- Tutorial (Jupyter Notebook)
- Contact
Installation
We build conda environment and uses conda
or pip
to install the required packages. See conda.yml
for all the packages.
conda create -n predict_drug_clinical_trial python==3.7
conda activate predict_drug_clinical_trial
conda install -c rdkit rdkit
pip install tqdm scikit-learn
pip install torch
pip install seaborn
pip install icd10-cm
We use following command to activate conda environment.
conda activate predict_drug_clinical_trial
Benchmark
To standardize the clinical trial outcome prediction, we create a benchmark dataset for Trial Outcome Prediction named TOP, which incorporate rich data components about clinical trials, including drug, disease and protocol (eligibility criteria).
All the scripts are in the folder benchmark
.
Please see benchmark/README.md
for details.
HINT: Learn and Inference
After processing the data, we learn the Hierarchical Interaction Network (HINT) on the following four tasks. The following figure illustrates the pipeline of HINT. All the scripts are available in the folder HINT
.
Please see HINT/README.md
for details.
Prediction results
We add the prediction results in ./results
for all the three phases.
Trained model
The trained HINT models for all the three phases are available in ./save_model
.
Tutorial (jupyter notebook)
benchmark
:tutorial_benchmark.ipynb
describes some key components of the data curation process.HINT
:tutorial_HINT.ipynb
is a tutorial to learn and evaluate HINT step by step.
Contact
Please contact futianfan@gmail.com for help or submit an issue. This is a joint work with Kexin Huang, Cao(Danica) Xiao, Lucas M. Glass and Jimeng Sun.
Benchmark Usage Agreement
The benchmark dataset and code (including data collection and preprocessing, model construction, learning process, evaluation), referred as the Works, are publicly available for Non-Commercial Use only at https://github.com/futianfan/clinical-trial-outcome-prediction. Non-Commercial Use is defined as for academic research or other non-profit educational use which is: (1) not-for-profit; (2) not conducted or funded (unless such funding confers no commercial rights to the funding entity) by an entity engaged in the commercial use, application or exploitation of works similar to the Works; and (3) not intended to produce works for commercial use.