imzdu / apply-ds-to-improve-addiction-treatment

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Apply Data Science To Improve Addiction Treatment

Motivation

Every day, more than 130 people in the United States die after overdosing on opioids^[National Institute on Drug Abuse - https://www.drugabuse.gov/drugs-abuse/opioids/opioid-overdose-crisis]. Addiction to and misuse of opioids, such as heroin, prescription pain relievers, and synthetic opioids such as fentanyl, has reached alarming levels. This issue has destroyed countless families, and has placed a heavy burden on the overall economy, including increased costs of healthcare, addiction treatment, criminal justice involvement, and lost productivity, etc.

Therefore, it has become ever more critical for healthcare providers to extract insights and better understand the addiction population for providing effective treatment.

Data science is one of the ways to help achieve this.

As our organization continuously strives to improve treatment quality, I've recently worked on a data science project to apply machine learning techniques to understand key common attributes among patients, and discover any unique clusters. Continuity of care is crucial for treating substance use disorder. Our goal is to discover key causes for patients to quit recovery programs, and identify areas for improvement, and design more effective intervention.

The analysis presented in this report is only one piece of a working-in-progress effort towards the goal. Upon approval by our organization, I'd like to share some of the key findings from this project, hoping to inspire meaningful discussions about how data science and technologies can transform addiction treatment.

Summary

In this report, I will go through step by step on how I used unsupervised machine learning to extract clinical insights and conduct patients segmentation analysis from survey results.

The survey used in this project is called Brief Addiction Monitor (BAM). The complete survey with scoring & clinical guidelines are publicly available, which can be found following this link.

This report follows the BSPF project framework developed by Matt Dancho, an enhanced CRISP-DM Methodology designed specifically for solving business problems using data science.

Here is the outline of this report:

  1. Problem Understanding
  2. Data Understanding
  3. Data Preparation
  4. Modeling
  5. Validation
  6. Impact
  7. What's Next

Files and Dataset

The RMarkdown file can be found here.

The full report of this analysis can be found here.

De-identified dataset for reproducing the report can be found here

Please make sure to set working directory as inside "imzdu/apply-ds-to-improve-addiction-treatment".

Disclaimer

The findings extracted from this analysis is entired based on data sets from within our organization. Although we'd like to think insights drawn from this analysis could represent a larger scope (given the sample size), you might find otherwise. To reproduce the analysis, the dataset used in this report can be found here. Please note, to completely ensure patients' privacy, the data has been de-identified to stay HIPAA compliant. Column "display_id" contains an id arbitrarily assigned to each patient for analysis purposes only.

About


Languages

Language:R 100.0%