RoryChenXY / DataRepExp_public

A shiny app that was developed to improve the FAIR of research data hosted on data repository.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Welcome to DataRepExp

The Data Repository Explorer, DataRepExp, is an open-source R Shiny application developed to improve the findability, accessibility, interoperability, and reusability (FAIR) of research data held in a data repository.

The application displays standardised metadata across multiple studies including data availability by categories (such as demographics, medical history, imaging data and genomic data) to allow high-level comparison. It enables users to explore and run preliminary analysis from participants that match certain criteria. In addition, it provides features to export reports and aggregated results for data access application purposes.

This demo application was created using simulated health-related data for demonstration purposes. It can be modified and utilized by other data repositories by adopting the discipline-specific metadata schema and common variables.

Deployment

To run the demo app locally, please use the following syntax:


devtools::install_github("RoryChenXY/DataRepExp_public", subdir="datarepexp")

library(datarepexp)

datarepexp::repexp_app()

How to navigate the application

The side menu allows you to switch between tabs, and the menu icon on the top allows you to collapse and expand the menu.

Side Menu

The Summary Tables Tab includes metadata of studies, listed in three tables, and provides a high-level comparison. All tables allow you to search, sort, and filter, and the "CLEAR" button resets the table to its original status.

Summary Tables Tab

The Filters Tab includes filters at both the study level and participant level, so that you can adjust and apply filters to identify studies and participants that match the selected criteria. The "Clear" buttons reset the filters. The Filters Tab also allows you to review and download the filters you have applied, and the study/studies you have identified in the Filters Report page.

Filters Tab1 Filters Tab2

The Visualisation Tab generates results based on the filters you have applied. All plots are organised into sub-tabs by domain.

Visualisation Tab

The Preliminary Analysis Tab generates results based on the filters you have applied. The variable options are separated by categorical and quantitative variables.

Preliminary Analysi Tab

All plots can be downloaded. They have interactive features such as zoom, select, adjust axis, hover for information, reset, etc.

Deployment Notes

To make modifications to the applications for your purposes, R and Shiny programming skill is required.

  1. The application requires both study-level metadata, and particicipant-level data, and a variable information data frame that contains all variable information. Deatail Notes and scripts used to generate data is included in the repository: Data Preparation Notes.
  2. DataRepExp was built in Shiny modules. Modularity makes the app easy to test, maintain, and deploy. The features can be easily further expanded with loose coupling module design: Application Development Notes.
  3. I have created a copy of this application that was modified to add two more variables just as a demonstration to set up your own variables: How to add new variables.
  4. Considering some repositories may hold highly sensitive data, or individual-level data may not be not available, a metadata-only version DataRepExp has also been developed, and relevant code is included here.
  5. The Data Repository Explorer, DataRepExp, is hosted through easy-to-use shinyapps.io, while the DPAU version is hosted on AWS environment using Shiny Server for high availability, scalability, security, and compliance. The detailed deployment instruction can be found here

Acknowledgements

This application was inspired by the visualization tool developed by Dementias Platform UK(DPUK) using PowerBI, then developed for the Dementias Platform Australia (DPAU) in R-Shiny. We acknowledge the generous sharing of best practices and knowledge from DPUK.

Funding

This work is supported by grants from the National Institute on Aging/ National Institute of Health (NIA/NIH) [1RF1AG057531-01] and the Medical Research Council [MRC/T0333771].

Availability and Community Guidelines

The application and associated documentation are open source (MIT License), but we ask that you kindly acknowledge our work. Users and contributors are welcome to contribute, request features, and report bugs through the GitHub repository.

About

A shiny app that was developed to improve the FAIR of research data hosted on data repository.

License:MIT License


Languages

Language:R 96.9%Language:TeX 2.4%Language:CSS 0.7%