Ellie190 / Student-Segmentation

Web-based Clustering Application Using a Shiny Framework for Determining Student Engagement Levels in Virtual Learning Environments

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Student-Segmentation

Web-based Clustering Application Using a Shiny Framework for Determining Student Engagement Levels in Virtual Learning Environments

Application Access

https://ds-analytics.shinyapps.io/Student-Segmentation/

Project Overview

  • Explored two clustering algorithms - Gaussian Mixture Model and K-means
  • Compared and evaluated the clutering algorithms using the following metrics:
    • Clustering Time: How long it takes the algorithm to fit the data
    • Silhoutte Coefficient (SC): Calculates the mean distance between the data points to find the better defined clusters - SC scikit-learn
    • Calinski-Harabasz Index (CHI): Calculates the ratio of the sum of within and between cluster dispersion for all clusters used to find well-defined clusters - CHI scikit-learn
    • Davies-Bouldin Index (DBI): Calculates the average similarity between each cluster and its most similar one, used to indicate well-partitioned clusters - DBI scikit-learn
  • More information on clustering metrics: https://scikit-learn.org/stable/modules/clustering.html
  • Built a R Shiny dashboard with three main functionalities: GMM cluster Analysis, User Guide, Descriptive Analysis

Methodology

Knowledge Discovery in Database (KDD) Methodology
KDD Methodology

Context

  • Higher Educational Institutions are greatly relying on online learning platforms, such as Moodle, Blackboard, Canvas and etc to assess, communicate and share resources with students.
  • These platforms capture and store each user's online interaction data in the system (both students and educators) by generating large and varied data sets providing the opportunity to discover valuable information
  • Opportunity - using clustering algorithms:
    • To determine low engaged students online
    • To understand the different students engaging with online learning platforms
    • To perform student segmentation
    • To provide personalised support to students engaging with the platforms differently

Resources

RStudio Version: 2022.07.1 Build 554
Libraries: tidyverse, mclust, highcharter, shiny, bs4Dash, waiter, shinycssloaders, DT, callr, data.table, rintrojs, readxl, lubridate, oulad
Open University Learning Analytics Dataset (OULAD): https://analyse.kmi.open.ac.uk/open_dataset

Dashboard Snippets

User Modal

User Guide

GMM Data Analysis Tab

Instructional Methods Tab

Student Characteristics Tab

About

Web-based Clustering Application Using a Shiny Framework for Determining Student Engagement Levels in Virtual Learning Environments

License:MIT License


Languages

Language:Jupyter Notebook 90.5%Language:R 9.5%