Arijeet04 / Suicide_Detection_Project_Using_ML

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Suicide_Detection_Project_Using_ML

Introduction

In recent times, the increasing number of student suicides worldwide has highlighted a critical problem that demands our attention. This project, "Suicide Detection for Students using Student Forums," aims to leverage technology to address this pressing issue. By harnessing the power of natural language processing (NLP) and sentiment analysis, we seek to identify potential signs of emotional distress and suicidal tendencies within student forum discussions.

Problem Statement

Student forums serve as platforms where students express their thoughts and challenges. Detecting signs of distress and potential suicide risk within these conversations is a complex task, often overwhelmed by the sheer volume and subtlety of cues.

Solution

Our solution employs advanced NLP techniques to analyze the language used in student forum posts. This approach allows us to identify patterns indicating emotional turmoil, depression, or thoughts of self-harm, enabling early intervention and support.

Objectives

  1. Early Detection: Promptly identify potential distress signals to enable timely intervention.
  2. Privacy Focused: Our system analyzes language patterns while respecting user privacy.
  3. Counselor Notification: Notify school counselors discreetly when concerning language is detected.
  4. Support Ecosystem: Foster effective communication between students and counselors, creating a holistic support network.

Impact

We believe that by combining technology with empathy, we can make a significant impact on students' lives. Early intervention and emotional support are crucial factors in preventing tragic outcomes. Our system is designed to enhance educators' and counselors' ability to reach out when students need it most.

Team Members

🀺 Arijeet Das Chowdhury

Project Overview

Using the power of OneAPI and Machine Learning, in our project we showcase how we can use student forums and conversation done on any microphone to detect if a student is prone to have intriguing thoughts of commiting suicide which is based on analysis of data recorded for past few days. We have also used Natural Language Processing (NLP) to add convenince to recording of data for analysis.

Dependencies Used

β€’ sklearn

β€’ python

β€’ keras

β€’ pandas

β€’ numpy

β€’ nltk

β€’ bilstm

β€’ nodejs

Methodology for Building

πŸ”§ We chose a categorical dataset comsisting of reddit platform's data comprising of 2 columns and 2,33,338 rows available at https://www.kaggle.com/datasets/nikhileswarkomati/suicide-watch. πŸ”§ Then we performed the following on the dataset:

      β€’ PreProcessing of text:
                     1. Punctuation removal
                     2. Stop word removal
                     3. Tokenization
                     4. Lemmatization
                     5. Construction of word vector
                     6. Train BiLSTM model
                     7. Prediction and accuracy

      β€’ Comment text collection

      β€’ Construction of weighted word vector

      β€’ Obtaining comment text representation

πŸ”§ After this we created an online forum portal for students using nodejs, html and css where the students can register, login and use the forum to discuss things openly.

πŸ”§ In the portal we have also integrated functionality of Natural Language Processing which allows the facility of recording voice which might be preferred which then converts to text and the same functionality is implemented.

Implementation

βœ… Student goes and initially registers on a student forum portal after which he/she can login and access the existing posts on the portal.

βœ… The student can post their posts on the forum as well.

βœ… The posts can be made either by typing the text via keyboard or by using a mic.

βœ… If the microphone is used the voice is then converted to text.

βœ… Either ways the text is then stored in the database based on the data for each user individually.

βœ… For each individual the data is then processed by summarising their data of past few days and then this data is sent to the model for processing.

βœ… The model then analyses if the student is having thoughts relating to commiting suicide and if so, the counsellor is informed about the same via the stored contact information.

Challenges Faced

βœ— To decipher a system which is convenient to register data for model prediction.

βœ— To avoid false positives and false negatives.

βœ— Cultural and Contextual variables.

βœ— Model Bias and Fairness.

βœ— Algorithm Complexity.

βœ— Improving the model.

βœ— Integration and intercompatibility of diverse applications.

βœ— Limited datasets available.

βœ— User Acceptance and Compliances.

Innovations & Unique Features

Integrating suicide detection algorithm to a student forum and a microphone helps to record the data in a covenient manner rather than recording data explicitly such like a form (most of the users are reluctant about).

Future Enhancements

The solution when integrated with NLP can be integrated in a mobile application for continous data recording and processing so that there will be continous detection. This methodology is already used by platforms like INSTAGRAM and FACEBOOK for target/niche advertising.

About


Languages

Language:Jupyter Notebook 79.8%Language:JavaScript 7.4%Language:CSS 6.1%Language:EJS 5.8%Language:Python 0.9%