jjeongin / DrJarvis

πŸ₯ Dr.Jarvis is a medical transcript classifier that helps patients to get their symptoms diagnosed in real-time on a Streamlit-powered web app. Trained by SVM, KNN, and Random Forest models of sklearn.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Dr. Jarvis πŸ’‰

In this project, we trained/finetuned a text classification model to predict the medical specialties based on the transcription text entered by users. The application is deployed on Streamlit.

You can find the Video Demonstration and the Presentation here

Installations

  1. Clone repository and go into the directory.
    git clone https://github.com/Fennec2000GH/StarHack-medical-classification.git

  2. Install required dependencies in Python.
    pip install -r requirements.txt

  3. Download required classifier models as Pickle (.pkl) files. The links to each are given below.

  • svm.pkl

  • knn.pkl

  • rfc.pkl

    Make sure the three (3) Pickle files are directly in the cloned repository root (StarHack-medical-classification/).

  1. Run the app.
    streamlit run app.py

The steps we took to build the app:

  • Downloaded the Medical Transcriptions Dataset from Kaggle
  • Split the data into random train and test subsets
  • Preprocessed the transcription column of text with tokenization
  • Trained by SVM, KNN, Random Forest models from scikit-learn
  • Created the application using Streamlit framework and deployed it

Output Examples (Prediction + Word Cloud)

Important Links

About

πŸ₯ Dr.Jarvis is a medical transcript classifier that helps patients to get their symptoms diagnosed in real-time on a Streamlit-powered web app. Trained by SVM, KNN, and Random Forest models of sklearn.


Languages

Language:Jupyter Notebook 99.5%Language:Python 0.5%