tianyuan09 / ai4ph2022

Welcome to the Artificial Intelligence for Public Health (AI4PH) event in 2022.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Tutorial on Text Analytics Using R (AI4PH 2022)

Welcome to the Artificial Intelligence for Public Health (AI4PH) event in 2022.

The tutorial and data challenge materials can be found at: https://bookdown.org/tianyuan09/ai4ph2022/.

This online tutorial will accompany two sessions:

  • Tutorial on text analytics with R
    • data pre-processing
      • regular expressions
      • tokenization
      • stopwords
      • stemming
      • Exploratory data analysis
    • supervised learning (classification models)
    • unsupervised learning (topic modelling)
  • Data Challenge using the N2C2 NLP Research Datasets

The Twitter Datasets

This repository contains the Twitter dataset you will use for the tutorial session.

  • TwitterDataforClassification.csv
  • TwitterDataforTopicModelling.csv

The tutorial site was created with R Markdown and bookdown (https://github.com/rstudio/bookdown).

About

Welcome to the Artificial Intelligence for Public Health (AI4PH) event in 2022.


Languages

Language:HTML 83.0%Language:CSS 9.6%Language:JavaScript 6.7%Language:TeX 0.7%