alblaine / short-course-2017

Data Matters Short Course 2017: Collecting, Classifying, and Analyzing Textual Data Using R

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Collecting, Classifying, and Analyzing Textual Data Using R

Data Matters Short Course

August 10-11, 2017 | NC State University
Alison Blaine and Markus Wust

This repository contains materials for the course, covering such topics as:

  • web scraping
  • using an API
  • sentiment analysis
  • tokenization
  • word frequencies
  • document-term matrix
  • tf-idf
  • visualization
  • part of speech tagging
  • topic modeling

Day 1 files: Web Scraping and API Harvesting

slides

web scraping activity in-class
web scraping practice exercise
web scraping practice instructions

API activity
API practice exercise
API practice instructions

Day 2 files: Text Analysis

sentiment analysis R script
twitter data file

sentiment analysis practice R script
practice twitter data

literary analysis R script
literary analysis practice R script

About

Data Matters Short Course 2017: Collecting, Classifying, and Analyzing Textual Data Using R


Languages

Language:R 100.0%