Collecting, Classifying, and Analyzing Textual Data Using R
Data Matters Short Course
August 10-11, 2017 | NC State University
Alison Blaine and Markus Wust
This repository contains materials for the course, covering such topics as:
- web scraping
- using an API
- sentiment analysis
- tokenization
- word frequencies
- document-term matrix
- tf-idf
- visualization
- part of speech tagging
- topic modeling
Day 1 files: Web Scraping and API Harvesting
web scraping activity in-class
web scraping practice exercise
web scraping practice instructions
API activity
API practice exercise
API practice instructions
Day 2 files: Text Analysis
sentiment analysis R script
twitter data file
sentiment analysis practice R script
practice twitter data
literary analysis R script
literary analysis practice R script