kj2013 / CACTUS_fall2021

CACTUS: Computational Analysis for Communication on Text, User-generated content, and Social networks. Coming up in Winter 2021 at the National University of Singapore.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

CACTUS: Computational Analysis for Communication on Text, User-generated content, and Social networks

Taught in Winter 2021 at the National University of Singapore

Professor: Dr. Kokil Jaidka Email: jaidka@nus.edu.sg

Course Description, Goals and Objectives

In this hands-on course students will learn how to manage large textual datasets (e.g. Twitter, YouTube, news stories) to investigate research questions. They will work through a series of steps to collect, organize, analyze and present textual data by using automated tools toward a final project of relevant interest. The course will cover linguistic theory and techniques that can be applied to textual data (particularly from the fields of corpus linguistics and natural language processing).

No prior programming experience is required. Through this course students will gain skills writing Python programs to handle large amounts of textual data and become familiar with one of the key techniques used by data scientists, which is currently one of the most in-demand jobs.

  • This course will be project-based and will provide an introduction to Python programming for collecting, preparing and analyzing text data from various sources including social media (e.g. Twitter), weblogs, online news media and various publicly available archives (e.g. presidential speech archive, congressional sessions).

  • Each week, there will be one tutorial over zoom in which students will report their progress as they work through programming exercises using Jupyter notebooks [A web-based programming environment well suited for data science and class-based assignments].

  • By completing this course students will:

    • gain an understanding of relevant linguistic concepts for the analysis of text
      • understand the field of corpus linguistics and how its concepts and tools can be applied to text analysis questions of relevance to Communication
      • be exposed to a range of techniques from natural language processing and understand how they can be used to improve content analyses
      • gain a basic level of programming proficiency in the Python programming language and have completed a number of programming exercises to build, clean and analysis corpora of text

Forked from COMM313 Spring 2018 Annenberg School for Communication, University of Pennsylvania

Big props to:

Professor: Dr. Matt O'Donnell

Email: mbod@asc.upenn.edu

About

CACTUS: Computational Analysis for Communication on Text, User-generated content, and Social networks. Coming up in Winter 2021 at the National University of Singapore.


Languages

Language:Jupyter Notebook 89.0%Language:HTML 11.0%