Data-Science-for-Linguists-2023 / TED-Talk-Rating-Analysis

This is Soobin's term project repository

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

TED-Talk-Rating-Analysis

Soobin Choi | soc69@pitt.edu | 04/25/2023

Brief description about the project

This project reveals the relationship between the popularity of a talk and its textual feature; if the popularity is solely based on the content of it or if there are other elements that determine the popularity, such as the speaker's delivery strategies, tone, word of choice, etc.

Directory

Final Report is the wrapped-up version of the project

Ted_talk_DataCleaning is where I cleaned the data and extracted the information I need.

Ted_talk_Analysis is where I testified the hypotheses and developed further analysis on the data sets.

data_sample is where the snippet of the full data is uploaded to help readers grasp the data structure.

LICENSE.md contains license information of both the data sets and my jupyter notebook files.

project_plan.md contains the initial purpose/start point of this whole project.

project_progress.md contains the three milestones and each milestone explains what kind of progress I have made.

Visitors log

This is the link of my guestbook

About

This is Soobin's term project repository

License:Other


Languages

Language:Jupyter Notebook 100.0%