Python,Data Analysis
In this project, we'll compare two different types of posts from Hacker News, a popular site where technology related stories (or 'posts') are voted and commented upon. The two types of posts we'll explore begin with either Ask HN or Show HN.
Users submit Ask HN posts to ask the Hacker News community a specific question, such as "What is the best online course you've ever taken?" Likewise, users submit Show HN posts to show the Hacker News community a project, product, or just generally something interesting.
We'll specifically compare these two types of posts to determine the following:
- Do Ask HN or Show HN receive more comments on average?
- Do posts created at a certain time receive more comments on average?
- python
It should be noted that the data set we're working with was reduced from almost 300,000 rows to approximately 20,000 rows by removing all submissions that did not receive any comments, and then randomly sampling from the remaining submissions.
Files in this repository:
File / Folder | Description |
---|---|
Hacker News.png | Hacker News Site Image |
hacker_news.csv | Containing data about approximately twenty thousand rows by removing all submissions that did not receive any comments, and then randomly sampling from the remaining submissions |
Exploring Hackers News Posts.ipynb | Analysis of Hacker News Data |
README | Readme file |
Clone the repository into a local machine using
git clone https://github.com/AvinashBolleddula/Exploring-Hackers-News-Posts
These are the prerequisites to run the program.
- python 3.8.3
Avinash B - avinash.bolleddula@gmail.com
Project Link: https://github.com/AvinashBolleddula/Exploring-Hackers-News-Posts