tenniskit / Project-1-DS-Blog-Post

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Table of Contents

  1. Installation
  2. Project Motivation
  3. File Descriptions
  4. Results
  5. Licensing, Authors, and Acknowledgements

Installation

The code should run with no issues using Python versions 3.6.3

Libraries embedded in Anaconda distribution used in the project:

  1. sys
  2. numpy
  3. pandas
  4. matplotlib.pyplot
  5. seaborn
  6. sklearn.cluster
  7. sklearn.preprocessing

External function library not embedded in Anaconda distribution used in the project called "proj1_func_library", which included in:

  1. proj1_func_library.py

Project Motivation

For this project, I was interestested in using Stack Overflow data from 2020 to better understand three questions:

  1. What are top FIVE hottest programming languages in 2020?
  2. What kind of people would like to keep studying new technologies?
  3. How could Stack Overflow segment their visitors according their behaviours on the platform?

File Descriptions

There are 3 jupyter notebook files (*.ipynb) available here to showcase the works for three questions mentioned above respectively with CRISP-PM process:

  1. project_1_question_1.ipynb
  2. project_1_question_2.ipynb
  3. project_1_question_3.ipynb

Those 3 jupyter notebooks required a external library call "proj1_func_library", which included in "proj1_func_library.py".

  1. proj1_func_library.py

Moreover, for easily reading, there are 3 HTML files generated by those 3 jupyter notebook files above. They are:

  1. project_1_question_1.html
  2. project_1_question_2.html
  3. project_1_question_3.html

Results

The main findings of the analysis can be found at the post available on my Medium Medium post here.

Licensing, Authors, Acknowledgements

Must give credit to Stack Overflow for the data. You can find the Licensing for the data and other descriptive information at the Kaggle link available here. Otherwise, feel free to use the code here as you would like!

(version 1.2 in friends_group)

About


Languages

Language:HTML 71.5%Language:Jupyter Notebook 28.4%Language:Python 0.1%