frason88 / StackOverflow_Tag_Pred

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

StackOverflow_Tag_Pred

1. Description:

Stack Overflow is the largest, most trusted online community for developers to learn, share their programming knowledge, and build their careers.

Stack Overflow is something which every programmer use one way or another. Each month, over 50 million developers come to Stack Overflow to learn, share their knowledge, and build their careers. It features questions and answers on a wide range of topics in computer programming. The website serves as a platform for users to ask and answer questions, and, through membership and active participation, to vote questions and answers up or down and edit questions and answers in a fashion similar to a wiki or Digg. As of April 2014 Stack Overflow has over 4,000,000 registered users, and it exceeded 10,000,000 questions in late August 2015. Based on the type of tags assigned to questions, the top eight most discussed topics on the site are: Java, JavaScript, C#, PHP, Android, jQuery, Python and HTML.

2. Problem Statement

Suggest the tags based on the content that was there in the question posted on Stackoverflow.

4. Source/Useful Links:

Data Source : https://www.kaggle.com/c/facebook-recruiting-iii-keyword-extraction/data
Youtube : https://youtu.be/nNDqbUhtIRg
Research paper : https://www.microsoft.com/en-us/research/wp-content/uploads/2016/02/tagging-1.pdf
Research paper : https://dl.acm.org/citation.cfm?id=2660970&dl=ACM&coll=DL

5. Real World / Business Objectives and Constraints:

  1. Predict as many tags as possible with high precision and recall.
  2. Incorrect tags could impact customer experience on StackOverflow.
  3. No strict latency constraints.

About

License:MIT License


Languages

Language:Jupyter Notebook 100.0%