Arpan-Mishra / reddit-flair-predictor

Created a Reddit flair detector

Home Page:https://flair-prediction-app.herokuapp.com/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Reddit Flair Prediction App: An end-to-end machine leanring Project

Background

What are Reddit Flairs?

A flair is a 'tag' that can be added to threads posted on the reddit website within a sub-reddit. They help users understand the category to which the posts belong to and help readers filter specific kind of posts based on their preferences.

This is a web-app that I created which predicts which flair should be alloted to a post. You have to post the link of the reddit post in the search bar. Find the webapp here

Automated Testing

The automated testing file can be found here. You have to upload a .txt file and then you can download a .json file with the predictions.

Directory Structure

I have made a flask app which is hosted on Heroku. The structure of the directory can be found here.

  • Notebooks: 1. Collecting 'India' subreddit data.ipynb : Data collection notebook 2. Data Analysis.ipynb: Different Data Analysis Tasks 3. Flair_Prediction.ipynb: Model Development

  • requirements.txt: Containing the requirements need to run this project.

  • app.py : Contains the flask app

  • inference.py : Inference Engine that runs the model and returns the predictions.

References

https://towardsdatascience.com/scraping-reddit-data-1c0af3040768

About

Created a Reddit flair detector

https://flair-prediction-app.herokuapp.com/


Languages

Language:Python 95.5%Language:Jupyter Notebook 1.9%Language:C 1.3%Language:C++ 0.3%Language:XSLT 0.3%Language:HTML 0.2%Language:JavaScript 0.2%Language:CSS 0.1%Language:TeX 0.1%Language:Objective-C 0.0%Language:Fortran 0.0%Language:Shell 0.0%Language:MATLAB 0.0%Language:Smarty 0.0%Language:PowerShell 0.0%Language:Makefile 0.0%