PuneethaPai / RPPP

RPPP – Reddit Post Popularity Predictor A project with two goals: 1. Given a Reddit post, predict how popular it's going to be (what it's score will be) 2. Showcasing a remote working file system with DVC

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

RPPP - Reddit Post Popularity Predictor

This Project attempts to predict whether a reddit submission will be popular or not according to it's features.

We currently provide models for r/MachineLearning only, base on submission title and body.

DVC Remote Working File System

This project is also an exploration of DVC remote WFS workflow. To setup your remote WFS – read here: Remote WFS Setup

Contributing

Contributions Are Very Welcome!

Read the Contribution Guide for more information.

Ideas to work on:

  • Combine textual and numerical classifier into one model!
  • Add UI to test if your post is going to be successful!
  • Add MOAR data! (other subreddits, more from r/ML)
  • Improve model performance (there is a lotttt to improve)!
  • Add memes: Add MOAR MEMES

About

RPPP – Reddit Post Popularity Predictor A project with two goals: 1. Given a Reddit post, predict how popular it's going to be (what it's score will be) 2. Showcasing a remote working file system with DVC


Languages

Language:Python 98.6%Language:Dockerfile 1.4%