Given a Reddit URL, predict Flairs from it
https://reddit-flair-detector-aman.herokuapp.com/index
https://reddit-flair-detector-aman.herokuapp.com/automated_testing
- Scrap data from reddit using the reddit Praw API
- Perform EDA on it and Understand data
- Clean Data
- Vectorize Data using Bag of Vectors and TFIDF
- Build Models using ML models
- Save the vectorizer weights and model weights
- Run the Download+EDA.ipynb notebook
- It will download the Data and put it into a .csv file
- After that run the Models.ipynb notebook
- It will dump the vectorizer and model into .pkl files
- If you want to make it retrainable, just download it as .py file
- The Models.ipynb dumps two pickle files
- app.py uses these two .pkl files to predict new results
- To learn how to productionize the model, use this link: https://www.youtube.com/watch?v=1umQhC2iWdY&t=536s
https://reddit-flair-detector-aman.herokuapp.com/index
https://flask.palletsprojects.com/en/1.1.x/patterns/fileuploads/ https://www.tutorialspoint.com/flask/flask_file_uploading.htm
https://stackoverflow.com/questions/52033852/python-flask-how-to-convert-a-dictionary-object-to-json