ashenjy / spark-submit-airflow-aws

Python, Spark Submit, Airflow, AWS S3, AWS EMR

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

spark-submit-airflow-aws

Tech:

Python, PySpark, Airflow, AWS S3, AWS EMR

Movie review classifier

  1. clean input data
  2. use a pre-trained model to make prediction
  3. write predictions to a HDFS output

About

Python, Spark Submit, Airflow, AWS S3, AWS EMR

License:Apache License 2.0


Languages

Language:Python 100.0%