Repository containing programs from the course "Big Data for Internet Applications" at Politecnico di Torino (a.y. 2022-2023).
The course was about processing Big Data using the Map-Reduce paradigm through Hadoop and Apache Spark (PySpark, specifically).