Michail Paraskevopoulos's repositories
covid19-airtraffic-spark-bigquery
✈ A Spark-based ETL Pipeline for the OpenSky and OpenFlights Datasets
onfleet-datastudio-connector
♾️A connector to integrate Onfleet analytics with Google Data Studio
the-met-collection-hadoop
🖼️ An implementation of Apache Hadoop to count the unique objects in every curatorial department of The Met Collection
airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Language:PythonApache-2.0000
Language:Python000
heathrow-flights-apache-beam
Sample code for Apache Beam to perform ETL from a stream-processing service (Pub/Sub) to BigQuery using Dataflow as the runner
NCBI-blast-PyAPI
A Python3 script to interface with the NCBI Blast Proteins Database.
Language:PythonApache-2.0000