Shahryar Siri's repositories
the-goings-on
A python script that fetches your last 20 Bandcamp collection / wishlist items and generates a music blog (automatic daily data refresh with Github Actions).
Twitter-Streamer-GCP
A streaming pipeline in GCP using the Twitter API, GCE, Pub / Sub, Dataflow, and BigQuery.
Dataverse
Tools and samples to help reporting from Dataverse. Primarily focused on Data Lake based reporting.
Language:TSQLMIT000
Faker-DB-Data
Generate Fake Data and Load Into Database
Language:Python000
Scala-Spark-EMR-MWAA
An example pipeline for running Scala Spark using AWS EMR and MWAA