Joshua-omolewa

Joshua Omolewa's repositories

Stock_streaming_pipeline_project

Built a real-time streaming pipeline to extract stock data, using Apache Nifi, Debezium, Kafka, and Spark Streaming. Loaded the transformed data into Glue database and created real-time dashboards using Power BI and Tableau with Athena. The pipeline is orchestrated using Airflow.

Language:Python17 40

Retailstore_ETL_pipeline_project

Built a Data Pipeline for a Retail store using AWS services that collects data from its transactional database (OLTP) in Snowflake and transforms the raw data (ETL process) using Apache spark to meet business requirements and also enables Data Analyst create Data Visualization using Superset. Airflow is used to orchestrate the pipeline

Language:Python5 30

edmonton_weather_aws_serverless_project

This is an AWS data engineering serverless project to track Edmonton weather in near real time using services like Kinesis Data Firehose, S3, AWS lambda, AWS Glue, Athena, IAM,

Language:Python200

Job_API_ETL_datapipeline_project

Building an ETL pipeline using AWS services that extract data from a Job API and then transforms data to meet business requirements and load data to S3 bucket

Language:Python100

take_home_assement

Language:Python100

30-Days-Of-Python

30 days of Python programming challenge is a step-by-step guide to learn the Python programming language in 30 days. This challenge may take more than100 days, follow your own pace. These videos may help too: https://www.youtube.com/channel/UC7PNRuno1rzYPb1xLa4yktw

Language:Python000

awesome-interview-questions

:octocat: A curated awesome list of lists of interview questions. Feel free to contribute! :mortar_board:

000

awesome-job-boards

000

ci-cd-project-1

Practicing CI/CD using github actions

Language:Dockerfile000

container-images

Docker images for Debezium. Please log issues in our JIRA at https://issues.redhat.com/projects/DBZ/summary

MIT000

Covid-19-analysis

Covid 19 Canada data analysis

Language:HTMLMIT000

curl_commands

000

data-engineer-handbook

This is a repo with links to everything you'd ever want to learn about data engineering

000

Data-Engineering-learning

My data engineering practice

Language:Shell000

data-engineering-practice

Data Engineering Practice Problems

000

debezium-examples

Examples for running Debezium (Configuration, Docker Compose files etc.)

Apache-2.0000

devops-exercises

Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP, DNS, Elastic, Network, Virtualization. DevOps Interview Questions

NOASSERTION000

devops-resources

DevOps resources - Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP

000

docker_ETL_pipeline_project

ETL project that uses docker container containing a python script to extract the csv data, transform the csv data by combining files into a single file and then load data into an output folder and also ensure the output csv file file is still available even if the container is shutdown.

Language:Python000

Joshua-omolewa

Joshua Omolewa's repositories

Stock_streaming_pipeline_project

Retailstore_ETL_pipeline_project

edmonton_weather_aws_serverless_project

Job_API_ETL_datapipeline_project

take_home_assement

30-Days-Of-Python

awesome-interview-questions

awesome-job-boards

ci-cd-project-1

container-images

Covid-19-analysis

curl_commands

data-engineer-handbook

Data-Engineering-learning

data-engineering-practice

debezium-examples

devops-exercises

devops-resources

docker_ETL_pipeline_project

flink

Git_practice

joshua-omolewa

markdown-here

Miscellaneous

spark-syntax

sqlfluff

system-design-notebook

tech-interview-handbook

Toronto_Climate_API_ETL_project

trino