C. Giray's repositories
CDC_stream_data_simulation
Dataflow task simulation with combination of CDC formation and streaming data read/writes
Data-Engineering-on-GCP
A repo containing auto-triggered Airflow ETL activities for datasets located on GCP storage that flattens and creates analytical views on Big Query
data_analysis_tool
A superb tool to analyze data in any given dataset as structured or semi-structured format that located in cloud storage or any sql rdbms. Analysis results can be gathered in seconds
data_engineering_databricks_pyspark
This Repo contains activities related to ETL, data warehouse creation and advanced analytics
etl-s3-airflow-snowflake-powerbi-marketing-data
A end to end data analytics work that consumes data from AWS S3, executing ETL process and creating datamart on Snowflake following to that a PowerBI report created by using datamart
airflow-elt-blueprint
A self-contained, ready to run Airflow ELT project. Can be run locally or within codespaces.
de_task
data engineering task solution
drawio-github
Drawio GitHub Integration
personal_files
Holding presentation of my career highlights
snowflake_fivetran_vhol
Sample dbt project for the Snowflake + Fivetran Virtual Hands-On Lab
sports_app_data_analysis_tool
A repo that has a python class structure that facilitates analysis of semi-structured (JSON) football dataset.