Ken's repositories
Docker-Spark-Setup
Setting up a Spark cluster in a Docker environment for improved repeatability and reliability. This project includes a simple transformation on a dataset containing approximately 31 million rows.
Spark-Standalone-Cluster-Setup
To facilitate the initial setup of Apache Spark, this repository provides a beginner-friendly, step-by-step guide on setting up a master node and two worker nodes.
PyODBC-Data-Import-for-SSMS-AlexTheAnalyst-Ref-
Using Python's pyodbc module to connect to Microsoft SQL Server and import data into SSMS.
Language:Jupyter Notebook000
Real-Time-BTC-USD-Airflow-DAG-Extract-In-Excel
Using yfinance, we grab minute-by-minute BTC-USD data, dump it into PostgreSQL, and link Excel via ODBC for quick analysis!
Language:Python000
SSMS-SQL-PowerQueryM-Functions
Using Power Query M to extract values from Excel workbooks for dynamic insertion into SQL code.