Pavan Patel's repositories
AdventureWorks-Analytics
Analyzing Manufacturing and Inventory Operations of the AdventureWorks Database using Power BI
beginner_de_project
Beginner data engineering project - batch edition
beginner_de_project_stream
Simple stream processing pipeline
bitcoinMonitor
Near real time ETL to populate a dashboard.
change_data_capture
Repo for CDC with debezium blog post
cost_effective_data_pipelines
Cost Efficient Data Pipelines with DuckDB
data_helper
Code to help generate SQL for stakeholders. Code at https://www.startdataengineering.com/post/data-democratize-llm/
e2e_datapipeline_test
Example repo to create end to end tests for data pipeline.
efficient_data_processing_spark
Code for "Efficient Data Processing in Spark" Course
generative-ai-for-beginners
12 Lessons, Get Started Building with Generative AI đź”— https://microsoft.github.io/generative-ai-for-beginners/
simple_dbt_project
Code for dbt tutorial
SparkLearning
A comprehensive Spark guide collated from multiple sources that can be referred to learn more about Spark or as an interview refresher.
sql-server-samples
Azure Data SQL Samples - Official Microsoft GitHub Repository containing code samples for SQL Server, Azure SQL, Azure Synapse, and Azure SQL Edge