There are 4 repositories under azure-synapse-analytics topic.
Solution Accelerator to help build Purview custom connectors
Solution accelerator to help build Machine Learning Lineage
Guide to data platforms and tools
An end-to-end data engineering pipeline that fetches data from Wikipedia, cleans and transforms it with Apache Airflow and saves it on Azure Data Lake. Other processing takes place on Azure Data Factory, Azure Synapse and Tableau.
End-to-end ETL pipeline in the Microsoft Azure cloud - (Jun '24 - Jul '24)
Samples for Industrial IoT Design Patterns
A comprehensive guide to understanding and implementing data management and analytics solutions in the Azure ecosystem using Azure Data Fundamentals.
This project builds an End-to-End Azure Data Engineering Pipeline, performing ETL and Analytics Reporting on the AdventureWorks2017LT Database.
Template to perform CI/CD for Azure Synapse dedicated SQL Pools using Azure DevOps
In this repository, you will find varies demo and presentations I have delivered throughout the year. This includes the link to the video, the source codes and the data files.
Reusable Python classes that extend open source PySpark capabilities. Examples of implementation is available under notebooks of repo https://github.com/bennyaustin/synapse-dataplatform
Open Log Analytics queries and samples on querying different Azure resources and services. Includes sample Power BI reports
This is a solution accelerator for creating personalized content recommendations based on user activity.
Annotated Microsoft Azure documentation links used throughout day to day technical conversations.
Optimize the performance of a data warehouse solution using Azure Synapse Analytics
A modern data platform implemented on Azure Synapse Analytics using ELT Framework - https://github.com/bennyaustin/elt-framework. Data platform infrastructure provisioned using https://github.com/bennyaustin/iac-synapse-dataplatform
An E2E solution of the Data Resources on Azure using the Snapshot Serengeti dataset. This E2E solution focuses Azure Synapse Analytics, Power Bi & the Azure Data Factory.
This repo focus on building a framework to simulate the load with multiple queries and get answer for questions like performance, load test, utilizing resources etc.
In this project we are going to create an end-to-end data platform right from Data Ingestion, Data Transformation, Data Loading and Reporting.
Cross platform tool for running test cases in a serial or concurrently and log the results to a set of tables to see the results.
Template to perform CI/CD for Azure Synapse serverless SQL Pools using GitHub Actions
This repository includes the demos and codes I use to play around with Azure Synapse Anayltics
Template to perform CI/CD for Azure Synapse dedicated SQL Pools using GitHub Actions
This sample shows how to leverage Azure OpenAI in Azure Synapse Analytics
Getting started with Azure Synapse and Azure Data Explorer
An end-to-end data engineering project using Azure Synapse Analytics to analyze and transform NYC taxi data.
Homemade database project for Azure Synapse serverless SQL Pool. For use with Azure DevOps.
A complete CI/CD solution for Azure Synapse Link for SQL Server 2022 using GitHub Actions
This is a content and schema crawler tool to receive, update and import various kinds of data into a Onprem or Cloud based SQLServer or Azure-Synapse-Analysis (Azure Datawarehouse SQLServer). As source it supports SQLServer Tables, ODATA Endpoints, CSV Files or Excel Files. For multiple sources it can run in parallel mode where it would make a thread for each connection. The speciality of this crawler is that it creates the target tables by himself using the additional info from source.json. In case of Azure-Synapse-Analysis it would estimate the distribution type and keys. The syncing works completely without SQL Transactions by using a consistency correction algorithm for very frequent fact tables. There are 5 Syncing Algorithms (see Manual/Insert) which can be selected as well as one Update Algorithm.
Near Real Time Analytics with Azure Synapse Link for Azure Cosmos DB
CI/CD for Azure Synapse Link for SQL Server 2022
This project builds an End-to-End Azure Data Engineering Pipeline, performing ETL and Analytics Reporting on the AdventureWorks2022LT Database.
Python script that cleans the JSON document used by an Azure Data Factory or Azure Synapse Copy Activity to specify the source-sink mapping.
Tokyo-olympic-azure-data-engineering-end-to-end-project