There are 0 repository under azure-data-lake-gen2 topic.
An end-to-end data engineering pipeline that fetches data from Wikipedia, cleans and transforms it with Apache Airflow and saves it on Azure Data Lake. Other processing takes place on Azure Data Factory, Azure Synapse and Tableau.
This project builds an End-to-End Azure Data Engineering Pipeline, performing ETL and Analytics Reporting on the AdventureWorks2017LT Database.
A comprehensive ETL pipeline and sales analysis project leveraging Microsoft Azure and PySpark, designed to optimize e-commerce sales by providing actionable insights through detailed data analysis.
Formula 1 race data engineering project which utilises azure services and databricks to ingest and analyse the data.
This project builds an End-to-End Azure Data Engineering Pipeline, performing ETL and Analytics Reporting on the AdventureWorks2022LT Database.
Tokyo-olympic-azure-data-engineering-end-to-end-project
Azure pipeline for data analytics on Tokyo Olympics data
Development of a Data Pipeline using Azure Synapse
Conducted comprehensive data analysis on Retail Sales data leveraging Azure Core Services, Azure Databricks and Power BI
This project demonstrates an ETL pipeline using Microsoft Azure for IMDb Movie Rating Dataset analysis. It covers data extraction from Azure Blob Storage, transformation with Azure Databricks, and loading into Azure SQL using Azure Data Factory. The pipeline automates insights generation and is a practical example of cloud-based data engineering.
Data Engineering Project - Python, PySpark & SQL - Azure Data Factory (ADF), DataBricks, Synapse Analytics, Azure Data Lake Storage (ADLS) Gen2, Power BI, Tableau and Looker Studio
Data Engineering Project using Tokyo Olympic Data
An end-to-end data engineering pipeline that fetches data from the BingAPI, cleans and transforms it with Azure Databricks.Sentiment Analysis is performed in AzureML and the data is visualized using Tableau.
This project builds an End-to-End Azure Data Engineering Pipeline, performing ETL and Analytics Reporting on the AdventureWorks2022LT Database.
A lightweight toolkit for Azure Data Lake Storage Gen2 operations, featuring AzCopy commands and Databricks integration examples. Includes sample data and notebooks for quick experimentation with data lake architectures.
This repository contains all scripts and notebooks created in the Azure Synapse Analytics course.course.
Real-Time Stock-Market Data Streaming Using Kafka
A cutting-edge data project leverages Azure's suite of services to seamlessly transform raw data from GitHub into actionable insights. Using Azure Data Factory for data ingestion, Databricks for PySpark transformations, Synapse Analytics for advanced analysis, and Power BI for intuitive visualization, this project navigates complex data workflows..