There are 1 repository under azure-data-lake topic.
A polycloud .NET cloud storage abstraction layer. Provides Blob storage (AWS S3, GCP, FTP, SFTP, Azure Blob/File/Event Hub/Data Lake) and Messaging (AWS SQS, Azure Queue/ServiceBus). Supports .NET 5+ and .NET Standard 2.0+. Pure C#.
Platform Extension Framework: Federated Query Engine
Repository with Sample threat hunting notebooks on Security Event Log Data Sources
Simple cloud only DWH solution architecture.
Collection of Databricks and Jupyter Notebooks
Example of a single node Presto with Azure Data Lake Store (ADLS) and Azure Storage Blob (WASB) access via Hive metastore
Road to Azure Data Engineer Part-II: DP-201 - Designing an Azure Data Solution
A comprehensive guide to understanding and implementing data management and analytics solutions in the Azure ecosystem using Azure Data Fundamentals.
This demo describes the basic integration between S/4HANA and the Microsoft Common Data Model (Model)
Microsoft Azure Data Lake Store Library for Delphi
Streaming order book data using TD Ameritrade API
A collection of Azure Function to make building Azure Data Factory pipeline simpler and easier.
A ready to use architecture for processing data and performing machine learning in Azure
An E2E solution of the Data Resources on Azure using the Snapshot Serengeti dataset. This E2E solution focuses Azure Synapse Analytics, Power Bi & the Azure Data Factory.
Kafka Connect Connector for ADLS(Azure Data Lake Store)
Fluentd output plugin for Azure Datalake Storage Gen2 (append support)
Scenarios on how you can be GDPR compliant by using Azure services
Azure ARM template to deploy Kafka and Spark clusters in same VNet with ADLS
A collection of utilities for working with Azure Batch, Azure Data Factory, Azure Table Storage and Azure Blob Storage.
Shows how to use an External Hive (SQL Server) along with ADLS Gen 1 as part of a Databricks initialization script that runs when the cluster is created.
Workshop for integrating Dynamics 365 Customer Insights and Azure Data Services
Creates a HDInsight cluster then runs distcp remotely to copy data between blob and/or data lake (ADLS)
Creates an HDInsight cluster that has an external Hive metastore and access to Azure Data Lake Store
Content-addressable Azure Data Lake block store
Azure Data Lake Gen2 storage connectors for Data Culpa - monitor data quality automatically with Data Culpa Validator
Bulk image streaming and upload using Flink (+ Kubernetes), Kafka, Data Lake, and SQL (Provided with React UI and Node server for Demo).
building a real-world data pipeline in Azure Data Factory (ADF) dataset provided by https://www.ecdc.europa.eu/ ingesting data from sources such as HTTP and Azure Blob Storage into Azure Data Lake Gen2 using ADF. transformed data and loaded transformed data using Databricks Notebook Activity in Azure Data Factory (ADF) and load into Azure Data Lake Storage Gen2.
ETL motor racing data project using Azure Databricks, Pyspark and Azure Date Lakes
POC projects working on Cloud Platforms
We have dataset of IPL from 2008 to 2020 and we have to visualize analytics on Power BI dashboard. We have to upload that dataset into data lake. After that we have to process that data through pipeline and produce modeled data in warehouse. So, that we will be able to analyze the data in Power BI through pre-defined dashboards.