There are 1 repository under data-lakes topic.
Projects done in the Data Engineer Nanodegree Program by Udacity.com
Documentation for Getting Up and Running w/ indexed.xyz Data
This is a repository to hold the files and notebooks produced throughout my Udacity's Nanodegree Data Engineering program.
A Semantic Data Reservoir for Heterogeneous Datasets
Discussion of DTF software architecture Repository
Udacity Data Engineering Nanodegree - Project #4
Follow along with materials in the book "Modern Data Architectures with Python: A practical guide to building and deploying data pipelines, data warehouses and data lakes" (Lipp, 2023)
A Search Join is a join operation which extends a user-provided table with additional attributes based on a large corpus of heterogeneous data originating from the Web or corporate intranets.