There are 42 repositories under data-engineer topic.
The best place to learn data engineering. Built and maintained by the data engineering community.
One framework to develop, deploy and operate data workflows with Python and SQL.
Content for architecting a data science platform for products using Luigi, Spark & Flask.
A Data Engineering project. Repository for backend infrastructure and Streamlit app files for a Premier League Dashboard.
:sunglasses: A curated list of awesome DataOps tools
The Data Engineering Book - หนังสือวิศวกรรมข้อมูล ของคนไทย เพื่อคนไทย
Projects done in the Data Engineer Nanodegree Program by Udacity.com
Road to Azure Data Engineer Part-I: DP-200 - Implementing an Azure Data Solution
Readme for my :octocat: Profile
DE or DIE meetup made by data engineers for data engineers. Currently in Russian only.
Crawls sites, to find new content and scrap it
Data Engineering Digest
Data Quest - Data Engineer Learning and Projects
For the Coursera specialization https://www.coursera.org/specializations/gcp-data-machine-learning
Road to Azure Data Engineer Part-II: DP-201 - Designing an Azure Data Solution
Here's how to get DataQuest's Data Engineering Track missions' content to work on your localhost. Using data from my Valenbisi ARIMA modeling project, I document my steps using PostgreSQL, Postico, and the Command Line to get our DataQuest exercises running out of a Jupyter Notebook.
datacamp Data Engineer with Python course. 73 hours/ 19 Courses /2 Skill Assessments
Code, Examples, Templates and Scripts for DataWorksSummit 2017 Sydney Talk
This is an ETL application on AWS with general open sales and customer data that you can find here: https://github.com/camposvinicius/data/blob/main/AdventureWorks.zip, it's a zipped file with some .csvs inside that we will apply transformations.
Learning from multiple companies in Silicon Valley. Netflix, Facebook, Google, Startups
Google Cloud Platform Professional Data Engineer Certification resources ☁️☁️☁️
This is a pipeline of an ETL application in GCP with open airport code data, which you can find here: https://datahub.io/core/airport-codes/r/airport-codes_zip.zip, it's about a zipped .json, which let's apply transforms.
Huemul BigDataGovernance, es una framework que trabaja sobre Spark, Hive y HDFS. Permite la implementación de una estrategia corporativa de dato único, basada en buenas prácticas de Gobierno de Datos. Permite implementar tablas con control de Primary Key y Foreing Key al insertar y actualizar datos utilizando la librería, Validación de nulos, largos de textos, máximos/mínimos de números y fechas, valores únicos y valores por default. También permite clasificar los campos en aplicabilidad de derechos ARCO para facilitar la implementación de leyes de protección de datos tipo GDPR, identificar los niveles de seguridad y si se está aplicando algún tipo de encriptación. Adicionalmente permite agregar reglas de validación más complejas sobre la misma tabla.
Wraps the DB by opening a REST API for storing and retrieving documents info & recommendations
A data engineering platform for maintaining a data ecosystem to support self-driving cars research.
Information relating to topics on Data Engineering, Data Infrastructure, Data Storing, Data Warehouses and Business Analysis. For those interested in both conceptual theory and use case examples for database design and development.
Data Engineering Project with Hadoop HDFS and Kafka
End-to-end data engineering processes for the NIGERIA Health Facility Registry (HFR). The project leveraged Selenium, Pandas, PySpark, PostgreSQL and Airflow
This is a repository to demonstrate my details, skills, projects and to keep track of my progression in Data Analytics and Data Science topics.
Data Engineering 🛠️ is like the backbone of data processing 📊, managing data pipelines 🚀, warehouses 🏢, and lakes 🌊. It's the bridge 🌉 between raw data and actionable insights, powering businesses 🚀 with efficient data management and analytics 📈.
Docker powered starter for geospatial analysis of lightning atmospheric data.