DataLearns's repositories
OpenRefine
OpenRefine is a free, open source power tool for working with messy data and improving it
mage-ai
🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data.
Flask-MonitoringDashboard
Automatically monitor the evolving performance of Flask/Python web services.
datamodel-code-generator
Pydantic model and dataclasses.dataclass generator for easy conversion of JSON, OpenAPI, JSON Schema, and YAML data sources.
Building-ETL-Pipelines-with-Python
Building ETL Pipelines with Python
datacrafter
NoSQL extract, transform, load (ETL) toolkit with Python
DataSphereStudio
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
dx_Python
Python support package for DEX visualization
yet_another_math_for_DS
math for data science [russian]
powerbi-macguyver-toolbox
Power BI report .pbip templates and patterns to create special visuals, address specific problems, and have adventures..
ydata-quality
Data Quality assessment with one line of code
Udacity-Data-Engineer-nanodegree
Classwork projects and home works done through Udacity data engineering nano degree
FabricAdventureWorksLakehouse
This repo shows how to build a Microsoft Fabric Lakehouse Data platform, a unified data management and analytics solution using Azure services. The platform supports data ingestion, storage, processing, analysis, governance, security, and compliance. The repo provides best practices and patterns for the platform.
rudder-airflow-provider
Rudderstack provider for Apache Airflow
awesome-opendata-software
Awesome list of the software tools related to opendata: data catalogs, ingestion tools, data prep tools and so on
Sales-Performance-Dashboard
Sales Performance Dashboard: Analyze and visualize sales data using Power BI. Gain insights into trends, customer segments, product performance, and geographic distribution. Make data-driven decisions to optimize sales strategies and maximize revenue.
textual
Textual is a Rapid Application Development framework for Python. Build sophisticated user interfaces with a simple Python API. Run your apps in the terminal and (coming soon) a web browser!
project-based-learning
Curated list of project-based tutorials
OpenMetadata
Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.
Rath
Next generation of automated data exploratory analysis and visualization platform.
data-product-workshop
This is repository with materials for workshop about discovering and defining data products.
datacube-core
Open Data Cube analyses continental scale Earth Observation data through time
Python-for-Engineers-Course
Курс для использования python в инженерных расчетах.
The-Python-Graph-Gallery
A website displaying hundreds of charts made with Python
LakehouseToPowerBI
Architectural design for incorporating a Data Lakehouse architecture with an Enterprise Power BI Deployment
json-toolkit
"the best opensource converter I've found across the Internet" -- dene14
sqlmesh
SQLMesh is a DataOps framework that brings the benefits of DevOps to data teams. It enables data scientists, analysts, and engineers to efficiently run and deploy data transformations written in SQL or Python.
data-engineering-wiki
The best place to learn data engineering. Built and maintained by the data engineering community.