DataEngineerOne's starred repositories

pandoc

Universal markup converter

Language:HaskellLicense:NOASSERTIONStargazers:34212Issues:516Issues:7634

kedro

Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.

Language:PythonLicense:Apache-2.0Stargazers:9884Issues:108Issues:1967

causalnex

A Python library that helps data scientists to infer causation rather than observing correlation.

Language:PythonLicense:NOASSERTIONStargazers:2224Issues:49Issues:139

klio

Smarter data pipelines for audio.

Language:PythonLicense:Apache-2.0Stargazers:839Issues:20Issues:6

clean-code-ml

:bathtub: Clean Code concepts adapted for machine learning and data science. Now a free video series 😎 https://bit.ly/2yGDyqT

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:712Issues:17Issues:0

qbstyles

QuantumBlack Matplotlib styles

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:358Issues:11Issues:6

kedro-wings

Kedro Wings automatically creates catalog entries to simplify Kedro pipeline writing. See the video here: https://www.youtube.com/watch?v=p4ELo1tqbYY

Language:PythonLicense:MITStargazers:22Issues:2Issues:5

find-kedro

kedro plugin to automatically construct pipelines using pytest style pattern matching

Language:PythonLicense:MITStargazers:21Issues:4Issues:7

steel-toes

a kedro hook to protect against breaking changes to data

Language:PythonLicense:MITStargazers:9Issues:3Issues:4