Kutay Ata Şen (kutayatasen)

kutayatasen

Geek Repo

Company:Meditopia

Location:Istanbul

Github PK Tool:Github PK Tool

Kutay Ata Şen's starred repositories

design-patterns-for-humans

An ultra-simplified explanation to design patterns

Stargazers:44160Issues:0Issues:0

amazon-sagemaker-examples

Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:9857Issues:0Issues:0

re-data

re_data - fix data issues before your users & CEO would discover them 😊

Language:HTMLLicense:NOASSERTIONStargazers:1532Issues:0Issues:0

awesome-pipeline

A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin

Stargazers:6071Issues:0Issues:0

great_expectations

Always know what to expect from your data.

Language:PythonLicense:Apache-2.0Stargazers:9711Issues:0Issues:0

ci-cd-for-data-processing-workflow

Cloud Build for Deploying Datapipelines with Composer, Dataflow and BigQuery

Language:GoLicense:Apache-2.0Stargazers:63Issues:0Issues:0

data-engineering-book

Accumulated knowledge and experience in the field of Data Engineering

Stargazers:836Issues:0Issues:0

pyjanitor

Clean APIs for data cleaning. Python implementation of R package Janitor

Language:PythonLicense:MITStargazers:1334Issues:0Issues:0

goodreads_etl_pipeline

An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.

Language:PythonLicense:MITStargazers:1260Issues:0Issues:0

Data-Engineering-HowTo

A list of useful resources to learn Data Engineering from scratch

Stargazers:3390Issues:0Issues:0

dbt-core

dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.

Language:PythonLicense:Apache-2.0Stargazers:9403Issues:0Issues:0

deequ

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

Language:ScalaLicense:Apache-2.0Stargazers:3198Issues:0Issues:0

applied-ml

📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.

License:MITStargazers:26869Issues:0Issues:0

OpenLineage

An Open Standard for lineage metadata collection

Language:JavaLicense:Apache-2.0Stargazers:1662Issues:0Issues:0

react-flask-docker-boilerplate

Boilerplate code for a web application running React and Flask with Docker Compose.

Language:JavaScriptLicense:UnlicenseStargazers:26Issues:0Issues:0

voila

Voilà turns Jupyter notebooks into standalone web applications

Language:PythonLicense:NOASSERTIONStargazers:5338Issues:0Issues:0

around-dataengineering

A Data Engineering & Machine Learning Knowledge Hub

Language:PythonStargazers:1099Issues:0Issues:0

mongoengine

A Python Object-Document-Mapper for working with MongoDB

Language:PythonLicense:MITStargazers:4200Issues:0Issues:0
Language:JavaLicense:Apache-2.0Stargazers:1598Issues:0Issues:0

ydata-synthetic

Synthetic data generators for tabular and time-series data

Language:Jupyter NotebookLicense:MITStargazers:1387Issues:0Issues:0

training-data-analyst

Labs and demos for courses for GCP Training (http://cloud.google.com/training).

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:7690Issues:0Issues:0

data-science-on-gcp

Source code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1308Issues:0Issues:0
Language:JavaLicense:Apache-2.0Stargazers:63Issues:0Issues:0

Data-Pipeline

Data pipeline is a tool to run Data loading pipelines. It is an open sourced app engine app that users can extend to suit their own needs. Out of the box it will load files from a source, transform them and then output them (output might be writing to a file or loading them into a data analysis tool). It is designed to be modular and support various sources, transformation technologies and output types. The transformations can be chained together to form complex pipelines.

Language:PythonLicense:Apache-2.0Stargazers:88Issues:0Issues:0

piranha.core

Piranha CMS is the friendly editor-focused CMS for .NET that can be used both as an integrated CMS or as a headless API.

Language:C#License:MITStargazers:1938Issues:0Issues:0

Cookbook

The Data Engineering Cookbook

License:Apache-2.0Stargazers:13409Issues:0Issues:0

fastapi

FastAPI framework, high performance, easy to learn, fast to code, ready for production

Language:PythonLicense:MITStargazers:74136Issues:0Issues:0

free-programming-books

:books: Freely available programming books

License:CC-BY-4.0Stargazers:330264Issues:0Issues:0

nlp-datasets

Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)

Stargazers:5697Issues:0Issues:0

saleor

Saleor Core: the high performance, composable, headless commerce API.

Language:PythonLicense:BSD-3-ClauseStargazers:20400Issues:0Issues:0