Kutay Ata Şen's starred repositories
design-patterns-for-humans
An ultra-simplified explanation to design patterns
amazon-sagemaker-examples
Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.
awesome-pipeline
A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin
great_expectations
Always know what to expect from your data.
ci-cd-for-data-processing-workflow
Cloud Build for Deploying Datapipelines with Composer, Dataflow and BigQuery
data-engineering-book
Accumulated knowledge and experience in the field of Data Engineering
goodreads_etl_pipeline
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
Data-Engineering-HowTo
A list of useful resources to learn Data Engineering from scratch
applied-ml
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
OpenLineage
An Open Standard for lineage metadata collection
react-flask-docker-boilerplate
Boilerplate code for a web application running React and Flask with Docker Compose.
around-dataengineering
A Data Engineering & Machine Learning Knowledge Hub
mongoengine
A Python Object-Document-Mapper for working with MongoDB
ydata-synthetic
Synthetic data generators for tabular and time-series data
training-data-analyst
Labs and demos for courses for GCP Training (http://cloud.google.com/training).
data-science-on-gcp
Source code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
Data-Pipeline
Data pipeline is a tool to run Data loading pipelines. It is an open sourced app engine app that users can extend to suit their own needs. Out of the box it will load files from a source, transform them and then output them (output might be writing to a file or loading them into a data analysis tool). It is designed to be modular and support various sources, transformation technologies and output types. The transformations can be chained together to form complex pipelines.
piranha.core
Piranha CMS is the friendly editor-focused CMS for .NET that can be used both as an integrated CMS or as a headless API.
free-programming-books
:books: Freely available programming books
nlp-datasets
Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)