Georvic Tur's repositories
AdeIndexer
A command line tool that uses Lucene to build an inverted index on a folder with .txt files and allows for the execution of efficient searches on it.
airflow-GKE-k8sExecutor-helm
Quickly get a kubernetes executor airflow environment provisioned on GKE. Azure Kubernetes Service instructions included also as are instructions for docker-for-mac.
survey_frontend
An online questionnaire showcasing a simple react app
AcidOnSpark-ETL
Delta-Lake, ETL, Spark, Airflow
airflow-toolkit
Any Airflow project day 1, you can spin up a local desktop Kubernetes Airflow environment AND one in Google Cloud Composer with tested data pipelines(DAGs) :desktop_computer: >> [ :rocket:, :ship: ]
CD4ML-Scenarios
Repository with sample code and instructions for "Continuous Intelligence" and "Continuous Delivery for Machine Learning: CD4ML" workshops
code-with-engineering-playbook
This is the playbook for "code-with" customer or partner engagements
data_exploration_spark
This repo is only used for learning Spark with Scala
dbt-metabase
Model synchronization from dbt to Metabase
DbtDagParser
Parses Dbt dags into graphs, and graphs into Airflow DAGs.
DO180-apps
DO180 Repository for Sample Applications
freeflow
Apache Airflow development and deployment template to make your development process hopefully simplified. Included ability to do config via file (encrypted!), operator testing, and more!
gitignore
A collection of useful .gitignore templates
k8s-UAP
⚙ Universal Analytics Platform: k8s-based Data-Driven Analytics/Data Science(ML/DeepML) PaaS/SaaS Platform for Data Analyst/Data Engineer/Data Scientist/DataOps/MLOps playground (R&D/MVP/POC/environmints)
microservices-demo
Sample cloud-native application with 10 microservices showcasing Kubernetes, Istio, gRPC and OpenCensus.
orchestra
Advertising Data Lakes and Workflow Automation
platys
A tool for generating docker-compose environments
platys-modern-data-platform
Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....
pyspark-example-project
Example project implementing best practices for PySpark ETL jobs and applications.
semaphore-demo-python-pants
Demo for building Python projects with The Pants Build System.
smart_scraper
Currently, this is only an example about how to use Sphinx
starthinker
Framework for building data workflows provided by Google.
superset
Apache Superset is a Data Visualization and Data Exploration Platform