datascience2014's repositories
bigquery-utils
Useful scripts, udfs, views, and other utilities for migration and data warehouse operations in BigQuery.
airflow-testing-ci-workflow
(project & tutorial) dag pipeline tests + ci/cd setup
terraform-docs
Generate documentation from Terraform modules in various output formats
cluster_terraform
Use these HashiCorp Terraform scripts to deploy a Looker cluster on the cloud environment of your choice.
docker-airflow
Docker Apache Airflow
dlp-dataflow-deidentification
Multi Cloud Data Tokenization Solution By Using Dataflow and Cloud DLP
python-basics-exercises
Python Basics: A Practical Introduction to Python 3
getting-started-python
Code samples for using Python on Google Cloud Platform
datacatalog-tag-history
Historical metadata of your data warehouse is a treasure trove to discover not just insights about changing data patterns, but also quality and user behaviour. This solution creates Data Catalog Tags history in BigQuery since Data Catalog keeps only the latest version of metadata for fast searchability.
datacatalog-connectors-hive
Sample code with integration between Data Catalog and Hive data source.
df-ml-anomaly-detection
Streaming Anomaly Detection Solution by using Pub/Sub, Dataflow, BQML & Cloud DLP
google-cloud-python
Google Cloud Client Library for Python
google-cloud-4-words
The Google Cloud Developer's Cheat Sheet
datacatalog-util
A Python package to centralize some Google Cloud Data Catalog scripts, this repo contains commands like bulk CSV operations that help leverage Data Catalog features.
pubsub
This repository contains open-source projects managed by the owners of Google Cloud Pub/Sub.
kafka-connect-mq-source
This repository contains a Kafka Connect source connector for copying data from IBM MQ into Apache Kafka.
hellonode
A Hello World HTTP server in Node, with a Dockerfile and a Jenkinsfile
benchmark
Benchmark data warehouses under Fivetran-like conditions
Useful-Website-Flask
A very useful website written with Flask
professional-services
Common solutions and tools developed by Google Cloud's Professional Services team
dlp-rdb-bq-import
Relational Database Import to Big Query with Dataflow and DLP API
PublicWorkshops
Used to provide needed code or artifacts for public workshops
content-gc-essentials
Joseph's Google Cloud Essential Labs Repo
la-ace-find-seller
A demo application for the Linux Academy Google Cloud Certified Associate Cloud Engineer exam prep course