GoogleCloudPlatform / public-datasets-pipelines

Cloud-native, data onboarding architecture for Google Cloud Datasets

Home Page:https://cloud.google.com/solutions/datasets

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Google Cloud Datasets: Data Pipelines and Documentation Set

public-datasets-pipelines

This repository contains the following:

  • Cloud-native, data pipeline architecture for onboarding public datasets to Google Cloud Datasets.
  • Documentation set containing tutorials, samples, and other articles making use of the datasets hosted by the program.

For detailed documentation, please see the Wiki Pages.

Datasets

Here are some of the featured datasets onboarded using this repository/architecture.

About

Cloud-native, data onboarding architecture for Google Cloud Datasets

https://cloud.google.com/solutions/datasets

License:Apache License 2.0


Languages

Language:Python 77.5%Language:HCL 10.7%Language:Jupyter Notebook 10.0%Language:Dockerfile 1.7%Language:Jinja 0.2%