jklukas / gcp-ingestion

Documentation and implementation for google cloud ingestion pipeline

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Telemetry Ingestion on Google Cloud Platform

A monorepo for documentation and implementation of the Mozilla telemetry ingestion system deployed to Google Cloud Platform (GCP).

The overall architecture is described in docs/architecture along with commentary on design decisions. Individual components are specified under docs and implemented under the various ingestion-* service directories:

  • ingestion-edge: a simple Python service for accepting HTTP messages and delivering to Google Cloud Pub/Sub
  • ingestion-beam: a Java module defining Apache Beam jobs for streaming and batch transformations of ingested messages

About

Documentation and implementation for google cloud ingestion pipeline

License:Mozilla Public License 2.0


Languages

Language:Java 74.6%Language:Python 24.2%Language:Shell 1.0%Language:Dockerfile 0.2%