pablo rodriguez defino's repositories
content-dicovery-platform-gcp
A content discovery platform powered by LLMs
apache-beam-ptransforms
A collection of useful PTransforms, utils and classes shared by multiple other projects
apache-beam-streaming-tests
A testing suite for Dataflow streaming pipelines
dataproc-workflowtemplate-cloudfunction
Implements a work queue for Dataproc Worflow Template executions
haproxy-lua-cbr
A simple content based router for HAProxy based on Lua scripting
beam-pabs
Apache Beam is a unified programming model for Batch and Streaming data processing.
bq-throttled-extraction
BigQuery query results extractor, enables throttled propagation to an external location.
bq-utility-scripts
collection of BigQuery related scripts
dataflow-cassandra-to-bigquery
Captures data from a Cassandra instance and sends it to BigQuery
DataflowTemplates
Google-provided Cloud Dataflow template pipelines for solving simple in-Cloud data tasks
devenv-skydns-discovery
simple development environment based on docker containers (service discovery and registry based on etc, registration and skydns)
flink-bigquery-connector
BigQuery integration to Apache Flink's Table API
gprof2dot
Converts profiling output to a dot graph.
kubernetes-devenv
A simple set of scripts to setup and use a local kubernetes based development environment
mutagen-cassandra
Mutate your Cassandra schema for fun and profit.
simple-native-server
simple java spark server built on graalvm
spark-csv-db-loader
Simple Apache Spark job to load a CSV file to a JDBC available DB
terraform-google-managed-instance-group
Modular Google Compute Engine managed instance group for Terraform.
terraform-google-nat-gateway
Modular NAT Gateway on Google Compute Engine for Terraform.
vagrant-env-cassandra-cluster
A virtualized environment to run a cluster of dockerized Cassandra images.