pquinteroc's repositories

awesome-gcp-certifications

A curated list of resources for learning about Google Cloud Platform certifications and how to prepare for it.

License:NOASSERTIONStargazers:1Issues:0Issues:0

aws-glue-developer-guide

The open source version of the AWS Glue docs. You can submit feedback & requests for changes by submitting issues in this repo or by making proposed changes & submitting a pull request.

License:NOASSERTIONStargazers:1Issues:0Issues:0

docker-images

Official source for Docker configurations, images, and examples of Dockerfiles for Oracle products and projects

Language:ShellLicense:UPL-1.0Stargazers:1Issues:0Issues:0

git-secrets

Prevents you from committing secrets and credentials into git repositories

Language:ShellLicense:Apache-2.0Stargazers:1Issues:0Issues:0

kafka-stack-docker-compose

docker compose files to create a fully working kafka stack

Language:ShellLicense:Apache-2.0Stargazers:1Issues:0Issues:0

kubernetes-kafka

Kafka cluster as Kubernetes StatefulSet, plain manifests and config

Language:ShellLicense:Apache-2.0Stargazers:1Issues:0Issues:0

spark-snowflake

Snowflake Data Source for Apache Spark.

License:Apache-2.0Stargazers:1Issues:0Issues:0

wrk

Modern HTTP benchmarking tool

Language:CLicense:NOASSERTIONStargazers:1Issues:0Issues:0

airflow-pagerduty-plugin

An Airflow operator for triggering PagerDuty incidents.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

amazon-redshift-utils

Amazon Redshift Utils contains utilities, scripts and view which are useful in a Redshift environment

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

boto3

AWS SDK for Python

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

confluent-kafka-python

Confluent's Apache Kafka Python client

License:NOASSERTIONStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

deequ

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

License:Apache-2.0Stargazers:0Issues:0Issues:0

divolte-kafka-druid-superset

A proof of concept using Divolte, Kafka, Druid and Superset

Stargazers:0Issues:0Issues:0

docker-druid

Druid Docker

Stargazers:0Issues:0Issues:0

docker-kafka

Kafka (and Zookeeper) in Docker

Language:ShellLicense:Apache-2.0Stargazers:0Issues:0Issues:0

druid

Apache Druid: a high performance real-time analytics database.

Language:JavaLicense:Apache-2.0Stargazers:0Issues:1Issues:0

geoscan

Geospatial clustering at massive scale

License:NOASSERTIONStargazers:0Issues:0Issues:0

googleads-python-lib

The Python client library for Google's Ads APIs

License:Apache-2.0Stargazers:0Issues:0Issues:0

hbc

A Java HTTP client for consuming Twitter's realtime Streaming API

Language:JavaLicense:Apache-2.0Stargazers:0Issues:1Issues:0

HiBench

HiBench is a big data benchmark suite.

Language:JavaLicense:NOASSERTIONStargazers:0Issues:1Issues:0

kafka-tutorials

Kafka Tutorials microsite

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

ksql

The event streaming database purpose-built for stream processing applications

Language:JavaLicense:NOASSERTIONStargazers:0Issues:1Issues:0

md2googleslides

Generate Google Slides from markdown

License:Apache-2.0Stargazers:0Issues:0Issues:0

memray

Memray is a memory profiler for Python

License:Apache-2.0Stargazers:0Issues:0Issues:0

python-patterns

A collection of design patterns/idioms in Python

Language:PythonStargazers:0Issues:0Issues:0

rubix

Cache File System optimized for columnar formats and object stores

License:Apache-2.0Stargazers:0Issues:0Issues:0

spark-redshift

Redshift data source for Apache Spark

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

spring-cloud-dataflow

A microservices-based Streaming and Batch data processing in Cloud Foundry and Kubernetes

License:Apache-2.0Stargazers:0Issues:0Issues:0