rgs's repositories
airflow-backfill-plugin
A plugin for backfilling task's and dag's through the UI
aws-glue-data-catalog-client-for-apache-hive-metastore
The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog as a central repository to store structural and operational metadata for their data. AWS Glue provides out-of-box integration with Amazon EMR that enables customers to use the AWS Glue Data Catalog as an external Hive Metastore. This is an open-source implementation of the Apache Hive Metastore client on Amazon EMR clusters that uses the AWS Glue Data Catalog as an external Hive Metastore. It serves as a reference implementation for building a Hive Metastore-compatible client that connects to the AWS Glue Data Catalog. It may be ported to other Hive Metastore-compatible platforms such as other Hadoop and Apache Spark distributions
clir
R Package Installer for Command Line Interface
config-files
My collection of .dotfiles, settings and snippets.
docker-spark-k8s-aws
Docker image for running Spark 3 on Kubernetes on AWS
GetDFPData
Repository for package GetDFPData
GetHFData
Repository for CRAN package GetHFData
GetITRData
Development version of GetITRData
hadoop-nativelibs-docker
Build Hadoop native libraries
hive-metastore-docker
Example for article Running Spark 3 with standalone Hive Metastore 3.0
presto-chart
Highly configurable Helm Presto Chart
prestoeventlistener
Implementation to collect queryInfo in S3 using presto event listener
prometheus-kafka-adapter
Use Kafka as a remote storage database for Prometheus (remote write only)
spark-build
Used to build the mesosphere/spark docker image and the DC/OS Spark package
spark_hive_test
Example for article Running Spark 3 with standalone Hive Metastore 3.0