sri's starred repositories
kafka-connect-file-pulse
🔗 A multipurpose Kafka Connect connector that makes it easy to parse, transform and stream any file, in any format, into Apache Kafka
waggle-dance
Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.
OpenMetadata
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.
livy-submit
Livy-Submit
metatron-discovery
Powerful & Easy way for big data discovery
tez-ats-import
Import Tez + Hive related entities from ATS
amazon-ebs-autoscale
Don't run out of disk space on your EC2 instance when generating or working with large files. Automatically add EBS volumes to a filesystem mount point in response to disk utilization.
boundary-layer
Builds Airflow DAGs from configuration files. Powers all DAGs on the Etsy Data Platform
ansible-hortonworks-with-extra-vars
Documentation on running ansible-hortonworks playbooks with extra-vars
iam-policy-json-to-terraform
Small tool to convert an IAM Policy in JSON format into a Terraform aws_iam_policy_document
hadoop-ansible
Install hadoop cluster with ansible
terraform-aws-emr-cluster
Terraform module to provision an Elastic MapReduce (EMR) cluster on AWS
trino-storage
Storage connector for Trino
datasketches-hive
Sketch adaptors for Hive.
hive-funnel-udf
Hive UDFs for funnel analysis
presto-gateway
A load balancer / proxy / gateway for prestodb
jvm-profiler
JVM Profiler Sending Metrics to Kafka, Console Output or Custom Reporter