Vihag Gupta's repositories
druid-hacks
Collection of hacks to optimally deploy druid in a cloud native environment
qubole-aws-terraform-deployment
Terraform modules to integrate Qubole with your AWS Account
qubole-gcp-terraform-deployment
Terraform modules for integrating Qubole with your GCP project
docker-druid
Druid Docker
druid
Column oriented distributed data store ideal for powering interactive applications
druid-spark-batch
Druid indexing plugin for using Spark in batch jobs
emrLogMiner
Project to perform 'yarn logs -applicationId' style log aggregation on terminated EMRs
giraph-yarn
Reference implementations for running Apache Giraph on Yarn
incubator-airflow
Apache Airflow (Incubating)
kedro
A Python framework for creating reproducible, maintainable and modular data science code.
mlflow-example
An example MLflow project
MLOps_E2E
MLOps End to End examples integrated with git actions
presto-graphite-emitter
Graphite emitter/sink for Presto. It allows to send Presto co-ordinator, worker, query metrics to Graphite in plaintext format.
sparklens
Qubole Sparklens tool for performance tuning Apache Spark
tpc-ds-dataset-generator
Generate big TPC-DS datasets with Databricks