Tilak Patidar's repositories
cdh5_hive_postgres
Hadoop 2.6.0 | Hive 1.1.0 | Postgres 9.5 docker image
pytest-snowflake_bdd
Setup test data and run tests on snowflake in BDD style!
wiki-search
Search engine based on wikipedia articles
airflow-gpg-plugin
Airflow plugin with hooks and operators to work with GPG encryption and decryption.
kafka-connect-transform
This project contains kafka-connect related custom transformation.
ambari-hue-service
Ambari stack service for easily installing and managing Hue on HDP cluster
dask-ingest
A basic JDBC to Parquet data pipeline using dask.
delta-merge
A pyspark tool to merge frequently ingested DELTA or FULL files into a SNAPSHOT.
exceptions_in_scala
Demo scripts for handling exceptions in Scala
hadoop-base-project
Base mvn project to write any hadoop application
incubator-gobblin
Gobblin is a distributed big data integration framework (ingestion, replication, compliance, retention) for batch and streaming systems. Gobblin features integrations with Apache Hadoop, Apache Kafka, Salesforce, S3, MySQL, Google etc.
liquibase-snowflake-poc
Run migrations against snowflake using liquibase
parquet-mr
Mirror of Apache Parquet
rust-examples
Code snippets from https://doc.rust-lang.org/stable/book/
snowflake-dbt-poc
POC for using DBT against snowflake
snowflake-pytest-poc
Running BDD style functional tests against snowflake.
terraform-az-jenkins
Docker image with terraform v1.0.11 and az cli installed. Suitable for running terraform builds in jenkins.
ubuntu-latest-act
Docker image for `ubuntu-latest` runner in github actions for `act` tool