Atul Bhardwaj's repositories
elasticsearch-jdbc
JDBC importer for Elasticsearch
spark-finance
A library for financial and time series calculations on Apache Spark
sparkMeasure
This is the development repository of SparkMeasure, a tool for performance troubleshooting of Apache Spark workloads. It simplifies the collection and analysis of Spark executor task metrics data.
AzureDatabricksBestPractices
Version 1 of Technical Best Practices of Azure Databricks based on real world Customer and Technical SME inputs
banana
Banana for Solr - A Port of Kibana
crawler4j
Open Source Web Crawler for Java
data-algorithms-book
MapReduce and Spark Source Code and Scripts for Data Algorithms Book
elasticsearch-definitive-guide
The Definitive Guide to Elasticsearch
hdp21-twitter-demo
'Hello world' Storm topology to analyze financial tweets
hdp22-hive-streaming
'Interactive Query with Apache Hive' webinar materials
hdp22-twitter-demo
Monitor Twitter stream for S&P 500 companies to identify & act on unexpected increases in tweet volume
hive
Mirror of Apache Hive
JSON-java
A reference implementation of a JSON package in Java.
JSqlParser
JSqlParser parses an SQL statement and translate it into a hierarchy of Java classes. The generated hierarchy can be navigated using the Visitor Pattern
kafka
Mirror of Apache Kafka
kafka-connect-mq-source
This repository contains a Kafka Connect source connector for copying data from IBM MQ into Apache Kafka.
kafka-examples
Snippets and small examples demonstrating kafka features and configs
MCW-Migrate-EDW-to-Azure-SQL-Data-Warehouse
MCW Migrate EDW to Azure SQL Data Warehouse
presto
Distributed SQL query engine for running interactive analytic queries against big data sources.
spark-notebook
Interactive and Reactive Data Science using Scala and Spark.
spark-testing-base
Base classes to use when writing tests with Spark
Spark-The-Definitive-Guide
Spark: The Definitive Guide's Code Repository
storm
Mirror of Apache Storm
storm-solr
Storm / Solr Integration
twitter-databricks-analyzer-cicd
A generalized pipeline for extracting topics from tweets in an azure-databricks-eventhub pipeline to find trends accompanied by a travis-ci based ci/cd pipeline