Rakesh's repositories
alerting-kibana-plugin
Open Distro for Elasticsearch Kibana Alerting Plugin
apicurio-registry
An API/Schema registry - stores APIs and Schemas.
Azure_Synapse_Toolbox
Repository of tools/queries for managing and monitoring Azure Synapse.
cdap
An open source framework for building data analytic applications.
delta
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
hadoop
Mirror of Apache Hadoop
hive
Mirror of Apache Hive
kafka
Mirror of Apache Kafka
helm-nifi
Helm Chart for Apache Nifi
hudi
Upserts, Deletes And Incremental Processing on Big Data.
kafka-connect-jdbc
Kafka Connect connector for JDBC-compatible databases
ksql
The database purpose-built for stream processing applications.
kylo
Kylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on Apache Hadoop and Spark. Kylo is licensed under Apache 2.0 and contributed by Think Big, A Teradata Company
mmlspark
Microsoft Machine Learning for Apache Spark
opendistro-build
Open Distro for Elasticsearch Build Scripts
OpenSearch
🔎 Open source distributed and RESTful search engine.
pdfbox
Mirror of Apache PDFBox
rapidminer-studio
Easy-to-use visual environment for predictive analytics. No programming required. RapidMiner is easily the most powerful and intuitive graphical user interface for the design of analysis processes. Forget sifting through code! You can also choose to run in batch mode. Whatever you prefer, RapidMiner has it all.
rapidprom-source
Current development of the RapidProM Extension
registry
Schema Registry
schema-registry
Confluent Schema Registry for Kafka
security
Open Distro for Elasticsearch Security plugin
security-advanced-modules
Advanced modules for the Open Distro for Elasticsearch security plugin
security-kibana-plugin
Open Distro for Elasticsearch Security Kibana Plugin
spark-on-k8s-operator
Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
strimzi-kafka-operator
Apache Kafka running on Kubernetes
timescaledb
An open-source time-series SQL database optimized for fast ingest and complex queries. Packaged as a PostgreSQL extension.
TransmogrifAI
TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning
trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)