uhjish / spark-connect

A subproject of Predictiveworks that provides common access to Cassandra, Elasticsearch, HBase, MongoDB, Parquet, JDBC database and other data sources from Apache Spark.

Geek Repo

Github PK Tool

Common Access Layer for Apache Spark

Predictiveworks supports raw data retrieval from multiple NoSQL and JDBC data sources.

Read requests are supported for the following big data sources:

Cassandra
Elasticsearch
HBase
MongoDB
Parquet

In addition, this project also provides an increasing number of connector to data sources relevant for analytics:

Google Analytics v3
Shopify

About

A subproject of Predictiveworks that provides common access to Cassandra, Elasticsearch, HBase, MongoDB, Parquet, JDBC database and other data sources from Apache Spark.

Languages

Language:Scala 100.0%