uhjish / spark-connect

A subproject of Predictiveworks that provides common access to Cassandra, Elasticsearch, HBase, MongoDB, Parquet, JDBC database and other data sources from Apache Spark.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Common Access Layer for Apache Spark

Predictiveworks supports raw data retrieval from multiple NoSQL and JDBC data sources.

Read requests are supported for the following big data sources:

  • Cassandra
  • Elasticsearch
  • HBase
  • MongoDB
  • Parquet

In addition, this project also provides an increasing number of connector to data sources relevant for analytics:

  • Google Analytics v3
  • Shopify

About

A subproject of Predictiveworks that provides common access to Cassandra, Elasticsearch, HBase, MongoDB, Parquet, JDBC database and other data sources from Apache Spark.


Languages

Language:Scala 100.0%