lenards / spark-cstar-canaries

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Spark + C* Canaries

Simple Java applications that use the spark-cassandra-connector to verify deployment configuration.

Note: Available as is. Only for demonstration use for development configuration verification.

Build

Project depends on Maven 3.* for packaging the jar.

$ mvn clean package

Canary: Simple Use Case

With a DataStax Enterprise cluster, you can submit the jar using dse spark-submit:

$ cd $REPO
$ dse spark-submit --class net.lenards.SparkCanaryCC \
  target/spark-cstar-canaries-0.0.1-SNAPSHOT.jar $(dsetool sparkmaster) 127.0.0.1

The expected output would be:

Count: 9
[CassandraRow{key: 7, value: Seven}, CassandraRow{key: 6, value: Six}, CassandraRow{key: 9, value: Nine}, CassandraRow{key: 12, value: Twelve}, CassandraRow{key: 5, value: Five}, CassandraRow{key: 10, value: Ten}, CassandraRow{key: 16, value: Sixteen}, CassandraRow{key: 1, value: One}, CassandraRow{key: 19, value: Nineteen}]

Canary: Amazon Kinesis + DSE

With a DataStax Enterprise 4.6.*, there is currently (as of May 11, 2015) a conflict in jar dependencies with the Amazon Kinesis Client and the resources integrated for Spark with DataStax Enterprise. A step-by-step guide for working around this is provided in the Wiki fort his repository.

About

License:Apache License 2.0


Languages

Language:Java 100.0%