David Pavlov (big-datai)

big-datai

Geek Repo

Location:United states

Home Page:www.big-datAI.com

Github PK Tool:Github PK Tool

David Pavlov's starred repositories

HtmlEntityExtraction

It is a spark machine learning project to extract patterns in XML/Html web files

Language:ScalaStargazers:2Issues:0Issues:0

restsparkserver

Spark rest server, get json requests, transfers them to sql query and executes the request.

Language:ScalaStargazers:1Issues:0Issues:0

spark-rabbitmq

This is a fork of Stratio/spark-rabbitmq with the only difference is acknowladgment

Language:ScalaLicense:Apache-2.0Stargazers:1Issues:0Issues:0

ds-posts

Tutorials and posts

Language:Jupyter NotebookStargazers:1Issues:0Issues:0
Language:ScalaLicense:NOASSERTIONStargazers:1Issues:0Issues:0

scraper

Distributed web scraper, kafka, spark, and html unit

Language:JavaLicense:NOASSERTIONStargazers:5Issues:0Issues:0

cassandra2grafana

Cassandra to Grafana connector

Language:ScalaLicense:NOASSERTIONStargazers:1Issues:0Issues:0

cassandra2InfluxDB

Spark streaming job, to do time series analytics on Grafana through influxDB

Language:ScalaLicense:NOASSERTIONStargazers:1Issues:0Issues:0

sparkDocker

This is spark stand alone docker project for easy spin up of spak

Language:PythonStargazers:1Issues:0Issues:0

spark-ec2

Scripts used to setup a Spark cluster on EC2

Language:PythonLicense:Apache-2.0Stargazers:391Issues:0Issues:0

spark

Apache Spark - A unified analytics engine for large-scale data processing

Language:ScalaLicense:Apache-2.0Stargazers:39018Issues:0Issues:0
Language:ScalaStargazers:1Issues:0Issues:0

spark4kube

This is a docker container with easy update of spark and config versions to use on kubernetes

Language:PythonLicense:NOASSERTIONStargazers:4Issues:0Issues:0

emr-bootstrap-actions

This repository hold the Amazon Elastic MapReduce sample bootstrap actions

Language:ShellLicense:NOASSERTIONStargazers:614Issues:0Issues:0