bigdata-vandy

bigdata-vandy

Geek Repo

Github PK Tool:Github PK Tool

bigdata-vandy's repositories

bigdata-vandy.github.io

The blog home of bigdata-vandy

Language:HTMLLicense:GPL-3.0Stargazers:1Issues:0Issues:0

cm_csds

Cloudera Manager Custom Service Descriptors

Language:ShellLicense:GPL-3.0Stargazers:1Issues:3Issues:0

download-stack-dump

Python code to download archived Stack Exchange from https://archive.org/details/stackexchange

Language:PythonLicense:MITStargazers:1Issues:0Issues:0
Language:Jupyter NotebookStargazers:1Issues:0Issues:0

spark-corenlp-demo

A demonstration of the Spark CoreNLP library from databricks

Language:ScalaLicense:MITStargazers:1Issues:0Issues:0

spark-wordcount

A brief demonstration of Spark functionality.

Language:Jupyter NotebookStargazers:1Issues:3Issues:0

spark-xml-parse

Demonstration of XML parsing using the StackOverflow data dump.

Language:ScalaStargazers:1Issues:0Issues:0

data-getters

A collection of simple scripts for pulling data from various and sundry sources.

Language:ShellLicense:GPL-3.0Stargazers:0Issues:3Issues:0

HBase-Standalone

HBase Standalone Tutorial

Stargazers:0Issues:0Issues:0

hdfs

Simple HDFS Demos

Language:Jupyter NotebookStargazers:0Issues:3Issues:0

akka-demo

A basic demo of web-scraping using Akka (Scala-flavor!)

Language:ScalaLicense:MITStargazers:0Issues:0Issues:0
Language:ScalaStargazers:0Issues:0Issues:0

mapreduce-wc

Wordcount with MapReduce, written in native Java

Language:JavaLicense:GPL-3.0Stargazers:0Issues:0Issues:0

password-cracker

A demonstration of distributed computation in Spark.

Language:ScalaStargazers:0Issues:0Issues:0

pyspark_intro_vish

This is a brief Introduction to Pyspark

Language:PythonStargazers:0Issues:0Issues:0

scp-data-to-hdfs

Bash scripts for copying data to the Big Data cluster with SLURM

Language:ShellLicense:MITStargazers:0Issues:0Issues:0

spark-sem-classify

Classify SEM data using Spark-ML

Language:ScalaLicense:MITStargazers:0Issues:0Issues:0

spark-taxi

Analyze NYC-TLC taxi trip data

Language:ScalaLicense:MITStargazers:0Issues:0Issues:0
Language:ScalaLicense:MITStargazers:0Issues:0Issues:0

stack-ex

Parse Stack Exchange data dump

Language:Jupyter NotebookLicense:MITStargazers:0Issues:3Issues:0

tweet-count

Count batch of Tweet records using Java implementation of MapReduce.

Language:JavaLicense:GPL-3.0Stargazers:0Issues:0Issues:0