sparkr

There are 1 repository under sparkr topic.

awesome-spark / awesome-spark
A curated list of awesome Apache Spark packages and resources.
apache-spark pyspark awesome sparkr
Language:Shell 1635
spark-standalone-cluster-on-docker
cluster-apps-on-docker / spark-standalone-cluster-on-docker
Learn Apache Spark in Scala, Python (PySpark) and R (SparkR) by building your own cluster with a JupyterLab interface on Docker. :zap:
spark docker python scala pyspark jupyter r sparkr
Language:Jupyter Notebook 419
jadianes / spark-r-notebooks
R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
big-data bigdata data-analysis data-science exploratory-data-analysis jupyter jupyter-notebook notebook r sparkr
Language:Jupyter Notebook 119
awesome-spark / learn-by-examples
Real-world Spark pipelines examples
apache-spark pyspark sparkr tutorial
Language:Scala 81
tomaztk / Azure-Databricks
Azure Databricks - Advent of 2020 Blogposts
python notebooks notebook databricks databricks-notebooks r-language spark sql azure-databricks data-analytics azure-data-factory scala pyspark sparkr spark-structured-streaming data-engineerg machine-learning mllib mlflow azure-machine-learnning
Language:Jupyter Notebook 55
manuparra / taller_SparkR
Taller SparkR para las Jornadas de Usuarios de R
spark sparkr ipynb rstudio bigdata hdfs r sparklyr machine-learning-algorithms data-mining data-analysis artificial-intelligence
Language:HTML 12
A-TALE-OF-THREE-CITIES
microsoft / A-TALE-OF-THREE-CITIES
Analyzing the safety (311) dataset published by Azure Open Datasets for Chicago, Boston and New York City using SparkR, SParkSQL, Azure Databricks, visualization using ggplot2 and leaflet. Focus is on descriptive analytics, visualization, clustering, time series forecasting and anomaly detection.
sparkr sparksql azure databricks-notebooks r eda data datascience-machinelearning azure-databricks workshop-materials time-series-analysis timeseries-forecasting anomaly-detection anomalydiscovery aiforsocialgood 311-data opendata visualization leaflet geospatial
Language:R 12
manuparra / MasterDatCom_BDCC_Practice
Practice and Workshop on BigData and Cloud Computing using Docker Containers and OpenNebula. HDFS, hadoop and spark+R
docker opennebula practices linux hdfs hadoop containers cloudcomputing bigdata sparkr spark
11
RSummerSchool / R-for-HPC-and-big-data
Slides and lab material for the talk R for HPC and big data at http://rsummer.data-analysis.at
r spark apache-spark spark-sql sparkr sparklyr data-science
7
zero323 / dlt
Mirror of https://gitlab.com/zero323/dlt
r rstats spark apache-spark sparkr delta delta-lake delta-io
Language:R 5
cosmincatalin / cubist-regression
Fit a Cubist regression model on StackOverflow data and make predictions in a distributed manner with SparkR
r spark sparkr regression cubist
Language:R 3
jaehyeon-kim / sparkr-demo
SparkR Demo
r spark sparkr docker rocker
Language:HTML 3
spark-in-a-box / sparkr-build-sandbox
Docker images for testing SparkR builds
apache-spark sparkr testing docker-image
Language:Python 3
duttashi / cheatsheets
A curated list of essential cheatsheets for data analysis, visualization and machine learning using R or Python
cheatsheets r sparkr
2
manuparra / taller-bigdata-con-r
Taller Big Data con Apache Spark + R desde Databricks cloud
r bigdata spark sparkr cloudcomputing databricks
2
slothkong / r_on_gcloud
R workloads running at scale on Google Cloud
r sparkr sparklyr apache-spark gcloud-sdk gcloud-cli unittesting
Language:R 2
ukdataservice / bdas2017
Course material for the "Encounters with Big Data" course delivered by the UK Data Service at the 2017 Big Data and Analytics Summer School.
r spark-sql sparkr spark hive hdfs hadoop data-science big-data big data analytics data-analysis
Language:R 2
kiendang / sparkr-naivebayes-example
r mllib scala apache-spark sparkr
Language:R 1
konhay / self-service-modeler
Self-service modeling analysis tool based on R language and big data. It integrates SparkR, Rserve, and Mlib machine learning libraries
financial-data logistic-regression mlib r random-forest rserve sparkr
Language:R 1
lix90 / Rnotes
R notebooks
spark sparkr sparklyr r bigdata notebook
Language:HTML 1
aaa121 / Big-Data-Analytics
hadoop-mapreduce hadoop-filesystem python scala r pyspark sparkr sql hive pig
Language:Shell 0
d4rthm4ul / R-Cleaning-Exploration-Imputation-Visualization
This repository you are browsing contains intermediate level piece of codes which are useful for cleaning, exploratory analysis, handling of missing data points, outlier detection and different visualization techniques using graphics, ggplot2, tidycharts, ggExtra packages. Also in particular part of the script you can get basic information about SparkR package which is an R package that provides a light-weight frontend to use Apache Spark from R . Do not be shy to fork and make contribute.
r r-programming data datascience cleaning-data exploratory-data-analysis imputation visualization ggplot2 tidyverse ggextra tidycharts sparkr graphics github outlier-detection scatter-plot histogram lolipop apache-spark
Language:R 0
MatthiasDE / spark_standalone_docker
Multiple-Node Standalone Spark with R and Python
r python3 spark sparkr
Language:R 0
reeantencamah / R_Linear-Regression_K-Means-Algorithm
Bi and Big Data Analytics, sparkR, Supervised and Unsupervised Machine Learning techniques The project's aim is of applying a supervised and an unsupervised machine learning technique on a dataset to test different models/scenario, interpret the results, perform predictions for each model and visualised the results.
dataanalysis supervised-machine-learning unsupervised-machine-learning r sparkr linear-regression kmeans-clustering machine-learning predictive-modeling visualization
Language:R 0
ruz023 / Spark-Desmontration
This is a demonstration of using Spark to explore large dataset, by using PySpark and SparkR. The files include loading data, data exploration and using clustering on words of Shakespeare's novels.
spark pyspark sparkr visualization nlp
Language:Jupyter Notebook 0
TIME-GATE / r-spark-service
用r、spark做的一些统计分析、机器学习实例，待传
r sparkr machine-learning statistics
Language:R 0
ashish-kamboj / BigData-Analytics
Data analysis and Model building on large datasets using Hive and Spark
hive hiveql spark sparkr aws aws-s3 aws-ec2 aws-cli
Language:R
gomezportillo / sparkR-hadoop
Processing massive datasets in Hadoop and SparkR
hadoop sparkr map-reduce
Language:R
jaehyeon-kim / rocker-extra
Extra docker images from rocker/tidyverse
docker r rocker python ssh spark sparkr rserve
Language:Shell
superzhang90 / sparkR
sparkr
Language:R

sparkr

awesome-spark / awesome-spark

cluster-apps-on-docker / spark-standalone-cluster-on-docker

jadianes / spark-r-notebooks

awesome-spark / learn-by-examples

tomaztk / Azure-Databricks

manuparra / taller_SparkR

microsoft / A-TALE-OF-THREE-CITIES

manuparra / MasterDatCom_BDCC_Practice

RSummerSchool / R-for-HPC-and-big-data

zero323 / dlt

cosmincatalin / cubist-regression

jaehyeon-kim / sparkr-demo

spark-in-a-box / sparkr-build-sandbox

duttashi / cheatsheets

manuparra / taller-bigdata-con-r

slothkong / r_on_gcloud

ukdataservice / bdas2017

kiendang / sparkr-naivebayes-example

konhay / self-service-modeler

lix90 / Rnotes

aaa121 / Big-Data-Analytics

d4rthm4ul / R-Cleaning-Exploration-Imputation-Visualization

MatthiasDE / spark_standalone_docker

reeantencamah / R_Linear-Regression_K-Means-Algorithm

ruz023 / Spark-Desmontration

TIME-GATE / r-spark-service

ashish-kamboj / BigData-Analytics

gomezportillo / sparkR-hadoop

jaehyeon-kim / rocker-extra

superzhang90 / sparkR