There are 1 repository under sparkmllib topic.
Spark Java_Examples for all modules including GraphX
Distributed Search and Recommendation with SpringBoot/ElasticSearch/Spark
A collection of “cookbook-style” scripts for simplifying data engineering and machine learning in Apache Spark.
EverAnalyzer is my thesis in the Department of Digital Systems of the University of Piraeus. EverAnalyzer is a platform for collecting, preprocessing, processing and analyzing Big Data from the Twitter platform.
Big Data Analytics Project using Apache Spark for Predicting Severity of Car Accidents in the USA
Big Data Project - SSML - Spark Streaming for Machine Learning
We generate potential customer leads for businesses on yelp using big data and machine learning
Work in-progress NBA Game Predictor using Spark
Spark Machine Learning Library - learning and developing Machine Learning algorithms
Wuzzuf DataAnalysis by java using (SparkSql-Spring-XChart-Spark-ML)
Introduction to Apache Spark.
• Developed a Recommender System for restaurants by performing analysis on data preprocessed from Yelp Dataset. • Used Altering Least Squares method with Matrix Factorization and Neighborhood Model to train and build the Recommender System. • Tested the Recommender System with multiple rounds of Cross Validation technique and 16% prediction error is observed
This is a repository i have created to put up some of the knowledge i have gained around Big Data Technologies especially Spark, GraphX etc.
Utilized SparkML and Scikit-Learn train several machine learning models for distinguishing fraudulent and legitimate transactions. The machine learning models are then utilized to make predictions on Kafka-generated real-time data streams. Built an interface for displaying these predictions in real-time using the Streamlit framework.
This is an example of Linear Regression done in SparkML and using the class PolynomialExpansion.
Intra-course Homeworks and final homework for Big Data Engineering course. Include KPMG Hackaton 'University Trends' documentation
Semisupervised classification methods (SSC) with Spark-ML, study and implementation
Predicting the Song Download number, given Artist name and Title of the Song
Created a SparkML RandomForest model to predict total employee compensation. Queried data with SparkSQL, ran PySpark scripts to run EDA, pre-process data, and train model achieving with 0.98 R2 score.
Scala Library for extracting useful information from trained Spark Model (DecisionTreeClassificationModel)
SparkMLib ALS(Writed by Scala&Java) used in commodity recommendation system
User, Event, and Predictive Metric Dashboard on 2GB/month of log files from Brackets IDE
Solving Kaggle Titanic with Pyspark libraries
IOT Anomalies detection using Spark MLlib library
Introductory Big Data concepts using Spark framework and different libraries
An implementation of K-means algorithm using Spark MLlib and Scala
SparkMLlibDeepLearn scala source algorithm analysis