This project aims to determine the result (approved or declined) for a loan application based on the information of the applicant.
The data is from the German Credit Data Set which classifies people described by a set of attributes about credit risks.
https://archive.ics.uci.edu/ml/datasets/Statlog+%28German+Credit+Data%29
The data has been downloaded and saved as credit_data.txt.
-
Download sbt.rar and unpack it into C:\project\
-
In the folder: C:\project\sbt, run: $ sbt assembly
-
Copy the executable JAR file from the folder C:\project\sbt\target\scala-2.11\ to the folder C:\project, copy the data file credit_data.txt to the folder C:\project\
-
Run the Spark program: $ spark-submit LoanFraudDetection-assembly-1.0.jar