billk97 / big-data-spark

big-data-spark

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Spark commands

text_file = sc.textFile("BORROWERS.TXT")

seperated_file = text_file.map(lambda line: line.split("|"))

seperated_file.first()

borrower_df = seperated_file.toDF()

borrower_df.show()
SELECT
    gender, department, sum(bid)
    FROM
    BORROWRES
    INNER JOIN LOANS ON borrower.bid, loan.bid
    Group by
    gender, department
    Group by
    gender
    Group by
    department
    Group by
    none

About

big-data-spark


Languages

Language:Python 100.0%