pradeepppc / PEFIM

implementation of EFIM algorithm in pyspark

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

PEFIM

implementation of EFIM algorithm in pyspark

How to run

python3 main.py input.txt output.txt minUtil numPartitions

input.txt - huim dataset
output.txt - output file containing the generated itemsets
minUtil - user specified minimum utility value
numPartitions - number of partitions (parallel)

About

implementation of EFIM algorithm in pyspark

License:MIT License


Languages

Language:Python 60.7%Language:Jupyter Notebook 39.3%