Renien / spark-ml-document-classification

An example on using spark :sparkles: ML models to classify docuements

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

article
spark-ml-document-classification

example for document classification

License Travis Build

Summary

An example on using spark ML models to classify docuements

Sample DataSet

Sample News article data is from by www.theguardian.com.

This example uses a very small dataset extracted from few articles. Each new line contains article data consisting of following data format:

{ 
    "bodyText": "..", 
    "webPublicationDate": "25-01-1999", 
    "topics": ["media"] 
}

About

An example on using spark :sparkles: ML models to classify docuements

License:MIT License