bigmianhua / Twitter-clustering

In this project, we collect the tweets from twitter API, using sentiment algorithm to divide users into two groups-positive and negative. Then using Spark to do k-means clustering in these two groups based on the followings of users. In the end, calculating the TF-IDF of tweets of different users, then do k-means clustering based on TF-IDF.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

bigmianhua/Twitter-clustering Stargazers