ELOHIMSUPREMES / hashtagcluster

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

hashtagcluster

To Run:

  1. Make sure that you have the required packages: -Anaconda 2.7 -tweepy

  2. Save the JSON auth file in the directory below this one.

Live Clustering: 3. find something that is trending on twitter

  1. change the variable hashtag on line 83 of etl.py

  2. start etl.py

  3. start cluster.py

  4. result files will show up in data/processed_data

Non-Live Clustering (tester.py): 3. Change the base file directory to location of data set (line 32)

  1. Change the range of the for loop to the number of batch files (line 34)

  2. result files will show up in same folder as batch files

About


Languages

Language:Python 84.4%Language:R 15.6%