- In this project I am testing whether the posts of a famous Chinese celebrity 'Cai' on his social media Weibo receive abnormally high amount of retweets and investigate the reason behind it.
- Programming Language: Python/Ipython notebook
- Statistics: A/B test(permutation test)
- Data Science: Webscrapping, Data visualization, and text-mining.
- Library used: datascience(a package like Pandas), BeautifulSoup, Numpy, Matplotlib, Jieba, Json, etc.
- Cai's Weibo posts indeed receive abnormally high amount reposts, and after analyzing the accounts that repost his posts, we found those accounts behaves highlly like bots.
- This project helps me get an A for my COGS9: Intro to Data Science class at UCSD
Go to the nbviewer for better rendering!