Schamnad / Data-Science

My foray into the world of Data Science

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Data Science

My foray into the world of Data Science. I am following the curriculum provided in OSDSM and am working through the courses one by one. Presently I am completing the course thought by Bill Howe and his Coursera course, Introduction to Data Science.

Project 1

###Tweet Sentiment Analysis

Introduction

Scripts for capturing tweets, creating sentiment dictionary, processing & scoring tweet sentiments, written in Python. These scripts were written to facilitate the clustering of tweet length & sentiment scores in this research paper.

Twitter app authentication credentials are required for use of twitterstream.py. Acquire these here.

Project 2

Data Manipulation at Scale: Systems and Algorithms

These are my assignments for Data Manipulation at Scale: Systems and Algorithms - Bill Howe -- Spring 2016 offered on Coursera.

Topics: Python NLP on Twitter API, Distributed Computing Paradigm, MapReduce/Hadoop & Pig Script, SQL/NoSQL, Relational Algebra, Experiment design, Statistics, Graphs, Amazon EC2, Visualization.

About

My foray into the world of Data Science

License:MIT License


Languages

Language:Python 54.8%Language:HTML 35.5%Language:R 9.7%