drewconway / data_science_box

Simple scripts to setup a fresh data science box using an Ubuntu 12.04.* LTS 64-bit server running on an EC2

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Data Science Box

I often have to spin-up EC2 instances to do various data science(y) things with said instance. These scripts are the result of having done that many times and needing many of the same tools to be available on those boxes.

Running these scripts will turn an Ubuntu 12.04.* LTS 64-bit server (tested) running on EC2 into a fully functioning data science box. Along with several base development libraries, the scripts installs and configures:

Installation

Stick these two scripts in the same directory on your freshly deployed instance and type:

$ ./data_science_box.sh

Then follow the on-screen instructions to configure the software.

Configuration

The script takes care of all the on-box configuration, but if you wish to access RStudio Server, shiny-server, and IPython notebook server via a browser you will need to make sure the Security Groups for the server you deploy allows in-bound traffic to the follow ports (defaults):

  • RStudio: 8787
  • shiny-server: 3838
  • IPython notebook: 8888

About

Simple scripts to setup a fresh data science box using an Ubuntu 12.04.* LTS 64-bit server running on an EC2

License:GNU General Public License v2.0


Languages

Language:Shell 97.2%Language:Python 2.8%