mwhittaker / cluster_tools

Tools for running code on a cluster

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Cluster Tools

This repository contains a miscellany of command line tools that I find useful when running code on a cluster of machines. For example, imagine you just launched a bunch of EC2 instances. Here's what a workflow might look like.

$ ./ec2_ips.sh > hosts.txt                      # List EC2 IP addresses.
$ ./terminals.sh -u ec2-user -i key.pem         # SSH into all machines.
$ ./pssh.sh -u ec2-user -i key.pem provision.sh # Run script on all machines.
$ ./tmux_tail -w out out/*                      # Watch stdout of script.
$ ./tmux_tail -w err err/*                      # Watch stderr of script.

Getting Started

These scripts have a couple of dependencies you'll have to install. On Ubuntu, you can run the following:

$ sudo apt-get install aws-cli jq pssh

You'll also have to configure the aws command line tool. Go to the IAM page on your AWS console. Then, click on Users and create a user and an access key. Then, run aws configure and follow its instructions.

About

Tools for running code on a cluster


Languages

Language:Shell 100.0%