DmZ / component-hadoop

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

component-hadoop

Version 2.0-36p

Installs and configures Cloudera Hadoop

Install

Features

  • Install and configure Cloudera Hadoop on multiple compute

Configurations

  • Cloudera Hadoop 4.4.0, CentOS 6.4 (us-east-1/ami-ee698586, us-west-1/ami-0e073d4b), AWS EC2 m3.large, root
  • Cloudera Hadoop 5.1.3, CentOS 6.4 (us-east-1/ami-ee698586, us-west-1/ami-0e073d4b), AWS EC2 m3.large, root

Pre-requisites

  • Configured Cloud Account a in chosen environment
  • Either installed Chef on target compute OR launch under root
  • Internet access from target compute:
    • Cloudera CDH and CM distribution
    • S3 bucket with Chef recipes: qubell-starter-kit-artifacts
    • If Chef is not installed: please install Chef 10.16.2 using http://www.opscode.com/chef/install.sh bash <($WGET -O - http://www.opscode.com/chef/install.sh) -v $CHEF_VERSION

Implementation notes

Configuration parameters

  • input.repository_url: URL to cloudera archive repo
  • input.cdh_ami: Amazon AMI ID
  • input.cookbooks_url: URL to chef cookbooks tarball
  • input.datanodes: Amount of datanodes to launch
  • input.master_hardware: Amazon instance type for master node
  • input.datanode_hardware: Amazon instance type for data nodes
  • input.cloudera_hadoop_version: Hadoop version to install
  • input.cloudera_manager_version: Cloudera Manager version to install
  • input.cloudera_search_version: Cloudera Search version to install
  • input.cloudera_impala_version: Cloudera Impala version to install
  • input.metastore_root_password: Password for metastore

About


Languages

Language:Python 96.3%Language:Ruby 3.7%