daviddesz / getdataProject

This repository contains my course project for the Coursera course Getting and Cleaning Data

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Getting and Cleaning Data: Course Project

Introduction

This repository contains my course project for the Coursera course "Getting and Cleaning data".

Project Description

You should create one R script called run_analysis.R that does the following.

  1. Merges the training and the test sets to create one data set.
  2. Extracts only the measurements on the mean and standard deviation for each measurement.
  3. Uses descriptive activity names to name the activities in the data set
  4. Appropriately labels the data set with descriptive activity names.
  5. Creates a second, independent tidy data set with the average of each variable for each activity and each subject.

Steps to run the script

  1. Download the data source and extract it on your local drive. You'll have a UCI HAR Dataset folder.
  2. Put run_analysis.R in the parent folder of UCI HAR Dataset, then set it as your working directory using setwd() function.
  3. Run source("run_analysis.R"), then it will generate a new file tiny_dataset.txt in your working directory.

Dependencies

The run_analysis.R file depends on reshape2 and data.table packages and will install them automatically.

About

This repository contains my course project for the Coursera course Getting and Cleaning Data


Languages

Language:R 100.0%