This is a Getting and Cleaning Data Course Project for the course of Coursera, which dose a pre-process on data collected from the accelerometers from the Samsung Galaxy S smartphone.
In this repo, there are three files:
- README.md explains the project.
- CodeBook.md describes the variables, the data, and transformations or work performed to clean up the data.
- run_analysis.R is the R script for the project.
In accordance with the requirement, the data is downloaded from: https://d396qusza40orc.cloudfront.net/getdata%2Fprojectfiles%2FUCI%20HAR%20Dataset.zip
A full description is available at the site where the data was obtained: http://archive.ics.uci.edu/ml/datasets/Human+Activity+Recognition+Using+Smartphones
After unzipping the zip file, there is a folder named "UCI HAR Dataset", all processes are done within this folder.
The R script in this repo called run_analysis.R does the following.
- Merges the training and the test sets to create one data set.
- Extracts only the measurements on the mean and standard deviation for each measurement.
- Uses descriptive activity names to name the activities in the data set.
- Appropriately labels the data set with descriptive variable names.
- From the data set in step 4, creates a second, independent tidy data set with the average of each variable for each activity and each subject.