Github repository for the submission of the "Getting and Cleaning Data" course by the Johns Hopkins Data Science Specialization
The purpose of this project is to demonstrate the ability to collect, work with, and clean a data set.
The data set used here concerns "Human Activity Recognition Using Smartphones Data Set" which can be found at http://archive.ics.uci.edu/ml/datasets/Human+Activity+Recognition+Using+Smartphones
The following are the project objectives:
You should create one R script called run_analysis.R that does the following.
- Merges the training and the test sets to create one data set.
- Extracts only the measurements on the mean and standard deviation for each measurement.
- Uses descriptive activity names to name the activities in the data set
- Appropriately labels the data set with descriptive variable names.
From the data set in step 4, creates a second, independent tidy data set with the average of each variable for each activity and each subject.
run_analysis.R contains the R script used to read, clean, transform and write the required data. It is to be used with RStudio or the R program itself. Place this R file in the root folder of the data set.
CodeBook.md contains variable description, data description as well as the transformations used to clean the data.