DataCleaning

For preparing the data, data cleaning has to be carried out where the messy data is cleaned to tidy data. According to Hadley Wickman, Tidy data is a standard way of mapping the meaning of s dataset to its structure. Tidy data is a way to describe data that’s organized with a particular structure – a rectangular structure, where each variable has its own column, and each observation has its own row (Wickham 2014).

Outgoing relations