EDA aims at summarizing the characteristics of a dataset with statistics (Correlation statistical methods are often used to explore the relationship between variables.) and visualization (like graphs, charts).
EDA helps to achieve the following,
▪Visualization helps to maximize insight into a data set and uncover underlying
structure
▪To get an overview of the data. For instance, answering the questions like How many samples (data points or
observations)? How many features (covariates or predictors or input variables or independent variables)? What are the features?
▪Orient further analysis by assisting you to choose correct methods/approaches
▪Help you to generate hypothesis
▪Spot problems in data
▪Understand properties of the variables (e.g., mean, variance
and outliers)
▪Understand relationships between variables