EDA

EDA aims at summarizing the characteristics of a dataset with statistics (Correlation statistical methods are often used to explore the relationship between variables.) and visualization (like graphs, charts).

Introduction

EDA helps to achieve the following, 

▪Visualization helps to maximize insight into a  data set and uncover underlying  
structure
▪To get an overview of the data. For instance, answering the questions like How many samples (data points or  
observations)? How many features (covariates or  predictors or input variables or  independent variables)? What are the features?
▪Orient further analysis by assisting you to choose correct methods/approaches 
▪Help you to generate hypothesis 
▪Spot problems in data 
▪Understand properties of the variables (e.g., mean, variance  
and outliers) 
▪Understand relationships between variables 

Outgoing relations

Incoming relations