Distance

(In data science) a numerical metric that assesses the dissimilarity or resemblance of two objects or data points.

Introduction

Distance is a numerical metric in data science that assesses the dissimilarity or similarity of two objects or data points. It allows you to compare the closeness or separation of items in a dataset.

There are at least three types of distance in this topic:

  • Euclidean distance is a measure of the straight-line or "as-the-crow-flies" distance between two points in Euclidean space
  • Manhattan distance also known as city block distance or L1 distance, is a measure of the distance between two points in a grid-like space
  • Minkowski distance is a generalized distance metric that includes both the Euclidean distance and the Manhattan distance as special cases.

Examples

Euclidean, manhattan, minkowski

Outgoing relations

  • Distance is a feature of Kmeans