RMSE

Introduction

Positional accuracy is normally measured as a root mean square error (RMSE). The RMSE is similar to, but not to be confused with, the standard deviation of a statistical sample. 

Explanation

The value of the RMSE is normally calculated from a set of check measurements (coordinate values from an independent source of higher accuracy for identical points). The differences at each point can be plotted as error vectors, as is done in the Figure 1 below for a single measurement. The error vector can be seen as having constituents in the x- and y- directions, which can be recombined by vector addition to give the error vector representing the locational error.

Figure 1: The positional error of a measurement can be expressed as a vector, which in turn can be viewed as the vector addition of its constituents in the x- and y-directions, respectively δx and δy.

For each checkpoint, the error vector has components δx and δy. The observed errors should be checked for a systematic error component, which may indicate a (possibly repairable) lapse in the measurement method. Systematic error has occurred when ∑δx≠0 or ∑δy≠0.

The systematic error δx in x is then defined as the average deviation from the true value:

%5Cdelta%5Coverline%7Bx%7D%20%3D%20%5Cfrac%7B1%7D%7Bn%7D%5Csum_%7Bi%3D1%7D%5En%20%5Cdelta%20x_i.
Equation 1

Analogously to the calculation of the variance and standard deviation of a statistical sample, the root mean square errors mx and my of a series of coordinate measurements are calculated as the square root of the average squared deviations:

m_x%20%3D%20%5Csqrt%7B%5Cfrac%7B1%7D%7Bn%7D%5Csum_%7Bi%3D1%7D%5En%20%5Cdelta%20x_i%5E2%7D~%5Cmbox%7B~and~%7D~%0A%20%20%20m_y%20%3D%20%5Csqrt%7B%5Cfrac%7B1%7D%7Bn%7D%5Csum_%7Bi%3D1%7D%5En%20%5Cdelta%20y_i%5E2%7D%2C
Equation 2

where δx2 stands for δx ⋅ δx. The total RMSE is obtained with the formula

m_%5Cmathrm%7Btotal%7D%20%3D%20%5Csqrt%7Bm_x%5E2%20%2B%20m_y%5E2%7D%2C
Equation 3

which, by the Pythagorean rule, is the length of the average (root squared) vector.

Accuracy tolerances

Many kinds of measurement can be naturally represented by a bell-shaped probability density function p, as depicted in Figure 2(a). This function is known as the normal (or Gaussian) distribution of a continuous, random variable, in the figure indicated as Y . It shape is determined by two parameters: μ, which is the mean expected value for Y , and σ, which is the standard deviation of Y . A small σ leads to a more attenuated bell-shaped function.

Figure 2: (a) Probability density function p of a variable Y , with its mean μ and standard deviation σ. (b) The probability that Y is in the range [μ - σ,μ + σ].

Any probability density function p has the characteristic that the area between its curve and the horizontal axis is equal to 1. Probabilities P can be inferred from p as the area under p’s curve. Figure 2(b), for instance, depicts P(μ-σ ≤ Y ≤ μ-σ), i.e. the probability that the value for Y is within distance σ from μ. In a normal distribution this specific probability for Y is always 0.6826.

The RMSE can be used to assess the probability that a particular set of measurements does not deviate too much from, i.e. is within a certain range of, the “true” value. In the case of coordinates, the probability density function is often considered to be that of a two-dimensional normally distributed variable (see Figure 3). The three standard probability values associated with this distribution are:

  • 0.50 for a circle with a radius of 1.1774 mx around the mean (known as the circular error probable, CEP);

  • 0.6321 for a circle with a radius of 1.412 mx around the mean (known as the root mean square error, RMSE);

  • 0.90 for a circle with a radius of 2.146 mx around the mean (known as the circular map accuracy standard, CMAS).

Figure 3: Probability density p of a normally distributed, two-dimensional variable (X, Y) (also known as a normal, bivariate distribution). In the ground plane, starting from the inside out, are the circles associated with CEP, RMSE and CMAS.

The RMSE provides an estimate of the spread of a series of measurements around their (assumed) “true” values. It is therefore commonly used to assess the quality of transformations such as the absolute orientation of photogrammetric models or the spatial referencing of satellite imagery. The RMSE also forms the basis of various statements for reporting and verifying compliance with defined map accuracy tolerances. An example is the American National Map Accuracy Standard, which states that:

“No more than 10% of well-defined points on maps of 1:20,000 scale or greater may be in error by more than 1/30 inch.”

Normally, compliance to this tolerance is based on at least 20 well-defined checkpoints. The Epsilon Band  is often used for confidence levels.

Learning outcomes

Outgoing relations

Incoming relations