4 Example 1 - estimating a constant

In this simple numerical example let us attempt to estimate a scalar random constant, a voltage for example. Let us assume that we can obtain measurements of the constant, but that the measurements are corrupted by a 0.1 volt RMS white measurement noise (e.g. our analog-to-digital converter is not very accurate).

Here, we will use data assimilation notation, where

f denotes forecast (or prediction)
a denotes analysis (or correction)
t denotes the true value.

In this scalar, 1D example, our process is governed by the state equation,

\[\begin{align*} x_{k} & = F x_{k-1}+w_{k}=x_{k-1}+w_{k} \end{align*}\]

and the measurement equation, \[\begin{align*} y_{k} & = H x_{k}+v_{k} = x^{\mathrm{t}} + v_{k}. \end{align*}\] The state, being constant, does not change from step to step, so \(F=I.\) Our noisy measurement is of the state directly so \(H=1.\)

The time-update (forecast) equations are, \[\begin{align} x_{k+1}^{\mathrm{f}} & = x_{k}^{\mathrm{a}}\,,\\ P_{k+1}^{\mathrm{f}} & = P_{k}^{\mathrm{a}}+Q \end{align}\] and the measurement update (analysis) equations are \[\begin{align} K_{k+1} & = P_{k+1}^{\mathrm{f}}(P_{k+1}^{\mathrm{f}}+R)^{-1},\\ x_{k+1}^{\mathrm{a}} & = x_{k+1}^{\mathrm{f}} + K_{k+1}(y_{k+1}-x_{k+1}^{\mathrm{f}}),\\ P_{k+1}^{\mathrm{a}} & = (1-K_{k+1})P_{k+1}^{\mathrm{f}}. \end{align}\]

Initialization

Presuming a very small process variance, we let \(Q=1.\mathrm{e}-5\). We could certainly let \(Q=0\) but assuming a small but non-zero value gives us more flexibility in tuning the filter as we will demonstrate below. Let us assume from experience that we know the true value of the random constant has a standard Gaussian probability distribution, so we will seed our filter with the guess that the constant is \(0.\) In other words, before starting we let \(x_{0}=0.\) Similarly we need to choose an initial value for \(P_{k}^{\mathrm{a}}\), call it \(P_{0}.\) If we were absolutely certain that our initial state estimate was correct, we would let \(P_{0}=0.\) However, given the uncertainty in our initial estimate \(x_{0},\) choosing \(P_{0}=0\) would cause the filter to initially and always believe that \(x_{k}^{\mathrm{a}}=0.\) As it turns out, the alternative choice is not critical. We could choose almost any \(P_{0}\neq0\) and the filter would eventually converge. We will start our filter with \(P_{0}=1.\)

Simulations

To begin with, we randomly chose a scalar constant \(y=-0.37727.\) We then simulate \(100\) distinct measurements that have an error normally-distributed around zero with a standard deviation of \(0.1\) (remember we supposed that the measurements are corrupted by a \(0.1\) volt RMS white measurement noise).

import numpy as np
import matplotlib.pyplot as plt
np.random.seed(1955)

#plt.rcParams['figure.figsize'] = (10, 6)

def KF_ct(n_iter=50, sig_w=0.001, sig_v=0.1):

    # intial parameters
    n_iter = 50
    sz = (n_iter,) # size of array
    x = -0.37727 # truth value
    z = np.random.normal(x,0.1,size=sz) # observations (normal about x, sigma=0.1)
    
    Q = sig_w**2 # 1e-6 # process variance
    
    # allocate space for arrays
    xhat      = np.zeros(sz)      # a posteri estimate of x
    P         = np.zeros(sz)         # a posteri error estimate
    xhatminus = np.zeros(sz) # a priori estimate of x
    Pminus    = np.zeros(sz)    # a priori error estimate
    K         = np.zeros(sz)         # gain or blending factor
    
    R = sig_v**2 # 0.1**2 # estimate of measurement variance, change to see effect
    
    # intial guesses
    xhat[0] = 0.0
    P[0] = 1.0
    
    for k in range(1,n_iter):
        # time update
        xhatminus[k] = xhat[k-1]
        Pminus[k] = P[k-1] + Q
    
        # measurement update
        K[k] = Pminus[k]/( Pminus[k] + R )
        xhat[k] = xhatminus[k]+K[k]*(z[k]-xhatminus[k])
        P[k] = (1 - K[k])*Pminus[k]
    
    fig, (ax1, ax2) = plt.subplots(2,1)
    fig.suptitle('KF with default values')
    ax1.plot(z,'ro',label='noisy measurements')
    ax1.plot(xhat,'b-',label='KF posterior estimate')
    ax1.axhline(x,color='g',label='true value')
    ax1.legend()
    
    valid_iter = range(1,n_iter) # Pminus not valid at step 0
    ax2.semilogy(valid_iter,Pminus[valid_iter],label='a priori error estimate')
    ax2.grid()
    ax2.legend()
    ax2.set(xlabel='Iteration', ylabel='Voltage$^2$') #, ylim=[0,.01])

# use all default values
KF_ct()

In this first simulation we fixed the measurement variance at \(R=(0.1)^{2}=0.01.\) Because this is the “true” measurementerror variance, we would expect the “best” performance in terms of balancing responsiveness and estimate variance. This will become more evident in the second and third simulations.

The above figure depicts the results of this first simulation. The true value of the random constant \(x=-0.37727\) is given by the solid line, the noisy measurements by the dots and the filter estimate by the blue curve.

Now, we will see what happens when the measurement error variance \(R\) is increased or decreased by a factor of \(100\) respectively.

first, the filter was told that the measurement variance was \(100\) times greater (i.e. \(R=1\)) so it was “slower” to believe the measurements.
then, the filter was told that the measurement variance was \(100\) times smaller (i.e. \(R=0.0001\) ) so it was very “quick” to believe the noisy measurements.

# increase R = 1
KF_ct(sig_v=1)

# decrease R = 0.0001
KF_ct(sig_v=0.01)

4.1 Conclusion

While the estimation of a constant is relatively straightforward, this example clearly demonstrates the workings of the Kalman filter. In the second Figure (R=1) in particular, the Kalman filtering is evident as the estimate appears considerably smoother than the noisy measurements. We observe the speed of convergence of the variance in the bottom subplot of the respective Figures.

Asch, Mark. 2022. A Toolbox for Digital Twins: From Model-Based to Data-Driven. Philadelphia, PA: Society for Industrial; Applied Mathematics. https://doi.org/10.1137/1.9781611976977.

Asch, Mark, Marc Bocquet, and Maëlle Nodet. 2016. Data Assimilation: Methods, Algorithms, and Applications. Philadelphia, PA: Society for Industrial; Applied Mathematics. https://doi.org/10.1137/1.9781611974546.

Calvello, Edoardo, Sebastian Reich, and Andrew M. Stuart. 2022. “Ensemble Kalman Methods: A Mean Field Perspective.” arXiv (to appear in Acta Numerica 2025). http://arxiv.org/abs/2209.11371.

Carrillo, J. A., F. Hoffmann, A. M. Stuart, and U. Vaes. 2024a. “Statistical Accuracy of Approximate Filtering Methods.” https://arxiv.org/abs/2402.01593.

———. 2024b. “The Mean Field Ensemble Kalman Filter: Near-Gaussian Setting.” https://arxiv.org/abs/2212.13239.

Dashti, Masoumeh, and Andrew M. Stuart. 2015. “The Bayesian Approach to Inverse Problems.” In Handbook of Uncertainty Quantification, edited by Roger Ghanem, David Higdon, and Houman Owhadi, 1–118. Cham: Springer International Publishing. https://doi.org/10.1007/978-3-319-11259-6_7-1.

Huang, Daniel Zhengyu, Jiaoyang Huang, Sebastian Reich, and Andrew M Stuart. 2022. “Efficient Derivative-Free Bayesian Inference for Large-Scale Inverse Problems.” Inverse Problems 38 (12): 125006. https://doi.org/10.1088/1361-6420/ac99fa.

Iglesias, Marco A, Kody J H Law, and Andrew M Stuart. 2013. “Ensemble Kalman Methods for Inverse Problems.” Inverse Problems 29 (4): 045001. https://doi.org/10.1088/0266-5611/29/4/045001.

Law, Kody, Andrew Stuart, and Konstantinos Zygalakis. 2015. Data Assimilation: A Mathematical Introduction. Vol. 62. Texts in Applied Mathematics. Cham: Springer International Publishing. https://doi.org/10.1007/978-3-319-20325-6.

Sanita Vetra-Carvalho, Lars Nerger, Peter Jan van Leeuwen, and Jean-Marie Beckers. 2018. “State-of-the-Art Stochastic Data Assimilation Methods for High-Dimensional Non-Gaussian Problems.” Tellus A: Dynamic Meteorology and Oceanography 70 (1): 1–43. https://doi.org/10.1080/16000870.2018.1445364.

Särkkä, S., and L. Svensson. 2023. Bayesian Filtering and Smoothing. 2nd ed. Institute of Mathematical Statistics Textbooks. Cambridge University Press. https://doi.org/10.1017/9781108917407.

Wu, Jin-Long, Matthew E. Levine, Tapio Schneider, and Andrew Stuart. 2023. “Learning about Structural Errors in Models of Complex Dynamical Systems.” https://arxiv.org/abs/2401.00035.