19 Example 1: One-dimensional EKI

We implement here the original, iterative ensemble Kalman filter as formulated in (Iglesias, Law, and Stuart 2013).

Recall the inverse problem: recover the unknown \(u\) from noisy measurements \(y\) related by

\[ y = \mathcal{G}(u) + \eta, \] where the noise \(\eta \sim \mathcal{N}(0, \Gamma).\)

Start by introducing a pseudo time, \(h = 1/N,\) and then propagate an ensemble \(\{ u_n^{(j)}\}\) of \(J\) particles (ensemble members) from “time” \(nh\) to \((n+1)h\) according to

\[ u_{n+1}^{(j)} = u_n^{(j)} + C^{up}(u_n) \left[ C^{pp}(u_n) + \frac{1}{h} \Gamma \right]^{-1} \left( y_{n+1}^{(j)} - \mathcal{G}(u_n^{(j)}) \right), \]

where

\[\begin{align} C^{pp}(u) &= \frac{1}{J-1} \sum_{j=1}^{J} \left( \mathcal{G}(u^{(j)} - \hat{\mathcal{G}} \right) \otimes \left( \mathcal{G}(u^{(j)} - \hat{\mathcal{G}} \right) \\ C^{up}(u) &= \frac{1}{J-1} \sum_{j=1}^{J} \left( u^{(j)} - \hat{u} \right) \otimes \left( \mathcal{G}(u^{(j)} - \hat{\mathcal{G}} \right) \\ \hat{u} &= \frac{1}{J} \sum_{j=1}^{J} u^{(j)}, \qquad \hat{\mathcal{G}} = \frac{1}{J} \sum_{j=1}^{J} \mathcal{G}(u^{(j)}) . \end{align}\]

import numpy as np
import functools
import matplotlib.pyplot as plt
from functools import partial

19.1 Implement the one-dimensional EKI for a linear forward operator \(\mathcal{G}\)

def eki_one_dim_lin(m_0, C_0, N, G, gamma, y, delt, h):
    # Inputs:
    # -------
    # m_0, C_0: mean value and covariance of inital ensemble
    # N:        number of iterations
    # G:        one-dimensional forward operator of the model
    # gamma:    covariance of the noise in the data
    # y:        observed data 
    # h:        discretization step  
    #
    # Outputs:
    # -------
    # U: (JxN) matrix with the computed particles for each iteration
    # m: vector of length N with the mean value of the particles
    # C: vector of length N with the covariance of the particles
    
    m = np.zeros(N)
    C = np.zeros(N)
    U = np.zeros((J,N))
    
    #Construct initial ensemble and estimator
    u_0 = np.random.normal(m_0, C_0, J)
    U[:,0] = u_0
    m[0] = np.mean(U[:,0])
    C[0] = (U[:,0] - m[0]) @ (U[:,0] - m[0]).T / (J-1)
    
    for n in range(1,N):
        
        # Last iterate under forward operator:
        G_u    = G*U[:, n-1]
        Ghat   = np.mean(G_u)
        U[:,n] = U[:,n-1] + h*(C[n-1] + delt)*G*(1/gamma)*((y - G_u))
        
        m[n] = np.mean(U[:,n])
        C[n] = (U[:,n] - m[n]) @ (U[:,n] - m[n]).T / (J-1)
            
    return U,m,C

#Set Parameters
J = 10
gamma = 1
m_0 = 0
C_0 = 9e-1
m_true = 0
c_true = C_0
G = 1.5
N = 10000
h = 1/100
delt = 1

# Construct data under true parameter
u_true = np.random.normal(m_true,c_true)
y = G*u_true

U,m,c = eki_one_dim_lin(m_0, C_0, N, G, gamma, y, delt, h)

it=N
iterations=list(range(1,(it+1)))
plt.xlabel('Iterations number n')
plt.ylabel('Error')
plt.loglog(iterations,np.sqrt((u_true*np.ones(N) - m)**2/(u_true**2)),"r",label='$u^\dagger-m_n$')
plt.legend(loc="upper right")
plt.show()

it=300
iterations=list(range(1,it+1))
plt.xlabel('Iteration number n')
plt.ylabel('Covariance')
plt.plot(iterations,c[0:it],"r", label='$c_n$')
plt.plot(iterations,np.divide(np.cumsum(c[0:it]),iterations)[0:it],"g",label='$N^{-1}\sum_k^N c_n$')
plt.legend(loc="upper right")
plt.show()

19.2 Implement the one dimensional EKI for an arbitrary forward operator \(\mathcal{G}\)

def eki_one_dim(m_0, C_0, N, G, gamma, y, h):
    # Inputs:
    # -------
    # m_0, C_0: mean value and covariance of inital ensemble
    # N:        number of iterations
    # G:        one-dimensional forward operator of the model
    # gamma:    covariance of the noise in the data
    # y:        observed data 
    # h:        discretization step  
    #
    # Outputs:
    # -------
    # U: (JxN) matrix with the computed particles for each iteration
    # m: vector of length N with the mean value of the particles
    # C: vector of length N with the covariance of the particles
        
    m = np.zeros(N)
    C = np.zeros(N)
    U = np.zeros((J,N))
    
    #Construct initial ensemble and estimator
    u_0 = np.random.normal(m_0, C_0, J)
    U[:,0] = u_0
    m[0] = np.mean(U[:,0])
    C[0] = (U[:,0] - m[0]) @ (U[:,0] - m[0]).T / (J-1)
    
    for n in range(1,N):
        
        # Last iterate under forward operator:
        G_u = G(U[:,n-1])
        uhat = np.mean(U[:,n-1])
        Ghat = np.mean(G_u)
        
        cov_up = (U[:,n-1] - uhat) @ (G_u - Ghat).T / (J-1)
        cov_pp = (G_u - Ghat) @ (G_u - Ghat).T / (J-1)
        
        U[:,n] = U[:,n-1] + cov_up*h/(h*cov_pp + gamma)*(y - G_u)        
        
        m[n] = np.mean(U[:,n])
        C[n] = (U[:,n]-m[n])@(U[:,n]-m[n]).T/(J-1)
            
    return U, m, C

#Set Parameters
J = 10
r = 10
k = 10
gamma = 1
m_0 = 0
C_0 = 9e-2
m_true = 0
C_true = C_0
N = 10000
h = 1/100 # 1/N

def forward_log(z, k, r, h):
    return k/(1 + np.exp(-r*k*h)*(k/z-1))

# Construct data under true parameter
u_true = np.random.normal(m_true, C_true)
y = forward_log(u_true, k, r, h)

# Use partial function
partial_log = functools.partial(forward_log, k=10, r=10, h=1/100)

U, m, C = eki_one_dim(m_0, C_0, N, partial_log, gamma, y, h)

it = N
iterations=list(range(1,it+1))
plt.xlabel('Iterations number n')
plt.ylabel('Error')
plt.plot(iterations,np.sqrt((u_true*np.ones(N) - m)**2/(u_true**2)),"r", label='$u^\dagger-m_n$')
plt.legend(loc="upper right")
plt.show()

plt.plot(iterations, u_true*np.ones(N), label="true")
plt.plot(iterations, m, label="inversion")
plt.legend()
plt.show()

it = 300
iterations = list(range(1,it+1))
plt.xlabel('Iteration number n')
plt.ylabel('Covariance')
plt.plot(iterations,C[0:it],"r", label='$C_n$')
plt.plot(iterations,np.divide(np.cumsum(C[0:it]),iterations)[0:it],"g",label='$N^{-1}\sum_k^N C_n$')
plt.legend(loc="upper right")
plt.show()

19.3 Conclusions

The convergence is very slow, and not very accurate. This will be remedied by the mean-field approach.

Asch, Mark. 2022. A Toolbox for Digital Twins: From Model-Based to Data-Driven. Philadelphia, PA: Society for Industrial; Applied Mathematics. https://doi.org/10.1137/1.9781611976977.

Asch, Mark, Marc Bocquet, and Maëlle Nodet. 2016. Data Assimilation: Methods, Algorithms, and Applications. Philadelphia, PA: Society for Industrial; Applied Mathematics. https://doi.org/10.1137/1.9781611974546.

Calvello, Edoardo, Sebastian Reich, and Andrew M. Stuart. 2022. “Ensemble Kalman Methods: A Mean Field Perspective.” arXiv (to appear in Acta Numerica 2025). http://arxiv.org/abs/2209.11371.

Carrillo, J. A., F. Hoffmann, A. M. Stuart, and U. Vaes. 2024a. “Statistical Accuracy of Approximate Filtering Methods.” https://arxiv.org/abs/2402.01593.

———. 2024b. “The Mean Field Ensemble Kalman Filter: Near-Gaussian Setting.” https://arxiv.org/abs/2212.13239.

Dashti, Masoumeh, and Andrew M. Stuart. 2015. “The Bayesian Approach to Inverse Problems.” In Handbook of Uncertainty Quantification, edited by Roger Ghanem, David Higdon, and Houman Owhadi, 1–118. Cham: Springer International Publishing. https://doi.org/10.1007/978-3-319-11259-6_7-1.

Huang, Daniel Zhengyu, Jiaoyang Huang, Sebastian Reich, and Andrew M Stuart. 2022. “Efficient Derivative-Free Bayesian Inference for Large-Scale Inverse Problems.” Inverse Problems 38 (12): 125006. https://doi.org/10.1088/1361-6420/ac99fa.

Iglesias, Marco A, Kody J H Law, and Andrew M Stuart. 2013. “Ensemble Kalman Methods for Inverse Problems.” Inverse Problems 29 (4): 045001. https://doi.org/10.1088/0266-5611/29/4/045001.

James, G., D. Witten, T. Hastie, and R. Tibshirani. 2021. An Introduction to Statistical Learning with Applications in R. Second Edition. Springer-Verlag New York. https://doi.org/10.1007/978-1-0716-1418-1.

Law, Kody, Andrew Stuart, and Konstantinos Zygalakis. 2015. Data Assimilation: A Mathematical Introduction. Vol. 62. Texts in Applied Mathematics. Cham: Springer International Publishing. https://doi.org/10.1007/978-3-319-20325-6.

Reich, Sebastian, and Colin Cotter. 2015. Probabilistic Forecasting and Bayesian Data Assimilation. Cambridge University Press.

Sanita Vetra-Carvalho, Lars Nerger, Peter Jan van Leeuwen, and Jean-Marie Beckers. 2018. “State-of-the-Art Stochastic Data Assimilation Methods for High-Dimensional Non-Gaussian Problems.” Tellus A: Dynamic Meteorology and Oceanography 70 (1): 1–43. https://doi.org/10.1080/16000870.2018.1445364.

Särkkä, S., and L. Svensson. 2023. Bayesian Filtering and Smoothing. 2nd ed. Institute of Mathematical Statistics Textbooks. Cambridge University Press. https://doi.org/10.1017/9781108917407.