11 Ensemble Kalman Filter

The idea behind the EnKF is

permit fully nonlinear process and observation models;
avoid gradient calculations for linearization, as in the EKF,
by replacing mean and covariance with empirical, ensemble averaging.

The EnKF is obtained by - replacing the exact covariance \(P\) by the ensemble sample covariance, and - adding noise to the data in order to avoid a shrinking of the ensemble spread and to obtain the correct filtering covariance in the limit

Here are the steps for the so-called stochastic EnKF, where we add an artificial, random perturbation to the observations, and we assume we have a linear observation operator, \(H.\)

11.1 Stochastic EnKF - linear observation operator

11.1.1 Prediction/Forecast

\[\begin{align} \hat{v}_{k+1}^{n} &= \Psi ( {v}_{k}^{n} ) + \xi_{k}^{n} , \quad n=1, \ldots, N, \\ \hat{m}_{k+1} & = \frac{1}{N} \sum_{i=1}^{N} \hat{v}_{k+1}^{n} , \\ \hat{C}_{k+1} & = \frac{1}{N-1} \sum_{i=1}^{N} \left( \hat{v}_{k+1}^{n} - \hat{m}_{k+1} \right) \left( \hat{v}_{k+1}^{n} - \hat{m}_{k+1} \right)^{\mathrm{T}}. \end{align}\]

11.1.2 Correction/Analysis

\[\begin{align} {S}_{k+1} &= H \hat{C}_{k+1} H^{\mathrm{T}} + \Gamma ,\\ {K}_{k+1} &= \hat{C}_{k+1} H^{\mathrm{T}} {S}_{k+1}^{-1} , \\ {y}_{k+1}^{n} &= {y}_{k+1} + \eta^n_{k+1} , \quad n=1, \ldots, N, \\ {v}_{k+1}^{n} &= (I - K_{k+1}H )\hat{v}_{k+1}^{n} + K_{k+1} y^n_{k+1} , \quad n=1, \ldots, N. \end{align}\]

Alternatively, defining the innovation \(d = {y}_{k+1}^{n} - H \hat{v}_{k+1}^{n},\) we can write the state update more simply as

\[ {v}_{k+1}^{n} = \hat{v}_{k+1}^{n} + K_{k+1} d . \]

In words:

For a given \(N_e \in \mathbb{N}\) generate i.i.d. ensemble of states random variables from the distribution of \(X(0).\)
For \(t \in \mathbb{N}\) recursively repeat the following steps:

Advance each ensemble member in time, using the nonlinear state equation with independently generated random state noise
Compute the forecast sample mean and the forecast sample covariance
Compute the sample Kalman gain
Add additional perturbation to the observation vector \(Y\) using independently generated random variables \(\eta(t)\)
Update each forecast ensemble member

Burgers et al. [1998] shows that without the data perturbation, the covariance of the ensemble would go to the zero matrix as \(t\) goes to infinity. The data perturbation also guarantees that the relation between the forecast sample covariance and the analysis sample covariance is analogous to the relation between the forecast and analysis covariances in the standard KF.

11.2 Full nonlinear formulation of the ensemble Kalman filter

There are many ways to formulate the EnKF. Following Vetra-Carvalho, et al (Tellus A, 2018), we express the filter in terms of the anomalies of state and observations. This is indispensable for fully nonlinear state and measurement models,

\[\begin{align} x_{k+1}^{n} &= \Psi ( {x}_{k}^{n} ) + w_{k}^{n} , \quad n=1, \ldots, N_e, \\ {y}_{k+1} &= \mathcal{H} (x_{k+1}) + v_{k+1}. \end{align}\]

To fix notation:

state forecast \(X^{\mathrm{f}},\) dimension \((N_t \times N_x)\)
ensemble state forecast \(\mathbf{X}^{\mathrm{f}},\) dimension \((N_t \times N_x \times N_e)\)
observation, \(Y,\) dimension \((N_t \times N_y)\)
ensemble state anomaly, \[\mathbf{X}' = \frac{1}{\sqrt{N_e - 1}} \left(\mathbf{X} - \overline{{X}}\right),\] dimension \((N_t \times N_x \times N_e)\) with \(\overline{{X}} = (1/N_e) \sum_{e=1}^{N_e} {X}_e\)
ensemble observation anomaly,
\[\mathbf{Y}' = \frac{1}{\sqrt{N_e - 1}} \left( \mathcal{H}(\mathbf{X}) - \overline{ \mathcal{H}(\mathbf{X} ) } \right),\] dimension \((N_t \times N_y \times N_e)\) with \(\overline{ \mathcal{H}( \mathbf{X} )} = (1/N_e)\sum_{e=1}^{N_e} \mathcal{H}({X}_e).\)

Then, the Kalman analysis update is

\[ \mathbf{X}^\mathrm{a} = \mathbf{X}^\mathrm{f} + \mathbf{X}'(\mathbf{Y}')^\mathrm{T} S^{-1} D, \]

with

\[\begin{align} S &= \mathbf{Y}'(\mathbf{Y}')^\mathrm{T} + R & \quad \text{(observation covariance)}, \\ D &= ( \mathbf{Y} + \mathbf{y}) - \mathcal{H}(\mathbf{X}) & \quad \text{(innovation)}, \end{align}\]

where \(y \sim \mathcal{N}(0,R)\) is the stochastic perturbation, and \(R\) is the measurement noise covariance matrix.

Or, defining the Kalman gain matrix as

\[ K = \mathbf{X}'(\mathbf{Y}')^\mathrm{T} S^{-1}, \]

we obtain the classical KF update,

\[ \mathbf{X}^\mathrm{a} = \mathbf{X}^\mathrm{f} + K D. \]

11.3 Summary of EnKF properties

EnKF represents error statistics by ensembles of (nonlinear) model and (nonlinear) measurement realizations.
EnKF performs sequential DA that processes measurements recursively in time.
EnKF is suitable for weather-prediction and any other complex, chaotiic dynamic systems.
Error propagation is nonlinear (see point 1).
Filter update is linear and computed in the low rank, ensemble subspace.
EnKF does not require any gradients, adjoints, linearizations.

Asch, Mark. 2022. A Toolbox for Digital Twins: From Model-Based to Data-Driven. Philadelphia, PA: Society for Industrial; Applied Mathematics. https://doi.org/10.1137/1.9781611976977.

Asch, Mark, Marc Bocquet, and Maëlle Nodet. 2016. Data Assimilation: Methods, Algorithms, and Applications. Philadelphia, PA: Society for Industrial; Applied Mathematics. https://doi.org/10.1137/1.9781611974546.

Calvello, Edoardo, Sebastian Reich, and Andrew M. Stuart. 2022. “Ensemble Kalman Methods: A Mean Field Perspective.” arXiv (to appear in Acta Numerica 2025). http://arxiv.org/abs/2209.11371.

Carrillo, J. A., F. Hoffmann, A. M. Stuart, and U. Vaes. 2024a. “Statistical Accuracy of Approximate Filtering Methods.” https://arxiv.org/abs/2402.01593.

———. 2024b. “The Mean Field Ensemble Kalman Filter: Near-Gaussian Setting.” https://arxiv.org/abs/2212.13239.

Dashti, Masoumeh, and Andrew M. Stuart. 2015. “The Bayesian Approach to Inverse Problems.” In Handbook of Uncertainty Quantification, edited by Roger Ghanem, David Higdon, and Houman Owhadi, 1–118. Cham: Springer International Publishing. https://doi.org/10.1007/978-3-319-11259-6_7-1.

Huang, Daniel Zhengyu, Jiaoyang Huang, Sebastian Reich, and Andrew M Stuart. 2022. “Efficient Derivative-Free Bayesian Inference for Large-Scale Inverse Problems.” Inverse Problems 38 (12): 125006. https://doi.org/10.1088/1361-6420/ac99fa.

Iglesias, Marco A, Kody J H Law, and Andrew M Stuart. 2013. “Ensemble Kalman Methods for Inverse Problems.” Inverse Problems 29 (4): 045001. https://doi.org/10.1088/0266-5611/29/4/045001.

James, G., D. Witten, T. Hastie, and R. Tibshirani. 2021. An Introduction to Statistical Learning with Applications in R. Second Edition. Springer-Verlag New York. https://doi.org/10.1007/978-1-0716-1418-1.

Law, Kody, Andrew Stuart, and Konstantinos Zygalakis. 2015. Data Assimilation: A Mathematical Introduction. Vol. 62. Texts in Applied Mathematics. Cham: Springer International Publishing. https://doi.org/10.1007/978-3-319-20325-6.

Reich, Sebastian, and Colin Cotter. 2015. Probabilistic Forecasting and Bayesian Data Assimilation. Cambridge University Press.

Sanita Vetra-Carvalho, Lars Nerger, Peter Jan van Leeuwen, and Jean-Marie Beckers. 2018. “State-of-the-Art Stochastic Data Assimilation Methods for High-Dimensional Non-Gaussian Problems.” Tellus A: Dynamic Meteorology and Oceanography 70 (1): 1–43. https://doi.org/10.1080/16000870.2018.1445364.

Särkkä, S., and L. Svensson. 2023. Bayesian Filtering and Smoothing. 2nd ed. Institute of Mathematical Statistics Textbooks. Cambridge University Press. https://doi.org/10.1017/9781108917407.