Kalman filter
Conditional mean and variance of Gaussian
Let $X, Y$ be joint Gaussian, i.e. \begin{eqnarray}\left[\begin{array}{c} X \\ Y\end{array}\right] \,\sim \, {\cal{N}}\left(\left[\begin{array}{c}\mu_X \\ \mu_Y\end{array}\right],\,\left[\begin{array}{cc}\Sigma_{XX} & \Sigma_{XY} \\ \Sigma_{YX} & \Sigma_{YY}\end{array}\right]\right)\,,\end{eqnarray} what is $\mathbb{E}(X|Y)$ and $\text{Var}(X|Y)$?Kalman filter
We consider a simple linear model with white noises: \begin{eqnarray} x_t &=& A\, x_{t-1} + w_t\,,\tag{3}\\ y_t &=& B\, x_t + v_t\,, \tag{4} \end{eqnarray} in which $w_t \sim {\cal{N}}(0, W)$, $v_t \sim {\cal{N}}(0, V)$ and $x_0 \sim {\cal{N}}(\bar{x}_0, \Sigma)$.
Let $Y_t$ be the set of variables $(y_1, \cdots, y_t)$ and $x_t \left| Y_s \right.$ be Gaussian with mean and covariance denoted by $\hat{x}_{t|s}$ and $\Sigma_{t|s}$.
Prediction
From (3), we have $x_t \left| Y_{t-1} \right. = A\, x_{t-1} \left| Y_{t-1} \right. + w_t$ and thus \begin{eqnarray} \hat{x}_{t|t-1} &=& A\, \hat{x}_{t-1|t-1}\,,\nonumber \\ \Sigma_{t|t-1} &=& A\, \Sigma_{t|t-1}\, A^T + W\,.\nonumber \end{eqnarray}
Filtering
From (4), $y_t \left| Y_{t-1} \right. = B\, x_t \left| Y_{t-1} \right. + v_t$. As a result, $x_t \left| Y_{t-1} \right.$ and $y_t \left| Y_{t-1} \right.$ are joint Gaussian \begin{eqnarray} \left[\begin{array}{c} x_t \left| Y_{t-1} \right. \\ y_t \left| Y_{t-1} \right.\end{array}\right] \,\sim \, {\cal{N}}\left(\left[\begin{array}{c}\hat{x}_{t|t-1} \\ B\hat{x}_{t|t-1}\end{array}\right],\, \left[\begin{array}{cc}\Sigma_{t|t-1} & \Sigma_{t|t-1} B^T \\ B\Sigma_{t|t-1} & B\Sigma_{t|t-1}B^T + V\end{array}\right]\right)\,.\nonumber \end{eqnarray} Note that $x_t \left| Y_t \right. = \left(x_t \left| Y_{t-1} \right.\right) \left| \left(y_t \left| Y_{t-1}\right. \right) \right.$. Using (1) and (2), we have \begin{eqnarray} \hat{x}_{t|t} &=& \hat{x}_{t|t-1} + \Sigma_{t|t-1} B^T \left(B\Sigma_{t|t-1}B^T + V\right)^{-1}\left( y_t - B\hat{x}_{t|t-1}\right)\,, \nonumber\\ \Sigma_{t|t} &=& \Sigma_{t|t-1} - \Sigma_{t|t-1} B^T \left(B\Sigma_{t|t-1}B^T + V\right)^{-1}B\Sigma_{t|t-1}\,.\nonumber \end{eqnarray} This gives our updated estimate of $x_t$ based on the measurement $y_t$ becoming available.
Smoothing
From (1), $x_{t+1} \left| Y_{t} \right. = A\, x_{t} \left| Y_{t} \right. + w_{t+1}$. As a result, $x_t \left| Y_{t} \right.$ and $x_{t+1} \left| Y_t \right.$ are joint Gaussian \begin{eqnarray} \left[\begin{array}{c} x_t \left| Y_{t} \right. \\ x_{t+1} \left| Y_{t} \right.\end{array}\right] \,\sim \, {\cal{N}}\left(\left[\begin{array}{c}\hat{x}_{t|t} \\ \hat{x}_{t+1|t}\end{array}\right],\, \left[\begin{array}{cc}\Sigma_{t|t} & \Sigma_{t|t} A^T \\ A\Sigma_{t|t} & \Sigma_{t+1|t} \end{array}\right]\right)\,.\nonumber \end{eqnarray} As a result, $\left(x_t \left| Y_{t}\right.\right)\left| \left(x_{t+1} \left| Y_{t}\right.\right)\right.$ is Gaussian with mean and covariance \begin{eqnarray} \mathbb{E}\left[\left(x_t \left| Y_{t}\right.\right)\left| \left(x_{t+1} \left| Y_{t}\right.\right) = z \right.\right] &=& \hat{x}_{t|t} + \Sigma_{t|t} A^T \Sigma_{t+1|t}^{-1} \left( z - \hat{x}_{t+1|t}\right)\,,\nonumber\\ \text{Var}\left[\left(x_t \left| Y_{t}\right.\right)\left| \left(x_{t+1} \left| Y_{t}\right.\right) = z \right.\right] &=& \Sigma_{t|t} - \Sigma_{t|t} A^T \Sigma_{t+1|t}^{-1} A \Sigma_{t|t} \,.\nonumber \end{eqnarray} Using the law of total expectation/variance, as well as the Markov property $\left(x_t \left| Y_{T}\right.\right)\left| \left(x_{t+1} \left| Y_{T}\right.\right)\right.= \left(x_t \left| Y_{t}\right.\right)\left| \left(x_{t+1} \left| Y_{T}\right.\right)\right.$, we have \begin{eqnarray} \hat{x}_{t|T} &=& \mathbb{E}_{x_{t+1} \left| Y_{T}\right.}\left[\left(x_t \left| Y_{T}\right.\right)\left| \left(x_{t+1} \left| Y_{t}\right.\right) = \left(x_{t+1} \left| Y_{T}\right.\right) \right.\right]\nonumber\\ &=&\mathbb{E}_{x_{t+1} \left| Y_{T}\right.}\left[\left(x_t \left| Y_{t}\right.\right)\left| \left(x_{t+1} \left| Y_{t}\right.\right) = \left(x_{t+1} \left| Y_{T}\right.\right) \right.\right]\nonumber\\ &=&\hat{x}_{t|t} + \Sigma_{t|t} A^T \Sigma_{t+1|t}^{-1} \left( \hat{x}_{t+1|T} - \hat{x}_{t+1|t}\right)\,,\nonumber\\ \hat{\Sigma}_{t|T} &=& \mathbb{E}_{x_{t+1} \left| Y_{T}\right.}\left\{\text{Var}\left[\left(x_t \left| Y_{t}\right.\right)\left| \left(x_{t+1} \left| Y_{t}\right.\right) = \left(x_{t+1} \left| Y_{T}\right.\right) \right.\right]\right\}+\nonumber\\ &&\text{Var}_{x_{t+1} \left| Y_{T}\right.}\left\{\mathbb{E}\left[\left(x_t \left| Y_{t}\right.\right)\left| \left(x_{t+1} \left| Y_{t}\right.\right) = \left(x_{t+1} \left| Y_{T}\right.\right) \right.\right]\right\}\nonumber\\ &=& \left(\Sigma_{t|t} - \Sigma_{t|t} A^T \Sigma_{t+1|t}^{-1} A \Sigma_{t|t}\right)+ \Sigma_{t|t} A^T \Sigma_{t+1|t}^{-1} \Sigma_{t+1|T}\Sigma_{t+1|t}^{-1} A\Sigma_{t|t}\nonumber\\ &=&\Sigma_{t|t} + \Sigma_{t|t} A^T \Sigma_{t+1|t}^{-1} \left(\Sigma_{t+1|T}-\Sigma_{t+1|t}\right)\Sigma_{t+1|t}^{-1} A\Sigma_{t|t}\,.\nonumber \end{eqnarray}
Comments
Post a Comment