Stability of Linear Set-Membership Filters With Respect to Initial Conditions: An Observation-Information Perspective

Yirui Cong [email protected] Xiangke Wang [email protected] Xiangyun Zhou [email protected] College of Intelligence Science and Technology, National University of Defense Technology School of Engineering, The Australian National University

Abstract

The issue of filter stability with respect to (w.r.t.) the initial condition refers to the unreliable filtering process caused by improper prior information of the initial state. This paper focuses on analyzing and resolving the stability issue w.r.t. the initial condition of the classical Set-Membership Filters (SMFs) for linear time-invariance systems, which has not yet been well understood in the literature. To this end, we propose a new concept – the Observation-Information Tower (OIT), which describes how the measurements affect the estimate in a set-intersection manner without relying on the initial condition. The proposed OIT enables a rigorous stability analysis, a new SMFing framework, as well as an efficient filtering algorithm. Specifically, based on the OIT, explicit necessary and sufficient conditions for stability w.r.t. the initial condition are provided for the classical SMFing framework. Furthermore, the OIT inspires a stability-guaranteed SMFing framework, which fully handles the stability issue w.r.t. the initial condition. Finally, with the OIT-inspired framework, we develop a fast and stable constrained zonotopic SMF, which significantly overcomes the wrapping effect.

keywords:

Set-membership filter, Stability w.r.t. initial condition, Observation-information tower, Constrained zonotope.

^†^†thanks: Corresponding author: X. Wang.

, ,

1 Introduction

Set-Membership Filter (SMF) is a very important class of non-stochastic filters, which gives the optimal solution to the filtering problems with bounded noises whose probability distributions are unknown. It is fundamentally equivalent to the Bayes filter [23] under the non-stochastic Markov condition (caused by unrelated noises) [10]. For linear systems, it can be regarded as a non-stochastic counterpart of the well-known Kalman filter. Therefore, the SMF has great potentials in many important fields like control systems, telecommunications, and navigation, as the Bayes filter does. However, the linear SMFing has not drawn an adequate amount of attention to realize its full potential. One important issue is stability w.r.t. the initial condition¹¹1Similarly to the true initial distribution/measure and the initial condition for stochastic filters [22, 26], in this article: the term true initial set refers to the set of all possible initial states (objectively determined by the system), which could be a singleton when we focus on one trial; the term initial condition (for SMFs) represents the initial set subjectively chosen in the design of a filter., which characterizes whether the resulting estimate, as time elapses, remains reliable when the filter is improperly initialized [22, 26].

For Kalman filters, stability w.r.t. the initial condition is a central feature and well-established in an asymptotic manner: the posterior distribution, under some simple system assumptions, converges to the true conditional distribution of the system state (given all observed measurements) regardless of initial conditions [12, 13, 15]. This means measurements in stable filters can help correct/forget the information from the initial condition.

For linear SMFs, in contrast, there is very limited knowledge of the stability w.r.t. the initial condition. The most related work is on the uniform boundedness of the estimate, which guarantees that the size of estimate does not increase unboundedly with time. In the literature, the uniform boundedness analysis is considered for two types of SMFs, i.e., the ellipsoidal and polytopic SMFs.²²2The uniform boundedness of interval observers [18, 29] is not included since their basic structures are linear observers. For ellipsoidal SMFs, [4] proposed an input-to-state stable algorithm that ensures the uniform boundedness of the estimate; since in [4] the input-to-state stability required to solve a time-inefficient polynomial equation at each time step, [16] developed an SMF with increased efficiency based on minimizing an important upper bound; then, [25] provided a parallelotope-bounding technique to reduce the complexity, which was further improved by [3]. For polytopic SMFs, [8] proposed a zonotopic Kalman filter, where a sufficient condition (called robust detectability) for the uniform boundedness of the estimate was given; in [28], a zonotopic SMF was designed for linear parameter-varying systems, and an upper bound on the radius of the estimate was derived by solving linear matrix inequalities (LMIs); based on [28], the article [11] proposed a zonotopic SMF for switched linear systems, where an LMI-based upper bound was obtained for the radius of the estimate.

It is important to note that the uniform boundedness of the estimate does not imply reliable estimates when the initial condition is improperly chosen. Therefore, existing linear SMFs still face the issue of stability w.r.t. the initial condition, reflected in two aspects³³3Note that stability of SMFs w.r.t. their initial conditions is different from stability of control systems with estimation [5, 6, 19, 20]; for an unstable system, an stable SMF w.r.t. the initial condition can still be designed.:

•

Ill-posedenss: In the literature, the initial condition of an SMF should include the true initial set of the system state. But if the initial condition does not contain the true initial set, the resulting estimate at some time steps can be an empty set; in this case, we say the SMF is ill-posed⁴⁴4In the SMF community, the ill-posedness is a case of the falsification of a priori assumption. To the best of our knowledge, this issue has not been studied in the literature. or not well-posed, which is a key different feature from the stability of Kalman filters. Unfortunately, ill-posedness widely exists, which means the initial condition must be carefully chosen to guarantee non-empty estimates.
•

Unbounded estimation gap: Even if the estimate is non-empty, perturbations of the initial condition can gradually amplify the difference (called the estimation gap) between the estimate and the true set of the system states. Consequently, the estimation gap can go unbounded with time: inherently, the classical SMFing framework cannot always ensure the bounded estimation gap (the boundedness condition is unknown); extrinsically, approximation/reduction methods result in the wrapping effect [14] so that the estimation gap is unboundedly accumulated.

To directly tackle the stability issue, we should analyze when the classical linear SMFing framework is stable w.r.t. the initial condition, i.e., well-posed and with a bounded estimation gap; then, it is necessary to propose a new SMFing framework with guaranteed stability.

In addition to the stability, a linear SMF should be fast and accurate, which is also a challenging issue arising from the computational intractability of the optimal estimate. In general, to improve the time efficiency of an existing SMF, one has to sacrifice the estimation accuracy, and vice versa.

A recent study in [2] showed that the constrained zonotopic SMF proposed by [24] can give tighter estimates than other types of modern SMFs (e.g., ellipsoidal SMFs [16, 17] and zonotopic SMFs [8, 27]), which sheds new light on balancing the efficiency and the accuracy. This is largely because constrained zonotopes are closed under Minkowski sums and set intersections in the prediction and update steps, respectively. Nevertheless, the constrained zonotopic SMF still has a relatively large computation time due to the limitation of the existing approximation techniques. Specifically, the reduction relies on the geometric properties of the constrained zonotopes at each single time step (i.e., without considering the “time correlation”). As a result, overestimation cumulates over time, which means maintaining accuracy would lead to an unnecessary increase of complexity. The situation would be even worse for high-dimensional systems.

Taking the above discussions into account, it is of great theoretical and practical interest to develop a fast constrained zonotopic SMF with guaranteed stability, which is also efficient for high-dimensional systems.

In this work, we focus on understanding and analyzing the stability w.r.t. the initial condition of SMFing for linear time-invariant systems and establish a stability-guaranteed filtering framework to develop a new SMF. The main contribution is to put forward a concept of Observation-Information Tower (OIT). It reflects the impact of the measurements on the estimate, independent of any reliance on the initial condition. The OIT enables to provide a rigorous stability analysis, resolve the ill-posedness issue, and develop an efficient filtering algorithm. More specifically:

•

Applying the OIT to a projected system, we analyze the stability of the classical linear SMFing framework. An explicit stability criterion (sufficient condition) is given, which turns out to be surprisingly close to the necessity w.r.t. the bounded estimation gap.
•

The OIT inspires a new SMFing framework, through a rigorously proven invariance property. This framework completely fixes the ill-posedness problem of the existing linear SMFing, without relying on any information of the true initial set.
•

With this new framework, we design a stable and fast constrained zonotopic SMF with uniform boundedness; different from the existing reduction methods based on geometric properties of constrained zonotopes, our method utilizes the properties of system dynamics via the OIT, which has high efficiency and significantly overcomes the wrapping effect.

In Section 2, the system model is given and three problems (on stability of classical linear SMFing framework, new framework with guaranteed stability, and stable and fast constrained zonotopic SMF) are described. We propose the OIT in Section 3 which is the key to solving those three problems, and the solutions are provided in Sections 4–6, respectively. Section 7 gives the simulation examples to corroborate our theoretical results. Finally, the concluding remarks are given in Section 8.

Notation: Throughout this paper, $\mathbb{R}$ , $\mathbb{N}_{0}$ , and $\mathbb{Z}_{+}$ denote the sets of real numbers, non-negative integers, and positive integers, respectively. $\mathbb{R}^{n}$ stands for the $n$ -dimensional Euclidean space. For an uncertain variable $\mathbf{x}$ defined on a sample space $\Omega$ , its range is $\llbracket\mathbf{x}\rrbracket=\{\mathbf{x}(\omega)\colon\omega\in\Omega\}$ and its realization is $x=\mathbf{x}(\omega)$ [21, 10]. For multiple uncertain variables with consecutive indices, we define $\mathbf{x}_{k_{1}:k_{2}}=\mathbf{x}_{k_{1}},\ldots,\mathbf{x}_{k_{2}}$ (with their realizations $x_{k_{1}:k_{2}}=x_{k_{1}},\ldots,x_{k_{2}}$ ) where $k_{2}\geq k_{1}$ . Given two sets $\mathcal{S}_{1}$ and $\mathcal{S}_{2}$ in a Euclidean space, the operation $\oplus$ stands for the Minkowski sum of $\mathcal{S}_{1}$ and $\mathcal{S}_{2}$ , i.e., $\mathcal{S}_{1}\oplus\mathcal{S}_{2}=\{s_{1}+s_{2}\colon s_{1}\in\mathcal{S}_{1},s_{2}\in\mathcal{S}_{2}\}$ . The summation $\sum_{i=a}^{b}\mathcal{S}_{i}$ represents $\mathcal{S}_{a}\oplus\mathcal{S}_{a+1}\oplus\cdots\oplus\mathcal{S}_{b}$ for $a\leq b$ , but returns $\{0\}$ for $a>b$ .⁵⁵5This is different from the mathematical convention, but can keep many expressions compact in this paper. We use $\|\cdot\|$ to represent the Euclidean norm (of a vector) or the spectral norm (of a matrix). The set $\mathcal{S}_{k}$ is uniformly bounded⁶⁶6A mathematically rigorous statement should be “the indexed family $(\mathcal{S}_{k})_{k\in\mathcal{K}}$ is uniformly bounded”. However, this minor abuse of notation considerably simplifies the presentation. (w.r.t. $k\in\mathcal{K}$ ) if there exists $\bar{d}>0$ such that $d(\mathcal{S}_{k})\leq\bar{d}$ for all $k\in\mathcal{K}$ , where $d(\mathcal{S}_{k})=\sup_{s,s^{\prime}\in\mathcal{S}_{k}}\|s-s^{\prime}\|$ is the diameter of $\mathcal{S}_{k}$ . The interval hull of a bounded set $\mathcal{S}\ni s=(s^{(1)},\ldots,s^{(n)})$ is $\overline{\mathrm{IH}}(\mathcal{S})=\prod_{i=1}^{n}[\underline{s}^{(i)},\overline{s}^{(i)}]$ , where $\underline{s}^{(i)}=\inf_{s^{(i)}}\{s\in\mathcal{S}\}$ and $\overline{s}^{(i)}=\sup_{s^{(i)}}\{s\in\mathcal{S}\}$ . The limit superior is denoted by $\varlimsup$ . Given a matrix $M\in\mathbb{R}^{m\times n}$ , the Moore-Penrose inverse is $M^{+}$ ; the range space and the kernel (null space) are denoted by $\mathrm{ran}(M)$ and $\ker(M)$ , respectively. The notation $\circ$ stands for the composition of two maps.

2 System Model and Problem Description

2.1 Linear SMF with Inaccurate Initial Condition

Consider the following discrete-time linear system described by uncertain variables:

	$\displaystyle\mathbf{x}_{k+1}$	$\displaystyle=A\mathbf{x}_{k}+B\mathbf{w}_{k},$		(1)
	$\displaystyle\mathbf{y}_{k}$	$\displaystyle=C\mathbf{x}_{k}+\mathbf{v}_{k},$		(2)

at time $k\in\mathbb{N}_{0}$ , where (1) and (2) are called the state and measurement equations, respectively, where $A\in{\mathbb{R}}^{n\times n}$ , $B\in{\mathbb{R}}^{n\times p}$ , and $C\in\mathbb{R}^{m\times n}$ . The state equation describes how the system state $\mathbf{x}_{k}$ (with its realization $x_{k}\in\llbracket\mathbf{x}_{k}\rrbracket\subseteq\mathbb{R}^{n}$ ) changes over time, where the true initial set $\llbracket\mathbf{x}_{0}\rrbracket$ is non-empty and bounded; $\mathbf{w}_{k}$ is the process/dynamical noise (with its realization $w_{k}\in\llbracket\mathbf{w}_{k}\rrbracket\subseteq\mathbb{R}^{p}$ ), and there exists a constant $d_{w}>0$ such that $d(\llbracket\mathbf{w}_{k}\rrbracket)\leq d_{w}$ for all $k\in\mathbb{N}_{0}$ . The measurement equation gives how the system state is measured from the measurement $\mathbf{y}_{k}$ (with its realization, called observed measurement, $y_{k}\in\llbracket\mathbf{y}_{k}\rrbracket\subseteq\mathbb{R}^{m}$ ); $\mathbf{v}_{k}$ (with its realization $v_{k}\in\llbracket\mathbf{v}_{k}\rrbracket\subseteq\mathbb{R}^{m}$ ) stands for the measurement noise, and there exists a constant $d_{v}>0$ such that $d(\llbracket\mathbf{v}_{k}\rrbracket)\leq d_{v}$ for all $k\in\mathbb{N}_{0}$ . The process noises, the measurement noises, and the initial state are unrelated [21], given in Assumption 1. This makes the system satisfy a non-stochastic hidden Markov model, which guarantees the optimality of the classical SMFing framework [10].

Assumption 1.

(Unrelated Noises and Initial State) $\forall k\in\mathbb{N}_{0}$ , $\mathbf{w}_{0:k},\mathbf{v}_{0:k},\mathbf{x}_{0}$ are unrelated.

Unless otherwise stated, the results and discussions in the rest of this paper are under Assumption 1.

The classical linear SMFing framework [10] is given in Algorithm 1, where an initial condition $\llbracket\hat{\mathbf{x}}_{0}\rrbracket$ is required. The existing linear SMFs are based on this framework. We highlight the prediction and update steps in (3) and (4), respectively:

	$\displaystyle\llbracket\hat{\mathbf{x}}_{k}\|y_{0:k-1}\rrbracket$	$\displaystyle=A\llbracket\hat{\mathbf{x}}_{k-1}\|y_{0:k-1}\rrbracket\oplus B\llbracket\mathbf{w}_{k-1}\rrbracket,$		(3)
	$\displaystyle\llbracket\hat{\mathbf{x}}_{k}\|y_{0:k}\rrbracket$	$\displaystyle=\mathcal{X}_{k}(C,y_{k},\llbracket\mathbf{v}_{k}\rrbracket)\bigcap\llbracket\hat{\mathbf{x}}_{k}\|y_{0:k-1}\rrbracket,$		(4)

where we define $\llbracket\hat{\mathbf{x}}_{0}\rrbracket=:\llbracket\hat{\mathbf{x}}_{0}|y_{0:-1}\rrbracket$ for consistency, and

\begin{split}\!\!\mathcal{X}_{k}(C,y_{k},\llbracket\mathbf{v}_{k}\rrbracket)&=\left\{x_{k}\colon y_{k}=Cx_{k}+v_{k},~{}v_{k}\in\llbracket\mathbf{v}_{k}\rrbracket\right\}\\ &=\ker(C)\oplus C^{+}(\left\{y_{k}\right\}\oplus\llbracket-\mathbf{v}_{k}\rrbracket).\end{split}

(5)

Algorithm 1 Classical Linear SMFing Framework

1:Initialization:

\llbracket\hat{\mathbf{x}}_{0}\rrbracket\subset\mathbb{R}^{n}

;

2:if

k>0

then

3: Prediction:

\llbracket\hat{\mathbf{x}}_{k}|y_{0:k-1}\rrbracket\leftarrow

(3); % Returns the prior set.

4:Update:

\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket\leftarrow

(4); % Returns the estimate/the posterior set.

If the true initial set $\llbracket\mathbf{x}_{0}\rrbracket$ can be perfectly known, i.e., $\llbracket\hat{\mathbf{x}}_{0}\rrbracket=\llbracket\mathbf{x}_{0}\rrbracket$ , Algorithm 1 returns the true set $\llbracket\mathbf{x}_{k}|y_{0:k}\rrbracket$ as the estimate $\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket$ . In this case, Algorithm 1 is optimal⁷⁷7The optimality is in the sense that no filters can give a smaller set containing all possible states [10].. Under this classical SMFing framework, existing SMFs employ different set-based descriptions to represent or outer bound $\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket$ , as pointed out in [10]; that means the estimate $\mathcal{Z}_{k}$ of a linear SMF, at $k\in\mathbb{N}_{0}$ , satisfies $\mathcal{Z}_{k}\supseteq\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket$ .

However, the initial condition is usually not accurate in practice, i.e., $\llbracket\hat{\mathbf{x}}_{0}\rrbracket\neq\llbracket\mathbf{x}_{0}\rrbracket$ , which can cause stability issue as stated in Section 1. To rigorously derive the theoretical results in the sequel of this paper, we introduce the filtering map as follows.

Definition 1.

(Filtering Map) At time $k\in\mathbb{Z}_{+}$ , the prediction-update map under $\llbracket\mathbf{w}_{k-1}\rrbracket$ and $\llbracket\mathbf{v}_{k}\rrbracket$ is

f_{k}:\mathcal{S}\mapsto\mathcal{X}_{k}(C,y_{k},\llbracket\mathbf{v}_{k}\rrbracket)\bigcap(A\mathcal{S}\oplus B\llbracket\mathbf{w}_{k-1}\rrbracket),

(6)

where $\mathcal{S}\subseteq\mathbb{R}^{n}$ . The filtering map from time $i\in\mathbb{Z}_{+}$ to $k\geq i$ is $F_{k,i}$ with the following form (where $0\leq i\leq k$ )

\!\!\!\!\!F_{k,i}(\mathcal{S})\!=\!\begin{cases}\!f_{k}\!\circ\cdots\circ\!f_{i+1}(\mathcal{X}_{i}(C,y_{i},\llbracket\mathbf{v}_{i}\rrbracket)\bigcap\mathcal{S}),&\!\!\!k>i,\\ \!\mathcal{X}_{i}(C,y_{i},\llbracket\mathbf{v}_{i}\rrbracket)\bigcap\mathcal{S},&\!\!\!k=i.\end{cases}

(7)

By Definition 1, Algorithm 1 can be described by a compact form by the filtering map:

\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket=F_{k,0}(\llbracket\hat{\mathbf{x}}_{0}\rrbracket),\quad k\in\mathbb{N}_{0}.

(8)

2.2 Problem Description: Analysis and Synthesis of Stability w.r.t. Initial Condition

In this work, we focus on understanding and handling the stability issue of linear SMFs, which includes three parts: (i) analyzing the stability of the classical linear SMFing framework (see Problem 1), (ii) establishing a new linear SMFing framework with guaranteed stability (see Problem 2), (iii) designing an efficient linear SMF under the proposed framework (see Problem 3).

To analyze the stability of existing linear SMFs, we define the stability in Definition 4, based on the well-posedness and the bounded estimation gap.

Definition 2.

(Well-Posedness) An SMF with the initial condition $\llbracket\hat{\mathbf{x}}_{0}\rrbracket$ is well-posed, if $\mathcal{Z}_{k}\neq\emptyset$ for all $k\in\mathbb{N}_{0}$ .

Definition 3.

(Bounded Estimation Gap) At $k\in\mathbb{N}_{0}$ , the estimation gap is the Hausdorff distance between the estimate $\mathcal{Z}_{k}$ and the true set $\llbracket\mathbf{x}_{k}|y_{0:k}\rrbracket$

d_{k}^{\mathrm{g}}(\mathcal{Z}_{k}):=d_{\mathrm{H}}(\mathcal{Z}_{k},\llbracket\mathbf{x}_{k}|y_{0:k}\rrbracket),

(9)

where

\!\!\!d_{\mathrm{H}}(\mathcal{S},\mathcal{T})=\max\Big{\{}\adjustlimits{\sup}_{s\in\mathcal{S}}{\inf}_{t\in\mathcal{T}}\|s-t\|,\adjustlimits{\sup}_{t\in\mathcal{T}}{\inf}_{s\in\mathcal{S}}\|s-t\|\Big{\}}.

(10)

The estimation gap is bounded, if there exists a $\bar{d}>0$ such that $d_{k}^{\mathrm{g}}(\mathcal{Z}_{k})\leq\bar{d}$ for all $k\in\mathbb{N}_{0}$ .

Definition 4.

(Stability of SMF) An SMF is stable w.r.t. its initial condition, if for all bounded $\llbracket\hat{\mathbf{x}}_{0}\rrbracket\subset\mathbb{R}^{n}$ , the SMF is well-posed and the estimation gap is bounded.

Remark 1.

The stability of an SMF reflects the insensitivity of $\mathcal{Z}_{k}$ to the initial conditions. Note that the well-posedness is a necessary condition for the stability, which is different from that of Kalman filters.

With Definition 4, we are ready to study the stability of the classical linear SMFing framework (that the existing linear SMFs are based on). For the framework described in Algorithm 1: the initial condition $\llbracket\hat{\mathbf{x}}_{0}\rrbracket$ must be carefully chosen in case of ill-posedness;⁸⁸8For example, consider the linear system with parameters $A=1$ , $B=1$ , $C=1$ , $\llbracket\mathbf{w}_{k}\rrbracket=[-1,1]$ , and $\llbracket\mathbf{v}_{k}\rrbracket=[0,1]$ . If $\llbracket\mathbf{x}_{0}\rrbracket=[-1,1]$ and $\llbracket\hat{\mathbf{x}}_{0}\rrbracket=[0,2]$ , with (4) we have $\llbracket\mathbf{x}_{0}|y_{0}\rrbracket=\mathcal{X}_{0}(C,y_{0},\llbracket\mathbf{v}_{0}\rrbracket)\bigcap\llbracket\mathbf{x}_{0}\rrbracket=[-2,-1]\bigcap[-1,1]=\{-1\}$ while $\llbracket\hat{\mathbf{x}}_{0}|y_{0}\rrbracket=\mathcal{X}_{0}(C,y_{0},\llbracket\mathbf{v}_{0}\rrbracket)\bigcap\llbracket\hat{\mathbf{x}}_{0}\rrbracket=[-2,-1]\bigcap[0,2]=\emptyset$ . From (3), we know that $\llbracket\hat{\mathbf{x}}_{1}|y_{0}\rrbracket=\emptyset$ and thus (4) gives $\llbracket\hat{\mathbf{x}}_{1}|y_{0:1}\rrbracket=\emptyset$ . Proceeding forward, we have $\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket=\emptyset$ for $k\geq 0$ . furthermore, the estimation gap $d_{k}^{\mathrm{g}}(\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket)$ can go unbounded as $k\to\infty$ . Thus, it is necessary to analyze the stability condition of Algorithm 1; see Problem 1.

Problem 1.

(Stability of Classical Linear SMFing Framework) Under what condition is the framework in Algorithm 1 stable w.r.t. the initial condition?

Since the well-posedness is sensitive to initial conditions, the classical framework faces the ill-posedness issue. This motivates us to propose a new SMFing framework always with guaranteed stability (regardless of perturbations w.r.t. the initial condition); see Problem 2.

Problem 2 (Stability-Guaranteed Framework).

How to establish a new SMFing framework such that for any bounded $\llbracket\hat{\mathbf{x}}_{0}\rrbracket\subseteq\mathbb{R}^{n}$ , the stability is guaranteed?

Under the new framework, one should develop a filtering algorithm with low complexity and high accuracy. Inspired by prior work (e.g., [24]), we consider a constrained zonotopic SMF for the filter design as follows.

Problem 3.

(Stable and Fast Constrained Zonotopic SMF) How to design a constrained zonotopic SMF satisfying the stability-guaranteed framework while the complexity and accuracy can be well handled?

To solve these three problems, we propose the concept of Observation-Information Tower (OIT) in Section 3. It plays a pivotal role in understanding the stability of linear SMFing and inspiring stability-guaranteed designs. Then, the solutions to Problems 1-3 are studied in Sections 4–6, respectively.

3 The Observation-Information Tower

Before defining the OIT, let us first analyze how a single observed measurement affects the estimate as time elapses.⁹⁹9This is quite similar to analyzing the impulse response of a control system. At $k=0$ , from the update step (4) in Algorithm 1 we know that the estimate $\llbracket\hat{\mathbf{x}}_{0}|y_{0}\rrbracket$ is the intersection of $\mathcal{X}_{0}(C,y_{0},\llbracket\mathbf{v}_{0}\rrbracket)=:\mathcal{O}_{0,0}$ and $\llbracket\hat{\mathbf{x}}_{0}\rrbracket=:\mathcal{E}_{0}$ , which describes how the observed measurement $y_{0}$ affects the estimate at $k=0$ . When we ignore all the successive observed measurements (like the impulse response does), only the prediction step in Algorithm 1 works for $k\geq 1$ . Based on “set intersection under Minkowski sum”: for sets $\mathcal{S}_{1},\ldots,\mathcal{S}_{I}$ , and $\mathcal{T}$ , we have¹⁰¹⁰10The proof is as follows. $\forall l\in\big{(}\bigcap_{i=1}^{I}\mathcal{S}_{i}\big{)}\oplus\mathcal{T}$ , $\exists s\in\bigcap_{i=1}^{I}\mathcal{S}_{i}$ and $t\in\mathcal{T}$ such that $l=s+t$ . Since $s\in\bigcap_{i=1}^{I}\mathcal{S}_{i}$ , we have $s\in\mathcal{S}_{i}$ for all $i\in\{1,\ldots,I\}$ . Thus, $\forall i\in\{1,\ldots,I\}$ , $l=s+t\in\mathcal{S}_{i}\oplus\mathcal{T}$ , which implies $l\in\bigcap_{i=1}^{I}(\mathcal{S}_{i}\oplus\mathcal{T})$ , and we get (11).

\bigg{(}\bigcap_{i=1}^{I}\mathcal{S}_{i}\bigg{)}\oplus\mathcal{T}\subseteq\bigcap_{i=1}^{I}(\mathcal{S}_{i}\oplus\mathcal{T}),

(11)

and hence the estimate at $k=1$ is outer bounded by the intersection of the following two sets:

	$\displaystyle A\mathcal{O}_{0,0}\oplus B\llbracket\mathbf{w}_{0}\rrbracket$	$\displaystyle=A\mathcal{X}_{0}(C,y_{0},\llbracket\mathbf{v}_{0}\rrbracket)\oplus B\llbracket\mathbf{w}_{0}\rrbracket=:\mathcal{O}_{1,0},$
	$\displaystyle A\mathcal{E}_{0}\oplus B\llbracket\mathbf{w}_{0}\rrbracket$	$\displaystyle=A\llbracket\hat{\mathbf{x}}_{0}\rrbracket\oplus B\llbracket\mathbf{w}_{0}\rrbracket=:\mathcal{E}_{1},$

i.e., $\llbracket\hat{\mathbf{x}}_{1}|y_{0}\rrbracket\subseteq\mathcal{O}_{1,0}\bigcap\mathcal{E}_{1}$ , which indicates how $y_{0}$ affects the estimate at $k=1$ . As such, we can analyze the effect of $y_{i}$ on the estimate at time $k$ . It defines the observation-information and state-evolution sets as follows.

Definition 5.

(Observation-Information and State-Evolution Sets) The observation-information set at time $k\geq i$ contributed by $y_{i}$ (i.e, the observed measurement at time $i$ ) is

\mathcal{O}_{k,i}:=A^{k-i}\mathcal{X}_{i}(C,y_{i},\llbracket\mathbf{v}_{i}\rrbracket)\oplus\sum_{r=i}^{k-1}A^{k-1-r}B\llbracket\mathbf{w}_{r}\rrbracket.

(12)

The state-evolution set at time $k$ is

\mathcal{E}_{k}:=A^{k}\llbracket\hat{\mathbf{x}}_{0}\rrbracket\oplus\sum_{r=0}^{k-1}A^{k-1-r}B\llbracket\mathbf{w}_{r}\rrbracket.

(13)

Remark 2.

If we consider all the observed measurements $y_{0:k}$ , the intersection of the observation-information sets and the state-evolution set forms an outer bound on the estimate (see Proposition 1), where the conservativeness comes from the converse of (11) is not true in general.

Proposition 1.

(Set-Intersection-Based Outer B-ound) At $k\in\mathbb{N}_{0}$ , an outer bound on $\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket$ in Algorithm 1 is

\!\!\!\!\!\!\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket=F_{k,0}(\llbracket\hat{\mathbf{x}}_{0}\rrbracket)\subseteq\!\bigcap_{i=0}^{k}\mathcal{O}_{k,i}\bigcap\mathcal{E}_{k},~{}\forall\llbracket\hat{\mathbf{x}}_{0}\rrbracket\subseteq\mathbb{R}^{n},

(14)

which means $\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket\subseteq\bigcap_{i=0}^{k}\mathcal{O}_{k,i}$ and $\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket\subseteq\mathcal{E}_{k}$ .

Proof.

See Appendix A. ∎

A pictorial illustration of Proposition 1 is given in Fig. 1.

Refer to caption — Figure 1: Illustration of set-intersection-based outer bound. For $k=0$ , the only observation-information set is $\mathcal{O}_{0,0}$ ; the state-evolution set is $\mathcal{E}_{0}$ ; the intersection of $\mathcal{O}_{0,0}$ and $\mathcal{E}_{0}$ gives the estimate $\llbracket\hat{\mathbf{x}}_{0}|y_{0}\rrbracket$ . For $k=1$ , $\mathcal{O}_{0,0}$ and $\mathcal{E}_{0}$ become $\mathcal{O}_{1,0}$ and $\mathcal{E}_{1}$ , respectively; the new observation-information set is $\mathcal{O}_{1,1}$ ; the intersection of $\mathcal{O}_{1,1}$ , $\mathcal{O}_{1,0}$ , and $\mathcal{E}_{1}$ forms an outer bound on the estimate $\llbracket\hat{\mathbf{x}}_{1}|y_{0:1}\rrbracket$ as presented in Proposition 1.

Note that

\bigcap_{i=0}^{k}\mathcal{O}_{k,i}\subseteq\bigcap_{i=k-\delta}^{k}\mathcal{O}_{k,i}

(15)

holds for any integer $\delta\in[0,k]$ ,¹¹¹¹11In this paper, when an integer is in an interval $[a,b]$ , this interval denotes $\{i\in\mathbb{Z}\colon a\leq i\leq b\}$ . we define the Right-Hand Side (RHS) of (15) as the OIT (see Definition 6).

Definition 6 (Observation-Information Tower).

The OIT at time $k\in\mathbb{N}_{0}$ is the intersection of the observation-information sets over $[k-\delta,k]$ : $\bigcap_{i=k-\delta}^{k}\mathcal{O}_{k,i}$ .

Now, we provide a sufficient condition for the uniform boundedness (see Section 1) of the OIT as follows, which is fundamental to the results derived in the rest of this paper.

Theorem 1 (Uniform Boundedness of OIT).

The OIT defined by Definition 6 is uniformly bounded w.r.t. $k\in\{k\geq\delta\colon\delta\geq\mu-1\}$ (or simply $k\geq\delta\geq\mu-1$ ), where $\mu$ is the observability index (see Chapter 6.3.1 in [5]), if the pair $(A,C)$ is observable and $A$ is non-singular. Furthermore, the diameter of the OIT is upper bounded by

\begin{split}\!\!d\bigg{(}\bigcap_{i=k-\delta}^{k}\mathcal{O}_{k,i}\bigg{)}&\leq\frac{\sqrt{\displaystyle\sum_{j=0}^{\delta}\bigg{[}d_{v}+\sum_{l=1}^{\delta-j}\|CA^{-l}B\|d_{w}\bigg{]}^{2}}}{\sigma_{\min}(O_{\delta})}\\ &=:\bar{d}_{\delta}(A,B,C),\end{split}

(16)

where $\sigma_{\min}(O_{\delta})$ returns the smallest singular value of $O_{\delta}=[(A^{-\delta})^{\mathrm{T}}C^{\mathrm{T}}\ldots C^{\mathrm{T}}]^{\mathrm{T}}$ .

Proof.

See Appendix B. ∎

Note that the uniform boundedness of the OIT indicates the diameter of $\bigcap_{i=k-\delta}^{k}\mathcal{O}_{k,i}$ ( $\forall\delta\geq\mu-1$ ) cannot go unbounded as $k\to\infty$ , which is upper bounded by (16).

4 Stability Analysis of Classical Linear Set-Membership Filtering Framework

In this section, we study the stability issue w.r.t. the initial condition of the classical linear SMFing framework to address Problem 1, based on the OIT.

Since the uniform boundedness of the OIT requires observability, for general $(A,C)$ we introduce the observability decomposition: there exists a nonsingular $P\in\mathbb{R}^{n\times n}$ such that the equivalence transformation $\tilde{\mathbf{x}}_{k}=[(\tilde{\mathbf{x}}_{k}^{o})^{\mathrm{T}}(\tilde{\mathbf{x}}_{k}^{\bar{o}})^{\mathrm{T}}]^{\mathrm{T}}=P\mathbf{x}_{k}$ transforms (1) and (2) into¹²¹²12The observability decomposition follows Theorem 6.O6 in [5]: $P=[P_{o}^{\mathrm{T}}~{}P_{\bar{o}}^{\mathrm{T}}]^{\mathrm{T}}$ and $U=P^{-1}=[U_{o}~{}U_{\bar{o}}]$ , where $P_{o}\in\mathbb{R}^{n_{o}\times n}$ , $P_{\bar{o}}\in\mathbb{R}^{n_{\bar{o}}\times n}$ , $U_{o}\in\mathbb{R}^{n\times n_{o}}$ , and $U_{\bar{o}}\in\mathbb{R}^{n\times n_{\bar{o}}}$ , such that $\tilde{A}_{o}=P_{o}AU_{o}$ , $\tilde{A}_{21}=P_{\bar{o}}AU_{o}$ , $\tilde{A}_{\bar{o}}=P_{\bar{o}}AU_{\bar{o}}$ , $\tilde{B}_{o}=P_{o}B$ , $\tilde{B}_{\bar{o}}=P_{\bar{o}}B$ , and $\tilde{C}_{o}=CU_{o}$ . Note that the eigenvalues of $\tilde{A}_{\bar{o}}$ only depends on $(A,C)$ but independent of $P$ .

\begin{split}\begin{bmatrix}\tilde{\mathbf{x}}_{k+1}^{o}\\ \tilde{\mathbf{x}}_{k+1}^{\bar{o}}\end{bmatrix}&=\begin{bmatrix}\tilde{A}_{o}&0\\ \tilde{A}_{21}&\tilde{A}_{\bar{o}}\end{bmatrix}\begin{bmatrix}\tilde{\mathbf{x}}_{k}^{o}\\ \tilde{\mathbf{x}}_{k}^{\bar{o}}\end{bmatrix}+\begin{bmatrix}\tilde{B}_{o}\\ \tilde{B}_{\bar{o}}\end{bmatrix}\mathbf{w}_{k},\\ \mathbf{y}_{k}&=\tilde{C}_{o}\tilde{\mathbf{x}}_{k}^{o}+\mathbf{v}_{k},\end{split}

(17)

where $\tilde{x}_{k}^{o}\in\mathbb{R}^{n_{o}}$ and $\tilde{x}_{k}^{\bar{o}}\in\mathbb{R}^{n_{\bar{o}}}$ ; the pair $(\tilde{A}_{o},\tilde{C}_{o})$ is observable with the observability index $\mu_{o}$ ; it is well-known that $(A,C)$ is detectable if and only if $\rho(\tilde{A}_{\bar{o}})<1$ , where $\rho(\cdot)$ returns the spectral radius of a matrix.

Now, we propose a stability condition of the classical linear SMFing framework as follows, where we define $\llbracket\hat{\tilde{\mathbf{x}}}_{0}^{o}\rrbracket$ and $\llbracket\hat{\tilde{\mathbf{x}}}_{0}^{\bar{o}}\rrbracket$ as the projections of $\llbracket\hat{\tilde{\mathbf{x}}}_{0}\rrbracket=\llbracket P\hat{\mathbf{x}}_{0}\rrbracket$ to the subspaces w.r.t. $\tilde{x}_{0}^{o}$ and $\tilde{x}_{0}^{\bar{o}}$ , respectively.

Theorem 2 (Stability Criterion).

The classical SM-Fing framework in Algorithm 1 is stable w.r.t. its initial condition, if the following conditions hold:

(i)

$\llbracket\hat{\tilde{\mathbf{x}}}_{0}^{o}\rrbracket\supseteq\llbracket\tilde{\mathbf{x}}_{0}^{o}\rrbracket$ ;
(ii)

$\lim_{k\to\infty}\big{(}\tilde{A}_{\bar{o}}^{k}+\sum_{i=0}^{k-1}\tilde{A}_{\bar{o}}^{k-1-i}\tilde{A}_{21}\big{)}<\infty$ .

Proof.

See Appendix C. ∎

Remark 3.

Theorem 2 tells that when conditions (i) and (ii) hold, the classical SMFing framework is stable, which solves Problem 1. More precisely, these two conditions ensure the well-posedness and the bounded estimation gap, respectively. However, under condition (i), we cannot always keep $\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket\supseteq\llbracket\mathbf{x}_{k}|y_{0:k}\rrbracket$ (which is different from Proposition 4 in Appendix C for observable systems), i.e., the outer boundedness breaks.

As mentioned in Remark 3, condition (ii) in Theorem 2 is a sufficient condition for the boundedness of the estimation gap. To evaluate the difference between the sufficiency and the necessity, we provide a necessary condition for the bounded estimation gap in Proposition 2.

Proposition 2.

(Necessary Condition for Bounded Estimation Gap) For all bounded $\llbracket\hat{\mathbf{x}}_{0}\rrbracket\subset\mathbb{R}^{n}$ with $\llbracket\hat{\tilde{\mathbf{x}}}_{0}^{o}\rrbracket\supseteq\llbracket\tilde{\mathbf{x}}_{0}^{o}\rrbracket$ , if the estimation gap $d_{k}^{\mathrm{g}}(\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket)$ of Algorithm 1 is bounded, $\tilde{A}_{\bar{o}}$ must be marginally stable¹³¹³13In this paper, it refers to that $\tilde{A}_{\bar{o}}$ is the system matrix of a marginally stable discrete-time system, i.e., all eigenvalues of $\tilde{A}_{\bar{o}}$ have magnitudes less or equal to one and those equal to one are non-defective [5]..

Proof.

See Appendix D. ∎

Remark 4.

For all bounded $\llbracket\hat{\mathbf{x}}_{0}\rrbracket\subset\mathbb{R}^{n}$ with $\llbracket\hat{\tilde{\mathbf{x}}}_{0}^{o}\rrbracket\supseteq\llbracket\tilde{\mathbf{x}}_{0}^{o}\rrbracket$ , the proposed sufficient condition for the bounded estimation gap [i.e., condition (ii) in Theorem 2] is very close to the necessary condition given in Proposition 2. When $\tilde{A}_{21}=0$ , condition (ii) means $\tilde{A}_{\bar{o}}$ is marginally stable; in that case, it becomes a necessary and sufficient condition. When $\tilde{A}_{21}\neq 0$ , in general we need $\rho(\tilde{A}_{\bar{o}})<1$ (which is close to marginal stability of $\tilde{A}_{\bar{o}}$ ), i.e., $(A,C)$ is detectable, to provide the bounded-input bounded-output stability to guarantee the bounded estimation gap.

The following corollary gives a sufficient condition for simultaneously guaranteeing the stability w.r.t. the initial condition and the uniform boundedness of the estimate.

Corollary 1 (Egregium).

For bounded $\llbracket\hat{\mathbf{x}}_{0}\rrbracket\subset\mathbb{R}^{n}$ with $\llbracket\hat{\tilde{\mathbf{x}}}_{0}^{o}\rrbracket\supseteq\llbracket\tilde{\mathbf{x}}_{0}^{o}\rrbracket$ , the classical SMFing framework is stable w.r.t. the initial condition and has the uniformly bounded estimate w.r.t. $k\in\mathbb{N}_{0}$ if $(A,C)$ is detectable.

Proof.

See Appendix E. ∎

Remark 5.

provides an explicit condition for the stability and the uniform boundedness¹⁴¹⁴14Even though the detectability is usually assumed for the uniform boundedness [7], to be best of our knowledge, there are no existing results on rigorously proving it.. This result is consistent with the discrete-time Kalman filter that detectable $(A,C)$ implies a stable¹⁵¹⁵15For Kalman filters, to achieve asymptotic stability w.r.t. the initial condition needs an additional condition on the reachability w.r.t. process noises. For linear SMFs, even though the numerical results in Section 7.1 show the asymptotic stability, proving it requires to introduce probability measures, which is beyond the scope of this work. filter [15]. Thus, it builds an important bridge between stochastic and non-stochastic linear filters on their stability w.r.t. initial conditions. It should also be highlighted that our result is independent of the types of the noises; while for stochastic filtering very few results are on the stability of discrete-time linear optimal filters with non-Gaussian noises.

Remark 6.

(Instability Caused by Improper Initial Conditions) Different from Kalman filtering, the classical linear SMFing framework is not stable for all bounded initial conditions. If the designer has little information on $\llbracket\mathbf{x}_{0}\rrbracket$ , it would be hard to choose a proper $\llbracket\hat{\mathbf{x}}_{0}\rrbracket$ to guarantee the stability w.r.t. the initial condition (see Theorem 2). This motivates us to propose a stability-guaranteed filtering framework without using the knowledge of $\llbracket\mathbf{x}_{0}\rrbracket$ , presented in Section 5.

5 Stability-Guaranteed Filtering Framework

In this section, we establish a stability-guaranteed filtering framework, called OIT-inspired filtering (see Algorithm 2), to solve Problem 2.

From Theorem 2, we know that the initial condition $\llbracket\hat{\mathbf{x}}_{0}\rrbracket$ should satisfy $\llbracket\hat{\tilde{\mathbf{x}}}_{0}^{o}\rrbracket\supseteq\llbracket\tilde{\mathbf{x}}_{0}^{o}\rrbracket$ to ensure the stability of Algorithm 1 w.r.t. the initial condition. This motivates us to choose a sufficiently large $\llbracket\hat{\tilde{\mathbf{x}}}_{0}^{o}\rrbracket\supseteq\llbracket\tilde{\mathbf{x}}_{0}^{o}\rrbracket$ so that the ill-posedness can be fully handled. However, it is difficult to choose such $\llbracket\hat{\tilde{\mathbf{x}}}_{0}^{o}\rrbracket$ due to the following two issues:

•

Since $\llbracket\tilde{\mathbf{x}}_{0}^{o}\rrbracket$ is unlikely to know exactly, we can hardly $100\%$ guarantee $\llbracket\hat{\tilde{\mathbf{x}}}_{0}^{o}\rrbracket\supseteq\llbracket\tilde{\mathbf{x}}_{0}^{o}\rrbracket$ .
•

Larger $\llbracket\hat{\tilde{\mathbf{x}}}_{0}^{o}\rrbracket$ can increase the possibility of the inclusion but brings more conservativeness.

In the existing SMFing framework, these two issues cannot be effectively resolved. Thus, we propose a new SMFing framework inspired by OIT, with the help of the following lemma.

Lemma 1 (OIT-Inspired Invariance).

For all $k\geq\max\{\mu_{o}-1+n_{\lambda_{0}^{o}},1\}=:k_{*}$ , where $n_{\lambda_{0}^{o}}$ is the index of eigenvalue $0$ of $\tilde{A}_{o}$ (if $\tilde{A}_{o}$ does not contain $0$ eigenvalues, $n_{\lambda_{0}^{o}}=0$ ), define the family of nested sets $\{\mathcal{B}_{\theta_{k}}^{\infty}[\hat{c}_{0}^{o}]\}_{\theta_{k}\in[0,\infty)}$ , in which $\mathcal{B}_{\theta_{k}}^{\infty}[\hat{c}_{0}^{o}]$ is a closed $n_{o}$ -cube of edge length $2\theta_{k}$ centered at $\hat{c}_{0}^{o}\in\mathbb{R}^{n_{o}}$ . Then, $\exists\bar{\theta}_{k}\geq 0$ s.t. $\forall\theta^{\prime}_{k}\leq\bar{\theta}_{k}\leq\theta^{\prime\prime}_{k}$ , (18) holds, where $P_{o}$ is a submatrix formed by the first $n_{o}$ rows of $P$ .

P_{o}F_{k,0}(P^{-1}(\mathcal{B}_{\theta^{\prime}_{k}}^{\infty}[\hat{c}_{0}^{o}]\times\llbracket\hat{\tilde{\mathbf{x}}}_{0}^{\bar{o}}\rrbracket))\subseteq P_{o}F_{k,0}(P^{-1}(\mathcal{B}_{\bar{\theta}_{k}}^{\infty}[\hat{c}_{0}^{o}]\times\llbracket\hat{\tilde{\mathbf{x}}}_{0}^{\bar{o}}\rrbracket))=P_{o}F_{k,0}(P^{-1}(\mathcal{B}_{\theta^{\prime\prime}_{k}}^{\infty}[\hat{c}_{0}^{o}]\times\llbracket\hat{\tilde{\mathbf{x}}}_{0}^{\bar{o}}\rrbracket)).

(18)

Proof.

See Appendix F, which is inspired by OIT. ∎

Remark 7.

From Lemma 1, we know that even without the information of $\llbracket\hat{\tilde{\mathbf{x}}}_{0}^{o}\rrbracket$ , we can enlarge $\theta_{k}$ until $P_{o}F_{k,0}(P^{-1}(\mathcal{B}_{\theta_{k}}^{\infty}[\hat{c}_{0}^{o}]\times\llbracket\hat{\tilde{\mathbf{x}}}_{0}^{\bar{o}}\rrbracket))$ becomes unchanged for $\theta_{k}\geq\bar{\theta}_{k}$ . As a result,

P_{o}F_{k,0}(P^{-1}(\llbracket\tilde{\mathbf{x}}_{0}^{o}\rrbracket\times\llbracket\hat{\tilde{\mathbf{x}}}_{0}^{\bar{o}}\rrbracket))\subseteq P_{o}F_{k,0}(P^{-1}(\mathcal{B}_{\bar{\theta}_{k}}^{\infty}[\hat{c}_{0}^{o}]\times\llbracket\hat{\tilde{\mathbf{x}}}_{0}^{\bar{o}}\rrbracket)),

which means $\mathcal{B}_{\bar{\theta}_{k}}^{\infty}[\hat{c}_{0}^{o}]\supseteq\llbracket\tilde{\mathbf{x}}_{0}^{o}\rrbracket$ always holds for $k\geq k_{*}$ . Based on this inspiration, we propose the OIT-inspired filtering framework in Algorithm 2.

Algorithm 2 OIT-Inspired Filtering

1:Initialization: Bounded

\llbracket\hat{\mathbf{x}}_{0}\rrbracket\subset\mathbb{R}^{n}

;

2:if

k<\max\{\mu_{o}-1+n_{\lambda_{0}^{o}},1\}=k_{*}

then

\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket\leftarrow

Algorithm 1;

4: if

\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket=\emptyset

then

5: Choose a

\hat{\mathcal{T}}_{0}^{o}

such that

\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket=F_{k,0}(P^{-1}(\hat{\mathcal{T}}_{0}^{o}\times\llbracket\hat{\tilde{\mathbf{x}}}_{0}^{\bar{o}}\rrbracket))

is non-empty and bounded;

6:else

\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket\!=\!F_{k,0}(P^{-1}(\mathcal{B}_{\bar{\theta}_{k}}^{\infty}[\hat{c}_{0}^{o}]\times\llbracket\hat{\tilde{\mathbf{x}}}_{0}^{\bar{o}}\rrbracket))

; % Recursively

A line-by-line explanation of Algorithm 2 is as follows. Line 1 initializes the algorithm. Lines 2-7 give the filtering process at each $k\in\mathbb{N}_{0}$ . For $k<k_{*}$ , the estimate $\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket$ is identical to that of Algorithm 1 (see Line 3 of Algorithm 2), if it is not an empty set; otherwise, it will be reset by Line 5. In Line 5, we can choose $\hat{\mathcal{T}}_{0}^{o}=\mathcal{B}_{\theta}^{\infty}[0_{n_{o}}]$ with sufficiently large $\theta$ , which is used in Algorithm 3. For $k\geq k_{*}$ , the estimate $\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket$ is determined by Line 7. Note that Line 7 can be implemented in a recursive manner by its definition in (7).¹⁶¹⁶16Generally speaking, reduction methods are required to balance the accuracy and the complexity in Line 7 for specific filter designs. But it is also possible to reduce the complexity without any accuracy loss (see the cases in Section 7.1).

Theorem 3 (Stability of OIT-Inspired Filtering).

If condition (ii) in Theorem 2 holds, the filtering framework in Algorithm 2 is stable w.r.t. the initial condition.

Proof.

See Appendix G. ∎

Theorem 3 indicates that the OIT-inspired filtering framework given in Algorithm 2 does not rely on any information about $\llbracket\mathbf{x}_{0}\rrbracket$ to guarantee the well-posedness.

For Algorithm 2, similar results in Proposition 2 and can also be derived, where the condition $\llbracket\hat{\tilde{\mathbf{x}}}_{0}^{o}\rrbracket\supseteq\llbracket\tilde{\mathbf{x}}_{0}^{o}\rrbracket$ is not needed any more.

6 Stable and Fast Constrained Zonotopic SMF

In this section, we develop a constrained zonotopic SMF under the new framework described in Algorithm 2, where the OIT plays the pivotal role. This SMF is not only with guaranteed stability but also with high efficiency and good accuracy. We call it the OIT-inspired Constrained Zonotopic SMF (OIT-CZ SMF).

Before presenting the SMF, we introduce the constrained zonotope [24] in Definition 7 with a small extension.

Definition 7 (Extended Constrained Zonotope).

A set $\mathcal{Z}\subseteq\mathbb{R}^{n}$ is an (extended) constrained zonotope if there exists a quintuple $(\hat{G},\hat{c},\hat{A},\hat{b},\hat{h})\in\mathbb{R}^{n\times n_{g}}\times\mathbb{R}^{n}\times\mathbb{R}^{n_{c}\times n_{g}}\times\mathbb{R}^{n_{c}}\times[0,\infty]^{n_{g}}$ such that $\mathcal{Z}$ is expressed by

\bigg{\{}\hat{G}\xi+\hat{c}\colon\hat{A}\xi=\hat{b},~{}\xi\in\prod_{j=1}^{n_{g}}\big{[}-\hat{h}^{(j)},\hat{h}^{(j)}\big{]}\bigg{\}}=:Z(\hat{G},\hat{c},\hat{A},\hat{b},\hat{h}),

where $\hat{h}^{(j)}$ is the $j$ ^th component of $\hat{h}$ .

In Definition 7, we slightly generalize the constrained zonotope in [24] by replacing $\|\xi\|_{\infty}\leq 1$ , i.e., $\xi\in[-1,~{}1]^{n_{g}}$ , with $\xi\in\prod_{j=1}^{n_{g}}\big{[}-\hat{h}^{(j)},\hat{h}^{(j)}\big{]}$ . The benefit is twofold: (i) we allow $\hat{h}^{(j)}$ to be infinity such that the posterior sets induced by unbounded prior sets (which is required by the SMFing framework in Algorithm 2) can be fully described; (ii) the numerical stability of our proposed algorithm is improved.

Based on Definition 7, if $\llbracket\hat{\mathbf{x}}_{0}\rrbracket$ , $\llbracket\mathbf{w}_{k}\rrbracket$ , and $\llbracket\mathbf{v}_{k}\rrbracket$ are constrained zonotopes, the resulting $\llbracket\hat{\mathbf{x}}_{k}|y_{0:k-1}\rrbracket$ and $\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket$ in Algorithm 1 are also constrained zonotopes without any approximations. Specifically, by defining

\begin{split}\llbracket\hat{\mathbf{x}}_{k}|y_{0:k-1}\rrbracket&=Z(\hat{G}_{k}^{-},\hat{c}_{k}^{-},\hat{A}_{k}^{-},\hat{b}_{k}^{-},\hat{h}_{k}^{-}),\\ \llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket&=Z(\hat{G}_{k},\hat{c}_{k},\hat{A}_{k},\hat{b}_{k},\hat{h}_{k}),\\ \llbracket\mathbf{w}_{k}\rrbracket&=Z(\hat{G}_{\mathbf{w}_{k}},\hat{c}_{\mathbf{w}_{k}},\hat{A}_{\mathbf{w}_{k}},\hat{b}_{\mathbf{w}_{k}},\hat{h}_{\mathbf{w}_{k}}),\\ \llbracket\mathbf{v}_{k}\rrbracket&=Z(\hat{G}_{\mathbf{v}_{k}},\hat{c}_{\mathbf{v}_{k}},\hat{A}_{\mathbf{v}_{k}},\hat{b}_{\mathbf{v}_{k}},\hat{h}_{\mathbf{v}_{k}}),\end{split}

(19)

the prediction step (3) gives the exact $\llbracket\hat{\mathbf{x}}_{k}|y_{0:k-1}\rrbracket$ with

\begin{split}\!\!\!\!\!\hat{G}_{k}^{-}&\!=\!\begin{bmatrix}A\hat{G}_{k-1}&B\hat{G}_{\mathbf{w}_{k-1}}\end{bmatrix},~{}\hat{c}_{k}^{-}=A\hat{c}_{k-1}+B\hat{c}_{\mathbf{w}_{k-1}},\\ \!\!\!\!\!\hat{A}_{k}^{-}&\!=\!\begin{bmatrix}\hat{A}_{k-1}&0\\ 0&\hat{A}_{\mathbf{w}_{k-1}}\end{bmatrix}\!,\hat{b}_{k}^{-}\!=\!\begin{bmatrix}\hat{b}_{k-1}\\ \hat{b}_{\mathbf{w}_{k-1}}\end{bmatrix}\!,\hat{h}_{k}^{-}\!=\!\begin{bmatrix}\hat{h}_{k-1}\\ \hat{h}_{\mathbf{w}_{k-1}}\end{bmatrix}\!,\end{split}

(20)

and the update step (4) returns the exact $\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket$ with

\begin{split}\hat{G}_{k}&=\begin{bmatrix}\hat{G}_{k}^{-}&0\end{bmatrix},~{}\hat{c}_{k}=\hat{c}_{k}^{-},~{}\hat{h}_{k}=\begin{bmatrix}\hat{h}_{k}^{-}\\ \hat{h}_{\mathbf{v}_{k}}\end{bmatrix},\\ \hat{A}_{k}&=\begin{bmatrix}\hat{A}_{k}^{-}&0\\ 0&\hat{A}_{\mathbf{v}_{k}}\\ C\hat{G}_{k}^{-}&\hat{G}_{\mathbf{v}_{k}}\end{bmatrix},~{}\hat{b}_{k}=\begin{bmatrix}\hat{b}_{k}^{-}\\ \hat{b}_{\mathbf{v}_{k}}\\ y_{k}-\hat{c}_{\mathbf{v}_{k}}-C\hat{c}_{k}^{-}\end{bmatrix}.\end{split}

(21)

The proof of (20) and (21) is straightforward from [24], by rewriting (4) as $\llbracket\mathbf{x}_{k}|y_{0:k}\rrbracket=\{x_{k}\in\llbracket\mathbf{x}_{k}|y_{0:k-1}\rrbracket\colon Cx_{k}\in\{y_{k}\}+\llbracket-\mathbf{v}_{k}\rrbracket\}$ .

Now, we are ready to design the OIT-CZ SMF (see Algorithm 3), where Proposition 3 plays an important role.

Proposition 3.

The image of a constrained zonotope $\hat{\mathcal{Z}}_{i}^{-}=Z(\hat{G}_{i}^{-},\hat{c}_{i}^{-},\hat{A}_{i}^{-},\hat{b}_{i}^{-},\hat{h}_{i}^{-})$ under the filtering map $F_{k,i}$ is

\mathcal{Z}_{k}=F_{k,i}(\hat{\mathcal{Z}}_{i}^{-})=Z(\hat{G}_{k},\hat{c}_{k},\hat{A}_{k},\hat{b}_{k},\hat{h}_{k}),

(22)

where the parameters are given in (23) with $\hat{b}_{y_{l}}=y_{l}-\hat{c}_{\mathbf{v}_{l}}-C\hat{c}_{l}^{-}$ and $\hat{c}_{l}^{-}=A^{l-i}\hat{c}_{i}^{-}+\sum_{r=i}^{l-1}A^{l-1-r}B\hat{c}_{\mathbf{w}_{r}}$ for $i\leq l\leq k$ .

\begin{split}\hat{G}_{k}&=\begin{bmatrix}A^{k-i}\hat{G}_{i}^{-}&0&A^{k-i-1}B\hat{G}_{\mathbf{w}_{i}}&0&\ldots&B\hat{G}_{\mathbf{w}_{k-1}}&0\end{bmatrix},\quad\hat{c}_{k}=A^{k-i}\hat{c}_{i}^{-}+\sum_{r=i}^{k-1}A^{k-1-r}B\hat{c}_{\mathbf{w}_{r}},\\ \hat{A}_{k}&=\begin{bmatrix}\hat{A}_{i}^{-}&0&0&0&\ldots&0&0\\ 0&\hat{A}_{\mathbf{v}_{i}}&0&0&\ldots&0&0\\ C\hat{G}_{i}^{-}&\hat{G}_{\mathbf{v}_{i}}&0&0&\ldots&0&0\\ 0&0&\hat{A}_{\mathbf{w}_{i}}&0&\ldots&0&0\\ 0&0&0&\hat{A}_{\mathbf{v}_{i+1}}&\ldots&0&0\\ CA\hat{G}_{i}^{-}&0&CB\hat{G}_{\mathbf{w}_{i}}&\hat{G}_{\mathbf{v}_{i+1}}&\ldots&0&0\\ \vdots&\vdots&\vdots&\vdots&&\vdots&\vdots\\ 0&0&0&0&\ldots&\hat{A}_{\mathbf{w}_{k-1}}&0\\ 0&0&0&0&\ldots&0&\hat{A}_{\mathbf{v}_{k}}\\ CA^{k-i}\hat{G}_{i}^{-}&0&CA^{k-i-1}B\hat{G}_{\mathbf{w}_{i}}&0&\ldots&CB\hat{G}_{\mathbf{w}_{k-1}}&\hat{G}_{\mathbf{v}_{k}}\end{bmatrix},\quad\hat{b}_{k}=\begin{bmatrix}\hat{b}_{i}^{-}\\ \hat{b}_{\mathbf{v}_{i}}\\ \hat{b}_{y_{i}}\\ \hat{b}_{\mathbf{w}_{i}}\\ \hat{b}_{\mathbf{v}_{i+1}}\\ \hat{b}_{y_{i+1}}\\ \vdots\\ \hat{b}_{\mathbf{w}_{k-1}}\\ \hat{b}_{\mathbf{v}_{k}}\\ \hat{b}_{y_{k}}\end{bmatrix},\quad\hat{h}_{k}=\begin{bmatrix}\hat{h}_{i}^{-}\\ \hat{h}_{\mathbf{v}_{i}}\\ \hat{h}_{\mathbf{w}_{i}}\\ \hat{h}_{\mathbf{v}_{i+1}}\\ \vdots\\ \hat{h}_{\mathbf{w}_{k-1}}\\ \hat{h}_{\mathbf{v}_{k}}\end{bmatrix}.\end{split}

(23)

Proof.

By setting $\hat{\mathcal{Z}}_{i}^{-}=\llbracket\hat{\mathbf{x}}_{i}|y_{0:i-1}\rrbracket$ and recursively using the prediction and update steps [i.e., (20) and (21)] according to (7), equation (22) can be derived. ∎

Algorithm 3 OIT-Inspired Constrained Zonotopic SMF (OIT-CZ SMF)

1:Initialization: Bounded constrained zonotope

\llbracket\hat{\mathbf{x}}_{0}\rrbracket=Z(\hat{G}_{0}^{-},\hat{c}_{0}^{-},\hat{A}_{0}^{-},\hat{b}_{0}^{-},\hat{h}_{0}^{-})\subset\mathbb{R}^{n}

\bar{\delta}\geq\max\{\mu_{o}-1+n_{\lambda_{0}^{o}},1\}

\varepsilon>0

\Upsilon_{\infty}=\inf_{\gamma\in(\rho(\tilde{A}_{\bar{o}}),1)}\frac{\max\{\gamma^{-k}\|\tilde{A}_{\bar{o}}^{k}\|_{\infty}\colon k\in\mathbb{N}_{0}\}}{1-\gamma}

;

2:if

k<\bar{\delta}

then

\mathcal{Z}_{k}=\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket\leftarrow

Algorithm 1 with (19)-(21);

4: if

\mathcal{Z}_{k}=\emptyset

then

\mathcal{Z}_{k}\leftarrow

(22) with

i=0

and

\mathcal{Z}_{0}^{-}=P^{-1}\big{(}\mathcal{B}_{\theta}^{\infty}[0_{n_{o}}]\times\llbracket\hat{\tilde{\mathbf{x}}}_{0}^{\bar{o}}\rrbracket\big{)}

, where

\theta>0

is sufficiently large such that

\mathcal{Z}_{k}\neq\emptyset

;

\hat{\mathcal{T}}_{k}^{\bar{o}}\leftarrow Z(I_{n_{o}},\overline{c}_{k}^{\bar{o}},[~{}],[~{}],\overline{G}_{k}^{\bar{o}}1_{n_{o}})=\overline{\mathrm{IH}}(\widetilde{\mathcal{Z}}_{k}^{-,\bar{o}})

;

7:else

\mathcal{Z}_{k}\leftarrow

(22) with

i=k-\bar{\delta}

and

\hat{\mathcal{Z}}_{i}^{-}=P^{-1}\big{(}\hat{\mathcal{T}}_{k-\bar{\delta}}^{o}\!\times\!\hat{\mathcal{T}}_{k-\bar{\delta}}^{\bar{o}}\big{)}

, where

\hat{\mathcal{T}}_{k-\bar{\delta}}^{o}=\mathcal{B}_{\bar{\theta}_{k}}^{\infty}\big{[}\mathrm{center}\big{(}P_{o}\overline{\mathrm{IH}}(\mathcal{Z}_{k-\bar{\delta}}^{-})\big{)}\big{]}

for

k-\bar{\delta}<\bar{\delta}

and

\hat{\mathcal{T}}_{k-\bar{\delta}}^{o}=P_{o}\overline{\mathrm{IH}}(\mathcal{Z}_{k-\bar{\delta}}^{-})

for

k-\bar{\delta}\geq\bar{\delta}

;

Z(I_{n_{o}},\hat{c}_{k}^{\mathrm{in}},[~{}],[~{}],\hat{G}_{k}^{\mathrm{in}}1_{n_{o}})=\overline{\mathrm{IH}}(\tilde{A}_{21}\widetilde{\mathcal{Z}}_{k}^{o}\oplus\tilde{B}_{\bar{o}}\llbracket\mathbf{w}_{k}\rrbracket)

;

10:

\ell_{k}=\max\{\|\hat{G}_{k}^{\mathrm{in}}\|_{\infty},\ell_{k-1}\}

, with

\ell_{\bar{\delta}-1}=0

;

11:

\hat{c}_{k+1}^{\bar{o}}=\tilde{A}_{\bar{o}}\hat{c}_{k}^{\bar{o}}+\hat{c}_{k}^{\mathrm{in}}

, with

\hat{c}_{\bar{\delta}}^{\bar{o}}=\mathrm{center}\big{(}\overline{\mathrm{IH}}(\widetilde{\mathcal{Z}}_{\bar{\delta}}^{\bar{o}})\big{)}

;

12:

\hat{\mathcal{T}}_{k}^{\bar{o}}\!=\!\mathcal{B}_{\alpha_{k}}^{\infty}[\hat{c}_{k}^{\bar{o}}]

with

\alpha_{k}\!=\!\frac{1}{2}\|A_{\bar{o}}^{k-\bar{\delta}}\|_{\infty}d_{\infty}\!(\widetilde{\mathcal{Z}}_{\bar{\delta}}^{\bar{o}})+\!\Upsilon_{\infty}\ell_{k-1}\!+\!\varepsilon

;

Algorithm 3 is under the framework of Algorithm 2, and we provide the line-by-line explanation as follows.

•

Line 1 initializes Algorithm 3, with the additional parameters $\bar{\delta}\geq\max\{\mu_{o}-1+n_{\lambda_{0}^{o}},1\}$ and $\varepsilon>0$ that: a larger $\bar{\delta}$ leads to higher accuracy but increases the complexity (determined by Line 8); a smaller $\varepsilon$ makes the estimate corresponding to the unobservable system more accurate but brings slower convergence of (24) [indicated by Line 12 and (88)]. Line 1 also gives an important constant $\Upsilon_{\infty}$ in estimating the unobservable state (see Line 8 with Line 12).¹⁷¹⁷17Note that $\max\{\gamma^{-k}\|\tilde{A}_{\bar{o}}^{k}\|_{\infty}\colon k\in\mathbb{N}_{0}\}$ can be calculated within finite steps (implied by the proof of Lemma 4). Thus, $\Upsilon_{\infty}$ can be computed by searching $\gamma$ over $(\rho(\tilde{A}_{\bar{o}}),1)$ .
•

Lines 3-6 are for $k<\bar{\delta}$ . Similarly to Lines 3 and 5 of Algorithm 2, Lines 3 and 5 of Algorithm 3 give the estimate $\mathcal{Z}_{k}$ : Line 3 gives the estimate returned by the constrained zonotopic version of Algorithm 1, i.e., with (19)-(21); in Line 5, $\mathcal{B}_{\theta}^{\infty}[0_{n_{o}}]$ can be expressed by $Z(I_{n_{o}},0_{n_{o}},[~{}],[~{}],\alpha 1_{n_{o}})$ for improving the numerical stability, where $I_{n_{o}}$ is the identity matrix of size $n_{o}$ and $1_{n_{o}}$ is the $n_{o}$ -dimensional all-ones column vector; one can double $\theta$ from $\theta=1$ until $\mathcal{Z}_{k}\neq\emptyset$ , and this can be done in finite steps. Line 6 calculates the interval hull of $\widetilde{\mathcal{Z}}_{k}^{-,\bar{o}}$ by simply solving $2n_{o}$ linear programmings, where $\overline{G}_{k}^{\bar{o}}$ and $\overline{c}_{k}^{\bar{o}}$ are the generator matrix and the center of the resulting interval hull, respectively, and $Z(I_{n_{o}},\overline{c}_{k}^{\bar{o}},[~{}],[~{}],\overline{G}_{k}^{\bar{o}}1_{n_{o}})$ is employed to improve the numerical stability; note that $\widetilde{\mathcal{Z}}_{k}^{-,\bar{o}}$ is the projection of $\widetilde{\mathcal{Z}}_{k}^{-}=P\mathcal{Z}_{k}^{-}$ to the subspace w.r.t. $\tilde{x}_{k}^{\bar{o}}$ , where $\mathcal{Z}_{k}^{-}$ is derived during the processing of calculating $\mathcal{Z}_{k}$ in Line 3 or Line 5.
•

Lines 8-12 are for $k\geq\bar{\delta}$ , where $\tilde{\mathcal{Z}}_{k}^{o}$ and $\tilde{\mathcal{Z}}_{k}^{\bar{o}}$ are the projections of $\widetilde{\mathcal{Z}}_{k}=P\mathcal{Z}_{k}$ to the subspaces w.r.t. $\tilde{x}_{k}^{o}$ and $\tilde{x}_{k}^{\bar{o}}$ , respectively. Line 8 of Algorithm 3 gives the estimate $\mathcal{Z}_{k}$ , which is a finite-horizon version of Line 7 of Algorithm 2 over the time window $[k-\bar{\delta},~{}k]$ . In Line 8, $\bar{\theta}_{k}$ is derived based on Lemma 1,¹⁸¹⁸18When utilizing Lemma 1, one should regard $k-\bar{\delta}$ , $\mathcal{B}_{\bar{\theta}_{k}}^{\infty}\big{[}\mathrm{center}\big{(}P_{o}\overline{\mathrm{IH}}(\mathcal{Z}_{k-\bar{\delta}}^{-})\big{)}\big{]}$ , and $\hat{\mathcal{T}}_{k-\bar{\delta}}^{\bar{o}}$ as $0$ , $\mathcal{B}_{\bar{\theta}_{k}}^{\infty}[\hat{c}_{0}^{o}]$ , and $\llbracket\hat{\tilde{\mathbf{x}}}_{0}^{\bar{o}}\rrbracket$ , respectively. For $\bar{\theta}_{k}$ , one can double $\theta_{k}$ from $\theta_{k}=1$ until the equality in (18) holds, where the equality can be checked by calculating interval hulls. and from Algorithm 3 we can observe that $\hat{\mathcal{T}}_{k-\bar{\delta}}^{\bar{o}}$ is calculated by Line 6 (for $k-\bar{\delta}<\bar{\delta}$ ) and Lines 9-12 (for $k-\bar{\delta}\geq\bar{\delta}$ ).¹⁹¹⁹19In Line 8, we can replace $\mathcal{Z}_{k-\bar{\delta}}^{-}$ with $\mathcal{Z}_{k-\bar{\delta}}$ to improve the efficiency when $\overline{\mathrm{IH}}(\mathcal{Z}_{k-\bar{\delta}})$ is calculated (see Section 7.2). Line 9 gives the interval hull of the “input” of the unobservable subsystem, where $\hat{c}_{k}^{\mathrm{in}}$ is the center of the resulting interval hull and $\hat{G}_{k}^{\mathrm{in}}$ is diagonal and positive semi-definite. In Line 10, $\|\hat{G}_{k}^{\mathrm{in}}\|_{\infty}$ represents the maximum half-edge length of the interval hull derived by Line 9. Thus, $\ell_{k}$ records the greatest maximum half-edge length up to $k$ , which determines the radius (in the sense of $\infty$ -norm) of $\hat{\mathcal{T}}_{k-\bar{\delta}}^{\bar{o}}$ [see Line 12, where $d_{\infty}(\mathcal{S}):=\sup_{s,s^{\prime}\in\mathcal{S}}\|s-s^{\prime}\|_{\infty}$ ]; for the center of $\hat{\mathcal{T}}_{k-\bar{\delta}}^{\bar{o}}$ , it is calculated by Line 11.

The following theorem describes not only the stability w.r.t. the initial condition but also two important properties of Algorithm 3. More specifically, for detectable $(A,C)$ : the finite-time inclusion property fixes the “outer boundedness breaking” problem in the classical linear SMFing framework (see Remark 3); the uniform boundedness of $\mathcal{Z}_{k}$ is guaranteed, i.e., the estimate cannot go unbounded as time elapses – to the best of our knowledge it is the first time to propose a constrained-zonotopic SMF with rigorously proven uniform boundedness.

Theorem 4 (Properties of OIT-CZ SMF).

If $(A,C)$ is detectable, the OIT-CZ SMF in Algorithm 3 is stable w.r.t. the initial condition. Furthermore, there exists a $\underline{k}\geq 2\bar{\delta}$ such that

\mathcal{Z}_{k}\supseteq\llbracket\mathbf{x}_{k}|y_{0:k}\rrbracket,\quad k\geq\underline{k},

(24)

which is the finite-time inclusion property of Algorithm 3. Finally, $\mathcal{Z}_{k}$ is uniformly bounded w.r.t. $k\in\mathbb{N}_{0}$ .

Proof.

See Appendix H. ∎

Furthermore, Algorithm 3 has low computational complexity per step, especially when the system is observable: the averaged complexity for $k\to\infty$ is determined by the case with $k\geq\bar{\delta}$ (i.e., Lines 8-12 in Algorithm 3); for observable $(A,C)$ , only Line 8 remains, and we can also set $\mathcal{Z}_{k-\bar{\delta}}^{-}=\mathbb{R}^{n}=\mathcal{B}_{\infty}^{\infty}[0_{n}]$ ; from (23) we know that In Section 7.2.2, we show that Algorithm 3 has higher time efficiency compared with two most efficient constrained zonotopic SMFs in the toolbox CORA 2024 [1].

In terms of the accuracy, Algorithm 3 can be regarded as a reduction on the optimal estimate in Algorithm 1 with constrained zonotopic descriptions. Different from the existing reduction methods (e.g., [24]) based on geometric properties of constrained zonotopes, Algorithm 3 utilizes the properties of the dynamical system. Thus, the reduction is in a long-term manner instead of a greedy/instanetous manner, which greatly overcomes the wrapping effect. Therefore, the proposed OIT-CZ SMF has the desired performance improvement in terms of both complexity and accuracy.

7 Numerical Examples

7.1 Classical and Stability-Guaranteed Frameworks

In this subsection, we consider observable and detectable dynamical systems as illustrative examples to validate the theoretical results in Section 4 and Section 5.

Observable $(A,C)$ : Consider a discretized second-order system described by (1) and (2), with parameters

A=\begin{bmatrix}1&1\\ 0&1\end{bmatrix},\quad B=\begin{bmatrix}0.5\\ 1\end{bmatrix},\quad C=\begin{bmatrix}1&0\end{bmatrix},\\ \llbracket\mathbf{w}_{k}\rrbracket=[-1,1],\quad\llbracket\mathbf{v}_{k}\rrbracket=[-1,1],

(25)

which means $A$ is not Schur stable and $(A,C)$ is observable with $\mu=2$ . The true initial set is $\llbracket\mathbf{x}_{0}\rrbracket=[1,3]\times[1,3]$ .²⁰²⁰20The probability distributions of uncertain variables $\mathbf{x}_{0},\mathbf{w}_{0:k},\mathbf{v}_{0:k}$ can be arbitrary for simulations. In Section 7, these uncertain variables are set to be uniformly distributed in their ranges. The Matlab codes for all results in this paper are provided at https://github.com/congyirui/Stability-of-Linear-SMF-2024.

Assume that there are four designers Alice, Bob, Carol, and David to design SMFs for (25), and the true initial set $\llbracket\mathbf{x}_{0}\rrbracket$ is unknown to them. Alice, Bob, and David employ the classical filtering framework in Algorithm 1, while Carol uses the OIT-inspired filtering framework in Algorithm 2, where $P=I_{2}$ .²¹²¹21Note that the exact solutions of Algorithm 1 and Algorithm 2 can be derived by employing halfspace-representation-based method with acceptable computational complexities. From Fig. 2, we can see that the difference of the initial conditions chosen by the first three designers are corrected by the measurements $y_{0:6}$ , and the estimates converge to $\llbracket\mathbf{x}_{6}|y_{0:6}\rrbracket$ ; thus, the estimation gaps of these three filters are $0$ for $k\geq 6$ ; this implies the estimation gaps are bounded for $k\in\mathbb{N}_{0}$ which corroborates the results in Theorem 2 and Theorem 3. In contrast, the initial condition chosen by David does not satisfy condition (i) in Theorem 2, and the resulting estimate becomes empty as time elapses.

Detectable $(A,C)$ : Consider the system with

A=\begin{bmatrix}0.5&1\\ 0&1\end{bmatrix},\quad B=\begin{bmatrix}0.5\\ 1\end{bmatrix},\quad C=\begin{bmatrix}0&1\end{bmatrix},\\ \llbracket\mathbf{w}_{k}\rrbracket=[-1,1],\quad\llbracket\mathbf{v}_{k}\rrbracket=[-1,1],

(26)

which implies $A$ is not Schur stable and $(A,C)$ is detectable with $\mu_{o}=1$ . The true initial set is $\llbracket\mathbf{x}_{0}\rrbracket=[1,3]\times[1,3]$ .

The initial conditions chosen by Alice, Bob, Carol, and David are identical to those in Fig. 2, where Algorithm 2 is with $P=\left[\begin{smallmatrix}0&1\\ -1&0\end{smallmatrix}\right]$ . Fig. 3 corroborates the theoretical results in Theorem 2 and Theorem 3, where the estimation gaps corresponding to non-empty estimates are bounded; more specifically, they converge to $0$ exponentially fast.

7.2 The Stable and Fast Constrained Zonotopic SMF

Firstly, we use Monte Carlo simulation to test the OIT-CZ SMF in Section 7.2.1; then, we show the time efficiency of the OIT-CZ SMF in Section 7.2.2.

7.2.1 Interval hull of the estimate

In this part, we employ the OIT-CZ SMF to derive the interval hull of the estimate.

First, consider the randomly generated $(A,B,C)$ (by using drss function in MATLAB) with observable $(A,C)$ . The process and measurement noises are with $\llbracket\mathbf{w}_{k}\rrbracket=[-1,1]^{p}$ and $\llbracket\mathbf{v}_{k}\rrbracket=[-1,1]^{m}$ . The true initial set is $\llbracket\mathbf{x}_{0}\rrbracket=[-10,10]^{n}$ , and the initial condition is $\llbracket\hat{\mathbf{x}}_{0}\rrbracket=[-10,10]^{n}\oplus\{c\}$ , where $c$ is randomly generated in $[-1,1]^{n}$ for testing Algorithm 3. In the simulations, we set $n=10$ , $p=m\in\{5,\ldots,10\}$ , and $\bar{\delta}=n-\mathrm{rank}(C)+3>\mu-1$ (in Algorithm 3); for each $p=m$ , the simulations are conducted $1000$ times. The results are shown in Fig. 4, where one of the simulation runs is highlighted by the (yellow) dash-dotted lines with the (purple) stars representing: $\mathcal{Z}_{k}=\emptyset$ holds in Line 4 of Algorithm 3 and Line 5 derives a non-empty $\mathcal{Z}_{k}$ by resetting $\mathcal{Z}_{0}^{-}$ . We can see that the proposed OIT-CZ SMF is stable and uniformly bounded, which corroborates the theoretical results in Theorem 4.

Second, consider the randomly generated $(A,B,C)$ (by using drss function in MATLAB) with detectable $(A,C)$ . For the unobservable subsystem, $\tilde{A}_{\bar{o}}$ is a randomly created matrix with $\rho(\tilde{A}_{\bar{o}})\leq 0.5$ ; each element in $\tilde{A}_{21}$ and $\tilde{B}_{\bar{o}}$ is randomly selected from $[0,~{}1]$ . The transformation matrix $P$ is a randomly derived orthogonal matrix such that by (17): $A=P^{-1}\tilde{A}P$ , $B=P^{-1}\tilde{B}$ , $C=[\tilde{C}_{o}~{}0]P$ are finally obtained. The sets $\llbracket\mathbf{w}_{k}\rrbracket$ , $\llbracket\mathbf{v}_{k}\rrbracket$ , $\llbracket\mathbf{x}_{0}\rrbracket$ , and $\llbracket\hat{\mathbf{x}}_{0}\rrbracket$ the same as those in the simulations for observable $(A,C)$ . Also, we set $n=10$ , $n_{o}=p=m\in\{7,8,9\}$ , $\bar{\delta}=n_{o}-\mathrm{rank}(\tilde{C}_{o})+3>\mu_{o}-1$ , and $\varepsilon=0.001$ . for each $n_{o}=p=m$ , the simulations are conducted $1000$ times. The results are shown in Fig. 4, which validates the results in Theorem 4.

7.2.2 Time Efficiency

In this part, we set²²²²22If $\llbracket\hat{\mathbf{x}}_{0}\rrbracket\neq\llbracket\mathbf{x}_{0}\rrbracket$ , the classical algorithm can return an error due to the ill-posedness. $\llbracket\hat{\mathbf{x}}_{0}\rrbracket=\llbracket\mathbf{x}_{0}\rrbracket$ and compare the computation time (w.r.t. the constrained zonotopic description) of the proposed OIT-CZ SMF and two classical constrained zonotopic SMFs; these two classical SMFs are with the reduction methods girard and combastel in CORA 2024 [1], respectively. Consider the randomly generated observable systems same as those in Section 7.2.1, but with $n=p=m\in\{10,20,30\}$ . We also set $\bar{\delta}=n-\mathrm{rank}(C)+3$ in Algorithm 3. For these two classical SMFs, we set $n_{c}=1$ and the degrees-of-freedom order [24] $o_{d}=(n_{g}-n_{c})/n=1$ . The simulations are conducted for $100$ runs over the time window $[0,100]$ by using Matlab 2019b on a laptop with Intel Core [email protected] CPU, and the averaged computation time (per time step) is shown in Table 1. Note that the diameters in the sense of $\infty$ -norm for SMF (girard), SMF (combastel), and OIT-CZ SMF are respectively: $14.8902$ , $14.8902$ , and $3.6878$ (when $n=10$ ); $29.3098$ , $29.1371$ , and $4.7596$ (when $n=20$ ); $37.2301$ , $37.2301$ , and $5.0911$ (when $n=30$ ). These results show that the OIT-CZ SMF achieves significantly higher accuracy with several orders of magnitude reduction in computation time, as compared to the classical SMFs.

Table 1: Computation Time (per Time Step) for the Constrained Zonotopic Description

	$n=10$	$n=20$	$n=30$
SMF (girard)	$0.5822$ s	$3.8103$ s	$11.9702$ s
SMF (combastel)	$0.5809$ s	$3.8014$ s	$11.9546$ s
OIT-CZ SMF	$0.0004$ s	$0.0006$ s	$0.0011$ s

8 Conclusion

In this paper, the stability of the SMFs w.r.t. the initial condition has been studied for linear time-invariant systems. Specifically, the stability is given by the well-posedness and the bounded estimation gap. Based on our proposed OIT, we have analyzed the stability of the classical linear SMFing framework, where an explicit sufficient condition has been given. Then, we have provided a good necessary condition for the bounded estimation gap, which is very close to the sufficient condition. To avoid unstable filter design resulted from improper initial conditions, the OIT-inspired filtering framework has been established to guaranteed the stability. Under this new framework, we have developed the OIT-CZ SMF with guaranteed stability, uniform boundedness, high efficiency, and good accuracy.

References

[1] Althoff, M., Kochdumper, N., & Wetzlinger, M. CORA 2024 manual. 2024.
[2] Althoff, M. & Rath, J.J. Comparison of guaranteed state estimators for linear time-invariant systems. Automatica, 130:109662, Aug. 2021.
[3] Becis-Aubry, Y. Ellipsoidal constrained state estimation in presence of bounded disturbances. arXiv:2012.03267, 2021.
[4] Becis-Aubry, Y., Boutayeb, M., & Darouach, M.. State estimation in the presence of bounded disturbances. Automatica, 44(7):1867–1873, Jul. 2008.
[5] Chen, C.T. Linear System Theory and Design. New York, NY, USA: Oxford University Press, 3rd edition, 1999.
[6] Chen, J. & Gu, G. Control-oriented system identification: an $H_{\infty}$ approach. New York, NY, USA: John Wiley & Sons, 2000.
[7] Combastel, C. A state bounding observer based on zonotopes. In Proc. Eur. Control Conf. (ECC), pages 2589–2594, Sep. 2003.
[8] Combastel, C. Zonotopes and kalman observers: Gain optimality under distinct uncertainty paradigms and robust convergence. Automatica, 55:265–273, May 2015.
[9] Cong, Y., Zhou, X., & Kennedy, R.A. Finite blocklength entropy-achieving coding for linear system stabilization. IEEE Trans. Autom. Control, 66:153–167, Jan. 2021.
[10] Cong, Y., Wang, X., & Zhou, X. Rethinking the mathematical framework and optimality of set-membership filtering. IEEE Trans. Autom. Control, 67(5):2544–2551, May 2022.
[11] Fei, Z., Yang, L., Sun, X., & Ren, S. Zonotopic set-membership state estimation for switched systems with restricted switching. IEEE Trans. Autom. Control, 67(11):6127-6134, Nov. 2021.
[12] Jazwinski, A.H. Stochastic Processes and Filtering Theory. New York, USA: Academic Press, New York, NY, USA, 1970.
[13] Kalman, R.E. & Bucy, R.S. New results in linear filtering and prediction theory. J. Basic Eng., 83(1):95–108, 1961.
[14] Kühn, W. Rigorously computed orbits of dynamical systems without the wrapping effect. Computing, 61(1):47–67, Mar. 1998.
[15] Lewis, F., Xie, L., & Popa, D. Optimal and robust estimation: with an introduction to stochastic control theory. CRC Press, Boca Raton, FL, USA, 2ed. edition, 2007.
[16] Liu, Y., Zhao, Y., & Wu, F. Ellipsoidal state-bounding-based set-membership estimation for linear system with unknown-but-bounded disturbances. IET Control Theory & Appl., 10(4):431–442, Feb. 2016.
[17] Loukkas, N., Martinez, J.J., & Meslem, N. Set-membership observer design based on ellipsoidal invariant sets. IFAC-PapersOnLine, 50(1):6471–6476, Jul. 2017.
[18] Mazenc, F. & Bernard, O. Interval observers for linear time-invariant systems with disturbances. Automatica, 47(1):140–147, Jan. 2011.
[19] Milanese, M., Norton, J., Piet-Lahanier, H., & Walter, É. Bounding Approaches to System Identification. New York, NY, USA: Plenum Press, 1996.
[20] Milanese, M., Tempo, R., & Vicino, A. Robustness in Identification and Control. New York, NY, USA: Plenum Press, 1989.
[21] Nair, G.N. A nonstochastic information theory for communication and state estimation. IEEE Trans. Autom. Control, 58(6):1497–1510, Jun. 2013.
[22] Ocone, D. & Pardoux, E. Asymptotic stability of the optimal filter with respect to its initial condition. SIAM J. Control Optim., 34(1):226–243, 1996.
[23] Särkkä, S. Bayesian filtering and smoothing. Cambridge University Press, New York, NY, USA, 2013.
[24] Scott, J.K., Raimondo, D.M., Marseglia, G.R., & Braatz, R.D. Constrained zonotopes: A new tool for set-based estimation and fault detection. Automatica, 69:126–136, Jul. 2016.
[25] Shen, Q., Liu, J., Zhou, X., Zhao, Q., & Wang, Q. Low-complexity ISS state estimation approach with bounded disturbances. Int. J. Adaptive Control and Signal Process., 32(10):1473–1488, Aug. 2018.
[26] Handel, R. Filtering, stability, and robustness. PhD thesis, California Institute of Technology, 2006.
[27] Wang, Y., Puig, V., & Cembrano, G. Set-membership approach and kalman observer based on zonotopes for discrete-time descriptor systems. Automatica, 93:435–443, Jul. 2018.
[28] Wang, Y., Wang, Z., Puig, V., & Cembrano, G. Zonotopic set-membership state estimation for discrete-time descriptor LPV systems. IEEE Trans. Autom. Control, 64(5):2092–2099, Aug. 2019.
[29] Xu, F., Tan, J., Raïssi, T., & Liang, B. Design of optimal interval observers using set-theoretic methods for robust state estimation. Int. J. Robust and Nonlinear Control, 30(9):3692–3705, Jun. 2020.

Appendix A Proof of Proposition 1

We prove Proposition 1 by induction.

Base case: For $k=0$ , it follows from (4), (12), and (13) that $\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket=\mathcal{O}_{0,0}\bigcap\mathcal{E}_{0}$ , i.e., (14) holds for $k=0$ .

Inductive step: Assume (14) holds for any $k=k^{\prime}\in\mathbb{N}_{0}$ . For $k=k^{\prime}+1$ , the prior set is derived by (3) that

\begin{split}&\llbracket\hat{\mathbf{x}}_{k^{\prime}+1}|y_{0:k^{\prime}}\rrbracket=A\llbracket\mathbf{x}_{k^{\prime}}|y_{0:k^{\prime}}\rrbracket\oplus B\llbracket\mathbf{w}_{k^{\prime}}\rrbracket\\ &\subseteq A\bigg{(}\bigcap_{i=0}^{k^{\prime}}\mathcal{O}_{k^{\prime},i}\bigcap\mathcal{E}_{k^{\prime}}\bigg{)}\oplus B\llbracket\mathbf{w}_{k^{\prime}}\rrbracket\\ &\stackrel{{\scriptstyle(a)}}{{\subseteq}}\bigcap_{i=0}^{k^{\prime}}\left(A\mathcal{O}_{k^{\prime},i}\oplus B\llbracket\mathbf{w}_{k^{\prime}}\rrbracket\right)\bigcap\left(A\mathcal{E}_{k^{\prime}}\oplus B\llbracket\mathbf{w}_{k^{\prime}}\rrbracket\right)\\ &\stackrel{{\scriptstyle(b)}}{{=}}\bigcap_{i=0}^{k^{\prime}}\mathcal{O}_{k^{\prime}+1,i}\bigcap\mathcal{E}_{k^{\prime}+1},\end{split}

(27)

where $(a)$ is established by (11), and $(b)$ is from the following two equations:

	$\displaystyle A\mathcal{O}_{k^{\prime},i}\oplus B\llbracket\mathbf{w}_{k^{\prime}}\rrbracket$	$\displaystyle\stackrel{{\scriptstyle(c)}}{{=}}\mathcal{O}_{k^{\prime}+1,i},$		(28)
	$\displaystyle A\mathcal{E}_{k^{\prime}}\oplus B\llbracket\mathbf{w}_{k^{\prime}}\rrbracket$	$\displaystyle\stackrel{{\scriptstyle(d)}}{{=}}\mathcal{E}_{k^{\prime}+1},$		(29)

where $(c)$ and $(d)$ follow from $h^{*}\big{(}\sum_{i\in\mathcal{I}}\mathcal{S}_{i}\big{)}=\sum_{i\in\mathcal{I}}h^{*}(\mathcal{S}_{i})$ for any linear map $h^{*}$ . By (27) and (4), the posterior set is outer bounded by

\begin{split}&\llbracket\hat{\mathbf{x}}_{k^{\prime}+1}|y_{0:k^{\prime}+1}\rrbracket=\mathcal{X}_{k^{\prime}+1}(C,y_{k^{\prime}+1},\llbracket\mathbf{v}_{k^{\prime}+1}\rrbracket)\bigcap\llbracket\hat{\mathbf{x}}_{k^{\prime}+1}|y_{0:k^{\prime}}\rrbracket\\ &=\mathcal{O}_{k^{\prime}+1,k^{\prime}+1}\bigcap\llbracket\mathbf{x}_{k^{\prime}+1}|y_{0:k^{\prime}}\rrbracket\subseteq\bigcap_{i=0}^{k^{\prime}+1}\mathcal{O}_{k^{\prime}+1,i}\bigcap\mathcal{E}_{k^{\prime}+1},\end{split}

which implies (14) holds for $k=k^{\prime}+1$ . Thus, Proposition 1 is proven by induction. $\square$

Appendix B Proof of Theorem 1

To start with, we provide two lemmas as follows.

Lemma 2.

(Uniform Boundedness Invariance Under Linear Map) $\forall l\in\mathcal{I}$ , let the set $\mathcal{T}_{l,k}\subset\mathbb{R}^{n}$ be uniformly bounded w.r.t. $k\in\mathbb{N}_{0}$ , and the linear map $h_{l}^{*}$ be independent of $k$ . Then, $\sum_{l\in\mathcal{I}}h_{l}^{*}(\mathcal{T}_{l,k})$ is uniformly bounded w.r.t. $k\in\mathbb{N}_{0}$ .

Proof.

Since $\forall l\in\mathcal{I}$ , $\mathcal{T}_{l,k}$ is uniformly bounded w.r.t. $k\in\mathbb{N}_{0}$ , there exists a $\bar{d}_{l}$ such that $d(\mathcal{T}_{l,k})\leq\bar{d}_{l},\quad\forall k\in\mathbb{N}_{0}$ . Thus, $\forall l\in\mathcal{I}$ , the following holds for all $k\in\mathbb{N}_{0}$

\begin{split}d(h_{l}^{*}(\mathcal{T}_{l,k}))&=\sup_{t,t^{\prime}\in\mathcal{T}_{l,k}}\|h_{l}^{*}(t)-h_{l}^{*}(t^{\prime})\|\\ &\leq\|h_{l}^{*}\|\sup_{t,t^{\prime}\in\mathcal{T}_{l,k}}\|t-t^{\prime}\|\leq\|h_{l}^{*}\|\bar{d}_{l},\end{split}

(30)

where $\|h_{l}^{*}\|$ is the operator norm of $h_{l}^{*}$ . Now, we have $k\in\mathbb{N}_{0}$ ,

\!\!\!d\bigg{(}\sum_{l\in\mathcal{I}}h_{l}^{*}(\mathcal{T}_{l,k})\bigg{)}\stackrel{{\scriptstyle(e)}}{{\leq}}\sum_{l\in\mathcal{I}}d(h_{l}^{*}(\mathcal{T}_{l,k}))\leq\sum_{l\in\mathcal{I}}\|h_{l}^{*}\|\bar{d}_{l},

(31)

where inequality $(e)$ can be easily obtained by using the following triangle inequality: for the sets $\mathcal{R}_{l}$ ( $l\in\mathcal{I}$ ),

d\bigg{(}\sum_{l\in\mathcal{I}}\mathcal{R}_{l}\bigg{)}=\sup_{r_{l},r^{\prime}_{l}\in\mathcal{R}_{l},l\in\mathcal{I}}\|\sum_{l\in\mathcal{I}}(r_{l}-r^{\prime}_{l})\|\\ \leq\sum_{l\in\mathcal{I}}\sup_{r_{l},r^{\prime}_{l}}\|r_{l}-r^{\prime}_{l}\|=\sum_{l\in\mathcal{I}}d(\mathcal{R}_{l}).

(32)

By (31), $\sum_{l\in\mathcal{I}}h_{l}^{*}(\mathcal{T}_{l,k})$ is uniformly bounded. ∎

Lemma 3 (Uniformly Bounded Intersection).

$\forall j\in\{0,\ldots,\delta\}$ ( $\delta\geq 0$ ), let $\mathcal{S}_{j,k}\subset\mathbb{R}^{n}$ be any uniformly bounded set w.r.t. $k\in\mathbb{N}_{0}$ . Then,

\bigcap_{j=0}^{\delta}\big{[}\mathrm{ran}(\bar{D}_{j})\oplus\mathcal{S}_{j,k}\big{]}

(33)

is uniformly bounded w.r.t. $k\in\mathbb{N}_{0}$ [recall that $\mathrm{ran}(\bar{D}_{j})$ returns the range space of matrix $\bar{D}_{j}\in\mathbb{R}^{n\times(n-m)}$ ], if

\mathrm{rank}\left(D^{\mathrm{T}}\right)=n,

(34)

where $D=[D_{0}\ldots D_{\delta}]$ , and $D_{j}\in\mathbb{R}^{n\times m}$ ( $j\in\{0,\ldots,\delta\}$ ) satisfies $\ker(D_{j}^{\mathrm{T}})=\mathrm{ran}(\bar{D}_{j})$ .

Proof.

This proof has three steps. In the first step, we prove the following equality holds:

\!\!\!\!\bigcap_{j=0}^{\delta}\big{[}\mathrm{ran}(\bar{D}_{j})\oplus\mathcal{S}_{j,k}\big{]}=\!\bigcup_{s_{k}\in\mathcal{S}_{k}}\bigcap_{j=0}^{\delta}\big{[}\mathrm{ran}(\bar{D}_{j})\oplus\{s_{j,k}\}\big{]},

(35)

where $s_{k}=[s_{0,k}^{\mathrm{T}}\ldots s_{\delta,k}^{\mathrm{T}}]^{\mathrm{T}}$ and $\mathcal{S}_{k}:=\mathcal{S}_{0,k}\times\cdots\times\mathcal{S}_{\delta,k}$ and $\times$ stands for the Cartesian product. In the second step, we analyze $\bigcap_{j=0}^{\delta}\big{[}\mathrm{ran}(\bar{D}_{j})\oplus\{s_{j,k}\}\big{]}$ from the perspective of solving linear equations. Afterwards, we complete this proof in the third step.

Step 1: Since $\mathrm{ran}(\bar{D}_{j})\oplus\mathcal{S}_{j,k}=\bigcup_{s_{j,k}\in\mathcal{S}_{j,k}}[\mathrm{ran}(\bar{D}_{j})\oplus\{s_{j,k}\}]=:\bigcup_{s_{j,k}\in\mathcal{S}_{j,k}}\mathcal{T}_{j,s_{j,k}}$ , we are readily to use distributive law of sets to get (35). Specifically, we have²³²³23The readers can use mathematical induction to derive a rigorous proof. Due to the page limit, we omit it here.

\bigcap_{j=0}^{\delta}\bigcup_{s_{j,k}\in\mathcal{S}_{j,k}}\mathcal{T}_{j,s_{j,k}}=\bigcup_{(s_{0,k},\ldots,s_{\delta,k})\in\mathcal{S}_{k}}\bigcap_{j=0}^{\delta}\mathcal{T}_{j,s_{j,k}},

(36)

which means (35) holds.

Step 2: $\forall s_{k}\in\mathcal{S}_{k}$ , with $\ker(D_{j}^{\mathrm{T}})=\mathrm{ran}(\bar{D}_{j})$ we have

\mathrm{ran}(\bar{D}_{j})\oplus\{s_{j,k}\}=\left\{x\colon D_{j}^{\mathrm{T}}x=D_{j}^{\mathrm{T}}s_{j,k}\right\},\quad j\in\{0,\ldots,{\delta}\},

which is the solution space of the linear equation $D_{j}^{\mathrm{T}}x=D_{j}^{\mathrm{T}}s_{j,k}$ . Thus, we get

\bigcap_{j=0}^{\delta}\big{[}\mathrm{ran}(\bar{D}_{j})\oplus\{s_{j,k}\}\big{]}=\left\{x\colon D^{\mathrm{T}}x=\bar{s}_{k}\right\},

(37)

where $\bar{s}_{k}=[s_{0,k}^{\mathrm{T}}D_{0}\ldots s_{\delta,k}^{\mathrm{T}}D_{\delta}]^{\mathrm{T}}$ . With (34), we know that $D^{\mathrm{T}}x=\bar{s}_{k}$ has either a unique solution $(D^{\mathrm{T}})^{+}\bar{s}_{k}$ or no solution, i.e., $\bigcap_{j=0}^{\delta}\big{[}\mathrm{ran}(\bar{D}_{j})\oplus\{s_{j,k}\}\big{]}$ at most contains one element. Hence, we have

\bigcap_{j=0}^{\delta}\big{[}\mathrm{ran}(\bar{D}_{j})\oplus\{s_{j,k}\}\big{]}=\begin{cases}\left\{\left(D^{\mathrm{T}}\right)^{+}\bar{s}_{k}\right\}&s_{k}\in\mathcal{S}_{k}^{\neq\emptyset},\\ \emptyset&\mathrm{otherwise},\end{cases}

(38)

where $\mathcal{S}_{k}^{\neq\emptyset}=\left\{s_{k}\in\mathcal{S}_{k}\colon\left\{x\colon D^{\mathrm{T}}x=\bar{s}_{k}\right\}\neq\emptyset\right\}$ .

Step 3: With (38), we can rewrite (35) as follows

\begin{split}\bigcap_{j=0}^{\delta}\big{[}\mathrm{ran}(\bar{D}_{j})\oplus\mathcal{S}_{j,k}\big{]}&=\bigcup_{s_{k}\in\mathcal{S}_{k}^{\neq\emptyset}}\left\{\left(D^{\mathrm{T}}\right)^{+}\bar{s}_{k}\right\}\\ &=\bigcup_{s_{k}\in\mathcal{S}_{k}^{\neq\emptyset}}\left\{Hs_{k}\right\}=H\mathcal{S}_{k}^{\neq\emptyset},\end{split}

(39)

where $H=\left(D^{\mathrm{T}}\right)^{+}\mathrm{diag}\{D_{0}^{\mathrm{T}},\ldots,D_{\delta}^{\mathrm{T}}\}$ . As $\mathcal{S}_{k}^{\neq\emptyset}\subseteq\mathcal{S}_{k}$ is uniformly bounded²⁴²⁴24This is because $d(\mathcal{S}_{k}^{\neq\emptyset})\leq d(\mathcal{S}_{k})\leq\sqrt{\sum_{i=0}^{\delta}d^{2}(\mathcal{S}_{i,k})}$ . and $H$ is a linear map independent of $k$ , $\bigcap_{j=0}^{\delta}\big{[}\mathrm{ran}(\bar{D}_{j})\oplus\mathcal{S}_{j,k}\big{]}$ is uniformly bounded. ∎

Now, we prove Theorem 1. With (5), the observation-information set in (12) can be rewritten as

\mathcal{O}_{k,i}=A^{k-i}\ker(C)\oplus A^{k-i}C^{+}(\{y_{i}\}\oplus\llbracket-\mathbf{v}_{i}\rrbracket)\\ \oplus\sum_{r=i}^{k-1}A^{k-1-r}B\llbracket\mathbf{w}_{r}\rrbracket.

(40)

Thus, with (40), $j=i-k+\delta$ , and $l=r-i+1$ , we can rewrite the OIT in Definition 6 as

\bigcap_{i=k-\delta}^{k}\mathcal{O}_{k,i}=\bigcap_{j=0}^{\delta}\left[A^{\delta-j}\ker(C)\oplus\mathcal{S}_{j,k}\right],

(41)

where

\mathcal{S}_{j,k}=A^{\delta-j}C^{+}(\{y_{k-\delta+j}\}\oplus\llbracket-\mathbf{v}_{k-\delta+j}\rrbracket)\\ \oplus\sum_{l=1}^{\delta-j}A^{\delta-j-l}B\llbracket\mathbf{w}_{l+k-\delta+j-1}\rrbracket.

(42)

Let $\bar{C}\in\mathbb{R}^{(n-m)\times n}$ such that $\ker(C)=\mathrm{ran}(\bar{C}^{\mathrm{T}})$ . By $A^{\delta-j}\ker(C)=A^{\delta-j}\mathrm{ran}(\bar{C}^{\mathrm{T}})=\mathrm{ran}(A^{\delta-j}\bar{C}^{\mathrm{T}})=:\mathrm{ran}(\bar{D}_{j})$ , (41) can be rewritten as

\bigcap_{i=k-\delta}^{k}\mathcal{O}_{k,i}=\bigcap_{j=0}^{\delta}\left[\mathrm{ran}(\bar{D}_{j})\oplus\mathcal{S}_{j,k}\right].

(43)

Since $d(\llbracket\mathbf{w}_{k}\rrbracket)\leq d_{w}$ and $d(\llbracket\mathbf{v}_{k}\rrbracket)\leq d_{v}$ , we know that $\{y_{k-\delta+j}\}\oplus\llbracket-\mathbf{v}_{k-\delta+j}\rrbracket$ and $\llbracket\mathbf{w}_{l+k-\delta+j-1}\rrbracket$ ( $l\in\{1,\ldots,\delta-j\}$ ) are uniformly bounded w.r.t. $k\in\mathbb{N}_{0}$ . Hence, $\forall j\in\{0,\ldots,\delta\}$ , by Lemma 2 $\mathcal{S}_{j,k}$ is uniformly bounded w.r.t. $k\in\mathbb{N}_{0}$ . Now, the precondition of Lemma 3 is satisfied for (43). To guarantee the uniform boundedness of (43), we only need to prove condition (34) with $\ker(D_{j}^{\mathrm{T}})=\mathrm{ran}(\bar{D}_{j})$ in Lemma 3 holds, where $D_{j}^{\mathrm{T}}:=CA^{j-\delta}$ . The proof has two steps.

Step 1: Prove $\ker(D_{j}^{\mathrm{T}})=\mathrm{ran}(\bar{D}_{j})$ . To be more specific, for non-singular $A$ , we show that

\ker(D_{j}^{\mathrm{T}})=\ker(CA^{j-\delta})=\mathrm{ran}(A^{\delta-j}\bar{C}^{\mathrm{T}})=\mathrm{ran}(\bar{D}_{j}),

(44)

as follows: Firstly, we have

D_{j}^{\mathrm{T}}\bar{D}_{j}=CA^{j-\delta}A^{\delta-j}\bar{C}^{\mathrm{T}}=C\bar{C}^{\mathrm{T}}=0,

(45)

which implies $\mathrm{ran}(\bar{D}_{j})\subseteq\ker(D_{j}^{\mathrm{T}})$ . Secondly, we check the rank of $\bar{D}_{j}$ as follows

\mathrm{rank}(\bar{D}_{j})=\mathrm{rank}(A^{\delta-j}\bar{C}^{\mathrm{T}})=\mathrm{rank}(\bar{C}^{\mathrm{T}})=n-\mathrm{rank}(C)\\ =n-\mathrm{rank}(CA^{j-\delta})=n-\mathrm{rank}(D_{j}^{\mathrm{T}}),

(46)

which combined with (45) gives (44).

Step 2: Prove (34). Since $D_{j}^{\mathrm{T}}=CA^{j-\delta}$ , we have $D^{\mathrm{T}}=O_{\delta}$ , and the left-hand side of (34) equals

\mathrm{rank}\left(O_{\delta}\right)=\mathrm{rank}\left(\begin{bmatrix}C\\ \vdots\\ CA^{\delta}\end{bmatrix}A^{-\delta}\right)\stackrel{{\scriptstyle(f)}}{{=}}n,

(47)

where $(f)$ is established by observable $(A,C)$ and $\delta\geq\mu-1$ in Theorem 1.

Therefore, by Lemma 3, $\bigcap_{i=k-\delta}^{k}\mathcal{O}_{k,i}$ in (43) is uniformly bounded w.r.t. $k\geq\delta\geq\mu-1$ .

Next, we prove equation (16). With (39) and $\mathcal{S}_{k}^{\neq\emptyset}\subseteq\mathcal{S}_{k}$ , we can rewrite (41) as

\bigcap_{i=k-\delta}^{k}\mathcal{O}_{k,i}\subseteq O_{\delta}^{+}\prod_{j=0}^{\delta}CA^{j-\delta}\mathcal{S}_{j,k},

(48)

where $\prod$ is the Cartesian product of the sequence of $CA^{j-\delta}\mathcal{S}_{j,k}$ with the following form

CC^{+}(\{y_{k-\delta+j}\}\oplus\llbracket-\mathbf{v}_{k-\delta+j}\rrbracket)\oplus\sum_{l=1}^{\delta-j}CA^{-l}B\llbracket\mathbf{w}_{l+k-\delta+j-1}\rrbracket.

Thus, noticing that $\|CC^{+}\|=1$ , $\|O_{\delta}^{+}\|=1/\sigma_{\min}(O_{\delta}^{+})$ , and $d(\{y_{k-\delta+j}\})=0$ , we can get (16) from (48). $\square$

Appendix C Proof of Theorem 2

Firstly, we consider the stability for observable $(A,C)$ , and a sufficient condition is provided in Proposition 4.

Proposition 4 (Stability of Observable Systems).

For observable $(A,C)$ and bounded $\llbracket\hat{\mathbf{x}}_{0}\rrbracket\supseteq\llbracket\mathbf{x}_{0}\rrbracket$ , the classical SMFing framework in Algorithm 1 is stable and $\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket$ is uniformly bounded w.r.t. $k\in\mathbb{N}_{0}$ .

Proof.

We prove the well-posedness, the uniform boundedness of the estimate, and the boundedness of the estimation gap as follows.

Well-posedness: Since $\llbracket\hat{\mathbf{x}}_{0}\rrbracket\supseteq\llbracket\mathbf{x}_{0}\rrbracket$ , with (4), we have

\llbracket\hat{\mathbf{x}}_{0}|y_{0}\rrbracket=\mathcal{X}_{0}(C,y_{0},\llbracket\mathbf{v}_{0}\rrbracket)\bigcap\llbracket\hat{\mathbf{x}}_{0}\rrbracket\\ \supseteq\mathcal{X}_{0}(C,y_{0},\llbracket\mathbf{v}_{0}\rrbracket)\bigcap\llbracket\mathbf{x}_{0}\rrbracket=\llbracket\mathbf{x}_{0}|y_{0}\rrbracket\stackrel{{\scriptstyle(g)}}{{\neq}}\emptyset,

(49)

i.e., $\llbracket\hat{\mathbf{x}}_{0}|y_{0}\rrbracket\neq\emptyset$ , where $(g)$ follows from the fact that $y_{0}$ is generated by (2) with at least one possible $x_{0}\in\llbracket\mathbf{x}_{0}\rrbracket$ . With (3), we have $\llbracket\hat{\mathbf{x}}_{1}|y_{0}\rrbracket\supseteq\llbracket\mathbf{x}_{1}|y_{0}\rrbracket$ , as $\llbracket\hat{\mathbf{x}}_{0}|y_{0}\rrbracket\supseteq\llbracket\mathbf{x}_{0}|y_{0}\rrbracket$ . Similarly to (49), by (4), $\llbracket\hat{\mathbf{x}}_{1}|y_{0:1}\rrbracket\supseteq\llbracket\mathbf{x}_{1}|y_{0:1}\rrbracket\neq\emptyset$ holds. Proceeding forward, we get $\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket\supseteq\llbracket\mathbf{x}_{k}|y_{0:k}\rrbracket\neq\emptyset$ for $k\in\mathbb{N}_{0}$ .

Uniformly bounded estimate: Let $\bar{\mathbf{x}}_{k}=[(\bar{\mathbf{x}}_{k}^{\mathrm{a}})^{\mathrm{T}}(\bar{\mathbf{x}}_{k}^{\mathrm{b}})^{\mathrm{T}}]^{\mathrm{T}}=Q\mathbf{x}_{k}$ with a non-singular $Q\in\mathbb{R}^{n\times n}$ such that

\begin{split}\begin{bmatrix}\bar{\mathbf{x}}_{k+1}^{\mathrm{a}}\\ \bar{\mathbf{x}}_{k+1}^{\mathrm{b}}\end{bmatrix}&=\begin{bmatrix}\bar{A}_{\mathrm{a}}&0\\ 0&J_{0}\end{bmatrix}\begin{bmatrix}\bar{\mathbf{x}}_{k}^{\mathrm{a}}\\ \bar{\mathbf{x}}_{k}^{\mathrm{b}}\end{bmatrix}+\begin{bmatrix}\bar{B}_{\mathrm{a}}\\ \bar{B}_{\mathrm{b}}\end{bmatrix}\mathbf{w}_{k},\\ \mathbf{y}_{k}&=\bar{C}_{\mathrm{a}}\bar{\mathbf{x}}_{k}^{\mathrm{a}}+\bar{C}_{\mathrm{b}}\bar{\mathbf{x}}_{k}^{\mathrm{b}}+\mathbf{v}_{k},\end{split}

(50)

where $\bar{A}_{\mathrm{a}}$ is non-singular and $J_{0}$ is a block diagonal matrix corresponding to all Jordan blocks associated with the eigenvalue $0$ . Thus, we have

\begin{split}d&(\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket)\leq\sup_{\hat{\bar{x}}_{k},\hat{\bar{x}}^{\prime}_{k}\in\llbracket\hat{\bar{\mathbf{x}}}_{k}|y_{0:k}\rrbracket}\|Q^{-1}\|\|\hat{\bar{x}}_{k}-\hat{\bar{x}}^{\prime}_{k}\|\\ \stackrel{{\scriptstyle(h)}}{{\leq}}&\sup_{\hat{\bar{x}}_{k}^{\mathrm{a}},\hat{\bar{x}}_{k}^{\prime\mathrm{a}}\in\llbracket\hat{\bar{\mathbf{x}}}_{k}^{\mathrm{a}}|y_{0:k}\rrbracket}\|Q^{-1}\|\sqrt{\|\hat{\bar{x}}_{k}^{\mathrm{a}}-\hat{\bar{x}}_{k}^{\prime\mathrm{a}}\|^{2}+\bar{d}_{\mathrm{b}}^{2}},\end{split}

(51)

where $(h)$ follows from the uniform boundedness²⁵²⁵25Noticing that $J_{0}$ is a nilpotent matrix, we can derive the uniform boundedness of $\llbracket\hat{\bar{\mathbf{x}}}_{k}^{\mathrm{b}}\rrbracket\supseteq\llbracket\hat{\bar{\mathbf{x}}}_{k}^{\mathrm{b}}|y_{0:k}\rrbracket$ from $\bar{\mathbf{x}}_{k}^{\mathrm{b}}=J_{0}^{k}\bar{\mathbf{x}}_{0}^{\mathrm{b}}+\sum_{i=0}^{k-1}J_{0}^{k-1-i}\bar{B}_{\mathrm{b}}\mathbf{w}_{i}$ and the boundedness of $\llbracket\hat{\mathbf{x}}_{0}\rrbracket\subset\mathbb{R}^{n}$ . of $\llbracket\hat{\bar{\mathbf{x}}}_{k}^{\mathrm{b}}|y_{0:k}\rrbracket$ w.r.t. $k\in\mathbb{N}_{0}$ that: $\forall\hat{\bar{x}}_{k}^{\mathrm{b}},\hat{\bar{x}}_{k}^{\prime\mathrm{b}}\in\llbracket\hat{\bar{\mathbf{x}}}_{k}^{\mathrm{b}}|y_{0:k}\rrbracket$ there exists a $\bar{d}_{\mathrm{b}}$ such that $\|\hat{\bar{x}}_{k}^{\mathrm{b}}-\hat{\bar{x}}_{k}^{\prime\mathrm{b}}\|\leq\bar{d}_{\mathrm{b}}$ . Then, we only need to focus on the first subsystem of (50), i.e.,

\begin{split}\bar{\mathbf{x}}_{k+1}^{\mathrm{a}}&=\bar{A}_{\mathrm{a}}\bar{\mathbf{x}}_{k}^{\mathrm{a}}+\bar{B}_{\mathrm{a}}\mathbf{w}_{k},\\ \mathbf{y}_{k}&=\bar{C}_{\mathrm{a}}\bar{\mathbf{x}}_{k}^{\mathrm{a}}+\bar{\mathbf{v}}_{k},\end{split}

(52)

where $\bar{\mathbf{v}}_{k}=\bar{C}_{\mathrm{b}}\bar{\mathbf{x}}_{k}^{\mathrm{b}}+\mathbf{v}_{k}$ is an equivalent measurement noise [which is related to $\mathbf{w}_{0:k-1}$ due to (50)] and $\llbracket\bar{\mathbf{v}}_{k}\rrbracket$ is uniformly bounded w.r.t. $k\in\mathbb{N}_{0}$ . Define $\overline{\mathcal{O}}_{k,i}^{\mathrm{a}}:=\bar{A}_{\mathrm{a}}^{k-i}\overline{\mathcal{X}}_{i}^{\mathrm{a}}(\bar{C}_{\mathrm{a}},y_{i},\llbracket\bar{\mathbf{v}}_{i}\rrbracket)\oplus\sum_{r=i}^{k-1}\bar{A}_{\mathrm{a}}^{k-1-r}\bar{B}_{\mathrm{a}}\llbracket\mathbf{w}_{r}\rrbracket$ with

\overline{\mathcal{X}}_{i}^{\mathrm{a}}(\bar{C}_{\mathrm{a}},y_{i},\llbracket\bar{\mathbf{v}}_{i}\rrbracket):=\ker(\bar{C}_{\mathrm{a}})\oplus\bar{C}_{\mathrm{a}}^{+}(\left\{y_{i}\right\}\oplus\llbracket-\bar{\mathbf{v}}_{i}\rrbracket).

Note that $\overline{\mathcal{O}}_{k,i}^{\mathrm{a}}$ is an observation-information set in terms of (52). Hence, we can derive $\llbracket\hat{\bar{\mathbf{x}}}_{k}^{\mathrm{a}}|y_{0:k}\rrbracket\subseteq\bigcap_{i=k-\delta}^{k}\overline{\mathcal{O}}_{k,i}^{\mathrm{a}}$ .²⁶²⁶26Even though $\mathbf{w}_{0:k-1}$ and $\bar{\mathbf{v}}_{k}$ are related, we can assume the unrelatedness holds and employ Theorem 3 in [10] to employ (14) and (15) to get this result. Thus, for $k\geq\delta\geq\mu_{\mathrm{a}}-1$ , where $\mu_{\mathrm{a}}$ is the observability index w.r.t. $(\bar{A}_{\mathrm{a}},\bar{C}_{\mathrm{a}})$ , we get

d(\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket)\stackrel{{\scriptstyle\eqref{eqn:An Upper Bound on the Diameter of the OIT}}}{{\leq}}\|Q^{-1}\|\sqrt{\bar{d}_{\delta}(\bar{A}_{\mathrm{a}},\bar{B}_{\mathrm{a}},\bar{C}_{\mathrm{a}})^{2}+\bar{d}_{\mathrm{b}}^{2}},

(53)

where $\bar{d}_{\delta}(\cdot,\cdot,\cdot)$ is defined in (16) with $d_{v}$ for the uniformly bounded equivalent noise $\bar{\mathbf{v}}_{k}$ . For $k<\delta$ , bounded $\llbracket\hat{\mathbf{x}}_{0}\rrbracket$ lead to bounded $\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket$ (determined by Algorithm 1). Thus $\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket$ is uniformly bounded w.r.t. $k\in\mathbb{N}_{0}$ .

Bounded estimation gap: Due to the page limit²⁷²⁷27We recommend the readers to follow the ideas in (58)-(68) and derive a tighter bound. However, the upper bound given in (54) is good enough to derive the boundedness of the estimation gap for observable systems., we provide the proof based on

\begin{split}d_{k}^{\mathrm{g}}&(\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket)\leq\sup_{\hat{x}_{k}\in\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket\atop x_{k}\in\llbracket\mathbf{x}_{k}|y_{0:k}\rrbracket}\|\hat{x}_{k}-x_{k}\|,\\ &\stackrel{{\scriptstyle(j)}}{{\leq}}\sup_{\hat{x}_{k},\hat{x}^{\prime}_{k}\in\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket}\|\hat{x}_{k}-\hat{x}^{\prime}_{k}\|=d(\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket),\end{split}

(54)

where $(j)$ is from $\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket\supseteq\llbracket\mathbf{x}_{k}|y_{0:k}\rrbracket$ (see the proof of well-posedness). Therefore, for observable $(A,C)$ , the boundedness of the estimation gap is established. ∎

Now, we prove the stability of Algorithm 1 w.r.t. the initial condition under conditions (i) and (ii) of Theorem 2.

Well-posedness: Define $\widetilde{\mathcal{O}}_{k,i}^{o}:=\tilde{A}_{o}^{k-i}\widetilde{\mathcal{X}}_{i}^{o}(\tilde{C}_{o},y_{i},\llbracket\mathbf{v}_{i}\rrbracket)\oplus\sum_{r=i}^{k-1}\tilde{A}_{o}^{k-1-r}\tilde{B}_{o}\llbracket\mathbf{w}_{r}\rrbracket$ with

\begin{split}\widetilde{\mathcal{X}}_{i}^{o}(\tilde{C}_{o},y_{i},\llbracket\mathbf{v}_{i}\rrbracket)&:=\left\{\tilde{x}_{i}^{o}\colon y_{i}=\tilde{C}_{o}\tilde{x}_{i}^{o}+v_{i},~{}v_{i}\in\llbracket\mathbf{v}_{i}\rrbracket\right\}\\ &=\ker(\tilde{C}_{o})\oplus\tilde{C}_{o}^{+}(\left\{y_{i}\right\}\oplus\llbracket-\mathbf{v}_{i}\rrbracket).\end{split}

Note that $\widetilde{\mathcal{O}}_{k,i}^{o}$ is an observation-information set in terms of the observable subsystem of (17). Then, we have

\begin{split}\llbracket\hat{\tilde{\mathbf{x}}}_{0}|y_{0}\rrbracket&=\llbracket P\hat{\mathbf{x}}_{0}|y_{0}\rrbracket=P\big{(}\mathcal{O}_{0,0}\bigcap\llbracket\hat{\mathbf{x}}_{0}\rrbracket\big{)}\\ &\stackrel{{\scriptstyle(k)}}{{=}}P\mathcal{O}_{0,0}\bigcap\llbracket P\hat{\mathbf{x}}_{0}\rrbracket\\ &\stackrel{{\scriptstyle(l)}}{{=}}\big{(}\widetilde{\mathcal{O}}_{0,0}^{o}\times\mathbb{R}^{n_{\bar{o}}}\big{)}\bigcap\llbracket\hat{\tilde{\mathbf{x}}}_{0}\rrbracket,\\ \end{split}

(55)

where $(k)$ follows from the non-singularity of $P$ ; $(l)$ is established by $P\mathcal{X}_{0}(C,y_{0},\llbracket\mathbf{v}_{0}\rrbracket)=\{\tilde{x}_{0}\colon y_{0}=CP^{-1}\tilde{x}_{0}+v_{0},~{}v_{0}\in\llbracket\mathbf{v}_{0}\rrbracket\}=\{\tilde{x}_{0}\colon y_{0}=[\tilde{C}_{o}~{}0]\tilde{x}_{0}+v_{0},~{}v_{0}\in\llbracket\mathbf{v}_{0}\rrbracket\}=\{\tilde{x}_{0}^{o}\colon y_{0}=\tilde{C}_{o}\tilde{x}_{0}^{o}+v_{0},~{}v_{0}\in\llbracket\mathbf{v}_{0}\rrbracket\}\times\{\tilde{x}_{0}^{\bar{o}}\in\mathbb{R}^{n_{\bar{o}}}\}$ . Substituting $\llbracket\hat{\tilde{\mathbf{x}}}_{0}\rrbracket=\llbracket\tilde{\mathbf{x}}_{0}\rrbracket$ into (55) and noticing that $\llbracket\tilde{\mathbf{x}}_{0}|y_{0}\rrbracket\neq\emptyset$ , we have

\llbracket\tilde{\mathbf{x}}_{0}|y_{0}\rrbracket=\big{(}\widetilde{\mathcal{O}}_{0,0}^{o}\times\mathbb{R}^{n_{\bar{o}}}\big{)}\bigcap\llbracket\tilde{\mathbf{x}}_{0}\rrbracket\neq\emptyset.

(56)

This means $\exists\{\tilde{x}_{0}\}=\{\tilde{x}_{0}^{o}\}\times\{\tilde{x}_{0}^{\bar{o}}\}$ such that

\emptyset\neq\big{(}\widetilde{\mathcal{O}}_{0,0}^{o}\times\mathbb{R}^{n_{\bar{o}}}\big{)}\bigcap\big{(}\{\tilde{x}_{0}^{o}\}\times\{\tilde{x}_{0}^{\bar{o}}\}\big{)}=\big{(}\widetilde{\mathcal{O}}_{0,0}^{o}\bigcap\{\tilde{x}_{0}^{o}\}\big{)}\times\{\tilde{x}_{0}^{\bar{o}}\},

where $\tilde{x}_{0}^{o}\in\llbracket\tilde{\mathbf{x}}_{0}^{o}\rrbracket$ . Since $\llbracket\hat{\tilde{\mathbf{x}}}_{0}^{o}\rrbracket\supseteq\llbracket\tilde{\mathbf{x}}_{0}^{o}\rrbracket$ in condition (i) holds, we get $\tilde{x}_{0}^{o}\in\llbracket\hat{\tilde{\mathbf{x}}}_{0}^{o}\rrbracket$ . Thus, $\exists\tilde{x}_{0}^{\prime\bar{o}}$ such that $\llbracket\hat{\tilde{\mathbf{x}}}_{0}\rrbracket\supseteq\{\tilde{x}_{0}^{o}\}\times\{\tilde{x}_{0}^{\prime\bar{o}}\}$ , which together with (55) gives

\begin{split}\llbracket\hat{\tilde{\mathbf{x}}}_{0}|y_{0}\rrbracket&=\big{(}\widetilde{\mathcal{O}}_{0,0}^{o}\times\mathbb{R}^{n_{\bar{o}}}\big{)}\bigcap\llbracket\hat{\tilde{\mathbf{x}}}_{0}\rrbracket\\ &\supseteq\big{(}\widetilde{\mathcal{O}}_{0,0}^{o}\bigcap\{\tilde{x}_{0}^{o}\}\big{)}\times\{\tilde{x}_{0}^{\prime\bar{o}}\}\neq\emptyset.\end{split}

(57)

From the first equalities of (56) and (57), we can derive $\llbracket\hat{\tilde{\mathbf{x}}}_{0}^{o}|y_{0}\rrbracket\supseteq\llbracket\tilde{\mathbf{x}}_{0}^{o}|y_{0}\rrbracket$ , which combined with the observable subsystem in (17) yields $\llbracket\hat{\tilde{\mathbf{x}}}_{1}^{o}|y_{0}\rrbracket\supseteq\llbracket\tilde{\mathbf{x}}_{1}^{o}|y_{0}\rrbracket$ . Proceeding forward, we can obtain that $\llbracket\hat{\tilde{\mathbf{x}}}_{k}|y_{0:k}\rrbracket\neq\emptyset$ and $\llbracket\hat{\tilde{\mathbf{x}}}_{k}^{o}|y_{0:k}\rrbracket\supseteq\llbracket\tilde{\mathbf{x}}_{k}^{o}|y_{0:k}\rrbracket$ for $k\in\mathbb{N}_{0}$ .

Boundedness of estimation gap: From Definition 3, the estimation gap $d_{k}^{\mathrm{g}}(\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket)=d_{\mathrm{H}}(\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket,\llbracket\mathbf{x}_{k}|y_{0:k}\rrbracket)$ can be upper bounded by

d_{k}^{\mathrm{g}}(\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket)\leq\|P^{-1}\|d_{\mathrm{H}}(\llbracket\hat{\tilde{\mathbf{x}}}_{k}|y_{0:k}\rrbracket,\llbracket\tilde{\mathbf{x}}_{k}|y_{0:k}\rrbracket).

(58)

Noticing the structure of $d_{\mathrm{H}}(\mathcal{S},\mathcal{T})$ in (10), firstly we prove the upper boundedness of

\adjustlimits{\sup}_{\hat{\tilde{x}}_{k}\in\llbracket\hat{\tilde{\mathbf{x}}}_{k}|y_{0:k}\rrbracket}{\inf}_{\tilde{x}_{k}\in\llbracket\tilde{\mathbf{x}}_{k}|y_{0:k}\rrbracket}\|\hat{\tilde{x}}_{k}-\tilde{x}_{k}\|.

(59)

Applying Proposition 4 to the observable subsystem of (17), we get that $\forall\hat{\tilde{x}}_{k}^{o}\in\llbracket\hat{\tilde{\mathbf{x}}}_{k}^{o}|y_{0:k}\rrbracket$ and $\forall\tilde{x}_{k}^{o}\in\llbracket\tilde{\mathbf{x}}_{k}^{o}|y_{0:k}\rrbracket$ , there exists a $\tilde{d}_{o}$ such that $\|\hat{\tilde{x}}_{k}^{o}-\tilde{x}_{k}^{o}\|\leq\tilde{d}_{o}$ for $k\in\mathbb{N}_{0}$ . Thus, (59) can be upper bounded by

\adjustlimits{\sup}_{\hat{\tilde{x}}_{k}^{\bar{o}}\in\llbracket\hat{\tilde{\mathbf{x}}}_{k}^{\bar{o}}|y_{0:k}\rrbracket}{\inf}_{\tilde{x}_{k}^{\bar{o}}\in\llbracket\tilde{\mathbf{x}}_{k}^{\bar{o}}|y_{0:k}\rrbracket}\sqrt{\tilde{d}_{o}^{2}+\|\hat{\tilde{x}}_{k}^{\bar{o}}-\tilde{x}_{k}^{\bar{o}}\|^{2}}.

(60)

From the unobservable subsystem of (17), we have

\begin{split}\llbracket\hat{\tilde{\mathbf{x}}}_{k}^{\bar{o}}&|y_{0:k}\rrbracket=\Big{\llbracket}\tilde{A}_{\bar{o}}^{k}\hat{\tilde{\mathbf{x}}}_{0}^{\bar{o}}+\sum_{i=0}^{k-1}\tilde{A}_{\bar{o}}^{k-1-i}(\tilde{A}_{21}\hat{\tilde{\mathbf{x}}}_{i}^{o}+\tilde{B}_{\bar{o}}\mathbf{w}_{i})\Big{|}y_{0:k}\Big{\rrbracket}\\ &\stackrel{{\scriptstyle(m)}}{{\subseteq}}\tilde{A}_{\bar{o}}^{k}\llbracket\hat{\tilde{\mathbf{x}}}_{0}^{\bar{o}}\rrbracket\oplus\sum_{i=0}^{k-1}\tilde{A}_{\bar{o}}^{k-1-i}(\tilde{A}_{21}\llbracket\hat{\tilde{\mathbf{x}}}_{i}^{o}|y_{0:i}\rrbracket\oplus\tilde{B}_{\bar{o}}\llbracket\mathbf{w}_{i}\rrbracket),\end{split}

(61)

where $(m)$ follows from $\llbracket\mathbf{a}+\mathbf{b}\rrbracket\subseteq\llbracket\mathbf{a}\rrbracket\oplus\llbracket\mathbf{b}\rrbracket$ and $\llbracket\mathbf{a}|b\rrbracket\subseteq\llbracket\mathbf{a}\rrbracket$ . Hence, $\forall\hat{\tilde{x}}_{k}^{\bar{o}}\in\llbracket\hat{\tilde{\mathbf{x}}}_{k}^{\bar{o}}|y_{0:k}\rrbracket$ , $\exists\hat{\tilde{x}}_{0}^{o}\in\llbracket\hat{\tilde{\mathbf{x}}}_{0}^{o}\rrbracket,~{}\hat{\tilde{x}}_{i}^{o}\in\llbracket\hat{\tilde{\mathbf{x}}}_{i}^{o}|y_{0:i}\rrbracket,~{}w_{i}\in\llbracket\mathbf{w}_{i}\rrbracket~{}(i=0,\ldots,k-1)$ such that

\hat{\tilde{x}}_{k}^{\bar{o}}=\tilde{A}_{\bar{o}}^{k}\hat{\tilde{x}}_{0}^{\bar{o}}+\sum_{i=0}^{k-1}\tilde{A}_{\bar{o}}^{k-1-i}(\tilde{A}_{21}\hat{\tilde{x}}_{i}^{o}+\tilde{B}_{\bar{o}}w_{i}).

(62)

Likewise, $\forall\tilde{x}_{k}^{\bar{o}}\in\llbracket\tilde{\mathbf{x}}_{k}^{\bar{o}}|y_{0:k},w_{0:k}\rrbracket\subseteq\llbracket\tilde{\mathbf{x}}_{k}^{\bar{o}}|y_{0:k}\rrbracket$ , $\exists\tilde{x}_{0}^{o}\in\llbracket\tilde{\mathbf{x}}_{0}^{o}\rrbracket,~{}\tilde{x}_{i}^{o}\in\llbracket\tilde{\mathbf{x}}_{i}^{o}|y_{0:i}\rrbracket~{}(i=0,\ldots,k-1)$ such that

\tilde{x}_{k}^{\bar{o}}=\tilde{A}_{\bar{o}}^{k}\tilde{x}_{0}^{\bar{o}}+\sum_{i=0}^{k-1}\tilde{A}_{\bar{o}}^{k-1-i}(\tilde{A}_{21}\tilde{x}_{i}^{o}+\tilde{B}_{\bar{o}}w_{i}).

(63)

Therefore, (60) is upper bounded by

\adjustlimits{\sup}_{\hat{\tilde{x}}_{k}^{\bar{o}}\in\llbracket\hat{\tilde{\mathbf{x}}}_{k}^{\bar{o}}|y_{0:k}\rrbracket}{\inf}_{\tilde{x}_{k}^{\bar{o}}\in\llbracket\tilde{\mathbf{x}}_{k}^{\bar{o}}|y_{0:k},w_{0:k}\rrbracket}\sqrt{\tilde{d}_{o}^{2}+\|\hat{\tilde{x}}_{k}^{\bar{o}}-\tilde{x}_{k}^{\bar{o}}\|^{2}},

(64)

where [based on (62) and (63)]

\hat{\tilde{x}}_{k}^{\bar{o}}-\tilde{x}_{k}^{\bar{o}}=\tilde{A}_{\bar{o}}^{k}(\hat{\tilde{x}}_{0}^{\bar{o}}-\tilde{x}_{0}^{\bar{o}})+\sum_{i=0}^{k-1}\tilde{A}_{21}\tilde{A}_{\bar{o}}^{k-1-i}(\hat{\tilde{x}}_{i}^{o}-\tilde{x}_{i}^{o}).

(65)

By condition (ii), (65), and the uniform boundedness of $\llbracket\hat{\tilde{\mathbf{x}}}_{k}^{o}|y_{0:k}\rrbracket$ and $\llbracket\tilde{\mathbf{x}}_{k}^{o}|y_{0:k}\rrbracket$ , there exists a $\bar{d}_{\bar{o}}$ such that

\|\hat{\tilde{x}}_{k}^{\bar{o}}-\tilde{x}_{k}^{\bar{o}}\|\leq\bar{d}_{\bar{o}},

(66)

which yields the boundedness of (59).

Secondly, we analyze the boundedness of

\adjustlimits{\sup}_{\tilde{x}_{k}\in\llbracket\tilde{\mathbf{x}}_{k}|y_{0:k}\rrbracket}{\inf}_{\hat{\tilde{x}}_{k}\in\llbracket\hat{\tilde{\mathbf{x}}}_{k}|y_{0:k}\rrbracket}\|\hat{\tilde{x}}_{k}-\tilde{x}_{k}\|

(67)

in $d_{\mathrm{H}}(\llbracket\hat{\tilde{\mathbf{x}}}_{k}|y_{0:k}\rrbracket,\llbracket\tilde{\mathbf{x}}_{k}|y_{0:k}\rrbracket)$ . From $\llbracket\hat{\tilde{\mathbf{x}}}_{k}^{o}|y_{0:k}\rrbracket\supseteq\llbracket\tilde{\mathbf{x}}_{k}^{o}|y_{0:k}\rrbracket$ [guaranteed by condition (i)], (67) can be rewritten as

\adjustlimits{\sup}_{\tilde{x}_{k}^{\bar{o}}\in\llbracket\tilde{\mathbf{x}}_{k}^{\bar{o}}|y_{0:k}\rrbracket}{\inf}_{\hat{\tilde{x}}_{k}^{\bar{o}}\in\llbracket\hat{\tilde{\mathbf{x}}}_{k}^{\bar{o}}|y_{0:k}\rrbracket}\|\hat{\tilde{x}}_{k}^{\bar{o}}-\tilde{x}_{k}^{\bar{o}}\|\stackrel{{\scriptstyle\eqref{eqninpf:thm:Stability Criterion - Boundedness of Estimation Gap - Bound d_obar}}}{{\leq}}\bar{d}_{\bar{o}}.

(68)

Therefore, (67) is bounded for $k\in\mathbb{N}_{0}$ . To sum up, the estimation gap $d_{k}^{\mathrm{g}}(\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket)$ is bounded w.r.t. $k\in\mathbb{N}_{0}$ . $\square$

Appendix D Proof of Proposition 2

The estimation gap can be lower bounded by

\begin{split}d_{k}^{\mathrm{g}}&(\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket)\geq\|P\|^{-1}d_{\mathrm{H}}(\llbracket\hat{\tilde{\mathbf{x}}}_{k}|y_{0:k}\rrbracket,\llbracket\tilde{\mathbf{x}}_{k}|y_{0:k}\rrbracket)\\ &\stackrel{{\scriptstyle\eqref{eqn:Hausdorff Distance}}}{{\geq}}\adjustlimits{\sup}_{\hat{\tilde{x}}_{k}\in\llbracket\hat{\tilde{\mathbf{x}}}_{k}|y_{0:k}\rrbracket}{\inf}_{\tilde{x}_{k}\in\llbracket\tilde{\mathbf{x}}_{k}|y_{0:k}\rrbracket}\|P\|^{-1}\|\hat{\tilde{x}}_{k}-\tilde{x}_{k}\|\\ &\geq\adjustlimits{\sup}_{\hat{\tilde{x}}_{k}\in\llbracket\hat{\tilde{\mathbf{x}}}_{k}|y_{0:k}\rrbracket}{\inf}_{\tilde{x}_{k}\in\llbracket\tilde{\mathbf{x}}_{k}|y_{0:k}\rrbracket}\|P\|^{-1}\|\hat{\tilde{x}}_{k}^{\bar{o}}-\tilde{x}_{k}^{\bar{o}}\|.\end{split}

(69)

From (69) and (65), we know: if $\tilde{A}_{\bar{o}}$ is not marginally stable, then there exists a bounded $\llbracket\hat{\mathbf{x}}_{0}\rrbracket$ such that $\|\hat{\tilde{x}}_{k}^{\bar{o}}-\tilde{x}_{k}^{\bar{o}}\|$ in (69) unboundedly increases with $k$ . Thus, for all bounded $\llbracket\hat{\mathbf{x}}_{0}\rrbracket\subset\mathbb{R}^{n}$ , the boundedness of the estimation gap cannot be guaranteed when $\tilde{A}_{\bar{o}}$ is not marginally stable. Therefore, $\tilde{A}_{\bar{o}}$ must be marginally stable to ensure the bounded $d_{k}^{\mathrm{g}}(\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket)$ . $\square$

Appendix E Proof of

By Theorem 2, the classical SMFing framework with bounded $\llbracket\hat{\mathbf{x}}_{0}\rrbracket$ and $\llbracket\hat{\tilde{\mathbf{x}}}_{0}^{o}\rrbracket\supseteq\llbracket\tilde{\mathbf{x}}_{0}^{o}\rrbracket$ is stable w.r.t. the initial condition if $(A,C)$ is detectable. Thus, we only focus on the uniform boundedness of the estimate.

From Proposition 4, $d(\llbracket\hat{\tilde{\mathbf{x}}}_{k}^{o}|y_{0:k}\rrbracket)$ is bounded. Thus, with

d(\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket)=\|P^{-1}\|\sqrt{d(\llbracket\hat{\tilde{\mathbf{x}}}_{k}^{o}|y_{0:k}\rrbracket)^{2}+d(\llbracket\hat{\tilde{\mathbf{x}}}_{k}^{\bar{o}}|y_{0:k}\rrbracket)^{2}},

we know that $\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket$ is uniformly bounded w.r.t. $k\in\mathbb{N}_{0}$ if and only if $\llbracket\hat{\tilde{\mathbf{x}}}_{k}^{\bar{o}}|y_{0:k}\rrbracket$ is uniformly bounded w.r.t. $k\in\mathbb{N}_{0}$ . From (61) with $\rho(\tilde{A}_{\bar{o}})<1$ [as $(A,C)$ is detectable], bounded $\llbracket\hat{\mathbf{x}}_{0}\rrbracket$ , and uniformly bounded $\llbracket\hat{\tilde{\mathbf{x}}}_{k}^{o}|y_{0:k}\rrbracket$ and $\llbracket\mathbf{w}_{k}\rrbracket$ , we can derive the uniform boundedness of $\llbracket\hat{\tilde{\mathbf{x}}}_{k}^{\bar{o}}|y_{0:k}\rrbracket$ . Therefore, $\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket$ is uniformly bounded w.r.t. $k\in\mathbb{N}_{0}$ for detectable $(A,C)$ . $\square$

Appendix F Proof of Lemma 1

Since for any $\mathcal{S}\subseteq\mathbb{R}^{n_{o}}$ , $P_{o}F_{k,0}(P^{-1}(\mathcal{S}\times\llbracket\hat{\tilde{\mathbf{x}}}_{0}^{\bar{o}}\rrbracket))$ in (18) corresponds to the observable subsystem and the resulting set is independent of $\llbracket\hat{\tilde{\mathbf{x}}}_{0}^{\bar{o}}\rrbracket$ , without loss of generality, we consider the pair $(A,C)$ is observable, i.e., $A=\tilde{A}_{o}$ . In that case, (18) becomes

F_{k,0}(\mathcal{B}_{\theta^{\prime}_{k}}^{\infty}[\hat{c}_{0}^{o}])\subseteq F_{k,0}(\mathcal{B}_{\bar{\theta}_{k}}^{\infty}[\hat{c}_{0}^{o}])=F_{k,0}(\mathcal{B}_{\theta^{\prime\prime}_{k}}^{\infty}[\hat{c}_{0}^{o}]).

(70)

Firstly, we prove (70) for non-singular $A$ . Because $\mathcal{B}_{\theta^{\prime}_{k}}^{\infty}[\hat{c}_{0}^{o}]\subseteq\mathcal{B}_{\bar{\theta}_{k}}^{\infty}[\hat{c}_{0}^{o}]\subseteq\mathcal{B}_{\theta^{\prime\prime}_{k}}^{\infty}[\hat{c}_{0}^{o}]$ for all $\theta^{\prime}_{k}\leq\bar{\theta}_{k}\leq\theta^{\prime\prime}_{k}$ , from Definition 1 we have

F_{k,0}(\mathcal{B}_{\theta^{\prime}_{k}}^{\infty}[\hat{c}_{0}^{o}])\subseteq F_{k,0}(\mathcal{B}_{\bar{\theta}_{k}}^{\infty}[\hat{c}_{0}^{o}])\subseteq F_{k,0}(\mathcal{B}_{\theta^{\prime\prime}_{k}}^{\infty}[\hat{c}_{0}^{o}]).

(71)

By Theorem 1, $\forall k\geq\max\{\mu_{o}-1+n_{\lambda_{0}^{o}},1\}$ , $\bigcap_{i=k-n_{\lambda_{0}^{o}}}^{k}\mathcal{O}_{k,i}$ is bounded, and $\exists\bar{\theta}_{k}\geq 0$ s.t. $\forall x_{0}\in\mathbb{R}^{n}\setminus\mathcal{B}_{\bar{\theta}_{k}}^{\infty}[\hat{c}_{0}^{o}]$ :

A^{k}x_{0}+\sum_{i=0}^{k-1}A^{k-1-i}Bw_{i}=x_{k}\notin\bigcap_{i=k-n_{\lambda_{0}^{o}}}^{k}\mathcal{O}_{k,i},

(72)

which together with (14) and (15) gives $F_{k,0}(\mathcal{B}_{\theta^{\prime\prime}_{k}}^{\infty}[\hat{c}_{0}^{o}]\setminus\mathcal{B}_{\bar{\theta}_{k}}^{\infty}[\hat{c}_{0}^{o}])=\emptyset$ for $\theta^{\prime\prime}_{k}\geq\bar{\theta}_{k}$ . Thus, $F_{k,0}(\mathcal{B}_{\bar{\theta}_{k}}^{\infty}[\hat{c}_{0}^{o}])=F_{k,0}(\mathcal{B}_{\theta^{\prime\prime}_{k}}^{\infty}[\hat{c}_{0}^{o}])$ , and with (71) we get (70).

When $A$ is singular, from (50) we have $\bar{\mathbf{x}}_{k}^{\mathrm{b}}=\sum_{i=0}^{k-1}J_{0}^{k-1-i}\bar{B}_{\mathrm{b}}\mathbf{w}_{i}$ for $k\geq n_{\lambda_{0}^{o}}$ , which is not affected by the choice of $\mathcal{B}_{\theta_{k}}^{\infty}[\hat{c}_{0}^{o}]$ ; for the subsystem w.r.t. $\bar{\mathbf{x}}_{k}^{\mathrm{a}}$ , the proof is the same as that of (70), where the measurement equation is in (52) with the equivalent noise $\bar{\mathbf{v}}_{k}=\bar{C}_{\mathrm{b}}\sum_{i=0}^{k-1}J_{0}^{k-1-i}\bar{B}_{\mathrm{b}}\mathbf{w}_{i}+\mathbf{v}_{k}$ for $k\geq n_{\lambda_{0}^{o}}$ . $\square$

Appendix G Proof of Theorem 3

Well-posedness: For $k<k_{*}$ , Line 5 resets every empty $\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket$ to a non-empty set. For $k\geq k_{*}$ , the estimate $\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket$ in Line 7 is non-empty, which is guaranteed by $\llbracket\hat{\tilde{\mathbf{x}}}_{0}^{o}\rrbracket=\mathcal{B}_{\bar{\theta}_{k}}^{\infty}[\hat{c}_{0}^{o}]\supseteq\llbracket\tilde{\mathbf{x}}_{0}^{o}\rrbracket$ (see Remark 7) and the well-posedness part [see (55)-(57)] in the proof of Theorem 2. Thus, $\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket\neq\emptyset$ holds for $k\in\mathbb{N}_{0}$ .

Boundedness of estimation gap: For $k<k_{*}$ , bounded $\llbracket\mathbf{x}_{0}\rrbracket$ with Lines 3 and 5 indicates $d_{k}^{\mathrm{g}}(\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket)<\infty$ . For $k\geq k_{*}$ , using the same techniques in (58)-(68) [note that $\llbracket\hat{\tilde{\mathbf{x}}}_{0}^{o}\rrbracket=\mathcal{B}_{\bar{\theta}_{k}}^{\infty}[\hat{c}_{0}^{o}]\supseteq\llbracket\tilde{\mathbf{x}}_{0}^{o}\rrbracket$ ensures the preconditon $\llbracket\hat{\mathbf{x}}_{0}\rrbracket\supseteq\llbracket\mathbf{x}_{0}\rrbracket$ in Proposition 4 when it is applied to the observable system to derive (60)], we can obtain the boundedness of $d_{k}^{\mathrm{g}}(\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket)$ . Therefore, the estimation gap $d_{k}^{\mathrm{g}}(\llbracket\hat{\mathbf{x}}_{k}|y_{0:k}\rrbracket)$ is bounded for $k\in\mathbb{N}_{0}$ . $\square$

Appendix H Proof of Theorem 4

Firstly, we provide Lemma 4 for bounding the norm of matrix power, which is a variant of Lemma 3 in [9].

Lemma 4 (Bound on Norm of Matrix Power).

Given $F\in\mathbb{R}^{n\times n}$ with $\rho(F)<1$ , for $\gamma\in(\rho(F),1)$ , $\|F^{k}\|_{\infty}\leq\beta_{\gamma}(F)\gamma^{k}$ holds for all $k\in\mathbb{N}_{0}$ , where $\beta_{\gamma}(F)=\max\{\gamma^{-k}\|F^{k}\|_{\infty}\colon k\in\mathbb{N}_{0}\}$ .

Proof.

$\forall\gamma\in(\rho(F),1)$ , let $\tilde{F}=\gamma^{-1}F$ , and we have $\rho(\tilde{F})=\rho(F)/\gamma<1$ . Since $\lim_{k\to\infty}\|\tilde{F}^{k}\|_{\infty}=0$ , we can find a $\underline{k}\in\mathbb{N}_{0}$ such that for all $k\geq\underline{k}$ , the following holds

\displaystyle\gamma^{-k}\|F^{k}\|_{\infty}=\|\gamma^{-k}F^{k}\|_{\infty}=\|\tilde{F}^{k}\|_{\infty}<1,

(73)

which implies $\|F^{k}\|_{\infty}<\gamma^{k}$ . It together with $\beta_{\gamma}(F)=\max\{1,\gamma^{-k}\|F^{k}\|_{\infty}~{}(1\leq k\leq\underline{k})\}=\max\{\gamma^{-k}\|F^{k}\|_{\infty}\colon k\in\mathbb{N}_{0}\}$ ²⁸²⁸28Note that $\gamma^{-0}\|F^{0}\|_{\infty}=1$ and $\gamma^{-k}\|F^{k}\|_{\infty}<1$ for $k>\underline{k}$ . gives $\|F^{k}\|_{\infty}\leq\beta_{\gamma}(F)\gamma^{k}$ for all $k\in\mathbb{N}_{0}$ . ∎

When $(A,C)$ is detectable, the uniform boundedness of the estimate and the boundedness of the estimation gap can be readily derived based on the results in Section 5, when we replace $F_{k,0}$ and $\llbracket\hat{\tilde{\mathbf{x}}}_{0}^{\bar{o}}\rrbracket$ with $F_{k,k-\bar{\delta}}$ and $\hat{\mathcal{T}}_{k-\bar{\delta}}^{\bar{o}}$ , respectively. Thus, we only focus on the well-posedness and finite-time inclusion property of Algorithm 3.

Well-posedness: Similarly to the well-posedness part in Appendix G (Lines 5 and 8 in Algorithm 3 play the same role as Lines 5 and 7 in Algorithm 2), we have $\mathcal{Z}_{k}\neq\emptyset$ for $k\in\mathbb{N}_{0}$ and

\widetilde{\mathcal{Z}}_{k}^{o}\supseteq\llbracket\tilde{\mathbf{x}}_{k}^{o}|y_{0:k}\rrbracket,\quad k\geq\bar{\delta}.

(74)

The only part needs highlighting is: for Line 8, when $\bar{\delta}\leq k<2\bar{\delta}$ , Lemma 1 guarantees $\hat{\mathcal{T}}_{k-\bar{\delta}}^{o}=\mathcal{B}_{\bar{\theta}_{k}}^{\infty}\big{[}\mathrm{center}\big{(}P_{o}\overline{\mathrm{IH}}(\mathcal{Z}_{k-\bar{\delta}}^{-})\big{)}\big{]}\supseteq\llbracket\tilde{\mathbf{x}}_{k-\bar{\delta}}^{o}|y_{0:k-\bar{\delta}-1}\rrbracket$ ; when $k\geq 2\bar{\delta}$ , $\hat{\mathcal{T}}_{k-\bar{\delta}}^{o}=P_{o}\overline{\mathrm{IH}}(\mathcal{Z}_{k-\bar{\delta}}^{-})\supseteq\llbracket\tilde{\mathbf{x}}_{k-\bar{\delta}}^{o}|y_{0:k-\bar{\delta}-1}\rrbracket$ ; thus,

\hat{\mathcal{T}}_{k}^{o}\supseteq\llbracket\tilde{\mathbf{x}}_{k}^{o}|y_{0:k-1}\rrbracket\supseteq\llbracket\tilde{\mathbf{x}}_{k}^{o}|y_{0:k}\rrbracket,\quad k\in\mathbb{N}_{0}.

(75)

Boundedness of estimation gap: When $(A,C)$ is detectable, condition (ii) in Theorem 2 holds. Then, the proof is similar to that of Theorem 3.

Finite-time inclusion: The proof includes two steps. In the first step, we show that

\llbracket\tilde{\mathbf{x}}_{k}|y_{0:k}\rrbracket=\llbracket P\mathbf{x}_{k}|y_{0:k}\rrbracket\subseteq\llbracket\tilde{\mathbf{x}}_{k}^{o}|y_{0:k}\rrbracket\times\mathcal{T}_{k}^{\bar{o}}

(76)

holds for $k\in\mathbb{N}_{0}$ , where

\!\!\!\!\!\mathcal{T}_{k}^{\bar{o}}\!:=\!\tilde{A}_{\bar{o}}^{k}\llbracket\tilde{\mathbf{x}}_{0}^{\bar{o}}\rrbracket\oplus\sum_{i=0}^{k-1}\tilde{A}_{\bar{o}}^{k-1-i}(\tilde{A}_{21}\llbracket\tilde{\mathbf{x}}_{i}^{o}|y_{0:i}\rrbracket\oplus\tilde{B}_{\bar{o}}\llbracket\mathbf{w}_{i}\rrbracket).

(77)

In the second step, the finite-time inclusion property (24) is proven based on (76).

Step 1: We use the mathematical induction on (76).

Base case: For $k=0$ , we have $\llbracket\tilde{\mathbf{x}}_{0}|y_{0}\rrbracket\subseteq\llbracket\tilde{\mathbf{x}}_{0}^{o}|y_{0}\rrbracket\times\llbracket\tilde{\mathbf{x}}_{0}^{\bar{o}}|y_{0}\rrbracket=\llbracket\tilde{\mathbf{x}}_{0}^{o}|y_{0}\rrbracket\times\mathcal{T}_{0}^{\bar{o}}$ . Thus, (76) holds for $k=0$ .

Inductive step: Assume (76) holds for any $k=l\in\mathbb{N}_{0}$ . For $k=l+1$ , with (17) we have

\llbracket\tilde{\mathbf{x}}_{l+1}|y_{0:l+1}\rrbracket\subseteq(\tilde{A}_{o}\llbracket\tilde{\mathbf{x}}_{l}^{o}|y_{0:l}\rrbracket\oplus\tilde{B}_{o}\llbracket\mathbf{w}_{l}\rrbracket)\\ \times(\tilde{A}_{\bar{o}}\mathcal{T}_{l}^{\bar{o}}\oplus\tilde{A}_{21}\llbracket\tilde{\mathbf{x}}_{l}^{o}|y_{0:l}\rrbracket\oplus\tilde{B}_{\bar{o}}\llbracket\mathbf{w}_{l}\rrbracket).

(78)

Noticing that $\mathcal{T}_{l+1}^{\bar{o}}=\tilde{A}_{\bar{o}}\mathcal{T}_{l}^{\bar{o}}\oplus\tilde{A}_{21}\llbracket\tilde{\mathbf{x}}_{l}^{o}|y_{0:l}\rrbracket\oplus\tilde{B}_{\bar{o}}\llbracket\mathbf{w}_{l}\rrbracket$ and

\llbracket\tilde{\mathbf{x}}_{l+1}|y_{0:l+1}\rrbracket\subseteq\llbracket\tilde{\mathbf{x}}_{l+1}^{o}|y_{0:l+1}\rrbracket\times\llbracket\tilde{\mathbf{x}}_{l+1}^{\bar{o}}|y_{0:l+1}\rrbracket,

(79)

we have [as $(\mathcal{T}_{1}\times\mathcal{T}_{2})\bigcap(\mathcal{T}_{3}\times\mathcal{T}_{4})=(\mathcal{T}_{1}\bigcap\mathcal{T}_{3})\times(\mathcal{T}_{2}\bigcap\mathcal{T}_{4})$ ]

\llbracket\tilde{\mathbf{x}}_{l+1}|y_{0:l+1}\rrbracket\subseteq\Big{[}(\tilde{A}_{o}\llbracket\tilde{\mathbf{x}}_{l}^{o}|y_{0:l}\rrbracket\oplus\tilde{B}_{o}\llbracket\mathbf{w}_{l}\rrbracket)\\ \bigcap\llbracket\tilde{\mathbf{x}}_{l+1}^{o}|y_{0:l+1}\rrbracket\Big{]}\times\big{(}\mathcal{T}_{l+1}^{\bar{o}}\bigcap\llbracket\tilde{\mathbf{x}}_{l+1}^{\bar{o}}|y_{0:l+1}\rrbracket\big{)}.

(80)

It implies $\llbracket\tilde{\mathbf{x}}_{l+1}|y_{0:l+1}\rrbracket\subseteq\llbracket\tilde{\mathbf{x}}_{l+1}^{o}|y_{0:l+1}\rrbracket\times\mathcal{T}_{l+1}^{\bar{o}}$ , i.e., (76) holds for $k=l+1$ . Thus, (76) is proven by induction.

Step 2: We split the RHS of (77) as $\mathcal{T}_{k}^{\bar{o}}=\mathcal{T}_{k,1}^{\bar{o}}+\mathcal{T}_{k,2}^{\bar{o}}$ :

	$\displaystyle\!\!\!\!\!\mathcal{T}_{k,1}^{\bar{o}}\!$	$\displaystyle=\!\tilde{A}_{\bar{o}}^{k}\llbracket\tilde{\mathbf{x}}_{0}^{\bar{o}}\rrbracket\!\oplus\!\!\sum_{i=0}^{\bar{\delta}-1}\!\tilde{A}_{\bar{o}}^{k-1-i}(\tilde{A}_{21}\llbracket\tilde{\mathbf{x}}_{i}^{o}\|y_{0:i}\rrbracket\!\oplus\!\tilde{B}_{\bar{o}}\llbracket\mathbf{w}_{i}\rrbracket),$		(81)
	$\displaystyle\!\!\!\!\!\mathcal{T}_{k,2}^{\bar{o}}\!$	$\displaystyle=\!\sum_{i=\bar{\delta}}^{k-1}\tilde{A}_{\bar{o}}^{k-1-i}(\tilde{A}_{21}\llbracket\tilde{\mathbf{x}}_{i}^{o}\|y_{0:i}\rrbracket\oplus\tilde{B}_{\bar{o}}\llbracket\mathbf{w}_{i}\rrbracket).$		(82)

Then, we also spilt $\mathcal{B}_{\alpha_{k}}^{\infty}[\hat{c}_{k}^{\bar{o}}]$ in Line 12 into two parts:

\hat{\mathcal{T}}_{k}^{\bar{o}}=\mathcal{B}_{\alpha_{k}}^{\infty}[\hat{c}_{k}^{\bar{o}}]=\mathcal{B}_{\alpha_{k,1}}^{\infty}[\hat{c}_{k,1}^{\bar{o}}]\oplus\mathcal{B}_{\alpha_{k,2}}^{\infty}[\hat{c}_{k,2}^{\bar{o}}],

(83)

where $\alpha_{k,1}=\frac{1}{2}\|A_{\bar{o}}^{k-\bar{\delta}}\|_{\infty}d_{\infty}(\llbracket\hat{\tilde{\mathbf{x}}}_{\bar{\delta}}^{\bar{o}}|y_{0:\bar{\delta}}\rrbracket)+\varepsilon$ , $\alpha_{k,2}=\Upsilon_{\infty}\ell_{k-1}$ , $\hat{c}_{k,1}^{\bar{o}}=\tilde{A}_{\bar{o}}^{k-\bar{\delta}}\hat{c}_{\bar{\delta}}^{\bar{o}}$ , and $\hat{c}_{k,2}^{\bar{o}}=\sum_{i=\bar{\delta}}^{k-1}\tilde{A}_{\bar{o}}^{k-1-i}\hat{c}_{i}^{\mathrm{in}}$ such that $\alpha_{k}=\alpha_{k,1}+\alpha_{k,2}$ and $\hat{c}_{k}^{\bar{o}}=\hat{c}_{k,1}^{\bar{o}}+\hat{c}_{k,2}^{\bar{o}}$ . Next, we prove

\mathcal{T}_{k,1}^{\bar{o}}\subseteq\mathcal{B}_{\alpha_{k,1}}^{\infty}[\hat{c}_{k,1}^{\bar{o}}],\quad k\geq\underline{k}^{\prime},

(84)

for sufficiently large $\underline{k}^{\prime}\geq\bar{\delta}$ , and

\mathcal{T}_{k,2}^{\bar{o}}\subseteq\mathcal{B}_{\alpha_{k,2}}^{\infty}[\hat{c}_{k,2}^{\bar{o}}],\quad k\geq\bar{\delta},

(85)

such that

\mathcal{T}_{k}^{\bar{o}}=\mathcal{T}_{k,1}^{\bar{o}}\oplus\mathcal{T}_{k,2}^{\bar{o}}\subseteq\mathcal{B}_{\alpha_{k}}^{\infty}[\hat{c}_{k}^{\bar{o}}]=\hat{\mathcal{T}}_{k}^{\bar{o}},\quad k\geq\underline{k}^{\prime}.

(86)

For (84), $\forall t_{k}\in\mathcal{T}_{k,1}^{\bar{o}}$ , its distance from the center of $\mathcal{B}_{\alpha_{k,1}}^{\infty}[\hat{c}_{k,1}^{\bar{o}}]$ is upper bounded by

\|t_{k}-\hat{c}_{k,1}^{\bar{o}}\|_{\infty}\leq\|t_{k}\|_{\infty}+\|\tilde{A}_{\bar{o}}^{k-\bar{\delta}}\hat{c}_{\bar{\delta}}^{\bar{o}}\|_{\infty},\quad k\geq\bar{\delta}.

(87)

Since (81) implies $\lim_{k\to\infty}\|t_{k}\|_{\infty}=0$ , we can further derive that for $\varepsilon>0$ given in Line 1, $\exists\underline{k}^{\prime}\geq\bar{\delta}$ such that

\|t_{k}-\hat{c}_{k,1}^{\bar{o}}\|_{\infty}\leq\varepsilon\leq\alpha_{k,1},\quad k\geq\underline{k}^{\prime},

(88)

which means $t_{k}\in\mathcal{B}_{\alpha_{k,1}}^{\infty}[\hat{c}_{k,1}^{\bar{o}}]$ , i.e., (84) holds.

For (85), as (74) holds, $\mathcal{T}_{k,2}^{\bar{o}}$ in (82) is bounded by

\begin{split}\mathcal{T}_{k,2}^{\bar{o}}&\subseteq\sum_{i=\bar{\delta}}^{k-1}\tilde{A}_{\bar{o}}^{k-1-i}(\tilde{A}_{21}\widetilde{\mathcal{Z}}_{i}^{o}\oplus\tilde{B}_{\bar{o}}\llbracket\mathbf{w}_{i}\rrbracket)\\ &\subseteq\sum_{i=\bar{\delta}}^{k-1}\tilde{A}_{\bar{o}}^{k-1-i}\overline{\mathrm{IH}}(\tilde{A}_{21}\widetilde{\mathcal{Z}}_{i}^{o}\oplus\tilde{B}_{\bar{o}}\llbracket\mathbf{w}_{i}\rrbracket)\\ &\stackrel{{\scriptstyle(n)}}{{\subseteq}}\sum_{i=\bar{\delta}}^{k-1}\tilde{A}_{\bar{o}}^{k-1-i}\mathcal{B}_{\ell_{i}}^{\infty}[\hat{c}_{i}^{\mathrm{in}}]\stackrel{{\scriptstyle(o)}}{{\subseteq}}\mathcal{B}_{\alpha_{k,2}}^{\infty}[\hat{c}_{k,2}^{\bar{o}}],\end{split}

(89)

where $(n)$ follows from Lines 9-10 and the fact that $2\|\hat{G}_{i}^{\mathrm{in}}\|_{\infty}$ gives the maximum edge length of the interval hull. Recall that $\alpha_{k,2}=\Upsilon_{\infty}\ell_{k-1}$ and $\hat{c}_{k,2}^{\bar{o}}=\sum_{i=\bar{\delta}}^{k-1}\tilde{A}_{\bar{o}}^{k-1-i}\hat{c}_{i}^{\mathrm{in}}$ , and observe that

	$\displaystyle\sum_{i=\bar{\delta}}^{k-1}\tilde{A}_{\bar{o}}^{k-1-i}\mathcal{B}_{\ell_{i}}^{\infty}[\hat{c}_{i}^{\mathrm{in}}]\subseteq\sum_{i=\bar{\delta}}^{k-1}\mathcal{B}_{\\|\tilde{A}_{\bar{o}}^{k-1-i}\\|_{\infty}\ell_{i}}^{\infty}[\hat{c}_{i}^{\mathrm{in}}],$
	$\displaystyle\sum_{i=\bar{\delta}}^{k-1}\\|\tilde{A}_{\bar{o}}^{k-1-i}\\|_{\infty}\stackrel{{\scriptstyle\mathrm{Lemma~{}\ref{lem:Bound on Matrix Power Norm}}}}{{\leq}}\!\!\inf_{\gamma\in(\rho(\tilde{A}_{\bar{o}}),1)}\!\!\frac{\beta_{\gamma}(\tilde{A}_{\bar{o}})}{1-\gamma}=\Upsilon_{\infty}.$

We can further derive $(o)$ in (89), i.e., (85) holds.

Thus, (84) and (85) give (86).

With (75), (76), and (86) we get $\llbracket\mathbf{x}_{k}|y_{0:k}\rrbracket\subseteq P^{-1}\big{(}\hat{\mathcal{T}}_{k}^{o}\times\hat{\mathcal{T}}_{k}^{\bar{o}}\big{)}$ for $k\geq\underline{k}^{\prime}$ . Setting $\underline{k}=\underline{k}^{\prime}+\bar{\delta}$ , we can obtain for $k\geq\underline{k}\geq 2\bar{\delta}$ ,

\llbracket\mathbf{x}_{k-\bar{\delta}}|y_{0:k-\bar{\delta}}\rrbracket\subseteq P^{-1}\big{(}\hat{\mathcal{T}}_{k-\bar{\delta}}^{o}\times\hat{\mathcal{T}}_{k-\bar{\delta}}^{\bar{o}}\big{)}.

(90)

Therefore, we have

\llbracket\mathbf{x}_{k}|y_{0:k}\rrbracket\stackrel{{\scriptstyle\eqref{eqn:Filtering Map}}}{{=}}F_{k,k-\bar{\delta}}\big{(}\llbracket\mathbf{x}_{k-\bar{\delta}}|y_{0:k-\bar{\delta}}\rrbracket\big{)}\\ \stackrel{{\scriptstyle\eqref{eqninpf:thm:Stability of OIT-CZ SMF - Well-Posedness - Step 2 - An Outer Bound on Actual Range}}}{{\subseteq}}F_{k,k-\bar{\delta}}\big{(}P^{-1}\big{(}\hat{\mathcal{T}}_{k-\bar{\delta}}^{o}\times\hat{\mathcal{T}}_{k-\bar{\delta}}^{\bar{o}}\big{)}\big{)}\stackrel{{\scriptstyle\mathrm{Line}~{}\ref{line:OIT-CZ SMF - Estimate Inspired by OIT}}}{{=}}\mathcal{Z}_{k}

(91)

for $k\geq\underline{k}$ , i.e., (24) holds.

Uniformly bounded estimate: From Algorithm 3, the uniform boundedness of $\mathcal{Z}_{k}$ only depends on $k\geq\bar{\delta}$ . By Line 8, $\mathcal{Z}_{k}^{o}=P_{o}F_{k,k-\bar{\delta}}\big{(}P^{-1}\big{(}\hat{\mathcal{T}}_{k-\bar{\delta}}^{o}\times\hat{\mathcal{T}}_{k-\bar{\delta}}^{\bar{o}}\big{)}\big{)}$ holds for $k\geq\bar{\delta}$ , which combined with Proposition 4 and (75) guarantees the uniform boundedness of $\mathcal{Z}_{k}^{o}$ w.r.t. $k\geq\bar{\delta}$ . Then, similarly to , we can get the uniform boundedness of $\mathcal{Z}_{k}$ w.r.t. $k\in\mathbb{N}_{0}$ . $\square$

	$\displaystyle\llbracket\hat{\mathbf{x}}_{k}\|y_{0:k-1}\rrbracket$	$\displaystyle=A\llbracket\hat{\mathbf{x}}_{k-1}\|y_{0:k-1}\rrbracket\oplus B\llbracket\mathbf{w}_{k-1}\rrbracket,$		(3)
	$\displaystyle\llbracket\hat{\mathbf{x}}_{k}\|y_{0:k}\rrbracket$	$\displaystyle=\mathcal{X}_{k}(C,y_{k},\llbracket\mathbf{v}_{k}\rrbracket)\bigcap\llbracket\hat{\mathbf{x}}_{k}\|y_{0:k-1}\rrbracket,$		(4)