A New Least Squares Parameter Estimator for Nonlinear Regression Equations with Relaxed Excitation Conditions and Forgetting Factor

Romeo Ortega [email protected] Jose Guadalupe Romero [email protected] Stanislav Aranovskiy [email protected] Departamento Académico de Sistemas Digitales, ITAM, Progreso Tizapán 1, Ciudad de México, 04100, México IETR–CentaleSupélec, Avenue de la Boulaie, 35576 Cesson-Sévigné, France Department of Control Systems and Robotics, ITMO University, Kronverkskiy av. 49, Saint Petersburg, 197101, Russia

Abstract

In this note a new high performance least squares parameter estimator is proposed. The main features of the estimator are: (i) global exponential convergence is guaranteed for all identifiable linear regression equations; (ii) it incorporates a forgetting factor allowing it to preserve alertness to time-varying parameters; (iii) thanks to the addition of a mixing step it relies on a set of scalar regression equations ensuring a superior transient performance; (iv) it is applicable to nonlinearly parameterized regressions verifying a monotonicity condition and to a class of systems with switched time-varying parameters; (v) it is shown that it is bounded-input-bounded-state stable with respect to additive disturbances; (vi) continuous and discrete-time versions of the estimator are given. The superior performance of the proposed estimator is illustrated with a series of examples reported in the literature.

keywords:

Parameter estimation, Least squares identification algorithm, Nonlinear regression model, Exponentially convergent identification.

1 Introduction

We have witnessed in the last few years an increasing interest in the analysis and design of new parameter estimators for linearly paramterized regression equations (LPRE) of the form $y(t)=\phi^{\top}(t)\theta$ , with $y(t)\in\mathbb{R},\;\phi(t)\in\mathbb{R}^{q}$ measurable signals and $\theta\in\mathbb{R}^{q}$ a constant vector of unknown parameters.¹¹1We consider the case of scalar $y(t)$ to simplify the notation—as will be seen below all results can be directly extended for the case of vector $y(t)$ . The main motivation of this research is to relax the highly restrictive assumption of persistent excitation (PE) imposed to guarantee global exponential convergence of classical gradient, least squares (LS) or Kalman-Bucy algorithms [16, 48, 52]. A second important motivation is to provide guaranteed good transient performance behavior since the one of the aforementioned schemes is highly unpredictable and only a weak monotonicity property of the norm of the vector of estimation errors can be insured.

1.1 Review of recent literature on LS estimators

It has recently been shown [3, 43] that global asymptotic convergence—but not exponential—of the error equation for standard continous-time (CT) gradient estimators is ensured under a strictly weaker condition of generalized PE—see [3, 55] for its definition and [13] for some robustness properties of the algorithm. Unfortunately, this condition is still extremely restrictive to be of practical use. In [7] it is shown that the classical discrete-time (DT) LS algorithm is asymptotically convergent if and only if the regressor $\phi(t)$ satisfies a new excitation property, called weak PE, that is strictly weaker than PE. This result is of limited interest because, on one hand, the definition is extremely technical and difficult to verify in applications. On the other hand, and more importantly, the analysis is limited to standard LS, without forgetting factor or covariance resetting that, as is well-known [16, 48], has a decreasing adaptation gain, loosing its alertness to track parameter variations, which is the main motivation for recursive algorithms. In [29] the underexcited scenario where the Gram matrix of the regressor has a $q_{0}$ -dimensional kernel, with $q_{0}\leq q$ , is considered. It is shown that incorporating into a CT LS (or gradient) estimator the information of a basis expanding this kernel—the columns of the matrix $N\in\mathbb{R}^{q_{0}\times q}$ in equation (14) or (24) whose columns $v_{i}\in\mathbb{R}^{q},\;i=1,\dots q_{0}$ satisfy equation (4)—it is possible to guarantee consistent estimation to its complementary space. Clearly, if the regressor is PE the dimension $q_{0}$ of the aforementioned kernel is zero and convergence of all the parameters is guaranteed, confirming the well-known result of the PE case. This result is related to the partial convergence property of [48, Theorem 2.7.4] where a similar fact is proven in the context of systems identification in underexcited situations. Although of theoretical interest, the result has little practical relevance because of the impossibilty to compute on-line the matrix $N$ mentioned above. In [10] a slight modification of the DT LS algorithm is proposed to deal with parameter variations in the LPRE, however the main convergence (to a compact set) results still rely on PE assumptions. In [22], an LS version of the well-known I&I estimator [2] is proposed while in [28] a variation of LS that defines a passive operator is proposed.

In [49] an interesting generalization of the classical CT LS with forgetting factor is introduced, where the latter is allowed to be a time varying matrix. See [50] for a similar result in DT. The convergence analysis in both papers still relies on the PE assumption. In the very recent paper [5] a general structure to design and analyze DT LS-like recursive estimators parameterized in terms of some free functions is proposed—see equation (7). A new definition of excitation, called $t^{\star}$ -excitation, is given in [5, Definition 1]. Interestingly, this definition involves not only the regressor but also two of the aforementioned functions. A particular choice of these functions yields the classical LS estimator and $t^{\star}$ -excitation is equivalent to PE. However, other choices of these free parameters may yield weaker excitation conditions, for instance the choice given in (27). However, this selection has again the problem of driving the adaptation gain to zero, loosing the estimator alertness and it is not clear if there are other choices that do not suffer from this drawback. Another novelty of [5] is that it incorporates a very interesting analysis of robustness to additive disturbances in the LPRE, encrypted in the input-to-state-stability property. The interested reader is referred to [5, 49, 50] for a review of the extensive literature on LS with forgetting factors.

1.2 Relaxing the PE condition

A major breakthrough in the design of recurrent estimators is the proof that it is possible to establish global convergence under the extremely weak assumption of interval excitation²²2It should be pointed out that IE is strictly weaker than the generalized PE of [55], the weak excitation PE property of [7] and the $t^{\star}$ -excitation of [5]. (IE) [20]—called initial excitation in [41] and excitation over a finite interval in [52]. To the best of the authors’ knowledge the first estimators where such a result was established are the concurrent and the composite learning schemes reported in [9] and [40], respectively; see [34] for a recent survey on new estimators. These algorithms, which incorporate the monitoring of past data to build a stack of suitable regressor vectors, are closer in spirit to off-line estimators. See also [19, 31] for two early references where a similar idea is explored. As is well-known, the main drawback of off-line estimators is their inability to track parameter variations, which is very often the main objective in applications. This situation motivates the interest to develop bona-fide on-line estimators that relax the PE condition preserving the scheme’s alertness [26].

New on-line estimators relying on the use of the dynamic regressor extension and mixing (DREM) technique with weaker excitation requirements have been recently proposed. DREM was first proposed in [1] for CT and in [4] for DT systems. In Appendix A it is recalled that the main step in the derivation of DREM estimators is the construction of a new extended LPRE $Y(t)=\Phi(t)\theta$ , with $Y(t)\in\mathbb{R}^{q}$ and $\Phi(t)\in\mathbb{R}^{q\times q}$ a new square matrix regressor. Two procedures to construct the extended LPRE, reported in [21] and [24], respectively, were originally considered—for the sake of completeness both constructions are reviewed in Appendix A. The final—and critical—mixing step consists of the multiplication of this extended LPRE by the adjugate of $\Phi(t)$ .³³3It is interesting to note that this operation was independently reported in [8] in the context of stochastic estimator convergence analysis. Clearly, this operation creates a new scalar LPRE of the form ${\cal Y}_{i}(t)=\Delta(t)\theta_{i},\;i\in\bar{q}$ , with ${\cal Y}(t):=\mbox{adj}\{\Phi(t)\}Y(t)$ and $\Delta(t):=\det\{\Phi(t)\}$ a scalar regressor, which is the essential feature of the approach.⁴⁴4In Appendix A DREM is applied to nonlinearly parameterized regression equation (NLPRE) of the form (1), which is also considered in this paper. DREM estimators have been successfully applied in a variety of identification and adaptive control problems, both, theoretical and practical ones, see [34, 38] for an account of some of these results.

The convergence properties of DREM-based estimators clearly depend on the scalar regressor $\Delta(t)$ . Due to the scalar nature of $\Delta(t)$ , it is clear that the parameter error converges if and only if $\Delta(t)$ is not square integrable (summable for DT systems) and convergence is exponential if and only if $\Delta(t)$ is PE, facts that were proven in [1]. In [14] a DREM-based algorithm using the extended regressor of [21] that ensures convergence in finite-time imposing the IE assumption on $\Delta(t)$ was proposed. An interesting open question was to establish the relation of the excitation of $\Delta(t)$ and the original regressor $\phi(t)$ , which was studied in [17] and [55] for the extended regressors of [21] and [24], respectively. The equivalence between PE of $\Delta(t)$ and PE of the original regressor $\phi(t)$ was established for both extended regressors—proving that DREM-based estimators are at least as good as standard gradient or LS schemes for excited LPRE. On the other hand, in [17] it is shown that if $\phi(t)$ is IE then $\Delta(t)$ is also IE for the extended regressor [21], while in [55] it was shown that the scheme of [24] ensures the stronger property that $\Delta(t)$ is bounded away from zero in an open interval $[t_{c},\infty)$ with $t_{c}>0$ . Finally, in [38, Proposition 3] a new extended regressor which guarantees exponential convergence under conditions that are strictly weaker than regressor PE was presented.

Three major developments in this line of research reported recently are:

(i)

the proposal in [18] and [53] of two new, fully on-line, DREM-based estimators where exponential convergence is established imposing only the IE condition to $\phi(t)$ ;
(ii)

the proof in [53] that IE of the regressor $\phi(t)$ is equivalent to identifiability of the LPRE. It should be recalled that identifiability of a LPRE is the existence of $q$ linearly independent regressor vectors [53, Definition 2] $\phi(t_{i}),\,i\in\bar{q}$ , and is a necessary and sufficient condition for the on- or off-line estimation of the parameters [16];
(iii)

the proof in [53] that the proposed estimator is applicable also to separable NLPRE of the form (1), provided the mapping ${\cal G}(\theta)$ verifies a monotonicity condition. Estimators for this kind of NLPRE were reported before in [36], but the convergence condition was expressed in terms of the scalar regressor $\Delta(t)$ .

The estimators of [18] and [53] rely on the generation of new LPRE using the main idea of generalized parameter estimation based observer (GPEBO), which is a technique to design state observers for state-affine nonlinear systems, first proposed in [32] and latter generalized in [35]. GPEBO translates the problem of state-estimation into one of parameter estimation from a LPRE. The latter is generated exploiting the well-known property [47, Property 4.4] that the trajectories of an LTV system can be expressed as linear combinations of the columns of its fundamental matrix. Besides the addition of the computationally demanding calculation of the fundamental matrix, a potential drawback of GPEBO is that it essentially reconstructs the initial conditions of some error equation, an operation which may adversely affect the robustness of the estimator, [38, Remark 7] and [37], see also [54]. The procedure followed in the construction of the estimator of [18] is first the application of DREM and then invoke GPEBO, hence we refer to it as D+G. On the other hand, the estimator of [53] uses also GPEBO and DREM, but in the opposite order, so we refer to it in the sequel as G+D.

1.3 Contributions of the paper

In this paper we provide an alternative to the D+G and G+D estimators that also ensures global exponential convergence under the weak assumption of IE of the original regressor $\phi(t)$ . The main features of this new estimator are summarized as follows.

F1 In contrast to the D+G and G+D estimators that implement a gradient descent search, we use the classical LS technique, hence we refer to it in the sequel as LS+D estimator. The superior convergence properties of LS estimators, as opposed to gradient-based, are widely recognized [16, 25, 45].

F2 We avoid the use of the GPEBO technique but instead exploit some structural properties of the LS estimator to construct the extended regressor. This fact removes the need to calculate the computationally demanding fundamental matrix.

F3 Similarly to the G+D scheme, the stability mechanism and, consequently, the stability analysis of the LS+D estimator is much more transparent than the one of the D+G estimator. There are two consequences of this fact, on one hand, the procedure of tuning the estimator to achieve a satisfactory transient performance, which is difficult for the D+G scheme, is straightforward for the LS+D one.

F4 A time-varying forgetting factor that allows the estimator to preserve its alertness to time-varying parameters is incorporated.

F5 Besides the case of LPRE we consider (separable and monotonic) NLPRE, with the associated estimator preserving all the properties of the case of LPRE. Also, we show that the proposed estimator is applicable to NLPRE with switched time-varying parameters.

F6 We show that the new estimator is robust with respect to additive disturbances, by proving that it defines a bounded-input-bounded-state (BIBS) stable system.

F7 The behaviour of many physical systems is described via CT models. On the other hand, DT implementations of estimators are of significant practical relevance. Therefore, similarly to [18, 38, 53], to comply with both scenarios we consider in the paper both kinds of LPREs. Interestingly, in contrast to [18], the construction and analysis tools of both cases are essentially the same—however, for the sake of clarity, they are presented in separate sections.

The remainder of the paper is organized as follows. In Section 2 we present the main result of the paper for CT systems, while the DT version is given in Section LABEL:sec3. For the sake of brevity we give both results for the general case of NLPRE, presenting the LPRE case as a corollary. Section 4 is devoted to the derivation of the proposed extended NLPRE applying directly the DREM construction procedure. Section 5 is devoted to the proof of robustness of the new estimator. Simulation results of some examples reported in the literature are given in Section 6 to illustrate the superior performance of the proposed LS+D estimator. The paper is wrapped-up with concluding remarks in Section 7.

Notation. $I_{n}$ is the $n\times n$ identity matrix and ${\bf 0}_{s\times r}$ is an $s\times r$ matrix of zeros. $\mathbb{R}_{>0}$ , $\mathbb{R}_{\geq 0}$ , $\mathbb{Z}_{>0}$ and $\mathbb{Z}_{\geq 0}$ denote the positive and non-negative real and integer numbers, respectively. For $q\in\mathbb{Z}_{>0}$ we defined the set $\bar{q}:=\{1,2,\dots,q\}$ . For $a\in\mathbb{R}^{n}$ , we denote $|a|^{2}:=a^{\top}a$ , and for any matrix $A$ its induced norm is $\|A\|$ . $\mbox{vec}:\mathbb{R}^{p\times p}\to\mathbb{R}^{p^{2}}$ is an operator that piles up the columns of a matrix. CT signals $x:\mathbb{R}_{\geq 0}\to\mathbb{R}^{n}$ are denoted $x(t)$ , while for DT sequences $x:\mathbb{Z}_{\geq 0}\to\mathbb{R}^{n}$ we use $x_{k}:=x(kT_{s})$ , with $T_{s}\in\mathbb{R}_{>0}$ the sampling time. The action of an operator $\mathcal{H}$ on a CT signal $s(t)$ is denoted as $\mathcal{H}[s](t)$ , and $\mathcal{H}[s](k)$ for a sequence $s_{k}$ .

2 Main Result for Continuous-time Systems

In this section we present the proposed LS+D interlaced estimator for CT systems, with the first estimator being the LS with bounded-gain forgetting factor proposed in [51, Subsection 8.7.6]. First, we consider the case of NLPRE and then specialize to LPRE that, as expected, ensures stronger convergence properties.

2.1 Nonlinearly parameterized regression equations

Consider the following CT NLPRE

y(t)=\phi^{\top}(t){\cal G}(\theta)

(1)

where $y(t)\in\mathbb{R}$ , $\phi(t)\in\mathbb{R}^{p}$ and ${\cal G}:\mathbb{R}^{q}\to\mathbb{R}^{p},\;q\leq p$ , a smooth mapping verifying the following.

Assumption A1. [Monotonicity] There exists a matrix $Q\in\mathbb{R}^{q\times p}$ such that mapping ${\cal G}(\theta)$ verifies the linear matrix inequality

\displaystyle{Q\nabla{\cal G}(\theta)+\nabla^{\top}{\cal G}(\theta)Q^{\top}\geq\rho I_{q}>0,\;\forall\;\theta\in\mathbb{R}^{q},}

(2)

for some $\rho\in\mathbb{R}_{>0}$ . Consequently [12, 42], The mapping $Q{\cal G}(\theta)$ is strongly monotone, that is,

with $a\neq b$ .

Assumption A2. [Interval Excitation] The regressor $\phi(t)$ is interval exciting (IE) [52, Definition 3.1]. That is, there exists constants $C_{c}>0$ and $t_{c}>0$ such that⁵⁵5In [52, Definition 3.1] there is an initial time in the integral that, for simplicity and without loss of generality, is taking here as zero.

\int_{0}^{t_{c}}\phi(s)\phi^{\top}(s)ds\geq C_{c}I_{p}.

(3)

Proposition 1.

Consider the NLPRE (1) with ${\cal G}(\theta)$ satisfying Assumption A1 and $\phi(t)$ verifying Assumption A2. Define the LS+D interlaced estimator with time-varying forgetting factor

Assumption A3. [Lipschitz] The mapping ${\cal G}(\theta)$ satisfies the Lipschitz condition

|{\cal G}(a)-{\cal G}(b)|\leq\nu|a-b|,\;\forall a,b\in\mathbb{R}^{q},

(4ein)

for some $\nu>0$ .

Moreover, assume the regressor $\phi_{k}$ satisfies.

Assumption A4. [Interval Excitation] [52, Definition 3.3] The regressor $\phi_{k}$ is IE. That is, there exists constants $C_{d}>0$ and $k_{c}>0$ such that

\begin{array}[]{ccl}\Sigma_{j=0}^{k_{c}}\phi_{j}\phi^{\top}_{j}\geq C_{d}I_{p}.\end{array}

(4eip)

Proposition 2.

Consider the NLPRE (LABEL:nlprek) with ${\cal G}(\theta)$ satisfying Assumption A1 and A3 and $\phi_{k}$ verifying Assumption A4. Define the normalized LS+DREM interlaced estimator


			(4eiqa)
			(4eiqb)
			(4eiqc)

with initial conditions $\hat{\eta}_{0}=:\eta_{0}\in\mathbb{R}^{p}$ , $F_{-1}={1\over f_{0}}I_{p}$ and the definitions


			(4eiqra)

with tuning parameters the initial condition $f_{0}>0$ , the forgetting factor $\beta\in(0,1]$ and the adaptation gain $\gamma>0$ , which is selected such that

\sigma:=\rho-{\gamma\nu^{2}\over 2}\lambda_{\max}\{Q^{\top}Q\}>0.

(4eiqrs)

Define the parameter estimation error $\tilde{\theta}_{k}:=\hat{\theta}_{k}-\theta$ . Then, for all $f_{0}>0$ , $\eta_{0}\in\mathbb{R}^{p}$ and $\theta_{0}\in\mathbb{R}^{q}$ , we have that

\lim_{k\to\infty}|\tilde{\theta}_{k}|=0,\;(exp),

(4eiqrt)

with all signals bounded.

Proof.

To simplify the notation we define the normalization sequence

{m}_{k}:=\beta+\phi^{\top}_{k}F_{k-1}\phi_{k}.

(4eiqru)

With some abuse of notation, define the error signal

\tilde{\eta}_{k}:=\hat{\eta}_{k}-{\cal G}(\theta),

(4eiqrv)

whose dynamics is given by

where we used (LABEL:nlprek) and (LABEL:thegk). Now, direct application of the matrix inversion lemma to (4eiqb) shows that,

F^{-1}_{k}=\beta F^{-1}_{k-1}+\phi_{k}\phi^{\top}_{k}.

(4eiqrw)

Combining (LABEL:tileta) and (4eiqrw) we can prove the following fundamental property of LS

F^{-1}_{k}\tilde{\eta}_{k+1}=\beta F^{-1}_{k-1}\tilde{\eta}_{k}.

Solving this difference equation we get

where we replaced the solution of (4eiq) and the initial condition choice $F_{-1}={1\over f_{0}}I_{p}$ to get the second identity. Using the definition (4eiqrv), the equation above may be rewritten as the extended LPRE

Following the DREM procedure we multiply (LABEL:keyide) by $\mbox{adj}\{I_{p}$ $-f_{0}z_{k}F_{k-1}\}$ to get the following NLPRE

{\cal Y}_{k}=\Delta_{k}{\cal G}(\theta),

(4eiqrx)

where we used (LABEL:delk) and (4eiqr). Replacing (4eiqrx) in (4eiqc) we get the dynamics of the parameter error

where, to simplify the notation, we defined the normalized scalar regressor sequence

\bar{\Delta}^{2}_{k}:={\Delta^{2}_{k}\over 1+\Delta^{2}_{k}}\leq 1.

(4eiqry)

To analyze the stability of this equation define the Lyapunov function candidate

V_{k}=\frac{1}{2\gamma}|\tilde{\theta}_{k}|^{2},

(4eiqrz)

that satisfies


			(4eiqraa)

			(4eiqrab)

where we invoked Assumption A2 and Assumption A3 to get the first bound, (4eiqry) for the second one and used (4eiqrs) in the last identity. Summing the inequality above we get

Taking the limit as $k\to\infty$ we conclude that $\Delta_{k}|\tilde{\theta}_{k}|\in\ell_{2}$ , consequently

\lim_{k\to\infty}\bar{\Delta}_{k}|\tilde{\theta}_{k}|=0.

(4eiqrac)

Now, from the Algebraic Limit Theorem [46, Theorem 3.3] we know that the limit of the product of two convergent sequences is the product of their limits. On the other hand, from the fact that

V(k+1)\leq V(k)\leq V(0),\;\forall k\in\mathbb{Z}_{>0},

we have that $|\tilde{\theta}(k)|$ is a bounded monotonic sequence, hence it converges [46, Theorem 3.14]. Finally, if $\bar{\Delta}(k)$ converges to a non-zero limit, we conclude from (4eiqrac) that $|\tilde{\theta}_{k}|\to 0$ .

We will proceed now to prove that (4eip) of Assumption A4 ensures this property of $\Delta_{k}$ , which together with the fact that if $\Delta_{k}$ converges to a non-zero limit, then $\bar{\Delta}_{k}$ also converges to a non-zero limit. Indeed, the solution of the difference equation (4eiqrw) is given by

F^{-1}_{k}=\beta^{k+1}f_{0}+\beta^{k}\sum_{j=0}^{k}\beta^{-j}\phi_{j}\phi^{\top}_{j}.

Evaluating this expression for $k=k_{c}$ yields

I_{p}-f_{0}z_{k_{c}+1}F_{k_{c}}=\beta^{k_{c}}F_{k_{c}}\sum_{j=0}^{k_{c}}\beta^{-j}\phi_{j}\phi^{\top}_{j}.

The IE assumption ensures that the summation term is positive definite, since $F_{k_{c}}$ is nonsingular this ensures that the matrix on the left hand side is nonsingular. The proof that this property holds for any $k>k_{c}$ stems from the observation that, for any $k_{b}>0$ we have that

\sum_{j=0}^{k_{c}+k_{b}}\beta^{-j}\phi_{j}\phi^{\top}_{j}=\sum_{j=0}^{k_{c}}\beta^{-j}\phi_{j}\phi^{\top}_{j}+\sum_{j=k_{c}+1}^{k_{c}+k_{b}}\beta^{-j}\phi_{j}\phi^{\top}_{j},

preserving the positivity property mentioned above. This completes the proof. $\Box\Box\Box$

3.2 Linearly parameterized regression equations

In this section, we use the result of Proposition 2 for the case of LPRE—obviously, in this linear case Assumption A1 and Assumption A3 are automatically satisfied. As a first step we recall [53] that Definition LABEL:def1 and Lemma LABEL:lem1, given for continuous functions, are also valid for sequences.

As a second step notice that for the LPRE case (LABEL:keyide) takes the form

(I_{p}-f_{0}z_{k}F_{k-1})\theta=\hat{\eta}_{k}-f_{0}z_{k}F_{k-1}\eta_{0},

consequently (4eiqrx) now becomes

{\cal Y}_{i}(k)=\Delta_{k}\theta_{i},\;i\in\bar{q},

and the dynamics of the parameter error (LABEL:errequk) is now given by

\tilde{\theta}_{k+1}=\tilde{\theta}_{k}-{\gamma\bar{\Delta}_{k}^{2}}\tilde{\theta}_{k},

whose stability is follows immediately from the IE assumption [53, Proposition 2] .

Corollary 2.

Consider the LPRE and assume it is identifiable. Define the normalized LS+D interlaced estimator with forgetting factor

with initial conditions $\hat{\eta}_{0}=:\eta_{0}\in\mathbb{R}^{q}$ , $F_{-1}={1\over f_{0}}I_{p}$ , tuning gains $f_{0}>0,\;\beta\in(0,1]$ and $\gamma>0$ , and we used the definitions (4eiqr). Then, for all $f_{0}>0$ , $\eta_{0}\in\mathbb{R}^{q}$ and $\theta_{0}\in\mathbb{R}^{q}$ , we have that (4eiqrac) holds with all signals bounded. Moreover, the individual parameter errors verify the monotonicity condition

|\tilde{\theta}_{i}(k_{a})|\leq|\tilde{\theta}_{i}(k_{b})|,\;\forall\,k_{a}\geq k_{b}\geq 0,\;i\in\bar{q}.

Remark 2.

The importance of the element-by-element monotonicity property of the parameter error can hardly be overestimated. It played a key role for the relaxation of the assumption of known sign of the high frequency in model reference adaptive control [14, 53] as well as in the solution of the adaptive pole placement problem [44].

3.3 Switching parameters case

In this section, we consider the case of switched parameters estimation. Whereas the results are presented in DT only, similar results can be formulated for the CT case in a straightforward manner. Rewrite (LABEL:nlprek) as

y_{k}=\phi_{k}^{\top}{\cal G}(\theta^{*}_{\sigma_{k}}),

(4eiqrad)

where $\theta^{*}_{\sigma_{k}}$ denotes the switched unknown parameter vector with $\theta^{*}_{\sigma_{k}}\in\left\{\theta^{*}_{1}\right.$ , $\left.\theta^{*}_{2},\ldots,\theta^{*}_{s}\right\}$ , $s\in\mathbb{Z}_{>0}$ . The switching signal $\sigma_{k}:\mathbb{Z}_{\geq 0}\to\bar{s}$ is a known⁷⁷7Such a scenario arises in several practical control scenarios, when the known switching signal $\sigma_{k}$ characterizes known changes in operation regimes [23]. piecewise-constant function defining the behavior of $\theta^{*}_{\sigma_{k}}$ , i.e., $\theta^{*}_{\sigma_{k}}=\theta^{*}_{i}$ when $\sigma_{k}=i$ , $i\in\bar{s}$ . The known time instants when $\sigma_{k}$ changes its value are further denoted as $t_{r,i}$ , $i\in\mathbb{Z}_{\geq 0}$ .

The estimator (4eiq), (4eiqr) is not capable of estimating switched parameters as for $\beta<1$ the sequence $z_{k}$ converges to zero, and for $\beta=1$ the LS estimator looses its alertness. To deal with switching parameters, we propose a resetting-based modification of the estimator (4eiq), (4eiqr):


$\displaystyle{\hat{\eta}}_{k+1}$	$\displaystyle=\hat{\eta}_{k}+\frac{1}{1+\phi^{\top}_{k}F_{k-1}\phi_{k}}F_{k-1}\phi_{k}(y_{k}-\phi^{\top}_{k}\hat{\eta}_{k}),$	(4eiqraea)
$\displaystyle{\hat{\theta}}_{k+1}$	$\displaystyle={\hat{\theta}}_{k}+\gamma Q\frac{\Delta_{k}}{1+\Delta^{2}_{k}}[{\cal Y}_{k}-\Delta_{k}{\cal G}(\hat{\theta}_{k})],\;\hat{\theta}_{0}=:\theta_{0}\in\mathbb{R}^{q},$	(4eiqraeb)
$\displaystyle{F}_{k}$	$\displaystyle=\begin{cases}{\mathcal{N}}_{k}F_{k-1}&\mbox{ if }t_{k}\neq t_{r,i}\;\forall i,\\ \frac{1}{f_{0}}I_{p}&\mbox{ otherwise,}\end{cases}$	(4eiqraec)
$\displaystyle{\mathcal{N}}_{k}$	$\displaystyle=\left(I_{p}-\frac{1}{1+\phi^{\top}_{k}F_{k-1}\phi_{k}}F_{k-1}\phi_{k}\phi^{\top}_{k}\right),$
$\displaystyle\psi_{k+1}$	$\displaystyle=\begin{cases}\psi_{k}&\mbox{ if }t_{k}\neq t_{r,i}\;\forall i,\\ {\hat{\eta}}_{k+1}&\mbox{ otherwise,}\end{cases},\quad\psi_{0}=\eta_{0},$	(4eiqraed)

where $\hat{\eta}_{0}=:\eta_{0}\in\mathbb{R}^{p}$ , $F_{-1}={1\over f_{0}}I_{p}$ , and


$\displaystyle\Delta_{k}$	$\displaystyle=\det\{I_{p}-f_{0}F_{k-1}\},$	(4eiqrafa)
$\displaystyle{\cal Y}_{k}$	$\displaystyle:=\mbox{adj}\{I_{p}-f_{0}F_{k-1}\}(\hat{\eta}_{k}-f_{0}F_{k-1}\psi_{k}).$	(4eiqrafb)

Between the reseting instances $t_{r,i}$ , the estimator (4eiqrae), (4eiqraf) reproduces the estimator (4eiq), (4eiqr) with $\beta=1$ and thus $z_{k}\equiv 1$ . Then, at each reset instance $t_{r,i}$ , the matrix $F_{k}$ is reset to its initial condition $F_{-1}$ , and the state $\psi_{k}$ saves the value of $\hat{\eta}_{k}$ . The state $\psi_{k}$ thus plays the same role as $\eta_{0}$ in (LABEL:keyide), compare (4eiqrafb) and (4eiqr). Following the properties of (4eiq), (4eiqr), the proposed estimator ensures the boundedness of the states and is capable of estimating $\theta^{*}_{\sigma_{k}}$ if the following assumption holds.

Assumption A5. [Switching Interval Excitation]. The switching signal $\sigma_{k}$ is such that the regressor $\phi_{k}$ is IE between two subsequent switching instants. That is, there exist constants $C_{d}>0$ and $k_{c}>0$ such that for any $i\in\mathbb{Z}_{\geq 0}$

t_{r,i}+k_{c}\leq t_{r,i+1}

and

\sum_{\ell=0}^{k_{c}}\phi_{t_{r,i}+\ell}\phi^{\top}_{t_{r,i}+\ell}\geq C_{d}I_{p}.

Remark 3.

In words, Assumption A5 means that the regressor satisfies the IE condition inside each subinterval $[t_{r,i},t_{r,i+1}]$ . For simplicity we have taken that the constants $k_{c}$ and $C_{d}$ that appear in the definition of IE are the same for all subintervals $[t_{r,i},t_{r,i+1}]$ , but this is clearly not necessary.

4 Derivation of the Extended NLPRE (LABEL:keyide1) via DRE

To simplify the reading of the material presented in this section we refer the reader to Appendix A where the procedure to derive DREM is recalled.

In Proposition 1 it is shown that the dynamic extension (LABEL:thegt1) and (LABEL:dotf) generates the extended NLPRE (LABEL:keyide1) to which we apply the mixing step S4 of Appendix A to generate the scalar NLPRE (LABEL:ydelc). In this section we prove that this extended NLPRE can also be derived directly applying the DREM step S2 of Appendix A for a suitably defined LTV operator ${\cal H}$ .⁸⁸8We refer the interested reader to [53, Proposition 3] where the DREM operator ${\cal H}$ for the G+D estimator reported in [53, Proposition 2] is identified. For the sake of brevity we only consider the CT case, with the DT case following verbatim.

Proposition 3.

Define the state space realization of the LTV operator ${\cal H}:u\to U$ used in step S2 of Appendix A as in (LABEL:dotu) with

A(t):=-\alpha F(t)\phi(t)\phi^{\top}(t),\;b(t):=\alpha F(t)\phi(t),

with $F(t)$ defined in (LABEL:dotf). Starting from the NLPRE $y(t)=\phi^{\top}(t){\cal G}(\theta)$ , construct $Y(t)\in\mathbb{R}^{p}$ and $\Phi(t)\in\mathbb{R}^{p\times p}$ via (2) that is, as the solutions of the dynamic extension


			(4eiqraga)

and initial conditions $Y(0)={\bf 0}_{p\times 1}$ and $\Phi(0)={\bf 0}_{p\times p}$ .

i)

The extended NLPRE $Y(t)=\Phi(t){\cal G}(\theta)$ holds.
ii)

The signals $Y(t)$ and $\Phi(t)$ satisfy (LABEL:ylsd) and (LABEL:philsd), respectively, with $\hat{\eta}(t)\in\mathbb{R}^{p}$ and $F(t)\in\mathbb{R}^{p\times p}$ solutions of the differential equations (LABEL:thegt1) and (LABEL:dotf), respectively.

Proof.

The fact that the extended NLPRE $Y(t)=\Phi(t){\cal G}(\theta)$ holds follows trivially from linearity of the operator ${\cal H}$ .

To prove the claim (ii) we invoke (LABEL:ylsd) and do the following calculations

In the same spirit as above we compute the time derivative of $\Phi(t)$ as defined in (LABEL:philsd) to get

This completes the proof. $\Box\Box\Box$

Remark 4.

It is important to note that the relation $Y(t)=\Phi(t){\cal G}(\theta)$ imposes the constraint $Y(0)=\Phi(0){\cal G}(\theta)$ , which is satisfied with the zero initial conditions imposed in Proposition 3. As expected, this choice is consistent with the choice of initial conditions for $\hat{\eta}(t)$ and $F(t)$ given in Proposition 1.

Remark 5.

The dynamic extension (LABEL:dotf)and (4eiqrag) provides an alternative to the construction of the proposed estimator. The relationship between the two implementations boils down to a standard diffeomorphic change of coordinates. Indeed, while the state of the system in (4) and (LABEL:dotzet) is given by $\mbox{col}(\hat{\eta}(t),\operatorname{vec}(F(t)),$ $z(t),\hat{\theta}(t))\in\mathbb{R}^{(p+p^{2}+1+q)}$ , the state of the system of Proposition 3 is $\mbox{col}(Y(t)$ , $\mbox{vec}(\Phi(t)),z(t),\hat{\theta}(t))\in\mathbb{R}^{(p+p^{2}+1+q)}$ , and the first two components are related by a simple invertible coordinate change

\begin{bmatrix}\hat{\eta}(t)\\ \mbox{vec}(F(t))\end{bmatrix}=\begin{bmatrix}Y(t)+[I_{p}-\Phi(t)]\eta_{0}\\ {1\over z(t)f_{0}}\mbox{vec}(I_{p}-\Phi(t))\end{bmatrix}.

However, the original implementation (4) clearly reveals the mechanism underlying the operation of the estimator, namely, the use of a classical LS update and the creation of the extended NLPRE exploiting the well-known property of LS (LABEL:prols).⁹⁹9To the best of the authors’ knowledge, this property was first reported in [11, equation (17)] and was widely used for the implementation of projections in indirect adaptive controllers [27].

5 Robustness Analysis of the CT LS+D Estimator

In this section we analyze the robustness vis-à-vis additive perturbations of the CT LS+D estimator of Proposition 1. That is, we consider the perturbed NLPRE

y(t)=\phi^{\top}(t){\cal G}(\theta)+d(t),

(4eiqragah)

where $d(t)$ represents an additive perturbation signal. This signal may come from additive noise in the measurements of $y(t)$ and $\phi(t)$ or time variations of the parameters, that is, $d(t)$ may be decomposed as

d(t)=d_{y}(t)+d^{\top}_{\theta}(t)\phi(t)+d^{\top}_{\phi}(t){\cal G}(\theta),

where $d_{y}(t)\in\mathbb{R}$ and $d_{\phi}(t)\in\mathbb{R}^{p}$ represent the measurement noise added to $y(t)$ and $\phi(t)$ , respectively, and $d_{\theta}(t)\in\mathbb{R}^{p}$ captures time variations in the parameters. We make the reasonable assumption that these signals are all bounded and prove that the CT LS+D estimator defines a bounded-input-bounded-state (BIBS) stable system.

The main result is summarized in the proposition below.

Proposition 4.

Consider the perturbed NLPLPRE (4eiqragah) with $d(t)$ a bounded signal. Assume the regressor $\phi(t)$ is IE. The LS+D estimator of Proposition 1 applied to this NLPRE is BIBS stable.

Proof.

In the light of Remark 3, to carry out the proof we rely on the use of the alternative implementation of the extended NLPRE of Proposition 3. Applying the operator ${\cal H}$ of Propositions 3 to the perturbed NLPRE (4eiqragah) yields the perturbed version of the extended LPRE (LABEL:extnlpre) as

Y(t)=\Phi(t){\cal G}(\theta)+{\cal H}[d](t),

(4eiqragai)

where we exploited the property of linearity of ${\cal H}$ . Next we proceed to show that the operator ${\cal H}$ is BIBO-stable. This is done by proving that, for all bounded $d(t)$ , the signal ${\cal H}[d](t)$ is also bounded.

The signal ${\cal H}[d](t)$ is generated via the CT LTV system

Defining $W(x_{d}):={1\over 2}x_{d}^{\top}(t)F^{-1}(t)x_{d}(t)$ , we have

As $\beta(t)>0$ and $d(t)$ is bounded, this proves that $x_{d}(t)={\cal H}[d](t)$ , is also bounded.

From the analysis above, we conclude that the operator ${\cal H}$ is BIBO-stable. Consequently, since $\phi(t)$ and $y(t)$ are bounded, it follows that $Y(t)={\cal H}[y](t)$ and $\Phi(t)={\cal H}[\phi^{\top}](t)$ are also bounded. It only remains to prove that and $\hat{\theta}(t)$ is bounded. Whence, multiplying (4eiqragai) by $\mbox{adj}\{\Phi(t)\}$ we get the following perturbed NLPRE

{\cal Y}(t)=\Delta(t){\cal G}(\theta)+\xi(t),

(4eiqragaj)

where we defined the signal

\xi(t):=\mbox{adj}\{\Phi(t)\}x_{d}(t).

(4eiqragak)

We notice that this signal is bounded. Replacing (4eiqragak) in the estimator (LABEL:thet1c) yields

Computing the derivative of the Lyapunov function candidate (LABEL:lyafunv) we get

The proof of boundedness of $\tilde{\theta}(t)$ is completed recalling that in Proposition 1 it is shown that $\Delta(t)$ is PE. $\Box\Box\Box$

6 Simulation Examples

In this section we present simulations of the proposed CT and DT estimators using different examples recently reported in the literature.

6.1 Example 5 of [29]

Consider the second order stable, CT, linear system described by

	$\displaystyle\dot{x}_{1}(t)=$	$\displaystyle x_{2}(t)$
	$\displaystyle\dot{x}_{2}(t)=$	$\displaystyle-\theta_{1}x_{1}(t)-\theta_{2}+\theta_{3}u(t)$
	$\displaystyle y(t)=$	$\displaystyle x_{1}(t),$

or equivalently

\ddot{x}_{1}(t)=-\theta_{1}x_{1}(t)-\theta_{2}\dot{x}_{1}(t)+\theta_{3}u(t)

(4eiqragal)

where $\theta_{1}$ , $\theta_{2}$ and $\theta_{3}$ are unknown parameters. Applying the filter

H({\mathfrak{p}})={1\over{{\mathfrak{p}}+\lambda}}

where ${\mathfrak{p}}:=\frac{d}{dt}$ , to both sides of (4eiqragal) and rearranging the terms, we get the LPRE (LABEL:lrect) with

y(t)={\mathfrak{p}}H({{\mathfrak{p}}})[x_{2}](t),\quad\phi(t)=H({{\mathfrak{p}}})[\mbox{col}(-x_{1}(t),\,-{\mathfrak{p}}x_{1}(t),\,u(t))],

(4eiqragam)

and $\theta:=\mbox{col}(\theta_{1},\,\theta_{2},\,\theta_{3})$ .

To carry out the simulations we use the same conditions that [29], that is, we set to zero the initial conditions of the filters, as well as the initial value of the parameter estimation vector ${\hat{\theta}}(0)=0$ , $\hat{\eta}(0)=\mbox{col}(0.1,\,0.1,\,0.1)$ , $u(t)=5$ and fix $\theta=\mbox{col}(2,3,1)$ . Besides, the tuning parameters of the proposed estimator of Corollary LABEL:cor1 were $\alpha=20.3$ , $f_{0}=4$ , $\beta=0.07$ and $\gamma=700$ . In Fig. 1 we appreciate the transient behavior of the estimated parameters, which clearly shows the estimation of the real values. This result should be contrasted with the non-converging behavior of the estimates reported in [29] with the gradient scheme and their modified gradient.

Refer to caption — Figure 1: Transient behavior of the estimated parameters $\hat{\theta}_{i}(t)$ with $i=1,2,3$ .

To illustrate the use of the NLPRE (1), we notice that from the proposed values for $\theta$ , we have that $\theta_{3}$ can be rewritten as $\theta_{3}=\theta_{2}-\theta_{1}$ . Hence, after the application of the filter $H({\mathfrak{p}})$ , the system (4eiqragal) can be written as the NLPRE (1) with ${\mathcal{G}}(\theta):=\mbox{col}(\theta_{1},\,\theta_{2},\,\theta_{2}-\theta_{1})$ , Thus, using the same initial conditions, estimator gains and verifying Assumption A1 with

Q=\left[\begin{array}[]{ccc}1&0&0\\ 0&1&0\end{array}\right],

and $\rho=1$ , we carry out a simulation to estimate only $\theta_{1}$ and $\theta_{2}$ with the estimator of Proposition 1. Fig. 2 shows the transient behavior of the estimated parameters, showing again parameter convergence.

6.2 Example 4 of [15]

Consider the first order linear system

y_{k}=\frac{\theta_{2}}{{\mathfrak{q}}-\theta_{1}}u_{k},

(4eiqragap)

where ${\mathfrak{q}}$ is the forward-shiff operator and $\theta_{1}$ and $\theta_{2}$ are unknown parameters. After some simple calculations, we have that (4eiqragap) can be written as a LPRE $y_{k}=\phi_{k}^{\top}\theta$ with

\phi_{k}=\mbox{col}(y_{k-1},\,u_{k-1}),\quad\theta=\mbox{col}(\theta_{1},\,\theta_{2}).

To carry out the simulations we have also used the same initial conditions and parameters of [15], that is, $\theta_{1}=0.4$ , $\theta_{2}=0.8$ , $\theta_{0}=\mbox{col}(0,\,0)$ and the input signal $u_{k}=1$ .¹⁰¹⁰10We notice that there is an unfortunate typo in the definition of $u_{k}$ in [15, Example 4]. The tuning gains of the estimator of Corollary 2 were chosen as $\beta=1$ , $f_{0}=0.14$ , $\gamma=0.4$ and initial conditions ${\hat{\eta}}_{0}=\mbox{col}(1,\,1)$ . It is important to note that for this system (4eiqragap), the estimator proposed in [15] only ensures the boundedness of $\hat{\theta}_{i}$ , with $i=1,2$ (see Fig. 3 of [15]). This should be contrasted with our estimator, which, as can be seen in Fig. 3, converges to the real value.

6.3 Example 8 of [30]

We consider the DT system

y_{k}=-0.5y_{k-1}+0.1y_{k-2}+u_{k-1}-0.4u_{k-2},

which switches for $k\geq k_{c}$ to

y_{k}=1.4y_{k-1}-0.3y_{k-2}+u_{k-1}-1.3u_{k-2}.

Note that for $k\geq k_{c}$ the plant is unstable and not minimum-phase. The initial conditions are $y_{-1}=-0.2$ , $y_{-2}=0.4$ , and $u_{-1}=u_{-2}=0$ , and $k_{c}=50$ .

The system can be written in the form (4eiqrad) setting ${\cal G}(\theta^{*}_{\sigma_{k}})=\theta^{*}_{\sigma_{k}}$ ,

\begin{gathered}\phi_{k}:=\begin{bmatrix}-y_{k-1}&-y_{k-2}&u_{k-1}&u_{k-2}\end{bmatrix}^{\top},\\ \theta^{*}_{1}:=\begin{bmatrix}0.5&-0.1&1&-0.4\end{bmatrix}^{\top},\\ \theta^{*}_{2}:=\begin{bmatrix}-1.4&0.3&1&-1.3\end{bmatrix}^{\top},\end{gathered}

and

\sigma_{k}=\begin{cases}1&\mbox{for }k<k_{c},\\ 2&\mbox{for }k\geq k_{c},\end{cases}

which corresponds to $t_{r,0}=0$ and $t_{r,1}=k_{c}$ .

In this example, we consider the indirect adaptive poles placement for the reference tracking, where the reference signal is denoted as $r_{k}$ . Then the control signal $u$ is given by

u_{k}=c_{1,k}y_{k}+c_{2,k}y_{k-1}+c_{3,k}u_{k-1}+c_{4,k}r_{k},

where the time-varying coefficients $c_{1,k}$ , $c_{2,k}$ , $c_{3,k}$ , and $c_{4,k}$ are computed based on the current parameter estimate $\hat{\theta}_{k}$ to provide the desired poles and unit gain of the closed-loop system; if for a value of $\hat{\theta}_{k}$ the computations are ill-conditioned, then $u_{k}=0$ is chosen. For this example, the desired poles for this are $e^{-1}$ , $e^{-0.5+0.86\sqrt{-1}}$ , and $e^{-0.5-0.86\sqrt{-1}}$ , and the reference signal is $r_{k}\equiv 1$ .

To estimate the parameters, we apply the resetting-based estimator (4eiqrae), (4eiqraf), where we set $f_{0}=0.4$ , $\gamma=500$ , $\eta_{0}=0$ , and $\theta_{0}=\begin{bmatrix}0.1&-0.3&0.5&-0.05\end{bmatrix}$ . Note that $\theta_{0}$ cannot be chosen zero as such a choice yields zero input to the system and the regressor $\phi$ is not IE; for a nonzero choice of $\theta_{0}$ , the interval excitation is provided by the transients of the plant.

The simulation results are depicted in Fig. 4 for the output signal $y$ and in Fig. 5 for the estimation errors $\tilde{\theta}_{k}=\hat{\theta}_{k}-\theta^{*}_{\sigma_{k}}$ . It can be observed that after the switch, parameters estimation errors remain almost constant for approximately $30$ steps, and then quickly converge. Further investigation shows that the regressor $\phi$ is not exciting on this initial interval, and thus the estimation does not progress. As soon as the IE condition is satisfied, the estimates $\hat{\theta}_{k}$ converges to the true value $\theta^{*}_{2}$ .

7 Concluding Remarks

We have presented in this paper a new robust DREM-based parameter estimator that proves global exponential convergence of the parameter errors with the weakest excitation assumption, namely, identifiability of the LPRE—which is, actually necessary for the off- or on-line estimation of the parameters. The main features of the estimator are: (i) it relies on the use of a high performance LS search, in contrast to the usually slower gradient descents; (ii) it ensures component-wise monotonicity of the parameter estimation errors; (iii) it incorporates a forgetting factor avoiding the well-known covariance wind-up problem of LS; (iv) it is applicable to NLPRE, which are separable and monotonic as well as to switching parameters; (v) it constructs the extended regressor avoiding the use of the computationally demanding GPEBO technique, exploiting instead the key structural property of the LS estimator captured in (LABEL:prols); and (vi) CT and DT implementations of the estimator are given. Several simulation results, borrowed from the literature, show the superior performance of the proposed estimator.

Acknowledgments

The first author is grateful to Dr. Michelangelo Bin for a detailed explanation of his important results reported in [5] and to Prof. Han-Fu Chen for bringing to his attention the use of the mixing step of DREM in [8]. He also thanks Dr. Lei Wang for his help in the derivation of the CT version of the LS+D with a forgetting factor.

CRediT authorship contribution statement

All authors equally contributed to the paper.

References

[1] S. Aranovskiy, A. Bobtsov, R. Ortega and A. Pyrkin, Performance enhancement of parameter estimators via dynamic regressor extension and mixing, IEEE Trans. Automatic Control, vol. 62, pp. 3546-3550, 2017.
[2] A. Astolfi, D. Karagiannis and R. Ortega, Nonlinear and Adaptive Control with Applications, vol. 187, Springer, London, 2008.
[3] N. E. Barabanov and R. Ortega, On global asymptotic stability of $\dot{x}=\phi(t)\phi^{\top}(t)x$ with $\phi(t)$ bounded and not persistently exciting, Systems and Control Letters, vol. 109, pp. 24-27, 2017.
[4] A. Belov, R. Ortega and A. Bobtsov, Guaranteed performance adaptive identification scheme of discrete-time systems using dynamic regressor extension and mixing, 18th IFAC Symposium on System Identification, (SYSID 2018), Stockholm, Sweden, July 9-11, 2018.
[5] M. Bin, Generalized recursive least squares: Stability, robustness, and excitation, Systems & Control Letters, vol. 161, 105144, 2022.
[6] A. Bobtsov, B. Yi, R. Ortega and A. Astolfi, Generation of new exciting regressors for consistent on-line estimation of a scalar parameter, IEEE Trans. Automatic Control, DOI: 10.1109/TAC.2022.3159568, 2022.
[7] A. L. Bruce, A. Goel and D. S. Bernstein, Necessary and sufficient regressor conditions for the global asymptotic stability of recursive least squares, Systems & Control Letters, vol. 157, 10500527, 2021.
[8] H. F. Chen and W. Zhao, Recursive Identification and Parameter Estimation, CRC Press, 2014.
[9] G. Chowdhary, T. Yucelen, M. Muhlegg and E. N. Johnson, Concurrent learning adaptive control of linear systems with exponentially convergent bounds, International Journal of Adaptive Control and Signal Processing, vol. 27, no. 4, pp. 280-301, 2013.
[10] Y. Cui, J. E. Gaudio and A. M. Annaswamy, A new algorithm for discrete-time parameter estimation, arXiv:2103.16653, 2021.
[11] Ph. de Larminat, On the stabilizability condition in indirect adaptive control, Automatica, vol. 20, no, 6, pp. 793-795, 1984.
[12] B. P. Demidovich, Dissipativity of nonlinear systems of differential equations, Vestnik Moscow State University, Ser. Mat. Mekh., Part I-6, (1961) pp. 19-27; Part II-1, (1962), pp. 3-8, (in Russian).
[13] D. Efimov, N. Barabanov and R. Ortega, Robustness of linear time-varying systems with relaxed excitation, Int. J. on Adaptive Control and Signal Processing, vol. 33, no. 12, pp. 1885-1900, 2019.
[14] D. Gerasimov, R. Ortega and V. Nikiforov, Adaptive control of multivariable systems with reduced knowledge of high frequency gain: Application of dynamic regressor extension and mixing estimators, 18th IFAC Symposium on System Identification, (SYSID 2018), Stockholm, Sweden, July 9-11, 2018.
[15] A. Goel, A. L. Bruce and D. S. Bernstein, Recursive least squares with variable-direction forgetting: Compensating for the loss of persistency, IEEE Control Systems Magazine, vol. 40, no, 4, pp. 80-102, 2020.
[16] G. Goodwin and K. Sin, Adaptive Filtering Prediction and Control, Prentice-Hall, 1984.
[17] M. Korotina, S. Aranovskiy, R. Ushirobira and A. Vedyakov, On parameter tuning and convergence properties of the DREM procedure, 2020 European Control Conference (ECC20), Saint Petersburg, Russia, May 12-15, 2020.
[18] M. Korotina, J. G. Romero, S. Aranovskiy, A. Bobtsov and R. Ortega, Persistent excitation is unnecessary for on-line exponential parameter estimation: a new algorithm that overcomes this obstacle, Systems & Control Letters, vol. 159, doi.org/10.1016/j.sysconle.2021.105079, 2022.
[19] J. Krause and P. Khargonekar, Parameter information content of measurable signals in direct adaptive control, IEEE Trans. on Automatic Control, vol. 32, no. 9, pp. 802-810. 1987.
[20] G. Kreisselmeier and G. Rietze-Augst, Richness and excitation on an interval—with application to continuous-time adaptive control, IEEE Trans. Automatic Control, vol. 35, no. 2, pp. 165-171, 1990.
[21] G. Kreisselmeier, Adaptive observers with exponential rate of convergence, IEEE Trans. Automatic Control, vol. 22, no. 1, pp. 2-8, 1977.
[22] M. Krstic, On using least-squares updates without regressor filtering in identification and adaptive control of nonlinear systems, Automatica, vol. 45, pp. 731-735, 2009.
[23] D. Liberzon, Switching in Systems and Control, vol 190, Springer, 2003.
[24] P.M. Lion, Rapid identification of linear and nonlinear systems, AIAA Journal, vol. 5, pp. 1835-1842, 1967.
[25] L. Ljung, System Identification: Theory for the User, Prentice Hall, New Jersey, 1987.
[26] L. Ljung and T. Soderstrom, Theory and Practice of Recursive Identification, MIT Press, 1983.
[27] R. Lozano and X. Zhao, Adaptive pole placement without excitation probing signals, IEEE Trans. Automatic Control, vol. 39, no. 1, pp. 47-58, 1994.
[28] R. Lozano and C. Canudas, Passivity-based adaptive control of mechanical manipulators using LS-type estimation, IEEE Trans. Automatic Control, vol. 35, no. 12, pp. 1363-1365, 1990.
[29] R. Marino and P. Tomei, On exponentially convergent parameter estimation with lack of persistency of excitation, Systems & Control Letters, vol. 159, 105080, 2022.
[30] T. Nguyen, S. Islam, D. Bernstein and I. Kolmanovsky, Predictive cost adaptive control: A numerical investigation of persistency, consistency, and exigency, IEEE Control Systems Magazine, vol. 41, pp. 64-96, 2021.
[31] R. Ortega, An on-line least-squares parameter estimator with finite convergence time, Proc. IEEE, vol. 76, no. 7, 1988.
[32] R. Ortega, A. Bobtsov, A. Pyrkin and S. Aranovskiy, A parameter estimation approach to state observation of nonlinear systems, Systems and Control Letters, vol. 85, pp 84-94, 2015.
[33] R. Ortega, D. Gerasimov, N. Barabanov and V. Nikiforov, Adaptive control of linear multivariable systems using dynamic regressor extension and mixing estimators: Removing the high-frequency gain assumption, Automatica, vol. 110, 108589, 2019.
[34] R. Ortega, V. Nikiforov and D. Gerasimov, On modified parameter estimators for identification and adaptive control: a unified framework and some new schemes, Annual Reviews in Control, vol. 50, pp. 278-293, 2020.
[35] R. Ortega, A. Bobtsov, N. Nikolayev, J. Schiffer and D. Dochain, Generalized parameter estimation-based observers: Application to power systems and chemical-biological reactors, Automatica, vol. 129, 109635, 2021.
[36] R. Ortega, V. Gromov, E. Nuño, A. Pyrkin and J. G. Romero, Parameter estimation of nonlinearly parameterized regressions: application to system identification and adaptive control, Automatica, vol. 127, 109544, 2021.
[37] R. Ortega, Comments on recent claims about trajectories of control systems valid for particular initial conditions, Asian Journal of Control, DOI: 10.1002/asjc.2512, 2021.
[38] R. Ortega, S. Aranovskiy, A. Pyrkin, A Astolfi and A. Bobtsov, New results on parameter estimation via dynamic regressor extension and mixing: Continuous and discrete-time cases, IEEE Trans. Automatic Control, vol. 66, no. 5, pp. 2265-2272, 2021.
[39] R. Ortega, A. Bobtsov and N. Nikolayev, Parameter identification with finite-convergence time alertness preservation, IEEE Control Systems Letters, vol. 6, pp. 205-210, 2022
[40] Y. Pan and H. Yu, Composite learning robot control with guaranteed parameter convergence, Automatica, vol. 89, pp. 398-406, 2018.
[41] Y. Pan, S. Aranovskiy, A. Bobtsov, and H. Yu, Efficient learning from adaptive control under sufficient excitation, International Journal of Robust and Nonlinear Control, vol. 29, pp. 3111-3124, 2019.
[42] A. Pavlov, A. Pogromsky, N. van de Wouw and H. Nijmeijer, Convergence dynamics, a tribute to Boris Pavlovich Demidovich, Systems & Control Letters, vol. 52, pp. 257-261, 2004.
[43] L. Praly, Convergence of the gradient algorithm for linear regression models in the continuous and discrete-time cases, Int. Rep. MINES ParisTech, Centre Automatique et Systèmes, December 26, 2017.
[44] A. Pyrkin, R. Ortega, V. Gromov, A. Bobtsov and A. Vedyakov, A Globally convergent direct adaptive pole-placement controller for nonminimum phase systems with relaxed excitation assumptions, Int. J. on Adaptive Control and Signal Processing, vol. 33, pp 1491-1505, 2019.
[45] C. Rao and H. Toutenburg, H, Linear Models: Least Squares and Alternatives, Springer Series in Statistics (3rd ed.). Berlin, Springer, 2008.
[46] W. Rudin, Principes of Mathematical Analysis, 3rd ed., NY:McGraw-Hill, Inc, 1976.
[47] W.J. Rugh, Linear Systems Theory, 2nd ed., Prentice hall, NJ, 1996.
[48] S. Sastry and M. Bodson, Adaptive Control: Stability, Convergence and Robustness, Prentice-Hall, New Jersey, 1989.
[49] V. Shaferman, M. Schwegel, T. Gluck and A. Kugi, Continuous-time least-squares forgetting algorithms for indirect adaptive control, European Journal of Control, vol. 62, pp.105-112, 2021.
[50] H. Shin and H. Lee, A new exponential forgetting algorithm for recursive least-squares parameter estimation, arXiv:2004.03910, 2020.
[51] J.-J. E. Slotine and W. Li, Applied Nonlinear Control, Prentice-Hall, New Jersey, USA, 1991.
[52] G. Tao, Adaptive Control Design and Analysis, vol. 37, John Wiley & Sons, New Jersey, 2003.
[53] L. Wang, R. Ortega, A. Bobtsov, J. G. Romero and B. Yi, Identifiability implies robust, globally exponentially convergent on-line parameter estimation: Application to model reference adaptive control, arXiv:2108.08436, 2021.
[54] Z. Wu, M. Ma, X. Xu, B. Liu and Z. Yu. Predefined-time parameter estimation via modified dynamic regressor extension and mixing, Journal of the Franklin Institute, doi.org/10.1016/j.jfranklin.2021.06.028, 2021.
[55] B. Yi and R. Ortega, Conditions for convergence of dynamic regressor extension and mixing parameter estimators using LTI filters, IEEE Trans. Automatic Control, 10.1109/TAC.2022.3149964, 2022.

Appendix A

In this appendix we briefly review the main steps in the construction of DREM-based estimators proceeding from the NLPRE (1). For the sake of brevity we restrict ourselves to CT versions, with the DT ones constructed verbatim. The interested reader is refered to [38] for further details on these constructions.

Derivation of classical DREM-based estimators

S1

Starting from the NLPRE $y(t)=\phi^{\top}(t){\cal G}(\theta)$ , with $y(t)\in\mathbb{R},\;\phi(t)\in\mathbb{R}^{p}$ measurable signals, ${\cal G}:\mathbb{R}^{q}\to\mathbb{R}^{p},\;p\geq q$ and $\theta\in\mathbb{R}^{q}$ a constant vector of unknown parameters.
S2

(Creation of the extended regressor) Inclusion of a free, stable, linear operator ${\cal H}:u(t)\to U(t)$ , with $u(t)\in\mathbb{R}$ and $U(t)\in\mathbb{R}^{p}$ , via its state space realization

with $A(t)\in\mathbb{R}^{p\times p},\;b(t)\in\mathbb{R}^{p}$ . Upon application to the NLPRE above, create a new extended NLPRE

$Y(t)=\Phi(t){\cal G}(\theta)$ (4eiqragat)

with

We underscore the fact that the new extended regressor $\Phi(t)$ is a square matrix.
S3

(Lion’s and Kreisselmeier REs) For Lion’s RE [24] we select for ${\cal H}$ the LTI filter

$A:=\mbox{diag}\{-a_{i}\},\;b:=\mbox{col}(b_{1},\dots,b_{p}),$

with $\;b_{i}\neq b_{j},a_{i}\neq a_{j}>0,\;(i,j)\in\bar{p}$ .

For Kreisselmeier RE we select LTV operators with

$A:=\mbox{diag}\{-a_{i}\},\;b(t):=\phi(t),\;a_{i}>0,\;i\in\bar{p}.$
S4

(Mixing step) Multiplication of the extended LPRE (4eiqragat) by the adjugate of $\Phi(t)$ to create the new NLPRE

${\cal Y}(t)=\Delta(t){\cal G}(\theta),$ (4eiqragau)

with

and scalar regressor $\Delta(t)$ . Notice that in the case of LPRE $y(t)=\phi^{\top}(t)\theta$ we obtain $q$ scalar LPREs of the form

${\cal Y}_{i}(t)=\Delta(t)\theta_{i},\;i\in\bar{q}.$