This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Safety Embedded Control of Nonlinear Systems via Barrier States*

Hassan Almubarak1,4, Nader Sadegh2 and Evangelos A. Theodorou3 *This research was supported by the NSF CPS grant no. 1932288.1School of Electrical and Computer Engineering2The George W. Woodruff School of Mechanical Engineering3The Daniel Guggenheim School of Aerospace EngineeringGeorgia Institute of Technology, Atlanta, GA 30332, USA4 Department of Control and Instrumentation Engineering, King Fahd University of Petroleum & Minerals, Dhahran 31261, Saudi Arabiahalmubarak, sadegh, [email protected]
Abstract

In many safety-critical control systems, possibly opposing safety restrictions and control performance objectives arise. To confront such a conflict, this letter proposes a novel methodology that embeds safety into stability of control systems. The development enforces safety by means of barrier functions used in optimization through the construction of barrier states (BaS) which are embedded in the control system’s model. As a result, as long as the equilibrium point of interest of the closed loop system is asymptotically stable, the generated trajectories are guaranteed to be safe. Consequently, a conflict between control objectives and safety constraints is substantially avoided. To show the efficacy of the proposed technique, we employ barrier states with the simple pole placement method to design safe linear controls. Nonlinear optimal control is subsequently employed to fulfill safety, stability and performance objectives by solving the associated Hamilton-Jacobi-Bellman (HJB) which minimizes a cost functional that can involve the BaS. Following this further, we exploit optimal control with barrier states on an unstable, constrained second dimensional pendulum on a cart model that is desired to avoid low velocities regions where the system may exhibit some controllability loss and on two mobile robots to safely arrive to opposite targets with an obstacle on the way.

I Introduction

Control theory has been a central element in today’s fast growing, interdisciplinary technologies from simple decision making problems to terrifically complex autonomous systems. Undoubtedly, advancements in technologies are faced with unprecedented challenges. One vital challenge is safety. Nonetheless, conflicting safety restrictions and control objectives potentially appear in safety-critical control systems. The main goal of this letter is to develop a control design methodology that satisfies safety constraints and evades the possible conflict between safety constraints and control objectives. The objective is achieved by embedding the safety constraints in the system’s model through barrier states (BaS) which are provided to the control law used to attain performance objectives.

For a dynamical system, safety is classically verified by invariance of the set of permitted states. The set of permitted states is (forward) invariant if at some point in time it contains the system’s state then it contains the state for all (future) times [1]. To formally prove invariance and verify safety, barrier certificates were introduced in the control literature [2] (for a brief historical overview, see [3]). Extending the idea to control dynamical systems, a set is said to be controlled invariant, also called viable, if for any initial condition in the set, the associated trajectory is forced to be in the set for all future times using a proper control. Influenced by barrier certificates and control Lyapunov functions (CLFs), control barrier functions (CBFs) were introduced in [4], which were further developed in [5, 6, 7, 8]. CBFs can be looked at as state-dependent hard input constraints which are commonly used in quadratic programming (QP) fashion to produce a safe controller [6, 8, 8]. CBFs are becoming increasingly popular in multi-objective control due to their ability of rendering sets invariant and flexible unification with CLFs. Nonetheless, to avoid conflicts in the CLF-CBF QP, one of the conditions needs to be relaxed. For a complete review on CBFs, the reader may refer to [3].

In this letter, inspired by adopting barrier functions to establish inequality constraints forming CBFs, we utilize barrier functions to construct barrier states. Those barrier states (BaS) create barriers in the state space forcing the search of a stabilizing feedback control law to the set of safe controls. Specifically, the barrier states are appended to the model of the safety-critical dynamical system to generate a system that is safe if the equilibrium point of interest is asymptotically stable. Therefore, since safety and stability have been unified, designing a stabilizing controller for the new model means a stabilizing safe control for the original dynamical system. In other words, the safety property is embedded in the stability of the system as the feedback controller is a function of the system’s states and the barrier states, hence the name safety embedded control. The proposed method is general enough to be used with any valid barrier function. For standard barrier functions, such as Log or inverse, we show that it is possible to generate smooth or even analytic state equations for barrier states if desired. Unlike control barrier functions, no explicit knowledge of the relative degree of the system with respect to the output describing the safe region is required.

The letter is organized as follows. Section II presents the problem statement. In Section III, utilizing barrier functions, we develop barrier states used to embed safety in the stabilization problem. Subsequently, we implement the developed concept to design simple safe linear controls in Section IV. In Section V, we employ barrier states in the context of constrained optimal control to meet safety and performance objectives through minimizing an infinite horizon cost functional. In Section VI, we show the effectiveness of the proposed technique through safely stabilizing an unstable pendulum on a cart model where it is desired to avoid low velocity regions near the 90o90^{\text{o}} angle. Moreover, a multi-robot example is used where two simple mobile robots are to go to specific locations safely, i.e. without colliding while avoiding obstacles on the way. Lastly, conclusion remarks are driven in Section VII.

II Problem Statement

Consider the nonlinear control-affine dynamical system

x˙=f(x)+g(x)u\dot{x}=f(x)+g(x)u (1)

where x𝒟nx\in\mathcal{D}\subset\mathbb{R}^{n}u𝒰mu\in\mathcal{U}\subset\mathbb{R}^{m}, f:nnf:\mathbb{R}^{n}\rightarrow\mathbb{R}^{n} and g:nn×mg:\mathbb{R}^{n}\rightarrow\mathbb{R}^{n\times m} are continuously differentiable and f(0)=0f(0)=0, without loss of generality. We wish to formulate a continuous stabilizing feedback controller u=K(x)u=K(x) that renders the nonempty open subset 𝒞\mathcal{C} of 𝒟\mathcal{D} defined as 𝒞={x𝒟:h(x)>0}\mathcal{C}=\{x\in\mathcal{D}:h(x)>0\} controlled invariant, where h:𝒟h:\mathcal{D}\rightarrow\mathbb{R} is a continuously differentiable scalar valued function that represents the safe set. The set 𝒟\mathcal{D} represents the domain of operation which will be further prescribed later in the letter. The set 𝒞\mathcal{C}, referred to as the safe set, is said to be forward invariant with respect to the closed-loop system x˙=f(x)+g(x)K(x)\dot{x}=f(x)+g(x)K(x) if x(0)𝒞x(0)\in\mathcal{C} implies x(t)𝒞x(t)\in\mathcal{C}, t0\forall t\geq 0.

Definition 1.

The continuous feedback controller u=K(x)u=K(x) is said to be safe if 𝒞\mathcal{C} is forward invariant with respect to the resulting closed-loop system x˙=f(x)+g(x)K(x)\dot{x}=f(x)+g(x)K(x). That is,

h(x(t))>0t0;x(0)𝒞h(x(t))>0\ \forall t\geq 0;\ x(0)\in\mathcal{C} (2)

We refer to this as the safety condition.

Enforcement of the safety constraint may be facilitated by means of a smooth scalar valued function B:𝒞B:\mathcal{C}\rightarrow\mathbb{R} known as a barrier function (BF) in the optimization literature. Popular barrier functions include the inverse barrier function (a.k.a. Carroll barrier) B(η)=1/ηB(\eta)=1/\eta and the logarithmic barrier function B(η)=log(1+ηη)B(\eta)=\log(\frac{1+\eta}{\eta}). The main properties of those types of barrier functions are that they blow up at the boundaries of the complement of 𝒞\mathcal{C}, i.e. limη0B(η)=\lim_{\eta\rightarrow 0}B(\eta)=\infty, limηB(η)=0\lim_{\eta\rightarrow\infty}B(\eta)=0 and infη+B(η)0\inf_{\eta\in\mathbb{R}^{+}}B(\eta)\geq 0. Another choice that has the advantage of being analytic with a known power series expansion is B(η)=tanh1(eη)B(\eta)=\tanh^{-1}(e^{-\eta}). The composite BF corresponding to hh that defines 𝒞\mathcal{C} is defined to be β(x):=Bh(x)\beta(x):=B\circ h(x). Note that β(x)\beta(x)\rightarrow\infty if and only if h(x)0h(x)\rightarrow 0. The following Proposition follows from Definition 1 and the choice of BFs:

Proposition 1.

For the control system in (1), the feedback controller u=K(x)u=K(x) satisfies the safety condition (2) if and only if β(x(0))<\beta(x(0))<\infty implies β(x(t))<t>0\beta(x(t))<\infty\ \forall t>0.

III Barrier States for Safety Embedded Stabilization

As possibly conflicting safety constraints and control performance objectives need to be avoided in safety-critical control, current multi-objective frameworks, e.g. CBF-CLF safe stabilization framework, avoid possible conflicts by relaxing one of the constraints to ensure feasibility of the solution. This may result in an undesirable performance. To untangle such a problem, we propose a provable safe control technique that satisfies the safety constraints and the performance objectives simultaneously with no relaxation.

For a barrier function β(x)=B(h(x))\beta(x)=B(h(x)) where h(x)h(x) defines the safe set 𝒞\mathcal{C}, the idea is to augment the open loop system with a new state variable zz that is related to the recentered barrier function [9], β(x)β(0)\beta(x)-\beta(0) to ensure that z=0z=0 is an equilibrium state. If we simply set zz to β(x)β(0)\beta(x)-\beta(0), the resulting state equation would be

β˙(x)\displaystyle\dot{\beta}(x) =B(h(x))(Lfh(x)+Lgh(x)u)\displaystyle=B^{\prime}(h(x))(L_{f}h(x)+L_{g}h(x)u)
=ϕ0(β(x))(Lfh(x)+Lgh(x)u)\displaystyle=\phi_{0}(\beta(x))(L_{f}h(x)+L_{g}h(x)u)

where ϕ0=BB1\phi_{0}=B^{\prime}\circ B^{-1} and B1B^{-1} is the inverse of the BF. To safely stabilize the origin of (1), however, we need to ensure stabilizability of the origin of the augmented system. The main issue with augmenting β˙\dot{\beta} as the state equation is that β(x)\beta(x) becomes a redundant state and the resulting augmented system may not be stabilizable in some cases. Fortunately, this issue can be resolved by perturbing the barrier state equation through an auxiliary function ϕ1\phi_{1} that ensures stabilizability of the origin of the augmented system without affecting the safety guarantees. More specifically, we modify the state equation for zz according to

z˙=ϕ0(z+β0)h˙(x)γϕ1(z+β0,h(x))\dot{z}=\phi_{0}(z+\beta_{0})\dot{h}(x)-\gamma\phi_{1}(z+\beta_{0},h(x)) (3)

where β0=β(0)\beta_{0}=\beta(0), h˙(x)=Lfh(x)+Lgh(x)u\dot{h}(x)=L_{f}h(x)+L_{g}h(x)u, γ+\gamma\in\mathbb{R}^{+} and ϕ1(ζ,η)\phi_{1}(\zeta,\eta) is an analytic function of two variables satisfying

ϕ1(β(x),h(x))=0andϕ1ζ(β0,h(0))>0\phi_{1}(\beta(x),h(x))=0\ \text{and}\ \frac{\partial\phi_{1}}{\partial\zeta}(\beta_{0},h(0))>0

It can be seen that for all three BFs mentioned earlier, the function ϕ0:=BB1\phi_{0}:=B^{\prime}\circ B^{-1} is analytic with no singularities. Moreover, it is worth noting that both ϕ0\phi_{0} and ϕ1\phi_{1} are formulated independently of the system’s model based on the barrier function and h(x)h(x). For instance, if ζ=B(η)=1/η\zeta=B(\eta)=1/\eta, then ϕ0(ζ)=B(B1(ζ))=ζ2\phi_{0}(\zeta)=B^{\prime}(B^{-1}(\zeta))=-\zeta^{2}. Furthermore, letting ϕ1(ζ,η)=ζ(ηζ1)\phi_{1}(\zeta,\eta)=\zeta(\eta\zeta-1) satisfies the first condition ϕ1(ζ,η)=0\phi_{1}(\zeta,\eta)=0 along ζ=β(x)\zeta=\beta(x) and η=h(x)\eta=h(x) since β(x)h(x)=1\beta(x)h(x)=1 and the second condition ϕ1ζ=2ζη1|β0,h(0)=2β0h(0)1=1>0\frac{\partial\phi_{1}}{\partial\zeta}=2\zeta\eta-1|_{\beta_{0},h(0)}=2\beta_{0}h(0)-1=1>0. Table I provides the explicit expressions for ϕ0\phi_{0} and possible ϕ1\phi_{1} for most commonly used barrier functions. It should be noted that the positive scalar γ\gamma is a design parameter that adjusts the rate at which zz returns to β(x)β0\beta(x)-\beta_{0} if it deviates from it at any instant of time. As a consequence, γ\gamma has some influence on the design of the safe feedback control gains as we will demonstrate in the simulation examples.

ζ\zeta=B(η\eta) ϕ0(ζ)\phi_{0}(\zeta) ϕ1(ζ,η)\phi_{1}(\zeta,\eta)
log(1+ηη)\log(\frac{1+\eta}{\eta}) 4sinh2(ζ/2)-4\sinh^{2}(\zeta/2) η(eζ1)2eζ+1\eta(e^{\zeta}-1)^{2}-e^{\zeta}+1
1/η1/\eta ζ2-\zeta^{2} ηζ2ζ\eta\zeta^{2}-\zeta
2tanh1(eη)2\tanh^{-1}(e^{-\eta}) sinh(ζ)-\sinh(\zeta) tanh(ζ/2)eη\tanh(\zeta/2)-e^{-\eta}
TABLE I: Functions ϕ0\phi_{0} and ϕ1\phi_{1} for three barrier functions.

The following proposition shows that if z(0)z(0) is defined properly, then z(t)=β(x)β0z(t)=\beta(x)-\beta_{0} and ϕ1(z+β0,h(x))=0\phi_{1}(z+\beta_{0},h(x))=0, t0\forall t\geq 0, implying that boundedness of zz guarantees satisfaction of the safety constraint.

Proposition 2.

Suppose that z(0)=β(x(0))β(0)z(0)=\beta(x(0))-\beta(0) and β(x(0))<\beta(x(0))<\infty. Then, the auxiliary state variable z(t)z(t) generated by the perturbed state equation in (3) along the trajectories of (1) is bounded if and only if β(x(t))\beta(x(t)) is bounded t\forall t.

Proof.

We prove the Proposition by establishing that

z(t)=β(x(t))β(0),t0z(t)=\beta(x(t))-\beta(0),\ \forall t\geq 0 (4)

Subtracting β˙(x)=ϕ0(β(x))h˙(x)\dot{\beta}(x)=\phi_{0}(\beta(x))\dot{h}(x) from both sides of (3), we have z~˙(t)=ϕ~0(z~,x)h˙(x)γϕ1(z~+β(x),h(x))\dot{\tilde{z}}(t)=\tilde{\phi}_{0}(\tilde{z},x)\dot{h}(x)-\gamma\phi_{1}(\tilde{z}+\beta(x),h(x)) where z~=z+β(0)β(x)\tilde{z}=z+\beta(0)-\beta(x) and ϕ~0(z~,x)=ϕ0(z~+β(x))ϕ0(β(x))\tilde{\phi}_{0}(\tilde{z},x)=\phi_{0}(\tilde{z}+\beta(x))-\phi_{0}(\beta(x)). The preceding differential equation has an equilibrium point at z~=0\tilde{z}=0 since ϕ1(β(x),h(x))=0\phi_{1}(\beta(x),h(x))=0 and ϕ~0(0,x)=0\tilde{\phi}_{0}(0,x)=0. Thus z~(t)=0\tilde{z}(t)=0, or equivalently z(t)=β(x(t))β(0)z(t)=\beta(x(t))-\beta(0), t0\forall t\geq 0, provided that z~(0)=z(0)+β(0)β(x(0))=0\tilde{z}(0)=z(0)+\beta(0)-\beta(x(0))=0 as assumed by the hypothesis. ∎

If desired, multiple constraints can be combined to form one BF, as done in the optimization literature, to create a single BaS. Creating a single BaS is favorable in many applications and helps avoiding adding many nonlinear state equations to the safety embedded model which may increase the complexity of the controller. A single BaS that represents different constraints, however, may be highly nonlinear and may be more difficult to use to design a safety embedded control. Additionally, some flexibility on choosing design parameters for each constraint, such as penalization of the barrier states in the case of optimal control, will be lost since we will have only one barrier state. It is important to mention that in some applications, one could represent multiple constraints with one function h(x)h(x) defining the safety set and then use a single BaS.

For qq constraints, let β(x)=i=1qB(hi(x))\beta(x)=\sum_{i=1}^{q}B(h_{i}(x)). Then,

β˙(x)=i=1qBB1(z+β0j=1,jiqB(hj(x)))h˙i\dot{\beta}(x)=\sum_{i=1}^{q}B^{\prime}\circ B^{-1}\Big{(}z+\beta_{0}-\sum_{j=1,j\neq i}^{q}B\big{(}h_{j}(x)\big{)}\Big{)}\dot{h}_{i}

and therefore the BaS can be found to be

z˙=i=1q[ϕ0(z+β0j=1,jiqB(hj(x)))h˙iγiϕ1(z+β0j=1,jiqB(hj(x)),hi)]\displaystyle\begin{split}\dot{z}=\sum_{i=1}^{q}&\Bigg{[}\phi_{0}\Big{(}z+\beta_{0}-\sum_{j=1,j\neq i}^{q}B\big{(}h_{j}(x)\big{)}\Big{)}\dot{h}_{i}\\ &-\gamma_{i}\phi_{1}\Big{(}z+\beta_{0}-\sum_{j=1,j\neq i}^{q}B\big{(}h_{j}(x)\big{)},h_{i}\Big{)}\Bigg{]}\end{split} (5)

In the simulation examples, we use a single BaS to represent multiple constraints for one problem and we use multiple ones in the other to validate the proposed technique.

Now, we are in a position to create the safety embedded model. By defining z=[z1,,zq]Tz=[z_{1},\dots,z_{q}]^{\rm{T}}, if more than one BaS is used, and augmenting it to the system (1), we get

x˙=f(x)+g(x)uz˙=fb(x,z)+gb(x,z)u\begin{split}&\dot{x}=f(x)+g(x)u\\ &\dot{z}=f_{b}(x,z)+g_{b}(x,z)u\end{split} (6)

where fb(x,z)f_{b}(x,z) and gb(x,z)g_{b}(x,z) are defined according to (3) or (5). By the definition of ϕ0\phi_{0} and the restrictions imposed on ϕ1\phi_{1}, it follows that both fbf_{b} and gbg_{b} are analytic if ff, gg, and hh are analytic, and the subsystem z˙=fb(x,z)+gb(x,z)u\dot{z}=f_{b}(x,z)+g_{b}(x,z)u is stabilizable at the origin. Hence, the combined system, which can be more compactly described as

x¯˙=f¯(x¯)+g¯(x¯)u\begin{split}&\dot{\bar{x}}=\bar{f}(\bar{x})+\bar{g}(\bar{x})u\\ \end{split} (7)

where x¯=[xz],f¯=[ffb]\bar{x}=\begin{bmatrix}x\\ z\end{bmatrix},\bar{f}=\begin{bmatrix}f\\ f_{b}\end{bmatrix} with f¯(0)=0\bar{f}(0)=0 and g¯=[ggb]\bar{g}=\begin{bmatrix}g\\ g_{b}\end{bmatrix}, preserves the continuous differentiability and stabilizability of the original control system (1). Therefore, the safety constraint is embedded in the closed-loop system’s dynamics and stabilizing the safety embedded system (LABEL:new_system,_safety_augmented) implies enforcing safety for the safety-critical system (1), i.e. forward invariance of the safe set 𝒞\mathcal{C} with respect to (1).

Theorem 3.

Suppose there exists a continuous feedback controller u=K(x¯)u=K(\bar{x}) such that the origin of the safety embedded closed-loop system, x¯˙=f¯(x¯)+g¯(x¯)K(x¯)\dot{\bar{x}}=\bar{f}(\bar{x})+\bar{g}(\bar{x})K(\bar{x}), is asymptotically stable. Then, there exists an open neighborhood 𝒟\mathcal{D} of the origin such that u=K(x¯)u=K(\bar{x}) is safe with respect to the safety region 𝒞={x𝒟:h(x)>0}\mathcal{C}=\{x\in\mathcal{D}:h(x)>0\}.

Proof.

Let us assume that the origin of the embedded closed-loop system is asymptotically stable with a domain of attraction 𝒜\mathcal{A}. Then, there exist open neighborhoods 𝒳n\mathcal{X}\subset\mathbb{R}^{n} and 𝒵q\mathcal{Z}\subset\mathbb{R}^{q} of the origin with 𝒵\mathcal{Z} bounded such that 𝒳×𝒵𝒜\mathcal{X}\times\mathcal{Z}\subset\mathcal{A}. Letting β:=[β1βq]T\beta:=[\beta_{1}\;\cdots\beta_{q}]^{\rm T}, by the continuity of β~(x)=β(x)β(0)\tilde{\beta}(x)=\beta(x)-\beta(0) on {xn:h(x)>0}\{x\in\mathbb{R}^{n}:h(x)>0\}, the inverse image β~1(𝒵)\tilde{\beta}^{-1}(\mathcal{Z}) of 𝒵\mathcal{Z} by β~\tilde{\beta} is an open neighborhood of the origin. Thus 𝒟:=𝒳β1(𝒵)\mathcal{D}:=\mathcal{X}\cap\beta^{-1}(\mathcal{Z}) is also an open neighborhood of the origin. For any initial condition x(0)𝒞𝒟x(0)\in\mathcal{C}\subset\mathcal{D}, it can be seen that z(0)=β~(x(0))𝒵z(0)=\tilde{\beta}(x(0))\in\mathcal{Z} and x¯(0)𝒜\bar{x}(0)\in\mathcal{A}. Thus the trajectories x¯(t)\bar{x}(t), t0t\geq 0, are bounded and converge to zero implying that z(t)z(t) is bounded as well. By Propositions 1 and 2, β(x)\beta(x) is also bounded guaranteeing the safety of u=K(x¯)u=K(\bar{x}). ∎

The remainder of the letter is devoted to synthesizing safe feedback controllers that simultaneously stabilize (1) and satisfy the required safety constraint (2) by stabilizing the dynamical system (LABEL:new_system,_safety_augmented) which unifies the two requirements. In the next sections, we apply the proposed methodology to constrained linear control systems to generate safe linear controls using pole placement and then to constrained optimal control problems to synthesize optimal safe controllers.

IV Safety Embedded Linear Control

Consider the linear time-invariant system x˙=Ax+Bu\dot{x}=Ax+Bu subject to some safety constraint defined by a smooth function hh. Defining the associated barrier state, augmenting it and linearizing the safety embedded model (LABEL:new_system,_safety_augmented) around the origin yields the linearized safety embedded system x¯˙=A¯x¯+B¯u\dot{\bar{x}}=\bar{A}\bar{x}+\bar{B}u where

A¯=[A0n×1γϕ1x(β0,hx(0))+ϕ0(β0)hx(0)Aγϕ1z(β0,h(0))]\displaystyle\bar{A}=\begin{bmatrix}A&0_{n\times 1}\\ -\gamma\phi_{1x}\big{(}\beta_{0},h_{x}(0)\big{)}+\phi_{0}(\beta_{0})h_{x}(0)A&-\gamma\phi_{1z}\big{(}\beta_{0},h(0)\big{)}\end{bmatrix}
B¯=[Bϕ0(β0)hx(0)B]\displaystyle\bar{B}=\begin{bmatrix}B\\ \phi_{0}(\beta_{0})h_{x}(0)B\end{bmatrix}

It should be noted that this system may not be controllable. This should not pose any problem, however, as ϕ1\phi_{1} guarantees stabilizability which should be enough for us to design a safe stabilizing controller. Although we may not be able to change the location of the barrier state’s pole, it is sufficient to use the states and the barrier state to construct a safe stabilizing control. Next, we show an example where we form a barrier state to design a safe stabilizing control while the linearized system is stabilizable but not controllable.

IV-A Constrained Linear System Numerical Example

Consider the open loop unstable linear system given by

[x˙1x˙2]=[1501][x1x2]+[01]u\begin{bmatrix}\dot{x}_{1}\\ \dot{x}_{2}\end{bmatrix}=\begin{bmatrix}1&-5\\ 0&-1\end{bmatrix}\begin{bmatrix}x_{1}\\ x_{2}\end{bmatrix}+\begin{bmatrix}0\\ 1\end{bmatrix}u

Assume that it is desired to stay in the safe set 𝒞={x:(x12)2+(x22)20.52>0}\mathcal{C}=\{x:(x_{1}-2)^{2}+(x_{2}-2)^{2}-0.5^{2}>0\} and that the closed-loop system’s poles are 3-3 and 5-5. Using the inverse barrier function, the linearized safety embedded system that we will use to design the linear control will be

x¯˙=[1500104γ+47.7524γ247.752γ]x¯+[0147.752]u\dot{\bar{x}}=\begin{bmatrix}1&-5&0\\ 0&-1&0\\ \frac{4\gamma+4}{7.75^{2}}&\frac{4\gamma-24}{7.75^{2}}&-\gamma\end{bmatrix}\bar{x}+\begin{bmatrix}0\\ 1\\ \frac{4}{7.75^{2}}\end{bmatrix}u

where x¯=[x1x2z]T\bar{x}=[x_{1}\ \ x_{2}\ \ z]^{\rm{T}}. We use this augmented linear system to design a safe linear controller using the pole placement method to place the poles of the closed-loop controllable subsystem at 3-3 and 5-5, achieving the desired performance. When γ=2\gamma=2, the safe stabilizing linear control is found to be u=4.43x1+8.38x25.63zu=-4.43x_{1}+8.38x_{2}-5.63z. Note that although the controller is linear with respect to the safety embedded system, it is a nonlinear function of the original state xx provided z(0)=β(x(0))β(0)z(0)=\beta(x(0))-\beta(0). Fig. 1 shows that indeed the designed linear controller is able to safely stabilize the system.

Refer to caption
(a)
Refer to caption
(b)
Figure 1: The top figure shows numerical simulations of the closed loop system starting from different initial conditions (small circles) with different γ\gammas using the pole placement method to place the poles at (3,5)(-3,-5). The bottom figure shows a phase portrait of the closed loop system under the safety embedded linear control u=22.63x1+7.14x2+102.96zu=-22.63x_{1}+7.14x_{2}+102.96z with γ=1\gamma=1.

It is worth noting that this is a linear control design for an inherently nonlinear control problem and thus limitations and difficulties of linear controls to stabilize nonlinear systems apply. One could use any nonlinear control technique to design a safely stabilizing control. In the next section, we leverage the infinite horizon optimal control in the process of designing an efficacious optimal safe control as optimal control is a well suited paradigm for such a problem and it facilitates the design of nonlinear controllers.

V Safety Embedded Optimal Control

Consider the infinite horizon optimal control problem of minimizing some cost functional subject to the dynamics (1) and the safety condition (2). This has been a sought-after problem recently [10, 11, 12] where CBFs are used to solve constrained optimal control problems. In these efforts, it can be seen how difficult the problem is and thus various complex techniques have been developed to solve the problem. Using the proposed BaS, this problem can be solved directly using well-known unconstrained optimal control methods. A systematic approach toward solving this problem is to seek an optimal feedback controller u=K(x¯)u=K(\bar{x}) that minimizes

V(x(0),u(t))=120Q(x¯)+uTRudtV(x(0),u(t))=\frac{1}{2}\int_{0}^{\infty}Q(\bar{x})+u^{\rm{T}}Ru\;dt (8)

where Q:n+q+x0Q:\mathbb{R}^{n+q}\rightarrow\mathbb{R}^{+}\ \forall x\neq 0 is analytic and its Hessian is positive definite and R0R\succ 0, subject to (LABEL:new_system,_safety_augmented). By Theorem 3, if this optimal control problem is successfully solved, then both safety and stability requirements are met. For such an infinite horizon optimal control problem, a necessary and sufficient condition is that the well-known Hamilton-Jacobi-Belmman (HJB) equation is satisfied,

HJB:=minuVx¯(f¯(x¯)+g¯(x¯)u)+12uTRu+12Q(x¯)=0\text{HJB}:=\min_{u}\ V_{\bar{x}}^{*}\big{(}\bar{f}(\bar{x})+\bar{g}(\bar{x})u\big{)}+\frac{1}{2}u^{\rm{T}}Ru+\frac{1}{2}Q(\bar{x})=0 (9)

with a boundary condition V(0)=0V^{*}(0)=0 where VV^{*} is the optimal solution, a.k.a. the value function, and Vx¯=Vx¯V_{\bar{x}}^{*}=\frac{\partial V^{*}}{\partial\bar{x}}.

Theorem 4.

Consider the optimal control problem (LABEL:new_system,_safety_augmented)-(8) with analytic f(x),g(x)f(x),g(x) and h(x)h(x) and suppose that the pair (fx(0),g(0))\big{(}\frac{\partial f}{\partial x}(0),g(0)\big{)} is stabilizable, QQ is analytic with positive definite Hessian and R0R\succ 0. Then, there exists a unique analytic value function V(x¯)V^{*}(\bar{x}) satisfying the HJB equation (9), which yields an optimal safe feedback control

usafe(x¯)=R1g¯(x¯)Vx¯(x¯)u^{*}_{safe}(\bar{x})=-R^{-1}\bar{g}(\bar{x})V^{*}_{\bar{x}}(\bar{x}) (10)

Moreover, V(x¯)V^{*}(\bar{x}) is a Lyapunov function and usafeu^{*}_{safe} renders the origin of the closed loop system f¯(x¯)+g¯(x¯)usafe(x¯)\bar{f}(\bar{x})+\bar{g}(\bar{x})u^{*}_{safe}(\bar{x}) asymptotically stable. Therefore, the barrier state zz is bounded guaranteeing the generation of safe trajectories.

Proof.

As remarked earlier in the letter, the embedded system (LABEL:new_system,_safety_augmented) preserves the continuous differentiability and stabilizability properties of (1) and thus satisfies the analyticity and stabilizability assumptions in [13, 14, 15] hence guaranteeing the existence and uniqueness of the value function V(x¯)V^{*}(\bar{x}) and the corresponding optimal controller usafe(x¯)u^{*}_{safe}(\bar{x}) described in (10). Furthermore, the origin of the resulting closed loop system is asymptotically stable by Lyapunov stability theory [14, 16] and by Theorem 3, usafe(x¯)u^{*}_{safe}(\bar{x}) is safe which completes the proof. ∎

Various techniques have been proposed in the literature to approximate the solution to the HJB equation or the associated optimal control [17, 15, 18, 19, 20, 13, 14]. In the next section, we utilize these efforts to produce a power series solution of the value function and its gradient to produce the optimal safe control (10) for the optimal control problem (8). Specifically, we mainly utilize the recursive analytic solution proposed in [14] and the nonlinear quadratic regulator (NLQR) in [13].

VI Numerical Implementation and Examples

To demonstrate the efficacy of the presented technique to produce a safe and stabilizing control, we use a single BaS to enforce safety for a model of an inverted pendulum on a cart, and multiple barrier states for a multi-mobile robot navigation task where two point-robots are asked to go to intersecting targets while avoiding collision and avoiding some unsafe region.

VI-A Second Order Inverted Pendulum on a Cart

This system is an unstable version with a state dependent input matrix of the mechanical system with unity parameters used in [7] to show the applicability of the Control Lyapunov–Barrier Function (CLBF) approach. The system is given by

x˙1=x2x˙2=sin(x1)0.5(tanh(10x2)+x2)+cos(x1)u\begin{split}&\dot{x}_{1}=x_{2}\\ &\dot{x}_{2}=\sin(x_{1})-0.5(\tanh(10x_{2})+x_{2})+\cos(x_{1})u\end{split}

It is desired to avoid low velocities when the angle is half way through to stabilize the pendulum at the upright position to steer clear of the 90o90^{\text{o}} angle where the system loses controllability due to the cos(x1)\cos(x_{1}) term in the input matrix g(x)g(x). That is, there are two symmetrical decoupled unsafe sets and therefore the safe set can be represented by two functions h1h_{1} and h2h_{2} such that 𝒞={x𝒟|(x12)2+x22>1(x1+2)2+x22>1}\mathcal{C}=\{x\in\mathcal{D}\ |\ (x_{1}-2)^{2}+x_{2}^{2}>1\ \cap\ (x_{1}+2)^{2}+x_{2}^{2}>1\}. We pick the Carroll BF with γ=5\gamma=5 and use equation (5) with z(0)=β(x(0))β(0)z(0)=\beta(x(0))-\beta(0) to generate a single BaS. To generate the optimal safe controller, it is chosen to minimize the functional (8), with R=1,Q=1x12+50x22+0.5zR=1,\ Q=1x_{1}^{2}+50x_{2}^{2}+0.5z. It is worth mentioning that this is an optimal control problem and thus different performances can be achieved using different cost functionals.

Fig. 2 shows numerical simulations for the closed-loop system under a 3rd3^{\text{rd}} order nonlinear quadratic regulator (NLQR) [13], i.e. the approximated value function for the optimal control problem is of order four. Clearly, the proposed technique is powerful enough to generate a safe and asymptotically stable closed loop system safety is embedded in the stability of the overall system. It can be seen that there is a small cusp when the trajectories cross the 90o90^{\text{o}} angle which is a result of the lose of controllability when cos(90o)=0\cos(90^{\text{o}})=0. If low velocities were allowed at that specific region, that is if no barrier states are used, the closed loop system will go unstable.

Refer to caption
Figure 2: Numerical simulations of the closed-loop safety-critical system starting from different initial conditions (small circles). The goal is to stabilize the pendulum and avoid crossing the 90o90^{\text{o}} angle with low velocities, where the unsafe region is represented by the red circles. The proposed technique successfully generates safe trajectories and safely stabilizes the origin.

VI-B Multi Simple Mobile Robot Collision Avoidance

In this example, two simple mobile robots are asked to navigate their way toward prespecified targets. The robots are to avoid colliding as well as avoid an obstacle on their way. The robots dynamics are

x˙i=[ui1ui2],x˙j=[uj1uj2]\begin{split}\dot{x}_{i}=\begin{bmatrix}u_{i1}\\ u_{i2}\end{bmatrix},\dot{x}_{j}=\begin{bmatrix}u_{j1}\\ u_{j2}\end{bmatrix}\end{split}

To avoid collision, a BaS is featured through a barrier function that prevents the robots from getting too close to each other. We pick the maximum distance between the two robots to be δ=0.1\delta=0.1. Therefore, the associated safe set is {xi,xj𝒟|xixj2>δ2}\{x_{i},x_{j}\in\mathcal{D}\ |\ ||x_{i}-x_{j}||^{2}>\delta^{2}\}. Furthermore, we add an obstacle which is represented by a circle at (0,0)(0,0) with a radius of 0.250.25. This calls for a BaS for each agent. Hence, the overall safe set for each agent is given by 𝒞={xk𝒟|xixj2>δandxk12+xk22>0.252,k=i,j}\mathcal{C}=\{x_{k}\in\mathcal{D}\ |\ ||x_{i}-x_{j}||^{2}>\delta\;\text{and}\;x_{k1}^{2}+x_{k2}^{2}>0.25^{2},\;k=i,j\}. For this example, we select the Log BF, βl=log(1+hlhl)\beta_{l}=\log(\frac{1+h_{l}}{h_{l}}) for l=1,2,3l=1,2,3, to construct three barrier states, where the first represents the distance constraint between the two robots and the second and third barrier states represent the barriers needed to avoid the obstacle for agent ii and agent jj respectively. Using Table I, the barrier states are given by

z˙l=γl(hl(ezcl1)2ezcl+1)4sinh2(zcl/2)hlxul\dot{z}_{l}=-\gamma_{l}\big{(}h_{l}(e^{z_{cl}}-1)^{2}-e^{z_{cl}}+1\big{)}-4\sinh^{2}(z_{cl}/2)h_{lx}u_{l}

with zl(0)=βl(x(0))βl(0)z_{l}(0)=\beta_{l}(x(0))-\beta_{l}(0) for l=1,2,3l=1,2,3 where γ1=15,γ2=γ3=0.5,zcl=(zl+cl),cl=log(1+hl(0)hl(0))\gamma_{1}=15,\gamma_{2}=\gamma_{3}=0.5,z_{cl}=(z_{l}+c_{l}),c_{l}=\log(\frac{1+h_{l}(0)}{h_{l}(0)}), u1=[uiT,ujT]Tu_{1}=[u_{i}^{\rm{T}},u_{j}^{\rm{T}}]^{\rm{T}}, u2=uiu_{2}=u_{i} and u3=uju_{3}=u_{j}. The cost functional (8) is selected such that R=I4R=I_{4} and Q=xiTxi+xjTxj+0.001z12+0.5z22+0.5z32Q=x_{i}^{\rm{T}}x_{i}+x_{j}^{\rm{T}}x_{j}+0.001z_{1}^{2}+0.5z_{2}^{2}+0.5z_{3}^{2}. A 3rd3^{\text{rd}} order NLQR is used in this example. As shown in Fig. 3, using the proposed technique, we are effectively able to send the two robots safely to the targeted positions while avoiding the obstacle as well as colliding with each other. The robots get very close to each other at sometime but never get too close, i.e. the distance between them is never less than or equal to δ\delta.

Refer to caption
(a)
Refer to caption
(b)
Figure 3: Two mobile robots starting from two opposite points sent to two opposite target points with an obstacle in-between. The two robots arrive at the desired locations safely. It can be seen that the two robots start by moving in a straight line toward their target points, which is the optimal behavior, but start to steer away to avoid the obstacle and then take some turns to avoid crossing the same point at the same time and to avoid getting too close (less than δ\delta distance) to each other as shown in the bottom figure.

VII Conclusions and Future Works

A novel construction of safety embedded controls through the development of barrier states was presented. Through a proper conversion of barrier functions, barrier states were augmented to generate a nominal model which is safe if is asymptotically stable. Using barrier states, the constrained control problem was transformed to an unconstrained control problem, which makes the safety problem easier to be considered with various control techniques such as optimal control. Moreover, the BaS method is agnostic to the relative degree of the function describing the safe set with respect to the system unlike existing CBF based methods. Furthermore, there is no need to relax the stability requirement to guarantee safety as the two conditions are coupled and achieved simultaneously. The disadvantages of this approach include increasing the dimension of the model and adding nonlinearity to the model. The safety embedded model was used in constrained linear control and in the context of optimal control to generate safe stabilizing controllers. Simple linear controls and nonlinear quadratic controls were used to show the efficacy of the proposed method in various simulation examples.

Future work will include generalizations to nonlinear stochastic systems and applications to infinite and finite horizon stochastic optimal control as well as sampling-based model predictive control formulations. Another line of research includes generalizations of the proposed optimal control framework to min-max and HH_{\infty} optimal control problem formulations. Finally, incorporating uncertainty quantification methods into the augmented state space representation in (LABEL:new_system,_safety_augmented) is an active research.

References

  • [1] Franco Blanchini “Set invariance in control” In Automatica 35.11 Elsevier, 1999, pp. 1747–1767
  • [2] Stephen Prajna and Ali Jadbabaie “Safety verification of hybrid systems using barrier certificates” In International Workshop on Hybrid Systems: Computation and Control, 2004, pp. 477–492 Springer
  • [3] Aaron D Ames et al. “Control barrier functions: Theory and applications” In 2019 18th European Control Conference (ECC), 2019, pp. 3420–3431 IEEE
  • [4] Peter Wieland and Frank Allgöwer “Constructive safety using control barrier functions” In IFAC Proceedings Volumes 40.12 Elsevier, 2007, pp. 462–467
  • [5] Muhammad Zakiyullah Romdlony and Bayu Jayawardhana “Uniting control Lyapunov and control barrier functions” In 53rd IEEE Conference on Decision and Control, 2014, pp. 2293–2298 IEEE
  • [6] Aaron D Ames, Jessy W Grizzle and Paulo Tabuada “Control barrier function based quadratic programs with application to adaptive cruise control” In 53rd IEEE Conference on Decision and Control, 2014, pp. 6271–6278 IEEE
  • [7] Muhammad Zakiyullah Romdlony and Bayu Jayawardhana “Stabilization with guaranteed safety using control Lyapunov–barrier function” In Automatica 66 Elsevier, 2016, pp. 39–47
  • [8] Aaron D Ames, Xiangru Xu, Jessy W Grizzle and Paulo Tabuada “Control barrier function based quadratic programs for safety critical systems” In IEEE Transactions on Automatic Control 62.8 IEEE, 2016, pp. 3861–3876
  • [9] Adrian G Wills and William P Heath “Barrier function based model predictive control” In Automatica 40.8 Elsevier, 2004, pp. 1415–1422
  • [10] Yuxiao Chen, Mohamadreza Ahmadi and Aaron D Ames “Optimal safe controller synthesis: A density function approach” In 2020 American Control Conference (ACC), 2020, pp. 5407–5412 IEEE
  • [11] Max H Cohen and Calin Belta “Approximate optimal control for safety-critical systems with control barrier functions” In 2020 59th IEEE Conference on Decision and Control (CDC), 2020, pp. 2062–2067 IEEE
  • [12] Hassan Almubarak, Evangelos A Theodorou and Nader Sadegh “HJB Based Optimal Safe Control using Control Barrier Functions” In arXiv preprint arXiv:2106.15560, 2021
  • [13] Hassan Almubarak, Nader Sadegh and David G. Taylor “Infinite horizon nonlinear quadratic cost regulator” In American Control Conference, 2019. Proceedings of the 2019, 2019 IEEE
  • [14] Nader Sadegh and Hassan Almubarak “Recursive Analytic Solution of Nonlinear Optimal Regulators” In arXiv preprint arXiv:2006.15685, 2020
  • [15] Dahlard L Lukes “Optimal regulation of nonlinear dynamical systems” In SIAM Journal on Control 7.1 SIAM, 1969, pp. 75–100
  • [16] Hassan K Khalil “Nonlinear systems” Prentice Hall, 2002
  • [17] EG Al’Brekht “On the optimal stabilization of nonlinear systems” In Journal of Applied Mathematics and Mechanics 25.5 Elsevier, 1961, pp. 1254–1266
  • [18] Randal W Beard, George N Saridis and John T Wen “Approximate solutions to the time-invariant Hamilton–Jacobi–Bellman equation” In Journal of Optimization theory and Applications 96.3 Springer, 1998, pp. 589–626
  • [19] Tayfun Cimen “State-dependent Riccati equation (SDRE) control: A survey” In IFAC Proceedings Volumes 41.2 Elsevier, 2008, pp. 3761–3775
  • [20] Noboru Sakamoto and Arjan J Schaft “Analytical approximation methods for the stabilizing solution of the Hamilton–Jacobi equation” In IEEE Transactions on Automatic Control 53.10 IEEE, 2008, pp. 2335–2350