Distributed Optimal Control of Graph Symmetric Systems via Graph Filters

Fengjun Yang^∗, Fernando Gama^†, Somayeh Sojoudi^‡ and Nikolai Matni^§ ^∗F. Yang is with the Dept. of Comput. and Info. Sci, University of Pennsylvania, Philadelphia, PA. ^†F. Gama is with the Dept. of Elect. and Comput. Eng., Rice University, Houston, TX. ^‡S. Sojoudi is with the Dept. of Elect. Eng. and Comput. Sci, University of California, Berkeley, CA. ^§N. Matni is with the Dept. of Elect. and Syst. Eng., Philadelphia, PA.N.M. is supported by NSF awards CPS-2038873, CAREER award ECCS-2045834, and a Google Research Scholar award. F.Y. is in part supported by NSF CAREER award ECCS-2045834. F.G. and S.S. are supported by ONR and NSF grants.

Abstract

Designing distributed optimal controllers subject to communication constraints is a difficult problem unless structural assumptions are imposed on the underlying dynamics and information exchange structure, e.g., sparsity, delay, or spatial invariance. In this paper, we borrow ideas from graph signal processing and define and analyze a class of Graph Symmetric Systems (GSSs), which are systems that are symmetric with respect to an underlying graph topology. We show that for linear quadratic problems subject to dynamics defined by a GSS, the optimal centralized controller is given by a novel class of graph filters with transfer function valued filter taps and can be implemented via distributed message passing. We then propose several methods for approximating the optimal centralized graph filter by a distributed controller only requiring communication with a small subset of neighboring subsystems. We further provide stability and suboptimality guarantees for the resulting distributed controllers. Finally, we empirically demonstrate that our approach allows for a principled tradeoff between communication cost and performance while guaranteeing stability. Our results can be viewed as a first step towards bridging the fields of distributed optimal control and graph signal processing.

I Introduction

Computing a distributed optimal controller in which subcontrollers have access to subsets of global system information is in general a computationally intractable problem. Indeed, even when restricted to quadratic costs, Gaussian noise, and linear dynamics, the resulting optimal controller can be nonlinear and difficult to compute \citepwitsenhausen1968counterexample. Nevertheless, significant progress has been made in distributed optimal controller synthesis over the past two decades by identifying structural assumptions on the underlying dynamics and information exchange structure such that the resulting distributed controller synthesis problem is convex.

One such structural assumption that has been shown to lead to tractable distributed optimal control problems is spatial invariance \citepbamieh2002distributed (and other closely related notions of symmetry \citepmassioni2009distributed). Such systems are invariant under subsystem permutations, and have been shown to have optimal centralized controllers that are approximately distributed. In particular, this allows for distributed controllers that enjoy stability and near-optimality guarantees to be computed by appropriately truncating the centralized controller.

Contributions: In this paper, inspired by results from graph signal processing, we introduce the notion of Graph Symmetric Systems (GSSs), which are systems that are symmetric with respect to an underlying graph topology (formalized in §II-A). We show that for such systems, the resulting Linear Quadratic (LQ) centralized optimal controller admits an efficient message passing implementation in the form of a novel class of graph filters defined by transfer function filter taps. We subsequently propose and analyze two complementary approaches to computing near-optimal distributed controllers by truncating the centralized optimal controller subject to stability constraints. By leveraging tools from robust System Level Synthesis (SLS) \citepmatni2017scalable, anderson_system_2019, we show that these truncation algorithms can be solved via convex optimization, and that the resulting distributed controllers enjoy sub-optimality guarantees relative to the centralized optimal controller. These results constitute an important first step towards bridging the complementary, but traditionally disparate, fields of distributed optimal control and graph signal processing.

Related work: An alternative structural assumption for tractable distributed optimal control of linear systems can be specified in terms of the sparsity and delay patterns of the control system. In particular, it is possible to characterize conditions on the sparsity and delay patterns of the information exchanged between subcontrollers relative to the propagation of signals through sparse and delayed distributed plants such that distributed optimal control is tractable. The seminal paper [rotkowitz2005characterization] introduced the notion of quadratic invariance,¹¹1We note that spatially invariant systems, as defined in [bamieh2002distributed], also satisfy quadratic invariance. We show in Appendix B that graph symmetric systems and controllers lead to optimal control problems satisfying quadratic invariance. which built upon and generalized funnel causality \citepbamieh2005convex, showed that so long as subcontrollers could communicate as quickly as control signals propagated through the plant, then the resulting distributed optimal control problem could be solved via convex optimization. This convex parameterization of sparse and delayed controllers has since been further generalized in the System Level Synthesis (SLS) \citepanderson_system_2019 and Input-Output Parameterization (IOP) \citepfurieri2019input frameworks, which allow for even richer classes of sparsity and delay patterns to be imposed on distributed controllers.

A related class of distributed controllers are those based on Graph Neural Networks (GNNs). GNNs can be viewed as graph filters followed by pointwise nonlinear activation functions \citepRuiz2021-GNN, and among other favorable properties, enjoy stability to graph perturbations \citepGama2020-Stability. While recent use of GNNs for distributed control has shown promise \citepGama2022-DistributedLQR, Gama2022-ControlGNN, yang2021communication, such results currently lack strong guarantees of stability. We believe the results in this paper are a first step towards addressing this gap in the literature, by explicitly connecting graph filters and distributed optimal controllers. The direct relationship between graph filters and GNNs suggests that understanding the former will give insight in the effects of the latter.

Notation: We use upper- and lower-case letters such as $A$ and $x$ to denote matrices and vectors respectively, although lower-case letters might also be used for scalars or functions (the distinction will be apparent from the context). For both upper- and lower-case letters, we use boldface such as $\mathbf{\Lambda}$ and $\boldsymbol{\phi}$ to denote transfer matrices or vector/scalar transfer functions.

II The Linear Quadratic Regulator Problem for Graph Symmetric Systems

Consider a discrete-time linear time-invariant (LTI) system composed of $N$ interconnected scalar subsystems, each with state $x_{i}(t)\in\mathbb{R}$ , control input $u_{i}(t)\in\mathbb{R}$ and which evolves under the dynamics

x_{i}(t+1)=\sum_{j=1}^{N}A_{ij}x_{j}(t)+\sum_{j=1}^{N}B_{ij}u_{j}(t)+w_{i}(t),

(1)

for suitable matrices $A_{ij},B_{ij}$ describing the interaction between subsystems. Here $w_{i}(t)$ is an i.i.d. zero-mean noise. We can compactly express the dynamics of the full system in terms of the joint states $x(t)=[x_{1}(t),\ldots,x_{N}(t)]^{\top}$ and joint control actions $u(t)=[u_{1}(t),\ldots,u_{N}(t)]^{\top}$ as

x(t+1)=Ax(t)+Bu(t)+w(t),

(2)

where $(A,B)$ are defined such that the global dynamics (2) are consistent with the subsystem dynamics (1).

Our goal is to find a (potentially time-varying) state-feedback controller $K_{t}$ that minimizes the cost

J\big{(}\{x(t)\},\{u(t)\}\big{)}:=\lim_{T\to\infty}\frac{1}{T}\sum_{t=0}^{T-1}\mathbb{E}_{w}\big{[}x(t)^{\mathsf{T}}Qx(t)+u(t)^{\mathsf{T}}Ru(t)\big{]}

(3)

where $u(t)=K_{t}(x(t))$ and $Q\succeq 0,R\succ 0$ are known symmetric $N\times N$ matrices. The Linear Quadratic Regulator (LQR) problem is then given by

\min_{\mathbf{K}}J\big{(}\{x(t)\},\{u(t)\}\big{)}\quad\text{s. t. }u(t)=K_{t}\big{(}x(t)\big{)}.

(4)

In the centralized setting where each subsystem has access to the global state, it is well-known that the controller that solves (4) is a linear static controller $u(t)=K^{\star}x(t)$ where $K^{\star}=-(R+B^{\top}PB)^{-1}B^{\top}PA$ and $P$ is the unique solution to the discrete-time algebraic Ricatti equation:

P=A^{\top}PA-A^{\top}PB\big{(}R+B^{\top}PB\big{)}^{-1}B^{\top}PA+Q.

(5)

In this work, we consider a distributed variant of (4) where each subsystem can only exchange information with a small subset of subsystems. Specifically, this communication constraint is encoded as a graph $\mathcal{G}=\{\mathcal{V},\mathcal{E}\}$ , where $\mathcal{V}=\{v_{1},\ldots,v_{N}\}$ is the set of $N$ components (nodes) and where $\mathcal{E}\subseteq\mathcal{V}\times\mathcal{V}$ is the set of the corresponding interconnections (edges). It is assumed that the graph is undirected, i.e. $(v_{i},v_{j})\in\mathcal{E}$ if and only if $(v_{j},v_{i})\in\mathcal{E}$ . As described in the introduction, general information exchange constraints can lead to non-convex optimal control problems \citepwitsenhausen1968counterexample. However, as we show later, under suitable graph symmetry assumptions on the dynamics matrices $A,B$ and the cost matrices $Q,R$ , the optimal centralized controller admits a distributed message passing implementation allowing for a principled tradeoff between communication complexity and controller performance.

In the rest of this section, we borrow ideas from graph signal processing \citepSandryhaila2013-DSPG and introduce the notion of graph symmetric systems. First, we introduce a convenient way to define operations that respect the underlying communication graph structure via the graph matrix description (GMD) $S\in\mathbb{R}^{N\times N}$ . The matrix $S$ is such that the $(i,j)^{\text{th}}$ entry is zero whenever there is no connection between components $v_{i}$ and $v_{j}$ , i.e. $[S]_{ij}=0$ if $i\neq j$ and $(v_{i},v_{j})\notin\mathcal{E}$ . Note that, since the graph is undirected, the matrix $S$ is symmetric. Therefore, it has an eigedecomposition in terms of an orthonormal basis of eigenvectors $S=V\Lambda_{S}V^{\top}$ where $\Lambda_{S}\in\mathbb{R}^{N\times N}$ is a diagonal matrix with elements $\lambda_{S,i}\in\mathbb{R}$ such that $Sv_{i}=\lambda_{S,i}v_{i}$ for $v_{i}\in\mathbb{R}^{N}$ being the $i^{\text{th}}$ column of $V$ . We now introduce the notion of a graph symmetric system.

Definition 1 (Graph Symmetric System).

Given a GMD $S=V\Lambda_{S}V^{\top}$ for a graph $\mathcal{G}$ , a linear system (2) is graph symmetric with respect to $\mathcal{G}$ if the dynamics matrices $A,B$ are simultaneously diagonalized by $V$ , i.e.,

\displaystyle A=V\Lambda_{A}V^{\top}

\displaystyle,\quad B=V\Lambda_{B}V^{\top},

(6)

where $\Lambda_{A},\Lambda_{B}$ are diagonal.

Note that Definition 1 does not require the dynamics to be sparse. In fact, matrices $A$ and $B$ of the form in Definition 1 can be arbitrarily dense, i.e., the evolution of a subsystem state $x_{i}(t+1)$ can depend on subsystems that $i$ cannot directly communicate with \citepGama2019-LinearControl. This is distinct from sparsity/delay constraints used in [rotkowitz2005characterization, anderson_system_2019, furieri2019input], and encodes a different notion of symmetry than that exploited in the distributed control of spatially invariant systems \citepbamieh2002distributed.

By well-known results in graph signal processing \citepSandryhaila2013-DSPG, simultaneous diagonalizability of the system matrices $(A,B)$ and the GMD $S$ implies²²2Under the assumption that $S$ corresponds to a finite graph and has all distinct eigenvalues. On a high level, the result follows directly from the Cayley-Hamilton theorem. that they can be written as matrix polynomials of $S$ of degrees at most $N-1$ ,

A=\sum_{k=0}^{N-1}h_{A,k}S^{k}\quad,\quad B=\sum_{k=0}^{N-1}h_{B,k}S^{k}.

(7)

Matrices that can be expressed in this matrix polynomial forms are called graph filters \citepSegarra2017-GraphFilterDesign and the coefficients $h_{A,k},h_{B,k}$ are referred to as the filter weights or filter taps.

We now give a message-passing interpretation of graph symmetric systems. First, it can be seen from the sparsity pattern of the GMD $S$ that the output of $Sx(t)$ can be computed entirely as a linear combination of the states in nodes $1$ hop away in $\mathcal{G}$ . To see this, consider the operation $Sx(t)$ whose $i^{\text{th}}$ entry yields

[Sx(t)]_{i}=\sum_{j:(v_{j},v_{i})\in\mathcal{E}}[S]_{ij}[x(t)]_{j}.

(8)

More generally, when considering polynomials, it is observed that $S^{k}x(t)$ is equivalent to exchanging $k$ times information with one-hop neighbors. Therefore, if the system matrix $A$ and the control matrix $B$ are polynomials of $S$ , then the evolution of the system can be computed entirely by means of exchanges with neighboring nodes. Hence, the system dynamics can be viewed as implementing distributed message passing \citepRuiz2021-GNN. Examples of such linear, distributed systems, include both discrete-time and continuous-time diffusions, solutions to the heat equation, among many others, see [Gama2019-LinearControl] and references therein.

For the rest of the paper, we also assume that the cost matrices for the LQR problem (4) can also be simultaneously diagonalized with the dynamics matrices. Formally, we make the following assumption.

Assumption 1.

The system (2) defines a graph symmetric system with respect to a fixed GMD $S$ , and the cost matrices $(Q,R)$ defining the LQR problem (4) are graph symmetric with respect to $S$ , i.e., they are simultaneously diagonalized by the ortho-bases $V$ satisfying $S=V\Lambda_{S}V^{\top}$ . In particular

Q=V\Lambda_{Q}V^{\top}\quad,\quad R=V\Lambda_{R}V^{\top},

where $\Lambda_{Q},\Lambda_{R}$ are symmetric.

III Optimal Distributed Linear Controller via System Level Synthesis

SLS provides a convex parameterization of achievable closed-loop system responses \citepwang_separable_2018,anderson_system_2019, which can be leveraged to show that the optimal controller for graph symmetric systems under Assumption 1 can be written as a novel class of graph filters defined by transfer function valued filter taps.

III-A Background: System-Level Synthesis

As noted in §4 of [anderson_system_2019], we can compactly write the system dynamics (2) in the frequency domain as

(zI-A)\mathbf{x}=B\mathbf{u}+\mathbf{w},

where $\mathbf{x}=\sum_{t=0}^{\infty}z^{-t}x(t)$ is the signal $x(t)$ in the $z$ -domain, and idem for $\mathbf{u}$ and $\mathbf{w}$ . For a (dynamic) linear state-feedback controller $\mathbf{u}=\mathbf{K}\mathbf{x}$ , it follows immediately that

	$\displaystyle\mathbf{x}$	$\displaystyle=(zI-A-B\mathbf{K})^{-1}\mathbf{w}=:\mathbf{\Phi}_{x}(z)\mathbf{w},$		(9)
	$\displaystyle\mathbf{u}$	$\displaystyle=\mathbf{K}(zI-A-B\mathbf{K})^{-1}\mathbf{w}=:\mathbf{\Phi}_{u}(z)\mathbf{w},$		(9)

where $\mathbf{\Phi}_{x}(z)\in\mathbb{C}^{N\times N}$ and $\mathbf{\Phi}_{u}(z)\in\mathbb{C}^{N\times N}$ are system responses that map the disturbance $\mathbf{w}$ to state $\mathbf{x}$ and control input $\mathbf{u}$ , respectively. The following SLS theorem states that all achievable responses lie in an affine subspace of strictly proper stable rational transfer functions $\frac{1}{z}\mathcal{R}\mathcal{H}_{\infty}$ .

Theorem 1.

[anderson_system_2019, Thm. 4.1] For the LTI system evolving under the dynamics (2) and control policy $\mathbf{u}=\mathbf{K}\mathbf{x}$ , the following statements are true:

The affine subspace defined by

\begin{bmatrix}zI-A&-B\end{bmatrix}\begin{bmatrix}\mathbf{\Phi}_{x}\\ \mathbf{\Phi}_{u}\end{bmatrix}=I,\quad\mathbf{\Phi}_{x},\mathbf{\Phi}_{u}\in\frac{1}{z}\mathcal{R}\mathcal{H}_{\infty}

(10)

parameterizes all system responses from $\mathbf{w}$ to $(\mathbf{x},\mathbf{u})$ as defined in (9), achievable by an internally stabilizing state feedback controller $\mathbf{K}$ .

2.

For any transfer matrices $\mathbf{\Phi}_{x},\mathbf{\Phi}_{u}$ satisfiying (10), the controller $\mathbf{K}=\mathbf{\Phi}_{u}\mathbf{\Phi}_{x}^{-1}$ is internally stabilizing and achieves the desired system response in (9).

For disturbance $w(t)\overset{\text{i.i.d.}}{\sim}\mathcal{N}(0,I)$ , one can recast the optimization problem (4) in terms of system responses $\mathbf{\Phi}_{x}$ and $\mathbf{\Phi}_{u}$ as

	$\displaystyle\underset{\mathbf{\Phi}_{x},\mathbf{\Phi}_{u}}{\text{minimize}}$	$\displaystyle\ \quad\left\lVert\mathbf{Q}^{1/2}\mathbf{\Phi}_{x}\right\rVert_{\mathcal{H}_{2}}^{2}+\left\lVert\mathbf{R}^{1/2}\mathbf{\Phi}_{u}\right\rVert_{\mathcal{H}_{2}}^{2}$		(11)
	s.t.	constraint (10).		(11)

With a slight abuse of notation, we define $J(\mathbf{\Phi}_{x},\mathbf{\Phi}_{u})$ to be the LQR cost achieved by $\mathbf{\Phi}_{x}$ and $\mathbf{\Phi}_{u}$ in the objective of (11) and $J(\mathbf{K})$ as the cost (3) achieved by applying controller $\mathbf{K}$ .

III-B SLS for Graph Symmetric Systems

We now proceed to show that under Assumption 1, the optimal system response for a graph symmetric system that solves the LQR problem (11) can be written as a graph filter.

Theorem 2.

Given a GMD $S=V\Lambda_{S}V^{\top}$ , consider an instance of the LQR problem (11) where the underlying system and cost satisfy Assumption 1. Then, there exists a global optimum $(\mathbf{\Phi}^{\star}_{x},\mathbf{\Phi}^{\star}_{u})$ where both $\mathbf{\Phi}^{\star}_{x}$ and $\mathbf{\Phi}^{\star}_{u}$ are diagonalizable by $\mathbf{V}$ , i.e.,

\displaystyle\mathbf{\Phi}^{\star}_{x}

\displaystyle=V\mathbf{\Lambda}_{x}^{*}V^{\top},\quad\mathbf{\Phi}^{\star}_{u}

\displaystyle=V\mathbf{\Lambda}_{u}^{*}V^{\top},

where $\mathbf{\Lambda}_{x}^{*}$ and $\mathbf{\Lambda}_{u}^{*}$ are diagonal transfer matrices. Hence, the optimal controller $\mathbf{K}^{\star}=(\mathbf{\Phi}^{\star}_{u})(\mathbf{\Phi}^{\star}_{x})^{-1}$ can also be diagonalized by $V$ .

Proof.

See Appendix. ∎

Remark 1.

Note that the elements defined by the diagonal responses $\mathbf{\Lambda}^{\star}_{x},\mathbf{\Lambda}^{\star}_{u}$ are transfer functions $[\mathbf{\Lambda}^{\star}_{x}]_{ii}(z),[\mathbf{\Lambda}^{\star}_{u}]_{ii}(z)$ . Thus the resulting graph filter taps are transfer functions as well, i.e., a transfer function $\mathbf{\Phi}(z)$ that is simultaneously diagonalizable with the matrix $S$ can be written as:

\mathbf{\Phi}(z)=\sum_{k=0}^{N-1}\boldsymbol{\phi}_{k}(z)S^{k}.

(12)

The main implication of Theorem 2 is that the optimal linear state-feedback controller for graph symmetric systems under Assumption 1 is a graph filter and can thus be implemented via distributed message passing. We note, however, that the above result implies that the resulting optimal system response $\mathbf{\Phi}^{\star}:=(\mathbf{\Phi}^{\star}_{x},\mathbf{\Phi}^{\star}_{u})$ could be dense, as $S^{N-1}$ is dense if $S$ defines a connected graph. This can be undesirable in practice, as it requires $N-1$ communication exchanges with one-hop neighbors, potentially causing significant delays if the size $N$ of the graph is large. In the next section, we leverage a robust variant of the SLS parameterization given in Theorem 1 to restrict the optimal system responses to only the first $F\ll N-1$ filter taps while guaranteeing stability and near optimal performance.

We end this section by noting that in the graph signal processing literature \citepSandryhaila2013-DSPG, a controller of the form of (12) is known as a linear, shift-invariant (LSI) graph filter, and is analogous to an LTI filter. Note that $S\mathbf{\Phi}^{\star}\mathbf{x}=\mathbf{\Phi}^{\star}S\mathbf{x}$ , hence the name. In particular, Equation (12) is a spatially finite impulse response (FIR) graph filter \citepSegarra2017-GraphFilterDesign that is completely characterized by a finite set of $N$ filter taps that can be conveniently described by a collection of transfer functions $\boldsymbol{\phi}^{\star}=[\boldsymbol{\phi}_{0}^{\star},\ldots,\boldsymbol{\phi}_{N-1}^{\star}]^{\mathsf{T}}\in\mathbb{R}^{N}$ . We emphasize that the transfer functions themselves are not restricted to be temporally FIR. Spatially FIR graph filters are also known as convolutional graph filters \citepRuiz2021-GNN due to their sum-and-shift nature, understanding that the effect of the operation $S\mathbf{x}$ is to shift the signal around the graph (thus, oftentimes, the GMD $S$ is referred to as the graph shift operator). Furthermore, spatially FIR graph filters satisfy the convolution theorem that indicates that a convolution in the vertex domain can be computed by means of an elementwise multiplication in the spectrum domain \citepSandryhaila2013-DSPG. Finally, in the context of finite graphs, it is observed that the space of FIR graph filters of the form (12), characterized by $N$ filter taps, is equivalent to the space of spatially infinite impulse response (IIR) graph filters as well as autoregressive, moving average (ARMA) graph filters \citepIsufi2017-ARMA.

IV Localized Approximations to the Optimal Distributed Linear Controller

In this section, we discuss several methods to approximate the optimal dense system response in the form of (12) with one that is localized and uses ${F\ll N}$ filter taps. We start with a projection method based on graph filter design. We then present a robust SLS formulation of the approximation problem that guarantees the stability of the resulting localized controller. Finally, we show how these two can be combined into a robust projection method that also ensures stability. In the following, we define $\mathbf{\Phi}^{\star}:=(\mathbf{\Phi}^{\star}_{x},\mathbf{\Phi}^{\star}_{u})$ , and recall that $\mathbf{\Phi}^{\star}$ can be written as a graph filter (12) defined by transfer function filter taps $\boldsymbol{\phi}^{\star}_{k}(z)$ , $k=0,...,N-1$ . We further recall that each transfer function filter tap $\boldsymbol{\phi}^{\star}_{k}(z)$ admits the following expansion in terms of its Markov parameters: $\boldsymbol{\phi}^{\star}_{k}(z)=\sum_{i=1}^{\infty}z^{-i}\phi^{\star}_{k}[i]$ .

IV-A Naive Projection

We propose an approach inspired by the graph signal processing literature, wherein we exploit the graph filter structure of the optimal system responses \citepSegarra2017-GraphFilterDesign. More specifically, we project the optimal system responses $\mathbf{\Phi}^{\star}$ onto graph filters of order $F$ in the $\mathcal{H}_{2}$ norm by solving the following optimization problem

\min_{\boldsymbol{\phi}}\left\lVert\mathbf{\Phi}-\mathbf{\Phi}^{\star}\right\rVert_{\mathcal{H}_{2}}^{2}.

(13)

Here $\boldsymbol{\phi}:=[\boldsymbol{\phi}_{0}(z),\ldots,\boldsymbol{\phi}_{F-1}(z)]^{\top}\in\mathbb{C}^{F}$ collects the $F$ transfer function filter taps defining $\mathbf{\Phi}:=\sum_{k=0}^{F-1}\boldsymbol{\phi}_{k}(z)S^{k}$ . If we further restrict each transfer function filter tap $\phi_{k}(z)$ to be FIR of order $n$ , i.e., if we write $\boldsymbol{\phi}_{k}(z)=\sum_{i=1}^{n}z^{-i}\phi_{k}[i]$ , then this reduces to solving the following unconstrained quadratic program

\min_{\{\phi_{k}[i]\}}\sum_{i=1}^{n}\left\|\sum_{k=0}^{F-1}\phi_{k}[i]S^{k}-\sum_{l=0}^{N-1}\phi^{\star}_{l}[i]S^{l}\right\|_{F}^{2}.

(14)

Proposition 3 (Approximating filter taps).

If the eigenvalues $\{\lambda_{i}\}$ of the GMD $\mathbf{S}$ are all distinct, then the filter taps that solve (13) are given by

\phi_{k}[i]=\phi_{k}^{\star}[i]+\boldsymbol{\varepsilon}_{k}[i]

(15)

where $\boldsymbol{\varepsilon}$ is the error vector computed as

\boldsymbol{\varepsilon}[j]=\Big{(}\sum_{i=1}^{N}\boldsymbol{\lambda}_{iF}\boldsymbol{\lambda}_{iF}^{\mathsf{T}}\Big{)}^{-1}\Big{(}\sum_{i=1}^{N}\boldsymbol{\lambda}_{iF}\boldsymbol{\lambda}_{i(N-F)}^{\mathsf{T}}\Big{)}\boldsymbol{\phi}_{N-F}^{\star}[j]

(16)

with $\boldsymbol{\lambda}_{iF}:=[1,\lambda_{i},\lambda_{i}^{2},\ldots,\lambda_{i}^{F-1}]\in\mathbb{R}^{F}$ is the collection of the first $F$ powers of $\lambda_{i}$ , $\boldsymbol{\lambda}_{i(N-F)}:=[\lambda_{i}^{F},\ldots,\lambda_{i}^{N-1}]\in\mathbb{R}^{N-F}$ is the collection of the remaining powers and $\boldsymbol{\phi}_{N-F}^{\star}[i]:=[\phi^{\star}_{F}[i],\ldots,\phi^{\star}_{N-1}[i]]\in\mathbb{R}^{N-F}$ collects the tail $N-F$ optimal filter taps.

Proof.

It follows from using the convexity of (14), matrix calculus and properties of Vandermonde matrices. ∎

Prop. 3 determines in closed-form how to compute the filter of order $F$ that best approximates the optimal linear distributed controller in the $\mathcal{H}_{2}$ norm. It also shows that each filter tap transfer function $\boldsymbol{\phi}_{k}(z)$ is obtained as the optimal tap $\boldsymbol{\phi}_{k}^{\star}(z)$ with an added corrective term that accounts for the $N-F$ taps that could not be included.

This approach is easy to implement computationally, as it only requires solving a least squares problem to minimize the projection cost. However, the resulting controller cannot be guaranteed to be stabilizing. As we show later via numerical simulation, approximations with a small number of filter taps $F$ are often unstable. This motivates an approach that takes into account the stability of the resulting controller.

IV-B Localized Approximations via Robust SLS

Robust SLS \citepmatni2017scalable, anderson_system_2019 offers a systematic way to reason about approximate system responses, i.e., system responses that do not exactly satisfy the achievability constraint (10). In particular, as shown in the following result, robust SLS allows for an explicit characterization of the effects of using approximate system responses for controller design.

Theorem 4 (Corollary 4.4 of [anderson_system_2019]).

Let $(\mathbf{\Phi}_{x},\mathbf{\Phi}_{u},\mathbf{\Delta})$ be a solution to

\begin{bmatrix}zI-A&-B\end{bmatrix}\begin{bmatrix}\mathbf{\Phi}_{x}\\ \mathbf{\Phi}_{u}\end{bmatrix}=I+\mathbf{\Delta},\quad\mathbf{\Phi}_{x},\mathbf{\Phi}_{u}\in\frac{1}{z}\mathcal{R}\mathcal{H}_{\infty}.

(17)

Then if $\lVert\mathbf{\Delta}\rVert_{\mathcal{H}_{\infty}}<1$ the controller $\mathbf{K}=\mathbf{\Phi}_{u}\mathbf{\Phi}_{x}^{-1}$ stabilizes the system (2), and the actual system response that is achieved is given by

\begin{bmatrix}\mathbf{x}\\ \mathbf{u}\end{bmatrix}=\begin{bmatrix}\mathbf{\Phi}_{x}\\ \mathbf{\Phi}_{u}\end{bmatrix}(I+\mathbf{\Delta})^{-1}\mathbf{w}.

We leverage this result to provide an upper-bound on the amount of truncation that can be applied to $\mathbf{\Phi}^{\star}$ without destabilizing the system.

Corollary 5.

Let $(\mathbf{\Phi}_{x},\mathbf{\Phi}_{u},\mathbf{\Delta})$ be a solution to (17) and assume that $\left\lVert\mathbf{\Delta}\right\rVert_{\mathcal{H}_{\infty}}<1$ . Then the controller $\mathbf{K}=\mathbf{\Phi}_{u}\mathbf{\Phi}_{x}^{-1}$ achieves an LQR cost (11) $J$ that can be bounded as

J(\mathbf{K})\leq\frac{1}{1-\left\lVert\mathbf{\Delta}\right\rVert_{\mathcal{H}_{\infty}}}\left\lVert\begin{bmatrix}Q^{1/2}&0\\ 0&R^{1/2}\end{bmatrix}\begin{bmatrix}\mathbf{\Phi}_{x}\\ \mathbf{\Phi}_{u}\end{bmatrix}\right\rVert_{\mathcal{H}_{2}}.

Proof.

First, we note that by Theorem 4, the system responses $(\tilde{\mathbf{\Phi}}_{x},\tilde{\mathbf{\Phi}}_{u})$ achieved by $\mathbf{K}$ are given by

\begin{bmatrix}\tilde{\mathbf{\Phi}}_{x}\\ \tilde{\mathbf{\Phi}}_{u}\end{bmatrix}=\begin{bmatrix}\mathbf{\Phi}_{x}\\ \mathbf{\Phi}_{u}\end{bmatrix}(I+\mathbf{\Delta})^{-1}.

Thus, the cost achieved by the controller $\mathbf{K}$ is bounded by

	$\displaystyle J(\mathbf{K})$	$\displaystyle=J(\tilde{\mathbf{\Phi}}_{x},\tilde{\mathbf{\Phi}}_{u})$
		$\displaystyle=\left\lVert\begin{bmatrix}Q^{1/2}&0\\ 0&R^{1/2}\end{bmatrix}\begin{bmatrix}\mathbf{\Phi}_{x}\\ \mathbf{\Phi}_{u}\end{bmatrix}(I+\mathbf{\Delta})^{-1}\right\rVert_{\mathcal{H}_{2}}$
		$\displaystyle\leq\left\lVert(I+\mathbf{\Delta})^{-1}\right\rVert_{\mathcal{H}_{\infty}}\left\lVert\begin{bmatrix}Q^{1/2}&0\\ 0&R^{1/2}\end{bmatrix}\begin{bmatrix}\mathbf{\Phi}_{x}\\ \mathbf{\Phi}_{u}\end{bmatrix}\right\rVert_{\mathcal{H}_{2}}$
		$\displaystyle\leq\frac{1}{1-\left\lVert\mathbf{\Delta}\right\rVert_{\mathcal{H}_{\infty}}}\left\lVert\begin{bmatrix}Q^{1/2}&0\\ 0&R^{1/2}\end{bmatrix}\begin{bmatrix}\mathbf{\Phi}_{x}\\ \mathbf{\Phi}_{u}\end{bmatrix}\right\rVert_{\mathcal{H}_{2}},$

where we used Cauchy-Schwarz in the first inequality and the small gain theorem and the fact that $\left\lVert\mathbf{\Delta}\right\rVert_{\mathcal{H}_{\infty}}<1$ in the last step. ∎

Corollary 5 offers a way to synthesize stable truncated system responses. Specifically, to synthesize a system response that uses only $F<N-1$ filter taps while guaranteeing both stability and performance, we propose the following optimization problem:


	$\displaystyle\begin{split}\underset{\boldsymbol{\phi}_{x},\boldsymbol{\phi}_{u},\gamma\in(0,1)}{\text{minimize}}&\ \quad\frac{1}{1-\gamma}\left\lVert\begin{bmatrix}Q^{1/2}&0\\ 0&R^{1/2}\end{bmatrix}\begin{bmatrix}\mathbf{\Phi}_{x}\\ \mathbf{\Phi}_{u}\end{bmatrix}\right\rVert_{\mathcal{H}_{2}}\end{split}$			(18a)
	$\displaystyle\begin{split}\text{s.t. }&\quad\begin{bmatrix}zI-A&-B\end{bmatrix}\begin{bmatrix}\mathbf{\Phi}_{x}\\ \mathbf{\Phi}_{u}\end{bmatrix}=I+\mathbf{\Delta},\\ &\ \left\lVert\mathbf{\Delta}\right\rVert_{\mathcal{H}_{\infty}}\leq\gamma,\\ &\ \mathbf{\Phi}_{x}(z)=\sum_{k=0}^{F-1}\boldsymbol{\phi}_{x,k}(z)S^{k},\,\boldsymbol{\phi}_{x,k}(z)\in\frac{1}{z}\mathcal{RH}_{\infty},\\ &\ \mathbf{\Phi}_{u}(z)=\sum_{k=0}^{F-1}\boldsymbol{\phi}_{u,k}(z)S^{k},\,\boldsymbol{\phi}_{u,k}(z)\in\frac{1}{z}\mathcal{RH}_{\infty}.\end{split}$			(18b)

By Corollary 5, the solution to (18) defines a controller that is stabilizing. We further show in the next result that it enjoys guaranteed suboptimality bounds relative to the optimal controller defined by $\mathbf{\Phi}^{\star}$ . We first introduce the following notation: for a system response of the form $\mathbf{\Phi}=\sum_{k=0}^{N-1}\boldsymbol{\phi}_{k}(z)S^{k}$ , we define the $F$ -truncation $P_{F}(\mathbf{\Phi})$ and the $F$ -tail $P_{F\perp}(\mathbf{\Phi})$ as

\displaystyle P_{F}(\mathbf{\Phi}):=\sum_{k=0}^{F-1}\boldsymbol{\phi}_{k}(z)S^{k},\ P_{F\perp}(\mathbf{\Phi}):=\sum_{k=F}^{N-1}\boldsymbol{\phi}_{k}(z)S^{k}.

Theorem 6.

Let $(\tilde{\mathbf{\Phi}}_{x},\tilde{\mathbf{\Phi}}_{u},\tilde{\gamma})$ be the optimal solution to the robust SLS problem (18). Let $(\mathbf{\Phi}^{\star}_{x},\mathbf{\Phi}^{\star}_{u})$ be the optimal solution to the untruncated SLS problem (11). Suppose that

\left\lVert\mathbf{\Delta}^{*}\right\rVert_{\mathcal{H}_{\infty}}:=\left\lVert(zI-A)P_{F\perp}(\mathbf{\Phi}^{\star}_{x})-BP_{F\perp}(\mathbf{\Phi}^{\star}_{u})\right\rVert_{\mathcal{H}_{\infty}}<1.

Then, the controller $\mathbf{K}=\tilde{\mathbf{\Phi}}_{u}\tilde{\mathbf{\Phi}}_{x}^{-1}$ is stabilizing and

J(\mathbf{K})\leq\frac{1}{1-\left\lVert\mathbf{\Delta}^{*}\right\rVert_{\mathcal{H}_{\infty}}}\Big{(}J(\mathbf{\Phi}^{\star}_{x},\mathbf{\Phi}^{\star}_{u})+J(P_{F\perp}(\mathbf{\Phi}^{\star}_{x}),P_{F\perp}(\mathbf{\Phi}^{\star}_{u}))\Big{)}

(19)

Proof.

By the constraints (18b) and Theorem 4, we immediately have that $\mathbf{K}$ is stabilizing. To show the given suboptimality bound, we first note that there exist some $\gamma$ such that $(P_{F}(\mathbf{\Phi}^{\star}_{x}),P_{F}(\mathbf{\Phi}^{\star}_{u}),\gamma)$ is a feasible solution to the robust optimization problem (18). To see this, observe that

		$\displaystyle\begin{bmatrix}zI-A&-B\end{bmatrix}\begin{bmatrix}P_{F}(\mathbf{\Phi}^{\star}_{x})\\ P_{F}(\mathbf{\Phi}^{\star}_{u})\end{bmatrix}$
	$\displaystyle=\$	$\displaystyle\begin{bmatrix}zI-A&-B\end{bmatrix}\begin{bmatrix}\mathbf{\Phi}^{\star}_{x}\\ \mathbf{\Phi}^{\star}_{u}\end{bmatrix}-\begin{bmatrix}zI-A&-B\end{bmatrix}\begin{bmatrix}P_{F\perp}(\mathbf{\Phi}^{\star}_{x})\\ P_{F\perp}(\mathbf{\Phi}^{\star}_{u})\end{bmatrix}$
	$\displaystyle=\$	$\displaystyle I-\mathbf{\Delta}^{*},$

where the last step follows from the achievability of the optimal response $(\mathbf{\Phi}^{\star}_{x},\mathbf{\Phi}^{\star}_{u})$ and the definition of $\mathbf{\Delta}^{*}$ . Thus, $(P_{F}(\mathbf{\Phi}^{\star}_{x}),P_{F}(\mathbf{\Phi}^{\star}_{u}),\left\lVert\mathbf{\Delta}^{*}\right\rVert_{\mathcal{H}_{\infty}})$ is a feasible solution with our assumption that $\left\lVert\mathbf{\Delta}^{*}\right\rVert_{\mathcal{H}_{\infty}}<1$ . Denote the robust SLS objective (18) as $\tilde{J}(\mathbf{\Phi}_{x},\mathbf{\Phi}_{u},\gamma)$ . By the optimality of the solution $(\tilde{\mathbf{\Phi}}_{x},\tilde{\mathbf{\Phi}}_{u},\tilde{\gamma})$ , we have that

	$\displaystyle\tilde{J}(\tilde{\mathbf{\Phi}}_{x},$	$\displaystyle\tilde{\mathbf{\Phi}}_{u},\tilde{\gamma})\leq\tilde{J}(\mathbf{\Phi}^{\star}_{x},\mathbf{\Phi}^{\star}_{u},\left\lVert\mathbf{\Delta}^{*}\right\rVert_{\mathcal{H}_{\infty}})$
		$\displaystyle\leq\frac{1}{1-\\|\mathbf{\Delta}^{*}\\|_{H_{\infty}}}\left\lVert\begin{bmatrix}Q^{1/2}&0\\ 0&R^{1/2}\end{bmatrix}\begin{bmatrix}P_{F}(\mathbf{\Phi}^{\star}_{x})\\ P_{F}(\mathbf{\Phi}^{\star}_{u})\end{bmatrix}\right\rVert_{\mathcal{H}_{2}},$

where we applied Corollary 5 in the second inequality. The desired result follows then from the fact that

\begin{bmatrix}P_{F}(\mathbf{\Phi}^{\star}_{x})\\ P_{F}(\mathbf{\Phi}^{\star}_{u})\end{bmatrix}=\begin{bmatrix}\mathbf{\Phi}^{\star}_{x}\\ \mathbf{\Phi}^{\star}_{u}\end{bmatrix}-\begin{bmatrix}P_{F\perp}(\mathbf{\Phi}^{\star}_{x})\\ P_{F\perp}(\mathbf{\Phi}^{\star}_{u})\end{bmatrix}

and an application of the triangle inequality. ∎

This optimization problem is jointly quasi-convex and can be solved efficiently using bisection. Further, feasibility provides a stability certificate in the form of $\left\lVert\mathbf{\Delta}\right\rVert_{\mathcal{H}_{\infty}}<1$ .

IV-C Robust Projection

Lastly, we can combine the robustness constraints used in robust SLS with the signal-processing-based projection method. Specifically, we solve the following optimization problem

	$\displaystyle\underset{\boldsymbol{\Phi}_{x},\boldsymbol{\Phi}_{u},\gamma\in(0,1)}{\text{minimize}}$	$\displaystyle\ \quad\left\lVert\mathbf{\Phi}-\mathbf{\Phi}^{*}\right\rVert_{\mathcal{H}_{2}}^{2}$		(20)
	s.t.	$\displaystyle\ \quad(\boldsymbol{\Phi}_{x},\boldsymbol{\Phi}_{u},\gamma)\;\text{satisfy constraint \eqref{eq:robustConstraint}}.$		(20)

We note that solving this problem does not give an upper bound on the cost of the resulting controller, but the robustness constraint ensures that the resulting controller is stabilizing.

IV-D Implementation

We end this section by detailing practical implementation details for optimization problems (18) and (20). First, we note that computationally, one cannot directly optimize for the IIR system responses as is written in (18), (20). In practice, we use an FIR approximation of the strictly proper transfer functions $\boldsymbol{\phi}_{x}(z)$ and $\boldsymbol{\phi}_{u}(z)$ , i.e., $\boldsymbol{\phi}(z)$ is parameterized as

\boldsymbol{\phi}(z)=\sum_{i=1}^{n}z^{-i}\phi[i]

for some given FIR order $n$ . As shown in [anderson_system_2019], the suboptimality incurred by such an FIR approximation decays exponentially in the horizon $n$ .

The $\mathcal{H}_{\infty}$ -norm constraints on the (also FIR) transfer matrix $\mathbf{\Delta}$ can then be enforced via semidefinite programming (see Theorem 5.8 in [dumitrescu2007positive]), potentially introducing a nontrivial computational burden. However, we note that one can replace the $\mathcal{H}_{\infty}$ -norm constraint in optimization problems (18) and (20) with any induced norm constraint. A particularly appealing option is the $\ell_{1}\to\ell_{1}$ induced norm (which defines the $\mathcal{L}_{1}$ -norm of the transpose system), as this norm decomposes columnwise. As shown in [anderson_system_2019, wang_separable_2018], the resulting robustness constraints are linear and embarrassingly parallelizable. We defer this extension to future work.

V Numerical Experiments

We show that our approach offers a principled way to trade off performance and communication complexity through numerical experiments. We also demonstrate the importance of the robustness constraints in synthesizing stable distributed controllers and compare the performance of robust SLS and projection-based methods on synthesizing localized controllers. All code needed to reproduce the examples found in this section is available at https://github.com/unstable-zeros/graph-symmetric-systems.

V-A Setup

In the following experiments, we consider the distributed linear quadratic regulator (LQR) problem (4) over $N=10$ scalar subsystems. We generate the GMD $S$ and dynamic matrices $A$ and $B$ using a process similar to that in [Gama2022-DistributedLQR]. To generate a problem instance, we start by creating the communication network $\mathcal{G}$ by randomly sampling $N$ numbers $\{u_{i}\}_{i=1}^{N}\sim{}U[0,1]$ , and creating a bi-directional link between $v_{i}$ and each of its $3$ nearest points as defined by the topology on the interval $[0,1]$ under the metric $d(v_{i},v_{j})=|u_{i}-u_{j}|$ . We then take $S$ to be a symmetric matrix that shares the sparsity pattern of the Laplacian of $\mathcal{G}$ , with its entry values sampled independently from $\mathcal{N}(0,1)$ . The GMD $S$ is then normalized to have a spectral radius of $1$ . We generate the dynamics matrices $A$ and $B$ to share the same eigenvectors as $S$ , and sample their eigenvalues i.i.d. from the standard normal distribution – hence both $A$ and $B$ are symmetric matrices. We take the cost matrices $Q=R=I_{N}$ . For both of the following experiments, we randomly generate $50$ problem instances using this process. We end by noting that $\mathcal{G}$ generated this way have, on average, a diameter of $5.92$ hops.

For the implementation of the optimization problems, we approximate the transfer functions $\boldsymbol{\phi}(z)$ with an FIR horizon of $n=10$ . Further, for the robust SLS problem (18), instead of using bisection to determine the best value of $\gamma$ , we fix $\gamma=0.98$ , as empirically the value of $\gamma$ does not significantly affect the cost achieved by the controllers.

Refer to caption — Figure 1: The percentage of synthesized $F$ -hop controllers that are stabilizing using naive projection and robust SLS-based synthesis across 50 random trials. The dashed red line denotes the average graph diameter of systems.

V-B Importance of Stability Constraints

In this experiment, we demonstrate the importance of the robust SLS-based stability constraints in synthesizing stable distributed controllers. We vary the number of allowed filter taps and apply both the naive projection (13) and robust SLS (20) methods to $50$ randomly generated problem instances. For the naive projection method, we report the percentage of resulting controllers that are stable. For robust SLS, we report the percentage of optimization problems (18) being feasible, as feasible solutions optimization problem (18) are guaranteed to be stabilizing. The results are shown in Figure 1.

First, we observe that as expected, a higher number of filter taps result in a higher probability of synthesizing stable responses for both methods. However, the naive projection method has nonzero probability of resulting in an unstable controllers even with a large number of filter taps i.e., even when the projection error between the stable optimal system responses and the projected responses is small. On the other hand, the percentage of stable solutions resulting from the robust SLS problem increases monotonically with the number of filter taps, which is expected for a principled way of synthesizing stable controllers. Robust SLS also generally achieves a higher percentage of certifiably stable responses than that of naive projection, except for in the extremely sparse case of $F=3$ . This suggests that the robust constraint might be too restrictive for synthesizing extremely sparse responses. Combined with the low computation cost of naive projection, this suggests a potential benefit of applying both methods in the sparse regime.

V-C Truncation Performance

In this experiment, we compare the performance of robust SLS and robust projection for different filter tap numbers $F$ on the same $50$ randomly generated problem instances. We show the median (solid lines), $25$ -th and $75$ -th percentile (shaded regions) of the costs achieved by both methods in Figure 2.

First, we note that the median costs decreases monotonically for both methods as the number of hops increase. This shows that that the optimization problems can leverage the increase in expressivity of the graph filters to achieve better performance, which matches our intuition. Second, we note that the robust SLS-based method achieves a lower cost than robust Projection over for all numbers of filter taps considered. We also note that to the left of $4$ hops, the upper boundary of the shaded region, which represents the $75$ -th percentile of the cost, is infinite, indicating that at least $25\%$ of the robust synthesis problems are infeasible. This again suggests a need to develop more flexible methods in the sparse regime.

VI Conclusion

In this works, we introduced the notion of graph symmetric systems and showed that for linear quadratic problems, the optimal system response for graph symmetric systems can be written as (potentially dense) graph filters. We then proposed three methods to approximate the optimal responses with localized responses and validated their performance in numerical simulation. Directions of future work include relaxing the GSS constraints, applying the results on $\mathcal{L}_{1}$ norm to enable distributed computation, and understanding how this can better inform GNN-based controllers with nonlinear activation functions.

-A Proof for Theorem 2

We show that for any optimal system response $\mathbf{\Phi}^{\star}_{x}$ , $\mathbf{\Phi}^{\star}_{u}$ of the optimization problem (11) that is not diagonalizable with $V$ , i.e.,

\displaystyle\mathbf{\Lambda}^{\star}_{x}

\displaystyle=V^{\top}\mathbf{\Phi}^{\star}_{x}V,\quad\mathbf{\Lambda}^{\star}_{u}=V^{\top}\mathbf{\Phi}^{\star}_{u}V

are not diagonal, we can construct a simultaneously diagonalizable system response $\mathbf{\Phi}_{x}^{\prime}$ , $\mathbf{\Phi}_{u}^{\prime}$ that is equally optimal. In particular, we construct such a system response as follows:

	$\displaystyle\mathbf{\Phi}_{x}^{\prime}=V\mathbf{\Lambda}_{x}^{\prime}V^{\top},\quad\mathbf{\Phi}_{u}^{\prime}=V\mathbf{\Lambda}_{u}^{\prime}V^{\top}$		(21)
	$\displaystyle[\mathbf{\Lambda}_{x}^{\prime}]_{ij}=\begin{cases}[\mathbf{\Lambda}^{\star}_{x}]_{ij}&i=j\\ 0,&\text{o.w.}\end{cases},\;[\mathbf{\Lambda}_{u}^{\prime}]_{ij}=\begin{cases}[\mathbf{\Lambda}^{\star}_{u}]_{ij}&i=j\\ 0&\text{o.w.}\end{cases}.$		(22)

We first show that $\mathbf{\Phi}_{x}^{\prime}$ , $\mathbf{\Phi}_{u}^{\prime}$ are feasible solutions of (11). From the achievability condition on $\mathbf{\Phi}^{\star}_{x},\mathbf{\Phi}^{\star}_{u}$ , we have that

\displaystyle(zI-A)\mathbf{\Phi}^{\star}_{x}-B\mathbf{\Phi}^{\star}_{u}

\displaystyle=I.

(23)

Using the simultaneous diagonalizability of $A$ and $B$ , we have

V(zI-\Lambda_{A})V^{\top}V\mathbf{\Lambda}_{x}^{\star}V^{\top}-V\Lambda_{B}V^{\top}V\mathbf{\Lambda}_{u}^{\star}V^{\top}=I\implies(zI-\Lambda_{A})\mathbf{\Lambda}^{\star}_{x}-\Lambda_{B}\mathbf{\Lambda}^{\star}_{u}=I.

(24)

Since the matrices $(zI-\Lambda_{A})$ , $\Lambda_{B}$ and $I$ are diagonal, we have

(zI-\Lambda_{A})\mathbf{\Lambda}_{x}^{\prime}-\Lambda_{B}\mathbf{\Lambda}_{u}^{\prime}=\mathsf{diag}\Big{[}(zI-\Lambda_{A})\mathbf{\Lambda}^{\star}_{x}-\Lambda_{B}\mathbf{\Lambda}^{\star}_{u}\Big{]}=I,

(25)

where $\mathsf{diag}[\mathbf{M}]$ projects a matrix $\mathbf{M}$ onto its diagonal elements. Therefore, $\mathbf{\Phi}_{x}^{\prime}$ , $\mathbf{\Phi}_{u}^{\prime}$ is also feasible.

Now, we show that $\mathbf{\Phi}_{x}^{\prime}$ , $\mathbf{\Phi}_{u}^{\prime}$ gives a cost at least as good as that of $\mathbf{\Phi}^{\star}_{x},\mathbf{\Phi}^{\star}_{u}$ . By the simultaneous diagonalizability of the matrices $Q$ and $R$ , and the fact that the $\mathcal{H}_{2}$ -norm is invariant under unitary transformations, we have that

	$\displaystyle\left\lVert Q^{1/2}\mathbf{\Phi}_{x}^{\prime}\right\rVert_{\mathcal{H}_{2}}^{2}+\left\lVert R^{1/2}\mathbf{\Phi}_{u}^{\prime}\right\rVert_{\mathcal{H}_{2}}^{2}$	(26)
$\displaystyle=$	$\displaystyle\left\lVert V\Lambda_{Q}^{1/2}V^{\top}V\mathbf{\Lambda}_{x}^{\prime}V^{\top}\right\rVert_{\mathcal{H}_{2}}+\left\lVert V\Lambda_{R}^{1/2}V^{\top}V\mathbf{\Lambda}_{u}^{\prime}V^{\top}\right\rVert_{\mathcal{H}_{2}}$
$\displaystyle=$	$\displaystyle\left\lVert\Lambda_{Q}^{1/2}\mathbf{\Lambda}_{x}^{\prime}\right\rVert_{\mathcal{H}_{2}}^{2}+\left\lVert\Lambda_{R}^{1/2}\mathbf{\Lambda}_{u}^{\prime}\right\rVert_{\mathcal{H}_{2}}^{2}.$

Denoting the $i$ -th eigenvalue of $\mathbf{Q}$ and $\mathbf{R}$ with $\lambda_{Q,i},\lambda_{R,i}$ , respectively, we have the inequality

	$\displaystyle\left\lVert\Lambda_{Q}^{1/2}\mathbf{\Lambda}_{x}^{\prime}\right\rVert_{\mathcal{H}_{2}}^{2}+\left\lVert\Lambda_{R}^{1/2}\mathbf{\Lambda}_{u}^{\prime}\right\rVert_{\mathcal{H}_{2}}^{2}.$	(27)
$\displaystyle=$	$\displaystyle\sum_{i,j}\lambda_{Q,i}\left\lVert[\mathbf{\Lambda}_{x}^{\prime}]_{ji}\right\rVert_{\mathcal{H}_{2}}^{2}+\sum_{i,j}\lambda_{R,i}\left\lVert[\mathbf{\Lambda}_{u}^{\prime}]_{ji}\right\rVert_{\mathcal{H}_{2}}^{2}$
$\displaystyle\leq$	$\displaystyle\sum_{i,j}\lambda_{Q,i}\left\lVert[\mathbf{\Lambda}_{x}^{\star}]_{ji}\right\rVert_{\mathcal{H}_{2}}^{2}+\sum_{i,j}\lambda_{R,i}\left\lVert[\mathbf{\Lambda}_{u}^{\star}]_{ji}\right\rVert_{\mathcal{H}_{2}}^{2}$
$\displaystyle=$	$\displaystyle\left\lVert\Lambda_{Q}^{1/2}\mathbf{\Lambda}_{x}^{}\right\rVert_{\mathcal{H}_{2}}^{2}+\left\lVert\Lambda_{R}^{1/2}\mathbf{\Lambda}_{u}^{}\right\rVert_{\mathcal{H}_{2}}^{2},$

which follows from the definition of $\mathbf{\Lambda}_{x}^{\prime}$ and $\mathbf{\Lambda}_{u}^{\prime}$ in equation (22). Reversing the steps in (26), we see that $\mathbf{\Phi}_{x}^{\prime},\mathbf{\Phi}_{u}^{\prime}$ achieves a cost at least as good as that of $\mathbf{\Phi}^{\star}_{x},\mathbf{\Phi}^{\star}_{u}$ . We can thus conclude that there always exists an optimal simultaneously diagonalizable system response to the LQR problem (11). The controller $\mathbf{K}^{\prime}=(\mathbf{\Phi}_{u}^{\prime})(\mathbf{\Phi}_{x}^{\prime})^{-1}$ is thus optimal and simultaneously diagonalizable by $V$ .

-B GSS and Controllers Satisfy Quadratic Invariance

Here we show that optimal control problems over graph symmetric systems and controllers satisfy quadratic invariance \citeprotkowitz2005characterization. Before proceeding, we remark that the analysis of LQR optimal control problem over GSSs does not require quadratic invariance. In particular, in Theorem 2 we analyze the unconstrained optimal control problem and show that the resulting unconstrained optimal controller satisfies a corresponding notion of graph symmetry. However, in the interest of completeness, we show that if such a constraint were imposed on the controller during synthesis, the resulting problem satisfies quadratic invariance.

To that end, the corresponding constrained controller synthesis problem can be stated as

	$\displaystyle\underset{\mathbf{K}}{\text{minimize}}$	$\displaystyle\quad J(\mathbf{K})$
	subject to	$\displaystyle\quad\mathbf{K}\ \text{stabilizes system \eqref{eq:linearDynamics}}$
		$\displaystyle\quad\mathbf{K}\in\mathcal{S}$

where $\mathcal{S}:=\{S\in\mathcal{RH}_{\infty}\ |\ S\ \text{is diagonalizable by }V\}.$ Denoting the plant input-output transfer function as

\mathbf{G}(z)=(zI-A)^{-1}B,

we have the following proposition.

Proposition 7.

The set of graph symmetric controllers $\mathcal{S}$ is quadratically invariant under $\mathbf{G}$ if system (2) is graph symmetric.

Proof.

The proof follows directly from the definition of quadratic invariance [rotkowitz2005characterization, Def. 2] by straightforward calculation. First, we note that

\mathbf{G}(z)=V(zI-\mathbf{\Lambda}_{A})^{-1}\mathbf{\Lambda}_{B}V^{\top}

is diagonalizable by $V$ . For any controller $\mathbf{K}\in\mathcal{S}$ , it then follows immediately that

\mathbf{K}\mathbf{G}\mathbf{K}\in\mathcal{S}

as the product of simultaneously diagonalizable matrices is also simultaneously diagonalizable, proving the claim. ∎