, , ,
Neural Operators for Adaptive Control of Freeway Traffic
Abstract
Uncertainty and delayed reactions in human driving behavior lead to stop-and-go traffic congestion on freeways. The freeway traffic dynamics are governed by the Aw-Rascle-Zhang (ARZ) traffic Partial Differential Equation (PDE) models with unknown relaxation time. Motivated by the adaptive traffic control problem, this paper presents a neural operator (NO) based adaptive boundary control design for the coupled 22 hyperbolic systems with uncertain spatially varying in-domain coefficients and boundary parameter. In traditional adaptive control for PDEs, solving backstepping kernel online is computationally intensive, as it requires significant resources at each time step to update the estimation of coefficients. To address this challenge, we use operator learning, i.e. DeepONet, to learn the mapping from system parameters to the kernels functions. DeepONet, a class of deep neural networks designed for approximating operators, has shown strong potential for approximating PDE backstepping designs in recent studies. Unlike previous works that focus on approximating single kernel equation associated with the scalar PDE system, we extend this framework to approximate PDE kernels for a class of the first-order coupled 22 hyperbolic kernel equations. Our approach demonstrates that DeepONet is nearly two orders of magnitude faster than traditional PDE solvers for generating kernel functions, while maintaining a loss on the order of . In addition, we mathematically validate the stability of the system through Lyapunov analysis when using DeepONet-approximated kernels in the adaptive controller. The proposed adaptive control is compared with reinforcement learning (RL) methods. Our approach guarantees stability and does not rely on initial values, which is essential for rapidly changing traffic scenarios. This is the first time this operator learning framework has been applied to the adaptive control of the ARZ traffic model, significantly enhancing the real-time applicability of this design framework for mitigating traffic congestion.
keywords:
Traffic Flow model; 22 Hyperbolic system; PDE backstepping; Neural operators; Adaptive control1 Introduction
Stop-and-go traffic congestion is a very common phenomenon in major cities around the world. The traffic congestion on highways leads to many unsafe driving behaviors, as well as increased fuel emissions, environmental pollution, and increased commuting time [1][2]. The traffic congestion is characterized by the propagation of shock waves on road, caused by delayed driver response. There have been many studies on traffic stabilization using PDE models, such as the first-order hyperbolic PDE model proposed by Ligthill and Whitham and Richards (LWR) [3][4] to describe traffic density waves on highways. Then Aw and Rascle [5] and Zhang [6] proposed the second-order nonlinear hyperbolic PDE model to describe the evolution of velocity and density states in traffic flow. The ARZ model is a 22 hyperbolic PDE system and widely used for describing dynamics of the stop-and-go traffic oscillations. In this paper, we adopt the ARZ model and develop adaptive boundary control designs for traffic stabilization.
1.1 PDE backstepping for traffic control
The control strategy for freeway traffic congestion is usually based on static road infrastructure to regulate traffic flow, such as ramp metering and varying speed limits. Various traffic boundary control designs have been proposed to smooth traffic in the works of Bekiaris-Liberis and Delis [7], Zhang [8] as well as Karafyllis, Bekiaris-Liberis, and Papageorgiou [9]. While Bekiaris-Liberis and Delis utilize Adaptive Cruise Control vehicles for in-domain actuation as control inputs [7], Karafyllis et al. design a boundary feedback law to manage inlet demand [9]. The boundary control strategy using PDE backstepping is first proposed in [10] to stabilize the linearized ARZ system, including full state feedback and output feedback. Recent efforts [11, 12, 13, 14, 15, 16] have further developed backstepping controllers for various traffic scenarios including multi-lane, multi-class and mixed-autonomy traffic. This paper primarily focuses on adaptive control of traffic PDE systems with uncertain parameters.
In traffic flow modeling, relaxation time is a critical parameter representing drivers’ reaction delays to evolving traffic conditions. However, heterogeneity and unpredictability of individual driver behavior makes it impossible to obtain the relaxation time in practice. This uncertainty in relaxation time can significantly impact the stability and performance of traffic systems. Traditional control methods struggle to handle such uncertainties, making it difficult to ensure system stability and optimal performance under varying traffic conditions. To address these challenges, we adopt adaptive control strategies that allow for real-time adjustment of the controller gains to accommodate unknown or time-varying system characteristics, ensuring the desired system performance.
Adaptive control for PDEs first appears in the late 2000s, primarily focused on parabolic PDEs [17, 18]. Adaptive control methods [19, 20, 21] can be categorized into Lyapunov-based design, identifier-based design and swapping-based design. After a decade of research, advancements in adaptive control have begun to be applied to coupled hyperbolic PDEs [22].
Although adaptive control for PDE systems with unknown parameters has been extensively studied [23, 24, 25, 26, 27, 28] and was first applied for the ARZ PDE model in [12]. The practical implementation of the adaptive controller for the traffic systems still faces challenge. This is because the adaptive control process simultaneously requires the estimation of unknown system parameters and PDE states. After each time step, it is necessary to recalculate the solution to the PDE corresponding to the gain kernel function in order to update the estimated system parameter functions. This places extremely high demands on real-time computation. The computational resources required for calculation of the gain function increase significantly with spatial sampling precision when applying traditional finite difference and finite element methods. In this paper, we adopt neural operators to accelerate computation of adaptive PDE backstepping controllers.
1.2 Advances in machine learning for PDE traffic control
With rapid advances in machine learning, data-driven methods for solving, modeling and control of PDEs have received widespread attention including physics-informed learning, reinforcement learning and operator learning. Physics-Informed Neural Networks (PINNs) directly incorporates physical constraints into neural networks training by embedding the physical laws of PDEs into the loss function. This enables PINNs to solve PDEs without large amounts of training data. Mowlavi and Nabi extend PINNs method to PDE optimal control problems in [29]. Zhao proposed a novel hybrid Traffic state estimation (TSE) approach called Observer-Informed Deep Learning (OIDL), which integrates a PDE observer and deep learning paradigm to estimate spatial-temporal traffic states from boundary sensing data in [30]. However, PINNs need to be retrained for each new set of boundary and initial conditions, which poses limitations in adaptive control applications.
Reinforcement learning (RL) has also been increasingly applied for PDE control problems, particularly in boundary and feedback control. RL continuously optimizes strategies to achieve real-time control of complex PDE systems. In the field of traffic management, researchers have been applying RL to various traffic issues. Wu et al. used the city mobility traffic micro-simulator SUMO to design a deep RL framework for hybrid autonomous traffic in various experimental scenarios [31]. Under the same framework, [32] proposed a reinforcement learning-based car-following model for electric, connected, and automated vehicles to reduce traffic oscillations and improve energy efficiency. [33] presented the exploration using RL for traffic PDE boundary control. However, RL has limited generalization ability in practical applications. RL may perform well under the specific initial conditions. However, for initial conditions outside the training range, there may be performance degradation or even failure. RL may be sensitive to hyperparameters and exhibit unpredictable behavior, making it difficult to ensure consistent and stable performance in different scenarios.
Traditional neural networks typically learn mappings between finite dimensional Euclidean spaces, but with the advancement of research, this method has been extended to the field of NO [34]. NO-based learning methods focus on mapping between function spaces and are specifically designed for solving PDEs and dynamical systems. Compared with traditional machine learning methods, NO have two unique advantages. Firstly, theoretically speaking, NO can learn the mapping of the entire system parameter set, rather than being limited to a single system parameter like standard neural networks. Secondly, from an empirical perspective, research work [34][35] has shown that NO have significantly better accuracy than traditional deep learning methods when simulating complex functions. Therefore, NO not only solves individual equation instances, but can also handle the problems of the entire PDE family.
Recent research has effectively utilized DeepONet for one-dimensional transport PDEs[36] and reaction-diffusion equations and observer designs[37]. These studies establish the stability of PDEs under approximate kernels by employing the general operator approximation theorem. Subsequent developments focus on hyperbolic PDEs[38], parabolic PDEs with delays[39], and the ARZ PDE system for traffic flow control[40]. NO are applied for gain scheduling to enable real-time control of nonlinear PDEs[41]. This framework integrates PDE backstepping and DeepONet methods for offline learning, ensuring closed-loop stability and enhancing the accuracy in managing more complex PDEs. The application of NO-approximated gain kernels becomes even more valuable for adaptive control, where the kernel must be recomputed online at each time step to accommodate updated estimates of the plant parameters. This was first explored for first-order hyperbolic PDE in [42] and extended to the reaction-diffusion equation in [43]. Different from [42][43], where the kernel equation involves a single kernel, in this work, we extend the results of [42] to the ARZ traffic models which involved the coupled heterogeneous hyperbolic PDEs. The technical challenges arise from both the more complex kernel computations and the proof analysis of the higher-order PDE systems with the approximated controllers.
Contributions: The main contributions are summarized as follows:
-
We present an NO-based adaptive control method to stabilize the ARZ traffic PDE model with unknown relaxation time. Additionally, we extend stability schemes for more general 22 hyperbolic systems with uncertain spatially varying in-domain coefficients and boundary parameter. Compared to the relevant works [42][43], which approximate single kernel, a key technical challenge is dealing with the approximation of coupled 22 Goursat-form PDE kernels in the stabilization of coupled 22 hyperbolic PDEs.
-
To address the computational challenges associated with solving gain kernel equations, we integrate DeepONet into the adaptive control framework. It is shown that the NO is almost two orders of magnitude faster than the PDE solver in solving kernel functions, and the loss remains on the order of . To the best of our knowledge, this is the first study to integrate DeepONet with adaptive control in traffic flow systems, demonstrating its potential to improve the computational efficiency of control schemes in congested traffic scenarios.
-
Through comparative experiments with RL, it has been proven that our method does not rely on initial values compared to RL and provides a model-based solution with guaranteed stability. In addition, we theoretically prove the system’s stability through Lyapunov analysis when replacing with the DeepONet approximation kernels in the adaptive controller.
Organization of paper:The paper is organized as follows. Section 2 introduces ARZ traffic PDE model and a nominal adaptive backstepping control scheme designed for 22 hyperbolic PDEs. Section 3 gives a series of properties for the gain kernel and its time derivative and introduces the approximation of feedback kernel operators. Section 4 presents the stabilization achieved through the application of approximate controller gain functions via DeepONet. Numerical simulations are presented in Section 5. Section 6 presents the conclusion.
Notation.
exact operator | |
---|---|
neural operator | |
unknown model parameters | |
estimated model parameters | |
exact kernel | |
exact estimated kernel | |
approximate estimated kernel |
We present the nomenclature for kernel learning with exact and approximate operators in Table 1. We define the -norm for as . For the convenience, we set . The supremum norm is denoted . denotes the projection operator
(4) |
2 Nominal Adaptive Control Design
2.1 ARZ PDE Traffic Model
The ARZ PDE model is used to describe the formation and dynamics of the traffic oscillations which refer to variations of traffic density and speed around equlibrium values. It consists of a set of 22 hyperbolic PDEs for traffic density and velocity. The ARZ model of -system is given by
(5) | ||||
(6) | ||||
(7) | ||||
(8) |
where , represents the traffic density, represents the traffic speed, and denotes the relaxation time, which refers to the time required for driver behavior to adapt to equilibrium. This parameter is used to describe the process by which vehicle speed adjusts to match the traffic density. The variable , defined as the traffic system pressure, is related to the density by the equation
(9) |
and . The equilibrium velocity-density relationship is given in Greenshield model:
(10) |
where is the free flow velocity, is the maximum density, and are the equilibrium points of the system with . We consider a constant traffic flux entering the domain from . We can apply the change of coordinates introduced in [14] to rewrite it in the Riemann coordinates and then map it to a decoupled first-order 22 hyperbolic system.
(11) | ||||
(12) | ||||
(13) | ||||
(14) |
where
(15) |
The relaxation time describes how fast drivers adapt their speed to equilibrium speed-density relations. Its value is usually difficult to measure in practice and is easily affected by various external factors. Therefore, we propose adaptive control law. Motivated by the second-order ARZ model, we first propose NO-based adaptive design for a more general framework of 22 hyperbolic PDEs with spatially varying coefficients, as the linearized ARZ model is a special case of such systems.
2.2 Adaptive Control for Coupled 22 Hyperbolic PDEs
We consider the first-order coupled 22 hyperbolic PDE system with four spatially variable coefficients,
(16) | ||||
(17) | ||||
(18) | ||||
(19) |
where is the time, is the space, the states are given by and the initial conditions are , where
(20) |
The transport speeds
(21) |
are known, while the spatially variable coefficients and boundary coefficient
(22) | ||||
(23) |
are unknown.
Note that system (16)-(19) is 22 hyperbolic system with spatially variable coefficients in domain, which is different from the system in [25] with the constant coefficients. System (16)-(19) is a direct extension of the system in [25], where the difference lies in the designed adaptive update law.
To ensure the well-posedness of the kernel PDEs, the adaptive control estimation requires bounded assumptions. Our basic assumption is as follows.
Assumption 1.
Bounds are known on all uncertain parameters, that is, there exists some constants , , and so that
(24) |
We first propose an adaptive control design using passive identifier design method, which includes the exact estimated backstepping kernels .
We consider the identifier
(25) | ||||
(26) | ||||
(27) | ||||
(28) |
where
(29) |
are errors between and and their estimates and , and are estimates and . We define
(30) |
for some initial conditions
The error signals (29) can straightforwardly be shown to have dynamics
(31) | ||||
(32) | ||||
(33) | ||||
(34) |
where , . We choose the following update laws
(35) | ||||
(36) | ||||
(37) | ||||
(38) | ||||
(39) |
where are scalar design gains. The adaptive law (35)-(39) have the following properties for all [44]
(40) |
where
(41) |
Lemma 1.
[Properties of passive identifier ] Consider the system (16)-(19) ,the identifier (2.2)-(28), with an arbitrary initial condition such that , along with the update law (35)-(39) with an arbitrary Lipschitz initial conditions satisfying the bounds (24), guarantees the following properties
(42) | |||||
(43) | |||||
(44) | |||||
(45) | |||||
(46) | |||||
(47) |
Proof.
The proof of this lemma follows a similar approach to the proof of Theorem 9.2 in [22]. The details can be found in the Appendix. ∎
Considering the plant (16)-(19) with unknown parameters and , we will design a nominal adaptive control law to achieve global stability.
We consider the following adaptive backstepping transformation
(48) | ||||
(49) |
where the kernels and satisfy the following kernel functions
(50) | ||||
(51) | ||||
(52) | ||||
(53) |
The coupled Goursat-form PDEs, governed by two gain kernels, are defined over the triangular domain , given by:
(54) |
Using the transformation (48) and (49), we get the following target system
(55) | ||||
(56) | ||||
(57) | ||||
(58) |
where the coefficient and are chosen to satisfy
(59) | ||||
(60) |
From the boundary conditions (19), (49) and (58), the nominal stabilizing controller is straightforwardly derived as follows [25]
(61) |
Next, we present the stability of exact adaptive backstepping control, which serves as a guide for what we aim to achieve under the NO-based approximate adaptive backstepping design.
Theorem 1.
Proof.
The proof of this theorem follows a similar approach to the proof of Theorem 9.2 in [22]. ∎
In summary, the exact adaptive backstepping feedback law (61) can achieve global stability at the equilibrium point, with the system states converging pointwise to zero. However, this method is computationally intensive because it requires solving Volterra equations (2.2)-(53) at each time step t. To simplify the computation, we propose using the NO to approximate the exact adaptive backstepping gain operator . This approach allows for neural network evaluation at each time step instead of solving the complex equations. In the following section, we will introduce how to approximate the operator using DeepONet and use the resulting approximate control gain functions for boundary stabilization of plant (16)-(19). And we will use the universal approximation theorem of DeepONet [45] to derive the stability theorem from control gain kernel approximations.
3 Neural Operator for Approximating Gain Kernels
To ensure the effectiveness of the control strategy, the time and spatial derivatives of the gain kernel PDEs must be well-defined, continuous, and bounded. By proving the continuity and boundedness mentioned above, it can be further deduced that for a set of continuously differentiable coefficients within a certain supremely bounded norm, there exists a NO with arbitrary accuracy. This means that the NO can be used to accurately approximate these reaction coefficients.
3.1 Properties of the gain kernel functions
3.2 Approximation of the NO
In the following discussion, we will first introduce the universal approximation theorem of DeepONet. This theorem demonstrates DeepONet’s ability to approximate operators, enabling us to use it to learn the backstepping gain kernel mapping of PDEs and theoretically guarantee the stability of the adaptive control system.
Theorem 2.
[Universal approximation theorem of DeepONet[45]].For sets and , they are compact sets of vectors and , respectively. Define and as sets of continuous functions and , respectively, and assume that is also a compact set. If the operator is continuous, then for any , there exist and such that when , there exist neural networks with parameters , and their corresponding , such that satisfies the following conditions.
(69) |
where
(70) |
for all and of .
Figure 1 presents a schematic diagram of the control circuit, illustrating the utilization of neural operators to accelerate the generation process of gain kernel functions in PDE adaptive control.
We use the deepxde package to train DeepONet. As shown in Figure 2, we constructed a branch network consisting of two layers of convolutional neural networks and two layers of fully connected networks, and a trunk network consisting of two layers of fully connected networks. This framework contains a total of 32,094,465 parameters. represents the number of discrete lattices in the direction. This framework consists of a branch network and a trunk network, encoding the input function space and output function domain respectively. Firstly, based on the kernel functions (2.2)-(53), multiple input-output data pairs are calculated offline using a traditional PDE numerical solver. Subsequently, the computed kernel functions are used as training data for training the neural network. Through the combination of multi-layer neural networks, DeepONet learns the hidden nonlinear relationships in the system, namely the mapping relationship between system parameters and kernel functions. During the training process, DeepONet continuously adjusts the network parameters through the backpropagation algorithm to minimize the error between the predicted output and the true kernels.

Next, we define the operator
(71) |
Theorem 3.
Proof.
4 Stabilization under DeepONet-Approximated Gain Feedback
We will demonstrate that although the adaptive controller uses approximate estimated kernel functions, the stability of the system is still guaranteed. Based on Theorem 1, we present the system stability proof for the adaptive backstepping controller using approximate estimation kernels in the following theorem.
Theorem 4.
[Stabilization under approximate adaptive backstepping control] For all there exists a such that for all NO approximations of accuracy provided by Theorem 3, the plant (16)-(19) in feedback with the adaptive control law
(73) |
along with the update law for and given by (35)-(39) with any Lipschitz initial condition such that and the passive identifier , given by (2.2)-(28) with any initial condition such that , the following properties hold:
(74) | |||
(75) |
Moreover, for the equilibrium the following global stability estimate holds
(76) |
where
(77) |
and , , and are strictly positive constants.
Proof.
This proof mainly refers to [22, Chperter 9], and makes necessary supplements to the gain approximation error while reducing repetition.
Part A: DeepONet-perturbed target system
We consider the following adaptive backstepping transformation (48) and (49)
(78) | ||||
(79) |
where and are exact solutions to kernel function (2.2)-(53). The transformation is an invertible backstepping transformation, with inverse in the same form
(80) | ||||
(81) |
where is an operator similar to . According to [47], from Lemma 2, are continuous, there exists a unique continuous inverse kernels defined on and there exits a constant so that , . We will derive the DeepONet-perturbed target system with exact estimated kernels. Because the controller we have chosen is (73), where the kernels and are approximated by NO. This transformation lead to the following target system
(82) | ||||
(83) | ||||
(84) | ||||
(85) |
where
(86) | ||||
(87) |
The main difference between the current system (82)-(85) and the system described in (55)-(58) is the perturbation in the boundary conditions (85). This difference is due to the controller (73) using an approximated estimated kernels and instead of the exact estimated kernels and . The specific derivation process of (85) is as follows
Spatial boundedness and regulation of plant and
observer states
We use the following Lyapunov function candidate
(88) |
where and
(89) | ||||
(90) |
Before we start the formal calculations of the Lyapunov function, we will present the inequalities derived from Lemma 1.
(91) | |||
(92) | |||
(93) | |||
(94) | |||
(95) | |||
(96) |
where
The current work is based on the previous work [22, Chapter 9], with a key difference being that . This leads to the terms in (98) as follows. Let . There exist positive constants and nonnegative, integrable function such that
(97) | ||||
(98) |
Thus, we obtain the following upper bound calculation
(99) |
for positive constant and the nonnegative, integrable functions and
(100) | ||||
(101) | ||||
(102) | ||||
(103) | ||||
(104) | ||||
(105) | ||||
(106) |
where , are positive constants. We introduce
(107) |
Thus, if we choose we have . It then follows from Lemma B.6 in [48] that
and hence
Due to the invertibility of the backstepping transformation
From Lemma 1 it follows that
Part B: Pointwise-in-space boundedness and regulation
The paper [49] proved that the system (16)-(19) is equivalent to the following system through an invertible backstepping transformation.
(108) | ||||
(109) | ||||
(110) | ||||
(111) |
for some bounded functions fo the unknown parameters. Equation (108)-(111) can be explicitly be solved for to yield
(112) | ||||
(113) |
From (111), the control law and , it follows that . Since and are simple, cascaded transport equations, this implies
(114) |
With the invertibility of the transformation, then yields
(115) |
From the structure of the identifier (16), we will also have , and hence
(116) |
Part C: Global stability
Here, we will prove the global stability of the system, specifically by proving (76), and thus we introduce the following function
(117) | ||||
(118) |
The goal of the proof is to demonstrate the existence of a function such that the following inequality holds.
(119) |
We will reuse the Lyapunov function from Appendix to show that the system’s state remains stable over time.
(120) |
where
(121) |
leads to the following upper bound:
(122) |
which shows that is non-increasing and hence bounded. Thus implies that the and limit exists. By integrating (122) from zero to infinity, we obtain the following upper bound:
(123) |
From (100) and (103), it can be concluded that there are constants and such that
(124) | ||||
(125) |
Recalling (99), we have that
(126) |
We also have from Lemma B.6 in [48] that
(127) |
We then introduce the function
(128) |
Noticing that
(129) |
we achieve from (127), (129), (124) and (125) the following
(130) |
This Lyapunov functional can be represented by an equivalent norm, and the bounds of this equivalent norm are determined by two positive constants and .
(131) |
So we have
(132) |
∎
5 Simulations
This section will present and analyze the performance of the proposed NO-based adaptive controllers for two PDE models: (i) a general 22 hyperbolic system (16)-(19) (ii) the ARZ PDE system. Through these examples, we will demonstrate the effectiveness of the NO-based adaptive control design.
5.1 Simulation of the Coupled 22 Hyperbolic System
A. Simulation configuration
The coefficients of system (16)-(19) are defined as , , and with the shape parameters , , and . We choose the spatially varying coefficient from three classes of functions—namely the Chebyshev polynomial forms, sine function and cosine function. Because these coefficients are both bounded and equicontinuous, satisfying the compactness requirement of Theorem 3. And We can generate a rich and diverse set of kernel functions for training data by varying a single parameter , , and . Although this paper uses specific Chebyshev polynomial forms, sine functions, and cosine functions, our framework is applicable to any compact set of continuous functions. We use the finite difference scheme to solve PDEs, where the time step , the spatial step , the total time , and the length . The initial conditions are , . We demonstrate in the Figure 3 that the system is open-loop unstable.
B. Dataset generation and NO training
In order to establish a mapping , one must build a sufficiently large dataset containing the values that may be encountered. In this work, we choose 10 sets of randomly sampled with , , , and , where denotes the uniform distribution over the interval . We simulate trajectories using adaptive control methods and calculate the corresponding kernel functions using numerical solvers. Each trajectory was sampled at time points, resulting in a dataset of sets of for training. We trained the model on an Nvidia RTX 4060 Ti GPU. After 600 epochs of training, the error of kernel reached , and the test error was . The error of kernel reached , and the testing error was , as shown in Figure 4.
C. Computation time comparison
We begin to discuss the computational acceleration performance of NO. Table 2 provides a comparison of the computation time of solving kernels at each time step using the numerical solver and the trained DeepONet model. The last column of Table 2 represents the multiple of acceleration calculation. As the sampling accuracy increases (i.e.,the discrete space step size decreases), the calculation time of the finite-difference method will significantly increase. In contrast, the calculation time of NO only slightly increases with the decrease of spatial step size. We can see that as the sampling accuracy improves, the acceleration obtained by the NO becomes substantial. Especially when the spatial step size dx=0.0005, the kernel computation time is shortened from 3.191s to 4.631 s. We computed the average absolute error between numerical solutions and NO solutions with different step sizes. Although the error slightly increases with the decrease of step size, they are quite small at all step sizes. Because adaptive control requires calculating control gain at every step of updating parameter estimation, quickly solving the kernel function can help improve the performance of adaptive control.
D. Simulation results
We test the performance of the closed-loop system stability with test values unseen during training. Figure 5 shows the kernels calculated by the numerical solver, the kernels learned by DeepONet, and the error between them. In Figure 6, we demonstrate closed-loop stability with the NO approximated kernel function for the control feedback law. The NO approximation of the kernels is illustrated in Figure 5, where most of the absolute pointwise errors for the learned kernel do not exceed 0.03. Figure 5 and Figure 6 confirm that the kernels approximated by NO can effectively simulate the backstepping kernels while maintaining the stability of the system. All estimated parameters and are shown in Figure LABEL:coefficient_ci. We emphasize that although in adaptive control the system parameters and may not precisely converge to their true values, this does not affect the control performance. This phenomenon is not a problem but rather a characteristic of adaptive control. The goal of adaptive control is not perfect system identification, but rather the estimation of parameters that ensure system stability.
E. Comparative experiment with RL
We will evaluate the performance of NO-based (DeepONet) adaptive control method and RL method for stabilization results under different initial conditions. Specifically, we assume that the initial condition of state is a sine function, and the specific form is
(133) |
where is the frequency of a sine wave. To evaluate the performance of these two methods, we train DeepONet and RL at the same frequency , ensuring all other parameters remained consistent with those in Figure 5. In the testing phase, we will use sine initial conditions of different frequencies to verify the model stability of NO-based adaptive control and RL. Figure LABEL:Comparison_with_RL shows the stabilization results of the RL and NO control under different initial conditions. The comparative experiments highlight a significant advantage of the NO-based adaptive control method, which consistently demonstrates robustness across different initial conditions. Specifically, the NO-based adaptive control method maintains system stability without requiring retraining even when the initial conditions are changed. This characteristic underscores its adaptability in dynamic environments. In contrast, the RL method shows a significant dependency on initial conditions. Although it performs well under specific conditions encountered during training, it is unstable when faced with unforeseen initial conditions().In real-world scenarios where initial conditions are often variable and unpredictable, DeepONet ensures stability and adaptability without the need for retraining. In summary, this demonstrates DeepONet’s potential for more reliable applications in adaptive control systems, where maintaining performance across diverse conditions is crucial.

|
|
|
Speedup | Error | |||
---|---|---|---|---|---|---|---|
dx= | |||||||
dx= | |||||||
dx= | |||||||
dx= |



5.2 Application Simulation of the ARZ Traffic System
A. NO-based adaptive controller
Following the steps in the section 2, we can obtain the adaptive controller for ARZ traffic system (5)-(8) as follows
(134) |
with the parameter update law
(135) |
where
(136) |
and the kernels satisfy the following kernel functions
(137) | ||||
(138) | ||||
(139) | ||||
(140) |
According to the approximation of NO in Theorem 3, we get the NO-based adaptive controller
(141) |
|
|
|
||||
---|---|---|---|---|---|---|
Nominal Adaptive Controller | ||||||
NO-based adaptive Controller |
B. Simulation results
Then, we analyze the performance of the proposed NO-based adaptive control law for the ARZ traffic PDE system through simulations on a L=600m road over T=300 seconds. The parameters are set as follows: free-flow velocity m/s, maximum density veh/km, equilibrium density veh/km, driver reaction time s. Let . Initial conditions are sinusoidal inputs and to mimic stop-and-go traffic. To generate a sufficient dataset for training, we use 10 different functions with and simulate the resulting PDEs under the adaptive controller for seconds. We sub-sample each pair every 0.1 seconds, resulting in a total of 30,000 distinct pairs for training the NO. Using the trained NO, we simulate with the same parameters. Figure 11 shows the ARZ system is open-loop unstable. Figures LABEL:rho_v show the density and velocity of ARZ traffic system with the adaptive controller (134) with the update law (135) and NO-based adaptive controller (141). The blue line indicates the initial condition, whereas the red line represents the boundary condition of the system. The results indicate that both the NO-based adaptive method and the adaptive backstepping control method effectively stabilize the transportation system. The traffic density and velocity converge to the equilibrium values of veh/km and m/s, respectively. The relative error between the closed-loop states of the nominal adaptive controller and the NO-based adaptive controller is shown in Figures LABEL:rho_v. The maximum error does not exceed 10. The estimated parameter is shown in Figure 17.
Table 3 presents the computation times for both the nominal adaptive controller and the NO-based adaptive controller. As the baseline result, the nominal adaptive control method is compared with the NO-based adaptive control method. Notably, the NO-based adaptive control method not only achieves significantly faster average computation times but also maintains superior accuracy with a lower relative error. These advantages of the NO-based adaptive control method not only enhance computational efficiency but also make it highly suitable for real-time traffic system applications. The NO method’s efficiency and accuracy represent a substantial advancement, promising more effective and scalable traffic control strategies in practical scenarios.


6 Conclusion
This paper builds upon prior studies such as [42] and [43], which primarily focused on approximating a single kernel PDE. In contrast, our work accelerates the computation of 22 coupled Goursat-form PDEs that arise from the application of adaptive control designs to a 22 linear first-order hyperbolic system. In this paper, the DeepONet is used to learn the adaptive control gains for stabilizing the traffic PDE system, and it is shown that under the DeepONet-approximated kernels the stabilization of 22 hyperbolic PDEs can still be achieved with significant improvement for computational speeds. Experimental results show that compared to traditional numerical solvers, our method improves computational efficiency by two orders of magnitude. Additionally, compared with RL, the NO-based adaptive control strategy is independent of the system’s initial conditions, making it more robust for rapidly changing traffic scenarios. Our method significantly accelerates the process of obtaining adaptive controllers in PDE systems, greatly improving the real-time applicability of adaptive control strategies for mitigating traffic congestion. In the future, we will incorporate real traffic data into the training of the neural operator.
Appendix
The proof of Lemma 1.
Property (42) follows trivially from projection in (35)-(38) and Lemma A.1 in [22]. The result can be easily obtained using the following Lyapunov function candidate:
(142) |
where
(143) |
(144) |
We put into the dynamics (31)-(34), integrating by parts, thus we have
(145) |
Inserting the adaptive laws (35)-(39), and using the property (40), give
(146) |
From the boundary condition (33), we have
(147) |
By substituting this, we obtain
(148) |
This establishes that is bounded. By the definitions of and , it follows that . When (Appendix) is integrated over time from zero to infinity, we conclude that . Additionally, from the properties (44), and the adaptive laws (35)-(38), we derive that (45). we choose the following Lyapunov function candidate
(149) |
and use the property (40), we find
(150) |
This implies that is upper-bounded, and hence we have . By integrating (150) from zero to infinity, we obtain (47). Using (147) and (33), we derive that
(151) |
References
- [1] F. Belletti, M. Huo, X. Litrico, A. M. Bayen, Prediction of traffic convective instability with spectral analysis of the aw–rascle–zhang model, Physics Letters A 379 (38) (2015) 2319–2330.
- [2] A. De Palma, R. Lindsey, Traffic congestion pricing methodologies and technologies, Transportation Research Part C: Emerging Technologies 19 (6) (2011) 1377–1399.
- [3] M. J. Lighthill, G. B. Whitham, On kinematic waves ii. a theory of traffic flow on long crowded roads, Proceedings of the royal society of london. series a. mathematical and physical sciences 229 (1178) (1955) 317–345.
- [4] P. I. Richards, Shock waves on the highway, Operations research 4 (1) (1956) 42–51.
- [5] A. Aw, M. Rascle, Resurrection of” second order” models of traffic flow, SIAM journal on applied mathematics 60 (3) (2000) 916–938.
- [6] H. M. Zhang, A non-equilibrium traffic model devoid of gas-like behavior, Transportation Research Part B: Methodological 36 (3) (2002) 275–290.
- [7] N. Bekiaris-Liberis, A. Delis, Feedback control of freeway traffic flow via time-gap manipulation of acc-equipped vehicles: A pde-based approach, IFAC-PapersOnLine 52 (6) (2019) 1–6.
- [8] L. Zhang, C. Prieur, J. Qiao, Pi boundary control of linear hyperbolic balance laws with stabilization of arz traffic flow models, Systems & Control Letters 123 (2019) 85–91.
- [9] I. Karafyllis, N. Bekiaris-Liberis, M. Papageorgiou, Feedback control of nonlinear hyperbolic pde systems inspired by traffic flow models, IEEE Transactions on Automatic Control 64 (9) (2018) 3647–3662.
- [10] H. Yu, M. Krstic, Traffic congestion control for aw–rascle–zhang model, Automatica 100 (2019) 38–51.
- [11] M. Burkhardt, H. Yu, M. Krstic, Stop-and-go suppression in two-class congested traffic, Automatica 125 (2021) 109381.
- [12] H. Yu, M. Krstic, Adaptive output feedback for aw-rascle-zhang traffic model in congested regime, in: 2018 Annual American Control Conference (ACC), IEEE, 2018, pp. 3281–3286.
- [13] H. Yu, M. Krstic, Traffic congestion control by PDE backstepping, Springer, 2022.
- [14] H. Yu, M. Krstic, Varying speed limit control of aw-rascle-zhang traffic model, in: 2018 21st international conference on intelligent transportation systems (ITSC), IEEE, 2018, pp. 1846–1851.
- [15] H. Yu, J. Auriol, M. Krstic, Simultaneous downstream and upstream output-feedback stabilization of cascaded freeway traffic, Automatica 136 (2022) 110044.
- [16] Y. Zhang, H. Yu, J. Auriol, M. Pereira, Mean-square exponential stabilization of mixed-autonomy traffic pde system, arXiv preprint arXiv:2310.15547 (2023).
- [17] M. Krstic, A. Smyshlyaev, Adaptive boundary control for unstable parabolic pdes—part i: Lyapunov design, IEEE Transactions on Automatic Control 53 (7) (2008) 1575–1591.
- [18] A. Smyshlyaev, M. Krstic, Adaptive boundary control for unstable parabolic pdes—part ii: Estimation-based designs, Automatica 43 (9) (2007) 1543–1556.
- [19] F. Di Meglio, D. Bresch-Pietri, U. J. F. Aarsnes, An adaptive observer for hyperbolic systems with application to underbalanced drilling, IFAC Proceedings Volumes 47 (3) (2014) 11391–11397.
- [20] M. C. Belhadjoudja, M. Maghenem, E. Witrant, C. Prieur, Adaptive stabilization of the kuramoto-sivashinsky equation subject to intermittent sensing, in: 2023 American Control Conference (ACC), IEEE, 2023, pp. 1608–1613.
- [21] C. Kawan, A. Mironchenko, M. Zamani, A lyapunov-based iss small-gain theorem for infinite networks of nonlinear systems, IEEE Transactions on Automatic Control 68 (3) (2022) 1447–1462.
- [22] H. Anfinsen, O. M. Aamo, Adaptive control of hyperbolic PDEs, Springer, 2019.
- [23] M. Krstic, A. Smyshlyaev, Backstepping boundary control for first-order hyperbolic pdes and application to systems with actuator and sensor delays, Systems & Control Letters 57 (9) (2008) 750–758.
- [24] A. Smyshlyaev, E. Cerpa, M. Krstic, Boundary stabilization of a 1-d wave equation with in-domain antidamping, SIAM journal on control and optimization 48 (6) (2010) 4014–4031.
- [25] H. Anfinsen, O. M. Aamo, Adaptive control of linear2 2 hyperbolic systems, Automatica 87 (2018) 69–82.
- [26] L. Hu, F. Di Meglio, R. Vazquez, M. Krstic, Control of homodirectional and general heterodirectional linear coupled hyperbolic pdes, IEEE Transactions on Automatic Control 61 (11) (2015) 3301–3314.
- [27] J. Wang, M. Krstic, Event-triggered backstepping control of 2 2 hyperbolic pde-ode systems, IFAC-PapersOnLine 53 (2) (2020) 7551–7556.
- [28] J. Auriol, Output feedback stabilization of an underactuated cascade network of interconnected linear pde systems using a backstepping approach, Automatica 117 (2020) 108964.
- [29] S. Mowlavi, S. Nabi, Optimal control of pdes using physics-informed neural networks, Journal of Computational Physics 473 (2023) 111731.
- [30] C. Zhao, H. Yu, Observer-informed deep learning for traffic state estimation with boundary sensing, IEEE Transactions on Intelligent Transportation Systems (2023).
- [31] C. Wu, A. Kreidieh, E. Vinitsky, A. M. Bayen, Emergent behaviors in mixed-autonomy traffic, in: Conference on Robot Learning, PMLR, 2017, pp. 398–407.
- [32] X. Qu, Y. Yu, M. Zhou, C.-T. Lin, X. Wang, Jointly dampening traffic oscillations and improving energy consumption with electric, connected and automated vehicles: A reinforcement learning based approach, Applied Energy 257 (2020) 114030.
- [33] H. Yu, S. Park, A. Bayen, S. Moura, M. Krstic, Reinforcement learning versus pde backstepping and pi control for congested freeway traffic, IEEE Transactions on Control Systems Technology 30 (4) (2021) 1595–1611.
- [34] L. Lu, P. Jin, G. Pang, Z. Zhang, G. E. Karniadakis, Learning nonlinear operators via deeponet based on the universal approximation theorem of operators, Nature machine intelligence 3 (3) (2021) 218–229.
- [35] Y. Shi, Z. Li, H. Yu, D. Steeves, A. Anandkumar, M. Krstic, Machine learning accelerated pde backstepping observers, in: 2022 IEEE 61st Conference on Decision and Control (CDC), IEEE, 2022, pp. 5423–5428.
- [36] L. Bhan, Y. Shi, M. Krstic, Neural operators for bypassing gain and control computations in pde backstepping, arXiv preprint arXiv:2302.14265 (2023).
- [37] M. Krstic, L. Bhan, Y. Shi, Neural operators of backstepping controller and observer gain functions for reaction–diffusion pdes, Automatica 164 (2024) 111649.
- [38] J. Qi, J. Zhang, M. Krstic, Neural operators for pde backstepping control of first-order hyperbolic pide with recycle and delay, Systems & Control Letters 185 (2024) 105714.
- [39] S. Wang, M. Diagne, M. Krstić, Deep learning of delay-compensated backstepping for reaction-diffusion pdes, arXiv preprint arXiv:2308.10501 (2023).
- [40] Y. Zhang, R. Zhong, H. Yu, Neural operators for boundary stabilization of stop-and-go traffic, in: 6th Annual Learning for Dynamics & Control Conference, PMLR, 2024, pp. 554–565.
- [41] M. Lamarque, L. Bhan, R. Vazquez, M. Krstic, Gain scheduling with a neural operator for a transport pde with nonlinear recirculation, arXiv preprint arXiv:2401.02511 (2024).
- [42] M. Lamarque, L. Bhan, Y. Shi, M. Krstic, Adaptive neural-operator backstepping control of a benchmark hyperbolic pde, arXiv preprint arXiv:2401.07862 (2024).
- [43] L. Bhan, Y. Shi, M. Krstic, Adaptive control of reaction-diffusion pdes via neural operator-approximated gain kernels, arXiv preprint arXiv:2407.01745 (2024).
- [44] M. Krstic, Delay compensation for nonlinear, adaptive, and pde systems (2009).
- [45] B. Deng, Y. Shin, L. Lu, Z. Zhang, G. E. Karniadakis, Approximation rates of deeponets for learning operators arising from advection–diffusion equations, Neural Networks 153 (2022) 411–426.
- [46] F. Di Meglio, R. Vazquez, M. Krstic, Stabilization of a system of coupled first-order hyperbolic linear pdes with a single boundary input, IEEE Transactions on Automatic Control 58 (12) (2013) 3097–3111.
- [47] R. V. Valenzuela, Boundary control laws and observer design for convective, turbulent and magnetohydrodynamic flows, University of California, San Diego, 2006.
- [48] M. Krstic, P. V. Kokotovic, I. Kanellakopoulos, Nonlinear and adaptive control design, John Wiley & Sons, Inc., 1995.
- [49] R. Vazquez, M. Krstic, J.-M. Coron, Backstepping boundary stabilization and state estimation of a 2 2 linear hyperbolic system, in: 2011 50th IEEE conference on decision and control and european control conference, IEEE, 2011, pp. 4937–4942.