A Hybrid Queuing Model for Coordinated Vehicle Platooning on Mixed-Autonomy Highways: Training and Validation
Abstract
Platooning of connected and autonomous vehicles (CAVs) is an emerging technology with a strong potential for throughput improvement and fuel reduction. Adequate macroscopic models are critical for system-level efficiency and reliability of platooning. In this paper, we consider a hybrid queuing model for a mixed-autonomy highway section and develop an easy-to-use training algorithm. The model predicts CAV and non-CAV counts according to the traffic demand as well as key parameters of the highway section. The training algorithm learns the highway parameters from observed data in real time. We test the model and the algorithm in Simulation of Urban Mobility (SUMO) and show that the prediction error is around 15% in a stationary setting and around 25% in a non-stationary setting. We also show that the trained model leads to a platoon headway regulation policy very close to the simulated optimum. The proposed model and algorithm can directly support model-predictive decision making for platooning in mixed autonomy.
Keywords: vehicle platooning, mixed autonomy, queuing model, model training.
1 Introduction
Platooning of connected and autonomous vehicles (CAVs) is an emerging highway operation with a strong potential for throughput improvement and fuel savings [1, 2, 3]. Platooning technology has been progressing fast recently [4, 5, 6, 7]. The mixed-autonomy traffic flow stability and throughput are extensively studied [8, 9, 10, 11]. However, the macroscopic interaction between CAV platoons and non-CAV background traffic has not been well modeled or understood. In particular, platoons require much larger spaces for lane-changing than non-CAVs, and platoons may hinder non-CAVs’ lane-changing; see Fig. 1.


Consequently, platoons may disrupt local traffic flow, which may prevent platooning from attaining its maximal benefits; see [12] for quantification of such compromise.
In this paper, we develop a hybrid queuing model as well as an easy-to-use training algorithm specifically for highways with a mixture of CAV platoons and non-CAV flow. The model predicts the CAV and non-CAV states based on the traffic demand and highway parameters. We develop a simulation-based gradient descent algorithm to train the model parameters in an online manner. We show that the proposed training algorithm can attain a prediction error less than 15% (resp. 25%) in a stationary (non-stationary) setting with respect to results given by the Simulation of Urban Mobility (SUMO [13]). We also show that the online trained model can be used for platoon headway regulation.
Our hybrid queuing model (HQM) consists of two traffic classes: CAVs and non-CAVs. CAVs arrive at the system as discrete batches (platoons), and non-CAVs arrive as a continuous flow. CAV platoons are “condensed” traffic: a platoon of CAVs is treated equivalently as non-CAVs, where captures the impact of reduced spacing between CAVs. A bottleneck restricts the discharge to downstream. Traffic arrives at the bottleneck after traversing the distance from the highway entrance to the bottleneck, and a constant, nominal traverse time is assumed. The vehicle discharge rate is upper-bounded by the highway capacity. The HQM can be viewed as a simplification of a class of more sophisticated models for mixed-autonomy traffic, which are designed for more detailed analysis or control purposes [10, 14, 15, 16]. The HQM can be further abstracted and used for higher-level decisions such as routing [17]. This model is a significant refinement of a preliminary version proposed some of us in [18].
Our training algorithm determines model parameters by minimizing the square error for the vehicle states on the highway section. The parameters to be determined include the nominal traverse time, the nominal capacity, and the non-CAV’s priority ratio. The algorithm improves the parameters heuristically with constant step sizes. The HQM is simulated to compute the square error with respect to input data. The algorithm terminates when the reduction in square error is less than a threshold. Since the fluid queuing model is simple and the space for parameter search is not very large, the computational efficiency is acceptable.
We show via simulation that the proposed HQM and the training algorithm are valid for control design. We consider the regulation of between-platoon headways in response to real-time traffic condition. The HQM can be used to determine the minimal allowable headway between two platoons, which is critical for the headway regulation policy. We show that the policy based on the HQM is very close to the simulated optimal policy.
The rest of this paper is organized as follows. Section 2 defines the HQM and formulates the model training problem. Section 3 presents the training algorithms in both stationary and non-stationary settings. Section 4 validates the model and evaluates the control algorithm in SUMO. Section 5 gives concluding remarks.
2 Hybrid Queuing Model
In this section, we introduce the hybrid queuing model (HQM) and formulate the training problem.

We model the generic highway section in Fig. 1 using the HQM in Fig. 2. Cell 1 models the bottleneck. Cells 2 through are virtual cells to model the traverse time from the entrance to the bottleneck; the nominal traverse time is . Every vehicle entering the highway first goes to cell and then switch from one cell to the immediately downstream one. No queuing dynamics exists in cells 2 through . Non-CAVs enter the highway as a continuous traffic flow with rate . CAVs enter the highway as discrete platoons with size . The number of CAV arrivals during the th time step is , where . The mixture of continuous and discrete arrivals leads to the hybrid nature of our model.
The state of the model includes (i) the non-CAV states in cells and (ii) the CAV states in cells . Hence, the state space is . To define the state space, let be a scaling factor for CAVs in a platoon.

That is, a platoon of CAVs is equivalent to non-CAVs in terms of its queuing effect; see Fig. 3. A typical value for is 2 [2, 19]. Hence, the set of is , while the set of is , where .
Consider an arbitrary initial condition . At the th time step, the state is updated as follows.
-
1.
For ,
-
2.
For ,
-
3.
For , the discharged flows are given by
(1a) (1b) where is the bottleneck’s capacity and specifies the priority of non-CAV traffic with respect to CAVs; means that non-CAVs have full priority over CAVs. The traffic state in cell 1 is updated as follows.
The above queuing dynamics can be interpreted as follows. CAVs and non-CAVs first enter cell and then enter cell 1 after time steps. No queuing occurs in cells , while a two-class traffic queue may exist in cell 1. The parameters that define the model are the nominal traverse time , the priority , and the capacity . The observable quantities are the inflows and the traffic states
where and stand for number of non-CAV and CAV based on the HQM respectively.
Thus, the estimation problem can be formulated as follows:
Given , ,
Find
such that some cost is minimized.
The objective of the HQM is to accurately predict the traffic states over the highway section.
Therefore, the cost function penalizes the error between the SUMO-simulated values and the predicted values .
The observed values are obtained from SUMO, while the predicted values are from the HQM.
The specific form of the cost function depends on the specific setting, which we discuss in the next section.
3 Model training algorithm
In this section, we consider the model training algorithm in two typical settings, viz. the stationary and the non-stationary settings. We use
to denote the vector of parameters of the HQM. These parameters are the decision variables of the training problem. Since the state and (and thus and ) depends on , we write , , , to emphasize such dependency.
Fig. 4 illustrates the mechanism of the training algorithm. SUMO data (vehicle trajectories) are aggregated as CAV and non-CAV counts on the highway section. The HQM uses its internal dynamics to predict the vehicle counts (model prediction), which is supposed to best match the counts (experimental data) given by SUMO. The training algorithm aims to minimizes the prediction error.

In the stationary setting, all parameters are time-invariant, i.e., for all , where is a constant vector. In this setting, SUMO first generates the experimental data first, and then the training algorithm matches the model prediction and the experimental data by varying the model parameters. In the non-stationary setting, is a non-constant function of . In this setting, as SUMO keeps generating data, the training algorithm computes model prediction and adjusts model parameters simultaneously.
3.1 Stationary setting
The objective in this setting is to develop an algorithm that gradually learns the constant parameters and minimizes the prediction error for the traffic count over the highway section. The one-step cost is given by
(3) |
where and ) are the traffic states observed in SUMO. The cumulative cost is given by
The optimal parameter is such that
The above optimization problem is solved by a simulation-based heuristic algorithm. Hence, the estimation gradually evolves over time, noticing that serves as a parameter for the HQM.
The optimization and iteration are implemented in Algorithm 1. Suppose that is given according to, say, prior knowledge. At time , the training algorithm initializes with an initial value . At the th iteration, we randomly perturb the latest estimate and keep updating until a termination threshold is reached.
3.2 Non-stationary setting
The objective in this setting is to develop an algorithm that tracks the drift of the parameters. The one-step cost is again given by (3) The cumulative cost is given by
where stands for the discounted factor. The optimal parameter is such that
The above optimization problem is solved by a simulation-based heuristic algorithm. Hence, the estimation gradually evolves over time.
The optimization and iteration are implemented as follows. The algorithm starts with some initial value according to prior knowledge. Then, the th estimate is randomly perturbed until improved to The termination rule is analogous to that in the stationary case. The pseudo code can be obtained by substituting in Algorithm 1 with .
4 Evaluation and Validation
4.1 Micro-simulation environment
The simulations are conducted in SUMO, see Fig. 5. SUMO simulations are incorporated with built-in road dynamics, including car-following models and lane-changing models, which realistically represent the ground-truth traffic conditions. For these simulation sets, Gipp’s car following model [20] and SUMO’s lane-changing model [21] are selected with other SUMO’s default settings.


Each individual experimental run simulates 120 minutes of traffic conditions on a highway section. A bottleneck appears where the 3-lane highway section becomes a 2-lane section, delaying the downstream traffic. Based on the traffic demand, vehicle trips are generated randomly to populate the testbed. Generated approaching vehicles are categorized as CAVs and non-CAVs according to CAV penetration rate. To maintain the proposed penetration rate, an arbitrary number of consecutive CAVs are initialized together to populate the highway section. During their trips traversing through this section, they are controlled together as a platoon. In this study, we are adopting the soft platoon state, which means individual CAVs in the same platoon are allowed to appear on separate lanes when the platoon is performing lane changing.
Without applying the proposed control strategies on platoons when they are approaching the highway section, all platoons are set to depart with the nominal travel speed. Taking the advantage of the proposed control scheme, neighboring platoons are designed to distance from each other. When an incoming platoon departs, it is assigned with a guideline velocity based on the difference between the residual time of its leading platoon and itself.
Therefore, under the stationary setting where the traffic demand is fixed during each simulation run, the threshold, based on which a controlled speed applied on the departing platoon, is also a fixed constant. However, for the non-stationary setting under which the traffic demand gradually increase over time until it reaches the desired level, the corresponding threshold also varies to ensure the average travel time of all vehicles is minimized.
4.2 Stationary setting
We first apply the training algorithm to the stationary setting. In this setting, traffic enter the highway section with a constant rate.
Quantity [unit] | Notation | Nominal value | Learned value |
---|---|---|---|
Traverse time [sec] | 37 | 31 | |
Capacity [veh/hr] | 3600 | 4000 | |
Scaling factor | 2 | 3 | |
Non-CAV priority ratio | 0.5 | 0.9 |
The results listed in Table 1 are worth discussing. It is interesting that the nominal values are more conservative than the learned values. Specifically, the learned travel time, capacity, and scaling factor are all more optimistic than their respective nominal values. The reason may be that platooning leads to more space on the road, which can further improve the non-CAV traffic flow. In addition, the nominal priority ratio for non-CAVs (0.5) is much smaller than the learned value (0.9); the learned value is very close to one. This implies that without intervention from the RSU, non-CAVs almost enjoy natural priority over CAV platoons. The reason is probably that non-CAVs are more flexible in terms of lane-changing and overtaking maneuvers, while CAV platoons require much more space for those maneuvers.

Fig. 6 shows that the HQM adequately tracks the vehicle count in SUMO. The time-average percentage prediction error is 14.6%, which is satisfactory given the simplicity of the HQM. In Fig. 6, a relatively significant difference is observed during the early stage, where the algorithm starts from initial guess and gradually learns the model parameters. Observed vehicle count based on SUMO simulations demonstrates consistency with the HQM predictions. The oscillations in vehicle count through time by HQM can be illustrated by the lack of explicit road dynamics, such as car-following models and lane-changing models, which usually “damp” oscillations. These road dynamics contribute to the relatively stable vehicle counts in the SUMO simulation.
We also use the trained HQM to implement a theoretically optimal headway regulation policy. This policy ensures that any two consecutive platoons cannot pass the bottleneck within a time interval less than amount of time. For this policy, is the major design parameter. The optimal value depending on the HQM is given by
where the value for is directly obtained from simulation data. More details about the headway regulation policy are available in [12, 22].

We apply a range of values for in SUMO and recorded the average delay; see Fig. 7. The simulated optimal headway is 4.5 sec, which is very close to computed based on the HQM.
4.3 Non-stationary setting
In this subsection, we gradually vary the free-flow speed over time and adjust the parameters to track such non-stationarity. Over a 2-hour time interval, the free-flow speed is linearly reduced from 100km/h to 60km/h, which mimics e.g. the change of weather or visibility. The algorithm was described in Section 3.2

Fig. 8 shows how the learned traverse time varies over time. The nominal value is given by
where is the length of the highway section and is the linearly decreasing free-flow speed. The learned value tracks the general trend of the nominal value. Similar to the observation in Fig. 6, the HQM changes more rapidly than SUMO. An explanation is that, since demand is fixed, vehicles may start to interact as free-flow speed decreases.

Fig. 9 shows that the model provides a good estimation of total vehicle count on the highway. The overall prediction error is 24.38%, which is higher than the stationary case. However, the error mainly comes from the beginning stage (before 400 sec), where the model parameters are learned from scratch. After 400 sec, the model tracks the vehicle counts much better than at the beginning.
5 Concluding remarks
In this paper, we considered the training and validation of a hybrid queuing model for highway bottlenecks with CAV platoons. This model describes CAV platoons as discrete bulk arrivals and non-CAV traffic as continuous flow. We studied model training algorithms in both stationary and non-stationary settings. We validate the hybrid queuing model with respect to SUMO results. In the stationary setting, our model gives a prediction error of about 15%. In the non-stationary setting, the model parameters are able to track the change in the environment, and a 25% prediction error is attained. The above results provide the basis for analysis and design of learning-based platoon coordination algorithms, which are promising directions for future work.
References
- [1] R. Horowitz and P. Varaiya, “Control design of an automated highway system,” Proceedings of the IEEE, vol. 88, no. 7, pp. 913–925, 2000.
- [2] B. Besselink, V. Turri, S. van de Hoef, K.-Y. Liang, A. Alam, J. Mårtensson, and K. H. Johansson, “Cyber-physical control of road freight transport,” Proceedings of IEEE, vol. 104, no. 5, pp. 1128–1141, 2016.
- [3] T. Litman, Autonomous vehicle implementation predictions. Victoria Transport Policy Institute Victoria, Canada, 2017.
- [4] G. J. L. Naus, R. P. A. Vugts, J. Ploeg, M. J. G. van de Molengraft, and M. Steinbuch, “String-stable CACC design and experimental validation: A frequency-domain approach,” IEEE Transactions on Vehicular Technology, vol. 59, no. 9, pp. 4268–4279, 2010.
- [5] J. Ploeg, N. van de Wouw, and H. Nijmeijer, “ string stability of cascaded systems: Application to vehicle platooning,” IEEE Transactions on Control Systems Technology, vol. 22, no. 2, pp. 786–793, 2014.
- [6] S. Coogan and M. Arcak, “A dissipativity approach to safety verification for interconnected systems,” IEEE Transactions on Automatic Control, vol. 60, no. 6, pp. 1722–1727, 2015.
- [7] S. Tsugawa, S. Jeschke, and S. E. Shladover, “A review of truck platooning projects for energy savings,” IEEE Transactions on Intelligent Vehicles, vol. 1, no. 1, pp. 68–77, 2016.
- [8] H. Yu, S. Amin, and M. Krstic, “Stability analysis of mixed-autonomy traffic with cav platoons using two-class aw-rascle model,” in 2020 59th IEEE Conference on Decision and Control (CDC). IEEE, 2020, pp. 5659–5664.
- [9] J. Zhou and F. Zhu, “Modeling the fundamental diagram of mixed human-driven and connected automated vehicles,” Transportation research part C: emerging technologies, vol. 115, p. 102614, 2020.
- [10] C. Wu, A. Kreidieh, E. Vinitsky, and A. M. Bayen, “Emergent behaviors in mixed-autonomy traffic,” in Conference on Robot Learning. PMLR, 2017, pp. 398–407.
- [11] A. Talebpour and H. S. Mahmassani, “Influence of connected and autonomous vehicles on traffic flow stability and throughput,” Transportation Research Part C: Emerging Technologies, vol. 71, pp. 143–163, 2016.
- [12] L. Jin, M. Čičić, K. H. Johansson, and S. Amin, “Analysis and design of vehicle platooning operations on mixed-traffic highways,” IEEE Transactions on Automatic Control, 2020, available at https://ieeexplore.ieee.org/document/9246221.
- [13] M. Behrisch, L. Bieker, J. Erdmann, and D. Krajzewicz, “Sumo–simulation of urban mobility,” in The Third International Conference on Advances in System Simulation (SIMUL 2011), Barcelona, Spain, vol. 42, 2011.
- [14] S. C. Calvert, W. J. Schakel, and B. V. Arem, “Evaluation and modelling of the traffic flow effects of truck platooning,” Transportation Research Part C: Emerging Technologies, vol. 105, no. AUG., pp. 1–22, 2019.
- [15] I. Martínez and W.-L. Jin, “Stochastic lwr model with heterogeneous vehicles: Theory and application for autonomous vehicles,” Transportation Research Procedia, vol. 47, pp. 155–162, 2020.
- [16] G. Piacentini, P. Goatin, and A. Ferrara, “A macroscopic model for platooning in highway traffic,” SIAM Journal on Applied Mathematics, vol. 80, no. 1, pp. 639–656, 2020.
- [17] D. Lazar, S. Coogan, and R. Pedarsani, “Routing for traffic networks with mixed autonomy,” IEEE Transactions on Automatic Control, pp. 1–1, 2020.
- [18] L. Jin, M. Čičić, S. Amin, and K. H. Johansson, “Modeling the impact of vehicle platooning on highway congestion: A fluid queuing approach,” in Proceedings of the 21st International Conference on Hybrid Systems: Computation and Control (part of CPS Week). ACM, 2018, pp. 237–246.
- [19] S. Santini, A. Salvi, A. S. Valente, A. Pescapé, M. Segata, and R. L. Cigno, “A consensus-based approach for platooning with intervehicular communications and its validation in realistic scenarios,” IEEE Transactions on Vehicular Technology, vol. 66, no. 3, pp. 1985–1999, 2016.
- [20] S. Krauß, “Microscopic modeling of traffic flow: Investigation of collision free vehicle dynamics,” 1998.
- [21] J. Erdmann, “Lane-changing model in sumo,” Proceedings of the SUMO2014 modeling mobility with open data, vol. 24, pp. 77–88, 2014.
- [22] M. Čičić, L. Jin, and K. H. Johansson, “Coordinating vehicle platoons for highway bottleneck decongestion and throughput improvement,” IEEE Transactions on Intelligent Transportation Systems, 2021, accepted, available at https://arxiv.org/pdf/1907.13049.pdf.