This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Throughput of Freeway Networks under Ramp Metering Subject to Vehicle Safety Constraints

Milad Pooladsanj1, Ketan Savla2, and Petros A. Ioannou1 1M. Pooladsanj and P. A. Ioannou are with the Department of Electrical Engineering, University of Southern California, Los Angeles, CA 90007 USA [email protected]; [email protected].2K. Savla is with the Sonny Astani Department of Civil and Environmental Engineering, University of Southern California, Los Angeles CA 90089 USA [email protected]. K. Savla has financial interest in Xtelligent, Inc. This work was supported in part by NSF CMMI 1636377 and METRANS 19-17.
Abstract

Ramp metering is one of the most effective tools to combat traffic congestion. In this paper, we present a ramp metering policy for a network of freeways with arbitrary number of on- and off-ramps, merge, and diverge junctions. The proposed policy is designed at the microscopic level and takes into account vehicle following safety constraints. In addition, each on-ramp operates in cycles during which it releases vehicles as long as the number of releases does not exceed its queue size at the start of the cycle. Moreover, each on-ramp dynamically adjusts its release rate based on the traffic condition. To evaluate the performance of the policy, we analyze its throughput, which is characterized by the set of arrival rates for which the queue sizes at all on-ramps remain bounded in expectation. We show that the proposed policy is able to maximize the throughput if the merging speed at all the on-ramps is equal to the free flow speed and the network has no merge junction. We provide simulations to illustrate the performance of our policy and compare it with a well-known policy from the literature.

I Introduction

Ramp metering, which controls the inflow of traffic to a freeway, is one of the most efficient tools to combat traffic congestion. In this paper, we propose a microscopic (vehicle)-level ramp metering policy for freeway networks with arbitrary number of on- and off-ramps, merge and diverge junctions. Our approach builds up on our previous work [1], where we considered a single freeway with no merge or diverge junctions.

Ramp metering policies are commonly designed using macroscopic traffic flow models, which use spatio-temporal averaging of vehicle interactions to model the traffic. Many ramp metering policies have been proposed in the literature using this approach [2, 3]. These policies are either: (i) fixed-time [4], where no real-time traffic measurement is used, or (ii) local traffic-responsive [5], where on-ramps use only local measurements, or (iii) coordinated traffic-responsive [6, 7, 8], where on-ramps also use measurements from other parts of the network. In spite of its generality, the macroscopic-level approach is not suitable in the context of Connected and Automated Vehicles (CAVs) as it does not capture some of their capabilities, such as the ability to dissipate traffic disturbance and provide accurate measurements of traffic through Vehicle-to-Vehicle (V2V) and Vehicle-to-Infrastructure (V2I) communication systems [9, 10, 11]. Additionally, macroscopic-level ramp metering policies do not account for vehicle following safety constraints. As a consequence, the vehicles may find themselves in unsafe situations near an on-ramp, which can trigger congestion or even an accident. While adding safety filters on top of the ramp metering policy is a potential solution, it may adversely affect other performance criteria such as the total travel time [1].

A more systematic approach is to explicitly include vehicle safety, connectivity, and automation protocols in the ramp metering design by considering the vehicles at the microscopic level. Somewhat surprisingly, the literature on the microscopic-level ramp metering design is sparse and only limited to isolated on-ramps [12, 13]. Relatively little attention has been given to analyzing the overall performance of a freeway network. For example, are certain system-level performance metrics such as throughput optimized? To bridge this gap and expand our previous work for a single freeway [1], we consider a network of freeways to design and analyze a microscopic ramp metering policy that can maximize the throughput.

The freeway network is modeled as a directed acyclic graph, with arbitrary number of on- and off-ramps, merge, and diverge junctions. Vehicles arrive from outside the network to the on-ramps and join the on-ramp queues. The on-ramp arrivals are sampled from i.i.d Bernoulli processes that are independent across different on-ramps, and the vehicle routes are sampled independently from a routing matrix. Once released from an on-ramp, each vehicle follows standard safety and speed rules until it reaches its destination off-ramp. Vehicles in the network have the same acceleration and braking capabilities, and are equipped with V2V and V2I communication systems. We leverage these connectivity capabilities to design a coordinated traffic-responsive ramp metering policy. The proposed policy operates under vehicle following safety constraints, where new vehicles are released only if there is enough gap on the mainline. Additionally, the on-ramps operate under synchronous cycles during which each on-ramp does not release more vehicles than its queue size at the start of the cycle. Moreover, an on-ramp’s release rate is adjusted during a cycle according to the traffic condition in the network.

We evaluate the performance of a policy in terms of its throughput, which is characterized by the set of arrival rates under which the queue sizes at every on-ramp remain bounded in expectation. We show that the proposed policy is able to maximize the throughput when the merging speed at all the on-ramps equals the free flow speed and the network has no merge junction, e.g., a single freeway.

The remainder of the paper is organized as follows: in Section II, we describe the network structure, the vehicles behavior in the network, the demand model, and the performance metric. We provide our ramp metering policy and its performance analysis in Section III. We provide simulation results in Section IV, and conclude the paper in Section V.

We collect key notations to be used throughout the paper. Let \mathbb{N} and 0\mathbb{N}_{0} respectively denote the set of positive integers, and non-negative integers. For mm\in\mathbb{N}, [m][m] denotes the set {1,,m}\{1,\ldots,m\}. For a set AA, |A||A| denotes its cardinality.

II Problem Formulation

II-A Network Topology

Refer to caption
Figure 1: An acyclic network with 44 on-ramps (blue), 11 off-ramp (orange), 66 freeway segments, 11 merge node, and 11 diverge node.

The network is modeled as a directed acyclic graph. The nodes represent either junctions (merge or diverge), or sources/sinks; the edges represent either single-lane freeway segments that connect two junctions, or on- and off-ramps that connect sources to merge nodes and diverge nodes to sinks, respectively; see Figure 1. We refer to any subset of the freeway segments as the mainline. For simplicity, we assume that each source has exactly one outgoing on-ramp, and each sink has exactly one incoming off-ramp. We distinguish between two types of merge node: on-ramp nodes, which have at least one incoming on-ramp, and regular merge nodes (or simply merge nodes), which have no incoming on-ramp. Similarly, we distinguish between an off-ramp node and a diverge node. We denote the set of on-ramp nodes by \mathcal{M}. For simplicity, we assume that each on-ramp node has exactly one incoming on-ramp edge. Thus, \mathcal{M} also specifies the set of on-ramp edges. For on-ramp ii\in\mathcal{M}, we let i+\mathcal{M}_{i}^{+} be the set of its successor on-ramps, i.e., the set of all on-ramps that are immediately downstream of on-ramp ii. An element of i+\mathcal{M}_{i}^{+} is denoted by i+i^{+}. Similarly, we let i\mathcal{M}_{i}^{-} be the set of its predecessor on-ramps. We say that two edges are adjacent if the head of one edge is the tail of the other edge. A route is a sequence of adjacent edges, with the first edge being an on-ramp and the last edge being an off-ramp. The collection of all routes is denoted by 𝒫\mathcal{P}.

Vehicles arrive to on-ramps according to some exogenous stochastic process and join the on-ramp queues. For any on-ramp, we assume a point queue model with unlimited capacity. We also assume that vehicles that reach their destination off-ramp node, immediately leave the network without creating any obstruction. Once a vehicles is released from an on-ramp into the mainline, it follows standard speed and safety rules, which will be explained in the following section.

II-B Vehicle-Level Objectives

Vehicles have the same length LL, the same acceleration and braking capabilities, and are equipped with V2V and V2I communication systems. The exact information communicated by a vehicle will be specified in Section III, where we introduce our ramp metering policy. We use the term ego vehicle to refer to a specific vehicle under consideration, and denote it by ee. Consider a vehicle following scenario and let vev_{e} (resp. vlv_{l}) be the speed of the ego vehicle (resp. its leading vehicle), and SeS_{e} be the safety distance between the two vehicles required to avoid collision. We assume that SeS_{e} satisfies Se=hve+S0+ve2vl22|amin|S_{e}=hv_{e}+S_{0}+\frac{v^{2}_{e}-v^{2}_{l}}{2|a_{\min}|}, which is calculated based on an emergency stopping scenario with details given in [14]. Here, h>0h>0 is a safe time headway constant, S0>0S_{0}>0 is an additional constant gap, and amin<0a_{\min}<0 is the minimum possible deceleration of the leading vehicle. We consider two general modes of operation for each vehicle: the speed tracking mode and the safety mode. The main objective in the speed tracking mode is to adjust and maintain the speed to the free flow speed VfV_{f} when the ego vehicle is far from any leading vehicle; the main objective in the safety mode is to avoid rear-end collision once the ego vehicle gets close to the leading vehicle.

We assume that vehicles obey the following rules:

  • (VC1)

    The ego vehicle maintains the constant speed VfV_{f} if:

    • (a)

      it is at a safe distance with respect to its leading vehicle 111At a diverge node, the leading vehicle is uniquely determined by the ego vehicle’s route choice., i.e., yeSey_{e}\geq S_{e}, where yey_{e} is the distance between the two vehicles. Note that if its leading vehicle is also at the constant speed VfV_{f}, then yeSe=hVf+S0y_{e}\geq S_{e}=hV_{f}+S_{0}. This distance is equivalent to a time headway of at least τ:=h+(S0+L)/Vf\tau:=h+(S_{0}+L)/V_{f} between the front bumpers of the two vehicles.

    • (b)

      it is near a merge node and predicts to be at a safe distance with respect to its leading vehicle after reaching the merging node, i.e., y^e(tm)S^e(tm):=hv^e(tm)+S0+v^e2(tm)v^l^2(tm)2|amin|\hat{y}_{e}(t_{m})\geq\hat{S}_{e}(t_{m}):=h\hat{v}_{e}(t_{m})+S_{0}+\frac{\hat{v}_{e}^{2}(t_{m})-\hat{v}_{\hat{l}}^{2}(t_{m})}{2|a_{\min}|}, where tmt_{m} is the time the ego vehicle predicts to reach the merge node, y^e\hat{y}_{e} is the predicted distance between the two vehicles, and v^\hat{v} is the predicted speed. Each vehicle uses the following speed prediction rule to calculate its predicted time of merging, position, and speed: if in the speed tracking mode, it assumes that it remains in this mode in the future; if in the safety mode, it assumes a constant speed trajectory. These information are then communicated among all vehicles near a merge node via V2V communication.

  • (VC2)

    there exists TfreeT_{\text{free}} such that for any initial condition, vehicles reach the free flow equilibrium after at most TfreeT_{\text{free}} time if no other vehicle is released from the on-ramps. The free flow equilibrium refers to a state where each vehicle is moving at the constant speed VfV_{f} and will maintain this speed in the future if no additional vehicle is released from the on-ramps.

Remark 1.

Due to lack of space, we assume that the ego vehicle immediately joins the mainline after being released. In other words, we ignore the vehicle dynamics between release from an on-ramp and joining the mainline. We also assume the speed at which the ego vehicle joins the mainline, i.e., the merging speed, is VfV_{f}. The general case where these assumptions are relaxed can be treated similar to [1]. We should emphasize, however, that the merging speed assumption can affect the performance of a ramp metering policy; see Remark 3.

In order to have a complete description of the network, we also need to specify the demand at the microscopic level. We will do this in the next section.

II-C Demand Model

For convenience in the analysis, we adopt a discrete time setting with time steps of size τ\tau, where τ\tau is the minimum safe time headway at the free flow speed defined in Section II-B. Let vehicles arrive to on-ramp ii\in\mathcal{M} according to an i.i.d. Bernoulli process with parameter λi[0,1]\lambda_{i}\in[0,1]. The arrival processes are assumed to be independent across different on-ramps. That is, at any given time step, the probability that a vehicle arrives at the ii-th on-ramp is λi\lambda_{i} independent of everything else. We let λ:=[λi]\lambda:=[\lambda_{i}]. A vehicle arriving to on-ramp ii, chooses to take route p𝒫p\in\mathcal{P} with probability Rip[0,1]R_{ip}\in[0,1] independent of everything else. Naturally, for every on-ramp ii, pRip=1\sum_{p}R_{ip}=1, and Rip=0R_{ip}=0 for any pp that is not compatible with the network topology. We assume that the vehicles do not change their route choice while in the network. We stack the routing probabilities in a routing matrix R=[Rip]R=[R_{ip}]. Let ρi=jp:ipλjRjp\rho_{i}=\sum_{j}\sum_{p:i\in p}\lambda_{j}R_{jp} be the average induced load on on-ramp ii, where the notation ipi\in p means that on-ramp ii’s node is on route pp. This is the average number of arrivals in the network that wants to cross on-ramp ii’s node in order to reach their destination. The performance of a ramp metering policy is quantified in terms of the maximum demand it can handle. We will formalize this in the next section.

II-D Ramp Metering and Performance Metric

To conveniently track vehicle locations in discrete time, we introduce the notion of slot. A slot is associated with a particular point on the mainline at a particular time. Consider the maximum number of distinct slots that can be placed on a freeway segment, such that the distance between adjacent slots is hVf+S0+LhV_{f}+S_{0}+L, i.e., the minimum safety distance at the free flow speed as per (VC1) plus the vehicle length. Consider a configuration of these slots at t=0t=0. At the end of each time step τ\tau, each slot replaces the next slot with the last slot replacing the first slot of that segment. Without loss of generality, we let the location of the first and last slots of each segment coincide with the tail and head of that segment at the end of each time step. Thus, if the ego vehicle is released from an on-ramp at the end of a given time step, its location coincides with a slot.

For ii\in\mathcal{M}, let Qi(t)Q_{i}(t) be the set of the route choices of the vehicles waiting at on-ramp ii, arranged in the order of their arrival, at time tt. We use |Qi(t)||Q_{i}(t)| to denote the queue size at on-ramp ii at time tt. Let |Q(t)|=[|Qi(t)|]|Q(t)|=[|Q_{i}(t)|] be the vector of queue sizes at all the on-ramps at time tt. A ramp metering policy is a function π(t)=[πi(t)]\pi(t)=[\pi_{i}(t)], where πi(t)=1\pi_{i}(t)=1 if a vehicle is released from on-ramp ii at time tt, and πi(t)=0\pi_{i}(t)=0 otherwise. The key metric we use to evaluate the performance of a ramp metering policy is its throughput, which is defined as follows: for a given ramp metering policy π\pi and routing matrix RR, let Uπ,RU_{\pi,R} be the set of λ\lambda’s for which lim supt𝔼[|Qi(t)|]<\limsup_{t\to\infty}\mathbb{E}\left[|Q_{i}(t)|\right]<\infty for all ii\in\mathcal{M}. We say that the network is under-saturated if λUπ,R\lambda\in U_{\pi,R}; otherwise, it is called saturated. The throughput is the boundary of the set Uπ,RU_{\pi,R}. We are interested in finding a ramp metering policy π\pi such that for every RR and any other policy π\pi^{\prime}, we have Uπ,RUπ,RU_{\pi^{\prime},R}\subseteq U_{\pi,R}. We say such a policy maximizes the throughput.

III Main Results

III-A Dynamic Release Rate Policy with Rate Allocation

Refer to caption
Figure 2: A configuration of slots (empty circles) at time t=0t=0. For this configuration and by choosing bi=3b_{i}=3, ai=1a_{i}=1, i[3]i\in[3], and b4=a4=1b_{4}=a_{4}=1 the release times can be made conflict-free.

In this section we present and analyze our ramp metering policy. The policy presented is coordinated, in the sense that each on-ramp may require information from other parts of the network in addition to the information obtained from its vicinity. However, it does not require any information about the demand or the route choice of the vehicles, i.e., it is reactive [15]. The policy operates under synchronous cycles of fixed length during which each on-ramp does not release more vehicles than its queue size at the beginning of the cycle. The synchronization of cycles is done once offline. This policy allocates a fixed maximum release rate to each on-ramp. In addition, it imposes dynamic minimum time gap criterion between release of successive vehicles from the same on-ramp. Changing the time gap between release of successive vehicles by an on-ramp is akin to changing its release rate. An inner approximation to the throughput of this policy is provided, and is then compared to an outer approximation.

Definition 1.

(Dynamic Release Rate policy with rate Allocation (DRRA)) The policy works in cycles of fixed length TτT\tau, where TT\in\mathbb{N}. At the beginning of the kk-th cycle at tk=(k1)Tτt_{k}=(k-1)T\tau, each on-ramp allocates itself a “quota” equal to the queue size at that on-ramp at tkt_{k}. At time t[tk,tk+1]t\in[t_{k},t_{k+1}] during the kk-th cycle, on-ramp ii releases the ego vehicle if:

  • (M1)

    t=(bim+n)τt=(b_{i}m+n)\tau, where m0m\in\mathbb{N}_{0}, n{n1,,nai}[bi]n\in\{n_{1},\ldots,n_{a_{i}}\}\subseteq[b_{i}], and ai,bia_{i},b_{i}\in\mathbb{N}, aibia_{i}\leq b_{i}, are such that the release times at all the on-ramps are conflict-free, meaning that no two vehicles that move at the constant speed VfV_{f} after being released can simultaneously reach a merging node; see Figure 2. The term ai/bia_{i}/b_{i} is the maximum release rate allocated to on-ramp ii.

  • (M2)

    the on-ramp has not reached its quota.

  • (M3)

    the ego vehicle is at a safe distance with respect to its leading and following vehicles at the moment of release.

  • (M4)

    at least g(t)g(t) time has passed since the release of the last vehicle from on-ramp ii.

Once an on-ramp reaches its quota, it does not release a vehicle during the rest of the cycle. The minimum time gap g(t)g(t) is piecewise constant, updated periodically at t=Tper,2Tper,t=T_{\text{per}},2T_{\text{per}},\ldots, as described in Algorithm 1. In Algorithm 1, XfX_{f} is defined as follows: for the ego vehicle, let xeT=((yeSe)𝟙{ye<Se},veVf,ae)x_{e}^{T}=((y_{e}-S_{e})\mathds{1}_{\{y_{e}<S_{e}\}},v_{e}-V_{f},a_{e}), where 𝟙{ye<Se}\mathds{1}_{\{y_{e}<S_{e}\}} is the indicator function, and aea_{e} is the acceleration. The state of all the vehicles is collectively denoted as XT=(xeT)e[n]X^{T}=\left(x_{e}^{T}\right)_{e\in[n]}, where nn is the total number of vehicles on the mainline. We let Xf1(t):=X(t)X_{f_{1}}(t):=\|X(t)\|. Moreover, for tTpert\geq T_{\text{per}}, let Xf2(t):=|y^e(tm)S^e(tm)|X_{f_{2}}(t):=\sum|\hat{y}_{e}(t_{m})-\hat{S}_{e}(t_{m})|, where the sum is over all the vehicles in the time interval [tTper,t][t-T_{\text{per}},t] that predicted to violate the safety distance after reaching a merging node. We let Xf(t)=Xf1(t)+Xf2(t)X_{f}(t)=X_{f_{1}}(t)+X_{f_{2}}(t). Roughly, Xf(t)X_{f}(t) measures the distance to the free flow equilibrium state at time tt.

Algorithm 1 Update rule for the minimum time gap between release of successive vehicles
design constants: Tper>0,γ1>0T_{\text{per}}>0,\gamma_{1}>0, γ2>0\gamma_{2}>0, θ>0\theta^{\circ}>0,                         β>1\beta>1  initial condition: g(0)=0,θ(0)=θg(0)=0,\leavevmode\nobreak\ \theta(0)=\theta^{\circ},                         Xf(0)=Xf1(0)X_{f}(0)=X_{f_{1}}(0) for t=Tper,2Tper,t=T_{\text{per}},2T_{\text{per}},\cdots do
if Xf(t)max{Xf(tTper)γ1,0}X_{f}(t)\leq\max\{X_{f}(t-T_{\text{per}})-\gamma_{1}{},0\}  then
     θ(t)θ(tTper)\theta(t)\leftarrow\theta(t-T_{\text{per}})
     g(t)max{g(tTper)γ2,0}g(t)\leftarrow\max\{g(t-T_{\text{per}})-\gamma_{2},0\}
else
     θ(t)βθ(tTper)\theta(t)\leftarrow\beta\theta(t-T_{\text{per}})
     g(t)g(tTper)+θ(t)g(t)\leftarrow g(t-T_{\text{per}})+\theta(t)
end if end for
Theorem 1.

For any initial condition, TT\in\mathbb{N}, design constants in Algorithm 1, and ai,bia_{i},b_{i}\in\mathbb{N} such that aibia_{i}\leq b_{i}, ii\in\mathcal{M}, and the release times are conflict-free, the DRRA policy keeps the network under-saturated if ρi<ai/bi\rho_{i}<a_{i}/b_{i} for all ii\in\mathcal{M}.

Proof.

See Appendix B. ∎

V2I communication requirements: This policy uses information about |Q||Q| (if T>1T>1) and XfX_{f}. After a finite time, the mainline vehicles only communicate Xf1X_{f_{1}} to all the on-ramps at the end of each update period TperT_{\text{per}}. The number of vehicles that constitute Xf1X_{f_{1}} is no more than the total number of slots. Furthermore, at the end of each time step during a cycle, the vehicles near an on-ramp communicate their state to that on-ramp for the safety gap evaluation.

Remark 2.

A non-reactive version of the DRRA policy with better throughput: suppose that the first vehicle in every on-ramp’s queue also communicates its route choice to that on-ramp. Consider a version of the DRRA policy, where, in addition to the releasing times in (M1), on-ramp ii releases the ego vehicle at t=mτt=m\tau, m0m\in\mathbb{N}_{0}, if the ego vehicle’s route contains no merge node. This policy is non-reactive since it uses the route choice of the vehicles, but gives a better throughput since more vehicles are released compared to the DRRA policy. We compare the performance of these two policies via simulations in Section IV.

Remark 3.

Recall from Section II-B that we made the simplifying assumption that the merging speed at all the on-ramps is VfV_{f}. One can relax this assumption, but the throughput would suffer due to the larger safety distance required at the moment of merging. However, the main steps of the proof of Theorem 1 remain the same and a similar bound for the throughput can be obtained; see [1, Theorem 2].

III-B A Necessary Condition

We now provide a necessary condition for under-saturation, against which we benchmark the sufficient condition in Theorem 1. Let D¯π,p(mτ)\bar{D}_{\pi,p}(m\tau) be the cumulative number of vehicles that has crossed point pp on the mainline up to time mτm\tau, m0m\in\mathbb{N}_{0}, under the ramp-metering policy π\pi. Then, the crossing rate at point pp is defined as D¯π,p(mτ)/m\bar{D}_{\pi,p}(m\tau)/m and the “long-run” crossing rate is lim supmD¯π,p(mτ)/m\limsup_{m\rightarrow\infty}\bar{D}_{\pi,p}(m\tau)/m. Recall the definition of ρi\rho_{i} for on-ramp ii’s node. We can extend this definition to every node in the network that is not a source or sink. Let ρ:=maxiρi\rho:=\max_{i}\rho_{i}, where ii is any node that is not a source or sink.

Theorem 2.

Suppose that the long-run crossing rate is no more than one vehicle per τ\tau seconds for at least one point on each freeway segment. If the network is under-saturated under the policy π\pi, then ρ1\rho\leq 1.

Proof.

The proof is similar to [1, Theorem 4]. ∎

Remark 4.

The long-run crossing rate condition in Theorem 2 is very mild in practice. In particular, if the traffic flow at some point on each freeway segment is less than the mainline capacity, then the long-run crossing rate condition is also satisfied [1].

Remark 5.

Consider a network with no merge node, and note that for any node jj that is not a source or sink, we have ρjρi\rho_{j}\leq\rho_{i} for some ii\in\mathcal{M}. For such a network, one can set ai=bi=1a_{i}=b_{i}=1 for all ii\in\mathcal{M} in the DRRA policy. The sufficient condition of Theorem 1 then becomes maxiρi=ρ<1\max_{i\in\mathcal{M}}\rho_{i}=\rho<1. Combining with Theorem 2, it follows for any routing matrix RR and any other ramp metering policy π\pi^{\prime} that Uπ,RUDRRA,RU_{\pi^{\prime},R}\subseteq U_{\text{DRRA},R}, except, maybe at the boundary of UDRRA,RU_{\text{DRRA},R}. Since the boundary of UDRRA,RU_{\text{DRRA},R} has zero volume (measure), we can conclude that the DRRA policy maximizes the throughput for all practical purposes.

IV Simulations

Refer to caption
Figure 3: The average total travel time (TTTn\text{TTT}_{n}) under the Safe-ALINEA and DRRA policies, where nn is the number of completed trips.
Refer to caption
(a)
Refer to caption
(b)
Figure 4: The two networks used in Sections IV-B and IV-C. Figure (a) is a three-legged merge junction, and figure (b) is constructed by making the first network cyclic.

We simulate the DRRA policy and compare its performance with its non-reactive version introduced in Remark 2 and a well-known policy from the literature. All the simulations were performed in MATLAB. We use the following setup in all the simulations: let h=1.5[s]h=1.5\leavevmode\nobreak\ [\text{s}], S0=4[m]S_{0}=4\leavevmode\nobreak\ [\text{m}], L=4.5[m]L=4.5\leavevmode\nobreak\ [\text{m}], and Vf=15[m/s]V_{f}=15\leavevmode\nobreak\ [\text{m/s}]. The initial queue size at all the on-ramps is set to zero, and vehicles arrive at the on-ramps according to i.i.d Bernoulli processes with the same rate λ\lambda.

IV-A Total Travel Time

In this section, we compare the total travel time under the DRRA policy and ALINEA [5], which is a macroscopic ramp metering policy. We also add the safety filter (M3) on top of ALINEA, and use the name Safe-ALINEA to refer to the resulting policy. An on-ramp’s outflow in the ALINEA policy is determined according to the following equation:

r(k)=r(k1)+Kr(o^o(k)).r(k)=r(k-1)+K_{r}(\hat{o}-o(k)).

Here, r()r(\cdot) is the on-ramp’s outflow, KrK_{r} is a positive design constant, o()o(\cdot) is the occupancy of the mainline downstream of the on-ramp, and o^\hat{o} is the desired occupancy. We use time steps of size 60[s]60\leavevmode\nobreak\ [\text{s}], Kr=70[veh/h]K_{r}=70\leavevmode\nobreak\ [\text{veh/h}] [5], and o^=13%\hat{o}=13\leavevmode\nobreak\ \% corresponding to the capacity flow. For the DRRA policy, we use the following parameters: T=13T=13, Tper=2τT_{\text{per}}=2\tau, γ1=50\gamma_{1}=50, γ2=10\gamma_{2}=10, θ=0.1\theta^{\circ}=0.1, β=1.01\beta=1.01, and ai=bi=1a_{i}=b_{i}=1, i=1,2,3i=1,2,3. The network is a ring road with perimeter 1860[m]1860\leavevmode\nobreak\ [\text{m}] and 33 on- and off-ramps. The on-ramps are located symmetrically at locations 0, 600600, and 12001200; the off-ramps are also located symmetrically 155[m]155\leavevmode\nobreak\ [\text{m}] upstream of each on-ramp. We consider a congested initial condition where the initial number of vehicles on the mainline is 100100; each vehicle is at the constant speed of 6.7[m/s]6.7\leavevmode\nobreak\ [\text{m/s}], and is at the distance h×6.7+S014.1[m]h\times 6.7+S_{0}\approx 14.1\leavevmode\nobreak\ [\text{m}] from its leading vehicle. The details of the vehicle following behavior is given in [1]. We also let λ=0.54\lambda=0.54 and

R1=(0.20.70.100.80.20.500.5),R_{1}=\begin{pmatrix}0.2&0.7&0.1\\ 0&0.8&0.2\\ 0.5&0&0.5\end{pmatrix},

where columns specify the off-ramps.

We evaluate the average Total Travel Time (TTTn), which is computed by averaging the total travel time of the first nn vehicles that complete their trips. The total travel time of a vehicle equals its waiting time in an on-ramp queue plus the time it spends on the freeway to reach its destination. Figure 3 shows the simulation result. From the figure, we can see that the DRRA policy provides significant improvement in the TTTn compared to the Safe-ALINEA policy. In fact, the network becomes saturated under the Safe-ALINEA policy, which implies that the TTTn will grow steadily with nn.

IV-B Acyclic Network with Merge Junction

The second network is a symmetric three-legged merge junction; see Figure 4(a). The total length of freeway segments on each leg is 310[m]310\leavevmode\nobreak\ [\text{m}]. on-ramps 11 and 22 are located at the beginning, while on-ramp 33 is located in the middle of their corresponding legs. Similarly, off-ramps 11 and 22 are in the middle, while off-ramp 33 is at the end of their corresponding legs. The mainline is initially empty and

R2=(0.600.400.60.4001).R_{2}=\begin{pmatrix}0.6&0&0.4\\ 0&0.6&0.4\\ 0&0&1\end{pmatrix}.

The on-ramps use the DRRA policy with T=1T=1 and other parameters chosen similar to Section IV-A. In addition, the release times of on-ramps 11 and 22 are set to t=(2m+1)τt=(2m+1)\tau and t=(2m+2)τt=(2m+2)\tau, respectively, while t=mτt=m\tau for on-ramp 33, m0m\in\mathbb{N}_{0} (thus, ai=1a_{i}=1, bi=2b_{i}=2, i[2]i\in[2], and a3=b3=1a_{3}=b_{3}=1). For the given routing matrix, the inner and outer estimates of the under-saturation region given by Theorems 1 and 2 are λ<1/2\lambda<1/2 and λ<5/9\lambda<5/9, respectively. From simulations, the under-saturation regions of the DRRA policy and its non-reactive version are estimated to be λ<1/2\lambda<1/2 and λ<5/9\lambda<5/9, respectively. This suggest that there is a gap between the inner and outer estimates of the under-saturation region in the DRRA policy, but not in its non-reactive version.

IV-C Cyclic Network with Merge Juction

The third network is constructed by making the second network cyclic; see Figure 4(b). The length of the additional segment in this network is 610[m]610\leavevmode\nobreak\ [\text{m}]. The mainline is initially empty and

R3=(0.600.400.60.40.500.5).R_{3}=\begin{pmatrix}0.6&0&0.4\\ 0&0.6&0.4\\ 0.5&0&0.5\end{pmatrix}.

The on-ramps again use the DRRA policy with the same parameters as in the previous section. The inner and outer estimates of the under-saturation region are λ<1/3\lambda<1/3 and λ<5/9\lambda<5/9, respectively. From simulations, the under-saturation regions of the DRRA policy and its non-reactive version are estimated to be λ<0.4\lambda<0.4 and λ<5/9\lambda<5/9, respectively. Therefore, Theorem 1 holds for this cyclic network. Based on the simulation results and the results in [1] given for a ring road network, we conjecture that Theorem 1 holds for general cyclic networks.

V Conclusion and Future Work

In this paper, we provided a microscopic ramp metering policy for freeway networks with arbitrary number of on- and off-ramps, merge, and diverge junctions. We analyzed the throughput of our policy for acyclic networks, and showed via simulations that it achieves a similar throughput for cyclic networks. We plan to confirm this observation analytically in the future. Moreover, we plan to extend our analysis to policies with tighter estimates of the throughput.

References

  • [1] M. Pooladsanj, K. Savla, and P. A. Ioannou, “Saturation region of freeway networks under safe microscopic ramp metering,” arXiv preprint arXiv:2207.02360, 2022.
  • [2] M. Papageorgiou and A. Kotsialos, “Freeway ramp metering: An overview,” IEEE transactions on intelligent transportation systems, vol. 3, no. 4, pp. 271–281, 2002.
  • [3] M. Papageorgiou, C. Diakaki, V. Dinopoulou, A. Kotsialos, and Y. Wang, “Review of road traffic control strategies,” Proceedings of the IEEE, vol. 91, no. 12, pp. 2043–2067, 2003.
  • [4] J. A. Wattleworth, “Peak-period analysis and control of a freeway system,” tech. rep., Texas Transportation Institute, 1965.
  • [5] M. Papageorgiou, H. Hadj-Salem, J.-M. Blosseville, et al., “Alinea: A local feedback control law for on-ramp metering,”
  • [6] M. Papageorgiou, J.-M. Blosseville, and H. Haj-Salem, “Modelling and real-time control of traffic flow on the southern part of boulevard périphérique in paris: Part ii: Coordinated on-ramp metering,” Transportation Research Part A: General, vol. 24, no. 5, pp. 361–370, 1990.
  • [7] I. Papamichail, A. Kotsialos, I. Margonis, and M. Papageorgiou, “Coordinated ramp metering for freeway networks–a model-predictive hierarchical control approach,” Transportation Research Part C: Emerging Technologies, vol. 18, no. 3, pp. 311–331, 2010.
  • [8] G. Gomes and R. Horowitz, “Optimal freeway ramp metering using the asymmetric cell transmission model,” Transportation Research Part C: Emerging Technologies, vol. 14, no. 4, pp. 244–262, 2006.
  • [9] R. E. Stern, S. Cui, M. L. Delle Monache, R. Bhadani, M. Bunting, M. Churchill, N. Hamilton, H. Pohlmann, F. Wu, B. Piccoli, et al., “Dissipation of stop-and-go waves via control of autonomous vehicles: Field experiments,” Transportation Research Part C: Emerging Technologies, vol. 89, pp. 205–221, 2018.
  • [10] M. Pooladsanj, K. Savla, and P. Ioannou, “Vehicle following over a closed ring road under safety constraint,” in 2020 IEEE Intelligent Vehicles Symposium (IV), pp. 413–418, IEEE.
  • [11] M. Pooladsanj, K. Savla, and P. A. Ioannou, “Vehicle following on a ring road under safety constraints: Role of connectivity and coordination,” IEEE Transactions on Intelligent Vehicles, vol. 8, no. 1, pp. 628–638, 2022.
  • [12] J. Rios-Torres and A. A. Malikopoulos, “Automated and cooperative vehicle merging at highway on-ramps,” IEEE Transactions on Intelligent Transportation Systems, vol. 18, no. 4, pp. 780–789, 2016.
  • [13] J. Rios-Torres and A. A. Malikopoulos, “A survey on the coordination of connected and automated vehicles at intersections and merging at highway on-ramps,” IEEE Transactions on Intelligent Transportation Systems, vol. 18, pp. 1066–1077, May 2017.
  • [14] P. Ioannou and C. Chien, “Autonomous intelligent cruise control,” IEEE Transactions On Vehicular Technology, vol. 42, no. 4, pp. 657–672, 1993.
  • [15] I. Papamichail and M. Papageorgiou, “Traffic-responsive linked ramp-metering control,” IEEE Transactions on Intelligent Transportation Systems, vol. 9, no. 1, pp. 111–121, 2008.
  • [16] S. P. Meyn and R. L. Tweedie, Markov chains and stochastic stability. Springer Science & Business Media, 2012.
  • [17] G. B. Folland, Real analysis: modern techniques and their applications, vol. 40. John Wiley & Sons, 1999.

Appendix A Performance Analysis Tool

Consider an initial condition where the vehicles are in the free flow equilibrium, and the location of each vehicle coincides with a slot. Thus, their location coincide with a slot in the future. Thereafter, during each time step, the following sequence of events occurs: (i) the slots on each segment replace the next slot with the last slot replacing the first slot; the numbering of slots is reset with the new first slot numbered 11, and the rest numbered in an increasing order in the direction of the flow, (ii) vehicles that reach their destination off-ramp exit the network, (iii) if permitted by the ramp metering policy, a vehicle is released into a slot at the free flow speed. Since the initial location of all vehicles coincide with a slot and the released times are conflict-free, the location of the released vehicle will also coincide with a slot in the future.

For the aforementioned initial condition, we can cast the network dynamics as a discrete-time Markov chain, with time steps of size τ\tau. Let M(n)M(n) be the set of the routes of the occupants of all the slots: M(n)=pM_{\ell}(n)=p if the vehicle occupying slot \ell at time nn has to take route pp, and M(n)=0M_{\ell}(n)=0 if slot \ell is empty at time nn. Consider the following discrete-time Markov chain with the state Z(n):=Y(n)Z_{\triangle}(n):=Y(n\triangle), n0n\geq 0, where Y(n):=(Q(n),M(n))Y(n):=(Q(n),M(n)), and \triangle\in\mathbb{N} will be specified for the policy being analyzed. The transition probabilities are determined by the ramp metering policy being analyzed, but will not be specified explicitly for brevity. For the ramp metering policy considered in this paper, the state Y(n)=(0,0)Y(n)=(0,0) is reachable from all other states, and (Y(n+1)=(0,0)|Y(n)=(0,0))>0\mathbb{P}\left(Y(n+1)=(0,0)\leavevmode\nobreak\ |\leavevmode\nobreak\ Y(n)=(0,0)\right)>0. Hence, the Markov chain ZZ_{\triangle} is irreducible and aperiodic.

Appendix B Proof of Theorem 1

For any initial condition, we first show that there exists some k0k_{0}\in\mathbb{N} such that for all kk0k\geq k_{0}, Xf(kTper)=0X_{f}(kT_{\text{per}})=0. This and (VC1) imply that every vehicle moves at the free flow speed for all tt0t\geq t_{0}. Hence, after a finite time after t0t_{0}, g()=0g(\cdot)=0, and the location of every vehicle coincides with a slot thereafter. We can then use the Markov chain setting from Appendix A.

Using an argument similar to [1], one can easily show that Xf(kTper)X_{f}(kT_{\text{per}}) is uniformly bounded for all kk\in\mathbb{N}. To show Xf()=0X_{f}(\cdot)=0 after a finite time, we use a proof by contradiction. Suppose that Xf(kTper)0X_{f}(kT_{\text{per}})\neq 0 for infinitely many kk\in\mathbb{N}. Since Xf(kTper)X_{f}(kT_{\text{per}}) is uniformly bounded, there exists an infinite sequence {kn}n1\{k_{n}\}_{n\geq 1} such that Xf(knTper)>Xf((kn1)Tper)γ1X_{f}(k_{n}T_{\text{per}})>X_{f}((k_{n}-1)T_{\text{per}})-\gamma_{1} for all n1n\geq 1. This implies that θ(knTper)=βθ((kn1)Tper)\theta(k_{n}T_{\text{per}})=\beta\theta((k_{n}-1)T_{\text{per}}) for all n1n\geq 1. Since θ()\theta(\cdot) is non-decreasing and β>1\beta>1, it follows that limtθ(t)=\lim_{t\rightarrow\infty}\theta(t)=\infty, which in turn implies that lim suptg(t)=\limsup_{t\rightarrow\infty}g(t)=\infty. Let kfk_{f} be such that g(kfTper)>||(Tfree+Tempty)(1+γ2/Tper)g(k_{f}T_{\text{per}})>|\mathcal{M}|(T_{\text{free}}+T_{\text{empty}})(1+\gamma_{2}/T_{\text{per}}), where TemptyT_{\text{empty}} is an upper bound on the additional time required for all vehicles on the mainline to leave the network after reaching the free flow state, if no additional vehicle is released. Let tf:=kfTpert_{f}:=k_{f}T_{\text{per}}. Note that for all t[tf,tf+||(Tfree+Tempty)]t\in[t_{f},t_{f}+|\mathcal{M}|(T_{\text{free}}+T_{\text{empty}})], g(t)>||(Tfree+Tempty)g(t)>|\mathcal{M}|(T_{\text{free}}+T_{\text{empty}}). Thus, each on-ramp releases at most one vehicle during the interval [tf,tf+||(Tfree+Tempty)][t_{f},t_{f}+|\mathcal{M}|(T_{\text{free}}+T_{\text{empty}})]. Hence, there exists a time interval of length at least (Tfree+Tempty)(T_{\text{free}}+T_{\text{empty}}) in [tf,tf+||(Tfree+Tempty)][t_{f},t_{f}+|\mathcal{M}|(T_{\text{free}}+T_{\text{empty}})] during which no on-ramp releases a vehicle. Condition (VC2) then implies that all vehicles reach the free flow state after at most TfreeT_{\text{free}} time, and leave the network after at most an additional TemptyT_{\text{empty}} time, at the end of which (say at time t0t_{0}), the network becomes empty. Thus, for all kk0:=t0/Tperk\geq k_{0}:=\lceil t_{0}/T_{\text{per}}\rceil, Xf(kTper)=0X_{f}(kT_{\text{per}})=0, which is a contradiction to the assumption that Xf(kTper)0X_{f}(kT_{\text{per}})\neq 0 for infinitely many kk.

Refer to caption
Figure 5: For on-ramp 44, U40={{4}}U_{4}^{0}=\{\{4\}\}, U41={{2,3}}U_{4}^{1}=\{\{2,3\}\}, and U42={{1,1},{1,3},{1,2}}U_{4}^{2}=\{\{1,1\},\{1,3\},\{1,2\}\}.

We assume that the vehicles are initially in the free flow equilibrium state, and the location of each vehicle coincides with a slot at t=0t=0 as in Appendix A. For the sake of readability, we present proofs of intermediate claims at the end. We adopt the Markov chain setting from Appendix A with {Z(n)}n0\{Z_{\triangle}(n)\}_{n\geq 0} as the Markov chain, where =kT\triangle=kT and kk\in\mathbb{N} is to be determined. For on-ramp ii\in\mathcal{M}, let Ni(n)N_{i}(n) be the degree of on-ramp ii representing the number of vehicles in the network at time nn that need to cross on-ramp ii’s node in order to reach their destination. We recursively define the following collections of index sets:

Ui0={{i}},Ui1={i},U_{i}^{0}=\{\{i\}\},\leavevmode\nobreak\ U_{i}^{1}=\{\mathcal{M}_{i}^{-}\},

and for k2k\geq 2,

Uik={jIIj:IUik1,Ij={j}orj}Uik1,U_{i}^{k}=\{\cup_{j\in I}I_{j}:\leavevmode\nobreak\ I\in U_{i}^{k-1},I_{j}=\{j\}\leavevmode\nobreak\ \text{or}\leavevmode\nobreak\ \mathcal{M}_{j}^{-}\}\setminus U_{i}^{k-1},

where each jIIj\cup_{j\in I}I_{j} is treated as a multiset; see Figure 5. In words, IUi1I\in U_{i}^{1}, i.e., I=iI=\mathcal{M}_{i}^{-}, is the set of all on-ramps that precede on-ramp ii; if IUi2Ui1I\in U_{i}^{2}\cup U_{i}^{1}, then for any on-ramp jj that precedes on-ramp ii, either jIj\in I or all on-ramps that precede jj belong to II; and so on. Note that since the network is acyclic, the number of sets UikU_{i}^{k} is finite for every ii. Let U=i,k0UikU=\cup_{i\in\mathcal{M},k\geq 0}U_{i}^{k}. Consider the candidate Lyapunov function V:𝒮[0,)V:\mathcal{S}\rightarrow[0,\infty) defined as:

V(n)V(Z(n)):=N2(n),V(n)\equiv V\left(Z_{\triangle}(n)\right):=N^{2}(n\triangle), (1)

where 𝒮\mathcal{S} is the range of values of YY in Appendix A, and N(m)=maxIUjINj(m)N(m)=\max\limits_{I\in U}\sum_{j\in I}N_{j}(m).

Note that (1) implies V(n+1)V(n)=N2((n+1))N2(n)V(n+1)-V(n)=N^{2}((n+1)\triangle)-N^{2}(n\triangle). We show at the end of the proof that if V(n)V\left(n\right) is large enough, specifically if V(n)>L:=((C1+1)2+C2)2V\left(n\right)>L:=\left((C_{1}+1)\triangle^{2}+C_{2}\right)^{2}, where C1:=(||+ρ+δ)|||𝒫|C_{1}:=(|\mathcal{M}|+\rho+\delta)|\mathcal{M}||\mathcal{P}| and C2:=(2T+maxj(rj+aj))|||𝒫|C_{2}:=(2T+\max_{j\in\mathcal{M}}(r_{j}+a_{j}))|\mathcal{M}||\mathcal{P}|, then

N((n+1))N(n)+A~()+C2,\displaystyle N((n+1)\triangle)\leq N(n\triangle)+\tilde{A}(\triangle)+C_{2}, (2)

where A~()\tilde{A}(\triangle) satisfies the following: it is independent of N(n)N(n\triangle), and there exists 1,ε,C3>0\triangle_{1},\varepsilon,C_{3}>0 such that for all >1\triangle>\triangle_{1} we have

𝔼[A~()]<ε,𝔼[A~2()]<C32.\mathbb{E}\left[\tilde{A}(\triangle)\right]<-\varepsilon\triangle,\quad\mathbb{E}\left[\tilde{A}^{2}(\triangle)\right]<C_{3}\triangle^{2}. (3)

The inequality (2) implies that

N2((n+1))N2(n)\displaystyle N^{2}((n+1)\triangle)\leq N^{2}(n\triangle) +2N(n)(A~()+C2)\displaystyle+2N(n\triangle)\left(\tilde{A}(\triangle)+C_{2}\right) (4)
+A~2()+2C2A~()+C22.\displaystyle+\tilde{A}^{2}(\triangle)+2C_{2}\tilde{A}(\triangle)+C_{2}^{2}.

By choosing 1\triangle\geq\triangle_{1} and taking conditional expectation from both sides of (4) we obtain

𝔼[\displaystyle\mathbb{E}\left[\right. V(n+1)V(n)|V(n)>L]N(n)\displaystyle\left.V\left(n+1\right)-V\left(n\right)\>|\>V(n)>L\right]\leq-N(n\triangle)
+2N(n)(ε+C2+12)+C322εC2+C22.\displaystyle+2N(n\triangle)(-\varepsilon\triangle+C_{2}+\frac{1}{2})+C_{3}\triangle^{2}-2\varepsilon C_{2}\triangle+C_{2}^{2}.

Since for any ii\in\mathcal{M}, |Qi(n)|Ni(n)N(n)|Q_{i}(n\triangle)|\leq N_{i}(n\triangle)\leq N(n\triangle), we have Q(n):=maxi|Qi(n)|N(n)\|Q(n\triangle)\|_{\infty}:=\max_{i\in\mathcal{M}}|Q_{i}(n\triangle)|\leq N(n\triangle). Hence, choosing >max{1,2:=C2+1/2ε}\triangle>\max\{\triangle_{1},\triangle_{2}:=\frac{C_{2}+1/2}{\varepsilon}\} gives

𝔼[\displaystyle\mathbb{E}\left[\right. V(n+1)V(n)|V(n)>L]Q(n)\displaystyle\left.V\left(n+1\right)-V\left(n\right)\>|\>V(n)>L\right]\leq-\|Q(n\triangle)\|_{\infty}
2ε(C1+1)3+((C1+1)(2C2+1)+C3)2\displaystyle-2\varepsilon(C_{1}+1)\triangle^{3}+\left((C_{1}+1)(2C_{2}+1)+C_{3}\right)\triangle^{2}
4εC2+3C22+1.\displaystyle-4\varepsilon C_{2}\triangle+3C_{2}^{2}+1.

If 3\triangle_{3} is chosen such that for all >max{1,2,3}\triangle>\max\{\triangle_{1},\triangle_{2},\triangle_{3}\},

2ε(C1+1)3\displaystyle-2\varepsilon(C_{1}+1)\triangle^{3} +((C1+1)(2C2+1)+C3)2\displaystyle+\left((C_{1}+1)(2C_{2}+1)+C_{3}\right)\triangle^{2} (5)
4εC2+3C22+1<0,\displaystyle-4\varepsilon C_{2}\triangle+3C_{2}^{2}+1<0,

then 𝔼[V(n+1)V(n)|V(n)>L]Q(n)\mathbb{E}\left[V\left(n+1\right)-V\left(n\right)|\>V\left(n\right)>L\right]\leq-\|Q(n\triangle)\|_{\infty}. Such a 3\triangle_{3} always exists because the 2ε(C1+1)3-2\varepsilon(C_{1}+1)\triangle^{3} term in (5) dominates for sufficiently large \triangle. We let ~>max{1,2,3}\tilde{\triangle}>\max\{\triangle_{1},\triangle_{2},\triangle_{3}\} be the minimum natural number that is divisible by TT.

Finally, if V(n)LV(n)\leq L, then Q(n)L\|Q(n\triangle)\|_{\infty}\leq\sqrt{L}. Also, V(n+1)V(n)+(L+||~)2V(n+1)\leq V(n)+(\sqrt{L}+|\mathcal{M}|\tilde{\triangle})^{2} because the total number of arrivals at each time step does not exceed |||\mathcal{M}|. Combining this with the previously considered case of V(n)>LV\left(n\right)>L, we get

𝔼[\displaystyle\mathbb{E}\left[\right. V(n+1)V(n)|Z~(n)]Q(n)\displaystyle\left.V\left(n+1\right)-V\left(n\right)|\>Z_{\tilde{\triangle}}(n)\right]\leq-\|Q(n\triangle)\|_{\infty}
+((L+||~)2+L+1)𝟙B,\displaystyle+\left((\sqrt{L}+|\mathcal{M}|\tilde{\triangle})^{2}+\sqrt{L}+1\right)\mathds{1}_{B},

where B={Z~(n)𝒮:V(n)L}B=\{Z_{\tilde{\triangle}}(n)\in\mathcal{S}:V(n)\leq L\} (a finite set). The result then follows from the well-known Foster-Lyapunov drift criterion [16, Theorem 14.0.1].

Proof of (2)

Consider on-ramp ii\in\mathcal{M}. If for every interval [bim,bi(m+1))[b_{i}m,b_{i}(m+1)) in [n,(n+1)1][n\triangle,(n+1)\triangle-1], aia_{i} vehicles cross on-ramp ii, then

Ni\displaystyle N_{i} ((n+1))Ni(n)+Ai(n+1,(n+1))biai\displaystyle((n+1)\triangle)\leq N_{i}(n\triangle)+A_{i}(n\triangle+1,(n+1)\triangle)-\lfloor\frac{\triangle}{b_{i}}\rfloor a_{i}
N(n)+Ai(n+1,(n+1))aibi+ai,\displaystyle\leq N(n\triangle)+A_{i}(n\triangle+1,(n+1)\triangle)-\frac{a_{i}}{b_{i}}\triangle+a_{i},

where Ai(m1,m2)A_{i}(m_{1},m_{2}) is the total number of arrivals in the network during the time interval [m1,m2][m_{1},m_{2}] that want to cross on-ramp ii’s node in order to reach their destination. If not, then there exist some time step in [n,(n+1)1][n\triangle,(n+1)\triangle-1] during which a valid slot that is empty crosses on-ramp ii’s node. Let mi[n,(n+1)1]m_{i}\in[n\triangle,(n+1)\triangle-1] be the last time step at which this happens, i.e., Ni(mi+1)=Ni(mi)+Ai(mi+1,mi+2)N_{i}(m_{i}+1)=N_{i}(m_{i})+A_{i}(m_{i}+1,m_{i}+2). Hence,

Ni((n+1))\displaystyle N_{i}((n+1)\triangle) Ni(mi+1)+Ai(mi+2,(n+1))+ai\displaystyle\leq N_{i}(m_{i}+1)+A_{i}(m_{i}+2,(n+1)\triangle)+a_{i} (6)
aibi((n+1)mi1).\displaystyle-\frac{a_{i}}{b_{i}}((n+1)\triangle-m_{i}-1).

By the assumption of the theorem, i.e., ρi<aibi\rho_{i}<\frac{a_{i}}{b_{i}} for all ii\in\mathcal{M}, there exists δ>0\delta>0 such that for all ii\in\mathcal{M},

ρi+δ<aibi.\rho_{i}+\delta<\frac{a_{i}}{b_{i}}.

Thus, (6) can be rewritten as

Ni((n+1))\displaystyle N_{i}((n+1)\triangle) Ni(mi+1)+Ai(mi+2,(n+1))\displaystyle\leq N_{i}(m_{i}+1)+A_{i}(m_{i}+2,(n+1)\triangle) (7)
(ρi+δ)((n+1)mi1)+ai.\displaystyle-(\rho_{i}+\delta)((n+1)\triangle-m_{i}-1)+a_{i}.

Furthermore, the queue size at on-ramp ii at time mi+1m_{i}+1 cannot exceed TT, i.e., |Qi(mi+1)|T|Q_{i}(m_{i}+1)|\leq T. This is because, a valid slot that is empty crosses on-ramp ii at time step mim_{i}. Therefore, the quotas of on-ramp ii at time mim_{i} must be zero. This implies that Qi(mi+1)Q_{i}(m_{i}+1) only consists of vehicles arriving after the start of the most recent cycle before mi+1m_{i}+1. Since the number of arrivals to on-ramp ii is no more than one per time step, it follows that |Qi(mi+1)|T|Q_{i}(m_{i}+1)|\leq T. Hence, Ni(mi+1)jiNj(mi+1)+|Qi(mi+1)|+rijiNj(mi+1)+T+riN_{i}(m_{i}+1)\leq\sum_{j\in\mathcal{M}_{i}^{-}}N_{j}(m_{i}+1)+|Q_{i}(m_{i}+1)|+r_{i}\leq\sum_{j\in\mathcal{M}_{i}^{-}}N_{j}(m_{i}+1)+T+r_{i}, where rir_{i} is the total number of slots between on-ramp ii and any on-ramp jij\in\mathcal{M}_{i}^{-}. Combining this with (7) gives

Ni((n+1)\displaystyle N_{i}((n+1) )jiNj(mi+1)+Ai(mi+2,(n+1))\displaystyle\triangle)\leq\sum_{j\in\mathcal{M}_{i}^{-}}N_{j}(m_{i}+1)+A_{i}(m_{i}+2,(n+1)\triangle) (8)
(ρi+δ)((n+1)mi1)+T+ri+ai.\displaystyle-(\rho_{i}+\delta)((n+1)\triangle-m_{i}-1)+T+r_{i}+a_{i}.

Repeating the above steps for any on-ramp jij\in\mathcal{M}_{i}^{-} gives

Nj(mi+1)\displaystyle N_{j}(m_{i}+1) kjNk(mj+1)+Aj(mj+2,mi+1)\displaystyle\leq\sum_{k\in\mathcal{M}_{j}^{-}}N_{k}(m_{j}+1)+A_{j}(m_{j}+2,m_{i}+1)
(ρj+δ)(mimj)+T+rj+aj,\displaystyle-(\rho_{j}+\delta)(m_{i}-m_{j})+T+r_{j}+a_{j},

where mj[n,mi+1]m_{j}\in[n\triangle,m_{i}+1] is defined similar to mim_{i} if at least one valid slot that is empty crosses on-ramp jj, and mj=n1m_{j}=n\triangle-1 otherwise. This process can be repeated until we find a multiset JiUJ_{i}\in U of on-ramps, such that for each jJij\in J_{i} with multiplicity mult(j)\text{mult}(j), either mjc=n1m_{j}^{c}=n\triangle-1, or mjc>n1m_{j}^{c}>n\triangle-1 and j=\mathcal{M}_{j}^{-}=\emptyset, c[mult(j)]c\in[\text{mult}(j)]. Indeed, such a JiJ_{i} always exist since the network is acyclic. Let J~i\tilde{J}_{i} be a multiset containing all the on-ramps visited in this process (for example, if Ji=iJ_{i}=\mathcal{M}_{i}^{-}, then J~i={i}i\tilde{J}_{i}=\{i\}\cup\mathcal{M}_{i}^{-}). For simplicity of notation, we drop the superscript cc in mjcm_{j}^{c} with the understanding that mjm_{j} could take different values for on-ramps with multiplicity greater than one. We also use j+j^{+} to denote an on-ramp that succeeds on-ramp jj. By combining all the inequalities in this process we obtain

Ni((n+1))\displaystyle N_{i}((n+1)\triangle) jJiNj(mj+1)\displaystyle\leq\sum_{j\in J_{i}}N_{j}(m_{j}+1)
+jJ~i(Aj(mj+2,mj++1)\displaystyle+\sum_{j\in\tilde{J}_{i}}\left(A_{j}(m_{j}+2,m_{j^{+}}+1)\right.
(ρj+δ)(mj+mj)+T+rj+aj).\displaystyle\left.-(\rho_{j}+\delta)(m_{j^{+}}-m_{j})+T+r_{j}+a_{j}\right).

Note that if mj>n1m_{j}>n\triangle-1 for some jJij\in J_{i}, then Nj(mj+1)TNj(n)+TN_{j}(m_{j}+1)\leq T\leq N_{j}(n\triangle)+T, which implies jJiNj(mj+1)jJiNj(n)+|||𝒫|T\sum_{j\in J_{i}}N_{j}(m_{j}+1)\leq\sum_{j\in J_{i}}N_{j}(n\triangle)+|\mathcal{M}||\mathcal{P}|T. Therefore, Ni((n+1))N(n)+A~i()+C2N_{i}((n+1)\triangle)\leq N(n\triangle)+\tilde{A}_{i}(\triangle)+C_{2}, where

A~i()=\displaystyle\tilde{A}_{i}(\triangle)= (9)
jJ~i(Aj(mj+2,mj++1)(ρj+δ)(mj+mj)),\displaystyle\sum_{j\in\tilde{J}_{i}}\left(A_{j}(m_{j}+2,m_{j^{+}}+1)-(\rho_{j}+\delta)(m_{j^{+}}-m_{j})\right),

where we have dropped the dependence of A~i()\tilde{A}_{i}(\triangle) on other parameters for brevity. Generally, if we start with any IUI\in U, we can repeat the above steps to obtain kINk((n+1))N(n)+A~I()+C2\sum_{k\in I}N_{k}((n+1)\triangle)\leq N(n\triangle)+\tilde{A}_{I}(\triangle)+C_{2}, where A~I()\tilde{A}_{I}(\triangle) has the same form as (9), with J~I\tilde{J}_{I} defined similarly. Hence, (2) follows with

A~()=maxIUA~I().\tilde{A}(\triangle)=\max_{I\in U}\tilde{A}_{I}(\triangle). (10)

Note that for each IUI\in U, we may assume that there exists some jJIj\in J_{I} for which mj=n1m_{j}=n\triangle-1. For if mj>n1m_{j}>n\triangle-1 for all jJIj\in J_{I}, then Nj(mj+1)TN_{j}(m_{j}+1)\leq T for all jJIj\in J_{I}, which implies

kINk((n+1))C1+C2<N(n)N((n+1)),\sum_{k\in I}N_{k}((n+1)\triangle)\leq C_{1}\triangle+C_{2}<N(n\triangle)-\triangle\leq N((n+1)\triangle), (11)

where in the first inequality we have used the following fact: the total number of arrivals to the network is bounded by |||\mathcal{M}| at each time step. Hence, for each jJ~Ij\in\tilde{J}_{I}, Aj(mj+2,mj++1)(mj+mj)||||A_{j}(m_{j}+2,m_{j^{+}}+1)\leq(m_{j^{+}}-m_{j})|\mathcal{M}|\leq|\mathcal{M}|\triangle. Also, in the third inequality we have used the following fact: N()N(\cdot) decreases by at most one during each time step. Therefore, N((n+1))N(n)N((n+1)\triangle)\geq N(n\triangle)-\triangle. The inequality (11) implies that we can exclude II from the maximum in (10). Thus, we may assume such jJ~Ij\in\tilde{J}_{I} with mj=n1m_{j}=n\triangle-1 exists.

Proof of (3)

Given IUI\in U with the corresponding multiset JIJ_{I}, let mj=n1m_{j}=n\triangle-1 for some jJIj\in J_{I}. Hence, there exists a sequence of on-ramps {i1,,iK}J~I\{i_{1},\ldots,i_{K}\}\subset\tilde{J}_{I} with i1=ji_{1}=j and iKIi_{K}\in I such that for k<Kk<K, ik+=ik+1i_{k}^{+}=i_{k+1}. Therefore, k[K](mik+mik)=\sum_{k\in[K]}(m_{i_{k}^{+}}-m_{i_{k}})=\triangle. Consider the sequence {Aik(m,m+1)}m=n+1\{A_{i_{k}}(m,m+1)\}_{m=n\triangle+1}^{\infty}, where the indices ik{i1,,iK}i_{k}\in\{i_{1},\ldots,i_{K}\} are allowed to depend on time mm. For a given m[n+1,)m\in[n\triangle+1,\infty), the term Aik(m,m+1)A_{i_{k}}(m,m+1) is independent of the other terms in the sequence, ρik=𝔼[Aik(m,m+1)]\rho_{i_{k}}=\mathbb{E}\left[A_{i_{k}}(m,m+1)\right] is bounded, and σik2:=𝔼[Aik2(m,m+1)ρik2]\sigma^{2}_{i_{k}}:=\mathbb{E}\left[A_{i_{k}}^{2}(m,m+1)-\rho_{i_{k}}^{2}\right] is (uniformly) bounded for all ik{i1,,iK}i_{k}\in\{i_{1},\ldots,i_{K}\}. As a result, limm=n+1(n+1)(mn)2σik2\lim_{\triangle\rightarrow\infty}\sum_{m=n\triangle+1}^{(n+1)\triangle}(m-n\triangle)^{-2}\sigma^{2}_{i_{k}} is also bounded. From Kolmogorov’s strong law of large numbers [17, Theorem 10.12], we have, with probability 11,

lim1(m=n+1(n+1)Aik(m,m+1)m=n+1(n+1)ρik)=0.\lim_{\triangle\rightarrow\infty}\frac{1}{\triangle}\left(\sum_{m=n\triangle+1}^{(n+1)\triangle}A_{i_{k}}(m,m+1)-\sum_{m=n\triangle+1}^{(n+1)\triangle}\rho_{i_{k}}\right)=0.

Hence, with probability one, there exists 1,ε>0\triangle_{1},\varepsilon>0 such that ε<δ\varepsilon<\delta and for >1\triangle>\triangle_{1} we have

m=n+1(n+1)Aik(m,m+1)m=n+1(n+1)ρik<ε.\sum_{m=n\triangle+1}^{(n+1)\triangle}A_{i_{k}}(m,m+1)-\sum_{m=n\triangle+1}^{(n+1)\triangle}\rho_{i_{k}}<\varepsilon\triangle.

This implies that, with probability one, for >1\triangle>\triangle_{1} we have

k[K](Aik(mik+2,mik++1)(ρk+δ)(mik+\displaystyle\sum_{k\in[K]}\left(A_{i_{k}}(m_{i_{k}}+2,m_{i_{k}^{+}}+1)-(\rho_{k}+\delta)(m_{i_{k}^{+}}\right. mik))\displaystyle\left.-m_{i_{k}})\right)
<(εδ).\displaystyle<(\varepsilon-\delta)\triangle.

Furthermore, for any other jJ~I{i1,,iK}j\in\tilde{J}_{I}\setminus\{i_{1},\ldots,i_{K}\}, with probability one, there exists Mj>0M_{j}>0 such that mj+mj>Mjm_{j^{+}}-m_{j}>M_{j} implies Aj(mj+2,mj++1)(ρj+δ)(mj+mj)<(εδ)(mj+mj)A_{j}(m_{j}+2,m_{j^{+}}+1)-(\rho_{j}+\delta)(m_{j^{+}}-m_{j})<(\varepsilon-\delta)(m_{j^{+}}-m_{j}), and if mj+mjMjm_{j^{+}}-m_{j}\leq M_{j}, then Aj(mj+2,mj++1)(ρj+δ)(mj+mj)(||ρjδ)MjA_{j}(m_{j}+2,m_{j^{+}}+1)-(\rho_{j}+\delta)(m_{j^{+}}-m_{j})\leq(|\mathcal{M}|-\rho_{j}-\delta)M_{j}, since the total number of arrivals to the network is bounded by |||\mathcal{M}| at each time step. Therefore, with probability one, for >max{1,2:=1δεj(||ρjδ)Mj}\triangle>\max\{\triangle_{1},\triangle_{2}:=\frac{1}{\delta-\varepsilon}\sum_{j}(|\mathcal{M}|-\rho_{j}-\delta)M_{j}\} we have

1jJ~I\displaystyle\frac{1}{\triangle}\sum_{j\in\tilde{J}_{I}} (Aj(mj+2,mj++1)(ρj+δ)(mj+mj))\displaystyle\left(A_{j}(m_{j}+2,m_{j^{+}}+1)-(\rho_{j}+\delta)(m_{j^{+}}-m_{j})\right) (12)
<(εδ)+1j(||ρjδ)Mj<0\displaystyle<(\varepsilon-\delta)+\frac{1}{\triangle}\sum_{j}(|\mathcal{M}|-\rho_{j}-\delta)M_{j}<0

which in turn implies that, with probability one, lim supA~I()/<0\limsup_{\triangle\rightarrow\infty}\tilde{A}_{I}(\triangle)/\triangle<0. It follows that, with probability one, lim supA~()/<0\limsup_{\triangle\rightarrow\infty}\tilde{A}(\triangle)/\triangle<0.

Finally,

|A~I()|1jJ~I(||+ρj+δ)(mj+mj)C2,\left|\frac{\tilde{A}_{I}(\triangle)}{\triangle}\right|\leq\frac{1}{\triangle}\sum_{j\in\tilde{J}_{I}}(|\mathcal{M}|+\rho_{j}+\delta)(m_{j^{+}}-m_{j})\leq C_{2}, (13)

where we have used Aj(mj+2,mj++1)(mj+mj)||A_{j}(m_{j}+2,m_{j^{+}}+1)\leq(m_{j^{+}}-m_{j})|\mathcal{M}| in the first inequality using the fact that the total number of arrivals to the network is bounded by |||\mathcal{M}| at each time step. Therefore, the sequences {A~()/}=1\left\{\tilde{A}(\triangle)/\triangle\right\}_{\triangle=1}^{\infty} and {(A~()/)2}=1\left\{\left(\tilde{A}(\triangle)/\triangle\right)^{2}\right\}_{\triangle=1}^{\infty} are upper bounded by integrable functions. Fatou’s lemma [17] then implies

lim sup𝔼[A~()]\displaystyle\limsup_{\triangle\rightarrow\infty}\mathbb{E}\left[\frac{\tilde{A}(\triangle)}{\triangle}\right] 𝔼[lim supA~()]<0,\displaystyle\leq\mathbb{E}\left[\limsup_{\triangle\rightarrow\infty}\frac{\tilde{A}(\triangle)}{\triangle}\right]<0,
lim sup𝔼[(A~())2]\displaystyle\limsup_{\triangle\rightarrow\infty}\mathbb{E}\left[\left(\frac{\tilde{A}(\triangle)}{\triangle}\right)^{2}\right] 𝔼[lim sup(A~())2]<C3,\displaystyle\leq\mathbb{E}\left[\limsup_{\triangle\rightarrow\infty}\left(\frac{\tilde{A}(\triangle)}{\triangle}\right)^{2}\right]<C_{3},

where C3>C22C_{3}>C_{2}^{2} is some constant. This in turn gives (3).