Competitive Analysis of Online Path Selection: Impacts of Path Length, Topology, and System-Level Costs

Ying Cao, Siyuan Yu, Xiaoqi Tan, and Danny H.K. Tsang Y. Cao is with the Department of Electronic and Computer Engineering, HKUST, HKSAR.
E-mail: [email protected] Siyuan Yu and Xiaoqi Tan are with the Department of Computing Science, University of Alberta, Edmonton, AB, Canada.
E-mail: syu3, [email protected] H.K. Tsang is with the Thrust of Internet of Things, HKUST(GZ), GZ, China.
E-mail: [email protected]

Abstract

Consider a communication network to which a sequence of self-interested users come and send requests for data transmission between nodes. This work studies the question of how to guide the path selection choices made by those online-arriving users and maximize the social welfare. Competitive analysis is the main technical tool. Specifically, the impacts of path length bounds and topology on the competitive ratio of the designed algorithm are analyzed theoretically and explored experimentally. We observe intricate and interesting relationships between the empirical performance and the studied network parameters, which shed some light on how to design the network. We also investigate the influence of system-level costs on the optimal algorithm design.

1 Introduction

From communication networks to transportation networks, strategically allocating network resources to competing users in a decentralized way is a recurring theme in network research. Network operators design resource allocation strategies to achieve specific goals, including revenue maximization, social welfare maximization, and cost minimization. In transportation networks, designing efficient local routing strategies for vehicles to best accommodate to users’ dynamic demands is one of the key challenges. In parallel, in communication networks, routing, congestion control and scheduling are practical real-time resource allocation mechanisms to improve the network performance. This work focuses on routing data traffic in communication networks. Following the convention of economics, data source nodes are considered as self-interested agents who demonstrate sufficient autonomy and pure rationality when making decisions. Each agent decides to send data through the network or not and select routing paths at will, as is the case with source routing in the future Internet [1]. Specifically, we are concerned about how to navigate data traffic from different agents through the network such that the social welfare is maximized.

One concern among others that network operators typically have is the allocation efficiency of the limited network capacity. Given the welfare of each agent for routing her data through a certain path, integer linear programming can model and deal with this concern. However, the set of agents in the network is usually dynamic. Which agents will join is generally unknown to the network operator a priori – multiple methodologies are devoted to dealing with this uncertainty. Each methodology makes a different set of assumptions. Assuming data – e.g., the welfare and resource consumption level of each agent – follow an either known or unknown probability distribution, one can employ stochastic optimization methods to provide a performance guarantee if an underlying distribution exists but little can be guaranteed when outliers don’t belong to the modelled distribution. Thus, to avoid the dependency on any stochastic assumption, we follow the worst-case analysis framework, which is a robust modeling and analysis framework that makes the least assumptions about the environment.

The worst-case analysis framework compares the performance of a decision-maker who is uninformed about the future with the performance of a hypothetical oracle who has perfect foresight and can make optimal decisions. It provides a comparative performance guarantee that holds even in the worst-possible scenario. In other words, this framework assumes an adversary who can manipulate the sequential data and the environment. Two performance measures dominate this field of analysis – the competitive ratio and the adversarial regret. The primary difference between them is the type of performance guarantee they provide. The regret provides an additive guarantee, bounding the difference between the decision-maker’s performance and the optimal one. In contrast, the competitive ratio offers a multiplicative guarantee, meaning the decision-maker’s performance is bounded by a constant factor of the optimal performance. This work adopts the competitive ratio as the performance measure of interest.

Another concern of network operators is the performance evaluation of routing algorithms under various network configurations [2]. By understanding the impacts of network configurations on the algorithmic performance, one can better configure the network to improve. For instance, the more nodes connecting the source and destination nodes, the longer the time it takes to transfer the data. Taking time as a limited and precious resource, network operators usually prefer the shortest-path routing which minimizes the data transfer time. Also, the average path length of a network topology, defined as the average path length between any two nodes, is usually linked to the easiness for communication within the network [3]. For example, a larger average path length in a communication network indicates a slower and less efficient information transfer process. Thus, to investigate the impact of the path length and the topology on the social welfare, we assume that possible path lengths fall into a known interval, characterizing the uncertainty perceived by the network operator in users’ path lengths, and ask the question, how does the uncertainty in path lengths affect the social welfare in different topological networks?

In addition to network configuration parameters, the social welfare is also deeply influenced by the presence of costs. For example, the queuing delay experienced by users is usually viewed as a type of cost. Packets will be discarded by applications if their waiting times are unacceptably long, and the user experience is usually inversely related to the experienced delay. An innate property of the queuing delay is that it is experienced and influenced by all agents in the system, leading to coupling effects between agents. A well-designed admission control or packet scheduling strategy can exert a distributed system-level control over the queuing delay and improve the overall service experience. With more agents joining the system, the social welfare is increased from serving more demands, while the competition becomes tenser, and thus a higher cost, decreasing the social welfare. Queuing theory has been partially devoted to understanding and characterizing the queuing delay within the system. We incorporate the mathematically well-defined queuing delay therein as a cost and investigate the influence of such system-level costs on the algorithm design.

1.1 Related Work

Online Competitive Resource Allocation. The worst-case analysis technique of this paper falls under the umbrella of the competitive analysis framework [4]. There exists a series of works closely related to the problem studied here, such as competitive online routing [5, 6] and variants of online knapsack [7, 8]. Surprisingly, none of the above works studies either the impact of the topology or the path length on the performance, which partially motivates this work. In comparison, online resource allocation with costs has been studied under different scenarios. For example, online combinatorial auction with convex costs was studied in [9], and welfare maximization with polynomial costs was studied in [10]. However, a cost function that goes to infinity when reaching the capacity limit, such as the queuing delay, has not been studied before.

Routing Games. Similar routing problems have also been studied via the lens of game theory, such as selfish routing games [11, 12] and network congestion games [13]. The performance measure of interest is the so-called price of anarchy, which compares the system performance where agents make decisions locally and selfishly with that produced by a centralized optimizer. It characterises the inefficiency of an equilibrium caused by the absence of a centralized regulator. In general, prior works mentioned here are concerned with the decentralized aspect of the problem, specifically, the interplay between agents, while this work is focused on retaining competitive against the uncertainty of the future. It is worth mentioning that the combination of these two aspects is an interesting and challenging direction where more attention is deserved.

Regret Analysis. The regret offers an alternative additive guarantee to the decision maker against the uncertainty of future. The field of online convex optimization dedicates to analyzing the regret compared with static or dynamic benchmarks. The online convex optimization with constraints is close to our system setting. In [14], they studied an online convex optimization problem with time-varying constraints and compared the designed online algorithm with a dynamic benchmark. Regret and constraint violations were analyzed for the proposed algorithm. The constraints in our setting are time-varying but coupled across time slots, and we do not allow constraint violations. Moreover, it has been explored that best-of-both-worlds guarantees, i.e., a sublinear regret and a bounded competitive ratio at the same time, can be achieved under specific settings, such as in metrical task systems [15]. To the best of our knowledge, it remains unknown whether best-of-both-worlds guarantees exist in other online problems.

Online Algorithm with Constrained Adversary. Another recent trend in the field of online algorithms is to consider a constrained adversary. Instead of granting the adversary arbitrary power to manipulate future arrivals, additional control could be exert on the possible arrival instance. For example, future prices were restricted in a known interval in the online selection problems [16]. Under different constrained adversaries, the design strategy of competitive online algorithms could be completely different. For example, in the online selection problem, an adversary that constrains the price range and one that constrains the horizon admits completely different competitive algorithms. This work can be viewed as designing competitive algorithm against the adversary who is constrained on the path length and the topology.

1.2 Contributions and Paper Organization

Our main contribution in this work is as follows. First, we consider the online path selection problem with constraints on path lengths and network topology. A set of topology-dependent competitive ratios are derived for the considered line and tree topologies. Second, we study the impacts of the minimum path length $m$ for the first time and recover existing results about the logarithmic dependence of the competitive ratio on the maximum path length $M$ . We show that the minimum path length $m$ plays a role of varying importance in different networks. Specifically, the competitive ratio decreases slower with $m$ in tree networks compared to line networks, which also highlights the influence of the topological structure. Finally, we conduct extensive experiments to examine our theoretical findings and explore the performance of different price aggressiveness, which is a parameter in the designed algorithm. Our results show that the relationship between the empirical ratio and the path length bound varies for different topologies and different price aggressiveness, for which we provide an in-depth analysis and offer insights into the network design.

The rest of this paper is organized as follows. Section 2 introduces the system model and derives competitive ratios that are dependent on the network topology and the path length. Specifically, two fundamental networks, line networks and hierarchical tree networks, are studied. We also provide an analysis on the impact of system-level costs on the competitive algorithm design in this section. In Section 3, we conduct extensive experiments to examine the empirical performance of the online algorithm. Specifically, we first examine the gap between worst-case theoretical guarantees and empirical performance against stochastic arrivals, and then we verify the theoretical results by running the algorithm on certain hard instances. For the path selection with cost problem, we show the logarithmic trend of the competitive ratio with respect to the maximum value density by numerical methods. Finally, we conclude the paper in Section 4.

2 System Description and Results

The system of concern is described as follows. We consider a network $\mathcal{N}$ whose topology is fixed. Nodes in the network can be viewed as routers or servers. Edges connecting nodes are endowed with fixed capacities, which refers to capacities of fixed electronic wires.

(Arrival Instance) Agents who need to be routed from one node to another node arrive at the network in an online fashion. There are $N$ agents in total, but the decision-maker does not know the value of $N$ . An agent submits a reservation request on her arrival to use a certain bandwidth for transmitting data along some routing path. The strategic setting is considered here, where each agent holds a private value of her demand if fulfilled and could report false values to the network for her benefit. In summary, each agent can be characterized by the following parameters:

•

Value: $v_{i}$
•

Rate requirement: $r_{i}$
•

Source and destination nodes: $s_{i}$ and $t_{i}$
•

A set of possible paths connecting $s_{i}$ and $t_{i}$ : $\mathbb{P}_{i}$

Without the loss of clarity, we use agents and requests interchangeably throughout the paper.

Assumption 1 (Small Request Size).

The rate requirement of any agent is upper-bounded by $\epsilon\in\mathbb{R}^{+}$ .

Assumption 1 is self-explanatory in many applications. For example, in cloud data centers where individual workloads are negligible compared to the computing capacity of servers. In communication networks with a considerable size, it is also expected that individual requests is negligible in comparison to the transmission capacity of any edge in the network. It is also important for the theoretical analysis, for example, in the proof of Theorem 1.

Assumption 2 (Bounded Path Length).

The path that a request can select is bounded in its length, i.e., $m\leq|P_{i}|\leq M$ .

Assumption 2 further constrains the set of possible arrivals by limiting the number of edges on any routing path.

In addition, any reasonable agent should not hold an infinite value of a finite rate. The following modeling assumption is to formalize this observation.

Assumption 3 (Bounded Value Density).

For any agent $i$ , the ratio between her value and her total resource consumption is bounded, i.e., $\frac{v_{i}}{|P_{i}|r_{i}}\in[1,\bar{p}]$ .

Assumption 3 sets uniform bounds on the per-unit value of any request, which leads to the value of user $i$ in proportion to the number of edges on path $P_{i}$ , i.e., $v_{i}\in[d_{i},d_{i}\bar{p}]$ , where $d_{i}$ is the path length of $P_{i}$ . This is a well-accepted assumption for various online decision problems such as the online time series search [17, 18] and online knapsack problem [7, 8], etc. Online path selection can be viewed as a special case of online knapsack problem [19].

Following the standard competitive analysis framework, the performance of an online algorithm is characterized by the competitive ratio, defined as

\displaystyle\max_{I\in\Omega}\frac{\textsf{OPT}(I)}{\textsf{ALG}(I)},

where $I$ denotes an instance and $\Omega$ represents the family of instances that satisfy Assumptions 1-3.

2.1 Posted-Price Mechanism

We propose Algorithm 1 below, whose nature is a posted-price mechanism. A central operator maintains a price for each edge in correspondence to its utilization level and announces the edge prices publicly. When an agent arrives, she calculates the minimum price over all paths that can fulfill her demand, compares it with her private value, and decides to join or leave the network based on the comparison. Importantly, a posted-price mechanism is both incentive-compatible and privacy-preserving as the agent does not need to report her private value to a central operator [20].

Algorithm 1 Posted-Price Mechanism for Path Selection (PPM-PS_ϕ)

0: Utilization

\omega_{e}^{(i)}\leftarrow 0

1: while A new agent

i

arrives do

2: Find the path with the minimum price:

P_{i}=\arg\min_{P\in\mathbb{P}_{i}}r_{i}\lambda^{(i-1)}(P).

3: if

v_{i}>r_{i}\lambda^{(i-1)}(P_{i})

then

4: if

\omega_{e}^{(i-1)}+r_{i}\leq C_{e},\forall e\in P_{i}

then

5: Join the network in full on

P_{i}

6: for each edge

e

P_{i}

\omega_{e}^{(i)}=\omega_{e}^{(i-1)}+r_{i}

8: Update edge price

\lambda^{(i)}_{e}=\phi_{e}\left(\omega_{e}^{(i)}\right)

9: end for

10: end if

11: Leave the network.

12: end if

13: end while

The price of a path $P$ , i.e., $\lambda^{(i-1)}(P)$ , is calculated by summing over the price of all edges on this path, i.e., $\lambda^{(i-1)}(P)=\sum_{e\in P}\lambda^{(i-1)}_{e}$ , and the pricing function $\phi_{e}(\cdot)$ is parameterized by a parameter $\gamma$ : $\phi_{e}^{\gamma}(\omega)=e^{\gamma\omega/C_{e}}-1$ . The exponential pricing function is a classic choice in online algorithm design, where $\gamma$ is a parameter that guides the aggressiveness of the price. If $\gamma$ is larger, the price increases much faster with the utilization, indicating that more capacity is reserved for the future, and thus leading to a more conservative allocation.

2.2 Main Results I: Topology-Dependent Theoretical Guarantees of PPM-PS_ϕ

To understand what effect the network structure imposes on the performance of the mechanism PPM-PS_ϕ, a set of fundamental and amiable network topologies, i.e., line networks and tree networks, are studied in this subsection.

2.2.1 Line Network

Line networks are the simplest networks. We consider a bi-directional line network with $N+1$ nodes and $N$ edges. The following theorem shows the competitive ratio of PPM-PS_ϕ for a line network.

Theorem 1.

For line networks, when $\epsilon\leq\frac{C_{\min}}{\gamma}$ , PPM-PS_ϕ is $\max\{O(\ln M\bar{p}),O(\beta\ln(\frac{M\bar{p}}{2m\beta}+1))\}$ -competitive, where $\beta$ is the ratio between the largest capacity and the smallest capacity in the network.

Proof.

For line networks, partition the line network $\mathcal{N}$ into $J=\lfloor\frac{N}{M}\rfloor$ disjoint lines $\mathcal{N}=\cup_{j}\mathcal{N}_{j}$ , each containing $M$ consecutive edges and $M+1$ nodes. Requests can travel through at most two $M$ -long lines. Those starting from $\mathcal{N}_{j}$ and ending in $\mathcal{N}_{j+1}$ are grouped in $I_{j}^{1}$ , and those starting from $\mathcal{N}_{j}$ and ending in $\mathcal{N}_{j-1}$ are grouped in $I_{j}^{2}$ . Let $I^{1}=\cup_{j}I_{j}^{1}$ , $I^{2}=\cup_{j}I_{j}^{2}$ and $I=I^{1}\cup I^{2}$ . Because the data transmission is usually unidirectional, requests in $I^{1}$ do not affect those in $I^{2}$ . Built on this decoupling, we have

	$\displaystyle\frac{\textsf{OPT}(I)}{\textsf{ALG}(I)}$	$\displaystyle=\frac{\textsf{OPT}(I^{1})+\textsf{OPT}(I^{2})}{\textsf{ALG}(I^{1})+\textsf{ALG}(I^{2})}$
		$\displaystyle\leq\frac{\textsf{OPT}(I^{1})}{\textsf{ALG}(I^{1})}+\frac{\textsf{OPT}(I^{2})}{\textsf{ALG}(I^{2})},$

where $\textsf{OPT}(I)$ and $\textsf{ALG}(I)$ denotes the optimal revenue and the revenue of the online algorithm given the instance $I$ . To simplify the notation, we drop the superscripts $1$ and $2$ from now on and focus on requests in one direction, e.g., $I_{j}$ is the set of requests that start from line network $\mathcal{N}_{j}$ .

Let $\tilde{I}_{j}=I_{j-1}\cup I_{j},j\geq 2$ . All requests in $\tilde{I}_{j}$ affect edges in $\mathcal{N}_{j}$ . In the sequel, we focus on the upper bound for the ratio $\frac{\textsf{OPT}(I_{j})}{\textsf{ALG}(\tilde{I}_{j})}$ over instances in any $M$ -long line network $\mathcal{N}_{j}$ because the competitive ratio is upper-bounded as:

	$\displaystyle\frac{\textsf{OPT}(I)}{\textsf{ALG}(I)}$	$\displaystyle=\frac{\sum_{j}\textsf{OPT}(I_{j})}{\sum_{j}\textsf{ALG}(I_{j})}$
		$\displaystyle=\frac{2\sum_{j}\textsf{OPT}(I_{j})}{\textsf{ALG}(I_{1})+\sum_{j}\textsf{ALG}(\tilde{I}_{j})+\textsf{ALG}(I_{J})}$
		$\displaystyle\leq\max_{j}\frac{2\textsf{OPT}(I_{j})}{\textsf{ALG}(\tilde{I}_{j})},$

Let $\tilde{\mathcal{N}}_{j}=\mathcal{N}_{j-1}\cup\mathcal{N}_{j}\cup\mathcal{N}_{j+1}$ . Any edge in $\tilde{\mathcal{N}}_{j}$ can be possibly affected by requests in $\tilde{I}_{j}$ . When there is no saturated edge in $\tilde{\mathcal{N}}_{j}$ , i.e., no edge $e\in\tilde{\mathcal{N}}_{j}$ with $\omega_{e}^{(i)}>C_{e}-\epsilon$ , after the $i$ th request with $P_{i}\subset\tilde{\mathcal{N}}_{j}$ is routed, the value generated by the online algorithm increases by $\Delta\textsf{ALG}=v_{i}$ . The increase to the optimal solution $\Delta\textsf{OPT}$ is upper-bounded by $\Delta C$ based on the weak duality, and we have

	$\displaystyle\Delta C$	$\displaystyle=\sum_{e\in P_{i}}C_{e}\left(\lambda_{e}^{(i)}-\lambda_{e}^{(i-1)}\right)+v_{i}$
		$\displaystyle=\sum_{e\in P_{i}}C_{e}\left(\lambda_{e}^{(i-1)}+1\right)\left[\exp\left(\frac{r_{i}\gamma}{C_{e}}\right)-1\right]+v_{i}$
		$\displaystyle=\sum_{e\in P_{i}}r_{i}\left(\lambda_{e}^{(i-1)}+1\right)\frac{C_{e}}{r_{i}}\left[\exp\left(\frac{r_{i}\gamma}{C_{e}}\right)-1\right]$
		$\displaystyle+\Delta\textsf{ALG}.$

It follows that

	$\displaystyle\Delta C$	$\displaystyle\leq\frac{C_{\min}}{r_{i}}\left[\exp\left(\frac{r_{i}\gamma}{C_{\min}}\right)-1\right]\sum_{e\in P_{i}}r_{i}\left(\lambda_{e}^{(i-1)}+1\right)$
		$\displaystyle+\Delta\textsf{ALG}$
		$\displaystyle\leq\frac{C_{\min}}{r_{i}}\left[\exp\left(\frac{r_{i}\gamma}{C_{\min}}\right)-1\right](v_{i}+r_{i}d_{i})+\Delta\textsf{ALG}$
		$\displaystyle\leq\left\{2\frac{C_{\min}}{r_{i}}\left[\exp({\frac{r_{i}\gamma}{C_{\min}}})-1\right]+1\right\}\Delta\textsf{ALG}$

because $r_{i}\sum_{e\in P_{i}}\lambda_{e}^{(i-1)}\leq v_{i}$ , $\frac{v_{i}}{r_{i}d_{i}}\in[1,\bar{p}]$ , and $x\left[\exp(\frac{\gamma}{x})-1\right]$ is decreasing in $[\gamma,\infty)$ . When $r_{i}\leq\epsilon\leq\frac{C_{\min}}{\gamma}$ , we have $\frac{\Delta C}{\Delta\textsf{ALG}}\leq 2\frac{C_{\min}}{r_{i}}\left(e^{\frac{r_{i}\gamma}{C_{\min}}}-1\right)+1\leq 2(e-1)\gamma+1$ . Summing over $i$ , we have $\textsf{OPT}(I_{j})\leq C(I_{j})$ , and $\frac{\textsf{OPT}(I_{j})}{\textsf{ALG}(\tilde{I}_{j})}\leq\frac{C(I_{j})}{\textsf{ALG}(\tilde{I}_{j})}=2(e-1)\gamma+1$ . Therefore, the competitive ratio for the case without edge saturation is $4(e-1)\gamma+2$ .

When there exist almost-saturated edges (case with edge saturation), i.e., $\exists\tilde{e}\in\tilde{\mathcal{N}}_{j},w_{\tilde{e}}^{(i)}>C_{\tilde{e}}-\epsilon$ , we need to show that the algorithm output is still lower-bounded by a portion of the optimal value, when requests are rejected due to the capacity limit instead of an insufficient value. Corollary 1 characterizes the largest possible lower bound by an optimization problem. Then we use Corollary 2 to bound the competitive ratio. Both corollaries are useful for the analysis of the tree network as well.

Corollary 1.

If there exists a link $\tilde{e}$ whose $\lambda_{\tilde{e}}^{(N)}$ is close to its capacity $C_{\tilde{e}}$ , the minimum value of $\sum_{e}C_{e}\phi_{e}\left(\omega_{e}^{(N)}\right)$ is the optimal value of the following optimization problem:

	$\displaystyle\min_{x\geq 0}\quad$	$\displaystyle\sum_{e\in\mathcal{E}_{\tilde{e}}}\phi_{e}\left(\omega_{e}^{(N)}\right)+\phi_{\tilde{e}}(C_{\tilde{e}})$
	$\displaystyle s.t.$	$\displaystyle\sum_{j:e\in P_{j}}x_{j}=\omega_{e}^{(N)},\forall e\in\mathcal{E}_{\tilde{e}},$
		$\displaystyle\sum_{j\in I_{\tilde{e}}}x_{j}=C_{\tilde{e}},$

where $x_{j},j\in[m]$ is the amount of rate allocated to the $j$ th request that uses edge $\tilde{e}$ , $\mathcal{E}_{\tilde{e}}$ is the set of edges that can share a path of length $m$ with edge $\tilde{e}$ , and $I_{\tilde{e}}$ is the set of requests that utilizes edge $\tilde{e}$ .

Corollary 2.

The competitive ratio is the maximum of the ratio for the case with edge saturation and the ratio for the case without edge saturation.

Assume that $M$ is a multiple of $m$ . A commonly-employed lower bound of the algorithm output is the final total prices of all edges in the network as follows.

Lemma 1.

$\textsf{ALG}(\tilde{I}_{j})\geq\frac{\sum_{e\in\tilde{\mathcal{N}}_{j}}C_{e}\lambda_{e}^{(N)}}{2\gamma(e-1)}$ .

Proof.

We have

	$\displaystyle\textsf{ALG}(\tilde{I}_{j})$	$\displaystyle=\sum_{i}v_{i}/2+\sum_{i}v_{i}/2$
		$\displaystyle\geq\sum_{i}\frac{r_{i}}{2}\sum_{e\in P_{i}}\lambda_{e}^{(i-1)}+\sum_{i}\frac{r_{i}d_{i}}{2}$
		$\displaystyle=\sum_{i}\frac{r_{i}}{2}\sum_{e\in P_{i}}\left(\lambda_{e}^{(i-1)}+1\right)$
		$\displaystyle=\sum_{i}\sum_{e\in P_{i}}\frac{r_{i}\left(\lambda_{e}^{(i)}-\lambda_{e}^{(i-1)}\right)}{2[\exp(\frac{r_{i}\gamma}{C_{e}})-1]}$
		$\displaystyle\geq\sum_{i}\sum_{e\in P_{i}}\frac{C_{e}(\lambda_{e}^{(i)}-\lambda_{e}^{(i-1)})}{2\gamma(e-1)}$
		$\displaystyle=\frac{\sum_{e}C_{e}\lambda_{e}^{(N)}}{2\gamma(e-1)},$

where the last inequality follows from that $\frac{x}{e^{\gamma x}-1}$ is decreasing in $x\in[0,\frac{1}{\gamma}]$ . ∎

It then remains to bound the optimal revenue and final edge prices $\lambda_{e}^{(N)}$ for the case with edge saturation, which are dependent on the network topology and the edge capacities.

If $\forall e,C_{e}=C$ , for a $3M$ -long line network, the worst-case scenario is that each almost-saturated edge blocks an $M$ -long request with the highest valuation. There are at most 3 such long and valuable requests blocked by three almost-saturated edges. Thus, the optimal revenue is upper-bounded by $3M\bar{p}C$ , the online revenue is lower-bounded by

\frac{3(C\phi(C-\epsilon)+C(2m-1)\phi((C-\epsilon)/2))}{2\gamma(e-1)},

and the ratio $\frac{\textsf{OPT}(I_{j})}{\textsf{ALG}(\tilde{I}_{j})}$ is upper-bounded by $\frac{2(e-1)\gamma M\bar{p}}{\phi(C)+(2m-2)\phi(C/2)}$ .

The overall competitive ratio is then upper-bounded by $\max\{4(e-1)\gamma+2,\frac{2(e-1)\gamma M\bar{p}}{\phi(C)+(2m-2)\phi(C/2)}\}=\max\{4(e-1)\gamma+2,\frac{2(e-1)\gamma M\bar{p}}{e^{\gamma}+(2m-2)e^{\gamma/2}+2m-1}\}=O(\ln\frac{M\bar{p}}{m})$ by choosing $\gamma=O(\ln\frac{M\bar{p}}{m})$ .

For line networks with heterogeneous capacities, denote the minimum capacity in $\mathcal{N}_{j}$ as $\underline{C}_{j}$ and the edge as $\underline{e}_{j}$ and the maximum capacity of edges within $m$ reach of edge $\underline{e}_{j}$ in $\mathcal{N}_{j}$ as $\bar{C}_{j}$ . The worst-case scenario happens when edge $\underline{e}_{j}$ is almost-saturated and blocks an $M$ -long request. The optimal revenue is upper-bounded by $\sum_{k=j-1}^{j+1}M\bar{p}\underline{C}_{k}$ , the online revenue is lower-bounded by

\frac{\sum_{k=j-1}^{j+1}\underline{C}_{k}(e^{\gamma}-1)+2(m-1)\bar{C}_{k}(e^{\frac{\underline{C}_{k}}{2\bar{C}_{k}}}-1)}{2\gamma(e-1)},

and the ratio $\frac{\textsf{OPT}(I_{j})}{\textsf{ALG}(\tilde{I}_{j})}$ is upper-bounded by $\frac{2(e-1)\gamma M\bar{p}}{e^{\gamma}-1+2(m-1)\beta(e^{\frac{\gamma}{2\beta}}-1)}$ , where $\beta=\max_{j}\frac{\bar{C}_{j}}{\underline{C}_{j}}$ . Thus, the overall competitive ratio is upper-bounded by $\max\{4(e-1)\gamma+2,\frac{2(e-1)\gamma M\bar{p}}{e^{\gamma}-1+2(m-1)\beta(e^{\frac{\gamma}{2\beta}}-1)}\}=O(\beta\ln(\frac{M\bar{p}}{2m\beta}+1))$ by choosing $\gamma=O(\beta\ln(\frac{M\bar{p}}{2m\beta}+1))$ . When $\beta$ is large, $\frac{2(e-1)\gamma M\bar{p}}{e^{\gamma}-1+2(m-1)\beta(e^{\frac{\gamma}{2\beta}}-1)}$ is upper-bounded by $\frac{2(e-1)\gamma M\bar{p}}{e^{\gamma}-1}=O(\ln(M\bar{p}))$ by choosing $\gamma=O(\ln(M\bar{p}))$ . Thus, the competitive ratio is upper-bounded by $\max\{O(\ln M\bar{p}),O(\beta\ln(\frac{M\bar{p}}{2m\beta}+1))\}$ .

∎

Remark 1.

When edge capacities are identical ( $\beta=1$ ), PPM-PS_ϕ is $O(\ln{\frac{M\bar{p}}{m}})$ -competitive, recovering the result in [7]. When $\beta=\infty$ and $m=1$ , PPM-PS_ϕ is $O(\ln M\bar{p})$ -competitive, consistent with the result in [8].

2.2.2 Tree Network

We consider a directed full binary tree of depth $M$ and the following two typical arrival patterns in tree networks.

•

Start from Root (SR): All requests start from the root node. This corresponds to the case when there is one source node in the communication network.
•

End at Leaf (EL): Requests must end at any leaf node but not necessarily start from the root node. This corresponds to the case when every non-leaf node can be the source node and generate data.

Theorem 2.

For SR requests, when edge capacities are identical (uniform capacity case), the competitive ratio is $O(2^{m-1}\ln(\frac{M\bar{p}}{m\cdot 2^{m-1}}+1))$ . When the $i$ -level edges have capacity $\frac{C}{2^{i}}$ (exponentially-decreasing capacity case), the competitive ratio is $O(\max\{\max_{x\in[0,m-1]}2^{x}\ln(\frac{M\bar{p}}{m2^{x}}+1),\ln(M\bar{p}+1)\})$ .

Proof.

We start with the case when all edges have the same capacity $C$ . From Lemma 1, we have $\textsf{ALG}\geq\frac{\sum_{e}C\phi_{e}(\omega_{e}^{(N)})}{2(e-1)\gamma}$ . The optimal revenue is upper-bounded by allocating the capacity of the top-level edge to the highest-valued requests: $\textsf{OPT}\leq M\bar{p}C$ . If there is an edge that reaches the capacity, it will be the top-level edge, and there are at most $2^{k}$ edges at level $k$ among which the load is evenly distributed, then we have

	$\displaystyle\sum_{e}C\phi_{e}(\omega_{e}^{(N)})$	$\displaystyle\geq C\left[\phi(C)+\sum_{k=1}^{m-1}2^{k}\phi\left(\frac{C}{2^{k}}\right)\right]$
		$\displaystyle=C\sum_{k=0}^{m-1}2^{k}\phi\left(\frac{C}{2^{k}}\right)$
		$\displaystyle=C\sum_{k=0}^{m-1}2^{k}(e^{\frac{\gamma}{2^{k}}}-1).$

The ratio for the case with edge saturation is then upper-bounded by $\frac{2(e-1)\gamma\bar{p}CM}{C\sum_{k=0}^{m-1}2^{k}(e^{\frac{\gamma}{2^{k}}}-1)}\leq\frac{2(e-1)\gamma\bar{p}M}{m2^{m-1}(e^{\frac{\gamma}{2^{m-1}}}-1)}$ , and the competitive ratio is eventually bounded by $O(2^{m-1}\ln(\frac{M\bar{p}}{m\cdot 2^{m-1}}+1))$ by choosing $\gamma=O(2^{m-1}\ln(\frac{M\bar{p}}{m\cdot 2^{m-1}}+1))$ .

We then consider the influence of heterogeneous capacities on the performance of our mechanism by studying the case where edges at the $i$ th level are endowed with a capacity of $\frac{C}{2^{i}}$ . In this case, edges of the first level may not be the first saturated. Let the first saturated edge be of the $l$ th level ( $l\leq m$ ) and $\phi_{k}(\omega)=e^{\frac{2^{k}\gamma\omega}{C}}-1$ , the algorithmic output is lower-bounded in

	$\displaystyle 2(e-1)\gamma\textsf{ALG}$	$\displaystyle\geq\sum_{k=0}^{l}\frac{C}{2^{k}}\phi_{k}(\frac{C}{2^{l}})+\sum_{k=l+1}^{m}\frac{C}{2^{k}}2^{k-l}\phi_{k}(\frac{C}{2^{k}})$
		$\displaystyle=\sum_{k=0}^{l}\frac{C}{2^{k}}(e^{\gamma 2^{k-l}}-1)+\frac{C}{2^{l}}\sum_{k=l+1}^{m}(e^{\gamma}-1)$
		$\displaystyle\geq\frac{l\cdot 2^{l}(e^{\frac{\gamma}{2^{l}}}-1)+(m-l+1)(e^{\gamma}-1)}{2^{l}}$
		$\displaystyle\geq\max\{l\cdot(e^{\frac{\gamma}{2^{l}}}-1),\frac{(m-l+1)(e^{\gamma}-1)}{2^{l}}\}.$

The above inequalities hold for the following reasons. Saturating an edge of the $l$ th level (with capacity $\frac{C}{2^{l}}$ ) leads to that each of the $l-1$ ancestor edges is consumed at least $\frac{C}{2^{l}}$ , and edges of $m-l$ child levels are also saturated because each child edge only has one ancestor edge and the total capacities over an edge’s child edges are the same to its own capacity. The optimal value is upper-bounded as $\textsf{OPT}\leq\frac{M\bar{p}C}{2^{l}}$ because the bottleneck link is of capacity $\frac{C}{2^{l}}$ .

The competitive ratio is then upper-bounded by $O(\max\{2^{l}\ln(\frac{M\bar{p}}{m2^{l}}+1),\ln(\frac{M\bar{p}}{m-l+1}+1)\})$ . We thus complete the proof of Theorem 2. ∎

Theorem 3.

For EL requests, when edge capacities are identical, the competitive ratio is $O(\max_{x\in[m,M]}2^{x-1}\ln(\frac{M\bar{p}}{x2^{x-1}}+1))$ . When the $i$ -level edges have capacity $\frac{C}{2^{i}}$ , the competitive ratio is $O(\ln(1+\frac{M\bar{p}}{m}))$ .

Proof.

In a full binary tree, the source nodes of concern must be of depth $d\leq M-m$ . For identical capacity case, the optimal value is upper-bounded by $M\bar{p}C$ . Denote the saturated edge with the lowest level as $\underline{e}$ and the level of its parent node as $\underline{d}$ . It is obvious that any ancestor edge of $\underline{e}$ is also saturated. The algorithm output ALG is lower-bounded by

	$\displaystyle\frac{C\phi(C)+C\sum_{i=1}^{M-\underline{d}-1}2^{i}\phi(\frac{C}{2^{i}})}{2(1-e)\gamma}$
	$\displaystyle=\frac{C\sum_{i=0}^{M-\underline{d}-1}2^{i}(e^{\gamma/2^{i}}-1)}{2(1-e)\gamma}$
	$\displaystyle\geq\frac{C(M-\underline{d})2^{M-\underline{d}-1}(e^{\gamma/2^{M-\underline{d}-1}}-1)}{2(1-e)\gamma}.$

The competitive ratio is upper-bounded by

	$\displaystyle\max\{4(e-1)\gamma+2,\frac{2(e-1)M\bar{p}\gamma}{(M-\underline{d})2^{M-\underline{d}-1}(e^{\gamma/2^{M-\underline{d}-1}}-1)}\}$
	$\displaystyle\leq O(2^{M-\underline{d}-1}\ln(\frac{M\bar{p}}{(M-\underline{d})2^{M-\underline{d}-1}}+1)),$

by choosing $\gamma=O(2^{M-\underline{d}-1}\ln(\frac{M\bar{p}}{(M-\underline{d})2^{M-\underline{d}-1}}+1))$ . Thus, the competitive ratio is upper-bounded by $O(\max_{x\in[m,M]}2^{x-1}\ln(\frac{M\bar{p}}{x2^{x-1}}+1))$ .

For the case where the capacity of the $i$ -th level edge is $\frac{C}{2^{i}}$ , the optimal value is upper-bounded by $M\bar{p}\frac{C}{2^{\underline{d}}}$ , where $\underline{d}$ is defined the same as above. The algorithmic output is lower-bounded in $2(e-1)\gamma\textsf{ALG}\geq\frac{C}{2^{\underline{d}}}\sum_{k=0}^{M-\underline{d}-1}\frac{C}{2^{k}}2^{k}(e^{\gamma}-1)$ . The ratio for the saturated case is thus upper-bounded by $\frac{2(e-1)\gamma M\bar{p}\frac{C}{2^{\underline{d}}}}{\frac{C}{2^{\underline{d}}}\sum_{k=0}^{M-\underline{d}-1}\frac{C}{2^{k}}2^{k}(e^{\gamma}-1)}=\frac{2(e-1)\gamma M\bar{p}}{(M-\underline{d})(e^{\gamma}-1)}=O(\ln(1+\frac{M\bar{p}}{M-\underline{d}}))$ by choosing $\gamma=O(\ln(1+\frac{M\bar{p}}{M-\underline{d}}))$ . Thus, the competitive ratio is upper-bounded by $O(\ln(1+\frac{M\bar{p}}{m}))$ .

∎

Remark 2.

The following observations and understandings are gained:

•

For SR requests, the competitive ratio for uniform capacity case is smaller than that for the exponentially-decreasing capacity case, demonstrating the difficulty of the latter.
•

The EL requests are harder than SR requests when edge capacities are identical, while it is the opposite for exponentially-decreasing capacity case. It shows the intricate interaction between the request pattern and the network design, especially the configuration of link capacities.
•

We also observe that the competitive ratio of EL requests with exponentially-decreasing capacity is the same as that of the line network with uniform capacity. The matching between the request pattern and the network configuration should thus be deemed as vital for a good competitive ratio.

Remark 3.

Theorem 2 and Theorem 3 can be extended trivially to the case when the branching factor of the tree is $b>2$ .

2.3 Main Results II: Impact of System-Level Costs on Optimal Design of PPM-PS_ϕ

As mentioned in the Introduction, the incurred waiting time at nodes leads to a system-level cost, namely the service quality degradation caused by the network congestion, requiring additional consideration in balancing between revenues and costs to further improve social welfare. We model the cost as follows. The sum of mean rates for all flows passing through edge $e$ is defined as $\omega_{e}=\sum_{i\in[N]}r_{i}\delta_{i}^{e}$ , where $\delta_{i}^{e}=1$ indicates that edge $e$ is on the path of $i$ th agent $P_{i}$ , and $\delta_{i}^{e}=0$ indicates the opposite. In this work, each edge is modeled as an $M/M/1$ queue with the arrival rate $\lambda_{e}$ and the service rate $C_{e}$ . The cost is quantified by the total number of packets in the network $\sum_{e}f(\rho_{e})$ , where

f(\rho_{e})=\begin{cases}\frac{\rho_{e}}{1-\rho_{e}},&0\leq\rho_{e}<1,\\ \infty,&\rho_{e}\geq 1,\end{cases}

and $\rho_{e}=\frac{\omega_{e}}{C_{e}}$ is the utilization of edge $e$ . It is well-accepted in the queuing theory that the number of packets in the network is positively related to the average network delay, and thus a useful indicator for the network congestion. Other typical preferences of network operators, such as maximizing the minimum load, can also be incorporated by enforcing different $f$ ’s. In this regard, $f$ can be viewed as a regularizer of the network state.

When the minimum path length $m>1$ , the worst-case instance is topology-dependent as shown in the previous subsection. To avoid over-complicating the problem and enable the analysis for the case with costs, we consider $m=1$ here.

Theorem 4 provides a set of sufficient conditions on the pricing function $\phi$ to remain competitive after considering the system-level cost.

Theorem 4 (Sufficiency).

For any given $\gamma\geq 1$ , $\textsf{PPM-PS}_{\bm{\phi}}$ is $\alpha$ -competitive if $\bm{\phi}=(\phi_{e})_{\forall e\in\mathcal{E}}$ and $\phi_{e}:[0,\bar{\rho}_{e}]\rightarrow\mathbb{R}^{+}$ is an analytic and non-decreasing solution to the following differential equation with boundary conditions:

\displaystyle\begin{cases}\left(1-{(C_{e}\phi_{e})^{-1/2}}\right)\phi^{\prime}_{e}=\gamma\left(\phi_{e}-f^{\prime}/C_{e}\right),\\ \phi_{e}(0)=\frac{1}{C_{e}},\phi_{e}(\bar{\rho}_{e})\geq\bar{p},\end{cases}

(1)

where $\bar{\rho}_{e}$ satisfies $f^{\prime}(\bar{\rho}_{e})=\bar{p}C_{e}$ .

Proof.

Before any agent arrives, $\textsf{ALG}=0$ , and the optimal OPT is upper-bounded by $\sum_{e}f^{*}(\phi_{e}(0)C_{e})\geq 0$ , where $f^{*}(y)=\sup_{\rho\in[0,\infty)}[y\rho-f(\rho)]=\begin{cases}(\sqrt{y}-1)^{2},\text{if }y\geq 1,\\ 0,\text{if }y<1.\end{cases}$ By setting $\phi_{e}(0)=\frac{1}{C_{e}}$ , we have $\textsf{ALG}=\textsf{OPT}=0$ .

If agent $i$ joins the system, the social welfare increases by

	$\displaystyle\Delta\textsf{ALG}$	$\displaystyle=v_{i}-\sum_{e\in P_{i}}\left[f(\rho_{e}^{(i)})-f(\rho_{e}^{(i-1)})\right]$
		$\displaystyle=\mu_{i}+\sum_{e\in P_{i}}r_{i}p_{e}^{(i-1)}-\sum_{e\in P_{i}}\left[f(\rho_{e}^{(i)})-f(\rho_{e}^{(i-1)})\right],$

where $\mu_{i}$ is the value margin of agent $i$ , i.e., $\mu_{i}=v_{i}-r_{i}\lambda^{(i-1)(P_{i})}\geq 0$ and $\rho_{e}^{(i)}=\frac{\omega_{e}^{(i)}}{C_{e}}$ .

The optimal social welfare increases at most

\displaystyle D_{i}-D_{i-1}=\mu_{i}+\sum_{e\in P_{i}}\left[f^{*}(\lambda_{e}^{(i)}C_{e})-f^{*}(\lambda_{e}^{(i-1)}C_{e})\right].

Being $\gamma$ -competitive is implied by the following inequality:

	$\displaystyle\sum_{e\in P_{i}}r_{i}\lambda_{e}^{(i-1)}-\sum_{e\in P_{i}}\left[f(\rho_{e}^{(i)})-f(\rho_{e}^{(i-1)})\right]+\left(1-\frac{1}{\gamma}\right)\mu_{i}$
	$\displaystyle\geq\frac{1}{\gamma}\sum_{e\in P_{i}}\left[f^{}(\lambda_{e}^{(i)}C_{e})-f^{}(\lambda_{e}^{(i-1)}C_{e})\right],$

which is further implied by the inequality over each agent as follows:

	$\displaystyle r_{i}\lambda_{e}^{(i-1)}-\left[f(\rho_{e}^{(i)})-f(\rho_{e}^{(i-1)})\right]$
	$\displaystyle\geq\frac{1}{\gamma}\left[f^{}(\lambda_{e}^{(i)}C_{e})-f^{}(\lambda_{e}^{(i-1)}C_{e})\right].$

By dividing $r_{i}=C_{e}(\rho_{e}^{(i)}-\rho_{e}^{(i-1)})$ at both sides, we have $\forall e\in P_{i},\quad\lambda_{e}^{(i-1)}-\frac{1}{C_{e}}\frac{f(\rho_{e}^{(i)})-f(\rho_{e}^{(i-1)})}{\rho_{e}^{(i)}-\rho_{e}^{(i-1)}}\geq\frac{1}{\gamma}\cdot\frac{f^{*}(\lambda_{e}^{(i)}C_{e})-f^{*}(\lambda_{e}^{(i-1)}C_{e})}{(\lambda_{e}^{(i)}-\lambda_{e}^{(i-1)})C_{e}}\cdot\frac{\lambda_{e}^{(i)}-\lambda_{e}^{(i-1)}}{\rho_{e}^{(i)}-\rho_{e}^{(i-1)}},$ which is implied by the differential equation

\phi_{e}\left(\rho_{e}^{(i-1)}\right)-\frac{1}{C_{e}}f^{\prime}_{e}\left(\rho_{e}^{(i-1)}\right)\geq\frac{1}{\gamma}f^{*^{\prime}}(\phi_{e}(\rho_{e}^{(i-1)})C_{e})\phi^{\prime}_{e}(\rho_{e}^{(i-1)}).

The right boundary condition is determined by identifying the worst-cast instance and ensuring the $\gamma$ -competitiveness for it. One worst-case instance consists of agents who can be routed via unit-length paths. Two phases in this instance exist: in the first phase, there come agents with valuation increasing from $1$ to $\bar{p}$ ; followed by agents with valuation $\bar{p}-\epsilon$ in the second phase. ∎

Theorem 5 reinforces the significance of Eq. (1) by showing that when the rate-to-capacity ratio $\epsilon$ is infinitesimal, the existence of a $\gamma$ -competitive online mechanism for the case with cost is equivalent to the existence of a solution to the integral version of Eq. (1).

Theorem 5 (Necessity).

For any $\gamma>0$ , if there exists an $\gamma$ -competitive deterministic online mechanism (not necessarily PPMs) for the online path selection problem with convex costs and the infinitesimal rate-to-capacity ratio, then the integral version of Eq. (1) has at least one solution.

Proof.

Denote a group of agents with value density $\nu$ and total demand $C_{e}f^{*^{\prime}}(\nu C_{e})$ as $G_{\nu}$ . Consider the following instance $I_{p}$ indexed by $p$ , $p\in[0,\bar{p})$ : there come $G_{\nu}$ s with $\nu$ increasing from $0$ to $p$ continuously. After that, there comes $G_{\nu}$ with $\nu=p-\epsilon$ . The optimal solution is composed of all agents in the last group of the instance $I_{p}$ , i.e., group $G_{p-\epsilon}$ , and its welfare is $(p-\epsilon)C_{e}f^{*^{\prime}}((p-\epsilon)C_{e})-f(f^{*^{\prime}}((p-\epsilon)C_{e}))=f^{*}((p-\epsilon)C_{e}).$ Define the utilization of link $e$ of any $\alpha$ -competitive online algorithm after processing $G_{\nu}$ as $\psi_{e}(\nu)$ . Denote the output of an online algorithm as ALG. Given the $\alpha$ -competitiveness, the following inequality holds for $\forall p\in\left(1/C_{e},\bar{p}\right]$ :

	ALG	$\displaystyle=\int_{1/C_{e}}^{p}\nu C_{e}d\psi_{e}(\nu)-f(\psi_{e}(p))$
		$\displaystyle\geq\frac{1}{\gamma}\textsf{OPT}=\frac{1}{\gamma}f^{*}((p-\epsilon)C_{e}).$		(2)

If there exists a $\gamma$ -competitive online algorithm (not necessarily PPM), we can always construct a PPM with $\psi_{e}(\bar{p})=\bar{\rho}_{e}$ and $\psi_{e}(\frac{1}{C_{e}})=0$ to be at least $\gamma$ -competitive.

The construction is as follows: Due to the definition of $\psi_{e}(p)$ , we have $\psi_{e}(\nu)\geq\psi_{e}(\frac{1}{C_{e}})\geq 0$ , for all $\nu\in[\frac{1}{C_{e}},\bar{p}]$ . If $\psi_{e}(\frac{1}{C_{e}})>0$ , agents that incur negative welfare ( $v_{i}/r_{i}\leq\frac{1}{c_{e}}$ ) will join, and thus there always exists a more competitive online algorithm with $\psi_{e}(\frac{1}{c_{e}})=0$ . If $\psi_{e}(\bar{p})>\bar{\rho}_{e}$ , we can always construct an algorithm at least $\gamma$ -competitive by stopping the allocation right before the utilization hits the effective utilization $\bar{\rho}_{e}$ , because the increase of the link costs after exceeding the effective utilization is greater than the increase of the value; if $\psi_{e}(\bar{p})<\bar{\rho}_{e}$ , we can always allocate the remaining $\bar{\rho}_{e}-\psi_{e}(\bar{p})$ of link $e$ to $I_{\bar{p}}$ and achieve a competitive ratio no worse than $\gamma$ .

Thus, for any $\gamma$ -competitive online algorithm, there is a $\psi_{e}$ that satisfies Eq. (3):

\displaystyle\begin{cases}\int_{1/C_{e}}^{p}\nu d\psi_{e}(\nu)-\frac{1}{C_{e}}f(\psi_{e}(p))\geq\frac{1}{\gamma C_{e}}f^{*}(pC_{e}),\forall p\in(1/C_{e},\bar{p})\\ \psi_{e}(\frac{1}{C_{e}})=0,\psi_{e}(\bar{p})=\bar{\rho}_{e}.\end{cases}

(3)

Define $\underline{\psi_{e}}(\nu)$ as the infimum over all feasible solutions to Eq. (3): $\underline{\psi_{e}}(\nu)=\inf\{\psi_{e}(\nu)|\psi_{e}$ is non-decreasing and feasible for Eq. (3) $\}$ . One can show that $\underline{\psi_{e}}$ is feasible for Eq. (3) with the equality holds and is strictly increasing.

Construct $\varphi_{e}$ as follows: for any $p\in(\frac{1}{C_{e}},\bar{p})$ , $\varphi_{e}(\rho)=\underline{\psi_{e}}^{-1}(\rho)=p,\forall\rho\in(0,\bar{\rho}_{e})$ , $\varphi_{e}(0)=\frac{1}{C_{e}}$ , $\varphi_{e}(\bar{\rho}_{e})=\bar{p}$ . By replacing $\nu$ with $\varphi_{e}(s)$ in Eq. (3), we have $\int_{0}^{\rho}\varphi_{e}(s)ds-\frac{1}{C_{e}}f(\rho)=\frac{1}{\alpha C_{e}}f^{*}(\varphi_{e}(\rho)C_{e}),\forall\rho\in(0,\bar{\rho}_{e}),$ which shows that $\varphi_{e}$ is a solution to the following set of equation:

		$\displaystyle\int_{0}^{\rho}\phi_{e}(s)ds-\frac{1}{C_{e}}(f(\rho)-f(0))$
		$\displaystyle\geq\frac{1}{\gamma C_{e}}\cdot\left(f^{}(\phi_{e}(\rho)C_{e})-f^{}(\phi_{e}(0)C_{e})\right)$
		$\displaystyle=\frac{1}{\gamma C_{e}}\cdot f^{*}(\phi_{e}(\rho)C_{e}),$		(4)

which is the integral version of Eq. (1). ∎

3 Experiments

3.1 Description of Experiment Setting

Refer to caption — Figure 1: Illustration of Network Topology

In line networks, 100 nodes in a line are connected by edges of uniform capacity 100. We conduct $20$ independent simulations, each generating $300$ requests with $\bar{p}=6$ . In particular, the path length and the value density of a request are drawn uniformly random from $[m,M]\cap\mathbb{N}$ and $[1,\bar{p}]$ , respectively.

In tree networks, a full binary tree of depth $M$ is generated, with the capacity of the two edges closest from the root set at 2560, and capacity of subsequent edges decaying exponentially at each level. We conduct $40$ independent simulations, each generating $3000$ requests that originate from the root with a random trajectory down the tree. Similarly, the path length and the value density of a request are drawn uniformly random from $[m,M]\cap\mathbb{N}$ and $[1,\bar{p}]$ , respectively. Figure 1 illustrates the setting.

In both cases, we compute $\textsf{OPT}(I)$ by solving an integer programming problem with Gurobi for each arrival instance $I$ .

Then, in an effort to simulate a hard arrival instance that challenges the online algorithm, we construct requests with progressively longer paths and increasing value densities for each given path length. This strategy aims to trick the online algorithm into accepting shorter and lower-valued requests and therefore leading to bottlenecks. In contrast, the optimal strategy in hindsight is to reject such requests and accept requests of higher value and longer length.

Lastly, we run an experiment to examine the relationship between the competitive ratio and the maximum price $\bar{p}$ for the case with cost.

(Choices of $\gamma$ ) Optimal designs that hedge against worst cases usually exhibit overly cautious and conservative empirical behaviours. To address this issue, many attempts have been made to design data-driven designs that also deliver effective practical performances [21]. In our design, the larger the $\gamma$ , the more rapidly the link price increases and therefore the more conservative the online algorithm is. We conduct experiments to examine how price aggressiveness impacts the empirical performance in different topologies and path length bounds.

3.2 Experimental Findings and Discussions

3.2.1 Impacts of Maximum Path Length

(Line Network) In Figure 2, we see that when $\gamma\in\{0.5,2,4\}$ , the empirical ratio increases in a logarithmic manner with $M$ , which is consistent with the competitive ratio. The intuition is that the optimal allocation OPT possesses greater power over choosing requests of higher-valued requests of longer paths as $M$ increases. Additionally, we observe rather consistent increasing trends in edge utilization levels as requests of longer path penetrate further into the line network. Meanwhile, there still exist edges with relatively low utilization levels as the maximum path length is still small compared to the total number of edges in the network. On the other hand, a longer request stands a greater chance of passing through a highly utilized edge, leading to an overall higher price to fulfill this request and thus a drop in the acceptance rate. This decline in the acceptance rate is particularly pronounced when $\gamma=4$ as it leads to the sharpest increase in edge prices with utilization levels due to its implied conservativeness.

(Tree Network) Figure 3 exhibits conflicting relations between $M$ and the empirical ratio. When $\gamma=0.5$ , the empirical ratio increases with $M$ ; when $\gamma=2$ , the empirical ratio also increases with $M$ with a drop when $M=2$ ; when $\gamma=4$ , the empirical ratio decreases, which is contrary to our theoretical findings. It sheds light on the gap between the worst-case analysis and the empirical performance.

Firstly, when $\gamma=4$ , both the acceptance rate of the online algorithm and the maximum utilization over all edges experience a significant drop compared to the cases when $\gamma\in\{0.5,2\}$ due to its high price. In this case, the network is far from being saturated and is overall under-utilized. Thus, the worst-case scenario of having bottlenecks due to edge saturation has not come into the picture yet. It is also not hard to understand why the empirical ratio decreases with the maximum length $M$ in the unsaturated case. When $M$ increases, the average value of users increases, which implies that being conservative at larger $M$ values is more advantageous because future agents are more likely to bring a much higher value.

Moreover, the empirical ratio is the smallest when $\gamma=2$ , which illustrates that a good $\gamma$ should balance between being overly aggressive ( $\gamma=0.5$ ) and overly conservative ( $\gamma=4$ ), and thus achieve a good trade-off between maximizing the resource utilization efficiency and reserving enough resources for the future. It can be done by increasing $\gamma$ for larger $M$ values while maintaining sufficient utilization levels. In addition to optimizing $\gamma$ , it also indicates that the maximum path length has a negative effect in the empirical ratio. We advise that network operators keep this in mind and scale the network without increasing the average path length too much. A possible guideline is to try to avoid deep trees in the network and keep an as-flat-as-possible structure.

3.2.2 Impacts of Minimum Path Length

The trends of the empirical ratio with $m$ in line and tree networks and results for hard instances are discussed in the following. In short, we observe interesting trends for stochastic instances that are partially inconsistent with the theoretical results from the worst-case analysis framework and offer detailed analysis for each of them.

(Line Network) Figure 4 shows the relationship between the empirical ratio and the minimum path length $m$ for line networks under different price aggressiveness $\gamma\in\{0.5,2,4\}$ . As the minimum path length $m$ increases towards the maximum path length $M$ , the range of possible starting points that can guarantee a minimum path length $m$ narrows, making it more likely that requests share a common set of edges. These common edges are thus frequently requested and more likely to be rather saturated, hindering the online algorithm from selecting highest-valued requests whereas the optimal allocation can perform the maximization ex post.

When there exist saturated edges as shown in the utilization row of Figure 4, as $m$ increases, in a line network with homogeneous capacities, the neighbouring edges tend to be extensively utilized as well. Therefore, more requests are rejected due to the resulting high price. It is thus better to be somewhat conservative and avoid using up too much capacity too fast by accepting many lower-valued requests. Due to such susceptibility to the saturation, it is thus more challenging for the online algorithm to perform as close as possible to the optimal allocation, leading to an increase in the empirical ratio.

Interestingly, we also observe a sudden drop in the empirical ratio when $m$ approaches $M=50$ . In this case, requests become increasingly similar in that there are many common edges requested. We conjecture that this results in a significantly reduced ability for even the optimal allocation to optimize, as this phenomenon can be largely explained by the drop in its average utilization rate.

(Tree Network) Figure 5 shows the relationship between the empirical ratio and the minimum path length $m$ for tree networks under different price aggressiveness $\gamma\in\{0.5,2,4\}$ . We see that when $\gamma=0.5$ , the empirical ratio decreases with $m$ , which is consistent with our theoretical findings. We also observe intricate trends in the empirical ratio not explained by our theoretical results. We argue that this is due to the inherent limitation of the worst-case analysis framework as it cannot effectively characterize empirical performance on stochastic inputs. In particular, when $\gamma=2$ , the empirical ratio decreases at first and then increases with $m$ ; when $\gamma=4$ , the empirical ratio increases with $m$ with a minor drop when $m=M=8$ . We offer discussions on the observed phenomena in the following.

When $m$ increases, the requests will be required to travel through more edges in the network, and thus the average edge utilization levels of both the online algorithm and the optimal allocation increase. When $m=M=8$ , all requests are directed all the way down to leaf nodes. Therefore, the minimum utilization of the optimal allocation becomes positive, as all edges, even at the lowest level, will be requested in expectation.

Additionally, we observe that the utilization levels and the acceptance rate of the online algorithm ( $\gamma=0.5$ ) and the optimal allocation are almost identical. It means that the optimal allocation strategy acts closely to a greedy scheme when the environment is stochastic. However, in terms of the empirical ratio, the online algorithm ( $\gamma=2$ ) still outperforms the other two. It clearly shows that an online algorithm does not need to closely mimic the optimal allocation to have a small empirical ratio.

It is no surprise that the empirical ratio of the online algorithm ( $\gamma=4$ ) increases with $m$ given that the maximum utilization of the optimal is strictly smaller than $1$ . The instances generated do not incur any edge saturation for any $m$ . It means that in the online algorithm, every request leaves the network because of an insufficient value. However, with $m$ increasing, it is more likely for two requests to share more edges, leading to more coupling effects between requests. Consider two requests with path length difference $1$ , and one path contains the other. Assume that the value densities of the two paths are such that the longer one can provide a higher value but the value density of the shorter one is larger. The longer one can be easily told off if the shorter one is accepted first, and thus it creates a harder time for the online algorithm to distinguish between them. But it is much easier for the algorithm to drop the ones with a smaller value if $m$ is smaller because it is less likely to encounter the aforementioned conundrum, where one needs to choose between requests with conflicting values and value densities.

When there exists edge saturation (maximum utilization is 1 for $\gamma\in\{0.5,2\}$ ), it verifies the theory that the ratio decreases with $m$ when there exists edge saturation. But the theory does not predict as to why the ratio increases (right half of $\gamma=2$ ). Observing the utilization, as soon as the online algorithm starts to accept less than the optimal, the empirical ratio increases. One possible reason is that by setting $\gamma=2$ , there exists a mix of saturated edges and unsaturated edges in the network, and the empirical ratio increases due to the same reason as $\gamma=4$ . This reveals again the limitation of the worst-case analysis approach when applied to complex multi-dimensional systems, for which worst-case instances are typically hard to pin down.

3.2.3 Hard Instances

Having observed mixed results against stochastic inputs, we are interested in finding out how the online algorithm performs against challenging instances. In face of strategically constructed hard instances, we observe replicating trends that match our theoretical findings.

The construction of hard instances is as follows: we generate requests with progressively long path length, and for each path length, we gradually feed the online algorithm requests with increasing value densities, starting from the lower bound. In the second part, we provide high-valued requests with long path length. In essence, we want to trick the online algorithm into accepting shorter, lower-valued requests in the first part and create bottlenecks that prevent it from accepting requests in the second part. On the other hand, the optimal allocation strategy should simply ignore requests in the first part and accept requests in the second part.

Figure 6 shows the relationship between the empirical ratio and $M/m$ in line networks, where $M/m$ indicates the fluctuation ratio of path lengths. Intuitively, a larger ratio indicates greater uncertainty and brings additional challenges. As shown before, the order-optimal choice of $\gamma$ is given in Theorem 1, and we select $opt=2\ln(\frac{(e-1)M\bar{p}}{m}+1)$ in the experiment. For $\gamma\in\{0.5,2\}$ , the empirical ratio increases linearly, whereas when $\gamma=opt$ , the empirical ratio grows logarithmically, reasserting the order optimality of our results.

Figure 7 shows that empirical ratios decrease with $m$ and increase with $M$ in tree networks when faced with hard instances for $\gamma\in\{0.5,2,4\}$ , which confirms our theoretical guarantees. Specifically, we observe that when $\gamma=4$ , it delivers the best performance in both cases. Intuitively, a conservative strategy is beneficial when faced with a particularly challenging instance. Lastly, the empirical ratio is logarithmic in $M$ when $\gamma=4$ , further verifying previous theoretical results.

3.2.4 Path Selection with Cost

For the online path selection with cost, we have shown that if a solution to Eq. (1) with parameter $\alpha$ exists, there is a corresponding online algorithm that achieves $\alpha$ -competitive. However, Eq. (1) is notoriously difficult to analyze as a non-autonomous differential equation with singular boundary conditions. Therefore, we resort to find the smallest $\alpha$ such that a solution exists numerically, and show its logarithmic growth w.r.t. $\bar{p}$ in Figure 8, in which the link capacity is set to $40$ .

4 Conclusions

In this paper, we investigated the role of the network topology and path lengths in determining the performance of a posted-price mechanism in the online path selection problem. We established new results about the dependence of competitive ratio on the path length bounds, recovered existing results about the dependence on the maximum path length, and elucidated particularly the varied influence of the path length bounds across different networks, in specific, line and hierarchical tree networks. Moreover, we studied the impact of system-level costs on the algorithm design and established sufficient and necessary conditions to be competitive. At last, we conducted extensive empirical experiments, which not only confirms our theoretical discovery but also uncovers the subtler effects of network structure on algorithmic performance for stochastic scenarios. These findings offer valuable insights for future development of more adaptive online algorithms that are tailored to specific network characteristics like bounded path lengths, and we hope it paves the way for further research in the domain of online algorithms against other types of constrained adversaries and other system-level coupling effects.

References

[1] C. I. S. WG, “Source packet routing in networking,” 2018.
[2] S.-J. Yang, “Performance evaluation of routing algorithms under various network configuration parameters,” International Journal of Network Management, vol. 7, no. 4, pp. 183–197, 1997.
[3] Q. Ye, B. Wu, and B. Wang, “Distance distribution and average shortest path length estimation in real-world networks,” in Advanced Data Mining and Applications: 6th International Conference, ADMA 2010, Chongqing, China, November 19-21, 2010, Proceedings, Part I 6. Springer, 2010, pp. 322–333.
[4] A. Borodin and R. El-Yaniv, Online computation and competitive analysis. Cambridge University Press, 2005.
[5] P. Bose and P. Morin, “Competitive online routing in geometric graphs,” Theoretical Computer Science, vol. 324, no. 2-3, pp. 273–288, 2004.
[6] N. Buchbinder and J. Naor, “Improved bounds for online routing and packing via a primal-dual approach,” in 2006 47th Annual IEEE Symposium on Foundations of Computer Science (FOCS’06). IEEE, 2006, pp. 293–304.
[7] B. Sun, L. Yang, M. Hajiesmaili, A. Wierman, J. C. Lui, D. Towsley, and D. H. Tsang, “The online knapsack problem with departures,” Proceedings of the ACM on Measurement and Analysis of Computing Systems, vol. 6, no. 3, pp. 1–32, 2022.
[8] L. Yang, A. Zeynali, M. H. Hajiesmaili, R. K. Sitaraman, and D. Towsley, “Competitive algorithms for online multidimensional knapsack problems,” Proceedings of the ACM on Measurement and Analysis of Computing Systems, vol. 5, no. 3, pp. 1–30, 2021.
[9] X. Tan, B. Sun, A. Leon-Garcia, Y. Wu, and D. H. K. Tsang, “Mechanism design for online resource allocation: A unified approach,” Proceedings of the ACM on Measurement and Analysis of Computing Systems, vol. 4, no. 2, pp. 1–46, 2020.
[10] Z. Huang and A. Kim, “Welfare maximization with production costs: A primal dual approach,” Games and Economic Behavior, vol. 118, pp. 648–667, 2019.
[11] T. Roughgarden and É. Tardos, “How bad is selfish routing?” Journal of the ACM (JACM), vol. 49, no. 2, pp. 236–259, 2002.
[12] R. Banner and A. Orda, “Bottleneck routing games in communication networks,” IEEE Journal on Selected Areas in Communications, vol. 25, no. 6, pp. 1173–1179, 2007.
[13] B. Hao and C. Michini, “Inefficiency of pure Nash equilibria in network congestion games: the impact of symmetry and network structure,” ACM Transactions on Economics and Computation, 2024.
[14] T. Chen, Q. Ling, and G. B. Giannakis, “An online convex optimization approach to proactive network resource allocation,” IEEE Transactions on Signal Processing, vol. 65, no. 24, pp. 6350–6364, 2017.
[15] A. Daniely and Y. Mansour, “Competitive ratio vs regret minimization: achieving the best of both worlds,” in Algorithmic Learning Theory. PMLR, 2019, pp. 333–368.
[16] Z. Jiang, P. Lu, Z. G. Tang, and Y. Zhang, “Online selection problems against constrained adversary,” in International Conference on Machine Learning. PMLR, 2021, pp. 5002–5012.
[17] R. El-Yaniv, A. Fiat, R. M. Karp, and G. Turpin, “Optimal search and one-way trading online algorithms,” Algorithmica, vol. 30, pp. 101–139, 2001.
[18] X. Tan, S. Yu, R. Boutaba, and A. Leon-Garcia, “Threshold policies with tight guarantees for online selection with convex costs,” in Proceedings of the 19th Conference on Web and Internet Economics (WINE), 2023.
[19] Y. Cao, S. Yu, X. Tan, and D. H. K. Tsang, “Competitive online path-aware path selection,” ACM SIGMETRICS Performance Evaluation Review, vol. 51, no. 4, pp. 66–72, 2024.
[20] M. Feldman, N. Gravin, and B. Lucier, “Combinatorial auctions via posted prices,” in Proceedings of the twenty-sixth annual ACM-SIAM symposium on Discrete algorithms. SIAM, 2014, pp. 123–135.
[21] A. Zeynali, B. Sun, M. Hajiesmaili, and A. Wierman, “Data-driven competitive algorithms for online knapsack and set cover,” in Proceedings of the AAAI Conference on Artificial Intelligence, vol. 35, no. 12, 2021, pp. 10 833–10 841.

Competitive Analysis of Online Path Selection: Impacts of Path Length, Topology, and System-Level Costs

Abstract

1 Introduction

1.1 Related Work

1.2 Contributions and Paper Organization

2 System Description and Results

Assumption 1 (Small Request Size).

Assumption 2 (Bounded Path Length).

Assumption 3 (Bounded Value Density).

2.1 Posted-Price Mechanism

2.2 Main Results I: Topology-Dependent Theoretical Guarantees of PPM-PSϕ

2.2.1 Line Network

Theorem 1.

Proof.

Corollary 1.

Corollary 2.

Lemma 1.

Proof.

Remark 1.

2.2.2 Tree Network

Theorem 2.

Proof.

Theorem 3.

Proof.

Remark 2.

Remark 3.

2.3 Main Results II: Impact of System-Level Costs on Optimal Design of PPM-PSϕ

Theorem 4 (Sufficiency).

Proof.

Theorem 5 (Necessity).

Proof.

3 Experiments

3.1 Description of Experiment Setting

3.2 Experimental Findings and Discussions

3.2.1 Impacts of Maximum Path Length

3.2.2 Impacts of Minimum Path Length

3.2.3 Hard Instances

3.2.4 Path Selection with Cost

4 Conclusions

References

2.2 Main Results I: Topology-Dependent Theoretical Guarantees of PPM-PS_ϕ

2.3 Main Results II: Impact of System-Level Costs on Optimal Design of PPM-PS_ϕ