Courteous MPC for Autonomous Driving with CBF-inspired
Risk Assessment

Yanze Zhang¹, Yiwei Lyu², Sude E. Demir³, Xingyu Zhou³
Yupeng Yang¹, Junmin Wang³, IEEE Fellow, and Wenhao Luo¹ ¹Yanze Zhang, Yupeng Yang, and Wenhao Luo are with the Department of Computer Science, University of North Carolina at Charlotte, Charlotte, NC 28223, USA. Email: {yzhang94, yyang52, wenhao.luo}@charlotte.edu.²Yiwei Lyu is with the Department of Electrical and Computer Engineering, Carnegie Mellon University, Pittsburgh, PA, 15213, USA. Email: [email protected].³Sude E. Demir, Xingyu Zhou, and Junmin Wang are with the Walker Department of Mechanical Engineering, University of Texas at Austin, Austin, TX, 78712, USA. Email: {sude.demir, xingyu.zhou, jwang}@austin.utexas.edu.This work was partially supported by NSF Awards 2312465 and 2312466.

Abstract

With more autonomous vehicles (AVs) sharing roadways with human-driven vehicles (HVs), ensuring safe and courteous maneuvers that respect HVs’ behavior becomes increasingly important. To promote both safety and courtesy in AV’s behavior, an extension of Control Barrier Functions (CBFs)-inspired risk evaluation framework is proposed in this paper by considering both noisy observed positions and velocities of surrounding vehicles. The perceived risk by the ego vehicle can be visualized as a risk map that reflects the understanding of the surrounding environment and thus shows the potential for facilitating safe and courteous driving. By incorporating the risk evaluation framework into the Model Predictive Control (MPC) scheme, we propose a Courteous MPC for ego AV to generate courteous behaviors that 1) reduce the overall risk imposed on other vehicles and 2) respect the hard safety constraints and the original objective for efficiency. We demonstrate the performance of the proposed Courteous MPC via theoretical analysis and simulation experiments.

I Introduction

As more autonomous vehicles (AVs) are deployed on public roads, the coexistence of AVs and human-driven vehicles (HVs) is expected to lead to frequent interactions between AVs and HVs, as well as among AVs themselves. Although AVs are designed to engineer for a combination of safety, efficiency, and driving quality, the underlying ego-centric design often overlooks the impact of AV behaviors to the surrounding environments, which may result in unwelcome or even dangerous actions by AVs to other vehicles [1, 2].

To realize safe and courteous driving behavior for AVs, some methods incorporate a courtesy term as a reward feature, and use Inverse Reinforcement Learning (IRL) to learn appropriate weights for such a feature from actual human driving data [1, 3, 4, 5]. Courtesy terms that have been quantified include costs related to the human driver’s tasks [1], active information gathering [3], and measures of selfishness or altruism through Social Value Orientation [4, 5]. Other methods integrate social factors with deep learning to foster socially compliant behaviors, e.g. Social GAN [6]. However, the vehicle’s social preferences, subject to some specific scenario or dataset, may not perform well in unfamiliar conditions. Furthermore, it is challenging to incorporate uncertainties into the generation of these methods or to estimate the failure probability in uncertain trafﬁc scenarios, resulting in a lack of theoretical safety guarantee.

Refer to caption — Figure 1: On-ramp merging scenario where the human-driven vehicle (blue) is trying to merge onto the main lane occupied by the ego vehicle (red). To accommodate the human-driven vehicle (HV) in a courteous manner, the ego vehicle could consider lane-change behavior that makes space for the HV without sacrificing much on safety and efficiency.

Some probabilistic approaches incorporate uncertainties into Partially Observable Markov Decision Process (POMDP) [7] for decision making. Specifically, research in [8] categorizes safety and courtesy into different levels and employs Hidden Markov Models (HMM) to effectively interpret the long-term patterns of behavior despite observational noise, enhancing trajectory prediction accuracy. Other methods [9, 10] formulate uncertainty-aware distance constraints as chance constraints and integrate them into the optimization problem to enhance safety in dynamic environments. However, enforcing these state-based constraints often require enlarged bounding volumes to accommodate braking distances, potentially overestimating the probability and leading to conservative behaviors.

Benefiting from the less restrictive constraints using Control Barrier Functions (CBFs) [11], recent methods [12, 13, 14, 15, 16] reformulated the state-based chance constraints for collision avoidance into deterministic control constraints that enable safety guarantees with high probability as well as task efficiency. Additionally, work in [17] has introduced a risk evaluation framework inspired by CBF [18] under motion uncertainty and integrated it into a risk-aware controller to facilitate effective decision-making in complex dynamic environments. Building on these foundations, we develop an extension to this risk evaluation framework that takes into account both noisy observed positions and motions, which are then integrated with a model predictive controller (MPC) to ensure smooth and robust decision-making for AVs. This integrated controller not only provides effective safety guarantees but also demonstrates courtesy by allowing additional space for the maneuvers of surrounding vehicles. Our contributions in this paper are threefold:

•

A novel extension to the existing CBF-inspired risk evaluation framework is proposed, which incorporates both position error and motion error to robustify the risk measurement under uncertainty;
•

A Courteous MPC method for autonomous driving is proposed that integrates the developed risk evaluation to consider heterogeneous surrounding human-driven vehicles with a provable probabilistic safety guarantee;
•

Simulations on interaction with vehicles modeled by intelligent driver model (IDM) [19] and NGSIM dataset [20] are shown to validate the effectiveness of the proposed risk evaluation framework and Courteous MPC.

II Background

II-A Control Barrier Functions

Consider a dynamical system in control-affine form:

\dot{x}=f(x)+g(x)u

(1)

where $x\in\mathbb{R}^{d}$ and $u\in\mathbb{R}^{m}$ are the system state vector and control input vector, respectively. The vector fields $f:\mathbb{R}^{d}\mapsto\mathbb{R}^{d}$ and matrix-valued function $g:\mathbb{R}^{d}\mapsto\mathbb{R}^{d\times m}$ are assumed to be locally Lipschitz continuous.

Given the control affine system in Eq. (1) and the safe set $\mathcal{H}$ defined as the zero-superlevel set of a continuously differentiable function $h:\mathbb{R}^{d}\mapsto\mathbb{R}$ , Safety Barrier Certificates [11] has been used to enforce the safety set forward invariance, as defined in the following lemma.

Lemma 1.

[Summarized from [11]] Given a dynamical system affine defined in Eq. (1) and a safe set $\mathcal{H}$ as the 0-super level set of a continuously differentiable function $h:\mathbb{R}^{d}\mapsto\mathbb{R}$ , the function $h$ is called a control barrier function, if there exists an extended class- $\mathcal{K}$ function $\alpha(\cdot)$ such that $\sup_{u\in\mathbb{R}^{m}}\{\dot{h}(x,u)\}\geq-\alpha(h(x))$ for all $x\in\mathbb{R}^{d}$ . The admissible control space for any Lipschitz continuous controller $u\in\mathbb{R}^{m}$ rendering $\mathcal{H}$ forward invariant thus becomes:

\displaystyle K_{\textrm{cbf}}(x)=\{u\in\mathbb{R}^{m}|L_{f}h(x)+L_{g}h(x)u\geq-\alpha(h(x))\}

(2)

where $L_{f}h(x)=\nabla h^{T}(x)f(x)$ and $L_{g}h(x)=\nabla h^{T}(x)g(x)$ , respectively.

This defines the set of controllers enforcing the system state $x$ staying in $\mathcal{H}$ over time if the initial system state $x(t=0)$ is inside the set $\mathcal{H}$ .

II-B Sample-Data Control Barrier Functions

However, when the system states are updated discretely and so does the controller, such controller implemented under a discrete-time system may not satisfy the safety condition using the continuous constraints defined in Eq. (2) between sample time steps [21]. To this end, the Sampled-data Control Barrier Functions (Sampled-data CBFs) [22] are introduced to enforce the forward invariance of safe set using Controller Margin¹¹1An additional term compare with Eq. (2) added to the right-hand side in Eq. (2) is called the controller margin., which is summarized in the following lemma.

Lemma 2.

[Summarized from [22]] Given the control affine system defined in Eq. (1) where the state is measured discretely (i.e, the sampled states $x_{k}=x(kdt)$ , where $k$ is the time step and $dt$ is the one time step duration and the control law is updated discretely (i.e., $u(t)=u_{k}(x_{k})$ ), and the safe set $\mathcal{H}$ , the admissible control space for any Lipschitz continuous controller $u_{k}\in\mathbb{R}^{m}$ at the time step $k$ rendering $\mathcal{H}$ forward invariant is defined as below:

	$\displaystyle S_{\textrm{cbf}}(x_{k})=\{u_{k}\in\mathbb{R}^{m}$	$\displaystyle\|L_{f}h(x_{k})+L_{g}h(x_{k})u_{k}$
		$\displaystyle\geq-\alpha(h(x_{k}))+l\delta dt\}$		(3)

where $x_{k}$ is the observed state at time step k, $l=l_{L_{f}h}+l_{L_{g}h}u_{max}+l_{\alpha}(h)$ ( $l_{L_{f}h},l_{L_{g}h}$ and $l_{\alpha}(h)$ are the Lipschitz constants of $L_{f}h,L_{g}h$ and $\alpha(h)$ , respectively), $\delta=\sup_{x\in S,u\in U}||f(x)+g(x)u||$ .

This defines the set of controllers for the system whose states are observed discretely to stay in $\mathcal{H}$ over time, especially during $[t_{k},t_{k+1})$ where $t_{k}=kdt$ .

II-C Kinematic Vehicle Model

By assuming that the tire slip ratios and slip angles are small, the vehicle’s orientation and position evolve based on its velocity, steering angle, and wheelbase. The kinematics of ego vehicle $\mathcal{V}_{e}$ as an AV can be described by the Kinematic Bicycle Model [23] as follows:

\begin{cases}\dot{X}_{e}&=v_{e}\cos(\phi_{e}+\beta_{e})\\ \dot{Y}_{e}&=v_{e}\sin(\phi_{e}+\beta_{e})\\ \dot{v}_{e}&=a_{e}\\ \dot{\phi}_{e}&=\frac{v_{e}}{l}\sin\beta_{e}\\ \beta_{e}&=\tan^{-1}(\frac{1}{2\tan\delta_{e}})\end{cases}

(4)

where $(X_{e},Y_{e})$ represents the vehicle’s longitude and lateral positions. $\phi_{e}$ is the vehicle heading angle and $v_{e}$ is the velocity of the vehicle. $\beta_{e}$ is the side slip angle at the center of gravity and $a_{e}$ the acceleration. $\delta_{e}$ is the front wheel steering angle.

When the vehicle travels on a highway, the environment and the vehicle dynamics may evolve rapidly, requiring quick responses from the control system [24]. Discretizing the kinematic bicycle model allows the computation to be performed efficiently. In this case, the system in Eq. (4) can be discretized as a linear parameter-varying system using Euler’s method. To simplify the discussion, we use the mapping $\tilde{f}$ to represent the system:

\mathbf{s}_{e,k+1}=\tilde{f}(\mathbf{s}_{e,k},\mathbf{u}_{e,k})

(5)

where $\mathbf{s}_{e,k}=[X_{e,k},Y_{e,k},v_{e,k},\phi_{e,k}]^{\text{T}}\in\mathbb{R}^{4}$ and $\mathbf{u}_{e,k}=[a_{e,k},\delta_{e,k}]^{\text{T}}\in\mathbb{R}^{2}$ are the state and control input of the vehicle $e$ at time step $k$ , respectively. In the following sections, we use $\mathbf{P}_{e,k}$ to represent the position $(X_{e,k},Y_{e,k})$ of the ego vehicle $\mathcal{V}_{e}$ at time step $k$ .

II-D Problem Statement

We consider the scenario where an AV operates on the road and can observe the neighboring vehicles’ positions and velocities. For the observed vehicle $\mathcal{V}_{i}$ at time step $k$ :

	$\displaystyle\mathbf{P}_{i,k}=(X_{i,k},Y_{i,k})$
	$\displaystyle\mathbf{V}_{i,k}=(v^{x}_{i,k},v^{y}_{i,k})$

where $\mathbf{P}_{i,k},\mathbf{V}_{i,k}$ represent the position and velocity, respectively. $v^{x}_{i,k},v^{y}_{i,k}$ are the velocities of the vehicle $i$ in the $x$ and $y$ direction. Both $\mathbf{P}_{i,k},\mathbf{V}_{i,k}$ are considered as Gaussian random variables with $\mathbf{P}_{i,k}\sim(\hat{\mathbf{P}}_{i,k},\Sigma^{P}_{i})$ and $\mathbf{V}_{i,k}\sim(\hat{\mathbf{V}}_{i,k},\Sigma^{V}_{i})$ , where $\hat{\mathbf{P}}_{i,k}\in\mathbb{R}^{2}$ and $\hat{\mathbf{V}}_{i,k}\in\mathbb{R}^{2}$ are the observed location and velocity of the neighboring vehicle, respectively, and $\Sigma^{P}_{i}\in\mathbb{R}^{2\times 2}$ and $\Sigma^{V}_{i}\in\mathbb{R}^{2\times 2}$ are the diagonal covariance matrix.

Remark 1.

Neighboring vehicles are the vehicles located within a specific distance that reflects the sensing range.

Motivated by CBF-inspired risk evaluation framework in [17], we seek to quantify the risk when there is both position and velocity uncertainty and exploit the potential of this risk evaluation framework in safe and courteous driving.

III Methods

III-A Control Barrier Functions-inspired Conditional Value at Risk under Noisy Observations

The vehicles operating on roads are assumed to be covered by an ellipse [25, 26], hence motivated by [27] the safety set between any pairwise vehicles at time step $k$ can be formulated as follows:

\mathcal{H}^{P_{k}}_{e,i}=\{(\mathbf{P}_{e,k},\mathbf{P}_{i,k})|h_{e,i}(\mathbf{P}_{k})\geq 0\}

(6)

h_{e,i}(\mathbf{P}_{k})=(X_{e,k}-X_{i,k})^{2}+(\frac{Y_{e,k}-Y_{i,k}}{\tau})^{2}-\mathcal{D}_{s}^{2}

(7)

where $\mathbf{P}_{k}$ is the joint position space of the ego vehicle $e$ and its neighboring vehicle $\forall i$ at the time step $k$ , $\tau$ is the scaling factor and $\mathcal{D}_{s}$ is the safety distance.

Inspired by Lemma 2, we use the function²²2In this paper, we select the class $\kappa$ function $\alpha(h)=\gamma h$ as the same setting in [28, 17], where $\gamma\in\mathbb{R}\geq 0$ . $H_{e,i}(\mathbf{P}_{k},\dot{\mathbf{P}}_{k})$ to represent the severity of the potential pairwise collision, which is defined as:

\displaystyle H_{e,i}(\mathbf{P}_{k},\dot{\mathbf{P}}_{k})=-\dot{h}_{e,i}(\mathbf{P}_{k},\dot{\mathbf{P}}_{k})-\gamma h_{e,i}(\mathbf{P}_{k})+l_{H}\delta_{H}dt

(8)

where $l_{H}=l_{L_{f}h_{e,i}}+l_{L_{g}h_{e,i}}u_{max}+l_{\alpha}(h_{e,i})$ with $\delta_{H}=\sup_{\mathbf{P}_{e,k}\in\mathcal{H}^{P_{k}}_{e,i}}|\dot{\mathbf{P}}_{e,k}|$ and $dt$ is the duration of one time step (the dependence of $h_{e,i}$ is omitted for notation simplicity.).

With the Gaussian distributed observed positions and velocities of the neighboring vehicles, $H_{e,i}(\mathbf{P}_{k},\dot{\mathbf{P}}_{k})$ is distributed as a noncentral chi-square distribution. Using the Delta method [29], we can approximate the distribution of $H_{e,i}(\mathbf{P}_{k},\dot{\mathbf{P}}_{k})$ as a Gaussian distribution where the mean and variance are estimated with the ﬁrst-order Taylor series expansion, i.e., $H_{e,i}(\mathbf{P}_{k},\dot{\mathbf{P}}_{k})\sim\mathcal{N}(\boldsymbol{\mu}_{H_{e,i}},\Sigma_{H_{e,i}})$ with:

	$\displaystyle\boldsymbol{\mu}_{H_{e,i}}$	$\displaystyle=-\dot{h}_{e,i}(\hat{\mathbf{P}}_{k},\dot{\hat{\mathbf{P}}}_{k})-\gamma h_{e,i}(\hat{\mathbf{P}}_{k})+l_{H}\delta_{H}dt$		(9)
	$\displaystyle\Sigma_{H_{e,i}}$	$\displaystyle=\nabla H_{e,i}(\boldsymbol{\mu}_{H_{e,i}})^{\text{T}}\Sigma_{e,i}\nabla H_{e,i}(\boldsymbol{\mu}_{H_{e,i}})$		(10)

where $\Sigma_{e,i}\in\mathbb{R}^{8\times 8}$ is the covariance of the $[\mathbf{P}_{e,k},\mathbf{P}_{i,k},\dot{\mathbf{P}}_{e,k},\dot{\mathbf{P}}_{i,k}]$ .

We employ conditional value-at-risk³³3With a user-defined confidence level $\alpha$ , $\mathrm{CVaR}$ is the expected cost in the worst $\alpha$ -percentile of the distribution function $H_{e,i}(\mathbf{P}_{k},\dot{\mathbf{P}}_{k})$ . (CVaR) [8] to quantify the ego vehicle’s perceived risk from potential pairwise collision and observation uncertainty:

	$\displaystyle\mathrm{CVaR}_{\alpha}(H_{e,i}):$	$\displaystyle=\frac{\int_{1-\alpha}^{1}\mathrm{V}\mathrm{a}\mathrm{R}_{1-\lambda}(H_{e,i})d\lambda}{\alpha}$
		$\displaystyle=E(H_{e,i}\|H_{e,i}\geq\mathrm{V}\mathrm{a}\mathrm{R}_{\alpha}(H_{e,i}))$		(11)

where $\mathrm{V}\mathrm{a}\mathrm{R}_{\alpha}(H_{e,i}):=\text{inf}\{H\in\mathbb{R}|\mathrm{Pr}(H_{e,i}\geq H)\leq\alpha\}$ , and $\alpha$ is a user-defined confidence level. $\mathrm{Pr}(\cdot)$ indicates the probability of an event. To simplify the notation, we use $H_{e,i}$ to represent $H_{e,i}(\mathbf{P}_{k},\dot{\mathbf{P}}_{k})$ by omitting the dependence on $\mathbf{P}_{k}$ and $\dot{\mathbf{P}}_{k}$ .

Theorem 3.

Consider the safe set $\mathcal{H}^{P_{k}}_{e,i}$ defined in Eq. (6). Let $l_{H}=l_{L_{f}h_{e,i}}+l_{L_{g}h_{e,i}}u_{max}+l_{\alpha}(h_{e,i})$ , $\delta_{H}=\sup_{\mathbf{P}_{e,k}\in\mathcal{H}^{P_{k}}_{e,i}}|\dot{\mathbf{P}}_{e,k}|$ , and with a user-defined confidence level $\alpha$ . If the ego vehicle is in a safe state at the initial time step, then the constraint defined in Eq. (12) at time step $k$ will guarantee the pairwise vehicle-to-vehicle safety with at least $1-\alpha$ probability during $[t_{k},t_{k+1})$ .

\mathrm{CVaR}_{\alpha}(H_{e,i})\leq 0.

(12)

Proof.

To simplify the notation, let us denote $L=\{H_{e,i}\geq\mathrm{VaR}_{\alpha}(H_{e,i})\}$ . Since $\mathrm{CVaR}_{\alpha}\leq 0$ suggests that the mean of the worst $\alpha$ portion of the outcomes for $H_{e,i}$ is negative, we can get $\mathrm{VaR}_{\alpha}(H_{e,i})\leq 0$ . Assume for contradiction that $\mathrm{VaR}_{\alpha}\geq 0$ . Then according to the definition, every element in the set $L$ is larger than $0$ , which contradicts the assumption that $\mathrm{CVaR}_{\alpha}(H_{e,i})=E(H_{e,i}|L)\leq 0$ .

Given $\mathrm{VaR}_{\alpha}(H_{e,i})\leq 0$ and from the definition of $\mathrm{VaR}$ , we have $\mathrm{Pr}(H_{e,i}\geq\mathrm{VaR}_{\alpha}(H_{e,i}))\leq\alpha$ , we can get that $\mathrm{Pr}(H_{e,i}\geq 0)\leq\alpha$ . Then it is straightforward that:

\mathrm{Pr}(H_{e,i}\leq 0)=1-\mathrm{Pr}(H_{e,i}\geq 0)\geq 1-\alpha.

(13)

According to the definition of $H_{e,i}$ in Eq. (8) and using Lemma 2, we can conclude that the vehicle-to-vehicle safety can be guaranteed with at least $1-\alpha$ probability. $\square$ ∎

Remark 2.

Note that Eq. (III-A) defines the probabilistic safety for a single time step. When extending the consideration to cover the high-probability safety throughout the entire trajectory, the step-wise probability for the vehicle-to-vehicle safety is lower bounded by $(1-\alpha)^{k_{t}}$ , where $k_{t}$ is the time steps of the entire trajectory. Readers are referred to [9, 12] for more detailed discussions.

Based on the pairwise risk evaluation defined in Eq. (III-A), we further define the resultant perceived risk $R_{e,k}$ of the ego vehicle $\mathcal{V}_{e}$ on the highway at time step $k$ as:

R_{e,k}=\max\{\mathrm{CVaR}_{\alpha}(H_{e,i}),\forall i\neq e\}.

(14)

Remark 3.

By assuming optimal behavior from both human drivers and automated vehicles, it is reasonable to conclude that the vehicles in the environment are well-spaced [30]. Consequently, we utilize the maximum risk derived from all pairwise risks to represent the perceived risk from neighboring vehicles. This approach is reasonable by the fact that in real-world scenarios, human drivers tend to focus more on the vehicles that pose the greatest perceived risk. A similar strategy using maximum risk to represent the risk perceived by the ego vehicle has been employed in [8].

III-B Ego-perceived Risk Map for Highway Decision Making

Using the CBF-inspired risk evaluation framework, we can construct a comprehensive risk map that enables deeper insights into how perceived risk is aggregated, enhancing the decision-making processes for the ego vehicle.

We employ a three-lane highway as a running example with two vehicles running on the road where the red vehicle is the ego vehicle $\mathcal{V}_{e}$ and the blue one $\mathcal{V}_{b}$ is the observed neighboring vehicle. To construct the risk map depicted in Fig. 2, we counterfactually place the ego vehicle at various points across the map and evaluate the perceived risk at each point using Eq. (14). This evaluated risk is then attributed to the corresponding point on the map and visualized using a gradient color scheme to explicitly depict risk variations.

Using the risk maps depicted in Fig. 2, we illustrate how risk is aggregated based on various velocity configurations and their influence on the ego-perceived risk map. In all three subfigures, the positions of the ego vehicle $\mathcal{V}_{e}$ and the neighboring vehicle $\mathcal{V}_{b}$ are the same. This consistent positioning allows us to isolate and analyze the effects of velocity changes on the perceived risk without the confounding factor of varying spatial relationships. The state and velocity noise covariance are both set as diag $\{0.1,0.1\}$ and the confidence level is set as $\alpha=0.1$ . The dashed blue contour in every subfigure delineates the area where the evaluated risk exceeds $0$ . In Fig. 2(a), both $\mathcal{V}_{e}$ and $\mathcal{V}_{b}$ are traveling at a velocity of $(15,0)$ , and it is obvious that the closer the ego vehicle is to the neighboring vehicle, the larger the perceived risk. Fig. 2(b) illustrates a shift in risk perception when $\mathcal{V}_{b}$ ’s velocity changes to $(10,0)$ , showing a noticeable bias. This is reasonable given that the ego vehicle’s velocity exceeds that of the leading vehicle.

Furthermore, we adjust $\mathcal{V}_{b}$ ’s velocity to $(15,1.5)$ , which significantly reduces the safe region in the third lane as shown in Fig. 2(c). If $\mathcal{V}_{e}$ intends to overtake $\mathcal{V}_{b}$ , although there appears to be sufficient space in Lane three, the risk map suggests that overtaking in Lane one is a preferable strategy. This decision not only offers a safer alternative but also exhibits courteous behavior towards $\mathcal{V}_{b}$ .

Remark 4.

This discussion focuses solely on the impact of the observed vehicle’s velocity on the risk map, serving to illustrate the effectiveness of the proposed CBF-inspired risk evaluation method. In the context of highway scenarios, velocity is the predominant factor. However, other elements e.g. the safety radius, parameter $\gamma$ , or noise distribution also significantly influence the ego-perceived risk map.

Based on these observations, it is reasonable to conclude that: 1) the risk map based on the risk evaluation method is applicable and illustrates varying performance under different positions and motions, and 2) it shows the potential for aiding AV’s decision-making for safe and courteous driving.

III-C Courteous Model Predictive Control

Inspired by the analysis of the CBF-inspired risk map and Theorem 3, to realize safe and courteous driving performance, we integrate the perceived risk by ego vehicle into the MPC framework.

For a MPC framework, a general task-related cost function $J(\mathbf{s}_{e,k})$ is defined as below:

	$\displaystyle J(\mathbf{s}_{e,k})$	$\displaystyle=\sum_{t=0}^{N-1}\bigl{(}l_{\text{S}}(\mathbf{s}_{e,k+t\|k},\mathbf{u}_{e,k+t\|k})\bigr{)}$
		$\displaystyle+l_{\text{T}}(\mathbf{s}_{e,k+N\|k},\mathbf{u}_{e,k+N\|k})$		(15)

where $N$ is the prediction horizon, $l_{\text{S}}(\mathbf{s}_{e,k+t|k},\mathbf{u}_{e,k+t|k})$ and $l_{\text{T}}(\mathbf{s}_{e,k+N|k},\mathbf{u}_{e,k+N|k})$ represent the stage cost and terminal cost, respectively. $\mathbf{s}_{e,k+t|k}$ denotes the state vector at time step $t+k$ predicted at time step $k$ obtained by starting from the current state $\mathbf{s}_{e,k}$ by applying the input sequence $\mathbf{u}_{k:k+N-1|k}$ . To realize safe and courteous driving, we further integrate the risk perceived by the ego vehicle to the cost function and constraints to synthesize the MPC controller, which is defined as:


	$\displaystyle\min_{\mathbf{u}_{k:k+N-1\|k}}J(\mathbf{s}_{e,k})+\sum_{t=0}^{N}\bigl{(}P_{\text{S}}R_{e,k+t\|k}\bigr{)}$	(16)
$\displaystyle s.t.\quad$	$\displaystyle\mathbf{s}_{e,k+t+1\|k}=\tilde{f}(\mathbf{s}_{e,k+t\|k},\mathbf{u}_{e,k+t\|k}),t=0,\cdots,N-1$
	$\displaystyle\mathbf{s}_{e,k\|k}=\mathbf{s}_{e,k}$
	$\displaystyle R_{e,k+t\|k}\leq 0,t=0,\cdots,N-1$	(16a)
	$\displaystyle\mathbf{s}_{min}\leq\mathbf{s}_{e,k+t\|k}\leq\mathbf{s}_{max},t=0,\cdots,N-1$
	$\displaystyle\mathbf{u}_{min}\leq\mathbf{u}_{e,k+t\|k}\leq\mathbf{u}_{max},t=0,\cdots,N-1$

where $P_{\text{S}}$ is a user-defined weight parameter and $R_{e,k+t|k}$ is the predicted risk at time step $k$ . $\mathbf{s}_{min}$ and $\mathbf{s}_{max}$ represent the lower and upper bounds of the state constraints. $\mathbf{u}_{min}$ and $\mathbf{u}_{max}$ represent the lower and upper bounds of the control inputs. By the constraint of Eq. (16a), we can guarantee safety with at least $1-\alpha$ probability. By integrating the risk evaluation into the cost function, the ego vehicle may make decisions to reduce the perceived risk to yield more space for the neighboring vehicles.

IV Experiments and Discussion

To validate the effectiveness of the proposed Courteous MPC, the parameter analysis experiment is conducted first to discuss the influence of weight parameter $P_{\mathrm{S}}$ on the ego vehicle courtesy performance. Then, the simulation experiments on interaction with both IDM vehicles and the NGSIM US101 dataset are conducted to validate the effectiveness of our proposed Courteous MPC. We employ the do-mpc library [31] to formulate the MPC controller and utilize IPOPT [32] as the nonlinear solver.

IV-A Parameter Analysis

To exploit the influence of the weight parameter $P_{\mathrm{S}}$ on the ego vehicle performance, we conducted a cruise driving task with varying $P_{\mathrm{S}}$ in the highway-env [33] simulation environment, where the behavior of surrounding vehicles is controlled by IDM [19]. The state and velocity noise covariances are both set as diag $\{0.1,0.1\}$ with the confidence level set as $\alpha=0.1$ . The target speed of the ego vehicle is $15\;m/s$ . As depicted in Fig. 3, with the increase of $P_{\mathrm{S}}$ in Eq. 16, the minimum distance between the ego vehicle and neighboring vehicles also increases. This indicates that the penalty of the risk term allows more space around the ego vehicle, enhancing comfort for surrounding vehicles. Additionally, we can observe that even with a lower $P_{\mathrm{S}}$ , a reasonable space is still maintained to ensure safe driving.

IV-B Simulation Experiment on IDM Vehicles

To further validate the performance of the proposed Courteous MPC, three methods including Courteous MPC, Risk-aware MPC and Batch MPC [26] are conducted for the cruise control task in the highway-env simulation environment [33]. For clarity, the controller is designated as ”Risk-aware MPC” when the risk term is removed from the cost function. For Courteous MPC, we use the same parameter setting as Section. IV-A and set $P_{\mathrm{S}}=0.25$ .

As depicted in Fig. 4, we provide snapshots demonstrating two examples of overtaking behavior for each method. Both Risk-aware MPC and Courteous MPC initiate overtaking maneuvers earlier than Batch MPC, which supports the efficacy of our CBF-inspired risk evaluation framework in preventing collisions proactively. Compared to Risk-aware MPC, Courteous MPC initiates overtaking maneuvers even earlier. This suggests that integrating the risk term into the cost function allows more space around neighboring vehicles, thereby enhancing comfort and safe margins. Additionally, Fig. 5 shows the minimum distance maintained by the ego vehicle from IDM-controlled vehicles over time. This analysis also reveals that Courteous MPC consistently allows greater space around the ego vehicle, enhancing comfort for surrounding vehicles⁴⁴4The simulation details can be found at https://youtu.be/s5aeT6MPhio .

To prove the effectiveness of Courteous MPC, we compare the cruise task performance of the three methods as shown in Table I. Performance metrics include the average speed of the ego vehicle and the total travel distance in the longitudinal direction. The data reveals that Courteous MPC significantly outperforms the other two methods on both metrics. Notably, the relatively lower accuracy of Batch MPC aligns with the findings reported in [34]. These improvements suggest that providing additional space does not detrimentally affect overall task performance. Instead, it likely enables the ego vehicle to seek more open areas, allowing it to achieve higher speeds rather than confining itself within narrow spaces.

TABLE I: Task Performance Comparison

	Average speed ( $m/s$ )	Longitude distance ( $m$ ) within 30 seconds
Courteous MPC	14.96	446.09
Risk-aware MPC	14.89	441.89
Batch MPC	13.80	414.60

To further validate the robustness of the Courteous MPC, we conduct $10$ trials for the three methods and plot the average minimum distances with error bars, as shown in Figure 6. For every trial, the positions of the vehicles are randomly initiated. The results corroborate our earlier findings, indicating that Courteous MPC consistently allows for greater spacing around surrounding vehicles, thereby facilitating more courteous interactions on the road.

IV-C Simulation Experiment on NGSIM Dataset

To further verify the performance of the Courteous MPC on real-world data set, we conduct the simulation experiment in the NGSIMENV simulation environment [30] with Next Generation Simulation (NGSIM) US101 dataset [20], where a human-driven vehicle is replaced by the intelligent vehicle empowered by the Courteous MPC. As we can observe from Fig. 7, the Courteous MPC can successfully travel through heterogeneous vehicles, which is realized by setting different scaling factors $\tau$ based on the vehicles’ sizes in Eq. (7). Note that in Fig. 7(b), the ego vehicle is running on the dashed white lane due to the neighboring two vehicles are running too close to the lane. The best way for the ego vehicle to show courtesy to both two vehicles is to maintain its position on the dashed white lane ⁵⁵5The simulation details can be found at https://youtu.be/9Cekw5xafWw .

V Conclusions

In this paper, we proposed an extension of the Control Barrier Functions-inspired risk evaluation framework while considering both noisy observed positions and velocities of neighboring vehicles. The visualization of the risk map suggests that the proposed risk evaluation framework has the potential to aid the decision-making of AVs for safe and courteous driving. The resultant Courteous Model Predictive Control (Courteous MPC) which integrates the risk term into the cost function and constraints shows superior performance through a detailed analysis of the efficacy, efficiency, and robustness using simulation experiments.

References

[1] L. Sun, W. Zhan, M. Tomizuka, and A. D. Dragan, “Courteous autonomous cars,” in IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 2018, pp. 663–670.
[2] Y. Wang, Y. Ren, S. Elliott, and W. Zhang, “Enabling courteous vehicle interactions through game-based and dynamics-aware intent inference,” IEEE Transactions on Intelligent Vehicles, vol. 5, no. 2, pp. 217–228, 2019.
[3] S. Wang, Y. Lyu, and J. M. Dolan, “Active probing and influencing human behaviors via autonomous agents,” in IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2023, pp. 1514–1521.
[4] B. Toghi, R. Valiente, D. Sadigh, R. Pedarsani, and Y. P. Fallah, “Social coordination and altruism in autonomous driving,” IEEE Transactions on Intelligent Transportation Systems, vol. 23, no. 12, pp. 24 791–24 804, 2022.
[5] Y. Lyu, W. Luo, and J. M. Dolan, “Responsibility-associated multi-agent collision avoidance with social preferences,” in IEEE 25th International Conference on Intelligent Transportation Systems (ITSC). IEEE, 2022, pp. 3645–3651.
[6] A. Gupta, J. Johnson, L. Fei-Fei, S. Savarese, and A. Alahi, “Social gan: Socially acceptable trajectories with generative adversarial networks,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2018, pp. 2255–2264.
[7] C. Hubmann, J. Schulz, M. Becker, D. Althoff, and C. Stiller, “Automated driving in uncertain environments: Planning with interaction and uncertain maneuver prediction,” IEEE transactions on intelligent vehicles, vol. 3, no. 1, pp. 5–17, 2018.
[8] K.-C. Hsu, K. Leung, Y. Chen, J. F. Fisac, and M. Pavone, “Interpretable trajectory prediction for autonomous vehicles via counterfactual responsibility,” in IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 2023, pp. 5918–5925.
[9] H. Zhu and J. Alonso-Mora, “Chance-constrained collision avoidance for mavs in dynamic environments,” IEEE Robotics and Automation Letters, vol. 4, no. 2, pp. 776–783, 2019.
[10] T. Nyberg, C. Pek, L. Dal Col, C. Norén, and J. Tumova, “Risk-aware motion planning for autonomous vehicles with safety specifications,” in IEEE intelligent vehicles symposium (iv). IEEE, 2021, pp. 1016–1023.
[11] A. D. Ames, S. Coogan, M. Egerstedt, G. Notomista, K. Sreenath, and P. Tabuada, “Control barrier functions: Theory and applications,” in 18th European Control Conference (ECC). IEEE, 2019, pp. 3420–3431.
[12] W. Luo, W. Sun, and A. Kapoor, “Multi-robot collision avoidance under uncertainty with probabilistic safety barrier certificates,” Advances in Neural Information Processing Systems, vol. 33, pp. 372–383, 2020.
[13] Y. Lyu, W. Luo, and J. M. Dolan, “Probabilistic safety-assured adaptive merging control for autonomous vehicles,” in IEEE International conference on robotics and automation (ICRA). IEEE, 2021, pp. 10 764–10 770.
[14] Y. Zhang, Y. Yang, and W. Luo, “Occlusion-free image-based visual servoing using probabilistic control barrier certificates,” IFAC-PapersOnLine, vol. 56, no. 2, pp. 4381–4387, 2023.
[15] Y. Yang, Y. Lyu, Y. Zhang, S. Yi, and W. Luo, “Decentralized multi-robot line-of-sight connectivity maintenance under uncertainty,” arXiv preprint arXiv:2406.12802, 2024.
[16] H. Zhou, Y. Zhang, and W. Luo, “Safety-critical control with uncertainty quantification using adaptive conformal prediction,” arXiv preprint arXiv:2407.03569, 2024.
[17] Y. Lyu, W. Luo, and J. M. Dolan, “Risk-aware safe control for decentralized multi-agent systems via dynamic responsibility allocation,” in IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 2023, pp. 1–8.
[18] Y. Lyu, J. M. Dolan, and W. Luo, “Cbf-inspired weighted buffered voronoi cells for distributed multi-agent collision avoidance,” in American Control Conference (ACC). IEEE, 2023, pp. 4513–4518.
[19] M. Treiber, A. Hennecke, and D. Helbing, “Congested traffic states in empirical observations and microscopic simulations,” Physical review E, vol. 62, no. 2, p. 1805, 2000.
[20] V. Alexiadis, J. Colyar, J. Halkias, R. Hranac, and G. McHale, “The next generation simulation program,” Institute of Transportation Engineers. ITE Journal, vol. 74, no. 8, p. 22, 2004.
[21] G. Yang, C. Belta, and R. Tron, “Continuous-time signal temporal logic planning with control barrier functions,” in American Control Conference (ACC). IEEE, 2020, pp. 4612–4618.
[22] J. Breeden, K. Garg, and D. Panagou, “Control barrier functions in sampled-data systems,” IEEE Control Systems Letters, vol. 6, pp. 367–372, 2021.
[23] P. Polack, F. Altché, B. d’Andréa Novel, and A. de La Fortelle, “The kinematic bicycle model: A consistent model for planning feasible trajectories for autonomous vehicles?” in IEEE intelligent vehicles symposium (IV). IEEE, 2017, pp. 812–818.
[24] H. Chu, D. Meng, S. Huang, M. Tian, J. Zhang, B. Gao, and H. Chen, “Autonomous high-speed overtaking of intelligent chassis using fast iterative model predictive control,” IEEE Transactions on Transportation Electrification, vol. 10, no. 1, pp. 1244–1256, 2024.
[25] A. Pierson, W. Schwarting, S. Karaman, and D. Rus, “Navigating congested environments with risk level sets,” in IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2018, pp. 5712–5719.
[26] V. K. Adajania, A. Sharma, A. Gupta, H. Masnavi, K. M. Krishna, and A. K. Singh, “Multi-modal model predictive control through batch non-holonomic trajectory optimization: Application to highway driving,” IEEE Robotics and Automation Letters, vol. 7, no. 2, pp. 4220–4227, 2022.
[27] L. Wang, A. D. Ames, and M. Egerstedt, “Safe certificate-based maneuvers for teams of quadrotors using differential flatness,” in IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2017, pp. 3293–3298.
[28] J. Zeng, B. Zhang, and K. Sreenath, “Safety-critical model predictive control with discrete-time control barrier function,” in American Control Conference (ACC). IEEE, 2021, pp. 3882–3889.
[29] J. M. Ver Hoef, “Who invented the delta method?” The American Statistician, vol. 66, no. 2, pp. 124–127, 2012.
[30] Z. Huang, J. Wu, and C. Lv, “Driving behavior modeling using naturalistic human driving data with inverse reinforcement learning,” IEEE transactions on intelligent transportation systems, vol. 23, no. 8, pp. 10 239–10 251, 2021.
[31] F. Fiedler, B. Karg, L. Lüken, D. Brandner, M. Heinlein, F. Brabender, and S. Lucia, “do-mpc: Towards fair nonlinear and robust model predictive control,” Control Engineering Practice, vol. 140, p. 105676, 2023.
[32] A. Wächter and L. T. Biegler, “On the implementation of an interior-point filter line-search algorithm for large-scale nonlinear programming,” Mathematical programming, vol. 106, pp. 25–57, 2006.
[33] E. Leurent, “An environment for autonomous driving decision-making,” https://github.com/eleurent/highway-env, 2018.
[34] L. Zheng, R. Yang, M. Y. Wang, and J. Ma, “Barrier-enhanced homotopic parallel trajectory optimization for safety-critical autonomous driving,” arXiv preprint arXiv:2402.10441, 2024.


	$\displaystyle\min_{\mathbf{u}_{k:k+N-1\|k}}J(\mathbf{s}_{e,k})+\sum_{t=0}^{N}\bigl{(}P_{\text{S}}R_{e,k+t\|k}\bigr{)}$	(16)
$\displaystyle s.t.\quad$	$\displaystyle\mathbf{s}_{e,k+t+1\|k}=\tilde{f}(\mathbf{s}_{e,k+t\|k},\mathbf{u}_{e,k+t\|k}),t=0,\cdots,N-1$
	$\displaystyle\mathbf{s}_{e,k\|k}=\mathbf{s}_{e,k}$
	$\displaystyle R_{e,k+t\|k}\leq 0,t=0,\cdots,N-1$	(16a)
	$\displaystyle\mathbf{s}_{min}\leq\mathbf{s}_{e,k+t\|k}\leq\mathbf{s}_{max},t=0,\cdots,N-1$
	$\displaystyle\mathbf{u}_{min}\leq\mathbf{u}_{e,k+t\|k}\leq\mathbf{u}_{max},t=0,\cdots,N-1$

Courteous MPC for Autonomous Driving with CBF-inspired Risk Assessment