MAPS-X: Explainable Multi-Robot Motion Planning via Segmentation

Justin Kottinger¹, Shaull Almagor², and Morteza Lahijanian^1,3 J. Kottinger received funding from the Department of Education through a Graduate Assistantships in Areas of National Need Fellowship in Critical Aerospace Technologies. S. Almagor has received funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie grant agreement No 837327.¹ Department of Aerospace Engineering Sciences, University of Colorado Boulder, USA {firstname.lastname}@colorado.edu² The Henry and Marilyn Taub Faculty of Computer Science, Technion, Israel shaull@cs.technion.ac.il³ Department of Computer Science, University of Colorado Boulder, USA

Abstract

Traditional multi-robot motion planning (MMP) focuses on computing trajectories for multiple robots acting in an environment, such that the robots do not collide when the trajectories are taken simultaneously. In safety-critical applications, a human supervisor may want to verify that the plan is indeed collision-free. In this work, we propose a notion of explanation for a plan of MMP, based on visualization of the plan as a short sequence of images representing time segments, where in each time segment the trajectories of the agents are disjoint, clearly illustrating the safety of the plan. We show that standard notions of optimality (e.g., makespan) may create conflict with short explanations. Thus, we propose meta-algorithms, namely multi-agent plan segmenting-X (MAPS-X) and its lazy variant, that can be plugged on existing centralized sampling-based tree planners X to produce plans with good explanations using a desirable number of images. We demonstrate the efficacy of this explanation-planning scheme and extensively evaluate the performance of MAPS-X and its lazy variant in various environments and agent dynamics.

I Introduction

Multi-robot motion planning (MMP) is a fundamental challenge in robotics and artificial intelligence (AI). The goal in MMP is to plan trajectories for multiple robots according to their dynamics to reach their respective goal regions such that, when the plans are executed simultaneously, every robot (agent) successfully completes its trajectory without colliding with other agents or obstacles. Applications of MMP can be found in many areas where several moving agents interact in a shared workspace. One limitation of various AI tools, including MMP, is their inability to explain their decisions and actions to human users [1]. In many safety-critical application domains, such as air-traffic control and hazardous material warehouses, MMP tools are rarely utilized if at all. Instead, the trajectories are either hand designed or, if a planner is used, the generated paths are given to a human-supervisor for safety verification before execution. Thus, these settings require the plan to be presented in a humanly-understandable manner. Specifically, the presentation should enable the supervisor to understand the path taken by individual agents and to verify that the agents do not collide. To this end, the goal of this work is to present a method of generating explainable motion plans for multi-agent systems.

In general, MMP is NP-Complete even in the discrete (finite space) setting and naturally becomes intractable in the continuous (space) setting as the number of robots grows. A significant body of work is dedicated to overcoming this difficulty, both in the discrete domain (e.g., [2, 3]) and continuous domain (e.g., [4, 5, 6, 7]). There are two general approaches to MMP: centralized methods, which work in the composed space of all the agents [8, 6, 7], and de-centralized approaches, which divide the problem into several subproblems and solve each separately [9, 10, 11, 12]. The focus of all these works is to overcome the general difficulty of computing a plan in a short amount of time and optimizing the plan according to a measure such as makespan and path length. These works do not take into consideration the explainability of the plan. In fact, explainability often conflicts with other optimality measures and requires separate treatment.

Significant effort has been dedicated to providing explanations for problems in AI and machine learning. For example, work [13] utilizes visualization to explain the result of certain machine learning algorithms that often come up with complicated classifiers. In [14], explanations are given by analyzing alternative plans with some user-defined properties. In [15], a user proposes a plan, and explanations are given as a minimal set of differences between the actual plan and the proposed plan. A broader approach was later given in [16], where multiple types of explanations are allowed. In that setting, the user can change the plan, motivating the planner to either explain why the original plan is better or to re-plan. None of these studies, however, focus on the MMP problem.

In [17], we propose an explanation scheme based on visualization for MMP in discrete domains. There, in order to convince a human supervisor that a suggested plan does not cause a collision between the agents, the plan is decomposed into time segments, such that within each segment the paths of the agents are disjoint (i.e., non-intersecting). Then, an explanation of the plan comprises a sequence of images representing each segment. An example of such explanations for three continuous agents is shown Fig. 1. It is important to note that, since identification of line intersections is made very early in the cognitive process (namely in the primary visual cortex) [18, 19], it is easy for a human to verify that in each segment, the paths do not intersect. Moreover, the sequence of images is potentially (and indeed, often in practice) much shorter than displaying, e.g., a slowed-down video of the agents taking their paths, and is hence easy to verify. In addition, [17] showed that finding optimal explanations for a given motion plan can be done in polynomial time, whereas generating plans for explainability, i.e., limiting the number of segments, is, at best, NP-Complete.

The algorithms proposed in [17] are based on a discrete (or discretized) environment, and thus overlook the challenges involved with motion planning in the continuous domain. In this work, we focus on MMP with explanations for realistic robotic systems in the continuous space with kinodynamical constraints. To this end, we treat explainability as an additional constraint on top of MMP, and incorporate it into existing sampling-based algorithms. As mentioned above and shown in [17], there is often a trade-off between planning for short explanations and short paths. Hence, explainability may conflict with state-of-the-art heuristics for MMP. In order to factor out precise heuristics, we devise generic meta-algorithms, that search for optimally-explainable plans using any centralized sampling-based algorithm. We demonstrate our meta algorithms by plugging them with classical motion planners such as rapidly-exploring random trees (RRT) [20].

The main contribution of this work is an explanation scheme for MMP that is based on path segmentation in the continuous domain. This scheme introduces a new constraint (challenge) to the motion planning problem that is not previously studied, to the best of our knowledge. We present two meta algorithms called multi-agent plan segmenting-X (MAPS-X), where X can be any existing centralized (kinodynamic) MMP planner. Another contribution is an extensive evaluation of these algorithms, highlighting their generality and differences in performance.

II Problem Formulation

Consider $k\in\mathbb{N}$ robotic systems (agents), in a shared workspace $W\subseteq\mathbb{R}^{2}$ which includes a finite set of obstacles $O$ , where each obstacle $o\in O$ is a closed subset of $W$ , i.e., $o\subset W$ . The motion of each agent $i\in\{1,2,\ldots,k\}$ is subject to the following dynamic constraint:

\displaystyle\mathbf{\dot{x}}_{i}

\displaystyle=f_{i}(\mathbf{x}_{i},\mathbf{u}_{i}),

\displaystyle\mathbf{x}_{i}

\displaystyle\in X_{i}\subseteq\mathbb{R}^{n_{i}},

\displaystyle\mathbf{u}_{i}

\displaystyle=U_{i}\subseteq\mathbb{R}^{m_{i}},

(1)

where $X_{i}$ and $U_{i}$ are the agent $i$ ’s state and input spaces, respectively, and $f_{i}:X_{i}\times U_{i}\rightarrow X_{i}$ is an integrable and possibly nonlinear function.

Given a time interval $[t_{0},t_{f}]$ , where $t_{0},t_{f}\in\mathbb{R}_{\geq 0}$ and $t_{0}<t_{f}$ , a controller $\mathbf{u}_{i}:[t_{0},t_{f}]\to U_{i}$ , and initial state $x_{i,0}\in X_{i}$ , function $f_{i}$ can be integrated up to time $t_{1}\leq t_{f}$ to form a trajectory segment $\mathbf{x}_{i}^{t_{0}:t_{1}}$ for agent $i$ , where $\mathbf{x}_{i}^{t_{0}:t_{1}}(t_{0})=x_{i,0}$ . For $s\in\mathbb{N}$ consecutive time intervals,

[t_{0},t_{1}],[t_{1},t_{2}],\ldots,[t_{s-1},t_{s}],

(2)

we define a trajectory

T_{i}=\{\mathbf{x}_{i}^{t_{0}:t_{1}},\mathbf{x}_{i}^{t_{1}:t_{2}},\ldots,\mathbf{x}_{i}^{t_{s-1}:t_{s}}\},

where $\mathbf{x}_{i}^{t_{l-1}:t_{l}}(t_{l})=\mathbf{x}_{i}^{t_{l}:t_{l+1}}(t_{l})$ for all $l\in\{1,2,\ldots,s-1\},$ to be a set of $s$ trajectory segments.

Let $X_{i}^{G}\subset X_{i}$ and $x_{i,0}\in X_{i}$ denote the goal region (destination) and initial state of agent $i$ , respectively. The goal multi-robot motion planning (MMP) is to find a trajectory $T_{i}$ with $\mathbf{x}_{i}(t_{0})=x_{i,0}$ for every agent $i\in\{1,2,\dots,k\}$ such that no agent collides with any obstacles nor with other agents, and $\mathbf{x}_{i}(t_{f})\in X_{i}^{G}$ . In explainable MMP we require that, in addition, the interval $[t_{0},t_{f}]$ can be segmented to at most $r\in\mathbb{N}$ consecutive time intervals (for some user-defined upper bound $r$ ), such that within each time interval, the trajectories are disjoint when presented to the user.

To formally define the explainable MMP problem, we start by formalizing the notion of disjoint segments. Let $\textsc{proj}_{W}^{X_{i}}:\mathbb{R}^{n_{i}}\rightarrow W$ be a function that projects a state $x_{i}\in X_{i}$ of agent $i$ onto workspace $W$ . Then, we call two trajectory segments $\mathbf{x}_{i}^{t_{1}:t_{2}}$ and $\mathbf{x}_{j}^{t_{1}:t_{2}}$ disjoint if

\textsc{proj}_{W}^{X_{i}}(\mathbf{x}_{i}^{t_{1}:t_{2}}(t))\;\neq\;\textsc{proj}_{W}^{X_{j}}(\mathbf{x}_{j}^{t_{1}:t_{2}}(t^{\prime}))\quad\forall t,t^{\prime}\in[t_{1},t_{2}].

We extend the notion of disjoint from segments to trajectories as follows. Given the $s$ time intervals from (2), we call a set of trajectories $T=\{T_{1},T_{2},\ldots,T_{k}\}$ segment-disjoint if for every segment $l\in\{1,2,\ldots,s\}$ the induced trajectory-segments $\mathbf{x}_{i}^{t_{l-1}:t_{l}}\in T_{i}$ and $\mathbf{x}_{j}^{t_{l-1}:t_{l}}\in T_{j}$ are disjoint for all $i,j\in\{1,2,\ldots,k\}$ and $i\neq j$ .

We can now formulate the explainable MMP problem as follows.

Problem 1 (Explainable MMP)

Given $k$ robotic agents with dynamics described in (1), initial states $x_{1,0},\ldots,x_{k,0}$ , goal regions $X^{G}_{1},\ldots,X^{G}_{k}$ , and a bound $r\in\mathbb{N}$ on the number of segments, find a controller $\mathbf{u}_{i}:[t_{0},t_{f}]\to U_{i}$ for each agent $i\in\{1,\ldots,k\}$ and time points $t_{1},\ldots,t_{s}$ , where $s\leq r$ and $t_{0}<t_{1}<\ldots<t_{s-1}<t_{s}=t_{f}$ , such that the obtained trajectory $T_{i}$ takes agent $i$ from $\mathbf{x}_{i}(t_{0})=x_{i,0}$ to $\mathbf{x}_{i}(t_{f})\in X^{G}_{i}$ while avoiding collisions with obstacles, and the set of trajectories $T=\{T_{1},\ldots,T_{k}\}$ is segment-disjoint.

Note that the segment-disjoint requirement for the trajectories in Problem 1 implies that the trajectories do not cause collisions between the agents. Further, a solution to this problem consists of not only $k$ valid trajectories but also a decomposition of these trajectories into $s$ set of disjoint segments. The explanations are then $s$ images, one image per set of segments projected onto the workspace $W$ , e.g., Fig. 1(b)-1(d).

We emphasize that our goal in this work is not to design an efficient algorithm that solves the general MMP problem. Rather, our goal is to design meta algorithms that turn an existing MMP planner into an explainable MMP that solves Problem 1. Specifically, we focus on centralized MMP, where planning takes place in the compound space of the agents. This approach has two advantages: first, centralized algorithms maintain most of the structure of the search space. This allows us to obtain optimal explanations (segments), which in turn sets a baseline by which to compare more involved algorithms for explainable MMP. Second, centralized planning uses well-understood algorithms, which enable us to reason about the explanations we obtain with respect to different algorithms. In contrast, de-centralized MMP algorithms may be too involved to separate their properties from the explanations they yield. We discuss incorporating explanations into more involved algorithms in Section V.

Finally, we remark that our definitions of disjoint trajectory segments is based on trajectory intersections. This definition (and the proposed algorithms) can be easily extended to intersections of robots with 2D shapes.

III Algorithms

In this section, we present three methodologies to solve Problem 1. These are based on a centralized approach to MMP, so we first present Planner X, which is a generic centralized sampling-based tree planner. Then, we define a post-process procedure that can minimally decompose (i.e., explain) an existing MMP solution, e.g., returned by Planner X, into disjoint segments. Next, we outline two frameworks that can be incorporated into Planner X to solve explainable MMP queries.

III-A Planner X: Centralized Sampling-based Tree MMP

Centralized tree-based planners grow a motion tree in the composed state space $\mathbb{X}=X_{1}\times X_{2}\times\ldots\times X_{k}$ of the agents according to the dynamics in (1) through sampling and extension procedures. A generalized sampling-based tree planner X is outlined in Alg. 1. It takes the composed state and input spaces, $\mathbb{X}$ and $\mathbb{U}=U_{1}\times\ldots\times U_{k}$ , respectively, the goal set $\mathbb{G}=X_{1}^{G}\times\ldots\times X_{k}^{G}$ , the set of obstacles $O$ , an initial configuration for all agents $x_{0}=(x_{1,0}^{T},\ldots,x_{k,0}^{T})^{T}\in\mathbb{X}$ , and a specified number of iterations $N\in\mathbb{N}$ and returns a valid trajectory if found in $N$ iterations.

G=\{V\leftarrow x_{0},E\leftarrow\emptyset\}

;

2 for $N$ iterations do

(x_{rand},x_{near})\leftarrow sample(\mathbb{X},V)

;

x_{new}=\textsc{multiAgentExtend}(x_{near},x_{rand},\mathbb{U})

;

5 if $isValid(\overrightarrow{x_{near},x_{new}})$ then

V\leftarrow V\cup\{x_{new}\};

E\leftarrow E\cup\{\overrightarrow{x_{near},x_{new}}\}

;

7 if $x_{new}\in\mathbb{G}$ then

8 return $\overrightarrow{x_{0},x_{new}}$ ;

return $G(V,E)$

Algorithm 1 Planner X (

\mathbb{X}

\mathbb{U}

\mathbb{G}

, O,

x_{0}

, N)

The algorithm first initializes the graph with the initial state (root node) $x_{0}$ . Next, an existing node $x_{near}$ and a random state $x_{rand}$ are picked through a sampling process in Line 1. After that, Line 1 samples control inputs $u\in\mathbb{U}$ and propagates the multi-agent system from $x_{near}$ toward $x_{rand}$ to generate a new state $x_{new}$ . If the newly generated trajectory $\overrightarrow{x_{near},x_{new}}$ is valid, i.e., it does not result in a collision with obstacles nor with other agents, then Line 1 adds the vertex $x_{new}$ and the edge $\overrightarrow{x_{near},x_{new}}$ to $G$ . This process repeats a maximum of $N$ times, exiting early if a solution is found. To speed up computation, it is common in the multi-agent setting to propagate only agents $i$ that are not in their respective goal region $X_{i}^{G}$ in the procedure multiAgentExtend. For the remainder of the paper, we refer to this planner as Planner X.

III-B Minimal Disjoint Segmentation

One method of explaining a multi-agent trajectory is to solve the explainability problem after the planning problem. This solution involves running a motion planner (e.g., Planner X) to generate a solution $T=\overrightarrow{x_{0},x_{goal}}$ , and then finding a minimal segmentation of the solution. This post-processing procedure, shown in Alg. 2, is called segmentSol.

d

s\leftarrow 1

;

v\leftarrow x_{0}

;

2 while $v\neq x_{goal}$ do

intersection\leftarrow\textsc{project2D}(v,d)

;

4 if intersection then

5 for $d-1$ $times$ do

v.segmentNum\leftarrow s

;

v\leftarrow v.child

;

d\leftarrow 1

;

s\leftarrow s+1

;

10 else

d\leftarrow d+1

;

return $\overrightarrow{x_{0},x_{goal}}$

Algorithm 2 segmentSol(

T=\overrightarrow{x_{0},x_{goal}}

)

Alg. 2 is implemented using a greedy approach – we traverse the nodes of the solution trajectory $\overrightarrow{x_{0},x_{goal}}$ , and add nodes to a segment as long as the projection of the agents’ trajectories do not intersect (Line 2). Once an intersection occurs, we end the segment (collating the nodes before the intersection), and start a fresh segment (Lines 2 and 2).

Checking for intersections (procedure project2D, Line 2) can be implemented efficiently, e.g., using linear interpolations of the projected paths in some $\Delta t$ intervals. As proven in [17], the greedy approach of Alg. 2 is guaranteed to obtain a minimal segmentation among those whose end-points are multiples of $\Delta t$ . Thus, we have the following theorem.

Theorem 1 (Min. disjoint segmentation [17], Thm. 3.1)

Given a set of trajectories $T=\{T_{1},\ldots,T_{k}\}$ for $k$ agents, Alg. 2 computes a segmentation of $T$ such that the resulting trajectories are segment-disjoint with minimal number of segments¹¹1Minimal among segmentations whose end-points are multiples of $\Delta t$ ..

Observe that Alg. 2 makes a single pass on the trajectory, but for each new node along the trajectory, intersections need to be checked against the segment collated so far. Thus, the algorithm has a quadratic running time, parametrized by the implementation of project2D, which can be made efficient.

The post-processing procedure segmentSol provides explanations for a solution of the MMP problem, i.e., enables users to validate that multi-agent trajectories are collision free. While this procedure guarantees a minimal segmentation for the given trajectory, it cannot guarantee that this number is below the given bound $r$ (or indeed, the minimal segmentation of any trajectory). That is because the planning process and the segmentation process are completely separate. Thus, strictly relying on Alg. 2 could result in unsatisfiable explanations that require many segments. Next, we show how to solve the planning problem and the explanability problem simultaneously.

III-C Planning with Segmentation

III-C1 Lazy MAPS-X

The most intuitive solution for combining explanations into the planning procedure is to incorporate Alg. 2 into Planner X. The resulting algorithm is shown in Alg. 3. It runs Planner X until a solution is found. Then, rather than immediately exiting the loop with a solution, Line 3 calculates the number of disjoint segments of the solution. If it is satisfactory, i.e., it is at most the user-defined upper bound $r$ , it returns trajectory $T=\overrightarrow{x_{0},x_{new}}$ . Otherwise, Line 3 prunes the unsatisifiable portion of the solution and continues planning. The loop repeats until a satisfiable solution is found.

G=\{V\leftarrow x_{0},E\leftarrow\emptyset\}

;

2 for $N$ iterations do

(x_{rand},x_{near})\leftarrow sample(\mathbb{X},V)

;

x_{new}=\textsc{multiAgentExtend}(x_{near},x_{rand},\mathbb{U})

;

5 if $isValid(\overrightarrow{x_{near},x_{new}})$ then

V\leftarrow V\cup\{x_{new}\}

;

E\leftarrow E\cup\{\overrightarrow{x_{near},x_{new}}\}

;

7 if $\mathbf{x}_{new}\in\mathbb{G}$ then

s\leftarrow\textsc{segmentSol}(\overrightarrow{x_{0},x_{new}})

;

9 if $s\leq r$ then

10 return $\overrightarrow{x_{0},x_{new}}$

11 else

pruneSol(\overrightarrow{x_{0},x_{new}})

;

return $G(V,E)$

Algorithm 3 Lazy MAPS-X (

\mathbb{X}

\mathbb{U}

\mathbb{G}

, O,

x_{0}

, N,

r

)

We call this framework lazy multi-agent plan segmenting X (Lazy MAPS-X) because it ‘lazily’ turns Planner X into an explainable planner. The number of segments required to explain a trajectory segment from the root node $x_{0}$ to a node $x_{new}$ is only considered if it is part of a possible solution. All other nodes are ignored. In Section IV, we show that Lazy MAPS-X works well when the environment naturally admits small number of explanations, but its benefits are hindered when the system behavior must be changed to match optimal explanation schemes. In such a situation, a more involved framework is desired.

III-C2 MAPS-X

We provide another version of our framework by further integrating segmentation into the planner. This planner differs from its Lazy counterpart by calculating how many segments are required to explain each trajectory from the root node $x_{0}$ to a new node $x_{new}$ during the planning phase. In doing so, MAPS-X only adds nodes that have a satisfiable number of explanations to graph (tree) $G$ .

Alg. 4 outlines MAPS-X. Here, we define a cost for each node $x_{new}$ in the graph $G$ equivalent to the number of segments required to explain the trajectory from the root node $x_{0}$ to $x_{new}$ . After verifying that $x_{new}$ is valid, MAPS-X calculates the cost of the node in Line 4 through procedure findTotalCost. If it is satisfiable ( $x_{new}.cost\leq r$ ), then the node is added to the tree; otherwise, it is rejected.

Procedure findTotalCost computes the cost of a new node by calling Alg. 2. For every new node $x_{new}$ , findTotalCost first identifies the current trajectory segment, i.e., set of nodes with the same cost (segment number) as the parent of $x_{new}$ on the current branch of $G$ . Then, it calls Alg. 2 on the current segment to check whether the addition of $x_{new}$ requires a new segmentation, i.e., whether $\overrightarrow{x_{near},x_{new}}$ intersects with the current segment. If so, it updates the costs of the nodes on the current segment accordingly; otherwise, the cost of $x_{new}$ is set to the cost of its parent node, i.e., $x_{new}.cost=x_{new}.parent.cost$ . Note that this procedure is efficient since it only checks for the current segment, not the entire trajectory from the root to $x_{new}$ .

G=\{V\leftarrow x_{0},E\leftarrow\emptyset\}

;

2 for $N$ iterations do

(x_{rand},x_{near})\leftarrow sample(\mathbb{X},V)

;

x_{new}=\textsc{multiAgentExtend}(x_{near},x_{rand},\mathbb{U})

;

5 if $isValid(\overrightarrow{x_{near},x_{new}})$ then

\textsc{findTotalCost}(x_{new})

;

7 if $x_{new}.cost\leq r$ then

V\leftarrow V\cup\{x_{new}\}

;

E\leftarrow E\cup\{\overrightarrow{x_{near},x_{new}}\}

;

9 if $x_{new}\in\mathbb{G}$ then

10 return $\overrightarrow{x_{0},x_{new}}$ ;

return $G(V,E)$

Algorithm 4 MAPS-X (

\mathbb{X}

\mathbb{U}

\mathbb{G}

x_{0}

, N,

\Delta T

r

)

The resulting behavior is a planner that tracks the number of segments required to explain each of its branches as the tree grows. Because only satisfiable nodes are added to $G$ , MAPS-X guarantees that the number of disjoint segments (explanations) of a solution is less than or equal to $r$ . Furthermore, since all nodes individually track their segment count (cost), once a solution is found, the segmentation information is already embedded in the solution, and no further computation is needed.

III-C3 Completeness and Optimality

It is important to note that Alg. 3 and 4 inherent the completeness property of the underlying Planner X. For example, a common property for sampling-based planners is probabilistically completeness, i.e., as number of planning iterations $N$ approaches infinity, the probability of finding a solution, if one exists, approaches $1$ . Then, we can make the following statement for Lazy MAPS-X and MAPS-X.

Theorem 2 (Completeness)

Planners Lazy MAPS-X and MAPS-X are probabilistically complete if and only if Planner X is probabilistically complete.

The proof for MAPS-X follows from Theorem 1 (the computed cost of each node is the minimal number of disjoint segments from root to that node) and the fact that MAPS-X mimics the behavior of Planner X, while only being more restrictive in the validity of nodes in the graph $G$ with respect to bound $r$ , i.e., it adds a node to the tree only if the number of disjoint segments to the node from the root is less than or equal to $r$ . Similarly, Lazy MAPS-X mimics the behavior of Planner X. But rather than being more restrictive in the validity of nodes in the graph $G$ during the extension procedure, Lazy MAPS-X is more restrictive in the solutions of $G$ with respect to $r$ and prunes the tree in a post procedure. Therefore, if a solution to Problem 1 exists in a particular query, Lazy MAPS-X and MAPS-X will find it with probability approaching $1$ as $N$ approaches infinity.

Lazy MAPS-X and MAPS-X can be turned into asymptotically optimal planners. This is achieved by iteratively running a particular planning query and lowering the bound $r$ at each run. Hence, the resulting meta-planner monotonically decreases its cost (explanations) and hence asymptotically approaches the optimal value.

IV Experiments and Benchmarks

In this section, we demonstrate the efficacy of our explanation scheme and evaluate the performance of the proposed meta-algorithms Lazy MAPS-X and MAPS-X. We implemented these algorithms with the classical motion planners RRT [20] and EST [21] and evaluated their performance in several environments and agents dynamics. The benchmarking results are shown in Table I.

Our implementations use the Open Motion Planning Library (OMPL) [22] and are available here [23]. The benchmarks were performed on a machine with AMD Ryzen 7 3.9 GHz CUP and 64 GB of RAM.

IV-A MAPS Planning

We begin by showcasing our explanation scheme in several settings, and gaining some insight into its properties.

Fig. 2(a) shows an example solution of the continuous MMP problem produced by RRT. The example shows two agents, each with $2^{\text{nd}}$ -order car dynamics, that cross each others’ paths to get to their goals (indeed, the paths must cross in this environment). While it is difficult for a human to validate that this plan is collision free, this becomes easy with MAPS-RRT, using two images (Figs. 2(b)-2(c)). Observe that in this example, the plan found by RRT already admits a minimal decomposition, therefore MAPS-RRT merely finds it.

In contrast, the environment in Fig. 3 can easily admit non-intersecting paths as the blue agent can go around the red agent (Fig. 3(c)). However, with RRT, we often get intersecting paths as shown in Fig. 3(a)-3(b). By using MAPS-RRT and setting $r=1$ , we quickly get the easily-explainable solution in Fig. 3(c) with only one segment.

	Space	Planner	Dyn.	# Agents	r	# Solns. Found (%)	Runtime (s)	Runtime Success (s)	Ave. Cost	Segment Time (s)	States added per sec.
1	Open	MAPS-RRT	1C	2	$\infty$	100	$4\pm 3$	$4\pm 3$	$2\pm 1$	$0.8\pm 0.5$	$1300\pm 75$
2	Open	Lazy MAPS-RRT	1C	2	$\infty$	100	$3\pm 2$	$3\pm 2$	$2\pm 1$	$0.12\pm 0.01$	$1600\pm 94$
3	Open	MAPS-RRT	1C	2	1	90.5	$11\pm 9$	$9\pm 6$	$1\pm 0$	$4\pm 3$	$800\pm 99$
4	Open	Lazy MAPS-RRT	1C	2	1	7.2	$20\pm 11$	$12\pm 9$	$1\pm 0$	$2\pm 1$	$1200\pm 77$
5	Congested	MAPS-RRT	1C	2	1	85	$37.9\pm 33.1$	$26.9\pm 25$	$1\pm 0$	$10.1\pm 8.1$	$629.4\pm 135$
6	Congested	Lazy MAPS-RRT	1C	2	1	0.0	$100\pm 0$	n/a	n/a	$14.7\pm 7.5$	$598\pm 42$
7	Congested	MAPS-RRT	2C	2	$\infty$	100	$0.33\pm 0.28$	$0.33\pm 0.28$	$3\pm 1$	$0.09\pm 0.08$	$2100\pm 152$
8	Congested	MAPS-RRT	2C	2	3	100	$0.39\pm 0.39$	$0.39\pm 0.39$	$2\pm 1$	$0.1\pm 0.1$	$2000\pm 182$
9	Congested	MAPS-RRT	2C	2	1	100	$0.67\pm 0.9$	$0.67\pm 0.9$	$1\pm 0$	$0.25\pm 0.36$	$1400\pm 262$
10	Corridor	MAPS-RRT	1U	2	$\infty$	100	$0.53\pm 0.58$	$0.53\pm 0.58$	$3\pm 1$	$0.13\pm 0.15$	$1600\pm 120$
11	Corridor	MAPS-RRT	1U	2	4	100	$0.54\pm 0.47$	$0.54\pm 0.47$	$3\pm 1$	$0.13\pm 0.12$	$1600\pm 110$
12	Corridor	MAPS-RRT	1U	2	2	100	$0.77\pm 0.84$	$0.77\pm 0.84$	$2\pm 0$	$0.21\pm 0.26$	$1400\pm 187$
13	Corridor	MAPS-RRT	1U	2	1	100	$1.54\pm 1.9$	$1.54\pm 1.9$	$1\pm 0$	$0.56\pm 0.74$	$1000\pm 182$
14	Hallway	MAPS-RRT	2L	3	$\infty$	100	$4.15\pm 3.9$	$4.15\pm 3.9$	$3\pm 1$	$1.13\pm 0.9$	$1000\pm 123$
15	Hallway	MAPS-RRT	2L	3	3	100	$4.4\pm 5.2$	$4.4\pm 5.2$	$3\pm 0$	$1.17\pm 1.11$	$900\pm 154$
16	Hallway	MAPS-RRT	2L	3	2	100	$6.21\pm 7.18$	$6.21\pm 7.18$	$2\pm 0$	$1.78\pm 1.85$	$744\pm 167$
17	Hallway	MAPS-RRT	2L	3	1	93.5	$28.61\pm 27.58$	$23.64\pm 20.79$	$1\pm 0$	$10.6\pm 9.1$	$283\pm 69$

TABLE I: Benchmark results with 200 runs. The two character dynamics are

1^{st}

- and

2^{nd}

- order Linear, Unicycle, and Car, respectively. The maximum allowed time was

30s

for rows 1-4, and

100s

for rows 5-17. The mean and standard deviation over all planning runs are shown for columns 7-11.

Explainability can come at a cost of plan length, as we show in Fig. 4: when planning with a bound of $r=3$ segments, we obtain the short plan in Fig. 4(a), where the agents follow one another closely in the corridor. This example creates a zig-zag behavior between agents that follow $2^{nd}$ -order linear dynamics, making it difficult for human validation (and indeed requires a 3-segment explanation). However, a better-explainable plan, obtained by setting $r=2$ , is given in 4(b). There, the blue agent waits for the red agent to get far ahead, before entering the corridor, as is explained in Figs. 4(c) and 4(d).

When agents go in opposite directions through a corridor, as demonstrated in the environment of Fig. 5, a controller may want to assure that the planner does not get both robots in the corridor at the same time, as that would be risky (cause path crossing). The explanation given in Figs. 5(b), 5(c) shows that indeed, only one robot is at the corridor at a time.

Validating trajectories without explanations gets more difficult as the number of agents increases. For example, consider the solution shown in Fig. 6(a) where the agents have $2^{nd}$ -order linear dynamics. A human user could have a difficult time checking that the entire trajectory is collision free. However, planning with the MAPS framework presents an easily verified solution, as shown in Fig. 6. Moreover, by decreasing bound $r$ , we improve the explainability and as a result get a “cleaner” plan, e.g., the solution in Fig 1 was obtained with $r=3$ .

IV-B Performance Evaluation

We now turn to study the performance of our algorithms. Our results are summarized in Table I. Recall that MAPS-X is paramterized by Planner X and inherits its planning properties. As this is an orthogonal concern to planning (for the scope of this paper), we focus on RRT as the planner.

Our results confirm expected phenomena of explanations. When planning without a bound ( $r=\infty$ ), MAPS-RRT and Lazy MAPS-RRT perform the same search, but MAPS-RRT has additional computational cost of tracking segmentation. Thus, in e.g., Rows 1, 2 of Table I both algorithms obtain the same average number of segments ( $2\pm 1$ segments), but MAPS-RRT takes longer to compute the segmentations. As the bound $r$ decreases, (Rows 3, 4, 8, 9, 11-13, and 15-17), MAPS-RRT takes longer to find solutions, but gives a lower number of segments. For low values of $r$ , Lazy MAPS-RRT is required to prune very often. Since unlike MAPS-RRT, it does not store segmentation information in the nodes, these prunings are expensive, resulting in the higher runtimes and fewer solutions found, as can be seen in Rows 3-6.

V Conclusion and Discussion

This work outlines a new aspect of MMP by considering explanation schemes for plans. The MAPS-X framework can be readily plugged on to existing (centralized) planners in order to provide explainable plans. As we showed, using MAPS-X to find short explanations sometimes generates longer plans, but can also make plans neater (at the cost of computational time). In future work, we plan to add explanation generation to state-of-the-art techniques, and in particular to decentralized approaches, in order to improve scalability.

References

[1] M. Turek, “Explainable artificial intelligence,” 2018. [Online]. Available: https://www.darpa.mil/program/explainable-artificial-intelligence
[2] R. Stern, N. R. Sturtevant, D. Atzmon, T. Walker, J. Li, L. Cohen, H. Ma, T. K. S. Kumar, A. Felner, and S. Koenig, “Multi-agent pathfinding: Definitions, variants, and benchmarks,” Symposium on Combinatorial Search (SoCS), pp. 151–158, 2019.
[3] T. S. Standley, “Finding optimal solutions to cooperative pathfinding problems,” in Twenty-Fourth AAAI Conference on Artificial Intelligence, 2010.
[4] F. Gravot and R. Alami, “A method for handling multiple roadmaps and its use for complex manipulation planning,” in 2003 IEEE International Conference on Robotics and Automation (Cat. No. 03CH37422), vol. 3. IEEE, 2003, pp. 2914–2919.
[5] M. Gharbi, J. Cortés, and T. Siméon, “Roadmap composition for multi-arm systems path planning,” in 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems. IEEE, 2009, pp. 2471–2476.
[6] G. Wagner and H. Choset, “Subdimensional expansion for multirobot path planning,” Artificial Intelligence, vol. 219, pp. 1–24, 2015.
[7] R. Shome, K. Solovey, A. Dobson, D. Halperin, and K. E. Bekris, “drrt*: Scalable and informed asymptotically-optimal multi-robot motion planning,” Autonomous Robots, vol. 44, no. 3, pp. 443–467, 2020.
[8] K. Solovey, O. Salzman, and D. Halperin, “Finding a needle in an exponential haystack: Discrete rrt for exploration of implicit roadmaps in multi-robot motion planning,” in Algorithmic Foundations of Robotics XI. Springer, 2015, pp. 591–607.
[9] J. D. Gammell, S. S. Srinivasa, and T. D. Barfoot, “Bit*: Batch informed trees for optimal sampling-based planning via dynamic programming on implicit random geometric graphs,” arXiv preprint arXiv:1405.5848, 2014.
[10] G. Sanchez and J.-C. Latombe, “Using a prm planner to compare centralized and decoupled planning for multi-robot systems,” in Proceedings 2002 IEEE International Conference on Robotics and Automation (Cat. No. 02CH37292), vol. 2. IEEE, 2002, pp. 2112–2119.
[11] J. P. Van Den Berg and M. H. Overmars, “Prioritized motion planning for multiple robots,” in IEEE/RSJ International Conference on Intelligent Robots and Systems. IEEE, 2005, pp. 430–435.
[12] S. Tang and V. Kumar, “A complete algorithm for generating safe trajectories for multi-robot teams,” in Robotics Research. Springer, 2018, pp. 599–616.
[13] S. Lapuschkin, S. Wäldchen, A. Binder, G. Montavon, W. Samek, and K.-R. Müller, “Unmasking clever hans predictors and assessing what machines really learn,” Nature Communications, vol. 10, no. 1, Mar 2019. [Online]. Available: http://dx.doi.org/10.1038/s41467-019-08987-4
[14] R. Eifler, M. Cashmore, H. Jorg, D. Magazzeni, and M. Steinmetz, “Explaining the space of plans through plan-property dependencies,” Proceedings of the 2nd Workshop on Explainable Planning (XAIP), 2019.
[15] S. Kambhampati, “Synthesizing explainable behavior for human-ai collaboration,” in Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems. Richland, SC: International Foundation for Autonomous Agents and Multiagent Systems, 2019, p. 1–2.
[16] M. Fox, D. Long, and D. Magazzeni, “Explainable planning,” 2017.
[17] S. Almagor and M. Lahijanian, “Explainable multi agent path finding,” in To appear in Int’l Conference on Autonomous Agents and Multi-agent Systems (AAMAS), 2020.
[18] D. H. Hubel and T. N. Wiesel, “Receptive fields of single neurones in the cat’s striate cortex,” The Journal of Physiology, vol. 148, no. 3, 1959.
[19] S. Tang, T. S. Lee, M. Li, Y. Zhang, Y. Xu, F. Liu, B. Teo, and H. Jiang, “Complex pattern selectivity in macaque primary visual cortex revealed by large-scale two-photon imaging,” 2018.
[20] S. M. Lavalle, “Rapidly-exploring random trees: A new tool for path planning,” Tech. Rep., 1998.
[21] D. Hsu, J.-C. Latombe, and R. Motwani, “Path planning in expansive configuration spaces,” in Proceedings of International Conference on Robotics and Automation, vol. 3. IEEE, 1997, pp. 2719–2726.
[22] I. A. Şucan, M. Moll, and L. E. Kavraki, “The Open Motion Planning Library,” IEEE Robotics & Automation Magazine, vol. 19, no. 4, pp. 72–82, December 2012, https://ompl.kavrakilab.org.
[23] J. Kottinger, “Multi-Agent Plan Segmentating-X (MAPS-X),” 2020. [Online]. Available: https://github.com/aria-systems-group/MAPS-X