Dynamical, value-based decision making among $N$ options:
a constructive approach to unfolding
the symmetric pitchfork bifurcation

Paul Reverdy P. Reverdy is with the Department of Aerospace and Mechanical Engineering, University of Arizona, Tucson, AZ 85721. Email: [email protected]

Abstract

Decision making is a fundamental capability of autonomous systems. As decision making is a process which happens over time, it can be well modeled by dynamical systems. Often, decisions are made on the basis of perceived values of the underlying options and the desired outcome is to select the option with the highest value. This can be encoded as a bifurcation which produces a stable equilibrium corresponding to the high-value option. When some options have identical values, it is natural to design the decision-making model to be indifferent among the equally-valued options, leading to symmetries in the underlying dynamical system. For example, when all $N$ options have identical values, the dynamical system should have $S_{N}$ symmetry. Unfortunately, constructing a dynamical system that unfolds the $S_{N}$ -symmetric pitchfork bifurcation is non-trivial. In this paper, we develop a method to construct an unfolding of the pitchfork bifurcation with a symmetry group that is a significant subgroup of $S_{N}$ . The construction begins by parsing the decision among $N$ options into a hierarchical set of $N-1$ binary decisions encoded in a binary tree. By associating the unfolding of a standard $S_{2}$ -symmetric pitchfork bifurcation with each of these binary decisions, we develop an unfolding of the pitchfork bifurcation with symmetries corresponding to isomorphisms of the underlying binary tree.

keywords:

symmetries of dynamical systems, bifurcations, binary trees

{AMS}

37C80, 37G10, 37E25

A fundamental characteristic of systems that exhibit autonomy is the ability to decide among a set of possible actions. Such systems arise in the natural world, composed of individuals or groups of humans or animals, and also in the artificial world, composed of robots or other algorithmic agents. Studying decision making in both types of systems is of scientific and engineering interest. In natural systems, one tends to focus on developing models that explain observed behavior, while in artificial systems, one tends to focus on developing models that perform some desired behavior. In both cases, it is natural to develop models which make decisions using a dynamic mechanism that reacts to some perceived notion of action value. This paper focuses on the case of a single agent and an arbitrary number of actions and constructs such a mechanism based on the pitchfork bifurcation with symmetries corresponding to interchange of action labels.

As done in numerous recent papers [1, 2, 3, 4, 5, 6], it is natural to model the decision making process using a dynamical system. Decision making is itself a dynamic process, affected by the set of actions available to the individual and the context, including the individual’s internal state and external inputs. For example, consider an animal running in a rough environment. The actions available to the animal correspond to locations it can place its feet on each stride, the internal state to the animal’s level of fatigue and fear, and the external inputs to stimuli revealing the quality of available foot placements or the presence of a predator. Decisions must be made quickly, since one is required for each step, and the decision-making process must be flexible, so that decisions can be revised smoothly if a previously attractive location turns out to be undesirable or the appearance of a predator requires a startle response. As argued in [6], these requirements suggest viewing the decision-making process as a dynamical system operating in continuous time and on a continuous state space.

The example of the running animal is an example of the ecological theory of affordances. An affordance is an opportunity for action which the environment affords the agent [7], and the theory of affordances has gained significant traction as a model of ecological decision making [8, 9]. In particular, the so-called affordance competition hypothesis [10, 11] suggests that animals perform physical behaviors by continuously identifying affordances and selecting among them by a process which is biased by the desirability of their predicted outcomes. More recently, affordance theory has been investigated in robotics as an approach for developing novel control laws [12, 13]. In our own work we have begun to use dynamical systems which we call motivation dynamics to develop both theoretical [14] and practical [15] robot control systems. Therefore, a dynamical system for making decisions on the basis of perceived action values could be valuable for applications in both ecology and robotics.

This paper constructs a continuous dynamical system that allows a single agent to make decisions on the basis of perceived action values. Motivated by recent work in this area [1, 3, 6, 5], we base our dynamical system on the pitchfork bifurcation with certain symmetries. In the nominal case where all $N$ actions have the same value, we require that the dynamical system be equivariant under interchange of actions and impose symmetries which correspond to permutations of the actions. This allows us to use the tools of equivariant bifurcation theory [16, 17]. We introduce asymmetric action values as unfolding parameters of this pitchfork bifurcation, and construct the system so that decisions correspond to bifurcations.

The key difficulty in our approach lies in the fact that developing an unfolding for the pitchfork bifurcation with $S_{N}$ -symmetry is non-trivial. In the case of a binary decision between $N=2$ options, i.e., $S_{2}$ symmetry, the unfolding properties are well understood [16], but generalizing to the case $N>2$ is complex. For $N=2$ , the model of value-based decision making introduced by [1] and analyzed by [3] can be shown to embed an unfolding of the pitchfork bifurcation [18], but this structure does not naturally generalize even to the case $N=3$ [4]. Reina et al. [4] show how to modify the Seeley et al. model to recover desirable bifurcation properties in the case of an arbitrary number of options $N$ but with only one best option, though they do not provide analysis for their general case. In the recent paper by Franci et al. [6] the authors use equivariant bifurcation theory to analyze a similar system in the general case of $N>2$ options (and indeed, for a generic number $M>1$ of interacting agents). However, Franci et al. present explicit equations only for the case of $N=3$ options and $M=3$ agents, and they consider only the symmetric case of equally-valued options.

In contrast to these preceding works, the present paper presents an explicit construction of an unfolding of a pitchfork bifurcation with a symmetry group that is a significant subgroup of $S_{N}$ for $N\geq 2$ . The core idea underlying our construction is that one can parse a decision among $N\geq 2$ options into a series of binary decisions; these decisions can be represented by a binary tree. This representation allows us to take advantage of the well-understood representation of binary decisions in terms of the unfolding of a standard pitchfork singularity. We then construct a dynamical system with the desired bifurcation for $N\geq 2$ by assembling a series of standard pitchfork systems in a way that reflects the parsing encoded by the tree structure. In the symmetric case where the options are equally-valued, the dynamical system is equivariant under transformations (i.e., relabeling of options) that correspond to isomorphisms of the binary tree. Breaking the symmetry of option values naturally results in an unfolding of the pitchfork bifurcation.

The contributions of this paper are three-fold. First, we develop a dynamical systems model of decision making where the structure of the decision, and of the dynamical system itself, is encoded in a tree graph structure. Such graph structures have often been used in studying multi-agent decision-making problems, [19, 5, 6], but to our knowledge this is the first time a graph structure has been used to organize the dynamical decision-making procedure of a single agent. Secondly, the dynamical system we develop has a novel feedforward structure due to its recursive definition, where the vector field is constructed by recursively parsing down the tree structure. The structure is such that dynamics flow from the root node towards the leaves, which represent options. Thirdly, in building our model we develop a constructive approach to unfolding the equivariant pitchfork bifurcation. Our model has symmetries which correspond to isomorphisms of the underlying binary tree structure, which results in our pitchfork having a symmetry group that is a significant subgroup of $S_{N}$ . This significantly extends existing results, e.g., in [4] and [6].

1 Dynamical systems as models of $N$ -ary decision making

In this section we summarize our requirements for dynamical systems models of decision making. We suppose we have $N\geq 2$ actions. In the following, we refer to actions by the more generic term option to reflect that decision making need not be directly tied to action. The $i^{th}$ option has value $v_{i}>0$ . Inspired by previous work in the decision making literature [1, 3], we encode the decision state of the system in a variable $m\in\Delta^{N}$ , where

\Delta^{N}=\left\{x\in\mathbb{R}^{N+1}\Big{|}x_{i}\geq 0,\sum_{i}x_{i}=1\right\}

denotes the $N$ -simplex. Let $e_{i}\in\mathbb{R}^{N+1},i\in\{1,\ldots,N+1\},$ denote the $i^{th}$ corner of $\Delta^{N}$ , i.e., the vector with the $i^{th}$ element equal to one and all other elements equal to zero. The $i^{th}$ element of $m$ , denoted $m_{i}$ , represents the degree to which the system has committed to option $i$ . Note that $m_{i}\in[0,1]$ . The $N+1$ element of $m$ represents the degree to which the system is uncommitted to any option, and the normalization condition ensures that the total commitment is finite.

We encode the decision-making process in a continuous vector field $f$ operating on the state space $\Delta^{N}$ . A decision for option $i$ is represented by the state $m$ approaching the corresponding corner of the $N$ -simplex, i.e., the state $e_{i}$ . See Figure 1 for the case $N=2$ . We want the system to commit to an option $i$ when its corresponding value $v_{i}$ is sufficiently high. Thus, we construct the vector field $f$ to have an attracting equilibrium near $e_{i}$ when $v_{i}$ is high. When several different options have high values but there is no clear maximum, it is desirable to have several equilibria that encode partial commitment to the attractive options.

Figure 1: Plot of the 2-simplex

\Delta^{2}

. The simplex is the shaded section of the plane

x_{1}+x_{2}+x_{3}=1

To structure the equilibria, we require symmetry in the vector field $f$ that reflects the symmetry in the options. We model options to be distinguished only by their associated values. Thus symmetries in the options are permutations of the set $\{v_{i},i\in\{1,\ldots,N\}\}$ that leave the set invariant. In the fully-symmetric case where $v_{i}=v>0\forall i\in\{1,\ldots,N\}$ , all options are equally valued and therefore identical, and the symmetry group is the symmetric group $S_{N}$ . When the option values are not equal, we want the symmetry in $f$ to be broken. Regardless of any symmetries that may be present, we want the system to remain uncommitted when the values $v_{i}$ are low and to commit when the values are above a threshold. In the symmetric case, we want the system to remain uncommitted when the value $v$ is low, and to commit symmetrically to all options when $v$ increases beyond a threshold. Conversely, in the asymmetric case, we want the system to commit asymmetrically to the high-value options.

1.1 Equivariant bifurcation theory

The desire for a switching commitment process strongly suggests the use of a bifurcation in the dynamical system, and the symmetry requirement encourages the use of equivariant bifurcation theory [16, 20]. In the case of $N=2$ options, our requirements are satisfied by selecting the vector field $f$ to embed a pitchfork bifurcation as shown in Figure 2. The mechanism can be viewed as follows. Let $x=m_{1}-m_{2}$ represent the degree of commitment for option 1 over option 2. Then, in the symmetric case, selecting $f$ to be a pitchfork bifurcation, i.e.,

(1)

\dot{x}=f(x,\mu)=x(\mu-x^{2}),\ \mu\in\mathbb{R}

will yield the desired behavior. Note that $f$ is an odd function of $x$ , so the system (1) obeys the $S_{2}$ symmetry corresponding to switching the option labels. When $\mu$ (which represents the value $v$ ) is less than zero, the unique equilibrium of (1) is $x=0$ , corresponding to equal commitment to both options, and this equilibrium is stable. As $\mu$ increases through zero, the equilibrium at $x=0$ becomes unstable and two new stable equilibria appear at $\pm\sqrt{\mu}$ . These facts are summarized in the bifurcation diagram shown in Figure 2(a). Equivariant bifurcation theory predicts that these two branches must appear symmetrically in the post-bifurcation regime; each branch represents a commitment to one option or the other. Due to symmetry, commitment to either option is possible; the option to which the system commits is determined by initial conditions.

The case of asymmetric option values is naturally modeled by breaking the symmetry of the vector field $f$ . The study of such symmetry breaking is a core part of equivariant bifurcation theory, and the fundamental concept is the so-called unfolding of the bifurcation. One begins with the concept of a perturbation of a bifurcation. Specifically, a perturbation $F$ of a bifurcation $f$ is a function $F(x,\mu,\alpha)$ such that $F(x,\mu,0)=f(x,\mu)$ . A universal unfolding is a $k$ -parameter (i.e., $\alpha\in\mathbb{R}^{k}$ ) perturbation $F$ such that any small perturbation of $f$ can be expressed in terms of the $k$ parameters that define $F$ . The two-parameter family

(2)

F(x,\mu,\alpha)=x(\mu-x^{2})+\alpha_{1}+\alpha_{2}x^{2},\ \alpha_{1},\alpha_{2}\in\mathbb{R}

is a universal unfolding of the pitchfork bifurcation (1) [16]. As the parameters $\alpha_{1},\alpha_{2}$ are varied, the bifurcation diagram for $\dot{x}=F(x,\mu,\alpha)$ varies as well. In particular, for appropriate parameter values, the system has a single equilibrium in the post-bifurcation regime, and this equilibrium is both nonzero and stable (see Figure 2(b)). Thus, by choosing appropriate values for the unfolding parameters, one can encode a globally-attracting preference for one option or the other. To complete the model for $N=2$ options, it remains to relate the option values $v_{i}$ to the bifurcation parameter $\mu$ and unfolding parameters $\alpha_{i}$ in (2).

Refer to caption — Figure 2: Bifurcation diagrams for (a) the symmetric pitchfork (1) and (b) the unfolded pitchfork (2). Stable equilibria are represented by solid lines and unstable equilibria by dashed lines. Equilibria above the $\mu$ axis represent a preference for option 1, while those below represent a preference for option 2. In panel (a) the system has a single stable equilibrium representing no preference when $\mu$ is small and two symmetric stable equilibria representing preferences for option 1 and option 2 respectively, when $\mu$ is large. Note particularly in panel (b) that, for intermediate values of $\mu$ , the system has a single stable equilibrium representing a preference for option 1.

1.2 Seeley et al. model for $N=2$

In their paper [1], Seeley et al. study the value-sensitive decision-making problem for $N=2$ options and develop a dynamical systems model. Their model can be shown to embed an unfolded pitchfork [18], thus completing the model whose mechanism we laid out in the preceding section. Concretely, Seeley et al. let the state of their model be $m=(m_{1},m_{2},m_{U})\in\Delta^{2}$ and set $\dot{m}=f(m,v)$ , with

(3)

\displaystyle\dot{m}_{i}=v_{i}m_{U}-m_{i}\left(\frac{1}{v_{i}}-v_{i}m_{U}+\sigma(1-m_{i}-m_{U})\right)

for each $i=1,2$ , and the dynamics of $m_{U}$ are determined by the normalization constraint. As above, $v_{i}$ denotes the value of option $i$ , and $\sigma>0$ is a constant parameter.

In the symmetric case of $v_{1}=v_{2}=v$ the dynamics (3) obey an $S_{2}$ symmetry. Seeley et al. [1] showed that, in the symmetric case, the dynamics (3) exhibits a pitchfork bifurcation as $v$ and $\sigma$ increase above a threshold. In the pre-bifurcation regime, the system has a single stable equilibrium with the symmetry $m_{1}=m_{2}$ . Because of the symmetry, the system does not commit to either option, and the equilibrium is said to be a deadlock. In the post-bifurcation regime, the deadlock equilibrium is unstable and two additional equilibria emerge, each representing a decision to commit to one of the two options. In previous work [1], the parameter values at the bifurcation point were found to satisfy

(4)

\sigma=\frac{4v^{3}}{(v^{2}-1)^{2}}.

Note that either $\sigma$ or $v$ can be interpreted as the bifurcation parameter, with (4) defining the bifurcation value. In other words, fixing $\sigma$ , the bifurcation occurs as $v$ increases through a threshold, while fixing $v$ , the bifurcation occurs as $\sigma$ increases.

In the asymmetric case of $v_{1}\neq v_{2}$ , the $S_{2}$ symmetry of the dynamics is broken and the pitchfork bifurcation unfolds as studied by Pais et al. [3]. For a fixed value of $\sigma$ , the number of equilibria of (3) depends on the parameters $v_{1}$ and $v_{2}$ . Certain parameter values result in a single stable equilibrium whose location is biased towards the high-value option, while others result in two stable equilibria representing each option and a saddle point in between. The complete phase diagram of the system is complex, but the two primary findings of [3] can be summarized as follows. First, the dynamics (3) remain deadlocked (i.e., have a single attractor with $m_{1}\approx m_{2}$ ) when the average option value $\bar{v}=(v_{1}+v_{2})/2$ is small. Second, the dynamics decide for the high value option (i.e., for $v_{1}>v_{2}$ , have a single attractor with $m_{1}\gg m_{2}$ ) when the difference in option values $\Delta v=v_{1}-v_{2}$ is sufficiently large relative to $\bar{v}=(v_{1}+v_{2})/2$ . In symbols, we have that the system makes a decision when

\frac{|\Delta v|}{\bar{v}}>\kappa(\sigma),

where $\kappa(\sigma)$ is a coefficient that depends on the parameter $\sigma$ . Pais et al. [3] note that this behavior is analogous to Weber’s law of just-noticeable differences from psychology, which states that the minimum difference in stimulus intensity required to discriminate between two different stimuli varies linearly with their mean intensity.

The implication of these two findings is that the decision-making dynamics (3) has several desirable properties. First, when both options are poor (corresponding to a low value of $\bar{v}$ ), the system remains deadlocked and avoids making a decision, e.g., to wait for more information. When at least one option is sufficiently satisfactory (corresponding to a high value of $\bar{v}$ ), the system will quickly commit to an option, and preferentially select the one with a higher value. These are properties that we seek to generalize to the case of $N>2$ options.

1.3 Reduction of the Seeley et al. model

As shown in the preceding sections, the Seeley et al. model (3) has desirable characteristics that we seek to generalize. However, the functional form of (3) obscures the unfolding of the pitchfork bifurcation which serves as the fundamental decision-making mechanism. In recent work [18], we studied the dynamics (3) using model reduction techniques to elucidate the unfolding.

The model reductions studied in [18] use singular perturbation theory. Specifically, the reduction approach maps $v_{i}\mapsto Kv_{i}$ for a constant gain $K>0$ and takes the singular limit $K\to+\infty$ . This approach is similar to an analysis performed in [3], where the authors studied the limit $\bar{v}\to+\infty$ ; however, the approach using the gain $K$ preserves the relative difference in values $\Delta v/\bar{v}$ . This ratio is key in defining the unfolding of the pitchfork bifurcation embedded in (3).

The bifurcation is more readily analyzed by expressing $m\in\Delta^{2}$ in terms of mean-difference coordinates defined by

\Delta m=m_{1}-m_{2},\bar{m}=\frac{m_{1}+m_{2}}{2}

which are analogous to the definitions of $\Delta v$ and $\bar{v}$ made above. Note that the definitions of these new coordinates and the definitions of $\Delta^{2}$ and $(v_{1},v_{2})\in\mathbb{R}_{+}^{2}$ imply that $\bar{m},\bar{v}>0$ and that $-2\bar{m}\leq\Delta m\leq 2\bar{m}$ and $-2\bar{v}<\Delta v<2\bar{v}$ .

In the mean-difference coordinates, the dynamics (3) of $m=({\Delta m},{\bar{m}})$ take the form

(5)		$\displaystyle\dot{\Delta m}$	$\displaystyle=f_{\Delta m}(\Delta m,\bar{m};\bar{v};\Delta v)$
		$\displaystyle=-\left(\frac{2\bar{m}+\Delta m}{K(2\bar{v}+\Delta v)}-\frac{2\bar{m}-\Delta m}{K(2\bar{v}-\Delta v)}\right)+K\bar{v}\Delta m(1-2\bar{m})+K\Delta v(1-2\bar{m})(1+\bar{m}),$

(6)		$\displaystyle\dot{\bar{m}}=$	$\displaystyle f_{\bar{m}}(\Delta m,\bar{m};\bar{v},\sigma;\Delta v)$
	$\displaystyle=$	$\displaystyle\frac{1}{2}\biggl{(}-\frac{2\bar{m}+\Delta m}{K(2\bar{v}+\Delta v)}-\frac{2\bar{m}-\Delta m}{K(2\bar{v}-\Delta v)}+\frac{K(2\bar{v}+\Delta v)}{2}(1-2\bar{m})(1+\frac{2\bar{m}+\Delta m}{2})$
		$\displaystyle\ \ \ \ \ +\frac{K(2\bar{v}-\Delta v)}{2}(1-2\bar{m})(1+\frac{2\bar{m}-\Delta m}{2})-\frac{\sigma}{2}(2\bar{m}+\Delta m)(2\bar{m}-\Delta m)\biggr{)}.$

A straightforward application of singular perturbation theory with small parameter $\epsilon=1/K$ and coordinates $x=\Delta m$ , and $y=(1-2\bar{m})/\epsilon$ yields the following result.

Theorem 1.1.

[18, Theorem 1] In the singular limit $\epsilon\to 0$ , the motivation dynamics (3) reduce to

(7)

\dot{x}=\frac{\sigma}{2\bar{v}}(1-x^{2})\frac{2x+3\alpha}{6+\alpha x},

where $\alpha=\Delta v/\bar{v}$ .

A standard nonlinear time scaling argument then allows one to eliminate the denominator $6+\alpha x$ from (7) and makes the connection to the unfolding of the pitchfork bifurcation (2) explicit.

Corollary 1.2.

[18, Corollary 2] The singularly-perturbed motivation dynamics (7) are equivalent to

x^{\prime}=x(1-x^{2})+\frac{3}{2}\alpha-\frac{3}{2}\alpha x^{2},

i.e., an unfolding of the pitchfork bifurcation (2) with bifurcation parameter $\mu\mapsto 1$ and unfolding parameters $\alpha_{1}=3\alpha/2$ and $\alpha_{2}=-3\alpha/2$ .

The equilibria of the singularly-perturbed system (7) are shown in Figure 3. Note that the system has three equilibria for all possible values of $\alpha\in[-2,2]$ . When $\alpha=0$ the options are equally valued, the pitchfork is unperturbed, and the equilibria correspond to those of the standard pitchfork (1) in the post-bifurcation regime. The equilibrium $x=0$ is unstable, while those at $x=\pm 1$ are stable. For nonzero $\alpha$ the pitchfork unfolds. In the singularly-perturbed regime, the non-zero equilibria remain at $x=\pm 1$ , while the intermediate equilibrium shifts to $x=-3\alpha/2$ . The intermediate equilibrium is unstable for the values of $\alpha\in[-2/3,2/3]$ where it exists. The equilibria at $x=\pm 1$ are stable when the value difference $\Delta v$ is not too biased against the corresponding option. For example, $x=+1$ is a stable equilibrium of (7) for $\alpha\geq-2/3$ .

The structure of equilibria shown in Figure 3 determines the decision-making behavior of the model (3) in the singular limit. The state $x=\Delta m=+1$ corresponds to the system committing fully to option 1, i.e., to $m=(1,0,0)$ . This state is stable for $\alpha=\Delta v/\bar{v}\geq-2/3$ , and globally attracting for $\alpha>2/3$ . In other words, the system can commit to option 1 when when $\Delta v\geq-2\bar{v}/3$ , and will be globally attracted to committing to option 1 when $\Delta v>2\bar{v}/3$ . Switching behavior can occur as $\alpha$ shifts. For example, suppose that the system (7) is initialized with $\alpha<0$ and $\Delta m=-0.9$ , representing a commitment to option 2. If $\alpha$ is then raised above the value $2/3$ , the state $\Delta m$ will be attracted to the value $+1$ , representing a decision to switch and commit to option 1. The rate at which $\Delta m$ is attracted to $+1$ is governed by the parameter $\sigma$ , as can be seen from the form of the dynamics (7).

We note that the coefficient $3/2$ arises from the last term $K\Delta v(1-2\bar{m})(1+\bar{m})$ in (5) and can be adjusted by changing the coefficient $1+\bar{m}$ to $\beta+\bar{m}$ , which changes the coefficient $3/2=1+1/2$ to $\beta+1/2$ . This observation can be used to control the bifurcation properties of the system, e.g., by making it more or less sensitive to the relative value difference $\alpha$ . We do not pursue this line of investigation further in the present paper, but recognize that it is a natural point of departure for further work.

In this section, we introduced our requirements for a dynamical system model of value-sensitive decision making We showed how an unfolding of the pitchfork bifurcation can provide the fundamental mechanism for such a model in the case of $N=2$ options, and reviewed a model due to Seeley et al. that embeds such a pitchfork mechanism along with some recent results reducing that model. In the following section we begin to construct a generalization of the Seeley et al. model to the case $N>2$ by parsing the decision among $N$ options into a series of binary decisions represented by a tree structure.

2 Parsing $N$ -ary decisions into binary trees

Inspired by the ideas presented in the previous section, we seek to develop a dynamical systems model of value-sensitive decision-making among $N$ options using an unfolding of the pitchfork bifurcation. The difficulty of this approach is that constructing an unfolding of the pitchfork bifurcation in $N$ dimensions is non-trivial. In this paper, we construct such an unfolding, and thus the desired model, by composing a series of unfoldings of a standard pitchfork bifurcations. The composition is structured by parsing a decision among $N$ options into a series of binary decisions.

In this section we introduce the idea of parsing a decision among $N$ options into a series of binary decisions represented by a tree structure, and review a number of concepts, primarily from the computer science literature, on binary trees. As an example of an $N$ -ary decision, consider the case of a professor who has a variety of tasks and must decide which one to focus on at any given time. She may decide on a task by making a series of binary decisions as shown in Figure 4. For example, she might first decide between working on research or on teaching; given a decision to work on teaching, she may work on preparing a lecture or some assignments. The decision among five options is thus parsed into a series of binary decisions represented by a binary tree. A decision among an arbitrary number $N$ options can be parsed in this way.

We now define a number of terms associated with binary trees. A (rooted) tree is a connected acyclic undirected graph where one node is identified as the root. The parent of a node $n$ is the node connected to $n$ on the path to the root, and the children of a node $n$ are the nodes for which $n$ is the parent. Similarly, a descendant of a node $n$ is any node which is a child of $n$ or is a descendent of any of the children of $n$ . A sibling of a node $n$ is any other node which shares a parent with $n$ . A leaf is a node with no children, while an internal node is a node which is not a leaf. Finally, a binary tree is a tree where each node has at most two children, referred to as the left and right children. For such a tree, we refer to the descendants of a node $n$ associated with the right and left children as the right and left descendants, respectively. Note that when an arbitrary number $N$ of options is parsed into a binary tree $T$ , the tree can be selected such that each internal node has two children. Such a tree is referred to as a full or proper binary tree.

We now formally define a parsing of a decision set.

Definition 2.1 (Parsing).

Given a set of $n_{o}\geq 2$ options, a parsing of these options is a proper rooted binary tree $T$ where each leaf node represents an option.

Often, we will label the nodes with an index $i$ . Then, the function $o$ maps leaf nodes to their associated option, i.e., $i\mapsto o(i)$ . Figure 5 shows the node labels associated with the parsing shown in Figure 4. In this case, we have $o(2)=$ Experiment, $o(3)=$ Code, $o(5)=$ Lecture, $o(7)=$ Homework, and $o(8)=$ Exam.

Figure 4: Parsing a decision among

N

options into a series of binary decisions.

Figure 5: Node labels associated with the parsing shown in Figure 4. The function

o

maps leaf nodes to options, e.g.,

o(2)=

Experiment.

2.1 Tree traversal

Tree traversal is a fundamental process acting on a tree, wherein the process visits (and carries out an action on) each node in the tree exactly once. Trees may be traversed in either depth-first or breadth-first orders, as depicted in Figure 6. As their names imply, depth-first order operates by going as deep down the tree as possible before going to the next sibling, while breadth-first order operates by going through each sibling before going to a descendant. The nodes in Figure 6 are labeled according to the order in which they will be visited in depth-first or breadth-first traversal.


Ordered node list: $(0,1,2,3,4,5,6,7,8)$	Ordered node list: (0,1,3,7,8,4,2,5,6)
Ordered option list: (3,4,5,7,8)	Ordered option list: (7,8,4,5,6)
(a)	(b)

Figure 6: Depth-first (a) versus breadth-first (b) traversal of a binary tree. The nodes are labeled with numbers according to the order in which they will be visited during traversal.

2.2 Tree paths

A path in a finite tree is a finite sequence of edges which joins a sequence of nodes. For a rooted tree, there is always a unique shortest path from the root to any other node. We denote the sequence of nodes along the shortest path from the root to node $i$ as $p_{i}$ and denote its $j^{th}$ element as $p_{ij}$ . The sequence $p_{i}$ begins with the root node and ends with the node $i$ . The number of nodes in $p_{i}$ is denoted $|p_{i}|$ .

2.3 Tree isomorphisms

Trees may have important symmetries. For example, the nodes of a tree may be rearranged without changing the structure it represents. Two trees which share the same structure are said to be isomorphic. The concept of tree isomorphism is inherited from the concept of graph isomorphism [21], for which tree isomorphisms are a special case.

Definition 2.2 (Rooted tree isomorphism).

Let $T_{1}$ and $T_{2}$ be two rooted trees with node sets $N_{1}$ and $N_{2}$ , edge sets $E_{1}$ and $E_{2}$ , and roots $r_{1}\in N_{1}$ and $r_{2}\in N_{2}$ , respectively. An isomorphism of $T_{1}$ and $T_{2}$ is a bijection between the node sets $\varphi:N_{1}\to N_{2}$ such that

\forall u,v\in N_{1}\ \ (u,v)\in E_{1}\Leftrightarrow(\varphi(u),\varphi(v))\in E_{2}

and $\varphi(r_{1})=r_{2}$ .

In words, a rooted tree isomorphism is a mapping between the node sets such that each edge is preserved, along with the root node. An example, Figure 7 shows two isomorphisms of the tree presented in Figure 6(a). Note that isomorphisms of binary trees are generated by flips at nodes, wherein the left and right descendants of a given node are exchanged.

Problems associated with tree isomorphisms arise in many applications. In particular, a standard problem in computer science is to determine whether two rooted trees $T_{1}$ and $T_{2}$ are isomorphic. A classic algorithm due to Aho, Hopcroft, and Ullman [22] solves the problem in $O(n)$ time for trees with $n$ vertices.


Ordered node list: (0,6,7,8,1,2,3,4,5)	Ordered node list: (0,1,5,2,3,4,6,7,8)
Ordered option list: (7,8,3,4,5)	Ordered option list: (5,3,4,7,8)
(a)	(b)

Figure 7: Tree isomorphisms are generated by flips at nodes. Here we show two isomorphisms of the tree presented in Figure 6(a), keeping the node numbers from the previous figure. Panel (a): isomorphism generated from flipping at node 0, i.e., applying

\gamma_{0}

. Panel (b): isomorphism generated from flipping at node 1, i.e., applying

\gamma_{1}

2.4 Symmetry group of a tree and of options

The set of isomorphisms of a given tree exhibit a group structure. The elements of the group are generated by the flips at internal nodes described above and the group operation is given by composition. It is clear that each flip is its own inverse, as exchanging left and right descendants of a node twice leaves the tree unchanged. Flips may be carried out in any order, so the operation is associative, and the identity is the element consisting of no flips.

When a tree $T$ is a parsing of a set of $n_{o}$ options, isomorphisms of the tree generate isomorphisms of the option set $\{1,\ldots,n_{o}\}$ . Recall that an isomorphism of $T$ is a bijection $\varphi$ from the node set of $T$ to itself. Thus, a node $i$ is mapped to $j=\varphi(i)$ and the set $\{o(i):i\in\{1,\ldots,n_{o}\}\}$ is mapped to $\{o(\varphi(i)):i\in\{1,\ldots,n_{o}\}\}$ . The group of all possible isomorphisms of $n_{o}$ objects is $S_{n_{o}}$ . Note, however, that not all such isomorphisms can be generated by the set of tree isomorphisms. For example, nodes that are siblings must remain siblings even under isomorphism operations.

Let the tree $T$ be a parsing of a set of $n_{o}$ options. The set of isomorphisms the options that can be generated by isomorphisms of $T$ forms a group $\Gamma_{T}\leq S_{n_{o}}$ whose structure is given by a wreath product of copies of $S_{2}$ . This can be seen as follows. Let $i$ be an internal node in the tree $T$ and let $ro(i)$ and $lo(i)$ denote the set of options associated with its right and left descendants, respectively. For example, for the tree in Figure 7(a), $ro(0)=\{o(3),o(4),o(5)\}$ and $lo(0)=\{o(7),o(8)\}$ . Let $\gamma_{i}$ represent the action of performing a flip at node $i$ . Then $\gamma_{i}$ exchanges the sets $ro(i)$ and $lo(i)$ . Explicitly, we have

(8)

\gamma_{i}:(ro(i),lo(i))\mapsto(lo(i),ro(i)).

Note that applying $\gamma_{i}$ twice results in the identity mapping, so $\gamma_{i}$ generates the permutation group $S_{2}$ acting on the set $\{ro(i),lo(i)\}$ . Furthermore, any options which are not associated with the descendants of node $i$ are unaffected. Thus, one can think of $\gamma_{i}$ as generating a representation of $S_{2}$ acting on the set $\{1,\ldots,n_{o}\}$ ; this is trivially a subgroup of $S_{n_{o}}$ . Applying $\gamma_{j}$ for another internal node $j\neq i$ generates another representation of $S_{2}$ . The group generated by application of both actions $\gamma_{i}$ and $\gamma_{j}$ is the wreath product $S_{2}\wr S_{2}$ . This process can be extended for each internal node $i$ , yielding a symmetry group which is the repeated wreath product of copies of $S_{2}$ . Formally we have the following Proposition.

Proposition 2.3 (Symmetry group of a parsing).

Let the tree $T$ be a parsing of a set of $n_{o}$ options. Denote the number of internal nodes of $T$ by $n_{i}$ . The symmetry group $\Gamma_{T}$ associated with the isomorphisms of $T$ is given by the $n_{i}$ -fold wreath product of $S_{2}$

(9)

\Gamma_{T}=\underbrace{S_{2}\wr S_{2}\wr\cdots\wr S_{2}}_{n_{i}\text{ times}}\leq S_{n_{o}}.

3 A recursively-defined vector field

In this section we show how to construct a decision-making vector field for a given parsing of a finite set of options. We suppose we have a finite set of $n_{o}$ options and a tree $T$ which is a parsing of the options. Furthermore, each option $i$ is associated with a scalar $v_{i}>0$ that encodes its importance or value. We seek a vector field $f$ operating on the state space $\Delta^{n_{o}}$ with attracting fixed points associated with the high-value options.

We label the $n_{n}$ nodes of $T$ with an index $i$ , with the root node having the index $i=0$ . We define the following notation to describe the tree structure. For a node $i$ , we denote its parent by $\operatorname{p}(i)$ and its left and right child nodes by $\operatorname{lc}(i)$ and $\operatorname{rc}(i)$ , respectively. The descendants of a node $i$ consist recursively of the node’s children $\operatorname{lc}(i)$ and $\operatorname{rc}(i)$ along with the descendants of the children. We denote the descendants of a node $i$ by $\operatorname{d}(i)$ . The descendants can be partitioned into left descendants and right descendants, which consist of the left child and its descendants and the right child and its descendants, respectively. For a node $i$ , we denote the left descendants by $\operatorname{ld}(i)$ and the right descendants by $\operatorname{rd}(i)$ , respectively. Recall that leaf nodes represent options. As above, let $o(i)$ be the option associated with a leaf node $i$ .

To each node $i$ we associate the state $z_{i}\in\mathbb{R}$ and the value $u_{i}>0$ . Furthermore, to each internal (non-leaf) node $i$ we associate states $\mathbf{z}_{i}\in\mathbb{R}^{2},\mathbf{m}_{i}\in\Delta^{2}$ , and $\mathbf{v}_{i}\in\mathbb{R}_{+}^{2}$ . These additional states represent quantities associated with the node’s children. We denote the $j^{th}$ component of $\mathbf{z}_{i},\mathbf{m}_{i}$ , and $\mathbf{v}_{i}$ by $z_{ij},m_{ij},$ and $v_{ij}$ , respectively.

The states $z_{i}$ are defined recursively as follows. Let $z_{0}=1$ . Then, for $i\geq 0,z_{lc(i)}=z_{i}m_{i1}$ and $z_{rc(i)}=z_{i}m_{i2}$ . Alternatively, for $i\geq 1,z_{i}=z_{\operatorname{p}(i)}m_{\operatorname{p}(i)j}$ , where

j=\begin{cases}1,&\text{if }i=\operatorname{lc}(\operatorname{p}(i)),\\ 2,&\text{if }i=\operatorname{rc}(\operatorname{p}(i)).\end{cases}

The state vector $\mathbf{z}_{i}$ associated with internal node $i$ has components $\mathbf{z}_{i}=(z_{\operatorname{lc}(i)},z_{\operatorname{rc}(i)})^{T}=z_{i}\mathbf{m}_{i}$ . Note that the definition of $z_{i}$ is invertible for $z_{\operatorname{p}(i)}\neq 0$ : in this case, we have $\mathbf{m}_{i}=\mathbf{z}_{i}/z_{i}$ . The components are $m_{\operatorname{p}(i)j}=z_{i}/z_{\operatorname{p}(i)}$ , where $j=1$ if $i=\operatorname{lc}(\operatorname{p}(i))$ and $j=2$ if $i=\operatorname{rc}(\operatorname{p}(i))$ . Similarly, for a node $i$ , $u_{i}$ is equal to the mean of the associated children’s values

(10)

u_{i}=\begin{cases}v_{o(i)},&\text{if $i$ is a leaf},\\ (v_{\operatorname{lc}(i)}+v_{\operatorname{rc}(i)})/2,&\text{if $i$ is an internal node.}\end{cases}

Then, for an internal node $i$ , $v_{i1}=u_{\operatorname{lc}(i)}$ and $v_{i2}=u_{\operatorname{rc}(i)}$ .

We endow the motivation states $\mathbf{m}_{i}$ associated with an internal node $i$ with dynamics $\dot{}\mathbf{m}_{i}=f(\mathbf{m}_{i},\mathbf{v}_{i})$ , where $f$ is the Seeley et al. dynamics (3). The dynamics of the overall tree $T$ consists of copies of the dynamics $f$ defined as follows. Recall that $n_{i}$ is the number of internal nodes of $T$ and let $\mathbf{m}\in\mathbb{R}^{2n_{i}}$ be the vector consisting of the stacked node states $\mathbf{m}_{i}$ . Note that the definition of $\mathbf{m}$ , i.e., the order in which the $\mathbf{m}_{i}$ are stacked, is arbitrary: different orders correspond to permutations of the coordinates. The structure of the dynamics is encoded in the tree structure, i.e., the parent-child relationships given by the functions $\operatorname{p},\operatorname{lc}$ , and $\operatorname{rc}$ . For practical purposes of performing computations, one chooses a scheme for numbering the coordinates that corresponds to a scheme for traversing the nodes of the tree. We choose to traverse the tree $T$ in a depth-first manner and define $\mathbf{m}$ by

\mathbf{m}=(\mathbf{m}_{0}^{T},\underbrace{\mathbf{m}_{\operatorname{lc}(0)}^{T},\mathbf{m}_{\operatorname{lc}(\operatorname{lc}(0))}^{T},\ldots}_{\text{left descendants of node }0},\underbrace{\mathbf{m}_{\operatorname{rc}(0)}^{T},\mathbf{m}_{\operatorname{lc}(\operatorname{rc}(0))}^{T},\ldots}_{\text{right descendants of node }0})^{T}.

The dynamics of $\mathbf{m}$ are defined by stacking the dynamics of the component states $\mathbf{m}_{i}$ :

(11)	$\displaystyle\dot{}\mathbf{m}=f_{m}(\mathbf{m},\mathbf{v})$	$\displaystyle=(\dot{}\mathbf{m}_{0}^{T},\dot{}\mathbf{m}_{\operatorname{lc}(0)}^{T},\dot{}\mathbf{m}_{\operatorname{lc}(\operatorname{lc}(0))}^{T},\ldots,\dot{}\mathbf{m}_{\operatorname{rc}(0)}^{T},\dot{}\mathbf{m}_{\operatorname{lc}(\operatorname{rc}(0))}^{T},\ldots)^{T}$
	$\displaystyle=(f(\mathbf{m}_{0},\mathbf{v}_{0})^{T},f(\mathbf{m}_{\operatorname{lc}(0)},\mathbf{v}_{\operatorname{lc}(0)})^{T},f(\mathbf{m}_{\operatorname{lc}(\operatorname{lc}(0))},\mathbf{v}_{\operatorname{lc}(\operatorname{lc}(0))})^{T},\ldots,$
	$\displaystyle\ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ f(\mathbf{m}_{\operatorname{rc}(0)},\mathbf{v}_{\operatorname{rc}(0)})^{T},f(\mathbf{m}_{\operatorname{lc}(\operatorname{rc}(0))},\mathbf{v}_{\operatorname{lc}(\operatorname{rc}(0))})^{T},\ldots)^{T},$

where $\mathbf{v}\in\mathbb{R}^{2n_{i}}$ is the vector of the node value states $\mathbf{v}_{i}$ stacked in depth-first order.

The dynamics of $\mathbf{m}_{i}$ then defines dynamics of the states $\mathbf{z}_{i}$ by a simple change of coordinates. Recall that $\mathbf{z}_{i}=(z_{\operatorname{lc}(i)},z_{\operatorname{rc}(i)})^{T}=z_{i}\mathbf{m}_{i}$ with $z_{0}=1$ . Then $\dot{}\mathbf{z}_{i}=\dot{z}_{i}\mathbf{m}_{i}+z_{i}\dot{}\mathbf{m}_{i}=\dot{z}_{i}\mathbf{m}_{i}+z_{i}f(\mathbf{m}_{i},\mathbf{v}_{i})$ . As above, we construct the vector $\mathbf{z}\in\mathbb{R}^{2n_{i}}$ by stacking the individual $\mathbf{z}_{i}$ in depth-first order. When $z_{i}\neq 0$ , we have $\mathbf{m}_{i}=\mathbf{z}_{i}/z_{i}$ , so $\dot{}\mathbf{z}_{i}$ can be written in terms of $\mathbf{z}$ . We denote the resulting dynamics of $\mathbf{z}$ by

(12)

\dot{}\mathbf{z}=f_{z}(\mathbf{z},\mathbf{v}).

In the following section we show that the functions $f_{m}$ and $f_{z}$ are equivariant under changes of coordinates that correspond to isomorphisms of the tree $T$ .

Let $\mathbf{m}_{o}\in\Delta^{n_{o}}$ be the state that represents the system’s decision among the $n_{o}$ options. We relate the state $\mathbf{z}$ to $\mathbf{m}_{o}$ by projection onto $\mathbb{R}^{n_{o}+1}$ . Let $o^{-1}:\{1,\ldots,n_{o}\}\to\{1,\ldots,n_{n}\}$ be the function that maps from an option $i$ to its associated leaf node.

Definition 3.1 (Projected dynamics).

Let $T$ be a parsing of $n_{o}$ options, let $g$ be the dynamics defined by (12), and let $h:\mathbb{R}^{n_{n}}\to\mathbb{R}^{n_{o}+1}$ be the projection that reads off the elements of $\mathbf{z}$ that correspond to the leaves of the tree $T$ . Explicitly, $h:\mathbf{z}\mapsto\mathbf{m}_{o}$ by

(13)

(\mathbf{m}_{o})_{i}=m_{oi}=\begin{cases}z_{o^{-1}(i)},&i\in\{1,2,\ldots,n_{o}\}\\ 1-\sum_{i=1}^{n_{o}}m_{oi},&i=n_{o}+1.\end{cases}

The projection $h$ then defines the dynamics of the projected state $\mathbf{m}_{o}$ by

(14)

\dot{}\mathbf{m}_{o}=f_{o}(\mathbf{z},\mathbf{v})=h(f_{z}(\mathbf{z},\mathbf{v})).

The projected dynamics (14) leave the simplex $\Delta^{n_{o}}$ invariant.

Theorem 3.2.

Let $T$ be a parsing of $n_{o}$ options, let $f_{z}$ be the dynamics defined by (12), and let $h:\mathbb{R}^{n_{n}}\to\mathbb{R}^{n_{o}+1}$ be the projection (13) that reads off the elements of $\mathbf{z}$ that correspond to the leaves of the tree $T$ . Let $f_{o}$ be the dynamics defined by (14). Then $f_{o}$ leaves the simplex $\Delta^{n_{o}}$ invariant.

Proof 3.3.

Let $\mathbf{m}$ be the projected state. Proving the claim reduces to showing that $m_{i}\geq 0$ and that $\sum_{i=1}^{n_{o}}m_{i}\leq 1$ . We proceed by induction from the bottom of the tree. Let $i$ be a generic leaf node of $T$ . By definition, $T$ is a proper binary tree, so $i$ has a sibling. Furthermore, since $i$ is a leaf node, it has a parent. Let $j$ denote the sibling of $i$ and $\operatorname{p}(i)$ the common parent of $i$ and $j$ . By the definition of the projection $h$ , we have $m_{o(i)}=z_{i},m_{o(j)}=z_{j}$ . By the definition of $\mathbf{z}_{i}$ , we have

\mathbf{z}_{\operatorname{p}(i)}=(z_{i},z_{j})=z_{\operatorname{p}(i)}\mathbf{m}_{\operatorname{p}(i)}=z_{\operatorname{p}(i)}(m_{\operatorname{p}(i)1},m_{\operatorname{p}(i)2}).

Note that the dynamics (3) of $\mathbf{m}_{\operatorname{p}(i)}$ leave the simplex $\Delta^{2}$ invariant. This implies that the components $m_{\operatorname{p}(i)j}\geq 0$ for $i=1,2$ , and

(15)

m_{\operatorname{p}(i)1}+m_{\operatorname{p}(i)2}\leq 1.

Furthermore, we have that $z_{i}=m_{o(i)}$ and $z_{j}=m_{o(j)}$ are both non-negative. Multiplying the expression (15) by $z_{\operatorname{p}(i)}$ yields the bound

m_{o(i)}+m_{o(j)}=z_{i}+z_{j}\leq z_{\operatorname{p}(i)}.

Thus, the sum of the $m_{i}$ associated with descendants of node $k=\operatorname{p}(i)$ is upper bounded by $z_{k}$ . The analogous argument holds for the parent of node $k$ , and thus we can inductively work our way up the tree. At each node $l$ , the sum of the $m_{i}$ associated with descendants of node $l$ is upper bounded by $z_{l}$ .

The base case of the inductive argument is the root node $i=0$ . All $n_{o}$ options are descendant from the root node, so we have $\sum_{i=1}^{n_{o}}m_{i}\leq z_{0}=1$ , as desired.

4 The vector field is equivariant under tree isomorphisms

Recall from Definition 2.2 above that two rooted trees $T_{1}$ and $T_{2}$ are said to be isomorphic if there exists a bijection mapping between the nodes of $T_{1}$ and $T_{2}$ that preserves the root node. The vector field $f_{z}$ defined by (12) and its projection $f_{o}$ defined by (14) are equivariant under changes of coordinates which correspond to tree isomorphisms.

Let $T_{1}$ and $T_{2}$ be two isomorphic trees. By definition they must have the same number $n_{n}$ of nodes. The isomorphism between the trees is a bijection between the node sets of $T_{1}$ and $T_{2}$ . In other words, it is a bijective map $\sigma:\{1,\ldots,n_{n}\}\to\{1,\ldots,n_{n}\}$ . This is precisely the definition of a permutation.

Definition 4.1.

Let $T_{1}$ and $T_{2}$ be two isomorphic trees each with $n_{n}$ nodes. The map $\sigma:\{1,\ldots,n_{n}\}\to\{1,\ldots,n_{n}\}$ associated with the isomorphism between the trees is called the node permutation corresponding to the isomorphism.

The dynamics (12) obey symmetries that correspond to isomorphisms of the underlying tree $T$ . Formally, the dynamics (12) are said to be equivariant.

Definition 4.2.

Let $X=\mathbb{R}^{n}$ and suppose that $\Gamma$ is a compact Lie group acting on $X$ . Then a mapping $F:X\times\mathbb{R}\to X$ is $\Gamma$ -equivariant if and only if

F(\gamma x,\lambda)=\gamma F(x,\lambda)

for all $\gamma\in\Gamma$ , where $\lambda\in\mathbb{R}$ is a parameter.

The Seeley et al. dynamics (3) obey a symmetry that correspond to swapping the labels of the two options. When the option values are identical, the dynamics are $S_{2}$ -equivariant.

Lemma 4.3.

Let $\pi_{2}\in S_{2}$ represent the permutation of two elements. The Seeley et al. dynamics $f$ defined by (3) are preserved under the action of $\pi_{2}$ . Specifically, we have

f(\pi_{2}\mathbf{m},\pi_{2}\mathbf{v})=\pi_{2}f(\mathbf{m},\mathbf{v}).

When $\mathbf{v}=(v,v)^{T}$ , $f(\pi_{2}\mathbf{m},\mathbf{v})=\pi_{2}f(\mathbf{m},\mathbf{v})$ , and the dynamics are $S_{2}$ -equivariant.

Proof 4.4.

The first statement is proven by straightforward substitution. From (3), we have

f(\pi_{2}\mathbf{m},\pi_{2}\mathbf{v})=\begin{bmatrix}v_{2}(1-m_{1}-m_{2})-m_{2}\left(\frac{1}{v_{2}}-v_{2}(1-m_{1}-m_{2})+\sigma(m_{1}m_{2})\right)\\ v_{1}(1-m_{1}-m_{2})-m_{1}\left(\frac{1}{v_{1}}-v_{1}(1-m_{1}-m_{2})+\sigma(m_{1}m_{2})\right)\end{bmatrix}=\pi_{2}f(\mathbf{m},\mathbf{v}).

The second statement follows by noting that $\mathbf{v}=(v,v)^{T}$ implies $\pi_{2}\mathbf{v}=\mathbf{v}$ . Then, we have $f(\pi_{2}\mathbf{m},\mathbf{v})=f(\pi_{2}\mathbf{m},\pi_{2}\mathbf{v})=\pi_{2}f(\mathbf{m},\mathbf{v})$ .

Let $T$ be a parsing of $n_{o}$ options. Recall from Proposition 2.3 that the set of isomorphisms associated with $T$ form a group denoted $\Gamma_{T}$ . These isomorphisms are represented by permutations $\sigma$ . The dynamics (12) obey symmetries corresponding to the permutations associated with $\Gamma_{T}$ . When the option values are all identical, the dynamics are $\Gamma_{T}$ -equivariant.

Lemma 4.5.

The dynamics (11) defined by a tree $T$ are preserved under isomorphisms of $T$ . Explicitly, let $T$ be a parsing of $n_{o}$ options, $f_{m}$ be the dynamics (11), and let $\gamma\in\Gamma_{T}$ . Then, $f_{m}(\gamma\mathbf{m},\gamma\mathbf{v})=\gamma f_{m}(\mathbf{m},\mathbf{v})$ .

Proof 4.6.

Let $\mathbf{m}$ represent the coordinates of (11) that result from the default depth-first parsing of the tree $T$ . Let $\gamma_{i}\in\Gamma_{T}$ represent the operation of flipping the tree $T$ at internal node $i$ . Note that any $\gamma\in\Gamma_{T}$ can be represented as a composition of several flips $\gamma_{i}$ , so it suffices to show that $f_{m}(\gamma_{i}\mathbf{m},\gamma_{i}\mathbf{v})=\gamma_{i}f_{m}(\mathbf{m},\mathbf{v})$ for any flip $\gamma_{i}$ .

Let $T_{i}^{\prime}$ be the tree that results from flipping $T$ at the internal node $i$ , and let $\mathbf{m}^{\prime}$ represent the coordinates of (11) that result from the depth-first traversal of the tree $T$ . The dynamics (11) take the form $\dot{}\mathbf{m}=f_{m}(\mathbf{m},\mathbf{v})$ in the coordinates associated with tree $T$ and $\dot{}\mathbf{m}^{\prime}=f_{m}(\mathbf{m}^{\prime},\mathbf{v}^{\prime})$ in the coordinates associated with $T^{\prime}$ . Note that $\mathbf{v}^{\prime}$ represents $\mathbf{v}$ in the coordinates associated with $T^{\prime}$ .

The action of $\gamma_{i}$ permutes the descendants of node $i$ , and in particular swaps the right and left children of $i$ : $\gamma_{i}:(m_{i1},m_{i2})\mapsto(m_{i2},m_{i1})$ . Compactly, this can be written as $\pi_{2}(m_{i1},m_{i2})=(m_{i2},m_{i1}),$ where $\pi_{2}\in S_{2}$ represents the permutation of two elements. The relation between $\mathbf{m}$ and $\mathbf{m}^{\prime}$ is as follows

	$\displaystyle\mathbf{m}$	$\displaystyle=(\mathbf{m}_{0}^{T},\ldots,\mathbf{m}_{i}^{T},\underbrace{\mathbf{m}_{\operatorname{lc}(i)}^{T},\mathbf{m}_{\operatorname{lc}(\operatorname{lc}(i))}^{T},\ldots}_{\text{left descendants of node }i},\underbrace{\mathbf{m}_{\operatorname{rc}(i)}^{T},\mathbf{m}_{\operatorname{lc}(\operatorname{rc}(i))}^{T},\ldots}_{\text{right descendants of node }i},\ldots)^{T}$
	$\displaystyle\mathbf{m}^{\prime}$	$\displaystyle=(\mathbf{m}_{0}^{T},\ldots,\pi_{2}\mathbf{m}_{i}^{T},\underbrace{\mathbf{m}_{\operatorname{rc}(i)}^{T},\mathbf{m}_{\operatorname{lc}(\operatorname{rc}(i))}^{T},\ldots}_{\text{right descendants of node }i},\underbrace{\mathbf{m}_{\operatorname{lc}(i)}^{T},\mathbf{m}_{\operatorname{lc}(\operatorname{lc}(i))}^{T},\ldots}_{\text{left descendants of node }i},\ldots)^{T},$

where $\operatorname{lc}$ and $\operatorname{rc}$ are the child relationships associated with tree $T$ .

Recall that $\mathbf{m}_{i}=(m_{i1},m_{i2})$ obeys the dynamics $\dot{}\mathbf{m}_{i}=f(\mathbf{m}_{i},\mathbf{v}_{i})$ given by (3). By Lemma 4.3, we have $f(\pi_{2}\mathbf{m}_{i},\pi_{2}\mathbf{v}_{i})=\pi_{2}f(\mathbf{m}_{i},\mathbf{v}_{i})$ , so the action of $\gamma_{i}$ leaves the dynamics of $\mathbf{m}_{i}$ equivariant. It remains to study the action of $\gamma_{i}$ on the descendants of node $i$ . As seen above, the action of $\gamma_{i}$ on these descendants is a block permutation, mapping $(\mathbf{m}_{\operatorname{lc}(i)},\mathbf{m}_{\operatorname{rc}(i)})\mapsto(\mathbf{m}_{\operatorname{rc}(i)},\mathbf{m}_{\operatorname{lc}(i)})$ , etc. The dynamics of each block $j$ is given by $\dot{}\mathbf{m}_{j}=f(\mathbf{m}_{j},\mathbf{v}_{j})$ and the overall dynamics $f_{m}$ is a simple stacking of copies of $f$ . Since the action of $\gamma_{i}$ permutes both the blocks of $\mathbf{m}$ and $\mathbf{v}$ in the same way, the dynamics $f_{m}(\mathbf{m}^{\prime},\mathbf{v}^{\prime})$ consists of a permutation of the blocks of $f_{m}(\mathbf{m},\mathbf{v})$ . Thus, we have

f_{m}(\mathbf{m}^{\prime},\mathbf{v}^{\prime})=f_{m}(\gamma_{i}\mathbf{m},\gamma_{i}\mathbf{v})=\gamma_{i}f_{m}(\mathbf{m},\mathbf{v})

for any flip $\gamma_{i}$ . The result follows by recalling that a generic $\gamma\in\Gamma_{T}$ can be represented by the composition of several flips $\gamma_{i}$ .

Theorem 4.7.

Let the conditions for Lemma 4.5 be satisfied and suppose that $\mathbf{v}=v\mathbf{1}_{n_{o}}$ , where $\mathbf{1}_{n_{o}}\in\mathbb{R}^{n_{o}}$ is the vector with all entries equal to 1, i.e., when $v_{i}=v\forall i\in\{1,\ldots,n_{o}\}$ . Then, the dynamics $f_{z}$ defined by (12) are equivariant under permutations of $\mathbf{z}$ corresponding to isomorphisms of $T$ .

Proof 4.8.

Let $\dot{}\mathbf{z}=f_{z}(\mathbf{z},\mathbf{v})$ be the dynamics (12). Let $\dot{}\mathbf{m}=f_{m}(\mathbf{m},\mathbf{v})$ be the dynamics (11). Note that the vector fields $f_{z}$ and $f_{m}$ are related by a change of coordinates $\mathbf{z}=g(\mathbf{m})$ that is invertible away from the origin $\mathbf{m}=0$ . It is clear that $g(\gamma\mathbf{m})=\gamma g(\mathbf{m})\forall\gamma\in\Gamma_{T}$ . Then, elementary calculus yields

(16)

f_{z}(\mathbf{z},\mathbf{v})=\dot{}\mathbf{z}=\frac{d}{dt}g(\mathbf{m})=\frac{\partial g}{\partial\mathbf{m}}\dot{}\mathbf{m}=\frac{\partial g}{\partial\mathbf{m}}f_{m}(\mathbf{m},\mathbf{v})=\frac{\partial g}{\partial\mathbf{m}}(g^{-1}(\mathbf{z}))f_{m}(g^{-1}(\mathbf{z}),\mathbf{v}).

Analogously, we have $f_{z}(\gamma\mathbf{z},\gamma\mathbf{v})=\frac{\partial g}{\partial\mathbf{m}}(g^{-1}(\gamma\mathbf{z}))f_{m}(g^{-1}(\gamma\mathbf{z}),\gamma\mathbf{v})$ . The fact that $g(\gamma\mathbf{m})=\gamma g(\mathbf{m})=\gamma\mathbf{z}$ implies that $\gamma g^{-1}(\mathbf{z})=\gamma\mathbf{m}=g^{-1}(\gamma\mathbf{z})$ . The chain rule yields $\frac{\partial g(\gamma\mathbf{m})}{\partial\mathbf{m}}=\frac{\partial g(\gamma\mathbf{m})}{\partial\mathbf{m}}\gamma$ . Similarly, $g(\gamma\mathbf{m})=\gamma g(\mathbf{m})$ implies that $\frac{\partial g(\gamma\mathbf{m})}{\partial\mathbf{m}}=\gamma\frac{\partial g(\mathbf{m})}{\partial\mathbf{m}}$ . Finally, note that $\gamma\cdot\gamma$ is equal to the identity for any $\gamma\in\Gamma_{T}$ . Putting these facts together yields

	$\displaystyle f_{z}(\gamma\mathbf{z},\gamma\mathbf{v})$	$\displaystyle=\frac{\partial g}{\partial\mathbf{m}}(g^{-1}(\gamma\mathbf{z}))f_{m}(g^{-1}(\gamma\mathbf{z}),\gamma\mathbf{v})$
		$\displaystyle=\gamma\frac{\partial g^{-1}(\mathbf{z})}{\partial\mathbf{m}}\gamma f_{m}(\gamma g^{-1}(\mathbf{z}),\gamma\mathbf{v})$
		$\displaystyle=\gamma\frac{\partial g^{-1}(\mathbf{z})}{\partial\mathbf{m}}\gamma\cdot\gamma f_{m}(g^{-1}(\mathbf{z}),\mathbf{v})=\gamma f_{z}(\mathbf{z},\mathbf{v}).$

Thus, the dynamics $f_{z}$ obey the same tree isomorphism symmetry as the dynamics $f_{m}$ . When $\mathbf{v}=v\mathbf{1}$ , $\gamma\mathbf{v}=\mathbf{v}\forall\gamma\in\Gamma_{T}$ . Then $f_{z}(\gamma\mathbf{z},\mathbf{v})=g(\gamma\mathbf{z},\gamma\mathbf{v})=\gamma g(\mathbf{z},\mathbf{v})$ , the desired result.

The implication of Lemma 4.5 and Theorem 4.7 is that the fundamental structure of the dynamics (11) and (12) is encoded in structure of the parsing $T$ . Furthermore, when all the options have equal values, they are treated the same in the sense that by the dynamics of the corresponding states are unchanged by permutation of the coordinates. When the option values differ, however, these symmetries can be broken. The symmetry breaking can be understood by studying the bifurcation properties of the vector field.

5 Bifurcation properties of the equivariant vector field

The Seeley et al. dynamics (3) decide between two options using a pitchfork bifurcation that unfolds as the values of the two options differ. The dynamics (11) and (12) introduced in Section 3 embed multiple copies of the pitchfork bifurcation inherited from (3). In this section we make this statement precise. We begin by recalling the definition of a $k$ -parameter unfolding of a bifurcation.

Definition 5.1 ([16]).

Let $f(x,\lambda)=0$ be an equation which undergoes a bifurcation as $\lambda\in\mathbb{R}$ is varied. An unfolding of $f$ is a parametrized family of functions $F(x,\lambda,\alpha),\alpha\in\mathbb{R}^{k}$ , such that $F(x,\lambda,0)=f(x,\lambda)$ . One refers to $F$ as a $k$ -parameter unfolding of $f$ .

We now recall the formal bifurcation result concerning the Seeley et al. dynamics (3).

Theorem 5.2.

[1, 3] Let $\dot{}\mathbf{m}=f(\mathbf{m},\mathbf{v})$ be the dynamics (3) and let $\mathbf{v}=v\mathbf{1}\in\mathbb{R}_{+}^{2}$ be the vector with both entries equal to $v>1$ . The dynamics undergo a pitchfork bifurcation as the parameter $\sigma$ increases through a critical value given by

(17)

\sigma=\frac{4v^{3}}{(v^{2}-1)^{2}}.

Equivalently, for fixed $\sigma$ , the dynamics undergo a pitchfork bifurcation as the parameter $v$ increases through the critical value $v=v^{*}$ solving (17).

Proof 5.3.

When $\mathbf{v}=v\mathbf{1}$ , straightforward computation shows that the dynamics $f$ defined by (3) have an equilibrium $\mathbf{m}=\bar{m}\mathbf{1}$ , where $\bar{m}$ satisfies

(18)

\bar{m}=\frac{-(1+v^{2})+\sqrt{1+2v^{2}+4\sigma v^{3}+9v^{4}}}{2v(2v+\sigma)}.

Evaluating the Jacobian of $f$ yields

(19)		$\displaystyle J$	$\displaystyle=\left.\begin{bmatrix}-\frac{1}{v_{1}}-v_{1}(1+m_{1})+v_{1}m_{U}-\sigma m_{2}&-v_{1}(1+m_{1})-\sigma m_{1}\\ -v_{2}(1+m_{2})&-\frac{1}{v_{2}}-v_{2}(1+m_{2})+v_{2}m_{U}-\sigma m_{1}\end{bmatrix}\right\|_{(\mathbf{m},\mathbf{v})=(\bar{m}\mathbf{1},v\mathbf{1})}$
		$\displaystyle=\begin{bmatrix}-\frac{1}{v}-3v\bar{m}-\sigma\bar{m}&-v(1+\bar{m})-\sigma\bar{m}\\ -v(1+\bar{m})-\sigma\bar{m}&-\frac{1}{v}-3v\bar{m}-\sigma\bar{m}\end{bmatrix}.$

The eigenvalues of $J$ are $\lambda_{1}=\frac{-2\bar{m}v^{2}+v^{2}-1}{v},\lambda_{2}=\frac{-4\bar{m}v^{2}-2\bar{m}\sigma v-v^{2}-1}{v}$ . Consider $\lambda_{1}$ and $\lambda_{2}$ as functions of $\sigma$ . Simple substitution shows that $\lambda_{2}<0\forall\sigma>0$ and that $\lambda_{1}$ smoothly increases through zero as $\sigma$ increases through the value $\sigma^{*}$ defined by (17).

As shown in Corollary 1.2, the pitchfork bifurcation embedded in the $S_{2}$ -equivariant dynamics (3) unfolds as a function of a single parameter $\alpha=2(v_{1}-v_{2})/(v_{1}+v_{2})$ . Note that when $v_{1}=v_{2},\alpha=0$ . The bifurcation and unfolding properties of (3) carry over naturally to the dynamics (11).

Note that the dynamics $f_{m}$ defined by (11) consist of stacked copies of (3), so the Jacobian of $f_{m}$ is a block diagonal matrix whose diagonal blocks are copies of $J$ defined in (19). The singularity of $f_{m}$ then unfolds as a function of $n_{i}=n_{o}-1$ parameters $\alpha_{i}=2(v_{i1}-v_{i2})/(v_{i1}+v_{i2})$ . Formally, we have the following theorem.

Theorem 5.4.

Let $T$ be a parsing of $n_{o}$ options consisting of $n_{i}$ internal nodes. Let $\Gamma_{T}$ be the isomorphism group of $T$ . Let $\dot{}\mathbf{m}=f_{m}(\mathbf{m},\mathbf{v})$ be the dynamics (11) defined by $T$ and let $v_{i}=v>1$ for each option $i\in\{1,\ldots,n_{o}\}$ Then,

i).

The vector field $f_{m}$ has an equilibrium $\mathbf{m}=\bar{m}\mathbf{1}_{2n_{i}}$ , where $\bar{m}$ is defined by (18).
ii).

The vector field has a singularity at $(\mathbf{m},v,\sigma)=(\bar{m}\mathbf{1}_{2n_{i}},v,\sigma)$ , where $\sigma$ and $v$ are related by (17). The singularity is a $\Gamma_{T}$ -equivariant bifurcation that consists of $n_{i}$ copies of the standard $S_{2}$ pitchfork bifurcation.
iii).

When $\mathbf{v}$ is perturbed away from $\mathbf{v}=v\mathbf{1}$ the system $f_{m}(\mathbf{m},\mathbf{v})$ is a $n_{o}-1$ -parameter unfolding of the $\Gamma_{T}$ -equivariant bifurcation.

Proof 5.5.

Recall from (11) that $f_{m}(\mathbf{m},\mathbf{v})$ consists of stacked copies of the dynamics $f(\mathbf{m}_{i},\mathbf{v}_{i})$ defined by (3). Since $v_{i}=v\forall i\in\{1,\ldots,n_{o}\}$ , $\mathbf{v}_{i}=v\mathbf{1}_{2}$ and $\mathbf{v}=v\mathbf{1}_{2n_{i}}$ . Thus, the $i^{th}$ block of $f_{m}(\mathbf{m},\mathbf{v})$ is equal to $f(\mathbf{m}_{i},\mathbf{v}_{i})$ , which has an equilibrium $\mathbf{m}_{i}=\bar{m}\mathbf{1}_{2}$ as seen in the proof of Theorem 5.2. The equilibrium of $f$ follows by stacking the blocks $\mathbf{m}_{i}$ , which proves statement i).

For statement ii), note that since the elements of $\mathbf{m}$ are stacked in the same order as those of $f_{m}(\mathbf{m},\mathbf{v})$ , the Jacobian of $f_{m}(\mathbf{m},\mathbf{v})$ is a block diagonal matrix with the $i^{th}$ diagonal block being the Jacobian of $f(\mathbf{m}_{i},\mathbf{v}_{i})$ . Thus, evaluating the Jacobian of $f_{m}(\mathbf{m},\mathbf{v})$ at the equilibrium in statement i) yields a block diagonal matrix

J_{m}=\left.\frac{\partial f_{m}(\mathbf{m},\mathbf{v})}{\partial\mathbf{m}}\right|_{(\mathbf{m},\mathbf{v})=(\bar{m}\mathbf{1}_{2n_{i}},v\mathbf{1}_{2n_{i}})}=\begin{bmatrix}J&0&\ldots&0\\ 0&J&\ldots&0\\ \vdots&\vdots&\ddots&\vdots\\ 0&0&\ldots&J\end{bmatrix}\in\mathbb{R}^{2n_{i}\times 2n_{i}},

where $J$ is the matrix defined in (19). The eigenvalues of $J_{m}$ are $\lambda_{1}=\frac{-2\bar{m}v^{2}+v^{2}-1}{v}$ and $\lambda_{2}=\frac{-4\bar{m}v^{2}-2\bar{m}\sigma v-v^{2}-1}{v}$ , each with multiplicity $n_{i}$ . As shown in the proof of Theorem 5.2, $\lambda_{1}=0$ when $\sigma$ and $v$ are related by (17), so there is a singularity at $(\mathbf{m},v,\sigma)=(\bar{m}\mathbf{1}_{2n_{i}},v,\sigma)$ . This singularity consists of $n_{i}$ copies of the $S_{2}$ pitchfork bifurcation embedded in $J$ .

For statement iii), consider how the option values $v_{i},i\in\{1,\ldots,n_{o}\}$ are related to the value vector $\mathbf{v}\in\mathbb{R}^{2n_{i}}$ . The vector $\mathbf{v}$ consists of $n_{i}$ blocks $\mathbf{v}_{i}$ whose components are defined by (10), one for each internal node. Note that, since $T$ is a full binary tree, it is a well-established fact [23] that $n_{i}=n_{0}-1$ . Each $\mathbf{v}_{i},i\in\{1,\ldots,n_{i}\}$ corresponds to an unfolding parameter $\alpha_{i}=2(v_{i1}-v_{i2})/(v_{i1}+v_{i2})$ . Each $\alpha_{i}$ is an unfolding parameter, since $\mathbf{v}=v\mathbf{1}$ implies that $\alpha_{i}$ . The result follows by noting that $n_{i}=n_{o}-1$ .

The implication of this result is that the dynamics (11) embeds a bifurcation which consists of multiple copies of the standard pitchfork bifurcation (1). The structure of the equilibria of (11) in the post-bifurcation regime reflects the symmetry properties of the vector field and can be studied in detail by appeal to the equivariant branching lemma [20, Theorem 3.3]. The detailed analysis is beyond the scope of this paper, but we show numerical results in Section 7.1 below.

6 Model reduction via singular perturbation

The dynamics $f_{m}$ defined in (11) inherit a complicated rational form from the Seeley et al. dynamics (3). As shown in Theorem 1.1, the dynamics (3) can be reduced by singular perturbation. In this section, we carry out an analogous model reduction for (11) and show that the equilibria of the resulting reduced model can be readily understood.

6.1 Change of coordinates

As in [18], we apply singular perturbation theory to the dynamics (11) by mapping $\mathbf{v}\mapsto K\mathbf{v}$ for a constant gain $K>0$ and take the singular limit $K\to+\infty$ , or equivalently $\epsilon=1/K\to 0$ . The singular perturbation allows us to eliminate half of the state variables and thus to express equilibria in a straightforward way as a function of the option values $v_{i}$ . The singular perturbation is more readily analyzed by expressing $\mathbf{m}_{i}=(m_{i1},m_{i2})\in\Delta^{2}$ and $\mathbf{v}_{i}=(v_{i1},v_{i2})\in\mathbb{R}^{2}_{+}$ in terms of mean-difference coordinates defined by

\Delta m_{i}=m_{i1}-m_{i2},\ \bar{m}_{i}=\frac{m_{i1}+m_{i2}}{2},\text{and }\Delta v_{i}=v_{i1}-v_{i2},\ \bar{v}_{i}=\frac{v_{i1}+v_{i2}}{2},

respectively. Expressing $\mathbf{m}_{i}$ in mean-difference coordinates results in expressing $\mathbf{z}_{i}\in\mathbb{R}^{2}$ in corresponding mean-difference coordinates

(z_{i1},z_{i2})=\mathbf{z}_{i}=z_{i}\mathbf{m}_{i}=(z_{i}m_{i1},z_{i}m_{i2})^{T}=\left(z_{i}\frac{2\bar{m}_{i}+\Delta m_{i}}{2},z_{i}\frac{2\bar{m}_{i}-\Delta m_{i}}{2}\right)^{T}.

Note that the recursive definition of the $z_{i}$ coordinates as $\mathbf{z}_{i}=z_{i}\mathbf{m}_{i}$ is such that the value of $z_{i}$ can be expressed as the product of $\mathbf{m}_{i}$ along the path $p_{i}$ from the root to node $i$ . Recall that we define $p_{i}$ as the sequence of nodes traversed along the unique shortest path from the root to node $i$ . The sequence $p_{i}$ begins with the root node and ends with node $i$ . We denote the number of nodes in the sequence by $|p_{i}|$ , and the $j^{th}$ node of $p_{i}$ by $p_{ij}$ . Explicitly, we have

(20)

z_{i}=\prod_{j=1}^{|p_{i}|-1}\frac{2\bar{m}_{p_{ij}}+a_{j}\Delta m_{p_{ij}}}{2},\text{ where }a_{j}=\begin{cases}+1,&p_{i(j+1)}=\operatorname{lc}(p_{ij})\\ -1,&p_{i(j+1)}=\operatorname{rc}(p_{ij}).\end{cases}

The dynamics of $z_{i}$ follow from the dynamics (11) and can be reduced by applying singular perturbation theory. As in [14, 18], we map $\mathbf{v}$ to $K\mathbf{v}$ , where $K>0$ is a constant gain. To apply singular perturbation theory, we set $\epsilon=1/K$ to be a small parameter and define coordinates $x,y$ with components

x_{i}=\Delta m_{i},\ y_{i}=\frac{1-2\bar{m}_{i}}{\epsilon}.

Since the dynamics $f_{m}$ defined in (11) is composed of stacked copies of the dynamics $f$ defined in (3), singular perturbation of (11) can be carried out by singularly perturbing its components which consist of copies of $f$ . We can express the dynamics in the coordinates $(\Delta m_{i},\bar{m}_{i})$ using the dynamics (5), (6)

(21)		$\displaystyle\dot{}\Delta m_{i}$	$\displaystyle=f_{\Delta m}(\Delta m_{i},\bar{m}_{i};K\bar{v}_{i};K\Delta v_{i})$
(22)		$\displaystyle\dot{}\bar{m}_{i}$	$\displaystyle=f_{\bar{m}}(\Delta m_{i},\bar{m}_{i},\sigma;K\Delta v_{i}).$

In the singular perturbation coordinates $(x_{i},y_{i})$ , these dynamics become

(23)	$\displaystyle\dot{x}_{i}$	$\displaystyle=f_{x}(x_{i},y_{i};\Delta v_{i},\bar{v}_{i},\epsilon)$
	$\displaystyle=-\epsilon\left(\frac{1-\epsilon y_{i}+x_{i}}{2\bar{v}_{i}+\Delta v_{i}}-\frac{1-\epsilon y_{i}-x_{i}}{2\bar{v}_{i}-\Delta v_{i}}\right)+\bar{v}_{i}x_{i}y_{i}+\Delta v_{i}y_{i}\frac{3-\epsilon y_{i}}{2}$
(24)	$\displaystyle\epsilon\dot{y}_{i}$	$\displaystyle=g_{y}(x_{i},y_{i};\Delta v_{i},\bar{v}_{i},\epsilon)$
	$\displaystyle=\epsilon\left(\frac{1-\epsilon y_{i}+x_{i}}{2\bar{v}_{i}+\Delta v_{i}}-\frac{1-\epsilon y_{i}-x_{i}}{2\bar{v}_{i}-\Delta v_{i}}\right)+\frac{\sigma}{2}((1-\epsilon y_{i})^{2}-x_{i}^{2})$
	$\displaystyle\ \ \ \ -\frac{y_{i}}{2}\left((2\bar{v}_{i}+\Delta v_{i})\left(1+\frac{1-\epsilon y_{i}+x_{i}}{2}\right)\right)-\frac{y_{i}}{2}\left((2\bar{v}_{i}-\Delta v_{i})\left(1+\frac{1-\epsilon y_{i}-x_{i}}{2}\right)\right).$

6.2 Reduced node dynamics

Taking the singular limit of the dynamics (23), (24) associated with node $i$ yields a reduced system whose dynamics are given by a rational polynomial. This is formalized in the following theorem, which is a straightforward application of [18, Theorem 1] stated above as Theorem 1.1 and whose proof is reproduced here.

Theorem 6.1.

In the singular limit $\epsilon\to 0$ , the dynamics (23), (24) reduce to

(25)

\dot{x}_{i}=\frac{\sigma}{2\bar{v}_{i}}(1-x_{i}^{2})\frac{2x_{i}+3\alpha_{i}}{6+\alpha_{i}x_{i}},

where $\alpha_{i}=\Delta v_{i}/\bar{v}_{i}$ .

Proof 6.2.

The proof follows the standard procedure for analyzing singularly-perturbed systems. First note that $x_{i}$ is the slow and $y_{i}$ the fast variable. Taking the singular limit $\epsilon\to 0$ of (23) and (24) yields

(26)		$\displaystyle\dot{x}_{i}$	$\displaystyle=f_{x}(x_{i},y_{i};\Delta v_{i},\bar{v}_{i},0)=\bar{v}_{i}x_{i}y_{i}+\frac{3\Delta v_{i}y_{i}}{2}$
(27)		$\displaystyle 0$	$\displaystyle=g_{y}(x_{i},y_{i};\Delta v_{i},\bar{v}_{i},0)=-\frac{y_{i}}{2}\left(6\bar{v}_{i}+\Delta v_{i}x_{i}\right)+\frac{\sigma}{2}\left(1-x_{i}^{2}\right).$

Solving Equation (27) for the fast variable $y_{i}$ yields

y_{i}=h(x_{i}):=\frac{\sigma(1-x_{i}^{2})}{6\bar{v}_{i}+\Delta v_{i}x_{i}}=\frac{\sigma}{\bar{v}_{i}}\frac{1-x_{i}^{2}}{6+(\Delta v_{i}/\bar{v}_{i})x_{i}},

which defines the slow manifold $\{(x_{i},y_{i})=(x_{i},h(x_{i}))\}$ . The system quickly converges to the slow manifold and then $x_{i}$ slowly evolves on the slow manifold. Using the expression $y_{i}=h(x_{i})$ for the fast variable $y_{i}$ in terms of the slow variable $x_{i}$ yields the reduced slow dynamics

\dot{x}_{i}=f_{x}(x_{i},h(x_{i});\Delta v_{i},\bar{v}_{i},0)=\frac{\sigma}{2\bar{v}_{i}}(1-x_{i}^{2})\frac{2x_{i}+3\Delta v_{i}/\bar{v}_{i}}{6+(\Delta v_{i}/\bar{v}_{i})x_{i}}.

Defining $\alpha_{i}=\Delta v_{i}/\bar{v}_{i}$ yields the desired result (25).

The implication of this result is that, in the singular limit $\epsilon\to 0$ , $\bar{m}_{i}\to 1/2$ and $\Delta m_{i}=x_{i}$ follows the dynamics (25). The coordinates of $\mathbf{m}_{i}$ associated with a node $i$ then reduce to

\mathbf{m}_{i}=\left(\frac{2\bar{m}_{i}+\Delta m_{i}}{2},\frac{2\bar{m}_{i}-\Delta m_{i}}{2}\right)^{T}=\left(\frac{1+\Delta m_{i}}{2},\frac{1-\Delta m_{i}}{2}\right)^{T},

where $\Delta m_{i}$ evolves according to (25). As shown in in Figure 3, the dynamics (25) have equilibria $x_{i}=\pm 1,-3\alpha_{i}/2$ whose existence and stability properties depend on the unfolding parameter $\alpha_{i}$ .

6.3 Reduced tree dynamics

The reduced dynamics of the full tree then follow from the reduction at each internal node $i$ introduced in Theorem 6.1. As noted above, the dynamics (11) of the tree state $\mathbf{m}$ consists of stacked copies of the node dynamics $\dot{}\mathbf{m}_{i}=f_{m}(\mathbf{m}_{i},\mathbf{v}_{i})$ . The reduced dynamics consist of stacked copies of the reduced dynamics (25). The equilibria of each state follow from the component dynamics. Formally, we have the following.

Corollary 6.3.

In the singular limit $\epsilon\to 0$ , the dynamics (11) reduce to

(28)

\dot{}\mathbf{x}=f_{\mathbf{x}}(\mathbf{x},\mathbf{v}),

where the $i^{th}$ component of $\mathbf{x}$ is equal to $x_{i}$ , the states $x_{i}$ associated with each node $i$ are stacked in depth-first order, and $f_{\mathbf{x}}$ consists of stacked copies of the singularly-reduced dynamics (25).

The dynamics (28) have equilibrium states $\mathbf{x}_{0}$ whose $i^{th}$ component $x_{i,0}$ is equal to $\pm 1$ or $-3\alpha_{i}/2$ . The existence and stability properties of these equilibrium values depends on the unfolding parameter $\alpha_{i}$ , as follows:

x_{i,0}=\begin{cases}+1,&\forall\alpha_{i}\in[-2,2],\text{ stable if }\alpha_{i}>-2/3\\ -1,&\forall\alpha_{i}\in[-2,2],\text{ stable if }\alpha_{i}<2/3\\ -2\alpha_{i}/3,&\forall\alpha_{i}\in[-2/3,2/3],\text{ unstable}.\end{cases}

The equilibria of the reduced dynamics (28) imply a set of equilibria of the projected dynamics $f_{o}$ defined in (14) whose values can be cleanly expressed in terms of the recursive definition (20). Formally, we have the following.

Corollary 6.4.

Take the singular limit $\epsilon\to 0$ of the dynamics (11) and consider the projection of these dynamics to the leaf states $\dot{}\mathbf{m}_{o}=f_{o}(\mathbf{z},\mathbf{v})$ given by (14). Denote the $i^{th}$ component of $\mathbf{m}_{o}$ as $m_{oi}=z_{o^{-1}(i)}$ . The projected dynamics have equilibria with components

(29)

m_{oi}=z_{o^{-1}(i)}=\prod_{j=1}^{|p_{i}|-1}\frac{1+a_{j}\Delta m_{j}^{*}}{2},

where $a_{j}$ and $\Delta m_{j}^{*}$ are given by

	$\displaystyle a_{j}$	$\displaystyle=\begin{cases}+1,&p_{o^{-1}(i)(j+1)}=\operatorname{lc}(p_{o^{-1}(i)j})\\ -1,&p_{o^{-1}(i)(j+1)}=\operatorname{rc}(p_{o^{-1}(i)j})\end{cases}\text{ and }$
	$\displaystyle\Delta m_{j}^{*}$	$\displaystyle=\Delta m_{p_{o^{-1}(i)j}}=\begin{cases}+1,&\forall\alpha_{p_{o^{-1}(i)j}}\in[-2,2],\\ -1,&\forall\alpha_{p_{o^{-1}(i)j}}\in[-2,2],\\ -2\alpha_{p_{o^{-1}(i)j}}/3,&\forall\alpha_{p_{o^{-1}(i)j}}\in[-2/3,2/3].\\ \end{cases}$

Equilibria are stable if each $\Delta m_{j}^{*}$ is a stable equilibrium, where the stability of each $\Delta m_{j}^{*}$ is given in Corollary 6.3.

Proof 6.5.

Recall that the value of $m_{oi}=z_{o^{-1}(i)}$ associated with an option $i$ can be expressed using the recursive definition (20) in terms of $\bar{m}_{j}$ and $\Delta m_{j}$ associated with nodes on the shortest path from the root node to the leaf node that represents the option $i$ . In the singular limit $\epsilon\to 0$ , we have $\bar{m}_{j}\to 1/2\forall j$ and $\dot{}\Delta m_{j}$ following the dynamics (28) with equilibria given in Corollary 6.3. Substituting the values of $\bar{m}_{j}$ and $\Delta m_{j}$ yields the expression (29).

The stability property follows by contradiction. If any $\Delta m_{j}^{*}$ in the product (29) corresponds to an unstable equilibrium of the singularly-perturbed dynamics (28), then the overall equilibrium will be unstable.

The implication of this result is clear from noting that the intermediate equilibrium $\Delta m_{j}^{*}=-2\alpha_{p_{o^{-1}(i)j}}/3$ is always unstable when it exists. Furthermore, the negative signs from $a_{j}$ and $\Delta m_{j}^{*}$ cancel out when the path $p_{i}$ passes from a parent to a right child. Thus, when the option value $v_{i}$ is sufficiently high so that $\alpha_{j}>-2/3$ whenever $p_{i}$ passes from a parent to a left child and that $\alpha_{j}<2/3$ whenever $p_{i}$ passes from a parent to a right child. In this case, each element in the product (29) is equal to one. Thus, when $v_{i}$ is sufficiently large relative to the other option values, the unique stable equilibrium of the projected dynamics (14) is the state $\mathbf{m}_{o}=e_{i}$ , where $e_{i}$ is the indicator vector with entry $i$ equal to 1 and all other entries equal to zero. Therefore, in the singular limit and when one option value is sufficiently large relative to the others, the dynamics (14) carries out an $\arg\max$ operation on the value vector $\mathbf{v}$ . When several option values are relatively large, the dynamics (14) effectively performs a sort of dynamical $\arg\max$ operation whose output depends on initial conditions. See Section 7.2 for a numerical example.

7 Numerical examples

In this section we show the results of numerical simulations of the dynamics. All the computations have been carried out with code that is publicly available from the author’s website [24]. The code is completely general in the sense that it implements the dynamics (11) for a generic binary tree $T$ . For clarity of presentation, all the simulations in this section are based on a binary tree parsing four options, as shown in Figure 8. In all simulations, the parameter $\sigma$ in (3) is set equal to 4 wherever it appears (once in (11) for each internal node).

Ordered node list: $(0,1,2,3,4,5,6)$
Ordered option list: (2,3,5,6)

Figure 8: Tree

T

used in the simulations.

T

is a parsing of four options, corresponding to the nodes

(2,3,5,6)

. The nodes are labeled with numbers according to the order in which they will be visited during a depth-first traversal.

7.1 Bifurcation characteristics

Here we present the results of several simulations illustrating the bifurcation characteristics of the system as studied in Section 5. Figures 9 and 10 study the case of equal option values (i.e., $v_{i}=v$ for each option $i\in\{1,\ldots,4\}$ ) and show how the system (11) bifurcates from having a single stable equilibrium to having five equilibria as the option value $v$ is increased past the critical value $v^{*}\approx 1.9058$ defined by (17).

For the simulation shown in Figure 9, $v=1.25<1.9058\approx v^{*}$ , so the system is in the pre-bifurcation regime. As predicted by Theorem 5.4, the dynamics (11) have an equilibrium $\mathbf{m}=\bar{m}\mathbf{1}$ , where $\bar{m}$ is defined by (18). This equilibrium value of $\mathbf{m}$ gets projected to an equilibrium value $\mathbf{m}_{o}$ of (14) whose $i^{th}$ component, corresponding to the $i^{th}$ option, is equal to $\bar{m}^{d}$ , with $d=|p_{o^{-1}(i)}|$ being the distance from the root of the tree $T$ to the $i^{th}$ option.

The simulation shown in Figure 10 is identical to that shown in Figure 9, except that now we set $v_{i}=v=5$ so that the system is in the post-bifurcation regime. The four panels show trajectories resulting from four different initial conditions along with the (now unstable) deadlock equilibrium located at $\mathbf{m}_{o}=\bar{m}\mathbf{1}$ . As suggested by Corollary 11, in the post-bifurcation regime there is an additional set of four symmetric stable equilibria corresponding to a clear preference for each of the four equally-valued options. The equilibrium to which a trajectory is attracted depends on initial conditions. In panel (a), initial conditions were $\mathbf{m}=(\mathbf{m}_{0}^{T},\mathbf{m}_{1}^{T},\mathbf{m}_{4}^{T})^{T}=(0.2,0.1,0.3,0.2,0.4,0.2)^{T}$ , corresponding to a weak initial preference for option 1. This initial preference determines the attracting equilibrium. The other three panels (b)–(d) use initial conditions that are permutations of those from panel (a). These permutations correspond to tree isomorphisms that exchange option 1 with options 2–4, respectively. As expected, the attracting equilibrium changes from option 1 to option 2–4 accordingly.

7.2 Singularly-perturbed system

In Figure 11 we present the results of a simulation illustrating the results of Section 6, particularly Corollary 6.4. The values of the four options are set equal to $(100,100,300,100)^{T}=100(1,1,3,1)^{T}$ , which puts the system close to the singularly-perturbed regime with $\epsilon=1/100\ll 1$ . These option values are such that the unfolding parameters of the internal nodes are $\alpha_{0}=-2/3,\alpha_{1}=0,$ and $\alpha_{4}=1$ . In this case, Corollary 6.4 predicts that there should be a unique stable equilibrium at $\mathbf{m}_{o}=(0,0,1,0)^{T}$ corresponding to an absolute preference for the high-value option 3.

The results shown in Figure 11 confirm this prediction of a unique stable equilibrium, as the trajectories of the projected dynamics (14) all converge to $\mathbf{m}_{0}=(0,0,1,0)^{T}$ for four different simulations with initial conditions corresponding to the simulations shown in Figure 10(a)–(d).

8 Conclusion

In this paper, we have developed a dynamical systems model of value-based decision making. The structure of the decision, and of the dynamical system itself, is encoded in a binary tree structure. The binary tree structure allows us to decompose a decision among $N$ options into a set of $N-1$ binary decisions arranged in a hierarchical structure. We then represent this decomposed decision as a dynamical system (11) whose vector field is defined recursively by parsing down the binary tree. At each internal node of the tree, the system makes decisions based on the values associated with the node’s two children, putting higher weight on the higher-value child. The $N$ leaf nodes of the tree represent the $N$ options.

The vector field (11) has symmetries that correspond to isomorphisms of the underlying tree and associated option values. When the $N$ options all have the same value, all isomorphisms of the tree leave the vector field equivariant; when only some options have the same value, a smaller set of isomorphisms leave the vector field equivariant. The equilibria of the vector field have significant structure, organized around an $N-1$ -parameter unfolding of a pitchfork singularity as shown in Theorem 5.4. The unfolding parameters of the pitchfork consist of the relative difference in values between the children of each internal node of the underlying tree. As shown in Corollary 6.4, the system equilibria correspond to point attractors at states that correspond to a preference for the high-value option. In a singular limit, this preference becomes absolute in the sense that no weight is accorded to any other option.

Further work remains to be done to understand the structure of the symmetry group of the vector fields (11) and (14), particularly in the case that only a subset of the options have identical values. In this case, the symmetry group will be a subgroup of the original group $\Gamma_{T}$ , and such subgroups likely have interesting structure. Similarly, further work remains to be done to understand the structure of equilibria of the vector fields in the post-bifurcation regime. The main tool here is the equivariant branching lemma [20, Theorem 3.3], which again leverages the subgroup structure of the symmetry group $\Gamma_{T}$ .

There are a number of interesting questions raised by the binary tree structure of our model. For example, consider a generic case of deciding among $N>2$ options. The binary tree structure appears to be a strong constraint on the structure of the decision-making process. A more general value-based decision-making model, such as the one whose analysis was begun in [6], could have similar unfolding characteristics with fewer structural constraints. An open question is to understand the effect of the constraints imposed by the binary tree structure. Are there decisions that can be made by a model encoded as a flat graph (i.e., without a hierarchy structure) that cannot be made by our binary-tree-based model?

As discussed in the introduction, we anticipate the model developed in this paper to be valuable for a variety of problems requiring models of value-based decision-making behavior. We are actively pursuing applications in the area of control systems and robotics where options correspond to control vector fields and the present model affords a method to compose multiple such vector fields. In particular, we are developing methods to derive dynamics of the option values $v_{i}$ such that the overall system achieves a complex behavior specified, e.g., in terms of temporal logic. This work has the potential to unite dynamical systems with so-called formal methods tools [25] for control.

Acknowledgement

We thank Daniel Koditschek for discussions that led to the concept of a parsing. This work was supported in part by Air Force Research Laboratory grant FA8650-15-D-1845 subcontract 669737-6 and grant FA8650-19-C-1712 subcontract 670956-1.

References

[1] T. D. Seeley, P. K. Visscher, T. Schlegel, P. M. Hogan, N. R. Franks, and J. A. Marshall, “Stop signals provide cross inhibition in collective decision-making by honeybee swarms,” Science, vol. 335, no. 6064, pp. 108–111, 2012.
[2] J. A. Marshall, R. Bogacz, A. Dornhaus, R. Planqué, T. Kovacs, and N. R. Franks, “On optimal decision-making in brains and social insect colonies,” Journal of the Royal Society Interface, vol. 6, no. 40, pp. 1065–1074, 2009.
[3] D. Pais, P. M. Hogan, T. Schlegel, N. R. Franks, N. E. Leonard, and J. A. Marshall, “A mechanism for value-sensitive decision-making,” PloS one, vol. 8, no. 9, p. e73216, 2013.
[4] A. Reina, J. A. R. Marshall, V. Trianni, and T. Bose, “Model of the best-of- $n$ nest-site selection process in honeybees,” Physical Review E, vol. 95, no. 5, 2017.
[5] R. Gray, A. Franci, V. Srivastava, and N. E. Leonard, “Multi-agent decision-making dynamics inspired by honeybees,” IEEE Transactions on Control of Network Systems, vol. 5, no. 2, pp. 793–806, 2018.
[6] A. Franci, M. Golubitsky, and N. E. Leonard, “The dynamics of multi-agent multi-option decision making,” 2019.
[7] J. J. Gibson, The theory of affordances. The ecological approach to visual perception. Houghton Mifflin Boston, MA, 1979.
[8] K. Friston, P. Schwartenbeck, T. FitzGerald, M. Moutoussis, T. Behrens, and R. J. Dolan, “The anatomy of choice: active inference and agency,” Frontiers in human neuroscience, vol. 7, p. 598, 2013.
[9] N. F. Lepora and G. Pezzulo, “Embodied choice: how action influences perceptual decision making,” PLoS computational biology, vol. 11, no. 4, p. e1004110, 2015.
[10] P. Cisek and J. F. Kalaska, “Neural mechanisms for interacting with a world full of action choices,” Annual review of neuroscience, vol. 33, pp. 269–298, 2010.
[11] P. Cisek, “Cortical mechanisms of action selection: the affordance competition hypothesis,” Philosophical Transactions of the Royal Society B: Biological Sciences, vol. 362, no. 1485, pp. 1585–1599, 2007.
[12] A. P. Duchon, L. P. Kaelbling, and W. H. Warren, “Ecological robotics,” Adaptive Behavior, vol. 6, no. 3-4, pp. 473–507, 1998.
[13] P. Zech, S. Haller, S. R. Lakani, B. Ridge, E. Ugur, and J. Piater, “Computational models of affordance in robotics: a taxonomy and systematic classification,” Adaptive Behavior, vol. 25, no. 5, pp. 235–271, 2017.
[14] P. B. Reverdy and D. E. Koditschek, “A dynamical system for prioritizing and coordinating motivations,” SIAM Journal on Applied Dynamical Systems, vol. 17, no. 2, pp. 1683–1715, 2018.
[15] P. Reverdy, V. Vasilopoulos, and D. E. Koditschek, “Motivation dynamics for autonomous composition of navigation tasks,” Submitted, 2020.
[16] M. Golubitsky and D. Schaeffer, Singularities and Groups in Bifurcation Theory, ser. Applied Mathematical Sciences. Springer, 1985, vol. 51.
[17] M. Golubitsky and I. Stewart, The symmetry perspective: from equilibrium to chaos in phase space and physical space. Springer Science & Business Media, 2003, vol. 200.
[18] P. B. Reverdy, “Two paths to finding the pitchfork bifurcation in motivation dynamics,” in Proc. IEEE Conf. Decision and Control, 2019, pp. 8030–8035.
[19] R. Olfati-Saber and R. M. Murray, “Consensus problems in networks of agents with switching topology and time-delays,” IEEE Transactions on automatic control, vol. 49, no. 9, pp. 1520–1533, 2004.
[20] M. Golubitsky, I. Stewart, and D. G. Schaeffer, Singularities and groups in bifurcation theory: Vol. II, ser. Applied Mathematical Sciences. New York: Springer-Verlag, 1988, no. 69.
[21] W. Dicks and M. J. Dunwoody, Groups acting on graphs. Cambridge University Press, 1989, vol. 17.
[22] A. V. Aho, J. E. Hopcroft, and J. D. Ullman, The design and analysis of computer algorithms. Addison-Wesley, 1974.
[23] M. Hazewinkel. Binary tree. Encyclopedia of Mathematics. [Online]. Available: {http://www.encyclopediaofmath.org/index.php?title=Binary_tree&oldid=31607}
[24] P. B. Reverdy. (2020, Mar.) preverdy/binary-tree-decisions: First release. [Online]. Available: https://doi.org/10.5281/zenodo.3698325
[25] H. Kress-Gazit, M. Lahijanian, and V. Raman, “Synthesis for Robots: Guarantees and Feedback for Robot Behavior,” Annual Review of Control, Robotics, and Autonomous Systems, vol. 1, no. 1, pp. 211–236, 2018. [Online]. Available: https://doi.org/10.1146/annurev-control-060117-104838

Dynamical, value-based decision making among NN options: a constructive approach to unfolding the symmetric pitchfork bifurcation