This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

thanks: Presented at the Traffic and Granular Flow 22 conference, held in New Delhi, India.

Collective traffic of agents that remember

Danny Raj M [email protected]; [email protected]    Arvind Nayak Dept of Chemical Engineering, Indian Institute of Science Bangalore, Bengaluru 560012, Karnataka, India
Abstract

Traffic and pedestrian systems consist of human collectives where agents are intelligent and capable of processing available information, to perform tactical manoeuvres that can potentially increase their movement efficiency. In this study, we introduce a social force model for agents that possess memory. Information of the agent’s past affects the agent’s instantaneous movement in order to swiftly take the agent towards its desired state. We show how the presence of memory is akin to an agent performing a proportional–integral control to achieve its desired state. The longer the agent remembers and the more impact the memory has on its motion, better is the movement of an isolated agent in terms of achieving its desired state. However, when in a collective, the interactions between the agents lead to non-monotonic effect of memory on the traffic dynamics. A group of agents with memory exiting through a narrow door exhibit more clogging with memory than without it. We also show that a very large amount of memory results in variation in the memory force experienced by agents in the system at any time, which reduces the propensity to form clogs and leads to efficient movement.

I Introduction

The ability to think is a key distinction between agents in human collectives, like vehicular and pedestrian traffic flows [1, 2, 3, 4], and granular collectives [5, 6], which are made up of grains that respond and move only to external forcing. Therefore, when one writes down a set of governing equations either in the form of rules of interaction or social forces, it is important to specify those that distinguish a living agent from one that is non-living. It would be an interesting problem to explore how this distinction manifests in the dynamics of the collective. Since a comprehensive model for human cognition (or thinking) is incredibly hard to formulate, we adopt a bottom-up approach where we investigate how a certain facet of intelligence affects the collective dynamics.

One of the key characteristics of a living agent is its ability to assimilate information, process it and alter the agent’s decisions based on it. A critical aspect in this process is the agent’s memory. An agent can remember its past actions and change its current actions accordingly to achieve better performance. Memory of an agent can affect the dynamics across several time scales: from learning route-specific rules over several days and months, to remembering its recent trajectory and reacting to immediate changes in the environment/jams. Here, we concentrate on the latter. We develop a social force model for the memory of an agent based on its past movement. Remembering how it moved in the recent past, the agent evaluates how well it has achieved its desired velocity and takes a tactical decision—to make up for the sub-optimal past movement.

In this article, we introduce a social force model for agents that exhibit memory. Memory is characterised by two parameters: i) how long the agent remembers and, ii) impact of memory on the movement. We first analyse the model equations to understand the impact of memory on the dynamics of an isolated agent. Then, we proceed to test the effect of memory on the collective. We simulate agents attempting to exit via narrow door where agents are known to exhibit clogging behaviour near the exits. Our objective is to understand how memory aids in the collective motion: Does the presence of memory always guarantee efficient movement? How does it impact the propensity to form or displace agents in a clog?

II Model for traffic dynamics

II.1 Agents with memory

The motion of agents is modelled using a social force model, similar to Helbing et al [7, 8, 9]. The velocity of an agent evolves in time based on Eq 1, where: i) the first term in the RHS is the restitution force that restores the agent to its desired direction and speed 𝐯0\mathbf{v}_{0}, ii) the second term is the force due to the effect of the memory of an agent and, iii) the third term 𝐈j,bi\mathbf{I}_{j,b}^{i} is the net social interaction of the agent to avoid collision with other agents and the boundary.

md𝐯idt=mτ(𝐯0𝐯i)+β~𝐌i+Ij,bim\frac{d\mathbf{v}_{i}}{dt}=\frac{m}{\tau}(\mathbf{v}_{0}-\mathbf{v}_{i})+\tilde{\beta}\mathbf{M}_{i}+I_{j,b}^{i} (1)

Here, the memory 𝐌i(t)\mathbf{M}_{i}(t) of an agent ii at time tt, is defined as the total deviation of the velocity of the agent from its desired velocity 𝐯0\mathbf{v}_{0} in the time window [tT,t][t-T,t] of the agent’s recent past (see Eq 2).

𝐌i=tTt(𝐯0𝐯i(t))𝑑t\mathbf{M}_{i}=\int_{t-T}^{t}\big{(}\mathbf{v}_{0}-\mathbf{v}_{i}(t^{\prime})\big{)}dt^{\prime} (2)

When the agent is unable to travel at its desired speed and direction, the memory term takes a positive value and offers an additional force to help restore the agent’s motion sooner.

Ij,bi=j𝒩i[𝐅i,jn+𝐅i,jt]+𝐅bn+𝐅btI_{j,b}^{i}=\sum_{\forall j\in\mathcal{N}_{i}}\Big{[}\mathbf{F}_{i,j}^{n}+\mathbf{F}_{i,j}^{t}\Big{]}+\mathbf{F}_{b}^{n}+\mathbf{F}_{b}^{t} (3)

The term 𝐈j,bi\mathbf{I}_{j,b}^{i} can take different forms depending on the context of the traffic problem. For pedestrian dynamics [8], Helbing and co-workers considered interaction forces similar to that used in granular flows: a sum of the total normal and tangential (frictional) forces due to contact. However, 𝐈j,bi\mathbf{I}_{j,b}^{i} can also include small-ranged forces that prevent collisions [9], which are more suitable for traffic flow problems. The qualitative results shown in the paper do not depend on the exact choice of the models for interactions.

II.2 Connections to control theory.

If one were to imagine 𝐯0\mathbf{v}_{0} as the set-point for a given agent, i.e., the desired state that the agent wants to achieve, then the restitution force and the memory term in Eqs 1 and 2, exactly resemble a proportional and an integral parts of a controller (PI), respectively. Presence of an integral component can result in overshoots and oscillations before the agent reaches its set-point. In the absence of memory and when there are continuous collisions with obstacles as agents move, we can expect the dynamics of the agent to show an offset (not reach 𝐯0\|\mathbf{v}_{0}\|)—since, it is well known that a proportional controller cannot take the system exactly to its set-point. Addition of memory could guarantee reaching the set-point, even in the presence of obstacles.

One could also conceive a dynamics resembling a PID controller. This modification does not qualitatively change the dynamics of the system. Since, adding a derivative term for the deviation 𝐯0𝐯i\mathbf{v}_{0}-\mathbf{v}_{i}, gets absorbed into the inertia (LHS term of Eq 1). In other words, the derivative term acts like an effective mass term that slows the response of the agent to any social force.

II.3 Memory as a state of the agent.

The memory term in Eq 2 is computed within a time window of [tT,t][t-T,t]. Any information from outside the time window is not used and all the information within are equally weighted. A simpler and more elegant formulation can be arrived at, if we weight the information with an exponential weighting as shown in Eq 4.

𝐌i=0tettα~(𝐯0𝐯i(t))𝑑t\mathbf{M}_{i}=\int_{0}^{t}e^{\frac{t^{\prime}-t}{\tilde{\alpha}}}\big{(}\mathbf{v}_{0}-\mathbf{v}_{i}(t^{\prime})\big{)}dt^{\prime} (4)

An exponential weighting prioritises information close to the current time instant than from a distant past. This allows us to move away from a discontinuous time window set by TT and towards a time scale α~\tilde{\alpha}. Then, we can differentiate Eq 4 with time, using the Leibniz rule for differentiating under the integral sign to get Eq 5.

d𝐌idt=𝐯0𝐯i𝐌iα~\frac{d\mathbf{M}_{i}}{dt}=\mathbf{v}_{0}-\mathbf{v}_{i}-\frac{\mathbf{M}_{i}}{\tilde{\alpha}} (5)

This allows us to convert the memory term, which was previously an integral, into a dynamic state of the agent. Eq 5 describes the evolution of this memory-state of the agent: 𝐯0𝐯i\mathbf{v}_{0}-\mathbf{v}_{i} serves as the instantaneous source for the memory while 𝐌iα~-\frac{\mathbf{M}_{i}}{\tilde{\alpha}} is the rate at which memory decays.

III Results and Discussion

III.1 Dynamics of a single agent

The equations for the dynamics of an isolated agent, far away from the boundary, can be written compactly as in Eq 6. This is after: i) the governing equations in Eq 1 are scaled (tt with τ\tau, 𝐯\|\mathbf{v}\| with 𝐯0\|\mathbf{v}_{0}\|, 𝐌\|\mathbf{M}\| with 𝐯0/τ\|\mathbf{v}_{0}\|/\tau), ii) the interaction terms are dropped and, iii) the scaled memory parameters become α=α~/τ\alpha=\tilde{\alpha}/\tau and β=β~/m\beta=\tilde{\beta}/m.

ddt(vxvyMxMy)=(10β0010β101α00101α)×(vxvyMxMy)+(1010)\frac{d}{dt}\begin{pmatrix}v_{x}\\ v_{y}\\ M_{x}\\ M_{y}\end{pmatrix}=\begin{pmatrix}-1&0&\beta&0\\ 0&-1&0&\beta\\ -1&0&-\frac{1}{\alpha}&0\\ 0&-1&0&-\frac{1}{\alpha}\end{pmatrix}\times\begin{pmatrix}v_{x}\\ v_{y}\\ M_{x}\\ M_{y}\end{pmatrix}+\begin{pmatrix}1\\ 0\\ 1\\ 0\end{pmatrix} (6)

To understand the effect of memory (α\alpha and β\beta) on the movement characteristics of a single agent, it is enough to look at the eigen values of the matrix in Eq 6. The eigen values are 12α×[(α+1)±4α2β+α22α+1]\frac{1}{2\alpha}\times\big{[}-(\alpha+1)\pm\sqrt{-4\alpha^{2}\beta+\alpha^{2}-2\alpha+1}~{}\big{]}, repeated twice. Analysing the eigen values helps us partition the βα\beta-\alpha space into four regions (see Fig 1, i).

Refer to caption
Figure 1: (i) Regions with qualitatively different dynamics in the βα\beta-\alpha parameter space. Coloured (shaded) regions exhibit stable dynamics, which can be oscillatory, non-oscillatory and opposing (negative β\beta). The unshaded region marks the unstable region, which is found only when β\beta takes a sufficiently large negative value. (ii)-(vii) Time evolution of the x components of velocity and memory of the isolated agent for different memory parameters. (ii) α=0.3;β=2\alpha=0.3;\beta=2, (iii) α=3;β=0.2\alpha=3;\beta=0.2, (iv) α=0.75;β=3\alpha=0.75;\beta=3, (v) α=3;β=3\alpha=3;\beta=3, (vi) α=0.4;β=1\alpha=0.4;\beta=-1, (vii) α=3;β=1\alpha=3;\beta=-1.

Stability: When β>0\beta>0, the memory term aids in restoring the agent to its desired state at steady state (see Fig 1, ii). And when β<0\beta<0, memory opposes the agent’s attempt to reach its desired state. However, since the restitution force (𝐯0𝐯i)(\mathbf{v}_{0}-\mathbf{v}_{i}) always acts to restore the agent’s dynamics, a small negative β\beta memory does not immediately destabilise the dynamics and only increases the time taken to reach steady state (see Fig 1, vi). However, when β<1α\beta<-\frac{1}{\alpha}, the eigen values begin to have a positive real part and the memory term overpowers the restitution force leading to unstable dynamics (see Fig 1, vii).

Overshoots & oscillations: When β>1412α+14α2\beta>\frac{1}{4}-\frac{1}{2\alpha}+\frac{1}{4\alpha^{2}}, the eigen values become complex conjugates and the dynamics become under-damped which results in oscillations in both the memory and the velocity of the agent (Fig 1, iv and v). Also, when α=1\alpha=1, any β\beta greater than 0 results in oscillations. This divides the region exhibiting non-oscillatory dynamics into two. The high-α\alpha part of this region exhibits dynamics where we observe overshoots; i.e. the velocity takes a larger value before asymptotically tapering to the desired, steady state value (see Fig 1, ii and iii to compare the dynamics in the low-α\alpha and high-α\alpha regions).

III.2 Collective escape through a narrow exit

To understand the effect of memory on the dynamics of the collective, we consider a well known system: agents exiting a room via a narrow door (See inset of figure 2 i). We use the conditions similar to that in ref [8]. The governing equations in Eq 1 and 5 are simulated where every agent has a 𝐯0\mathbf{v}_{0}, directed along the line joining the agent centre and the mid-point of the exit of the door. The inter-agent interactions lead to collisions between agents as they crowd near the exit, slowing them down and giving rise to temporary clogging. Now as agents slow down, memory force increases since the source for the memory is 𝐯0𝐯i\mathbf{v}_{0}-\mathbf{v}_{i}. This makes the agents push harder as they attempt to exit the room. To understand how these forces experienced by the agents lead to the collective escape of these agents, we introduce an order parameter 𝒞\mathcal{C} that characterises the clogging propensity. Here, 𝒞\mathcal{C} is simply the fraction of the total time when the agent-number in the room remains a constant. With this definition, a high value of 𝒞\mathcal{C} would correspond to the prevalence of a large number of clogging events during which agent number in the room remains unchanged.

Refer to caption
Figure 2: (i) - Heat map of the clogging order parameter 𝒞\mathcal{C}, averaged over 7575 independent realisations, plotted for the memory parameters β[0,10]\beta\in[0,10] and α[0.1,30]\alpha\in[0.1,30]. INSET: a snapshot of agents exiting a room through a narrow exit. (ii) - Time evolution of the mean and the standard deviation of the memory of agents within the room, averaged over 7575 independent realisations. This is plotted for three different memory configurations one from each of the regions II, III and IV. Parameters considered are α=22.2 and β=0.3\alpha=22.2\text{ and }\beta=0.3 for region II, α=22.2 and β=2\alpha=22.2\text{ and }\beta=2 for region III and α=22.2 and β=10\alpha=22.2\text{ and }\beta=10 for region IV.

Figure 2 i, shows the heat map of 𝒞\mathcal{C} in the βα\beta-\alpha space. We find the landscape of 𝒞\mathcal{C} to be non-monotonic with respect to both the principle axes: i.e., an increase in the memory does not necessarily reduce the propensity to form clogs. This observation is similar to the so-called faster-is-slower effect (FIS), reported by Helbing et al [8, 10, 11], where agents trying harder do not necessarily lead to more efficient escape. In addition, we find that the observed non-monotonic effect is more pronounced in certain regions of the αβ\alpha-\beta space. For instance, when either α\alpha or β\beta are high (see figure 2 i), the non-monotonicity with respect to the other parameter is well pronounced in comparison to when they are low. We believe this feature arises due to the intrinsic dependence of the memory force on the parameters α\alpha and β\beta as seen in figure 1. When an agent remembers only its recent past, i.e. when α\alpha is small, a high value of β\beta is required to produce a force of a similar magnitude to transition from non-oscillatory to oscillatory behaviour and vice versa.

We partition the αβ\alpha-\beta space in figure 2 i into four regions while preserving the αβ\alpha-\beta relationship discussed previously. Region I is monotonic with respect to 𝒞\mathcal{C} and completely covers and extends over the region in the αβ\alpha-\beta space corresponding to non-oscillatory behaviour at the agent-level (compare with figure 1). Region II corresponds to the ideal amount of memory that agents can possess to efficiently escape through the exit. Region III corresponds to the FIS phenomenon and it is sandwiched between regions II and IV which exhibit efficient collective escape.

To understand how memory affects the escape dynamics and the formation of clogs, we look at the averaged dynamics of the memory term for the agents within the room. Figure 2 ii shows the time evolution of the mean and standard deviation of the memory Mi(t)\|M_{i}(t)\| in system, averaged over all the independent realisations of the escape dynamics. The initial transient dynamics show that the effect of memory is more pronounced for the agents with lesser strength of memory (i.e., regions II>III>IVII>III>IV). This is because memory increases when the agents begin to slow down and agents with a higher strength of memory reach their desired state faster. This trend disappears as the agents clog near the entrance: both the mean and standard deviation of memory of the agents within the room increases rapidly in time. The dynamics of agents with memory in region III begin to have a higher overall memory with smaller variation between the agents within the room in comparison to II and IV. In other words, the strength and time scale of memory is such that all the agents try hard to exit by a similar amount which favours the formation of clogs and gives rise to a high value of 𝒞\mathcal{C}. However, the higher variation and the lower mean memory of agents with memory in region IV implies that the some agents have more memory than the rest, which favours a more efficient exit strategy giving rise to lower values for 𝒞\mathcal{C}.

IV Conclusion

A key feature of the traffic characteristics of living collectives is the ability of individual agents to assimilate dynamic information and alter their movement appropriately. In this article, we introduce memory, a facet of intelligence where an agent remembers its trajectory from its recent past and quantifies how well it was able to achieve its desired velocity. The agent makes up for any non-optimal movement in its recent past with a social force proportional to the memory. We show that an agent’s memory has an effect akin to a Proportional–Integral controller. Depending on the memory parameters α\alpha and β\beta, the agent dynamics can be stable or unstable, and give rise to overshoots and oscillations. The eigen values of the system shed light on the boundaries that partition the different regions in the βα\beta-\alpha space that show qualitatively different dynamics.

While the effect of memory on the dynamics of the agent is monotonic when only a single agent is considered, it is not the case when agents are in a crowd. We study the effect of memory on the dynamics of agents exiting a room through a narrow door. We find that the presence of memory does not always improve the movement of the agents: the clogging order parameter 𝒞\mathcal{C} has a non-monotonic landscape in the βα\beta-\alpha space resulting in a behaviour similar to the well known faster-is-slower effect. This allows us to partition the βα\beta-\alpha space into four regions based on a order parameter that quantifies clogging propensity. We find the eigen values corresponding to the motion of individual agents, in the non-monotonic region of the βα\beta-\alpha space, to be complex. In other words, the observed FIS effect is a result of the under-damped response of agents arising due to the integral component of the memory force term. The time dynamics of the mean and standard deviation of the memory of agents within the room, reveal why some regions in the βα\beta-\alpha space favour the formation of clogs giving rise to non-monotonicity.

Acknowledgements

The authors thank the DST INSPIRE faculty award (grant number: DST/INSPIRE/04 /2017/002985) for the funding. DRM thanks V Kumaran, IISc Bangalore for the discussions during the early stages of the work.

References

  • Moussaid et al. [2009] M. Moussaid, S. Garnier, G. Theraulaz, and D. Helbing, Collective Information Processing and Pattern Formation in Swarms, Flocks, and Crowds, Topics in Cognitive Science 1, 469 (2009).
  • Seitz et al. [2016] M. J. Seitz, N. W. Bode, and G. Köster, How cognitive heuristics can explain social interactions in spatial movement, Journal of the Royal Society Interface 1310.1098/rsif.2016.0439 (2016).
  • Rahmati and Talebpour [2018] Y. Rahmati and A. Talebpour, Learning-based game theoretical framework for modeling pedestrian motion, Physical Review E 98, 032312 (2018).
  • Nicolas et al. [2019] A. Nicolas, M. Kuperman, S. Ibañez, S. Bouzat, and C. Appert-Rolland, Mechanical response of dense pedestrian crowds to the crossing of intruders, Scientific Reports 9, 105 (2019).
  • Candelier and Dauchot [2010] R. Candelier and O. Dauchot, Journey of an intruder through the fluidization and jamming transitions of a dense granular media, Physical Review E 81, 011304 (2010).
  • Guo and Curtis [2015] Y. Guo and J. S. Curtis, Discrete element method simulations for complex granular flows, Annual Review of Fluid Mechanics 47, 21 (2015).
  • Helbing and Molnár [1995] D. Helbing and P. Molnár, Social force model for pedestrian dynamics, Physical Review E 51, 4282 (1995).
  • Helbing et al. [2000a] D. Helbing, I. Farkas, and T. Vicsek, Simulating dynamical features of escape panic, Nature 407, 487 (2000a).
  • Helbing et al. [2000b] D. Helbing, I. J. Farkas, and T. Vicsek, Freezing by Heating in a Driven Mesoscopic System, Physical Review Letters 84, 1240 (2000b).
  • Garcimartín et al. [2014] A. Garcimartín, I. Zuriguel, J. M. Pastor, C. Martín-Gómez, and D. R. Parisi, Experimental evidence of the ”faster is slower” effect, Transportation Research Procedia 2, 760 (2014).
  • Pastor et al. [2015] J. M. Pastor, A. Garcimartín, P. a. Gago, J. P. Peralta, C. Martín-Gómez, L. M. Ferrer, D. Maza, D. R. Parisi, L. a. Pugnaloni, and I. Zuriguel, Experimental proof of faster-is-slower in systems of frictional particles flowing through constrictions, Physical Review E 92, 062817 (2015).