This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Free utility model for explaining the social gravity law

Hao Wang Institute of Transportation System Science and Engineering, Beijing Jiaotong University, Beijing 100044, China    Xiao-Yong Yan [email protected] Institute of Transportation System Science and Engineering, Beijing Jiaotong University, Beijing 100044, China Compleχ\chi Lab, University of Electronic Science and Technology of China, Chengdu 611731, China    Jinshan Wu School of Systems Science, Beijing Normal University, Beijing 100875, China
Abstract

Social gravity law widely exists in human travel, population migration, commodity trade, information communication, scientific collaboration and so on. Why is there such a simple law in many complex social systems is an interesting question. Although scientists from fields of statistical physics, complex systems, economics and transportation science have explained the social gravity law, a theoretical explanation including two dominant mechanisms, namely individual interaction and bounded rationality, is still lacking. Here we present a free utility model, whose objective function is mathematically consistent with the Helmholtz free energy in physics, from the perspective of individual choice behavior to explain the social gravity law. The basic assumption is that bounded rational individuals interacting with each other will trade off the expected utility and information-processing cost to maximize their own utility. The previous explanations of the social gravity law including the maximum entropy model, the free cost model, the Logit model and the destination choice game model are all special cases under our model. Further, we extend the free utility model to the network. This model not only helps us to better understand the underlying mechanisms of spatial interaction patterns in complex social systems, but also provides a new perspective for understanding the potential function in game theory and the user equilibrium model in transportation science.

I Introduction

Predicting the mobility of people, goods and information between locations is an important problem in fields as diverse as sociology Garra16 , economics Niedercorn69 , demography Roy03 , epidemiology Jia20 , transportation science Dios11 and network science Bar11 . For more than a hundred years, scholars have proposed a variety of models to predict such mobility Dios11 ; Bar11 ; Stouffer40 ; Sen12 ; Simini12 ; Yan14 ; Yan17 ; Liu20 . These models are named spatial interaction models in economics Sen12 and trip distribution models in transportation science Dios11 . The gravity model is the most influential mobility prediction model and has been applied in many fields Bar11 . For example, it is used to predict population migration Fagiolo13 , commodity trade Karpiarz14 , commuting flows Masucci13 and public transportation flows Masucci13 ; Goh12 . The gravity model is popular because spatial interaction phenomena in these fields all obey a law known as the social gravity law under which the flow between two locations is proportional to the activity (usually quantified by population, GDP and so on) of these two locations and decays with the power function of the distance between them, similar to Newton’s law of universal gravitation. As early as 1846, Desart found that the railway passenger flow between stations in Belgium obeyed the social gravity law Odlyzko15 . This may be the earliest record of the discovery of the social gravity law. Later, Carey Carey58 and Ravenstein Ravenstein89 found that population migrations of the United States and the United Kingdom, respectively, obeyed the social gravity law. Additionally, Reilly Reilly29 found that retail business drawn from cities obeyed the social gravity law. In recent years, with the continuous development of modern electronic and information technology, there have been many approaches (such as GPS, mobile phones and social networking sites) to recording the mobility data of people, goods and information over long periods. By analyzing these data, scientists have found many phenomena showing obedience to the social gravity law in various systems Bar11 . For example, commuting flow in the United States Viboud06 , highway traffic flow in Korea Jung08 , telecommunication flow in Belgium Krings09 , global airline flow Balcan09 , global cargo ship movement Kaluza10 and even global scientific collaboration Pan12 followed the social gravity law.

Why does such a simple law apply to so many complex social systems? This problem has long excited the curiosity of many scholars. In the past half century, scientists have proposed different explanations of the roots of the social gravity law, among which Wilson’s maximum entropy model Wilson67 is the prevailing explanation. He proposed an approach to deriving the doubly-constrained gravity model by maximizing the entropy of the trip distribution between locations in the transportation system under the constraints of the total cost, the trip production volume and attraction volume of each location. However, unlike the total amount of internal energy available to all gas molecules in a system, which is determined exogenously, the total cost cannot be estimated in a real transportation system Hua79 . Tomlin and Tomlin Tomlin68 constructed a free cost model by analogy with the Helmholtz free energy in physics. This model can lead to the same results as Wilson’s maximum entropy model but does not need the prior constraint of the total cost. However, both the maximum entropy model and the free cost model are macroscopic explanations of the social gravity law. They provide only the most probable macroscopic distribution state but do not take into account individual choice behavior in destination choice Sheppard78 . On the other hand, economists have explained the social gravity law by describing individual choice behavior using utility theory, in which the most influential study is that of Domencich and McFadden, who applied random utility theory to model individual destination choice behavior DoMc75 . They assumed that the traveler always selects the destination with the highest utility, but her perception of the destination utility exhibits random error. If these errors follow the independent and identical Gumbel distribution, the Logit model Hensher18 can be derived. If the destination utility consists only of the destination attractiveness and the travel cost to the destination, the Logit model can derive the singly-constrained gravity model. However, all the aforementioned models neglect individual interaction, which is a ubiquitous phenomenon in many complex systems Vicsek12 . For example, in a real transportation system, a traveler considers not only the constant values of destination attractiveness and travel cost but also possible crowding at the destination and congestion on the way. Recently, Yan and Zhou Yan19 modeled the individual destination choice process as a congestion game, including interaction among travelers, and further derived the singly-constrained gravity model. However, they assumed that all travelers are perfectly rational and can accurately perceive the utilities of all destinations. In practice, individual rationality is bounded because of the intractability of the alternative choice problem and limited information-processing resources Simon72 . However, an explanation of the social gravity law that simultaneously reflects individual interaction and bounded rationality is still lacking.

In this paper, we develop a free utility model from the perspective of individual choice behavior to explain the social gravity law in the context of the destination choice problem in transportation science. First, we establish the individual choice model on the basis of how a bounded rational traveler trades off expected utility and information-processing cost. Then, we extend the individual choice model to the collective choice model, including infinite noninteractive or interactive travelers, and further derive the gravity model from the collective model. We next extend the collective model of interactive travelers, named the free utility model, to a network. Finally, we contrast the similarities and differences between the free utility model and the free energy in physics and further discuss the potential application value of the free utility model in different scientific fields.

II Model

II.1 Gravity model

Before presenting our model to explain the social gravity law, we first briefly introduce the gravity model that is widely used to predict the spatial interaction flow obeying the social gravity law. The earliest gravity model Odlyzko15 , generated from an analogy with Newton’s law of universal gravitation, has the following functional form:

Tij=αmimjdijβ,T_{ij}=\alpha\frac{m_{i}m_{j}}{d_{ij}^{\beta}}, (1)

where TijT_{ij} is the flow from origin ii to destination jj, mim_{i} is the activity (usually quantified by population) of location ii, dijd_{ij} is the distance between ii and jj, and α\alpha and β\beta are parameters. This original gravity model has a simple form and can be used to reproduce the distribution pattern of spatial interaction flow, but difficulties arise when predicting future flows. For example, the TijT_{ij} calculated by Eq. (1) would increase by a factor of 100 if the future populations mim_{i} and mjm_{j} both increase by factors of 10, which is clearly not realistic.

An improvement to overcome this shortcoming of the original gravity model is to discard the fixed parameter α\alpha and use the following equation to calculate the flow:

Tij=Oimjdijβjmjdijβ,T_{ij}=O_{i}\frac{m_{j}d_{ij}^{-\beta}}{\sum_{j}m_{j}d_{ij}^{-\beta}}, (2)

where OiO_{i} is the outflow of location ii, which is usually roughly proportional to the population of location ii, i.e., OiθmiO_{i}\approx\theta m_{i} Simini12 . The results calculated by Eq. (2) satisfy the constraint jTij=Oi\sum_{j}T_{ij}=O_{i}, so Eq. (2) is named the singly-constrained gravity model Dios11 .

The more commonly used gravity model in transportation science is the doubly-constrained gravity model Dios11 , which is constructed to satisfy two constraints, i.e., jTij=Oi\sum_{j}T_{ij}=O_{i} and iTij=Dj\sum_{i}T_{ij}=D_{j}, where DjD_{j} is the inflow of location jj. In addition, travel cost cijc_{ij} is used more often than distance dijd_{ij} in trip distribution forecasting, so the doubly-constrained gravity model can be written as

Tij=aibjOiDjf(cij),T_{ij}=a_{i}b_{j}O_{i}D_{j}f(c_{ij}), (3)

where f(cij)f(c_{ij}) is the travel cost function and ai=1/jbjDjf(cij)a_{i}=1/{\sum_{j}b_{j}D_{j}f(c_{ij})} and bj=1/iaiOif(cij)b_{j}=1/{\sum_{i}a_{i}O_{i}f(c_{ij})} are balancing factors.

Next, we will establish the free utility model from the perspective of individual choice behavior to derive the singly- and doubly-constrained gravity models and explain the social gravity law.

II.2 Individual choice model

Similar to some of the aforementioned studies explaining the social gravity law, we conduct our research in the context of individual destination choice behavior in the transportation system. In this system, there are MM origins labeled ii (i=1,2,,M)(i=1,2,\dots,M) and NN destinations labeled jj (j=1,2,,N)(j=1,2,\dots,N). OiO_{i} is the number of trips leaving origin ii. We initially start with the simplest system, in which there is only one origin (M=1)(M=1) and one trip (Oi=1)(O_{i}=1) made by a traveler who can select NN (N>1)(N>1) destinations. The utility of destination jj for this traveler is uiju_{ij}, which describes her satisfaction degree concerning the attractiveness of destination jj and the travel cost from origin ii to destination jj. According to the utility maximization principle, this traveler will select the destination with the highest utility only if she is perfectly rational Fishburn82 . However, it is difficult for a bounded rational traveler to exactly perceive the utilities of all destinations in reality. In this case, her choice is probabilistic, i.e., destination jj is selected with probability pijp_{ij}, which depends on uiju_{ij} Luce59 . If the traveler is completely uncertain about the utilities of all destinations, she can select destination jj only with probability pij=1Np_{ij}=\frac{1}{N}, and her expected utility jpijuij\sum_{j}p_{ij}u_{ij} is the average of the utilities of all destinations. If she wants to obtain higher expected utility, she must acquire knowledge about the utilities of the destinations through information processing Marsili99 . A natural measure of information processing is negative information entropy H=jpijlnpij-H=\sum_{j}p_{ij}\ln p_{ij} Marsili99 ; Wolpert12 ; Ortega13 . Assuming that the price of unit information is τ\tau, then the information-processing cost is τH-\tau H. If τ\tau\to\infty, which means that the information-processing cost is extremely high, the traveler does not care about the expected utility and focuses only on the information-processing cost. Hence, she will make a totally random choice of destination, that is, a uniform distribution over the set of destinations. If τ=0\tau=0, which means that information processing has no cost, she will select only the destination with the highest utility. In general case τ>0\tau>0, she must trade off her expected utility and the information-processing cost Guan20 ; Tkacik16 to achieve the total utility maximization goal, i.e.,

maxw=\displaystyle\max w= jpijuij+τH,\displaystyle\sum_{j}p_{ij}u_{ij}+\tau H, (4)
s.t.\displaystyle\mathrm{s.t.} jpij=1,\displaystyle\sum_{j}p_{ij}=1,

where jpijuij+τH\sum_{j}p_{ij}u_{ij}+\tau H is the objective function subjected to jpij=1\sum_{j}p_{ij}=1. Using the Lagrange multiplier method, we can obtain the solution of Eq. (4) as follows:

L(pij,λ)=jpijuij+τHλ(jpij1),L(p_{ij},\lambda)=\sum_{j}p_{ij}u_{ij}+\tau H-\lambda(\sum_{j}p_{ij}-1), (5)

where λ\lambda is a Lagrange multiplier. According to Lpij=0\frac{\partial L}{\partial p_{ij}}=0 for all destinations, we can obtain

uijτ(lnpij+1)=λ,u_{ij}-\tau(\ln p_{ij}+1)=\lambda, (6)

which means that all destinations have the same utility minus the marginal information-processing cost under the traveler’s optimal choice strategy. This situation is very similar to consumer equilibrium in microeconomics Tewari03 , in which the marginal utility of each good is equal. Thus, we name uijτ(lnpij+1)u_{ij}-\tau(\ln p_{ij}+1) the marginal utility of destination jj. The total utility of the system is the sum of the integrals of the marginal utilities of all destinations. The optimal choice strategy for the traveler is to follow the equimarginal principle Tewari03 to select destinations in order to achieve maximum total utility. Combining Eq. (6) and jpij=1\sum_{j}p_{ij}=1, we can derive

pij=euij/τjeuij/τ,p_{ij}=\frac{\mathrm{e}^{u_{ij}/\tau}}{\sum_{j}\mathrm{e}^{u_{ij}/\tau}}, (7)

which is the equilibrium solution of Eq. (4). In economics, the reciprocal of the parameter τ\tau in Eq. (7) is commonly referred to as the intensity of choice parameter Brock97 , which measures how sensitive a traveler is with respect to differences in the destination utilities. The greater 1/τ1/\tau is, the more sensitive the traveler is to differences in the destination utilities, and vice versa. In mathematical form, Eq. (7) is the Logit model derived in terms of random utility theory DoMc75 . However, our derivation does not need to assume in advance that the destination utility perception errors follow the Gumbel distribution.

II.3 Collective choice model

We further extend the transportation system to the case where origin ii has infinite homogeneous travelers (i.e., Oi1O_{i}\gg 1), each of whom still has only one trip. The question then becomes how these trips are distributed among various destinations. If the utility of destination jj is not affected by the number of trips to jj, Eq. (7) can still be applied in this system. On this occasion, the number of trips from ii to jj is Tij=OipijT_{ij}=O_{i}p_{ij}, and the system entropy SS is the sum of the information entropy HH of each trip, i.e., S=OiHS=O_{i}H. Therefore, we can rewrite Eq. (4) as

maxW=\displaystyle\max W= jTijuij+τS,\displaystyle\sum_{j}T_{ij}u_{ij}+\tau S, (8)
s.t.\displaystyle\mathrm{s.t.} jTij=Oi.\displaystyle\sum_{j}T_{ij}=O_{i}.

Similarly, all homogeneous travelers follow the equimarginal principle so that all destinations have the same marginal utility for them. If the utility of destination jj is abstractly written as uij=Ajciju_{ij}=A_{j}-c_{ij}, where AjA_{j} is the constant attractiveness of destination jj reflecting the activity opportunities (including variables such as retail activity and employment density) Sheffi85 available there, and cijc_{ij} is the constant travel cost from ii to jj, we can use the Lagrange multiplier method to obtain the equilibrium solution of Eq. (8) as

Tij=Oie(Ajcij)/τje(Ajcij)/τ,T_{ij}=O_{i}\frac{\mathrm{e}^{(A_{j}-c_{ij})/\tau}}{\sum_{j}\mathrm{e}^{(A_{j}-c_{ij})/\tau}}, (9)

which is the same as the singly-constrained gravity model derived from the Logit model DoMc75 .

However, in practice, the utility of destination jj is affected by the number of trips from ii to jj Fujita89 ; that is, uiju_{ij} is a function of TijT_{ij}. For example, an increase in the trip number TijT_{ij} from origin ii to destination jj will increase the travel cost from ii to jj and decrease the attractiveness of destination jj Yan19 , both of which will change the utility of destination jj. Hence, the utility can be abstractly written as uij(Tij)=Ajlj(Tij)cijgij(Tij)u_{ij}(T_{ij})=A_{j}-l_{j}(T_{ij})-c_{ij}-g_{ij}(T_{ij}), where lj(Tij)l_{j}(T_{ij}) is the variable attractiveness function of destination jj and gij(Tij)g_{ij}(T_{ij}) is the congestion function on the way from ii to jj. To solve the subsequent optimization problems, we assume that lj(Tij)l_{j}(T_{ij}) and gij(Tij)g_{ij}(T_{ij}) are both monotonically nondecreasing differentiable functions. We already know that each traveler follows the equimarginal principle to make the optimal choice. In this context, a traveler’s choice of destination depends on how other travelers are distributed over all destinations Conlisk76 . The phenomenon of one individual’s behavior being dependent on the behavior of other individuals, known as individual interaction, is widespread in social systems. In this interactive system, the optimal choice strategy for all travelers is still to assign all destinations the same marginal utility. Since the total utility of the system is the sum of the integrals of the marginal utilities of all destinations, the utility maximization model of the interactive system can be written as

maxW=\displaystyle\max W= j0Tijuij(x)dx+τS,\displaystyle\sum_{j}\int_{0}^{T_{ij}}u_{ij}(x)\mathrm{d}x+\tau S, (10)
s.t.\displaystyle\mathrm{s.t.} jTij=Oi.\displaystyle\sum_{j}T_{ij}=O_{i}.

Interestingly, the objective function of Eq. (10) is mathematically consistent with the Helmholtz free energy in physics Kittel80 ; thus, we name Eq. (10) the free utility model. Analogous to the thermodynamic system, the equilibrium trip distribution maximizes the free utility function in the transportation system. In addition, the first term of the free utility function, j0Tijuij(x)dx\sum_{j}\int_{0}^{T_{ij}}u_{ij}(x)\mathrm{d}x, is analogous to internal energy, including potential energy in a thermodynamic system. Monderer and Shapley Monderer96 therefore named this term the potential function. The free utility model with τ=0\tau=0 is essentially a potential game with an infinite number of agents.

Refer to caption
Figure 1: The effect of parameter changes on the optimal solution of the free utility model. In this simple system, there is only one origin, labeled a, and two destinations, labeled b and c. The origin has 200 travelers, each of whom has only one trip. The utilities of b and c are uab=203γTabu_{ab}=20-3\gamma T_{ab} and uac=2γTacu_{ac}=2-\gamma T_{ac}, respectively, where TabT_{ab} and TacT_{ac} are the number of travelers selecting destinations b and c, respectively, and γ\gamma is the interaction strength parameter. The information-processing cost for these travelers is τS-\tau S, where τ\tau is the information-processing price. The surface of the figure describes probability pbp_{b} that destination b is selected in the optimal solution of the free utility model under different parameter combinations.

Here, to generate gravity-like behavior, the utility needs a logarithmic dependence on the number of travelers from ii to jj, i.e., uij(Tij)=AjcijγlnTiju_{ij}(T_{ij})=A_{j}-c_{ij}-\gamma\ln T_{ij}, where γlnTij=lj(Tij)+gij(Tij)\gamma\ln T_{ij}=l_{j}(T_{ij})+g_{ij}(T_{ij}) is the simplified cost function, including the travel congestion cost and the destination variable attractiveness, and γ\gamma is a non-negative parameter. The logarithmic form of the cost function can be explained by the Weber-Fechner law in behavioral psychology Takemura14 . This law holds that the magnitude of human perception (e.g., the traveler’s perception of travel cost) is proportional to the logarithm of the magnitude of the physical stimulus (e.g., the number of travelers TijT_{ij}). Equation (10) can be specifically written as

maxW=\displaystyle\max W= j0Tij(Ajcijγlnx)dx+τS,\displaystyle\sum_{j}\int_{0}^{T_{ij}}(A_{j}-c_{ij}-\gamma\ln x)\mathrm{d}x+\tau S, (11)
s.t.\displaystyle\mathrm{s.t.} jTij=Oi.\displaystyle\sum_{j}T_{ij}=O_{i}.

Using the Lagrange multiplier method, we can obtain

Tij=Oie(Ajcij)/(γ+τ)je(Ajcij)/(γ+τ).T_{ij}=O_{i}\frac{\mathrm{e}^{(A_{j}-c_{ij})/(\gamma+\tau)}}{\sum_{j}\mathrm{e}^{(A_{j}-c_{ij})/(\gamma+\tau)}}. (12)

This result is the singly-constrained gravity model Dios11 . Parameter τ\tau reflects the information-processing price for bounded rational travelers, and parameter γ\gamma reflects travelers’ interaction strength. Figure 1 shows the effect of changes in parameters τ\tau and γ\gamma on the free utility optimal solution in a simple system with one origin and two destinations. The surface in Fig. 1 describes the optimal solution of the free utility model in this simple system. From Eq. (11), we can see that when γ>0\gamma>0 and τ=0\tau=0 (meaning no information-processing cost), the free utility model is equivalent to the degenerated destination choice game model Yan19 in terms of potential game theory, as shown in the left dashed line of Fig. 1; when γ>0\gamma>0 and τ\tau\to\infty (meaning that the information-processing cost is too high), travelers can only uniformly randomly select each destination, as shown in the lower-right solid line of Fig. 1; when τ>0\tau>0 and γ=0\gamma=0 (meaning that there is no interaction among travelers), the free utility model is equivalent to the Logit model DoMc75 , as shown in the upper-right dotted line of Fig. 1; and when τ=0\tau=0 and γ=0\gamma=0, all travelers will select the destination with the highest constant utility, as shown in the intersection point of the dashed line and dotted line of Fig. 1.

Above, we considered only a simple system with a single origin. However, there is more than one origin (M>1)(M>1) in a real transportation system. In such a system, the utility form of destination jj is changed. It is affected by not only the number of trips from ii to jj but also the total number of trips attracted to destination jj. Hence, the utility of destination jj for travelers at origin ii can be abstractly written as uij(Tij,Dj)=Ajlj(Dj)cijgij(Tij)u_{ij}(T_{ij},D_{j})=A_{j}-l_{j}(D_{j})-c_{ij}-g_{ij}(T_{ij}), where DjD_{j} is the total number of trips attracted to destination jj, i.e., Dj=iTijD_{j}=\sum_{i}T_{ij}, and used as the independent variable of the variable attractiveness function lj(Dj)l_{j}(D_{j}) of destination jj. If DjD_{j} is as constant as OiO_{i}, that is, not only the total number of trips emanating from origin ii but also the number of trips attracted to any destination is fixed, then, the variable attractiveness function lj(Dj)l_{j}(D_{j}) of destination jj is also constant and thus can be merged into the constant attractiveness AjA_{j} of destination jj. In other words, only the travel congestion function gij(Tij)g_{ij}(T_{ij}) influences the destination choice behavior of travelers. If gij(Tij)=γlnTijg_{ij}(T_{ij})=\gamma\ln T_{ij}, the free utility model of the system can be written as

maxW=\displaystyle\max W= ij0Tij(Ajcijγlnx)dx+τiSi,\displaystyle\sum_{i}\sum_{j}\int_{0}^{T_{ij}}(A_{j}-c_{ij}-\gamma\ln x)\mathrm{d}x+\tau\sum_{i}S_{i}, (13)
s.t.\displaystyle\mathrm{s.t.} jTij=Oi,\displaystyle\sum_{j}T_{ij}=O_{i},
iTij=Dj,\displaystyle\sum_{i}T_{ij}=D_{j},

where the two constraints jTij=Oi\sum_{j}T_{ij}=O_{i} and iTij=Dj\sum_{i}T_{ij}=D_{j} are the fixed total number of trips emanating from origin ii and the fixed total number of trips attracted to destination jj, respectively. Using the Lagrange multiplier method (see Appendix), we can obtain

Tij=aibjOiDjecij/(γ+τ),T_{ij}=a_{i}b_{j}O_{i}D_{j}\mathrm{e}^{-c_{ij}/(\gamma+\tau)}, (14)

where ai=1/jbjDjecij/(γ+τ)a_{i}=1/\sum_{j}b_{j}D_{j}\mathrm{e}^{-c_{ij}/(\gamma+\tau)} and bj=1/iaiOiecij/(γ+τ)b_{j}=1/\sum_{i}a_{i}O_{i}\mathrm{e}^{-c_{ij}/(\gamma+\tau)}. This is the doubly-constrained gravity model widely used in transportation science Dios11 . This free utility model can be reduced to some classical models under specific parameter combinations. When τ>0\tau>0 and γ=0\gamma=0 (meaning that there is no interaction among travelers), Eq. (13) is equivalent to the free cost model proposed by Tomlin et al. Tomlin68 , and its solution has the same form as that of Wilson’s maximum entropy model Wilson67 . When τ=0\tau=0 and γ=0\gamma=0, Eq. (13) is equivalent to the Hitchcock-Koopmans problem Hitchcock41 , which asks, for every origin ii and destination jj, how many travelers must journey from ii to jj in order to minimize the total cost ij(cijAj)Tij\sum_{i}\sum_{j}(c_{ij}-A_{j})T_{ij}. When τ\tau\to\infty (meaning that the first term of the free utility function in Eq. (13) is negligible), Eq. (13) is equivalent to the equal priori probability Sasaki model Tomlin68 with the solution TijOiDjT_{ij}\propto O_{i}D_{j}.

In a real transportation system, the travel cost cijc_{ij} often follows an approximate logarithmic relationship with distance dijd_{ij}, i.e., cijβlndijc_{ij}\approx\beta\ln d_{ij} Yan13 . Using βlndij\beta\ln d_{ij} instead of cijc_{ij} in Eq. (14), we can obtain the social gravity law

Tij=aibjOiDjdijβ/(γ+τ).T_{ij}=a_{i}b_{j}O_{i}D_{j}d_{ij}^{-\beta/(\gamma+\tau)}. (15)

Thus far, we have achieved the goal of explaining the root of the social gravity law. The social gravity law is a macroscopic phenomenon caused by the interaction of bounded rational individuals in destination choice. In contrast, the destination choice game model considers only individual interaction but ignores individual bounded rationality, which is almost universal in practice, while the maximum entropy model, the free cost model and the Logit model reflect the randomness of individual choice decisions but do not reflect individual interaction.

II.4 Network expansion model

The above constraints jTij=Oi\sum_{j}T_{ij}=O_{i} and iTij=Dj\sum_{i}T_{ij}=D_{j} are essential in the classic four-step travel demand forecasting models in transportation science since the second-step model predicting trip distribution requires the input values of trip production and attraction resulting from the first-step model Dios11 . However, the total number of trips DjD_{j} attracted to destination jj is impossible to fix beforehand in a real transportation system since Dj=iTijD_{j}=\sum_{i}T_{ij} is the product of the individual destination choice process. In other words, DjD_{j} is variable. In this case, the free utility model can still describe traveler destination choice behavior. For a better understanding, we transform the destination choice problem in the transportation system with multiple origin-destination pairs into an equivalent route choice problem in a network, as shown in Fig. 2. In this network, there are MM origin nodes, NN destination nodes and one dummy node ss. There are M×NM\times N links leading from each origin node to each destination node and NN links leading from each destination node to the dummy node. Each origin node has OiO_{i} travelers whose final destination in this network is the dummy node ss. The traveler journeying from ii to ss has NN alternative paths. Each path is composed of two links, ijij and jsjs. The flow along link ijij is TijT_{ij}, and that along jsjs is iTij\sum_{i}T_{ij}. The path utility consists of the utility uij(Tij)=cijgij(Tij)u_{ij}(T_{ij})=-c_{ij}-g_{ij}(T_{ij}) of link ijij and the utility uj(iTij)=Ajlj(iTij)u_{j}(\sum_{i}T_{ij})=A_{j}-l_{j}(\sum_{i}T_{ij}) of link jsjs. Hence, the free utility model for this network can be written as

maxW=\displaystyle\max W= ij0Tijuij(x)dx\displaystyle\sum_{i}\sum_{j}\int_{0}^{T_{ij}}u_{ij}(x)\mathrm{d}x (16)
+j0iTijuj(x)dx+τiSi,\displaystyle+\sum_{j}\int_{0}^{\sum_{i}T_{ij}}u_{j}(x)\mathrm{d}x+\tau\sum_{i}S_{i},
s.t.\displaystyle\mathrm{s.t.} jTij=Oi,\displaystyle\sum_{j}T_{ij}=O_{i},

where Si=jTijlnTijOiS_{i}=-\sum_{j}T_{ij}\ln\frac{T_{ij}}{O_{i}}. In addition, some paths may not be selected since the number of trips OiO_{i} is finite in practice; thus, constraint Tij0T_{ij}\geq 0 must be added to Eq. (16). Interestingly, Eq. (16) is identical to the stochastic user equilibrium (SUE) model Fisk80 for traffic assignment in transportation science. There are many algorithms that can solve this model and calculate the traffic flow on all links in the network Sheffi85 . Although the free utility model in Eq. (16) cannot provide an explicit expression of TijT_{ij}, it can better characterize collective destination choice behavior than the doubly-constrained gravity model that specifies the number of trips DjD_{j} attracted to destination jj.

Refer to caption
Figure 2: An example network to illustrate destination choice behavior in a transportation system with multiple origin-destination pairs. In this network, the green nodes in the bottom layer is are origin nodes, labeled ii, and the orange nodes in the middle layer are destination nodes, labeled jj. The solid lines from the green nodes to the orange nodes are links ijij, with flow TijT_{ij} and utility uij(Tij)u_{ij}(T_{ij}). The blue node in the top layer is the dummy node ss. The dashed lines from the orange nodes to the dummy node are links jsjs, with flow iTij\sum_{i}T_{ij} and utility uj(iTij)u_{j}(\sum_{i}T_{ij}). Individual destination choice behavior is route choice behavior in this network. The path from ii to ss is composed of link ijij and corresponding link jsjs.

III Discussion and conclusions

In this paper, we developed a free utility model that can explain the social gravity law from the perspective of individual choice behavior. The model makes two basic assumptions: (1) the individual pursues utility maximization, and (2) the individual needs to pay the information-processing cost to acquire more knowledge about the utilities of the destinations. The objective function of the free utility model of the destination choice system with a single origin (see Eq. (10)) is mathematically consistent with the Helmholtz free energy in physics. In other words, this destination choice system is analogous to the isothermal and isochoric thermodynamic system that consists of several subsystems in thermal contact with a large reservoir: the number of travelers is analogous to the number of particles; the first term of the free utility model’s objective function is analogous to the thermodynamic system’s internal energy including potential energy; the information-processing price is analogous to the temperature of the reservoir; the information entropy is analogous to the entropy of the thermodynamic system; the information-processing cost is analogous to the heat transferred between the thermodynamic system and the reservoir; and the marginal utility is analogous to the chemical potential of the subsystem. However, the essence of these two systems is different: the thermodynamic system follows the minimum free energy principle to make the system reach the equilibrium state in which all subsystems have the same chemical potential Kittel80 , and the maximization free utility of destination choice system is the result of individuals following the equimarginal principle to make choices to maximize their own utility.

Some previous models, including the free cost model Tomlin68 and the maximum entropy model Wilson67 , have not shown a clear derivation of the gravity model from microscopic mechanisms Bar19 . The free cost model presented by Tomlin and Tomlin derived the gravity model by analogy with the Helmholtz free energy, but it did not explain the social gravity law from the perspective of individual choice behavior. In our opinion, this model is not essentially different from the unconstrained gravity model established by direct analogy with Newton’s law of universal gravitation. Similarly, Wilson’s maximum entropy model on the gravity model can provide only the most likely macrostate but cannot describe individual choice behavior. The maximum entropy model also requires the prior constraint of the total cost, which is actually the result of individual choice. Additionally, both of these physical analogy models ignore the interaction among individuals, which is a ubiquitous phenomenon in social systems Bar11 . In comparison, we provide a concise explanation for the social gravity law from the perspective of individual behavior. It simultaneously reflects two dominant mechanisms that are common in social systems, namely, individual interaction and bounded rationality. This line of explanation brings us to the idea that the social gravity law is a phenomenon resulting from bounded rational individuals interacting with each other.

The free utility model can explain not only the social gravity law but also the potential function in game theory from the perspective of individual behavior. The free utility model is a stochastic potential game model Goeree99 in mathematical form. The objective function of the potential game model proposed by Monderer and Shapley was established by analogy with the potential function in physics Monderer96 . The variation in a player’s individual payoff due to changes in the player’s strategy is equal to the variation in the potential function Grauwin09 ; Lemoy11 . Every equilibrium strategy profile of the potential game maximizes the potential function. Under these circumstances, all players have the same payoff, and they cannot unilaterally change their strategy to increase their respective payoff. However, Monderer and Shapley raised a question about the interpretation of potential function: “What do the firms try to jointly maximize?” Monderer96 . In fact, the potential function is not something that players try to jointly maximize. Now we know that players will maximize only their own utility (i.e., the payoff) through the modeling process of the free utility model. A player’s optimal choice strategy is to make the marginal utility of each alternative equal. When all players follow this equimarginal principle, the sum of the integral of the marginal utility of each alternative (called total utility in economics) is naturally maximized. In other words, the maximum potential function is the result of players’ optimal choice strategy but is not a goal that the players pursue jointly.

The free utility model can also provide an explanation for the objective function of the classic traffic assignment model in transportation science. From Eq. (16), we can see that the free utility model for the transportation network is mathematically consistent with the SUE model Fisk80 . Without considering the information-processing cost, the free utility model is mathematically consistent with the classic user equilibrium (UE) model established by Beckmann Beckmann55 . The equilibrium solution of the UE model, in which all paths used between each OD pair have equal and minimum costs, is simply the result of the optimal path choice strategy selected by travelers following the equimarginal principle. The objective function of the UE model, i.e., the sum of the integral of the marginal utility (negative cost) of each link, is actually the negative free utility without information-processing cost. This provides new insights into UE and SUE models.

Finally, although we use individual destination choice behavior in the transportation system as background to explain the social gravity law, such behavior of selecting alternatives exists not only in the transportation system but also in systems where the spatial interaction patterns follow the social gravity law. For example, population migration involves selecting locations as residences, social interaction involves selecting people as friends and scientific collaboration involves selecting researchers as partners. In these systems, individuals tend to select alternatives with relatively high activity and relatively low distance, and changes in the number of individuals who select the same alternative will affect the variation in the utility of the alternative. For example, an increase in commodity competitors for the same product will lead to a decrease in the product price, and an increase in collaborators with one scientist will lead to a decrease in cooperation intensity between this scientist and her collaborators. The spatial interaction behavior in these different systems can be described by the free utility model. Not only that, the free utility model of the destination choice system with a single origin (see Eq. (10)) can be used for generalized problems of human choice, where homogeneous individuals select from multiple alternatives with utility associated with the choices. Moreover, using the network method, we can extend the free utility model to individuals of different types (i.e., heterogeneous individuals). The network in Fig. 2 can be regarded as a case in which two types of individuals select from among three alternatives. In this network, the utility of the solid line represents the utility component of the alternative for the corresponding type of individual, and the utility of the dashed line represents the utility component of the alternative for both types of individuals. In short, the free utility model that explains the social gravity law not only helps us deeply understand collective choice behavior patterns emerging from the interaction of bounded rational individuals but also shows potential application in predicting, guiding or even controlling human choice behavior in various complex social systems.

Acknowledgments

This work was supported by the National Natural Science Foundation of China (Grant Nos. 71822102, 71621001, 71671015).

Appendix

The Lagrangian expression of the optimization problem in Eq. (13) is

maxL(Tij)=\displaystyle\max L(T_{ij})= ij0Tij(Ajcijγlnx)dx\displaystyle\sum\limits_{i}\sum\limits_{j}\int_{0}^{T_{ij}}(A_{j}-c_{ij}-\gamma\ln x)\mathrm{d}x (A.1)
+τiSi\displaystyle+\tau\sum_{i}S_{i}
+iλi(jTijOi)\displaystyle+\sum\limits_{i}\lambda_{i}(\sum\limits_{j}T_{ij}-O_{i})
+jμj(iTijDj),\displaystyle+\sum\limits_{j}\mu_{j}(\sum\limits_{i}T_{ij}-D_{j}),

where λi\lambda_{i} and μj\mu_{j} are Lagrange multipliers. Since

Si\displaystyle S_{i} =Oijpijlnpij\displaystyle=-O_{i}\sum\limits_{j}p_{ij}\ln p_{ij} (A.2)
=Oij(Tij/Oi)ln(Tij/Oi)\displaystyle=-O_{i}\sum\limits_{j}(T_{ij}/O_{i})\ln(T_{ij}/O_{i})
=jTijln(Tij/Oi)\displaystyle=-\sum\limits_{j}T_{ij}\ln(T_{ij}/O_{i})
=jTijlnTij+OilnOi\displaystyle=-\sum\limits_{j}T_{ij}\ln T_{ij}+O_{i}\ln O_{i}
=jTijlnTij+Oi+OilnOiOi\displaystyle=-\sum\limits_{j}T_{ij}\ln T_{ij}+O_{i}+O_{i}\ln O_{i}-O_{i}
=j(TijlnTijTij)+OilnOiOi,\displaystyle=-\sum\limits_{j}(T_{ij}\ln T_{ij}-T_{ij})+O_{i}\ln O_{i}-O_{i},

the partial derivative of the Lagrangian expression Eq. (A.1) with respect to TijT_{ij} is

LTij=AjcijγlnTijτlnTij+λi+μj=0,\frac{\partial L}{\partial T_{ij}}=A_{j}-c_{ij}-\gamma\ln T_{ij}-\tau\ln T_{ij}+\lambda_{i}+\mu_{j}=0, (A.3)

therefore,

Tij=e(Ajcij+λi+μj)/(γ+τ).T_{ij}=\mathrm{e}^{(A_{j}-c_{ij}+\lambda_{i}+\mu_{j})/(\gamma+\tau)}. (A.4)

Since

Oi=\displaystyle O_{i}= jTij=je(Ajcij+λi+μj)/(γ+τ)\displaystyle\sum\limits_{j}T_{ij}=\sum\limits_{j}\mathrm{e}^{(A_{j}-c_{ij}+\lambda_{i}+\mu_{j})/(\gamma+\tau)} (A.5)
=\displaystyle= eλi/(γ+τ)je(Ajcij+μj)/(γ+τ)\displaystyle\mathrm{e}^{\lambda_{i}/(\gamma+\tau)}\sum\limits_{j}\mathrm{e}^{(A_{j}-c_{ij}+\mu_{j})/(\gamma+\tau)}

and

Dj=\displaystyle D_{j}= iTij=ie(Ajcij+λi+μj)/(γ+τ)\displaystyle\sum\limits_{i}T_{ij}=\sum\limits_{i}\mathrm{e}^{(A_{j}-c_{ij}+\lambda_{i}+\mu_{j})/(\gamma+\tau)} (A.6)
=\displaystyle= e(Aj+μj)/(γ+τ)ie(λicij)/(γ+τ),\displaystyle\mathrm{e}^{(A_{j}+\mu_{j})/(\gamma+\tau)}\sum\limits_{i}\mathrm{e}^{(\lambda_{i}-c_{ij})/(\gamma+\tau)},

we obtain

eλi/(γ+τ)=Oi/je(Ajcij+μj)/(γ+τ)\mathrm{e}^{\lambda_{i}/(\gamma+\tau)}=O_{i}/\sum\limits_{j}\mathrm{e}^{(A_{j}-c_{ij}+\mu_{j})/(\gamma+\tau)} (A.7)

and

e(Aj+μj)/(γ+τ)=Dj/ie(λicij)/(γ+τ).\mathrm{e}^{(A_{j}+\mu_{j})/(\gamma+\tau)}=D_{j}/\sum\limits_{i}\mathrm{e}^{(\lambda_{i}-c_{ij})/(\gamma+\tau)}. (A.8)

Let ai=eλi/(γ+τ)/Oia_{i}=\mathrm{e}^{\lambda_{i}/(\gamma+\tau)}/O_{i} and bj=e(Aj+μj)/(γ+τ)/Djb_{j}=\mathrm{e}^{(A_{j}+\mu_{j})/(\gamma+\tau)}/D_{j}; Eq. (A.4) can then be rewritten as Eq. (14).

In the actual calculation, aia_{i} and bjb_{j} are two sets of interdependent balancing factors, i.e., ai=1/jbjDjecij/(γ+τ)a_{i}=1/\sum_{j}b_{j}D_{j}\mathrm{e}^{-c_{ij}/(\gamma+\tau)} and bj=1/iaiOiecij/(γ+τ)b_{j}=1/\sum_{i}a_{i}O_{i}\mathrm{e}^{-c_{ij}/(\gamma+\tau)}. Thus, the calculation of one set requires the values of the other set: start with all bj=1b_{j}=1, solve for ai=1/jbjDjecij/(γ+τ)a_{i}=1/\sum_{j}b_{j}D_{j}\mathrm{e}^{-c_{ij}/(\gamma+\tau)}, use these values to re-estimate bj=1/iaiOiecij/(γ+τ)b_{j}=1/\sum_{i}a_{i}O_{i}\mathrm{e}^{-c_{ij}/(\gamma+\tau)}, and repeat until convergence of the two sets is achieved Dios11 .

References

  • (1) Carra G, Mulalic I, Fosgerau M and Barthelemy M, Modelling the relation between income and commuting distance, 2016 J. R. Soc. Interface 13 20160306
  • (2) Niedercorn J H and Bechdolt Jr B V, An economic derivation of the “gravity law” of spatial interaction, 1969 J. Regional Sci. 9 273-282
  • (3) Roy J R and Thill J C, Spatial interaction modelling, 2003 Papers in Reg. Sci. 83 339-361
  • (4) Jia J S, Lu X, Yuan Y, Xu G, Jia J and Christakis N A, Population flow drives spatio-temporal distribution of COVID-19 in China, 2020 Nature 582 389-394
  • (5) de Dios Ortúzar J and Willumsen L G 2011 Modelling Transport (West Sussex: John Wiley & Sons)
  • (6) Barthélemy M, Spatial networks, 2011 Phys. Rep. 499 1-101
  • (7) Stouffer S A, Intervening opportunities: A theory relating mobility and distance, 1940 Am. Sociol. Rev. 5 845-867
  • (8) Sen A and Smith T E 1995 Gravity Models of Spatial Interaction Behavior (Heidelberg: Springer-Verlag)
  • (9) Simini F, González M C, Maritan A and Barabási A-L, A universal model for mobility and migration patterns, 2012 Nature 484 96-100
  • (10) Yan X-Y, Zhao C, Fan Y, Di Z-R and Wang W-X, Universal predictability of mobility patterns in cities, 2014 J. R. Soc. Interface 11 20140834
  • (11) Yan X-Y, Wang W-X, Gao Z-Y and Lai Y-C, Universal model of individual and population mobility on diverse spatial scales, 2017 Nat. Commun. 8 1639
  • (12) Liu E-J and Yan X-Y, A universal opportunity model for human mobility, 2020 Sci. Rep. 10 4657
  • (13) Fagiolo G and Mastrorillo M, International migration network: Topology and modeling, 2013 Phys. Rev. E 88 012812
  • (14) Karpiarz M, Fronczak P and Fronczak A, International trade network: Fractal properties and globalization puzzle, 2014 Phys. Rev. Lett. 113 248701
  • (15) Masucci A P, Serras J, Johansson A and Batty M, Gravity versus radiation models: On the importance of scale and heterogeneity in commuting flows, 2013 Phys. Rev. E 88 022812
  • (16) Goh S, Lee K, Park J S and Choi M Y, Modification of the gravity model and application to the metropolitan Seoul subway system, 2012 Phys. Rev. E 86 026102
  • (17) Odlyzko A, The forgotten discovery of gravity models and the inefficiency of early railway networks, 2015 Œconomia 5 157-192
  • (18) Carey H C 1858 Principles of Social Science (Philadelphia: Lippincott)
  • (19) Ravenstein E G, The laws of migration, 1889 J. R. Stat. Soc. 52 241-305
  • (20) Reilly W J 1929 Methods for the Study of Retail Relationships (Austin: University of Texas)
  • (21) Viboud C, Bjørnstad O N, Smith D L, Simonsen L, Miller M A and Grenfell B T, Synchrony, waves, and spatial hierarchies in the spread of influenza, 2006 Science 312 447-451
  • (22) Jung W S, Wang F and Stanley H E, Gravity model in the Korean highway, 2008 Europhys. Lett. 81 48005
  • (23) Krings G, Calabrese F, Ratti C and Blondel V D, Urban gravity: A model for inter-city telecommunication flows, 2009 J. Stat. Mech. 2009 L07003
  • (24) Balcan D, Colizza V, Gonçalves B, Hu H, Ramasco J J and Vespignani A, Multiscale mobility networks and the spatial spreading of infectious diseases, 2009 Proc. Natl Acad. Sci. USA 106 21484-21489
  • (25) Kaluza P, Kölzsch A, Gastner M T and Blasius B, The complex network of global cargo ship movements, 2010 J. R. Soc. Interface 7 1093-1103
  • (26) Pan R K, Kaski K and Fortunato S, World citation and collaboration networks: uncovering the role of geography in science, 2012 Sci. Rep. 2 902
  • (27) Wilson A G, A statistical theory of spatial distribution models, 1967 Transport. Res. 1 253-269
  • (28) Hua C and Porell F, A critical review of the development of the gravity model, 1979 Int. Regional Sci. Rev. 4 97-126
  • (29) Tomlin J A and Tomlin S G, Traffic distribution and entropy, 1968 Nature 220 974-976
  • (30) Sheppard E S, Theoretical underpinnings of the gravity hypothesis, 1978 Geogr. Anal. 10 386-402
  • (31) Domencich T A and Mcfadden D 1975 Urban Travel Demand: A Behavioral Analysis (Amsterdam: North-Holland Publishing Co.)
  • (32) Hensher D A and Johnson L W 2018 Applied Discrete-Choice Modelling (New York: Routledge)
  • (33) Vicsek T and Zafeiris A, Collective motion, 2012 Phys. Rep. 517 71-140
  • (34) Yan X-Y and Zhou T, Destination choice game: A spatial interaction theory on human mobility, 2019 Sci. Rep. 9 9466
  • (35) Simon H A 1972 Theories of Bounded Rationality (Amsterdam: North-Holland Publishing Co.)
  • (36) Fishburn P 1982 The Foundations of Expected Utility (Dordrecht: D. Reidel Publishing Co.)
  • (37) Luce D 1959 Individual Choice Behavior: A Theoretical Analysis (West Sussex: John Wiley & Sons)
  • (38) Marsili M, On the multinomial logit model, 1999 Physica A 269 9-15
  • (39) Wolpert D H, Harré M, Olbrich E, Bertschinger N and Jost J, Hysteresis effects of changing the parameters of noncooperative games, 2012 Phys. Rev. E 85 036102
  • (40) Ortega P A and Braun D A, Thermodynamics as a theory of decision-making with information-processing costs, 2013 Proc. R. Soc. A 469 20120683
  • (41) Guan S, Xu L, Zhang Q and Shi H, Trade-offs between effectiveness and cost in bifunctional enzyme circuit with concentration robustness, 2020 Phys. Rev. E 101 012409
  • (42) Tkačik G and Bialek W, Information processing in living systems, 2006 Annu. Rev. of Condens. Matter Phys. 7 89-117
  • (43) Tewari D D 2003 Principles of Microeconomics (New Delhi: New Age International Ltd.)
  • (44) Brock W A and Hommes C H, A rational route to randomness, 1997 Econometrica 65 1059-1095
  • (45) Sheffi Y 1985 Urban Transportation Networks: Equilibrium Analysis with Mathematical Programming Methods (England Cliffs: Prentice Hall)
  • (46) Fujita M 1989 Urban Economic Theory: Land Use and City Size (Cambridge: Cambridge University Press)
  • (47) Conlisk J, Interactive Markov chains, 1976 J. Math. Sociol. 4 157-185
  • (48) Kittel C and Kroemer H 1980 Thermal Physics (New York: Freeman)
  • (49) Monderer D and Shapley L S, Potential games, 1996 Games Econ. Behav. 14 124-143
  • (50) Takemura K 2014 Behavioral Decision Theory: Psychological and Mathematical Descriptions of Human Choice Behavior (Tokyo: Springer)
  • (51) Hitchcock F L, The distribution of a product from several sources to numerous localities, 1941 J. Math. Phys. 20 224-230
  • (52) Yan X-Y, Han X-P, Wang B-H and Zhou T, Diversity of individual mobility patterns and emergence of aggregated scaling laws, 2013 Sci. Rep. 3 2678
  • (53) Fisk C, Some developments in equilibrium traffic assignment, 1980 Transport. Res. Part B: Methodol. 14 243-255
  • (54) Barthélemy M, The statistical physics of cities, 2019 Nat. Rev. Phys. 1 406-415
  • (55) Goeree J K and Holt C A, Stochastic game theory: For playing games, not just for doing theory, 1999 Proc. Natl Acad. Sci. USA 96 10564-10567
  • (56) Grauwin S, Bertin E, Lemoy R and Jensen P, Competition between collective and individual dynamics, 2009 Proc. Natl Acad. Sci. USA 106 20622-20626
  • (57) Lemoy R, Bertin E and Jensen P, Socio-economic utility and chemical potential, 2011 Europhys. Lett. 93 38002
  • (58) Beckmann M J, McGuire C B and Winsten C B 1956 Studies in the Economics of Transportation (New Haven: Yale University Press)