This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Why are U.S. Parties So Polarized?
A “Satisficing” Dynamical Model

Vicky Chuqiao Yang Santa Fe Institute, Santa Fe, NM    Daniel M. Abrams Department of Engineering Sciences and Applied Mathematics, and Northwestern Institute on Complex Systems, Northwestern University, Evanston, IL    Georgia Kernell Departments of Communication and Political Science, University of California, Los Angeles, Los Angeles, CA    Adilson E. Motter Department of Physics and Astronomy, and Northwestern Institute on Complex Systems, Northwestern University, Evanston, IL
Abstract

Since the 1960s, Democrats and Republicans in U.S. Congress have taken increasingly polarized positions, while the public’s policy positions have remained centrist and moderate. We explain this apparent contradiction by developing a dynamical model that predicts ideological positions of political parties. Our approach tackles the challenge of incorporating bounded rationality into mathematical models and integrates the empirical finding of satisficing decision making—voters settle for candidates who are “good enough” when deciding for whom to vote. We test the model using data from the U.S. Congress over the past 150 years, and find that our predictions are consistent with the two major political parties’ historical trajectory. In particular, the model explains how polarization between the Democrats and Republicans since the 1960s could be a consequence of increasing ideological homogeneity within the parties.

1 Introduction

The U.S. Democratic and Republican parties have polarized drastically since the 1960s [2, 22, 23]. Legislators’ ideological positions are distributed bimodally, with an increasingly vast distance separating the two parties’ modes [4, 17]. Although what it means to be moderate has changed over the past half century, the general public’s dispersion in ideology remains mostly the same: unimodal, centrist, and stationary [12, 16, 17]. Figure 1 illustrates contrasting ideological dispersion between the U.S. public and U.S. Congress. In a system designed to create democratic responsiveness, why have members of congress of the two major parties shifted while the general public appears to have remained unchanged?

Political party polarization is a complex social phenomenon. Recent research demonstrates that many key aspects of social processes can be explained by simple mathematical models. Examples include opinion dynamics [9], terrorist events [10], the spread of rumors [6], and shifts in religious affiliation [1]. A number of studies address aspects of voting and elections, such as consensus formation [27, 28], how opinion clusters form in the compromise process [5], and how democratic voting can lead to totalitarianism [14]. Party polarization has now started to gain attention in the dynamical systems community. A recent study, for example, used a group competition model and found that the two major parties in the U.S. are increasingly benefiting from polarization in Congress [20].

In political science, many theoretical models examine how individuals’ voting behavior shapes political parties’ positions. The classic Downsian model [11] assumes two-party competition in a one-dimensional ideology space, where citizens cast their vote for the ideologically closest party, and parties adjust their positions to maximize votes. The Downsian model predicts that both parties should converge to the median voter’s position. In reality, however, the two major U.S. parties have dramatically polarized over the past half century.

Refer to caption
Figure 1: Polarization in the U.S. public and Congress (House and Senate combined), as measured by the standard deviation of ideological positions. The ideological positions are drawn from [17] for the public and [7] for Congress. Congress’s standard deviation is scaled so that its mean for the first three data points matches that of the public.

One reason we believe the Downsian model fails empirical validation is the assumption that voters maximize their utility. Empirical research suggests that voters often behave in ways that are only boundedly rational [13]. Instead of maximizing, people tend to satisfice; that is, they accept what is “good enough” and do not obsess over other options [3, 24, 25, 26]. The decisions of a maximizer may approach those of a satisficer in the presence of noise, misinformation, missing information, decision fatigue, and other factors that limit one’s ability to choose optimally. Moreover, experimental evidence demonstrates that for complex decisions, maximizers often make sub-optimal choices [21]. Thus, even if some voters follow a maximizing decision-making strategy, in practice they will often fail to find the optimum and instead behave similarly to satisficers.

In this article, we (1) develop a simple mathematical model for political party positions, taking the more realistic satisficing decision making of voters into account; and (2) explain why political party polarization may develop in the absence of changes in voters’ ideological positions. This work helps address the outstanding challenge in political polarization, and social systems more generally [15], of establishing unified mathematical models that are grounded in empirical findings. In addition to explaining key aspects of political polarization, our model’s framework provides a foundation for future studies on other aspects of party dynamics, such as minor party influence and success.

2 Overview

Our modeling framework considers the voter population distributed in a one-dimensional ideology space, with two parties adjusting their positions along this continuum in order to win the most votes. Voters decide which party to support based on satisficing—choosing one party randomly out of those that are satisfactory. The probability that a party is satisfactory to any given voter decays with the distance between that party and the voter, with the speed of decay determined by a parameter describing the party’s inclusiveness, or tolerance of ideological diversity. We present a dynamical model for the parties’ positions in ideology space. We then validate our model’s predictions using empirical data on the distribution of U.S. legislators’ ideological positions from 1861 to 2015, from which we estimate the position and inclusiveness of the Democratic and Republican party. The model employs as an input party inclusiveness, estimated from the data in each Congress, and predicts the corresponding position of each party over time. We then compare the predicted positions against historical positions of the two major U.S. parties, to establish a relationship between party polarization and party inclusiveness. Our model proposes a possible mechanism for the polarization of political parties without any change in the distribution of the public’s ideological positions.

3 Deriving the mathematical model

Using congressional roll call voting records, empirical research has found that the U.S. political division can be well represented by a single dimension [23].111While a single dimension historically performs well at explaining the vast majority of legislative voting behavior, there are several notable time periods—such as during the civil rights movement—during which a second dimension is salient [7]. We refer to this dimension as the ideology space, with left being liberal and right being conservative. We also describe how liberal or conservative a party or voter is by their position in the same ideology space, henceforth referred to as ideological position, or simply as position. Survey data reveals that the American public’s ideology is distributed in this space in a unimodal manner, peaked at a moderate position and well approximated by a Gaussian (see the supplementary material, section 1). Thus, we consider the voters to be distributed in the ideology space according to a Gaussian function ρ(x)\rho(x). Without loss of generality, we set the mean of the Gaussian to 0, and the standard deviation to σ0\sigma_{0}. Each party ii adopts a position, denoted as μi\mu_{i}, along this ideology space. See Fig. 2 for a sketch of the model setup.

Refer to caption
Figure 2: Sketch of the model setup showing the interpretation of parameters in the ideology space. The function ρ\rho (solid curve) represents the opinion distribution in the population of voters. The parameter μi\mu_{i} (triangle) marks the position of one out of possibly nn parties competing for voters, and function sis_{i} (dashed curve) represents that party’s satisficing function.

Consider an election among nn parties. We assume the probability that a voter positioned at xx is satisfied with party ii decays with the voter’s distance from μi\mu_{i} and that this decay is symmetrical. We refer to this decaying probability as the satisficing function, si(di)s_{i}(d_{i}), where di=|xμi|d_{i}=|x-\mu_{i}|. We also assume si(0)=1s_{i}(0)=1 (meaning that a voter perfectly aligned with a party will be satisfied with that party), and si(di)0s_{i}(d_{i})\to 0 as did_{i}\rightarrow\infty. One function that satisfies these properties is, again, a Gaussian: si(di)=exp[di2/(2σi2)]s_{i}(d_{i})=\exp[-d_{i}^{2}/(2\sigma_{i}^{2})] (here scaled to have unit peak amplitude). The parameter σi\sigma_{i} represents how tolerant voters are of parties with ideologies different from their own (Fig. 2). It can also be interpreted as the inclusiveness of party ii; that is, how much a party appeals to voters at distant ideological positions. Parties with lower inclusiveness are more ideologically homogeneous.

We break down the satisficing decision-making process into explicit statements:

  1. 1.

    Voters who are satisfied with none of the nn parties abstain from voting.

  2. 2.

    Voters who are satisfied with a subset MM of the nn parties vote for each party in MM with probability pip_{i}, where Mpi=1\sum_{M}p_{i}=1. (Below, we assume for simplicity that pip_{i} is the same for all parties in this set, but this assumption is not essential.)

We first examine the case in which two parties, party 11 and party 22, compete. Since the governing equations for the two parties are similar, in the following equations, we use index ii to denote either party 1 or 2, and jj the other party. The expected share of potential voters at position xx who vote for party ii can be expressed as

pi(x|μi,μj)=si(di)(1sj(dj))A+12si(di)sj(dj)B,p_{i}(x|\mu_{i},\mu_{j})=\underbrace{s_{i}(d_{i})\left(1-s_{j}(d_{j})\right)}_{\text{A}}+\underbrace{\frac{1}{2}s_{i}(d_{i})s_{j}(d_{j})}_{\text{B}}\;, (1)

where term A represents the expected share of voters at position xx who are satisfied with party ii only, and term B represents half of those who are satisfied with both parties. The expected total number of voters for party ii, which we denote ViV_{i}, is obtained by integrating pi(x|μi,μj)p_{i}(x|\mu_{i},\mu_{j}) over the ideology space weighted by the population density ρ\rho:

Vi(μi,μj)=ρ(x)pi(x|μi,μj)𝑑x.V_{i}(\mu_{i},\mu_{j})=\int_{-\infty}^{\infty}\rho(x)p_{i}(x|\mu_{i},\mu_{j})dx\;. (2)

To model dynamical changes in positions over time, we assume parties move in the direction that increases the number of votes received at a speed proportional to the potential gain:

dμidt=kViμi,\frac{d\mu_{i}}{dt}=k\frac{\partial V_{i}}{\partial\mu_{i}}\;, (3)

where kk is a positive constant determined from data that sets the time scale. Equation (3) generates predictions for party ii’s position over time, where the left side represents the speed of position change and the right side represents the change in party’s votes. While we derived this equation for the two-party case, it admits an immediate extension to the nn-party case with the exact same form, except that the vote, ViV_{i}, is then defined relative to all nn parties. In supplementary material, section 4, we explore an alternative model where parties maximize vote share. This model produces similar outcomes (although it requires additional complexity).

4 Results and comparison with empirical data

4.1 Fixed points of the system

Refer to caption
Figure 3: (A) A vector field that demonstrates the movement of the two parties at various ideological positions. Arrows show the direction of the party positions’ movement, with the length of each arrow proportional to the speed of movement. The parameters used in this figure are the inclusiveness parameters σ1=σ2=0.5\sigma_{1}=\sigma_{2}=0.5 and time scale constant k=1k=1. (B) Fixed points for party positions (μ\mu) as a function of the inclusiveness parameter of the two major parties, assuming symmetrical inclusiveness σ1=σ2=σ\sigma_{1}=\sigma_{2}=\sigma. The stable fixed points are shown as solid curves, and the unstable fixed point is indicated by the dashed line. The shaded area indicates the inclusiveness parameter region corresponding to that estimated from the DW-NOMINATE data discussed in section 4.3 using methods discussed in section 4.5. (C) Polarization, as measured by the distance between parties at steady state, as a function of both σ1\sigma_{1} and σ2\sigma_{2}. In all three panels, the parameter σ0\sigma_{0} is set to 0.930.93, which is the same value used in Fig. 4.

We solve for the fixed points of the system both analytically and numerically with analytical results presented in section 4.2 and numerical results presented in Figs. 3 and 4. The results reveal that for sufficiently large inclusiveness parameters, σ1,2\sigma_{1,2}, the two parties are attracted to a single stable equilibrium: μ1=μ2=0\mu_{1}=\mu_{2}=0, at the center of the ideology space. However, for smaller σ1,2\sigma_{1,2}, the two parties stabilize at a finite separation. Fig. 3A visualizes the dynamics of a two-party system by plotting the derivatives dμ1/dtd\mu_{1}/dt and dμ2/dtd\mu_{2}/dt in Eq. (3) as a vector field. In this example, the two parties stabilize at a finite separation. The fixed points and their stability are shown in Fig. 3B for the symmetric case, σ1=σ2\sigma_{1}=\sigma_{2}. We note that Fig. 3B plots the positions of both parties on the same vertical axis. In the asymmetric case where σ1σ2\sigma_{1}\neq\sigma_{2}, the polarization, as measured by the distance between the two parties at steady state, is a function of both σ1\sigma_{1} and σ2\sigma_{2}. The numerical results for this case are shown in Fig. 3C. The system equilibrates at a finite separation in most of the parameter space.

An intuitive understanding of this result is that when the inclusiveness parameters are large, few voters abstain. Thus, parties are attracted to the center of the ideology space through the same mechanism as in the classic Downsian model that predicts the convergence of both parties to the median voter [11]. However, as the inclusiveness parameters decrease, convergence to the center increases the number of voters at the tails of the ideology distribution abstaining from voting. In that case, the parties benefit from moving away from the center, with an equilibrium distance determined by the parameters. Although the model was presented in a one-dimensional ideology space, the model’s behavior is similar in two dimensions (see supplementary material section1).

4.2 Analytical results for two identical parties

The right hand side of the system specified by Eq. (3) can be computed analytically for the symmetrical case σ1=σ2=σ\sigma_{1}=\sigma_{2}=\sigma:

dμidt=σ0[μi(σ2+σ02)μjσ02]2σ(σ2+2σ02)3/2e(μi2+μj2)σ2+(μiμj)2σ022σ2(σ2+2σ02)μiσ0σeμi22(σ2+σ02)(σ2+σ02)3/2.\frac{d\mu_{i}}{dt}=\frac{\sigma_{0}[\mu_{i}(\sigma^{2}+\sigma_{0}^{2})-\mu_{j}\sigma_{0}^{2}]}{2\sigma(\sigma^{2}+2\sigma_{0}^{2})^{3/2}}e^{-\frac{(\mu_{i}^{2}+\mu_{j}^{2})\sigma^{2}+(\mu_{i}-\mu_{j})^{2}\sigma_{0}^{2}}{2\sigma^{2}(\sigma^{2}+2\sigma_{0}^{2})}}-\frac{\mu_{i}\sigma_{0}\sigma e^{-\frac{\mu_{i}^{2}}{2(\sigma^{2}+\sigma_{0}^{2})}}}{(\sigma^{2}+\sigma_{0}^{2})^{3/2}}\;. (4)

Solving for the equilibria dμi/dt=0d\mu_{i}/dt=0, and imposing symmetry in party positions μ2=μ1=μ\mu_{2}=-\mu_{1}=\mu (this follows from the symmetry in σ1=σ2\sigma_{1}=\sigma_{2}), we find

μ[2σ2(σ2+2σ02)1/2eμ22(σ2+σ02)+(σ2+σ02)3/2eμ2σ2]=0.\mu\left[-2\sigma^{2}(\sigma^{2}+2\sigma_{0}^{2})^{1/2}e^{-\frac{\mu^{2}}{2(\sigma^{2}+\sigma_{0}^{2})}}+(\sigma^{2}+\sigma_{0}^{2})^{3/2}e^{-\frac{\mu^{2}}{\sigma^{2}}}\right]=0\;. (5)

Clearly μ=0\mu=0 is a solution, and thus μ=0\mu^{\star}=0 is a fixed point of the system for all values of σ\sigma and σ0\sigma_{0}. Other possible fixed points are given by the solution of the equation (σ2+σ02)3/2eμ2σ22σ2(σ2+2σ02)1/2eμ22(σ2+σ02)=0(\sigma^{2}+\sigma_{0}^{2})^{3/2}e^{-\frac{\mu^{2}}{\sigma^{2}}}-2\sigma^{2}(\sigma^{2}+2\sigma_{0}^{2})^{1/2}e^{-\frac{\mu^{2}}{2(\sigma^{2}+\sigma_{0}^{2})}}=0, which can be solved for μ2\mu^{2} explicitly as

(μ)2=σ2(σ2+σ02σ2+2σ02)ln[(σ2+σ02)34σ4(σ2+2σ02)];\left(\mu^{\star}\right)^{2}=\sigma^{2}\left(\frac{\sigma^{2}+\sigma_{0}^{2}}{\sigma^{2}+2\sigma_{0}^{2}}\right)\ln\left[\frac{\left(\sigma^{2}+\sigma_{0}^{2}\right)^{3}}{4\sigma^{4}\left(\sigma^{2}+2\sigma_{0}^{2}\right)}\right]\;; (6)

the solid line in Fig. 3B is this curve. Note that this expression can be rewritten in terms of nondimensional parameters {σ^σ/σ0,μ^μ/σ0}\{\hat{\sigma}\equiv\sigma/\sigma_{0},~{}\hat{\mu}\equiv\mu/\sigma_{0}\} simply by setting σσ^\sigma\mapsto\hat{\sigma} and σ01\sigma_{0}\mapsto 1.

The trivial solution μ=0\mu^{\star}=0 is the only fixed point for σ>σc\sigma>\sigma_{c} for some σc\sigma_{c}, and it is of interest to understand how σc\sigma_{c} depends on system parameters222The result that the trivial solution μ=0\mu^{\star}=0 is the only fixed point for σ>σc\sigma>\sigma_{c}, holds even for two parties with μ1μ2\mu_{1}\neq\mu_{2}, as can be seen with a series expansion for large σ\sigma retaining only leading order terms.. This critical value can be understood by noting that the nontrivial fixed points for μ\mu given in Eq. (6) cease to exist when the logarithm becomes negative, i.e., when the argument of the logarithm becomes less than one; σc\sigma_{c} is the critical value at which the argument is exactly one. Expressing this condition in the nondimensionalized form gives

(σ^c2+1)34σ^c4(σ^c2+2)=1,\frac{\left(\hat{\sigma}_{c}^{2}+1\right)^{3}}{4\hat{\sigma}_{c}^{4}\left(\hat{\sigma}_{c}^{2}+2\right)}=1\;, (7)

and rearranging the terms leads to a cubic polynomial in σ^c2\hat{\sigma}_{c}^{2}:

3σ^c6+5σ^c43σ^c21=0.3\hat{\sigma}_{c}^{6}+5\hat{\sigma}_{c}^{4}-3\hat{\sigma}_{c}^{2}-1=0\;. (8)

This equation has three real roots for σ^c2\hat{\sigma}^{2}_{c}, at σ^c2{2.07,0.247,0.652}\hat{\sigma}_{c}^{2}\approx\{-2.07,-0.247,0.652\}, and only the last root allows for a real-valued solution σ^c0.807\hat{\sigma}_{c}\approx 0.807. The system undergoes a subcritical pitchfork bifurcation at this point.

The exact shape of the fixed point curve is defined by Eq. (6), which cannot be solved explicitly for σ\sigma (or σ^\hat{\sigma} in nondimensional form). A convenient approximation can be written by expanding the right-hand side to the leading order in σ^\hat{\sigma}, yielding μ^2σ^2ln(8σ^2)\hat{\mu}^{2}\approx-\hat{\sigma}^{2}\ln(\sqrt{8}\hat{\sigma}^{2}). Define Wμ^2/σ^2W\equiv-\hat{\mu}^{2}/\hat{\sigma}^{2}, and rearrange the previous equation to get 8μ^2Wexp(W)-\sqrt{8}\hat{\mu}^{2}\approx W\exp(W). This equation is solved by W=W(8μ^2)W=W(-\sqrt{8}\hat{\mu}^{2}), where WW is now identified as the special function known as the Lambert WW function. Thus σ^2μ^2/W(8μ^2)\hat{\sigma}^{2}\approx-\hat{\mu}^{2}/W(-\sqrt{8}\hat{\mu}^{2}).

The Lambert WW function is real-valued for negative arguments only when the argument has magnitude less than 1/e1/e. This sets a bound on the maximum μ^2\hat{\mu}^{2} for which a solution exists since 8μ^max21/e\sqrt{8}\hat{\mu}_{\textrm{max}}^{2}\approx 1/e implies μ^max23/4e1/20.361\hat{\mu}_{\textrm{max}}\approx 2^{-3/4}e^{-1/2}\approx 0.361. The corresponding value of σ^\hat{\sigma} is then immediate since W(1/e)=1W(-1/e)=-1, yielding σ^maxμ^max23/4e1/2\hat{\sigma}_{\textrm{max}}\approx\hat{\mu}_{\textrm{max}}\approx 2^{-3/4}e^{-1/2}. These approximations are useful in understanding the general shape of the bifurcation curve, though the critical points of the exact expression in Eq. (6) can be easily computed numerically.

4.3 Empirical Data

To examine changes in real-world party positions over time, we employ ideological positions for Democratic and Republican legislators calculated from their congressional roll call voting records using the Dynamic, Weighted, Nominal Three-Step Estimation (DW-NOMINATE) method [7]. DW-NOMINATE is a multi-dimensional scaling method that first calculates a pairwise distance for every two members of the U.S. House of Representatives based on similarities in their roll call voting records. It then projects the resulting high-dimensional network of representatives to a low dimension while preserving the pairwise distance relation as much as possible. The representatives’ relative positions in this low-dimensional space are referred to as their ideology scores.333Note that this measure is relative in nature—it reflects the similarities and differences among members’ voting records across a large number of bills, not their positions on any specific issue. The same method can be used for scaling the positions of U.S. Senators, and in what follows we use a combined dataset of both representatives and senators (see supplementary material section 2 for data source).

As an example, Fig. 4A displays a histogram of Democratic and Republican legislators’ ideological positions during the 2013-15 term, the most recent in the data.

4.4 Model Validation

For purposes of comparing our model with data, we take the mean of each party’s distribution to represent its position, μi\mu_{i}, in the model and the standard deviation of each party’s distribution to represent its inclusiveness parameter, σi\sigma_{i}. As members of Congress move farther from the party mean, we infer that the party is more inclusive—and may thus appeal more to voters farther away from its center. Repeating that procedure for all available Congresses leads to a series of party positions over time since 1861, shown as the solid curves in Fig. 4B. We note that two distinct polarization metrics—the distance between party means (used in Fig. 4) and the standard deviation of the legislators’ ideology distribution (used in Fig. 1)—consistently demonstrate the striking increase in party polarization since the 1960s.

Refer to caption
Figure 4: Predictions and validations of the satisficing model. (A) Histogram of ideological positions for the Democratic and Republican parties in the 2013-15 Congress. (B) Comparison of the model prediction with time series data for party positions. The solid curves indicate the empirical party positions, estimated by the mean position of the party’s members of Congress, while the shaded areas mark the associated standard deviations. The dashed curves show our model predictions. (C) Comparison of party polarization predicted by our model and polarization observed in the data. The Pearson correlation between the prediction and the data is 0.75, with p-value 6.1×10156.1\times 10^{-15}. The dashed line corresponds to correlation 1 and is a reference. (D) Party polarization as a function of σ\sigma (average of the Democratic and Republican parties’ inclusiveness parameters). Each marker represents one Congress term. In both the model’s prediction and the data, party inclusiveness is negatively associated with polarization. In panels (C) and (D), party polarization is measured as the distance between party positions.

Importantly, Fig. 4B also compares the model’s predictions for the two major parties’ positions to historical data. We set the two parties’ positions in 1861 as their initial conditions and numerically simulate the dynamical system (3) to predict their positions in subsequent years. In the simulations, we update the inclusiveness parameter according to the data every Congress, and the model outputs the positions of both parties according to Eq. (3). Parameters σ0\sigma_{0}, kk, and a proportionality constant associating the standard deviation in the DW-NOMINATE score with the inclusiveness parameter are determined from the fit to the data. An animated visualization of Figs. 4A and 4B is included as a supplementary video.

Our theory shows good agreement with the data (Pearson correlation 0.75; see Figs. 4C and 4D). Notable deviations are observed around the times of the First and Second World Wars and following the recent rightward move by the Republican Party, as shown in Fig. 4B.

A central prediction of our model is that lower inclusiveness (more homogeneity within parties) will lead to higher party polarization. We find support for this prediction in empirical data, as shown in Fig. 4D. This finding suggests that it is not necessary for the voting population to become polarized in order for increased political party polarization to occur. Indeed, the electorate need not change at all; it is held constant in our simulations. We also perform robustness checks using an alternative measure of party polarization as well as an independent data source for the inclusiveness parameter and reach the same qualitative conclusions (see supplementary material section 3).

4.5 Parameter fitting

When determining the parameters from the data, we relate the units of the DW-NOMINATE ideology scores to units for variables in our model by assuming a linear scaling. For example, the party inclusiveness parameter, σi\sigma_{i}, is assumed to be linearly related to the DW-NOMINATE data through the scaling σi=bσdata,i\sigma_{i}=b\;\sigma_{\text{data},i}. We fitted the parameters in the model by minimizing the 1-norm for differences between the time series predicted by the model and the time series from data. Three parameters are fitted: (1) σ0\sigma_{0}, the standard deviation of the population ideology distribution; (2) bb, the proportionality constant relating the standard deviation in the DW-NOMINATE data with the party inclusiveness parameter in the model, σi\sigma_{i}; and (3) kk, the time scale constant. We set the initial conditions for the two parties as given by the data. The best fitting parameters are: σ0=0.93\sigma_{0}=0.93, b=3.73b=3.73, and k=2.54k=2.54.

5 Discussion

5.1 Comparison between satisficing and maximizing models

The satisficing assumption is essential to our model’s behavior. Here, we compare the results of our satisficing model with a maximizing model in the same framework (assuming two parties) and show that the predictions from the two models are fundamentally different. We consider the maximizing voters to be defined in the Downsian sense [11]—they maximize their utility function by voting for the ideologically closest party. With maximizing voters, the number of votes each party receives is

Vl(m)=(μl+μr)/2ρ(x)𝑑x,andVr(m)=(μl+μr)/2ρ(x)𝑑x,V^{(m)}_{l}=\int_{-\infty}^{(\mu_{l}+\mu_{r})/2}\rho(x)dx,\;\text{and}\;V^{(m)}_{r}=\int_{(\mu_{l}+\mu_{r})/2}^{\infty}\rho(x)dx\;, (9)

where (l,r)=(1,2)(l,r)=(1,2) if μ1<μ2\mu_{1}<\mu_{2}, and (l,r)=(2,1)(l,r)=(2,1) if μ1>μ2\mu_{1}>\mu_{2}. If μ1=μ2\mu_{1}=\mu_{2}, both parties receive the same number of votes,

V1(m)=V2(m)=12ρ(x)𝑑x.V^{(m)}_{1}=V^{(m)}_{2}=\frac{1}{2}\int_{-\infty}^{\infty}\rho(x)dx\;. (10)

We then solve for the fixed points of the system described by Eq. (3) with the two types of vote calculations and find their stabilities.

Figure 5 shows the stable fixed points for party positions as a function of σ1=σ2=σ\sigma_{1}=\sigma_{2}=\sigma, for normalized variables σ^=σ/σ0\hat{\sigma}=\sigma/\sigma_{0} and μ^=μ/σ0\hat{\mu}=\mu/\sigma_{0}. With satisficing voters, the two parties stabilize at a finite separation for a range of parameters (as shown above). However, with maximizing voters, the position (0,0)(0,0) is the only stable fixed point, recovering Downs’ result [11], a classic finding in political science.444This result can be immediately seen for the symmetric case μ1=μ2\mu_{1}=-\mu_{2}. In that case, each party is drawn toward the center to increase votes, despite having split the number of votes equally with the opposing party. This difference indicates that the voters’ satisficing decision making is indeed a key ingredient in producing the model’s results.

Refer to caption
Figure 5: Stable fixed points for party positions (μ^\hat{\mu}) as a function of the inclusiveness parameter of the two parties (σ^\hat{\sigma}) for (A) the satisficing model and (B) the maximizing model. For each model, the positions of both parties are shown on the same vertical axis.

5.2 Implications

Our model offers new contributions to the literature. First, we present empirical evidence of and an explanation for the relationship between party homogeneity and polarization, showing that simply changing the shape of parties’ satisficing functions (via changes to inclusiveness) is sufficient to lead to divergence in their positions. In particular, even though the impact of intra-party heterogeneity on a party’s competitiveness has been discussed before [18], the mechanism linking increasing ideological homogeneity with diverging party positions has remained unclear. Second, the model also shows why appealing to an extreme segment of the electorate can be a winning political strategy in times of greater intra-party ideological homogeneity (i.e., decreasing party inclusiveness), which may be especially relevant for interpreting current trends in U.S. politics. Ideological homogeneity itself may be driven by other factors, such as partisan redistricting [8] or media echo chambers [19], and explicitly modeling how these factors influence intra-party cohesion remains an open area of research. Third, our approach offers a new quantitative framework that incorporates satisficing behavior into a voting model.

In addressing the call for combining interdisciplinary methods to study human social behavior [15], this article offers insight into the complex process of political elections and democratic responsiveness through a parsimonious model, and suggests several directions for future work. For example, do all individuals engage in satisficing, and if so, do they do so in the same way? When and how can minor parties gain traction? And, how might outcomes change if parties could control both their position and their level of inclusiveness? For simplicity, we left out a number of electoral variables, such as party primaries, campaign financing, and Southern realignment, but it will be important for future research to understand how these factors interact with vote satisficing. We hope our work will spur further development of quantitative frameworks to incorporate human bounded rationality into mathematical models.

Acknowledgments

We thank Jeffrey Lewis and Keith Poole for in-depth discussions about DW-NOMINATE. We also gratefully acknowledge babies Miriam, Desiree, and Yara for their contributions.

Data and code availability

A copy of the DW-NOMINATE dataset used is included as a supplementary file. The computer code for numerically simulating the model is available in the following GitHub repository: https://github.com/vc-yang/satisficing_election_model.

Author Contributions

All authors contributed to the design of the research. V.C.Y. developed the model and performed computer simulations. All authors analyzed the results and contributed to the writing of the manuscript, which was led by V.C.Y. and D.M.A.

Funding

This work was supported by the James S. McDonnell Foundation through grant No. 220020230, the Northwestern University’s Data Science Initiative, Suzanne Hurst and Samuel Peters, and the Santa Fe Institute Omidyar Fellowship.

References

  • [1] D. M. Abrams, H. A. Yaple, and R. J. Wiener, Dynamics of social group competition: modeling the decline of religious affiliation, Phys. Rev. Lett., 107 (2011), p. 088701.
  • [2] C. Andris, D. Lee, M. J. Hamilton, M. Martino, C. E. Gunning, and J. A. Selden, The rise of partisanship and super-cooperators in the US House of Representatives, PLoS One, 10 (2015), p. e0123507.
  • [3] R. J. Aumann, Rationality and bounded rationality, Game. Econ. Behav., 21 (1997), pp. 2–14.
  • [4] D. Baldassarri and A. Gelman, Partisans without constraint: Political polarization and trends in American public opinion, Am. J. Sociol., 114 (2008), pp. 408–446.
  • [5] E. Ben-Naim, P. L. Krapivsky, and S. Redner, Bifurcations and patterns in compromise processes, Physica D, 183 (2003), pp. 190–204.
  • [6] J. Borge-Holthoefer and Y. Moreno, Absence of influential spreaders in rumor dynamics, Phys. Rev. E, 85 (2012), p. 026116.
  • [7] R. Carroll, J. Lewis, J. Lo, N. McCarty, K. Poole, and H. Rosenthal, “Common space” DW-NOMINATE scores with bootstrapped standard errors (joint House and Senate scaling). http://voteview.com/dwnomin_joint_house_and_senate.htm, 2015.
  • [8] J. L. Carson, G. Koger, M. J. Lebo, and E. Young, The electoral costs of party loyalty in Congress., American Journal of Political Science, 54 (2010), pp. 598–616.
  • [9] C. Castellano, S. Fortunato, and V. Loreto, Statistical physics of social dynamics, Rev. Mod. Phys., 81 (2009), p. 591.
  • [10] A. Clauset, M. Young, and K. S. Gleditsch, On the frequency of severe terrorist events, J Conflict Resolut, 51 (2007), pp. 58–87.
  • [11] A. Downs, An economic theory of political action in a democracy, J. Polit. Econ., 65 (1957), pp. 135–150.
  • [12] M. P. Fiorina and S. J. Abrams, Political polarization in the American public, Annu. Rev. Polit. Sci., 11 (2008), pp. 563–588.
  • [13] K. E. Fisher, S. Erdelez, and L. McKechnie, Theories of Information Behavior, Information Today, Inc., 2005.
  • [14] S. Galam, Application of statistical physics to politics, Physica A, 274 (1999), pp. 132–139.
  • [15] H. Gintis, Towards the unity of the human behavioral sciences, Polit. Philos. Econ., 3 (2004), pp. 37–57.
  • [16] M. J. Hetherington, Review article: Putting polarization in perspective, Brit. J. Polit. Sci., 39 (2009), pp. 413–448.
  • [17] S. J. Hill and C. Tausanovitch, A disconnect in representation? Comparison of trends in congressional and public polarization, J. Polit., 77 (2015), pp. 1058–1075.
  • [18] G. Kernell, Strategic party heterogeneity, J. Theor. Polit., 28 (2015), pp. 408–430.
  • [19] M. Levendusky, The Partisan Sort: How Liberals Became Democrats and Conservatives Became Republicans, The University of Chicago Press, 2009.
  • [20] X. Lu, J. Gao, and B. K. Szymanski, The evolution of polarization in the legislative branch of government, J Roy Soc Interface, 16 (2019).
  • [21] W. Mao, When one desires too much of a good thing: The compromise effect under maximizing tendencies, J. Consum. Psychol., 26 (2016), pp. 66–80.
  • [22] N. McCarty, K. T. Poole, and H. Rosenthal, Polarized America: The Dance of Ideology and Unequal Riches, MIT Press, 2016.
  • [23] K. T. Poole and H. Rosenthal, The polarization of American politics, J. Polit., 46 (1984), pp. 1061–1079.
  • [24] B. Schwartz, A. Ward, J. Monterosso, S. Lyubomirsky, K. White, and D. R. Lehman, Maximizing versus satisficing: Happiness is a matter of choice., J. Pers. Soc. Psychol., 83 (2002), p. 1178.
  • [25] H. A. Simon, Theories of bounded rationality, Decision and Organization, 1 (1972), pp. 161–176.
  • [26]  , Human nature in politics: The dialogue of psychology with political science, Am. Polit. Sci. Rev., 79 (1985), pp. 293–304.
  • [27] V. Sood and S. Redner, Voter model on heterogeneous graphs, Phys. Rev. Lett., 94 (2005), p. 178701.
  • [28] J. Xie, S. Sreenivasan, G. Korniss, W. Zhang, C. Lim, and B. K. Szymanski, Social consensus through the influence of committed minorities, Phys. Rev. E, 84 (2011), p. 011130.

Supplementary Material:
Why Are U.S. Parties So Polarized? A “Satisficing” Dynamical Model

Vicky Chuqiao Yang, Daniel M. Abrams, Georgia Kernell, and Adilson E. Motter

1 Additional discussion of model assumptions

We discuss the empirical basis for the assumed ideology distribution of the voting population and key properties of a higher-dimensional extension of the model.

1.1 Population ideology distribution

In the main text, we assumed the public’s ideology distribution to be unimodal and approximated it as Gaussian. Here we use empirical data to justify this assumption.

The American National Elections Studies (ANES) conducted yearly surveys from 1972 to 2012 asking people’s self-identification on a 7-point scale between extremely liberal and extremely conservative. The self identification shows a unimodal distribution centered at “moderate.” The data were downloaded in November 2015 from http://www.electionstudies.org/studypages/anes_timeseries_cdf/anes_timeseries_cdf.htm.
The histogram for the U.S. public self-identified ideology is shown in Fig. S1. Here, the distribution remains peaked at the moderate position for all years, despite fluctuations, and the Gaussian distribution is a good approximation of this distribution.

Refer to caption
Figure S1: Distribution of self-reported ideology in the ANES data (1972–2012 aggregated) compared with a Gaussian fit. The R2\text{R}^{2} of the Gaussian fit is 0.74.

1.2 Dimensionality of the ideology space

In the main text, we are motivated by the reality that political idealogy in the U.S. is largely one-dimensional. The model can be generalized to higher dimensions by replacing the scalar variables xx, μi\mu_{i}, and σi\sigma_{i} with vectors. Figure S2 shows some example trajectories for the two-party case in a two-dimensional ideology space. In these simulations, for ease of numerical computation, we used discrete time steps and restricted party movements in each period to either vertical or horizontal directions, but the steady states reached are equivalent to those found by integrating the differential equations. The behavior of the system is analogous to that of the one-dimensional problem. For large values of the parameters σi\sigma_{i}, both parties converge to the center. For small-to-moderate σi\sigma_{i}, however, the parties stabilize at a finite separation on opposite sides of a circle (Fig. S2).

Refer to caption
Figure S2: Example trajectories of the two-party system in a two-dimensional ideology space, where ideological positions are denoted by the coordinates (μ1,μ2)(\mu_{1},\mu_{2}). Solid circles denote the steady-state positions of the parties and the colored curves denote transient trajectories from various initial conditions. Curves of the same color indicate the two parties’ trajectories from one set of initial conditions. In these simulations, the parameters are set to σ0=1\sigma_{0}=1 and σ1,2=0.2\sigma_{1,2}=0.2. For simplicity, the simulation is restricted to parties moving parallel to the horizontal or vertical axis at each time step. The dashed circle is included to facilitate visualization of the steady-states positions.

2 Data for the U.S. Democratic and Republican party positions

We use congressional voting records compiled with the Dynamic, Weighted, Nominal Three-Step Estimation (DW-NOMINATE) algorithm to empirically measure the party positions, μ\mu, and the inclusiveness parameter, σ\sigma, for the Democratic and Republican parties. DW-NOMINATE is a multi-dimensional scaling method that first calculates a pairwise distance for every two members of Congress based on similarity in their roll call vote records. It then projects the resulting high-dimensional network of legislators to a low-dimensional space while preserving the pairwise distance relation as much as possible. The legislators’ relative positions in this low-dimensional space are referred to as their ideology scores. This procedure was conducted for the House and Senate separately. The ideology scores of the two chambers were then combined into one dataset, which is the dataset used in our analyses.

We used the version of the House and Senate combined dataset that was last updated in 2015. The data was downloaded from https://legacy.voteview.com/dwnomin.htm in May 2016. The website has since been updated. The new link (as of September 2018) for data download is https://voteview.com/data. A copy of the dataset that the authors downloaded in May 2016 and used in this analysis can be found as a supplementary data file.

According to Poole and Rosenthal [S1], despite the underlying complexity, the roll call votes in the House and the Senate can be organized and explained by no more than two dimensions throughout American history. The first dimension, also called the dominant dimension, is commonly thought of as tapping into the left-right (or liberal–conservative) spectrum on economic matters. The second dimension picks up attitudes on cross-cutting and salient issues (e.g., slavery, civil rights, and social/lifestyle issues).

We use the DW-NOMINATE first dimension score to measure a legislator’s ideological position. To estimate each party’s position, μi\mu_{i}, we use the mean position of a party’s legislators. We also estimate each party’s inclusiveness parameter, σi\sigma_{i}, as proportional to the standard deviation of the legislators’ positions. The second dimension accounts for at most an additional 6%, and at the lowest point an additional 1%, of explanatory power in legislators’ votes [S1].

3 Robustness checks

In this section, we provide evidence that the core conclusions of our paper do not change when using alternative datasets and slightly different model assumptions.

3.1 Validation for the satisficing function from independent data

For a robustness check of our findings, we employed an alternative estimate of the satisficing function parameter σ\sigma using a dataset that is independent of the DW-NOMINATE scores shown in the main text.

The data used is the American National Elections Studies (ANES) 1948-2012 time series dataset, as described in section 1 above. We used three variables from the survey: the self-reported liberal-conservative scale (VCF0803), the feeling thermometer for liberals (VCF0211), and the feeling thermometer for conservatives (VCF0212). Data on these variables are available for 18 years between 1972 and 2012, for 55,674 individuals in total.

The feeling thermometer questions ask participants to report their feeling towards a group on a 0-100 scale. Participants are told that 50–100 means they feel favorably towards the group, 0–50 means they do not feel favorably towards the group, and 50 means they do not feel particularly warm or cold.

We assume that thermometer 50\geq 50 means satisfied with the liberal/conservative position. The self-reported liberal-conservative scale is 1 to 7, where 1 represents most liberal, and 7 most conservative. Fig. S1 shows the aggregated distribution of the data from 1972 to 2012. Given the definition of the 7-point scale, we interpret the target group (liberal and conservative) as positioned at 1 and 7 of this scale, respectively. Then, we calculate the proportion of the population satisfied with the target group as a function of their distance to the group. We fit a Gaussian to this curve and get the parameter σ\sigma from the fit, which corresponds to the width of the satisficing function. Fig. S3 shows, as an example, the behavior of the data in 2012. The two satisficing curves for conservatives and liberals collapse, and are well approximated by a Gaussian function.

Refer to caption
Figure S3: ANES thermometer data from 2012. We assume that thermometer 50\geq 50 means satisfied. (A) Proportion of the population satisfied with liberals and conservatives by their ideological position, where 1 means extremely liberal and 7 means extremely conservative (B) Proportion satisfied as a function of the distance to the target groups (liberals or conservatives), and Gaussian fit. The two curves collapse and can be approximated by a Gaussian function. The best fitting Gaussian has a standard deviation of 3.33.3.
Refer to caption
Figure S4: Comparing σ\sigma fitted from the ANES data and metrics computed from the DW-NOMINATE data. Left: the σ\sigma parameters inferred from the two sources are positively correlated, giving us some confidence in using the DW-NOMINATE standard deviation to approximate the “tolerance” of voters. Right: the parameter σ\sigma fitted from the ANES data is negatively correlated with the distance between parties (i.e., party polarization), which supports the observation made in the main text using σ\sigma derived from the DW-NOMINATE data.

We repeat the analysis for all 18 years of data available. Fig. S4 plots the best fitting σ\sigma value to the ANES data of each year (horizontal axis) against the σ\sigma value estimated from the DW-NOMINATE data of the Congress that includes the same year. The Pearson correlation is 0.510.51 (p=0.03p=0.03). The σ\sigma value estimated from ANES correlates with distance between parties (i.e., party polarization) in the DW-NOMINATE data (Congress data) with Pearson correlation 0.58-0.58 (p=0.01p=0.01).

This analysis serves as a robustness check for the negative relationship between party separation (i.e., political polarization) and the narrowing of the satisficing function (i.e., increasing intra-party ideological homogeneity or reducing inclusiveness).

3.2 Polarization in alternative metrics of party position

The DW-
NOMINATE score has been criticized at times for its computational complexity and for the difficulty of interpreting its axes. Here, we present a simpler and easier-to-understand metric and show that a negative relationship between polarization and inclusiveness is still observed.

We use data from the U.S. House of Representatives’ roll call records for the period 1941–2015. The data were downloaded from https://www.govtrack.us/data/ in March 2016.

We calculate two scores for each representative. For each representative in each Congress, we calculate a Republican alignment score (SRS_{R}). This score measures the proportion of times she or he voted in agreement with the Republican majority position. For example, if 60% of Republican representatives supported a bill, then a representative voting in favor of that bill would be counted as voting in agreement with the Republican majority. More explicitly,

SR=NRN,S_{R}=\frac{N_{R}}{N}, (S1)

where SRS_{R} is the Republican alignment score, NRN_{R} is the number of times voting with the Republican majority, and NN is the total number of votes. Similarly, we also calculate a Democratic alignment score for each representative,

SD=NDN,S_{D}=\frac{N_{D}}{N}, (S2)

where NDN_{D} is the number of times a representative votes with the Democratic majority.

Finally, we combine the two scores into one alternative metric:

S=12(SRSD).S=\frac{1}{2}(S_{R}-S_{D}). (S3)

Results of this analysis are shown in Fig. S5. The polarization trend is observable even using this simple metric (shown in the second column of Fig. S5). The negative relationship between the interparty distance (i.e., party polarization) and party inclusiveness is still present. The Pearson correlation in the DW-NOMINATE data is 0.77-0.77 (p=2×1016p=2\times 10^{-16}), and the Pearson correlation in the alternative metric is 0.59-0.59 (p=2×108p=2\times 10^{-8}). Note that the Republican and Democratic scores (shown in the third and fourth columns of Fig. S5) exhibit asymmetry due to a number of unanimous votes.

Refer to caption
Figure S5: Party polarization trends from alternative metrics. Top row: time history of party positions and standard deviations measured by the various metrics. Bottom row: distance between parties (i.e., party polarization) vs. party inclusiveness based on various metrics. (A, B) DW-NOMINATE data (same as used in Fig. 4 of the main text). (C, D) Alternative metric SS. (E, F) Republican alignment score SRS_{R}. (G, H) Democratic alignment score SDS_{D}.

4 Alternative model that maximizes vote share

In Eq. (3.3) of the main text, we assume that each party acts to maximize its number of votes. But in many cases parties may seek instead to maximize their vote share. We analyze this alternative model and find that similar qualitative conclusions hold; specifically, lower inclusiveness is related to greater party polarization. In the main text we chose to present the vote-maximizing model because it requires fewer assumptions and parameters.

In the alternative model for elections among parties 11 through nn, Equation (3.3) is modified to

dμidt=kμi(Vij=1nVj),\frac{d\mu_{i}}{dt}=k\frac{\partial}{\partial\mu_{i}}\left(\frac{V_{i}}{\sum_{j=1}^{n}V_{j}}\right)\;, (S4)

where nn is the number of parties. Similar expressions for pp can be derived for multi-party systems. For example, in a three-party system,

p1=s1(1s2)(1s3)term 1+12(s1s2(1s3)+s1s3(1s2))term 2+13s1s2s3term 3,p_{1}=\underbrace{s_{1}(1-s_{2})(1-s_{3})}_{\text{term 1}}+\frac{1}{2}\underbrace{\left(s_{1}s_{2}(1-s_{3})+s_{1}s_{3}(1-s_{2})\right)}_{\text{term 2}}+\frac{1}{3}\underbrace{s_{1}s_{2}s_{3}}_{\text{term 3}}\;, (S5)

where the functions s1s_{1}, s2s_{2}, and s3s_{3} are the same satisficing functions as described in the main text. Term 1 gives the expected proportion of voters satisfied with party 11 only. Term 2 gives the proportion satisfied with party 11 and exactly one other party. Term 3 gives the proportion satisfied with all three parties.

In the case where exactly two parties compete, Eq. (S4) predicts that both parties will converge to the median voter’s position, recovering a classic result in political science [S4]. However, this strategy is sensitive to the presence of even very small third parties. In the multi-party version of Eq. (3.3), with a minor third party (characterized by a small σ3\sigma_{3} parameter), the outcome changes: the two major parties diverge from the median and stabilize at a finite distance from each other. Intuitively, because voters who are unsatisfied with either party choose to abstain, in a strictly two-party system, the median voter’s position does not maximize the number of votes for either party, although it does maximize the vote share. It is only with the threat of additional parties capturing lost votes that convergence to the median ceases to be optimal. This prediction is in agreement with previous research that finds that third-party candidates can shape election outcomes [S3]. We also perform analyses similar to those shown in Fig. 4 using a four-party version of Eq. (3.3), consisting of two major parties and two minor parties, and find qualitatively similar results (an example is shown in Fig. S6).

Refer to caption
Figure S6: The comparison of party position over time predicted by the alternative model for maximizing vote share with empirical data. In the alternative model, we assume two major parties (shown), and two minor parties on either side of the major parties (not shown).

References

  • [S1] K.T. Poole, H.L. Rosenthal, Ideology and Congress (Transaction Publishers, 2011).
  • [S2] H. Hermsen, A. Verbeek, Equilibria in multi-party systems, Public Choice, 73, 147 (1992).
  • [S3] Lee, D. J. Anticipating entry: Major party positioning and third party threat, Polit. Res. Quart., 65, 138–150 (2012).
  • [S4] Downs, A. An economic theory of political action in a democracy, J. Polit. Econ., 65 (1957), pp. 135–150.