\hideLIPIcs

Department of Computer Science, Princeton University, United [email protected]://orcid.org/0000-0001-8542-0247Department of Computer Science, Princeton University, United [email protected]://orcid.org/0000-0002-6398-3097 Department of Computer Science, ETH Zurich, [email protected]://orcid.org/0009-0002-7028-2595 \CopyrightBernard Chazelle, Kritkorn Karntikoon, and Jakob Nogler {CCSXML} <ccs2012> <concept> <concept_id>10003752.10010061</concept_id> <concept_desc>Theory of computation Randomness, geometry and discrete structures</concept_desc> <concept_significance>500</concept_significance> </concept> </ccs2012> \ccsdesc[500]Theory of computation Randomness, geometry and discrete structures \fundingThis work was supported in part by NSF grant CCF-2006125. \EventEditorsJohn Q. Open and Joan R. Access \EventNoEds2 \EventLongTitle42nd Conference on Very Important Topics (CVIT 2016) \EventShortTitleCVIT 2016 \EventAcronymCVIT \EventYear2016 \EventDateDecember 24–27, 2016 \EventLocationLittle Whinging, United Kingdom \EventLogo \SeriesVolume42 \ArticleNo23

The Geometry of Cyclical Social Trends

Bernard Chazelle Kritkorn Karntikoon Jakob Nogler

Abstract

We investigate the emergence of periodic behavior in opinion dynamics and its underlying geometry. For this, we use a bounded-confidence model with contrarian agents in a convolution social network. This means that agents adapt their opinions by interacting with their neighbors in a time-varying social network. Being contrarian, the agents are kept from reaching consensus. This is the key feature that allows the emergence of cyclical trends. We show that the systems either converge to nonconsensual equilibrium or are attracted to periodic or quasi-periodic orbits. We bound the dimension of the attractors and the period of cyclical trends. We exhibit instances where each orbit is dense and uniformly distributed within its attractor. We also investigate the case of randomly changing social networks.

keywords:

opinion dynamics, Minkowski sums, equidistribution, periodicity

1 Introduction

Much of the work in the area of opinion dynamics has focused on consensus and polarization [3, 14]. Typical questions include: How do agents come to agree or disagree? How do exogenous forces drive them to consensus? How long does it take for opinion formation to settle? Largely left out of the discussion has been the emergence of cyclical trends. Periodic patterns in opinions and preferences is a complex, multifactorial social phenomenon beyond the scope of this work [24]. A question worth examining, however, is whether the process conceals deeper mathematical structure. The purpose of this work is to show that it is, indeed, the case.

This work began with a thought experiment and a computer simulation. The latter revealed highly unexpected behavior, which in turn compelled us to search for an explanation. Our main result is a proof that adding a simple contrarian rule to the classic bounded-confidence model suffices to produce quasi-periodic trajectories. The model is a slight variant of the classic HK framework: a finite collection of agents hold opinions on several topics, which they update at discrete time steps by consulting their neighbors in a (time-varying) social network. The modification is the addition of a simple repulsive force field that keep agents away from tight consensus. The idea is partly inspired by swarming dynamics. For example, birds refrain from flocking too closely. Likewise, near-consensus on a large enough scale tends to induce contrarian reactions among agents [1, 20]. Some political scientists have pointed to contrarianism as one of the reasons for the closeness of some national elections [19, 18].

One of the paradoxical observations we sought to elucidate was why cyclic trends in social networks seem oblivious to the initial opinions of one’s friends: specifically, it is not specific distributions of initial opinions that produce oscillations but, rather, the recurrence of certain symmetries in the networks. We prove that the condition is sufficient (though its necessity is still open). Another mystery was why contrarian opinions tend to orbit toward an attractor whose dimensionality is independent of the number of opinions held by a single agent. These attracting sets are typically Minkowski sums of ellipses. They emerge algorithmically and constitute a natural focus of interest in distributed computational geometry.

Our inquiry builds on the pioneering work of French [16], DeGroot [10], Friedkin & Johnsen [17], and Deffuant et al. [9]. The model we use is a minor modification of the bounded-confidence model model [5, 21]. A Hegselmann-Krause (HK) system consists of $n$ agents, each one represented by a point in ${\mathbb{R}}^{d}$ . The $d$ coordinates for each agent $i$ represent their current opinions on $d$ different topics: thus, $d$ is the dimension of the opinion space. At any (discrete) time, each agent $i$ moves to the mass center of the agents within a fixed distance $r_{i}$ , which represents its radius of influence (Fig. 1). This step is repeated ad infinitum. Formally, the agents are positioned at $x_{1}(t),\ldots,x_{n}(t)\in{\mathbb{R}}^{d}$ at time $t$ and for any $t=0,1,2,\ldots\,$ ,

x_{i}(t+1)=\frac{1}{|\mathcal{N}_{i}(t)|}\sum_{j\in\mathcal{N}_{i}(t)}x_{j}(t)\,,\;\text{with }\mathcal{N}_{i}(t)=\Bigl{\{}\,1\leq j\leq n\,:\,\big{\|}x_{i}(t)-x_{j}(t)\big{\|}_{2}\leq r_{i}\,\Bigr{\}}.

(1)

Refer to caption — Figure 1: The evolution of 20,000 random points in an HK system.

Interpreting each $\mathcal{N}_{i}(t)$ as the set of neighbors of agent $i$ defines the social network $G_{t}$ at time $t$ . In the special case where all the radii of influence are equal $(r_{i}=R)$ , convergence into fixed-point clusters occurs within a polynomial number of steps [4, 13, 25]. Computer simulation suggests that the same remains true even when the radii differ but a proof has remained elusive. For cyclical trends to emerge, the social networks require a higher degree of underlying structure. In this work, we assume vertex transitivity (via Cayley graphs), which stipulates that agents cannot be distinguished by their local environment. Before defining the model formally in the next section, we summarize our main findings.

•

Undirected networks always drive the agents to nonconsensual convergence, ie, to fixed points at which they “agree to disagree.” For their behavior to become periodic or quasi-periodic, the social networks need to be directed. We prove that such systems either converge or are attracted to periodic or quasi-periodic orbits. We give precise formulas for the orbits.
•

We investigate the geometry of the attractors (Fig. 2). We bound the rotation number, which indicates the speed at which (quasi)-periodic opinions undergo a full cycle. We exhibit instances where each limiting orbit forms a set that is dense and, in fact, uniformly distributed on its attractor.
•

We explore the case of social networks changing randomly at each step. We prove the surprising result that the dimension of the attractor can decrease because of the randomization. This is a rare case where adding entropy to a system can reduce its dimensionality.

The dynamics of contrarian views has been studied before [1, 11, 15, 19, 18, 20, 26] but, to our knowledge, not for the purpose of explaining cyclical trends. Our mathematical findings can be viewed as a grand generalization of the affine-invariant evolution of planar polygons studied in [6, 8, 12, 22].

2 Contrarian Opinion Dynamics

The social network is a time-dependent Cayley graph over an abelian group. All finite abelian groups are isomorphic to a direct sum of cyclic groups $(\mathbb{Z}/n_{1}\mathbb{Z})\oplus\cdots\oplus(\mathbb{Z}/n_{m}\mathbb{Z})$ . For notational convenience, we set $n_{i}=n$ . We regard the toral grid $V=(\mathbb{Z}/n\,\mathbb{Z})^{m}$ as a vector space, and we write $N=|V|=n^{m}$ . Let $x_{v}(t)$ be the position of agent $v$ in $\mathbb{R}^{d}$ at time $t$ . We fix $x_{v}(0)$ and abbreviate it as $x_{v}$ . Choose $p$ such that $1/N<p<1$ and let $(C_{t})_{t\geq 0}$ be an infinite sequence of subsets of $V$ . For technical convenience, we assume that each set $C_{t}$ spans the vector space $V$ ; hence $|C_{t}|\geq m$ . In the spirit of HK systems, we define the dynamics as follows: for $t=0,1,\ldots,$

x_{v}(t+1)=px_{v}(t)+\frac{1-p}{|C_{t}|}\sum_{w\in v+C_{t}}x_{w}(t).

(2)

Because of the presence of the “self-confidence” weight $p$ , we may assume that the convolution set $C_{t}$ does not contain the origin $\mathbf{0}$ . If we view each $x_{v}(t)$ as a row vector in $\mathbb{R}^{d}$ , the update (2) specifies an $N$ -by- $N$ stochastic matrix $F_{C_{t}}$ . Let $x(t)$ denote the $N$ -by- $d$ matrix whose rows are the $N$ agent positions $x_{v}(t)$ , for $v\in V$ . We have $x(t+1)=F_{C_{t}}x(t)$ . The matrix $F_{C_{t}}$ may not be symmetric but it is always doubly-stochastic. This means that the mass center $\mathbf{1}^{\top}x(t)/N$ is time-invariant. Since the dynamics itself is translation-invariant, we are free to move the mass center to the origin, which we do by assuming $\mathbf{1}^{\top}x=\mathbf{0}^{\top}$ , where $x$ denotes $x(0)$ .

Obviously, some initial conditions are uninteresting: for example, $x=\mathbf{0}$ . For this reason, we choose $x$ randomly; specifically, each $x_{v}$ is picked iid from the $d-$ dimensional normal distribution $\mathcal{N}(\mathbf{0},1)$ . In the following, we use the phrase “with high probability,” to refer to an event occurring with probability at least $1-\varepsilon$ , for any fixed $\varepsilon>0$ . Once we’ve picked the matrix $x$ randomly, we place the mass center of the agents at the origin by subtracting its displacement from the origin: $x\leftarrow x-\frac{1}{N}\mathbf{1}\mathbf{1}^{\top}x$ .

The agents will be attracted to the origin to form a single-point cluster of consensus in the limit. Responding to their contrarian nature, the agents will restore mutual differences by boosting the own opinions. For that reason we consider the scaled dynamics: $y(0)=x$ and, for $t\geq 0$ ,

y(t+1)=\xi_{t}F_{C_{t}}y(t),

(3)

where $\xi_{t}$ is chosen so that the diameter of the system remains roughly constant. Since scaling leaves the salient topological and geometric properties of the dynamics unchanged, the precise definition of $\xi_{t}$ can vary to fit analytical (or even visual) needs.

2.1 Preliminaries

We define the directed graph $G_{C_{t}}=(V,E_{t})$ at time $t\geq 0$ , where $E_{t}=\bigcup\,\{(v,v+c)\,|\,v\in V,c\in C_{t}\}$ and $N_{v}=\{(v,w)\,|\,w\in v+C_{t}\subseteq V\}$ . For clarity, we drop the subscript $t$ for the remainder of this section; so we write $C$ for $C_{t}$ .

Lemma 2.1.

The convolution set $C$ spans the vector space $V$ if and only if the graph $G_{C}$ is strongly connected.

Proof 2.2.

If $C$ spans $V$ , then for any pair $u,v\in V$ , there exist $a_{h}\in\mathbb{Z}/n\mathbb{Z}$ , for each $h\in C$ , such that $v-u=\sum_{h\in C}a_{h}h$ . The right-hand side specifies $\sum_{h}a_{h}$ edges (sum taken over $\mathbb{N}$ ) that form a path from $u$ to $v$ ; therefore $G$ is strongly connected. Conversely, assuming the latter, there is a path from $u$ to $v$ : $(w_{1},w_{2}),\ldots,(w_{k-1},w_{k})$ , with $w_{1}=u$ and $w_{k}=v$ . Thus, $v-u=\sum_{i}c_{i}$ , where $c_{i}=w_{i}-w_{i-1}\in C$ ; therefore $C$ spans $V$ .

Our assumption about $C$ implies that each $G_{C}$ is strongly connected. The presence of the weight $p>0$ in (2) ensures that the diagonal of $F_{C}$ is positive. Together with the strong connnectivity assumption, this makes the matrix $F_{C}$ primitive, meaning that $F_{C}^{k}>0$ , for some $k>0$ . By the Perron-Frobenius theorem [27], all the eigenvalues of $F_{C}$ lie strictly inside the unit circle in $\mathbb{C}$ , except for the dominant eigenvalue 1, which has multiplicity $1$ . For any $u,v\in V$ , we write $\psi_{u}^{v}=\omega^{\langle u,v\rangle}$ , where $\omega:=e^{2\pi i/n}$ . We define the vector $\psi^{v}=(\psi_{u}^{v}\,|\,u\in V)$ and easily verify that $\{\psi^{v}\,|\,v\in V\}$ forms an orthogonal eigenbasis for $F_{C}$ . The eigenvalue $\lambda_{v}$ corresponding to $\psi^{v}$ satisfies

\lambda_{v}\psi_{u}^{v}=p\psi_{u}^{v}+\frac{1-p}{|C|}\sum_{w\in u+C}\psi_{w}^{v}=p\psi_{u}^{v}+\frac{1-p}{|C|}\Big{(}\sum_{h\in C}\psi_{h}^{v}\Big{)}\psi_{u}^{v}\,.

We conclude:

Lemma 2.3.

Each $v\in V$ corresponds to a distinct eigenvector $\psi^{v}$ , which together form an orthogonal basis for $\mathbb{C}^{N}$ . The corresponding eigenvalue is given by

\lambda_{v}=p+\frac{1-p}{|C|}\sum_{h\in C}\omega^{\langle v,h\rangle}.

We define $\lambda=\max_{v\in V}\{|\lambda_{v}|<1\}$ and denote by $W=\{v\in V:|\lambda_{v}|=\lambda\}$ the set of subdominant eigenvectors. The argument of $\lambda_{v}$ plays a key role in our discussion, so we define $\theta_{v}$ such that $\lambda_{v}=|\lambda_{v}|\omega^{\theta_{v}}$ , with $\theta_{v}\in(-n/2,n/2]$ . By (6), $\lambda_{v}\neq 0$ for $v\in W$ , so $\theta_{v}$ is well defined.

2.2 The evolution of opinions

We begin with the case of a fixed convolution set $C_{t}=C$ . The initial position of the agents is expressed in eigenspace as $x=\frac{1}{N}\sum_{v\in V}\psi^{v}(\psi^{v})^{\mathrm{H}}x$ . Let $z_{v}$ denote the row vector $(\psi^{v})^{\mathrm{H}}x=\sum_{u\in V}\omega^{-\langle v,u\rangle}x_{u}$ . Because $(\psi^{v})^{\mathrm{H}}x=\mathbf{1}^{\top}x=\mathbf{0}^{\top}$ , for $v=\mathbf{0}\in V$ ,

x(t)=\frac{1}{N}\sum_{v\in V\setminus\{\mathbf{0}\}}\lambda_{v}^{t}\psi^{v}z_{v}.

(4)

Lemma 2.4.

With high probability, for all $v\neq\mathbf{0}$ ,

\Omega\big{(}\sqrt{1/N}\,\big{)}=\|z_{v}\|_{2}=O\big{(}\sqrt{dN\log dN}\,\big{)}.

Proof 2.5.

Let $a=(a_{u})_{u\in V}$ be the first column of the matrix $x$ . For each $u\in V$ , by the initialization of the system, $a_{u}=\zeta_{u}-\delta$ , where $\zeta_{u}\sim\mathcal{N}(0,1)$ and $\delta=\frac{1}{N}\mathbf{1}^{\top}\zeta$ . Given $v\neq\mathbf{0}$ , $\psi^{v}$ is orthogonal to $\psi^{\mathbf{0}}=\mathbf{1}$ ; hence $(\psi^{v})^{\mathrm{H}}a=(\psi^{v})^{\mathrm{H}}(\zeta-\delta\mathbf{1})=(\psi^{v})^{\mathrm{H}}\zeta$ . Since the random vector $\zeta$ is unbiased and $|\omega^{-\langle v,u\rangle}|=1$ , it follows that $\hbox{\rm var}\,\big{[}(\psi^{v})^{\mathrm{H}}a\big{]}=\sum_{u\in V}\hbox{\rm var}\,\zeta_{u}=N$ . Thus, the first coordinate $z_{v,1}$ of $z_{v}$ is of the form $a+ib$ , where $a$ and $b$ are sampled (not independently) from $\mathcal{N}(0,\sigma_{1}^{2})$ and $\mathcal{N}(0,\sigma_{2}^{2})$ , respectively, such that $\sigma_{1}^{2}+\sigma_{2}^{2}=N$ . Thus, $|z_{v,1}|\leq\delta$ with probability at most $2\delta/\sqrt{\pi N}$ . Conversely, by the inequality erfc $(z)\leq e^{-z^{2}}$ for $z>0$ , we find that $|z_{v,1}|=O(\sqrt{N\log(dN/\varepsilon)}\,)$ , with probability at least $1-\varepsilon/dN$ , for any $0<\varepsilon<1$ ; hence $\|z_{v}\|_{2}=O(\sqrt{dN\log(dN/\varepsilon)}\,)$ , with probability at least $1-\varepsilon/N$ . Setting $\delta=\varepsilon\sqrt{\pi/4N}$ and using a union bound completes the proof.

We upscale the system by setting $\xi_{t}=1/\lambda$ ; hence $y(t+1)=y(t)/\lambda$ .

Theorem 2.6.

Let $a_{h}$ and $b_{h}$ be the row vectors whose $u$ -th coordinates ( $u\in V$ ) are $\cos(2\pi\langle h,u\rangle/n)$ and $\sin(2\pi\langle h,u\rangle/n)$ , respectively. With high probability, for each $v\in V$ , the agent $v$ is attracted to the trajectory of $y_{v}^{*}(t)$ , where

y_{v}^{*}(t)=\frac{1}{N}\sum_{h\in W}\left(\cos\frac{2\pi(t\theta_{h}+\langle h,v\rangle)}{n}\,,\,\sin\frac{2\pi(t\theta_{h}+\langle h,v\rangle)}{n}\right)\begin{pmatrix}a_{h}\\ b_{h}\end{pmatrix}x.

(5)

Let $\mu:=\max\{|\lambda_{v}|/\lambda<1\}$ be the third largest (upscaled) eigenvalue, measured in distinct moduli. The error of the approximation decays exponentially fast as a function of $\mu$ :

\frac{\|y_{v}^{*}(t)-y_{v}(t)\|_{F}}{\|y_{v}(t)\|_{F}}=O\big{(}\mu^{t}N^{2}\sqrt{d\log dN}\,\big{)}.

Proof 2.7.

Since the eigenvalues sum up to tr $F_{C}=pN$ and 1 has multiplicity 1, we have $pN\leq 1+(N-1)\lambda$ ; hence, by $p>1/N$ ,

\lambda\geq\frac{pN-1}{N-1}>0.

(6)

Writing $\mu_{v}=\lambda_{v}/\lambda$ and $\mu=\max\{|\mu_{v}|<1\}$ , we have $|\mu_{v}|=1$ for $v\in W$ ; recall that $W=\{v\in V:|\lambda_{v}|=\lambda\}$ . By (4), it follows that

y(t)=\frac{1}{N}\sum_{v\in W}\mu_{v}^{t}\psi^{v}z_{v}+\eta(t),

(7)

where, by Lemma 2.4, with high probability,

	$\displaystyle{\\|\eta(t)\\|}_{F}$	$\displaystyle=\Big{\\|}\frac{1}{N}\sum_{v\in V\setminus(W\cup\{\mathbf{0}\})}\mu_{v}^{t}\psi^{v}z_{v}\Big{\\|}_{F}$
		$\displaystyle\leq\frac{1}{N}\sum_{v\in V\setminus(W\cup\{\mathbf{0}\})}\mu^{t}\\|\psi^{v}\\|_{2}\,\\|z_{v}\\|_{2}=O\big{(}\mu^{t}N\sqrt{d\log dN}\,\big{)}.$

The lower bound of the lemma implies that, for any $v\in W$ ,

	$\displaystyle\Big{\\|}\sum_{v\in W}\mu_{v}^{t}\psi^{v}z_{v}\Big{\\|}_{F}^{2}$	$\displaystyle=\,\mathrm{tr}\,\Big{(}\sum_{v\in W}\mu_{v}^{t}\psi^{v}z_{v}\Big{)}^{\mathrm{H}}\Big{(}\sum_{v\in W}\mu_{v}^{t}\psi^{v}z_{v}\Big{)}=\,\mathrm{tr}\,\Big{\{}\sum_{v\in W}z_{v}^{\mathrm{H}}(\psi^{v})^{\mathrm{H}}\psi^{v}z_{v}\Big{\}}$
		$\displaystyle=\,N\cdot\mathrm{tr}\,\Big{\{}\sum_{v\in W}z_{v}^{\mathrm{H}}z_{v}\Big{\}}=N\sum_{v\in W}\\|z_{v}\\|_{2}^{2}\geq\Omega(1).$

For large enough $t=\Omega\big{(}\log(dN)/\log(1/\mu)\big{)}$ , the sum in (7) dominates $\eta(t)$ with high probability, while the latter decays exponentially fast. Thus the dynamics $y(t)$ is asymptotically equivalent to $y^{*}(t)=\frac{1}{N}\sum_{v\in W}\mu_{v}^{t}\psi^{v}z_{v}$ . Recall that $\lambda_{v}=|\lambda_{v}|\omega^{\theta_{v}}$ ; since, for $v\in W$ , $\mu_{v}=\lambda_{v}/\lambda$ has modulus 1, it is equal to $\omega^{\theta_{v}}$ . This implies that $y_{v}^{*}(t)=\frac{1}{N}\sum_{h\in W}\,\sum_{u\in V}\omega^{t\theta_{h}+\langle h,v-u\rangle}x_{u}$ . Because $y_{v}^{*}(t)$ is real, we can ignore the imaginary part when expanding the expression above, which completes the proof.

2.3 Geometric investigations

The trajectory $y_{v}^{*}(t)$ is called the limiting orbit.¹¹1The phase space of the dynamical system is $\mathbb{R}^{dN}$ , but by abuse of notation we use the word “orbit” to refer the trajectory of a single agent, which lies in $\mathbb{R}^{d}$ . Theorem 2.6 indicates that, with high probability, every orbit is attracted to its limiting form at an exponential rate, so we may focus on the latter. Given the initial placement $x$ of the agents, all the limiting orbits lie in the set $\mathbb{S}$ , expressed in parametric form by

\mathbb{S}=\frac{1}{N}\sum_{h\in W}\big{\{}(a_{h}x)\cos X_{h}+(b_{h}x)\sin X_{h}\big{\}}.

(8)

Recall that $a_{h}x$ and $b_{h}x$ are row vectors in $\mathbb{R}^{d}$ . The attractor $\mathbb{S}$ is the Minkowski sum of a number of ellipses. We examine the geometric structure $\mathbb{S}$ and explain how the limiting orbits embed into it. To do that, we break up the sum (5) into three parts. Given $h\in W$ , we know that $\lambda_{h}\neq 0$ by (6), so there remain the following cases for the subdominant eigenvalues:

•

real $\lambda_{h}>0$ : the contribution to the sum is $c_{v}x$ , where $c_{v}$ is the row vector

c_{v}:=\frac{1}{N}\sum_{h\in W:\,\theta_{h}=0}\Big{\{}\,a_{h}\cos\frac{2\pi\langle h,v\rangle}{n}+b_{h}\sin\frac{2\pi\langle h,v\rangle}{n}\,\Big{\}}.

(9)

•

real $\lambda_{h}<0$ : the contribution is $(-1)^{t}d_{v}x$ , where, likewise, $d_{v}$ is the row vector

d_{v}:=\frac{1}{N}\sum_{h\in W:\,\theta_{h}=n/2}\Big{\{}\,a_{h}\cos\frac{2\pi\langle h,v\rangle}{n}+b_{h}\sin\frac{2\pi\langle h,v\rangle}{n}\,\Big{\}}.

(10)

•

nonreal $\lambda_{h}$ : we can assume that $\theta_{h}>0$ since the conjugate eigenvalue $\bar{\lambda}_{h}=\lambda_{-h}$ is also present in $W$ . The contribution of an eigenvalue is the same as that of its conjugate since $a_{h}=a_{-h}$ and $b_{h}=-b_{-h}$ . So the contribution of a given $\theta>0$ is equal to $e_{v,\theta}x$ , where

e_{v,\theta}:=\frac{2}{N}\sum_{h\in W:\,\theta_{h}=\theta}\left\{\,a_{h}\cos\frac{2\pi(t\theta+\langle h,v\rangle)}{n}+b_{h}\sin\frac{2\pi(t\theta+\langle h,v\rangle)}{n}\,\right\},

which we can expand as $a_{v,\theta}\cos\frac{2\pi\theta t}{n}+b_{v,\theta}\sin\frac{2\pi\theta t}{n}$ , where²²2 $R(\alpha)=\begin{pmatrix}\cos\alpha&-\sin\alpha\\ \sin\alpha&\cos\alpha\end{pmatrix}$ .

\begin{pmatrix}a_{v,\theta}\\ b_{v,\theta}\end{pmatrix}=\frac{2}{N}\sum_{h\in W:\,\theta_{h}=\theta}R\left(\frac{-2\pi\langle h,v\rangle}{n}\right)\begin{pmatrix}a_{h}\\ b_{h}\end{pmatrix}.

(11)

Putting all three contributions together, we find

y_{v}^{*}(t)=c_{v}x+(-1)^{t}d_{v}x+\sum_{\theta\in\vartheta}\Big{\{}\,a_{v,\theta}\cos\frac{2\pi\theta t}{n}+b_{v,\theta}\sin\frac{2\pi\theta t}{n}\,\Big{\}}x,

(12)

where $\vartheta$ is the set of distinct $\theta_{h}>0$ for $h\in W$ and all other quantities are defined in (9, 10, 11). See Figure 3 for an illustration of a doubly-elliptical orbit around its torus-like attractor.

2.3.1 Generic elliptical attraction

We prove that, for almost all values of the self-confidence weight $p$ , the set $W$ generates either a single real eigenvalue or two complex conjugate ones. By (12), this shows that almost every limiting orbit is either a single fixed point or a subset of an ellipse in $\mathbb{R}^{d}$ .

Theorem 2.8.

There exists a set $\Lambda$ of at most $\binom{N}{2}$ reals such that the set $W$ is associated with either a single real eigenvalue or two complex conjugate ones, for any $p\in(1/N,1)\setminus\Lambda$ .

The system is called regular if $p\in(1/N,1)\setminus\Lambda$ . For such a system, either (i) $\vartheta=\{\theta\}$ and $c_{v}=d_{v}=\mathbf{0}$ , or (ii) $\vartheta=\emptyset$ and exactly one of $c_{v}$ or $d_{v}$ equals $\mathbf{0}$ . In other words, by (12), we have three cases depending on the subdominant eigenvalues:

y_{v}^{*}(t)=\begin{cases}\,\,c_{v}x&:\text{\small\emph{real positive}}\\ \,\,(-1)^{t}d_{v}x&:\text{\small\emph{real negative}}\\ \,\,\big{(}\,a_{v,\theta}\cos\frac{2\pi\theta t}{n}+b_{v,\theta}\sin\frac{2\pi\theta t}{n}\,\big{)}x&:\text{\small\emph{conjugate pair}.}\\ \end{cases}

(13)

Lemma 2.9.

Consider a triangle $abc$ and let $e=pc+(1-p)a$ and $f=pc+(1-p)b$ . Let $O$ be the origin and assume that the segments $Oe$ and $Of$ are of the same length (Fig. 4); then the identity $|a|^{2}-|b|^{2}=\frac{2p}{1-p}(b-a)\cdot c$ holds.

Proof 2.10.

Let $d:=\frac{1}{2}(e+f)$ be the midpoint of $ef$ . The segment $Od$ lies on the perpendicular bisector of $ef$ , so it is orthogonal to $ef$ ; hence to $ab$ . Thus, $d\cdot(b-a)=0$ . Since $d=\frac{1}{2}(2pc+(1-p)a+(1-p)b)$ , the lemma follows from $(2pc+(1-p)(a+b))\cdot(b-a)=0$ .

Figure 4: A triangle identity.

Proof 2.11 (Proof of Theorem 2.8).

Pick two distinct $u,v\in W$ . Applying Lemma 2.9 in the complex plane, we set: $a=\frac{1}{|C|}\sum_{h\in C}\omega^{\langle u,h\rangle}$ ; $b=\frac{1}{|C|}\sum_{h\in C}\omega^{\langle v,h\rangle}$ ; and $c=1$ ; thus $e=\lambda_{u}$ and $f=\lambda_{v}$ , which implies that the segments $Oe$ and $Of$ are of the same length. Abusing notation by treating $a,b,c$ as both vectors and complex numbers, we have $(b-a)\cdot c=\Re(b-a)$ ; therefore,

\big{(}2\Re(b-a)+|a|^{2}-|b|^{2}\big{)}p=|a|^{2}-|b|^{2}.

1.

If $2\Re(b-a)+|a|^{2}-|b|^{2}=0$ , then $|a|=|b|$ , which in turn implies that $\Re(b-a)=0$ ; hence $a=\bar{b}$ and $\lambda_{u}=\bar{\lambda}_{v}$ .
2.

If $2\Re(b-a)+|a|^{2}-|b|^{2}\neq 0$ , then $p$ is unique: $p=p_{u,v}$ .

We form $\Lambda$ by including all of the numbers $p_{u,v}$ , with $u,v\in W$ .

In some cases, regularity holds with no need to exclude values of $p$ :

Theorem 2.12.

If $C$ forms a basis of $V$ and $n$ is prime, then $|W|=2m$ and $W$ produces exactly two eigenvalues: $p+\frac{1-p}{m}(\omega-1)$ and its conjugate.

Proof 2.13.

By Lemma 2.3, $\lambda_{v}=p+\frac{1-p}{|C|}\sum_{h\in C}\omega^{\langle v,h\rangle}$ . Fix nonzero $v\in V$ . Because $n$ is prime and the vectors $h_{1},\ldots,h_{m}$ from $C$ form a basis over the field $\mathbb{Z}/n\mathbb{Z}$ , the $m$ -by- $m$ matrix whose $i$ -th row is $h_{i}$ is nonsingular. This implies that, in the sum $\sum_{h\in C}\omega^{\langle v,h\rangle}$ , the exponent sequence $(1,0,\ldots,0)$ appears for exactly one value $v\in V$ and the same is true of $(-1,0,\ldots,0)$ . This holds true of any one-hot vector with a single $\pm 1$ at any place and $0$ elsewhere. This means that, for $2m$ values of $v$ , the eigenvalue $\lambda_{v}$ is of the form $p+\frac{1-p}{m}(m-1+\omega)$ or its conjugate. Simple examination shows that these are precisely the subdominant eigenvalues.

2.3.2 The case of cycle convolutions

It is useful to consider the case of a single cycle: $m=1$ . For convenience, we momentarily assume that $n$ is prime and that $\sum_{h\in C}h\neq 0\pmod{n}$ ; both assumptions will be dropped in subsequent sections.

Lemma 2.14.

Each eigenvalue $\lambda_{v}$ is simple.

Proof 2.15.

Because $n$ is prime, the cyclotomic polynomial for $\omega$ is $\Phi(z)=z^{n-1}+z^{n-2}+\cdots+z+1$ . It is the minimal polynomial for $\omega$ , which is unique. Note that $\langle v,h\rangle=vh$ , since $m=1$ . Given $v\in V$ , we define the polynomial $g_{v}(z)=\sum_{h\in C}z^{vh}$ in the quotient ring of rational polynomials $\mathbb{Q}[z]/(z^{n}-1)$ . Sorting the summands by degree modulo $n$ , we have $g_{v}(z)=\sum_{k=0}^{n-1}q_{v,k}z^{k}$ , for nonnegative integers $q_{v,k}$ , where $\sum_{k}q_{v,k}=|C|$ . If $\lambda_{v}=\lambda_{u}$ , for some $u\in V$ , then, by Lemma 2.3, $g_{v}(\omega)=g_{u}(\omega)$ ; hence $\Phi$ divides $g_{v}-g_{u}$ . Because the latter is of degree at most $n-1$ , it is either identically zero or equal to $\Phi$ up to a rational factor $r\neq 0$ . In the second case,

(q_{v,n-1}-q_{u,n-1})z^{n-1}+\cdots+(q_{v,1}-q_{u,1})z+q_{v,0}-q_{u,0}=r\Phi.

This implies that $q_{v,k}-q_{u,k}=r\neq 0$ , for all $0\leq k<n$ , which contradicts the fact that $\sum_{k}q_{v,k}=\sum_{k}q_{u,k}=|C|$ ; therefore, $g_{v}=g_{u}$ .

1.

If $v=0$ , then $g_{v}(z)=|C|$ ; hence $g_{u}(z)=|C|$ and $u=0$ , ie, $v=u$ .
2.

If $v\neq 0$ , then let $S_{v}=\{\omega^{vh}\,|\,h\in C\}$ . Because $\mathbb{Z}/n\mathbb{Z}$ is a field, the $|C|$ roots of unity in $S_{v}$ are distinct; hence $q_{v,k}\in\{0,1\}$ . It follows that $S_{v}=S_{u}$ and $|S_{v}|=|S_{u}|=|C|$ ; therefore, for some permutation $\sigma$ of order $|C|$ , we have $vh=u\sigma(h)$ , for all $h\in C$ . Summing up both sides over $h\in C$ gives us $v\sum_{h\in C}h=u\sum_{h\in C}h\pmod{n}$ ; hence $v=u$ , since $\sum_{h\in C}h\neq 0\pmod{n}$ .

By (13), the limiting orbit is of the form $y_{v}^{*}(t)=c_{v}x$ or $y_{v}^{*}(t)=(-1)^{t}d_{v}x$ if the subdominant eigenvalue is real. Otherwise, the orbit of an agent approaches a single ellipse in $\mathbb{R}^{d}$ : for some $\theta>0$ , $y_{v}^{*}(t)=\left(\,a_{v,\theta}\cos\frac{2\pi\theta t}{n}+b_{v,\theta}\sin\frac{2\pi\theta t}{n}\,\right)x$ .

2.3.3 Opinion velocities

Assume that the system is regular, so $W$ is associated with either a single real eigenvalue or two complex conjugate ones. If $\vartheta=\emptyset$ , by (12), every agent converges to a fixed point of the attractor $\mathbb{S}$ or its limiting orbit has a period of 2. The other case $\vartheta=\{\theta\}$ is more interesting. The agent approaches its limiting orbit, which is periodic or quasi-periodic. The rotation number, $\alpha:=\theta/n$ , is the (average) fraction of a whole rotation covered in a single step. It measures the speed at which the agent cycles around its orbit. We prove a lower bound on that speed.³³3Its upper bound is $1/2$ .

Theorem 2.16.

The rotation number $\alpha$ of a regular system satisfies $\alpha\geq\frac{1-p}{n}\left(\frac{1}{2N}\right)^{N}$ .

Proof 2.17.

Of course, this assumes that $\vartheta\neq\emptyset$ . Fix $v\in V$ and let $\beta_{v}=\sum_{w\in C}\big{(}\omega^{\langle v,w\rangle}-\omega^{-\langle v,w\rangle}\bigr{)}$ ; further, assume that $\beta_{v}$ is nonzero, hence imaginary. We have $\beta_{v}\,\psi_{u}^{v}=g_{u}^{\top}\psi^{v}$ , where $g_{u}$ is a vector in $\{-1,0,1\}^{N}$ . It follows that $\beta_{v}\,\psi^{v}=A\psi^{v}$ , for an $N$ -by- $N$ matrix $A$ whose nonzero elements are $\pm 1$ and whose rows are given by $g_{u}^{\top}$ . Thus, $\beta_{v}$ is an imaginary eigenvalue of $A$ ; hence a complex root of the characteristic polynomial $\det(A-\gamma\mathbb{I})$ . Let $r\geq 1$ be the rank of $A$ and let $\gamma_{1},\ldots,\gamma_{r}$ be its nonzero eigenvalues. Expansion of the determinant gives us a sum of monomials of the form $b_{i}\gamma^{l_{i}}$ , for $1\leq i\leq 2^{N}N!$ , where $b_{i}\in\{-1,0,1\}$ . The subset of them given by $l_{i}=N-r$ add up to the product of the nonzero eigenvalues (times $\pm\gamma^{N-r}$ ); hence $\prod_{i=1}^{r}|\gamma_{i}|\geq 1$ . Label $\gamma_{1}$ the nonzero eigenvalue of smallest modulus. The sum of the squared moduli of the eigenvalues of a matrix is at most the square of its Frobenius norm; hence no eigenvalue of $A$ can have a modulus larger than $\sqrt{2N|C|}$ and, therefore

|\beta_{v}|\geq|\gamma_{1}|=\frac{\prod_{i=1}^{r}|\gamma_{i}|}{\prod_{i=2}^{r}|\gamma_{i}|}\geq\left(\frac{1}{2N|C|}\right)^{\frac{r-1}{2}}.

(14)

Since $h\in W$ , it follows from (6) that $0<\lambda=|\lambda_{h}|<1$ . Thus,

|\theta_{h}|\geq|\sin\theta_{h}|=\frac{|\Im(\lambda_{h})|}{|\lambda_{h}|}\geq\frac{1-p}{|C|}\,\Big{|}\Im\Big{\{}\,\sum_{w\in C}\omega^{\langle h,w\rangle}\,\Big{\}}\Big{|}\geq\frac{1-p}{2|C|}\,|\beta_{h}|\,.

With $\lambda_{h}$ assumed to be nonreal, Lemma 2.3 implies that so is $\beta_{h}$ ; hence $\beta_{h}\neq 0$ . Applying (14) completes the proof.

Our next result formalizes the intuitive fact that self-confidence slows down motion. Self-assured agents are reluctant to change opinions.

Theorem 2.18.

The rotation number of a regular system cannot increase with $p$ .

Proof 2.19.

We must have $|\vartheta|=1$ . Let $\lambda_{h}$ be (an) eigenvalue corresponding to the unique angle in $\vartheta$ ; recall that $0<\theta_{h}<n/2$ . As we replace $p$ by $p^{\prime}>p$ , we use the prime sign with all relevant quantities post-substitution. Thus, the subdominant eigenvalue for $p^{\prime}$ associated with $\vartheta^{\prime}$ is denoted by $\lambda_{v}^{\prime}$ ; again, we assume that $|\vartheta^{\prime}|=1$ . Note that $v$ might not necessarily be equal to $h$ ; hence the case analysis:

•

$v=h$ : Using the same notation for complex numbers and the points in the plane they represent (Fig. 5), we see that $\lambda_{h}^{\prime}$ lies in (the relative interior of) the segment $1\lambda_{h}$ ; hence $\theta_{h}^{\prime}<\theta_{h}$ .
•

$v\neq h$ : We prove that, as illustrated in Fig. 5, all three conditions $|\lambda_{h}|>|\lambda_{v}|$ , $|\lambda_{h}^{\prime}|<|\lambda_{v}^{\prime}|$ , and $\theta_{h}<\theta_{v}^{\prime}\leq n/2$ , cannot hold at the same time, which will establish the theorem. If we increase $q$ continuously from $p$ to $p^{\prime}$ , $\theta_{h}(q)$ decreases continuously. (We use the argument $q$ to denote the fact that $\theta_{h}$ corresponds to the eigenvalue defined with $p$ replaced by $q$ .) Since, at the end of that motion, $|\lambda_{h}(q)|<|\lambda_{v}(q)|$ , by continuity we have $p_{o}<p^{\prime}$ , where $p_{o}=\min\{q>p\,:\,|\lambda_{h}(q)|=|\lambda_{v}(q)|\}$ . To simplify the notation, we repurpose the use of the prime superscript to refer to $p_{o}$ (eg, $p^{\prime}=p_{o}$ ). So, we now have $|\lambda_{h}^{\prime}|=|\lambda_{v}^{\prime}|$ and $\theta_{h}<\theta_{v}^{\prime}<\theta_{v}\leq n/2$ . It follows that (i) the point $\lambda_{v}$ lies in the pie slice of radius $|\lambda_{h}|$ running counterclockwise from $\lambda_{h}$ to $-|\lambda_{h}|$ on the real axis. Also, because $|\lambda_{h}^{\prime}|=|\lambda_{v}^{\prime}|$ and $|\lambda_{h}|>|\lambda_{v}|$ , setting $c=1$ as before in Lemma 2.9 shows that (ii) $\Re(\lambda_{v})>\Re(\lambda_{h})$ .⁴⁴4The keen-eyed observer will notice that in the lemma we must plug in $(p_{o}-p)/(1-p)$ instead of $p$ . Putting (i, ii) together shows that $\theta_{h}\geq n/4$ (as shown in Fig. 5). Consequently, the slope of the segment $\lambda_{h}\lambda_{v}$ is negative. Since that segment is parallel to $\lambda_{h}^{\prime}\lambda_{v}^{\prime}$ , the perpendicular bisector of the latter has positive slope. Since that bisector is above $\lambda_{v}^{\prime}$ and $\Im(\lambda_{v}^{\prime})\geq 0$ , this implies that $0$ and $\lambda_{h}^{\prime}$ are on opposite sides of that bisector; hence $|\lambda_{v}^{\prime}|<|\lambda_{h}^{\prime}|$ , which is a contradiction.

Figure 5: Why self-confidence slows down the dynamics: proof by contradiction.

2.4 Equidistributed orbits

The attractor $\mathbb{S}$ is the Minkowski sum of a number of ellipses bounded by $|W|$ . An agent orbits around an ellipse as it gets attracted to it exponentially fast. In a regular system with $\vartheta\neq\emptyset$ , its limiting orbit is periodic if the unique angle $\theta_{h}$ of $\vartheta$ is rational; it is quasi-periodic otherwise. In fact, it then forms a dense subset of the ellipse. By (12), this follows from Weyl’s ergodicity principle [23], which states that the set $\{\alpha t\pmod{1},|\,t\geq 0\}$ is uniformly distributed in $[0,1)$ , for any irrational $\alpha$ .

Dropping the regularity requirement may produce more exotic dynamics. We exhibit instances where a limiting orbit will not only be dense over the entire attracting set but, in fact, uniformly distributed. In other words, an agent will approach every possible opinion with equal frequency. This will occur when this property holds:⁵⁵5The coordinates of $a=(a_{1},\ldots,a_{k})$ are linearly independent over the rationals if $\mathbf{0}$ is the only rational vector normal to $a$ .

Assumption 1.

The numbers in $\vartheta\cup\{1\}$ are linearly independent over the rationals.

We explain this phenomenon next. Order the angles of $\vartheta$ arbitrarily and define the vector $\alpha=(\alpha_{1},\ldots,\alpha_{s})\in\big{[}0,\frac{1}{2}\big{]}^{s}$ , where $s=|\vartheta|$ and $\alpha_{j}=\theta/n$ for the $j$ -th angle $\theta\in\vartheta$ . We may assume that $c_{v}=d_{v}=\mathbf{0}$ in (12) since these cases are rotationally trivial. By Assumption 1, $\mathbf{0}$ is the only integer vector whose dot product with $\alpha$ is an integer. We use the standard notation $\|\alpha\|_{\,\mathbb{R}/\mathbb{Z}}=\max_{k\leq s}\min_{a\in\mathbb{Z}}|\alpha_{k}-a|$ . By Kronecker’s approximation theorem [7], for any $\beta\in[0,1]^{s}$ and any $\varepsilon>0$ , there exists $q\in\mathbb{Z}$ such that $\|q\alpha-\beta\|_{\,\mathbb{R}/\mathbb{Z}}\leq\varepsilon$ . It follows directly that, with high probability, any limiting orbit is dense over the attractor $\,\mathbb{S}$ . We prove the stronger result:

Theorem 2.20.

Under Assumption 1, any limiting orbit is uniformly distributed over the attractor $\,\mathbb{S}$ .

We mention that, in general, Assumption 1 might be difficult to verify analytically. Empirically, however, density is fairly obvious to ascertain numerically with suitable visual evidence (Fig. 6).

We define the discrepancy $D(S_{t})$ of $S_{t}=(p_{1},\ldots,p_{t})$ , with $p_{i}\in\mathbb{R}^{s}$ , as

D(S_{t})=\sup_{B\in J}\Big{|}\,\frac{A(B;t)}{t}-\mu_{s}(B)\,\Big{|},

where $\mu_{s}$ is the $s$ -dimensional Lebesgue measure and $A(B;t)=|\{i\,|\,p_{i}\in B\}|$ and $J$ is the set of $s$ -dimensional boxes of the form $\prod_{i=1}^{s}\,[a_{i},b_{i})\subset[0,1]^{s}$ . The infinite sequence $S_{\infty}$ is said to be uniformly distributed if $D(S_{t})$ tends to $0$ , as $t$ goes to infinity.

Lemma 2.21.

(Erdős–Turán–Koksma [23], page 116). For any integer $L>0$ ,

D(S_{t})\leq 2s^{2}3^{s+1}\Big{(}\,\frac{1}{L}+\sum_{0<\|\ell\|_{\infty}\leq L}\frac{1}{r(\ell)}\,\Big{|}\frac{1}{t}\sum_{k=1}^{t}e^{2\pi i\langle\ell,p_{k}\rangle}\Big{|}\,\Big{)}\,,

where $r(\ell):=\prod_{j=1}^{s}\max\{1,|\ell_{j}|\}$ and $\ell=(\ell_{1},\ldots,\ell_{s})\in\mathbb{Z}^{s}$ .

Proof 2.22 (Proof of Theorem 2.20).

We form the sequence $p_{1},\ldots,p_{t}\in[0,1)^{s}$ such that $p_{k}=k\alpha\pmod{1}$ ; where each coordinate of $k\alpha$ is replaced by its fractional part. By Lemma 2.21, its box discrepancy satisfies

D(S_{t})\leq 2s^{2}3^{s+1}\Big{(}\,\frac{1}{L}+\sum_{0<\|\ell\|_{\infty}\leq L}\frac{1}{r(\ell)}\,\Big{|}\frac{1}{t}\sum_{k=1}^{t}e^{2\pi i\langle\ell,k\alpha\rangle}\Big{|}\,\Big{)},

(15)

By Assumption 1, $\mathbf{0}$ is the only integer vector whose dot product with $\alpha$ is an integer; hence $\gamma_{\ell}:=e^{2\pi i\langle\ell,\alpha\rangle}\neq 1$ , for any $\ell\neq\mathbf{0}$ . It follows that

\Big{|}\sum_{k=1}^{t}e^{2\pi i\langle\ell,k\alpha\rangle}\,\Big{|}=\Big{|}\sum_{k=1}^{t}\gamma_{\ell}^{k}\,\Big{|}=\Big{|}\frac{\gamma_{\ell}-\gamma_{\ell}^{t+1}}{1-\gamma_{\ell}}\Big{|}\leq\frac{2}{|1-\gamma_{\ell}|}\,.

By (15), for any $\delta>0$ ,

D(S_{t})\leq 2s^{2}3^{s+1}\Big{(}\,\frac{1}{L}+\frac{1}{t}\sum_{0<\|\ell\|_{\infty}\leq L}\,\frac{2}{|1-\gamma_{\ell}|}\,\Big{)}\leq\delta.

for $L=\lceil 4s^{2}3^{s+1}/\delta\rceil$ and $t\geq(8/\delta)s^{2}3^{s+1}\sum_{0<\|\ell\|_{\infty}\leq L}|1-\gamma_{\ell}|^{-1}$ .

2.5 Examples

We illustrate the range of contrarian opinion dynamics by considering a few specific examples for which calculations are feasible.

2.5.1 Fixed-point attractor

Set $m=2$ and $C=\{(1,0),(0,1),(-1,0),(0,-1)\}$ . By Lemma 2.3, for any $v=(v_{1},v_{2})\in V$ ,

\lambda_{v}=p+\frac{1-p}{2}\Big{(}\cos\frac{2\pi v_{1}}{n}+\cos\frac{2\pi v_{2}}{n}\Big{)}.

The eigenvalues are real and $\lambda=\max_{v\in V}\{|\lambda_{v}|<1\}=p+\frac{1}{2}(1-p)(1+\cos 2\pi/n)$ . For any $h\in C$ , $\lambda_{h}=\lambda$ and $\theta_{h}=0$ ; hence $C\subseteq W$ . A simple examination shows that, in fact, $W=C$ . By (9, 12), given $j\in[d]$ ,⁶⁶6As usual, $[d]$ denotes $\{1,\ldots,d\}$ .

y_{v}^{*}(t)_{j}=A_{j}\cos\frac{2\pi(v_{1}+\alpha_{j})}{n}+B_{j}\cos\frac{2\pi(v_{2}+\beta_{j})}{n}\,,

where $A_{j},B_{j},\alpha_{j},\beta_{j}$ do not depend on $v$ but only on the initial position $x$ . This produces a 2D surface in $\mathbb{R}^{d}$ formed by the Minkowski sum of two ellipses centered at the origin (Fig.7).

2.5.2 Periodic and quasi-periodic orbits

Set $m=2$ and $C=\{(1,0),(0,1)\}$ . By Lemma 2.3, for any $v\in V$ , $\lambda_{v}=p+\frac{1-p}{2}\big{(}\omega^{v_{1}}+\omega^{v_{2}}\big{)}$ ; hence $\lambda=\max_{v\in V}\{|\lambda_{v}|<1\}=\frac{1}{2}\big{|}1+p+(1-p)\omega\big{|}$ and $W=\{(1,0),(0,1),(-1,0),(0,-1)\}$ . Specifically, $\lambda_{v}$ is equal to $\frac{1}{2}\big{(}1+p+(1-p)\omega\big{)}$ , for $v\in\{(1,0),(0,1)\}$ , and to its conjugate, for $v\in\{(-1,0),(0,-1)\}$ . By (11, 12), we have $\vartheta=\{\theta\}$ , where

\theta=\Big{(}\frac{n}{2\pi}\Big{)}\arctan\left(\frac{(1-p)\sin 2\pi/n}{1+p+(1-p)\cos 2\pi/n}\right)\,,

and

y_{v}^{*}(t)=\Big{(}\,a_{v,\theta}\cos\frac{2\pi\theta t}{n}+b_{v,\theta}\sin\frac{2\pi\theta t}{n}\,\Big{)}x\,.

Fix a coordinate $j\in[d]$ ; we find that

y_{v}^{*}(t)_{j}=A_{j}\cos\frac{2\pi(\theta t+v_{1}+\alpha_{j})}{n}+B_{j}\cos\frac{2\pi(\theta t+v_{2}+\beta_{j})}{n}\,,

for suitable reals $A_{j},B_{j},\alpha_{j},\beta_{j}$ that depend on the initial position $x$ but not on $v$ . This again produces a two-dimensional attracting subset of $\mathbb{R}^{d}$ formed by the Minkowski sum of two ellipses. In the case of Figure 8, the attractor is a torus pinched along two curves. The main difference from the previous case comes from the limit behavior of the agents. They are not attracted to a fixed point but, rather, to a surface. With high probability, the orbits are asymptotically periodic if $\theta$ is rational, and quasi-periodic otherwise. For a case of the former, consider $p=0$ , which gives us

\theta=\Big{(}\frac{n}{2\pi}\Big{)}\arctan\left(\frac{\sin 2\pi/n}{1+\cos 2\pi/n}\right)=\frac{1}{2}\,;

hence periodic orbits.

2.5.3 Equidistribution over the attractor

Put $m=2$ and $C=\{(1,0),(0,1),(2,3)\}$ . We set $p=1/4$ . For any $v\in V$ , we have

\lambda_{v}=p+\frac{1-p}{3}\big{(}\omega^{v_{1}}+\omega^{v_{2}}+\omega^{2v_{1}+3v_{2}}\big{)}.

We verified numerically that $W=\{(1,0),(1,-1),(-1,0),(-1,1)\}$ and $\vartheta=\{\theta_{1},\theta_{2}\}$ , where

\begin{cases}\,\,\theta_{1}=\big{(}\frac{n}{2\pi}\big{)}\arctan\left(\frac{\sin 2\pi/n+\sin 4\pi/n}{2+\cos 2\pi/n+\cos 4\pi/n}\right)\\ \,\,\theta_{2}=\big{(}\frac{n}{2\pi}\big{)}\arctan\left(\frac{-\sin 2\pi/n}{1+3\cos 2\pi/n}\right).\end{cases}

By (12),

y_{v}^{*}(t)=\sum_{k=1,2}\Big{(}\,a_{v,\theta_{k}}\cos\frac{2\pi\theta_{k}t}{n}+b_{v,\theta_{k}}\sin\frac{2\pi\theta_{k}t}{n}\,\Big{)}x\,.

Computer experimentation points to the linear independence of the numbers $1,\theta_{1},\theta_{2}$ over the rationals. If so, then Assumption 1 from Section 2.4 holds and, by Theorem 2.20, any limiting orbit is uniformly distributed over the attractor $\,\mathbb{S}$ (Fig.9).

3 Dynamic Social Networks

We define a mixed model of contrarian opinion dynamics. Let $\mathcal{M}=\{C_{1},\ldots,C_{s}\}$ be a set of $s$ nonempty subsets, each one spanning the vector space $V$ . At each time step $t$ , we define the matrix $F_{C}$ by choosing, as convolution set $C$ , a random, uniformly distributed element of $\mathcal{M}$ . As before, we assume that $\mathbf{1}^{\top}x=0$ . Let $\lambda_{j,v}$ be the eigenvalue of $F_{C_{j}}$ associated with $v\in V$ . Given an infinite sequence $I_{\infty}$ of indices from $[s]$ , we denote by $I_{t}={k_{1},\ldots,k_{t}}$ be the first $t$ indices of $I_{\infty}$ , and we write $\Lambda_{v}(I_{t})=\prod_{k\in I_{t}}\lambda_{k,v}$ . We generalize (4) into

x(t)=\frac{1}{N}\sum_{v\in V\setminus\{\mathbf{0}\}}\Lambda_{v}(I_{t})\,\psi^{v}z_{v}\,,

(16)

where $z_{v}$ is the row vector $\sum_{u\in V}\omega^{-\langle v,u\rangle}x_{u}$ .

3.1 Spectral decomposition

Write $\lambda_{v}^{\times}=\big{|}\prod_{j=1}^{s}\lambda_{j,v}\big{|}^{1/s}$ and $\lambda=\max_{v\in V\setminus\{\mathbf{0}\}}\lambda_{v}^{\times}$ ; because all the eigenvalues other than $\lambda_{j,\mathbf{0}}=1$ lie strictly inside the unit circle, we have $\lambda<1$ . Without loss of generality, we can assume that $\lambda>0$ . Indeed, suppose that $\lambda=0$ ; then, for every $v\in V\setminus\{\mathbf{0}\}$ , there is $j=j(v)$ such that $\lambda_{j,v}=0$ . This presents us with a “coupon collector’s” scenario: with probability at most $N(1-1/s)^{t}\leq Ne^{-t/s}$ , we have $\Lambda_{v}(I_{t})\neq 0$ for at least one nonzero $v\in V$ . In other words, with high probability, every coordinate of $x(t)$ in the eigenbasis will vanish after $O(s\log N)$ steps; hence $x(t)=0$ for all $t$ large enough. This case is of little interest, so we dismiss it and assume that $\lambda$ is positive. We redefine $W=\{v\in V\,|\,\lambda_{v}^{\times}=\lambda\}$ . Let $W^{\prime}=\{v\in V\,|\,\lambda_{v}^{\times}<\lambda\}$ .

Lemma 3.1.

If $W^{\prime}$ is nonempty, there exists $c<1$ such that, with high probability, for all $t$ large enough,

\max_{w^{\prime}\in W^{\prime}}|\Lambda_{w^{\prime}}(I_{t})|\leq c^{t}\min_{w\in W}|\Lambda_{w}(I_{t})|.

Note that the high-probability event applies to all times $t$ larger than a fixed constant. The proof involves the comparison of two multiplicative random walks.

Proof 3.2.

Fix $w\in W$ and $w^{\prime}\in W^{\prime}$ . We prove that $|\Lambda_{w^{\prime}}(I_{t})|\leq c^{t}|\Lambda_{w}(I_{t})|$ . If $\lambda_{w^{\prime}}^{\times}=0$ , then $\lambda_{j,w^{\prime}}=0$ , for some $j$ . With high probability, the sequence $I_{t}$ includes the index $j$ at least once for any $t$ large enough; hence $|\Lambda_{w^{\prime}}(I_{t})|=0$ and the lemma holds. Assume now that $\lambda_{w^{\prime}}^{\times}>0$ ; for all $j$ , both of $\lambda_{j,w}$ and $\lambda_{j,w^{\prime}}$ are nonzero. Write $S_{v}(I_{t})=\log\prod_{k\in I_{t}}|\lambda_{k,v}|$ , for $v=w,w^{\prime}$ , and note that $S_{v}(I_{t})=t\log\lambda_{v}^{\times}+\sum_{k\in I_{t}}\sigma_{k,v}$ , where $\sigma_{k,v}=\log|\lambda_{k,v}|-\log\lambda_{v}^{\times}$ . Let $\sigma=\max_{k,v}|\sigma_{k,v}|$ . The random variables $\sigma_{k,v}$ are unbiased and mutually independent in $[-\sigma,\sigma]$ . Classic deviation bounds [2] give us $\mathbb{P}\Big{[}\,\Big{|}\sum_{k\in I_{t}}\sigma_{k,v}\Big{|}>b\,\Big{]}<2e^{-b^{2}/(2t\sigma^{2})}.$ It follows that $\big{|}S_{v}(I_{t})-t\log\lambda_{v}^{\times}\big{|}=O\big{(}\sigma\sqrt{t\ln(tN)}\,\big{)}$ with probability $1-a/(tN)^{2}$ , for an arbitrarily small constant $a>0$ . Since $\sum_{t>0}1/t^{2}=\pi^{2}/6$ , it follows that, for arbitrarily small fixed $\varepsilon>0$ and all $t$ large enough, with probability at least $1-\varepsilon/N^{2}$ ,

\log\frac{|\Lambda_{w}(I_{t})|}{|\Lambda_{w^{\prime}}(I_{t})|}=S_{w}(I_{t})-S_{w^{\prime}}(I_{t})\geq t\log\frac{\lambda_{w}^{\times}}{\lambda_{w^{\prime}}^{\times}}-O\big{(}\sigma\sqrt{t\log(tN)}\,\big{)}\geq\frac{t}{2}\log\frac{\lambda_{w}^{\times}}{\lambda_{w^{\prime}}^{\times}}\,,

for any given $w\in W$ and $w^{\prime}\in W^{\prime}$ . Setting $c=\max_{w\in W,w^{\prime}\in W^{\prime}}\sqrt{\lambda_{w^{\prime}}^{\times}/\lambda_{w}^{\times}}$ and using a union bound completes the proof.

We define the scaled orbit $y(t)=x(t)/\lambda^{t}$ . Reprising the argument from Theorem 2.6, we conclude from (16) that, with high probability, the limiting orbit is of the form

y^{*}(t)=\frac{1}{N}\sum_{h\in W}\Big{(}\prod_{k\in I_{t}}\frac{\lambda_{k,h}}{\lambda}\Big{)}\psi^{h}z_{h}=\frac{1}{N}\sum_{h\in W}\Big{(}\prod_{k\in I_{t}}\frac{|\lambda_{k,h}|}{\lambda}\,\Big{)}\,\omega^{\sum_{k\in I_{t}}\!\theta_{k,h}}\,\psi^{h}z_{h},

where $\lambda_{k,h}:=|\lambda_{k,h}|\omega^{\theta_{k,h}}$ . It follows that

y_{v}^{*}(t)=\frac{1}{N}\sum_{h\in W}\Big{(}\prod_{k\in I_{t}}\frac{|\lambda_{k,h}|}{\lambda}\,\Big{)}\,\omega^{\sum_{k\in I_{t}}\!\theta_{k,h}+\langle h,v\rangle}\,\sum_{u\in V}\omega^{-\langle h,u\rangle}x_{u}\,.

If we put $X_{h}=\frac{2\pi}{n}\big{(}\langle h,v\rangle+\sum_{k\in I_{t}}\!\theta_{k,h}\big{)}$ , then, with $a_{h}$ and $b_{h}$ being the row vectors defined in Theorem 2.6,

y_{v}^{*}(t)=\frac{1}{N}\sum_{h\in W}\Big{(}\prod_{k\in I_{t}}\frac{|\lambda_{k,h}|}{\lambda}\,\Big{)}\,\Big{(}(a_{h}x)\cos X_{h}+(b_{h}x)\sin X_{h}\Big{)}.

(17)

3.2 Surprising attractors

Adding mixing to a model increases the entropy of the system. It is thus to be expected that the attractor of a mixed model should have higher dimensionality than its pure components. The surprise is that this need not be the case. We exhibit instances of contrarian opinion dynamics where mixing decreases the dimension of the attractor. To keep the notation simple, we consider two pure models $\mathcal{M}_{1}=\{C_{1}\}$ , $\mathcal{M}_{2}=\{C_{2}\}$ alongside their mixture $\mathcal{M}_{3}=\{C_{1},C_{2}\}$ .

Theorem 3.3.

For any $k\in[m]$ , there is a choice of $C_{1}$ and $C_{2}$ such that $\mathrm{dim}\,\mathcal{M}_{3}=k$ and $\mathrm{dim}\,\mathcal{M}_{1}=\mathrm{dim}\,\mathcal{M}_{2}=m$ ; in other words, the dimension of the mixture’s attractor can be arbitrarily smaller than those of its pure components.

Proof 3.4.

We define $C_{1}=(e_{1},\ldots,e_{m})$ and $C_{2}=(e_{1},\ldots,e_{k},2e_{k+1},\ldots,2e_{m})$ , for any $k\in[m]$ , where $e_{i}$ is the one-hot vector of $V$ whose $i$ -th coordinate is 1 and all the others 0. Let $W_{i}$ denote the set $W$ corresponding to the system $\mathcal{M}_{i}$ . We easily verify that $W_{1}=\pm C_{1}$ and $W_{2}=\pm\big{\{}e_{1},\ldots,e_{k},2^{-1}e_{k+1},\ldots,2^{-1}e_{m}\big{\}}$ , where $2^{-1}$ is the inverse of $2$ in the field $\mathbb{Z}/n\mathbb{Z}$ . A vector $v\in W_{i}$ and its negative contribute to the same ellipse, so we have $\mathrm{dim}\,\mathcal{M}_{1}=\mathrm{dim}\,\mathcal{M}_{2}=m$ . We note that $|\lambda_{k,h}|=\lambda=\big{|}1+\frac{1-p}{m}(\omega-1)\big{|}$ , for $h\in W_{1}\cup W_{2}$ ; hence $\lambda_{v}^{\times}=\lambda$ for $h\in W_{1}\cap W_{2}$ and $\lambda_{v}^{\times}<\lambda$ for all other values of $h$ . It follows that $\mathrm{dim}\,\mathcal{M}_{3}=k$ .

Figure 10 illustrates Theorem 3.3. We have $m=2$ and $n=29$ . The two convolution sets are $C_{1}=\{(1,0),(0,1)\}$ and $C_{2}=\{(1,0),(0,2)\}$ . The initial positions are random and identical in all three cases.

We can generalize the mixed model by picking $C_{1}$ (resp. $C_{2}$ ) with probability $1-q$ (resp. $q$ ), where $0\leq q\leq 1$ . For this, we redefine $\lambda_{v}^{\times}(q)=\big{|}\lambda_{1,v}^{1-q}\lambda_{2,v}^{q}\big{|}$ and $\lambda(q)=\max_{v\in V\setminus\{\mathbf{0}\}}\lambda_{v}^{\times}(q)$ .

Theorem 3.5.

There is a choice of $C_{1}$ and $C_{2}$ such that $\mathrm{dim}\,\mathcal{M}_{3}>\mathrm{dim}\,\mathcal{M}_{1}=\mathrm{dim}\,\mathcal{M}_{2}=m$ ; in other words, the dimension of the mixture’s attractor can be larger than those of its pure components.

Proof 3.6.

Borrowing the notation of the previous proof, we define $C_{1}=(e_{1},\ldots,e_{m})$ and $C_{2}=(2e_{1},\ldots,2e_{m})$ and verify that $W_{1}=\pm C_{1}$ and $W_{2}=\pm\{2^{-1}e_{1},\ldots,2^{-1}e_{m}\big{\}}$ ; hence $\mathrm{dim}\,\mathcal{M}_{1}=\mathrm{dim}\,\mathcal{M}_{2}=m$ . Assuming that $n>3$ , we note that the sets $W_{1}$ and $W_{2}$ are disjoint. Regarding the mixed system, we have $W(q)=\{v\in V\,:\,\lambda_{v}^{\times}(q)=\lambda(q)\}$ , where $W(0)=W_{1}$ and $W(1)=W_{2}$ . Around $q=0$ , we have, for all $v\in W(0)$ ,

\lambda_{v}^{\times}(q)=\Big{|}\,1+\frac{1-p}{m}(\omega-1)\,\Big{|}^{1-q}\times\Big{|}\,1+\frac{1-p}{m}(\omega^{2}-1)\,\Big{|}^{q}\,.

(18)

Since $W(0)\neq W(1)$ , by continuity, there are $q\in(0,1)$ and $w\in W(q)\setminus W(0)$ such that $\lambda_{w}^{\times}(q)$ is equal to the right-hand side of (18). This implies that $W(q)\supseteq W(0)\cup\{w\}$ , which completes the proof.

Figure 11 illustrates the theorem. We have $m=2$ , $n=29$ , and $p=0.9$ . The two convolution sets are $C_{1}=\{(1,0),(0,1)\}$ and $C_{2}=\{(2,0),(0,2)\}$ ; the mixture probability is $q=0.0306$ . The initial positions are random and identical in all three cases.

References

[1] C. Fred Alford. Group Psychology and Political Theory. Yale University Press, 1994. URL: http://www.jstor.org/stable/j.ctt1dt007c.
[2] Noga Alon and Joel H. Spencer. The Probabilistic Method, Third Edition. Wiley-Interscience series in discrete mathematics and optimization. Wiley, 2008.
[3] Carmela Bernardo, Claudio Altafini, Anton Proskurnikov, and Francesco Vasca. Bounded confidence opinion dynamics: A survey. Automatica, 159:111302, 2024. URL: https://www.sciencedirect.com/science/article/pii/S0005109823004661, doi:https://doi.org/10.1016/j.automatica.2023.111302.
[4] Arnab Bhattacharyya, Mark Braverman, Bernard Chazelle, and Huy L. Nguyen. On the convergence of the hegselmann-krause system. In Robert D. Kleinberg, editor, Innovations in Theoretical Computer Science, ITCS ’13, Berkeley, CA, USA, January 9-12, 2013, pages 61–66. ACM, 2013. URL: https://doi.org/10.1145/2422436.2422446, doi:10.1145/2422436.2422446.
[5] Vincent D. Blondel, Julien M. Hendrickx, and John N. Tsitsiklis. On krause’s multi-agent consensus model with state-dependent connectivity. IEEE Trans. Autom. Control., 54(11):2586–2597, 2009. URL: https://doi.org/10.1109/TAC.2009.2031211, doi:10.1109/TAC.2009.2031211.
[6] Alfred M. Bruckstein, Guillermo Sapiro, and Doron Shaked. Evolutions of planar polygons. Int. J. Pattern Recognit. Artif. Intell., 9(6):991–1014, 1995. URL: https://doi.org/10.1142/S0218001495000407, doi:10.1142/S0218001495000407.
[7] J.W.S Cassels. An Introduction to Diophantine Approximation. Cambridge University Press, 1957.
[8] Philips J. Davis. Circulant Matrices, volume 338. AMS Chelsea Publishing, 2nd edition, 1994.
[9] Guillaume Deffuant, David Neau, Frédéric Amblard, and Gérard Weisbuch. Mixing beliefs among interacting agents. Adv. Complex Syst., 3(1-4):87–98, 2000. URL: https://doi.org/10.1142/S0219525900000078, doi:10.1142/S0219525900000078.
[10] Morris H. DeGroot. Reaching a consensus. Journal of the American Statistical Association, 69(345):118–121, 1974. URL: http://www.jstor.org/stable/2285509.
[11] Kaitlyn Eekhoff. Opinion formation dynamics with contrarians and zealots. SIAM J. Undergraduate Research Online, 12, 2019.
[12] Adam N. Elmachtoub and Charles F. Van Loan. From random polygon to ellipse: An eigenanalysis. SIAM Rev., 52(1):151–170, 2010. URL: https://doi.org/10.1137/090746707, doi:10.1137/090746707.
[13] Seyed Rasoul Etesami and Tamer Başar. Game-theoretic analysis of the hegselmann-krause model for opinion dynamics in finite dimensions. IEEE Trans. Autom. Control., 60(7):1886–1897, 2015. URL: https://doi.org/10.1109/TAC.2015.2394954, doi:10.1109/TAC.2015.2394954.
[14] Fabio Fagnani and Paolo Frasca. Introduction to Averaging Dynamics over Networks, volume 472. Springer, 1st edition, 2018.
[15] Henrique Ferraz de Arruda, Alexandre Benatti, Filipi Nascimento Silva, César Henrique Comin, and Luciano da Fontoura Costa. Contrarian effects and echo chamber formation in opinion dynamics. Journal of Physics: Complexity, 2(2):025010, mar 2021. URL: https://dx.doi.org/10.1088/2632-072X/abe561, doi:10.1088/2632-072X/abe561.
[16] John R. P. French. A formal theory of social power. Psychological Review, 63(3):181–194, 1956. doi:10.1037/h0046123.
[17] Noah E. Friedkin and Eugene C. Johnsen. Social influence and opinions. The Journal of Mathematical Sociology, 15(3-4):193–206, 1990. URL: https://doi.org/10.1080/0022250X.1990.9990069, doi:10.1080/0022250X.1990.9990069.
[18] Serge Galam. From 2000 bush–gore to 2006 italian elections: voting at fifty-fifty and the contrarian effect. Quality & quantity, 41(4):579–589, 2007.
[19] Serge Galam and Taksu Cheon. Asymmetric contrarians in opinion dynamics. Entropy, 22(1):25, 2020. URL: https://doi.org/10.3390/e22010025, doi:10.3390/E22010025.
[20] Carl Heese. Information frictions and opposed political interests. American Economic Association, 2022.
[21] Rainer Hegselmann and Ulrich Krause. Opinion dynamics and bounded confidence: models, analysis and simulation. J. Artif. Soc. Soc. Simul., 5(3), 2002. URL: http://jasss.soc.surrey.ac.uk/5/3/2.html.
[22] Boyan Kostadinov. Limiting forms of iterated circular convolutions of planar polygons. International Journal of Applied and Computational Mathematics, 3:1779 – 1798, 2016.
[23] L. Kuipers and H. Niederreiter. Uniform Distribution of Sequences. A Wiley-Interscience publication. Wiley, 1974.
[24] Hsin-Min Lu. Detecting short-term cyclical topic dynamics in the user-generated content and news. Decis. Support Syst., 70:1–14, 2015. URL: https://doi.org/10.1016/j.dss.2014.11.006, doi:10.1016/J.DSS.2014.11.006.
[25] Anders Martinsson. An improved energy argument for the hegselmann–krause model. Journal of Difference Equations and Applications, 22(4):513–518, 2016. doi:10.1080/10236198.2015.1115486.
[26] Roni Muslim, M. Jauhar Kholili, and Ahmad R.T. Nugraha. Opinion dynamics involving contrarian and independence behaviors based on the sznajd model with two-two and three-one agent interactions. Physica D: Nonlinear Phenomena, 439:133379, 2022. URL: https://www.sciencedirect.com/science/article/pii/S0167278922001373, doi:https://doi.org/10.1016/j.physd.2022.133379.
[27] E. Seneta. Non-negative Matrices and Markov Chains. Springer Series in Statistics. Springer, 2nd edition, 2006.