Orbit decay in encounters between anisotropic spherical galaxies of equal mass

Lucas Saleh¹, Joshua E. Barnes²
¹Department of Physics and Astronomy, University of Hawaii at Manoa; [email protected]
²Institute for Astronomy, University of Hawaii at Manoa; [email protected]

(Accepted 2023 November 21. Received 2023 November 16; in original form 2023 August 7)

Abstract

We investigate the effect of radial anisotropy on the rate of orbit decay in parabolic encounters of identical spherical galaxies. Our galaxy models have Hernquist density profiles and Osipkov–Merritt velocity distributions. We find that radially anisotropic models merge in as little as half the time of their isotropic counterparts. Anisotropic models are more susceptible to tidal deformation; this accelerates the transfer of orbital angular momentum to internal degrees of freedom. Even during the initial approach, the anisotropic models become more distorted, and arrive at pericentre already having lost substantial amounts of angular momentum. Our results may have implications for estimates of merger rates and persistence of tidal tails.

keywords:

galaxies: evolution – galaxies: interactions – instabilities – methods: numerical

^†^†pubyear: 2023^†^†pagerange: Orbit decay in encounters between anisotropic spherical galaxies of equal mass–A

1 INTRODUCTION

Galaxy encounters are remarkably inelastic. Two equally-massive galaxies falling together on an interpenetrating parabolic orbit are left on a much smaller and more tightly-bound orbit after their first passage, and typically merge soon after their next approach. This was already clear in early numerical experiments (e.g., van Albada & van Gorkom 1977; White 1978) and has been affirmed many times since. Orbit decay is driven by the transfer of energy and angular momentum from the relative motion of the galaxies to internal motions within each galaxy (Binney & Tremaine, 2008, Ch. 8).

Early studies indicated that the internal velocity structure of interacting galaxies influences the rate of orbit decay. Working with spherical systems, White (1979) found that merging is faster if galaxies spin in the same direction as they orbit one another, and slower if they spin in a retrograde direction. This arises because particles on direct orbits experience a ‘broad resonance’ (Toomre & Toomre, 1972) with the relative motion of the two galaxies, yielding a stronger tidal response. While the role of rotation has been highlighted in subsequent studies of disc-galaxy mergers, little attention seems to have been paid to other options for velocity structure. White (1979) included non-rotating models with both isotropic and circular velocity distributions, and noted that the latter produced merger remnants with more compact cores, but remarked that ‘[t]he strength of interaction does not, however, seem to depend on the initial velocity structure.’

Subsequent simulations of encounters between spherical galaxies have explored a wide range of choices for mass ratios, initial orbits, and initial density profiles (e.g., Villumsen 1982, 1983; Vergne & Muzzio 1995; Makino & Hut 1997; Fulton & Barnes 2001; Boylan-Kolchin, Ma, & Quataert 2008; Drakos et al. 2019). These studies have been somewhat overshadowed by simulations of disc galaxy encounters, which have become increasingly realistic as they attempt to reproduce observed interacting systems and model the formation of elliptical galaxies by mergers. Yet even in disc simulations, most of the mass is invested in a roughly spherical dark halo. These haloes are usually set up with isotropic velocity distributions, although some workers (e.g., Cox et al. 2006, Springel & White 1999) added modest amounts of halo rotation.

A handful of studies have incorporated radial or tangential anisotropy in simulations of galaxy interactions. McMillan et al. (2007) examined encounters of equal-mass disc galaxies embedded in anisotropic haloes, and reported that initial anisotropy had very little effect on the haloes of merger remnants. When simulating interactions between large galaxies and small satellites, Arena & Bertin (2007) found that satellites orbiting radially anisotropic primaries merged in ‘somewhat shorter’ times, while Vasiliev et al. (2022) noted that radial (tangential) anisotropy caused satellite orbits to become more (less) elongated over time. Rozier et al. (2022) and Vasiliev (2024) included halo anisotropy in studies of the Milky Way–LMC system. Both found that radially anisotropic halos respond more strongly to the LMC’s gravitational field; the latter also reported that the present position and velocity of the LMC are consistent with an initial orbit with higher angular momentum if the dark halo is radially anisotropic.

In this paper we return to the study of encounters between identical spherical systems, and explore the influence of radial anisotropy on orbit decay. We focus on spherical systems for simplicity, and because there are straightforward, accurate ways to generate radially anisotropic models of such systems. Our immediate goal is to clarify the physics of orbit decay, rather than to model real galaxies. However, we note that inasmuch as dark haloes form partly by radial collapse, they are likely to have velocity distributions with preferentially radial orbits.

Our galaxy models, simulation methodology, and parameter choices are described in § 2. In § 3 we present results for simulated encounters and simplified models that explore the response to tidal perturbations. § 4 presents our conclusions. Appendix A discusses simulation uncertainties.

2 METHODS

In concept, our experimental approach is very simple. We construct spherical Hernquist (1990) models with anisotropic velocity distributions following Osipkov (1979) and Merritt (1985). Two such models are launched on a parabolic collision trajectory, and their evolution is followed until they have merged. We vary the amount of anisotropy while holding other parameters fixed, and use the time between first and second passages to measure the timescale of orbit decay.

To implement this approach, we must make a number of approximations. In particular, (1) the model density profile is exponentially tapered toward zero at a finite radius, (2) dynamical evolution is simulated using an N-body method with finite particle number, (3) gravitational interactions are ‘softened’ at small separations, (4) forces are evaluated using a hierarchical tree code which trades accuracy for speed, (5) particle trajectories are integrated using a finite time-step, and (6) the models are launched from a finite separation. We discuss each of these approximations below and estimate their effects on our results.

2.1 Mass Distribution and Gravitational Potential

We use the Hernquist (1990) density profile to model a simple, spherically symmetric galaxy. This profile is fully characterized by two parameters, the mass $M$ and scale radius $a$ . The density is

\rho_{\mathrm{H}}(r)=\frac{1}{2\pi}\,\frac{Ma}{r(r+a)^{3}}\,.

(1)

The corresponding gravitational potential is

\Phi_{\mathrm{H}}(r)=-\,\frac{GM}{r+a}\,.

(2)

where $G$ is the gravitational constant.

For simplicity, we use units in which $G=M=a=1$ . This entails no loss of generality, as the resulting models can be rescaled to any desired system of units. In these units, the half-mass radius is $r_{\mathrm{h}}=1+\sqrt{2}$ . A circular orbit at this radius has period $P_{\mathrm{h}}\simeq 33.332$ ; this is a characteristic timescale for dynamical evolution.

At large radii, $\rho_{\mathrm{H}}(r)$ falls off as $\rho\propto r^{-4}$ . When this density profile is sampled with $N$ particles, the outermost one will have radius $r\sim aN$ . Such extreme outliers are numerically inconvenient, so we use a general tapering function (Barnes, 2012, eq. 16) to smoothly taper the density profile, starting at a radius $b=100a$ . This effectively limits the outermost particle to a radius of roughly $500a$ . To keep the total mass of the model fixed, matter at large radii is redistributed within $r<b$ ; this increases the density within radius $b$ by just under $1$ per cent. In what follows, $\rho(r)$ is the tapered density profile.

Gravitational interactions between particles are softened at short range using the usual Plummer formula: at a distance $R$ , the potential of a particle is proportional to $(R^{2}+\epsilon^{2})^{-1/2}$ , where $\epsilon$ is a small ‘softening length’. This biases the overall gravitational potential, so $\Phi_{\mathrm{H}}(r)$ is no longer accurate. Since we need accurate potentials to compute velocity distributions (§ 2.2), the tapered density profile $\rho(r)$ is numerically convolved with a Plummer kernel (Barnes, 2012, eq. 6), and the result is used to integrate the potential as a function of radius. In what follows, $\Phi(r)$ is the resulting potential.

2.2 Velocity Distribution

We use the Osipkov–Merritt method to construct radially anisotropic DFs (distribution functions) for our models (Osipkov, 1979; Merritt, 1985). For a spherical isotropic system, the DF is $f(\mathbf{r},\mathbf{v})=f(E)$ , where $E\equiv\frac{1}{2}|\mathbf{v}|^{2}+\Phi(|\mathbf{r}|)$ is the specific binding energy. The Osipkov–Merritt DF is $f_{\mathrm{a}}(\mathbf{r},\mathbf{v})=f_{\mathrm{a}}(Q)$ , where $Q\equiv E+\frac{1}{2}J^{2}/r_{a}^{2}$ ; here, $J$ is the specific angular momentum and $r_{\mathrm{a}}$ is the anisotropy radius, a parameter which controls the degree of anisotropy. Resolving the velocity into a radial component $v_{\mathrm{r}}$ and a tangential component $v_{\mathrm{t}}$ , we obtain

Q=\frac{1}{2}v_{\mathrm{r}}^{2}+\frac{1}{2}\left(1+\frac{r^{2}}{r_{\mathrm{a}}^{2}}\right)v_{\mathrm{t}}^{2}+\Phi(|\mathbf{r}|)\,.

(3)

The velocity distribution at any radius is stratified on prolate spheroids. For $r\ll r_{\mathrm{a}}$ , the quantity $Q\simeq E$ and the velocity distribution is approximately isotropic. Conversely, for $r\gg r_{\mathrm{a}}$ , transverse velocities are weighted more than radial velocities, so the velocity distribution is preferentially radial. The anisotropy at radius $r$ is quantified by

\beta(r)\equiv 1-\frac{\langle v_{t}^{2}\rangle}{\langle 2v_{r}^{2}\rangle}=\frac{r^{2}}{r^{2}+r_{\mathrm{a}}^{2}}\,,

(4)

where the final equality is specific to Osipkov–Merritt DFs. Isotropic distributions have $\beta=0$ , while radially anisotropic distributions have $0<\beta\leq 1$ .

To compute the DF, given a density $\rho(r)$ , potential $\Phi(r)$ , and anisotropy radius $r_{\mathrm{a}}$ , we first set

\tilde{\rho}(r)=\left(1+\frac{r^{2}}{r_{\mathrm{a}}^{2}}\right)\rho(r)\,.

(5)

Since the potential is a monotonic function of radius, $\tilde{\rho}$ can be considered a function of $\Phi$ . The DF is then given by

f_{\mathrm{a}}(Q)=\frac{1}{\sqrt{8}\pi^{2}}\,\frac{\mathrm{d}}{\mathrm{d}Q}\int_{Q}^{0}\frac{1}{\sqrt{\Phi-Q}}\,\frac{\mathrm{d}\tilde{\rho}}{\mathrm{d}\Phi}\,\mathrm{d}\Phi\,.

(6)

In practice, these steps must be performed numerically, since the potential $\Phi(r)$ is not available analytically. We use spline interpolation to invert $\Phi(r)$ and compute $\mathrm{d}\tilde{\rho}/\mathrm{d}\Phi$ . An adaptive routine is used to compute the integral in equation (6); the result is tabulated as a function of $Q$ , and spline interpolation is used for the final differentiation. When given $\rho_{\mathrm{H}}(r)$ and $\Phi_{\mathrm{H}}(r)$ as inputs, our numerical procedure reproduces the corresponding Osipkov–Merritt DF (Hernquist, 1990) with relative errors $<0.1$ per cent.

2.3 N-body Simulations

We use self-consistent N-body simulations to follow the interactions and mergers of our anisotropic models. This approach represents the DF as a collection of $N$ particles; particle $i=1,\dots,N$ has mass $m_{i}=M/N$ , position $\mathbf{r}_{i}$ , and velocity $\mathbf{v}_{i}$ . Each particle obeys Newton’s laws of motion in the gravitational field generated by all other particles.

Particles are initialized so that the probability of finding one with coordinates $(\mathbf{r}_{i},\mathbf{v}_{i})$ is proportional to the value of $f_{\mathrm{a}}(\mathbf{r}_{i},\mathbf{v}_{i})$ . Each particle is initialized independently as follows. First, the particle’s radius $r_{i}$ is chosen by generating a random number $M_{i}$ uniformly distributed in the range $[0,M)$ , and then solving $M_{i}=M(r_{i})$ , where $M(r^{\prime})=\int_{0}^{r^{\prime}}4\pi r^{2}\rho(r)dr$ is the cumulative mass profile. We then generate a random unit vector $\hat{\mathbf{u}}_{i}$ and set the particle position $\mathbf{r}_{i}=r_{i}\,\hat{\mathbf{u}}_{i}$ . Second, at radius $r_{i}$ , we use rejection sampling to pick a speed $v^{\prime}_{i}$ from the speed distribution $v^{2}f_{\mathrm{a}}(\Phi(r_{i})+\frac{1}{2}v^{2})$ . Another random unit vector $\hat{\mathbf{u}}^{\prime}_{i}$ is generated, and we set $\mathbf{v}^{\prime}_{i}=v^{\prime}_{i}\,\hat{\mathbf{u}}^{\prime}_{i}$ . Finally, the tangential components of $\mathbf{v}^{\prime}_{i}$ are rescaled by a factor of $(1+r_{i}^{2}/r_{\mathrm{a}}^{2})^{-1/2}$ to yield the particle velocity $\mathbf{v}_{i}$ .

The first and second steps of this procedure, which suffice to generate isotropic N-body realizations, were extensively tested by Barnes (2012). The final step, which transforms an isotropic (spherical) velocity distribution into the spheroidal distribution required by the Osipkov–Merritt formalism, is new. Stability tests, to be described in the next section, confirm that our initial conditions are very close to equilibrium.

In our simulations we use $N=65536=2^{16}$ particles for each spherical model. Power-of-two values for $N$ ensure that the particle mass $m_{i}$ , and integer multiples thereof, are exactly represented as floating point numbers. Our specific choice for $N$ represents a compromise between sampling accuracy and computing resources. Since particles are initialized independently, integrated quantities will have Monte-Carlo uncertainties of order $N^{-1/2}$ , or $\sim 0.4$ per cent for our value of $N$ . ‘Quiet-start’ procedures which suppress Monte-Carlo fluctuations are available for disc simulations (e.g., Debattista & Sellwood, 2000), but not for a general $3$ -D problem like the encounters we study here. We therefore ran multiple realizations of each experiment with different random seeds to initialize the particle coordinates, and used run-to-run variation to estimate the uncertainty due to sampling.

Gravitational forces are computed using a tree algorithm (Barnes & Hut, 1986). In this algorithm, the force on each particle is approximated by a sum of $O(\log N)$ terms, which represent nearby particles and a hierarchy of cubical cells at larger distances. To be included in this hierarchy, a cell at a distance $R$ from a given particle must satisfy $d<\theta R$ , where $\theta$ is a fixed parameter which controls the trade-off between speed and accuracy. We set $\theta=0.75$ and expand the gravitational field of each cell to quadrupole order. With these choices, the median acceleration error is $\sim 0.03$ per cent.

As noted above, we adopt Plummer softening to limit gravitational accelerations due to short-range interactions. We use a softening length of $\epsilon=1/64$ , which is small enough to resolve the inner $\rho\propto r^{-1}$ slope of the Hernquist model (Barnes, 2012). Compared to the original potential (equation 2), this softening biases the central potential upward by $\sim 3$ per cent. However, at radius $r=a$ , the potential bias is only $\sim 0.1$ per cent, and at larger radii it’s even smaller. Since the models in our simulated encounters are typically separated by distances greater that $a$ , softening is unlikely to have any significant impact on our results.

We use a time-centred leapfrog with a uniform time-step to compute particle trajectories. This simple scheme is adequate for our purposes since softening limits the rate of change of particle accelerations. Our time-step is $\delta t=1/64=2^{-6}$ (again, a power-of-two value guarantees that times have exact representations). With this $\delta t$ , our simulations conserve energy and angular momentum to better than $0.1$ per cent.

All our calculations were run on $4$ GHz Intel processors, using single-precision arithmetic. A typical encounter simulation, following $131072$ particles for $500$ time units, took $\sim 58$ hours of CPU time.

2.4 Stability

Spherical systems with predominantly radial orbits are often unstable, spontaneously evolving into roughly prolate configurations (Antonov 1973; Merritt & Aguilar 1985; Barnes et al. 1986). While it’s certainly possible to simulate encounters of unstable systems, it’s not easy to motivate such experiments as analogs of encounters between real galaxies; to the extent that galaxies can be described as equilibrium systems, we expect them be stable or nearly so. Thus our initial simulations were designed to determine the maximum amount of anisotropy our models can bear without becoming noticeably unstable.

In addition to visually inspecting individual simulations, we quantified the effects of the radial orbit instability as follows. After generating each model realization (§ 2.3), all $N$ particles are sorted by binding energy $E$ ; this ordering will be largely preserved as the system evolves. Sectioning the particle array into $8$ equal segments thus defines $8$ Lagrangian volumes or ‘shells’ ordered by binding energy. The shape of shell $j$ is measured by computing the minor-to-major axis ratio $(c/a)_{j}$ from eigenvalues of the shell’s moment of inertia tensor. The most tightly-bound shells are relatively poor indicators of instability, remaining nearly spherical even when more outlying shells are clearly distorted. We therefore used the $7^{\mathrm{th}}$ shell (i.e., the $75^{\mathrm{th}}$ to $87.5^{\mathrm{th}}$ percentile), which primarily samples the mass distribution at radii between $\sim 5a$ and $\sim 14a$ , to track the development of the radial-orbit instability. (The $8^{\mathrm{th}}$ shell was not useful, as it includes a few very distant particles which individually bias the moment of inertia tensor; these particles have orbital periods much longer than the duration of our simulations.)

A trial run with $r_{\mathrm{a}}=1$ indicated that this model is unstable. We therefore tested anisotropy radii $r_{\mathrm{a}}=2^{k/8}$ where $k>0$ is an integer. This scheme samples $r_{\mathrm{a}}$ finely for small $k$ and more coarsely for larger $k$ . Single realizations with $k=1,\dots,8$ were run using the N-body parameters given in § 2.3. Models with $k\geq 4$ (i.e., $r_{\mathrm{a}}\geq 1.414$ ) appeared stable, with $(c/a)_{7}$ values fluctuating between $\sim 0.95$ and unity. In contrast, models with $k\leq 2$ (i.e., $r_{\mathrm{a}}\leq 1.189$ ) clearly evolved towards more non-spherical shapes, typically falling below $(c/a)_{7}<0.85$ after $500$ time units ( $\sim 15$ orbit periods at the half-mass radius).

Refer to caption — Figure 1: Evolution of $(c/a)_{7}$ , the minor-to-major axial ratio of shell $7$ , for models with various anisotropy radii $r_{\mathrm{a}}$ . Four independent realizations are shown for each choice of $r_{\mathrm{a}}$ .

To better constrain the boundary of stability, we ran another three realizations of each model with $k=1,\dots,4$ . Fig. 1 presents the results. The additional runs confirm that models with $k\leq 2$ are unstable, while those with $k\geq 4$ appear to be stable. As a group, the models with $k=3$ (i.e., $r_{\mathrm{a}}=1.297$ ) typically maintain axial ratios $(c/a)_{7}\simeq 0.95$ . Individual realizations fluctuate about this value, but none of the four realizations displays the systematically decreasing $(c/a)_{7}$ ratios characteristic of the unstable models.

After completing our tests, we learned that Meza & Zamorano (1997) and Buyle et al. (2007) also studied the stability of anisotropic Hernquist models with Osipkov–Merrit DFs. Meza & Zamorano estimated the stability boundary at $r_{\mathrm{a}}\simeq 1.0$ , while Buyle et al. placed it at $r_{\mathrm{a}}\simeq 1.1$ . Our results indicate that both of these models are unstable. The reason for this discrepancy may lie in the method used to detect the instability. Both of the earlier studies quantified the shapes of their simulations using the inner $\sim 70$ per cent of the mass distribution. Measuring shapes at smaller radii may have led these studies to overlook weak instabilities, which manifest most strongly at large radii.

We adopt the $r_{\mathrm{a}}=1.297$ model as the most anisotropic which is dynamically stable enough to be useful in this work. We cannot preclude the possibility that this model is unstable on timescales longer than the duration of our experiments; nor can we preclude an instability which manifests only at extremely large radii. Rather, we argue that such weak instabilities would have little practical consequence for our experiments. In support of this, we note that the merging behavior of our ‘maximally anisotropic’ model is continuous with the behavior of less anisotropic models.

2.4.1 Initial equilibria

As mentioned above, the stability calculations can also be analysed to check our procedure for initializing particle coordinates. Errors in generating equilibrium initial conditions generally manifest on a dynamical timescale. We therefore looked for abrupt evolution in scatter-plots of particle radii $r$ vs. radial velocity $v_{r}$ or tangential velocity $v_{t}$ , in the ratio of velocity dispersions $\langle v_{t}^{2}\rangle/\langle 2v_{r}^{2}\rangle$ for particles in nested spherical shells, and in radial mass profiles. No changes on a dynamical timescale were observed, implying our initial conditions, whether stable or not, are very close to equilibrium. In stable anisotropic models, we did notice modest evolution of the ratio $\langle v_{t}^{2}\rangle/\langle 2v_{r}^{2}\rangle$ , consistent with two-body relaxation slowly driving this quantity toward its isotropic value of unity. However, the duration of our simulations is too short to accurately characterize this effect.

2.5 Encounters

Our experiments mimic encounters of galaxies falling together on asymptotically parabolic trajectories. Initial positions and velocities are obtained from an idealized parabolic orbit, with pericentric separation $r_{\mathrm{p}}$ , of two point-like bodies each of mass $M=1$ . We adopt a coordinate system in which these bodies move clockwise in the $(x,y)$ plane. At pericentre, which defines time $t=0$ , they are symmetrically placed on the $x$ -axis, with positions $x=\pm\textstyle\frac{1}{2}r_{\mathrm{p}}$ and velocities $v_{y}=\mp\sqrt{GM/r_{\mathrm{p}}}$ . We then track the bodies back to a chosen starting time $t_{0}<0$ , yielding their initial separation vector $\mathbf{r}_{0}$ and relative velocity vector $\mathbf{v}_{0}$ .

We next replace these point-like bodies with N-body realizations of the anisotropic DF $f_{\mathrm{a}}(\mathbf{r},\mathbf{v})$ for a given anisotropy radius $r_{\mathrm{a}}$ . A different random number seed is used for each realization. These realizations are then translated by $\pm\frac{1}{2}\mathbf{r}_{0}$ in position, and $\pm\frac{1}{2}\mathbf{v}_{0}$ in velocity, and superimposed. Thus, the initial conditions realize a system with the DF

f_{\mathrm{sys}}=f_{\mathrm{a}}(\mathbf{r}-\textstyle\frac{1}{2}\mathbf{r}_{0},\mathbf{v}-\textstyle\frac{1}{2}\mathbf{v}_{0})+f_{\mathrm{a}}(\mathbf{r}+\textstyle\frac{1}{2}\mathbf{r}_{0},\mathbf{v}+\textstyle\frac{1}{2}\mathbf{v}_{0})\,.

(7)

Note that in constructing our initial conditions, we have not subtracted centre-of-mass positions and velocities from the individual galaxy realizations. As noted in § 2.3, bulk quantities have uncertainties due to Monte-Carlo statistics; as a result, each galaxy realization has an initial offset in position and velocity, of order $O(N^{-1/2})$ , with respect to its assigned coordinates. While these offsets are small, they do have measurable consequences, as we show in Appendix A.

Table 1: Encounter simulations. Primary parameters are the pericentric separation of the initial orbit,

r_{\mathrm{p}}

, and the Osipkov–Merritt anisotropy radius,

r_{\mathrm{a}}

(where

r_{\mathrm{a}}=\infty

indicates an isotropic model). The starting time is

t_{0}

. We ran

N_{\mathrm{real}}

realizations of each encounter.

$r_{\mathrm{p}}$	$r_{\mathrm{a}}$	$t_{0}$	$N_{\mathrm{real}}$
1.0	1.297	-200	8
	1.297	-400	8
	1.414	-200	8
	2.0	-200	4
	4.0	-200	4
	8.0	-200	4
	$\infty$	-200	8
	$\infty$	-400	8
2.0	1.297	-200	8
	1.297	-400	8
	1.414	-200	4
	2.0	-200	4
	4.0	-200	4
	8.0	-200	4
	$\infty$	-200	8
	$\infty$	-400	8
4.0	1.297	-200	8
	1.297	-400	8
	1.414	-200	4
	2.0	-200	4
	4.0	-200	4
	8.0	-200	4
	$\infty$	-200	8
	$\infty$	-400	8

The choice of starting time requires some attention. To be entirely consistent with the chosen two-body orbit, the galaxies should ideally start so far apart that they initially interact like disjoint spherical systems. This isn’t possible for models with extended outer envelopes; some overlap has to be tolerated. We quantified the degree of overlap using

\delta U=U_{\mathrm{tot}}-(U_{1}+U_{2}-GM^{2}/r_{0})\,,

(8)

where $U_{\mathrm{tot}}$ is the total potential energy of the initial configuration, and $U_{1}$ and $U_{2}$ are the internal potential energies of the two models. Separate spherical systems yield $\delta U=0$ ; overlap correlates with $\delta U>0$ .

Our experiments start $t_{0}=-200$ time units before pericentre, unless another time is explicitly stated. This places the models $r_{0}\simeq 69$ length units apart. For this configuration, the resulting $\delta U$ was $\sim 1.3$ per cent of $GM^{2}/r_{0}$ , which seemed a bit large but acceptable inasmuch as the same bias applies to all models with the same $t_{0}$ and $r_{\mathrm{p}}$ . To check, we re-ran some key experiments starting at $t_{0}=-400$ . This initially placed the models $\sim 111$ length units apart; the resulting $\delta U$ was $\sim 0.42$ per cent of $GM^{2}/r_{0}$ .

Table 1 lists parameters for our encounter simulations. We ran at least four realizations of each encounter, and an additional four for a subset. The extra realizations were run to test both choices for $t_{0}$ at the extremes for $r_{\mathrm{p}}$ and $r_{\mathrm{a}}$ , and to anchor our results by accurately characterizing the most extreme cases.

2.5.1 Orbital trajectories

To follow the process of orbit decay, we extract trajectories from the encounter simulations. Our method relies on the fact that particles which are initially tightly bound remain tightly bound as the system evolves. As noted in § 2.4, the particles in each realization are initially sorted in binding energy $E$ , with the most tightly bound particles first. At each time-step, we can therefore estimate the position and velocity of each realization by averaging over the coordinates of its first $N_{\mathrm{c}}$ particles. Our choice of $N_{\mathrm{c}}\,=4096$ represents a trade-off between statistical and systematic uncertainties; larger values would yield smoother trajectories, but loosely-bound particles could be detached from their centres following a close passage, potentially biasing the measured coordinates at later times. Mean and median positions for representative samples of $N_{\mathrm{c}}=4096$ tightly-bound particles, taken from a large ensemble of simulations, typically agree to $\sim 0.005$ length units ( $\sim 30$ per cent of the softening length, $\epsilon$ ), indicating that detached outliers have little effect. Key results are also robust against halving or doubling $N_{\mathrm{c}}$ .

Panel (a) of Fig. 2 shows how this works, using an encounter of two isotropic models with $r_{\mathrm{p}}=1$ . Here, the grey points are the most tightly bound $N_{\mathrm{c}}$ particles in each realization, shown $t=20$ time units after pericentre. These particle distributions are still symmetric and well-localized about the centres of their respective realizations. The solid curves are the trajectories extracted by averaging the positions of these particles at each time-step, while the dashed curves are the idealized parabolic orbits used to set up the initial conditions. All of our encounter simulations unfold in a broadly similar fashion, with the two models approximating idealized parabolic trajectories up to first passage, reaching apocentre shortly thereafter, and subsequently falling back for a second and much closer passage.

Panel (b) shows four simulations, all sampling the same initial DF as in panel (a), but initialized using different random number seeds. Here we plot a single relative trajectory for each simulation, fixing the other model at the origin; this reduces visual clutter at the cost of suppressing the slight asymmetries between individual trajectories seen in the top panel. The variation from run-to-run is due to Monte-Carlo sampling of the initial conditions; to narrow the range, we would need to increase the number of particles, $N$ .

Panel (c) plots separations as functions of time for the same ensemble of simulations. We note that the actual time of first pericentre (hereafter $t_{\mathrm{p1}}$ ) is delayed relative to the pericentre of the idealized parabolic trajectory, which occurs at $t_{\mathrm{p}}=0$ . Moreover, the actual pericentric separation (hereafter $r_{\mathrm{p1}}$ ) is larger than the idealized separation of $r_{\mathrm{p}}=1$ . Conservation of orbital energy and and angular momentum predicts differences of this kind for an interpenetrating encounter of two rigid spheres. As we will see, the actual dynamics are more complex; tides deform the models even during their initial approach, transferring energy and angular momentum to internal degrees of freedom.

Panel (c) also shows that each encounter has a well-defined second pericentre at time $t_{\mathrm{p2}}$ , and merges shortly thereafter. We will use $\Delta t=t_{\mathrm{p2}}-t_{\mathrm{p1}}$ to measure the timescale for orbit decay.

3 RESULTS

Fig. 3 compares encounters of anisotropic models (top row; $r_{\mathrm{a}}=1.297$ ) with equivalent encounters of isotropic models (bottom row). These models were all launched with the same initial positions and velocities, taken from a parabolic encounter of two point masses with pericentric separation $r_{\mathrm{p}}=1$ . Each frame is a stack of eight independent realizations, with a single contour of projected particle density tracing the shapes of the participants. This figure demonstrates that anisotropy has a significant effect on orbit decay.

The frames on the left depict the models during their approach. Some effects of anisotropy are already visible at these early stages, especially at time $t=-20$ , where the contour shows that the anisotropic models are already tidally distorted, while their isotropic counterparts remain nearly spherical. The middle frames, at $t=0$ , show both pairs of models deeply interpenetrating just before first passage; while these images are superficially similar, the anisotropic models continue to show marked distortion along the axis of their initial approach. After first passage, the effects of anisotropy become more dramatic: at $t=20$ , the isotropic models are loitering near apocentre, while the anisotropic models are well past apocentre and plunging toward their second passage. In the final frames, at $t=40$ , the anisotropic models have already merged, while the isotropic models are just beginning their second encounter.

3.1 Orbital Evolution

Fig. 4 compares trajectories from encounters between anisotropic models with their isotropic counterparts. Here, the colored curves represent models with maximal anisotropy $r_{\mathrm{a}}=1.297$ , while the light grey curves represent isotropic models. Each panel shows results for a different choice of idealized pericentric separation $r_{\mathrm{p}}$ . Across the entire set of encounters, it’s evident that anisotropy has a strong effect on orbit decay; this effect is much larger than the run-to-run variation between models in each ensemble. The anisotropic models all attain much smaller separations after first passage, and their subsequent trajectories are more nearly radial than those of their isotropic counterparts. Indeed, the anisotropic encounters with $r_{\mathrm{p}}=1$ come to a near-standstill at apocentre, and fall back together on nearly linear trajectories. This is an example of ‘radialization’ – the tendency for orbits to become more eccentric as they decay. Radialization in unequal-mass encounters is discussed by Amorisco (2017) and Vasiliev et al. (2022); the latter study showed that radial anisotropy in the host accelerates the radialization of satellite orbits. Barnes (2016) illustrates an extreme example of radialization, in which two equal-mass galaxies with isotropic halos reverse their sense of orbital motion after first passage; passages closer than those considered here would likely exhibit this reversal as well.

Careful inspection of Fig. 4 reveals a curious trend; during the initial approach, the trajectories of the anisotropic models lie inside the idealized parabolic orbits, while those of the isotropic models generally fall on the outside. This is not the result of run-to-run variations or a problem with the initial conditions; orbit decay is modifying the trajectories of these models even before first passage.

Fig. 5 shows the same simulations, with separations plotted as functions of time. This figure again shows that the anisotropic simulations have much smaller orbits after the first passage. As one might expect, the encounters between highly anisotropic systems also reach apocentre and fall back to a second pericentre much faster than their isotropic counterparts.

We focus on first pericentre passages in Fig. 6, which plots relative velocities $v_{\mathrm{p1}}$ vs. separations $r_{\mathrm{p1}}$ for all three values of $r_{\mathrm{p}}$ . This figure shows results for isotropic models (black; $r_{\mathrm{a}}=\infty$ ), moderately anisotropic models (blue; $r_{\mathrm{a}}=2$ ), and maximally anisotropic models (red; $r_{\mathrm{a}}=1.297$ ). Between $N_{\mathrm{real}}=4$ and $8$ realizations of each encounter are plotted as points to show run-to-run variations. For each ensemble, we plot a box centered on the sample means $\overline{r_{\mathrm{p1}}}$ and $\overline{v_{\mathrm{p1}}}$ . Each box extends $\pm 2N_{\mathrm{real}}^{-1/2}s_{r}$ in the horizontal direction, and $\pm 2N_{\mathrm{real}}^{-1/2}s_{v}$ in the vertical direction, where $s_{r}$ and $s_{v}$ are sample standard deviations in $r_{\mathrm{p1}}$ and $v_{\mathrm{p1}}$ , respectively (estimated assuming $1$ degree of freedom). Thus, to the extent that the central limit theorem applies for small samples, the boxes indicate $\sim 2\sigma$ uncertainties in the sample means. It is evident that pericentric separation and velocity are both influenced by anisotropy, with smaller $r_{\mathrm{a}}$ values yielding closer and faster passages.

The systematic trends with $r_{\mathrm{a}}$ seen here imply that a significant amount of orbital evolution is taking place during the approach to first passage. For equal-mass encounters, the specific orbital angular momentum at first passage is $J_{\mathrm{p1}}=\textstyle\frac{1}{2}r_{\mathrm{p1}}v_{\mathrm{p1}}$ ; the curves plotted in Fig. 6 are contours of constant angular momentum, passing through the mean $\overline{r_{\mathrm{p1}}}$ and $\overline{v_{\mathrm{p1}}}$ of each sample. Anisotropic models clearly lose more orbital angular momentum than their isotropic counterparts. This difference is most dramatic for close passages; among the $r_{\mathrm{p}}=1$ encounters, the most anisotropic models have lost $\sim 42$ per cent of their initial orbital angular momentum, while their isotropic counterparts have lost only $\sim 19$ per cent. The latter is consistent with a previous study of encounters between disc galaxy models dominated by isotropic dark haloes, which found average losses of $\sim 16$ per cent (Barnes, 2016).

3.1.1 Starting time

As noted in § 2.5, most of our simulations start at $t_{0}=-200$ time units before pericenter, which places the two models relatively close. We therefore ran experiments starting at $t_{0}=-400$ . These encounters used anisotropy radii $r_{\mathrm{a}}=1.297$ and $\infty$ , and initial orbits with pericentric separations $r_{\mathrm{p}}=1$ , $2$ , and $4$ . To overcome run-to-run variation, we simulated $8$ realizations of each encounter (Table 1).

We illustrate several consequences of changing $t_{0}$ in Fig. 7. First, starting earlier appears to reduce the maximum separations that the anisotropic models (top panels) attain after first pericentre. For these models, starting earlier also appears to reduce the time $\Delta t$ between pericentres, although this trend is harder to disentangle due to run-to-run variations. In contrast, the isotropic models exhibit no obvious trends in separation or $\Delta t$ with starting time. Finally, starting earlier seems to increase the run to run variation; dotted curves, which show results for the earlier starting times, appear more spread out then their solid counterparts. This is further discussed in Appendix A.

Fig. 8 illustrates the effect of starting times on pericentric parameters. Consistent with what we have just seen, anisotropic encounters which start earlier undergo more orbit decay, prior to first passage, than those which start later. The curves of constant angular momentum are clearly different in the anisotropic cases, but essentially indistinguishable in the isotropic cases. This difference arises because anisotropic systems begin transferring angular momentum earlier in the experiments starting at $t_{0}=-400$ . For these experiments, up to $\sim 48$ per cent of the initial angular momentum is lost before first pericentre.

3.2 Response To Tidal Forces

Orbit decay in galaxy encounters is driven by the transfer of energy and angular momentum. If galaxies could not deform during tidal encounters, this transfer would be impossible. We therefore studied the shape of our galaxies prior to first pericentre, when they have not yet intermingled. Tidal deformation is already visible in Fig. 3, but the response of each galaxy is hard to follow, as they partially overlap. By contouring the surface density for just one of the two galaxies, we can accurately visualize its response to the other galaxy’s tidal field; Fig. 9 shows examples. The anisotropic simulations in the top row are conspicuously elongated, roughly towards the other galaxy, from $t=-60$ onward. The isotropic simulations in the bottom row show little sign of distortion until $t=-20$ .

Fig. 10 compares shape responses of anisotropic and isotropic encounters for all $r_{\mathrm{p}}$ values. As in Sec. 2.4, we section each realization by binding energy into $8$ shells; here we plot $(c/a)_{7}$ , the minor-to-major axial ratio for the $7^{\mathrm{th}}$ shell. This probes the tidal response at the relatively large radii which drive orbit decay. As Fig. 9 already suggests, the anisotropic systems show a consistently earlier and stronger response to the other galaxy’s tidal field. We observe a similar pattern of behavior for more tightly-bound shells, although the response is weaker since tides have less effect at smaller radii.

Starting time has a clear effect on the orbital evolution of anisotropic models, and in Fig. 11 we show how starting time influences shape response. For isotropic models, the choice of starting time makes little difference; these systems remain nearly spherical until they draw close to each other. In contrast, starting time has a large effect on the shapes of the anisotropic models; the encounters started at $t_{0}=-400$ are already exhibiting significant distortions (e.g., $(c/a)_{7}\simeq 0.9$ ) by $t_{0}=-200$ .

Tidal distortion is necessary for orbit decay, but in order to actually transfer angular momentum, it’s also necessary for this distortion to lag behind the current tidal forcing. This lag arises because a stellar system doesn’t respond instantaneously to tidal perturbations; rather, a velocity perturbation imposed at time $t$ produces a density response at times $t^{\prime}>t$ . In the case of a pair of galaxies making their initial approach, the major axis of each galaxy’s tidal distortion does not point directly toward its companion, but rather toward where the companion was at some earlier time. This misalignment produces a tidal torque which transfers orbital angular momentum to internal rotation.

Fig. 12 shows how orientations evolve during the approach to first pericentre. We use the simulations starting at $t_{0}=-400$ , which follow pre-encounter evolution more completely. Each galaxy is represented by a thin curve plotting the angle $\theta_{7}$ of its $7^{\mathrm{th}}$ shell, measured clockwise from the $+x$ axis; this curve is plotted from the time when the shell’s axial ratio $(c/a)_{7}$ falls below $0.9$ ( $0.95$ ) for the anisotropic (isotropic) models, respectively. The heavier solid curves are means computed over the entire sample. Finally the filled circles show angles to companion galaxies, again measured from the $+x$ axis.

This figure illustrates the delayed response necessary for orbit decay. It also points up another factor which plausibly accelerates orbit decay in encounters of anisotropic galaxies: the size of the lag. At $t\ll t_{0}=-400$ the companion galaxy’s bearing would have been $\theta\simeq 0$ degrees (i.e., along the $+x$ axis), but by $t=t_{0}$ it has swung clockwise, reaching $\theta\simeq 11$ to $22$ degrees for $r_{\mathrm{p}}=1$ to $4$ , respectively. Until shortly before pericenter ( $t=0$ ), the distance between the galaxies scales approximately as $r\propto|t|^{2/3}$ , and their mutual angular velocity scales as $\dot{\theta}\propto Jr^{-2}\propto|t|^{-4/3}$ . Anisotropic galaxies respond earlier to the tidal forcing (growing as $r^{-3}$ ), and their outer regions align roughly with the bearing of the other galaxy at early times. In contrast, the isotropic galaxies don’t respond until later, by which time the companion galaxy has swung further away from the $x$ axis. As a result, anisotropic galaxies lag more than their isotropic counterparts. This increased misalignment, along with the stronger tidal distortions already described, enable the anisotropic encounters to lose orbital angular momentum more rapidly.

3.3 Perturbation Experiments

To examine the effects of anisotropy in a less complicated setting, we simulated the response of single galaxy disturbed by a tide-like field. This allowed us to focus on the response without trying to disentangle the detailed interaction of two galaxies. We implemented the tide-like field by impulsively altering particle velocities. Inasmuch as any perturbation may be approximated by a series of impulses, we expect that the response to a single impulse to be governed by the same physics which is relevant during the initial stages of a tidal encounter.

Our first experiments adopted a linear tidal field aligned with the $x$ -axis, mimicking the effect of a brief tidal perturbation due to a large mass located far away. However we found that this simple linear prescription had an oversized effect on particles at large radii. In particular, loosely bound particles far from the centre were given such large velocities that they became unbound.

To reduce the magnitude of the velocity perturbation on outlying particles, we next tried the following:


	$\displaystyle v_{x}^{\prime}=v_{x}+\frac{2A}{1+r/r_{\mathrm{0}}}\,x\,,$		(9a)
	$\displaystyle v_{y}^{\prime}=v_{y}-\frac{A}{1+r/r_{\mathrm{0}}}\,y\,,$		(9b)
	$\displaystyle v_{z}^{\prime}=v_{z}-\frac{A}{1+r/r_{\mathrm{0}}}\,\,z\,.$		(9c)

The parameter $A$ controls the strength of the perturbation. We rather arbitrarily set $r_{\mathrm{0}}=1+\sqrt{2}$ , which is the half mass radius of our original Hernquist model. In effect, this scheme follows the linear relationship at small radii but reduces the perturbation strength for $r>r_{\mathrm{0}}$ .

After perturbing the velocities, we simulated the model evolution and quantified the shape using the same methodology as in § 2.4. Experimenting with a range of values, we found that $A=0.01$ produced a clear response which relaxed in a few hundred time units. The $7^{\mathrm{th}}$ shell gave the most unambiguous results, with more tightly bound shells behaving in a similar but less dramatic fashion.

Fig. 13 compares results for three different levels of anisotropy. Clearly, anisotropic models are more easily deformed than their isotropic counterparts. The $(b/a)_{7}$ ratios closely track the $(c/a)_{7}$ ratios, indicating these distortions are prolate, as expected given the symmetry of the imposed perturbation. These results may be compared with the results shown in Fig. 10. Here the models are responding to a brief initial perturbation rather than the increasing tidal fields generated during initial approach, but in both cases the anisotropic models exhibit the strongest response. We argue that this stronger response accounts for the difference in orbit decay.

4 DISCUSSION

In encounters of equal-mass spherical systems, the rate of orbit decay is sensitive to radial velocity anisotropy. We demonstrated this by simulating encounters between idealized models with Hernquist (1990) density profiles and Osipkov (1979)-Merritt (1985) velocity distributions. Fig. 14 summarizes results for all simulations listed in Table 1. The vertical axis is the log of $\Delta t$ , the time between $1^{\mathrm{st}}$ and $2^{\mathrm{nd}}$ pericentric passages, averaged over all realizations in each ensemble. We plot the inverse of the anisotropy radius $r_{\mathrm{a}}$ on the horizontal axis to encompass both anisotropic and isotropic ( $1/r_{\mathrm{a}}=0$ ) models. Across the range of parabolic orbits in this study, the most anisotropic models merge in roughly half the time their isotropic counterparts require.

This result was prefigured by some earlier studies. Arena & Bertin (2007) simulated the evolution of satellite galaxy systems with mass ratios of $10\!:\!1$ , using primary galaxies with Osipkov-Merritt velocity distributions and Plummer (1911) or Jaffe (1983) density profiles. They found that anisotropy reduced the orbit decay time by $\sim 8$ per cent for the former, and by $\sim 22$ per cent for the latter. Their results are qualitatively consistent with, but smaller than, the effects we observe; the difference may be due to adopted mass ratio, emphasis on initially bound versus parabolic orbits, or degree of anisotropy.

Curiously, McMillan et al. (2007) did not report any effect of halo anisotropy on orbit decay in equal-mass disk galaxy mergers. As part of a multi-parameter study of remnant structure, they ran experiments with both radially and tangentially anisotropic haloes. They found that initial anisotropy has a modest effect on remnant halo density profiles (consistent with White, 1979), but very little influence on other remnant properties. We estimate that their models were only mildly anisotropic, perhaps comparable to our models with anisotropy radius $r_{\mathrm{a}}=4$ , and conjecture that the resulting decrease of orbit decay times was not dramatic enough to be mentioned in a study focused on remnant haloes.

More recently, Vasiliev et al. (2022) simulated orbital evolution in unequal-mass systems to study radialization of satellite orbits. They present two simulations involving anisotropic primary galaxies, with primary-to-satellite mass ratios of $10\!:\!1$ , and find that radial (tangential) anisotropy enhances (suppresses) radialization. Close inspection of their Fig. 3 also suggests that their radially anisotropic model may have merged more rapidly than its isotropic counterpart.

The immediate reason for faster mergers between anisotropic systems appears straightforward: anisotropic models are more easily deformed during tidal interactions. We have shown that anisotropic models become more non-spherical than their isotropic counterparts during the initial stages of an encounter; it’s likely that this earlier response also explains their larger misalignments with the present positions of their companions (§ 3.2). This accelerates the transfer of orbital angular momentum to internal degrees of freedom, speeding up the merging process. Since our interpretation is based on very general considerations, it’s likely to apply in any tidal encounter between galaxies with significant radial anisotropy.

Simulations of isolated models support the idea that radial anisotropy makes systems more sensitive to non-radial perturbations (§ 3.3). Several previous studies have also found that radial anisotropy increases galactic responses to tides. In simulated Milky Way-LMC encounters, Rozier et al. (2022) report that the wake induced in the Milky Way’s stellar halo by the passage of the LMC is larger if the halo is radially anisotropic, and smaller if it is tangentially anisotropic (in both cases the dark halo was isotropic). This suggests the strength of the response does not depend on self-gravity, since the stellar halo’s mass is negligible. Similarly, Vasiliev (2024) simulated Milky Way-LMC encounters and found that the wake induced by the LMC was larger when the Milky Way’s dark halo was radially anisotropic.

The reason for this larger response is not immediately obvious. It’s intuitively appealing to think that the anisotropic pressure in our models explains why these systems are more responsive to non-radial perturbations. But Palmer & Papaloizou (1987), in a closely related context, showed that pressure anisotropy fails to predict the onset of the radial orbit instability. They present a physical interpretation of the instability based on the precession rates of very eccentric orbits subject to a bar-like perturbation; we suggest that a similar mechanism may be at work here. Linear theory calculations (e.g., Binney & Tremaine, 2008, Ch. 5) and test-particle simulations may help clarify the basis for the stronger response of anisotropic systems.

We found that the choice of starting time has a significant influence on encounters of anisotropic systems. In Fig. 14, the open markers represent runs starting at $t_{0}=-400$ instead of the default $t_{0}=-200$ . The choice of starting time appears to have no effect on the evolution of the isotropic models; for such systems, the potential-energy criterion $\delta U\ll GM^{2}/r_{0}$ (see equation 8) appears adequate. However, the anisotropic models behave differently; those started earlier merge faster. This supports the idea that anisotropic galaxies are susceptible to tidal deformations quite early in the encounter process. It is possible that even $t_{0}=-400$ is not early enough to fully capture the effect of tidal interactions between these anisotropic systems. The importance of early tidal interaction was not appreciated in previous studies; computational considerations generally favor launching galaxies in relatively close proximity. Given that the choice of initial separation appears to have significant consequences for encounters involving anisotropic systems, it should be explicitly discussed in future studies.

Our results are relevant to ‘idealised’ simulations in which two or more equilibrium galaxy models are allowed to interact with each other. It seems plausible that haloes formed partly by gravitational collapse would have predominantly radial orbits. Idealised models could be made more realistic by including some level of radial anisotropy. Below, we outline three possible applications.

First, shorter merger time scales due to halo anisotropy may influence estimates of the merger rate. Lotz et al. (2008) used merger time scales derived from idealised simulations, together with observational counts of interacting galaxies, to infer rates of galaxy mergers. The haloes used in their simulations were very close to isotropic, potentially overestimating the merger time scale. If galaxies take less time to merge, then the merger rate must be higher to account for the number of merging galaxies observed.

Second, faster orbit decay may help explain the persistence of tidal tails in merger remnants. Tails formed during interactions of disc galaxies embedded in massive dark haloes generally remain bound and may fall back before the galaxies finally merge (e.g., Barnes, 2016, and references therein). Such complete recapture is incompatible with observations of long-tailed merger remnants such as NGC 7252 (Schweizer, 1982). Faster orbit decay in systems with anisotropic haloes may make it easier for galaxies to merge before their tails fall back. We hope to explore this question in future work.

Third, semi-analytic models of galaxy formation often estimate merger times from simple formulas based on the Chandrasekhar (1943) treatment of dynamical friction. Boylan-Kolchin, Ma, & Quataert (2008) use simulated mergers of isotropic haloes to argue that these simple formulas underestimate merging times by factors of $\sim 1.7$ or more, distorting key semi-analytic predictions such as the build-up of central galaxy stellar masses. Including anisotropy in the simulations would presumably – for purely fortuitous reasons – have reduced the discrepancy with the dynamical friction formulas.

Possible directions for future work include:

1.

Encounter dynamics. The present study could be extended to head-on encounters (in which angular momentum transfer does not play a role), and to encounters involving tangentially anisotropic models. The early stages of equal-mass tidal encounters may be amenable to linear theory calculations.
2.

Unequal-mass interactions. We expect that our results also apply to encounters between systems with different masses. While Arena & Bertin (2007) found that satellite galaxies orbiting in radially anisotropic haloes suffer more rapid orbit decay, it would be worthwhile to understand why the effects they report are smaller than those we observe.
3.

Remnant structure. McMillan et al. (2007) found that moderate levels of initial anisotropy have little effect on remnant anisotropy, and only slight effects on remnant shape. However, we speculate that the earlier transfer of angular momentum in encounters between anisotropic systems may produce remnants with more of their angular momentum retained at large radii.

The implications of our results for cosmological simulations, or for real galaxies, are somewhat more complex. Simulations in which structure grows from linear perturbations automatically incorporate halo anisotropy. In the specific case of $\Lambda$ CDM, Wojtak et al. (2005) found that haloes in dark matter only simulations are approximately isotropic at their centres but radially anisotropic further out, rising to $\beta\simeq 0.5$ at the virial radius. Such haloes, while less anisotropic than our most extreme examples, may be anisotropic enough to influence merger times. However, haloes in cosmological simulations don’t form as spherical systems; instead, they are generically triaxial, and typically aligned with their neighbors and with the filamentary pattern of large-scale structure (e.g., Bailin & Steinmetz, 2005; Veena et al., 2018; Delgrado et al., 2023). Idealised simulations don’t capture these phenomena. But the mutual alignments created when idealised anisotropic models interact qualitatively mimic the long-range alignments seen in cosmological simulations; the subsequent evolution of such models may be a useful analog to the behavior of real systems.

4.1 Summary

Our investigation of radial anisotropy in encounters of spherical galaxy models has shown that anisotropic models merge faster than their isotropic counterparts. In comparison to isotropic models, anisotropic models are more susceptible to tidal deformation. This accelerates the process of momentum transfer and orbit decay. Measurements of deformation confirm that anisotropic models become more elongated than their isotropic counterparts. During encounters, anisotropic models arrive at first passage having already lost significant amounts of angular momentum.

ACKNOWLEDGEMENTS

We thank the referee for a constructive and stimulating report, and Colby Haggerty and Jennifer Lotz for helpful discussions. JEB is grateful to the Yukawa Institute for Theoretical Physics, Kyoto University, for generous hospitality in Fall 2022 and Summer 2023, and especially to Atsushi Taruya for sponsoring me during these visits. LS thanks the members of the Spring 2022 Senior Research Project class for feedback and suggestions.

DATA AVABILITY

Simulation data and software are available by request to [email protected].

References

(1)
Amorisco (2017) Amorisco, N. C. 2017, MNRAS, 464, 2882
Antonov (1973) Antonov, V. A., 1973, Soviet Ast., 17, 428
Arena & Bertin (2007) Arena, S. E. & Bertin, G. 2007, A&A, 463, 921
Bailin & Steinmetz (2005) Bailin, J., Steinmetz, M. 2005, ApJ, 627, 647
Barnes (2012) Barnes, J. E., 2012, MNRAS, 425, 1104
Barnes (2016) Barnes, J. E., 2016, MNRAS, 455, 1957
Barnes & Hut (1986) Barnes, J., Hut, P., 1986, Nature, 324, 446
Barnes et al. (1986) Barnes, J., Goodman, J., Hut, P., 1986, ApJ, 300, 112
Binney & Tremaine (2008) Binney, J., Tremaine, S., 2008, Galactic Dynamics: Second Edition. Princeton University Press, Princeton, NJ
Boylan-Kolchin, Ma, & Quataert (2008) Boylan-Kolchin, M., Ma, C.-P., Quataert, E. 2008, MNRAS, 383, 93
Buyle et al. (2007) Buyle, P., van Hese, E., de Rijcke, S., Dejonghe, H., 2007, MNRAS, 375, 1157
Chandrasekhar (1943) Chandrasekhar, S. 1943, ApJ, 97, 255.
Cox et al. (2006) Cox, T. J., Jonsson, P., Primack, J. R., Somerville, R. S., 2006, MNRAS, 373, 1013
Debattista & Sellwood (2000) Debattista, V. P., Sellwood, J. A., 2000, ApJ, 543, 704
Delgrado et al. (2023) Delgrado, A. M. et al. 2023, MNRAS, 523, 5899
Drakos et al. (2019) Drakos, N. E., Taylor, J. E., Berrouet, A., Robotham, A. S. G., Power, C., 2019, MNRAS, 487, 993
Fulton & Barnes (2001) Fulton, E., Barnes, J. E., 2001, Ap&SS, 276, 851
Hernquist (1990) Hernquist, L., 1990, ApJ, 356, 359
Jaffe (1983) Jaffe, W., 1983, MNRAS, 202, 995
Lotz et al. (2008) Lotz, J. M., Jonsson, P., Cox, T. J., Primack, J., 2008, MNRAS, 391, 1137
Makino & Hut (1997) Makino, J., Hut, P., 1997, ApJ, 481, 83
McMillan et al. (2007) McMillan, P. J., Athanassoula, E., & Dehnen, W. 2007, MNRAS, 376, 1261.
Merritt (1985) Merritt, D., 1985, AJ, 90, 1027
Merritt & Aguilar (1985) Merritt, D., Aguilar, L. A., 1985, MNRAS, 217, 787
Meza & Zamorano (1997) Meza, A., Zamorano, N., 1997, ApJ, 490, 136.
Osipkov (1979) Osipkov, L. P., 1979, Azh, 5, 77L
Palmer & Papaloizou (1987) Palmer & Papaloizou, 1987, MNRAS 224, 1043
Plummer (1911) Plummer, H. C., 1911, MNRAS, 71, 460
Rozier et al. (2022) Rozier, S., Famaey, B., Siebert, A., et al. 2022, ApJ, 933, 113
Schweizer (1982) Schweizer, F., 1982, ApJ, 252, 455.
Springel & White (1999) Springel, V., White, S. D. M., 1999, MNRAS, 307, 162.
Toomre & Toomre (1972) Toomre, A., Toomre, J., 1972, ApJ, 178, 623.
van Albada & van Gorkom (1977) van Albada, T. S., van Gorkom, J. H., 1977, A&A, 54, 121
Vasiliev et al. (2022) Vasiliev, E., Belokurov, V., & Evans, N. W. 2022, ApJ, 926, 203
Vasiliev (2024) Vasiliev, E. 2024, MNRAS, 527, 437.
Veena et al. (2018) Veena, G. P., Cautun, M., van de Weygaert, R., Tempel, E., Jones, B. J. T., Rieder, S., Frenk, C. S., 2018, MNRAS, 481, 414
Vergne & Muzzio (1995) Vergne, M. M., Muzzio, J. C., 1995, MNRAS, 276, 439
Villumsen (1982) Villumsen, J. V., 1982, MNRAS, 199, 493
Villumsen (1983) Villumsen, J. V., 1983, MNRAS, 204, 219
White (1978) White, S. D. M., 1978, MNRAS, 184, 185
White (1979) White, S. D. M., 1979, MNRAS, 189, 831
Wojtak et al. (2005) Wojtak, R., Łokas, E. L., Gottlöber, S., Mamon, G. A., 2005, MNRAS, 361, L1

Appendix A RUN-TO-RUN VARIATIONS

We have taken advantage of the relatively low computational cost of our numerical experiments by running from $4$ to $8$ independent realizations of each physical scenario (Table 1). Run-to-run variations among ostensibly equivalent realizations can be surprisingly large; in some ensembles, the spread in the orbit decay time $\Delta t$ is $\sim 10$ per cent of the mean value. Below, we show that these variations are primarily driven by the run-to-run spread in orbital angular momentum.

A galaxy realization is generated by independently drawing $N$ particle positions $\mathbf{r}_{i}$ and velocities $\mathbf{v}_{i}$ from a DF (§ 2.3). Integrated quantities therefore have variations of $O(N^{-1/2})$ from one realization to the next. For example, consider the $x$ -component of the centre-of-mass position, which is $\overline{x}\equiv N^{-1}\sum_{i=1}^{N}x_{i}$ for $N$ equal-mass particles. In the large- $N$ limit, $\overline{x}$ has a normal distribution, with a mean of $0$ and a standard deviation of $\sigma_{\overline{x}}=x_{\mathrm{rms}}N^{-1/2}$ , where $x_{\mathrm{rms}}\equiv(N^{-1}\sum_{i=1}^{N}x_{i}^{2})^{1/2}$ is the RMS value of the $x$ coordinate. Since the models are spherically symmetric, the $y$ and $z$ -components have the same distribution. Similar reasoning applies to the centre-of-mass velocity, and to the net angular momentum. The net binding energy $\mathcal{E}$ also varies about its mean value $\mathcal{E}\simeq-0.0850$ ; numerical tests confirm that $\sigma_{\mathcal{E}}\propto N^{-1/2}$ . Table 2 lists dispersions in position, velocity, angular momentum, and energy for isotropic models realized with $N=65536$ particles.

Table 2: Standard deviations in 1-D centre-of-mass position, velocity, angular momentum, and in net binding energy for realizations with

N=65536

particles.

Quantity	Symbol	Value
C.M. Position	$\sigma_{r}$	0.0464
C.M. Velocity	$\sigma_{v}$	0.000938
Angular Momentum	$\sigma_{\mathcal{J}}$	0.00376
Binding Energy	$\sigma_{\mathcal{E}}$	0.000792

To set up an encounter, two such galaxy realizations are translated by $\pm\frac{1}{2}\mathbf{r}_{0}$ in position and $\pm\frac{1}{2}\mathbf{v}_{0}$ in velocity, as specified by equation (7). The initial parabolic orbit has angular momentum about the $z$ -axis

\mathcal{J}_{\mathrm{orb}}=\sqrt{GM^{3}r_{\mathrm{p}}}=\frac{1}{2}M\,(\mathbf{r}_{0}\mathbf{\times}\mathbf{v}_{0})\,\mathbf{\cdot}\,\hat{\mathbf{z}}\,,

(10)

where $M$ is the mass of a single model, and $\hat{\mathbf{z}}$ is a unit vector along the $z$ -axis. For finite $N$ , however, the initial centre-of-mass position $\mathbf{r}_{\ell}$ and velocity $\mathbf{v}_{\ell}$ of galaxy realization $\ell=1,\,2$ will be offset by random vectors of magnitude $\sim\sigma_{r}$ and $\sim\sigma_{v}$ , respectively. As a result, the effective orbital angular momentum

\mathcal{J}_{\mathrm{eff}}=\sum_{\ell=1}^{2}M\,(\mathbf{r}_{\ell}\mathbf{\times}\mathbf{v}_{\ell})\,\mathbf{\cdot}\,\hat{\mathbf{z}}\,,

(11)

includes stochastic terms proportional to $\sigma_{r}$ and $\sigma_{v}$ .

For the encounters considered here, the run-to-run variation in $\mathcal{J}_{\mathrm{eff}}$ is dominated by the variation in centre-of-mass velocity. This is a direct consequence of our efforts to keep the models from overlapping too much at the start of the simulations. As an example, starting the $r_{\mathrm{p}}=1$ orbit at time $t_{0}=-200$ initially places the models $r_{0}\simeq 70.2$ length units apart. The perpendicular orbital velocity of each model is then only $\sim 15$ times larger than the variation in centre-of-mass velocity for $N=2^{16}$ particles. In brief, it’s hard to come close to the intended angular momentum when galaxy realizations must be launched from far apart.

More formally, the variance of $\mathcal{J}_{\mathrm{eff}}$ is

\mathrm{Var}(\mathcal{J}_{\mathrm{eff}})=2\,(\textstyle\frac{1}{2}r_{0}\sigma_{v})^{2}+2\,(\textstyle\frac{1}{2}v_{0}\sigma_{r})^{2}\,.

(12)

The first term in this sum accounts for $\sim 97$ per cent of the variance for orbits starting at $t_{0}=-200$ , and $\sim 99$ per cent for orbits starting at $t_{0}=-400$ . For the various choices of $r_{\mathrm{p}}$ and $t_{0}$ used in this work, the standard deviation of $\mathcal{J}_{\mathrm{eff}}$ is between $2.2$ per cent and $7.2$ per cent of its mean value.

In passing, it’s worth noting that run-to-run variations in orbital binding energy $\mathcal{E}_{\mathrm{eff}}=\frac{1}{2}M(v_{1}^{2}+v_{2}^{2})-GM^{2}/|\mathbf{r}_{1}-\mathbf{r}_{2}|$ are much smaller. For all intents and purposes, the orbits used in our simulations are mono-energetic.

Other things being equal, systems started with more orbital angular momentum should take longer to merge. Figs. 5 and 14 suggest that $\Delta t$ , the time between first and second pericentric passages, is an increasing function of pericentric separation $r_{\mathrm{p}}$ , which in turn is $\propto\mathcal{J}_{\mathrm{orb}}^{2}$ . Fig. 15 provides further support. For each given $r_{\mathrm{a}}$ , there is clearly a well-defined relationship between $\Delta t$ and $\mathcal{J}_{\mathrm{eff}}$ . This figure also shows that, within each ensemble, run-to-run variation in $\mathcal{J}_{\mathrm{eff}}$ accounts for most of the variation in $\Delta t$ .

The sensitivity of anisotropic mergers to the choice of starting time $t_{0}$ becomes very clear when results are plotted on the $\Delta t$ vs. $\mathcal{J}_{\mathrm{eff}}$ plane. Fig. 16 compares $t_{0}=-400$ runs (open symbols, dotted lines) with their $t_{0}=-200$ counterparts (filled symbols, solid lines). This plot confirms that isotropic encounters (grey) merge in the same time for either choice of $t_{0}$ , while anisotropic encounters (red) merge faster when started earlier.

The correlation between $\Delta t$ and $\mathcal{J}_{\mathrm{eff}}$ may be useful to reduce uncertainties. For example, when averaging over the members of an ensemble, more weight could be given to those runs with $\mathcal{J}_{\mathrm{eff}}$ values closer to the ideal $\mathcal{J}_{\mathrm{orb}}$ . An even better strategy might be to use linear fits like those plotted in Fig. 15 to estimate what $\Delta t$ would result from an encounter with $\mathcal{J}_{\mathrm{eff}}=\mathcal{J}_{\mathrm{orb}}$ . The uncertainty in this estimate could be inferred from the residuals of the linear relationship.

Another option, of course, is to subtract the centre-of-mass position and velocity from each galaxy realization before it is translated to its launch coordinates. This strategy ensures that $\mathcal{J}_{\mathrm{eff}}$ for each encounter is exactly equal to $\mathcal{J}_{\mathrm{orb}}$ . The drawback, in our view, is that it introduces approximations into equation (7). In effect, subtracting centre-of-mass coordinates simply ‘papers over’ one obvious manifestation of Monte-Carlo sampling.

Finally, we note that simulations starting earlier that $t_{0}=-400$ would suffer from even more dramatic run-to-run variations. It may be interesting to try earlier starting times, since we are not sure that our experiments start early enough to capture the full effect of anisotropy on orbit decay. Large ensembles or larger values of $N$ might be needed to overcome uncertainties in such experiments.