Orbit decay in encounters between anisotropic spherical galaxies of equal mass
Abstract
We investigate the effect of radial anisotropy on the rate of orbit decay in parabolic encounters of identical spherical galaxies. Our galaxy models have Hernquist density profiles and Osipkov–Merritt velocity distributions. We find that radially anisotropic models merge in as little as half the time of their isotropic counterparts. Anisotropic models are more susceptible to tidal deformation; this accelerates the transfer of orbital angular momentum to internal degrees of freedom. Even during the initial approach, the anisotropic models become more distorted, and arrive at pericentre already having lost substantial amounts of angular momentum. Our results may have implications for estimates of merger rates and persistence of tidal tails.
keywords:
galaxies: evolution – galaxies: interactions – instabilities – methods: numerical1 INTRODUCTION
Galaxy encounters are remarkably inelastic. Two equally-massive galaxies falling together on an interpenetrating parabolic orbit are left on a much smaller and more tightly-bound orbit after their first passage, and typically merge soon after their next approach. This was already clear in early numerical experiments (e.g., van Albada & van Gorkom 1977; White 1978) and has been affirmed many times since. Orbit decay is driven by the transfer of energy and angular momentum from the relative motion of the galaxies to internal motions within each galaxy (Binney & Tremaine, 2008, Ch. 8).
Early studies indicated that the internal velocity structure of interacting galaxies influences the rate of orbit decay. Working with spherical systems, White (1979) found that merging is faster if galaxies spin in the same direction as they orbit one another, and slower if they spin in a retrograde direction. This arises because particles on direct orbits experience a ‘broad resonance’ (Toomre & Toomre, 1972) with the relative motion of the two galaxies, yielding a stronger tidal response. While the role of rotation has been highlighted in subsequent studies of disc-galaxy mergers, little attention seems to have been paid to other options for velocity structure. White (1979) included non-rotating models with both isotropic and circular velocity distributions, and noted that the latter produced merger remnants with more compact cores, but remarked that ‘[t]he strength of interaction does not, however, seem to depend on the initial velocity structure.’
Subsequent simulations of encounters between spherical galaxies have explored a wide range of choices for mass ratios, initial orbits, and initial density profiles (e.g., Villumsen 1982, 1983; Vergne & Muzzio 1995; Makino & Hut 1997; Fulton & Barnes 2001; Boylan-Kolchin, Ma, & Quataert 2008; Drakos et al. 2019). These studies have been somewhat overshadowed by simulations of disc galaxy encounters, which have become increasingly realistic as they attempt to reproduce observed interacting systems and model the formation of elliptical galaxies by mergers. Yet even in disc simulations, most of the mass is invested in a roughly spherical dark halo. These haloes are usually set up with isotropic velocity distributions, although some workers (e.g., Cox et al. 2006, Springel & White 1999) added modest amounts of halo rotation.
A handful of studies have incorporated radial or tangential anisotropy in simulations of galaxy interactions. McMillan et al. (2007) examined encounters of equal-mass disc galaxies embedded in anisotropic haloes, and reported that initial anisotropy had very little effect on the haloes of merger remnants. When simulating interactions between large galaxies and small satellites, Arena & Bertin (2007) found that satellites orbiting radially anisotropic primaries merged in ‘somewhat shorter’ times, while Vasiliev et al. (2022) noted that radial (tangential) anisotropy caused satellite orbits to become more (less) elongated over time. Rozier et al. (2022) and Vasiliev (2024) included halo anisotropy in studies of the Milky Way–LMC system. Both found that radially anisotropic halos respond more strongly to the LMC’s gravitational field; the latter also reported that the present position and velocity of the LMC are consistent with an initial orbit with higher angular momentum if the dark halo is radially anisotropic.
In this paper we return to the study of encounters between identical spherical systems, and explore the influence of radial anisotropy on orbit decay. We focus on spherical systems for simplicity, and because there are straightforward, accurate ways to generate radially anisotropic models of such systems. Our immediate goal is to clarify the physics of orbit decay, rather than to model real galaxies. However, we note that inasmuch as dark haloes form partly by radial collapse, they are likely to have velocity distributions with preferentially radial orbits.
2 METHODS
In concept, our experimental approach is very simple. We construct spherical Hernquist (1990) models with anisotropic velocity distributions following Osipkov (1979) and Merritt (1985). Two such models are launched on a parabolic collision trajectory, and their evolution is followed until they have merged. We vary the amount of anisotropy while holding other parameters fixed, and use the time between first and second passages to measure the timescale of orbit decay.
To implement this approach, we must make a number of approximations. In particular, (1) the model density profile is exponentially tapered toward zero at a finite radius, (2) dynamical evolution is simulated using an N-body method with finite particle number, (3) gravitational interactions are ‘softened’ at small separations, (4) forces are evaluated using a hierarchical tree code which trades accuracy for speed, (5) particle trajectories are integrated using a finite time-step, and (6) the models are launched from a finite separation. We discuss each of these approximations below and estimate their effects on our results.
2.1 Mass Distribution and Gravitational Potential
We use the Hernquist (1990) density profile to model a simple, spherically symmetric galaxy. This profile is fully characterized by two parameters, the mass and scale radius . The density is
(1) |
The corresponding gravitational potential is
(2) |
where is the gravitational constant.
For simplicity, we use units in which . This entails no loss of generality, as the resulting models can be rescaled to any desired system of units. In these units, the half-mass radius is . A circular orbit at this radius has period ; this is a characteristic timescale for dynamical evolution.
At large radii, falls off as . When this density profile is sampled with particles, the outermost one will have radius . Such extreme outliers are numerically inconvenient, so we use a general tapering function (Barnes, 2012, eq. 16) to smoothly taper the density profile, starting at a radius . This effectively limits the outermost particle to a radius of roughly . To keep the total mass of the model fixed, matter at large radii is redistributed within ; this increases the density within radius by just under per cent. In what follows, is the tapered density profile.
Gravitational interactions between particles are softened at short range using the usual Plummer formula: at a distance , the potential of a particle is proportional to , where is a small ‘softening length’. This biases the overall gravitational potential, so is no longer accurate. Since we need accurate potentials to compute velocity distributions (§ 2.2), the tapered density profile is numerically convolved with a Plummer kernel (Barnes, 2012, eq. 6), and the result is used to integrate the potential as a function of radius. In what follows, is the resulting potential.
2.2 Velocity Distribution
We use the Osipkov–Merritt method to construct radially anisotropic DFs (distribution functions) for our models (Osipkov, 1979; Merritt, 1985). For a spherical isotropic system, the DF is , where is the specific binding energy. The Osipkov–Merritt DF is , where ; here, is the specific angular momentum and is the anisotropy radius, a parameter which controls the degree of anisotropy. Resolving the velocity into a radial component and a tangential component , we obtain
(3) |
The velocity distribution at any radius is stratified on prolate spheroids. For , the quantity and the velocity distribution is approximately isotropic. Conversely, for , transverse velocities are weighted more than radial velocities, so the velocity distribution is preferentially radial. The anisotropy at radius is quantified by
(4) |
where the final equality is specific to Osipkov–Merritt DFs. Isotropic distributions have , while radially anisotropic distributions have .
To compute the DF, given a density , potential , and anisotropy radius , we first set
(5) |
Since the potential is a monotonic function of radius, can be considered a function of . The DF is then given by
(6) |
In practice, these steps must be performed numerically, since the potential is not available analytically. We use spline interpolation to invert and compute . An adaptive routine is used to compute the integral in equation (6); the result is tabulated as a function of , and spline interpolation is used for the final differentiation. When given and as inputs, our numerical procedure reproduces the corresponding Osipkov–Merritt DF (Hernquist, 1990) with relative errors per cent.
2.3 N-body Simulations
We use self-consistent N-body simulations to follow the interactions and mergers of our anisotropic models. This approach represents the DF as a collection of particles; particle has mass , position , and velocity . Each particle obeys Newton’s laws of motion in the gravitational field generated by all other particles.
Particles are initialized so that the probability of finding one with coordinates is proportional to the value of . Each particle is initialized independently as follows. First, the particle’s radius is chosen by generating a random number uniformly distributed in the range , and then solving , where is the cumulative mass profile. We then generate a random unit vector and set the particle position . Second, at radius , we use rejection sampling to pick a speed from the speed distribution . Another random unit vector is generated, and we set . Finally, the tangential components of are rescaled by a factor of to yield the particle velocity .
The first and second steps of this procedure, which suffice to generate isotropic N-body realizations, were extensively tested by Barnes (2012). The final step, which transforms an isotropic (spherical) velocity distribution into the spheroidal distribution required by the Osipkov–Merritt formalism, is new. Stability tests, to be described in the next section, confirm that our initial conditions are very close to equilibrium.
In our simulations we use particles for each spherical model. Power-of-two values for ensure that the particle mass , and integer multiples thereof, are exactly represented as floating point numbers. Our specific choice for represents a compromise between sampling accuracy and computing resources. Since particles are initialized independently, integrated quantities will have Monte-Carlo uncertainties of order , or per cent for our value of . ‘Quiet-start’ procedures which suppress Monte-Carlo fluctuations are available for disc simulations (e.g., Debattista & Sellwood, 2000), but not for a general -D problem like the encounters we study here. We therefore ran multiple realizations of each experiment with different random seeds to initialize the particle coordinates, and used run-to-run variation to estimate the uncertainty due to sampling.
Gravitational forces are computed using a tree algorithm (Barnes & Hut, 1986). In this algorithm, the force on each particle is approximated by a sum of terms, which represent nearby particles and a hierarchy of cubical cells at larger distances. To be included in this hierarchy, a cell at a distance from a given particle must satisfy , where is a fixed parameter which controls the trade-off between speed and accuracy. We set and expand the gravitational field of each cell to quadrupole order. With these choices, the median acceleration error is per cent.
As noted above, we adopt Plummer softening to limit gravitational accelerations due to short-range interactions. We use a softening length of , which is small enough to resolve the inner slope of the Hernquist model (Barnes, 2012). Compared to the original potential (equation 2), this softening biases the central potential upward by per cent. However, at radius , the potential bias is only per cent, and at larger radii it’s even smaller. Since the models in our simulated encounters are typically separated by distances greater that , softening is unlikely to have any significant impact on our results.
We use a time-centred leapfrog with a uniform time-step to compute particle trajectories. This simple scheme is adequate for our purposes since softening limits the rate of change of particle accelerations. Our time-step is (again, a power-of-two value guarantees that times have exact representations). With this , our simulations conserve energy and angular momentum to better than per cent.
All our calculations were run on GHz Intel processors, using single-precision arithmetic. A typical encounter simulation, following particles for time units, took hours of CPU time.
2.4 Stability
Spherical systems with predominantly radial orbits are often unstable, spontaneously evolving into roughly prolate configurations (Antonov 1973; Merritt & Aguilar 1985; Barnes et al. 1986). While it’s certainly possible to simulate encounters of unstable systems, it’s not easy to motivate such experiments as analogs of encounters between real galaxies; to the extent that galaxies can be described as equilibrium systems, we expect them be stable or nearly so. Thus our initial simulations were designed to determine the maximum amount of anisotropy our models can bear without becoming noticeably unstable.
In addition to visually inspecting individual simulations, we quantified the effects of the radial orbit instability as follows. After generating each model realization (§ 2.3), all particles are sorted by binding energy ; this ordering will be largely preserved as the system evolves. Sectioning the particle array into equal segments thus defines Lagrangian volumes or ‘shells’ ordered by binding energy. The shape of shell is measured by computing the minor-to-major axis ratio from eigenvalues of the shell’s moment of inertia tensor. The most tightly-bound shells are relatively poor indicators of instability, remaining nearly spherical even when more outlying shells are clearly distorted. We therefore used the shell (i.e., the to percentile), which primarily samples the mass distribution at radii between and , to track the development of the radial-orbit instability. (The shell was not useful, as it includes a few very distant particles which individually bias the moment of inertia tensor; these particles have orbital periods much longer than the duration of our simulations.)
A trial run with indicated that this model is unstable. We therefore tested anisotropy radii where is an integer. This scheme samples finely for small and more coarsely for larger . Single realizations with were run using the N-body parameters given in § 2.3. Models with (i.e., ) appeared stable, with values fluctuating between and unity. In contrast, models with (i.e., ) clearly evolved towards more non-spherical shapes, typically falling below after time units ( orbit periods at the half-mass radius).

To better constrain the boundary of stability, we ran another three realizations of each model with . Fig. 1 presents the results. The additional runs confirm that models with are unstable, while those with appear to be stable. As a group, the models with (i.e., ) typically maintain axial ratios . Individual realizations fluctuate about this value, but none of the four realizations displays the systematically decreasing ratios characteristic of the unstable models.
After completing our tests, we learned that Meza & Zamorano (1997) and Buyle et al. (2007) also studied the stability of anisotropic Hernquist models with Osipkov–Merrit DFs. Meza & Zamorano estimated the stability boundary at , while Buyle et al. placed it at . Our results indicate that both of these models are unstable. The reason for this discrepancy may lie in the method used to detect the instability. Both of the earlier studies quantified the shapes of their simulations using the inner per cent of the mass distribution. Measuring shapes at smaller radii may have led these studies to overlook weak instabilities, which manifest most strongly at large radii.
We adopt the model as the most anisotropic which is dynamically stable enough to be useful in this work. We cannot preclude the possibility that this model is unstable on timescales longer than the duration of our experiments; nor can we preclude an instability which manifests only at extremely large radii. Rather, we argue that such weak instabilities would have little practical consequence for our experiments. In support of this, we note that the merging behavior of our ‘maximally anisotropic’ model is continuous with the behavior of less anisotropic models.
2.4.1 Initial equilibria
As mentioned above, the stability calculations can also be analysed to check our procedure for initializing particle coordinates. Errors in generating equilibrium initial conditions generally manifest on a dynamical timescale. We therefore looked for abrupt evolution in scatter-plots of particle radii vs. radial velocity or tangential velocity , in the ratio of velocity dispersions for particles in nested spherical shells, and in radial mass profiles. No changes on a dynamical timescale were observed, implying our initial conditions, whether stable or not, are very close to equilibrium. In stable anisotropic models, we did notice modest evolution of the ratio , consistent with two-body relaxation slowly driving this quantity toward its isotropic value of unity. However, the duration of our simulations is too short to accurately characterize this effect.
2.5 Encounters
Our experiments mimic encounters of galaxies falling together on asymptotically parabolic trajectories. Initial positions and velocities are obtained from an idealized parabolic orbit, with pericentric separation , of two point-like bodies each of mass . We adopt a coordinate system in which these bodies move clockwise in the plane. At pericentre, which defines time , they are symmetrically placed on the -axis, with positions and velocities . We then track the bodies back to a chosen starting time , yielding their initial separation vector and relative velocity vector .
We next replace these point-like bodies with N-body realizations of the anisotropic DF for a given anisotropy radius . A different random number seed is used for each realization. These realizations are then translated by in position, and in velocity, and superimposed. Thus, the initial conditions realize a system with the DF
(7) |
Note that in constructing our initial conditions, we have not subtracted centre-of-mass positions and velocities from the individual galaxy realizations. As noted in § 2.3, bulk quantities have uncertainties due to Monte-Carlo statistics; as a result, each galaxy realization has an initial offset in position and velocity, of order , with respect to its assigned coordinates. While these offsets are small, they do have measurable consequences, as we show in Appendix A.
1.0 | 1.297 | -200 | 8 |
1.297 | -400 | 8 | |
1.414 | -200 | 8 | |
2.0 | -200 | 4 | |
4.0 | -200 | 4 | |
8.0 | -200 | 4 | |
-200 | 8 | ||
-400 | 8 | ||
2.0 | 1.297 | -200 | 8 |
1.297 | -400 | 8 | |
1.414 | -200 | 4 | |
2.0 | -200 | 4 | |
4.0 | -200 | 4 | |
8.0 | -200 | 4 | |
-200 | 8 | ||
-400 | 8 | ||
4.0 | 1.297 | -200 | 8 |
1.297 | -400 | 8 | |
1.414 | -200 | 4 | |
2.0 | -200 | 4 | |
4.0 | -200 | 4 | |
8.0 | -200 | 4 | |
-200 | 8 | ||
-400 | 8 |
The choice of starting time requires some attention. To be entirely consistent with the chosen two-body orbit, the galaxies should ideally start so far apart that they initially interact like disjoint spherical systems. This isn’t possible for models with extended outer envelopes; some overlap has to be tolerated. We quantified the degree of overlap using
(8) |
where is the total potential energy of the initial configuration, and and are the internal potential energies of the two models. Separate spherical systems yield ; overlap correlates with .
Our experiments start time units before pericentre, unless another time is explicitly stated. This places the models length units apart. For this configuration, the resulting was per cent of , which seemed a bit large but acceptable inasmuch as the same bias applies to all models with the same and . To check, we re-ran some key experiments starting at . This initially placed the models length units apart; the resulting was per cent of .
Table 1 lists parameters for our encounter simulations. We ran at least four realizations of each encounter, and an additional four for a subset. The extra realizations were run to test both choices for at the extremes for and , and to anchor our results by accurately characterizing the most extreme cases.
2.5.1 Orbital trajectories
To follow the process of orbit decay, we extract trajectories from the encounter simulations. Our method relies on the fact that particles which are initially tightly bound remain tightly bound as the system evolves. As noted in § 2.4, the particles in each realization are initially sorted in binding energy , with the most tightly bound particles first. At each time-step, we can therefore estimate the position and velocity of each realization by averaging over the coordinates of its first particles. Our choice of represents a trade-off between statistical and systematic uncertainties; larger values would yield smoother trajectories, but loosely-bound particles could be detached from their centres following a close passage, potentially biasing the measured coordinates at later times. Mean and median positions for representative samples of tightly-bound particles, taken from a large ensemble of simulations, typically agree to length units ( per cent of the softening length, ), indicating that detached outliers have little effect. Key results are also robust against halving or doubling .

Panel (a) of Fig. 2 shows how this works, using an encounter of two isotropic models with . Here, the grey points are the most tightly bound particles in each realization, shown time units after pericentre. These particle distributions are still symmetric and well-localized about the centres of their respective realizations. The solid curves are the trajectories extracted by averaging the positions of these particles at each time-step, while the dashed curves are the idealized parabolic orbits used to set up the initial conditions. All of our encounter simulations unfold in a broadly similar fashion, with the two models approximating idealized parabolic trajectories up to first passage, reaching apocentre shortly thereafter, and subsequently falling back for a second and much closer passage.
Panel (b) shows four simulations, all sampling the same initial DF as in panel (a), but initialized using different random number seeds. Here we plot a single relative trajectory for each simulation, fixing the other model at the origin; this reduces visual clutter at the cost of suppressing the slight asymmetries between individual trajectories seen in the top panel. The variation from run-to-run is due to Monte-Carlo sampling of the initial conditions; to narrow the range, we would need to increase the number of particles, .
Panel (c) plots separations as functions of time for the same ensemble of simulations. We note that the actual time of first pericentre (hereafter ) is delayed relative to the pericentre of the idealized parabolic trajectory, which occurs at . Moreover, the actual pericentric separation (hereafter ) is larger than the idealized separation of . Conservation of orbital energy and and angular momentum predicts differences of this kind for an interpenetrating encounter of two rigid spheres. As we will see, the actual dynamics are more complex; tides deform the models even during their initial approach, transferring energy and angular momentum to internal degrees of freedom.
Panel (c) also shows that each encounter has a well-defined second pericentre at time , and merges shortly thereafter. We will use to measure the timescale for orbit decay.
3 RESULTS
Fig. 3 compares encounters of anisotropic models (top row; ) with equivalent encounters of isotropic models (bottom row). These models were all launched with the same initial positions and velocities, taken from a parabolic encounter of two point masses with pericentric separation . Each frame is a stack of eight independent realizations, with a single contour of projected particle density tracing the shapes of the participants. This figure demonstrates that anisotropy has a significant effect on orbit decay.

The frames on the left depict the models during their approach. Some effects of anisotropy are already visible at these early stages, especially at time , where the contour shows that the anisotropic models are already tidally distorted, while their isotropic counterparts remain nearly spherical. The middle frames, at , show both pairs of models deeply interpenetrating just before first passage; while these images are superficially similar, the anisotropic models continue to show marked distortion along the axis of their initial approach. After first passage, the effects of anisotropy become more dramatic: at , the isotropic models are loitering near apocentre, while the anisotropic models are well past apocentre and plunging toward their second passage. In the final frames, at , the anisotropic models have already merged, while the isotropic models are just beginning their second encounter.
3.1 Orbital Evolution


Fig. 4 compares trajectories from encounters between anisotropic models with their isotropic counterparts. Here, the colored curves represent models with maximal anisotropy , while the light grey curves represent isotropic models. Each panel shows results for a different choice of idealized pericentric separation . Across the entire set of encounters, it’s evident that anisotropy has a strong effect on orbit decay; this effect is much larger than the run-to-run variation between models in each ensemble. The anisotropic models all attain much smaller separations after first passage, and their subsequent trajectories are more nearly radial than those of their isotropic counterparts. Indeed, the anisotropic encounters with come to a near-standstill at apocentre, and fall back together on nearly linear trajectories. This is an example of ‘radialization’ – the tendency for orbits to become more eccentric as they decay. Radialization in unequal-mass encounters is discussed by Amorisco (2017) and Vasiliev et al. (2022); the latter study showed that radial anisotropy in the host accelerates the radialization of satellite orbits. Barnes (2016) illustrates an extreme example of radialization, in which two equal-mass galaxies with isotropic halos reverse their sense of orbital motion after first passage; passages closer than those considered here would likely exhibit this reversal as well.
Careful inspection of Fig. 4 reveals a curious trend; during the initial approach, the trajectories of the anisotropic models lie inside the idealized parabolic orbits, while those of the isotropic models generally fall on the outside. This is not the result of run-to-run variations or a problem with the initial conditions; orbit decay is modifying the trajectories of these models even before first passage.
Fig. 5 shows the same simulations, with separations plotted as functions of time. This figure again shows that the anisotropic simulations have much smaller orbits after the first passage. As one might expect, the encounters between highly anisotropic systems also reach apocentre and fall back to a second pericentre much faster than their isotropic counterparts.

We focus on first pericentre passages in Fig. 6, which plots relative velocities vs. separations for all three values of . This figure shows results for isotropic models (black; ), moderately anisotropic models (blue; ), and maximally anisotropic models (red; ). Between and realizations of each encounter are plotted as points to show run-to-run variations. For each ensemble, we plot a box centered on the sample means and . Each box extends in the horizontal direction, and in the vertical direction, where and are sample standard deviations in and , respectively (estimated assuming degree of freedom). Thus, to the extent that the central limit theorem applies for small samples, the boxes indicate uncertainties in the sample means. It is evident that pericentric separation and velocity are both influenced by anisotropy, with smaller values yielding closer and faster passages.
The systematic trends with seen here imply that a significant amount of orbital evolution is taking place during the approach to first passage. For equal-mass encounters, the specific orbital angular momentum at first passage is ; the curves plotted in Fig. 6 are contours of constant angular momentum, passing through the mean and of each sample. Anisotropic models clearly lose more orbital angular momentum than their isotropic counterparts. This difference is most dramatic for close passages; among the encounters, the most anisotropic models have lost per cent of their initial orbital angular momentum, while their isotropic counterparts have lost only per cent. The latter is consistent with a previous study of encounters between disc galaxy models dominated by isotropic dark haloes, which found average losses of per cent (Barnes, 2016).
3.1.1 Starting time
As noted in § 2.5, most of our simulations start at time units before pericenter, which places the two models relatively close. We therefore ran experiments starting at . These encounters used anisotropy radii and , and initial orbits with pericentric separations , , and . To overcome run-to-run variation, we simulated realizations of each encounter (Table 1).

We illustrate several consequences of changing in Fig. 7. First, starting earlier appears to reduce the maximum separations that the anisotropic models (top panels) attain after first pericentre. For these models, starting earlier also appears to reduce the time between pericentres, although this trend is harder to disentangle due to run-to-run variations. In contrast, the isotropic models exhibit no obvious trends in separation or with starting time. Finally, starting earlier seems to increase the run to run variation; dotted curves, which show results for the earlier starting times, appear more spread out then their solid counterparts. This is further discussed in Appendix A.
Fig. 8 illustrates the effect of starting times on pericentric parameters. Consistent with what we have just seen, anisotropic encounters which start earlier undergo more orbit decay, prior to first passage, than those which start later. The curves of constant angular momentum are clearly different in the anisotropic cases, but essentially indistinguishable in the isotropic cases. This difference arises because anisotropic systems begin transferring angular momentum earlier in the experiments starting at . For these experiments, up to per cent of the initial angular momentum is lost before first pericentre.

3.2 Response To Tidal Forces
Orbit decay in galaxy encounters is driven by the transfer of energy and angular momentum. If galaxies could not deform during tidal encounters, this transfer would be impossible. We therefore studied the shape of our galaxies prior to first pericentre, when they have not yet intermingled. Tidal deformation is already visible in Fig. 3, but the response of each galaxy is hard to follow, as they partially overlap. By contouring the surface density for just one of the two galaxies, we can accurately visualize its response to the other galaxy’s tidal field; Fig. 9 shows examples. The anisotropic simulations in the top row are conspicuously elongated, roughly towards the other galaxy, from onward. The isotropic simulations in the bottom row show little sign of distortion until .


Fig. 10 compares shape responses of anisotropic and isotropic encounters for all values. As in Sec. 2.4, we section each realization by binding energy into shells; here we plot , the minor-to-major axial ratio for the shell. This probes the tidal response at the relatively large radii which drive orbit decay. As Fig. 9 already suggests, the anisotropic systems show a consistently earlier and stronger response to the other galaxy’s tidal field. We observe a similar pattern of behavior for more tightly-bound shells, although the response is weaker since tides have less effect at smaller radii.
Starting time has a clear effect on the orbital evolution of anisotropic models, and in Fig. 11 we show how starting time influences shape response. For isotropic models, the choice of starting time makes little difference; these systems remain nearly spherical until they draw close to each other. In contrast, starting time has a large effect on the shapes of the anisotropic models; the encounters started at are already exhibiting significant distortions (e.g., ) by .

Tidal distortion is necessary for orbit decay, but in order to actually transfer angular momentum, it’s also necessary for this distortion to lag behind the current tidal forcing. This lag arises because a stellar system doesn’t respond instantaneously to tidal perturbations; rather, a velocity perturbation imposed at time produces a density response at times . In the case of a pair of galaxies making their initial approach, the major axis of each galaxy’s tidal distortion does not point directly toward its companion, but rather toward where the companion was at some earlier time. This misalignment produces a tidal torque which transfers orbital angular momentum to internal rotation.
Fig. 12 shows how orientations evolve during the approach to first pericentre. We use the simulations starting at , which follow pre-encounter evolution more completely. Each galaxy is represented by a thin curve plotting the angle of its shell, measured clockwise from the axis; this curve is plotted from the time when the shell’s axial ratio falls below () for the anisotropic (isotropic) models, respectively. The heavier solid curves are means computed over the entire sample. Finally the filled circles show angles to companion galaxies, again measured from the axis.

This figure illustrates the delayed response necessary for orbit decay. It also points up another factor which plausibly accelerates orbit decay in encounters of anisotropic galaxies: the size of the lag. At the companion galaxy’s bearing would have been degrees (i.e., along the axis), but by it has swung clockwise, reaching to degrees for to , respectively. Until shortly before pericenter (), the distance between the galaxies scales approximately as , and their mutual angular velocity scales as . Anisotropic galaxies respond earlier to the tidal forcing (growing as ), and their outer regions align roughly with the bearing of the other galaxy at early times. In contrast, the isotropic galaxies don’t respond until later, by which time the companion galaxy has swung further away from the axis. As a result, anisotropic galaxies lag more than their isotropic counterparts. This increased misalignment, along with the stronger tidal distortions already described, enable the anisotropic encounters to lose orbital angular momentum more rapidly.
3.3 Perturbation Experiments
To examine the effects of anisotropy in a less complicated setting, we simulated the response of single galaxy disturbed by a tide-like field. This allowed us to focus on the response without trying to disentangle the detailed interaction of two galaxies. We implemented the tide-like field by impulsively altering particle velocities. Inasmuch as any perturbation may be approximated by a series of impulses, we expect that the response to a single impulse to be governed by the same physics which is relevant during the initial stages of a tidal encounter.
Our first experiments adopted a linear tidal field aligned with the -axis, mimicking the effect of a brief tidal perturbation due to a large mass located far away. However we found that this simple linear prescription had an oversized effect on particles at large radii. In particular, loosely bound particles far from the centre were given such large velocities that they became unbound.
To reduce the magnitude of the velocity perturbation on outlying particles, we next tried the following:
(9a) | |||
(9b) | |||
(9c) |
The parameter controls the strength of the perturbation. We rather arbitrarily set , which is the half mass radius of our original Hernquist model. In effect, this scheme follows the linear relationship at small radii but reduces the perturbation strength for .
After perturbing the velocities, we simulated the model evolution and quantified the shape using the same methodology as in § 2.4. Experimenting with a range of values, we found that produced a clear response which relaxed in a few hundred time units. The shell gave the most unambiguous results, with more tightly bound shells behaving in a similar but less dramatic fashion.

Fig. 13 compares results for three different levels of anisotropy. Clearly, anisotropic models are more easily deformed than their isotropic counterparts. The ratios closely track the ratios, indicating these distortions are prolate, as expected given the symmetry of the imposed perturbation. These results may be compared with the results shown in Fig. 10. Here the models are responding to a brief initial perturbation rather than the increasing tidal fields generated during initial approach, but in both cases the anisotropic models exhibit the strongest response. We argue that this stronger response accounts for the difference in orbit decay.
4 DISCUSSION

In encounters of equal-mass spherical systems, the rate of orbit decay is sensitive to radial velocity anisotropy. We demonstrated this by simulating encounters between idealized models with Hernquist (1990) density profiles and Osipkov (1979)-Merritt (1985) velocity distributions. Fig. 14 summarizes results for all simulations listed in Table 1. The vertical axis is the log of , the time between and pericentric passages, averaged over all realizations in each ensemble. We plot the inverse of the anisotropy radius on the horizontal axis to encompass both anisotropic and isotropic () models. Across the range of parabolic orbits in this study, the most anisotropic models merge in roughly half the time their isotropic counterparts require.
This result was prefigured by some earlier studies. Arena & Bertin (2007) simulated the evolution of satellite galaxy systems with mass ratios of , using primary galaxies with Osipkov-Merritt velocity distributions and Plummer (1911) or Jaffe (1983) density profiles. They found that anisotropy reduced the orbit decay time by per cent for the former, and by per cent for the latter. Their results are qualitatively consistent with, but smaller than, the effects we observe; the difference may be due to adopted mass ratio, emphasis on initially bound versus parabolic orbits, or degree of anisotropy.
Curiously, McMillan et al. (2007) did not report any effect of halo anisotropy on orbit decay in equal-mass disk galaxy mergers. As part of a multi-parameter study of remnant structure, they ran experiments with both radially and tangentially anisotropic haloes. They found that initial anisotropy has a modest effect on remnant halo density profiles (consistent with White, 1979), but very little influence on other remnant properties. We estimate that their models were only mildly anisotropic, perhaps comparable to our models with anisotropy radius , and conjecture that the resulting decrease of orbit decay times was not dramatic enough to be mentioned in a study focused on remnant haloes.
More recently, Vasiliev et al. (2022) simulated orbital evolution in unequal-mass systems to study radialization of satellite orbits. They present two simulations involving anisotropic primary galaxies, with primary-to-satellite mass ratios of , and find that radial (tangential) anisotropy enhances (suppresses) radialization. Close inspection of their Fig. 3 also suggests that their radially anisotropic model may have merged more rapidly than its isotropic counterpart.
The immediate reason for faster mergers between anisotropic systems appears straightforward: anisotropic models are more easily deformed during tidal interactions. We have shown that anisotropic models become more non-spherical than their isotropic counterparts during the initial stages of an encounter; it’s likely that this earlier response also explains their larger misalignments with the present positions of their companions (§ 3.2). This accelerates the transfer of orbital angular momentum to internal degrees of freedom, speeding up the merging process. Since our interpretation is based on very general considerations, it’s likely to apply in any tidal encounter between galaxies with significant radial anisotropy.
Simulations of isolated models support the idea that radial anisotropy makes systems more sensitive to non-radial perturbations (§ 3.3). Several previous studies have also found that radial anisotropy increases galactic responses to tides. In simulated Milky Way-LMC encounters, Rozier et al. (2022) report that the wake induced in the Milky Way’s stellar halo by the passage of the LMC is larger if the halo is radially anisotropic, and smaller if it is tangentially anisotropic (in both cases the dark halo was isotropic). This suggests the strength of the response does not depend on self-gravity, since the stellar halo’s mass is negligible. Similarly, Vasiliev (2024) simulated Milky Way-LMC encounters and found that the wake induced by the LMC was larger when the Milky Way’s dark halo was radially anisotropic.
The reason for this larger response is not immediately obvious. It’s intuitively appealing to think that the anisotropic pressure in our models explains why these systems are more responsive to non-radial perturbations. But Palmer & Papaloizou (1987), in a closely related context, showed that pressure anisotropy fails to predict the onset of the radial orbit instability. They present a physical interpretation of the instability based on the precession rates of very eccentric orbits subject to a bar-like perturbation; we suggest that a similar mechanism may be at work here. Linear theory calculations (e.g., Binney & Tremaine, 2008, Ch. 5) and test-particle simulations may help clarify the basis for the stronger response of anisotropic systems.
We found that the choice of starting time has a significant influence on encounters of anisotropic systems. In Fig. 14, the open markers represent runs starting at instead of the default . The choice of starting time appears to have no effect on the evolution of the isotropic models; for such systems, the potential-energy criterion (see equation 8) appears adequate. However, the anisotropic models behave differently; those started earlier merge faster. This supports the idea that anisotropic galaxies are susceptible to tidal deformations quite early in the encounter process. It is possible that even is not early enough to fully capture the effect of tidal interactions between these anisotropic systems. The importance of early tidal interaction was not appreciated in previous studies; computational considerations generally favor launching galaxies in relatively close proximity. Given that the choice of initial separation appears to have significant consequences for encounters involving anisotropic systems, it should be explicitly discussed in future studies.
Our results are relevant to ‘idealised’ simulations in which two or more equilibrium galaxy models are allowed to interact with each other. It seems plausible that haloes formed partly by gravitational collapse would have predominantly radial orbits. Idealised models could be made more realistic by including some level of radial anisotropy. Below, we outline three possible applications.
First, shorter merger time scales due to halo anisotropy may influence estimates of the merger rate. Lotz et al. (2008) used merger time scales derived from idealised simulations, together with observational counts of interacting galaxies, to infer rates of galaxy mergers. The haloes used in their simulations were very close to isotropic, potentially overestimating the merger time scale. If galaxies take less time to merge, then the merger rate must be higher to account for the number of merging galaxies observed.
Second, faster orbit decay may help explain the persistence of tidal tails in merger remnants. Tails formed during interactions of disc galaxies embedded in massive dark haloes generally remain bound and may fall back before the galaxies finally merge (e.g., Barnes, 2016, and references therein). Such complete recapture is incompatible with observations of long-tailed merger remnants such as NGC 7252 (Schweizer, 1982). Faster orbit decay in systems with anisotropic haloes may make it easier for galaxies to merge before their tails fall back. We hope to explore this question in future work.
Third, semi-analytic models of galaxy formation often estimate merger times from simple formulas based on the Chandrasekhar (1943) treatment of dynamical friction. Boylan-Kolchin, Ma, & Quataert (2008) use simulated mergers of isotropic haloes to argue that these simple formulas underestimate merging times by factors of or more, distorting key semi-analytic predictions such as the build-up of central galaxy stellar masses. Including anisotropy in the simulations would presumably – for purely fortuitous reasons – have reduced the discrepancy with the dynamical friction formulas.
Possible directions for future work include:
-
1.
Encounter dynamics. The present study could be extended to head-on encounters (in which angular momentum transfer does not play a role), and to encounters involving tangentially anisotropic models. The early stages of equal-mass tidal encounters may be amenable to linear theory calculations.
-
2.
Unequal-mass interactions. We expect that our results also apply to encounters between systems with different masses. While Arena & Bertin (2007) found that satellite galaxies orbiting in radially anisotropic haloes suffer more rapid orbit decay, it would be worthwhile to understand why the effects they report are smaller than those we observe.
-
3.
Remnant structure. McMillan et al. (2007) found that moderate levels of initial anisotropy have little effect on remnant anisotropy, and only slight effects on remnant shape. However, we speculate that the earlier transfer of angular momentum in encounters between anisotropic systems may produce remnants with more of their angular momentum retained at large radii.
The implications of our results for cosmological simulations, or for real galaxies, are somewhat more complex. Simulations in which structure grows from linear perturbations automatically incorporate halo anisotropy. In the specific case of CDM, Wojtak et al. (2005) found that haloes in dark matter only simulations are approximately isotropic at their centres but radially anisotropic further out, rising to at the virial radius. Such haloes, while less anisotropic than our most extreme examples, may be anisotropic enough to influence merger times. However, haloes in cosmological simulations don’t form as spherical systems; instead, they are generically triaxial, and typically aligned with their neighbors and with the filamentary pattern of large-scale structure (e.g., Bailin & Steinmetz, 2005; Veena et al., 2018; Delgrado et al., 2023). Idealised simulations don’t capture these phenomena. But the mutual alignments created when idealised anisotropic models interact qualitatively mimic the long-range alignments seen in cosmological simulations; the subsequent evolution of such models may be a useful analog to the behavior of real systems.
4.1 Summary
Our investigation of radial anisotropy in encounters of spherical galaxy models has shown that anisotropic models merge faster than their isotropic counterparts. In comparison to isotropic models, anisotropic models are more susceptible to tidal deformation. This accelerates the process of momentum transfer and orbit decay. Measurements of deformation confirm that anisotropic models become more elongated than their isotropic counterparts. During encounters, anisotropic models arrive at first passage having already lost significant amounts of angular momentum.
ACKNOWLEDGEMENTS
We thank the referee for a constructive and stimulating report, and Colby Haggerty and Jennifer Lotz for helpful discussions. JEB is grateful to the Yukawa Institute for Theoretical Physics, Kyoto University, for generous hospitality in Fall 2022 and Summer 2023, and especially to Atsushi Taruya for sponsoring me during these visits. LS thanks the members of the Spring 2022 Senior Research Project class for feedback and suggestions.
DATA AVABILITY
Simulation data and software are available by request to [email protected].
References
- (1)
- Amorisco (2017) Amorisco, N. C. 2017, MNRAS, 464, 2882
- Antonov (1973) Antonov, V. A., 1973, Soviet Ast., 17, 428
- Arena & Bertin (2007) Arena, S. E. & Bertin, G. 2007, A&A, 463, 921
- Bailin & Steinmetz (2005) Bailin, J., Steinmetz, M. 2005, ApJ, 627, 647
- Barnes (2012) Barnes, J. E., 2012, MNRAS, 425, 1104
- Barnes (2016) Barnes, J. E., 2016, MNRAS, 455, 1957
- Barnes & Hut (1986) Barnes, J., Hut, P., 1986, Nature, 324, 446
- Barnes et al. (1986) Barnes, J., Goodman, J., Hut, P., 1986, ApJ, 300, 112
- Binney & Tremaine (2008) Binney, J., Tremaine, S., 2008, Galactic Dynamics: Second Edition. Princeton University Press, Princeton, NJ
- Boylan-Kolchin, Ma, & Quataert (2008) Boylan-Kolchin, M., Ma, C.-P., Quataert, E. 2008, MNRAS, 383, 93
- Buyle et al. (2007) Buyle, P., van Hese, E., de Rijcke, S., Dejonghe, H., 2007, MNRAS, 375, 1157
- Chandrasekhar (1943) Chandrasekhar, S. 1943, ApJ, 97, 255.
- Cox et al. (2006) Cox, T. J., Jonsson, P., Primack, J. R., Somerville, R. S., 2006, MNRAS, 373, 1013
- Debattista & Sellwood (2000) Debattista, V. P., Sellwood, J. A., 2000, ApJ, 543, 704
- Delgrado et al. (2023) Delgrado, A. M. et al. 2023, MNRAS, 523, 5899
- Drakos et al. (2019) Drakos, N. E., Taylor, J. E., Berrouet, A., Robotham, A. S. G., Power, C., 2019, MNRAS, 487, 993
- Fulton & Barnes (2001) Fulton, E., Barnes, J. E., 2001, Ap&SS, 276, 851
- Hernquist (1990) Hernquist, L., 1990, ApJ, 356, 359
- Jaffe (1983) Jaffe, W., 1983, MNRAS, 202, 995
- Lotz et al. (2008) Lotz, J. M., Jonsson, P., Cox, T. J., Primack, J., 2008, MNRAS, 391, 1137
- Makino & Hut (1997) Makino, J., Hut, P., 1997, ApJ, 481, 83
- McMillan et al. (2007) McMillan, P. J., Athanassoula, E., & Dehnen, W. 2007, MNRAS, 376, 1261.
- Merritt (1985) Merritt, D., 1985, AJ, 90, 1027
- Merritt & Aguilar (1985) Merritt, D., Aguilar, L. A., 1985, MNRAS, 217, 787
- Meza & Zamorano (1997) Meza, A., Zamorano, N., 1997, ApJ, 490, 136.
- Osipkov (1979) Osipkov, L. P., 1979, Azh, 5, 77L
- Palmer & Papaloizou (1987) Palmer & Papaloizou, 1987, MNRAS 224, 1043
- Plummer (1911) Plummer, H. C., 1911, MNRAS, 71, 460
- Rozier et al. (2022) Rozier, S., Famaey, B., Siebert, A., et al. 2022, ApJ, 933, 113
- Schweizer (1982) Schweizer, F., 1982, ApJ, 252, 455.
- Springel & White (1999) Springel, V., White, S. D. M., 1999, MNRAS, 307, 162.
- Toomre & Toomre (1972) Toomre, A., Toomre, J., 1972, ApJ, 178, 623.
- van Albada & van Gorkom (1977) van Albada, T. S., van Gorkom, J. H., 1977, A&A, 54, 121
- Vasiliev et al. (2022) Vasiliev, E., Belokurov, V., & Evans, N. W. 2022, ApJ, 926, 203
- Vasiliev (2024) Vasiliev, E. 2024, MNRAS, 527, 437.
- Veena et al. (2018) Veena, G. P., Cautun, M., van de Weygaert, R., Tempel, E., Jones, B. J. T., Rieder, S., Frenk, C. S., 2018, MNRAS, 481, 414
- Vergne & Muzzio (1995) Vergne, M. M., Muzzio, J. C., 1995, MNRAS, 276, 439
- Villumsen (1982) Villumsen, J. V., 1982, MNRAS, 199, 493
- Villumsen (1983) Villumsen, J. V., 1983, MNRAS, 204, 219
- White (1978) White, S. D. M., 1978, MNRAS, 184, 185
- White (1979) White, S. D. M., 1979, MNRAS, 189, 831
- Wojtak et al. (2005) Wojtak, R., Łokas, E. L., Gottlöber, S., Mamon, G. A., 2005, MNRAS, 361, L1
Appendix A RUN-TO-RUN VARIATIONS
We have taken advantage of the relatively low computational cost of our numerical experiments by running from to independent realizations of each physical scenario (Table 1). Run-to-run variations among ostensibly equivalent realizations can be surprisingly large; in some ensembles, the spread in the orbit decay time is per cent of the mean value. Below, we show that these variations are primarily driven by the run-to-run spread in orbital angular momentum.
A galaxy realization is generated by independently drawing particle positions and velocities from a DF (§ 2.3). Integrated quantities therefore have variations of from one realization to the next. For example, consider the -component of the centre-of-mass position, which is for equal-mass particles. In the large- limit, has a normal distribution, with a mean of and a standard deviation of , where is the RMS value of the coordinate. Since the models are spherically symmetric, the and -components have the same distribution. Similar reasoning applies to the centre-of-mass velocity, and to the net angular momentum. The net binding energy also varies about its mean value ; numerical tests confirm that . Table 2 lists dispersions in position, velocity, angular momentum, and energy for isotropic models realized with particles.
Quantity | Symbol | Value |
---|---|---|
C.M. Position | 0.0464 | |
C.M. Velocity | 0.000938 | |
Angular Momentum | 0.00376 | |
Binding Energy | 0.000792 |
To set up an encounter, two such galaxy realizations are translated by in position and in velocity, as specified by equation (7). The initial parabolic orbit has angular momentum about the -axis
(10) |
where is the mass of a single model, and is a unit vector along the -axis. For finite , however, the initial centre-of-mass position and velocity of galaxy realization will be offset by random vectors of magnitude and , respectively. As a result, the effective orbital angular momentum
(11) |
includes stochastic terms proportional to and .
For the encounters considered here, the run-to-run variation in is dominated by the variation in centre-of-mass velocity. This is a direct consequence of our efforts to keep the models from overlapping too much at the start of the simulations. As an example, starting the orbit at time initially places the models length units apart. The perpendicular orbital velocity of each model is then only times larger than the variation in centre-of-mass velocity for particles. In brief, it’s hard to come close to the intended angular momentum when galaxy realizations must be launched from far apart.
More formally, the variance of is
(12) |
The first term in this sum accounts for per cent of the variance for orbits starting at , and per cent for orbits starting at . For the various choices of and used in this work, the standard deviation of is between per cent and per cent of its mean value.
In passing, it’s worth noting that run-to-run variations in orbital binding energy are much smaller. For all intents and purposes, the orbits used in our simulations are mono-energetic.
Other things being equal, systems started with more orbital angular momentum should take longer to merge. Figs. 5 and 14 suggest that , the time between first and second pericentric passages, is an increasing function of pericentric separation , which in turn is . Fig. 15 provides further support. For each given , there is clearly a well-defined relationship between and . This figure also shows that, within each ensemble, run-to-run variation in accounts for most of the variation in .

The sensitivity of anisotropic mergers to the choice of starting time becomes very clear when results are plotted on the vs. plane. Fig. 16 compares runs (open symbols, dotted lines) with their counterparts (filled symbols, solid lines). This plot confirms that isotropic encounters (grey) merge in the same time for either choice of , while anisotropic encounters (red) merge faster when started earlier.

The correlation between and may be useful to reduce uncertainties. For example, when averaging over the members of an ensemble, more weight could be given to those runs with values closer to the ideal . An even better strategy might be to use linear fits like those plotted in Fig. 15 to estimate what would result from an encounter with . The uncertainty in this estimate could be inferred from the residuals of the linear relationship.
Another option, of course, is to subtract the centre-of-mass position and velocity from each galaxy realization before it is translated to its launch coordinates. This strategy ensures that for each encounter is exactly equal to . The drawback, in our view, is that it introduces approximations into equation (7). In effect, subtracting centre-of-mass coordinates simply ‘papers over’ one obvious manifestation of Monte-Carlo sampling.
Finally, we note that simulations starting earlier that would suffer from even more dramatic run-to-run variations. It may be interesting to try earlier starting times, since we are not sure that our experiments start early enough to capture the full effect of anisotropy on orbit decay. Large ensembles or larger values of might be needed to overcome uncertainties in such experiments.