This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

The orientations of the binary black holes in GWTC-3

Salvatore Vitale [email protected]    Sylvia Biscoveanu    Colm Talbot LIGO, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02139, USA Kavli Institute for Astrophysics and Space Research and Department of Physics, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02139, USA
Abstract

It is expected that the orbital planes of gravitational-wave (GW) sources are isotropically distributed. However, both physical and technical factors, such as alternate theories of gravity with birefringence, catalog contamination, and search algorithm limitations, could result in inferring a non-isotropic distribution. Showing that the inferred astrophysical distribution of the orbital orientations is indeed isotropic can thus be used to rule out some violations of general relativity, as a null test about the purity of the GW catalog sample, and as a check that selection effects are being properly accounted for. We augment the default mass/spins/redshift model used by the LIGO-Virgo-KAGRA Collaboration in their most recent analysis to also measure the astrophysical distribution of orbital orientations. We show that the 69 binary black holes in GWTC-3 are consistent with having random orbital orientations. The inferred distribution is highly symmetric around π/2\pi/2, with skewness 𝒮post=0.010.17+0.17\mathcal{S}_{\rm{post}}=0.01^{+0.17}_{-0.17}. Meanwhile, the median of the inferred distribution has a Jensen–Shannon divergence of 1.4×1041.4\times 10^{-4} bits when compared to the expected isotropic distribution.

Introduction– The latest gravitational-wave (GW) catalog released by the LIGO [1]-Virgo [2]-KAGRA [3, 4, 5] collaboration (LVK) – GWTC-3 [6] – contains 69 binary black holes (BBH) with false alarm rate smaller than 1 per year [7]. Groups not affiliated with the LVK have reported the discovery of other BBHs, usually with low signal-to-noise ratios (SNRs) [8, 9]. The growing BBH dataset has been used to study the underlying distribution of masses and spins, as well as the evolution of their merger rate with redshift. While the primary masses of the BBHs in GWTC-3 are still consistent with a rather simple phenomenological model (a power law plus a gaussian distribution [10]), there is now tentative evidence for extra structure at around 10 MM_{\odot} and 17 MM_{\odot} [11, 12, 13, 14, 7]. Meanwhile, no evidence has been found for either a lower (i.e. in the mass range between 22 MM_{\odot} and 5M\sim 5~{}M_{\odot} [15, 16]) or an upper mass (i.e. above 50M\sim 50~{}M_{\odot}, in the pair instability mass gap [17]) gap [18]. The distribution of the spin magnitudes peaks at around 0.20.2, if both spins are treated as identically distributed [19, 7], while that of the most rapidly spinning black hole in each binary peaks at 0.4\sim 0.4 [20]. This results in a distribution for the effective inspiral spin χeff\chi_{\rm{eff}} that favors small (0.1\sim 0.1) positive values [21, 7]. In fact, what can be learned from the spins of GWTC-3 is still debated in the literature [22, 23, 21], and more will be learned in the fourth observing run, starting in late 2022. Ref. [24] has reported evidence for an anti-correlation between the mass ratio and the effective inspiral spin of the BBHs in the 44 BBHs of the previous LVK catalog, GWTC-2 [25]. This finding has later been confirmed by the LVK in GWTC-3, at a similar level of significance [7]. Recent works have also found evidence of less significant correlations between the other binary mass parameters and the effective spin distribution [26, 27, 28]. While measurements described above pertain to the intrinsic parameters of the sources, the extrinsic source parameters can also be used to learn about the BBH population. The LVK has shown that the BBH merger rate evolves with redshift in a way that is consistent with the star formation rate [7]. Estimates of source distances can also be used to exclude alternatives to general relativity [29] and to measure the speed of gravitational waves [30].

In this Letter we measure the astrophysical distribution of another important extrinsic parameter, the orbital inclination θJN\theta_{\rm{JN}}, defined as the angle between the total angular momentum and the line of sight. It might seem obvious that the orientation of BBH orbits should be isotropic, i.e. that π(θJN)sinθJN\pi(\theta_{\rm{JN}})\propto\sin\theta_{\rm{JN}} [31, 32]. However, several factors may result in detecting a distribution for θJN\theta_{\rm{JN}} which is not isotropic. First, some alternative theories of gravity, e.g. birefringence, predict that the GW power emitted by a merging binary is not symmetric about the orbital plane [33, 34, 35]: this would result in a preference for detecting sources with inclination of e.g. π/6\pi/6 over sources with inclination of ππ/6\pi-\pi/6, whereas these two values would be equally detectable in general relativity. Second, the algorithms that search for compact binary coalescences (CBCs) assume that spins are perfectly aligned with the orbital angular momentum, and neglect higher order modes (HOMs) [6]. These approximations have been shown to be sufficient when spins are small or aligned with the orbit, and mass ratios are close to unity. However, their accuracy is also a function of the orbital inclination: indeed, the loss of SNR due to neglecting these effects is maximal when θJN=π/2\theta_{\rm{JN}}=\pi/2 (“edge-on” orientation). This could result in systematically missing sources with high orbital inclination and hence in a dip in the inferred θJN\theta_{\rm{JN}} distribution at θJN=π/2\theta_{\rm{JN}}=\pi/2. If this loss of SNR is properly accounted for when calculating selection effects, one should still measure an isotropic distribution. Finally, marginal triggers of non-astrophysical origin (often called glitches) could be mistakenly identified as BBHs, and added in the catalog. It is has been found that that running CBC source characterization algorithms on glitches results in distributions of θJN\theta_{\rm{JN}} peaked at π/2\pi/2. A sub-population of glitches in the catalog would thus result in an excess of posterior probability at around π/2\pi/2 for θJN\theta_{\rm{JN}}. Measuring a distribution of θJN\theta_{\rm{JN}} different from the expected isotropic one could thus indicate problems with our understanding of gravity, with the sensitivity of detection algorithms to highly inclined orbits, or with glitch contamination. Ref. [35] used the sources in GWTC-2 to recast the measured level of symmetry of the inclination angles around π/2\pi/2 into constraints on birefringence and Chern-Simons gravity. They did not perform full hierarchical inference on the astrophysical distribution of inclination angles, which is the focus of this work.

Method– We parametrize the distribution of the masses, spins, redshifts and inclination angles of BBHs in terms of vector of (unknown) hyper-parameters λ\vec{\lambda} (“hyper” to distinguish them from the parameters of individual sources, e.g. masses and spins). Our goal is to infer λ\vec{\lambda} given the dataset DD consisting of the 69 GWTC-3 BBHs with false alarm ratio smaller than 1 per year [7], D{di,i=169}D\equiv\{d_{i},i=1\ldots 69\}. If one is not interested in measuring it, the overall merger rate density can be analytically marginalized over to obtain [36, 37, 38]

p(λ|D)π(λ)i=169p(di|λ)α(λ).p(\vec{\lambda}|D)\propto{\pi(\vec{\lambda})}\prod_{i=1}^{69}\frac{p(d_{i}|\vec{\lambda})}{\alpha(\vec{\lambda})}\,. (1)

In this expression, α(λ)\alpha(\vec{\lambda}) represents the fraction of detectable BBHs given the population parameters λ\vec{\lambda}; π(λ)\pi(\vec{\lambda}) is the population prior and p(di|λ)p(d_{i}|\vec{\lambda}) is the likelihood of the stretch of data containing the i-th BBH. The likelihood of the individual sources can be written in terms of their posterior distributions by marginalizing over the individual source parameters θ(m1,m2,a1,τ1,a2,τ2,z,θJN)\vec{\theta}\equiv(m_{1},m_{2},a_{1},\tau_{1},a_{2},\tau_{2},z,\theta_{\rm{JN}}). In this expression, m1m_{1} and m2m_{2} are the masses of the heavier and lighter black hole in the binary, respectively; aia_{i} is the dimensionless spin of the i-th black hole in the binary, and τi\tau_{i} is the angle between the i-th spin vector and the orbital angular momentum, zz is the redshift and θJN\theta_{\rm{JN}} is the orbital inclinations. For the i-th source, one has [39, 38]:

p(di|λ)=dθp(di|θ)π(θ|λ)=dθp(θ|diPE)π(θ|λ)π(θ|PE),p(d_{i}|\vec{\lambda})=\int\mathrm{d}\vec{\theta}p(d_{i}|\vec{\theta})\pi(\vec{\theta}|\vec{\lambda})=\int\mathrm{d}\vec{\theta}\;\;\frac{p(\vec{\theta}|d_{i}\mathcal{H}_{\mathrm{PE}})\pi(\vec{\theta}|\vec{\lambda})}{\pi(\vec{\theta}|\mathcal{H}_{\mathrm{PE}})}, (2)

where p(θ|diPE)p(\vec{\theta}|d_{i}\mathcal{H}_{\mathrm{PE}}) is the posterior distribution for the source in the i-th stretch of data, and PE\mathcal{H}_{\mathrm{PE}} symbolizes all of the settings that went into the parameter estimation algorithm used to produce those samples. Similarly, π(θ|PE)\pi(\vec{\theta}|\mathcal{H}_{\mathrm{PE}}) are the priors used when sampling the posterior distribution. Finally, π(θ|λ)\pi(\vec{\theta}|\vec{\lambda}) is the population prior, i.e., our model for how the hyper parameters affect the true underlying distribution of the BBH parameters θ\vec{\theta}. We use the posterior samples of the 69 BBHs reported in GWTC-3, as released by the LVK in Refs [40, 41, 42, 43] and approximate the integral with a discrete sum

dθp(di|θ)π(θ|λ)Nsamples1kNsamplesπ(θik|λ)π(θik|PE)\int\mathrm{d}\vec{\theta}p(d_{i}|\vec{\theta})\pi(\vec{\theta}|\vec{\lambda})\simeq{N_{\rm{samples}}}^{-1}\sum_{k}^{N_{\rm{samples}}}\frac{\pi(\vec{\theta}^{k}_{i}|\vec{\lambda})}{\pi(\vec{\theta}^{k}_{i}|\mathcal{H}_{\mathrm{PE}})} (3)

where the Nsamples{N_{\rm{samples}}} samples are drawn from the posterior distribution of the i-th event. We sample the hyper posterior with the GWPopulation algorithm [44], using the dynesty [45] sampler. GWPopulation requires that the same number of Nsamples{N_{\rm{samples}}} is used for all sources. We are thus limited by the source for which fewest samples were made public. That is GW200129_065458, for which Nsamples=3194{N_{\rm{samples}}}=3194. For the sources reported in GWTC-1, we use the samples labelled IMRPhenomPv2_posterior in the data release; for GWTC-2 we use PublicationSamples; for GWTC-2.1 we use PrecessingSpinIMRHM, and for GWTC-3 we use C01:Mixed.

The detection efficiency α(λ)\alpha(\vec{\lambda}) can also be calculated through an approximated sum starting from a large collection of simulated BBHs for which the SNR (or other detection statistic) is recorded, as described in Ref. [46, 7]. We use the endo3_bbhpop-LIGO-T2100113-v12-
1238166018-15843600.hdf5
sensitivity file released by the LVK [47] to calculate α(λ)\alpha(\lambda), using a false alarm threshold of 1 per year to identify detectable sources, consistently with [7]. We stress that this file only contains BBHs which would be detectable by GW observatories at O3 sensitivity, whereas our dataset also includes sources detected in O1 and O2. Unfortunately, the sensitivity files released by the LVK in Ref. [48] which also cover GWTC-1 and GWTC-2 do not include the inclination of the simulated sources, and thus cannot be used in this work. We have generated our own sensitivity files which account for the changing sensitivity of the GW network since O1 and verified that the results presented below don’t change significantly. This makes sense because it is only with next-generation detectors that one should expect a different distribution of detectable inclinations [32, 49] and since O3 does dominate the overall surveyed time-volume. To make our findings easy to reproduce, we therefore only present results obtained with the LVK data products. Finally, we have verified that using only GW BBHs from O3 one obtain consistent results, though with larger error bars.

Models– The population distribution of masses, spins and redshifts are the same as those used by the LVK in their default analysis (Fiducial population mass and redshift and Fiducial population spin in Ref. [7]). Specifically, the distribution of primary masses m1m_{1} is parametrized as the mixture model of a power law and a gaussian distribution (7 parameters: power law slope, mixture fraction, minimum and maximum black hole mass, smoothing factor, mean and standard deviation of the gaussian component) [10]; the mass ratio is parametrized as a power law (1 parameter: power law slope) [37]; the spin magnitudes are parametrized as identically distributed beta distributions (2 parameters) [19]; the spin tilts are modeled as identically distributed mixtures of an isotropic component and a gaussian component centered at zero degrees (2 parameters: the mixture fraction and the width of the preferentially spin-aligned distribution) [50]. We use a beta distribution to parametrize the population distribution of inclination angles, rescaled to take values in the range [0,π][0,\pi], and parametrized by two hyper parameters αθJN\alpha_{\theta_{\rm{JN}}} and βθJN\beta_{\theta_{\rm{JN}}}:

p(θJN|αθJN,βθJN)=xαθJN1(πx)βθJN1B(αθJN,βθJN)παθJN+βθJN+1p(\theta_{\rm{JN}}|\alpha_{\theta_{\rm{JN}}},\beta_{\theta_{\rm{JN}}})=\frac{x^{\alpha_{\theta_{\rm{JN}}}-1}(\pi-x)^{\beta_{\theta_{\rm{JN}}}-1}}{B(\alpha_{\theta_{\rm{JN}}},\beta_{\theta_{\rm{JN}}})\pi^{\alpha_{\theta_{\rm{JN}}}+\beta_{\theta_{\rm{JN}}}+1}} (4)

with B(a,b)Γ(a)Γ(b)/Γ(a+b)B(a,b)\equiv\Gamma(a)\Gamma(b)/\Gamma(a+b). For αθJN=βθJN\alpha_{\theta_{\rm{JN}}}=\beta_{\theta_{\rm{JN}}} this distribution is symmetric around π/2\pi/2: for αθJN=βθJN2.15\alpha_{\theta_{\rm{JN}}}=\beta_{\theta_{\rm{JN}}}\simeq 2.15 it is very similar to sinθJN/2\sin\theta_{\rm{JN}}/2 (The Jensen–Shannon (JS) divergence [51, 52] between sin(θJN)/2\sin(\theta_{\rm{JN}})/2 and p(θJN|2.15,2.15)p(\theta_{\rm{JN}}|2.15,2.15) is 5×105\sim 5\times 10^{-5} bits.), whereas for αθJN=βθJN>2.15\alpha_{\theta_{\rm{JN}}}=\beta_{\theta_{\rm{JN}}}>2.15 (αθJN=βθJN<2.15\alpha_{\theta_{\rm{JN}}}=\beta_{\theta_{\rm{JN}}}<2.15) more (less) posterior weight is placed at θJN=π/2\theta_{\rm{JN}}=\pi/2

For the mass, spin, and redshift hyper parameters we use the same priors used by the LVK, as described in Table VI and Table XII of Ref. [7], with the exception of the two hyper parameters of the spin magnitude beta distribution, for which we used αχ:Uniform(1,5)\alpha_{\chi}:\texttt{Uniform(1,5)}, βχ:Uniform(1,5)\beta_{\chi}:\texttt{Uniform(1,5)}. For the inclination model we use the following priors: αθJN:Uniform(1,8)\alpha_{\theta_{\rm{JN}}}:\texttt{Uniform(1,8)}, βθJN:Uniform(1,8)\beta_{\theta_{\rm{JN}}}:\texttt{Uniform(1,8)}, where the model can be normalized.

Results– In Fig. 1 we show the join distribution of αθJN\alpha_{\theta_{\rm{JN}}} and βθJN\beta_{\theta_{\rm{JN}}}, with KDE contours that increment by steps of 20% of posterior mass. The point (αθJN,βθJN)=(2.15,2.15)(\alpha_{\theta_{\rm{JN}}},\beta_{\theta_{\rm{JN}}})=(2.15,2.15) which corresponds to a nearly isotropic distribution is found within the top 20% of posterior probability. The data yields a joint distribution which is strongly correlated and elongated along the diagonal, indicating a preference for a θJN\theta_{\rm{JN}} distribution which is symmetric around π/2\pi/2. The level of symmetry can be be quantified by calculating the skewness of the beta distribution, 𝒮=2βθJNαθJNβθJN+αθJN+2βθJN+αθJN+1βθJN+αθJN\mathcal{S}=2\frac{\beta_{\theta_{\rm{JN}}}-\alpha_{\theta_{\rm{JN}}}}{\beta_{\theta_{\rm{JN}}}+\alpha_{\theta_{\rm{JN}}}+2}\sqrt{\frac{\beta_{\theta_{\rm{JN}}}+\alpha_{\theta_{\rm{JN}}}+1}{\beta_{\theta_{\rm{JN}}}+\alpha_{\theta_{\rm{JN}}}}}. Drawing 10000 random samples from the hyper posterior, we find 𝒮post=0.010.17+0.17\mathcal{S}_{\rm{post}}=0.01^{+0.17}_{-0.17}. For comparison, 10000 random samples from the prior yield 𝒮prior=0.000.86+0.88\mathcal{S}_{\rm{prior}}=0.00^{+0.88}_{-0.86}, which shows how the GW data narrows down by a factor of 5\sim 5 the possible skewness of the orbital orientation distribution, relative to the prior we used.

Refer to caption
Figure 1: Joint distribution for the hyper parameters of the beta model. The posterior lies along the diagonal dashed line, corresponding to distributions which are symmetric around θJN=π/2\theta_{\rm{JN}}=\pi/2. The point αθJN=βθJN=2.15\alpha_{\theta_{\rm{JN}}}=\beta_{\theta_{\rm{JN}}}=2.15 (marked with a star), corresponding to an approximately isotropic distribution, is included in the top 20% of posterior mass.
Refer to caption
Figure 2: Measured population distribution of inclination angles. The dim red lines are 1000 individual draws from the posterior. The blue band shows the 90% credible interval and blue dashed line its median. Finally, the green dashed line show the expected curve if the source orientations were isotropic.

Next, we take 10000 random samples from the hyper posterior and calculate the corresponding p(θJN|αθJN,βθJN)p(\theta_{\rm{JN}}|\alpha_{\theta_{\rm{JN}}},\beta_{\theta_{\rm{JN}}}), Fig. 2. Each dim red line is an individual draw from the posterior (to enhance visibility, we only show 1000 such draws), whereas the blue band shows the 90% credible interval. The blue thick curve represents the median, which shows remarkable agreement with the expected isotropic distribution (green dashed curve). The JS divergence between the inferred population median and a perfectly isotropic distribution is 1.4×1041.4\times 10^{-4} bits. We stress that this level of agreement is not built-in in our model, nor in the shape or width of the hyper parameters’ priors. This is shown explicitly in Fig. 3: first, we draw 10000 points from the priors of αθJN\alpha_{\theta_{\rm{JN}}} and βθJN\beta_{\theta_{\rm{JN}}} and plot the resulting 90% credible interval as a grey band, and the median as a dashed grey curve. The posterior 90% credible interval and median from Fig. 2 are also shown in blue. It is apparent that the median and the 90% credible interval are significantly different from the prior.

However, one might still wonder whether the main information extracted from the data is that the distributions should be symmetric, i.e. that αθJNβθJN\alpha_{\theta_{\rm{JN}}}\simeq\beta_{\theta_{\rm{JN}}}. To check whether this is the case, we draw 10000 samples from the prior of αθJN\alpha_{\theta_{\rm{JN}}} and then calculate p(θJN|αθJN,βθJN=αθJN)p(\theta_{\rm{JN}}|\alpha_{\theta_{\rm{JN}}},\beta_{\theta_{\rm{JN}}}=\alpha_{\theta_{\rm{JN}}}), i.e. we consider the effective prior one would obtain by only considering distributions symmetric around π/2\pi/2. The resulting 90% credible interval is shown as a yellow band in Fig. 3, and its median as a dashed yellow curve. Here too it is clear that the measurement is informative, and the data excludes large regions of parameter space. The ratio of the JS divergence between the median obtained with prior draws and a perfectly isotropic distribution and the JS divergence calculated with the population posterior median is 288. Using samples from a prior that has been forced to be symmetric, the ratio is 372.

Refer to caption
Figure 3: 90% credible intervals (colored band) and medians (thick dashed line) for the inclination distribution obtained by sampling the full prior (grey) and a prior that enforces symmetry around π/2\pi/2 (yellow). The blue band and dashed line show the 90% credible interval of the hyper posterior and its median from Fig. 2, for comparison.

Conclusion and outlook– The growing set of GW sources can be used to infer the intrinsic properties of the underlying astrophysical populations, but also to learn about cosmology, dark matter, and to perform tests of general relativity. In this Letter we have augmented the main population model used by the LVK in the analysis of GWTC-3 [6] to also infer the underlying distribution of the orbital inclination for the 69 BBHs included in GWTC-3, which is expected to be isotropic. However several factors can result in inferring an anisotropic distribution: some violations of general relativity; inaccurate evaluation of selection effects due to limitations in the algorithms that search for compact binaries; the inclusion of non-astrophysical triggers (glitches) in the BBH catalog. A measurement of the population distribution of orbital orientations can thus be used either as a test of general relativity or, within general relativity, as a null test that all of the analysis components behave as expected.

We have modeled the astrophysical distribution of orbital inclinations as a non-singular beta distribution, and found that the data prefers inclination distributions which are symmetric around π/2\pi/2: the skewness of the inclination distribution is reduced by a factor of 5 relative to what would be obtained using prior samples. Furthermore, the inferred distribution for the inclination angle is consistent with being isotropic: the JS divergence between the inferred population median and a perfectly isotropic distribution is 1.4×1041.4\times 10^{-4} bits.

We have only used BBH sources reported by the LVK collaboration because a key part of any hierarchical analysis is a self-consistent evaluation of the selection effects, which is hard to achieve for sources found by other groups. However, we do encourage groups that produce independent catalogs of GW sources to also make public their own sensitivity files, to make this type of analysis possible. As the size of the GW network increases in the next few years, we expect this analyses will yield even stronger constraints, since a larger network can better reveal the polarization content, and hence the inclination angle, of individual sources. The improved inclination measurement for each source, together with the higher detection rates, implies we could set limits on even smaller departures from orientational isotropy, and use more sophisticated models that explicitly allow for these anisotropies. This will be explored in a future paper.

Acknowledgments– S.V., S.B. and C.T.  acknowledge support of the National Science Foundation and the LIGO Laboratory. LIGO was constructed by the California Institute of Technology and Massachusetts Institute of Technology with funding from the National Science Foundation and operates under cooperative agreement PHY-0757058. S.V. is also supported by NSF PHY-2045740. S.B. is also supported by the NSF Graduate Research Fellowship under Grant No. DGE-1122374. C.T. is supported by the MIT Kavli Postdoctoral Fellowship. The authors would like to thank Hsin-Yu Chen, Thomas Dent, Reed Essick, Will Farr, Carl Haster, Charlie Hoy, Ken Ng and Geraint Pratten for insightful discussions and comments. The authors are grateful for computational resources provided by the LIGO Lab and supported by NSF Grants PHY-0757058 and PHY-0823459. This paper carries LIGO document number LIGO-P2200104.

References