Resolving Galactic binaries using a network of space-borne gravitational wave detectors
Abstract
Extracting gravitational wave (GW) signals from individual Galactic binaries (GBs) against their self-generated confusion noise is a key data analysis challenge for space-borne detectors operating in the mHz to mHz range. Given the likely prospect that there will be multiple such detectors, namely LISA, Taiji, and Tianqin, with overlapping operational periods in the next decade, it is important to examine the extent to which the joint analysis of their data can benefit GB resolution and parameter estimation. To investigate this, we use realistic simulated LISA and Taiji data containing the set of GBs used in the first LISA data challenge (Radler), and an iterative source extraction method called GBSIEVER introduced in an earlier work. We find that a coherent network analysis of LISA-Taiji data boosts the number of confirmed sources by over that from a single detector. The residual after subtracting out the reported sources from the data of any one of the detectors is much closer to the confusion noise expected from an ideal, but infeasible, multisource resolution method that perfectly removes all sources above a given signal-to-noise ratio threshold. While parameter estimation for sources common to both the single detector and network improves broadly in line with the enhanced signal to noise ratio of GW sources in the latter, deviation from the scaling of error variance predicted by Fisher information analysis is observed for a subset of the parameters.
I Introduction
Gravitational wave (GW) astronomy is now well-established in the frequency band Hz using ground-based interferometric detectors. Starting with the discovery in 2015 of GW150914 [1], a binary black hole (BBH) merger signal, by the twin LIGO [2] detectors, the number of confirmed GW signals has grown to over three observing runs [3, 4, 5] of the LIGO and Virgo [6] detectors. While the majority of signals are from BBH mergers, there is also a double-neutron star [7] and two neutron star-black hole signals [8] in the haul so far.
The success of GW astronomy with ground-based detectors has further spurred the drive to open up the gravitational wave window at lower frequencies. Pulsar Timing Arrays (PTAs) are already being used for the Hz and a tantalizing signal, not yet confirmed to be from GWs, has emerged [9, 10, 11] in the most recent datasets collected by the NANOGrav [12], PPTA [13] and EPTA [14] collaborations. The Hz band is the target for the space-borne LISA (Laser Interferometer Space Antenna) [15] detector, scheduled for launch in 2037. Targeting a similar frequency band, intensive work is in progress on the design of the Taiji [16] and Tianqin [17] missions, both of which have expected launch dates in the middle of the next decade.
While both LISA and Taiji will have heliocentric orbits, Tianqin is planned to be geocentric. In other respects, however, these detectors share the common feature of having three spacecrafts in a triangular formation that sense GW-induced distance distortion using inter-spacecraft laser links. The nominal distance between the spacecrafts will be km for LISA and km for Taiji, while Tianqin will have shorter arm lengths of . The normals to the planes of both the LISA and Taiji constellations will have an approximate tilt of from the normal to the ecliptic and rotate once around the latter in a year. For Tianqin, the normal to the spacecraft plane will point towards a fixed sky location, namely that of the double white dwarf RX J0806.3+1527, at all times.
All planned space-based GW detectors will employ the technique of time-delay interferometry (TDI) [18], in which the laser frequency Doppler readouts from different arms are combined linearly after introducing time delays, to substantially reduce laser frequency noise. In this paper we consider the so-called , , and TDI combinations that have mutually independent noise.
The operational frequency bands of LISA, Taiji, and Tianqin are expected to be rich in astrophysical sources [19, 20]. These include a large population of compact object Galactic binaries (GBs) [21], comprised mostly of white dwarfs and reaching in number, tens to hundreds [22] of Extreme Mass Ratio Inspirals (EMRIs) – a system containing a massive black hole in the to range orbited by a stellar-mass range compact object [23], and a handful of Massive black hole binaries (MBHB) containing comparable to components [24]. Due to the generically broad antenna patterns of interferometric detectors and longer lifetimes of GW sources at lower frequencies, the data from these detectors will be simultaneously occupied by signals from a large number and variety of sources. The multisource resolution problem of disentangling these signals from each other presents a major data analysis challenge that has motivated a number of different approaches and algorithms.
To provide a benchmark for the development and comparison of data analysis methods, the LISA community has organized several mock LISA data challenges (MLDCs) [25, 26, 27, 28]. The most recent in this series are the LISA data challenges (LDCs) [29]. The first LDC (LDC1 or Radler), which is the challenge considered in this paper, consists of several subchallenges among which subchallenge 4 (LDC1-4) focuses purely on GB multisource resolution. LDC1-4 provides single realizations of TDI combinations, each containing Gaussian stationary instrumental noise added to the GW signals from GBs. A catalog of the parameters of the GBs is also provided, allowing the performance of a multisource resolution method to be quantified rigorously.
Investigations carried out with the mock data challenges [30, 28] indicate that, over an observation period of yr with a single detector, it would be possible to confidently resolve individual GBs in the mHz band [31, 32, 33, 34]. The remaining GBs will blend together to form a stochastic signal – the GB background – that will likely dominate over instrumental noise below mHz. Since the GB background will be the main factor limiting the sensitivity of searches for all other GW sources in this band, it is important to develop methods that can extract the maximum number of resolvable GBs without grossly overfitting the data.
Given that, until recently, LISA was the only anticipated mission, much of the literature on data analysis for space-based detectors has focused on a single detector. Some notable exceptions are the works that have analyzed a mission concept called the Big Bang Observer (BBO) [35], proposed as a successor to LISA. BBO is envisioned to contain LISA-like detectors but with shorter arm lengths of km. Three of the detectors will be spaced apart in heliocentric orbits while the fourth will be coincident in location with one of them. In addition to multiple detectors, enhancements such as higher laser power are assumed in order to reach the overall sensitivity expected to reveal the inflationary GW stochastic background.
While BBO remains a conceptual mission, the scenario of multiple space-based detectors does not appear to be a pipe dream any longer given the advent of Taiji and Tianqin, which will likely overlap in operation with LISA. As such, the question of what new science capabilities will be unlocked by the existence of a network of space-based detectors has become a timely one. Several recent works have approached this question using the computationally inexpensive Fisher information formalism [36, 37, 38, 39, 40, 41, 42] in the context of single sources. It has been shown that considerable benefits can be derived by combining the data from multiple space-based detectors. However, the analysis of single sources using the Fisher information approach, which only provides an asymptotic lower bound on parameter estimation errors, does not provide a realistic estimate of performance for the full multisource resolution problem. For the latter, there is no substitute for the analysis of realistic mock data with an actual, not ideal, data analysis pipeline.
In this paper, we report an analysis of mock data from a network of two space-based detectors and provide realistic estimates of the detection and estimation performance on the GB multisource resolution problem. The mock data is generated using the same set of GBs as in LDC1-4 under the assumption, which is adequate for a first study and simplifies mock data generation, that Taiji is identical in design to LISA, and that one detector trails the Earth while the other leads it by . The mock data is analyzed using the latest version of the GB resolution pipeline, GBSIEVER (Galactic Binary Separation by Iterative Extraction and Validation using Extended Range), introduced in [33], that has been generalized to handle TDI data from multiple detectors.
As in the earlier paper [33], referred to as P1 from here on, it is convenient to define the following types of GB sources when discussing our results. (i) True: sources in the LDC1-4 catalog. (ii) Reported: the final list of estimated sources returned by GBSIEVER. (iii) Identified: The initial list of sources produced by the single source search step in GBSIEVER before various cuts are imposed to construct the set of reported sources. (iv) Confirmed: The reported sources that match true sources as determined by a prescribed metric for association. The fraction of confirmed sources in the set of reported ones is called the detection rate.
Our key result is that, at comparable detection rates, a LISA-Taiji network is capable of resolving more confirmed binaries than either of the detectors alone. In addition, we find that the residual left after subtracting the reported signals from the data tends to have fewer cases of missed strong sources compared to the single detector case. The power spectral density (PSD) of the residual reaches the theoretical lower bound on confusion noise, obtained under the assumption of a perfect but infeasible multisource resolution method for a single detector [43], without significant overfitting. Thus, not only can a detector network probe deeper into the GB population for resolvable sources but the lowered GB background will enhance the detectability of non-GB sources. While parameter estimation errors are reduced with the LISA-Taiji network for sources that are common with the LISA-only analysis, the scaling of error variances with signal strength that is expected from a Fisher information based analysis is not observed for all the parameters.
The rest of the paper is organized as follows. The data used in this paper are described in Sec. II. The baseline single-source detection and parameter estimation method for a detector network is described in Sec. III. A self-contained but brief overview of GBSIEVER is described in Sec. IV. This is followed by the results of our analysis in Sec. V. Our conclusions and prospects for future work are presented in Sec. VI.
II Data description
Space-based GW detectors will use TDI to strongly suppress laser frequency noise. TDI is implemented by linearly combining readouts of frequency shifts between the incoming and outgoing light along each arm after introducing known time delays [18]. There are a variety of combinations and levels of approximations, called TDI generations, that have been proposed. We use the so-called and combinations, which have mutually independent instrumental noise, corresponding to the first generation of TDIs used in LDC1 [44]. (The combination is dropped because the GW signal in it is highly attenuated.)
To obtain the TDI GW signals, we start with the two polarizations in the transverse traceless (TT) gauge of a plane GW incident at the Solar System barycenter (SSB) origin. For a GB, the polarizations are well-modeled in the SSB frame as linear chirps,
(1) | ||||
(2) | ||||
(3) |
where is the amplitude of the wave, is the inclination angle between the GB orbital angular momentum and the line of sight from the SSB origin to the GB, is the initial phase, is the frequency at the start of observations, and is the secular frequency drift. We use the formalism in [31] to generate the TDI GW signals for GB sources.
The TDI time series for combination and detector is given by
(4) |
where denotes a row vector, denotes a single GB signal corresponding to source parameters , is a realization of the instrumental noise, and is the number of GBs. For a GB, consists of , where , , , , and were defined following Eq. 1 to Eq. 3), is the polarization angle defining the orientation of the binary orbit projected on the sky, and the longitude and the latitude of the source in the SSB frame are denoted by and , respectively. In the rest of the paper, will serve as a stand in for a GB source itself where convenient. Following LDC1-4 we set the sampling frequency for the uniformly sampled time series , for all and , to be Hz, and the number of samples to corresponding to an observation period yr.
For the generation of mock TDI data, the LISA community has adopted a standard code called LISACode [45] for generating instrumental noise and a code called FastGB for GB signals. (The latter is based on the formalism in [46] and is included in the software distribution associated with LDC data [47].) The simulated LDC1-4 data contains the TDI signals from million GBs, with GW signal frequencies in mHz. In the astrophysically realistic model used for the GB population [48, 49], the density of GBs drops with increasing frequency with most of the sources above mHz being individually resolvable. The instrumental noise realization is drawn from a Gaussian, stationary, stochastic process. The PSD of the noise is denoted by at Fourier frequency . In LISACode, the PSD of the instrumental noise in the A and E combinations are identical and derived from the design sensitivity of LISA.
We generate simulated Taiji TDI data by shifting the centroid of the orbit ahead of LISA and using the same software suite and GB catalog as LDC1-4. Statistical independence of the instrumental noise in the two detectors is ensured by using different seeds for the corresponding pseudo-random sequences in LISACode. We keep the arm length of Taiji the same as that of LISA. Besides allowing LISACode to be used with minimal changes at the code level, this leads to a simpler and more direct comparison, as befits a first study, of the performance gained in switching from a single detector to a network. To acknowledge this simplification, the simulated Taiji data is called Taiji-mod in the remainder of the paper. Extending our analysis to a realistic Taiji would be quite straightforward when its own data simulation framework is made available in the public domain.
The GW signal in each TDI combination has amplitude and frequency modulations arising from the time-varying antenna response due to the rotation of the spacecraft constellation and the Doppler effect due to its orbital motion, respectively. Consequently, the spectrum of the TDI GB signal is broadened over a frequency range that is times larger than the separation of Fourier frequencies for a yr observation period. The broadening increases the overlap of spectra from multiple GBs, with the number of overlaps increasing at lower frequencies and signal strengths. This precludes the differentiation of the signals using a simple Fourier transform and makes multisource resolution an extremely challenging data analysis problem.
III Single source estimation
GBSIEVER implements an iterative scheme in which the parameters of only a single source are estimated at a time and the corresponding signal is subtracted from the data. The parameter estimation step uses maximum likelihood estimation (MLE) where the log-likelihood function is constructed under the assumption that the data contains a single source added to Gaussian, stationary noise. The maximization of the log-likelihood in the case of a GB can be carried out analytically over a subset of the parameters, leaving behind a function, commonly called the -statistic in the GW literature [50], that needs to be numerically maximized over the remaining ones. In GBSIEVER, the latter maximization is carried out using particle swarm optimization (PSO) [51].
The principal difference between P1 and the present paper is the shift in the mathematical formalism implemented in GBSIEVER, which is a straightforward generalization from a single detector to, as described in this section, a network of detectors.
III.1 Network -statistic
The GW signal from a single GB in combination of detector can be expressed, schematically, as
(5) |
where the extrinsic parameters are obtained by reparametrizing , , , and . is a -by- matrix of template waveforms that depend on the intrinsic parameter set . [In the GW literature, the terms intrinsic and extrinsic generally refer to sets of parameters over which the log-likelihood is maximized using numerical and analytical (or computationally efficient) methods, respectively.]
The estimator, , of the parameters of a single source in iteration is given by,
(6) |
where is the set of detectors, is the set of TDI combinations (with mutually independent noise) per detector, and as defined in Eq. 4. (We drop the iteration index and the source parameter, where convenient, in the expressions below for simplicity of notation.) Here, is the norm induced by the noise-weighted inner product,
(7) |
where is the sampling frequency, is the discrete Fourier transform (DFT) of [defined by ], ‘’ denotes element-wise division, and is the sequence of samples at the DFT frequencies. A note on implementation: It is computationally efficient to confine searches for single sources to narrow frequency bands (see Sec. IV.1) and analyze different bands in parallel on a multi-processor machine. The PSD in a sufficiently narrow search band is well approximated by a constant, turning the inner product in Eq. 7 to an ordinary Euclidean one between bandpassed time series (see Sec. IV.2).
Let denote row and the element in row and column of a matrix . Let denote the column vector with the th element
(8) |
and denote the matrix with
(9) |
Then, the minimization problem in Eq. 6 can be recast as a maximization,
(10) |
where and . For fixed , the maximization over is trivial,
(11) |
The estimator of is then given by
(12) |
where
(13) |
is widely known in the GW literature as the -statistic. The estimated extrinsic parameters are obtained by substituting in Eq. 11.
III.2 Signal-to-noise ratio and association metric
It is convenient to use the signal-to-noise ratio (SNR) defined below to represent the overall strength of a signal relative to instrumental noise.
(14) |
where the norm is defined under the constant PSD approximation described earlier. For the LDC1-4 and Taiji-mod and TDI combinations, is independent of both and , thereby appearing as a constant overall factor in Eq. 6 for a given search band. As such, the value of used for a search band affects the conversion of the overall amplitude to SNR but does not affect . (The error made in the estimation of , on the other hand, does depend on the true as it governs the level of noise in .)
The independence of from also implies that the SNR of a GB signal is amplified by a factor of in the two detector network relative to a single detector. For a multi-year long observation period, the signal from a source in both LISA and Taiji-mod undergoes the same degree of amplitude and frequency modulations, which makes the summands in Eq. 14 comparable. Hence, the increase in SNR is close to for all sources. While one expects both source identification and parameter estimation to improve for an individual source due to the increase in SNR, this is not sufficient in itself to explain the impact of a detector network on multisource resolution. The details of how the sources mutually interfere with each other also play an important role and need to be taken into account.
In analyzing the output of a multisource resolution method, a metric is required to quantify the degree of association between a given pair of sources. As in P1, and across the LISA mock data challenges in general, it is convenient for this purpose to use the correlation coefficient, , between the TDI signals corresponding to a given pair of sources, and , as follows.
(15) | ||||
(16) |
Since the correlation coefficient only measures similarity between the shapes of the signals, it needs to be supplemented by an SNR-based criterion to account for the closeness of sources in their overall amplitudes. For a pair of reported and true sources, denoted by and , respectively, GBSIEVER uses the scheme laid out in MLDC-3 [28] wherein such a pair is eligible for association only when has (a) an , (b) a frequency within DFT frequencies of , and (c) the lowest distance, defined as , from . Given an eligible association, the status of a reported source is elevated to confirmed only if .
The test outlined above does not prevent multiple reported sources from being associated with the same true source. We follow [31] in handling such an ambiguous case by retaining only the reported source with the highest value provided . This approach is a conservative one since it lowers the detection rate by reducing the count of confirmed sources but not that of reported ones. We actually did not find any such case in the analysis of the network data while, for single detector search, there is only one instance. Hence, the effect on the detection rate is negligible in practice for the data considered in this paper.
IV Overview of GBSIEVER and its settings
The principal algorithmic components of GBSIEVER and their corresponding settings are reviewed in this section. The description is self-contained but brief since further details are available in P1. We do not provide an extensive review of PSO here, which is used for the global optimization task in Eq. 12, since it is a widely known algorithm. A pedagogical introduction to PSO and its application to MLE can be found in [52]. The specific variant of PSO used in GBSIEVER is discussed in P1 and its parameter settings are described in detail in Sec. III of [53].
IV.1 Narrow frequency band search and edge effects
As mentioned earlier, single source searches in GBSIEVER are performed in parallel over narrow frequency bands. The width of each search band is set at mHz but only the sources found in the central mHz, called the acceptance zone, are admitted into the set of identified sources. This restriction is imposed to counter the excessive occurrence of spurious identified sources near the band edges due to the spread of signal power from strong sources across the search bands. The frequency band limits are imposed by applying a Tukey window to the DFT of a TDI time series. The central flat part of the window is kept slightly wider, at mHz, than the acceptance zone. The sources discarded in a given search band are not lost but recovered in adjacent ones because they are overlapped to make all the acceptance zones contiguous.
IV.2 Undersampling
After applying the Tukey window for a given search band, an inverse DFT brings the bandpassed TDI data back into the time domain. Here, a key step that follows is the undersampling [54] of the time series, which drastically reduces the number of samples without any loss of information. Undersampling is a clever technique that exploits the aliasing error caused by sampling below the Nyquist rate to move the information content in bandpassed data to low frequencies. Recalling that the -statistic in a given search band is evaluated under the white noise approximation, allowing the inner product in Eq. 7 to be evaluated in the time domain, the reduction in the number of samples by undersampling leads to a much faster computation of the -statistic.
IV.3 Termination rule
The single-source estimation and subtraction iterations in a search band are terminated when (i) sources have been identified, or (ii) the estimated source SNRs in consecutive iterations fall below . We note that in P1, the maximum number of iterations was set at but this was never reached for any of the search bands as criterion (ii) was satisfied first. Hence, the increase in the maximum number of iterations here does not affect the single detector results. However, the larger number of iterations is required in the case of the detector network because, for the same SNR threshold, sources have a lower amplitude, hence are more numerous, relative to a single detector.
IV.4 Cross-validation
Besides PSO, a key feature of GBSIEVER that differentiates it from other multisource resolution methods that have been applied to mock LISA data is the step called cross-validation. In this step, the entire single source search is rerun on the data with all settings held the same except for the search range used for the secular frequency drift . Thus, we obtain two sets of identified sources from the two runs. In one of the runs, called the primary, the range is governed by the expected astrophysical one, while in the other (called secondary) it is set much wider. Finally, for each identified source in the primary run, we compute
(17) |
with belonging to the set of identified sources from the secondary run. The identified source is elevated to the status of a reported source only if crosses a preset threshold. As demonstrated in P1, cross-validation is highly effective in eliminating spurious identified sources. This is because they are much less likely to recur in the two runs unlike the identified sources that are associated with true sources.
We impose different thresholds in different blocks that tile the SNR and frequency plane contiguously. This addresses the expectation that the incidence of spurious sources is not constant across this plane: In a given frequency interval, the fraction of spurious sources increases at lower SNRs, requiring a higher cutoff to weed them out, while the fall-off in the density of GBs with increasing frequency leads to less confusion and fewer spurious sources, requiring a less stringent cutoff.
The combination of blocks in the SNR-frequency plane and the associated thresholds are part of the user-defined settings in GBSIEVER. In P1, we carried out a comparative analysis of different combinations. Here, we simply run GBSIEVER for the particular combination below that was used in P1 to obtain the principal results.
(20) | ||||
(23) |
Cross-validation is not required for mHz because sources can be well resolved without generating a significant number of spurious ones.
V Results
The presentation of results in this section is organized as follows. Sec. V.1 contains results on source resolution. An investigation of the residual is presented in Sec. V.2. Sec. V.3 contains results on GB parameter estimation.
V.1 Source resolution performance
To quantify the performance of GBSIEVER in GB resolution, we use the detection rate, defined as the fraction of reported sources that are elevated to the status of confirmed sources using the test of association described in Sec. III.2.
Table 1 provides a summary of the output from GBSIEVER for the LISA and Taiji network using the LDC1-4 and Taiji-mod data. For comparison, the results from the analysis of LDC1-4 data alone are reported in Table 2. The primary observation is that the number of confirmed GBs is boosted from for LISA to for the LISA-Taiji network, a remarkable increase of . Similarly the number of confirmed sources in corresponding frequency ranges are also significantly higher for the LISA-Taiji network. For example, an increase of in the frequency range, mHz, was achieved.
mHz | SNR | mHz | SNR | mHz | SNR | mHz | SNR | mHz | SNR | |
---|---|---|---|---|---|---|---|---|---|---|
Identified | ||||||||||
Reported | ||||||||||
Detection rate | ||||||||||
Lowest SNR (confirmed) | ||||||||||
Total reported | ||||||||||
Total confirmed | ||||||||||
Detection rate |
mHz | SNR | mHz | SNR | mHz | SNR | mHz | SNR | mHz | SNR | |
---|---|---|---|---|---|---|---|---|---|---|
Identified | ||||||||||
Reported | ||||||||||
Detection rate | ||||||||||
Lowest SNR (confirmed) | ||||||||||
Total reported | ||||||||||
Total confirmed | ||||||||||
Detection rate |
The overall detection rate of the LISA-Taiji network is , which is slightly smaller than the for LISA. However, if the first frequency range ( mHz), which has the lowest detection rate is ignored, the detection rate for the LISA-Taiji network becomes while that for LISA becomes smaller at . Thus, while the overall detection rate does not show a consistent preference for a single detector or a network, the latter finds far more reported sources overall, causing a corresponding jump in the number of confirmed sources.
An auxiliary metric that is useful for gauging the performance of a multisource resolution method is the lowest SNR among confirmed sources. This tells us how deep into the source population can we dive to extract resolvable sources. While it would appear from Tables 1 and 2 that there is no significant change in this quantity, it is important to note that the SNRs are computed for different numbers of detectors. In terms of the signal amplitude, , which is the invariant quantity here, the SNR for a two detector network would be times higher than that for a single detector. Conversely, the same SNR implies that the signal in the network has that is a factor of lower. Hence, the LISA-Taiji network is able to increase the search depth quite substantially.
V.2 Residuals
Fig. 1 compares the spectral properties of the TDI data and the residual obtained by subtracting out all the reported sources from it. There is a significant reduction of spectral power in the LISA-Taiji residual relative to the case of a single detector in the mHz band. The residuals begin to converge to the instrumental noise in the mHz band. The Pearson correlation coefficient between the residual and instrumental noise time series in the combination bandpassed to mHz improves from for the single detector case to for the LISA-Taiji network. If one could remove all true sources perfectly from the data, the residual would have a correlation of unity with the instrumental noise. Thus, the significantly improved correlation above shows that the set of reported sources found in this band by the network is more reliable and complete than the one from a single detector.

The LISA-Taiji residual is also less spiky between to mHz because a cluster of loud sources that was missed in the single detector analysis was identified and extracted out. In general, defining loud sources as those with a single-detector true , the LISA-Taiji network finds additional loud sources across the entire frequency range, and the fraction of true loud sources that were not detected falls from to .
The subtraction of reported GB sources from the data is a prerequisite for mounting a search for any other type of GW source. Since the signals from the GBs in the data will superpose to form the GB background noise that will limit the sensitivity of these searches, it is important to understand how close the actual GB background comes to the expected one after the subtraction of reported sources. However, what constitutes the irreducible expected GB background is not a straightforward question. One approach, reviewed in brief below, is the scheme proposed in [55] and developed further in [43].
It is assumed that one has an ideal method that is capable of estimating the parameters of a source perfectly as long as it has a minimum SNR. The SNR here is defined relative to the floor of the PSD of the data, estimated using a smoothing method such as running mean or median [56], not the instrumental noise PSD. For clarity, let us denote it as and the minimum value as . The subtraction of sources and the estimation of the PSD floor follow each other iteratively until no sources with are left in the data. The remaining data is then deemed to be devoid of resolvable sources, hence an estimate of the GB background. There are some important caveats to this approach, one being that it is only applicable to data from a simulated population of sources with known parameters, and the other being the unrealizable assumption of an ideal method. The choice of , which is ad hoc in itself, is also an important factor that governs the final GB background. Nonetheless, it is reasonable to accept the ideal GB background obtained from this approach to be a lower bound on the actual one that can be achieved by a practical multisource resolution method.
Fig. 2 compares the LISA-Taiji residual (same as in Fig. 1) with the ideal GB background obtained, as outlined above, for LISA alone using . We see that the LISA-Taiji residual lies well below the single-detector ideal GB background for and nearly coincides with that for at frequencies below mHz. The latter is an unrealistically optimistic lower bound on the background since one does not expect any algorithm to confidently resolve sources perfectly at this SNR. However, using the LISA-Taiji network will still allow this level of the background to be reached.

While the LISA-Taiji residual actually dips a little under the ideal background below mHz, this is the effect of overfitting the data caused by the presence of spurious sources at low SNR. Overfitting is an important effect to consider for any multisource resolution method, not just GBSIEVER, since spurious sources are unavoidable. The overfitting of data can result in a loss in SNR for a non-GB source if its spectral power is close in level to the ideal background. As seen in Fig. 2, the observed overfitting is quite mild for GBSIEVER running on network data and we do not expect it to be a major issue. However, this discussion also points to the necessity of subjecting all multisource methods to a comparison with the ideal background in order to gauge their tendency to overfit the data.
V.3 Parameter estimation performance
For assessing the parameter estimation performance of GBSIEVER, we consider the difference between the values of a parameter for a confirmed source and its associated true source. Fig. 3 shows the estimated probability density function (PDF) of this difference for each GB signal parameter. Comparing them with the corresponding ones in P1 for the single-detector case, we find no significant differences. [In terms of the actual counts per bin, however, there would be a substantial difference due to the larger number of confirmed sources (c.f., Tables 1 and 2) for the network.]

The fact that there is no significant change in the PDFs in going from LISA to LISA-Taiji may appear surprising at first since one expects parameter estimation accuracy to improve due to the higher SNR of a source in the latter. However, one must also account for the fact that the network finds a larger number of weaker sources that, in general, have worse errors in the estimated parameters. This is supported by Fig. 4, which shows the scatterplot of the differences in the parameter values of confirmed and associated true sources as a function of the strain amplitude of the latter. As can be seen clearly, the scatterplot for network sources extends to lower amplitudes where the error in the parameters tends to be higher.
For a fair comparison between the single-detector and network performance, we must look at the distribution of parameter differences for only the confirmed sources that they have in common. Fig. 5 compares the PDFs of the parameter differences for only this restricted set of sources and, indeed, it is seen clearly that they are all more concentrated towards zero difference in the case of the network. We have also listed in the caption of Fig. 5, the value of for each parameter where and are the standard deviations of the PDFs for the network and single-detector, respectively. Based on Fisher information analysis of parameter estimation errors for a single source, in which the standard deviation of estimation error is inversely proportional to the SNR [57, 58] and the factor of accounts for the naive enhancement in SNR for a network of two detectors relative to a single one (all of them being identical with mutually independent noise), one expects this quantity to be close to unity. Indeed, we see that this is so for the subset of intrinsic parameters , , , and . However, it departs quite a bit from unity for all the extrinsic parameters. This result suggests that a Fisher information analysis confined to a single sources is not adequate for quantifying parameter estimation errors in the GB resolution problem.


VI Conclusions
We investigated the GB multisource resolution problem in the context of a network of comparable space-based GW detectors, namely, LISA and Taiji, using a previously developed iterative source subtraction pipeline, GBSIEVER, that was extended to analyze TDI combinations from multiple detectors. Our results provide a realistic assessment of the benefits expected from a network of space-based detectors that goes beyond Fisher information studies of parameter estimation errors for single GBs.
In the present work, we looked at detection and estimation performance of the LISA-Taiji network relative to LISA alone. There is a significant increase, by , in the number of confirmed sources that can be extracted while maintaining approximately the same detection rate. This increase was obtained without changing any of the cuts (SNR and ) imposed on the identified sources or the cross-validation setup for primary and secondary runs.
One of the key results we have reported is the comparison of the residual after subtraction of reported sources found in the network analysis with the ideal GB background of a single detector. We find that a network of space-based detectors will allow the GB background to be reduced substantially below the lowest level possible with a single detector. The PSD of the residual lies entirely below the ideal GB background obtained with a detection threshold of . It coincides with the background over much of the frequency range in which the latter dominates over instrumental noise. Since the residual from subtracting GBs forms the noise background for non-GB searches, one expects that their performance will be significantly improved in the mHz band. Further studies need to be carried out to quantify this expectation.
Our study of parameter estimation errors brings forth two salient points that highlight the difference between a single-source and multisource problem. (i) There is no significant change in the distribution of the differences in parameter values of confirmed and true sources between a single-detector and a network of comparable ones. (ii) Fisher information based error analysis of single sources may not be adequate in the multi-source resolution problem since the expected scaling of errors with SNR is observed only for the subset of intrinsic parameters but not the extrinsic ones. While there is a clear explanation for (i), we do not understand the origin of (ii) yet.
We have assumed simplified models for both the LISA and Taiji detectors, treating them as identical in design except for an angular separation in their heliocentric orbit. In future work, we will use more realistic models (the orbits and individual TDI combinations) for LISA and Taiji data to investigate the performance improvements for LISA-Taiji network. We will also take into account the uncertainty in prior knowledge of the PSDs of instrumental or GB background noise. This can be achieved, in principle, by a modest expansion of the search space for PSO by including the ratios of the noise PSDs in the same frequency band as free parameters to be optimized along with those of a single GB. We have further assumed that the LISA and Taiji data are collected over the same observation period. Since the GBs are persistent sources, and since the -statistic treats data from different detectors and TDI combinations additively, much of our analysis would go through, barring code modifications, even if the observation periods were non-overlapping. The inclusion of strongly evolving GBs could be an interesting exception here, if the observation periods are far apart, that will require a more careful approach.
Acknowledgements
Zhang, Zhao and Liu are supported by the National Key Research and Development Program of China grants No. 2021YFC2203000 and No. 2020YFC2201400, the National Natural Science Foundation of China through Grant No. 12047501, the 111 Project under grant No. B20063, and Lanzhou City’s scientific research funding subsidy to Lanzhou University. Group activities are financially supported by the Morningside Center of Mathematics and a part of the MPG-CAS collaboration in low frequency gravitational wave physics. Partial support from the Xiandao B project in gravitational wave detection is also acknowledged. We are grateful to Prof. Shing-Tung Yau for his long term and unconditional support and Prof. Yun-Kau Lau for assembling our research group. We gratefully acknowledge the use of high performance computers at the Supercomputing Center of Lanzhou University and State Key Laboratory of Scientific and Engineering Computing, CAS.
References
- [1] B. P. Abbott, R. Abbott, T. D. Abbott, M. R. Abernathy, F. Acernese, et al., “Observation of Gravitational Waves from a Binary Black Hole Merger,” Phys. Rev. Lett., vol. 116, p. 061102, Feb 2016.
- [2] J. Aasi, B. Abbott, R. Abbott, T. Abbott, M. Abernathy, K. Ackley, C. Adams, T. Adams, P. Addesso, R. Adhikari, et al., “Advanced LIGO,” Classical and quantum gravity, vol. 32, no. 7, p. 074001, 2015.
- [3] B. P. Abbott, R. Abbott, T. D. Abbott, S. Abraham, F. Acernese, et al., “GWTC-1: A Gravitational-Wave Transient Catalog of Compact Binary Mergers Observed by LIGO and Virgo during the First and Second Observing Runs,” Phys. Rev. X, vol. 9, p. 031040, Sep 2019.
- [4] R. Abbott, T. D. Abbott, S. Abraham, F. Acernese, et al., “GWTC-2: Compact Binary Coalescences Observed by LIGO and Virgo during the First Half of the Third Observing Run,” Phys. Rev. X, vol. 11, p. 021053, Jun 2021.
- [5] R. Abbott, T. Abbott, F. Acernese, K. Ackley, C. Adams, N. Adhikari, R. Adhikari, V. Adya, C. Affeldt, D. Agarwal, et al., “GWTC-3: Compact binary coalescences observed by LIGO and Virgo during the second part of the third observing run,” arXiv preprint arXiv:2111.03606, 2021.
- [6] F. Acernese, M. Agathos, K. Agatsuma, D. Aisa, N. Allemandou, et al., “Advanced Virgo: a second-generation interferometric gravitational wave detector,” Classical and Quantum Gravity, vol. 32, no. 2, p. 024001, 2014.
- [7] B. P. Abbott, R. Abbott, T. D. Abbott, F. Acernese, et al., “GW170817: Observation of Gravitational Waves from a Binary Neutron Star Inspiral,” Phys. Rev. Lett., vol. 119, p. 161101, Oct 2017.
- [8] R. Abbott, T. D. Abbott, S. Abraham, F. Acernese, et al., “Observation of Gravitational Waves from Two Neutron Star–Black Hole Coalescences,” The Astrophysical Journal Letters, vol. 915, p. L5, jun 2021.
- [9] Z. Arzoumanian, P. T. Baker, H. Blumer, B. Bécsy, et al., “The NANOGrav 12.5 yr Data Set: Search for an Isotropic Stochastic Gravitational-wave Background,” Astrophys. J. Lett., vol. 905, no. 2, p. L34, 2020.
- [10] B. Goncharov, R. Shannon, D. Reardon, G. Hobbs, A. Zic, M. Bailes, M. Curyło, S. Dai, M. Kerr, M. Lower, et al., “On the evidence for a common-spectrum process in the search for the nanohertz gravitational-wave background with the Parkes Pulsar Timing Array,” The Astrophysical Journal Letters, vol. 917, no. 2, p. L19, 2021.
- [11] S. Chen, R. Caballero, Y. Guo, A. Chalumeau, K. Liu, G. Shaifullah, K. Lee, S. Babak, G. Desvignes, A. Parthasarathy, et al., “Common-red-signal analysis with 24-yr high-precision timing of the European Pulsar Timing Array: Inferences in the stochastic gravitational-wave background search,” Monthly Notices of the Royal Astronomical Society, vol. 508, no. 4, pp. 4970–4993, 2021.
- [12] M. A. McLaughlin, “The North American nanohertz observatory for gravitational waves,” Classical and Quantum Gravity, vol. 30, no. 22, p. 224008, 2013.
- [13] R. N. Manchester, G. Hobbs, M. Bailes, W. Coles, W. Van Straten, M. Keith, R. Shannon, et al., “The parkes pulsar timing array project,” Publications of the Astronomical Society of Australia, vol. 30, 2013.
- [14] G. Desvignes, R. Caballero, L. Lentati, J. Verbiest, D. Champion, B. Stappers, G. Janssen, P. Lazarus, S. Osłowski, S. Babak, et al., “High-precision timing of 42 millisecond pulsars with the European Pulsar Timing Array,” Monthly Notices of the Royal Astronomical Society, vol. 458, no. 3, pp. 3341–3380, 2016.
- [15] P. Amaro-Seoane, H. Audley, S. Babak, J. Baker, E. Barausse, P. Bender, E. Berti, P. Binetruy, M. Born, D. Bortoluzzi, et al., “Laser Interferometer Space Antenna,” arXiv preprint arXiv:1702.00786, 2017.
- [16] W.-H. Ruan, Z.-K. Guo, R.-G. Cai, and Y.-Z. Zhang, “Taiji program: Gravitational-wave sources,” International Journal of Modern Physics A, vol. 35, no. 17, p. 2050075, 2020.
- [17] J. Luo, L.-S. Chen, H.-Z. Duan, Y.-G. Gong, S. Hu, J. Ji, Q. Liu, J. Mei, V. Milyukov, M. Sazhin, et al., “TianQin: a space-borne gravitational wave detector,” Classical and Quantum Gravity, vol. 33, no. 3, p. 035010, 2016.
- [18] M. Tinto and S. V. Dhurandhar, “Time-delay interferometry,” Living Reviews in Relativity, vol. 24, no. 1, pp. 1–73, 2021.
- [19] W.-T. Ni, “Gravitational wave detection in space,” Int. J. Mod. Phys. D, vol. 25, no. 14, p. 1630001, 2016.
- [20] E. Barausse, E. Berti, T. Hertog, S. A. Hughes, et al., “Prospects for fundamental physics with LISA,” General Relativity and Gravitation, vol. 52, no. 8, pp. 1–33, 2020.
- [21] S. Nissanke, M. Vallisneri, G. Nelemans, and T. A. Prince, “Gravitational-Wave emission from compact Galactic binaries,” Astrophys. J., vol. 758, p. 131, 2012.
- [22] J. Baker, J. Bellovary, P. L. Bender, E. Berti, R. Caldwell, et al., “The Laser Interferometer Space Antenna: unveiling the millihertz gravitational wave sky,” arXiv preprint arXiv:1907.06482, 2019.
- [23] S. Babak, J. Gair, A. Sesana, E. Barausse, C. F. Sopuerta, C. P. L. Berry, E. Berti, P. Amaro-Seoane, A. Petiteau, and A. Klein, “Science with the space-based interferometer LISA. V. Extreme mass-ratio inspirals,” Phys. Rev. D, vol. 95, p. 103012, May 2017.
- [24] M. L. Katz, L. Z. Kelley, F. Dosopoulou, S. Berry, L. Blecha, and S. L. Larson, “Probing massive black hole binary populations with LISA,” Monthly Notices of the Royal Astronomical Society, vol. 491, pp. 2301–2317, 11 2019.
- [25] K. A. Arnaud, S. Babak, J. Baker, M. Benacquista, N. Cornish, C. Cutler, et al., “The Mock LISA Data Challenges: An overview,” AIP Conf. Proc., vol. 873, no. 1, pp. 619–624, 2006.
- [26] K. A. Arnaud, S. Babak, J. G. Baker, M. J. Benacquista, N. J. Cornish, C. Cutler, L. Finn, S. Larson, T. Littenberg, E. Porter, et al., “An overview of the second round of the Mock LISA Data Challenges,” Classical and Quantum Gravity, vol. 24, no. 19, p. S551, 2007.
- [27] S. Babak, J. G. Baker, M. J. Benacquista, N. J. Cornish, J. Crowder, S. L. Larson, E. Plagnol, E. K. Porter, M. Vallisneri, A. Vecchio, et al., “The Mock LISA Data Challenges: from Challenge 1B to Challenge 3,” Classical and Quantum Gravity, vol. 25, no. 18, p. 184026, 2008.
- [28] S. Babak, J. G. Baker, M. J. Benacquista, N. J. Cornish, et al., “The Mock LISA Data Challenges: From Challenge 3 to Challenge 4,” Class. Quant. Grav., vol. 27, p. 084009, 2010.
- [29] Q. Baghi, “The LISA Data Challenges,” arXiv preprint arXiv:2204.12142, 2022.
- [30] S. Babak, J. G. Baker, M. J. Benacquista, N. J. Cornish, J. Crowder, C. Cutler, et al., “Report on the second Mock LISA Data Challenge,” Class. Quant. Grav., vol. 25, p. 114037, 2008.
- [31] A. Błaut, S. Babak, and A. Królak, “Mock LISA data challenge for the Galactic white dwarf binaries,” Phys. Rev. D, vol. 81, p. 063008, Mar 2010.
- [32] T. B. Littenberg, “Detection pipeline for Galactic binaries in LISA data,” Phys. Rev. D, vol. 84, p. 063009, Sep 2011.
- [33] X.-H. Zhang, S. D. Mohanty, X.-B. Zou, and Y.-X. Liu, “Resolving Galactic binaries in LISA data using particle swarm optimization and cross-validation,” Phys. Rev. D, vol. 104, p. 024023, 2021.
- [34] Y. Lu, E.-K. Li, Y.-M. Hu, J.-d. Zhang, and J. Mei, “An implementation of Galactic white dwarf binary data analysis for MLDC-3.1,” arXiv preprint arXiv:2205.02384, 2022.
- [35] S. Phinney, P. Bender, R. Buchman, R. Byer, N. Cornish, P. Fritschel, and S. Vitale, “The Big Bang Observer: Direct detection of gravitational waves from the birth of the Universe to the Present,” NASA mission concept study, 2004.
- [36] G. Wang, W.-T. Ni, W.-B. Han, S.-C. Yang, and X.-Y. Zhong, “Numerical simulation of sky localization for LISA-TAIJI joint observation,” Phys. Rev. D, vol. 102, p. 024089, Jul 2020.
- [37] S.-J. Huang, Y.-M. Hu, V. Korol, P.-C. Li, Z.-C. Liang, Y. Lu, H.-T. Wang, S. Yu, and J. Mei, “Science with the TianQin Observatory: Preliminary results on Galactic double white dwarf binaries,” Phys. Rev. D, vol. 102, p. 063021, Sep 2020.
- [38] C. Zhang, Y. Gong, H. Liu, B. Wang, and C. Zhang, “Sky localization of space-based gravitational wave detectors,” Phys. Rev. D, vol. 103, p. 103013, May 2021.
- [39] G. Wang, W.-T. Ni, W.-B. Han, P. Xu, and Z. Luo, “Alternative LISA-TAIJI networks,” Phys. Rev. D, vol. 104, no. 2, p. 024012, 2021.
- [40] C.-Y. Zhang, Y.-G. Gong, and C. Zhang, “Parameter estimation for space-based gravitational wave detectors with ringdown signals,” Phys. Rev. D, vol. 104, no. 8, p. 083038, 2021.
- [41] W.-H. Ruan, C. Liu, Z.-K. Guo, Y.-L. Wu, and R.-G. Cai, “The LISA-Taiji Network: Precision Localization of Coalescing Massive Black Hole Binaries,” Research (Washington, DC), vol. 2021, p. 6014164, 2021.
- [42] K. J. Shuman and N. J. Cornish, “Massive black hole binaries and where to find them with dual detector networks,” Phys. Rev. D, vol. 105, p. 064055, Mar 2022.
- [43] N. Karnesis, S. Babak, M. Pieroni, N. Cornish, and T. Littenberg, “Characterization of the stochastic signal originating from compact binary populations as measured by LISA,” Phys. Rev. D, vol. 104, no. 4, p. 043019, 2021.
- [44] S. Babak and A. Petiteau, “LISA Data Challenge Manual,” Tech. Rep. LISA-LCST-SGS-MAN-002, APC, Paris, 2020.
- [45] A. Petiteau, G. Auger, H. Halloin, O. Jeannin, E. Plagnol, S. Pireaux, T. Regimbau, and J.-Y. Vinet, “LISACode: A Scientific simulator of LISA,” Phys. Rev. D, vol. 77, p. 023002, 2008.
- [46] N. J. Cornish and T. B. Littenberg, “Tests of bayesian model selection techniques for gravitational wave astronomy,” Phys. Rev. D, vol. 76, p. 083006, Oct 2007.
- [47] “LISA Data Challenge 1: Radler.” https://lisa-ldc.lal.in2p3.fr/challenge1. Accessed: 2022-06-22.
- [48] G. Nelemans, “Galactic binaries with eLISA,” arXiv preprint arXiv:1302.0138, 2013.
- [49] V. Korol, S. Toonen, A. Klein, V. Belokurov, F. Vincenzo, R. Buscicchio, D. Gerosa, C. Moore, E. Roebber, E. Rossi, et al., “Populations of double white dwarfs in Milky Way satellites and their detectability with LISA,” Astronomy & Astrophysics, vol. 638, p. A153, 2020.
- [50] P. Jaranowski, A. Krolak, and B. F. Schutz, “Data analysis of gravitational - wave signals from spinning neutron stars. 1. The Signal and its detection,” Phys. Rev. D, vol. 58, p. 063001, 1998.
- [51] R. Eberhart and J. Kennedy, “Particle swarm optimization,” in Proceedings of the IEEE international conference on neural networks, vol. 4, pp. 1942–1948, Citeseer, 1995.
- [52] S. D. Mohanty, Swarm Intelligence Methods for Statistical Regression. Chapman and Hall/CRC, 2018.
- [53] M. E. Normandin, S. D. Mohanty, and T. S. Weerathunga, “Particle swarm optimization based search for gravitational waves from compact binary coalescences: Performance improvements,” Physical Review D, vol. 98, no. 4, p. 044029, 2018.
- [54] D. L. Donoho and J. Tanner, “Precise undersampling theorems,” Proceedings of the IEEE, vol. 98, no. 6, pp. 913–924, 2010.
- [55] S. E. Timpano, L. J. Rubbo, and N. J. Cornish, “Characterizing the galactic gravitational wave background with LISA,” Phys. Rev. D, vol. 73, p. 122001, Jun 2006.
- [56] S. D. Mohanty, “Efficient algorithm for computing a running median,” Max Planck Institut Gravitationsphysik, Golm, Germany, Tech. Rep. LIGO, 2003.
- [57] C. Cutler and E. E. Flanagan, “Gravitational waves from merging compact binaries: How accurately can one extract the binary’s parameters from the inspiral wave form?,” Phys. Rev. D, vol. 49, pp. 2658–2697, 1994.
- [58] C. Zhang, Y.-G. Gong, B. Wang, and C.-Y. Zhang, “Accuracy of parameter estimations with a spaceborne gravitational wave observatory,” Phys. Rev. D, vol. 103, no. 10, p. 104066, 2021.