Toward the unambiguous identification of supermassive binary black holes through Bayesian inference
Abstract
Supermassive binary black holes at sub-parsec orbital separations have yet to be discovered, with the possible exception of blazar OJ 287. In parallel to the global hunt for nanohertz gravitational waves from supermassive binaries using pulsar timing arrays, there has been a growing sample of candidates reported from electromagnetic surveys, particularly searches for periodic variations in optical light curves of quasars. However, the periodicity search is prone to false positives from quasar red noise and quasi-periodic oscillations from the accretion disc of a single supermassive black hole—especially when the data span fewer than a few signal cycles. We present a Bayesian method for the detection of quasar (quasi-)periodicity in the presence of red noise. We apply this method to the binary candidate PG1302102, and show that a) there is very strong support (Bayes factor ) for quasi-periodicity, and b) the data slightly favour a quasi-periodic oscillation over a sinusoidal signal, which we interpret as modest evidence against the binary black hole hypothesis. We also find that the prevalent damped random walk red-noise model is disfavored with more than 99.9% credibility. Finally, we outline future work that may enable the unambiguous identification of supermassive binary black holes.
1 Introduction
Supermassive binary black holes are thought to be common in the Universe as natural products of galaxy mergers (Begelman et al., 1980). They are likely to be the primary sources of nanohertz gravitational waves, which have been actively searched for using pulsar timing arrays in the past decade (see, e.g., Hobbs et al., 2010; Verbiest et al., 2016; Perera et al., 2019, and references therein). Current pulsar timing arrays are sensitive to individual binaries, with component black hole masses and binary orbital periods years (or orbital separations pc), up to distances of Mpc (Zhu et al., 2014; Babak et al., 2016; Aggarwal et al., 2019), though see Rosado et al. (2016) for the possibility of detecting high-redshift binaries. No detection has been made so far, although it has been suggested that the detection of a stochastic gravitational-wave background formed by the combined emission from binaries across the Universe is likely in a few years (e.g., Taylor et al., 2016; Cordes & McLaughlin, 2019).
The electromagnetic identification of sub-parsec supermassive binary black holes has proven to be challenging. Direct imaging of such close binaries is only possible for sources within Mpc via very-long-baseline radio interferometry observations. The closest binary with confident direct images, found in the radio galaxy 0402+379, has a separation of 7.3 pc (Rodriguez et al., 2006), corresponding to an orbital period of years (Bansal et al., 2017); see also Kharb et al. (2017) for a binary candidate at in the Seyfert galaxy NGC 7674. How binaries like these can reach milli-parsec orbital separations, where the emission of gravitational waves can efficiently drive the binary to merge, is referred to as the “final-parsec problem”, which remains an active area of theoretical investigations (e.g., Milosavljević & Merritt, 2001; Yu, 2002; Colpi, 2014; Khan et al., 2016; Ryu et al., 2018; Goicovic et al., 2018; Muñoz et al., 2020; Taylor et al., 2017; Chen et al., 2019). Finding a sub-parsec supermassive binary black hole would not only provide insights into the final-parsec problem, but also shed light on the expected stochastic gravitational-wave signal strength for pulsar timing arrays (Zhu et al., 2019; Goulding et al., 2019).
Periodically variable active galactic nuclei provide an interesting class of candidates for sub-parsec supermassive binary black holes. The most prominent object is OJ 287, a blazar which exhibits quasi-periodic outbursts in optical light curves that date back to the 19th century (Sillanpaa et al., 1988; Valtonen et al., 2008). This periodicity has been interpreted as the secondary black hole crossing the accretion disc of the primary in an eccentric orbit (Valtonen et al., 2016; Dey et al., 2019).111After this paper was submitted, Laine et al. (2020) reported Spitzer observations of the predicted Eddington flare from OJ 287, adding significant weight to the theory that it is, in fact, a supermassive binary black hole. In the past five years, with the growth of time-domain astronomy, there have been a large number of sub-parsec supermassive binary black hole candidates claimed by various groups.
Based on the Catalina Real-time Transient Survey (CRTS), Graham et al. (2015a) put forward 111 binary black hole candidates from an optical variability analysis of 243,500 quasars. Among them, PG1302102 was the most significant candidate with a measured period of (Graham et al., 2015b). This periodicity has been attributed to the relativistic Doppler boosting of emission from a mini-disc around the secondary black hole (D’Orazio et al., 2015). Charisi et al. (2016) reported 33 binary candidates from a periodicity search in a sample of 35,383 quasars in the photometric database of the Palomar Transient Factory. Liu et al. (2015) identified a periodic variability of in quasar PSO J334.2+01.4 using data from the Pan-STARRS1 Medium Deep Survey. However, such a detection was found to be insignificant in a subsequent analysis of extended data (Liu et al., 2016); see Liu et al. (2019) for a systematic search over 9000 quasars which resulted in one candidate in their extensive analysis.
Following the periodicity report of PG1302102, Vaughan et al. (2016) cautioned that the stochastic variability of normal quasars (i.e., those that do not host binary black holes) can resemble a periodic feature in light curves that span only a few periods and highlighted the importance of careful evaluation of false alarm rate. Through Bayesian model comparison, Vaughan et al. (2016) found a red-noise model is significantly favoured over a sinusoidal model for PG1302102 based on the CRTS data. More recently, Liu et al. (2018) revisited the periodicity of PG1302102 using additional data from the All-Sky Automated Survey for Supernovae (ASAS-SN). They employed a maximum likelihood method to search for a periodic signal in the presence of quasar red noise and found that the inclusion of ASAS-SN data reduced the periodicity significance. Therefore, Liu et al. (2018) concluded that the binary black hole model was disfavoured for PG1302102. Kovačević et al. (2019) proposed a model that posits a cold spot in the accretion disc of the primary black hole. Such a model can produce a perturbed sinusoidal feature and thus explain the apparent decrease in periodicity significance found by Liu et al. (2018).
Here, we propose a fully Bayesian framework for the identification of supermassive binary black hole candidates in time-domain electromagnetic surveys. It is capable of dealing with generic signal forms in the presence of red noise. Our work improves on previous studies in several ways. First, our method allows the inference of noise properties and signal parameters simultaneously and thus accounts for potential covariance between a periodic signal and quasar red noise. Second, it is robust to offsets in data collected with different surveys and possible over/under-estimation of measurement uncertainties. Third, we adopt a general form of quasar red noise and search for deviation from the commonly assumed damped random walk (DRW) model (Kelly et al., 2009). We also compare the sinusoidal signal hypothesis against a quasi-periodic oscillation (QPO) model based on behavior observed in many stellar-mass black hole X-ray binaries (see, e.g., Motta, 2016, for a recent review).
The remainder of this paper is organized as follows. In Section 2, we describe the Bayesian inference framework and introduce our signal and noise models. In Section 3, we apply the method to PG1302102 with data from CRTS, ASAS-SN, and the Lincoln Near-Earth Asteroid Research (LINEAR) survey, and present our analysis results. In Section 4 we discuss various aspects of a periodicity search through simulations. Last, we provide concluding remarks and outline directions for future work in Section 5.
2 Bayesian inference and model selection of time series
In this section, we describe the framework of Bayesian inference and model selection for the analysis of time-series data. We focus on the case of searching for periodicity in quasar light curves—quasar brightness measurements as a function of time. Assuming stationary Gaussian noise, the likelihood function for a quasar light curve is
where is the time-series light curve data with length , and include the noise and signal parameters, respectively. In this work we use measurements of optical magnitudes (i.e., the logarithmic light curve). The data is modeled as
(2) |
where is the noise vector, which contains measurement uncertainties and additional intrinsic stochastic quasar variability; is a constant vector with identical entries of , and accounts for the mean magnitude and any constant offset (e.g., constant level of contamination due to host galaxy light); the signal vector is given by
(3) |
The signal parameters are , where the signal frequency is related to the period by . In Equation (2), is the noise covariance matrix, which contains two components, , where is a diagonal matrix that represents white noise, and accounts for the stochastic quasar variability, usually termed “red noise.”
The white noise matrix takes the following form
(4) |
where is the measurement uncertainty for the th observation, is a scale factor used to quantify the over/under-estimation of measurement uncertainties222This is equivalent to the EFAC (Error FACtor) parameter used in pulsar timing data analysis. An additional EQUAD (Error added in QUADrature) parameter could be introduced to account for potential systematic errors that are not included in measurement uncertainties., is the Kronecker delta function.
In the DRW model (Kelly et al., 2009), the covariance function is an exponential function. The red-noise covariance matrix is
(5) |
where is the intrinsic variance between observations on short timescales ( day), and is the damping timescale, and . The noise power spectral density of the DRW model is
(6) |
To account for the possibility that the quasar red noise deviates from the DRW model (e.g., Zu et al., 2013; Guo et al., 2017), we also consider the following form of noise covariance matrix:
(7) |
This is also called the stretched exponential function333In the context of describing relaxation in disordered systems, it is called the Kohlrausch-Williams-Watts function.. Note that the special case of corresponds to the DRW model, means white noise, and reduces to the Gaussian function.
We further extend our red-noise covariance matrix to account for the QPO phenomenon
(8) |
where is the period of the QPO. In the case of , the power spectral density of the above covariance function becomes
(9) |
where is the QPO frequency. Note that in the limit of , the above equation reduces to Equation (6). In practice, this model is indistinguishable from the DRW model once is longer than the data span. The noise parameters are . We illustrate the red-noise models and discuss their phenomenology in detail in Appendix A.
We use Bayesian model selection to quantify the statistical significance of the presence of periodic signals in quasar light curves. We start with Bayes’ theorem, which states that
(10) |
Here is the posterior probability distribution function of parameters given data and hypothesis ; is the likelihood function given in Equation (2), which describes the probability of observing data given the hypothesis and parameters . Meanwhile, is the prior distribution of parameters while is the Bayesian evidence for hypothesis
(11) |

Given the observational data, we wish to compare two hypotheses: = the data are consistent with only noise (i.e., ), and = there is a periodic signal present in the data. The ratio of posterior probability, called the odds ratio, between hypotheses and is:
(12) |
where and are the prior probability for hypotheses and , respectively. Assuming equal prior probability for both hypotheses, Bayesian model selection is usually performed by computing the Bayes factor:
(13) |
In this work, we are concerned with the support for a sinusoidal signal quantified by . We also wish to compute the support for the presence of QPO by comparing two models involving Equations (8) and (7), and for the deviation from the DRW model, by comparing two models involving Equations (7) and (5). Following Kass & Raftery (1995), the interpretation of Bayes factors is as follows. In a natural logrithmic scale, indicates that the support (for the hypothesis in the numerator) “worth no more than a bare mention,” implies positive support, implies strong support and implies very strong support. These thresholds are somewhat arbitrary; is often adopted as the detection threshold in gravitational-wave astronomy (e.g., Thrane & Talbot, 2019). Throughout this paper, the log evidence or log Bayes factor refer to the natural logarithm.

3 Application to PG1302102
PG1302102 is a nearby bright quasar with a median V-band magnitude of at a redshift of . We apply our method to V-band magnitude measurements collected with CRTS (Drake et al., 2009), ASAS-SN (Shappee et al., 2014; Jayasinghe et al., 2019), and LINEAR (Sesar et al., 2011). The CRTS data are publicly available444http://nesssi.cacr.caltech.edu/DataRelease/ as part of the Catalina Surveys Data Release 2, including 290 phototmetric measurements taken between 6 May 2005 and 30 May 2013. The ASAS-SN data are downloaded from its photometry database555https://asas-sn.osu.edu/photometry as part of their Data Release 9, including 232 measurements acquired between 23 November 2013 and 27 November 2018. The LINEAR data contain 626 measurements made between 23 January 2003 and 12 June 2012. The median measurement uncertainties are 0.06, 0.05 and 0.01 mag for CRTS, ASAS-SN and LINEAR data, respectively. The span of the combined data set is . To reduce the computational cost, we average LINEAR data with an interval of , which reduces the number of data points to . The bin size is chosen such that any stochastic time-correlated variations over a timescale greater than are preserved. The new uncertainty for binned data is computed as the standard deviation of original measurements inside the binning window. The median uncertainty for averaged LINEAR data is .
Figure 1 shows the data used in our analysis: LINEAR in pink, CRTS in black and ASAS-SN in blue. The error bars indicate the 1- measurement uncertainties. We also show the 68% credible region of the sinusoidal signal (grey shaded band) with a period of inferred from the data, and a red-noise realization (thin light blue line) that contains a QPO with a period of and might fit the data well.
Before we present details of our analysis results, we show in Figure 2 the Lomb-Scargle periodogram of PG1302102 data (solid blue line), along with the averaged periodogram taken from noise realizations for three models: red noise (dotted red line), red noise plus a QPO (dash-dotted green line), and red noise plus a sinusoid (dashed cyan line). The vertical black dash line indicates the data span of for PG1302102. We simulate data using the same sampling in time and error bars as real data to obtain the average periodogram for three models. The following median posterior probability model parameters as inferred from real data are adopted: 1) , , , for the red-noise hypothesis; 2) , , , , for the “red noise + QPO” hypothesis; 3) , , , , , , for the “red noise + sinusoid” hypothesis. The purpose of Figure 2 is to provide some insights into how different models might fit the data, instead of a proper accounting of the goodness of these models. The periodogram of real data exhibits a strong peak at , with secondary peaks at around 1 year. As one can see, the major peak can be reproduced with both a QPO and sinusoidal model but not the pure red-noise model. The secondary peaks at around 1 year are probably artefacts of the irregular sampling of the data, since simulated data that contain no signal at that period also result in apparent peaks there.
3.1 Results
To implement the analysis outlined in Section 2 for PG1302102 data, we employ the Bilby software package (Ashton et al., 2019), which is a general and versatile Bayesian inference library developed primarily for gravitational-wave astronomy. For the stochastic sampling of posteriors and evidence calculation, we use the dynamic nested sampling method developed by Higson et al. (2019), which is available through the Dynesty package (Speagle, 2020). The uncertainty of the logarithmic model evidence () computed by Dynesty is . By performing independent runs of the same sampling process, we verify that the returned values of are consistent within the uncertainty, and that the posterior distributions have converged.
Parameter | Prior description |
---|---|
Same for prior & | |
(mag) | Uniform, min=0, max=0.5 |
(radian) | Uniform, min=0, max= |
(yr) | Uniform, min=0, max=10 |
(mag) | Uniform, min=14.5, max=15.5 |
Uniform, min=0.1, max=2 | |
Uniform, min=0, max=2 | |
prior | |
(mag2 yr-1) | Normal, mean=, width=1.15 |
(yr) | Normal, mean=, width=1.15 |
prior | |
(mag2 yr-1) | Uniform, min=, max=0 |
(yr) | Uniform, min=, max=4 |
prior , same as prior , with and fixed. |
Our priors are specified in Table 1. We consider three cases denoted by prior , , and . The prior employs log-normal priors for the DRW model parameters and , same as those used in Vaughan et al. (2016), which were based on earlier studies of quasar red noise under the DRW model (MacLeod et al., 2010; Kozłowski et al., 2010; Andrae et al., 2013). The prior employs a log-uniform prior for both and . Specifically, the prior edges on correspond to and . For the remaining parameters, we assign uniform priors. There is one mean magnitude and one white-noise scaling factor for each data set (CRTS, ASAS-SN and LINEAR). Prior is the same as prior , except that we fix these six parameters: is fixed at the weighted mean magnitudes - , , and for CRTS, ASAS-SN, and LINEAR, respectively; is fixed at 0.22, 1.15 and 1.0 for CRTS, ASAS-SN, and LINEAR, respectively, based on analysis of the data under the best-performing hypothesis as we discuss below. We find the posteriors on these six parameters are well constrained and do not change with respect to the choice of signal/noise terms included in the hypothesis or the choice of prior for red noise parameters.
In Table 2, we show the effect of the choice of priors on for the red-noise hypothesis. We find that prior results in slightly higher than prior . We also note that prior leads to posterior distribution of that is in the low-probability tail of the log-normal prior. Therefore, we adopt the less-informative log-uniform prior on red-noise parameters and . By fixing and , the evidence increases by 14.6 for the full combined data set (comparing prior against prior ). This shows that the inclusion of these parameters is unnecessary, and we only present results based on prior in the remaining sections unless otherwise specified.
C | C+L | C+A | C+L+A | |
---|---|---|---|---|
prior | 0 | 0 | 0 | 0 |
prior | 0.7 | 1.2 | 1.1 | 1.2 |
prior | 5.9 | 11.8 | 10.1 | 15.8 |
C | C+L | C+A | C+L+A | |
1. red | 0 | 0 | 0 | 0 |
2. red+sine | 9.1 | 14.3 | 4.6 | 12.7 |
3. DRW | 1.3 | 1.6 | ||
4. DRW+sine | 6.1 | 0.2 | 4.0 | 1.3 |
5. red+QPO | 9.3 | 16.5 | 6.9 | 14.5 |
6. red+QPO+sine | 9.2 | 17.2 | 6.2 | 14.1 |
3.0 | 14.1 | 0.6 | 11.4 |
In comparison to Vaughan et al. (2016), we find that the DRW model is overwhelmingly favoured over the sinusoidal model, with () using CRTS data alone (under prior ). This is consistent with the result of obtained in Vaughan et al. (2016). However, it may be argued that the choice between pure red noise and pure sinusoid is a false dilemma as we expect a realistic signal to be characterized by both red noise and a sinusoidal signal. The question investigated here therefore is whether or not there is a sinusoidal signal on top of red noise. The corresponding physical picture is that fluctuations in the accretion disc of the primary black hole (or the circumbinary disc) produces red noise and the binary motion results in periodic variations.
In Table 3, we list the log Bayes factor for six hypotheses with respect to the red-noise only hypothesis (Eq. 7, denoted as “red”). We examine the presence of a sinusoidal signal (denoted as “sine”) on top of additional two noise scenarios: the DRW model (Eq. 5) and the red noise plus a QPO term (Eq. 8, denoted as “red+QPO”). Moreover, compares the “red+sine” hypothesis to the “DRW+sine” hypothesis, and indicates the level of support for deviation from the DRW red noise.


There are a few take-away messages from Table 3. First, there is very strong evidence against the noise-only hypothesis. Using the full data set (C+L+A), we find strong support for either a sinusoidal signal () or a QPO term (), with the QPO slightly favoured with . Assuming equal prior probability for both signal hypotheses, the QPO hypothesis is favoured by an odds ratio of . While this is inconclusive, we discuss other hints toward the QPO later on. Second, the DRW red-noise model is strongly disfavoured, with (a Bayes factor of ) using the full data set. Third, there is a moderate reduction in associated with the inclusion of ASAS-SN data for almost all Bayes factors. For the sinusoidal hypothesis, this is interpreted as evidence against the periodicity by Liu et al. (2018). We look into such a feature in more detail in Section 4. Last, there is no support for the presence of sinusoidal signal on top of the “red+QPO” scenario.
The full posterior distribution of model parameters for the best performing hypothesis (“red+QPO”, using prior ) is presented in Appendix B. We find the CRTS measurement uncertainties are significantly overestimated by a factor of 5 (), whereas those of ASAS-SN are slightly underestimated (). The unusual value of for CRTS data is consistent with that found in Vaughan et al. (2016), and can be verified by binning the original measurements inside a window and computing the dispersion. For LINEAR data, we find , which is consistent with .
Figure 3 shows the posterior distributions for the “red+QPO” (left panel) and the “red+sine” hypothesis (right panel), for three combinations of data sets. It can be seen that the use of the full data set generally results in tighter constraints on model parameters. For the “red+QPO” hypothesis, the exponential index is measured to be using the full data set. This rules out (the DRW model) with more than credibility, consistent with the Bayes factor results in Table 3. This measurement of , which is consistent with that for the “red+sine” hypothesis, implies that the red noise in PG1302102 features a power-law spectral density with a power-law index less than 2. The red-noise time scale is long, , which is in the low-probability tail of the log-normal prior distribution (our prior ) as adopted in Vaughan et al. (2016).
Comparing two panels in Figure 3, we find that, 1) posteriors of the “red+QPO” hypothesis are better constrained; 2) there appear to be two posterior modes for “red+sine” hypothesis for the red-noise amplitude and timescale . These two parameters are correlated, and one of the two modes – large and small – is consistent with the posterior distribution found under the “red+QPO” hypothesis; 3) the posteriors for three combinations of data sets are consistent for the “red+QPO” hypothesis, whereas those for the “red+sine” hypothesis are unstable. Specifically, the credible region for the sinusoidal period is inconsistent between CRTS ( yr) and the full data set ( yr). This comparison is an example of what is known as a posterior predictive check, in which one performs sanity tests to make sure the models are suitable for Bayesian inference. The instability of posterior distributions with the inclusion of more data may suggest the “red+sine” hypothesis does not fully describe the data.
4 Discussion
4.1 What does it mean that the significance of the periodicity in PG1302-102 goes down when we add more data?
The growth in detection significance of a sinusoidal signal is not guaranteed in the presence of red noise. A red-noise process produces long-term correlations in the data, meaning the noise component in new data is not independent from old data. To demonstrate this effect, we inject a periodic signal into 10 random realizations of quasar red noise. We choose exactly the same sampling and error bars as shown in Figure 1. Each of the 10 data sets contain three subsets, from LINEAR, CRTS and ASAS-SN. We compute the Bayes factor, comparing the “red+sine” hypothesis against the red-noise-only hypothesis, for different combinations of subsets.
We choose the following parameters, which are consistent with posterior estimates of PG1302102: mag, , , , , , , and . The red noise realization is generated using . Here is a lower triangular matrix obtained from the Cholesky decomposition of the noise covariance matrix where is the conjugate transpose of . The vector of contains independent random numbers that follow the standard normal distribution.
Figure 4 shows the Bayes factors from 10 simulated light curves of PG1302102 as light blue dots with green lines. Several features are noteworthy. First, the general trend is that Bayes factors grow when we add LINEAR and ASAS-SN data to CRTS data. Second, there are two noise realizations where the addition of LINEAR data onto CRTS data results in a reduced detection significance of periodicity. We note that these two realizations have relatively low initial Bayes factors (). Therefore, we conclude that 1) whether or not the periodicity significance grows with the inclusion of a certain set of additional data cannot be used as a simple criterion for true periodicity; 2) the Bayes factor is more likely to grow with time once the initial Bayes factor is high (e.g., ). Third, the addition of ASAS-SN data generally leads to a greater improvement in detection significance than LINEAR. As a sanity check, we also show the log Bayes factors for 10 noise-only data sets as grey dots in Figure 4, using the same noise parameters mentioned earlier. In this case, the log Bayes factors are around , correctly disfavouring the presence of sinusoidal signals. The addition of new data does not change the Bayes factor significantly.


In Fig. 4, we also show results from the analysis of real data as blue diamonds. The evolution of , particularly the reduction in associated with ASAS-SN data, is in drastic contrast with our simulations. In Fig. 5, we plot the posterior distributions for one of the sinusoidal injections for different combinations of sub-data sets. In comparison to the right panel of Figure 3, which is the counterpart plot for real data, we find the posteriors are more stable. Taken together, our simulations hint toward a problem with the sinusoidal signal hypothesis for PG1302102.
Lastly, as a side note, the simulation study presented here highlights the challenge in the periodicity search in quasar light curves; namely, there is only one realisation of the red noise. This situation is in contrast with the search for continuous gravitational waves (also essentially sinusoidal signals) using pulsar timing arrays. Many millisecond pulsars in the timing array have been found to exhibit red noise for which the origin is largely unknown. However, these red noise processes, if intrinsic to pulsars themselves, are not expected to correlate among different pulsars. A false detection caused by red noise in one particular pulsar can be ruled out by cross-checking other pulsar data. This sort of cross-check is not always possible when searching for binary black holes in quasar light curves.
4.2 Can we distinguish a sinusoid from a QPO?
Our analysis of PG1302102 data reveals modest evidence for a QPO over a sinusoidal signal. It is natural to ask: under what circumstances can we distinguish a true sinusoid from a QPO? To find out, we compute the Bayes factor between the sinusoidal hypothesis and the QPO hypothesis for simulated data that contain a sinusoidal signal and some red noise. We take the sampling and error bars of PG1302102 data, and vary the signal period so that the data (spanning ) include 3, 6 and 9 wave cycles. For Injection , we choose the following parameters , , , , . Injection is the same except that we increase the signal amplitude () and reduce the red-noise amplitude ().
Number of signal cycles | 3 | 6 | 9 | |
---|---|---|---|---|
Injection | 15.0 | 25.5 | 36.4 | |
0.1 | 2.4 | 2.1 | ||
Injection | 40.6 | 68.9 | 101.3 | |
4.6 | 7.5 | 13.3 |

Table 4 lists the Bayes factors for both injections. One can see that the Bayes factor that supports the presence of a sinusoidal signal increases with the number of signal cycles, as expected. However, for Injection , the sinusoid is indistinguishable from a QPO even after 9 wave cycles. In fact, the Bayes factor slightly favours the QPO hypothesis for the 6-cycle and 9-cycle cases. For the stronger Injection , it becomes possible to tell the signal is a sinusoid instead of a QPO, and the corresponding Bayes factor grows with the number of wave cycles.

Figure 6 shows the posterior distributions for the 3-cycle case of Injection . Whereas the injection parameters are correctly recovered for the true “red+sine” hypothesis, red-noise parameters are mistaken by the (incorrect) “red+QPO” hypothesis. Noting that the quality factor of the QPO is given by , it is unsurprising that the QPO hypothesis results in posteriors that peak at large and small red-noise amplitude .
4.3 How does binning affect the detection significance and parameter estimation?
It is common that some sort of averaging or binning is applied when analysing time-series data. Information is inevitably lost in this process. Here we demonstrate the potentially adverse effect of binning on the periodicity detection significance and parameter estimation precision with an example. We choose the simulated data set that gives rise to the highest log Bayes factor in Figure 4. Because the LINEAR data used in this work have already gone through an averaging process with an interval of , we only use simulated data for CRTS and ASAS-SN. Figure 7 shows the simulated data, their binned versions with an interval of , and the injected sinusoidal signal.
We compute two Bayes factors: a) the “red+sine” hypothesis against the red-noise-only hypothesis, and b) the “red+sine” hypothesis against the “DRW+sine” hypothesis. Cases a) and b) indicate the level of support for periodicity and for deviation from the DRW model, respectively. The log Bayes factors are and for case a) and b), respectively, using the original data. These Bayes factors become and for case a) and b), respectively, using the binned data. Therefore, the binning process not only reduces our ability to detect a periodicity but also to identify deviation from the DRW red-noise model (noting the high-frequency red-noise fluctuations illustrated in Figure 1).
Posterior distributions for this injection study are shown in Appendix C in Figures 10 (unbinned data) and 11 (binned data). The binning process generally broadens the posterior distributions of both signal and noise parameters. Note, in particular, that posteriors for binned data are uninformative, which agrees with the Bayes factor () that shows the two red-noise models are indistinguishable. A multimodal feature in the 2-D posterior of can be seen in Figure 10. This can be attributed to the fact that we are estimating red noise and the periodic signal parameters simultaneously, and the data cover only a few () signal cycles.
4.4 On the implementation of our method to a large sample of quasar light curves
The computational cost in our analysis is dominated by the computation of the inverse and determinant of the noise covariance matrix associated with the likelihood evaluation, which scales as . For the entire data set of PG1302102 analysed in this work, . A single likelihood evaluation takes on the order of and the full parameter estimation and evidence calculation for such a data set take about on a single modern CPU core. The computational cost is the reason why we choose to average LINEAR data in an interval of , which reduce the total number of data points from to . Furthermore, there are a large number of quasars for which long-term photometric measurements are available, for example quasars from the CRTS as processed in Graham et al. (2015a). Therefore, further speed-up of our method is highly desirable.
This is a well-known problem in the analysis of astronomical time series. In pulsar timing arrays, there are normally hundreds to thousands of times of arrival measurements for each pulsar and the problem involves correlation analysis with data from dozens of pulsars. Various acceleration/approximation techniques have been proposed to enable full Bayesian analysis of pulsar timing array data. One popular method is to approximate the red noise as the sum of Fourier components, and thus its covariance matrix is given by where is (with ) and is a diagonal matrix. This turns the computationally heavy inversion of into the lower-rank diagonal matrix inversion through the Woodbury matrix lemma (e.g., van Haasteren & Vallisneri, 2015; Lentati et al., 2013). Elsewhere, Foreman-Mackey et al. (2017) proposed a fast Gaussian-process model that makes use of the feature where the covariance function can be expressed as a mixture of complex exponentials. Implemented as the Celerite package, its computational cost scales linearly with the size of the data set; the same scaling for computational cost is shared by the CARMA method (Kelly et al., 2014). All of these methods may prove useful for the analysis of light curves for a large number of quasars within the Bayesian framework developed here.
5 Conclusions
Sub-parsec supermassive binary black holes are crucial in our understanding of galaxy evolution. They are also the primary sources of nanohertz gravitational waves highly anticipated by international pulsar timing array campaigns. While nearly impossible to resolve through direct imaging, such close binaries are expected to produce periodic variations in light curves of active galactic nuclei. New candidates of periodicity are reported on a nearly monthly basis.
In this work, we propose a fully Bayesian method for the identification of periodicity in astronomical time-series that exhibit red noise. We apply this method to one of the most promising periodicity candidates, PG1302102, using data from CRTS, ASAS-SN and LINEAR surveys. Our main findings are:
-
1.
There is very strong support for the presence of either a sinusoidal signal () or a QPO () at a period of . The data slightly favours the QPO hypothesis with an odds ratio of 6.
-
2.
The inclusion of ASAS-SN data reduces the log Bayes factor that supports the presence of a sinusoidal signal by 1.6. This, combined with the fact that the posterior distribution of sinusoidal period is unstable when we combine new data with CRTS, provides further evidence against the sinusoidal hypothesis.
-
3.
There is also conclusive evidence for deviation from the DRW red-noise model, with . The noise power spectral density is shallower than a power law with an index of 2, and the red-noise damping timescale is long .
We perform simulations of sinusoidal signals and red noise and demonstrate the following:
-
1.
The growth of the periodicity significance with additional data is not guaranteed because of the stochastic fluctuations of red noise. For data sets like LINEAR, CRTS and ASAS-SN, the log Bayes factor is expected to grow once is achieved with initial data.
-
2.
Our method is capable of distinguishing a sinusoidal signal from a QPO. However, this is only possible for strong signals or a large number of wave cycles. A strong prior constraint on the red noise would also help.
-
3.
The use of data binning can reduce our ability to detect periodicity or deviation from the DRW model, because the binning throws away high-frequency information that helps estimate the spectral slope of the red noise (see Fig. 8).
We show that the data of PG1302102 favours the presence of a quasi-periodicity at around against the noise hypothesis with a Bayes factor of . However, given that PG1302102 was selected as the most periodic candidate out of quasars, it cannot be ruled out that such a quasi-periodicity is caused by pure red noise. Assuming that it is indeed a QPO and the central black hole is (Graham et al., 2015b), we infer a period corresponding to a Keplerian orbit at 344 Schwarzschild radii (). We note that the QPO found in PG1302102 roughly corresponds to the low-frequency QPOs seen in stellar-mass black hole X-ray binaries, although the QPO frequency of PG1302102 lies one order of magnitude below the linear scaling relation between black hole mass and QPO frequency fitted to several candidates in Smith et al. (2018). The physical origins of QPOs are poorly understood and therefore such a scaling relation might not exist if there are different mechanisms driving QPOs in small and big black holes. Nevertheless, the QPO hypothesis of PG1302102 can be tested with continued observations, and if the quasi-periodicity signature is confirmed, it may have significant implications for accretion physics of supermassive black holes.
Finally, our Bayesian framework can be adopted to establish unambiguous binary black hole detections with the following extensions:
-
1.
Apply this method to various physical models for the periodicity, such as periodic mass accretion rate of the binary (e.g., Farris et al., 2014) or jet precession of a single or binary supermassive black holes (e.g., Abraham & Carrara, 1998; Kudryavtseva et al., 2011; Britzen et al., 2018), in addition to relativistic Doppler boosting;
- 2.
-
3.
Combine multi-wavelength observations (e.g., Xin et al., 2020), and other signatures associated with a binary black hole, for example, Doppler velocity offsets in broad emission line profiles (Eracleous et al., 2012; Bon et al., 2012; Li et al., 2016), and flux deficits in the spectral energy distribution (Gültekin & Miller, 2012; Guo et al., 2020). See Zheng et al. (2016) for a binary black hole candidate for which different types of signatures are analyzed;
-
4.
Use astrophysically-motivated population priors. For example, the period distribution of binary black hole population is expected to be dominated by long periods, and the distribution of red-noise parameters can be obtained by applying our method to a large number of active galactic nuclei using hierarchical inference.
References
- Abraham & Carrara (1998) Abraham, Z., & Carrara, E. A. 1998, ApJ, 496, 172
- Aggarwal et al. (2019) Aggarwal, K., et al. 2019, ApJ, 880, 116
- Andrae et al. (2013) Andrae, R., Kim, D. W., & Bailer-Jones, C. A. L. 2013, A&A, 554, A137
- Ashton et al. (2019) Ashton, G., Hübner, M., Lasky, P. D., et al. 2019, ApJS, 241, 27
- Babak et al. (2016) Babak, S., Petiteau, A., Sesana, A., et al. 2016, MNRAS, 455, 1665
- Bansal et al. (2017) Bansal, K., Taylor, G. B., Peck, A. B., Zavala, R. T., & Romani, R. W. 2017, ApJ, 843, 14
- Begelman et al. (1980) Begelman, M. C., Blandford, R. D., & Rees, M. J. 1980, Nature, 287, 307
- Bon et al. (2012) Bon, E., Jovanović, P., Marziani, P., et al. 2012, ApJ, 759, 118
- Britzen et al. (2018) Britzen, S., Fendt, C., Witzel, G., et al. 2018, MNRAS, 478, 3199
- Charisi et al. (2016) Charisi, M., Bartos, I., Haiman, Z., et al. 2016, MNRAS, 463, 2145
- Chen et al. (2019) Chen, S., Sesana, A., & Conselice, C. J. 2019, MNRAS, 488, 401
- Colpi (2014) Colpi, M. 2014, Space Sci. Rev., 183, 189
- Cordes & McLaughlin (2019) Cordes, J., & McLaughlin, M. A. 2019, BAAS, 51, 447
- Dey et al. (2018) Dey, L., Valtonen, M. J., Gopakumar, A., et al. 2018, ApJ, 866, 11
- Dey et al. (2019) Dey, L., Gopakumar, A., Valtonen, M., et al. 2019, Universe, 5, 108
- D’Orazio et al. (2015) D’Orazio, D. J., Haiman, Z., & Schiminovich, D. 2015, Nature, 525, 351
- Drake et al. (2009) Drake, A. J., Djorgovski, S. G., Mahabal, A., et al. 2009, ApJ, 696, 870
- Eracleous et al. (2012) Eracleous, M., Boroson, T. A., Halpern, J. P., & Liu, J. 2012, ApJS, 201, 23
- Farris et al. (2014) Farris, B. D., Duffell, P., MacFadyen, A. I., & Haiman, Z. 2014, ApJ, 783, 134
- Foreman-Mackey et al. (2017) Foreman-Mackey, D., Agol, E., Ambikasaran, S., & Angus, R. 2017, AJ, 154, 220
- Goicovic et al. (2018) Goicovic, F. G., Maureira-Fredes, C., Sesana, A., Amaro-Seoane, P., & Cuadra, J. 2018, MNRAS, 479, 3438
- Goulding et al. (2019) Goulding, A. D., Pardo, K., Greene, J. E., et al. 2019, ApJ, 879, L21
- Graham et al. (2015a) Graham, M. J., Djorgovski, S. G., Stern, D., et al. 2015a, MNRAS, 453, 1562
- Graham et al. (2015b) —. 2015b, Nature, 518, 74
- Gültekin & Miller (2012) Gültekin, K., & Miller, J. M. 2012, ApJ, 761, 90
- Guo et al. (2020) Guo, H., Liu, X., Tayyaba, Z., & Liao, W.-T. 2020, MNRAS, 492, 2910
- Guo et al. (2017) Guo, H., Wang, J., Cai, Z., & Sun, M. 2017, ApJ, 847, 132
- Higson et al. (2019) Higson, E., Handley, W., Hobson, M., & Lasenby, A. 2019, Statistics and Computing, 29, 891
- Hinton (2016) Hinton, S. R. 2016, The Journal of Open Source Software, 1, 00045
- Hobbs et al. (2010) Hobbs, G., et al. 2010, Class. Quant. Grav., 27, 084013
- Jayasinghe et al. (2019) Jayasinghe, T., Stanek, K. Z., Kochanek, C. S., et al. 2019, MNRAS, 485, 961
- Kass & Raftery (1995) Kass, R. E., & Raftery, A. E. 1995, Journal of the American Statistical Association, 90, 773
- Kelly et al. (2009) Kelly, B. C., Bechtold, J., & Siemiginowska, A. 2009, ApJ, 698, 895
- Kelly et al. (2014) Kelly, B. C., Becker, A. C., Sobolewska, M., Siemiginowska, A., & Uttley, P. 2014, ApJ, 788, 33
- Khan et al. (2016) Khan, F. M., Fiacconi, D., Mayer, L., Berczik, P., & Just, A. 2016, ApJ, 828, 73
- Kharb et al. (2017) Kharb, P., Lal, D. V., & Merritt, D. 2017, Nature Astronomy, 1, 727
- Kovačević et al. (2019) Kovačević, A. B., Popović, L. Č., Simić, S., & Ilić, D. 2019, ApJ, 871, 32
- Kozłowski et al. (2010) Kozłowski, S., Kochanek, C. S., Udalski, A., et al. 2010, ApJ, 708, 927
- Kudryavtseva et al. (2011) Kudryavtseva, N. A., Britzen, S., Witzel, A., et al. 2011, A&A, 526, A51
- Laine et al. (2020) Laine, S., Dey, L., Valtonen, M., et al. 2020, ApJ, 894, L1
- Lentati et al. (2013) Lentati, L., Alexander, P., Hobson, M. P., et al. 2013, Phys. Rev. D, 87, 104021
- Li et al. (2016) Li, Y.-R., Wang, J.-M., Ho, L. C., et al. 2016, ApJ, 822, 4
- Liu et al. (2018) Liu, T., Gezari, S., & Miller, M. C. 2018, ApJ, 859, L12
- Liu et al. (2015) Liu, T., Gezari, S., Heinis, S., et al. 2015, ApJ, 803, L16
- Liu et al. (2016) Liu, T., Gezari, S., Burgett, W., et al. 2016, ApJ, 833, 6
- Liu et al. (2019) Liu, T., Gezari, S., Ayers, M., et al. 2019, ApJ, 884, 36
- MacLeod et al. (2010) MacLeod, C. L., Ivezić, Ž., Kochanek, C. S., et al. 2010, ApJ, 721, 1014
- Milosavljević & Merritt (2001) Milosavljević, M., & Merritt, D. 2001, ApJ, 563, 34
- Motta (2016) Motta, S. E. 2016, Astronomische Nachrichten, 337, 398
- Muñoz et al. (2020) Muñoz, D. J., Lai, D., Kratter, K., & Mirand a, R. 2020, ApJ, 889, 114
- Perera et al. (2019) Perera, B. B. P., et al. 2019, MNRAS, 490, 4666
- Rodriguez et al. (2006) Rodriguez, C., Taylor, G. B., Zavala, R. T., et al. 2006, ApJ, 646, 49
- Rosado et al. (2016) Rosado, P. A., Lasky, P. D., Thrane, E., et al. 2016, Phys. Rev. Lett., 116, 101102
- Ryu et al. (2018) Ryu, T., Perna, R., Haiman, Z., Ostriker, J. P., & Stone, N. C. 2018, MNRAS, 473, 3410
- Sesar et al. (2011) Sesar, B., Stuart, J. S., Ivezić, Ž., et al. 2011, AJ, 142, 190
- Shappee et al. (2014) Shappee, B. J., Prieto, J. L., Grupe, D., et al. 2014, ApJ, 788, 48
- Sillanpaa et al. (1988) Sillanpaa, A., Haarala, S., Valtonen, M. J., Sundelius, B., & Byrd, G. G. 1988, ApJ, 325, 628
- Smith et al. (2018) Smith, K. L., Mushotzky, R. F., Boyd, P. T., & Wagoner, R. V. 2018, ApJ, 860, L10
- Speagle (2020) Speagle, J. S. 2020, MNRAS, 493, 3132
- Taylor et al. (2017) Taylor, S. R., Simon, J., & Sampson, L. 2017, Phys. Rev. Lett., 118, 181102
- Taylor et al. (2016) Taylor, S. R., Vallisneri, M., Ellis, J. A., et al. 2016, ApJ, 819, L6
- Thrane & Talbot (2019) Thrane, E., & Talbot, C. 2019, PASA, 36, e010
- Valtonen et al. (2008) Valtonen, M. J., et al. 2008, Nature, 452, 851
- Valtonen et al. (2016) —. 2016, ApJ, 819, L37
- van Haasteren & Vallisneri (2015) van Haasteren, R., & Vallisneri, M. 2015, MNRAS, 446, 1170
- Vaughan et al. (2016) Vaughan, S., Uttley, P., Markowitz, A. G., et al. 2016, MNRAS, 461, 3145
- Verbiest et al. (2016) Verbiest, J. P. W., et al. 2016, MNRAS, 458, 1267
- Xin et al. (2020) Xin, C., Charisi, M., Haiman, Z., et al. 2020, MNRAS, 496, 1683
- Yu (2002) Yu, Q. 2002, MNRAS, 331, 935
- Zheng et al. (2016) Zheng, Z.-Y., Butler, N. R., Shen, Y., et al. 2016, ApJ, 827, 56
- Zhu et al. (2019) Zhu, X.-J., Cui, W., & Thrane, E. 2019, MNRAS, 482, 2588
- Zhu et al. (2014) Zhu, X.-J., et al. 2014, MNRAS, 444, 3709
- Zu et al. (2013) Zu, Y., Kochanek, C. S., Kozłowski, S., & Udalski, A. 2013, ApJ, 765, 106
Appendix A Red-noise models
In Fig. 8, we show the covariance function (left panel) and power spectral density (right panel) for the red-noise models considered in this work. Black dashed lines are for the DRW model, i.e., , and coloured lines are for different values of . Additionally, we show the case of a QPO on top of the DRW red noise in green line. The grey vertical line on the right panel indicates where yr is the data span of PG1302102. The time scale and the QPO period are both set to be 1 yr in these examples.

The prevalent DRW red-noise model features a spectral turnover at a frequency that corresponds to the damping timescale , and a power-law spectral density with a power-law index of at high frequency (). As can be seen in the right panel of Fig. 8, our general red-noise model covers a broad range of power-law spectral shapes, from a power-law index of zero () to extremely steep spectrum (). In order to measure the spectral shape of quasar red noise, i.e., the parameter, the high-frequency part is critical since that is where has the largest influence. In the case that , various models reduce to the power-law model. The QPO model, on the other hand, is drastically different as it features a peak in the power spectral density. The quality factor of the QPO, defined as the ratio of peak frequency of the QPO to its width, is determined by .
We note that the DRW model and the QPO prescription adopted here are special cases of the flexible CARMA (continuous-time autoregressive moving average) models. Kelly et al. (2014) demonstrated that CARMA models are useful in identifying new features in the power spectral density of astronomical time-series data. Since the power spectral density of CARMA models can be expressed as a sum of Lorentzian functions, in a similar fashion to the Celerite model described in Foreman-Mackey et al. (2017), it allows fast evaluation of the likelihood function which scales linearly with the number of data points. This makes it particularly powerful for analyzing massive time-domain data from a large number of objects. We leave the exploration of CARMA and Celerite models in our Bayesian inference framework for future work.
Appendix B Posterior distributions from analysis of real data
Here we present full posterior distributions from the analysis of the combined data set (Fig. 1) from CRTS, LINEAR and ASAS-SN for PG1302102. Fig. 9 shows the distributions for the red-noise plus a QPO hypothesis, which is the favoured hypothesis given the data. Note that the mean offset parameter for LINEAR data and CRTS data are highly covariant because there is a significant overlap in time for the two.

Appendix C Posterior distributions from analysis of simulated data
Here we present full posterior distributions from the analysis of a simulated data set (Fig. 7) that includes an injected sinusoidal signal. Fig. 10 shows the distributions for original data, whereas Fig. 11 shows results for the binned data with an interval of 100 days.

