This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Coronal height constraint in IRAS 13224–3809 and 1H 0707–495 by the random forest regressor

N. Mankatwit1, P. Chainakun1,2, W. Luangtip3,4, A. J. Young5
1School of Physics, Institute of Science, Suranaree University of Technology, Nakhon Ratchasima 30000, Thailand
2Centre of Excellence in High Energy Physics and Astrophysics, Suranaree University of Technology, Nakhon Ratchasima 30000, Thailand
3Department of Physics, Faculty of Science, Srinakharinwirot University, Bangkok 10110, Thailand
4National Astronomical Research Institute of Thailand, Chiang Mai 50180, Thailand
5H. H. Wills Physics Laboratory, Tyndall Avenue, Bristol BS8 1TL, UK
E-mail: [email protected]
(Accepted XXX. Received YYY; in original form ZZZ)
Abstract

We develop a random forest regressor (RFR) machine learning model to trace the coronal evolution in two highly variable active galactic nuclei (AGNs) IRAS 13224–3809 and 1H 0707–495 observed with XMM-Newton, by probing the X-ray reverberation features imprinted on their power spectral density (PSD) profiles. Simulated PSDs in the form of a power-law, with similar frequency range and bins to the observed data, are produced. Then, they are convolved with relativistic disc-response functions from a lamp-post source before being used to train and test the model to predict the coronal height. We remove some bins that are dominated by Poisson noise and find that the model can tolerate the frequency-bin removal up to 10\sim 10 bins to maintain a prediction accuracy of R2>0.9R^{2}>0.9. The black hole mass and inclination should be fixed so that the accuracy in predicting the source height is still >0.9>0.9. The accuracy also increases with the reflection fraction. The corona heights for both AGN are then predicted using the RFR model developed from the simulated PSDs whose frequency range and bins are specifically adjusted to match those from each individual observation. The model suggests that their corona varies between 518rg\sim~{}5-18~{}r_{\rm g}, with R2>0.9R^{2}>0.9 for all observations. Such high accuracy can still be obtained if the difference between the true mass and the trained value is 10%\lesssim 10\%. Finally, the model supports the height-changing corona under the light-bending scenario where the height is correlated to source luminosity in both IRAS 13224–3809 and 1H 0707–495.

keywords:
accretion, accretion discs – black hole physics – galaxies: active – X-rays: galaxies
pubyear: 2021pagerange: Coronal height constraint in IRAS 13224–3809 and 1H 0707–495 by the random forest regressorReferences

1 Introduction

X-ray reverberation in active galactic nucleus (AGN) occurs when the primary X-ray continuum from the corona illuminates an accretion disc, producing the reflected X-rays that are observed with a time delay after the direct continuum (see Uttley et al., 2014; Cackett, Bentz, & Kara, 2021, for a review). The reverberation lags due to the delay between the variations of the reflection and the direct continuum components were detected via the Fourier-based technique (e.g. Fabian et al., 2009; De Marco et al., 2013; Kara et al., 2016). The lag amplitude depends on the light travel distance between the corona and the disc, providing valuable information about the geometry of the system. Most previous work investigated X-ray reverberation signatures in the lag-frequency spectra using the standard lamp-post geometry where the corona is an isotropic point-like source located on the rotational axis of the black hole (Cackett et al., 2014; Emmanoulopoulos et al., 2014; Chainakun et al., 2016; Epitropakis et al., 2016; Caballero-García et al., 2018), while some investigated it in an extended corona environment (Wilkins et al., 2016; Chainakun & Young, 2017; Chainakun et al., 2019).

The AGN emission fluctuates rapidly on a wide range of timescales. The amount of the X-ray variability power that the AGN emits can be quantified using the power spectral density (PSD), whose intrinsic shape can also be imprinted with X-ray reverberation effects (Papadakis et al., 2016; Chainakun, 2019). The observed PSD profiles tend to be relatively flat at low Fourier frequencies, with a steep drop-off at higher frequencies (González-Martín & Vaughan, 2012). The intrinsic shape of the PSD depends on the specific characteristics of the accretion disc (Arévalo & Uttley, 2006; Ashton & Middleton, 2022). An attempt at probing the reverberation signature (appeared as an oscillatory structure) in the PSD data of AGN to constrain the coronal height were carried out by Emmanoulopoulos et al. (2016); Chainakun et al. (2022). In this way, the coronal geometry can be constrained, but high accuracy prediction may still require the intrinsic PSD shape to be known in advance.

Furthermore, Chainakun et al. (2021) developed a machine learning (ML) model using dictionary-learning and support vector machine to predict the discrete values of the coronal height. Interestingly, while the reverberation-imprinted and power-law based PSD profiles were used to train the model, it was suggested that the accuracy of predicting the source height was still high even when the model was tested with a bending power-law PSD. Here, we aim to further develop the model so that it can predict a continuous value of the coronal height and can directly be applied to real observational data. We use a random forest regressor (RFR) which is an ensemble ML algorithm that is composed of many individual decision trees being trained on subsets of the data. Each tree makes its individual prediction, and the final prediction is achieved by statistical averaging the predictions of all individual decision trees.

We select to apply our developed RFR model to the AGN IRAS 13224–3809 and 1H 0707–495, both of which are of particular interest due to their unique X-ray spectral properties and extreme variability. IRAS 13224–3809 is a type 1 AGN that is known to have a supermassive black hole at its center, with a mass of 2×106M\sim 2\times 10^{6}\ M_{\odot} (Alston et al., 2020). It has been thoroughly studied via both spectral and timing analysis (e.g. Fabian et al., 2013; Kara et al., 2013a; Chiang et al., 2015; Parker et al., 2017; Jiang et al., 2018). Along the line of these studies, the changing of the coronal height with luminosity was evidenced (Alston et al., 2020; Caballero-García et al., 2020; Chainakun et al., 2022), where the height increases from 3\sim 35rg5~{}r_{\rm g} to 10\sim 1025rg25~{}r_{\rm g} (rg=GM/c2r_{\rm g}=GM/c^{2}, GG is the gravitational constant, MM is the black hole mass and cc is the speed of light) when the source changes from lowest to highest flux state. The model that simultaneously fitted the energy and lag spectra in various flux states also favoured a maximum spin black hole (Caballero-García et al., 2020). Meanwhile, the 1H 0707–495 corona geometry is very uncertain and under debate. While an analysis using the emissivity profiles suggested an extended corona for 1H 0707–495 (Wilkins & Fabian, 2012; Wilkins et al., 2014), a much more compact corona was obtained using a newly developed spectral model accounting for the spatial extent and the rotation of the corona (Szanecki et al., 2020). Both intrinsic and environmental absorption were also found when analysing the fractional excess variance of 1H 0707–495 (Parker et al., 2021).

This paper is organised as follows. The observations and data reduction are described in Section 2. How the PSD profiles are produced is explained in Section 3. In Section 4 we present the RFR algorithms used in this work and explain how the RFR model is developed. The results and analysis towards the performance of the model is given in Section 5, as well as the results when we apply the model to predict the coronal height in IRAS 13224–3809 and 1H 0707–495. The discussion and conclusion is provided in Section 6.

2 Observations and data reduction

The X-ray data of the AGN IRAS 13224–3809 and 1H 0707–495 used in this study were previously observed by XMM-Newton observatory (Jansen et al., 2001) and were obtained from the XMM-Newton Science Archive.111Available online at http://nxsa.esac.esa.int/ All observational data are shown in Table 1. Here, only pn data are used to obtain high quality data. To analyse the data, we created the PSDs from these observations following the method and criteria explained in Chainakun et al. (2022). In brief, the pn data were reprocessed using the Science Analysis Software (SAS) following the procedure suggested in XMM-Newton Science Analysis System’s user guide.222https://xmm-tools.cosmos.esa.int/external/xmm_user_support/documentation/sas_usg/USG.pdf The observational periods which were affected by high background flaring activity were also removed during these steps. The remaining useful exposure time of each observation after removing the background flaring events is shown in column 4 of Table 1. Note that, in this step, we only considered the observations with a remaining exposure time of \ga30 ks so that the signal to noise of their PSDs is sufficiently high.

Table 1: XMM-Newton observations of the AGN sample.
Observation ID Revolution number Observational date Exposure timea Count rateb
(ks) (count s-1)
IRAS 13224–3809
0673580101 2126 2011-07-19 34.08 0.37
0673580201 2127 2011-07-21 49.26 0.24
0673580301 2129 2011-07-25 52.03 0.09
0673580401 2131 2011-07-29 85.11 0.28
0780560101 3037 2016-07-08 39.38 0.19
0780561301 3038 2016-07-10 112.35 0.26
0780561401 3039 2016-07-12 99.76 0.22
0780561501 3043 2016-07-20 93.22 0.13
0780561601 3044 2016-07-22 97.85 0.38
0780561701 3045 2016-07-24 100.13 0.16
0792180101 3046 2016-07-26 110.64 0.13
0792180201 3048 2016-07-30 110.55 0.19
0792180301 3049 2016-08-01 86.44 0.07
0792180401 3050 2016-08-03 98.63 0.75
0792180501 3052 2016-08-07 102.66 0.23
0792180601 3053 2016-08-09 101.50 0.68
1H 0707–495
110890201 159 2000-10-21 29.94 0.13
148010301 521 2002-10-13 57.98 0.66
506200301 1360 2007-05-14 33.38 0.25
506200501 1379 2007-06-20 20.52 1.03
511580101 1491 2008-01-29 68.69 0.60
511580201 1492 2008-01-31 43.88 1.01
511580301 1493 2008-02-02 37.39 0.66
511580401 1494 2008-02-04 50.62 0.56
653510301 1971 2010-09-13 93.30 0.53
653510401 1972 2010-09-15 78.46 0.80
653510501 1973 2010-09-17 81.51 0.49
653510601 1974 2010-09-19 79.65 0.57
554710801 2032 2011-01-12 48.54 0.04
  • Note: aUseful exposure time used in the analysis. bThe background-subtracted count rate in the 1–4 keV energy band of the AGN obtained from the pn detector.

We then extracted the background-subtracted light curves of the AGN in the soft (0.3–1 keV) and hard (1.2–5 keV) energy bands – defined as reflection dominated and continuum dominated energy bands, respectively – from the observational data flagged with PATTERN \leqslant 4 and #XMMEA_EP for each observation. The corresponding PSDs were derived from the light curves which were divided into the length of \sim20 ks and re-binned to have a time resolution of 179 s. To improve the data signal to noise, we also re-binned the PSDs in which every bin width is equal in logarithmic space of base 1.06. These PSDs were then used as a representative of the real AGN sample in the RFR analysis in Section 5.2.

3 Reverberation and PSD model

The variability power of the AGN on different timescales can be represented using the PSD whose intrinsic shape follows a broadband power-law profile usually simplified in the form of

P0(f)fγ,P_{0}(f)\propto f^{-\gamma}\;, (1)

where ff is the Fourier frequency and γ\gamma is the power spectral index.

However, some characteristic features such as the dip and oscillatory structures caused by the X-ray reverberation can also be imprinted on the PSD profiles. We can assume that the total light curve includes both direct continuum and reverberation flux, and the total response function of the system can be written in the form of (e.g. Papadakis et al., 2016)

Ψ(t)=δ(t)+RFψ(t),\Psi(t)=\delta(t)+R_{F}\psi(t)\;, (2)

where δ(t)\delta(t) is a delta function representing the continuum flux response and ψ(t)\psi(t) is the disc response function representing the amount of the reflection flux detected by a distant observer, as a function of time, after the primary continuum flux. The precise ψ(t)\psi(t) is obtained via the ray-tracing simulations (e.g. Cackett et al., 2014; Chainakun et al., 2016; Wilkins et al., 2016; Epitropakis et al., 2016). Here, we use the kynxilrev model Caballero-García et al. (2018, 2020) to compute the disc response functions under the lamp-post geometry where an accretion disc is optically thick and geometrically thin and is illuminated by a point-like corona located on the rotational axis of the central black hole. The area under the disc response function is normalized to 1, so that the reflection fraction RFR_{F} determines the ratio of (reflection flux / continuum flux).

Based on the convolution theorem, the PSD including the X-ray reverberation effects can be obtained by Uttley et al. (2014); Papadakis et al. (2016); Chainakun et al. (2022)

Pobs(f)=|Ψ(f)|2P0(f),P_{\text{obs}}(f)=\left|\Psi(f)\right|^{2}P_{0}(f)\;, (3)

where Ψ(f)\Psi(f) is the Fourier form of the total response function. The intrinsic P0(f)P_{0}(f) is then an input signal which is processed by the total response function resulting in a reverberation feature appearing as an oscillatory structure in the Pobs(f)P_{\text{obs}}(f) (Papadakis et al., 2016; Chainakun, 2019).

By using the kynxilrev model (Caballero-García et al., 2018, 2020) to produce the disc response functions, the continuum spectrum is represented by a cut-off power-law: F(E)EΓEE/EcF(E)\propto E^{-\Gamma}E^{-E/E_{c}}, where Ec=300E_{c}=300  keV. The disc extends from the innermost stable circular orbit (ISCO) to 1000 rgr_{\rm g}. We initially fix the black hole spin to be a=1a=1. Other parameters in the kynxilrev model, if not specified, are set to their default values.

Examples of the simulated PSD profiles are presented in Fig. 1. In this illustration, we assume the black hole mass of 106M10^{6}~{}M_{\odot} and the inclination i=45i=45^{\circ}. The orange lines in the left panels represent the PSD that already include the reverberation effects due to the lamp-post corona. The main reverberation dip, as expected, shifts towards higher frequencies (shorter timescales) for lower coronal height. The corresponding noisy PSD profiles (blue lines) are simulated using the Timmer & Koenig (1995) method, then are binned into 30\sim 30 frequency bins within the frequency range of 3×105\sim 3\times 10^{-5}3×1033\times 10^{-3} Hz (red points in the right panel), similar to what is normally extracted from the XMM-Newton observations of IRAS 13224–3809. After being binned, these noisy PSD data are then used to train the machine, as their shapes and frequency bins are quite close to the observational ones.

Refer to caption h=4rgh=4~{}r_{\rm g}Refer to caption

Refer to caption h=26rgh=26~{}r_{\rm g}Refer to caption

Figure 1: Left panel: Clean PSD (orange lines) in the form of a power law and the corresponding noisy PSD (blue lines) generated using the method of Timmer & Koenig (1995), imprinted with the reverberation echoes with the source height h=4rgh=4~{}r_{\rm g} (top panels) and h=26rgh=26~{}r_{\rm g} (bottom panels). Right panel: Noisy PSD data after being binned and limited the frequency range to be 3×105\sim 3\times 10^{-5}3×1033\times 10^{-3} Hz (red points), as expected from IRAS 13224–3809 observations.

4 Machine learning algorithms

The machine learning (ML) algorithm used here is the random forest regressor (RFR). It aims to probe the characteristic features of the reverberation signatures that appear on the PSD profiles in order to predict the disc corona geometry, such as the coronal height and the disc inclination. We follow the standard procedure by splitting the PSD data into 80/20 for training/testing. All ML algorithms and related functions for the RFR model development are adopted from scikit-learn333https://scikit-learn.org/ (Pedregosa et al., 2012).

4.1 Random forest regressor

We use sklearn.ensemble.RandomForestRegressor() to implement the RFR algorithm to the model. The RFR is an ensemble method known as the bootstrapping random forest. The RFR algorithm combines the prediction result of multiple decision trees (DTs) to obtain a final powerful model for regression. Each DT starts at the top with the root node that is split into a left and right node, which is then split further in a tree structure until the model has a satisfying result. At each node, one of the data features is evaluated so that a certain path (i.e. branch) is determined toward the final node of the tree (referred to as the leaf node) where a numerical value is best estimated.

During the bootstrapping process (Fig. 2), the RF algorithm randomly selects, with replacements, the PSD samples in the training set (e.g. some PSD data may be repeated in the subset while others may not be included at all). This creates a new subset with a different set of PSD profiles for one decision tree. This process is repeated TT times meaning that we have TT decision trees, fnf_{n}, (n=1,2,3,,Tn=1,2,3,...,T) in the forest. For each individual tree, the data are trained and tested via the model given by fnf_{n}. The ensemble learning is initiated by using fnf_{n} to predict the unseen data set XX. After that, the result is obtained by averaging each individual prediction on unseen samples. This process leads to a final RFR model that can be written as

f^=1Tn=1Tfn(X).\hat{f}=\frac{1}{T}\sum_{n=1}^{T}f_{n}(X)\;. (4)

Refer to caption

Figure 2: Flowchart of the random forest bootstrap processing, where [x][x] is the complete PSD data set with the corresponding response yy. [P]n[P]_{n} is the new PSD data set for individual decision tree after bootstrapping process with the corresponding response ZnZ_{n}. The data are trained and tested individually via the decision-tree model fnf_{n} (n=1,2,3,,Tn=1,2,3,...,T), where TT is the total number of trees in the forest. The final model is then obtained by averaging the results from all decision-tree models.

4.2 Hyperparameter tuning and cross-validation

Hyperparameters are variables that are set before the learning phase. They help control and optimize the learning algorithm and are essential for achieving an accurate model. There are two main hyperparameters for the RFR model that we select to fine-tune, which are the maximum depth of the trees and the number of trees (estimators) in the forest.

To fine-tune these hyperparameters, we use sklearn.model_selection.GridSearchCV() that allows us to define a grid of possible values of each hyperparameter, before training the model using each combination of these values. The model is trained and validated via the KK-fold cross-validation process, which randomly divides the training set further into KK folds. The model is trained using the data in K1K-1 folds (new training set) and is evaluated using the remaining fold (validation set). The process is repeated KK times, with the model being trained and validated on different folds. Here, we fix K=5K=5, meaning that the training data are split into 5 folds having 4 folds (80%) for training and 1 fold (20%) for validating. The data that are initially split into the test set are then kept unseen all the time for the final evaluation.

After the cross-validation process, the best combination of hyperparameters is identified. For the final evaluation, the developed model is used to predict the data in the test set that are kept unseen during the training phase, so they represent the data that are completely new to the machine.

4.3 Data preparation and preprocessing

To prepare our data set, we simulated the response functions, ψ(t)\psi(t), using the kynxilrev model by varying the source heights, hh, between 2.330rg2.3-30~{}r_{\rm g} and the reflection fraction, RFR_{F}, between 0.61.60.6-1.6. The PSD index, γ\gamma, is varied between 0.52.50.5-2.5. For the black hole mass and inclination, we investigate the cases when they are either fixed or free. We also investigate other scenarios such as using different binning or removing some frequency bins (e.g. those that are dominated by Poisson noises) when we train and test the machine. These aim to determine the best way of data processing for the model to accurately predict the coronal height from real observational PSD data.

5 Results and analysis

5.1 Efficiency of RFR algorithm

Our simulated PSD profiles in the training data set are used for fine-tuning the hyperparameters via 5-fold cross-validation. Fig. 3 shows the accuracy scores for predicting the coronal height obtained from the training/validation process. In this illustration, we fix the black hole mass M=2×106MM=2\times 10^{6}~{}M_{\odot}, the inclination i=45i=45^{\circ} and use the number of frequency bins Nbin=30N_{\rm bin}=30. The RFR model can accurately predict the coronal height when the number of estimators is 50\gtrsim 50 and the maximum tree depth is 10\gtrsim 10, with the highest accuracy score of Rtrain2R^{2}_{\rm train} 0.96\sim 0.96. The best number of estimators and maximum depth obtained here are 200 and 26, respectively. Note that the R2R^{2} score444More explanation on the R2R^{2} score at https://scikit-learn.org/stable/modules/generated/sklearn.metrics.r2_score.html is the proportion of the variation in the dependent variable that is predictable from the independent variable. It is usually used to evaluate the ML model’s performance. The maximum R2R^{2} score is 1 which means that the model fits the data perfectly, or all independent variables can explain all possible interpretations of the dependent variable.

Refer to caption R2R^{2}

Figure 3: Cross validation score (R2R^{2}) for predicting hh for different values of hyperparameters. The highest Rtrain2R^{2}_{\rm train} is 0.96, corresponding to the case when the number of estimators is 200 and the maximum depth is 26.

Then, the developed RFR model is tested with the new, unseen data spared in the test data set. The true versus predicted values of the coronal height in this case is shown in Fig. 4. The obtained Rtest2R^{2}_{\rm test} is 0.95. Since Rtest2Rtrain2R^{2}_{\rm test}\sim R^{2}_{\rm train}, the model is generalized well (i.e. not overfitting the data). However, the model accuracy is relatively small at the low and high ends for the source height. The low accuracy at h2.3rgh\sim 2.3\ r_{\rm g} is because, with this assumed mass, the reverberation features are clearly noticeable at the frequencies of 3×103\gtrsim 3\times 10^{-3} Hz, or beyond the highest frequency probed in the PSD profiles. For h28rgh\gtrsim 28\ r_{\rm g}, less accuracy may arise because the reverberation bump is too weak in the majority of the PSD data, especially when the reflection fraction is also low.

Refer to caption

Figure 4: Plot of the true versus predicted values of the coronal height using the test data set. The Rtest2R^{2}_{\rm test} is 0.95, which is comparable to the Rtrain2R^{2}_{\rm train}. The blue solid line represents the results obtained from the RFR model. The red dashed line shows the perfect prediction line. The regions between the blue-dashed lines and between the blue-dotted lines represent the deviations of the predicted heights within ±1σ\pm 1\sigma and ±2σ\pm 2\sigma, respectively.

Furthermore, Fig. 5 shows that the RFR model is more effective in cases of the reflection fraction RF1R_{F}\geq 1 (Rtest2=0.98R^{2}_{\rm test}=0.98). This means that it is best to apply the model to the observational data extracted in the reverberation dominated band (e.g. 0.310.3-1 keV band).

Refer to caption RF<1R_{F}<1

Refer to caption RF1R_{F}\geq 1

Figure 5: Same as in Fig. 4, but showing the separate cases when the reflection fraction RF<1R_{F}<1 (top panel) and RF1R_{F}\geq 1 (bottom panel).

When the model is allowed to predict both coronal height and inclination, the accuracy is relatively small (Rtest2<0.5R^{2}_{\rm test}<0.5). This suggests that the model can make an accurate prediction for the coronal height, but not for the inclination. The poor prediction also occurs when we try to predict both source height and black hole mass, even when the reflection fraction RF>1R_{F}>1. Making an accurate prediction for all these parameters simultaneously may require fine-tuning more hyperparameters, so at this stage we select to fix ii and MM, and investigate further the effects of the data binning. There are also intrinsic degeneracies between model parameters (see discussion Section 6).

Fig. 6 (top panel) shows how the number of frequency bins (NbinN_{\rm bin}) affects the Rtest2R^{2}_{\rm test} when we fix M=2×106MM=2\times 10^{6}~{}M_{\odot}, i=45i=45^{\circ} and produce the PSD in the frequency range of 105\sim 10^{-5}10310^{-3} Hz. The developed RFR model seems to work very well (Rtest20.9R^{2}_{\rm test}\gtrsim 0.9) when Nbin8N_{\rm bin}\gtrsim 8. The observed PSD data of IRAS 13224–3809 and 1H 0707–495 are extracted using the number of frequency bins of 30\sim 30, which is definitely large enough that the reverberation features can still be interpretable by the model. Nevertheless, some frequency bins of the observational data can be dominated by Poisson noise. In this case, we investigate whether it will affect the accuracy score if we remove those Poisson-dominated bins. This is illustrated in Fig. 6 (bottom panel) when 2102-10 bins are randomly removed from the simulated PSD profiles, and the machine is trained and tested with these modified data. The Rtest20.95R^{2}_{\rm test}\gtrsim 0.95 in all cases suggests that the observational PSD data can be pre-processed by removing the Poission-noise dominated bins, as long as the machine is trained using the simulated PSD profiles that are modified and binned in a similar way.

Refer to caption

     Refer to caption

Figure 6: Accuracy score (Rtest2R^{2}_{\rm test}) for predicting hh varying with the number of frequency bins, NbinN_{\rm bin}, when the frequency range of the PSD is fixed at 105\sim 10^{-5}10310^{-3} Hz (top panel). We also show the Rtest2R^{2}_{\rm test} profile when we fix Nbin=30N_{\rm bin}=30 before some of the frequency bins are randomly removed (bottom panel). The sub-figure highlights the error bars obtained by repeating the random process 20 times. This illustrates that the Rtest2R^{2}_{\rm test} is still 0.95\gtrsim 0.95 even if the observational data are pre-processed by removing the Poission-noise dominated bins, as long as the machine is trained using the simulated PSD profiles that are binned consistently with the observational data. Note that Rtrain2Rtest2R^{2}_{\rm train}\sim R^{2}_{\rm test}, so the Rtrain2R^{2}_{\rm train} profile is omitted for clarity of the plots.

Fig. 7 shows examples of the observational PSD data of IRAS 13224–3809 (Rev. no. 2127) and the corresponding PSD profiles simulated for training and testing the machine. Their bins, as well as frequency range, are adjusted to be the same. The frequency bins of the observed PSD that are dominated by Poisson noise are removed, and these bins are removed from the simulated data too. Therefore, to predict the coronal height from each observation, each RFR model must be newly developed using the simulated PSD profiles that are specific to each individual observation. In this way, we can always ensure, based on the performance of the model, that the predicted coronal height is highly accurate (R20.9R^{2}\gtrsim 0.9).

Refer to caption IRAS 13224–3809(Rev. no. 2127)

Refer to caption Simulatated profiles

Figure 7: Top panel: Example of the observed PSD profile of IRAS 13224–3809 (Rev. no. 2127) extracted in the 0.3–1 keV. The bins where the Poisson noise dominate are removed. Bottom panel: Simulated corresponding PSD profiles with the mass and inclination fixed at M=2×106MM=2\times 10^{6}~{}M_{\odot} and i=45i=45^{\circ}, as expected for IRAS 13224–3809. The simulated profiles are binned specifically to Rev. no. 2127. Some frequency bins are removed similar to what is done to the observed PSD data.

5.2 Fitting IRAS 13224–3809 and 1H 0707–495

For IRAS 13224–3809, we fix M=2×106MM=2\times 10^{6}~{}M_{\odot} (Alston et al., 2020) and i=45i=45^{\circ} (Caballero-García et al., 2020). Meanwhile, for 1H 0707–495, we fix M=2.3×106MM=2.3\times 10^{6}~{}M_{\odot} (Zhou & Wang, 2005) and i=45i=45^{\circ}, which is an intermediate value between what’s reported in Caballero-García et al. (2018) and Wilkins & Fabian (2011). The developed RFR algorithm is then applied to predict the coronal height in 16 and 13 observations of IRAS 13224–3809 and 1H 0707–495, respectively, where their PSDs are extracted in the 0.3–1 keV band. We input each individual observed PSD data, one by one, and the specific RFR model can be produced. We find both Rtrain2R^{2}_{\rm train} and Rtest2R^{2}_{\rm test} are comparable, which are 0.9\gtrsim 0.9 for all observational data. The predicted coronal heights versus the count rates for both AGN are presented in Fig. 8. We find a moderate, monotonic correlation between coronal height and count rate in both IRAS 13224–3809 and 1H 0707–495. The mean squared errors are varied between 0.22.8\sim 0.2-2.8 among different data sets. We investigate two further cases, referred to as Test A and Test B, where we sample 20 PSD indices in the range of 0.5–2 and 40 PSD indices in the wider range of 0.5–4, respectively. We find that the predicted source heights are slightly different if we assume different ranges of the power law PSD indices. This, however, does not change the trend of increasing source height with count rate.

Fig. 9 shows an example of how the obtained accuracy changes if the mass of the target AGN differs from the value used to train the machine. If the mass difference is larger, the accuracy is undoubtedly smaller. We find that the R20.9R^{2}\gtrsim 0.9 can still be achieved if the true mass is ±10%\pm 10\% from the trained value.

Refer to caption IRAS 13224–3809

Refer to caption 1H 0707–495

Figure 8: Coronal height, hh, versus count rate of the observations of IRAS 13224–3809 (upper panel) and 1H 0707–495 (lower panel) as predicted by the RFR models when using the PSD extracted in 0.3–1 keV band. Test A and B refer to the cases when the power index of the training PSD data is allowed to vary between 0.5–2.5 and 0.5–4.0, respectively. The Spearman correlation coefficient (rSr_{S}) and pp-value (pp) are also presented.

Refer to caption


Figure 9: Accuracy score (Rtest2R^{2}_{\rm test}) for predicting hh when the RFR model is trained with the data where the mass is fixed at 2×106M2\times 10^{6}~{}M_{\odot}, but is tested with new data that have different values of the black hole mass. In this illustration, we use the frequency range of 105\sim 10^{-5}10310^{-3} Hz and Nbin=30N_{\rm bin}=30. For clarity, the Rtrain2R^{2}_{\rm train} profile is omitted since Rtrain2Rtest2R^{2}_{\rm train}\sim R^{2}_{\rm test}.

6 Discussion and Conclusion

Firstly, we investigate the effectiveness of the RFR model in estimating three physical parameters of AGN including the source height, inclination, and black hole mass. Our results reveal that the accuracy for forecasting the inclination is poor (R2<0.5R^{2}<0.5) because either the reverberation feature is indistinct to be extracted or the model requires a larger number of PSDs varying with inclination to be trained. Similarly, the accuracy in determining the black hole mass simultaneously with the coronal height is poor as the characteristics of PSDs varying with the mass can resemble those changing with the source height (Papadakis et al., 2016). For example, the data with smaller mass and larger height can also be reasonably explained by the models with larger mass but smaller height. The degeneracy between the mass and the source height is also found in other timing profiles such as the lag-frequency spectrum (e.g. Cackett et al., 2014; Caballero-García et al., 2018, 2020), and still cannot be easily broken by the model.

In the case of predicting the source height alone, the obtained accuracy is excellent (R20.9R^{2}\gtrsim 0.9), particularly when the reflection fraction RF1R_{F}\geqslant 1. This is because the reverberation feature in the PSD, such as the dip, is more pronounced. As a result, we focus on the PSD data extracted in the energy band of 0.30.3–1 keV where the reflection flux is usually dominated and choose to predict the source height by fixing the black hole mass and the inclination angle.

Chainakun et al. (2021) proved that the ML technique can be used to extract the reverberation signatures on the PSD profiles to predict the source height. However, for the ML approach, there is for a classification problem where the source heights are categorized into several distinct classes (i.e. discrete values), each of which represents a range of source heights. Here, we use the RFR algorithms so that the source height can be predicted as continuous values. The simulated PSD profiles being used to train, cross-validate and test the machine are also specifically produced using a similar frequency range and a similar number of frequency bins to each observational data. Higher Poisson noise can lead to higher standard deviation of the measured lag, without affecting the lag value (Kara et al., 2013b). Rather than quantifying the Poisson noise level in each observation and involving the errors from the PSD measurement, we select to directly remove the Poisson-noise dominated bins instead. This is acceptable as long as the training and test data are processed in the same way. The prediction accuracy of R2>0.9R^{2}>0.9 can still be achieved if the number of Poisson-noise dominated bins being removed is less than 10, which is likely the case for our observational data.

Once the developed RFR model is applied to observational data, we find the IRAS 13224–3809 corona varies between 518rg\sim 5-18~{}r_{\rm g}. This is comparable with the constrained coronal height of 620rg\sim 6-20~{}r_{\rm g} using the lag-frequency spectra (Alston et al., 2020). The combined spectral-timing analysis by Caballero-García et al. (2020) also reveals the coronal height varying between 3101+10rg\sim 3-10^{+10}_{-1}~{}r_{\rm g}, assuming a maximally spinning black hole. Recently, tracing the corona evolution from the reverberation signatures that appear in the PSD of IRAS 13224–3809, but with conventional fitting technique, suggests the varying corona located at the height of 325rg\sim 3-25~{}r_{\rm g} (Chainakun et al., 2022). We also find a moderate monotonic correlation showing the trend of increasing source height with the count rate, rS0.55r_{S}\sim 0.55 (p0.03p\sim 0.03), similar to Alston et al. (2020); Caballero-García et al. (2020); Chainakun et al. (2022). This is because the photons emitted from the corona located further away from the black hole can escape the strong gravitational field more easily than those emitted from the corona closer to the black hole, leading to higher source luminosity. The corona evolution in IRAS 13224–3809 then is quite consistent among these studies, even though different methods are used.

Here, we fix the IRAS 13224–3809 mass to be 2×106M2\times 10^{6}~{}M_{\odot} (Alston et al., 2020) and employ the intermediate inclination of 4545^{\circ} (Caballero-García et al., 2020). The result suggests that high accuracy can still be obtained as long as the fixed mass is 10\lesssim 10 per cent different from the true value. The spectral fits by Jiang et al. (2022) using a high-density disc model suggested the inclination of 6060^{\circ}7070^{\circ}. The choice of inclination, however, should not significantly affect the accuracy in predicting the coronal height since the reverberation features in timing profiles are less dependent with the inclination (e.g. Cackett et al., 2014; Papadakis et al., 2016). Recently, Hancock, Young, & Chainakun (2023) fitted an extended corona model, simplified using dual lamp-post sources, to the lag-frequency spectra of IRAS 13224–3809. By adopting the inclination i=64i=64^{\circ} as reported by Fabian et al. (2013) and placing the first, lower source at 2rg2~{}r_{\rm g}, they found that the second, upper source could vary between 320rg\sim 3-20~{}r_{\rm g}. Although different coronal geometry and inclination are assumed, the disc-corona distance constrained here is comparable with the upper source height suggested in Hancock, Young, & Chainakun (2023). Developing the RFR model to probe the spatial extent of the corona is possible, but it is a subject for future research.

Recently, Chainakun et al. (2023) employed the Granger-causality test to probe the intrinsic reverberation lags in IRAS 13224–3809 to be 200\sim 200–500 s. They directly converted the intrinsic lags to the true light-travel distance of 20\sim 20–50 rgr_{\rm g}, and by assuming a face-on disc the coronal height could be roughly approximated to be 10\sim 10–25 rgr_{\rm g}. Furthermore, they reported the evidence of the corona variability within some individual observations (e.g. the corona can change from 10\sim 10–25 rgr_{\rm g} to 55\sim 55 rgr_{\rm g}, or vice versa, from the beginning to the end of the observation). The method outlined in Chainakun et al. (2023) is a statistical test and the large coronal height (e.g. >25>25 rgr_{\rm g}) is seen only in some partial segments of the light curves. The different lags from different segments can lead to an insignificance of the lags at a particular value when analysing the full light curve, or the multiple lags can be averaged out. However, Chainakun et al. (2023) still found, for the majority of the IRAS 13224–3809 observations, that the corona situated at 10\sim 10–25 rgr_{\rm g}, so with the measurement error of 5\sim 5 rgr_{\rm g} suggested by their method, the coronal height constrained there is mostly comparable to what obtained in this work.

For 1H 0707–495, the RFR model places the corona at h518rgh\sim 5-18~{}r_{\rm g} as well. The hint of increasing source height with count rate is also observed. Moreover, Chainakun et al. (2021) suggested that even if the ML is trained using the power-law PSDs but the observed data have a bending power-law shape, the coronal height can still be forecasted with the accuracy of 0.75\gtrsim 0.75. Then, the obtained accuracy reported here can be lower if the intrinsic shape of the PSD, which is not known prior, has a bending power-law form. Perhaps, the best way might be to train the machine with the PSD with various intrinsic shapes as much as possible, but this will progressively increase computational time consuming and is beyond the scope of this work. Also, when the range and number of the photon indices are changed, the regression results show slight modifications (see Test A and Test B in Fig. 8). This effect, however, should not change the overall trend of the correlation if the range of the photon indices are large enough.

Nevertheless, the nature of 1H 0707–495 is quite uncertain. Some suggested that it should have a low-spin black hole (Done & Jin, 2016), while the mass can possibly be 510×106M\sim 5-10\times 10^{6}~{}M_{\odot} (e.g. Zoghbi, Uttley, & Fabian, 2011; Pan et al., 2016), which is larger than the mass assumed in, e.g., this work and Szanecki et al. (2020) by a factor of 2\sim 2–5. Szanecki et al. (2020) analysed the relativistic reflection spectra from the disc around a maximally spinning black hole and found the corona is very compact, with an extended size no larger than 1rg\sim 1~{}r_{\rm g} and located at h3rgh\sim 3~{}r_{\rm g}. A compact corona at 3\sim 34rg4~{}r_{\rm g} was also proposed by Dauser et al. (2012) and Caballero-García et al. (2018). Contrarily, Wilkins & Fabian (2012) studied the emissivity profiles due to various coronal geometries along with previous constraints from reverberation lags, and inferred a presence of the 1H 0707–495 coronal height at 2rg\sim 2~{}r_{\rm g} that radially extends outwards to a radius of 30rg\sim 30~{}r_{\rm g}. The corona also seems to expand as the source luminosity increases (Wilkins et al., 2014). Dovčiak & Done (2016) suggested that the 1H 0707–495 corona must be large enough to intercept sufficient disc photons to produce the hard X-ray continuum and, during the maximum flux, it could be at 25rg\sim 2-5~{}r_{\rm g} extending further outside to 20rg\sim 20~{}r_{\rm g}. Furthermore, Chainakun et al. (2019) developed a spherical corona model to explain the frequency-dependent time lags of AGN, and found that the 1H 0707–495 corona could be radially extended up to 10rg\sim 10~{}r_{\rm g}. Previous study using a dual lamp-post model also suggested the 1H 0707–495 corona could extend upwards, up to 20rg\sim 20~{}r_{\rm g} (Hancock, Young, & Chainakun, 2023). Although this work assumes a lamp-post geometry, it suggests a dynamic corona of 1H 0707–495 that can move far beyond a few gravitational radii from the central black hole, and an increase of the coronal height with luminosity is also evidenced.

In conclusion, our study highlights the potential of machine learning, specifically regression techniques, for probing the coronal evolution in AGN. The constrained h518rgh\sim 5-18~{}r_{\rm g} in both AGN here supports the height-changing corona and the light-bending scenario (Miniutti & Fabian, 2004), in which the coronal height is correlated to source luminosity. The fluctuations in the gravitational and magnetic fields caused by the black hole and the accretion disc can induce the corona to move closer or farther away from the black hole, leading to variations in the X-ray emission. The results are also in line with recent studies on other AGN such as NGC 5548 that, by probing the UV/optical PSD variation, revealed a dynamic corona chainging its position and probably also its spatial extension on various timescales (Panagiotou et al., 2022).

Acknowledgements

We thank the anonymous referee for comments that improved the paper. This research has received funding support from the NSRF via the Program Management Unit for Human Resources & Institutional Development, Research and Innovation (grant number B16F640076). N.M. thanks Suranaree University of Technology (SUT) and National Astronomical Research Institute of Thailand (NARIT) for the financial support.

Data availability

The observational data were accessed from XMM-Newton Observatory (http://nxsa.esac.esa.int). The response function data analysed here were generated using the kynxilrev model available in https://projects.asu.cas.cz/stronggravity/kynreverb. The code for the developed RFR model to predict the coronal height is available in https://github.com/PChainakun/RanForPSD. Other derived data underlying this article will be shared on reasonable request to the corresponding author.

References

  • Alston et al. (2020) Alston W. N., Fabian A. C., Kara E., Parker M. L., Dovciak M., Pinto C., Jiang J., et al., 2020, NatAs, 4, 597.
  • Arévalo & Uttley (2006) Arévalo P., Uttley P., 2006, MNRAS, 367, 801.
  • Ashton & Middleton (2022) Ashton D. I., Middleton M. J., 2022, MNRAS, 513, 5245.
  • Caballero-García et al. (2018) Caballero-García M. D., Papadakis I. E., Dovčiak M., Bursa M., Epitropakis A., Karas V., Svoboda J., 2018, MNRAS, 480, 2650.
  • Caballero-García et al. (2020) Caballero-García M. D., Papadakis I. E., Dovčiak M., Bursa M., Svoboda J., Karas V., 2020, MNRAS, 498, 3184.
  • Cackett et al. (2014) Cackett E. M., Zoghbi A., Reynolds C., Fabian A. C., Kara E., Uttley P., Wilkins D. R., 2014, MNRAS438, 2980
  • Cackett, Bentz, & Kara (2021) Cackett E. M., Bentz M. C., Kara E., 2021, iSci, 24, 102557.
  • Chiang et al. (2015) Chiang C.-Y., Walton D. J., Fabian A. C., Wilkins D. R., Gallo L. C., 2015, MNRAS, 446, 759.
  • Chainakun et al. (2016) Chainakun, P., Young, A. J., & Kara, E. 2016, MNRAS, 460, 3076
  • Chainakun & Young (2017) Chainakun, P., & Young, A. J. 2017, MNRAS, 465, 3965
  • Chainakun (2019) Chainakun, P. 2019, ApJ, 878, 20
  • Chainakun et al. (2019) Chainakun, P., Watcharangkool, A., Young, A. J., & Hancock, S. 2019, MNRAS, 487, 667
  • Chainakun et al. (2021) Chainakun P., Mankatwit N., Thongkonsing P., Young A. J., 2021, MNRAS, 506, 5318.
  • Chainakun et al. (2022) Chainakun P., Luangtip W., Jiang J., Young A. J., 2022, ApJ, 934, 166.
  • Chainakun et al. (2023) Chainakun P., Nakhonthong N., Luangtip W., Young A. J., 2023, MNRAS, 523, 111.
  • Dauser et al. (2012) Dauser T., Svoboda J., Schartel N., Wilms J., Dovčiak M., Ehle M., Karas V., et al., 2012, MNRAS, 422, 1914.
  • De Marco et al. (2013) De Marco B., Ponti G., Cappi M., Dadina M., Uttley P., Cackett E. M., Fabian A. C., et al., 2013, MNRAS, 431, 2441.
  • Done & Jin (2016) Done C., Jin C., 2016, MNRAS, 460, 1716.
  • Dovčiak & Done (2016) Dovčiak M., Done C., 2016, AN, 337, 441.
  • Emmanoulopoulos et al. (2014) Emmanoulopoulos, D., Papadakis, I. E., Dovčiak, M., & McHardy, I. M. 2014, MNRAS, 439, 3931
  • Emmanoulopoulos et al. (2016) Emmanoulopoulos D., Papadakis I. E., Epitropakis A., Pecháček T., Dovčiak M., McHardy I. M., 2016, MNRAS, 461, 1642.
  • Epitropakis et al. (2016) Epitropakis, A., Papadakis, I. E., Dovčiak, M., et al. 2016, A&A, 594, A71
  • Fabian et al. (2009) Fabian A. C., Zoghbi A., Ross R. R., Uttley P., Gallo L. C., Brandt W. N., Blustin A. J., et al., 2009, Natur, 459, 540.
  • Fabian et al. (2013) Fabian A. C., Kara E., Walton D. J., Wilkins D. R., Ross R. R., Lozanov K., Uttley P., et al., 2013, MNRAS, 429, 2917.
  • González-Martín & Vaughan (2012) González-Martín, O., & Vaughan, S. 2012, A&A, 544, A80
  • Hancock, Young, & Chainakun (2023) Hancock S., Young A. J., Chainakun P., 2023, MNRAS, 520, 180.
  • Jansen et al. (2001) Jansen, F., Lumb, D., Altieri, B., Clavel, J., Ehle, M., Erd, C., et al., 2001, A&A, 365, L1.
  • Jiang et al. (2018) Jiang J., Parker M. L., Fabian A. C., Alston W. N., Buisson D. J. K., Cackett E. M., Chiang C.-Y., et al., 2018, MNRAS, 477, 3711.
  • Jiang et al. (2022) Jiang J., Dauser T., Fabian A. C., Alston W. N., Gallo L. C., Parker M. L., Reynolds C. S., 2022, MNRAS, 514, 1107.
  • Kara et al. (2013a) Kara E., Fabian A. C., Cackett E. M., Miniutti G., Uttley P., 2013, MNRAS, 430, 1408.
  • Kara et al. (2013b) Kara E., Fabian A. C., Cackett E. M., Steiner J. F., Uttley P., Wilkins D. R., Zoghbi A., 2013, MNRAS, 428, 2795.
  • Kara et al. (2016) Kara, E., Alston, W. N., Fabian, A. C., et al. 2016, MNRAS, 462, 511
  • Kara et al. (2019) Kara E., Steiner J. F., Fabian A. C., Cackett E. M., Uttley P., Remillard R. A., Gendreau K. C., et al., 2019, Natur, 565, 198.
  • Miniutti & Fabian (2004) Miniutti G., Fabian A. C., 2004, MNRAS, 349, 1435.
  • Pan et al. (2016) Pan H.-W., Yuan W., Yao S., Zhou X.-L., Liu B., Zhou H., Zhang S.-N., 2016, ApJL, 819, L19.
  • Papadakis et al. (2016) Papadakis, I., Pecháček, T., Dovčiak, M., et al. 2016, A&A, 588, A13
  • Parker et al. (2017) Parker M. L., Alston W. N., Buisson D. J. K., Fabian A. C., Jiang J., Kara E., Lohfink A., et al., 2017, MNRAS, 469, 1553.
  • Parker et al. (2021) Parker M. L., Alston W. N., Härer L., Igo Z., Joyce A., Buisson D. J. K., Chainakun P., et al., 2021, MNRAS, 508, 1798.
  • Pedregosa et al. (2012) Pedregosa F., Varoquaux G., Gramfort A., Michel V., Thirion B., Grisel O., Blondel M., et al., 2012, arXiv, arXiv:1201.0490
  • Panagiotou et al. (2022) Panagiotou C., Papadakis I., Kara E., Kammoun E., Dovčiak M., 2022, ApJ, 935, 93.
  • Szanecki et al. (2020) Szanecki M., Niedźwiecki A., Done C., Klepczarek Ł., Lubiński P., Mizumoto M., 2020, A&A, 641, A89.
  • Timmer & Koenig (1995) Timmer J., Koenig M., 1995, A&A, 300, 707
  • Uttley et al. (2014) Uttley, P., Cackett, E. M., Fabian, A. C., Kara, E., & Wilkins, D. R. 2014, A&ARv, 22, 72
  • Wilkins & Fabian (2011) Wilkins D. R., Fabian A. C., 2011, MNRAS, 414, 1269.
  • Wilkins & Fabian (2012) Wilkins D. R., Fabian A. C., 2012, MNRAS, 424, 1284.
  • Wilkins et al. (2014) Wilkins D. R., Kara E., Fabian A. C., Gallo L. C., 2014, MNRAS, 443, 2746.
  • Wilkins & Gallo (2015) Wilkins D. R., Gallo L. C., 2015, MNRAS, 449, 129.
  • Wilkins et al. (2016) Wilkins D. R., Cackett E. M., Fabian A. C., Reynolds C. S., 2016, MNRAS, 458, 200.
  • Zhou & Wang (2005) Zhou X.-L., Wang J.-M., 2005, ApJL, 618, L83. doi:10.1086/427871
  • Zoghbi, Uttley, & Fabian (2011) Zoghbi A., Uttley P., Fabian A. C., 2011, MNRAS, 412, 59.