This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Systematics in Asteroseismic Modelling: Application of a Correlated Noise Model for Oscillation Frequencies

Tanda Li1,2,3,4, Guy R. Davies3,4, Martin Nielsen3,4,5, Margarida S. Cunha6, Alexander J. Lyttle3,4
1 Institute for Frontiers in Astronomy and Astrophysics, Beijing Normal University, Beijing 102206, China
2 Department of Astronomy, Beijing Normal University, Beijing, 100875, China
3 School of Physics and Astronomy, University of Birmingham, Birmingham, B15 2TT, United Kingdom
4Stellar Astrophysics Centre (SAC), Department of Physics and Astronomy, Aarhus University, Ny Munkegade 120, DK-8000 Aarhus C, Denmark
5Center for Space Science, NYUAD Institute, New York University Abu Dhabi, PO Box 129188, Abu Dhabi, United Arab Emirates
6Instituto de Astrofísica e Ciências do Espaço, Universidade do Porto, CAUP, Rua das Estrelas, PT4150-762 Porto, Portugal
E-mail: [email protected]
(Accepted XXX. Received YYY; in original form ZZZ)
Abstract

The detailed modelling of stellar oscillations is a powerful approach to characterising stars. However, poor treatment of systematics in theoretical models leads to misinterpretations of stars. Here we propose a more principled statistical treatment for the systematics to be applied to fitting individual mode frequencies with a typical stellar model grid. We introduce a correlated noise model based on a Gaussian Process (GP) kernel to describe the systematics given that mode frequency systematics are expected to be highly correlated. We show that tuning the GP kernel can reproduce general features of frequency variations for changing model input physics and fundamental parameters. Fits with the correlated noise model better recover stellar parameters than traditional methods which either ignore the systematics or treat them as uncorrelated noise.

keywords:
Star: Modelling – Star: Oscillation – Methods: Statistical
pubyear: 2020pagerange: Systematics in Asteroseismic Modelling: Application of a Correlated Noise Model for Oscillation FrequenciesA.1

1 Introduction

One-dimension stellar models have been widely used for decades to predict the structure, evolution, and oscillations of stars. Systematic errors are expected because stellar models contain assumptions and approximations (e.g., using the mixing-length theory to describe the convection) which do not perfectly reflect the actual physics in stars. In asteroseismic mode frequencies, the most well-known systematic error is the so-called surface term, which appears as a frequency offsets between observations and theoretical predictions computed with the best-fitting structural model. The surface term is caused by the poor modelling of near-surface layers of the star (see details in Ball, 2017), and it is a major source of error in theoretical mode frequencies (\sim5μ\muHz at oscillation frequency with the largest amplitude, i.e., νmax\nu_{\rm max}, for the Sun; Christensen-Dalsgaard et al., 1996). Treatment of the surface term normally follows a deterministic approach with parameterisation based on the so-called ‘surface correction’ formulae (e.g. Kjeldsen et al., 2008; Ball & Gizon, 2014; Sonoi et al., 2015). Previous studies for the Kepler LEGACY sample (Silva Aguirre et al., 2017; Lund et al., 2017) showed that those correction formulae can give good fits to the frequency offsets (Compton et al., 2018). This surface correction is expected to be a smooth function of the mode frequency and some formulae also contain the mode inertia. Secondary systematic errors are caused by other missing physics. As an example, stellar magnetic activity, which is not included in most stellar codes, shifts mode frequencies to a noticeable degree. For the Sun, which is an aged and inactive G-type star, its frequencies of low angular degree modes (\ell\leq 3) shift up to 0.5μ\muHz during a solar cycle (Chaplin et al., 2007). Recent findings based on Kepler (Borucki et al., 2009) data have shown even larger frequency shifting (up to \sim2μ\muHz) in some sun-like stars (Kiefer et al., 2017; Salabert et al., 2018). Howe et al. (2017) notes that the impact of magnetic activity can be treated as a part of the surface term. Parametrising the time variation of the surface correction could be a way to account for the activity-related frequency variation in solar-like oscillators. There are also model errors that have not been well studied or properly treated in modelling. For instance, Ge et al. (2015) stated that systematic differences between observed and model frequencies would be expected for metal-poor stars if an incorrect α\alpha-enhancement value is used in model computations.

In grid-based modelling, systematic uncertainty could also undermine modelling solutions when the frequency resolution is comparable to the observed uncertainty. Here we define the frequency resolution as the difference between neighbouring points of the model grid. For a seismic model grid, neighbouring points are those with consecutive fundamental inputs (mass, matellicity, helium fraction, mixing-length parameter, etc.) and the same mean density. We use the mean density to locate neighbouring points because good-fitting models constrained by oscillation modes normally converge at a similar large separation (Δν\Delta\nu), which tightly correlates to the mean density (see Li et al., 2022). In Figure 1, we demonstrate an example from sequential models of a model grid. Here we take the 1M model at the solar mean density, and we compare the model frequencies with its neighbouring grid points in terms of mass (0.99M and 1.01M models with the closest mean density). The mean frequency resolution is \sim1μ\muHz corresponding to a mass step of 0.01M. This frequency resolution is larger than the typical observed uncertainty of many well-studied solar-like oscillators(Appourchaux et al., 2012; Davies et al., 2016; Lund et al., 2017). For instance, Li et al. (2020a) achieved an average uncertainty at about 0.2μ\muHz when measuring mode frequencies of 36 Kepler subgiants. This is to say, a sparse model grid could be significantly under-sampled for fitting to observed oscillation frequencies.

Refer to caption
Figure 1: Systematic errors and uncertainties in theoretical mode frequencies. Black filled circles are radial (\ell = 0) mode frequencies of the Sun observed by BiSON (Howe et al., 2017) . Open symbols represent mode frequencies of three theoretical models with solar mean density but different input masses: MM = 0.99, 1.00, and 1.01M. Note that the models have the same input helium fraction (YY = 0.26), metallicity ([Fe/H]= 0.0), and mixing-length parameter (αMLT\alpha_{\rm MLT} = 2.1).

Computing a fine model grid with a number of free model inputs is computationally expensive. Interpolating model frequencies based on established model grids like AIMS (Rendle et al., 2019) and BASTA (Aguirre Børsen-Koch et al., 2022) or using a simplex method (e.g. the ‘simplex search’ function in MESA Astero Module Paxton et al., 2015) are possible solutions to this issue. These methods are statistically-sound but not efficient enough to apply to a large sample of stars. A fast approach is homologously scaling mode frequencies of a closely fitting model by a correction factor (rr) to approximate a better fitting model (Kjeldsen et al., 2008). Scaling the mode frequencies by rr changes the seismic large separation (Δν\Delta\nu) by the same ratio and changes the mean density by a factor r2r^{2}. The downside of this method is that the rr-scale transformation is not easily applied to other parameters, such as mass and age.

Traditional fitting strategies normally treat model systematics as white noise. For example, Li et al. (2020b) adopted a uniform white systematic noise when modelling the Kepler subgiants. The scale of noise is determined by the average frequency difference between the observation and the best-fitting model. This treatment is too simple to properly describe the model systematic uncertainty and hence leads to poorly measured uncertainties.

The goal of this work is developing a better statistical treatment for systematics in model frequencies to improve the reliability of detailed modelling based on a stellar model grid. We propose a correlated noise model based on a Gaussian process (GP) kernel (also known as the covariance function) to describe the systematics. As a demonstration of the principle, we discuss the method with radial modes only, but it is extendable to all acoustic modes and also possibly to mixed modes. We introduce the underlying functions of the correlated noise model and discuss the fitting procedure in Section 2. We then apply the new fitting method to characterise fake model star in Section 3 to examine whether fits with the correlated noise model better recover the true stellar parameters. Lastly, we close with some discussions about the new fitting method and conclusions of the paper in Section 4.

2 Method

2.1 Model Systematic Function

2.1.1 Understanding model systematics

Understanding the systematics that affect the oscillation frequencies in a stellar grid is the key to finding proper functional forms with which to describe the model systematics. As seen in Figure 1, the model systematics can be described as a combination of two components. The first is the frequency-dependent offset between the best-fitting model and the observed frequencies. Instead of calling it the surface term, we refer to it as the model systematic error (\mathcal{E}) to represent the systematics caused by all improper and missing physics in a theoretical stellar model. We use a smooth functional form, similar to a surface correction formulae, to model \mathcal{E}.

The second component, as mentioned in Section 1, is the systematic uncertainty (𝒰\mathcal{U}) caused by the frequency resolution of the model grid. The detailed modelling uses global parameters (effective temperature, luminosity, metallicity, etc.) and individual mode frequencies to characterise stars. Comparing the models in Figure 1 we observe that, at a given mean density, changing the mass by a typical mass interval in a grid shifts the mode frequencies horizontally by an amount which increases smoothly with frequency. Changing one of the other model inputs like metallicity, helium fraction, and mixing-length parameter shifts mode frequencies in a similar way. There is also a secondary term to be considered in the systematic uncertainty related to the signature of rapid structural variations in the oscillation frequencies (known as the helium ‘glitch’ signature). The structural variation can be seen in the first adiabatic index (Γ1\Gamma_{1}). Gough (1990) and Houdek & Gough (2007) assumed that the helium glitch signature arises from the second helium ionisation zone. Later studies indicate that this signature is from the Γ1\Gamma_{1} peak between the first and second helium ionisation zones (see Houdayer et al., 2021, for details on modelling of the ionisation region). The helium glitch strongly correlates with the helium fraction in the convective envelope. In Figure 2, we illustrate the signature of the helium glitches extracted from theoretical models with approximately the solar mean density. The glitch signatures follow the sinusoidal function with decaying amplitudes and the signature parameters (amplitude, period, and phase) change with the fundamental inputs (Houdek & Gough, 2007; Verma et al., 2019). For the models presented here, the average frequency shift is at a level of \sim0.1μ\muHz, which are comparable or larger than the observed frequency uncertainties on some Kepler stars (e.g., Li et al., 2020a) thus should not be ignored. Hence, the systematic uncertainty can be described with a two-term functional form. The primary term (𝒰1\mathcal{U}_{1}) which is a very smooth function of frequency and the secondary term (𝒰2\mathcal{U}_{2}) which is a fast-varying function of frequency.

As a result, we describe the model systematics as the combination of the systematic errors (\mathcal{E}) and two systematic uncertainty terms (𝒰1\mathcal{U}_{1} and 𝒰2\mathcal{U}_{2}).

Refer to caption
Figure 2: The helium glitch features extracted with the tool Asterion (https://github.com/alexlyttle/asterion) from theoretical radial mode frequencies. 1st: models with different mass (MM = 0.99, 1.00, and 1.01M{\rm M_{\odot}}), but same metallicity ([Fe/H]= 0.0), helium fraction (YY= 0.26), and mixing-length parameters (αMLT\alpha_{\rm MLT} = 1.9); 2nd: models with same mass (MM = 1.00M{\rm M_{\odot}}), metallicity ([Fe/H]= 0.0), and mixing-length parameters (αMLT\alpha_{\rm MLT} = 1.9) but different input helium fractions (YY= 0.28, 0.26, and 0.24); 3rd: models with same mass (MM = 1.00M{\rm M_{\odot}}), helium fraction(YY= 0.26), and mixing-length parameters (αMLT\alpha_{\rm MLT} = 1.9) but different metallicity([Fe/H]= -0.1, 0.0, +0.1); 4th: models with same mass (MM = 1.00M{\rm M_{\odot}}), metallicity([Fe/H]= 0.0), and helium fraction (YY = 0.26) but different mixing-length parameters (αMLT\alpha_{\rm MLT}= 1.9, 2.1 and 2.3). All models have approximate solar mean density.

2.1.2 The Application of a GP Kernel

As demonstrated above, proper descriptions of systematic errors and uncertainties are smooth functions of frequency. This is expected because mode frequencies are highly correlated following the so-called asymptotic relation. Thus, a white noise model for the systematic noise term is inherently not a good model. Here we suggest a noise model with a smooth function form which is able to consider the correlation between mode frequencies, and we refer to it as correlated noise model (CNM).

The Gaussian Process (GP) kernel, which is used to generate a covariance matrix for a multivariate Normal distribution, is particularly suitable for building up the CNM. Rasmussen & Williams (2006) consider a GP in a functional form domain whereby the conditional GP acts to marginalize over all possible functional forms weighted by the prior (i.e. the kernel) and the data. Here we chose a Squared Exponential (SE) kernel as it follows a smooth functional form and is able to consider the correlation between oscillation frequencies. The SE kernel is described by the covariance matrix

k(ν,ν)=σ2exp((νν)22l2).k(\nu,\nu^{\prime})=\sigma^{2}{\rm exp}\left(-\frac{(\nu-\nu^{\prime})^{2}}{2l^{2}}\right).\\ (1)

The kernel k(ν,ν)k(\nu,\nu^{\prime}) models the joint variability of the Gaussian Process random variables. It returns the modelled covariance between each pair of frequencies ν\nu and ν\nu^{\prime}. When using the kernel to represent the systematic uncertainties, ν\nu are the computed mode frequencies in the model grid and ν\nu^{\prime} are the predicted frequencies based on ν\nu. There are two free parameters in the function: the lengthscale ll and the noise variance σ\sigma. The lengthscale describes how smooth a generated function is (length of the ‘wiggles’), and the noise variance specifies the average distance of the function away from its mean. Tuning these two parameters allows us to generate functional noise forms with different smoothness and varying ranges. In sampling process, the kernel randomly generates frequency noise (ν\nu - ν\nu^{\prime}) for a given ν\nu following the multivariate normal distribution specified by the lengthscale and the noise variance.

In Figure 3, we demonstrate some randomly generated noise realizations using Eq 1 based on a set of computed model frequencies (ν\nu). We use a lengthscale of 5Δν\Delta\nu and a variance of 1.0μ\muHz. As seen, the predicted frequency sets (ν\nu^{\prime}) change in a highly correlated way. We also show a set of random samples from the white noise model (WNM) for comparison. The WNM follows a Gaussian distribution and has the same variance value (σ\sigma = 1.0μ\muHz). The predicted frequencies based on the WNM include spiky features, which obviously do not well reflect the true curvature of radial mode frequencies. We suggest here that a CNM based on the SE kernel is the better representation for model systematics because it is more consistent with our prior belief in the nature of the oscillation frequencies.

Refer to caption
Figure 3: Mode frequency sets sampled with the white noise model and the correlated noise model (based on the SE kernel in Eq. 1) on the Échelle diagram. Black dots represent a set of mode frequencies of a stellar model computed with MESA (Paxton et al., 2011) and GYRE (Townsend & Teitler, 2013) codes. Red dots in the left panel and blue dots in the right panel represent generated noise with the white noise model and the correlated models plus the computed mode frequency.

2.2 Determining the Systematic Function with the SE Kernel

Now we determine the functional form of CNM. Firstly, we describe the systematic error kernel as

k=σ2exp((ν+μν)22l2),k_{\mathcal{E}}=\sigma_{\mathcal{E}}^{2}{\rm exp}\left(-\frac{(\nu+\mu_{\mathcal{E}}-\nu^{\prime})^{2}}{2l_{\mathcal{E}}^{2}}\right),\\ (2)

where μ\mu_{\mathcal{E}} is the mean function, σ\sigma_{\mathcal{E}} and ll_{\mathcal{E}} are the error kernel variance and the error kernel lengthscale, respectively. Secondly, the systematic uncertainty function contains two terms (𝒰1\mathcal{U}_{1}, and 𝒰2\mathcal{U}_{2}) which represent frequency shifts between model grid points. We therefore use two kernels to describe them, written as

k𝒰,1+k𝒰,2=σ𝒰,12exp((νν)22l𝒰,12)+σ𝒰,22exp((νν)22l𝒰,22).\begin{split}&k_{\mathcal{U},1}+k_{\mathcal{U},2}=\\ &\sigma_{\mathcal{U},1}^{2}{\rm exp}\left(-\frac{(\nu-\nu^{\prime})^{2}}{2l_{\mathcal{U},1}^{2}}\right)+\sigma_{\mathcal{U},2}^{2}{\rm exp}\left(-\frac{(\nu-\nu^{\prime})^{2}}{2l_{\mathcal{U},2}^{2}}\right).\\ \end{split} (3)

Here we refer to two kernels (k𝒰,1k_{\mathcal{U},1}, and k𝒰,2k_{\mathcal{U},2}) as the primary uncertainty kernel and the secondary uncertainty kernel, in which σ𝒰\sigma_{\mathcal{U}} and l𝒰l_{\mathcal{U}} are the uncertainty kernel variance and the uncertainty kernel lengthscale, respectively.

Finally, we write the full expression for our CNM as

𝒮=k+k𝒰,1+k𝒰,2.\mathcal{S}=k_{\mathcal{E}}+k_{\mathcal{U},1}+k_{\mathcal{U},2}.\\ (4)

The systematic function considers the systematic errors caused by improper physics in stellar models and the systematic uncertainty due to the model grid steps. To apply the function to stellar mode frequency fits, we need to choose appropriate kernel parameters (μ\mu_{\mathcal{E}}, ll_{\mathcal{E}}, σ\sigma_{\mathcal{E}}, l𝒰,1l_{\mathcal{U},1}, σ𝒰,1\sigma_{\mathcal{U},1}, l𝒰,2l_{\mathcal{U},2}, and σ𝒰,2\sigma_{\mathcal{U},2}). In the following section, we will demonstrate how these terms are determined in an actual fitting process.

2.3 Determining Kernel Parameters

We demonstrate the determination of the kernel parameters of the CNM in the case of the Sun-as-a-star. We use the same stellar model grid adopted by Lyttle et al. (2021). The grid covers parameter ranges (steps) of 0.8 – 1.2 (0.01) M for mass (MM), -0.5 – +0.5 (0.1) for metallicity (M/H), 0.24 – 0.32 (0.02) for initial helium mass fraction (YY), and 1.5 – 2.5 (0.2) for the mixing-length parameter (αMLT\alpha_{\rm MLT}). Observed frequencies of the Sun are taken from the BiSON network (Howe et al., 2017).

2.3.1 Free parameters in the Systematic Error Kernel

Here we discuss the determination of the mean function (μ\mu_{\mathcal{E}}), error kernel lengthscale (ll_{\mathcal{E}}), and error kernel variance (σ\sigma_{\mathcal{E}}). The systematic error kernel is normally determined from frequency differences between observations and the best-fitting model. However, the best-fitting model is unknown to us a priori. Instead, we use specific knowledge of model errors of similar stars to characterise the systematic error and determine three adjusted parameters. In what follows, we justify our choices but note that the method itself could incorporate alternative choices.

Previous studies of the surface correction on stars similar to the Sun provide some useful references for model errors of the Sun. Compton et al. (2018) estimated the surface terms of 66 Kepler main-sequence stars. For stars within a parameter range of TeffT_{\text{eff}} = 5777 ±\pm 250K and logg\log g = 4.44 ±\pm 0.5dex, the relative surface corrections at νmax\nu_{\rm max} (δν(νmax)\delta\nu(\nu_{\rm max})) vary from -0.0015νmax\nu_{\rm max} to -0.0045νmax\nu_{\rm max}. We use this prior information in the form of a weight wνmaxw_{\rm\nu_{max}} given by a super Gaussian function with the exponent raised to a power of 10 (flat-top Gaussian function)

wνmax=exp(((δν(νmax)μνmax)22σνmax2)10).w_{\rm\nu_{max}}={\rm exp}\left(-\left(\frac{(\delta\nu(\nu_{\rm max})-\mu_{\nu_{\rm max}})^{2}}{2\sigma_{\nu_{\rm max}}^{2}}\right)^{10}\right). (5)

Here, μνmax\mu_{\nu_{\rm max}} and σνmax\sigma_{\nu_{\rm max}} are -0.003νmax\nu_{\rm max} and 0.0015νmax\nu_{\rm max}, respectively. This likelihood function returns a weight that is flat when δν(νmax)\delta\nu(\nu_{\rm max}) is in the μνmax±σνmax\mu_{\nu_{\rm max}}\pm\sigma_{\nu_{\rm max}} range, and quickly drops toward zero at μνmax±1.5σνmax\mu_{\nu_{\rm max}}\pm 1.5\sigma_{\nu_{\rm max}}. Moreover, previous studies (Ball & Gizon, 2014; Compton et al., 2018; Li et al., 2020b) have also inferred that the model frequency errors at the low-frequency range, below \sim0.7νmax\nu_{\rm max}, are not as significant as those in high-frequency range. In sun-like stars, the low-frequency-range errors are on average approximately zero with a small amount of 0.001νmax\nu_{\rm max} spread. This can be used as another constraint and we describe it as a Gaussian function as

wlowν=exp(δν(νobs)~22σlowν2),(forνobs0.7νmax),w_{\rm low-\nu}={\rm exp}\left(-\frac{\widetilde{\delta\nu(\nu_{\rm obs})}^{2}}{2\sigma_{\rm low-\nu}^{2}}\right),({\rm for\;}\nu_{\rm obs}\leq 0.7\nu_{\rm max}), (6)

where δν(νobs)~\widetilde{\delta\nu(\nu_{\rm obs})} represents the median of frequency offsets for given observed frequencies, σlowν\sigma_{\rm low-\nu} is adjusted and defines for how much the δν(ν)~\widetilde{\delta\nu(\nu)} deviating from zero is sensible. According to earlier studies (Ball & Gizon, 2014; Compton et al., 2018), we adopt σlowν\sigma_{\rm low-\nu} = 0.001νmax\nu_{\rm max}.

Refer to caption
Refer to caption
Figure 4: Left: The determination of mean function (μ\mu_{\mathcal{E}}) and variation (σ\sigma_{\mathcal{E}}) of the systematic error kernel (\mathcal{E}) while fitting to the Sun-as-a-star. Dots are frequency differences between observed solar data (from BiSON) and theoretical models. These models are from the same evolutionary track with MM = 1.00 M, YY = 0.26, [Fe/H]= 0.0, and αMLT\alpha_{\rm MLT} = 2.1. The colour code indicates the joint weight determined with the two likelihood functions (Eq. 5 and 6). Note that we use all models on the track to determine the mean function and variation but only plot eight models with a joint weight larger than 0.01. The solid line represents the polynomial function that fits the frequency differences (weighted by the joint weight), and we use this as the mean function. The grey shade indicates the weighted standard deviation which is adopted as the variance. Right: Five random draws from the systematic error kernel. The mean function and variance are determined with the method demonstrated in the left panel, and the error kernel lengthscale is 5Δν\Delta\nu.

Now we demonstrate how these two functions are used to characterise the model errors of the Sun. Firstly, we compare observed and model frequencies to calculate frequency differences. We then use Eq. 5 and 6 to obtain a joint weight (wlowνw_{\rm low-\nu}\cdotwνmaxw_{\rm\nu_{max}}) for each model. As a simple example, we use stellar models on an 1M evolutionary track to demonstrate this estimation in the left panel of Figure 4. When the joint weight of each model is obtained, we find a couple of potential good-fitting models (with weight value larger than 0.01) on this track and plot their frequency differences as a function of the frequency. The weight distribution indicates the model error is about -5μ\muHz at 3,000 μ\muHz and goes down to about -10μ\muHz at 4,000μ\muHz. Now we could fit a cubic polynomial function to all frequency differences in Figure 4 against the frequency weighted by the joint weight to estimate the mean of frequency offsets (the solid line). Moreover, we calculate the weighted standard deviations of frequency differences (as illustrated by the grey shade). The fitted polynomial function represents the mean estimate of systematic error in these theoretical models, and the standard deviation infer the range for it to vary. We thus use them as the mean function and noise variance in the systematic error kernel. In our CNM, the lenghtscale determines the degree of correlation between mode frequency noise. By inspecting the surface correction results obtained on the Kepler LEGACY sample (Compton et al., 2018), we notice that the frequency offsets between model and observations mostly follow slowly-varying smooth functions (higthly-correlated). This is to say the error kernel lengthscale ought to be much larger than the large separation. We test a selection of lengthscale values and compare the kernel predictions with the surface correction results. We find a suitable lengthscale of 5Δν\Delta\nu because the kernel with this lengthscale value well reproduces both the very smooth surface terms expected for stars with approximately one solar mass and the slightly curved ones, expected for the more massive F-type stars(see Fig. 8 in Compton et al., 2018). Using the mean function, variance, and lengthscale obtained above, we illustrate some random draws from the systematic error kernel in the right panel of Figure 4.

2.3.2 Free Parameters in the Uncertainty Kernels

Now we determine the lengthscales and variances of the two uncertainty kernels. The two lengthscales and variances are different because the primary kernel describes the general frequency change (a very smooth function of frequency) and the secondary kernel corresponds to the helium glitch signatures (a fast-varying function of frequency). The uncertainty kernels represent the systematic uncertainty at a grid point and their free parameters depend on the local frequency changes between the point and its neighbouring points. Thus, the free parameters in the uncertainty kernels vary for different grid points.

Here we use a simple method to determine free parameters for the grid point at MM = 1M, [Fe/H] = 0.0, YinitY_{\rm init} = 0.26, αMLT\alpha_{\rm MLT} = 2.1, and ρ\rho = ρ\rho_{\odot}. A model grid always contains multiple input dimensions. For the grid in this work, the independent inputs are mass, initial metallicity, initial helium abundance, and initial mixing-length parameters. We therefore inspect the frequency changes for each fundamental input (see details in Figure 8). We find that the primary uncertainty kernel needs a large lengthscale to make it act as a very smooth function. We test different lengthscale from 10Δν\Delta\nu to 30Δν\Delta\nu and adopt l𝒰,1l_{\mathcal{U},1} =20Δν\Delta\nu because it best recovers the general shape of frequency changes in Figure 8. On the other hand, we use a small lengthscale for the secondary uncertainty kernel to match the quick variation of the glitch signature. By inspecting the second-order variations in Figure 8, we notice that the lengths of the ‘wiggles’ are mostly between 2–3 radial orders. The choice of lengthscale needs to be in a similar range to reproduce the fast variations. We hence test different lengthscale values from 1Δν\Delta\nu to 4Δν\Delta\nu and find the kernel with l𝒰,2l_{\mathcal{U},2} = 2Δν\Delta\nu best recovers the test case. Considering the frequency resolution for all input dimensions, we find an average varying range is \sim0.75μ\muHz at 0.5νmax\nu_{\rm max}, \sim1.5μ\muHz at νmax\nu_{\rm max}, and \sim2.0μ\muHz at 1.5νmax\nu_{\rm max}. The variance is hence frequency dependent and we use σ𝒰,1\sigma_{\mathcal{U},1} = 1.5νobs/νmax1.5\nu_{\rm obs}/\nu_{\rm max}. The secondary kernel variance is also frequency-dependent given that the signature of the helium glitch is a damped sine wave. As shown in Figure 2, the secondary uncertainty is up to \sim0.4μ\muHz at 0.5νmax\nu_{\rm max} and gradually reduces to \sim 0 at 1.5νmax\nu_{\rm max}. We therefore set up the secondary variance as σ𝒰,2\sigma_{\mathcal{U},2} = 0.1(νmax/νobs\nu_{\rm max}/\nu_{\rm obs})2. Note that the method of measuring frequency differences and estimating the kernel variation is fairly rough. Firstly, the kernel variation is not uniform through a model grid. Secondly, we measure frequency changes in each input dimension independently, but the four fundamental inputs highly degenerate. The grid resolution of oscillation frequency and the free parameters of the uncertainty kernels should be solved as multiple-dimesions problems. We leave these in future studies.

In Figure 5, we use the two uncertainty kernels to generate some frequency noise as a function of frequency. As seen, the primary uncertainty kernel generates very smooth and highly correlated noise, and what the secondary uncertainty kernel provides is similar to the damped sine wave. The combination of the two kernels gives reasonable predictions of frequency noise between stellar model grids.

Refer to caption
Figure 5: Generated frequency noise using the primary uncertainty kernel (top), the secondary uncertainty kernel (middle), and the combination of two kernels (bottom). The primary kernel has a lengthscale of 20Δν\Delta\nu and a frequency-dependent variance of 1.5νobs/νmax\nu_{\rm obs}/\nu_{\rm max}. The secondary kernel, which describes the change in glitch signature, has a lengthscale of 2Δν\Delta\nu and a variance of 0.1(νmax/νobs\nu_{\rm max}/\nu_{\rm obs})2. In each panel, we demonstrate 10 randomly generated sets of frequency noise, i.e., the (ν\nu - ν\nu^{\prime}) in the systematic uncertainty kernel. Note that we use observed solar frequencies as ν\nu when generating the noise in this plot.

2.4 Likelihood Function

In this section, we discuss the likelihood function that works for the CNM fitting. We start with the likelihood function used in traditional methods. When no systematics are considered and errors on the observed frequencies are assumed to be uncorrelated, it is normally written as

=exp((νobsνmod+δνsc)22σνobs2),\mathcal{L}={\rm exp}\left(-\frac{(\nu_{\rm obs}-\nu_{\rm mod}+\delta\nu_{\rm sc})^{2}}{2\sigma^{2}_{\nu_{\rm obs}}}\right),\\ (7)

where the subscripts ‘obs’ and ‘mod’ stand for the observed and the model quantities. The term δνsc\delta\nu_{\rm sc} represents the surface correction to model frequencies. When we consider model systematic uncertainties and treat them as white noises, the likelihood function becomes

WNM=exp((νobsνmod+δνsc)22(σνobs2+σνsys2)),\mathcal{L}_{\rm WNM}={\rm exp}\left(-\frac{(\nu_{\rm obs}-\nu_{\rm mod}+\delta\nu_{\rm sc})^{2}}{2(\sigma^{2}_{\nu_{\rm obs}}+\sigma^{2}_{\nu_{\rm sys}})}\right),\\ (8)

where σνsys\sigma_{\nu_{\rm sys}} represents the white systematic noise.

In Figures 4 and 5, we generate ν\nu^{\prime} for justifying our choices of kernel parameters. In the fitting process, generating ν\nu^{\prime} with some samplers (e.g., MCMC) and fitting ν\nu^{\prime} to observations is applicable but computationally expensive. In the fitting procedure, we could simply use likelihood functions instead of sampling ν\nu^{\prime}. Because a GP kernel is essentially a covariance matrix for a multivariate Normal distribution, whose probability function follows a multivariate normal distribution. Thus, we can describe CNM as a multivariate normal distribution in the fits and write the likelihood function as

CNM,Nexp(12(νmod+μνobs)T(νmod+μνobs)1).\begin{split}&\mathcal{L}_{\rm CNM,N}\propto&{\rm exp}\left(-\frac{1}{2}(\nu_{\rm mod}+\mu_{\mathcal{E}}-\nu_{\rm obs})^{T}\sum{}^{-1}(\nu_{\rm mod}+\mu_{\mathcal{E}}-\nu_{\rm obs})\right).\\ \end{split} (9)

The subscript ‘N’ stands for the Normal distribution, and the \sum{} represents the covariance matrix based on the SE kernel

=kcom(νmod,νmod)+Iσνobs2.\sum=k_{\rm com}(\nu_{\rm mod},\nu_{\rm mod})+I\cdot\sigma^{2}_{\nu_{\rm obs}}. (10)

The term kcom(νmod,νmod)k_{\rm com}(\nu_{\rm mod},\nu_{\rm mod}) is the combination of three kernels which describe correlated noise. The term Iσνobs2I\cdot\sigma^{2}_{\nu_{\rm obs}} (II is the identity matrix) describes the observed uncertainty of mode frequencies which should be white noise. It should be noted that the kcom(νmod,νmod)k_{\rm com}(\nu_{\rm mod},\nu_{\rm mod}) term does not contain μ\mu_{\mathcal{E}} in the systematic error kernel as we already consider μ\mu_{\mathcal{E}} when comparing model and observed frequencies as shown in Eq. 9.

Moreover, we could consider in addition some potential errors which are unknown or unpredictable. In some cases, a few per cent of observed mode frequencies can be misreported or poorly measured. Some noisy spikes in the oscillation power spectrum could be misclassified as modes; mixed modes close to the =0\ell=0 ridge could be misidentified as radial modes; and frequency uncertainties could be underestimated for un-resolved modes. Due to these additional errors, the probability distribution should contain a small fraction of ‘exceptions’. The tt-distribution is a good option for dealing with these cases. The probability density function of the tt-distribution is similar to the normal distribution but has long and fat tails at both sides. The probability in the tail region represents the chance of a mode frequency being misreported. The likelihood function following the multivariate t-distribution is written as

CNM,t(1+1d(νmodνobs)T(νmodνobs)1)(d+p)/2.\begin{split}&\mathcal{L}_{\rm CNM,{\it t}}\propto\\ &\left(1+\frac{1}{d}(\nu_{\rm mod}-\nu_{\rm obs})^{T}\sum{}^{-1}(\nu_{\rm mod}-\nu_{\rm obs})\right)^{-(d+p)/2}.\\ \end{split} (11)

The subscript ‘tt’ represents the tt-distribution, pp is the dimension of the vector of frequencies, and dd is the number of degrees of freedom that determines the possibility of incorrect measurement. The degrees of freedom d=2d=2 corresponds to a \sim10% probability in the tail regions (outside 3 times half width at half maximum) and we adopt this value in the following analysis.

3 Application in asteroseismic fitting

3.1 Fitting a fake model star

To test whether this new fitting method better recovers the truths of stellar parameters. We fit to a fake model star which has similar surface properties to the Sun. The fake star is computed with same input physics but with off-grid input fundamental parameters, which are M = 1.005M, YY = 0.256, αMLT\alpha_{\rm MLT} = 1.99. The true age, radius, and mean density are τ\tau = 3.99Gyr, RR = 1.007R, and ρ¯\overline{\rho} = 0.984ρ¯\overline{\rho}_{\odot}, respectively. Classical ‘observed’ quantities are TeffT_{\text{eff}} = 5771K, logg\log g = 4.43dex, [Fe/H] = -0.05dex. We adopt observed uncertainties of ±\pm50K for TeffT_{\text{eff}}, ±\pm0.1dex for logg\log g, and ±\pm0.1dex for [Fe/H]. Radial mode frequencies for nn = 12 – 30 are selected as seismic constraints. The global seismic parameters are νmax\nu_{\rm max} = 3058μ\muHz (calculated with the scaling relation given by Kjeldsen & Bedding (1995)) and Δν\Delta\nu = 134.4μ\muHz (from the linear fitting of radial mode frequencies). A uniform observed uncertainty of σνobs=0.5μ\sigma_{\nu_{\rm obs}}=0.5\muHz for mode frequencies is applied. Regarding the free parameters of the kernels, we use fixed parameters determined in Sections 2.3.2 and 2.3.1 in all following tests.

Because the star is a fake model star, no model error is included. The systematic function only contains the uncertainty term. We consider three cases for comparison. We do not consider the systematics for the first case and use Eq. 7 as the likelihood function. The second case includes the white systematic noise and Eq. 8 as the likelihood function. In the third case, we apply the correlated noise model and use the multivariate normal distribution (Eq. 9) to determine the likelihood function. Note that we use the same Maximum Likelihood Estimation (MLE) to fit classical observed frequencies in all these three cases. The posterior distribution is the joint probability of likelihoods from the classical and the seismic fits.

We start with the first and the second cases which either ignore model systematics or treat them as white noise. Frequency-dependent systematic uncertainties as σsys\sigma_{\rm sys} = 1.5νobs/νmax1.5\nu_{\rm obs}/\nu_{\rm max} are used in the second case. Figure 6 shows posterior distributions of five fundamental parameters. Posterior distributions for the first case are spiky because the model grid is significantly under-sampled, leading to poor accuracy and precision. When the white systematic uncertainty is considered, we obtain continuous posterior distributions and more reliable estimates of stellar parameters. We then apply the CNM as described in Section 2 in the fits and illustrate the fitting results at the bottom in Figure 6. Comparing with above traditional methods, we find significant improvements in estimated precisions for mass, radius, and age because CNM is a more principled statistical treatment for the systematics.

Refer to caption
Refer to caption
Refer to caption
Figure 6: Probability distributions of five star parameters of the fake star on the Corner (Foreman-Mackey, 2016) plot. Results include three cases which are fits without any model systematics (top left), with white systematic noise (top right), with a correlated noise model (bottom). The input values of the five stellar parameters are indicated by blue solid lines.

3.2 Fitting a realistic fake star

As a further test of our method, we make the above fake star more realistic by adding the surface term and some random observed noise to oscillation frequencies. To be specific, we refer to it as the ‘realistic fake star’. We add the solar surface term using the correction formula given by Kjeldsen et al. (2008) with the two adjusted parameters as aa = -4.73 and bb = 4.90. Some random noise following the Gaussian distribution (σobs\sigma_{\rm obs} = 0.5μ\muHz) is added to the frequencies. We generate several sets of ‘observed’ frequencies and choose the one with a few frequencies obviously deviating from the asymptotic relation (as shown on the top left in Figure 7). This is to mimic the case of poorly measured modes. All the observed quantities are the same except Δν\Delta\nu because adding the surface term changes its value. We fit the new mode frequencies with a linear function and obtain an observed Δν\Delta\nu of 133.4μ\muHz.

The full systematic function is applied in the fitting. We first fit to three classical observed quantities (TeffT_{\text{eff}}, logg\log g and [Fe/H]) using the MLE method and select models with likelihood greater than 10410^{-4}. We then determine the mean function (μ\mu) and the variance (σ\sigma_{\mathcal{E}}) for the error kernel (𝒦\mathcal{K}_{\mathcal{E}}) with the selected models. As illustrated on the top right in Figure 7, we calculate the prior likelihoods with Eqs 5 and 6 and calculate the weighted median and the weighted standard deviation for observed frequencies. The uncertainty kernel is defined in the same way as in the fits to the fake model star.

We fit models to the data and calculate the likelihood with the multivariate Normal distribution (Eq 9) as well as the multivariate tt-distribution (Eq. 11). We show posterior distributions for the two cases at the bottom of Figure 7. Estimated mass, radius, and age for both cases are consistent with the input parameters. Comparing results with two different likelihood functions, we find better age precision from the fits with multivariate tt-distribution but no obvious improvements for estimated mass and radius. From the joint distributions, we find larger age spreads against the helium fraction and the mixing-length parameter for the Normal-distribution case. This is because those poorly-measured modes form some small-scale curvatures which affect the glitch signature. This is to say, those bad modes weaken the indications of mode frequencies to the helium abundance. The Normal distribution likelihood function does not consider any exceptions in oscillation frequencies and hence treat those mode shifts as helium glitches, leading to larger spreading in the posterior of the helium fraction. On the other hand, the likelihood function with tt-distribution can properly interpret them as potential errors and hence avoid over-explaining the data.

Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 7: Fitting procedure for the ‘realistic fake star’. Top left: ‘observed’ radial mode frequencies (blue dots) generated with model frequencies (black dots) plus the solar surface term and some random noises. Top right: the determination of mean function of model errors (μ\mu in Eq. 4). Dots indicates frequency offsets between models and observations, and colour code represent the joint probability of two priors (lowννmax\mathcal{L}_{\rm low-\nu}\cdot\mathcal{L}_{\rm\nu_{max}}) of each model. Black solid line and grey shade are weighted median and weighted standard deviation of frequency offsets. Black dashed line stands for true model errors. Bottom left: posterior distributions of five stellar parameters based on the multivariate Normal distribution (Eq. 9) on the Corner plot. Truths are presented by blue lines. Bottom right: same as the left, but for the the multivariate tt-distribution (Eq. 9) case.

4 Discussions and Conclusions

We propose a correlated noise model based on a Gaussian Process kernel (covariance function) to better describe model systematics in stellar models constrained by asteroseismic mode frequencies. The work is motivated by poor treatments of model systematics in theoretical mode frequencies which undermine modelling solutions. The model systematics include errors caused by improper physics and uncertainties due to the grid resolution. We use a GP kernel because it has infinitely many derivatives in its prior to represent all possible frequency variations between the points of a model grid. We show that this CNM generates mode frequencies that better match our expectations than the white noise model (Figure 3). In practice, we describe the systematics as a mean function of frequency offsets and several kernels. We manage to use these kernels to reproduce frequency variations at different scales by tuning the two free parameters, i.e., kernel lengthscale and kernel variance. We apply the method to fitting a simulated set of model frequencies. In this work, fits with this new CNM outperform the other two traditional methods which either ignores the systematics or treat them as white noises. We also suggest using the tt-distribution likelihood function in the fitting to cope with potentially misreported mode frequencies. Our testing shows that the tt-distribution likelihood function better recovers the properties of the simulated star compared with the Normal-distribution.

The new fitting approach includes a better description for the model systematics and hence improves the reliability of modelling solutions. This is a novel alternative to account for the effect of the limited resolution of the grid. It mitigates the issues in seismic modelling, as found by Cunha et al. (2021), that when the errors on the frequencies are not inflated the uncertainties on the inferred stellar properties are often underestimated. We also treat the surface term (model errors) as a mean function plus a covariance function instead of parameterising a specific correction formula. The method can be applied to any established model grid. It is fast and statistically-sound in the grid-based framework and hence suitable for modelling a large sample of stars. The method could be useful for the PLATO stellar analysis pipeline (Gent et al., 2021), which is developed for fast determinations of stellar parameters in the core program of the mission. This work only demonstrates the application to radial modes, but the method is extendable to all acoustic modes and also possibly to mixed modes.

There are also some limitations. Although the correlated noise model is a reasonable prediction of model systematics, fitting with it is still not as good as interpolating the grid (assuming the error on interpolation is significantly smaller than the frequency errors). The fitting approach is therefore not the best option when precision is essential. The second limitation is in the determination of the mean function appearing in the systematic error kernel. It relies on previous studies on similar stars to give reliable priors. Here we obtain a good mean function for the example star, because its parameters fit in many well-studied Sun-like stars. To apply this method to other types of stars, a well-studied star sample that covers wide parameter ranges is required for estimating the mean function. Moreover, we note that the functional form of kernels can change for different input physics or free parameters of model grids. Analysing model systematics is required before applying the method to model grids. In future work, we will introduce additional machine-learning tools to learn the model systematics across the HR diagram to determine the functional form in a robust way. This will make the method applicable for many stars without customising the GP kernels.

Acknowledgements

This paper has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (CartographY GA. 804752). This work is supported by the Joint Research Fund in Astronomy (U2031203) under cooperative agreement between the National Natural Science Foundation of China (NSFC) and Chinese Academy of Sciences (CAS), and NSFC grants (12090040, 12090042). This work is also supported by the Fundamental Research Funds for the Central Universities. MBN acknowledges support from the UK Space Agency. MSC acknowledges the support by FCT/MCTES through the research grants UIDB/04434/2020, UIDP/04434/2020 and PTDC/FIS-AST/30389/2017 and the contract CEECIND/02619/2017, and by FEDER - Fundo Europeu de Desenvolvimento Regional through COMPETE2020 - Programa Operacional Competitividade e Internacionalização (grant: POCI-01-0145-FEDER-030389).

Data availability

The data underlying this article are available in GitHub, at https://github.com/litanda/csnm_fitting.

References

  • Aguirre Børsen-Koch et al. (2022) Aguirre Børsen-Koch V., et al., 2022, MNRAS, 509, 4344
  • Appourchaux et al. (2012) Appourchaux T., et al., 2012, A&A, 543, A54
  • Ball (2017) Ball W. H., 2017, in EPJ Web of Conferences. p. 02001
  • Ball & Gizon (2014) Ball W. H., Gizon L., 2014, A&A, 568, A123
  • Borucki et al. (2009) Borucki W., et al., 2009, in Pont F., Sasselov D., Holman M. J., eds, Vol. 253, Transiting Planets. pp 289–299, doi:10.1017/S1743921308026513
  • Chaplin et al. (2007) Chaplin W. J., Elsworth Y., Miller B. A., Verner G. A., New R., 2007, ApJ, 659, 1749
  • Christensen-Dalsgaard et al. (1996) Christensen-Dalsgaard J., et al., 1996, Science, 272, 1286
  • Compton et al. (2018) Compton D. L., Bedding T. R., Ball W. H., Stello D., Huber D., White T. R., Kjeldsen H., 2018, MNRAS, 479, 4416
  • Cunha et al. (2021) Cunha M. S., et al., 2021, MNRAS, 508, 5864
  • Davies et al. (2016) Davies G. R., et al., 2016, MNRAS, 456, 2183
  • Foreman-Mackey (2016) Foreman-Mackey D., 2016, The Journal of Open Source Software, 1, 24
  • Ge et al. (2015) Ge Z. S., Bi S. L., Li T. D., Liu K., Tian Z. J., Yang W. M., Liu Z. E., Yu J., 2015, MNRAS, 447, 680
  • Gent et al. (2021) Gent M. R., et al., 2021, arXiv e-prints, p. arXiv:2111.06666
  • Gough (1990) Gough D. O., 1990, in Osaki, Yoji and Shibahashi, Hiromoto, Progress of Seismology of the Sun and Stars. Vol. 367, doi:10.1007/3-540-53091-6,
  • Houdayer et al. (2021) Houdayer P. S., Reese D. R., Goupil M.-J., Lebreton Y., 2021, A&A, 655, A85
  • Houdek & Gough (2007) Houdek G., Gough D. O., 2007, MNRAS, 375, 861
  • Howe et al. (2017) Howe R., Basu S., Davies G. R., Ball W. H., Chaplin W. J., Elsworth Y., Komm R., 2017, MNRAS, 464, 4777
  • Kiefer et al. (2017) Kiefer R., Schad A., Davies G., Roth M., 2017, A&A, 598, A77
  • Kjeldsen & Bedding (1995) Kjeldsen H., Bedding T. R., 1995, A&A, 293, 87
  • Kjeldsen et al. (2008) Kjeldsen H., Bedding T. R., Christensen-Dalsgaard J., 2008, ApJ, 683, L175
  • Li et al. (2020a) Li Y., Bedding T. R., Li T., Bi S., Stello D., Zhou Y., White T. R., 2020a, MNRAS, 495, 2363
  • Li et al. (2020b) Li T., Bedding T. R., Christensen-Dalsgaard J., Stello D., Li Y., Keen M. A., 2020b, MNRAS, 495, 3431
  • Li et al. (2022) Li T., Li Y., Bi S., Bedding T. R., Davies G., Du M., 2022, ApJ, 927, 167
  • Lund et al. (2017) Lund M. N., et al., 2017, ApJ, 835, 172
  • Lyttle et al. (2021) Lyttle A. J., et al., 2021, MNRAS, 505, 2427
  • Paxton et al. (2011) Paxton B., Bildsten L., Dotter A., Herwig F., Lesaffre P., Timmes F., 2011, The Astrophysical Journal Supplement Series, 192, 3
  • Paxton et al. (2015) Paxton B., et al., 2015, The Astrophysical Journal Supplement Series, 220, 15
  • Rasmussen & Williams (2006) Rasmussen C. E., Williams C. K. I., 2006, Gaussian Processes for Machine Learning
  • Rendle et al. (2019) Rendle B. M., et al., 2019, MNRAS, 484, 771
  • Salabert et al. (2018) Salabert D., Régulo C., Pérez Hernández F., García R. A., 2018, A&A, 611, A84
  • Silva Aguirre et al. (2017) Silva Aguirre V., et al., 2017, ApJ, 835, 173
  • Sonoi et al. (2015) Sonoi T., Samadi R., Belkacem K., Ludwig H. G., Caffau E., Mosser B., 2015, A&A, 583, A112
  • Townsend & Teitler (2013) Townsend R. H. D., Teitler S. A., 2013, MNRAS, 435, 3406
  • Verma et al. (2019) Verma K., Raodeo K., Basu S., Silva Aguirre V., Mazumdar A., Mosumgaard J. R., Lund M. N., Ranadive P., 2019, MNRAS, 483, 4678

Appendix A

A.1 Inspecting frequency differences between the grid points

There are two free parameters in the RBF kernel, i.e., the lengthscale and the variance. Proper choices of kernel parameters are the key to the noise model. We inspect the frequency change between consecutive grid points to determine the kernel parameters. We present frequency differences between models with approximately the solar mean density in Figure 8. As shown, frequency differences can be described as a smooth function of the frequency plus a damped sine function (the signature of the helium glitch). The systematic uncertainty hence contains two kernels. The primary kernel should have a large lengthscale and a frequency-dependent variance for each input dimension. The secondary kernel represents the signature of the helium glitch. It hence has a relatively small lengthscale and some variances decreasing with the frequency.

Refer to caption
Figure 8: Mode frequency changes between consecutive grid points in four input dimensions. All presented models have approximately solar mean density.