This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Evidence that the Occurrence Rate of Hot Jupiters around Sun-like Stars Decreases with Stellar Age

Shota Miyazaki Institute of Space and Astronautical Science, Japan Aerospace Exploration Agency, 3-1-1 Yoshinodai, Chuo, Sagamihara, Kanagawa 252-5210, Japan Kento Masuda Department of Earth and Space Science, Graduate School of Science, Osaka University, 1-1 Machikaneyama, Toyonaka, Osaka 560-0043, Japan
Abstract

We investigate how the occurrence rate of giant planets (minimum mass >0.3MJup>0.3\,M_{\mathrm{Jup}}) around Sun-like stars depends on the age, mass, and metallicity of their host stars. We develop a hierarchical Bayesian framework to infer the number of planets per star (NPPS) as a function of both planetary and stellar parameters. The framework fully takes into account the uncertainties in the latter by utilizing the posterior samples for the stellar parameters obtained by fitting stellar isochrone models to the spectroscopic parameters, Gaia DR3 parallaxes, and 2MASS KsK_{\rm s}-band magnitudes adopting a certain bookkeeping prior. We apply the framework to 46 Doppler giants found around a sample of 382 Sun-like stars from the California Legacy Survey catalog that publishes spectroscopic parameters and search completeness for all the surveyed stars. We find evidence that the NPPS of hot Jupiters (orbital period P=1P=110days10\,\mathrm{days}) decreases roughly in the latter half of the main sequence over the timescale of 𝒪(Gyr)\mathcal{O}(\mathrm{Gyr}), while that of cold Jupiters (P=1P=110yr10\,\mathrm{yr}) does not. Assuming that this decrease is real and caused by tidal orbital decay, the modified stellar tidal quality factor QQ^{\prime}_{\star} is implied to be 𝒪(106)\mathcal{O}(10^{6}) for a Sun-like main-sequence star orbited by a Jupiter-mass planet with P3daysP\approx 3\,\mathrm{days}.

planets and satellites: dynamical evolution and stability
software: corner (Foreman-Mackey, 2016), JAX (Bradbury et al., 2021), Numpyro (Phan et al., 2019; Bingham et al., 2019), jaxstar (Masuda, 2022, 2023)

1 Introduction

Quantifying the occurrence rate of hot Jupiters (HJs) is important for understanding their formation and evolution (Dawson & Johnson, 2018), and this has been the focus of many previous works. The rates estimated for Sun-like stars are all roughly around 1%, but the results based on different survey samples show differences of marginal significance (e.g., Wright et al., 2012; Howard et al., 2012). This may partly be due to the unaccounted correlation between the HJ occurrence and certain properties of the stars, which may vary among the stellar samples that were used to derive the occurrence rates. Stellar metallicity shows a strong correlation with the occurrence rate of giant planets and has long been recognized as one such important variable (Santos et al., 2004; Fischer & Valenti, 2005; Guo et al., 2017). More recently, it has been argued that stellar environments may also play an important role. For example, Brucalassi et al. (2016, 2017) estimated the HJ occurrence rate in the solar-metallicity, solar-age open cluster M67 to be 5.73.0+5.5%5.7^{+5.5}_{-3.0}\%, which may be higher than that around field stars. Winter et al. (2020) also claimed that the occurrence rate of close-in planets including HJs is enhanced around stars that are clustered in the position-velocity phase space, although the key driver of this correlation is still debated (e.g., Kruijssen et al., 2021). Mustill et al. (2022), in particular, pointed out that the correlation found by Winter et al. (2020) can instead be a manifestation of the age dependence: stars in phase-space overdensity are also kinematically cold and tend to be younger, and so the correlation may arise if HJs prefer younger hosts.

This paper primarily focuses on the age dependence of the HJ occurrence, which has been less well explored than the dependence on the other parameters. Given that the orbital periods of HJs are often shorter than the rotation periods of their host stars, the possibility that HJs may spiral into the central star due to tides raised on it has been discussed since the discovery of 51 Peg b (Rasio et al., 1996; Valsecchi & Rasio, 2014; Valsecchi et al., 2014). While it has been difficult to theoretically predict the orbital decay timescale due to the complex nature of tidal dissipation (Ogilvie, 2014), there is growing observational evidence in favor of ongoing decays. Long-term transit monitoring of WASP-12b (Maciejewski et al., 2016; Patra et al., 2017; Yee et al., 2020) and Kepler-1658b (Vissapragada et al., 2022) indicate that the orbital periods as measured directly by successive transits are decreasing. The rapid rotation reported for HJ hosts may also be a signature of stellar spin-up associated with orbital decay, i.e., angular momentum transfer from the orbit to spin (e.g., Penev et al., 2018; Tejada Arevalo et al., 2021). Even the detection of an infrared transient associated with the engulfment of a short-period planet has been reported (De et al., 2023). Of particular relevance to our paper is the work by Hamer & Schlaufman (2019), who showed that Sun-like stars known to host HJs have a smaller Galactic velocity dispersion than their control stars without known planets, and those with known cold Jovian planets. They interpreted this difference as evidence that HJ hosts are on average younger than their control stars, suggesting that a non-negligible fraction of HJs are tidally disrupted by their host stars during the main sequence lifetime.

However, it still remains quantitatively unclear how the orbital decay drives the evolution of the entire population of HJs. The two systems with detected decay, for example, may well be atypical cases showing the quickest decays that are easiest to detect, perhaps aided by enhanced tidal dissipation due to host stars evolving off the main sequence (e.g., Schlaufman & Winn, 2013; Weinberg et al., 2017). The study of Hamer & Schlaufman (2019) does not relate the velocity dispersion with the absolute age scale. Even after such a conversion, it will not be straightforward to interpret the inferred mean age of the HJ hosts, because it depends not only on the occurrence rate of HJs as a function of age but also on the age distribution of the stars from which these planets have been drawn (see also Section 3.1). It is difficult to test whether the latter is the same as their control stars or stars with cold Jupiters because of the inhomogeneous nature of their HJ sample. The HJ occurrence around red giants reported by Temmink & Snellen (2023) may not even support the qualitative conclusion of Hamer & Schlaufman (2019), although the estimate based on four HJs still has a large uncertainty.

Here we attempt to infer the HJ occurrence rate as a function of the stellar age, which directly probes what fraction of HJs is affected by orbital decay on what timescale, and potentially provides insights into the physics of tidal dissipation and formation of HJs (e.g., Matsakos & Königl, 2016; Owen & Lai, 2018). Unlike in Hamer & Schlaufman (2019), this requires the knowledge of not only planet-hosting stars but also of all the stars from which these planets have been detected, as well as the age estimates for all of them. To meet these requirements, here we focus on the California Legacy Survey (CLS) sample (Rosenthal et al., 2021, hereafter R21) — which published the information of all the surveyed stars including the search completeness — and leverage isochronal ages that can be estimated homogeneously for all the stars. This is still a challenging task because main-sequence stars change little with age and thus the uncertainty of the isochronal age is typically large due to degeneracy with other stellar properties and also varies depending on the evolutionary phase of individual stars (Soderblom, 2010a). In this paper, we develop a hierarchical Bayesian framework that allows us to properly handle the large (but statistically well-defined) uncertainty of the isochrone-based age, as well as the degeneracy with stellar mass and metallicity.

The structure of this paper is as follows. In Section 2, we describe our isochrone analysis of the CLS stars and construct a subsample of the CLS stars that we use for the occurrence rate analysis. In Section 3, we present our Bayesian framework for inferring the occurrence rate of planets as a function of both planetary properties (mass, orbital period, radius, etc.) and stellar properties (mass, metallicity, age, etc.). In Section 4, we apply the framework to the Sun-like star sample defined in Section 2 and infer the occurrence rate of hot and cold Jupiters as a function of stellar parameters. In Section 5, we discuss constraints on the stellar tidal quality factor and implications for other survey results. Section 6 summarizes the paper.

Refer to caption
Figure 1: Black crosses show the absolute Gaia GG-band magnitudes MGM_{G} and the effective temperatures TeffT_{\rm eff} of 697 CLS stars for which we derived isochrone parameters. 382 stars in our Sun-like sample defined in Section 2 are shown as orange circles. Among them, 37 stars (cyan squares) host cold Jupiters, and 9 stars (red stars) host hot Jupiters.
Refer to caption
Figure 2: The minimum RV masses MsiniM\sin{i} and orbital periods PP of planets in our Sun-like star sample (filled circles). Planets are colored by median values of ages in the posterior PDFs that are obtained by the stellar isochrone fitting described in Section 2.2. The red and cyan rectangles show the definition of hot and cold Jupiters in our analysis, respectively. The background color map and white contours represent the mean detection efficiencies (survey completeness) of planets in the Sun-like star sample calculated from the injection-recovery simulation results published in Rosenthal et al. (2021).
Refer to caption
Figure 3: The result of the isochrone fitting for the 382 stars in the Sun-like sample. (Left panel): Medians of the posterior distributions in the plane of metallicity [Fe/H] and age. The red stars and cyan circles represent hosts of hot Jupiter (HJ) and cold Jupiter (CJ), respectively. The typical 68% credible interval in each posterior distribution is shown at the bottom left. (Right panel): Same as the left panel, but for the stellar mass and age.
Refer to caption
Figure 4: Corner plots for the posterior samples of the parameters of 9 HJ hosts in the Sun-like sample. The inferred age, mass, and metallicity are highly correlated. This is also the case for the other stars in the Sun-like sample. This figure was created using corner.py (Foreman-Mackey, 2016).
Refer to caption
Figure 5: Cumulative distribution functions (CDFs) of the ages for the HJ hosts (red), CJ hosts (blue), and all the stars (gray) in the Sun-like sample, respectively. Here the “age” for each star is evaluated as the median of the marginalized posterior distribution.

2 Sample Description

2.1 The California Legacy Survey

The CLS catalog presented by R21 contains 179 planets detected around 719 nearby FGKM dwarfs based on the radial velocity (RV) data from the California Planet Search team (Howard et al., 2010). R21 made various data related to the CLS survey available, including spectroscopic parameters of the stars, properties of detected planets derived from RVs, and planet detection efficiencies (completeness),111https://github.com/leerosenthalj/CLSI which makes the catalog an ideal resource for occurrence rate studies. In the following subsections, we will define a subset of Sun-like stars from the CLS catalog for our occurrence analysis. The stars in our sample are shown in Figure 1 with filled orange circles. We will use the spectroscopic parameters provided in the catalog for stellar isochrone fitting in Section 2.2, and utilize the planetary parameters and detection efficiencies for inferring the planet occurrence rate in Section 4.

2.2 Isochrone fitting

We fit the observed effective temperature TeffT_{\rm eff}, iron metallicity [Fe/H], KsK_{\rm s}-band magnitude KsK_{\rm s}, and parallax ϖ\varpi of the CLS stars with the MIST models (Paxton et al., 2011, 2013, 2015; Choi et al., 2016), and derive physical parameters of the stars including mass and age. The outcome is the samples drawn from the joint posterior distribution for the stellar parameters of each star, obtained using the jaxstar software222https://github.com/kemasuda/jaxstar (Masuda, 2022, 2023). The samples will be used for inferring the occurrence rate of planets in Section 3.

We use the measurements of TeffT_{\rm eff} and [Fe/H] from the CLS catalog, which were obtained in R21 by applying the Specmatch (Petigura, 2015) algorithm to the Keck-HIRES high-resolution spectra and using the Isoclassify (Morton, 2015; Huber et al., 2017; Berger et al., 2020) software package.333Strictly speaking, it is ideal here to use the direct outputs of Specmatch that do not rely on stellar models. We implicitly assume that this minor inconsistency does not affect the inferred age dependence given the errors we assigned for these parameters. We assume Gaussian errors of 100 K and 0.1 dex for TeffT_{\rm eff} and [Fe/H], respectively. We exclude five stars without TeffT_{\rm eff} measurements and eight stars with [Fe/H]<1<-1 in the CLS catalog. We collect the parallax measurements (ϖ\varpi) from Gaia DR3 (Gaia Collaboration et al., 2016, 2022) by cross-referencing using their coordinates and magnitudes. We correct the zero-points of the parallax values following the recipe of Lindegren et al. (2021) and their error bars following El-Badry et al. (2021). We obtain KsK_{\rm s}-band magnitudes from the Two Micron All Sky Survey (2MASS; Skrutskie et al., 2006) using the Gaia DR3 source IDs and correct the measurements for extinction using the dust map of Bayestar17 (Green et al., 2019), although the corrections turned out to be negligible in most cases due to their proximity and near-infrared magnitudes. We removed stars without the 2MASS IDs. Six stars in the CLS catalog do not have KsK_{\rm s}-band measurements, so we instead use HH-band 2MASS magnitudes for these stars. In the end, all the necessary information was found for 697 stars.

We then perform isochrone fitting for the 697 stars. By fitting, we mean drawing samples from the joint posterior probability density function (PDF) of stellar parameters 𝜽\bm{\theta}, p0(𝜽|D)p_{0}(\bm{\theta}|D), conditioned on the data DD and a certain prior PDF p0(𝜽)p_{0}(\bm{\theta}) for “bookkeeping.”444The inference of the occurrence rate will not depend explicitly on the prior adopted here; see Section 3. Given the measured values of 𝒚obs={Teff,[Fe/H],ϖ,Ks}\bm{y}^{\rm obs}=\{T_{\rm eff},\mathrm{[Fe/H]},\varpi,K_{\rm s}\} and their errors 𝝈obs\bm{\sigma}^{\rm obs}, the likelihood function (D|𝜽)\mathcal{L}(D|\bm{\theta}) based on the assumption that errors follow Gaussian distributions and are independent is:

(D|𝜽)=i12πσiobsexp[12(yiobsyi(𝜽)σiobs)2],\displaystyle\mathcal{L}(D|\bm{\theta})=\prod_{i}\frac{1}{\sqrt{2\pi}\sigma^{\rm obs}_{i}}\exp{\left[-\frac{1}{2}\left(\frac{y^{\rm obs}_{i}-y_{i}(\bm{\theta})}{\sigma^{\rm obs}_{i}}\right)^{2}\right]}, (1)

where yi(𝜽)y_{i}(\bm{\theta}) is computed (in a deterministic manner) by linearly interpolating the MIST grids of (t,[Fe/H],e)(t_{\star},\mathrm{[Fe/H]},e) and by ϖ=1/d\varpi=1/d, where tt_{\star}, ee, and dd represent the stellar age, equivalent evolutionary phase (EEP; Dotter, 2016), and distance to the star, respectively. We then sample from:

p0(𝜽|D)(D|𝜽)p0(𝜽).\displaystyle p_{0}(\bm{\theta}|D)\propto\mathcal{L}(D|\bm{\theta})\,p_{0}(\bm{\theta}). (2)

We assume that the prior PDF p0(𝜽)p_{0}(\bm{\theta}) is separable as p0(𝜽)=p0(t,[Fe/H],e)p0(d)p_{0}(\bm{\theta})=p_{0}(t_{\star},\mathrm{[Fe/H]},e)\,p_{0}(d) and set:

p0(t,[Fe/H],e)|(t,[Fe/H],M)(t,[Fe/H],e)|\displaystyle p_{0}(t_{\star},\mathrm{[Fe/H]},e)\propto\left|\frac{\partial(t_{\star},\mathrm{[Fe/H]},M_{\star})}{\partial(t_{\star},\mathrm{[Fe/H]},e)}\right| (3)

in order that the prior PDF is constant in the parameter space of (t,[Fe/H],M)(t_{\star},\mathrm{[Fe/H]},M_{\star}) where valid stellar isochrone models exist (Morton, 2015; Masuda, 2022). The priors for age, [Fe/H], and EEP are bounded within (0.1,13.8)(0.1,13.8) Gyr, (0.5,0.5)(-0.5,0.5), and (0,600)(0,600), respectively. For the distance prior p0(d)p_{0}(d), we adopt an exponentially decreasing volume density prior with a length scale of 1.35 kpc (Bailer-Jones, 2015; Astraatmadja & Bailer-Jones, 2016). We obtain 10,000 posterior samples for each of the 697 stars in the CLS catalog (black crosses in Figure 3). For all the parameters, the Gelman–Rubin convergence diagnostics R^\hat{R} (Gelman & Rubin, 1992) computed by splitting a single chain were R^<1.01\hat{R}<1.01 in most stars and R^<1.2\hat{R}<1.2 in all the stars.

2.3 The Sun-like Star Sample

In this work, we focus on planets around Sun-like stars, for which models are more accurately calibrated than for lower- and higher-mass stars (Soderblom, 2010b). We first removed 25 poorly-fitted stars whose marginalized posterior PDFs do not contain any of the measured values within 90% credible intervals, resulting in 672 stars. We construct the subsample of Sun-like stars by choosing the stars whose mass and [Fe/H], as evaluated using the median of the posterior samples, satisfy 0.8<Mmed/M<1.20.8<M^{\rm med}_{\star}/M_{\odot}<1.2 and 0.4<[Fe/H]med<0.4-0.4<\mathrm{[Fe/H]}^{\rm med}<0.4, respectively, and then we are left with 382 stars. Typical 68%68\% credible intervals for each posterior are approximately 0.05M0.05\,M_{\odot} in mass, 0.09dex0.09\,{\rm dex} in [Fe/H], and 2.8Gyr2.8\,{\rm Gyr} in age. A previous comparison with asteroseismic stars demonstrated no significant bias larger than these uncertainties for the stars in the above mass-metallicity range (Masuda, 2022).

The Sun-like sample contains 124 planets shown in Figure 2. We focus on the following two classes of planets:

  • Hot Jupiter (HJ): Planets with orbital period PP ranging from 1–10days\;{\rm days} and minimum mass MsiniM\sin{i} between 0.3–10MJup\;M_{\rm Jup}.

  • Cold Jupiter (CJ): Planets with orbital period PP between 1–10years\;{\rm years} (corresponding to approximately 1–5 au in semi-major axis) and minimum mass MsiniM\sin{i} in the range of 0.3–10MJup\;M_{\rm Jup}.

The corresponding regions of the parameter space are shown by red and cyan rectangles in Figure 2. Also shown by the background color map and white contours are the mean detection efficiencies as computed using the results of injection-recovery simulation reported in R21. The minimum masses of the planets MsiniM\sin{i} and the detection efficiencies are recalculated using the host masses MM_{\star} derived from isochrone fitting. For most of the HJs and CJs as defined here, the uncertainties in the minimum mass and orbital period are smaller than the bin size adopted for displaying the detection efficiency.

2.4 Initial Investigation of the Age Dependence: A Simple Statistical Test and its Problems

In Figure 2, each planet is color-coded based on the age of the host star, here evaluated as the median of the marginal posterior. Albeit with large uncertainties (see also Figures 4), this figure already hints that stars hosting HJs (all having reddish colors) tend to be younger than those with CJs (mixture of blue and red).555One may think that the lower limit of the HJ’s mass, Msini=0.3MJupM\sin{i}=0.3M_{\rm Jup}, can impact our result because a relatively older planet-host exists at (P,Msini)=(6day,70M)(P,M\sin{i})=(6\;{\rm day},70M_{\oplus}) in Figure 2. We confirmed our conclusions remained unchanged when the planets down to Msini=0.1MJupM\sin{i}=0.1\,M_{\rm Jup} were considered as HJs. The same trend is also seen in Figure 3, which displays the distribution of the posterior medians of ([Fe/H],M,t)(\mathrm{[Fe/H]},M_{\star},t_{\star}) in the Sun-like sample: the stars hosting HJs (red stars) are in the lower parts of the figures at a given [Fe/H] (left panel) or mass (right panel), compared to stars with CJs (cyan circles) or stars without known planets (black crosses).

We perform a simple statistical test for the age distribution as an attempt to quantify this difference further and to highlight its issues as well. Figure 5 shows the cumulative distribution functions of ages (median of marginal posterior) for all stars in the sample (gray), stars hosting HJs (red), and stars with CJs (blue). We tested the null hypothesis that two of the subsamples are drawn from the same population using the Anderson–Darling test for kk samples and found the pp-values of 0.022, 0.18, and 1.6×1031.6\times 10^{-3} when comparing all-stars vs. HJ hosts, all-stars vs. CJ hosts, and HJ hosts vs. CJ hosts, respectively. These results confirm the above visual impression that HJ hosts are younger than CJ hosts as well as all surveyed stars.

We note, however, that interpreting this kind of statistical test is by no means straightforward. First, this does not take into account heterogeneous uncertainties in the age estimates. In general, the uncertainty varies from star to star for a variety of reasons; this may be due to the difference in the precision of the spectroscopic parameters that depends on the signal-to-noise of the observed spectrum, or due to the difference in other stellar parameters (e.g., mass) that affect the age precision. When this is the case, the above test may give low pp-values even for two populations of stars with the same true age distributions; the distribution of the medians of the posteriors conditioned on a certain prior (i.e., the estimated ages) could still be different.666Consider an extreme case where there exist two populations of stars with the same age distributions and the ages can be inferred extremely precisely for one population, and only very poorly for the other. Then the distribution of the estimated ages will be very similar to the true distribution in the former but will be totally different for the latter, depending on how the estimate is given, and so the distributions of the estimated ages will look very different. Second, the apparent difference in the age distributions could be caused by the difference in other stellar parameters, such as the mass, that are correlated with age. If the occurrence rate of HJs increases with the stellar mass more rapidly than that of CJs, for example, then the HJ hosts will be younger than CJs even if their occurrences have the same age dependence at a given mass, because more massive stars are on average younger. One conceptually straightforward solution is to control the other stellar parameters in the sample by, for example, focusing on subsets of stars with similar masses and metallicities and computing the occurrence rates separately. When this is not practical — as in our case — one needs to consider the distribution of all the relevant stellar parameters in the sample and infer the dependence of the occurrence rate on these parameters simultaneously. Such an analysis must take into account the fact that the constraints on stellar parameters are often given in a degenerate manner; for example, the isochronal mass and age are strongly degenerate (Figure 4). The degeneracy in the inferred stellar parameters should not be confused with the correlation that actually exists in the stellar sample.

The rest of this paper describes our attempts to analyze the age difference taking into account the above issues associated with large and heterogeneous uncertainties in the degenerate stellar parameters. The following Section 3 presents a framework for doing so.

3 A General Framework for Inferring the Number of Planets per Star

3.1 Definition of Planet Occurrence Rate

Two types of “occurrence rates” have been mainly discussed in the literature: (1) the number of planets per star (NPPS; e.g. Youdin, 2011; Fulton et al., 2021), and (2) the fraction of stars with planets (FSWP; e.g. Fischer & Valenti, 2005; Johnson et al., 2010). In this work, we discuss the NPPS as a function of both planet and stellar properties.

Let 𝒛\bm{z} and 𝒙\bm{x} represent the physical parameters of stars (mass, metallicity, age, etc.) and planets (mass, orbital period, radius, etc.), respectively. We define the NPPS function f(𝒙|𝒛)f(\bm{x}|\bm{z}) so that, around a star with given 𝒛\bm{z}, the expected number of planets that fall within a small volume of the parameter space d𝒙d\bm{x} around 𝒙\bm{x} is (e.g., Tabachnik & Tremaine, 2002; Youdin, 2011):

dn¯p=f(𝒙|𝒛)d𝒙.\displaystyle d\overline{n}_{\rm p}=f(\bm{x}|\bm{z})d\bm{x}. (4)

Thus f(𝒙|𝒛)f(\bm{x}|\bm{z}) denotes the expected number of planets per star and per d𝒙\mathrm{d}\bm{x}, but not per d𝒛d\bm{z}.

By integrating over a certain region 𝒫\mathcal{P} in the 𝒙\bm{x}-space, which defines the “planet” of consideration, we obtain the expected number of planets with 𝒙𝒫\bm{x}\in\mathcal{P} per star, in a manner that depends on the stellar properties:

n¯𝒙𝒫(𝒛)=𝒫f(𝒙|𝒛)𝑑𝒙.\displaystyle\overline{n}_{\bm{x}\in\mathcal{P}}(\bm{z})=\int_{\mathcal{P}}f(\bm{x}|\bm{z})d\bm{x}. (5)

Further integrating over 𝒛\bm{z}, the expected NPPS around NN_{\star} stars whose properties 𝒛\bm{z} follow the probability distribution p(𝒛)p_{\star}(\bm{z}) is:

n¯𝒙𝒫,𝒛p(𝒛)\displaystyle\overline{n}_{\bm{x}\in\mathcal{P},\bm{z}\sim p_{\star}(\bm{z})} =1N𝒮n¯𝒙𝒫(𝒛)Np(𝒛)𝑑𝒛\displaystyle={1\over N_{\star}}\int_{\mathcal{S}}\overline{n}_{\bm{x}\in\mathcal{P}}(\bm{z})N_{\star}p_{\star}(\bm{z})\,d\bm{z}
=𝒫𝒮f(𝒙|𝒛)p(𝒛)𝑑𝒛𝑑𝒙,\displaystyle=\int_{\mathcal{P}}\int_{\mathcal{S}}f(\bm{x}|\bm{z})\,p_{\star}(\bm{z})\,d\bm{z}\,d\bm{x}, (6)

where 𝒮\mathcal{S} denotes the domain of 𝒛\bm{z}.

We emphasize again that f(𝒙|𝒛)f(\bm{x}|\bm{z}) does not have the unit of 1/𝒛1/\bm{z}, and neither does n¯𝒙𝒫(𝒛)\overline{n}_{\bm{x}\in\mathcal{P}}(\bm{z}). The PDF p(𝒛)p_{\star}(\bm{z}) does, and satisfies 𝒮p(𝒛)𝑑𝒛=1\int_{\mathcal{S}}p_{\star}(\bm{z})\,d\bm{z}=1. One cannot integrate out 𝒛\bm{z} in f(𝒙|𝒛)f(\bm{x}|\bm{z}) without specifying p(𝒛)p_{\star}(\bm{z}). This means the properties of all the surveyed stars need to be specified to relate ff with the observed number of planets. To put it the other way, the parameter distribution of the surveyed stars necessarily needs to be specified/inferred for inferring the NPPS as a function of stellar parameters.

3.2 Inferring the NPPS from the Data

Consider a planet survey in which the total number of stars is NN_{\star}. We assume that the survey provides the following data sets:

  • 𝑫{Dj}j=1N\bm{D}\equiv\{D_{j}\}^{N_{\star}}_{j=1}: This data has information about the stellar properties (𝒛j)(\bm{z}_{j}) for each star. In our case, this is the stellar data used for isochrone fitting.

  • 𝑯={Hj}j=1N\bm{H}=\{H_{j}\}^{N_{\star}}_{j=1}: This data set includes the parameters of planets around each star. For the jj-th star with detected planets, the set HjH_{j} contains the parameters for all the detected planets, represented as Hj={𝒙j(1),𝒙j(2),,𝒙j(nj)}H_{j}=\{\bm{x}^{(1)}_{j},\bm{x}^{(2)}_{j},...,\bm{x}^{(n_{j})}_{j}\}, where njn_{j} is the number of detected planets orbiting the jj-th star. If no planets are detected, the set will be empty. In our application, 𝒙\bm{x} consists of the minimum mass and orbital period of the planet.

We parameterize the stellar distribution p(𝒛)p_{\star}(\bm{z}) and the NPPS function f(𝒙|𝒛)f(\bm{x}|\bm{z}) by the sets of parameters 𝜶\bm{\alpha} and 𝜸\bm{\gamma}, respectively. We then aim to determine the joint posterior PDF of these parameters, given the data 𝑫\bm{D} and 𝑯\bm{H}:

p(𝜶,𝜸|𝑫,𝑯)\displaystyle p(\bm{\alpha},\bm{\gamma}|\bm{D},\bm{H}) \displaystyle\propto (𝜶,𝜸)p(𝜶,𝜸),\displaystyle\mathcal{L}(\bm{\alpha},\bm{\gamma})\,p(\bm{\alpha},\bm{\gamma}), (7)

where (𝜶,𝜸)\mathcal{L}(\bm{\alpha},\bm{\gamma}) is the likelihood function and p(𝜶,𝜸)p(\bm{\alpha},\bm{\gamma}) is the prior PDF for 𝜶\bm{\alpha} and 𝜸\bm{\gamma}. We assume that the data for each star are independent when conditioned on 𝜶\bm{\alpha} and 𝜸\bm{\gamma}. Then the likelihood is represented as the product of the probabilities of the data for each star:

(𝜶,𝜸)\displaystyle\mathcal{L}(\bm{\alpha},\bm{\gamma}) =\displaystyle= j=1Np(Dj,Hj|𝜶,𝜸).\displaystyle\prod^{N_{\star}}_{j=1}p(D_{j},H_{j}|\bm{\alpha},\bm{\gamma}). (8)

For the jj-th star, the likelihood may be evaluated as:

p(Dj,Hj|𝜶,𝜸)\displaystyle p(D_{j},H_{j}|\bm{\alpha},\bm{\gamma}) =\displaystyle= p(Dj,Hj|𝜶,𝜸,𝒛j)p(𝒛j|𝜶,𝜸)𝑑𝒛j\displaystyle\int p(D_{j},H_{j}|\bm{\alpha},\bm{\gamma},\bm{z}_{j})\,p(\bm{z}_{j}|\bm{\alpha},\bm{\gamma})d\bm{z}_{j}
=\displaystyle= p(Dj|𝒛j)p(Hj|𝜸,𝒛j)p(𝒛j|𝜶)𝑑𝒛j.\displaystyle\int p(D_{j}|\bm{z}_{j})\,p(H_{j}|\bm{\gamma},\bm{z}_{j})p_{\star}(\bm{z}_{j}|\bm{\alpha})d\bm{z}_{j}.

In the second line, we assume that DjD_{j} and HjH_{j} are independent when conditioned on 𝒛j\bm{z}_{j}, and that DjD_{j} does not depend on 𝜶\bm{\alpha} nor 𝜸\bm{\gamma} when conditioned on 𝒛j\bm{z}_{j}. We compute p(Hj|𝜸,𝒛j)p(H_{j}|\bm{\gamma},\bm{z}_{j}) assuming that the number of detected planets in a given small partition Δl\Delta_{l} of the parameter space follows the Poisson distribution, and that the detections in different partitions are made independently. From Equation (4), the expected number of planets in a given partition is ηj(𝒙l)f(𝒙l|𝒛j)Δl\eta_{j}(\bm{x}_{l})f(\bm{x}_{l}|\bm{z}_{j})\Delta_{l}, where ηj(𝒙)\eta_{j}(\bm{x}) is the detection efficiency of planets as a function of 𝒙\bm{x} for the jj-th star. Note that ηj(𝒙)\eta_{j}(\bm{x}) is not a density function per 𝒙\bm{x}. Then p(Hj|𝜸,𝒛j)p(H_{j}|\bm{\gamma},\bm{z}_{j}), the probability to find one planet in partitions around 𝒙j(1),,𝒙j(nj)\bm{x}_{j}^{(1)},\dots,\bm{x}_{j}^{(n_{j})}, but none elsewhere, is (Tabachnik & Tremaine, 2002; Youdin, 2011):

p(Hj|𝜸,𝒛j)\displaystyle p(H_{j}|\bm{\gamma},\bm{z}_{j})
=\displaystyle= (l,𝒙lHj[ηj(𝒙l)f(𝒙l|𝜸,𝒛j)Δl]exp[ηj(𝒙l)f(𝒙l|𝜸,𝒛j)Δl])forpartitionswithdetectedplanets\displaystyle\underbrace{\left(\prod_{l,\bm{x}_{l}\in H_{j}}\left[\eta_{j}(\bm{x}_{l})f(\bm{x}_{l}|\bm{\gamma},\bm{z}_{j})\Delta_{l}\right]\exp{\left[-\eta_{j}(\bm{x}_{l})f(\bm{x}_{l}|\bm{\gamma},\bm{z}_{j})\Delta_{l}\right]}\right)}_{\rm for\;partitions\;with\;detected\;planets}
×(l,𝒙lHjexp[ηj(𝒙l)f(𝒙l|𝜸,𝒛j)Δl])forpartitionswithoutdetectedplanets\displaystyle\times\underbrace{\left(\prod_{l,\bm{x}_{l}\notin H_{j}}\exp{\left[-\eta_{j}(\bm{x}_{l})f(\bm{x}_{l}|\bm{\gamma},\bm{z}_{j})\Delta_{l}\right]}\right)}_{\rm for\;partitions\;without\;detected\;planets}
\displaystyle\propto (𝒙jHjηj(𝒙j)f(𝒙j|𝜸,𝒛j))exp[ηj(𝒙)f(𝒙|𝜸,𝒛j)𝑑𝒙].\displaystyle\left(\prod_{\bm{x}_{j}\in H_{j}}\eta_{j}(\bm{x}_{j})f(\bm{x}_{j}|\bm{\gamma},\bm{z}_{j})\right)\exp{\left[-\int\eta_{j}(\bm{x})f(\bm{x}|\bm{\gamma},\bm{z}_{j})d\bm{x}\right]}.

Substituting this expression into Equation (3.2), we find

p(Dj,Hj|𝜶,𝜸)\displaystyle p(D_{j},H_{j}|\bm{\alpha},\bm{\gamma}) =\displaystyle= p(Dj|𝒛j)p(𝒛j|𝜶)(𝒙jHjf~(𝒙j|𝜸,𝒛j))\displaystyle\int\;p(D_{j}|\bm{z}_{j})\,p_{\star}(\bm{z}_{j}|\bm{\alpha})\left(\prod_{\bm{x}_{j}\in H_{j}}\widetilde{f}(\bm{x}_{j}|\bm{\gamma},\bm{z}_{j})\right) (11)
×exp[f~j(𝒙|𝜸,𝒛j)𝑑𝒙]d𝒛j,\displaystyle\times\exp{\left[-\int\widetilde{f}_{j}(\bm{x}|\bm{\gamma},\bm{z}_{j})d\bm{x}\right]}d\bm{z}_{j},

where we defined f~j(𝒙|𝒛)=ηj(𝒙)f(𝒙|𝒛)\widetilde{f}_{j}(\bm{x}|\bm{z})=\eta_{j}(\bm{x})f(\bm{x}|\bm{z}).

In this work, we model the distribution of stellar parameters in the sample, p(𝒛)p_{\star}(\bm{z}), as follows:

p(𝒛|𝜶)=k=1Kexp(αk) 1k(𝒛).\displaystyle p_{\star}(\bm{z}|\bm{\alpha})=\sum^{K}_{k=1}\exp{(\alpha_{k})}\;\bm{1}_{k}(\bm{z}). (12)

Here, kk represents the index for bins that divide the domain of 𝒛\bm{z}, and 𝟏k(𝒛)\bm{1}_{k}(\bm{z}) is a step function that returns 1 if the kk-th bin contains 𝒛\bm{z}, and 0 otherwise. The term exp(αk)\exp{(\alpha_{k})} represents the height of the kk-th bin, and must satisfy kexp(αk)Δk=1\sum_{k}\exp{(\alpha_{k})}\Delta_{k}=1 with Δk\Delta_{k} being the volume of the kk-th bin, so that pp_{\star} is a normalized PDF. In this form, we assume that p(𝒛|𝜶)p_{\star}(\bm{z}|\bm{\alpha}) is constant within a given bin, and αk\alpha_{k} represents the log probability density in the kk-th bin. Using this expression for p(𝒛|𝜶)p_{\star}(\bm{z}|\bm{\alpha}), Equation (11) can be rewritten as:

p(Dj,Hj|𝜶,𝜸)\displaystyle p(D_{j},H_{j}|\bm{\alpha},\bm{\gamma}) =\displaystyle= k=1Kexp(αk)Lj,k(𝜸),\displaystyle\sum^{K}_{k=1}\exp{(\alpha_{k})}L_{j,k}(\bm{\gamma}), (13)

where

Lj,k(𝜸)\displaystyle L_{j,k}(\bm{\gamma}) =\displaystyle= 𝑑𝒛jp(Dj|𝒛j)𝟏k(𝒛j)(𝒙jHjf~j(𝒙j|𝜸,𝒛j))\displaystyle\int d\bm{z}_{j}\,p(D_{j}|\bm{z}_{j})\bm{1}_{k}(\bm{z}_{j})\left(\prod_{\bm{x}_{j}\in H_{j}}\widetilde{f}_{j}(\bm{x}_{j}|\bm{\gamma},\bm{z}_{j})\right)
×\displaystyle\times exp[f~j(𝒙|𝜸,𝒛j)𝑑𝒙].\displaystyle\exp{\left[-\int\widetilde{f}_{j}(\bm{x}|\bm{\gamma},\bm{z}_{j})d\bm{x}\right]}.

Following Hogg et al. (2010), we evaluate the integral of Lj,kL_{j,k} via an importance sampling using the samples from the posterior PDF p0(𝒛|Dj)p_{0}(\bm{z}|D_{j}) conditioned on a certain bookkeeping prior p0(𝒛)p_{0}(\bm{z}), as we obtained in Section 2.2. Namely, considering Bayes’ theorem as

p(D|𝒛)=p0(𝒛|D)p0(D)p0(𝒛),p(D|\bm{z})=\frac{p_{0}(\bm{z}|D)p_{0}(D)}{p_{0}(\bm{z})}, (14)

we approximate Lj,kL_{j,k} as:

Lj,k(𝜸)\displaystyle L_{j,k}(\bm{\gamma}) \displaystyle\approx p0(Dj)Mm=1M𝟏k(𝒛j(m))p0(𝒛j(m))(𝒙jHjf~j(𝒙j|𝜸,𝒛j))\displaystyle\frac{p_{0}(D_{j})}{M}\sum^{M}_{m=1}\frac{\bm{1}_{k}(\bm{z}^{(m)}_{j})}{p_{0}(\bm{z}^{(m)}_{j})}\left(\prod_{\bm{x}_{j}\in H_{j}}\widetilde{f}_{j}(\bm{x}_{j}|\bm{\gamma},\bm{z}_{j})\right)
×\displaystyle\times exp[f~j(𝒙|𝜸,𝒛j)𝑑𝒙],𝒛j(m)p0(𝒛|Dj).\displaystyle\exp{\left[-\int\widetilde{f}_{j}(\bm{x}|\bm{\gamma},\bm{z}_{j})d\bm{x}\right]},\;\;\;\bm{z}^{(m)}_{j}\sim p_{0}(\bm{z}|D_{j}).

Here, MM represents the number of samples drawn from the posterior PDF, and we can drop the constant factor p0(D)p_{0}(D) that is irrelevant to the inference. We note that the inference does not depend explicitly on the choice of the bookkeeping prior p0(𝒛)p_{0}(\bm{z}). This is because the evaluation of LL only involves p0(𝒛|D)/p0(𝒛)p(D|𝒛)p_{0}(\bm{z}|D)/p_{0}(\bm{z})\propto p(D|\bm{z}), which is the likelihood function that depends on the data alone, but not on p0(𝒛)p_{0}(\bm{z}) used to draw posterior samples in the isochrone fitting of individual stars.

Finally, the joint posterior PDF in Equation (7) is evaluated as

p(𝜶,𝜸|𝑫,𝑯)=jNp(Dj,Hj|𝜶,𝜸)p(𝜶,𝜸).\displaystyle p(\bm{\alpha},\bm{\gamma}|\bm{D},\bm{H})=\prod^{N_{\star}}_{j}p(D_{j},H_{j}|\bm{\alpha},\bm{\gamma})\,p(\bm{\alpha},\bm{\gamma}). (16)

In this work, we perform posterior sampling using Hamiltonian Monte Carlo and No-U-Turn Sampler (Duane et al., 1987; Betancourt, 2017) as implemented in NumPyro (Phan et al., 2019; Bingham et al., 2019). The code is implemented using JAX (Bradbury et al., 2021).

4 Planet Occurrence Rates around the CLS Sun-like Stars

Refer to caption
Figure 6: The stellar parameter distribution p(𝒛)p_{\star}(\bm{z}) inferred for the Sun-like sample using the histogram model in Section 4.1. (Left): The 3D view of the posterior in the parameter space of ([Fe/H],M,t)(\mathrm{[Fe/H]},M_{\star},t_{\star}), where the circles represent the mean values of p(𝒛)p_{\star}(\bm{z}) at the center of each grid. (Right): The panels show the marginalized posterior PDFs for [Fe/H], mass, and age (top to bottom). The solid lines and translucent regions represent the medians and 90% equal-tail credible intervals (CIs) in each bin.
Refer to caption
Figure 7: The NPPS function n¯p(𝒛)\bar{n}_{p}(\bm{z}) inferred for the Sun-like sample using the histogram model in Section 4.1. (Left): The posterior PDFs of the NPPS for HJ (orange) and for CJ (blue). These are the numbers of planets per star in the searched sample, computed taking into account the stellar parameter dependence of the occurrence rate as well as the distribution of the stellar parameters in the sample; see Equation (6). (Right): The posterior PDFs for the NPPS of HJs (top) and CJs (bottom) as functions of [Fe/H], mass, and age are shown as violin plots, whose widths represent the probability densities. The error bars represent their peaks (modes) and 68% highest density intervals (HDIs). This is essentially the NPPS as a function of 𝒛\bm{z} as given in Equation 5 but has been marginalized for the stellar parameters that are not shown in the horizontal axis as in Equation (6) (see also the main text and Equation (26)).
Refer to caption
Figure 8: The posterior PDFs for the NPPS as a function of age derived using the histogram model in Section 4.1. Unlike in the right panels of Figure 7, here we show the NPPS evaluated at given stellar masses shown at the top of each column, after marginalizing over the metallicity distribution.

In this section, we apply the framework described in Section 3 to the Sun-like sample defined in Section 2 and infer the occurrence rate of hot Jupiters (HJs) and cold Jupiter (CJs), taking into account the dependencies on stellar and planetary properties. For the stellar properties, we consider metallicity, mass, and age, i.e., 𝒛=([Fe/H],M,t)\bm{z}=(\mathrm{[Fe/H]},M_{\star},t_{\star}), and ignore possible dependencies on other parameters including the Galactic position and velocity. As we discussed in Section 2.4, it is essential to consider all these variables jointly so that the dependencies on mass and/or metallicity do not mimic the age dependence through the correlations between these parameters. For the planet properties, we consider the logarithms of the minimum mass and orbital period that are well determined from RVs: 𝒙=(logMsini,logP)\bm{x}=(\log M\sin{i},\log P).

We assume the following functional form for the NPPS function f(𝒙|𝜸,𝒛)f(\bm{x}|\bm{\gamma},\bm{z}):

f(𝒙|𝜸,𝒛)\displaystyle f(\bm{x}|\bm{\gamma},\bm{z}) =\displaystyle= 2n¯p(logMsini)(logP)\displaystyle\frac{\partial^{2}\bar{n}_{p}}{\partial({\log{M\sin{i}}})\partial(\log{P})} (19)
=\displaystyle= {𝒢(𝒛,𝜸s)(Msini/MJup)m(P/day)p,(forHJ)𝒢(𝒛,𝜸s)(Msini/MJup)m(P/year)p,(forCJ)\displaystyle\left\{\begin{array}[]{lc}\mathcal{G}(\bm{z},\bm{\gamma}_{s})\left({M\sin{i}}/{M_{\rm Jup}}\right)^{m}\left({P}/{\rm day}\right)^{p},&\mathrm{(for\;HJ)}\\ \mathcal{G}(\bm{z},\bm{\gamma}_{s})\left({M\sin{i}}/{M_{\rm Jup}}\right)^{m}\left({P}/{\rm year}\right)^{p},&\mathrm{(for\;CJ)}\end{array}\right.

where 𝜸={𝜸s,m,p}\bm{\gamma}=\{\bm{\gamma}_{s},m,p\} are the parameters that specify ff. Here, 𝒢(𝒛,𝜸s)\mathcal{G}(\bm{z},\bm{\gamma}_{s}) is a shape function that describes the dependency on stellar properties, and it includes a normalization factor for the NPPS. We follow previous works (e.g., Howard et al., 2010; Petigura et al., 2018) to model the dependence on the planet properties as a double power law with indices mm and pp. Although recent studies report a possible deviation from a single power law function at long orbital periods corresponding to a few au (Fernandes et al., 2019; Fulton et al., 2021), a single power law still remains a reasonable model for the narrow range of our CJs’ period (1–10 yr). By Equation (19), we implicitly assume that the mass/period dependences are separable, and are common for all stars with different 𝒛\bm{z}.

For evaluating the detection efficiency for each star, ηj(𝒙)\eta_{j}(\bm{x}), we divide the parameter space of planetary properties 𝒙\bm{x} into cells with dimensions of Δ(logP)×Δ(logMsini)0.14dex×0.13dex\Delta(\log{P})\times\Delta(\log{M\sin{i}})\sim 0.14\;{\rm dex}\times 0.13\;{\rm dex}, and computed the value of ηj\eta_{j} for each of these cells using the results of injection-recovery simulations (Rosenthal et al., 2021). The efficiency averaged over all the sample stars is shown in Figure 2.

4.1 Histogram Model

Table 1: Priors and Posteriors of Parameters in the Histogram Inference
Parameters Prior Posterior summary
Hot Jupiter Cold Jupiter
Parameters for the NPPS function f(𝐱|𝐳)f(\bm{x}|\bm{z})
{logfk}\{\log f_{k}\} 𝒰(4,1)\mathcal{U}(-4,1) see Figure 7
mm 𝒰(5,5)\mathcal{U}(-5,5) 0.680.33+0.32-0.68^{+0.32}_{-0.33} 0.180.15+0.15-0.18^{+0.15}_{-0.15}
pp 𝒰(5,5)\mathcal{U}(-5,5) 0.340.46+0.45-0.34^{+0.45}_{-0.46} 0.510.25+0.250.51^{+0.25}_{-0.25}
Parameters for the stellar distribution p(𝐳)p_{\star}(\bm{z})
{αk}\{\alpha_{k}\} Eq. (22) see Figure 6

Note. — The values shown are the medians and 68% equal-tail intervals of the marginal posterior distribution.

Throughout this paper, the stellar distribution p(𝒛)p_{\star}(\bm{z}) is modeled as a histogram, as shown in Equation (12). In this subsection, we model 𝒢(𝒛,𝜸s)\mathcal{G}(\bm{z},\bm{\gamma}_{s}) as a histogram too, using the common bins in the 𝒛\bm{z} space as adopted for p(𝒛)p_{\star}(\bm{z}). This is meant to serve as an analysis with minimal assumptions on the 𝒛\bm{z} dependence of the NPPS, which guides the interpretation of the subsequent results from other more constrained parametric models. We set up 4×4×54\times 4\times 5 bins for [Fe/H], mass, and age spanning [0.4,0.4][-0.4,0.4], [0.8,1.2][0.8,1.2], and [0,14][0,14], respectively, resulting in the total bin number of K=80K=80. The bin widths are determined to be comparable to the typical uncertainty of these parameters (see also Section 4.3); we do not expect that the data provide useful information on finer parameter dependencies. Then we represent the expected NPPS values in each bin (assumed to be constant within the bin) by 𝜸s={f1,f2,,fK}\bm{\gamma}_{s}=\{f_{1},f_{2},...,f_{K}\}. Consequently, we have

𝒢(𝒛,𝜸s)=k=1KfkΔmΔp𝟏k(𝒛),\displaystyle\mathcal{G}(\bm{z},\bm{\gamma}_{s})=\sum^{K}_{k=1}{f_{k}\over\Delta_{m}\Delta_{p}}\bm{1}_{k}(\bm{z}), (21)

where ΔmΔp\Delta_{m}\Delta_{p} denotes the integral of the power law part of Equation (19) in the HJ/CJ domain, and depends both on mm and pp. This factor makes fkf_{k} to be the number of planets in the domain per star with parameters 𝒛\bm{z}; see Equation (5).

The prior p(𝜶,𝜸)p(\bm{\alpha},\bm{\gamma}) is assumed to be separable as p(𝜶,𝜸)=p(𝜶)p(𝜸)p(\bm{\alpha},\bm{\gamma})=p(\bm{\alpha})\,p(\bm{\gamma}). For 𝜶\bm{\alpha} that describes the stellar distribution p(𝒛|𝜶)p_{\star}(\bm{z}|\bm{\alpha}), we assume

p(𝜶)\displaystyle p(\bm{\alpha}) \displaystyle\propto δ(kKexp(αk)Δk1)\displaystyle\delta\left(\sum^{K}_{k}\exp(\alpha_{k})\Delta_{k}-1\right) (22)
×\displaystyle\times [k𝒦𝒰(αk;5,αmax)][k𝒦δ(αk+10)],\displaystyle\left[\prod_{k\notin\mathcal{K}}\mathcal{U}(\alpha_{k};-5,\alpha_{\rm max})\right]\cdot\left[\prod_{k\in\mathcal{K}}\delta(\alpha_{k}+10)\right],

where δ(x)\delta(x) denotes the Dirac delta function, and 𝒰(x;a,b)\mathcal{U}(x;a,b) denotes the uniform distribution for xx between aa and bb. The first term on the right side comes from the normalization requirement of the PDF. In the second line, we assign uniform priors on most αk\alpha_{k}, but assign essentially zero probability density αk=10\alpha_{k}=-10 at bins in the set 𝒦\mathcal{K} that satisfies

t(Gyr)>30(M/M1.25)+5.\displaystyle t_{\star}\;{\rm(Gyr)}>-30(M_{\star}/M_{\odot}-1.25)+5. (23)

This represents our prior knowledge that stars do not exist in these regions. The value of αmax\alpha_{\rm max} is chosen so that the normalized p(𝒛|𝜶)p_{\star}(\bm{z}|\bm{\alpha}) is positive (i.e., exp(αk)Δk\exp(\alpha_{k})\Delta_{k} never exceeds unity for any kk). For the parameters 𝜸\bm{\gamma} in the NPPS function f(𝒙|𝜸,𝒛)f(\bm{x}|\bm{\gamma},\bm{z}), we assume independent and uniform priors:

p(𝜸)=[k=1K𝒰(logfk;4,1)]𝒰(m;5,5)𝒰(p;5,5)\displaystyle p(\bm{\gamma})=\left[\prod^{K}_{k=1}\mathcal{U}(\log{f_{k}};-4,1)\right]\,\mathcal{U}(m;-5,5)\,\mathcal{U}(p;-5,5)

We obtain 10,000 samples from the posterior distribution in Equation (16) and confirm R^<1.01\hat{R}<1.01 for all the parameters. Figure 6, Figure 7, Figure 8, and Table 1 present a summary of the result for the joint inference of the stellar parameter distribution p(𝒛)p_{\star}(\bm{z}) and the NPPS function f(𝒙|𝜸,𝒛)f(\bm{x}|\bm{\gamma},\bm{z}). Figure 6 shows the stellar parameter distribution p(𝒛)p_{\star}(\bm{z}) inferred from the whole data. The left panel displays the joint distribution:

p(𝒛|𝑫,𝑯)=p(𝒛|𝜶)p(𝜶,𝜸|𝑫,𝑯)𝑑𝜶𝑑𝜸.\displaystyle p(\bm{z}|\bm{D},\bm{H})=\int p_{\star}(\bm{z}|\bm{\alpha})\,p(\bm{\alpha},\bm{\gamma}|\bm{D},\bm{H})\,d\bm{\alpha}d\bm{\gamma}. (25)

The right panels show the marginalized distributions of this joint PDF p(𝒛|𝑫,𝑯)p(\bm{z}|\bm{D},\bm{H}) for [Fe/H], mass, and age. Figure 7 shows the inferred NPPS functions for HJs and CJs. What is shown in the right panels is essentially Equation (5) conditioned on the data, but for 1D visualization as a function of each of the mass, metallicity, and age (which we denote by z^\hat{z}), it has been marginalized over the other two stellar parameters (𝒛\z^\bm{z}_{\backslash\hat{z}}). Namely, shown here by the violin plots are the distributions of

n¯𝒙𝒫(𝜸,𝒛)p(𝒛\z^|𝜶,z^)𝑑𝒛\z^\displaystyle\int\overline{n}_{\bm{x}\in\mathcal{P}}(\bm{\gamma},\bm{z})\,p_{\star}(\bm{z}_{\backslash\hat{z}}|\bm{\alpha},\hat{z})\,d\bm{z}_{\backslash\hat{z}} (26)

(see also Equation (5)) computed for the samples of (𝜶,𝜸)(\bm{\alpha},\bm{\gamma}) drawn from p(𝜶,𝜸|𝑫,𝑯)p(\bm{\alpha},\bm{\gamma}|\bm{D},\bm{H}). From these plots, we see that the NPPS of both HJs and CJs is likely higher for more metal-rich stars. This is a known trend, but we recover this even considering the correlations with other stellar parameters. Furthermore, the NPPS of HJs shows a decreasing trend with increasing age, to a similar degree to the metallicity dependence. We further investigate these trends with the parametric models below. The left panel of Figure 7 shows the posterior PDFs for the NPPS of HJs and CJs in the Sun-like sample, averaged over the stellar parameters: the histograms show the distributions of Equation (6), n¯𝒙𝒫,𝒛p(𝒛)(𝜶,𝜸)\overline{n}_{\bm{x}\in\mathcal{P},\bm{z}\sim p_{\star}(\bm{z})}(\bm{\alpha},\bm{\gamma}), computed for the samples of (𝜶,𝜸)(\bm{\alpha},\bm{\gamma}) drawn from p(𝜶,𝜸|D,H)p(\bm{\alpha},\bm{\gamma}|D,H). The posterior median and equal-tail 68% intervals are 4.01.2+1.9%4.0^{+1.9}_{-1.2}\,\% for HJs (orange) and 18.03.3+5.1%18.0^{+5.1}_{-3.3}\,\% for CJs (blue), respectively. They are both consistent with the values reported by Zhu (2022), who analyzed a similar subset of Sun-like stars from the CLS catalog. In Figure 8, we show the posterior PDFs for the NPPS as a function of both mass and age, now marginalizing only over the metallicity. Although the results for separate stellar mass bins are more uncertain, the NPPS of HJs still shows a decreasing trend with increasing age at each stellar mass. This confirms that the age dependence seen in Figure 7 is not solely due to the mass dependence (cf. Section 2.4); even if we control the stellar mass, the NPPS of HJs tends to be higher at younger ages.

Refer to caption
Figure 9: (Left) The panels show the marginalized posterior PDFs for [Fe/H], mass, and age (top to bottom). The solid lines and translucent regions represent the medians and 90% credible intervals (CIs) in each bin. (Right) The inferred NPPS are functions of age for hot Jupiters and cold Jupiters, adopting the exponential model (left panel) and the sigmoid model (right panel). This is Equation (5) evaluated at the solar mass and metallicity. In each panel, solid lines present median values of the posterior predictions at ages and dark and light-shaded regions means 68%68\% and 90%90\% credible intervals of the posterior predictions, respectively.

4.2 Other Parametric Models

Table 2: Priors and Posteriors of Parameters in the Parametric Inference
Parameters Prior Exponential Sigmoid
Hot Jupiter Cold Jupiter Hot Jupiter Cold Jupiter
Parameters for the NPPS function f(𝐱|𝐳)f(\bm{x}|\bm{z})
logf0\log{f_{0}} 𝒰(5,0)\mathcal{U}(-5,0) 1.650.90+0.76-1.65^{+0.76}_{-0.90} 2.170.60+0.48-2.17^{+0.48}_{-0.60} 2.250.67+0.57-2.25^{+0.57}_{-0.67} 1.510.20+0.20-1.51^{+0.20}_{-0.20}
mm 𝒰(5,5)\mathcal{U}(-5,5) 0.750.38+0.35-0.75^{+0.35}_{-0.38} 0.160.16+0.16-0.16^{+0.16}_{-0.16} 0.750.37+0.35-0.75^{+0.35}_{-0.37} 0.180.15+0.15-0.18^{+0.15}_{-0.15}
pp 𝒰(5,5)\mathcal{U}(-5,5) 0.340.51+0.530.34^{+0.53}_{-0.51} 0.660.25+0.260.66^{+0.26}_{-0.25} 0.330.52+0.530.33^{+0.53}_{-0.52} 0.670.27+0.270.67^{+0.27}_{-0.27}
β\beta 𝒰(20,20)\mathcal{U}(-20,20) 4.622.37+3.914.62^{+3.91}_{-2.37} 1.860.71+0.771.86^{+0.77}_{-0.71} 3.361.78+2.383.36^{+2.38}_{-1.78} 1.330.63+0.651.33^{+0.65}_{-0.63}
κ\kappa 𝒰(30,30)\mathcal{U}(-30,30) 12.526.87+8.8812.52^{+8.88}_{-6.87} 1.462.60+2.601.46^{+2.60}_{-2.60} 8.985.90+6.618.98^{+6.61}_{-5.90} 2.132.24+2.352.13^{+2.35}_{-2.24}
ϵ\epsilon 𝒰(5,5)\mathcal{U}(-5,5) 0.740.54+0.34-0.74^{+0.34}_{-0.54} 0.200.14+0.150.20^{+0.15}_{-0.14} - -
logλ\log{\lambda} 𝒰(0.5,1.5)\mathcal{U}(-0.5,1.5) - - 0.350.48+0.770.35^{+0.77}_{-0.48} 0.510.66+0.660.51^{+0.66}_{-0.66}
CageC_{\rm age} 𝒰(0,14)\mathcal{U}(0,14) - - 5.132.34+1.805.13^{+1.80}_{-2.34} 11.941.77+1.4311.94^{+1.43}_{-1.77}
Parameters for the stellar distribution p(𝐳)p_{\star}(\bm{z})
{αk}\{\alpha_{k}\} Eq. (22) see Figure 9 left not shown but similar to Figure 9

Note. — The values shown are the medians and 68% equal-tail intervals of the marginal posterior distribution. f0f_{0}: normalization factor, mm: power law index for planet mass, pp: power law index for orbital period, β\beta: power law index for metallicity, κ\kappa: power law index for stellar mass, ϵ\epsilon: index for the exponential form, λ\lambda: decrease timescale for the sigmoid function, CageC_{\rm age}: cut-off age for the sigmoid function. See Equation (27), (28), and (29).

In this section, we model 𝒢(𝒛,𝜸s)\mathcal{G}(\bm{z},\bm{\gamma}_{s}) as a parametric function of the form:

𝒢(𝒛,𝜸s)=f010β[Fe/H](M/M)κ𝒢(t,𝜸),\displaystyle\mathcal{G}(\bm{z},\bm{\gamma}_{s})=f_{0}10^{\beta\mathrm{[Fe/H]}}(M_{\star}/M_{\odot})^{\kappa}\mathcal{G}^{\prime}(t_{\star},\bm{\gamma}^{\prime}), (27)

where f0f_{0} is a normalization factor for the NPPS, and the function 𝒢(t,𝜸)\mathcal{G}^{\prime}(t_{\star},\bm{\gamma}^{\prime}) that accounts for the age dependence will be specified below. The metallicity and mass dependencies are modeled as power-law functions with indices β\beta and κ\kappa, respectively, following the prior studies (e.g., Johnson et al., 2010). We set uniform priors for β\beta and κ\kappa within [20,20][-20,20] and [30,30][-30,30], respectively, and a log-uniform prior for f0f_{0} within [105,1][10^{-5},1]; see Table 2.

We continue to model the stellar distribution as a histogram as in Section 4.1, but given the fewer number of free parameters in 𝒢(𝒛,𝜸s)\mathcal{G}(\bm{z},\bm{\gamma}_{s}), we increase the resolution. We set up 10 bins for [Fe/H] spanning [0.5,0.5][-0.5,0.5] at 0.1 dex intervals, 5 bins for mass spanning [0.75,1.25]M[0.75,1.25]\;M_{\odot} at 0.1M0.1\,M_{\odot} intervals, 7 bins for ages spanning [0,14][0,14] Gyr at 2 Gyr intervals, which results in the total number of bins of K=10×5×7=350K=10\times 5\times 7=350. The prior PDF for 𝜶\bm{\alpha} is again given by Equation (22).

4.2.1 Exponential Function for Age

First, we assume an exponential function for the age dependence:

𝒢(t,𝜸)=eϵ(t/Gyr).\mathcal{G}^{\prime}(t_{\star},\bm{\gamma}^{\prime})=e^{\epsilon(t_{\star}/\mathrm{Gyr})}. (28)

This form is not necessarily motivated physically but will be useful to characterize the timescale of a monotonic change in the NPPS, which is given by 1/ϵ1/\epsilon. We set a uniform prior for ϵ\epsilon within [5,5][-5,5], allowing for the NPPS to increase, decrease, or remain constant as a function of age.

We obtain 10,000 samples from the posterior distribution in Equation (16), where 𝜸={m,p,logf0,β,κ,ϵ}\bm{\gamma}=\{m,p,\log f_{0},\beta,\kappa,\epsilon\}. The posterior constraints on the parameters 𝜸\bm{\gamma} are summarized in Table 2. The corner plot for 𝜸\bm{\gamma} is shown in Figure 10 in Appendix. The left panel of Figure 9 presents the inferred stellar distribution, and the right panel shows the inferred NPPS as a function of age for HJs (red) and CJs (blue). Here we show the median and 68%/90% equal-tail intervals of f(𝒙|𝜸,𝒛={t,[Fe/H]=0,M=M}f(\bm{x}|\bm{\gamma},\bm{z}=\{t_{\star},\mathrm{[Fe/H]}=0,M_{\star}=M_{\odot}\} integrated over 𝒙\bm{x} and evaluated for the posterior samples of 𝜸\bm{\gamma}. We obtain ϵHJ=0.740.98+0.54\epsilon_{\rm HJ}=-0.74_{-0.98}^{+0.54} for HJs and ϵCJ=0.200.23+0.28\epsilon_{\rm CJ}=0.20_{-0.23}^{+0.28} for CJs (both 90% equal-tail credible intervals) and find that the NPPS of HJs likely decreases with increasing age, while that for CJs is consistent with a constant value, in agreement with the result of the histogram inference. Assuming that ϵHJ<0\epsilon_{\rm HJ}<0, which is satisfied by more than 99% of the posterior samples, the timescale for the decrease of HJ NPPS is found to be 1/ϵHJ=1.350.57+1.09Gyr-1/\epsilon_{\rm HJ}=1.35^{+1.09}_{-0.57}\,\mathrm{Gyr} (68% credible interval).

We find positive metallicity dependency β\beta for both HJ and CJ, which is consistent with prior studies (e.g., Petigura et al., 2018). As expected from the histogram result, we do not find very strong evidence for the stellar mass dependence (i.e., non-zero κ\kappa) in our Sun-like star sample. This is possibly due to the narrower mass range than investigated by Johnson et al. (2010); or such dependence may not really exist (e.g., Zhou et al., 2019). Because we model these dependencies simultaneously with the age dependence, the analysis suggests that the metallicity dependence of giant planets’ NPPS found in previous works is unlikely to be an artifact caused by the dependence on age and its correlation with metallicity.

To check on the possible prior dependence, we repeated a similar analysis by re-parametrizing ϵ=tanθ\epsilon=\tan\theta and by assigning a uniform prior between π/2-\pi/2 and π/2\pi/2 for θ\theta. This choice results in the prior PDF for ϵ\epsilon that is more narrowly peaked around zero than in the model above, putting more weight on the solutions without age dependence. The result was found to be consistent with the above analysis adopting a uniform prior for ϵ\epsilon.

4.2.2 Sigmoid Function for Age

Next, we assume a sigmoid function of the form

𝒢(t,𝜸)=[1+eλ(t/GyrCage)]1.\mathcal{G}^{\prime}(t_{\star},\bm{\gamma}^{\prime})=\left[1+e^{\lambda(t_{\star}/\mathrm{Gyr}-C_{\rm age})}\right]^{-1}. (29)

This function remains almost constant up to around the cut-off age CageC_{\rm age}, and decreases over a timescale characterized by the parameter λ(>0)\lambda\;(>0). We assume a log-uniform prior for λ\lambda within [100.5,101.5][10^{-0.5},10^{1.5}] and a uniform prior for the cut-off age CageC_{\rm age} between 0 and 14. With the two free parameters, this model has a larger flexibility than the exponential model. When λ\lambda is large, 𝒢\mathcal{G}^{\prime} approaches a step function, while when it is small 𝒢\mathcal{G}^{\prime} behaves in a similar manner to the exponential function as adopted in the previous subsection.

From Equation (16), we obtain 10,000 posterior samples, where 𝜸={m,p,logf0,β,κ,λ,Cage}\bm{\gamma}=\{m,p,\log f_{0},\beta,\kappa,\lambda,C_{\rm age}\}. The result of this inference is summarized in Table 2, Figure 9, and Figure 11. The right panel of Figure 9 presents the inferred NPPS for the sigmoid model. Although the parameters λ\lambda and CageC_{\rm age} are strongly degenerate for HJs, the inferred NPPS functions show a similar tendency as seen for the exponential model: the NPPS of HJs decreases with age, while that of CJs does not, at least up to Cage12GyrC_{\rm age}\sim 12\,\mathrm{Gyr}. While the NPPS of HJs at ages less than a few Gyr appears different from the exponential case by the construction of the sigmoid model, the 68% regions from the two models overlap. Similar to the exponential model, the sigmoid model also suggests that the age dependences (i.e., shapes of 𝒢(t,γ)\mathcal{G}^{\prime}(t_{\star},\gamma^{\prime})) for HJs and CJs are likely different; see also the corner plot (Figure 11) in Appendix.

4.3 Conclusions

Both the histogram analysis (Section 4.1) and the other parametric analysis adopting two different functional forms (Section 4.2) consistently suggest that the NPPS of HJs decreases with increasing age. The parametric models give tighter constraints on the NPPS than the histogram one, presumably because of the additional assumption that the NPPS changes smoothly as a function of age. The consistency of the results based on two different parametric models suggests that the inferred age dependence is not too sensitive to how this smoothness is implemented in the NPPS function.

The NPPS of HJs decreases in the latter half of the main-sequence lifetime, while the behavior in the first half is not well constrained partly due to the small number of stars searched (see Figure 9 left); the lack of young stars may be due to the selection of the Doppler survey against active stars that are not suited for precise RV measurements. Although the uncertainty in the inferred NPPS is still large due to the limited number of HJs, the result suggests that the change in the NPPS around older main-sequence stars could well be quite rapid: both of the parametric models suggest that the NPPS decreases by an order of magnitude over a few Gyrs.

In contrast, the NPPS of CJs does not show strong evidence for age dependency during the main sequence. Given that we do not know of physical processes that change their occurrence rate during the main sequence, this absence of age dependence supports that our inference framework is working properly. For evolved stars, on the other hand, the sigmoid model may hint at the decrease of NPPS; we are not fully sure if this is a real trend, given that this may partly be driven by the construction of the sigmoid model, and that the NPPS is not well constrained due to the lack of such old stars. The slight inconsistency in the inferred NPPS of CJs around the oldest stars based on the two parametric models implies that the NPPS of CJs may have a more complicated dependence on age than we assume here and that the available data may already provide such information. It is beyond the scope of the present work to perform a more detailed study of the NPPS evolution for CJs.

In this work, we adopted the MIST models for the stellar isochrone fitting, and have not examined the dependence of the results on the adopted stellar model. Tayar et al. (2022) demonstrated that the model-dependent offsets are typically 5%\sim 5\% in mass and 20%\sim 20\% in age for main-sequence and sub-giant stars. Considering the current precision of the inference, these model-dependent offsets do not significantly affect the above conclusions but may become more important in future analyses using larger samples. Because our analysis is based on the homogeneous analysis of stars using the same stellar model and the observing data with similar qualities, the conclusions based on relative comparisons, such as the NPPS difference between younger and older stars, and the difference between HJs and CJs, are more robust against the model-dependent systematics than those based on the absolute age (e.g., at what age the NPPS starts decreasing).

5 Discussion

5.1 Implications for the Tidal Evolution of HJs

Tidal orbital decay appears to be the most plausible explanation for the decrease of HJ’s NPPS around older stars (Section 4). In principle, the NPPS as a function of age, as obtained in this work, enables the joint inference of the mass-period distribution of HJs before tidal evolution as well as the efficiency of tidal dissipation inside the star. Because the NPPS analysis uses the information for all the stars that have been surveyed, such an analysis can consider both HJs that currently survive and that have already been engulfed in a consistent manner, which is otherwise difficult. It is even possible, at least in principle, to include the dependence of the dissipation efficiency on the star’s mass and age, tidal forcing frequency, and planet’s mass. The information on the “initial” mass–period distribution may be useful to distinguish the origin scenarios of HJs. The empirical constraint on the tidal dissipation efficiency will be useful for improving the theory of tidal dissipation, and also for better understanding the orbital evolution of stellar binaries in general.

We leave this “full” analysis for future work, given that the NPPS function based on the CLS sample still has a large uncertainty. Instead, here we give a simple order-of-magnitude estimate for the tidal dissipation efficiency based on our inference, assuming that the decrease of the HJ’s NPPS is real and solely due to the tidal orbital decay. We characterize the efficiency of tidal dissipation by the modified stellar tidal quality factor, Q=3Q/2k2Q^{\prime}_{\star}=3Q_{\star}/2k_{2}, where QQ_{\star} represents the ratio of the maximum energy stored in the tide to the energy dissipated in one cycle, and k2k_{2} denotes the tidal Love number (equals 3/2 for a homogeneous sphere). Highly dissipative stars have small QQ^{\prime}_{\star} values, and the QQ^{\prime}_{\star} value, in general, depends on tidal forcing frequency and amplitude as well as on the internal structure of the star (e.g., Barker, 2020). If QQ^{\prime}_{\star} is assumed to be constant, the inspiral time for a planet with orbital period PP is analytically given by (e.g., Ogilvie, 2014):

tin\displaystyle t_{\rm in} \displaystyle\simeq 2.3Gyr(Q106)(q103)1(ρ¯ρ¯)53(P3days)133,\displaystyle 2.3\,\mathrm{Gyr}\left(Q^{\prime}_{\star}\over 10^{6}\right)\left(q\over 10^{-3}\right)^{-1}\left(\frac{\bar{\rho}_{\star}}{\bar{\rho}_{\odot}}\right)^{\frac{5}{3}}\left(\frac{P}{3\,{\rm days}}\right)^{\frac{13}{3}},

where qq is the planet-to-star mass ratio, and ρ¯\bar{\rho}_{\star} and ρ¯\bar{\rho}_{\odot} are the mean densities of the host star and the Sun, respectively. The values of qq and PP in the above equation correspond to typical HJs in our sample (see Figure 2), which should be considered as planets that have not yet been engulfed by the stars and mostly preserve their initial orbits (because the change in the orbit is small except for the last moment of tidal engulfment). The results of the parametric inferences in Figure 9 imply that they will be lost during the first half of the main sequence, roughly within 6Gyr\sim 6\,\mathrm{Gyr}. Thus Equation (5.1) implies Q106Q^{\prime}_{\star}\sim 10^{6} for a typical HJ orbiting a typical Sun-like star in our sample. We note that the corresponding inspiral time in Equation (5.1) is also comparable to the 1Gyr\sim 1\,\mathrm{Gyr} timescale inferred from our exponential model. This estimate of the tidal quality factor is essentially similar to log10Q<6.50.6+0.5\log_{10}{Q^{\prime}_{\star}}<6.5^{+0.5}_{-0.6} given by Hamer & Schlaufman (2019) who required tint_{\rm in} to be shorter than the main-sequence lifetime of Sun-like stars. The estimate is also comparable to the theoretical estimate assuming the efficient non-linear dissipation of internal gravity waves for the tidal forcing frequency of 3days/2=1.5days\approx 3\,\mathrm{days}/2=1.5\,\mathrm{days} (see Equation 54 of Barker, 2020).777Barker (2020) derived QQ^{\prime}_{\star} scales as Ptide8/3P_{\rm tide}^{8/3} with PtideP_{\rm tide} being tidal forcing frequency. If we modify Equation 5.1 considering this dependence and assuming Ptide=P/2P_{\rm tide}=P/2, and apply the same argument, we estimate QQ^{\prime}_{\star} at Ptide=0.5daysP_{\rm tide}=0.5\,\mathrm{days} to be 105\sim 10^{5} instead. This also agrees with his Equation 54.

5.2 Implications for Past and Future Survey Results

In this work, we focused on 382382 Sun-like stars from the CLS survey, which includes nine HJs. Due to this small number, the timescale for the decline of the NPPS is still consistent with a wide range of values. Our framework can be extended to transit surveys, and will also be useful to investigate HJ samples that are more than an order of magnitude larger (e.g., Yee et al., 2023). Such applications would provide more precise age dependence of the NPPS, and might also reveal the age dependence of the mass–period distribution of HJs: in this work, we ignored possible stellar parameter dependence of the shape of f(𝒙|𝜸,𝒛)f(\bm{x}|\bm{\gamma},\bm{z}) (i.e., mm and pp in Equation 19 do not depend on stellar parameters) but this assumption may be relaxed with a larger sample.

Our result suggests that the age dependence of the HJ occurrence could be as important as that on metallicity. Then it is crucial to consider the age distribution of the sample stars as well in interpreting the results of different surveys. A general tendency would be that surveys targeting a larger fraction of older, brighter, more evolved Sun-like stars tend to give lower HJ occurrence rates, all else being equal. The age dependency might turn out to be the key to explaining marginally different survey results as mentioned in Section 1 in a unified manner. Such consideration will be even more important as the Nancy Grace Roman Space Telescope will uncover a large number of HJs orbiting stars across the Milky Way, including those in the Galactic Center (Montet et al., 2017; Miyazaki et al., 2021; Wilson et al., 2023) where we expect a large difference in the age distribution from the solar neighborhood. It is also possible for our framework to take into account the NPPS dependence on stellar properties other than mass, age, and metallicity, such as the position within the Galaxy.

6 Summary

In this paper, we developed a Bayesian framework to infer the number of planets per star (NPPS) as a function of both planetary and stellar properties, which must involve the simultaneous inference of the stellar parameter distribution in the surveyed stars (Section 3). Our framework can handle large and heterogeneous uncertainties in the degenerate stellar parameters by incorporating the information on the entire likelihood function for the stellar parameters. We applied the framework to hot Jupiters (HJs) and cold Jupiters (CJs) orbiting Sun-like stars from the California Legacy Survey (Section 2) to derive their NPPSs as a function of the stellar mass, age, and metallicity, which were estimated by fitting the isochrone models to their measured spectroscopic parameters, Gaia DR3 parallaxes, and 2MASS KsK_{\rm s}-band magnitudes. We found evidence that the NPPS of HJs decreases in the latter half of the main-sequence lifetime of their Sun-like host stars over the timescale of several Gyr, while the NPPS of CJs does not show the same trend (Section 4). The result implies that the modified stellar tidal quality factor QQ^{\prime}_{\star} is on the order of 10610^{6} for a typical HJ in the sample on a 3day\approx 3\,\mathrm{day} orbit (Section 5).

Acknowledgments

The authors thank Luke Bouma, Morgan MacLeod, Kaloyan Penev, and Josh Winn for their helpful comments on the manuscript. We also thank the anonymous reviewers for constructive comments that helped us improve the manuscript. We sincerely appreciate all the efforts and contributions to constructing the California Legacy Survey and making their data and codes publicly available. This work has made use of data from the European Space Agency (ESA) mission Gaia (https://www.cosmos.esa.int/gaia), processed by the Gaia Data Processing and Analysis Consortium (DPAC, https://www.cosmos.esa.int/web/gaia/dpac/consortium). Funding for the DPAC has been provided by national institutions, in particular, the institutions participating in the Gaia Multilateral Agreement. This publication makes use of data products from the Two Micron All Sky Survey, which is a joint project of the University of Massachusetts and the Infrared Processing and Analysis Center/California Institute of Technology, funded by the National Aeronautics and Space Administration and the National Science Foundation.

Appendix A Special Cases of the Formulation in Section 3

Here we describe special cases of our general formulation presented in Section 3, and show that the results in previous works are recovered.

A.1 Ignoring Dependency on Stellar Properties

When the dependency of the NPPS function on stellar properties 𝒛\bm{z} is ignored, Equation (11) reduces to:

p(Dj,Hj|𝜶,𝜸)=p(Hj|𝜸)p(Dj|𝜶)\displaystyle p(D_{j},H_{j}|\bm{\alpha},\bm{\gamma})=p(H_{j}|\bm{\gamma})p(D_{j}|\bm{\alpha}) (A1)
=\displaystyle= (𝒙jHjf~j(𝒙j|𝜸))exp[f~j(𝒙|𝜸)𝑑𝒙]\displaystyle\left(\prod_{\bm{x}_{j}\in H_{j}}\widetilde{f}_{j}(\bm{x}_{j}|\bm{\gamma})\right)\exp{\left[-\int\widetilde{f}_{j}(\bm{x}|\bm{\gamma})d\bm{x}\right]}
×p(Dj|𝒛j)p(𝒛j|𝜶)d𝒛j.\displaystyle\times\int p(D_{j}|\bm{z}_{j})p_{\star}(\bm{z}_{j}|\bm{\alpha})d\bm{z}_{j}.

Therefore

(𝜶,𝜸)=(𝜸)(𝜶),\displaystyle\mathcal{L}(\bm{\alpha},\bm{\gamma})=\mathcal{L}(\bm{\gamma})\,\mathcal{L}(\bm{\alpha}), (A2)

where

(𝜸)\displaystyle\mathcal{L}(\bm{\gamma}) =\displaystyle= jN(𝒙jHjf~j(𝒙j|𝜸))exp[f~j(𝒙|𝜸)𝑑𝒙],\displaystyle\prod^{N_{\star}}_{j}\left(\prod_{\bm{x}_{j}\in H_{j}}\widetilde{f}_{j}(\bm{x}_{j}|\bm{\gamma})\right)\exp{\left[-\int\widetilde{f}_{j}(\bm{x}|\bm{\gamma})d\bm{x}\right]},
(𝜶)\displaystyle\mathcal{L}(\bm{\alpha}) =\displaystyle= jNp(Dj|𝒛j)p(𝒛j|𝜶)𝑑𝒛j.\displaystyle\prod^{N_{\star}}_{j}\int p(D_{j}|\bm{z}_{j})p_{\star}(\bm{z}_{j}|\bm{\alpha})d\bm{z}_{j}. (A4)

This means that the NPPS function and the stellar parameter distribution can be inferred independently. The likelihood in Equation (A.1) has been adopted in previous studies of the occurrence rate that do not take into account its dependence on stellar parameters (e.g., Foreman-Mackey et al., 2014; Fulton et al., 2021).

A.2 Ignoring Dependency on Planet Properties

Let us consider the other extreme case, where we instead ignore the dependency of the NPPS function on the properties of planets. In this case,

f(𝒙|𝒛)𝒙=0f(𝒙|𝒛)=h(𝒛)Δ𝒙,\displaystyle{\partial f(\bm{x}|\bm{z})\over\partial\bm{x}}=0\quad\rightarrow\quad f(\bm{x}|\bm{z})={h(\bm{z})\over\Delta\bm{x}}, (A5)

where h(𝒛)h(\bm{z}) denotes the NPPS as a function of 𝒛\bm{z} alone, and Δ𝒙\Delta\bm{x} is the volume of the domain of 𝒙\bm{x}. The latter domain implicitly defines what is counted as planets in the NPPS hh. Then the part of Equation (11) relevant for the planets becomes as follows:

(𝒙jHjf~j(𝒙j|𝒛j,𝜸))exp[f~j(𝒙|𝒛j,𝜸)𝑑𝒙]\displaystyle\left(\prod_{\bm{x}_{j}\in H_{j}}\widetilde{f}_{j}(\bm{x}_{j}|\bm{z}_{j},\bm{\gamma})\right)\exp{\left[-\int\widetilde{f}_{j}(\bm{x}|\bm{z}_{j},\bm{\gamma})d\bm{x}\right]} (A6)
=\displaystyle= (𝒙jHjηj(𝒙j)h(𝒛j,𝜸)Δ𝒙)exp[h(𝒛j,𝜸)Δ𝒙ηj(𝒙)𝑑𝒙]\displaystyle\left(\prod_{\bm{x}_{j}\in H_{j}}\eta_{j}(\bm{x}_{j})\frac{h(\bm{z}_{j},\bm{\gamma})}{\Delta\bm{x}}\right)\exp{\left[-\frac{h(\bm{z}_{j},\bm{\gamma})}{\Delta\bm{x}}\int\eta_{j}(\bm{x})d\bm{x}\right]}
\displaystyle\propto (𝒙jHjηj(𝒙j))[h(𝒛j,𝜸)]njexp[η¯jh(𝒛j,𝜸)],\displaystyle\left(\prod_{\bm{x}_{j}\in H_{j}}\eta_{j}(\bm{x}_{j})\right)\left[h(\bm{z}_{j},\bm{\gamma})\right]^{n_{j}}\exp{\left[-\bar{\eta}_{j}h(\bm{z}_{j},\bm{\gamma})\right]},

where

η¯j=1Δ𝒙ηj(𝒙)𝑑𝒙\bar{\eta}_{j}={1\over\Delta\bm{x}}\int\eta_{j}(\bm{x})\,d\bm{x} (A7)

denotes the detection efficiency ηj(𝒙)\eta_{j}(\bm{x}) averaged over the domain of 𝒙\bm{x}. The product 𝒙jHjηj(𝒙j)\prod_{\bm{x}_{j}\in H_{j}}\eta_{j}(\bm{x}_{j}) is constant when ηj\eta_{j} is given and so can be ignored in the inference. Then Equation (11) is simplified as:

p(Dj,Hj|𝜶,𝜸)\displaystyle p(D_{j},H_{j}|\bm{\alpha},\bm{\gamma})
\displaystyle\propto p(Dj|𝒛j)p(𝒛j|𝜶)[h(𝒛j,𝜸)]njexp[η¯jh(𝒛j,𝜸)]𝑑𝒛j.\displaystyle\int p(D_{j}|\bm{z}_{j})\,p_{\star}(\bm{z}_{j}|\bm{\alpha})\left[h(\bm{z}_{j},\bm{\gamma})\right]^{n_{j}}\exp{\left[-\bar{\eta}_{j}h(\bm{z}_{j},\bm{\gamma})\right]}d\bm{z}_{j}.

This is essentially the likelihood function adopted in Masuda (2022), who used the binomial distribution (instead of Poisson) to model the fraction of stars showing significant rotational modulation.

Appendix B Corner plots for the Parameters in the Parametric Inference

In Figures 10 and 11, we present the corner plots for the posterior distributions of the parameters in the NPPS models adopting the exponential and sigmoid forms (Section 4.2), respectively. The left panels show the results for HJs, and the right ones are for CJs.

Refer to caption
Refer to caption
Figure 10: Corner plots for the posterior distributions of the parameters in the exponential NPPS model. The left panel is for HJs and the right panel is for CJs. The figure was created using corner.py (Foreman-Mackey, 2016).
Refer to caption
Refer to caption
Figure 11: Same as Figure 10 but for the sigmoid model.

References