Constraints from dwarf galaxies on black hole seeding and growth models
with current and future surveys
Abstract
Dwarf galaxies are promising test beds for constraining models of supermassive and intermediate-mass black holes (MBH) via their black hole occupation fraction (BHOF). Disentangling seeding from the confounding effects of mass assembly over a Hubble time is a challenging problem, that we tackle in this study with a suite of semi-analytical models (SAMs). We show how measured BHOF depends on the lowest black hole mass or AGN luminosity achieved by a survey. To tell seeding models apart, we need to detect or model all AGN brighter than in galaxies of . Shallower surveys, like eRASS, cannot distinguish between seed models even with the compensation of a much larger survey volume. We show that the AMUSE survey, with its inference of the MBH population underlying the observed AGN, strongly favors heavy seed models, growing with either a power-law Eddington Ratio Distribution Function (ERDF) or one in which black hole accretion is tagged to the star-formation rate (AGN-MS). These two growth channels can then be distinguished by the AGN luminosity function at , with the AGN-MS model requiring more accretion than observed at z 0. Thus, current X-ray observations favour heavy seeds whose Eddington ratios follow a power-law distribution. The different models also predict different radio scaling relations, which we quantify using the fundamental plane of black hole activity. We close with recommendations for the design of upcoming multi-wavelength campaigns that can optimally detect MBHs in dwarf galaxies.
1 Introduction
Massive black holes (MBHs) are expected to reside in the core of every galaxy at , but their occupation fraction in galaxies at lower stellar masses are poorly constrained. Unlike stellar mass black holes, the origin of MBHs remains an open question, in particular, if there exist multiple pathways for their formation. Key candidate formation channels include remnants of Population III stars (Bromm & Loeb, 2003), merged remnants in nuclear star clusters (Portegies Zwart et al., 2004; Gürkan et al., 2004; Baumgardt, 2017; Natarajan, 2021), the direct collapse of pre-galactic gas disks in which fragmentation and star formation is suppressed (DCBH) Begelman et al. (2006); Lodato & Natarajan (2006); Mayer et al. (2010); Tanaka & Li (2014); Inayoshi et al. (2015)), or dark stars, powered by DM annihilation rather than nuclear fusion Spolyar et al. (2008); Freese et al. (2010); see also the recent review by Volonteri et al. (2021). Several seeding models are capable of reproducing observations in higher-mass galaxies under different assumptions for black hole growth (Milosavljević et al., 2009; Valiante et al., 2016; Ricarte & Natarajan, 2018a), because observations largely probe later times in the Universe relative to the extremely early seed formation epochs. The models are all calibrated to match observations and reproduce observations of high black hole/galaxy masses as a function of time. However, to truly distinguish between the models, we either require observations at when seeds are expected to assemble (Treister et al., 2013; Ricarte & Natarajan, 2018b; Lusso et al., 2022) and/or lower mass galaxies at recent epochs (see Reines, 2022, for a review).
Despite being modulated by complex growth mechanisms over cosmic time, some signatures of the initial MBH seeding mechanisms have been demonstrated to survive in late-time observables, primarily the low-redshift occupation fraction of MBH in dwarf galaxies (Ricarte & Natarajan, 2018a; Volonteri et al., 2021). While most, if not all, massive galaxies seem to host central MBHs with masses that are empirically seen to be correlated with host galaxy properties (Kormendy & Ho, 2013; McConnell & Ma, 2013; Bogdán & Goulding, 2015), many low-mass galaxies may not host a central MBH at all; and if they do, may possibly follow different scaling relations (Fontanot et al., 2015) or none at all (Baldassare et al., 2020). Multi-wavelength observations have been finding ever fainter MBHs in dwarf galaxies in the radio (Reines et al., 2020), via stellar dynamics (Baldassare et al., 2015; Nguyen et al., 2018), and from optical line ratios/variability (Reines et al., 2013; Moran et al., 2014; Baldassare et al., 2018; Mezcua & Domínguez Sánchez, 2020). Any census of these low-mass MBHs is bound to be incomplete since some are too faint – perhaps even completely inactive – and therefore evade detection by these methods that rely on accretion activity. To infer the full occupation fraction of MBH, regardless of activity, one must incorporate detections as well as upper limits across multiple wavelengths and rely on some, preferably flexible models. Such work has been attempted successfully in the X-ray (Miller et al., 2015; Gallo & Sesana, 2019) and optical (Kelly & Shen, 2013).
But how to test MBH seed and growth models with the observed occupation fractions? The black holes that exist at , by and large, do not reflect the masses of initial seeds, as these are the result of growth averaged over cosmic time (Mezcua, 2019). We require models that trace the MBH population detected locally back to their high redshift seeds via growth histories. Semi-analytic models (SAMs) have proved to be a computationally efficient way to do so, as they involve populating and assembling halos over cosmic time, using well-defined physical models for black hole growth. In one such set of models, halos are populated with MBHs adopting various scaling relations, that permit tracking their growth over cosmic time in secular as well as merger-driven accretion modes (Ricarte & Natarajan, 2018b, a; Ricarte et al., 2019). SAMs enable fairly comprehensive probing of the parameter space of growth trajectories coupled with the different seeding mechanisms, quantifying how their combinations produce very different populations at . In this paper, we probe these differences in detail and examine which observables offer powerful discrimination, specifically on the issue of initial seeding.
The paper is structured as follows. In §2, we present an overview of the semi-analytical model and the scaling relations used to create the mock catalogs. §3 compares the black hole occupation fraction using different cuts on black hole mass and luminosity and compares to X-ray observations. §4 makes recommendations for upcoming multi-wavelength surveys, discussing the capabilities and limitations of each and the synergies between them. We close with conclusions and future prospects in §5.
2 Tracking black hole growth over cosmic time
2.1 Semi-Analytic Models
By construction, SAMs are set up to explore a range of seeding and growth modes for a population of black holes as a function of redshift. Time slices from these SAMs can then be compared to measured black holes. In this work, we focus on the SAM developed by Ricarte & Natarajan (2018b) and explore new predictions for the low mass end of the local black hole mass function. In particular, we include the properties of the host galaxies explicitly in the scheme and do so flexibly by exploring multiple scaling relations between them and MBH properties. It is this new feature that allows us to delve more deeply into further predictions in combination with the Bayesian inference framework in this work.

Intriguingly, as first noted and discussed in detail in Ricarte & Natarajan (2018a), signatures of seeding survive to late times via the observed occupation fraction though there is dependence on the details of their mass assembly history. There are currently other SAMs from Somerville et al. (2008) & Bower et al. (2010) and updated versions thereof, that do not focus on seeding and include recipes for star and galaxy formation as well as MBH growth, and these models inevitably have many more tuneable parameters. Here, we stick with a pared-down, phenomenologically driven model that is primarily focused on seeding signatures from nearly local observations (). The Ricarte & Natarajan (2018a) SAM considers two seed models: either a massive and rare “heavy” seed typically produced from direct collapse following the seeding prescription of Lodato & Natarajan (2006); Volonteri et al. (2008), or abundant “light” seeds drawn from a power law initial mass function between 30 and 100 solar masses motivated by cosmological simulations Hirano et al. (2015). For each seed, the SAM explored in Ricarte & Natarajan (2018b) follows three different MBH growth prescriptions:
-
•
Power Law (PL) Model: In this model, major halo mergers trigger a burst of growth at the Eddington rate until MBHs reach the mass determined by the observed local relation. When MBHs are not in the burst mode, their accretion rates are drawn from a power law tuned to help reproduce low-redshift luminosity functions (Ricarte & Natarajan, 2018b).
-
•
AGN Main Sequence (AGNMS) Model: Major halo mergers trigger burst mode growth at the Eddington rate until MBHs reach the relation. When MBHs are growing secularly, their accretion rates are set to be the star formation rate of their host. This mode is motivated by empirical relation determined by Mullaney et al. (2012). Since star formation is not actually tracked in this SAM, scaling relations are instead used to estimate stellar masses of galaxies as a function of halo mass and redshift (Ricarte & Natarajan, 2018b).
-
•
Broad-line Quasars (BLQ) Model: Major halo mergers (defined to be those with a halo mass ratio of at least 1:10) trigger burst mode growth. Additionally, at each time step the accretion rate is drawn from a distribution of Eddington rates observationally inferred for broad-line quasars in SDSS (Kelly & Shen, 2013). Specifically, we adopt a redshift-evolving log-normal distribution fit by Tucci & Volonteri (2017)111Here, the probability distribution of Eddington ratios is given by , where and . No additional steady accretion mode for mass growth is added since the AGN luminosity function is adequately predicted with this prescription (Ricarte et al., 2019). However, as discussed in this work, this assumption fails when considering deep surveys of dwarf galaxies that are not selected a priori to be AGN.
In this model, only 10 of the halo mergers lead to black hole mergers. This is done to prevent the overgrowth of BHs in the most massive halos, whose BHs would otherwise grow mostly from BH-BH mergers at low-redshift. We do not expect this assumption to have an impact on the dwarf galaxies considered in this work, which have much lower merger rates. In our study, we do not include satellite halos, since the SAM does not include a prescription for environmental processes such as ram-pressure/tidal stripping, which in turn is expected to affect the stellar mass as well as the growth of MBH. To increase our sample size, we compile the output from three snapshots at: , , and . This yields a total of 6,579 halos per seed growth model, so the uncertainties from Poisson (small number) statistics on the predictions are small, even in the lower stellar mass bins. The scaling relations between galaxy and MBH properties evolve very marginally in this redshift range, and therefore no further modification is applied to the halos.
2.2 Creating mock catalogs from the SAM
The SAMs provide the mass and mass accretion rates for MBHs at every snapshot. The simplest way to compute the luminosity is to convert the accretion rate into a bolometric luminosity with some efficiency , and then use a bolometric correction factor to infer the luminosity in some specific energy/wavelength range. Like much of the literature, we assume the radiative efficiency of accretion = 0.1. The SAM is known to contain too little scatter (Ricarte & Natarajan, 2018a) since the AGN are essentially always on. In practice, a duty cycle would increase the scatter in the observed instantaneous luminosity of the black holes. Meanwhile, we know that relatively complete surveys like AMUSE (Gallo et al., 2008; Miller et al., 2012), with a limiting luminosity of at 0.5-7.0 keV, find AGN in of galaxies with . AMUSE is a remarkably uniform and complete survey of AGN in nearby galaxies performed with the Chandra X-ray telescope, observing 200 galaxies – some in the field, some in the Virgo galaxy cluster – to a limiting flux of , shown in Figure 1 with a dotted line. While more recent catalogs exist, they are compiled from the Chandra archive and are thus necessarily non-uniform in exposure (e.g. Gallo & Sesana, 2019). We measure the detected fraction of AGN in each SAM as:
(1) | ||||
(2) |
where is the relative abundance of the host galaxy stellar mass in the SAM compared to the observations. The sum in the numerator is over the galaxies with and in the denominator is over all galaxies. The weighting corrects for the different stellar mass functions of the SAMs and the observed sample. Both numerator and denominator include galaxies at . The power-law and AGN-MS growth models predict the existence of many AGN below the AMUSE flux limit, shown as the dotted horizontal line in each panel.
If the SAMs are taken at face value, 75-90 of the galaxies in the AGN-MS and PL models would have had AGN detections in an AMUSE-like survey. We, therefore, add scatter to the in these SAMs until the detected fraction, as defined above, is for an AMUSE-like survey. These values are shown in Table 1, and the relations for the SAMs with the added scatter are shown in Figure 2.

Shen et al. (2020) measured the bolometric correction for the keV and keV X-ray bands and for the infrared B-band, as a function of bolometric luminosity. This corresponds to in each of the X-ray bands and in the B-band, each with a scatter of about 50. For each black hole, we draw from the bolometric correction from a normal distribution with the mean and scatter based on its bolometric luminosity.

3 Results
In this section, we compare the BHOF for the six SAM models (3 accretion modes 2 seeding models) as detailed above with published values in the literature, applying cuts to the SAM outputs to match the observational selection functions.
3.1 Occupation fractions & selection criteria
We focus first on how observational selection can impact the derived BHOF and explore the optimal future survey strategies that will enable potentially discriminating between seeding models. We assume that black hole masses are available from the full range of possible observational probes and documented empirical correlations - either from dynamical modeling of galaxy cores or other techniques. AGN, for instance, have been identified via their optical variability (Baldassare et al., 2018, 2020) or line ratio diagnostics (Polimera et al., 2022; Cid Fernandes et al., 2010; Kewley et al., 2013); masses are then measured via reverberation mapping (e.g. Peterson et al., 2004) or direct dynamical modeling (e.g. Walsh et al., 2013; Seth et al., 2014; Nguyen et al., 2017), and then used to construct scaling relations based on easily observable quantities like emission line strengths. Therefore, we proceed with the premise that we have in hand measurements/upper limits for every single galaxy in our SAM survey sample. We first investigate the impact of the completeness of mass measurement on our inference of the BHOF.
Figure 3 shows the occupation fraction for different values of black hole mass completeness. If we assume in the best-case scenario that all black holes above can be detected, then the Pop III seeds produce a higher occupation fraction than DCBH models. This is particularly noticeable for galaxies with . We note that this clearly reflects the efficiency and abundance of initial seeds, light seeds are more ubiquitous and are expected to form more efficiently compared to heavy seeds in the SAMs. If, however, we can only detect BHs more massive than , then light seeds growing through the AGN-MS channel predicts for all stellar masses; for the other two growth channels, seed models become indistinguishable for . If we can only find MBHs above , information about seeding cannot be cleanly disentangled from accretion. These findings clearly show that the detectable present-day mass needs to be close to the initial assumed ”heavy” seed mass from high redshifts, namely, , in order to be disentangled from the confounding effects of the accretion physics.
In summary, if the initial seeds have not grown much by any of the accretion modes or mergers, then their memory of seeding is retained in the BHOF as measured even at low redshift.

In practice, MBHs are usually detected when they are accreting and luminous. The scaling relations used to measure black hole masses regardless of wavelength deployed or method used, all suffer from large-intrinsic scatter (Woo & Urry, 2002) and are most reliable at X-ray wavelengths where the obscuration is the lowest. Therefore, we stick to X-ray observations for comparison with SAMs for the rest of this study. Figure 4 shows the fraction of galaxies with AGN detected at keV as a function of the survey detection limit. The BLQ prescription is ruled out because it predicts almost no AGN in dwarf galaxies. A relatively shallow survey of would only be able to marginally distinguish between the AGN-MS and PL growth channels modes, with the former predicting 10-20 more detections in dwarf galaxies; if all AGN above are found, the detected fraction is identical for both these growth channels and both growth channels. We need a survey that either directly detects, or can model the presence of, all AGN with luminosities down to if the heavy and light seed models are to be clearly distinguished.
Figure 3 and 4 reveal that most of the differences between the models are apparent at the lowest halo masses. Firstly, this highlights the importance of using a parent sample of galaxies that is stellar mass complete down to , pushing down an order of magnitude compared to current observational limits. Figure 5 therefore shows the occupation fraction of galaxies with stellar masses in the range as a function of the minimum detectable black hole mass. If all the information we have is the black hole and stellar masses, we can only tell different seed models apart if we find every MBH at least as massive as . If we can separately confirm that the growth channel is like the AGN-MS model, then we can distinguish seeds using all MBH above .

In practice, again, we almost always detect black holes when they are luminous. Figure 6 shows the fraction of galaxies with stellar mass with detectable AGN, as a function of survey limiting luminosity. Heavy and light seeds can be distinguished if the survey has a limiting luminosity better than .
Figure 7 shows the AGN bolometric luminosity function for the six models, with observations from Ueda et al. (2011) in black for comparison. The uncertainties incorporate 1000 draws from the uncertainty in the bolometric corrections; the BLQ model uncertainties are not shown because, due to its low fraction of active MBHs, has error bars down to 0 at all luminosities. The AGN-MS and PL models have diverging luminosity functions above , with the former model requiring more growth at late times to match the relation. The PL model is thoroughly consistent with the observations, although the AGN-MS model cannot be entirely ruled out due to the large scatter required to bring it into an agreement with the observed active fraction in the AMUSE survey.


3.2 Coupling the SAMs to Bayesian inference
As noted above, the BHOF depends on the minimum AGN luminosity that we can measure or model. Studies like Miller et al. (2015) compute the BHOF from the observed AGN luminosities and upper limits, using a model where the relation contains log-normal scatter. Therefore, this BHOF accounts for AGN even below the nominal luminosity limit of the survey, which is . Since they did not publish the scaling relations inferred by their Bayesian model, we re-measure it here. To recap, the original study assumed that the BHOF had the functional form of a sigmoid function:
(3) |
The tuning parameter dictates how rapidly the occupation fraction jumps from 0 to 1; in other words, a higher value for this parameter means that galaxies with lower stellar masses are less likely to host SMBH than their more massive counterparts. Put another way, a higher means a lower BHOF.
A Bayesian inference model then constrains simultaneously with the normalisation, slope and scatter of the relation:
(4) |
This approach thus assumes 1) that the AGN duty cycle can be described in terms of the time spent below the detection limit if the scaling relation is a power law with log-normal scatter and 2) that we have no prior knowledge of this scaling relation. Again, Figure 2 shows the relation for each SAM, with points below the AMUSE detection threshold shown as translucent.
Right away, we can rule out the BLQ prescription as the primary growth channel, because this predicts that only of the galaxies at contain AGN, far below the detection fraction in AMUSE. This prescription was motivated from observations of the brightest quasars, the high luminosity end of the SDSS quasar LF. Since the abundance of bright quasars drops precipitously at low redshifts, the BLQ SAM is self-consistent in predicting low active fractions at low-z. The fact that this differs with observations of low-mass galaxies shows that this growth channel, which is bursty and merger-driven, cannot be important in dwarf galaxies. Next, we aim to distinguish between the AGN-MS and PL growth channels.
Model | ||||
---|---|---|---|---|
PopIII - PL | 0.5 | 0.50 | 34.0 | 1.13 |
PopIII - AGN-MS | 1.9 | 0.36 | 34.8 | 0.28 |
DCBH.- PL | 0.6 | 0.6 | 37.3 | 1.28 |
DCBH - AGN-MS | 1.8 | 0.35 | 34.7 | 0.29 |
Both the power-law and AGN-MS, models predict very high (75-99) occupation fractions. From Figure 4, we know that we need to apply a luminosity cut to the SAMs before comparing them to observations. For the Miller et al. (2015) study, this is not the same as the limiting luminosity of the observations. Rather, the BHOF in that study includes all the AGN whose luminosity falls within log-normal scatter from the mean value of the power-law relation. Therefore, we start by measuring this .
The probability of a parameter set for detections is defined as:
(5) |
where is the normal probability distribution function centered at with standard deviation . For a -th non-detection, the probability that there is a black hole is given by the latent variable , which in Miller et al. (2015) depended only on the scaling relation and occupation fraction.
(6) |
where is the cumulative probability of the normal distribution. This says that a non-detection means either that there is no black hole, or there is a black hole and it is on but the luminosity is below the detection limit . The latent variable is then set to either 1 or 0 by drawing from a binomial distribution with . For further details of the inference pipeline, we refer the reader to §2 of Miller et al. (2015).

As a test, we run the AMUSE sample through an MCMC pipeline four times, each time using a different scaling relation between in Table 1, inferred from Figure 1. The resulting posterior on , the stellar mass at which occupation fraction is , is shown in Figure 8. Surprisingly, these posteriors are almost indistinguishable from each other.
The best-fit value of for AMUSE is 2; as seen in Table 1, this is very comparable to the total scatter required in each SAM to match the AMUSE detection fraction. Thus, the BHOF inferred assuming an relation with such a large scatter would capture all the AGN in the SAMs with both the PL and AGN-MS growth channels, and no luminosity cut needs to be applied on the SAM MBHs.
Finally, Figure 9 compares the AMUSE constraints applied to each of the four SAMs. The observations are clearly consistent with the heavy seed models growing via either the PL or AGN-MS channels. The light seed models, even with large scatter in their relation, cannot be reconciled with an X-ray observed sample like AMUSE.



3.3 Prospects for future surveys
A key result of our study is that distinguishing between MBH seeding and growth models relies crucially on detecting the least massive MBH (or least luminous AGN) in galaxy samples that are stellar mass complete down to at least . We show here how such a survey could be constructed by following up on existing surveys.
The RESOLVE survey (Eckert et al., 2015) was designed to be complete in baryonic mass down to , and thus to even lower stellar masses, within ( Mpc). Figure 10 shows the spatial (left) and stellar mass (right) distribution of the RESOLVE galaxies, with the latter demonstrating stellar mass completeness above . The orange histogram shows only the galaxies above , where emission from the Milky Way is negligible, and even faint extra-galactic AGN could be detected. In principle, a survey could include galaxies at as long as they are in the direction opposite to the Galactic Center; we are being conservative in our recommendation here to account for telescope scheduling difficulties, which may make it difficult to point at extragalactic fields at all times. The RESOLVE survey contains such galaxies.
Using this subset of galaxies as a parent sample, we create three different mock catalogs. Each assumes a different value of and thus a different BHOF. Where a MBH exists, we assign an X-ray luminosity drawn from the AMUSE observed relation, including the scatter. We pass it through the Bayesian inference pipeline described above, assuming survey luminosity limits of and . Figure 11 shows the posterior distributions recovered for each of the mocks. Higher occupation fractions are harder to rule out than lower ones, due to the large scatter inherent in the relation. While broad, the mode of each distribution matches the input value for . For shallower surveys, if the intrinsic BHOF is high, the survey would return a flat posterior on even for this high level of completeness. On the one hand, this suggests that the uncertainties obtained from the AMUSE survey may be biased low. On the other hand, it confirms that higher values of would have been ruled out with high confidence.
On the other hand, we have checked that the eFEDS survey, despite its uniform coverage overlapping with 500 times more galaxies than AMUSE and detection of several new AGN candidates in dwarf galaxies (Latimer et al., 2021a), yields flat posteriors on , and that this situation is not expected to improve even with the deepest, polar regions of eRASS8. The key takeaway from this analysis is that survey depth, and the resulting completeness in stellar mass and MBH mass/luminosity, are much more powerful than survey volume when measuring the underlying BHOF.

4 Discussion
4.1 Confusion with stellar black hole systems
A key challenge at low MBH masses, and therefore luminosities, is confusion with other sources such as Utra- or HyperLuminous X-ray sources (ULXs/HLXs) (Swartz et al., 2004, 2011), which are likely the high Eddington-ratio end of the accreting stellar black hole population (Stobbart et al., 2006; Sutton et al., 2013), although some studies argue that the brightest of these may indeed be intermediate-mass black holes (Miller et al., 2003; Bellovary et al., 2010; Sutton et al., 2012). ULXs may be differentiated from IMBHs by a lack of radio emission or a position offset from the galaxy center (Thygesen et al., 2023), although Bellovary et al. (2021) showed that in the shallow potentials of dwarf galaxies, MBHs could live off-center. Multi-band X-ray observations can enable spectral studies to further differentiate between MBHs and high-accreting stellar black holes, and multi-epoch observations may find different variability scales. Further contamination at the faintest ends may arise from populations of X-ray binaries (XRBs), although this can potentially be modeled via low-scatter scaling relations with the stellar mass and star-formation rate (Lehmer et al., 2010, 2019).
4.2 What would a low-redshift multi-wavelength campaign need to look like?
Whether the black hole selection is by mass (Figure 3) or luminosity (Figure 4), seeding models cannot be distinguished unless we find every MBH . This finding was also independently arrived at in recent work by Burke et al. (2022) using future variability surveys. These smallest MBHs have been recently discovered using optical variability (Baldassare et al., 2018) and line diagnostics Baldassare et al. (2015); Mezcua & Domínguez Sánchez (2020); Polimera et al. (2022). Emission line widths, such as , can yield black hole masses, although these can easily be confused with ongoing star formation.
MBHs are, however, notorious for the wide variety and variability of their Spectral Energy Distributions (SEDs). This means that we do not yet have a complete understanding of how the emission from the reprocessed material feeding a black hole is distributed at different wavelengths. Below, we summarize some of the opportunities and challenges at various wavelengths for future low redshift surveys.
4.2.1 Optical and infrared
Optical emission from AGN is highly susceptible to obscuration by dust, dust either in the torus of the AGN itself or elsewhere in the host galaxy (e.g. Barger et al., 2005). Dust-absorbed optical photons are then re-emitted in the infrared (IR); however, infrared surveys thus far have lacked the sensitivity and/or spatial resolution to capture this population well enough (Latimer et al., 2021b). Marleau et al. (2017) found 62 AGN candidates in dwarf galaxies () in the WISE all-sky survey, but black hole masses require an assumption about Eddington ratios and therefore span three orders of magnitude. The current situation stands to be transformed by low-redshift data from the James Webb Space Telescope (JWST), particularly if there are appropriately designed surveys with well-understood selection functions.
4.2.2 Spectroscopic line ratios
A common way to find AGN is using spectroscopic line ratios, but these selection criteria were developed for more massive black holes and can fail for dwarf AGN. Polimera et al. (2022) found a population of AGN in dwarf galaxies using new line ratio diagnostics, exceeding previous detection rates in the galaxy sample by 3-16. Mezcua & Domínguez Sánchez (2020) found that for 23 of 34 AGN in dwarf galaxies with resolved IFU spectroscopy, the AGN signal gets washed out if integrated over the entire galaxy. Recently, Cann et al. (2018) has shown that more IMBHs should show up via coronal emission lines in JWST observations of nearby dwarfs.
4.2.3 Optical/IR variability selection
Using a similarly phenomenologically motivated model, Burke et al. (2022) recently presented forecasts for the detection prospects of actively accreting black holes hosted in dwarf galaxies using their optical variability. Focusing on the intermediate black hole mass regime, they use observational constraints on optical variability as a function of black hole (BH) mass to generate mock light curves. In a similar vein to the work presented here, adopting several different models for the BH occupation function, including one for off-nuclear IMBHs, they quantify differences in the predicted properties of black hole mass and luminosity functions in local dwarf galaxies. Modeling the fraction of variable accreting black holes as a function of host galaxy stellar mass, including selection effects, they report that the BHOFs for ‘heavy’ and ‘light’ initial seeding scenarios can be discriminated with variability alone at the level for galaxy host stellar masses below with data from the upcoming planned synoptic surveys with the Vera C. Rubin Observatory. Current variability-centered searches for low mass BHs are starting to yield dividends, for instance, Ward et al. (2022) find 200 dwarf AGN using optical variability, and show that of them would be missed by spectroscopic searches.
4.2.4 Radio wavelengths
AGN have also been detected in nearby dwarf galaxies through their radio emission, which stands out due to its very steep spectral index (Yang et al., 2022; Sargent et al., 2022; Davis et al., 2022). Liodakis (2022) models the expected emission from jets in intermediate black mass holes in dwarf galaxies and predicts that of them will be detectable with ngVLA, SKA and the Rubin observatory.
Using the fundamental plane of black hole activity (Merloni et al., 2003), we can use black hole masses and X-ray luminosities to predict the expected radio emission for the different SAMs. The most recent measurement of this relation by Gültekin et al. (2019):
(7) |
where the is the radio continuum luminosity at 5 GHz and is the X-ray luminosity integrated over 2-10 keV. The best fit to their observations was , and , with an intrinsic scatter of -0.04 in log(). Figure 12 shows the predictions derived using this observed fit for the expected for each of the SAMs. The heavy seed models predict slightly higher radio emission for dwarf AGN, but almost no sources in galaxies with . Additionally, in the heavy seed scenario, the PL growth channel shows larger scatter in radio luminosities for the faint AGN in dwarf galaxies. This allows an additional way to distinguish between growth channels, other than the high-end luminosity function mentioned above, without having to assume that the same growth channels dominate for MBHs of all masses/luminosities.

Combining optical, infrared, and X-ray observations would allow a self-consistent determination of the bolometric luminosity, removing the need for the uncertain bolometric correction factors that are currently adopted (Thornton et al., 2008; Koliopanos et al., 2017). Such a follow-up campaign to measure the masses of all the black holes detected via the range of methods above would greatly improve measurements of the BHOF.
4.3 The possibility of multiple seeds
In this idealised study, we have examined in detail the effects of seed and growth channels in isolation. It is likely that black holes may grow form via multiple channels. Spinoso et al. (2022) used a semi-analytic model to populate halos from the Millennium-II with both heavy and light seeds, and evolve their growth to z=0 using a two-phase prescription, corresponding to the observed quasar and radio modes of AGN. They find that with such a multiple-channel-seeding model, the slope of the black hole-stellar mass relation is flatter for dwarf galaxies than their more massive counterparts. We see this in Figure 1 in the models that grow with a power law ERDF, respective of the seeding model, although the difference is starker for heavy seeds. Interestingly, this heavy-seed, power-law ERDF model is also the one that is most consistent with the AMUSE observations. This is a promising sign for future X-ray surveys, suggesting that we may find more dwarf AGN by simply pushing to slightly lower luminosities.
5 Conclusions
We have assessed the feasibility of distinguishing between models of MBH seeding and growth in the presence of observational limitations. Using a suite of semi-analytic models, we applied cuts on the black hole mass, black hole X-ray luminosity and host galaxy stellar mass to mimic observational selection functions. We find that:
-
•
If we can find every MBH above , light seeds predict a occupation fraction down to stellar masses of , whereas for heavy seeds, this occupation fraction falls to at . The total occupation fraction of MBHs at is and for the light and heavy seeds, respectively.
-
•
If galaxies grow via the AGN-MS channel, the seed models can be distinguished if we can find every MBH above and are stellar mass complete down to . With a power-law ERDF, the occupation fraction of MBHs is 20 at for the heavy seed model, vs. for a light seed. The two growth channels can then be told apart by looking only at black holes , which are much more abundant with the AGN-MS growth channel than the PL.
-
•
We use bolometric corrections from Shen et al. (2020) to determine X-ray luminosities for the simulated AGN. An X-ray survey of limiting luminosity in the keV band can distinguish clearly between heavy and light seed models. If this limit is , both seed models with either AGN-MS or PL growth channels are indistinguishable; at higher luminosity limits, most models produce at all stellar masses.
-
•
The BLQ prescription, which assumes purely merger-driven growth and reproduced the broad-line quasar luminosity function at high redshifts, predicts almost no luminous AGN at z=0, inconsistent with observations. We therefore conclude that it is not a viable mechanism for the growth of dwarf AGN.
-
•
The constraining power of a flux-limited survey can be enhanced by physical modeling of the underlying MBH population. We use mock catalogs to confirm that Bayesian pipelines such as Miller et al. (2015) are able to recover the true intrinsic occupation fraction, using a combination of AGN detections and upper limits. The inferred BHOF is consistent with heavy seeds growing via either the PL or AGN-MS channels. The two growth channels produce different luminosity functions at z = 0, however, with the PL model being more consistent with the observations of Ueda et al. (2011).
-
•
Using the fundamental plane of black hole activity, we predict that heavy seeds with produce more luminous dwarf AGN, but almost no sources in galaxies with . While for light seeds the two growth channels produce identical radio signals, for heavy seeds, the PL model has a larger scatter in the relation. This will be testable with upcoming radio surveys like ASKAP and MeerKAT.
-
•
We emphasise that future optical/infrared, variability-selected and radio surveys of AGN in dwarf galaxies must emphasise stellar mass completeness of the galaxy sample, since AGN non-detections are likely to be correlated with galaxy detectability ways that are non-trivial to model.
In summary, we show concretely that it is possible to robustly test models of both the seeding and growth of supermassive black holes using low-redshift surveys. The surveys need to be carefully designed, achieving high completeness at low stellar and black hole masses and luminosities. Multi-wavelength campaigns will help break degeneracies. In most cases, pushing current surveys just one order of magnitude deeper will help distinguish between radically different early seeding models.
Acknowledgements. We thank Paul Tiede for helpful discussions of the statistical technique, and Ned Taylor for discussion on optical surveys. U.C. and Á.B. acknowledge support from the Smithsonian Institution through NASA contract NAS8-03060. UC was also supported by NASA grant GO0-21070X. A.R. and P.N. gratefully acknowledge support from the Black Hole Initiative at Harvard University, which is funded by grants from the John Templeton Foundation and the Gordon and Betty Moore Foundation.
References
- Almeida et al. (2021) Almeida, I., Duarte, R., & Nemmen, R. 2021, arXiv e-prints, arXiv:2102.05809. https://arxiv.org/abs/2102.05809
- Baldassare et al. (2020) Baldassare, V. F., Dickey, C., Geha, M., & Reines, A. E. 2020, ApJ, 898, L3, doi: 10.3847/2041-8213/aba0c1
- Baldassare et al. (2018) Baldassare, V. F., Geha, M., & Greene, J. 2018, ApJ, 868, 152, doi: 10.3847/1538-4357/aae6cf
- Baldassare et al. (2015) Baldassare, V. F., Reines, A. E., Gallo, E., & Greene, J. E. 2015, ApJ, 809, L14, doi: 10.1088/2041-8205/809/1/L14
- Barger et al. (2005) Barger, A. J., Cowie, L. L., Mushotzky, R. F., et al. 2005, AJ, 129, 578, doi: 10.1086/426915
- Baumgardt (2017) Baumgardt, H. 2017, MNRAS, 464, 2174, doi: 10.1093/mnras/stw2488
- Becker et al. (1995) Becker, R. H., White, R. L., & Helfand, D. J. 1995, ApJ, 450, 559, doi: 10.1086/176166
- Begelman et al. (2006) Begelman, M. C., Volonteri, M., & Rees, M. J. 2006, MNRAS, 370, 289, doi: 10.1111/j.1365-2966.2006.10467.x
- Bellovary et al. (2010) Bellovary, J. M., Governato, F., Quinn, T. R., et al. 2010, ApJ, 721, L148, doi: 10.1088/2041-8205/721/2/L148
- Bellovary et al. (2021) Bellovary, J. M., Hayoune, S., Chafla, K., et al. 2021, MNRAS, 505, 5129, doi: 10.1093/mnras/stab1665
- Bogdán & Goulding (2015) Bogdán, Á., & Goulding, A. D. 2015, ApJ, 800, 124, doi: 10.1088/0004-637X/800/2/124
- Böttcher et al. (2017) Böttcher, M., van Soelen, B., Britto, R., et al. 2017, Galaxies, 5, 52, doi: 10.3390/galaxies5030052
- Bower et al. (2010) Bower, R. G., Vernon, I., Goldstein, M., et al. 2010, MNRAS, 407, 2017, doi: 10.1111/j.1365-2966.2010.16991.x
- Bromm & Loeb (2003) Bromm, V., & Loeb, A. 2003, ApJ, 596, 34, doi: 10.1086/377529
- Burke et al. (2022) Burke, C. J., Shen, Y., Liu, X., et al. 2022, arXiv e-prints, arXiv:2207.04092. https://arxiv.org/abs/2207.04092
- Cann et al. (2018) Cann, J. M., Satyapal, S., Abel, N. P., et al. 2018, ApJ, 861, 142, doi: 10.3847/1538-4357/aac64a
- Cid Fernandes et al. (2010) Cid Fernandes, R., Stasińska, G., Schlickmann, M. S., et al. 2010, MNRAS, 403, 1036, doi: 10.1111/j.1365-2966.2009.16185.x
- Davis et al. (2022) Davis, F., Kaviraj, S., Hardcastle, M. J., et al. 2022, MNRAS, 511, 4109, doi: 10.1093/mnras/stac068
- Duras et al. (2020) Duras, F., Bongiorno, A., Ricci, F., et al. 2020, A&A, 636, A73, doi: 10.1051/0004-6361/201936817
- Eckert et al. (2015) Eckert, K. D., Kannappan, S. J., Stark, D. V., et al. 2015, ApJ, 810, 166, doi: 10.1088/0004-637X/810/2/166
- Fontanot et al. (2015) Fontanot, F., Monaco, P., & Shankar, F. 2015, MNRAS, 453, 4112, doi: 10.1093/mnras/stv1930
- Freese et al. (2010) Freese, K., Ilie, C., Spolyar, D., Valluri, M., & Bodenheimer, P. 2010, ApJ, 716, 1397, doi: 10.1088/0004-637X/716/2/1397
- Gallo & Sesana (2019) Gallo, E., & Sesana, A. 2019, ApJ, 883, L18, doi: 10.3847/2041-8213/ab40c6
- Gallo et al. (2008) Gallo, E., Treu, T., Jacob, J., et al. 2008, ApJ, 680, 154, doi: 10.1086/588012
- Gallo et al. (2010) Gallo, E., Treu, T., Marshall, P. J., et al. 2010, ApJ, 714, 25, doi: 10.1088/0004-637X/714/1/25
- Giommi et al. (2009) Giommi, P., Colafrancesco, S., Padovani, P., et al. 2009, A&A, 508, 107, doi: 10.1051/0004-6361/20078905
- Gültekin et al. (2019) Gültekin, K., King, A. L., Cackett, E. M., et al. 2019, ApJ, 871, 80, doi: 10.3847/1538-4357/aaf6b9
- Gürkan et al. (2004) Gürkan, M. A., Freitag, M., & Rasio, F. A. 2004, ApJ, 604, 632, doi: 10.1086/381968
- Hirano et al. (2015) Hirano, S., Hosokawa, T., Yoshida, N., Omukai, K., & Yorke, H. W. 2015, MNRAS, 448, 568, doi: 10.1093/mnras/stv044
- Inayoshi et al. (2015) Inayoshi, K., Visbal, E., & Kashiyama, K. 2015, MNRAS, 453, 1692, doi: 10.1093/mnras/stv1654
- Kelly & Shen (2013) Kelly, B. C., & Shen, Y. 2013, ApJ, 764, 45, doi: 10.1088/0004-637X/764/1/45
- Kewley et al. (2013) Kewley, L. J., Dopita, M. A., Leitherer, C., et al. 2013, ApJ, 774, 100, doi: 10.1088/0004-637X/774/2/100
- Koliopanos et al. (2017) Koliopanos, F., Ciambur, B. C., Graham, A. W., et al. 2017, A&A, 601, A20, doi: 10.1051/0004-6361/201630061
- Kormendy & Ho (2013) Kormendy, J., & Ho, L. C. 2013, ARA&A, 51, 511, doi: 10.1146/annurev-astro-082708-101811
- Latimer et al. (2021a) Latimer, L. J., Reines, A. E., Bogdan, A., & Kraft, R. 2021a, ApJ, 922, L40, doi: 10.3847/2041-8213/ac3af6
- Latimer et al. (2021b) Latimer, L. J., Reines, A. E., Hainline, K. N., Greene, J. E., & Stern, D. 2021b, ApJ, 914, 133, doi: 10.3847/1538-4357/abfe0c
- Lehmer et al. (2010) Lehmer, B. D., Alexander, D. M., Bauer, F. E., et al. 2010, ApJ, 724, 559, doi: 10.1088/0004-637X/724/1/559
- Lehmer et al. (2019) Lehmer, B. D., Eufrasio, R. T., Tzanavaris, P., et al. 2019, ApJS, 243, 3, doi: 10.3847/1538-4365/ab22a8
- Liodakis (2022) Liodakis, I. 2022, MNRAS, 512, 291, doi: 10.1093/mnras/stac181
- Lodato & Natarajan (2006) Lodato, G., & Natarajan, P. 2006, MNRAS, 371, 1813, doi: 10.1111/j.1365-2966.2006.10801.x
- Lusso et al. (2022) Lusso, E., Valiante, R., & Vito, F. 2022, arXiv e-prints, arXiv:2205.15349. https://arxiv.org/abs/2205.15349
- Lusso et al. (2012) Lusso, E., Comastri, A., Simmons, B. D., et al. 2012, MNRAS, 425, 623, doi: 10.1111/j.1365-2966.2012.21513.x
- Marleau et al. (2017) Marleau, F. R., Clancy, D., Habas, R., & Bianconi, M. 2017, A&A, 602, A28, doi: 10.1051/0004-6361/201629832
- Mayer et al. (2010) Mayer, L., Kazantzidis, S., Escala, A., & Callegari, S. 2010, Nature, 466, 1082, doi: 10.1038/nature09294
- McConnell & Ma (2013) McConnell, N. J., & Ma, C.-P. 2013, ApJ, 764, 184, doi: 10.1088/0004-637X/764/2/184
- Merloni et al. (2003) Merloni, A., Heinz, S., & di Matteo, T. 2003, MNRAS, 345, 1057, doi: 10.1046/j.1365-2966.2003.07017.x
- Mezcua (2019) Mezcua, M. 2019, Nature Astronomy, 3, 6, doi: 10.1038/s41550-018-0662-2
- Mezcua & Domínguez Sánchez (2020) Mezcua, M., & Domínguez Sánchez, H. 2020, ApJ, 898, L30, doi: 10.3847/2041-8213/aba199
- Miller et al. (2012) Miller, B., Gallo, E., Treu, T., & Woo, J.-H. 2012, ApJ, 747, 57, doi: 10.1088/0004-637X/747/1/57
- Miller et al. (2015) Miller, B. P., Gallo, E., Greene, J. E., et al. 2015, ApJ, 799, 98, doi: 10.1088/0004-637X/799/1/98
- Miller et al. (2003) Miller, J. M., Fabbiano, G., Miller, M. C., & Fabian, A. C. 2003, ApJ, 585, L37, doi: 10.1086/368373
- Milosavljević et al. (2009) Milosavljević, M., Bromm, V., Couch, S. M., & Oh, S. P. 2009, ApJ, 698, 766, doi: 10.1088/0004-637X/698/1/766
- Moran et al. (2014) Moran, E. C., Shahinyan, K., Sugarman, H. R., Vélez, D. O., & Eracleous, M. 2014, AJ, 148, 136, doi: 10.1088/0004-6256/148/6/136
- Mullaney et al. (2012) Mullaney, J. R., Daddi, E., Béthermin, M., et al. 2012, ApJ, 753, L30, doi: 10.1088/2041-8205/753/2/L30
- Natarajan (2021) Natarajan, P. 2021, MNRAS, 501, 1413, doi: 10.1093/mnras/staa3724
- Nguyen et al. (2017) Nguyen, D. D., Seth, A. C., den Brok, M., et al. 2017, ApJ, 836, 237, doi: 10.3847/1538-4357/aa5cb4
- Nguyen et al. (2018) Nguyen, D. D., Seth, A. C., Neumayer, N., et al. 2018, ApJ, 858, 118, doi: 10.3847/1538-4357/aabe28
- Peterson et al. (2004) Peterson, B. M., Ferrarese, L., Gilbert, K. M., et al. 2004, ApJ, 613, 682, doi: 10.1086/423269
- Polimera et al. (2022) Polimera, M. S., Kannappan, S. J., Richardson, C. T., et al. 2022, ApJ, 931, 44, doi: 10.3847/1538-4357/ac6595
- Portegies Zwart et al. (2004) Portegies Zwart, S. F., Baumgardt, H., Hut, P., Makino, J., & McMillan, S. L. W. 2004, Nature, 428, 724, doi: 10.1038/nature02448
- Reines (2022) Reines, A. E. 2022, Nature Astronomy, 6, 26, doi: 10.1038/s41550-021-01556-0
- Reines et al. (2020) Reines, A. E., Condon, J. J., Darling, J., & Greene, J. E. 2020, ApJ, 888, 36, doi: 10.3847/1538-4357/ab4999
- Reines et al. (2013) Reines, A. E., Greene, J. E., & Geha, M. 2013, ApJ, 775, 116, doi: 10.1088/0004-637X/775/2/116
- Ricarte & Natarajan (2018a) Ricarte, A., & Natarajan, P. 2018a, MNRAS, 481, 3278, doi: 10.1093/mnras/sty2448
- Ricarte & Natarajan (2018b) —. 2018b, MNRAS, 474, 1995, doi: 10.1093/mnras/stx2851
- Ricarte et al. (2019) Ricarte, A., Pacucci, F., Cappelluti, N., & Natarajan, P. 2019, MNRAS, 489, 1006, doi: 10.1093/mnras/stz1891
- Richards et al. (2006) Richards, G. T., Lacy, M., Storrie-Lombardi, L. J., et al. 2006, ApJS, 166, 470, doi: 10.1086/506525
- Sargent et al. (2022) Sargent, A. J., Johnson, M. C., Reines, A. E., et al. 2022, ApJ, 933, 160, doi: 10.3847/1538-4357/ac74be
- Seth et al. (2014) Seth, A. C., van den Bosch, R., Mieske, S., et al. 2014, Nature, 513, 398, doi: 10.1038/nature13762
- Shen et al. (2020) Shen, X., Hopkins, P. F., Faucher-Giguère, C.-A., et al. 2020, MNRAS, 495, 3252, doi: 10.1093/mnras/staa1381
- Somerville et al. (2008) Somerville, R. S., Hopkins, P. F., Cox, T. J., Robertson, B. E., & Hernquist, L. 2008, MNRAS, 391, 481, doi: 10.1111/j.1365-2966.2008.13805.x
- Spinoso et al. (2022) Spinoso, D., Bonoli, S., Valiante, R., Schneider, R., & Izquierdo-Villalba, D. 2022, arXiv e-prints, arXiv:2203.13846. https://arxiv.org/abs/2203.13846
- Spolyar et al. (2008) Spolyar, D., Freese, K., & Gondolo, P. 2008, Phys. Rev. Lett., 100, 051101, doi: 10.1103/PhysRevLett.100.051101
- Stobbart et al. (2006) Stobbart, A. M., Roberts, T. P., & Wilms, J. 2006, MNRAS, 368, 397, doi: 10.1111/j.1365-2966.2006.10112.x
- Sutton et al. (2013) Sutton, A. D., Roberts, T. P., & Middleton, M. J. 2013, MNRAS, 435, 1758, doi: 10.1093/mnras/stt1419
- Sutton et al. (2012) Sutton, A. D., Roberts, T. P., Walton, D. J., Gladstone, J. C., & Scott, A. E. 2012, MNRAS, 423, 1154, doi: 10.1111/j.1365-2966.2012.20944.x
- Swartz et al. (2004) Swartz, D. A., Ghosh, K. K., Tennant, A. F., & Wu, K. 2004, ApJS, 154, 519, doi: 10.1086/422842
- Swartz et al. (2011) Swartz, D. A., Soria, R., Tennant, A. F., & Yukita, M. 2011, ApJ, 741, 49, doi: 10.1088/0004-637X/741/1/49
- Tanaka & Li (2014) Tanaka, T. L., & Li, M. 2014, MNRAS, 439, 1092, doi: 10.1093/mnras/stu042
- Thornton et al. (2008) Thornton, C. E., Barth, A. J., Ho, L. C., Rutledge, R. E., & Greene, J. E. 2008, ApJ, 686, 892, doi: 10.1086/591519
- Thygesen et al. (2023) Thygesen, E., Plotkin, R. M., Soria, R., et al. 2023, MNRAS, 519, 5848, doi: 10.1093/mnras/stad002
- Treister et al. (2013) Treister, E., Schawinski, K., Volonteri, M., & Natarajan, P. 2013, ApJ, 778, 130, doi: 10.1088/0004-637X/778/2/130
- Tucci & Volonteri (2017) Tucci, M., & Volonteri, M. 2017, A&A, 600, A64, doi: 10.1051/0004-6361/201628419
- Ueda et al. (2011) Ueda, Y., Hiroi, K., Isobe, N., et al. 2011, PASJ, 63, S937, doi: 10.1093/pasj/63.sp3.S937
- Valiante et al. (2016) Valiante, R., Schneider, R., Volonteri, M., & Omukai, K. 2016, MNRAS, 457, 3356, doi: 10.1093/mnras/stw225
- Vasudevan & Fabian (2007) Vasudevan, R. V., & Fabian, A. C. 2007, MNRAS, 381, 1235, doi: 10.1111/j.1365-2966.2007.12328.x
- Volonteri et al. (2021) Volonteri, M., Habouzit, M., & Colpi, M. 2021, Nature Reviews Physics, 3, 732, doi: 10.1038/s42254-021-00364-9
- Volonteri et al. (2008) Volonteri, M., Lodato, G., & Natarajan, P. 2008, MNRAS, 383, 1079, doi: 10.1111/j.1365-2966.2007.12589.x
- Walsh et al. (2013) Walsh, J. L., Barth, A. J., Ho, L. C., & Sarzi, M. 2013, ApJ, 770, 86, doi: 10.1088/0004-637X/770/2/86
- Ward et al. (2022) Ward, C., Gezari, S., Nugent, P., et al. 2022, ApJ, 936, 104, doi: 10.3847/1538-4357/ac8666
- Woo & Urry (2002) Woo, J.-H., & Urry, C. M. 2002, ApJ, 579, 530, doi: 10.1086/342878
- Yang et al. (2022) Yang, X., Mohan, P., Yang, J., et al. 2022, arXiv e-prints, arXiv:2211.00050. https://arxiv.org/abs/2211.00050
Appendix A Appendix
A.1 Bolometric corrections
The most uncertain part of our model is the connection between the MBH mass and its X-ray luminosity. Observationally, we know that the MBH population is diverse - there are Type I (Richards et al., 2006) and Type II AGN, Seyfert galaxies, FR I and FR II jets, blazars (Giommi et al., 2009; Böttcher et al., 2017), broad-line quasars (BLQs), narrow-line quasars (NLQs), low-luminosity AGN (Almeida et al., 2021) and a variety of radio sources. A large part of this diversity has to do with dust obscuration, which affects X-ray wavelengths far less than optical/infrared (IR). However, there is expected to be intrinsic variability in the spectral energy distributions (SED) at different black hole accretion rates, depending on what process is driving the luminosity on (black hole event) horizon scales.
We adopted the bolometric correction to the keV band from Shen et al. (2020), which is a function of the bolometric luminosity; they found a similar scaling for the keV band. Their measurement agrees very well with Duras et al. (2020); for the earlier Lusso et al. (2012) the corrections agree at , but diverge significantly for fainter sources. Using Lusso et al. (2012) would thus yield much more pessimistic predictions for the X-ray surveys; we note, however, that these scaling relations were computed for AGN with erg/s, far above the dwarf AGN regime. Vasudevan & Fabian (2007) found that the bolometric correction to the keV band is better correlated to the Eddington ratio. Woo & Urry (2002) went so far as to argue that there was no correlation between black hole luminosity and mass other than due to circular reasoning in inferring the former from the latter. A relationship between the two is crucial for inferring black hole demographics unless we rely on dynamical measurements for the black hole mass, which is only possible for the nearest galaxies, or reverberation mapping, which is possible only for Type I AGN with an unobscured broad-line region.
A.2 Cosmic downsizing
We do not know whether the Eddington distribution - the ratio of a black holes luminosity to the Eddington luminosity - is the same as for more massive galaxies, or if there is a “downsizing” effect where less massive galaxies host, on average, more active black holes. There is then a degeneracy between the scaling relation and the occupation fraction. The most robust constraints must jointly model these two effects, using an unbiased sample of host galaxies (Gallo et al., 2010).