The Hubble Space Telescope Survey of M31 Satellite Galaxies. III.
Calibrating the Horizontal Branch as an Age Indicator for Nearby Galaxies
Abstract
We present a new method for measuring the mean age of old/intermediate stellar populations in resolved, metal-poor () galaxies using only the morphology of the horizontal branch (HB) and an estimate of the average metallicity. We calculate the ratio of blue-to-red HB stars and the mass-weighted mean ages of 27 M31 satellite galaxies that have star formation histories (SFHs) measured from Hubble Space Telescope-based color-magnitude diagrams (CMDs) that include the oldest Main Sequence Turn-off (MSTO) ages. We find a strong correlation between mean age, metallicity, and HB morphology, for stellar populations older than Gyr. The correlation allows us to predict a galaxy’s mean age from its HB morphology to a precision of Gyr. We validate our method by recovering the correct ages of Local Group galaxies that have robust MSTO-based ages and are not in our calibration sample. We also use our technique to measure the mean ages of isolated field galaxies KKR25 ( Gyr) and VV124 ( Gyr), which indicate that their main star formation episodes may have lasted several Gyr and support the picture that they achieved their early-type characteristics (e.g., low gas content, low star formation activity) in isolation and not through environment. Because the HB is brighter than the oldest MSTO, our method can provide precise characteristic ages of predominantly old galaxies at distances times farther. We provide our calibrations in commonly used HST/ACS filters.
1 Introduction
The color-magnitude diagrams (CMDs) of nearby galaxies provide detailed insight into their formation histories. Several regions of a galaxy’s CMD, such as the main sequence turnoff (MSTO), the sub-giant branch (SGB), and the red giant branch (RGB) are sensitive to age and/or metallicity, and provide the means to reconstruct detailed, ‘non-parametric’ star formation and enrichment histories of galaxies over cosmic time (e.g., Gallart et al., 2005; Tolstoy et al., 2009; Cignoni & Tosi, 2010). The exquisite angular resolution and sensitivity of the Hubble Space Telescope (HST) has enabled the construction of CMDs and measurement of SFHs for hundreds of galaxies in and around the Local Group (LG), providing rich insight into galaxy evolution throughout the local Universe (e.g., Dalcanton et al., 2009; McQuinn et al., 2010; Brown et al., 2014; Weisz et al., 2014a; Gallart et al., 2015; Skillman et al., 2017).
However, the accuracy of CMD-based star formation histories (SFHs) depends on the depth of the CMD. The most robust SFHs are derived from CMDs that reach the oldest MSTO (oMSTO). This faint feature () is sensitive to age and metallicity, while the relative simplicity of stellar physics for this phase of evolution translates to small systematic uncertainties on MSTO-based SFHs. SFHs derived from CMDs that reach the oMSTO are considered to be accurate over all cosmic time (e.g., Gallart et al., 2005).
On the other hand, faintness and crowding effectively limit observations of the oMSTO to galaxies located within the LG, even with the resolving power of HST. Expanding galaxy ages and SFHs beyond the LG has required modeling more luminous, evolved phases of stellar evolution (e.g., red giants, red clump; Rejkuba et al., 2005; Weisz et al., 2011), all of which are less sensitive to age than the MSTO and subject to much less certain physics. The result is that SFHs from shallower CMDs only provide coarse age resolution over cosmic time and systematics in the stellar libraries themselves can be the dominant source of uncertainty.
The horizontal branch (HB) offers a promising compromise between SFH fidelity and observational access. In the optical, it is 3-4 magnitudes brighter than the oMSTO, while its sensitivity to age and metallicity makes it a suitable tracer of the old SFH (e.g., Rejkuba et al., 2011; Savino et al., 2018, 2020). Due to these characteristics, the HB has long been used as a qualitative SFH indicator in nearby galaxies (e.g., Da Costa et al., 1996; Harbeck et al., 2001; Grebel & Gallagher, 2004; Martin et al., 2017). However, a more quantitative calibration of the HB morphology as an age indicator has so far been hindered by the complex physics of this evolutionary phase. While this issue was originally referred to as the “second parameter problem” (e.g., van den Bergh, 1967; Sandage & Wildey, 1967), decades of studies of Galactic globular clusters have revealed that the HB morphology is likely influenced by a large number of astrophysical parameters, the most relevant of which are stellar metallicity, age, helium abundance, and the amount of mass lost during RGB evolution (e.g., Catelan et al., 2009; Dotter et al., 2010; Gratton et al., 2010; Milone et al., 2014; Tailo et al., 2020). This complexity has long hampered the use of the HB as a reliable age tracer.
However, two major developments in our understanding of stellar populations have opened a promising path to calibrate the age dependence of the HB. First, we now know that globular clusters, once thought to be the archetype of simple stellar population, have very complex chemical abundance patterns (commonly reffered to as the “multiple population” phenomenon, e.g., Bastian & Lardo, 2018, and references therein). These include large helium spreads, which have a central role in the diversity of clusters’ HB morphologies (e.g., Piotto et al., 2007; Dalessandro et al., 2011; Milone et al., 2018; Tailo et al., 2020). While a small fraction of these chemically peculiar stars have been found in the Milky Way (MW) field (e.g., Martell et al., 2011; Schiavon et al., 2017), the multiple population phenomenon appears to be predominantly a feature of globular cluster formation. To date, there is no indication that similar abundance anomalies are present in the field population of nearby dwarf galaxies (e.g., Geisler et al., 2007; Salaris et al., 2013; Fabrizio et al., 2015). This makes interpreting the HB of nearby dwarfs potentially simpler compared to Galactic globular clusters.
Second, we have made great strides in measuring the RGB mass loss with increasing accuracy (e.g., Gratton et al., 2010; Salaris et al., 2013; Savino et al., 2019; Tailo et al., 2020). The growing evidence is that, for old stars, RGB mass loss is a predictable function of other stellar population parameters, primarily metallicity. Furthermore, once the effect of helium abundance is removed, there is remarkable agreement between the amount of mass loss inferred from nearby dwarfs (Savino et al., 2019) and Galactic globular clusters (Tailo et al., 2020), further supporting the argument that RGB evolution does not have a strong environmental dependence.
This means that it should be possible to use the density distribution across the HB in nearby dwarfs as a tracer of age. Specifically, as the age of the stellar population increases, the HB will progressively be occupied by hotter stars. Metallicity also affects the temperatures of HB stars, with higher metallicity producing cooler HB stars. Therefore, after factoring in the effect of metallicity, the ratio of blue HB (BHB) to red HB (RHB) stars is expected to be a good proxy for the characteristic stellar population age. The combination of age sensitivity and a luminosity that is times brighter than the MSTO means that the HB has the potential to provide robust ages for galaxies out to several Mpc. This is particularly powerful in light of the thousands of faint galaxies that next generation surveys are expected to uncover in the coming decade (e.g., Simon, 2019; Mutlu-Pakdil et al., 2021; Qu et al., 2023).
Here, we present a step forward toward empirically using the HB morphology to estimate the characteristic ages of distant galaxies. Specifically, we use the MSTO-based SFHs of 27 M31 satellite galaxies from the HST survey of M31 satellites to estimate the correlation of the ratio of BHB to RHB stars with mean galaxy stellar age. The M31 satellite sample provides a large, diverse, and uniformly analyzed set of SFHs and high-precision photometry that can be used to calibrate the HB as an age indicator for a variety of galaxy types. In comparison, the MW satellites have excellent uniform photometry (e.g., Muñoz et al., 2018a, b), but the SFH methodologies are inhomogeneous across the MW satellite populations (e.g. Lee et al., 2009; de Boer et al., 2014; Rusakov et al., 2021), which is not suitable for this type of analysis. Similarly, as discussed above, the MW globular clusters have excellent photometry and well-determined ages (e.g., Sarajedini et al., 2007; Dotter et al., 2010; VandenBerg et al., 2013), but the impact of multiple populations on their HB morphologies makes them not well-suited for direct ties to entire galaxies. As a result, our M31 sample is among the best available for this type of work.
From our M31 data, we show that we can robustly predict the mean age of old stars in a galaxy not in our calibration sample, using only HB morphology and an estimate of the galaxy’s average metallicity. The HB provides access exclusively to star formation older than Gyr, as more massive (and younger) stars do not evolve through the HB phase. The mean age of such ancient stars is useful for understanding galaxy formation mechanisms in the early Universe (e.g., quenching, connection to reionization, the ages of stellar halos in more massive systems) and for deciding which objects may warrant deeper (e.g., ancient MSTO depth) follow-up observations. There are obvious limitations to our approach such as the insensitivity to star formation younger than Gyr and the low time resolution relative to full SFHs from the oMSTO. Nevertheless, virtually all galaxies known in the local Universe are predominantly ancient (e.g., Weisz et al. 2011; though see Cole et al. 2007, 2014 for a few exceptions) and even HB-depth CMDs have proven observationally challenging/prohibitive in the era of HST outside Mpc. Thus, in the broader context, a robust HB age provides an important first step in a more detailed understanding of resolved galaxies out to larger volumes.
This paper is organized as follows. In §2 we summarize the data (i.e., photometry, SFHs) we use to calibrate the relationship between a galaxy’s age, metallicity, and its HB morphology. In §3, we present the technical details for how we relate the HB morphology to the galaxy’s age and metallicity. In §4 we validate our results on datasets not used in our calibration and apply our findings to estimate the ages for a handful of galaxies outside the LG.
2 Data
2.1 Photometry and sample selection
We use HST/ACS F475W, F606W, and F814W photometric catalogs uniformly produced by the HST M31 Treasury Survey of M31 Satellites (GO-15902; PI: Weisz). The photometry used in this paper comes from new observations, taken as part of GO-15902, as well as additional archival data of the M31 system (GO-13028/13739, PI: Skillman; GO-13699, PI: Martin). The photometric reduction process with DOLPHOT (Dolphin, 2000) and subsequent catalog construction is summarized in Savino et al. (2022, 2023) and will be fully detailed in Weisz et al. (in prep). Here, we focus on highly complete and high signal-to-noise ratio (SNR) stars around the HB, which are robustly measured regardless of the specific choices in the reduction set-up. We use this M31 sample, as opposed to the Milky Way (MW), because of its uniformity of data quality, areal coverage, and SFH and relatively small degree of MW foreground contamination relative to the angularly larger MW satellites. Similarly, the chemical abundance peculiarities in MW globular clusters appear to influence the morphology of the HB in ways that are different from galaxy field populations, limiting the utility of globular clusters for our purposes (Gratton et al., 2011; Dalessandro et al., 2011; Bastian & Lardo, 2018, and references therein).
From the survey’s full sample of 36 dwarf galaxies (Savino et al., 2022), we exclude M32, NGC147, NGC185, NGC205, and And XIX as their large angular extent, off-center position of the HST pointings, and our incomplete knowledge of population gradients in these galaxies (e.g., Rose et al., 2005; Ho et al., 2015; Taibi et al., 2022) make it challenging to estimate representative metallicities that are matched to our fields (cf. § 2.3). Moreover, due to crowding, M32 and NGC205 do not have sufficiently deep photometry to ensure reliable stellar population ages (i.e., their CMDs do not reach the oMSTO).
We also exclude IC1613, the Pisces dwarf (LGS3) and the Pegasus dwarf irregular, as these galaxies have had recent star formation, and contamination from bright main sequence (MS) stars makes a clean HB selection more challenging for our calibration purposes. In fact, we use the Pisces and Pegasus dwarf galaxies in § 4.2, to validate how our methodology performs in the presence of main-sequence contamination of the HB. We also exclude And IX, which suffers from significant contamination from M31 itself, making it challenging to isolate genuine member HB stars.
Figures 1 and 2 show the CMDs of our final sample, focused on the HB region. Overall we have 21 galaxies with F606W/F814W data and 6 galaxies with F475W/F814W data. The photometry has been transformed to the absolute magnitude space using distances from Savino et al. (2022) and foreground extinction from Green et al. (2019). All the analysis in this paper will be carried out on absolute magnitudes. The typical SNR at the HB is between 60 and 150, and the completeness, as estimated from artificial star tests, is %.

2.2 Stellar Ages
The stellar population ages used in this work are based on a set of homogeneous SFHs measured as part of the M31 Treasury survey (Savino et al., in prep; HST-GO-15902, PI D. Weisz). The general process of measuring these SFHs, along with the SFHs of the ultra-faint M31 satellites, are detailed in Savino et al. (2023).
In short, we derived the SFHs with the widely-used MATCH software package (Dolphin, 2002, 2012, 2013). MATCH models the density distribution of stars in the CMD, down to the oMSTO in the case of all our data. We used the same distance and extinction values used in this paper (Green et al., 2019; Savino et al., 2022). The CMDs were modeled using the BaSTI stellar model library (Hidalgo et al., 2018), covering an age range , with bins of size dex, and a metallicity range of , with bins of size dex. The HB was explicitly excluded from the CMD fit and the age information we derive is primarily from the MSTO. The uncertainties on the SFH are then calculated using the methodology described in Dolphin (2012, 2013). Random uncertainties are based on Hamiltonian Monte Carlo sampling of the solution parameter space, while systematic uncertainties are estimated by introducing systematic perturbations in the shapes of the stellar models.
For each galaxy, we use the MATCH-based SFH to derive the mass-weighted age of the stellar population, , defined as:
(1) |
where is the median age in the star formation bin and is the total stellar mass formed in that bin. As the HB is only composed of low-mass stars, we only average over the star formation bins older than 6 Gyr, which corresponds to stellar masses on the HB of roughly . As the precise age upper limit probed by our analysis is somewhat uncertain, we experimented with other age cuts and found that 6 Gyr provides the strongest correlation with HB morphology. The values of obtained in this way are listed in Tab. 1. We calculate random and systematic uncertainties on by sampling from the respective SFH uncertainties.
As our analysis is based on ages from a homogeneous dataset and methodology, we only use the random uncertainties in our models. This is sufficient to achieve a robust relative age calibration of the HB morphology and explore galaxy-to-galaxy differences. However, the effect of systematic uncertainties needs to be taken into account when placing our HB-based ages in a broader context. Systematic uncertainties on the value of , inferred from the MSTO, are listed in Tab. 1. While the precise value of this systematic term changes from galaxy to galaxy, the HB-based age scale we present in this paper is derived from the full sample of galaxies. For this reason, we adopt the median value of the systematic uncertainty distribution, which is 0.86 Gyr, as our systematic uncertainty on the HB-based ages.
2.3 Metallicities
At present, only a modest subset of M31 satellites, typically the brightest systems, have published spectroscopic metallicities of a large number of resolved RGB stars (Vargas et al., 2014; Ho et al., 2015; Kirby et al., 2020). We thus cannot rely on direct metallicity information for all galaxies in our sample. Instead, in order to maintain uniformity across the sample, we adopt a mean metallicity value for the stellar populations in our fields using the well-established luminosity-metallicity (LZ) scaling relation from Kirby et al. (2013b):
(2) |
which has been obtained from spectroscopic studies of Milky Way dwarf spheroidal galaxies and that has been shown to be adequate for M31 satellites as well (Ho et al., 2015; Kirby et al., 2020). To calculate the metallicities we use stellar luminosities from Savino et al. (2022). The uncertainty on is calculated by propagating the uncertainties on the coefficients of eq. 2 and those on the luminosities, and adding the resulting term in quadrature with the LZ scatter of 0.17 dex reported in Kirby et al. (2013b). The values of obtained in this way are listed in Tab. 1. In Appendix A, we explore the effect of adopting LZ-based metallicities, instead of spectroscopic values, and find differences compatible with the uncertainties in our model.
The value of obtained through Eq. 2 is a mean metallicity of the galaxy stellar population (as measured from RGB stars, in the original study of Kirby et al., 2013b); it is not guaranteed to be representative of the HB population (e.g., Nagarajan et al., 2022; Savino et al., 2022), because HB and RGB stars do not come from identical stellar sub-populations and because of evolutionary timescale differences. However, direct metallicities of HB stars are impossible to obtain in any extragalactic system except the closest ones (e.g. Clementini et al., 2005). The next best approach is to rely on RGB-based tracers in both our calibration sample and in future applications to distant galaxies. This potential source of uncertainty is treated as a nuisance parameter in our models, as further discussed in § 3.2.


3 Methodology
3.1 Measuring Horizontal Branch Morphology
The morphology of the HB can be parametrized in a number of ways (e.g., Mackey & van den Bergh, 2005; Dotter et al., 2010; Milone et al., 2014). Here we adopt a commonly used (e.g, Da Costa et al., 1996, 2000, 2002; Martin et al., 2017) population ratio , defined as:
(3) |
where and are the number of blue and red HB stars, respectively. The selection of BHB and RHB stars is based on the Martin et al. (2017) definition, which makes use of CMD boxes. However, we have refined our box locations to accommodate our improved photometric precision, the use of multiple photometric bands, and to explicltly address contamination from the RGB and, if present, the MS. We now provide a detailed description of our HB member selection.
3.1.1 F606W/F814W Photometry
The majority of our sample galaxies were observed with the F606W and F814W filter combination (Fig. 1). For these galaxies, we select BHB stars in the color range . This extends the BHB further to the blue from the definition used in Martin et al. (2017). The extended BHB makes decontamination from MS stars (see § 4.2), if they are present, more robust. Additionally, a wide color range reduces the impact of photometric uncertainties from scattering stars outside the BHB bounds. We expect these observational uncertainties to be insignificant for galaxies in our sample, but they may become significant for more distant galaxies.
The RHB star sample spans the color range . This expands the original Martin et al. (2017) definition to include the entirety of the RGB over this magnitude range. Given the variety of HB morphologies and occasional blending between the HB and the RGB, we find that this approach allows for a uniform treatment of contaminants of the RHB population, as opposed to guessing at an appropriate red edge of the selection box. We describe this choice, and the RGB decontamination process, in § 3.1.3. The vertical height of the BHB and RHB boxes is set at 0.8 mag: for the BHB and for the RHB. These values have been determined on our sample to roughly be centered around the HB, and are large enough to mitigate uncertainties in the distance while sufficiently small to mitigate the effect of spurious CMD contaminants. As our selection is primarily based on color, the precise magnitude range is less critical. The corresponding selection regions are shown as blue and red boxes in Fig. 1.
\topruleName | from HB | from MSTO | ||||
dex | Gyr | Gyr | ||||
\topruleAnd I | -1.56±0.18 | ) | ||||
And II | -1.52±0.18 | ) | ||||
And III | -1.78±0.18 | ) | ||||
And V | -1.80±0.18 | ) | ||||
And VI | -1.53±0.18 | ) | ||||
And VII | -1.34±0.19 | ) | ||||
And X | -2.03±0.19 | ) | ||||
And XI | -2.14±0.20 | ) | ||||
And XII | -2.11±0.20 | ) | ||||
And XIII | -2.09±0.19 | ) | ||||
And XIV | -1.88±0.19 | ) | ||||
And XV | -1.91±0.19 | ) | ||||
And XVI | -2.01±0.19 | ) | ||||
And XVII | -1.98±0.19 | ) | ||||
And XX | -2.14±0.20 | ) | ||||
And XXI | -1.85±0.18 | ) | ||||
And XXII | -2.14±0.20 | ) | ||||
And XXIII | -1.74±0.18 | ) | ||||
And XXIV | -2.00±0.19 | ) | ||||
And XXV | -1.82±0.18 | ) | ||||
And XXVI | -2.18±0.21 | ) | ||||
And XXVIII | -1.86±0.22 | ) | ||||
And XXIX | -1.93±0.19 | ) | ||||
And XXX | -1.99±0.19 | ) | ||||
And XXXI | -1.53±0.20 | ) | ||||
And XXXII | -1.45±0.20 | ) | ||||
And XXXIII | -1.71±0.20 | ) | ||||
\toprule |
3.1.2 F475W/F814W Photometry
Six galaxies in our sample have data from the ISLandS program (Skillman et al., 2017), which used the F475W and F814W filters (Fig. 2). For these galaxies, we need to ensure consistency with the sample selection that we defined for the F606W/F814W photometry, so that we can derive a homogeneous age scale. We therefore use filter transformations to map the color ranges defined in § 3.1.1 to colors.
We derive preliminary filter transformations using theoretical zero-age HB loci from the BaSTI stellar library (Hidalgo et al., 2018), which allow us to examine a large range of metallicities. Using HB tracks with , which is representative of the M31 satellite metallicity range (Collins et al., 2013; Ho et al., 2015; Kirby et al., 2020), we derive the following second-order transformation:
(4) |
We validate this transformation using HST data for the dwarf galaxy Eridanus II, which is one of the few LG galaxies to have spatially overlapping F475W/F814W (Gallart et al., 2021) and F606W/F814W (Simon et al., 2021) imaging. We use a photometric catalog derived with the same methodology of our primary sample (Savino et al., 2023) containing 497 stars, and compare the measured (F475W-F814W) color for HB and RGB stars, with the predicted values using eq. 4. We find that the theoretical filter transformations are accurate, with the exception of a 0.056 mag zero-point offset. No other significant trend was found in the residuals, so we apply this term to our filter transformations, resulting in:
(5) |
Through eq. 5, the BHB color selection becomes , while the RHB selection becomes . The vertical position of the selection boxes has been set at for the BHB stars, and for the RHB stars. Those values have been derived to roughly center the HB in our selection boxes. The corresponding selection regions are shown as blue and red boxes in Fig. 2.
Figure 3 shows F606W/F814W and F475W/F814W CMDs of Eridanus II zoomed in on the HB regions. Overplotted are the HB selection regions in the two filter pairs. Our BHB selection captures virtually the same stars in both CMDs. However, not all the RHB candidates selected in F606W/F814W are contained in the F475W/F813W RHB box, leading to a slight difference in measured . This discrepancy is partially due to the presence of RR Lyrae stars (Martínez-Vázquez et al., 2021) in the bluest region of the RHB selection box, which experience time-dependent changes in magnitude and color. For datasets consisting of a high number of epochs, the light-curve is sufficiently sampled that the recovered magnitude and color will converge towards average values, mitigating this effect. However, for more sparsely sampled photometry, the position of RR Lyrae on the CMD can vary between datasets ovserved at different epochs.
The amount of scatter introduced by the discrete light-curve sampling depends on the number of photometric epochs, on the observing cadence, and on the photometric band, with blue bands being more affected due to the higher pulsation amplitude. For Eridanus II this is particularly a problem, as the F475W photometry only consists of 8 epochs. Using mock light curves, we estimate that such sparse sampling introduces a scatter in the F475W-F814W color of roughly 0.15 mag. This is one of the worst cases for RR Lyrae bias, but it is our only option for illustrative purposes due to a paucity of overlapping data in these two filter combinations. Reaching the HB in most galaxies, which are more distant than Eridanus II, requires deeper photometry and more epochs, thus making the RR Lyrae issue much less severe. For comparison, the typical effect on our M31 galaxies is of the order of 0.05 mag, for both the F606W and F475W datasets. Despite the RR Lyrae issue in Eridanus II, we note that the measured values of are consistent within uncertainty and that the resulting age differences (cf.§ 4.1) are subdominant compared to the other uncertainty sources in our analysis as discussed below.
3.1.3 Contamination from the RGB
The RHB selection outlined in Martin et al. (2017) partially overlaps with the RGB, introducing some degree of contamination into their RHB boxes. This is unavoidable given the modest photometric precision of their dataset. In many cases, our deeper photometry allows in many cases to define a much cleaner selection of RHB stars, avoiding contamination from the adjacent RGB (e.g., Fig. 1). However, instead of simply using more refined boxes, we opt to model RGB contamination for two reasons.
First, some galaxies (e.g., And II, And VII, And XXXII) do not have a clear distinction between the reddest HB stars and the RGB, even with our high precision data. This is likely due to the higher metallicity (which shifts the HB to redder colors) and larger metallicity dispersion (which increases the color spread of both the HB and the RGB) of these galaxies.
Second, observations of galaxies outside the Local Group, for which our calibration is of particular interest, will likely not achieve sufficiently deep photometry to achieve a clean separation of the RHB and RGB. By adopting a statistical treatment of contamination from RGB stars, our method can be applied to virtually any CMD depth and HB morphology.
Figure 4 illustrates our decontamination procedure using And XXX, a galaxy that has a clear distinction between the RHB and RGB. This allows us to validate our procedure. First, we select candidate RHB stars in a box that contains both the RHB and the adjacent RGB stars. Second, we use stellar counts above and below the selection box to model the luminosity function of the RGB. The luminosity function is fit as a power law to the measured density of RGB stars in vertical bins. Bins were defined to contain a roughly constant number of stars. The color extent of the RGB bins are and . The bins extend from magnitude above the RHB to magnitude below the RHB. Bins below the RHB have a 0.2 mag height, while bins above the RHB have a 0.3 mag height since the RGB density decreases at brighter magnitudes.
We then integrate the model luminosity function to estimate the number of RGB stars present in the RHB selection box, . This value is subtracted from the total number of stars in the red box, , to get . We use this decontaminated in Eq. 3 to calculate . For galaxies with MS contamination of the BHB (see § 4.2), we use the exact same method to find , where is the total number of stars in the BHB selection box and is the estimated number of MS contaminants.
The uncertainty in is calculated using Monte-Carlo realizations of , , , and , which are assumed to follow a Poisson distribution, and calculating the resultant distribution of through eq. 3. The and percentiles of this distribution are taken as our uncertainty in . It should be noted that, in most cases, the distribution of is well approximated by a Gaussian. Only for values of close to 0 or 1 significant deviation from Gaussianity is appreciable.

3.2 The HB morphology as an age indicator

The left panel of Fig. 5 shows the mass-weighted age, , of the old ( Gyr) stellar population in our galaxy sample, as a function of and value. Consistent with expectations from stellar models and previous observational work (e.g., Sarajedini et al., 1995; Dotter et al., 2010; Salaris et al., 2013), we observe a general trend that galaxies with a red HB morphology (low values) tend to have younger stellar populations, while galaxies with blue HB morphologies (high values) are on average older. There is also indication that, at fixed , galaxies of higher metallicity might be comparatively younger although the trend is less defined.
A clear takeaway from Fig. 5 is that galaxies with particularly blue HB morphologies (), all have predominantly old and metal-poor stellar populations, having Gyr and . This is in accordance with expectations from stellar evolution theory (e.g., Cassisi et al., 2013), as hot temperatures on the HB can only be reached by low-mass metal-poor models. As the HB morphology becomes redder, the scatter of the relation increases. This is the manifestation of the dependence of the HB temperature on age and metallicity (both mean and dispersion). Similar HB morphologies, in fact, can be obtained from different combinations of age and chemical composition.
Given this degeneracy, we model the relationship between and age (Fig. 5) using a bivariate function:
(6) |
where , and are assumed to follow a linear relation with an intrinsic scatter of variance . We fit this model using the affine invariant ensemble Markov chain Monte Carlo (MCMC) sampler emcee (Foreman-Mackey et al., 2013). The free parameters of this model are , , , as well as . We adopt uniform priors listed in Tab. 2.
We sample the posterior distribution using 32 walkers, setting the burn-in to first 100 steps, and defining the convergence length of the MCMC chain as 50 times the autocorrelation length. As both and have asymmetric uncertainties, we calculate our model likelihood under the assumption that these quantities follow a split-normal distribution. Figure 6 shows the corner plot of our MCMC runs. Our posterior distributions are well-constrained and unimodal. Adopting the 50th percentiles as our fiducial parameters, and the 16th and 84th percentiles as our confidence interval, the fiducial relation is:
(7) |
with intrinsic scatter , which corresponds to an rms of in . For a population of 11 Gyr, this corresponds to a Gyr scatter. Our fiducial model is shown as a black line in the right panel of Fig. 5. Using eq. 7, the value of can be determined from the value of and , with uncertainty:
(8) |
where is the covariance matrix
(9) |
which is derived from our MCMC chains, and is the Jacobian matrix of Eq. 7.
We comment on a few aspects of Eq. 7. The first regards the equation coefficients. The coefficient of the term is positive, meaning that bluer HB morphologies correspond to older stellar population ages. This is evident from Fig. 5 and is expected from stellar evolution. The coefficient of the metallicity term, on the other hand, is negative, meaning that, at higher metallicities, a given HB morphology traces a younger stellar population age. This seems at odds with stellar evolution expectations, as higher metallicities should require smaller masses (i.e., older ages) to achieve a given temperature on the HB. The interpretation of the term is more nuanced for a few reasons.
First, the metallicity obtained through Eq. 2 represents the stellar metallicity averaged across all stellar populations capable of producing an RGB. This is not guaranteed to be representative of the metallicity on the HB (which, given our selection, is produced only by stars older than Gyr), especially in galaxies with extended SFHs. In fact, the value of , as obtained from Eq. 2, is essentially a proxy for stellar luminosity, rather than a direct probe of the metallicity of the old stars.
\topruleName | Description | Prior | Posterior |
---|---|---|---|
\toprulea | slope | ||
b | slope | ||
c | zero point | ||
ln(V) | intrinsic scatter | ||
\toprule |
Second, and more importantly, ages and metallicities of stars in a galaxy are not truly independent variables. On the contrary, well defined age-metallicity relations (AMRs) are known to exist in nearby galaxies, as a result of chemical self-enrichment (e.g., de Boer et al., 2012a, b). These correlations exist within individual galaxies but also across the galaxy sample, with higher metallicity (i.e., higher luminosity) systems having on average more extended star formation histories and, therefore, younger mean ages. This is observed, for instance, in satellites of the MW (e.g., Weisz et al., 2015) and holds true for our M31 sample (Savino et al., in prep.). The negative coefficient of eq. 7 is likely capturing this correlation between galaxy mean age and metallicity. At fixed metallicity, the morphology of the HB then allows to quantify differences in mean stellar population age.
As this correlation between mean age and metallicity is encoded in our model, it follows that our calibration works best on galaxies that have broadly compatible AMRs to those of the M31 satellites. This could be a possible source of additional scatter when comparing galaxies across very different hosts. However, from eq. 7, this effect only becomes larger than the intrinsic scatter term if the galaxy AMR is offset by more than approximately 0.3 dex from those of our calibration sample.



Another interesting result in Eq. 7 regards the size of the intrinsic scatter term. The presence of a non-zero intrinsic scatter in our relation is not surprising, and this is likely arising from several sources of uncertainties in our methodology. These include the previously discussed discrepancy that could exist between and the HB metallicity. Another source of scatter may be that all the variables of our model (, , and ) describe global properties of the stellar population (being integrals of the age, HB color, and metallicity distributions, respectively). While stellar evolution suggests that a strong correlation should exist between age, metallicity, and HB color on a star-by-star basis, the correlation between global stellar population measurements might be somewhat reduced. For instance, two galaxies may have distinct stellar age distributions that result in similar values of , while their HB morphologies might not necessarily lead to comparable values of . Finally, we are assuming that the other variables affecting the HB (e.g., helium abundance or RGB mass loss) do not vary significantly across the galaxy sample, whereas some variation might be present.
Regardless, all these effects only introduce modest scatter in our fit. As shown in Fig. 5, the inclusion of a 0.022 dex term in our model (0.5 to 0.7 Gyr, depending on the ) is sufficient to capture these additional uncertainties for virtually all galaxies (with a few exceptions discussed below). As demonstrated in § 4.4 and § 4.5, this intrinsic scatter is our dominant source of uncertainty on a large range of galaxy luminosities and photometric depths. The resulting age uncertainty is comparable in size to the typical systematics that affect old stellar population ages (e.g., Tab. 3), meaning that our HB calibration is robust enough to be used as an effective archaeological tracer.
3.3 The Outliers: And VI, And XIII, and And XVI
The right panel of Fig. 5 demonstrates that our model is able to capture the relation between age, metallicity, and HB morphology, for most galaxies in our sample, within measurement uncertainties and the intrinsic scatter term. However, galaxies And VI, And XIII, and And XVI are clearly outliers relative to our model and most of the other systems. These three galaxies appear to be significantly younger than what we predict based on their HB morphology and metallicity.
Interestingly, these three galaxies also have unusual SFHs compared to the rest of the sample. As demonstrated in Savino et al. (2023), And XIII has experienced unusually low levels of early star formation, compared to other M31 satellites of similar luminosity, and ignited prominent star formation much later, at . A similarly delayed SFH has been shown for And XVI (Weisz et al., 2014b; Monelli et al., 2016; Skillman et al., 2017) and, from a recently measured SFH (Savino et al., in prep.), And VI also shows a strong burst of star formation at .
In the context where our model described by eq. 7 is capturing the correlation between and introduced by the AMR, it is plausible that the late dominant star formation episodes for these three galaxies resulted in a similarly unusual enrichment history. For instance, the ignition of late prominent star formation could have been fueled by the infall of fresh metal-poor gas. This would have altered drastically the AMR, with new stars forming at lower metallicity and resulting in a comparatively blue HB morphology.
At present, it is not clear how to identify whether more distant galaxies might exhibit similar behavior, save perhaps from spectroscopic data. However, we note that the incidence in our sample is low, being approximately 10%. Under the assumption that our galaxy sample is broadly representative, which is implicit in the use of our calibration in the first place, this 10% incidence can be considered as a rough estimate of the abundance of such objects.
4 Discussion
4.1 Validation on Local Group Galaxies
Eq. 7 allows us to estimate a mass-weighted stellar population age for distant resolved galaxies for which the oldest MSTO is not accessible. The two requirements for using this relation are (i) a CMD that reaches the HB with SNR (see § 4.4) and (ii) an estimate of the mean metallicity (e.g., through the LZ relation).
In this section, we validate our results by comparing our HB-based values to MSTO-based values for Local Group galaxies that are not in our calibration sample. Specifically, we use photometric catalogs obtained from deep HST imaging of the Cetus and Tucana dSph galaxies (HST-GO-10505, PI: Gallart) and of the Eridanus II dwarf (HST-GO-14224, PI: Gallart; HST-GO-14234, PI: Simon; cfr. § 3.1.2). The catalogs have been obtained following the same photometric reduction of our primary sample (Savino et al., 2023). For a consistent comparison, we re-derived the MSTO-based SFHs using the same methodology used for our calibration sample (Savino et al., 2023, Savino et al., in prep.). This choice minimizes systematics due to, e.g., stellar models and and ensures a consistent test of our HB-based calibration. The MSTO-based SFH of Eridanus II is derived from the F606/F814W dataset.
For consistency with the calibration sample, and with application on distant galaxies, we adopted a mean metallicity value from the LZ relation of eq. 2 (Kirby et al., 2013b). For Cetus and Tucana, we used absolute luminosities from McConnachie (2012), while for Eridanus II we used the measurement from Crnojević et al. (2016). Within the respective uncertanties, the metallicities obtained from eq. 2 are in good agreement with more direct metallicity determinations (e.g., Li et al., 2017; Taibi et al., 2018, 2020; Fu et al., 2022). The values and the derived are reported in Tab. 3.
Figure 7 shows the CMDs of our validation sample and the BHB/RHB selection used to calculate . The values of inferred from the HB, and those calculated from the MSTO-based SFH are reported in Tab. 3. The agreement between HB-based and MSTO-ages is excellent. In all cases, the two measurements differ by less than 0.4 Gyr, a difference that is small compared to our uncertainty budget ( Gyr). It is also comparable to other common sources of uncertainties in the ages of resolved galaxies (e.g., stellar models, distance and reddening uncertainties, Gallart et al., 2005). The case of Eridanus II also demonstrates consistency between the HB-based ages from F475W/F814W and F606W/F814W data. Despite differences in the exact BHB/RHB selection, highlighted in § 3.1.2, the ages obtained from the two datasets differ by only 0.1 Gyr.
4.2 Validation on Galaxies with Recent Star Formation
All galaxies analyzed so far in this paper have not experienced significant star-formation activity in the last few Gyr. This means that the number of sources that could contaminate our BHB selection is low. In galaxies with more recent star formation, however, massive stars on the MS will overlap in magnitude and color with the stars on the blue end of the HB, complicating the measurement of the HB morphology. In this section we apply our methodology to two such galaxies, to quantify the impact of this effect.
Figure 8 shows the F475W/F814W CMD of the Pisces and Pegasus dwarf irregular galaxies, which are part of the M31 satellite Treasury sample. The CMDs clearly show the presence of a plume of bright MS stars in our BHB selection box. We deal with this contamination using the same method described in § 3.1.3. We determine the luminosity function of the MS using stellar counts above and below the BHB box (magenta boxes in Fig. 8). The MS boxes span 0.5 mag in magnitude and are subdivided in five and three bins for the lower and upper boxes, respectively. The number of MS contaminants, , is then estimated by fitting the luminosity function with a power-law and integrating the resulting luminosity density over the span of the BHB box. The decontaminated sample of BHB stars is then used to calculate .
The values of we measure through the HB morphology of Pisces and Pegasus are Gyr and Gyr, respectively (assuming absolute luminosities from Savino et al. 2022). We compare these values with the corresponding obtained from the MSTO (Savino et al., in prep., listed in Tab. 3) and find that the two measurements differ by approximately 0.6 Gyr. This is somewhat larger than the difference measured for passive galaxies in § 4.1, which reflects the added uncertainty introduced by the MS decontamination. Nevertheless, HB and MSTO based ages are still compatible to the level of both statistical and systematic uncertainties. The conclusion is that, while the presence of the bright MS might have an effect on the fidelity of HB-based ages, the accuracy of our method is still marginally impacted.

\topruleName | from HB | from MSTO | |||||
---|---|---|---|---|---|---|---|
mag | Mpc | mag | dex | Gyr | Gyr | ||
\topruleEriII(F606W) | 22.67±0.04 | 0.34±0.01 | -7.1 | -2.06±0.19 | |||
EriII(F475W) | 22.67±0.04 | 0.34±0.01 | -7.1 | -2.06±0.19 | |||
Cetus | 24.46±0.12 | 0.78±0.04 | -11.2 | -1.58±0.18 | |||
Tucana | 24.74±0.12 | 0.89±0.05 | -9.5 | -1.78±0.18 | |||
Pisces | 23.91±0.05 | 0.61±0.01 | -9.8 | -1.74±0.18 | |||
PegDIG | 24.74±0.05 | 0.89±0.02 | -12.3 | -1.45±0.18 | |||
KKR25 | 26.42±0.07 | 1.92±0.06 | -10.5 | -1.66±0.18 | - | ||
VV124 | 25.67±0.11 | 1.36±0.07 | -12.5 | -1.43±0.18 | - | ||
\toprule |
4.3 Application to Data Beyond the Local Group: KKR 25 and VV 124
The ability to obtain reliable stellar ages from the morphology of the HB is of particular value for galaxies in the immediate vicinity ( Mpc) of the Local Group. Detecting the oMSTO of galaxies in this distance range is either impossible or impractically expensive with current telescopes. On the other hand, the brighter and less crowded HB stars can be observed much more efficiently in these galaxies and represent the best available tracer of their early star formation. In this section, we provide a first application of our new method by deriving stellar ages for the isolated galaxies VV 124 ( Mpc, Tully et al., 2013) and KKR 25 ( Mpc, Makarov et al., 2012).
VV 124 and KKR 25 are two relatively bright (), metal-poor () dwarf galaxies located just outside the LG (e.g., Karachentsev et al., 2001; Kopylov et al., 2008; Bellazzini et al., 2011b; Makarov et al., 2012; Kirby et al., 2012, 2013a). Both galaxies host predominantly old/intermediate stellar populations with a small amount of recent Gyr) star formation (Kopylov et al., 2008; Bellazzini et al., 2011a; Makarov et al., 2012; Neeley et al., 2021), have relatively low content (Begum & Chengalur, 2005; Bellazzini et al., 2011b), and (in the case of VV 124) pressure-supported kinematics (Kirby et al., 2012). Many of these observations have led to suggestions that these galaxies are transition dwarfs in a late stage of their evolution.
The properties of KKR 25 and VV 124 are particularly intriguing in the context of low-mass galaxy evolution. The long-established morphology-density relationship has been central to the idea that environment is a primary driver in the evolution and quenching of luminous () dwarf galaxies (e.g., Mateo, 1998; Grebel et al., 2003; Geha et al., 2012; Putman et al., 2021). Yet, early-type dwarfs such as KKR 25 and VV 124, are distant enough from other massive galaxies that past interactions are virtually impossible. This poses a challenge in our understanding of why such objects have relatively low levels of recent star formation and small neutral gas reservoirs in light of environmentally-driven theories of galaxy quenching.
While the distances of these two galaxies make their oMSTOs very challenging to observe, their HBs are well within the reach of existing HST observations. To measure the HB morphology of these two targets we have therefore performed PSF-photometry (using the same procedure as in § 2.1) on the archival HST data of program GO-15244 (PI: Monelli). Each galaxy has 16 orbits of ACS F475W and F814W imaging that have previously been used to study their RR Lyrae (Neeley et al., 2021).
Figure 9 shows the CMDs of KKR 25 and VV 124 around the HB region. The helium burning sequences of these galaxies are extended from the blue end of the HB to more massive helium-burning stars above the red clump, suggestive of an extended SFH. A notable feature in this figure is that both galaxies have a bright plume of young main-sequence (MS) stars which overlaps with the BHB. We account for this using the same procedure described in § 4.2.

The resulting HB morphology parameters are for KKR 25 and for VV 124. Using absolute magnitudes from McConnachie (2012) and deriving average metallicities through eq. 2 (listed in Tab. 3), we find Gyr for KKR 25 and Gyr for VV 124. These mass-weighted mean ages are similar to Cetus and Pisces, and quite younger than Tucana and Eri II.
Given that is an integrated measurement of the SFH, the interpretation of the mean ages of KKR 25 and VV 124 depends on our assumptions on when star formation began in these objects. Under the assumption that star formation began at the earliest epochs, our age measurements indicate a substantially extended SFH, akin to the aforementioned Pisces and Cetus dwarfs (Monelli et al., 2010; Hidalgo et al., 2011). On the other hand, it is possible that star formation remained low for the first one or two Gyr, and then experienced a short duration burst around . While we cannot discard this option, we note that VV 124 presents a prominent BHB population (clearly seen through the MS contamination), which suggest the presence of a sizable population of truly ancient stars.
Regardless, the presence of significant star formation at Gyr suggests that reionization had little impact on the early star formation of these galaxies. In comparison, isolated galaxies Leo A and DDO 210 have similar present day luminosities, but only formed 10% of their stellar mass at ages 10 Gyr, which, in some interpretations, could be due to suppression from the cosmic ultra-violet background (e.g., Cole et al., 2007, 2014). Similarly, the degree of isolation of these galaxies, combined with their star formation activity at low () redshift, suggests that the transition from gas-rich to gas-poor dwarf galaxies cannot solely be a function of environment. According to the 2023 version of the Nearby Galaxy Catalog (Karachentsev et al., 2004, 2013), the nearest massive neighbors to KKR 25 and VV 124 are more than 1 Mpc away (UGC8508 and Sextans B, for KKR 25 and VV 124, respectively), and even those are relatively low-mass dwarfs (). Any environmental interaction these galaxies might have had with a large galaxy must have happened at ancient times and, if responsible for their nature of early type galaxies, would have resulted in a very early depression of star formation.
For reference, the Cetus dSph, which has been proposed as a candidate backsplash galaxy from the MW or M31 (e.g., Sales et al., 2007; Teyssier et al., 2012), has a similar to VV 124 and KKR 25 but it currently lies at kpc from either large spirals (McConnachie, 2012). This is roughly two times closer than VV 124 and three times closer than KKR 25. Tucana, another candidate backsplash galaxy, lies at a larger distance of kpc from the MW (McConnachie, 2012), but has a that is Gyr older. It therefore seems unlikely that VV 124 and KKR 25 could have had an interaction with the MW or M31 and managed to sustain star formation for as long as their HB seems to indicate.
Finally, if, these galaxies had ancient SFHs that are more like MW satellites, i.e., dominant ancient episodes, than other isolated galaxies (e.g., Leo A, DDO 210), then they provide interesting counter examples to the slow/fast scenario proposed by Gallart et al. (2015), in which the dominance of ancient star formation can be mapped to the local environmental density at high-redshifts. While this limited sample hints at new insight into isolated dwarf galaxy formation, a large systematic study (e.g., if HB-depth photometry were available for the entire ANGST sample; Dalcanton et al. 2009) is needed to enable a truly statistical study.
4.4 Application to Data Beyond the Local Group: The Effect of Photometric Depth
A challenge of characterizing the HB morphology of galaxies beyond the LG will be the limited photometric precision that observations of these distant targets will achieve at the HB magnitudes. In contrast, our calibration sample has exquisite photometry of the HB region, with mean SNR ratios between 147 (And XVI) and 58 (And XXVI). To quantify the effect of lower photometric precision on the recovered ages, we have artificially degraded the photometry of one of our calibration galaxies and re-applied our methodology.
Figure 10 illustrates our experiment. We start with the original photometry of And XIV as our ground truth. The mean magnitude SNR of this dataset, at the magnitude level of the HB, is 74. The measured HB morphology index is , which results in an age of Gyr. We then perturbed the photometry of individual stars in And XIV with a random Gaussian term . The term is chosen to emulate a target SNR: . We adopt the same for the whole photometry, as we are considering a relatively small magnitude range in the CMD. The upper panel of Fig. 10 illustrates select examples of the CMDs we obtain. We then measure from the new degraded CMDs and derive values using eq. 7.
Because the morphology of the HB is parameterized using large selection boxes, the value of only begins to significantly change as a function of SNR for SNRs as low as . We find that below SNR, the measured value of can deviate significantly from the truth, in excess of measurement uncertainties. Though the is not heavily affected in our example, because the scatter term in Eq. 8 dominates, we find that the shallowest photometric depths affect our accuracy to Myr. Moreover, at such low SNRs, the lower RGB will no longer be accessible, meaning that our decontamination will not work. In contrast, for HBs with SNR , and age are statistically consistent and the lower RGB can be accessed for decontamination purposes.
4.5 Application to Data Beyond the Local Group: The Effect of CMD Population
Thanks to future deep, large-area surveys (e.g., Rubin, Roman, Euclid), our census of low-mass, nearby galaxies is poised to dramatically increase in the coming decade. For example, a wealth of new low-mass galaxies down to within Mpc should be discoverable (e.g., Mutlu-Pakdil et al., 2021). Characterizing the ages of these galaxies from the MSTO will be challenging due to the large distances. While the HB will be much brighter, it may also be sparsely populated (e.g., a few dozen HB stars for bright ultra-faint dwarfs). Accordingly, it is important to quantify the accuracy of our method as a function of stellar population size.
To do so, we start by using two of our calibration galaxies, with different values of , for ground truth. We have selected And V (; Savino et al. 2022), which has a population of stars (before decontamination) and a relatively red HB morphology (), and And XVII (; Savino et al. 2022), which has a population of stars and a much bluer HB morphology (). We then re-sample (without replacement) the CMD of these two galaxies until a given number of HB stars, , is reached. For a given we generate 1000 CMD realizations and calculate the corresponding and values. We also scale the absolute luminosity of both galaxies accordingly, as we decrease the HB counts, to estimate the approximate galaxy luminosity at which HB sampling becomes relevant.


Figure 11 shows the difference between the original measurements of and , and those measured from the re-sampled CMDs, as a function of . We do not report any significant bias in the measured ages, even in sparsely populated CMDs. This is likely because stochastic sampling affects stars equally regardless of their position on the HB. The reduced HB population, however, does steadily increase the measurement uncertainty on , as it is reasonably expected. Nevertheless, the bottom panels of Fig. 11 show that the contribution of stochastic sampling to the uncertainty on remains subdominant with respect to our calibration scatter for HB populations of stars for And V and stars for And XVII. Below these HB population regimes, the low number of HB stars becomes the primary source of uncertainty in age.
The effect of low HB population kicks in at different HB counts for And V (70 stars) and And XVII (30 stars) due to their different HB morphology. Given that eq. 7 depends on the logarithm of , galaxies with blue HB morphologies (i.e., ) will be affected less by a given change in than galaxies with red HBs (i.e., ). The corresponding absolute luminosity at which HB sampling becomes relevant is for And V and for And XVII. The galaxy luminosity limit for our method applicability is therefore slightly dependent on HB morphology. We note, however, that at such low magnitudes, blue HBs are typically more common.
The above experiment assumes that the only source of uncertainty introduced by a low HB population is due to stochastic sampling. In reality, at low HB counts the issue of identifying bona fide HB stars become increasingly more relevant. Spurious sources in the HB region, such as photometric artifacts, foreground stars, or background galaxies, can potentially bias the inferred value of and, consequently, of . Such an effect is much more difficult to quantify, as it depends on a number of other factors, such as the depth of the photometry, the distance of the galaxy, its galactic latitude, its SFH, and the adopted methodology to reject non-stellar sources. As a general rule, we advise using our calibration with caution whenever the number of potential contaminants, as estimated by visual inspection of the CMD, is a non-negligible fraction of the stars in the HB region.
5 Conclusion
In this paper we explored the connection between the HB morphology of nearby resolved galaxies and their characteristic stellar population ages and metallicities, with the aim of developing a method for estimating the characteristic age of a resolved galaxy outside the LG.
1. We use the photometric catalogs and SFHs of 27 M31 satellites developed as part the M31 Satellite Treasury survey to measure a relationship between the ratio of blue-to-red HB stars and the mass-weighted oMSTO age of old/intermediate age stars ( Gyr) in the same galaxy. We find a strong correlation between , , and , with older galaxies having, on average, bluer HBs. We derive a linear relationship between these quantities that allows us to estimate the mean-mass weighted age of a stellar population given its HB morphology and luminosity. The resulting precision is Gyr. We perform extensive tests to validate these results and explore uncertainties (e.g., filter sets, photometric depth, CMD population).
2. We use this calibration to measure the mass-weighted age of the distant, isolated galaxies KKR 25 and VV 124. For those two galaxies, we measure values of ( Gyr) and ( Gyr) for KKR 25 and VV 124, respectively. These values indicate that early star formation in these targets may have proceeded for an extended period of several Gyr. Combined with the current isolated position of these galaxies, this suggests that KKR 25 and VV 124 reached their low specific star formation rate and neutral gas content in isolation, strongly suggesting that internal processes played a major role in their transformation from gas-rich to gas-poor dwarf galaxies.
3. We characterize the performance of our methodology for a range of photometric depths and galaxy masses. We conclude that our methodology is robust down to SNR of on the HB, and for galaxy luminosities as faint as . This demonstrates that our method can reliably be used to infer characteristic stellar population ages for the hundreds of faint dwarf galaxies that next-generation surveys will discover within a volume of 3-4 Mpc.
Appendix A Spectroscopic Metallicites
In our main analysis, we use metallicity values derived from the local LZ relation (Kirby et al., 2013a). This choice was made for the sake of homogeneity and to provide a calibration readily applied to more distant galaxies, which might not have spectroscopic data. Some of our galaxy sample, however, do possess metallicity measurements from RGB spectroscopy. In this appendix, we take advantage of these spectroscopic data to quantify systematics due to our LZ-based metallicity assumptions.
From our sample of 27 galaxies, six have spectroscopic metallicities from a large resolved sample of RGB stars (Vargas et al., 2014; Ho et al., 2015; Kirby et al., 2020). For these, we adopt the from Kirby et al. (2020), with the exception of And II, which we take from Ho et al. (2015). Furthermore, 15 additional galaxies have measurements from stacked RGB spectra (Collins et al., 2013). This gives us a total sample of 21 galaxies with spectroscopic information. Fig. 12 shows the comparison between the mean spectroscopic metallicities, , and the values we inferred through eq. 2, . As demonstrated by the original spectroscopic studies, the two values are in very good agreement, with typical differences comparable to the measurement uncertainties.
Rerunning our analysis with yields the set of model coefficients, for eq. 6, listed in Tab. 4. While slightly different, this alternative prescription does not result in significantly different values of . Fig. 13 reports the difference in measured , between the LZ-based and spectroscopy-based calibrations, as function of and . While some differences, upwards of 0.5 Gyr, may arise in some locations of this parameter space, those do not seem to be regions occupied by observed galaxies. In fact, in the - region spanned by our M31 sample, the mean magnitude difference is 0.12 Gyr and the highest difference is 0.38 Gyr. As an additional test, we have re-measured the ages of our validation galaxies (§ 4.1 and § 4.2), as well as those of KKR 25 and VV 124 (§ 4.3). For these galaxies, we estimate - using the LZ relation, but we calculate using the spectroscopy-based calibration. This emulates a typical use case on more distant galaxies. The resulting values are reported in Tab. 5. Also in this case, the inferred ages differ by a small amount, i.e., by less than 200 Myr. The conclusion is that adopting the LZ relation as our calibration baseline does not introduce a significant bias in our analysis.

\topruleName | LZ | Spectroscopy |
---|---|---|
\toprulea | ||
b | ||
c | ||
ln(V) | ||
\toprule |

\topruleName | LZ-based | Spectroscopy-based | from MSTO | |||
mag | dex | Gyr | Gyr | Gyr | ||
\topruleEriII(F606W) | -7.1 | -2.06±0.19 | ||||
EriII(F475W) | -7.1 | -2.06±0.19 | ||||
Cetus | -11.2 | -1.58±0.18 | ||||
Tucana | -9.5 | -1.78±0.18 | ||||
Pisces | -9.8 | -1.74±0.18 | ||||
PegDIG | -12.3 | -1.45±0.18 | ||||
KKR25 | -10.5 | -1.66±0.18 | - | |||
VV124 | -12.5 | -1.43±0.18 | - | |||
\toprule |
References
- Astropy Collaboration et al. (2013) Astropy Collaboration, Robitaille, T. P., Tollerud, E. J., et al. 2013, A&A, 558, A33, doi: 10.1051/0004-6361/201322068
- Bastian & Lardo (2018) Bastian, N., & Lardo, C. 2018, ARA&A, 56, 83, doi: 10.1146/annurev-astro-081817-051839
- Begum & Chengalur (2005) Begum, A., & Chengalur, J. N. 2005, MNRAS, 362, 609, doi: 10.1111/j.1365-2966.2005.09342.x
- Bellazzini et al. (2011a) Bellazzini, M., Perina, S., Galleti, S., & Oosterloo, T. 2011a, A&A, 533, A37, doi: 10.1051/0004-6361/201117275
- Bellazzini et al. (2011b) Bellazzini, M., Beccari, G., Oosterloo, T. A., et al. 2011b, A&A, 527, A58, doi: 10.1051/0004-6361/201016159
- Brown et al. (2014) Brown, T. M., Tumlinson, J., Geha, M., et al. 2014, ApJ, 796, 91, doi: 10.1088/0004-637X/796/2/91
- Cassisi et al. (2013) Cassisi, S., Mucciarelli, A., Pietrinferni, A., Salaris, M., & Ferguson, J. 2013, A&A, 554, A19, doi: 10.1051/0004-6361/201321311
- Catelan et al. (2009) Catelan, M., Grundahl, F., Sweigart, A. V., Valcarce, A. A. R., & Cortés, C. 2009, ApJ, 695, L97, doi: 10.1088/0004-637X/695/1/L97
- Cignoni & Tosi (2010) Cignoni, M., & Tosi, M. 2010, Advances in Astronomy, 2010, 158568, doi: 10.1155/2010/158568
- Clementini et al. (2005) Clementini, G., Ripepi, V., Bragaglia, A., et al. 2005, MNRAS, 363, 734, doi: 10.1111/j.1365-2966.2005.09478.x
- Cole et al. (2014) Cole, A. A., Weisz, D. R., Dolphin, A. E., et al. 2014, ApJ, 795, 54, doi: 10.1088/0004-637X/795/1/54
- Cole et al. (2007) Cole, A. A., Skillman, E. D., Tolstoy, E., et al. 2007, ApJ, 659, L17, doi: 10.1086/516711
- Collins et al. (2013) Collins, M. L. M., Chapman, S. C., Rich, R. M., et al. 2013, ApJ, 768, 172, doi: 10.1088/0004-637X/768/2/172
- Crnojević et al. (2016) Crnojević, D., Sand, D. J., Zaritsky, D., et al. 2016, ApJ, 824, L14, doi: 10.3847/2041-8205/824/1/L14
- Da Costa et al. (2002) Da Costa, G. S., Armandroff, T. E., & Caldwell, N. 2002, AJ, 124, 332, doi: 10.1086/340965
- Da Costa et al. (1996) Da Costa, G. S., Armandroff, T. E., Caldwell, N., & Seitzer, P. 1996, AJ, 112, 2576, doi: 10.1086/118204
- Da Costa et al. (2000) —. 2000, AJ, 119, 705, doi: 10.1086/301223
- Dalcanton et al. (2009) Dalcanton, J. J., Williams, B. F., Seth, A. C., et al. 2009, ApJS, 183, 67, doi: 10.1088/0067-0049/183/1/67
- Dalessandro et al. (2011) Dalessandro, E., Salaris, M., Ferraro, F. R., et al. 2011, MNRAS, 410, 694, doi: 10.1111/j.1365-2966.2010.17479.x
- de Boer et al. (2014) de Boer, T. J. L., Tolstoy, E., Lemasle, B., et al. 2014, A&A, 572, A10, doi: 10.1051/0004-6361/201424119
- de Boer et al. (2012a) de Boer, T. J. L., Tolstoy, E., Hill, V., et al. 2012a, A&A, 539, A103, doi: 10.1051/0004-6361/201118378
- de Boer et al. (2012b) —. 2012b, A&A, 544, A73, doi: 10.1051/0004-6361/201219547
- Dolphin (2016) Dolphin, A. 2016, DOLPHOT: Stellar photometry, Astrophysics Source Code Library, record ascl:1608.013. http://ascl.net/1608.013
- Dolphin (2000) Dolphin, A. E. 2000, PASP, 112, 1383, doi: 10.1086/316630
- Dolphin (2002) —. 2002, MNRAS, 332, 91, doi: 10.1046/j.1365-8711.2002.05271.x
- Dolphin (2012) —. 2012, ApJ, 751, 60, doi: 10.1088/0004-637X/751/1/60
- Dolphin (2013) —. 2013, ApJ, 775, 76, doi: 10.1088/0004-637X/775/1/76
- Dotter et al. (2010) Dotter, A., Sarajedini, A., Anderson, J., et al. 2010, ApJ, 708, 698, doi: 10.1088/0004-637X/708/1/698
- Fabrizio et al. (2015) Fabrizio, M., Nonino, M., Bono, G., et al. 2015, A&A, 580, A18, doi: 10.1051/0004-6361/201525753
- Foreman-Mackey et al. (2013) Foreman-Mackey, D., Hogg, D. W., Lang, D., & Goodman, J. 2013, PASP, 125, 306, doi: 10.1086/670067
- Fu et al. (2022) Fu, S. W., Weisz, D. R., Starkenburg, E., et al. 2022, ApJ, 925, 6, doi: 10.3847/1538-4357/ac3665
- Gallart et al. (2005) Gallart, C., Zoccali, M., & Aparicio, A. 2005, ARA&A, 43, 387, doi: 10.1146/annurev.astro.43.072103.150608
- Gallart et al. (2015) Gallart, C., Monelli, M., Mayer, L., et al. 2015, ApJ, 811, L18, doi: 10.1088/2041-8205/811/2/L18
- Gallart et al. (2021) Gallart, C., Monelli, M., Ruiz-Lara, T., et al. 2021, ApJ, 909, 192, doi: 10.3847/1538-4357/abddbe
- Geha et al. (2012) Geha, M., Blanton, M. R., Yan, R., & Tinker, J. L. 2012, ApJ, 757, 85, doi: 10.1088/0004-637X/757/1/85
- Geisler et al. (2007) Geisler, D., Wallerstein, G., Smith, V. V., & Casetti-Dinescu, D. I. 2007, PASP, 119, 939, doi: 10.1086/521990
- Gratton et al. (2010) Gratton, R. G., Carretta, E., Bragaglia, A., Lucatello, S., & D’Orazi, V. 2010, A&A, 517, A81, doi: 10.1051/0004-6361/200912572
- Gratton et al. (2011) Gratton, R. G., Lucatello, S., Carretta, E., et al. 2011, A&A, 534, A123, doi: 10.1051/0004-6361/201117690
- Grebel & Gallagher (2004) Grebel, E. K., & Gallagher, John S., I. 2004, ApJ, 610, L89, doi: 10.1086/423339
- Grebel et al. (2003) Grebel, E. K., Gallagher, John S., I., & Harbeck, D. 2003, AJ, 125, 1926, doi: 10.1086/368363
- Green et al. (2019) Green, G. M., Schlafly, E., Zucker, C., Speagle, J. S., & Finkbeiner, D. 2019, ApJ, 887, 93, doi: 10.3847/1538-4357/ab5362
- Harbeck et al. (2001) Harbeck, D., Grebel, E. K., Holtzman, J., et al. 2001, AJ, 122, 3092, doi: 10.1086/324232
- Hidalgo et al. (2011) Hidalgo, S. L., Aparicio, A., Skillman, E., et al. 2011, ApJ, 730, 14, doi: 10.1088/0004-637X/730/1/14
- Hidalgo et al. (2018) Hidalgo, S. L., Pietrinferni, A., Cassisi, S., et al. 2018, ApJ, 856, 125, doi: 10.3847/1538-4357/aab158
- Ho et al. (2015) Ho, N., Geha, M., Tollerud, E. J., et al. 2015, ApJ, 798, 77, doi: 10.1088/0004-637X/798/2/77
- Hunter (2007) Hunter, J. D. 2007, Computing in Science Engineering, 9, 90
- Karachentsev et al. (2004) Karachentsev, I. D., Karachentseva, V. E., Huchtmeier, W. K., & Makarov, D. I. 2004, AJ, 127, 2031, doi: 10.1086/382905
- Karachentsev et al. (2013) Karachentsev, I. D., Makarov, D. I., & Kaisina, E. I. 2013, AJ, 145, 101, doi: 10.1088/0004-6256/145/4/101
- Karachentsev et al. (2001) Karachentsev, I. D., Sharina, M. E., Dolphin, A. E., et al. 2001, A&A, 379, 407, doi: 10.1051/0004-6361:20011344
- Kirby et al. (2012) Kirby, E. N., Cohen, J. G., & Bellazzini, M. 2012, ApJ, 751, 46, doi: 10.1088/0004-637X/751/1/46
- Kirby et al. (2013a) —. 2013a, ApJ, 768, 96, doi: 10.1088/0004-637X/768/1/96
- Kirby et al. (2013b) Kirby, E. N., Cohen, J. G., Guhathakurta, P., et al. 2013b, ApJ, 779, 102, doi: 10.1088/0004-637X/779/2/102
- Kirby et al. (2020) Kirby, E. N., Gilbert, K. M., Escala, I., et al. 2020, AJ, 159, 46, doi: 10.3847/1538-3881/ab5f0f
- Kopylov et al. (2008) Kopylov, A. I., Tikhonov, N. A., Fabrika, S., Drozdovsky, I., & Valeev, A. F. 2008, MNRAS, 387, L45, doi: 10.1111/j.1745-3933.2008.00482.x
- Lee et al. (2009) Lee, M. G., Yuk, I.-S., Park, H. S., Harris, J., & Zaritsky, D. 2009, ApJ, 703, 692, doi: 10.1088/0004-637X/703/1/692
- Li et al. (2017) Li, T. S., Simon, J. D., Drlica-Wagner, A., et al. 2017, ApJ, 838, 8, doi: 10.3847/1538-4357/aa6113
- Mackey & van den Bergh (2005) Mackey, A. D., & van den Bergh, S. 2005, MNRAS, 360, 631, doi: 10.1111/j.1365-2966.2005.09080.x
- Makarov et al. (2012) Makarov, D., Makarova, L., Sharina, M., et al. 2012, MNRAS, 425, 709, doi: 10.1111/j.1365-2966.2012.21581.x
- Martell et al. (2011) Martell, S. L., Smolinski, J. P., Beers, T. C., & Grebel, E. K. 2011, A&A, 534, A136, doi: 10.1051/0004-6361/201117644
- Martin et al. (2017) Martin, N. F., Weisz, D. R., Albers, S. M., et al. 2017, ApJ, 850, 16, doi: 10.3847/1538-4357/aa901a
- Martínez-Vázquez et al. (2021) Martínez-Vázquez, C. E., Monelli, M., Cassisi, S., et al. 2021, MNRAS, 508, 1064, doi: 10.1093/mnras/stab2493
- Mateo (1998) Mateo, M. L. 1998, ARA&A, 36, 435, doi: 10.1146/annurev.astro.36.1.435
- McConnachie (2012) McConnachie, A. W. 2012, AJ, 144, 4, doi: 10.1088/0004-6256/144/1/4
- McQuinn et al. (2010) McQuinn, K. B. W., Skillman, E. D., Cannon, J. M., et al. 2010, ApJ, 721, 297, doi: 10.1088/0004-637X/721/1/297
- Milone et al. (2014) Milone, A. P., Marino, A. F., Dotter, A., et al. 2014, ApJ, 785, 21, doi: 10.1088/0004-637X/785/1/21
- Milone et al. (2018) Milone, A. P., Marino, A. F., Renzini, A., et al. 2018, MNRAS, 481, 5098, doi: 10.1093/mnras/sty2573
- Monelli et al. (2010) Monelli, M., Hidalgo, S. L., Stetson, P. B., et al. 2010, ApJ, 720, 1225, doi: 10.1088/0004-637X/720/2/1225
- Monelli et al. (2016) Monelli, M., Martínez-Vázquez, C. E., Bernard, E. J., et al. 2016, ApJ, 819, 147, doi: 10.3847/0004-637X/819/2/147
- Muñoz et al. (2018a) Muñoz, R. R., Côté, P., Santana, F. A., et al. 2018a, ApJ, 860, 65, doi: 10.3847/1538-4357/aac168
- Muñoz et al. (2018b) —. 2018b, ApJ, 860, 66, doi: 10.3847/1538-4357/aac16b
- Mutlu-Pakdil et al. (2021) Mutlu-Pakdil, B., Sand, D. J., Crnojević, D., et al. 2021, ApJ, 918, 88, doi: 10.3847/1538-4357/ac0db8
- Nagarajan et al. (2022) Nagarajan, P., Weisz, D. R., & El-Badry, K. 2022, ApJ, 932, 19, doi: 10.3847/1538-4357/ac69e6
- Neeley et al. (2021) Neeley, J. R., Monelli, M., Marengo, M., et al. 2021, ApJ, 920, 152, doi: 10.3847/1538-4357/ac1a7a
- Perez & Granger (2007) Perez, F., & Granger, B. E. 2007, Computing in Science Engineering, 9, 21
- Piotto et al. (2007) Piotto, G., Bedin, L. R., Anderson, J., et al. 2007, ApJ, 661, L53, doi: 10.1086/518503
- Putman et al. (2021) Putman, M. E., Zheng, Y., Price-Whelan, A. M., et al. 2021, ApJ, 913, 53, doi: 10.3847/1538-4357/abe391
- Qu et al. (2023) Qu, H., Yuan, Z., Doliva-Dolinsky, A., et al. 2023, MNRAS, 523, 876, doi: 10.1093/mnras/stad1352
- Rejkuba et al. (2005) Rejkuba, M., Greggio, L., Harris, W. E., Harris, G. L. H., & Peng, E. W. 2005, ApJ, 631, 262, doi: 10.1086/432462
- Rejkuba et al. (2011) Rejkuba, M., Harris, W. E., Greggio, L., & Harris, G. L. H. 2011, A&A, 526, A123, doi: 10.1051/0004-6361/201015640
- Rose et al. (2005) Rose, J. A., Arimoto, N., Caldwell, N., et al. 2005, AJ, 129, 712, doi: 10.1086/427136
- Rusakov et al. (2021) Rusakov, V., Monelli, M., Gallart, C., et al. 2021, MNRAS, 502, 642, doi: 10.1093/mnras/stab006
- Salaris et al. (2013) Salaris, M., de Boer, T., Tolstoy, E., Fiorentino, G., & Cassisi, S. 2013, A&A, 559, A57, doi: 10.1051/0004-6361/201322501
- Sales et al. (2007) Sales, L. V., Navarro, J. F., Abadi, M. G., & Steinmetz, M. 2007, MNRAS, 379, 1475, doi: 10.1111/j.1365-2966.2007.12026.x
- Sandage & Wildey (1967) Sandage, A., & Wildey, R. 1967, ApJ, 150, 469, doi: 10.1086/149350
- Sarajedini et al. (1995) Sarajedini, A., Lee, Y.-W., & Lee, D.-H. 1995, ApJ, 450, 712, doi: 10.1086/176177
- Sarajedini et al. (2007) Sarajedini, A., Bedin, L. R., Chaboyer, B., et al. 2007, AJ, 133, 1658, doi: 10.1086/511979
- Savino et al. (2018) Savino, A., de Boer, T. J. L., Salaris, M., & Tolstoy, E. 2018, MNRAS, 480, 1587, doi: 10.1093/mnras/sty1954
- Savino et al. (2020) Savino, A., Koch, A., Prudil, Z., Kunder, A., & Smolec, R. 2020, A&A, 641, A96, doi: 10.1051/0004-6361/202038305
- Savino et al. (2019) Savino, A., Tolstoy, E., Salaris, M., Monelli, M., & de Boer, T. J. L. 2019, A&A, 630, A116, doi: 10.1051/0004-6361/201936077
- Savino et al. (2022) Savino, A., Weisz, D. R., Skillman, E. D., et al. 2022, ApJ, 938, 101, doi: 10.3847/1538-4357/ac91cb
- Savino et al. (2023) —. 2023, arXiv e-prints, arXiv:2305.13360, doi: 10.48550/arXiv.2305.13360
- Schiavon et al. (2017) Schiavon, R. P., Johnson, J. A., Frinchaboy, P. M., et al. 2017, MNRAS, 466, 1010, doi: 10.1093/mnras/stw3093
- Simon (2019) Simon, J. D. 2019, ARA&A, 57, 375, doi: 10.1146/annurev-astro-091918-104453
- Simon et al. (2021) Simon, J. D., Brown, T. M., Drlica-Wagner, A., et al. 2021, ApJ, 908, 18, doi: 10.3847/1538-4357/abd31b
- Skillman et al. (2017) Skillman, E. D., Monelli, M., Weisz, D. R., et al. 2017, ApJ, 837, 102, doi: 10.3847/1538-4357/aa60c5
- Taibi et al. (2022) Taibi, S., Battaglia, G., Leaman, R., et al. 2022, A&A, 665, A92, doi: 10.1051/0004-6361/202243508
- Taibi et al. (2020) Taibi, S., Battaglia, G., Rejkuba, M., et al. 2020, A&A, 635, A152, doi: 10.1051/0004-6361/201937240
- Taibi et al. (2018) Taibi, S., Battaglia, G., Kacharov, N., et al. 2018, A&A, 618, A122, doi: 10.1051/0004-6361/201833414
- Tailo et al. (2020) Tailo, M., Milone, A. P., Lagioia, E. P., et al. 2020, MNRAS, 498, 5745, doi: 10.1093/mnras/staa2639
- Teyssier et al. (2012) Teyssier, M., Johnston, K. V., & Kuhlen, M. 2012, MNRAS, 426, 1808, doi: 10.1111/j.1365-2966.2012.21793.x
- Tolstoy et al. (2009) Tolstoy, E., Hill, V., & Tosi, M. 2009, ARA&A, 47, 371, doi: 10.1146/annurev-astro-082708-101650
- Tully et al. (2013) Tully, R. B., Courtois, H. M., Dolphin, A. E., et al. 2013, AJ, 146, 86, doi: 10.1088/0004-6256/146/4/86
- van den Bergh (1967) van den Bergh, S. 1967, AJ, 72, 70, doi: 10.1086/110203
- van der Walt et al. (2011) van der Walt, S., Colbert, S. C., & Varoquaux, G. 2011, Computing in Science Engineering, 13, 22
- VandenBerg et al. (2013) VandenBerg, D. A., Brogaard, K., Leaman, R., & Casagrande, L. 2013, ApJ, 775, 134, doi: 10.1088/0004-637X/775/2/134
- Vargas et al. (2014) Vargas, L. C., Geha, M. C., & Tollerud, E. J. 2014, ApJ, 790, 73, doi: 10.1088/0004-637X/790/1/73
- Virtanen et al. (2020) Virtanen, P., Gommers, R., Oliphant, T. E., et al. 2020, Nature Methods, 17, 261, doi: https://doi.org/10.1038/s41592-019-0686-2
- Weisz et al. (2014a) Weisz, D. R., Dolphin, A. E., Skillman, E. D., et al. 2014a, ApJ, 789, 148, doi: 10.1088/0004-637X/789/2/148
- Weisz et al. (2011) Weisz, D. R., Dalcanton, J. J., Williams, B. F., et al. 2011, ApJ, 739, 5, doi: 10.1088/0004-637X/739/1/5
- Weisz et al. (2014b) Weisz, D. R., Skillman, E. D., Hidalgo, S. L., et al. 2014b, ApJ, 789, 24, doi: 10.1088/0004-637X/789/1/24
- Weisz et al. (2015) Weisz, D. R., Johnson, L. C., Foreman-Mackey, D., et al. 2015, ApJ, 806, 198, doi: 10.1088/0004-637X/806/2/198
- Wes McKinney (2010) Wes McKinney. 2010, in Proceedings of the 9th Python in Science Conference, ed. Stéfan van der Walt & Jarrod Millman, 56 – 61, doi: 10.25080/Majora-92bf1922-00a