Candidate high-redshift protoclusters and lensed galaxies in the Planck list of high-z sources overlapping with Herschel-SPIRE imaging
Abstract
The Planck list of high-redshift source candidates (the PHz catalogue) contains 2151 peaks in the cosmic infrared background, unresolved by Planck’s 5 arcmin beam. Follow-up spectroscopic observations have revealed that some of these objects are protoclusters and strong gravitational lenses, but an unbiased survey has not yet been carried out. To this end, we have used archival Herschel-SPIRE observations to study a uniformly-selected sample of 187 PHz sources. In contrast with follow-up studies that were biased towards bright, compact sources, we find that only one of our PHz sources is a bright gravitationally-lensed galaxy (peak flux mJy), indicating that such objects are rarer in the PHz catalogue than previously believed (< 1 per cent). The majority of our PHz sources consist of many red, star-forming galaxies, demonstrating that typical PHz sources are candidate protoclusters. However, our new PHz sources are significantly less bright than found in previous studies and differ in colour, suggesting possible differences in redshift and star-formation rate. Nonetheless, 40 of our PHz sources contain galaxy overdensities, comparable to the fraction of overdensities found in earlier biased studies. We additionally use a machine-learning approach to identify less extreme (peak flux mJy) gravitationally-lensed galaxies among Herschel-SPIRE observations of PHz sources, finding a total of seven candidates in our unbiased sample, and 13 amongst previous biased samples. Our new uniformly-selected catalogues of candidate protoclusters and strong gravitational lenses provide interesting targets for follow up with higher-resolution facilities, such as ALMA and JWST.
keywords:
galaxies: high-redshift – galaxies: clusters: general – submillimetre: galaxies – cosmology: observations – cosmology: large-scale structure of Universe – gravitational lensing: strong1 Introduction
Wide sky surveys at millimetre and submillimetre (submm) wavelengths are now routinely carried out to map the cosmic microwave background (CMB) at arcminute resolution for cosmological studies. In the process, these surveys inevitably find the brightest foreground (i.e., between the Earth and the surface of last scattering at ) millimetre and submm sources, which are themselves interesting for studying star-formation at various epochs. Some of these sources are bright star-forming regions in our Galaxy, while others are nearby galaxies or active galactic nuclei that can be filtered out with various selection criteria (e.g., Planck Collaboration XVI, 2011; Planck Collaboration XXIII, 2011; Planck Collaboration XXVIII, 2016; Planck Collaboration XXVI, 2016; Everett et al., 2020); however, after follow-up observations with higher-resolution imaging, the remaining sources are now known to be strong gravitational lenses (e.g., Hezaveh et al., 2013; Cañameras et al., 2015; Spilker et al., 2016) or overdensities of star-forming galaxies (e.g., Planck Collaboration XXVII, 2015; Flores-Cacho et al., 2016; Miller et al., 2018; Kneissl et al., 2019; Hill et al., 2020; Wang et al., 2020; Koyama et al., 2021; Polletta et al., 2021).
Strong gravitationally-lensed galaxies are useful targets for studying galaxy evolution. The magnification provided by lensing allows for the study of high- galaxies in much more detail than would be otherwise possible. This has significantly contributed to our understanding of the physical mechanisms driving star-formation at high redshift, through detailed studies of faint emission lines and resolved star-forming regions down to sub-kpc scales (e.g., Ebbels et al., 1996; Livermore et al., 2012; Massardi et al., 2018). Lensing can also be used to probe the substructure of dark matter halos, a key technique for constraining the properties of dark matter (e.g., Mao & Schneider, 1998; Dalal & Kochanek, 2002; Vegetti et al., 2014). To place tight constraints on dark matter properties, a large sample of bright gravitational lenses is required. However, currently only a few tens of sufficiently bright gravitationally-lensed systems are known. Previously, searching for such systems primarily relied on large optical surveys (e.g., Jacobs et al., 2019), but recently attention has been turning to the submm due to the efficiency at identifying high- sources, the ease of distinguishing between the lensed galaxy and foreground lens, and the potential for follow up with the Atacama Large Millimeter/Submillimeter Array (ALMA, Blain et al., 2002; Hezaveh et al., 2016; Ritondale et al., 2019). Submm all-sky surveys, such as the three highest frequency-channel maps from the Planck High-frequency Instrument (HFI), provide the opportunity to greatly increase the known sample of gravitationally-lensed systems, as well as to study the brightest ones in detail (Cañameras et al., 2015).
Despite the advances made in understanding galaxy evolution, the formation of galaxy clusters is not yet well understood. Studies of the progenitors of galaxy clusters, termed ‘protoclusters,’ can place important constraints on galaxy formation models and the ingredients going into cosmological simulations (Harrison & Coles, 2011; Miller et al., 2018). Furthermore, protoclusters are believed to represent the phase at which galaxies in galaxy clusters formed most of their stars, contributing per cent to the cosmic star-formation rate (SFR) at (Madau & Dickinson, 2014; Granato et al., 2015; Chiang et al., 2017). However, current galaxy formation models struggle to reproduce the SFRs observed in the most dense protoclusters (Hill et al., 2020; Lim et al., 2021). As such, the rare and extreme population of high- protoclusters can help to constrain models of galaxy evolution. Though difficult to identify, protoclusters are visible in the far-IR (FIR) and submm during this starbursting phase due to the copious amounts of dust-enshrouded star formation they host. Millimetre-wavelength data can select protoclusters at higher redshifts, but the Planck all-sky maps, which we focus on here, are effective for selecting the far-IR spectral maximum from , when the global star-formation activity peaked (Planck Collaboration XXVII, 2015; MacKenzie et al., 2017).
Planck’s ability to identify foreground objects of astrophysical significance was recognized early on. In particular, the Planck Collaboration released three catalogues of compact sources: the Planck Early Release Compact Source Catalogue (ERCSC; Planck Collaboration VII 2011); the Planck Catalogue of Compact Sources (PCCS; Planck Collaboration XXVIII 2014); and the Second Planck Catalogue of Compact sources (PCCS2; Planck Collaboration XXVI 2016). Several groups followed up these catalogues in search of high- clusters and protoclusters (Herranz et al., 2013; Clements et al., 2014; Baes et al., 2014; Greenslade et al., 2018). Specifically, Herranz et al. (2013), Clements et al. (2014) and Baes et al. (2014) looked for overdensities of dusty star-forming galaxies in the ERCSC, using early Herschel wide-field surveys (H-ATLAS, HerMES and HeViCS, respectively). Herranz et al. (2013) discovered a candidate protocluster, while Clements et al. (2014) found another four candidates. On the other hand, Baes et al. (2014) found no candidate protoclusters in the PCCS that fell in the HeViCS field. Most recently, Greenslade et al. (2018) found 27 candidate protoclusters in the PCCS and PCCS2 that fell within the 800 deg2 of available Herschel survey area.
In addition to the various compact source catalogues, the Planck Collaboration released a catalogue of 2151 potential high- sources (the Planck list of high- source candidates, or PHz) by looking at peaks in the cosmic infrared background (Planck Collaboration XXXIX, 2016). Planck Collaboration XXVII (2015), hereafter PlanckXXVII, performed a dedicated Herschel-SPIRE follow up of 228 Planck-selected high- candidates, consisting of red sources from the PCCS and sources selected in a similar manner to the PHz catalogue. PlanckXXVII found that 212 of their Planck sources are consistent with being overdensisities of high- star-forming galaxies (i.e., potential protoclusters) and 12 are strong gravitationally-lensed candidates that have all since been confirmed (one was previously discovered); the remaining four sources were dominated by Galactic cirrus. Many of the 212 PlanckXXVII sources are now confirmed submm overdensities based on follow-up studies (MacKenzie et al., 2017; Frye et al., 2019; Martinache et al., 2018; Kneissl et al., 2019; Cheng et al., 2020), and the newly-discovered lensed galaxies have been the focus of a number of analyses (e.g., Cañameras et al., 2015; Nesvadba et al., 2016; Cañameras et al., 2017a, b, 2018). Follow-up observations of individual PHz overdensities have now confirmed a handful of protoclusters (Flores-Cacho et al., 2016; Kneissl et al., 2019; Koyama et al., 2021; Polletta et al., 2021). Despite the success of the PHz in identifying high- protoclusters and gravitationally-lensed galaxies, no additional comprehensive follow-up studies of PHz sources have been done. Furthermore, PlanckXXVII followed up a biased sample of PHz sources, focusing on compact, high signal-to-noise objects and a comprehensive, unbiased study has not yet been carried out.
While PlanckXXVII was able to establish the existence of strong gravitational lenses and protoclusters in the PHz catalogue, the fact that the follow-up observations were biased towards bright and compact sources meant that the underlying statistics of the catalogue could not be robustly determined. In order to correctly deduce the nature of the PHz catalogue, in this paper we investigate a sample of 187 unbiased Herschel-SPIRE observations of PHz sources that serendipitously fall within the 1270 deg2 coverage of the Herschel wide-field surveys, now combined into a single data release by the Herschel Extragalactic Legacy Project (HELP; Shirley et al., 2021b). Additionally, we use our unbiased sample to produce sub-samples of the most statistically significant strong gravitational lenses and protocluster candidates that can be followed up with future high-resolution submm surveys, ultimately improving the sample sizes of these objects and enabling more detailed studies of dark matter halos, star formation and high-redshift clustering.
Section 2 describes the data used in this paper, including the PHz catalogue and maps obtained from HELP. In Section 3 we describe our Herschel-SPIRE photometry methods and in Section 4 we present the statistics of the PHz HELP sources and compare them to results from PlanckXXVII. In Section 5, we identify statistically-significant candidate protoclusters and gravitationally-lensed galaxies, and the discussion and conclusions are presented in Sections 6 and 7, respectively. Throughout this paper, we assume a standard CDM cosmology with parameters from Planck Collaboration VI (2020).
2 Data
Here we provide a brief overview of the PHz catalogue and the HELP data release. For more details on the PHz catalogue and HELP, see Planck Collaboration XXXIX (2016) and Shirley et al. (2021a), respectively.
2.1 The Planck List of High-z Source Candidates

Planck mapped the whole sky between 30 and 857 GHz, including channels centred at 217, 353 and 545 GHz (Planck Collaboration I, 2014). Using these maps, the Planck Collaboration generated a list of high- candidates by identifying red peaks in the CIB (Planck Collaboration XXXIX, 2016). Specifically, maps of the cleanest 25.8 per cent of the sky (i.e., the least contaminated by Galactic emission) were used, and the Infrared Astronomical Satellite (IRAS) 3-THz map was included in the analysis. The CMB was removed from each channel map using the 217-GHz map as a template, and Galactic cirrus was removed using the IRAS 3-THz map. S/N5 sources were identified in the 545-GHz ‘excess’ map, constructed by subtracting a linear interpolation between the 353- and 857-GHz maps from the 545-GHz map. In addition, sources were required to have S/N3 at 353 and 857 GHz. Lastly, to avoid selecting cold Galactic clumps and radio galaxies, all sources with or were removed. The resulting catalogue consists of 2151 potential high- sources, spread across the northern and southern Galactic hemispheres (see Fig. 1). The focus on picking out the apparently coldest, potentially highest redshift objects makes the PHz catalogue complementary to the Planck compact source catalogues, with almost no overlap (Planck Collaboration XXXIX, 2016).
Across three proposal periods, a total of 228 Planck-identified sources were followed up with dedicated Herschel-SPIRE observations (PlanckXXVII): 204 sources were selected with an algorithm that resembled the final PHz selection process and the other 24 came from the PCCS. The 228-source sample was constructed based on preliminary and evolving Planck data, as new products were made available throughout the three SPIRE observation periods. Additionally, there was a bias for preferentially selecting bright sources with regular shapes to follow up with Herschel-SPIRE (Planck Collaboration XXXIX, 2016). As a result, many of the sources catalogued in PlanckXXVII did not end up in the final PHz list, and the sample was biased towards sources with high S/N, small angular sizes, small ellipticities and low extinctions when compared with the full PHz. Nonetheless, the selection process for the 228 sources was qualitatively similar to the PHz catalogue, requiring compact (at 5-arcmin resolution) source-detections at 545 GHz.
Specifically, of the 228 Planck-identified sources presented in PlanckXXVII, 83/228 (36 per cent) ended up in the final PHz catalogue. Due to these differences, the authors of Planck Collaboration XXXIX (2016) remarked that ‘it is hard to draw definitive conclusions about the nature of the PHz sources based on the Herschel analysis’. In what follows, we study a large and unbiased sample of PHz sources using recently homogenized archival Herschel-SPIRE observations. For comparison, we repeated all of our analyses on the 228 PlanckXXVII sources (although in practice, we were unable to recover the Herschel-SPIRE maps of nine of the sources). Hereafter, the 228 Planck-identified sources followed up in PlanckXXVII will be referred to as the ‘PlanckXXVII sources’.
2.2 The Herschel Extragalactic Legacy Project

Name | RA min | RA max | Dec min | Dec max | Area | 250 m (1) | 350 m (1) | 500 m (1) |
[deg] | [deg] | [deg] | [deg] | [deg2] | [mJy] | [mJy] | [mJy] | |
AKARI-NEP | 264.6 | 275.3 | 64.5 | 68.5 | 9.2 | 11.0 | 10.1 | 11.3 |
AKARI-SEP | 66.2 | 75.4 | 55.9 | 51.7 | 8.7 | 7.3 | 8.1 | 9.1 |
Boötes | 215.7 | 220.6 | 32.2 | 36.1 | 11.4 | 6.2 | 7.1 | 8.0 |
CDFS-SWIRE | 50.8 | 55.4 | 30.4 | 26.0 | 13.0 | 4.9 | 6.0 | 6.7 |
COSMOS | 148.7 | 151.6 | 0.8 | 3.6 | 5.1 | 4.6 | 5.8 | 6.3 |
EGS | 212.4 | 217.5 | 51.2 | 54.2 | 3.6 | 5.3 | 6.4 | 7.2 |
ELAIS-N1 | 237.9 | 247.9 | 52.4 | 57.5 | 13.5 | 6.5 | 7.3 | 8.3 |
ELAIS-N2 | 246.1 | 252.3 | 39.1 | 43.0 | 9.2 | 7.7 | 8.3 | 9.5 |
ELAIS-S1 | 6.4 | 11.2 | 45.5 | 41.6 | 9.0 | 6.1 | 7.0 | 7.9 |
GAMA-09 | 127.2 | 142.2 | 2.5 | 3.5 | 62.0 | 10.9 | 11.2 | 12.3 |
GAMA-12 | 172.3 | 187.3 | 3.5 | 2.5 | 62.7 | 10.5 | 10.8 | 12.0 |
GAMA-15 | 210.0 | 225.2 | 2.5 | 3.4 | 61.7 | 10.4 | 10.7 | 12.0 |
HATLAS-NGP | 189.9 | 209.2 | 21.7 | 36.1 | 177.7 | 11.0 | 11.3 | 12.2 |
HATLAS-SGP | 337.2 | 26.9 | 35.6 | 24.5 | 294.6 | 11.5 | 11.7 | 12.7 |
HDF-N | 188.1 | 190.4 | 61.8 | 62.7 | 0.67 | 3.6 | 5.0 | 5.6 |
Herschel-Stripe-82 | 348.4 | 36.2 | 9.1 | 8.9 | 363.4 | 15.3 | 15.4 | 16.7 |
Lockman-SWIRE | 154.8 | 167.7 | 55.0 | 60.8 | 22.4 | 5.5 | 6.6 | 7.3 |
SA13 | 197.6 | 198.5 | 42.4 | 43.0 | 0.27 | 9.1 | 9.8 | 10.9 |
SPIRE-NEP | 263.7 | 266.4 | 68.6 | 69.4 | 0.6 | 4.0 | 5.0 | 5.5 |
SSDF | 341.5 | 2.1 | 60.5 | 48.5 | 110.4 | 10.9 | 11.1 | 12.7 |
XMM-13hr | 202.9 | 204.4 | 37.4 | 38.5 | 0.76 | 8.1 | 8.8 | 9.8 |
XMM-LSS | 32.2 | 38.1 | 7.5 | 1.6 | 21.8 | 6.4 | 7.3 | 8.2 |
xFLS | 255.6 | 262.5 | 57.9 | 60.8 | 7.4 | 7.3 | 8.0 | 9.0 |
The Herschel satellite carried out a number of wide-field extragalactic surveys using the SPIRE instrument (see Fig. 1 and Table 1). HELP combines the Herschel-SPIRE survey fields into a consistent single data release spanning a total of 1270 deg2 on the sky (Shirley et al., 2021a). In particular, HELP DR1 provides homogeneous SPIRE 250-, 350- and 500-m images for each of the HELP fields. The HELP fields avoid regions of bright dust emission from our own Galaxy, span a range of right ascension and include areas of sky near both ecliptic poles. As a result, there is significant overlap between the coverage of HELP fields and sources in the PHz catalogue, allowing us to select an unbiased sample of PHz sources at the SPIRE wavelengths.
A total of 194 sources from the PHz list fall within the HELP fields, six of which lie too close to the edge of the HELP field coverage and one falls within a region masked by HELP because of Galactic cirrus, resulting in a total of 187 usable sources. Hereafter, these 187 PHz sources that lie in the HELP fields will be referred to as the ‘PHz HELP sources’. By design of the follow up in PlanckXXVII, there is no overlap between the PHz HELP sources and the PlanckXXVII sources. However, the PHz HELP sample includes two sources that have previously been studied extensively, namely PHz G95.5061.59 (Flores-Cacho et al., 2016) and PHz G237.042.5 (Koyama et al., 2021; Polletta et al., 2021).
3 Observations and photometry
3.1 SPIRE-detected galaxies within PHz sources
In order to study the number counts of SPIRE-detected galaxies in the PHz catalogue, we define an ‘IN region’ in the same way as in PlanckXXVII.111To avoid confusion, we will reserve the word ‘source’ for the Planck-selected targets and refer to the SPIRE-detected objects as ‘galaxies’. We draw contours enclosing 50 per cent of the Planck flux density at 545 GHz (the frequency used to select PHz sources), and use the galaxies within this region for further analysis. Other follow-up studies have adopted a circular IN region centred on the Planck position (e.g., Greenslade et al., 2018). We find that using a contour on the Planck 545-GHz map generally encloses sources more effectively than a circular IN region. That is, when the galaxies making up a PHz source cover an extended region, Planck partially resolves the source and the contour is slightly larger than the Planck beam, whereas for individual bright galaxies, the contour is approximately the size of the beam (see Fig. 2). As a result, defining the IN region using a Planck contour misses fewer contributing galaxies than a circular IN region (although the differences are not dramatic). For comparison, the contours have a median area of 59.2 arcmin2 on the sky, corresponding to a search radius of 4.34 arcmin, which is slightly smaller than the 4.63 arcmin radius adopted in Greenslade et al. (2018). We determine the IN region contour by smoothing the 545-GHz map with a 2.0 arcmin full width at half-maximum (FWHM) Gaussian, then find the contour on the smoothed 545-GHz map corresponding to 50 per cent of the peak flux density of the source. For sources where the contour does not close or extends far beyond the Herschel-SPIRE map coverage, a circular IN region centred on the Planck position with a radius of 4.34 arcmin is adopted, corresponding to the median contour area.
3.2 Independent photometry
Before performing photometry, we measure the noise at each wavelength across all of the HELP fields in a similar fashion to PlanckXXVII (Table 1). Specifically, in each HELP field, for each SPIRE band, we construct histograms including all of the pixels in the map. The histograms have a mean of 0 mJy, and appear Gaussian in shape, except for a positive tail of bright pixels corresponding to detected galaxies. We fit a Gaussian to each histogram in order to calculate the standard deviation of the pixel value distribution, , which quantifies the noise of the map (i.e., the combined confusion and detector noise). Note that all pixels, including those in the positive tail, are included when performing the Gaussian fit, thereby accounting for confusion noise (if instead the fit is restricted to the negative half of the histogram, the RMS noise levels change only slightly). For consistency with previous studies (e.g., PlanckXXVII; Greenslade et al., 2018), we use the same flux density thresholds for galaxy detection across all HELP fields. Because the noise levels of the HELP fields vary (Table 1), we adopt 250, 350 and 500 m flux density thresholds that are significantly larger than the noise level of all the fields (). Specifically, we choose flux density cutoffs of 4 at 250, 350 and 500 m, that is, 4 times the mean noise of all the HELP fields. The cutoff flux densities are 32.0, 34.6 and 38.5 mJy at 250, 350 and 500 m, respectively. These thresholds are comparable to those adopted in PlanckXXVII, although they are higher than the levels used in Greenslade et al. (2018), where 25.4 mJy was used at all three wavebands. We find that our results are not sensitive to the choice of flux density thresholds.
We detect Herschel-SPIRE galaxies independently in the 250-, 350- and 500-m maps before matching galaxies across the three bands. Photometry is performed on sky-subtracted mean-0 maps using Gaussian point-spread functions (PSFs) with FWHMs of 18.15, 25.15 and 36.30 arcsec for the 250-, 350- and 500-m maps, respectively (Griffin et al., 2010). First, we convolve the signal map with the corresponding Gaussian PSF. Then, we search for peaks in the convolved signal map that fall within the IN region. All local peaks in the 250-, 350- and 500-m convolved maps above the corresponding galaxy detection thresholds are independently identified as potential galaxies. The values and positions of peak pixels in the convolved maps are used as initial guesses for the galaxy fluxes and positions at the wavelength of detection. A Gaussian PSF fit is then performed on each detected galaxy in the non-convolved map, with a fixed FWHM (depending on the wavelength), and initial position/amplitude guesses from the convolved map. During PSF fitting, the PSF amplitude is allowed total freedom, but the positions are allowed to shift only slightly from the initial position (FWHM in each direction) to avoid accidentally fitting the PSF to a separate nearby galaxy instead of the intended galaxy. Finally, the amplitudes and positions of the fitted Gaussian PSFs are added to the 250/350/500 m galaxy catalogue.
When galaxy blending is present, which is especially common in the overdense sources discovered with Planck, a more careful fitting procedure is required. Whenever multiple galaxies are detected within two FWHMs of each other (corresponding to 6.0, 5.1 and 5.2 pixels at 250, 350 and 500 m, respectively), galaxy flux densities and positions are measured with simultaneous PSF fitting. That is, Gaussian PSFs are simultaneously fit to each of the nearby galaxies in the non-convolved map, avoiding the risk of inadvertently including the fluxes from other nearby galaxies during PSF fitting. In practice, twice the FWHM is a conservative threshold; galaxies separated by FWHM are too far away for significant blending to occur, in which case simultaneously fitting the PSFs is equivalent to independently fitting Gaussian PSFs to both galaxies. After simultaneous PSF fitting, the amplitudes and positions of the fitted Gaussian PSFs are added to the 250/350/500 m galaxy catalogue. We find that our galaxy detection and photometry values are largely consistent with the results from other comparable codes, including codes developed specifically for Herschel-SPIRE observations (e.g., sussextractor; Pearson et al. 2014).
3.3 Galaxy matching
Galaxies are matched across bands in order to measure single-galaxy properties (e.g., colours), as well as to identify candidate strong gravitationally-lensed galaxies. Matching galaxies detected across the 250-, 350- and 500-m maps is done by focusing on the galaxies detected at 350 m (also see PlanckXXVII). The 350-m map has the advantage of higher angular resolution and lower noise than the 500-m map, while also avoiding many of the low- galaxies that appear in the 250-m map. Our band-merging procedure proceeds as follows. For each galaxy detected in the 350-m map, if there is a galaxy within of the 350-m galaxy position in either the 250-m map (FWHM 18.15 arcsec) or the 500-m map (FWHM 36.30 arcsec), then the independently-calculated flux density is adopted from the corresponding map. If a galaxy is not detected within in the 250- or 500-m maps, we perform a ‘forced’ Gaussian PSF fit in the 250- or 500-m maps (for which there are no enforced thresholds). That is, we use the 350-m galaxy position as the initial guess position and the value in the nearest pixel in the convolved 250- or 500-m map as the initial guess flux in order to fit a Gaussian PSF (as in Section 3.2). If multiple galaxies are detected within of the 350-m position at either 250 or 500 m, we instead perform simultaneous forced Gaussian PSF fitting at the positions of the close galaxies in the other two maps (i.e., if a galaxy detected at 350 m breaks up into multiple galaxies at 250 m, simultaneous PSF fitting is carried out at the positions of the galaxies in the 350- and 500-m maps). To be consistent with independent galaxy detection, when forced PSF fitting is performed, we carry out simultaneous PSF fits on all galaxies found within of the galaxy being matched across maps. For instance, if a galaxy at 350 m has a single corresponding galaxy within at 250 m, but there is a second galaxy away at 250 m, simultaneous Gaussian PSF fitting on both of the galaxies is performed at 350 m and 500 m, to avoid biases due to galaxy blending. If a 350-m galaxy is successfully matched to a single galaxy at 250 m, or is revealed to break up into multiple galaxies, we record the positions of the galaxy at 250 m because of the superior angular resolution of the 250-m map (otherwise, the 350-m position is used). With this procedure, we generate a catalogue of galaxies with flux densities measured at 250, 350 and 500 m.
4 Source statistics
4.1 Planck flux densities
As described in Section 2.1, the PHz sources were selected using Planck flux densities, or more specifically, the ‘excess’ flux density at 545 GHz. PlanckXXVII followed up a sample of 228 bright sources selected in a similar fashion as the PHz catologue, of which 83 ended up in the final version of the PHz catalogue. To evaluate the statistical differences resulting from this selection process, in Fig. 3 we show the distributions of the Planck flux densities for the 187 PHz HELP sources, about 100 of the PlanckXXVII sources (the sources for which Planck fluxes are publicly available) and all 2151 PHz sources. PlanckXXVII sources that came from earlier, non-public, PHz catalogue-like selection processes are necessarily excluded. The two-sample Kolmogorov–Smirnov (KS) test rejects the null hypothesis that the PlanckXXVII sources are drawn from the same sample as the full PHz catalogue with -values of , and at 857, 545 and 353 GHz, respectively. For the PHz HELP sources, on the other hand, the two-sample KS test gives a -value of , and at 857, 545 and 353 GHz, respectively. In other words, the 187 PHz HELP sources are drawn from the same distribution as the entire PHz sample, and are therefore representative of the overall PHz catalogue. These trends are similar across other PHz source properties, such as FWHMs and ellipticities. It is worth noting that differences between the PlanckXXVII source flux densities and the PHz source flux densities (as well as other properties) are not simply caused by the inclusion of PCCS sources in the PlanckXXVII sample. We see an equally significant difference when we restrict the comparison to the 83 PlanckXXVII sources that ended up in the final PHz catalogue, indicating that there were biases in the selection of PlanckXXVII sources (see Planck Collaboration XXXIX 2016 for a discussion of these selection effects).

4.2 Number counts
In Fig. 4 we show Euclidean-normalized differential number counts of SPIRE-detected galaxies, , where is the total number of galaxies per steradian in the IN regions and is the flux density at 250, 350 or 500 m. Number counts are calculated using the galaxies detected independently in each of the three bands and are intended to be compared relative to one another, so no corrections (such as for completeness) have been applied. As in PlanckXXVII, we use the Lockman-SWIRE field to compare between the PHz HELP and PlanckXXVII sources. To compare the number counts of IN region galaxies with the number counts of random locations, we compute the number counts at 10,000 random positions in Lockman-SWIRE within a circle of radius 4.34 arcmin (the median size of the IN regions).
Similarly to the PlanckXXVII sources, when compared with random locations on the sky, the PHz HELP sources have a significant excess in number counts below approximately 100 mJy across the 250-, 350- and 500-m bands, with the most substantial differences seen at 500 m. This confirms that typical PHz sources contain an exceptional number of galaxies, just as the preferentially-selected PlanckXXVII sources do. That being said, the PHz HELP sources have lower number counts at 250 m than the PlanckXXVII sources. Additionally, above about 100 mJy, there are substantial differences between the PHz HELP and PlanckXXVII number counts. As found in PlanckXXVII, the PlanckXXVII sources contain significantly higher numbers of SPIRE galaxies at large flux densities compared to random positions, the brightest of which are now known to be strong gravitational lenses. Across all three bands, the HELP sources have lower numbers of SPIRE galaxies at large flux densities compared to the PlanckXXVII sample, potentially indicating that fewer of the PHz HELP sources contain strong gravitationally-lensed galaxies. This is investigated further in Section 5.

4.3 Projected galaxy densities
We next quantify the projected overdensities of our PHz HELP sources. In Fig. 5 we compare the number densities of IN region galaxies for the PHz HELP sources, the PlanckXXVII sources and 10,000 random positions in the Lockman-SWIRE field. Just as we found in Section 4.2, the PHz HELP sources are significantly overdense when compared with random positions, across all three bands. The PlanckXXVII sources appear more overdense at 250 m compared to the PHz HELP sources, whereas galaxy densisites appear more comparable at 350 and 500 m.
In order to take into account the background densities, as was done in PlanckXXVII, we define the overdensity contrast of a Planck source as
(1) |
where is the galaxy density within the contour of the PHz HELP/PlanckXXVII source and is the mean galaxy density of the 10,000 random positions in Lockman-SWIRE. In Fig. 6 we show the overdensity contrasts across all three SPIRE bands for our sample of PHz HELP sources and for the PlanckXXVII sources. As suggested by Fig. 4, the overdensity contrast distribution at 250 m for the PlanckXXVII sources is shifted to the right when compared to the PHz HELP sources. At 350 m and 500 m, the galaxy densities are comparable, although the PHz HELP sources have a slightly greater positive tail at 350 m and more sources with low galaxy density at 500 m. The two-sample KS test confidently rejects (at ) the null-hypothesis that the PHz HELP source galaxy densities are drawn from the same distribution as the random location galaxy densities across 250, 350 and 500 m. Again, this confirms the exceptional nature of typical PHz sources, demonstrating that on average the PHz sources are similarly overdense at 350 and 500 m to the preferentially-selected PlanckXXVII sources.


4.4 Colours
After matching galaxies across the 250-, 350- and 500-m bands, colours are quantified by comparing flux density ratios. Figure 7 shows the distributions of and for all of the galaxies within in the PHz HELP sources, the PlanckXXVII sources and 10,000 random Lockman-SWIRE positions. The PHz HELP sources display a clear excess of and ratios compared to random positions, indicative of an excess of red galaxies. The PlanckXXVII sources display a more significant excess of ratios compared to random positions; however, the distribution of is more consistent with random positions (these distributions are comparable to Fig. 7 from PlanckXXVII). Quantitatively, when PHz HELP colours are compared to random sources, we find that the null hypothesis is rejected at 20.3 and 14.2 for the colours and the colours, respectively. These differences demonstrate that the average PHz source contains an excess of red (and potentially higher ) sources, as was previously concluded for the preferentially-selected PlanckXXVII sources. However, we also find significant differences between the PHz HELP and PlanckXXVII galaxy colours. The two-sample KS test rejects (at 16.7) the null hypothesis that the distribution of colours for the PHz HELP sources is drawn from the same distribution as the PlanckXXVII sources. Similarly, for the distribution of colours, the KS test rejects the null hypothesis at 12.6. Furthermore, the peak in PHz HELP colours better aligns with the peak for all PHz sources from Planck colours.
To further illustrate the differences in galaxy colours between the PHz HELP, PlanckXXVII and random Lockman-SWIRE positions, Fig. 8 shows 2D histograms of versus . The PHz HELP sources have a clear tail of red galaxies extending to the right, which is not seen in the PlanckXXVII sources or the random sources. However, the PlanckXXVII colour distribution appears shifted upwards when compared to the other two distributions. Based on the distribution of random galaxy colours, we set the thresholds and to select the reddest galaxies, falling in the upper-right region of Fig. 8. Overlaying these thresholds on the 2D colour histograms of the PHz HELP and PlanckXXVII sources illustrates the differences between the colour distributions; when compared with the PlanckXXVII sources, significantly more PHz HELP sources lie above the threshold and noticeably fewer lie above the threshold. Density histograms of red galaxies that fall above these two cuts confirm these trends (see Fig. 9). The two-sample KS test rejects the null hypothesis that the distribution of red PHz HELP sources is the same as the distribution the red PlanckXXVII sources at 8.0 for the sources and 5.3 for the sources.
Based on the 2D colour distributions, we can also infer that fewer PHz HELP galaxies peak in flux density at 250 m as compared with both random galaxies and PlanckXXVII galaxies. Quantitatively, of random galaxies detected in the Lockman-SWIRE field, a total of 38 per cent peak at 250 m, and comparatively, 37 per cent of PlanckXXVII galaxies peak at 250 m. For the PHz HELP sources, on the other hand, we find that only 21 per cent of galaxies peak at 250 m. Instead, most PHz HELP galaxies (66 per cent) peak at 350 m, whereas only 41 per cent of galaxies in the PlanckXXVII sources peak at 350 m. In this way, the PHz HELP sources typically consist of redder galaxies than the PlanckXXVII sources do. This suggests that the distribution of galaxy redshifts in PHz HELP sources may differ from the distribution of redshifts in PlanckXXVII sources, a possibility that is further explored in Section 4.5; however, it is difficult to draw conclusions about galaxy redshifts with SPIRE photometry alone. It is also worth noting that PlanckXXVII reports that none of the brightest galaxies within the PlanckXXVII sources peak at 250 m (i.e., within each PlanckXXVII source, the brightest galaxy across all three bands is brightest at 350/500 m), while we find that a significant fraction of the brightest PlanckXXVII galaxies peak at 250 m (56 per cent), and a smaller, but still significant, fraction of the brightest PHz HELP galaxies peak at 250 m (37 per cent).



4.5 Photometric redshifts and star-formation rates
Owing to the fact that the rest-frame far-IR emission observed by SPIRE probes a relatively simple, thermal portion of a galaxy’s spectral energy distribution (SED), SPIRE colours can be used to provide crude estimates of photometric redshifts, luminosities and therefore SFRs, after applying various simplifying assumptions (e.g., Blain et al., 2003; Draine, 2003). Assuming that any AGN contribution is sub-dominant, the SPIRE bands are sensitive to thermal dust emission generated by star formation, characterized by a modified blackbody distribution with a single dust temperature. However, due to redshifting, we observe a dust temperature that is decreased by a factor of , preventing us from determining both the intrinsic dust temperature and the redshift simultaneously. Nonetheless, if we assume a typical dust temperature motivated by studies of similar SPIRE-detected galaxies, we can obtain rough redshift estimates and SFRs, which should at least be useful for comparitive studies.
For each galaxy in our sample, we fit the three SPIRE flux densities to a modified blackbody SED of the form
(2) |
where is the observed frequency, is the redshift, is an arbitrary normalization constant fixed to 3000 GHz, is the Planck constant, is the Boltzman constant, is the dust emissivity index, which is fixed to 1.5 (the same value adopted in PlanckXXVII) and is the dust temperature. In order to make progress, we set K for all of the galaxies in the analysis. This is in line with the typical dust temperatures measured in large samples of –2 submm-bright galaxies (e.g., Roseboom et al., 2012; Magnelli et al., 2014). Of course in reality, galaxies exhibit a wide range of temperatures, but we can nonetheless compare results between the PHz HELP sources, the PlanckXXVII sources and random positions. In Fig. 10 (left panel) we show the resulting distribution of photometric redshifts for our PHz HELP sources, compared to the distribution for the PlanckXXVII sources and 10,000 random Lockman positions. We find similar trends when instead adopting a dust temperature of K or K, and use the change in the median of the redshift distribution to make an estimate of the redshift uncertainty of .
Next, we calculate far-IR luminosities () by integrating our fits to Eq. 2 between 8 and 1000 m (in the rest-frame), and convert these to SFRs using a conversion factor of , which assumes a Chabrier initial mass function (Chabrier, 2003). The distributions of and SFR are shown in Fig. 10 (right panel). We again find similar trends when the assumed dust temperature is changed by K, giving an uncertainty of and .
In agreement with PlanckXXVII, we find that the redshift distribution of galaxies in the PHz HELP sources peaks at . The PHz HELP galaxy redshifts are largely comparable to the PlanckXXVII galaxy redshifts and both have a positive tail of high- galaxies when compared with galaxies found in random positions. The KS test confidently rejects the null hypothesis (at ) that the PHz HELP redshifts are drawn from the same distribution as the random galaxy redshifts.
It remains unknown whether the PHz sources are predominantly protoclusters of physically-associated galaxies at the same redshift or line-of-sight alignments of multiple groups of galaxies (e.g., Negrello et al., 2017; Gouin et al., 2022), as discussed further in Sect. 6. By comparing the SPIRE photometric redshifts of multiple galaxies detected within the same PHz source, we can attempt to quantify the presence of line-of-sight alignments within the PHz HELP sources, but we find that the SPIRE photometric redshifts are too imprecise () to distinguish between overdensities at a single redshift from random alignments along the line-of-sight.
Both the PHz HELP sources and PlanckXXVII sources generally have higher far-IR luminosities/SFRs than random sources (the KS test puts the significance of the differences at ). Furthermore, the far-IR luminosity/SFR distribution of the PlanckXXVII sources is shifted to the right of the distribution of the PHz HELP sources; the difference between the PHz HELP and PlanckXXVII sources is significant at by the KS test. The larger far-IR luminosities/SFRs of galaxies within the PlanckXXVII sources agrees with the larger flux densities of the PlanckXXVII sources, which is seen in the galaxy number counts (Fig. 4). Nonetheless, in the more representative PHz HELP sources, we still find an excess of high far-IR/SFR galaxies when compared with random positions.

4.6 Source stacking and surface brightness profiles



We next investigate the spatial properties of the PHz HELP sources through a stacking analysis. Because we are primarily interested in the spatial distributions of the overdense sources, we remove the sources that consist of a single bright galaxy (namely the Planck GEMS from the PlanckXXVII sample, and the bright gravitational lens G160 from the PHz HELP sample). We create mean image stacks by averaging the 250-, 350- and 500-m maps centred on the position of the brightest 350-m galaxy in each IN region.222We explored other stacking choices as well, including stacking on the flux-weighted centroid positions, and found similar trends. Before computing the average, we set the background of each image to 0 Jy beam-1 by subtracting the mean intensity value in pixels that are arcmin from the centre. In Fig. 11 we show the resulting stacks at 350 m for the PHz HELP sources, PlanckXXVII sources and random positions in the Lockman-SWIRE field centred on the brightest galaxy in each 4.34 arcmin radius circular region. It is clear from the 350-m stacks that the PHz HELP sources are significantly less bright than the PlanckXXVII sources. Nonetheless, similar to the PlanckXXVII stack, the PHz HELP stack has clustering extending out to about 5 arcmin from the centre that is not seen around random galaxies in the Lockman-SWIRE field. This confirms that average PHz sources display clustering, just as the preferentially-selected bright PlanckXXVII sources do.
To quantify the extended nature of the PHz HELP and PlanckXXVII stacks, we plot the radial surface brightness profiles of the PHz HELP, PlanckXXVII and random position stacks (see Fig. 12). Confirming the visual conclusions drawn from Fig. 11, the PHz HELP stacks are less bright than the PlanckXXVII stacks, but still clearly extended when compared with random sources across 250, 350 and 500 m. In comparison with the random sources, the PHz HELP sources display the strongest clustering at 500 m and the weakest at 250 m. This is likely caused by the PHz selection process, which relied primarily on detecting excess emission at 500 m. To directly compare the PHz HELP and PlanckXXVII radial profiles, we arbitrarily normalize both profiles to 1.0 Jy beam-1 in the 1.0–2.0 arcmin bin (Fig. 13). The PHz HELP and PlanckXXVII radial profiles closely match outside the central 1.0 arcmin bin; however, the PHz HELP sources are brighter within this central bin, suggesting that the PHz HELP sources are (relatively) more centrally-concentrated.
To confirm that the PHz HELP sources are genuinely more centrally-concentrated than the PlanckXXVII sources, we compare the ratios of the flux density contained within 1.0 arcmin of the brightest galaxy to the total flux density of the IN region (; see Fig. 14). For the purpose of this plot, we adopt a 4.34 arcmin radius circle for the IN region of each source so that differences in are not caused by variations in the IN region sizes. As alluded to by the normalized radial profiles, a greater fraction of the PHz HELP source flux densities come from the 1.0 arcmin region around the brightest galaxy when compared to the PlanckXXVII sources. According to the KS test, the differences between the PHz HELP and PlanckXXVII sources is significant at across the 250-, 350- and 500-m distributions. We thus conclude that the PHz HELP sources are less bright, but more centrally-concentrated than the PlanckXXVII sources.

5 Protocluster and strong gravitational lens candidates
Since there are now a number of confirmed strong gravitational lenses (Cañameras et al., 2015) and protoclusters (Flores-Cacho et al., 2016; Kneissl et al., 2019; Koyama et al., 2021) originating from Planck observations, we expect that similar objects should exist in our PHz HELP sample. Here we pursue a comprehensive search for protocluster and strong gravitational lens candidates, with the motivation to provide a representative list of targets for future surveys to further investigate the nature of the PHz sources, and to bolster the sample size of such objects, thereby enabling more detailed studies of star formation, early galaxy clustering and dark matter substructure.
5.1 Protocluster candidates
Because the photometric redshift information offered by the three far-infrared SPIRE bands is too uncertain to identify redshift clustering about a single target, we turn to the spatial overdensity data to identify the most promising protocluster candidates. We then statistically search for clustering within the combined photometric redshift distribution of the best protocluster candidates.
5.1.1 3 spatial overdensities
To search for the most likely protocluster candidates in our sample of 187 PHz sources, we now estimate the significance of the observed spatial overdensities. Some studies (e.g., PlanckXXVII) have relied on kernel density estimator techniques to determine the statistical significance of spatial overdensities, which sometimes results in high significances for both overdense sources and random locations in the Lockman-SWIRE field. To obtain more easily-interpretable significance estimates, we instead turn to a comparison with the distributions of galaxy densities from random positions in the Lockman-SWIRE field, which allow us to estimate 3 overdensity thresholds (see e.g., Greenslade et al. 2018; Cheng et al. 2019, 2020). In order to construct a homogeneous, representative sample of promising PHz protocluster candidates, we use the total galaxy densities in Figs. 5 and 6 (we separately explore the possibility of instead selecting protocluster candidates with red galaxy densities below). Using the distribution of galaxy densities from 10,000 random positions, we interpret the number of random sources with or more galaxies as a probability (), thereby estimating the statistical significance of a source containing or more galaxies and subsequently the significance of the corresponding galaxy density. A total of 12 random sources contain 20 or more mJy galaxies, corresponding to a probability of , or 3.0. A random source made up of 20 galaxies has a surface density of deg-2, providing an approximate 3 galaxy surface density threshold. Likewise, at 350 m, 3 random sources of mJy galaxies contain 16 galaxies, corresponding to a 3 surface density of deg-2. Lastly, the 3 surface density of mJy galaxies is deg-2. In Figs. 5 and 6 we show these resulting 3 thresholds as vertical dashed lines.
With these thresholds, we identify the most promising potential protoclusters as those that are overdense at 250, 350 or 500 m (see Table 2). One downside of this approach is that some sources pass the overdensity threshold because they contain few (5) galaxies within a small Planck intensity contour (20 arcmin2) rather than containing many sources extended over several arcmin. To exclude these artificially-overdense sources from the list of candidate protoclusters, we adopt a conservative, arbitrary threshold of 5 galaxies at 350 m, which results in a total of 40 protocluster candidates (changing the exact minimum galaxy threshold would include or exclude only a few sources).
In total, these 40 promising protocluster candidates make up 21 per cent of the total PHz HELP sample, including nine sources that are overdense at 250 m (4.8 per cent), 26 at 350 m (14 per cent) and 36 at 500 m (19 per cent). When this analysis is repeated on the PlanckXXVII sources, we find that 57/228 (25 per cent) are overdense, of which 28 are overdense at 250 m (12 per cent), 24 at 350 m (10 per cent) and 41 at 500 m (18 per cent). Although the PHz HELP sources are significantly less bright than the preferentially-selected PlanckXXVII sources (see Section 4.6), there is a similar fraction of overdensities in the PHz HELP and PlanckXXVII sources. Additionally, a smaller fraction of the PHz HELP sources are overdense at 250 m and a larger fraction are overdense at 350 m, consistent with the observed differences in galaxy colours.
For comparison with the PCCS, Greenslade et al. (2018) found a total of 27 overdense protocluster candidates out of 354 PCCS sources (8 per cent; many of these sources are overdense at 250 m), a substantially lower fraction, supporting the uniqueness of the PHz catalogue in containing many overdensities of red galaxies. However, because the PCCS contains significantly more sources than the PHz catalogue, the density on the sky of overdense protocluster candidates in the PCCS catalogue ( deg-2) is comparable to the density in the PHz catalogue ( deg-2). It is worth noting that the PHz HELP sources that do not pass the 3 overdensity threshold are still interesting; the stacking analysis in Section 4.6 demonstrated that typical PHz sources are still significantly more extended than random locations on the sky. In fact, neither of the two previously-studied PHz HELP sources, PHz G95.5061.59 (Flores-Cacho et al., 2016) and PHz G237.042.5 (Koyama et al., 2021; Polletta et al., 2021), are overdense in the SPIRE maps with our galaxy detection thresholds.
To identify the most promising high- protocluster candidates in the PHz HELP sources, we could instead search for the sources that contain the most red galaxies (Fig. 9). Repeating the process of identifying 3 thresholds using the distribution of densities from the random Lockman sources, the 3 red galaxy density thresholds are shown as vertical lines in Fig. 9. After searching for PHz sources that lie above the 3 threshold of 1.2 galaxies or the 3 threshold of 0.8 galaxies (and requiring 5 galaxies at 350 m as before), we find 38 candidate protoclusters, a similar fraction to above. In addition to these criteria identifying a similar number of candidate protoclusters, there is significant overlap between the 40 candidates from above and the 38 red candidates; 27 of the 38 red candidates (71 per cent) are included among the 40 overdense locations, supporting the potential high- nature of the protocluster candidates. For completeness, we report the 11 lower-significance red protocluster candidates in Table 2; however, we focus on the 40 homogeneously-selected sources as our main protocluster candidates.
5.1.2 Photometric redshift clustering in 3 spatial overdensities
Although SPIRE photometric redshifts for single galaxies are imprecise (), we may still attempt to search for photometric redshift clustering by statistically averaging over the 40 best protocluster sources. To do so, we follow a simplified version of an algorithm detailed by Castignani et al. (2014), which utilizes a background photometric redshift distribution to calculate Poisson probabilities within redshift bins. We define the background distribution (i.e., the number of galaxies per square degree per redshift) to be the distribution we derive from 10,000 random positions in the Lockman-SWIRE field (Fig. 10).
Next, we compare the photometric redshift distributions in each protocluster source with the background distribution. The redshift interval from 0–5 is divided into small redshift bins, allowing us to count the number of galaxies with photometric redshifts within a width of each bin. If the background number density of galaxies within the same redshift bin (say, bin ) is , and the solid angle of the protocluster candidate source is , then the probability of obtaining or more galaxies in the absence of clustering is given by the Poisson distribution,
(3) |
Thus, the null hypothesis of no clustering is rejected with probability
(4) |
The results from this calculation depend on the chosen redshift bins, which should be comparable to the photometric redshift uncertainty. We therefore fix , and note that Castignani et al. (2014) provides a more generalized framework for incorporating arbitrary redshift bin sizes, but we do not expect a more detailed calculation to provide much more information because our photometric redshift uncertainties are large (). Thus, for each redshift bin in each of the 40 protocluster candidate sources, we calculate using Eq. 4 with , and show the results in Fig. 15 (faint grey lines). Note that the results here have been smoothed by a Gaussian kernel with a width of 0.1, which is much smaller than the photometric redshift uncertainty, and therefore should not introduce any biases.
Next, to quantify the photometric redshift clustering probability of the full protocluster candidate sample, we compute the product of the individual probability distributions (Fig. 15, solid black line). This operation effectively averages the likelihoods across the sample, leaving us with the mean probability of clustering over all 40 best protocluster candidates (see Fig. 15). The final probability distribution shows a significant peak around –3, which indicates that the spatial overdensities in these 40 sources are not spread out uniformly in photometric redshift space, but are instead predominately found within this redshift range; repeating this analysis on the 38 red protocluster candidates produces a very similar probability distribution. However, we cannot say whether these sources are dominated by line-of-sight alignments within this redshift range; solving this issue will require spectroscopic follow-up observations.

5.2 Strong gravitational lens candidates
In PlanckXXVII, Herschel-SPIRE follow-up observations of 228 Planck-selected sources led to the discovery of 11 of the brightest known gravitationally-lensed submm galaxies (‘Planck’s dusty GEMS’; Cañameras et al. 2015). Due to the exceptional brightness of the GEMS and the relative paucity of other SPIRE galaxies in these Planck sources, identification of such bright gravitational lenses turned out not to be very challenging. PlanckXXVII classified sources as either overdensities or candidate gravitationally-lensed galaxies using two quantities: the flux density of the brightest galaxy in the source (at any wavelength); and the wavelength at which the brightest galaxy peaks. Specifically, any sources containing a galaxy with flux density mJy were identified as candidate lens systems and all other sources whose brightest galaxy peaks at 350 or 500 m were identified as overdensities. This classification scheme identified seven of the PlanckXXVII sources as candidate lensed galaxies, and the authors selected an additional five candidate lensed galaxies by visual inspection (all of which have peak flux mJy and appear as the only bright galaxy in the source). These 12 sources consisted of one previously-discovered strong gravitationally-lensed galaxy and 11 new lensing candidates, which have since been confirmed as bright gravitationally-lensed galaxies making up the GEMS sample (Cañameras et al., 2015).

Only one PHz HELP source, G160, contains a single exceptionally bright SPIRE galaxy, like the GEMs, and turns out to be a previously-discovered strong gravitationally-lensed galaxy (Geach et al., 2015; Harrington et al., 2016). However, not all strong gravitationally-lensed galaxies are as bright as Planck’s dusty GEMS; the PHz sources could contain high- strong gravitationally-lensed galaxies of somewhat lower brightness, comparable to other known gravitational lenses (e.g., Negrello et al., 2010; Weiß et al., 2013). Furthermore, unlike the GEMS, overdense PHz sources may contain high- strong gravitationally-lensed galaxies, making them difficult to identify visually. Studies of Herschel wide-field surveys have used a simple detection threshold of 100 mJy at 500 m to identify candidate lensed galaxies. Applying this criterion to the galaxies within the PHz HELP sources, we find a total of just four sufficiently bright galaxies. To search for more candidates, here we use a machine-learning (ML) approach to select gravitationally-lensed galaxies in the PHz HELP sources.
5.2.1 Feature selection
The choice of features to input into an ML model plays a crucial role in successful classification (see, for example Liu et al., 2019), in this case the identification of gravitationally-lensed galaxies. Due to the small training sample size and the unresolved nature of high- galaxies in SPIRE maps, we find that convolutional neural networks trained on images of sources struggle to successfully distinguish between lensed and non-lensed galaxies. Instead of providing images as inputs to the ML model, we choose to train an ML model using information about individual galaxies. This has the advantage of avoiding biases in the classification of galaxies based on other nearby galaxies (unlike visual identification), as well as compressing the total amount of information fed to the ML model. Furthermore, physically, we expect the brightnesses and colours to be the main pieces of information used to identify candidate gravitationally-lensed galaxies in these lower-resolution images (Coppin et al., 2006; Negrello et al., 2007). As a result, we provide the flux densities of individual IN region galaxies at 250, 350 and 500 m as inputs to the ML model. With the three galaxy flux densities as the input, a simple single-layer neural network (NN) classifier consistently trains to 100 per cent accuracy on the test set (see below).
5.2.2 Training and test sets
In contrast to PlanckXXVII, we look for gravitationally-lensed galaxies by classifying the individual SPIRE galaxies in our Planck sources, as opposed to entire sources, allowing for the possibility of finding a gravitational lens within an overdensity. As such, we require a training set consisting of individual lensed galaxies (positive examples) and non-lensed galaxies (negative examples).
For the sample of lensed galaxies, we include the 11 bright Planck dusty GEMS, as well as H-ATLAS J114637.9-001132 (a strong gravitationally-lensed galaxy at discovered using the first Planck compact source catalogue; Fu et al. 2012) and HLS J091828.6+514223 (a bright gravitationally-lensed galaxy at ; Combes et al. 2012). We also include a number of less bright gravitational lenses, which are crucial for identifying similar candidate galaxies in the SPIRE maps. Specifically, we include the two highest redshift lensed galaxies from Negrello et al. (2010), namely SDP.81 at (ALMA Partnership et al., 2015) and SDP.130 at , as well as all of the lensed galaxies from Weiß et al. (2013) that have a peak flux density 100 mJy at 350 or 500 m (16 galaxies total). We exclude the lower redshift lensed galaxies from Negrello et al. (2010) because they are below the redshift range (–4) of galaxies expected in the PHz sample. We exclude the fainter lensed galaxies from Weiß et al. (2013) because they are virtually indistinguishable from non-lensed galaxies in the SPIRE maps (the threshold at 100 mJy is motivated by e.g., Paciga et al. 2009; Negrello et al. 2010; Nayyeri et al. 2016). In other words, this is a cutoff in the magnification of the training lensed galaxies. The final training set thus contains a total of 31 confirmed high- strong gravitationally-lensed galaxies.
To create a sample of non-lensed star-forming galaxies, we turn to the multi-wavelength follow-up programmes of candidate protoclusters of star-forming galaxies from PlanckXXVII. MacKenzie et al. (2017) and Cheng et al. (2020) followed up large samples of PHz sources with SCUBA-2 at 850 m and selected 45 sources as protocluster candidates, based on their overdensities at this wavelength. Although these sources are well studied, this does not guarantee that there are undiscovered lensed galaxies within these protocluster candidates. Again, motivated by the criteria used to select gravitationally-lensed galaxies, we do not include galaxies with flux densities mJy at 350/500 m in the unlensed training set (11 galaxies across the 45 sources). The flux densities of these individual galaxies are largely indistinguishable from the flux densities of individual galaxies in less extreme sources. The training set contains the remaining 388 non-obviously gravitationally-lensed galaxies from the protocluster candidate sources. For model validation, we randomly select 20 per cent of the training set (84 galaxies) to be a test set, on which the model is tested but not trained.
5.2.3 Neural-network model
We construct our ML model using PyTorch (Paszke et al., 2019). We find that a simple NN model consistently trains to about 100 per cent accuracy on the test set, however, it does not consistently identify the same new candidate lensed galaxies each time it is trained. That is, the training data set is not sufficiently large or representative to fully constrain the weights and biases of the NN model, causing the random values in the initialized weights and biases to influence the NN’s classification results. Even with small, single-layer models, the results of the NN are not fully consistent across multiple training attempts, despite consistently training to 100 per cent accuracy on the test set. This is a common problem in ML that is typically overcome by using ensemble methods to combine the results of multiple ML models, making predictions more reliable (Opitz & Maclin, 1999; Rokach, 2010). Fortunately, training a simple NN model is fast enough to make ensemble learning feasible (a simple single-layer NN take about 10 s to train for 5000 epochs on a standard desktop). We adopt an ensemble learning technique in which we simultaneously train many NN models and use the average of their outputs for more consistent classification (Hansen & Salamon, 1990).
Our final ML classification model consists of 100 ensemble NNs. Each NN has three input nodes (corresponding to galaxy flux densities at 250, 350 and 500 m), one hidden layer of 20 nodes, and a single output node, for which an output of 1.0 corresponds to a lensed galaxy and 0.0 corresponds to an un-lensed star-forming galaxy. We adopt a standard rectified linear unit (ReLU; Nair & Hinton 2010) activation function for the hidden layer, a sigmoid (or logistic) activation function for the output layer, and the adaptive moment estimation optimizer (ADAM; Kingma & Ba 2014). We simultaneously train all 100 NNs over 5000 epochs (using the full training set every epoch) with a learning rate of 0.01, and average their outputs to obtain a final ML score. We use a standard, albeit arbitrary, cutoff at 0.5 to identify new lensed galaxies.
5.2.4 New candidate lensed galaxies
The ML model identifies a total of eight candidate gravitationally-lensed galaxies among the PHz HELP sources (see Table 3). As expected, the ML model confidently identifies G160 as a candidate gravitationally-lensed galaxy, which, as noted above, has been confirmed in the Herschel Stripe 82 field (Geach et al., 2015; Harrington et al., 2016). G160 is easily identifiable without sophisticated methods due to its exceptional brightness, comparable to Planck’s Dusty GEMS (an order of magnitude above the other candidate galaxies in the PHz HELP sources), in addition to being the only notable galaxy in this PHz source. In contrast, most of the other identified candidate lensed galaxies are contained within PHz sources that consist of many SPIRE galaxies, including overdensities G088 and G245. Notice that in the PHz HELP sources, we find only one exceptionally bright (peak flux density mJy) strong gravitationally-lensed galaxy out of 187 sources (0.5 per cent), whereas PlanckXXVII found a total of 12 exceptionally bright strong gravitational lenses (the 11 Planck GEMS and one previously discovered galaxy) out of 228 sources (5 per cent). The larger number of bright lensed galaxies in PlanckXXVII can be explained by the inclusion of PCCS sources in the follow-up programmes and the bias towards targeting bright sources. As a result, the population of bright gravitationally-lensed galaxies (peak flux mJy) in the PHz has been previously overestimated; we find them to be significantly more rare (perhaps 0.5 per cent, as opposed to 5 per cent). Of course there could still be a significant population of somewhat fainter gravitationally lensed galaxies in these catalogues.
Applying our ML model to galaxies within the PlanckXXVII sources, we find that we can identify 13 new candidate lensed galaxies from the PlanckXXVII sources (see Table 4). Additionally, we note that two PlanckXXVII maps contain very bright sources that lie outside of the IN region but still within 10 arcmin of Planck position (we include them in Table 4). One of the best lensed candidates from the PlanckXXVII sources, G224, has recently been followed up with ALMA and confirmed as a new Planck-identified strong gravitationally-lensed galaxy (Fig. 16). The 870-m ALMA image has spatially resolved G224, revealing a clear Einstein ring. This supports the suspected presence of gravitationally-lensed galaxies among the PHz sources that are less extreme than the Planck GEMS. Additionally, unlike the GEMS, G224 contains many SPIRE-detected galaxies, similar to many of the other candidates, rather than having a single dominant galaxy.

The SEDs of the new candidates, both from the PHz HELP sources and the PlanckXXVII sources, are consistent with previously-discovered lensed galaxies (see Fig. 17). In particular, the rising SEDs and peak flux densities of the new candidates are most comparable to the sample of lensed galaxies from Weiß et al. (2013), which have an average redshift of 3.5. Their SEDs are also similar to the highest redshift lensed galaxies in Negrello et al. (2010), namely SDP.81 and SDP.130, which were included in our training data set. The other lower-redshift lensed galaxies from Negrello et al. (2010) clearly peak at lower flux densities than these new candidates. Additional observations are required to determine whether the candidate lenses are true strong gravitationally-lensed galaxies, as well as to confirm their potentially high-redshift nature.
5.2.5 500-m threshold
As briefly mentioned above, previous studies searching for strong gravitationally-lensed galaxies in the submm took advantage of the steep number counts of non-lensed star-forming submm galaxies by adopting a strict flux-density selection threshold (Coppin et al., 2006; Negrello et al., 2007). In particular, a cutoff of 100 mJy at 500 m has been used to search for candidate lensed galaxies in Herschel wide-field surveys (Negrello et al., 2010; Nayyeri et al., 2016). Due to the high efficiency of this threshold at selecting lensed galaxies, we implicitly relied on this threshold when constructing the training data set. Of course, this raises the question of whether the ML model picks up on additional features when identifying candidate lensed galaxies or simply reproduces the results of a 100-mJy threshold at 500 m.
Among the PHz HELP sources, a total of four galaxies (out of 1892) have a flux density above 100 mJy at 500 m. The ML model, on the other hand, identifies eight candidate lensed galaxies, including all four galaxies with flux densities above 100 mJy at 500 m. The additional four galaxies identified by the ML model are bright at 500 m and either have increasing SEDs or a flux density above 100 mJy at 350 m (see Table 3). In the PlanckXXVII sources, a total of 30 galaxies (out of 1916) are brighter than 100 mJy at 500 m, significantly more than the PHz HELP sources. Excluding previously discovered sources, 18 galaxies have flux density above 100 mJy at 500 m and the ML model identifies a total of 13 new candidate lensed galaxies. There is significant overlap between the 18 galaxies with mJy and the 13 candidate lensed galaxies from the ML model; 12/13 of the potential lensed galaxies have mJy and the last galaxy has an increasing SED with just below mJy. The six galaxies with mJy that the ML model does not identify as candidate lensed galaxies have decreasing or flat SEDs (nonetheless, we include them in Table 4).
Because the ML model identifies candidate lensed galaxies with increasing SEDs and excludes some galaxies with decreasing/flat SEDs, the ML model is clearly more sophisticated than a simple threshold at 500 m; however, it is unclear whether this additional effort is worthwhile. When dealing with the bright galaxies within the PlanckXXVII sources, a simple threshold of 100 mJy at 500 m identifies nearly all of the ML candidate galaxies, as well as a handful of additional galaxies with decreasing or flat SEDs. These galaxies could be gravitationally lensed of course (possibly at lower redshift), but one could add an additional colour cut (e.g., ) to the simple 500 m threshold to remove such galaxies if desired. For the generally less bright galaxies within the PHz HELP sources, we see more of an advantage to using the ML model, since it doubles the number of candidate lensed galaxies identified from four to eight. However, some of the additional galaxies would instead be picked up with the addition of other simple criteria (e.g., mJy). As such, it is not clear if the increased sophistication and potential for identifying additional lensed galaxies with ML approaches is worth the required effort. Nonetheless, the classifications of our ML model are helpful for informing the choices of simple classification criteria (see below).
5.2.6 New threshold selection criteria

One of the advantages of providing flux densities at 250, 350 and 500 m as inputs to the ML model, as opposed to the unprocessed images themselves, is the greater potential to interpret the classifications of the ML model and to tune more straight-forward selection criteria. Based on , and , Planck’s dusty GEMS are easily distinguished from non-lensed submm galaxies (see Fig. 18). Using flux ratios (i.e., colours) alone, there is no clear distinction, but considering both brightness and flux ratios, there is an obvious separation between Planck’s dusty GEMS and typical submm galaxies (see left and centre panels of Fig. 18). Furthermore, we see that the ML model identifies the galaxies that best follow the trends of Planck’s dusty GEMS in versus and versus at lower brightnesses.
In the hope of reproducing the classification done by the ML model with a threshold criteria, we introduce demarcation lines in versus , versus and versus spaces. We optimize the slope and intercept of each demarcation line based on the classifications of the ML model. To incorporate both the accuracy and precision of the demarcation line criteria in searching for the best line, we optimize for the line’s score (the harmonic mean of the precision and recall: ). In other words, for each of the plots in Fig. 18, we search the two-dimensional parameter space to find the slope and intercept of the line that maximizes the score with respect to the ML model’s classifications. As expected, the candidate lensed galaxies cannot be accurately distinguished from the non-lensed galaxies using their colours alone; although the candidate lensed galaxies are red in colour, so are the majority of non-lensed galaxies; the ‘optimal’ line () has score < 0.5. For the absolute flux density plots, on the other hand, we see that the demarcation lines are able to separate the lensed galaxy candidates from the non-lensed galaxies much more effectively. The optimal demarcation line in versus space ( mJy) classifies galaxies as lensed candidates/non-lensed galaxies with an score of 0.999, and similarly, the optimal line in versus space ( mJy) also classifies galaxies with an score of 0.999. We define the threshold selection criteria as follows: galaxies that fall above the versus demarcation line or the versus demarcation line are identified as candidate lensed galaxies. We choose to identify candidate lensed galaxies that fall above either of the two demarcation lines because this slightly increases the score of the individual lines alone and results in a higher score than requiring galaxies to lie above both lines.
The threshold selection criteria lead to six candidate lensed galaxies among the PHz HELP sources, all of which were identified by the ML model. The criteria miss G245 due to its lower flux density at and m, which was likely identified by the ML model as a lensed galaxy based on its increasing SED, and narrowly miss G010 due to its relatively high flux density at m (note that these galaxies also did not have a high ML score). Among the PlanckXXVII sources, the threshold criteria identifies 15 candidate lensed galaxies, including all of the galaxies identified by the ML model, except G223 due to its lower flux density at and m. The other three galaxies identified by the threshold criteria all have mJy; in fact, they are the three brightest galaxies at 500 m not identified as lensed galaxies by the ML model. Despite the simplicity of the criteria, two linear flux thresholds are able to reproduce the results of the ML model reasonably well, missing a few of the galaxies with increasing SEDs, while also identifying some additional plausible lensed galaxies. Furthermore, our criteria are less strict than a threshold of 100 mJy at 500 m, potentially allowing for the detection of additional lensed galaxies that are fainter. Further follow-up observations are required to confirm the lensed nature of candidate galaxies identified by either the ML model or linear threshold criteria.
6 Discussion
6.1 The nature of the PHz sources
Now that we have investigated the resolved rest-frame far-IR properties of a representative subsample of PHz sources, we can discuss the overall statistical properties of the PHz catalogue, including some corrections to the conclusions drawn with the biased subsample analysed in PlanckXXVII.
In the preferentially-selected PlanckXXVII subsample, which had a bias towards high S/N sources with compact/regular shapes and included sources from the PCCS, 12 sources turned out to be ultra-bright strong gravitationally-lensed galaxies (5 per cent of their subsample). Based on our more representative sample, we now see that the fraction of bright strong gravitational lenses (peak flux mJy) in the PHz catalogue has been overestimated; we find only one such galaxy (a rate of 0.5 per cent), namely G160. The larger fraction of bright strong gravitationally-lensed galaxies found in the PlanckXXVII sample suggests that such bright gravitational lenses are more easily found in the Planck compact source catalogues (see Trombetti et al. 2021), rather than the PHz catalogue.
On the other hand, we confirm that most PHz sources correspond to overdensities in SPIRE maps. This is particular illustrated through the stacking analysis, in which average stacks of the PHz HELP sources, similarly to the PlanckXXVII sources, show significant extended emission out to about 5 arcmin, well beyond the SPIRE beam at all three observed wavelengths (with the most substantial differences at m). Additionally, we found a total of 40 sources having overdensities of SPIRE galaxies, corresponding to 21 per cent of the PHz HELP sources; this is comparable to the 25 per cent of overdensities found in the PlanckXXVII sources and includes a larger fraction of sources that are overdense at 350 and 500 m.
Despite the similarity in these results, we find significant differences between the PHz HELP and PlanckXXVII sources when we examine the SEDs of the galaxies found across the two subsamples. For our PHz HELP sources, we measure lower overdensities at 250 and 500 m when compared to the PlanckXXVII sources, with a slightly greater positive tail at 350 m (see Fig. 5). This is related to the differences in colour seen between the galaxies in these two subsamples, for which we find larger values of in the PHz HELP galaxies and larger values of in the PlanckXXVII galaxies. This may indicate that galaxies found in the PlanckXXVII sources are at systematically higher redshift or contain systematically cooler dust temperatures, although we cannot provide any definitive conclusions because of the large uncertainties associated with the SPIRE photometric redshifts. Indeed, the distributions of photometric redshifts appear nearly identical for the two subsamples (see Fig. 10).
We know that the PHz sources are places on the sky corresponding to peaks in the CIB that are red. Hence we know that these are special directions, containing galaxies with red colours at these wavelengths. However, we would like to know whether the observed projected overdensities are predominantly protoclusters of galaxies at the same redshift or line-of-sight alignments of two or more groups of galaxies. Unfortunately, our current photometric redshifts are too uncertain to distinguish between these two situations. Instead, to make progress, we turn to a comparison with simulations. Specifically, we repeated the PHz selection algorithm on the Simulated Infrared Dusty Extragalactic Sky (SIDES; Béthermin et al., 2017) simulation data. Briefly, SIDES simulated 2 deg2 of the extragalactic sky at the wavelengths observed by the SPIRE instrument (i.e., 250, 350 and 500 m), as well as 850 m, using a two-scenario model of star formation, where galaxies are either actively forming stars or quenched. Clustering was included by populating dark matter halos from dark matter-only simulations using an abundance matching technique (see Béthermin et al. 2017 for more details). We smoothed the SIDES catalogue to the 5 arcmin resolution of Planck, and reproduced the 500-m excess selection criteria used to generate the PHz catalogue (see Planck Collaboration XXXIX 2016). Upon investigating the 30 brightest peaks in the resulting excess map, we did not find any strong evidence of large-scale clustering or line-of-sight alignments; instead, the selection criteria predominantly picked out individual galaxies bright at 500 m, with only mild clustering. This is likely caused by the small size of the SIDES simulation, preventing it from reproducing the rare objects that make up the PHz catalogue; the 2151 PHz sources were found across 10,700 deg2, meaning that their density is about 0.2 deg-2, and thus a simulation at least as large as would be needed to find comparably rare objects.
Given the limitations of the SIDES simulation, we also perform tests using an analytical formalism as an alternative way to investigate potential line-of-sight alignments. Specifically, we employ the analytic model developed by Negrello et al. (2017), which gives the luminosity function (LF) of clumps (either an isolated galaxy or a physically bound group of galaxies) at 545 GHz based on galaxy evolution models and the infrared LFs of individual galaxies; this model is found to match observations well. Following the simple Monte Carlo simulations done by Negrello et al. (2017), we draw clumps from the LF of their model, and distribute them randomly over 5,000 deg2 of a mock sky — we expect this area to be large enough to generate the rare peaks in the CIB found by Planck, and we note that the 40 protocluster candidates found here were drawn from 1,270 deg2.
We smooth the mock sky with a Gaussian corresponding to the Planck beam of 5 arcmin. From this smoothed map we select peaks with S/N5 to mimic the PHz selection, where the noise is the rms of the map, that is, purely the confusion noise. Finally, from this sample, we select sources with 3 overdensities of individual galaxies brighter than mJy at 545 GHz, corresponding to our selection of protocluster candidates as described in Sect. 5.1. The 500-m excess selection used in Planck Collaboration XXXIX (2016) to eliminate low-redshift sources and other potential contamination is not necessary here because the LF adopted for the test only includes clumps with redshifts between 2 and 4.
From this test we find about 220 simulated protocluster candidates (or line-of-sight alignments), selected in the same way as our protocluster candidate catalogue. This roughly matches the number of candidates projected for the given simulated area. Upon investigating these simulated protocluster candidates, we find that 96 per cent of them contain two or more clumps aligned along the line-of-sight that are not physically connected to one another. Similarly, 81 per cent are found to have more than three clumps aligned along the line-of-sight by chance. The average number of physically unbound clumps along the line-of-sight is about four, while that for random directions is about one.
This suggests that most of the PHz sources, and our protocluster candidates selected from that catalogue, may be line-of-sight combinations of star-forming galaxies at different redshifts, rather than single clumps of physically associated galaxies. This is similar to the conclusions drawn in Negrello et al. (2017) and Gouin et al. (2022), which found that the Planck-derived SFRs for the PHz sources can be explained only by a dominant contribution from background/foreground interlopers, rather than by single haloes with very high SFRs. This conclusion, that there are foreground and background interlopers, is of course not very surprising. The more interesting question is whether there is a dominant star-forming protocluster along these special lines of sight. From the same simulations we find that the ratios of flux densities between the brightest and second brightest clumps along the selected lines of sight are much higher than that in random directions, with the brightest clump being a factor of approximately 2 (3) times brighter than the second brightest clump in 76 (60) per cent of the cases. The average flux density ratio between the brightest and second brightest clumps is about 4.4, in comparison with 1.5 for random directions. These results suggest that while the PHz sources and our protocluster candidates do tend to have line-of-sight interlopers, they still tend to be dominated by a significantly bright clump of galaxies at a single redshift, motivating continued follow-up studies with protocluster science in mind.
To summarize the conclusions of this section, we find that the PHz catalogue consists primarily of angular overdensities of –3 galaxies, with only a very small fraction of bright strong gravitational lenses. Simulations suggest that there are multiple structures along each line of sight, but that they are often dominated by a single structure. In order to determine whether these spatial overdensities contain genuine protoclusters, further spectroscopic follow-up observations are needed. Our set of 40 overdensities provides an optimal list for such studies, and should help us better understand how efficient CMB surveys are at finding the population of star-forming protoclusters around redshifts of 2.
6.2 The spatial distribution of star-formation in protoclusters

Current models of galaxy formation underestimate the SFRs observed in high- protoclusters (Lim et al., 2021), including the PHz source G237 (Polletta et al., 2021). Even though simulations suggest that chance line-of-sight alignments are quite common among PHz sources, if each line-of-sight is at least dominated by one bright clump (as suggested above), then our Herschel-SPIRE observations may still be useful for constraining star-formation in typical protoclusters at . In particular, in Section 4.6 we stacked cutouts of the PHz HELP sources centered on the brightest galaxy detected at 350 m in each source. We expect that these brightest galaxies are part of the dominant clumps, and therefore our stacks may average out the fainter clumps along each line-of-sight.
Currently, statistical samples of protoclusters at higher redshift do not exist, but at Alberts et al. (2021) studied 53 near-IR-selected galaxy clusters observed by Herschel-SPIRE at 250, 350 and 500 m. In Fig. 19 we compare our stacked surface brightness profiles (as well as those obtained from PlanckXXVII for reference) to the stack from Alberts et al. (2021) at 250, 350 and 500 m. Here we have followed Alberts et al. (2021), adopting the same bin sizes and arbitrarily normalizing the profiles to a value of 0.5 at 27.5 arcsec.
Interestingly, we find that the average far-infrared extent of the PHz HELP sources is much larger than that of the average cluster; it is worth noting that angular diameter distances only differ by about 5 per cent between redshift 1 and 2, so the observed differences in size are physical. We know that the average SFR density of the Universe peaked around (e.g., Behroozi et al., 2013; Gruppioni et al., 2013; Madau & Dickinson, 2014; Chiang et al., 2017; Koprowski et al., 2017), and therefore we might expect protoclusters to be more extended in star-formation as well. However, this is speculation, and the exact mechanisms driving any increase in SFR and size would need to be investigated in more detail.
Similarly, it is interesting to note that the PHz HELP sources are more centrally concentrated than the PlanckXXVII sources. Since our stacking procedure relied on centering each image on the brightest 350 m-detected galaxy, the centre of each stack is effectively a point source with a flux density given by the average flux density of the brightest PHz HELP/PlanckXXVII galaxies. We found that across the PHz HELP sources, the brightest central galaxy is relatively brighter than its surroundings when compared with the PlanckXXVII sources. This could come from the bias towards brighter sources in selecting the PlanckXXVII sample, which not only picked brighter central galaxies, but also sources with bright galaxies distributed throughout Planck’s 5 arcmin beam. With a more representative sample of PHz targets, we find that typical sources are, in fact, more likely to contain bright central galaxies relative to the surrounding faint galaxies, as opposed to collections of relatively bright galaxies. This may indicate the presence of brightest cluster galaxies (BCGs) forming in protocluster cores, although further work is needed to test this speculation.
7 Conclusions
We have used archival Herschel-SPIRE data to perform a follow-up study of 187 representative sources from the PHz catalogue that serendipitously fall in the 1270 deg2 of sky covered by the HELP fields. This expands on the Herschel-SPIRE follow up of 228 preferentially-selected PlanckXXVII sources, of which only 83 ended up in the final PHz catalogue. We are thus able to determine statistical properties of the PHz catalogue, whereas the follow-up observations of PlanckXXVII suffered from a bias towards bright, compact sources with regular shapes, making the sample statistically distinct from the PHz catalogue.
In our sample of 187 PHz HELP sources, we find only one (previously-identified) exceptionally bright (peak flux mJy) strong gravitationally-lensed galaxy that is comparable to the 11 Planck GEMS found in PlanckXXVII. This implies that the proportion of exceptionally bright strong gravitationally-lensed galaxies in the PHz has been overestimated by a factor of about 10, and the true fraction of such galaxies in the PHz catalogue is likely to be < 1 per cent.
The other PHz HELP sources consist primarily of red galaxies, most of which are overdense when compared with random locations at 350 and 500 m, demonstrating that typical PHz sources are potential protoclusters. We investigated the average surface brightness profile of our representative sample of PHz sources using a stacking analysis, finding significant extended emission (out to about 5 arcmin) relative to stacks around random galaxies on the sky. We also compared our surface brightness profiles to similar stacks of galaxy clusters selected in the near-IR, and found our sources to be much more extended.
In search of the most promising protocluster candidates, we identify a total of 40 PHz HELP sources that are overdense at 250, 350 or 500 m (21 per cent). For comparison with the PCCS, Greenslade et al. (2018) found 27 protocluster candidates that are overdense out of a total of 354 PCCS sources (8 per cent). The larger sample of significant overdensities in the PHz catalogue (most of which are at 350/500 m) compared with the PCCS supports the usefulness of the PHz catalogue in selecting protocluster targets. Due to the larger number of sources in the PCCS, the density on the sky of overdense protocluster candidates in the PHz catalogue ( deg-2) and the PCCS ( deg-2) is comparable.
Although we find only one exceptionally bright (peak flux mJy) lensed galaxy in the PHz HELP, we have explored the possibility of finding more typical strong gravitationally-lensed galaxies in the remaining sources. Our machine-learning approach identified a total of seven new candidate lensed galaxies among the PHz HELP sources whose SEDs are consistent with being high- gravitational lenses. We also looked for lensed galaxies in the PlanckXXVII sources that may have been fainter than the bright GEMS. The algorithm identified 13 additional candidate lensed galaxies, including the PHz source G224 that is now confirmed by ALMA to be a gravitationally-lensed galaxy.
Our results support the high- nature of the PHz catalogue and confirm that the vast majority of the PHz sources are spatial overdensities of galaxies at 350 and 500 m. According to our results, we expect that the full catalogue of 2151 objects in the PHz contain overdensities of star-forming galaxies and are therefore interesting for follow-up studies in their own right. Our subset of the best 40 protocluster candidates and 713 new strong gravitational lenses provide a tractable and representative sample ideal for follow-up surveys (e.g., with ALMA or JWST) to further elucidate the nature of the PHz catalogue, and significantly bolster the number of known protoclusters and strong gravitational lenses.
Acknowledgements
We acknowledge Marta Frias Castillo for work on an early version of the extended gravitational lens catalogue. We acknowledge the support of the Natural Sciences and Engineering Research Council of Canada (NSERC). SL acknowledges partial support from the Canadian Institute for Theoretical Astrophysics. This work is based on observations obtained with Planck (http://www.esa.int/Planck), an ESA science mission with instruments and contributions directly funded by ESA Member States, NASA and Canada. Herschel is an ESA space observatory with science instruments provided by European-led Principal Investigator consortia and with important participation from NASA. The Herschel Extragalactic Legacy Project, (HELP), is a European Commission Research Executive Agency funded project under the SP1-Cooperation, Collaborative project, Small or medium-scale focused research project, FP7-SPACE-2013-1 scheme, Grant Agreement Number 607254. This paper makes use of the following ALMA data: ADS/JAO.ALMA#2019.1.01155.S. ALMA is a partnership of ESO (representing its member states), NSF (USA) and NINS (Japan), together with NRC (Canada), MOST and ASIAA (Taiwan), and KASI (Republic of Korea), in cooperation with the Republic of Chile. The Joint ALMA Observatory is operated by ESO, AUI/NRAO and NAOJ.
Data availability
All of the data used in this paper is publicly available. The PlanckXXVII SPIRE maps are available through the ESA Herschel Science Archive (http://archives.esac.esa.int/hsa/whsa). The wide-field HELP maps can be downloaded from the Herschel Extragalactic Legacy Project Data site (http://hedam.lam.fr/HELP). Catalogues of the candidate protoclusters and graviationally-lensed galaxies are included in the Appendices below.
References
- ALMA Partnership et al. (2015) ALMA Partnership et al., 2015, ApJ, 808, L4
- Alberts et al. (2021) Alberts S., et al., 2021, MNRAS, 501, 1970
- Baes et al. (2014) Baes M., et al., 2014, A&A, 562, A106
- Behroozi et al. (2013) Behroozi P. S., Wechsler R. H., Conroy C., 2013, ApJ, 770, 57
- Béthermin et al. (2017) Béthermin M., et al., 2017, A&A, 607, A89
- Blain et al. (2002) Blain A. W., Smail I., Ivison R. J., Kneib J. P., Frayer D. T., 2002, Phys. Rep., 369, 111
- Blain et al. (2003) Blain A. W., Barnard V. E., Chapman S. C., 2003, MNRAS, 338, 733
- Cañameras et al. (2015) Cañameras R., et al., 2015, A&A, 581, A105
- Cañameras et al. (2017a) Cañameras R., et al., 2017a, A&A, 600, L3
- Cañameras et al. (2017b) Cañameras R., et al., 2017b, A&A, 604, A117
- Cañameras et al. (2018) Cañameras R., et al., 2018, A&A, 620, A60
- Castignani et al. (2014) Castignani G., Chiaberge M., Celotti A., Norman C., 2014, ApJ, 792, 113
- Chabrier (2003) Chabrier G., 2003, ApJ, 586, L133
- Cheng et al. (2019) Cheng T., et al., 2019, MNRAS, 490, 3840
- Cheng et al. (2020) Cheng T., et al., 2020, MNRAS, 494, 5985
- Chiang et al. (2017) Chiang Y.-K., Overzier R. A., Gebhardt K., Henriques B., 2017, ApJ, 844, L23
- Clements et al. (2014) Clements D. L., et al., 2014, MNRAS, 439, 1193
- Combes et al. (2012) Combes F., et al., 2012, A&A, 538, L4
- Coppin et al. (2006) Coppin K., et al., 2006, MNRAS, 372, 1621
- Dalal & Kochanek (2002) Dalal N., Kochanek C. S., 2002, ApJ, 572, 25
- Draine (2003) Draine B. T., 2003, ARA&A, 41, 241
- Ebbels et al. (1996) Ebbels T. M. D., Le Borgne J. F., Pello R., Ellis R. S., Kneib J. P., Smail I., Sanahuja B., 1996, MNRAS, 281, L75
- Everett et al. (2020) Everett W. B., et al., 2020, arXiv e-prints, p. arXiv:2003.03431
- Flores-Cacho et al. (2016) Flores-Cacho I., et al., 2016, A&A, 585, A54
- Frye et al. (2019) Frye B. L., et al., 2019, ApJ, 871, 51
- Fu et al. (2012) Fu H., et al., 2012, ApJ, 753, 134
- Geach et al. (2015) Geach J. E., et al., 2015, MNRAS, 452, 502
- Gouin et al. (2022) Gouin C., Aghanim N., Dole H., Polletta M., Park C., 2022, arXiv e-prints, p. arXiv:2203.16276
- Granato et al. (2015) Granato G. L., Ragone-Figueroa C., Domínguez-Tenreiro R., Obreja A., Borgani S., De Lucia G., Murante G., 2015, MNRAS, 450, 1320
- Greenslade et al. (2018) Greenslade J., et al., 2018, MNRAS, 476, 3336
- Griffin et al. (2010) Griffin M. J., et al., 2010, A&A, 518, L3
- Gruppioni et al. (2013) Gruppioni C., et al., 2013, MNRAS, 432, 23
- Hansen & Salamon (1990) Hansen L., Salamon P., 1990, Pattern Analysis and Machine Intelligence, IEEE Transactions on, 12, 993
- Harrington et al. (2016) Harrington K. C., et al., 2016, MNRAS, 458, 4383
- Harrison & Coles (2011) Harrison I., Coles P., 2011, MNRAS, 418, L20
- Herranz et al. (2013) Herranz D., et al., 2013, A&A, 549, A31
- Hezaveh et al. (2013) Hezaveh Y. D., et al., 2013, ApJ, 767, 132
- Hezaveh et al. (2016) Hezaveh Y. D., et al., 2016, ApJ, 823, 37
- Hill et al. (2020) Hill R., et al., 2020, MNRAS, 495, 3124
- Jacobs et al. (2019) Jacobs C., et al., 2019, MNRAS, 484, 5330
- Kingma & Ba (2014) Kingma D., Ba J., 2014, International Conference on Learning Representations
- Kneissl et al. (2019) Kneissl R., et al., 2019, A&A, 625, A96
- Koprowski et al. (2017) Koprowski M. P., Dunlop J. S., Michałowski M. J., Coppin K. E. K., Geach J. E., McLure R. J., Scott D., van der Werf P. P., 2017, MNRAS, 471, 4155
- Koyama et al. (2021) Koyama Y., et al., 2021, MNRAS, 503, L1
- Lim et al. (2021) Lim S., Scott D., Babul A., Barnes D. J., Kay S. T., McCarthy I. G., Rennehan D., Vogelsberger M., 2021, MNRAS, 501, 1803
- Liu et al. (2019) Liu R. H., et al., 2019, MNRAS, 489, 1770
- Livermore et al. (2012) Livermore R. C., et al., 2012, MNRAS, 427, 688
- MacKenzie et al. (2017) MacKenzie T. P., et al., 2017, MNRAS, 468, 4006
- Madau & Dickinson (2014) Madau P., Dickinson M., 2014, ARA&A, 52, 415
- Magnelli et al. (2014) Magnelli B., et al., 2014, A&A, 561, A86
- Mao & Schneider (1998) Mao S., Schneider P., 1998, MNRAS, 295, 587
- Martinache et al. (2018) Martinache C., et al., 2018, A&A, 620, A198
- Massardi et al. (2018) Massardi M., et al., 2018, A&A, 610, A53
- Miller et al. (2018) Miller T. B., et al., 2018, Nature, 556, 469
- Nair & Hinton (2010) Nair V., Hinton G., 2010, Proceedings of ICML, 27, 807
- Nayyeri et al. (2016) Nayyeri H., et al., 2016, ApJ, 823, 17
- Negrello et al. (2007) Negrello M., Perrotta F., González-Nuevo J., Silva L., de Zotti G., Granato G. L., Baccigalupi C., Danese L., 2007, MNRAS, 377, 1557
- Negrello et al. (2010) Negrello M., et al., 2010, Science, 330, 800
- Negrello et al. (2017) Negrello M., et al., 2017, MNRAS, 470, 2253
- Nesvadba et al. (2016) Nesvadba N., et al., 2016, A&A, 593, L2
- Opitz & Maclin (1999) Opitz D., Maclin R., 1999, Journal of Artificial Intelligence Research, 11
- Paciga et al. (2009) Paciga G., Scott D., Chapin E. L., 2009, MNRAS, 395, 1153
- Paszke et al. (2019) Paszke A., et al., 2019, arXiv e-prints, p. arXiv:1912.01703
- Pearson et al. (2014) Pearson C., et al., 2014, Experimental Astronomy, 37, 175
- Planck Collaboration I (2014) Planck Collaboration I 2014, A&A, 571, A1
- Planck Collaboration VI (2020) Planck Collaboration VI 2020, A&A, 641, A6
- Planck Collaboration VII (2011) Planck Collaboration VII 2011, A&A, 536, A7
- Planck Collaboration XVI (2011) Planck Collaboration XVI 2011, A&A, 536, A16
- Planck Collaboration XXIII (2011) Planck Collaboration XXIII 2011, A&A, 536, A23
- Planck Collaboration XXVI (2016) Planck Collaboration XXVI 2016, A&A, 594, A26
- Planck Collaboration XXVII (2015) Planck Collaboration XXVII 2015, A&A, 582, A30
- Planck Collaboration XXVIII (2014) Planck Collaboration XXVIII 2014, A&A, 571, A28
- Planck Collaboration XXVIII (2016) Planck Collaboration XXVIII 2016, A&A, 594, A28
- Planck Collaboration XXXIX (2016) Planck Collaboration XXXIX 2016, A&A, 596, A100
- Polletta et al. (2021) Polletta M., et al., 2021, A&A, 654, A121
- Ritondale et al. (2019) Ritondale E., Vegetti S., Despali G., Auger M. W., Koopmans L. V. E., McKean J. P., 2019, MNRAS, 485, 2179
- Rokach (2010) Rokach L., 2010, Artif. Intell. Rev., 33, 1
- Roseboom et al. (2012) Roseboom I. G., et al., 2012, MNRAS, 419, 2758
- Shirley et al. (2021a) Shirley R., et al., 2021a, arXiv e-prints, p. arXiv:2105.05659
- Shirley et al. (2021b) Shirley R., et al., 2021b, MNRAS, 507, 129
- Spilker et al. (2016) Spilker J. S., et al., 2016, ApJ, 826, 112
- Trombetti et al. (2021) Trombetti T., Burigana C., Bonato M., Herranz D., De Zotti G., Negrello M., Galluzzi V., Massardi M., 2021, A&A, 653, A151
- Vegetti et al. (2014) Vegetti S., Koopmans L. V. E., Auger M. W., Treu T., Bolton A. S., 2014, MNRAS, 442, 2017
- Wang et al. (2020) Wang G., et al., 2020, arXiv e-prints, p. arXiv:2010.02909
- Weiß et al. (2013) Weiß A., et al., 2013, ApJ, 767, 88
Appendix A PHz HELP source catalogues
Here we provide source catalogues for the PHz protocluster candidates and candidate gravitationally-lensed galaxies in the HELP fields. The protocluster candidate catalogue (Table 2) consists of the 40 PHz HELP sources that are at least 3 overdense at 250, 350 or 500 m when compared with random locations in the Lockman-SWIRE field. The catalogue provides the source name in the PHz catalogue, the Planck position, the galaxy counts at 250, 350 and 500 m, and the overdensity contrasts at 250, 350 and 500 m (see Section 5.1). We include the galaxy counts because some protocluster candidates have fewer galaxies within a small contour area and others have many galaxies within a large contour area.
The source catalogue of candidate strong gravitational lenses among the HELP sources (Table 3) consists of the eight galaxies that were identified as potential lensed galaxies by the ML process (seven new sources and G160). The catalogue columns give the PHz source in which the galaxy was detected, the position of the candidate lensed galaxy in the source, the flux density at 250, 350 and 500 m, and the ML model score from 0.0 to 1.0.
Source name | Planck RA | Planck Dec | ||||||
---|---|---|---|---|---|---|---|---|
[J2000] | [J2000] | |||||||
PHz G245.1482.47 | 01:21:09.79 | 30:56:46.68 | 27 | 32 | 16 | 0.82 | 1.88 | 4.79 |
PHz G105.9457.19 | 00:14:52.98 | 04:28:16.39 | 38 | 27 | 13 | 2.24 | 2.07 | 4.94 |
PHz G288.3088.95 | 12:50:15.36 | 26:06:43.56 | 16 | 20 | 13 | 1.36 | 2.93 | 9.27 |
PHz G325.5056.97 | 23:23:16.56 | 56:07:12.72 | 22 | 19 | 8 | 1.67 | 2.08 | 4.21 |
PHz G332.1580.25 | 00:28:06.98 | 35:31:10.92 | 19 | 18 | 5 | 1.64 | 2.34 | 2.73 |
PHz G097.4486.77 | 12:57:51.60 | 30:02:11.04 | 10 | 17 | 8 | 0.45 | 2.29 | 5.21 |
PHz G112.5661.51 | 00:31:44.08 | 00:57:40.70 | 12 | 17 | 8 | 0.67 | 2.15 | 4.95 |
PHz G229.1127.02 | 08:56:06.96 | 00:47:18.23 | 14 | 17 | 7 | 1.46 | 2.98 | 5.59 |
PHz G113.7664.06 | 00:35:26.74 | 01:28:16.64 | 19 | 16 | 11 | 0.95 | 1.20 | 5.06 |
PHz G326.2553.62 | 23:04:54.72 | 58:24:59.04 | 11 | 16 | 10 | 1.03 | 2.95 | 8.91 |
PHz G088.8761.67 | 23:49:41.76 | 02:57:00.86 | 14 | 16 | 6 | 3.66 | 6.11 | 9.71 |
PHz G309.4972.52 | 00:40:25.18 | 44:27:57.60 | 16 | 15 | 10 | 1.11 | 1.65 | 6.09 |
PHz G131.1960.58 | 01:07:37.92 | 02:01:46.20 | 13 | 14 | 8 | 0.80 | 1.59 | 4.95 |
PHz G116.7963.75 | 00:40:35.11 | 01:00:45.68 | 9 | 14 | 7 | 0.69 | 2.52 | 6.06 |
PHz G315.6155.34 | 23:52:59.76 | 60:19:39.00 | 15 | 14 | 9 | 1.08 | 1.59 | 5.70 |
PHz G249.6581.40 | 01:23:58.15 | 32:02:08.16 | 10 | 13 | 8 | 1.90 | 4.04 | 11.46 |
PHz G325.4084.13 | 00:40:48.29 | 32:31:50.17 | 12 | 13 | 8 | 1.29 | 2.31 | 7.17 |
PHz G332.0657.45 | 23:07:16.80 | 53:22:54.11 | 14 | 13 | 7 | 1.82 | 2.49 | 6.55 |
PHz G103.7060.58 | 00:14:10.87 | 00:53:43.20 | 12 | 13 | 11 | 0.29 | 0.87 | 5.35 |
PHz G316.8183.76 | 00:44:17.78 | 33:10:35.39 | 8 | 13 | 5 | 0.58 | 2.43 | 4.30 |
PHz G012.0067.27 | 23:09:34.56 | 32:55:09.48 | 14 | 13 | 8 | 1.24 | 1.77 | 5.85 |
PHz G156.5188.24 | 12:47:00.48 | 28:35:18.96 | 8 | 12 | 5 | 1.03 | 3.07 | 5.81 |
PHz G092.2465.65 | 00:02:35.86 | 05:43:47.75 | 12 | 12 | 8 | 0.67 | 1.22 | 4.95 |
PHz G096.7962.76 | 00:04:52.45 | 02:16:40.01 | 8 | 12 | 6 | 0.87 | 2.74 | 6.52 |
PHz G259.7584.40 | 01:09:19.27 | 31:08:12.84 | 16 | 12 | 7 | 1.58 | 1.58 | 5.05 |
PHz G298.5688.58 | 00:51:55.94 | 28:32:51.00 | 15 | 12 | 7 | 1.94 | 2.14 | 6.36 |
PHz G012.9066.24 | 23:04:31.92 | 32:41:45.95 | 10 | 11 | 4 | 1.05 | 2.00 | 3.39 |
PHz G342.1581.32 | 00:25:05.34 | 33:41:56.76 | 12 | 11 | 8 | 1.19 | 1.68 | 6.82 |
PHz G117.6857.72 | 00:40:11.30 | 05:02:13.60 | 9 | 11 | 2 | 1.41 | 2.93 | 1.87 |
PHz G223.7279.87 | 01:36:49.78 | 28:34:01.92 | 7 | 10 | 3 | 2.05 | 4.82 | 6.01 |
PHz G006.0265.14 | 23:01:30.96 | 35:40:04.44 | 10 | 10 | 4 | 1.48 | 2.31 | 4.31 |
PHz G015.1265.76 | 23:01:52.80 | 31:50:25.80 | 7 | 10 | 2 | 0.86 | 2.54 | 1.85 |
PHz G355.1184.41 | 00:30:56.42 | 30:27:54.00 | 14 | 10 | 5 | 1.87 | 1.74 | 4.50 |
PHz G014.0064.56 | 22:56:22.32 | 32:24:00.72 | 9 | 10 | 4 | 1.38 | 2.52 | 4.66 |
PHz G007.5064.16 | 22:56:12.24 | 35:14:09.60 | 8 | 9 | 6 | 1.19 | 2.29 | 7.82 |
PHz G029.6080.99 | 13:31:34.56 | 26:14:52.44 | 9 | 8 | 5 | 2.57 | 3.23 | 9.62 |
PHz G317.5954.94 | 23:43:22.56 | 60:12:41.04 | 8 | 8 | 9 | 0.11 | 0.48 | 5.70 |
PHz G322.0163.40 | 23:56:50.16 | 51:39:36.00 | 10 | 8 | 4 | 1.69 | 1.88 | 4.78 |
PHz G242.2881.81 | 01:24:43.99 | 30:53:49.20 | 7 | 8 | 4 | 1.88 | 3.40 | 7.83 |
PHz G019.3368.20 | 23:12:49.20 | 30:04:16.68 | 5 | 7 | 4 | 0.29 | 1.41 | 4.52 |
aPHz G358.7582.52 | 00:22:31.59 | 31:08:39.84 | 13 | 19 | 7 | 0.40 | 1.74 | 3.05 |
aPHz G025.5877.88 | 13:44:33.84 | 24:57:16.56 | 16 | 19 | 7 | 0.68 | 1.66 | 2.94 |
aPHz G026.0285.28 | 00:30:17.78 | 27:35:48.84 | 10 | 15 | 7 | 0.14 | 1.28 | 3.28 |
aPHz G083.8864.06 | 23:47:03.36 | 06:10:55.85 | 12 | 14 | 6 | 0.85 | 1.88 | 3.95 |
aPHz G010.9778.47 | 00:01:33.29 | 30:52:58.44 | 6 | 14 | 6 | 0.17 | 1.59 | 3.46 |
a,bPHz G095.5061.59 | 00:00:46.47 | 01:26:57.77 | 10 | 13 | 5 | 0.39 | 1.41 | 2.72 |
aPHz G344.0689.34 | 12:53:22.32 | 26:37:55.56 | 8 | 13 | 3 | 0.28 | 1.78 | 1.58 |
aPHz G009.5883.12 | 13:19:07.68 | 24:14:06.00 | 9 | 12 | 2 | 0.25 | 1.22 | 0.49 |
aPHz G110.8963.19 | 00:29:50.64 | 00:49:30.02 | 6 | 11 | 2 | 0.07 | 1.29 | 0.67 |
aPHz G081.5387.74 | 12:58:16.56 | 28:49:00.12 | 11 | 10 | 4 | 0.98 | 1.41 | 2.87 |
aPHz G303.0386.33 | 12:51:27.84 | 23:27:44.64 | 8 | 8 | 2 | 0.83 | 1.44 | 1.45 |
Source name | RA | Dec | ML score | |||
---|---|---|---|---|---|---|
[J2000] | [J2000] | [mJy] | [mJy] | [mJy] | ||
aPHz G160.5756.79 | 02:09:41.19 | 00:15:57.70 | 891 10 | 1165 11 | 857 10 | 1.00 |
PHz G088.8761.67 | 23:49:39.97 | 02:55:54.29 | 58 8 | 92 8 | 109 8 | 0.98 |
PHz G115.6159.35 | 00:36:27.67 | 03:20:51.64 | 106 8 | 142 8 | 106 8 | 0.92 |
PHz G222.7555.98 | 03:25:01.90 | 27:33:08.76 | 79 6 | 107 6 | 109 6 | 0.91 |
PHz G325.8262.47 | 23:44:07.68 | 51:32:24.65 | 60 6 | 105 6 | 99 7 | 0.84 |
PHz G358.7582.52 | 00:22:32.56 | 31:11:36.99 | 71 4 | 112 6 | 91 7 | 0.63 |
PHz G245.1482.47 | 01:20:52.35 | 30:55:41.35 | 46 7 | 78 6 | 87 7 | 0.59 |
PHz G010.7170.28 | 23:24:01.94 | 32:55:30.32 | 85 5 | 116 5 | 93 6 | 0.50 |
Appendix B Candidate lensed galaxies in the PlanckXXVII sources
PlanckXXVII identified 11 new exceptionally bright (peak flux mJy) candidate gravitationally-lensed galaxies among the PlanckXXVII sources, which have all since been confirmed, but there may be less obvious undiscovered lensed galaxies in the PlanckXXVII sources. As with the PHz HELP sources, we use our ML model to identify more dim candidate lensed galaxies. The source catalogue of new candidate PlanckXXVII lensed galaxies (Table 4) provides the Herschel-SPIRE source name, the position of the candidate lensed galaxy, the flux densities at 250, 350 and 500 m, and the ML score. One of the best candidates, G224, has recently been followed up with ALMA and confirmed as a strong gravitational lens system (Fig. 16).
Source name | RA | Dec | ML score | |||
---|---|---|---|---|---|---|
[J2000] | [J2000] | [mJy] | [mJy] | [mJy] | ||
aPLCK_G224.640.7 | 04:33:19.24 | 25:46:32.22 | 154 6 | 215 5 | 199 6 | 1.00 |
PLCK_G003.763.3 | 22:53:38.49 | 37:03:22.31 | 85 6 | 123 5 | 128 6 | 1.00 |
PLCK_G026.674.4 | 13:59:38.09 | 24:23:16.15 | 169 5 | 191 4 | 146 5 | 1.00 |
PLCK_G058.564.6 | 14:44:26.76 | 35:12:35.47 | 220 6 | 227 5 | 180 5 | 0.99 |
PLCK_G016.254.0 | 15:12:17.55 | 12:36:27.95 | 61 6 | 101 5 | 106 5 | 0.99 |
PLCK_G120.451.6 | 13:06:14.00 | 65:25:17.97 | 182 6 | 197 5 | 171 6 | 0.99 |
PLCK_G186.372.7 | 01:56:33.97 | 18:28:36.80 | 73 6 | 113 5 | 105 6 | 0.98 |
PLCK_G319.745.3 | 22:43:26.17 | 67:23:58.05 | 97 5 | 124 5 | 130 6 | 0.96 |
PLCK_G143.669.4 | 12:10:04.73 | 46:06:26.77 | 69 6 | 99 5 | 132 5 | 0.96 |
PLCK_G176.659.0 | 10:37:07.85 | 41:25:32.20 | 185 6 | 195 5 | 148 6 | 0.92 |
PLCK_G256.869.3 | 02:09:29.96 | 40:07:26.51 | 43 5 | 120 5 | 111 5 | 0.63 |
PLCK_G223.361.4 | 10:52:10.68 | 18:52:54.40 | 68 5 | 97 5 | 95 6 | 0.62 |
PLCK_G239.069.8 | 11:35:55.44 | 17:04:50.51 | 91 6 | 118 5 | 103 5 | 0.60 |
bPLCK_G242.340.7 | 04:43:45.39 | 38:59:00.45 | 375 6 | 368 6 | 264 6 | 1.00 |
bPLCK_G180.165.1 | 11:04:27.10 | 38:12:32.99 | 267 8 | 269 7 | 284 7 | 0.92 |
c,dPLCK_G191.362.0 | 10:44:38.47 | 33:51:03.63 | 101 6 | 115 5 | 112 6 | 0.20 |
c,dPLCK_G106.883.3 | 00:43:25.58 | 20:42:00.88 | 161 3 | 162 3 | 129 3 | 0.11 |
cPLCK_G107.636.9 | 16:07:22.53 | 73:47:02.67 | 217 6 | 195 5 | 109 5 | 0.02 |
cPLCK_G160.741.0 | 09:07:27.92 | 56:09:00.57 | 111 5 | 119 5 | 101 5 | 0.01 |
c,dPLCK_G112.445.8 | 14:17:07.61 | 69:32:27.35 | 162 6 | 143 5 | 114 6 | 0.00 |
cPLCK_G280.277.8 | 01:15:01.27 | 38:10:32.77 | 172 6 | 152 5 | 101 6 | 0.00 |
-
•
a New confirmed strong gravitationally-lensed galaxy (see Fig. 16).
-
•
b Bright galaxy located outside the IN region but still within 10 arcmin of the Planck position.
-
•
c Galaxy identified with criteria from Negrello et al. (2010) ( mJy).
-
•
d Galaxy also identified with simple selection criteria (see Section 5.2.6).