Formulating and critically examining the assumptions of global 21-cm signal analyses: How to avoid the false troughs that can appear in single spectrum fits
Abstract
The assumptions inherent to global 21-cm signal analyses are rarely delineated. In this paper, we formulate a general list of suppositions underlying a given claimed detection of the global 21-cm signal. Then, we specify the form of these assumptions for two different analyses: 1) the one performed by the EDGES team showing an absorption trough in brightness temperature that they modeled separately from the sky foreground and 2) a new, so-called Minimum Assumption Analysis (MAA), that makes the most conservative assumptions possible for the signal. We show fits using the EDGES analysis on various beam-weighted foreground simulations from the EDGES latitude with no signal added. Depending on the beam used, these simulations produce large false troughs due to the invalidity of the foreground model to describe the combination of beam chromaticity and the shape of the Galactic plane in the sky, the residuals of which are captured by the ad hoc flattened Gaussian signal model. On the other hand, the MAA provides robust fits by including many spectra at different time bins and allowing any possible 21-cm spectrum to be modeled exactly. We present uncertainty levels and example signal reconstructions found with the MAA for different numbers of time bins. With enough time bins, one can determine the true 21-cm signal with the MAA to times the noise level.
1 Introduction
The highly redshifted 21-cm signal generated by the hyperfine, spin-flip transition of neutral hydrogen tracks the history of the Universe after recombination and before reionization (Pritchard & Loeb, 2012), from the Dark Ages, before there were any compact sources, through Cosmic Dawn, when the first stars were born. This signal comes in two forms, the power spectrum, which has both angular and spectral dependence, and the sky-averaged global signal, which is a single spectrum. While the former is being focused on by experiments like the Hydrogen Epoch of Reionization Array (HERA, DeBoer et al., 2017), the former, which is the focus of this paper, is actively being searched for by experiments such as the Large-Aperture Experiment to Detect the Dark Age (LEDA, Bernardi, 2018), the Shaped Antenna measurement of the background RAdio Spectrum (SARAS, Singh et al., 2018), the Cosmic Twilight Polarimeter (CTP, Nhan et al., 2019), the Radio Experiment for the Analysis of Cosmic Hydrogen (REACH, de Lera Acedo, 2019), and the Experiment to Detect the Global Epoch-of-Reionization Signature (EDGES, Monsalve et al., 2017a, 2018). In particular, in Bowman et al. (2018) the EDGES team reported a trough in the sky-averaged radio spectrum at 78 MHz using an analysis technique that we will examine in detail in this paper. Space-based mission concepts, such as the Dark Ages Polarimeter PathfindER (DAPPER; Burns et al., 2019; Burns, 2020) and its precursor the Dark Ages Radio Explorer (DARE, Burns et al., 2017), are also being developed to measure the global 21-cm signal from the vicinity of the Moon, where experiments do not have to deal with systematic effects such as human-generated Radio Frequency Interference (RFI) or attenuation and emission from the ionosphere or interactions of the antenna beam with the environment.
While all of these experiments must be designed carefully to have the sensitivity to measure the global signal, which is expected to be a few hundred mK deep, this paper will focus on the data analysis techniques necessary to extract the signal from the beam-weighted galactic foreground emission, which, at the relevant frequencies, is generally at a level of a few thousand K (for the magnitude of the beam-weighted foreground seen by EDGES, see Figure 1a of Bowman et al., 2018). The shape of the beam-weighted foreground in the absence of the global signal is not known a priori, so simple subtraction is not an option. Therefore, one must devise a model which is known (or at least reasonably assumed) to encapsulate the possible values of the beam-weighted foreground. This task would be easy if the beam would be achromatic, but such beams do not exist. Real antennas have beams that change with frequency over the wide bandwidth that these experiments need to measure. This beam chromaticity distorts the spectral shapes of the intrinsic foreground as measured by an achromatic beam (and which would be well fit by models such as the ones EDGES uses) in ways particular to the antenna being used. Therefore, to fit the beam-weighted foreground, a model that is particular to the given antenna and experiment location must be created. Such a model can be created via Singular Value Decomposition (SVD) of a training set as it is in the pipeline laid out in Tauscher et al. (2018), Rapetti et al. (2019), and Tauscher et al. (2020). The method presented for making this model is similar in spirit to that defined in Switzer & Liu (2014), except the foreground basis is derived from an a priori training set instead of the data itself. It is also very similar to the method performed by Vedantham et al. (2014), although in that paper, SVD is performed on spectra at different time bins to define modes that span frequencies, while in this paper, different simulations of spectra defined at multiple time bins are used to define modes that span both frequencies and time bins. This difference is crucial because the constraining power of our method comes from the fact that its foreground model encodes correlations between time bins, which is not true of the purely spectral modes arising from the methods of Vedantham et al. (2014). Our method, a new variant of which will be described in Section 4, connects with the idea from Liu et al. (2013) that angular information can effectively supplement spectral information in signal estimation. However, instead of requiring instruments with angular resolution, our method gleans spatial information from time-dependent drift-scan measurements from single antenna instruments with no angular resolution, where the spectra weight the entire spatial sky into a single pixel.
This paper is especially relevant in light of the large volume of literature written by the community in their attempts to explain the trough presented by Bowman et al. (2018), especially its depth. Some have explored the possibility that it could represent a larger radio background than expected (Feng & Holder, 2018; Ewall-Wice et al., 2018, 2019; Fialkov & Barkana, 2019; Mebane et al., 2019), while others have hypothesized that Rutherford-like scattering of dark matter could cause the hydrogen gas to cool faster than adiabatic cooling would imply (Muñoz et al., 2018; Barkana, 2018; Barkana et al., 2018; Fialkov et al., 2018; Loeb & Muñoz, 2018; Berlin et al., 2018). While none of these ideas perfectly describe the detection (see, e.g. Creque-Sarbinowski et al., 2019, for a criticism of dark matter explanations), they illustrate the temptation to use the results of Bowman et al. (2018) to explore possible exotic physics. In addition to these physical explanations, some have explored possible unmodeled systematic effects present in the data (Hills et al., 2018; Bradley et al., 2019; Draine & Miralda-Escudé, 2018; Singh & Subrahmanyan, 2019; Spinelli et al., 2019; Sims & Pober, 2020). In this paper, we layout yet another possible explanation of the EDGES results—that the modeling performed could be mistaking beam cromaticity distortion of the foreground for signal.
In Section 2, we lay out the general presumptions necessary to perform any global 21-cm signal experiment. In Section 3, we generate and fit multiple EDGES-like simulations of beam-weighted foregrounds with the analysis method of Bowman et al. (2018). In Section 4, we propose a new, minimum assumption analysis that allows for any possible 21-cm global signal and demonstrate it on simulated spectra with a signal and beam-weighted foreground. In Section 5, we discuss and compare the results of the two analyses. Finally, we conclude in Section 6.
2 Enumerating assumptions
The likelihood used when doing a foreground-only fit (like the one done to generate the residuals found in Figure 1b of Bowman et al., 2018) is
(1) |
where is the spectrum written as a column vector, the covariance matrix of the data’s noise distribution, and a matrix with the basis vectors used to fit the foreground as columns. The value of that maximizes is
(2) |
and the maximum likelihood reconstruction of the foreground, , is given by , or , where
(3) |
is the matrix that projects vectors into the column space of (i.e. the space of spectra formable by the linear foreground model). The residual, , unfit by this procedure is given by . The data are given by a sum of a foreground term, , a signal term , and random noise .
We can write where is the coefficient vector that best describes given the basis matrix 111If the signal, , was zero and there was no random noise, then would be equal to . However, in the general case where there is a nonzero signal, absorbs some of the signal spectrum. In addition, is subject to small biases caused by random noise, whereas in this formulation is the best fitting coefficient vector of the ideal, noiseless foreground (i.e. the one which would be measured given infinite integration time with a stable instrument). and is the part of that cannot be fit by the foreground model, i.e. the part of that is not in the column space of , which satisfies . This means that and the foreground-only residual is given by
(4) |
Therefore, to solve for the signal plus foreground inaccuracy and noise, which we define as , we must find the general solution of , which is the sum of any particular solution, , and the general solution, , to the homogeneous equation, . We guess that is a particular solution and verify by noting that because . The homogeneous solution is any vector in the null space of , which is equivalent to any vector that remains unchanged when multiplied on the left by . Since is a projection matrix onto the columns of , any vector in the column space of remains unchanged when being multiplied by . Therefore, for any . This means that the signal satisfies
(5) |
for some unknown vector . Equation 5 implies that, due to the filtering out of the foregrounds when fitting, the signal can have any foreground modes added to it without changing the quality of the fit. This shows that with only one spectrum and no more information, even if (i.e. the foreground model is perfect), the 21-cm signal cannot be uniquely determined. Thus, an extra assumption about the form of the signal is required. The full set of assumptions needed to extract the global 21-cm signal with useful, rigorous uncertainties is as follows:
-
1.
Sky-averaged radio data contains a sum of beam-weighted foreground emission and the global signal. This assumes a well-calibrated instrument.
-
2.
The noise of the data follows a known or estimated distribution.222Usually, this distribution is a zero-mean Gaussian specified entirely by its covariance matrix, , which must be known up to a constant of proportionality. The proportionality constant must be sufficiently close to 1 if we also wish to measure goodness-of-fit.
-
3.
The true beam-weighted foreground can be fit with the given foreground model, described by the basis matrix , to well below the noise level of the data. This is equivalent to , where is the number of channels in and is the unmodeled component of the foreground as above, given by .
-
4.
The signal follows a specific form.
Assumption 4 can take many forms, some much stronger and more unjustified than others. In Bowman et al. (2018), the signal is assumed to follow the form of a flattened Gaussian profile. Note that this is an assumption for analyzing the data and cannot be justified by examining the data itself, especially in light of the Bayesian evidence-based analysis of Sims & Pober (2020), which showed that many possible assumed signal models lead to essentially the same Bayesian evidence. Crucially, the errors obtained via fitting a given model of the signal do not account for the unknown likelihood that the true signal can be fit by that model.
The specific flattened Gaussian profile in Bowman et al. (2018) merely represents the value of from Equation 5 that best matches the assumed flattened Gaussian model. Different values of along with different assumed signal models lead to equally valid interpretations of the data (see, e.g. Hills et al., 2018; Bradley et al., 2019; Singh & Subrahmanyan, 2019; Sims & Pober, 2020).
3 EDGES-like analysis
3.1 Assumptions
3.1.1 Assumption 1: calibration
The EDGES team has worked extensively on the calibration of their receiver (Monsalve et al., 2017a, b). Assuming solar, weather, RFI, and other transient events are removed, due to the rigor of their lab measurements and calibration strategy, it is reasonable to assume that, to the mK level, their data consists solely of beam-weighted foreground and global 21-cm signal.
3.1.2 Assumption 2: noise level
Although the covariance distribution of the noise is not known, the residuals presented in Bowman et al. (2018) seem to have a relatively flat magnitude of 20 mK when many smooth modes are removed. So, for our simulations we use where mK and is the identity matrix.
3.1.3 Assumption 3: foreground model
The EDGES analysis involves multiple linear foreground models, but a common model in the literature is a polynomial in multiplied by where is the observed frequency and is a reference frequency,
(6) |
This is equivalent to assuming that is a matrix with the terms as its columns and the foreground coefficient vector is given by . This model is based on the Taylor series approximation of around , which should adequately describe the intrinsic synchrotron foreground in each sky direction (see also Hibbard et al., in preparation).
3.1.4 Assumption 4: signal model
In Bowman et al. (2018), the EDGES team uses a flattened Gaussian signal model of the form
(7) |
where
(8) |
In this section, we adopt the same model.




3.2 Simulations
To illustrate the potential problems with the EDGES analysis in Bowman et al. (2018), we have created simple simulations of the beam-weighted foreground expected to be seen from the Murchison Radio-astronomy Observatory (MRO) where the experiment is located.333 S, E This section will lay out those simulations as well as apply the EDGES analysis to them.
The foreground brightness temperature, , in the simulations presented throughout Sections 3 and 4 is given by
(9) |
where is the antenna beam, is the foreground emission, and and are the spherical coordinate angles.

3.2.1 Antenna beam
The antenna beam used in our simulations is a Gaussian beam with a frequency dependent Full Width at Half Maximum (FWHM), , i.e.
(10) |
The FWHMs we use are parabolas with fixed endpoints at and 444These endpoints are similar to those of the beam of the blade antenna used by EDGES (see extended data figure 4a of Bowman et al., 2018). and varying curvature, . This means, they have the form
(11) |
where is the line connecting the two endpoints and is the parabola with unit curvature and zeros at the endpoint frequencies. and are given by
(12a) | ||||
(12b) |
For illustration purposes, we use values of , , , and . The FWHM curves of the four beams used in our simulations are shown in the upper left panel of Figure 2. To ensure that results are not tainted by any fraction of the beam below the horizon, the beams are normalized such that
(13) |
where represents the half of the sphere above the horizon, i.e. and .
3.2.2 Foreground map
For the simulations used in this paper, we use the 408 MHz map from Haslam et al. (1982), scaled to lower frequencies via multiplication by a power law with spectral index -2.5, i.e.
(14) |
The time dependence indicated in both and is determined by the rotation of the Earth from the EDGES latitude as a function of sidereal time. The Haslam maps, rotated to match the zenith direction at 0, 3, and 6 hr LST (local sidereal time) at S latitude, are shown in the top left, top right, and bottom left panels of Figure 1.
3.2.3 Observation strategy
The simulations were performed by averaging together equally all beam-weighted foregrounds between LST hours 0-6 from the EDGES latitude. As the central bulge of the Milky Way is closest to the zenith between LST hours 17 and 18, this strategy amounts to averaging during low foreground times. This observation strategy leads to an effective foreground given by
(15) |
and shown at 80 MHz in the bottom right panel of Figure 1.
3.2.4 No 21-cm signal added
No 21-cm global signal was included in the simulations. All fits shown in Figure 2 are performed on the beam-weighted foreground alone with noise.
3.2.5 Random noise
To each simulated measurement of the beam-weighted foreground we add the same realization of noise at the 20 mK level (see Section 3.1.2) to control for the effect of noise without ignoring it.
3.3 Likelihood maximization techniques
For foreground-only fits, we maximize the likelihood through the analytical computation of the coefficient vector as in Equation 2. Then, the maximum likelihood foreground reconstruction is simply .
For fits with both the linear foreground model and the flattened Gaussian absorption trough model, we numerically explore the space of the trough parameters to find the maximum likelihood value.555We use the minimize function from scipy.optimize to minimize the negative log-likelihood. At the iteration of the optimization algorithm, the exploration is at . To evaluate the full likelihood at that point, we perform a foreground-only fit, as above, on the modified data spectrum given by . This allows for exploration of only the nonlinear absorption trough parameters instead of including the foreground parameters, which would slow down the analysis. This technique is similar to that used in Rapetti et al. (2019), except instead of exploring the entire posterior distribution of the trough parameters with a Markov Chain Monte Carlo simulation, here we merely wish to find its maximum through gradient ascent.
3.4 EDGES-like analysis results
The beam-weighted foregrounds with noise from the simulations described in Section 3.2 were fit with the model given by Equation 6, which is meant to describe the sky foreground completely. The residuals from these foreground-only fits are shown in the upper right panel of Figure 2. While the beam-weighted foreground from the lowest curvature FWHM beam case is fit to the noise level, it is clear that the residuals grow as the curvature increases, with the largest curvature beam case yielding residuals two times higher than the noise level. These foreground-only residuals correspond to the curves from Equation 4 with .
As in Bowman et al. (2018), we also performed fits that include both the linear foreground model and a flattened Gaussian absorption trough model. The resulting troughs are shown in the lower left panel of Figure 2. As the beam FWHM curvature increases, the size of the fit trough increases up to 800 mK, showing that inaccuracies in the foreground model (i.e. violations of assumption 3) can lead to false troughs appearing in fits. The residuals to these foreground and trough fits are shown in the lower right panel of Figure 2. In all four cases, these residuals are at the 20 mK noise level, showing that flattened Gaussian troughs can effectively complement the smooth foreground model from Equation 6 to fit the foreground down to the noise level.
The simulations presented here show that some frequency dependencies, such as curvature in the FWHM of an antenna beam (which there is some sign of in extended data Figure 4a of Bowman et al., 2018, although only three frequencies are shown), can induce structure in the beam-weighted foreground that cannot be fit out by a order polynomial. Not accounting for this structure can lead to artificial troughs being found when a signal model is fit simultaneously with the beam-weighted foreground. It is important to note that the simulations proposed here use simple beams, fully characterized by the FWHM function , whereas real antenna beams have many independent modes of variation corresponding to the geometry and electrical properties of antenna components. One complication in the specific case of the EDGES beam is the lack of azimuthal symmetry, leading to different E- and H-plane beam patterns.
4 Minimum assumption analysis
4.1 A more robust assumption 4
For a robust analysis, instead of assumption 4 as laid out for the EDGES-like analysis in Section 3.1.4, which is not properly motivated, we can utilize exclusively the fact—not even used in the previous analysis—that, by definition, the global 21-cm signal must be spatially uniform. If the data is a concatenation of spectra taken by the same instrument instead of a single spectrum, i.e.
(16) |
then the spatial uniformity of the signal implies that the signal contribution to each spectrum is identical, i.e. for all . On the other hand, the foregrounds of each spectrum will be different (unless the same sky is overhead in two or more of the spectra).
The analysis with the fewest possible assumptions about the signal would involve assuming nothing about the spectral behavior of the signal, i.e. using a signal model that can fit all values of in where is the number of frequencies. This can be achieved by computing a so-called “expansion matrix” (see Tauscher et al., 2018, for the initial definition) that encodes how the signal appears in the data, i.e. the signal term in the full data, , is . To encode the defining information aforementioned that all spectra have the same signal in them, we use . We term an analysis with this form of the signal the minimum assumption analysis (MAA).
can also be adapted to fit different experimental designs, such as full Stokes measurements (e.g., from CTP and DAPPER) or data for which a different amount of sky is blocked in each spectrum (such as it can be for DAPPER due to the shifting position of the moon).
4.2 Assumption 3: choosing foreground basis
4.2.1 Polynomial bases
Using identical and independent basis sets for each spectrum leads to a degeneracy between the foreground and signal models, causing infinite uncertainties. To see this, suppose that each spectrum has its own polynomial basis, represented by the matrix . In this case, the full foreground basis matrix is
(17) |
Multiplying this basis by a coefficient vector of the form , leads to a foreground vector of the form . Since, in this case, the foreground basis can generate vectors in the column space of , the foreground and signal bases are degenerate and the uncertainties diverge to infinity. This is true even if the basis can exactly fit the foregrounds in each spectrum. Note that this explanation holds no matter what the true beam-weighted foreground is because it only depends on the form of the beam-weighted foreground model.
4.2.2 General choice of basis
We wish to find a basis that satisfies and can accurately fit the beam-weighted foreground expected for each spectrum. We suggest generating a simulated training set of foregrounds of the form
(18) |
where is the spectrum of the simulation and there are total simulations. For a given number of basis vectors , weighted Singular Value Decomposition (SVD) of as in Tauscher et al. (2018) yields a basis satisfying the normalization conditions.666Weighted SVD refers to a decomposition of the form where , , and is a rectangular diagonal matrix with decreasing non-negative elements on the diagonal. The basis matrix is made from the first columns of . This basis will encode expected correlations between the different spectra in the foreground model directly, which is a crucial aspect of the analysis necessary to obtain finite errors (see also Tauscher et al., 2020) when allowing for any signal to be fit as discussed in Section 4.1.
4.3 Maximum likelihood calculations
With a data vector , a signal expansion matrix , a noise covariance matrix , and a foreground basis matrix , the channel covariance of the signal and the corresponding mean are given by
(19a) | ||||
(19b) |
where, under the normalization condition , is the foreground projection matrix described in Section 2, given by . The calculations leading to Equations 19a and 19b are presented in Appendix A. Appendices B and C give the forms of and for the specific cases of total power spectra, where the signal expansion matrix is , and spectra of all 4 Stokes parameters, where the expansion matrix is .777For the sake of simplicity, results using the MAA with polarization are not shown in this paper, but the equations of Appendix C will prove useful in future work.



4.4 Simulation details
To test the MAA, we apply it using a foreground training set that is made identically to that used in Tauscher et al. (2020), which we briefly review here. Like the simulations from Section 3, the foreground map used in this training set was the Haslam map scaled down in frequency using a spatially constant spectral index of -2.5. The pointing of the antenna is set by the EDGES latitude and the sources and beam below the horizon are set to zero as in Section 3. The beams used have FWHMs specified by a distribution of quadratic functions given by where are the order Legendre polynomials and the ’s are normal with means , , and and standard deviations , , and .
The day is split into 100 LST chunks,888In each of these 15 minute chunks, the foreground is smeared as shown in Figure 1. which are then averaged to the number of bins used for the given analysis. For example, in the 5 time bins case, LST chunks 1-20, 21-40, 41-60, 61-80, and 81-100 are used to generate the 5 time bins. If the number of time bins used does not divide 100, then the 100 chunks are reduced to the largest multiple of the number of bins less than 100 before binning.
4.5 MAA results
The results of applying MAA using the training set described in Section 4.4 are shown in Figures 3 and 4. Figure 3 shows the uncertainty levels at each frequency using different numbers of total power spectra in the training set. These uncertainties are very large for small numbers of bins (indeed, they diverge when only one time bin is used), but approach the noise level for large numbers of bins. Note that different training sets will lead to different results and that confidence levels of , , and only correspond to , , and when the training set adequately describes the beam-weighted foreground. If the training set is not adequate, any given percentage confidence interval will be wider than expected based on the -level. Appendix D derives the so-called signal bias statistic, , from Tauscher et al. (2018) in the MAA case when the foreground training set is imperfect. This statistic connects percentage confidence to the -level at which the signal is inside the interval in a root-mean-square sense.
The top panel of Figure 4 shows an example reconstruction found when applying the MAA with 6 time bins to the sum of a beam-weighted foreground curve from the training set and an additional signal component that is Gaussian in frequency. The two intervals show the and confidence levels at each individual frequency, which are 1 and 2 times the square roots of the diagonal elements of , respectively. Note, however, that even though a Gaussian signal is used in this example, as shown in the figure, the method would work equally well, providing the same errors, with any conceivable global 21-cm signal.
The bottom panel of Figure 4 shows the frequency correlation matrix, , which is a normalized form of the covariance matrix, . The correlation matrix is designed to contain only values between 1 and -1, which occur at perfect correlation and anti-correlation, respectively. It is defined through , which can also be written as
(20) |
where has the same diagonal elements as but has no nonzero off-diagonal elements. By construction, all diagonal elements of are equal to 1. It is clear from Figure 4 that all of the frequency channels are positively correlated with each other in this particular case.
To completely utilize the power of the MAA, the full covariance matrix, which would include such correlations, should be used when defining a likelihood function to extend MAA results to constrain any given signal model.
5 Discussion
5.1 EDGES-like analysis
5.1.1 Beam chromaticity distortions
The fits in Section 3 (Figure 2) show that false troughs can appear when fitting a single spectrum. Some may argue that troughs cannot be created by the foreground because the spectral dependence of the foreground does not allow it. However, one needs to keep in mind that what is measured is not the intrinsic spectral structure of the foreground radiation, but rather the spectral structure of the beam-weighted foreground. Since the angular structure of the beam weighting changes from frequency to frequency the foreground component of the measured spectrum is composed of different ratios of each direction’s radiation at each frequency.
This means that a linear model that fits the intrinsic foreground spectrum of each pixel, like the model from Equation 6, will not necessarily fit the beam-weighted foreground. To illustrate this, suppose that every sky direction’s intrinsic foreground spectrum can be fit by a model using a basis matrix , i.e. where the element of is the foreground spectrum at the frequency. If the beam is achromatic, then the beam-weighted foreground is given by
(21a) | ||||
(21b) |
where is the beam pattern at every frequency that satisfies . This means that a linear model that fits for every angle also fits the beam-weighted foreground , i.e. the foreground model residual is . In the case of beam chromaticity, the beam is a diagonal matrix with the diagonal entries being the beam patterns at each frequency, satisfying . In this case, the beam-weighted foreground is
(22a) | ||||
(22b) |
and the residual
(23) |
is not necessarily zero,999The only situation in which is necessarily zero for all beams, , occurs when . But, this implies that projecting into the span of the foreground basis has no effect, which can only happen if the foreground basis is complete. In this case, the signal model will be degenerate with the foreground model and therefore cannot be extracted. leading to a possible failure of the assumption that the foreground model is adequate (Assumption 3). Therefore, while troughs are unlikely to appear in intrinsic foreground spectra, they may well appear when beam-weighted foregrounds are fit with a model of a trough and an intrinsic foreground spectrum model simultaneously.
5.1.2 EDGES beam chromaticity factor
In several of their works (see, e.g., Bowman et al., 2018; Monsalve et al., 2018; Mozdzen et al., 2019), the EDGES team uses what they term the “beam chromaticity factor” (see Equation 14 of Monsalve et al., 2017a), defined at each LST through
(24) |
where the LST dependence comes from how the temperature map is rotated with respect to the beam and and are the assumed forms of the beam and foreground. The goal of this factor is to make the spectrum appear as if it was measured with an achromatic beam (specifically, the beam at ) so that, as discussed in Section 5.1.1, basis vectors that fit the intrinsic foreground spectra can be used to fit the beam-weighted foreground spectrum. If we denote a spectrum measured at a single LST by , then the EDGES beam calibration forms a corrected spectrum, . Neglecting noise and possible receiver errors for the sake of clarity, we can express the measured beam-weighted foreground spectrum through
(25) |
where and are the unknown forms of the true beam and foreground. With these definitions, the corrected spectrum is given by
(26) |
The intention is for the fraction on the second line of Equation 26 to be 1 so that is simply the reference foreground weighted by the reference beam at the reference frequency. However, in order to assume that the fraction is 1, one must assume that and ; but, this is not a reasonable assumption as foreground and beam models are likely imperfect.101010If the reference beam and foreground were equal to the true beam and foreground, then one would know the exact beam-weighted foreground and should simply subtract it from the measured spectrum, leaving only noise and the 21-cm signal. The beam chromaticity factor calibration method is therefore prone to introduce significant biases. On the contrary, our method of deriving modes describing the beam-weighted foreground from a large sample of simulations and observations allows uncertainties in both the beam and foreground to be included in the analysis. However, as will be elaborated on in Section 5.2, it is important to note that our method is contingent on the ability of the simulations and observations to accurately describe the data being analyzed. SVD can only provide eigenmodes in the space spanned by the training set curves.
5.1.3 Toy galaxy model
In Section 3, in addition to the Haslam 408 MHz map, we also tested a toy model of galactic emission described by
(27) |
This is a model that treats the Galactic plane as a FWHM Gaussian in colatitude, , with a maximum given by 300 K at the Galactic center and 200 K at the anticenter and asymptotes to 25 K away from the plane. This map is then scaled by . Due to the fact that the false troughs remain after this change (see e.g. the dashed green line in the lower left panel of Figure 2), we infer that the chromaticity of the quadratic beams used in Section 3 interacts with the galactic plane as seen from the EDGES latitude to generate residuals that are better fit with a flattened Gaussian plus foreground model than the foreground model alone, which could lead to analyses falsely concluding that there are troughs in the sky-averaged radio spectrum.
Since the false troughs found in Section 3 are fits to aspects of the beam-weighted foreground, we urge EDGES to perform the same experiment and analysis from the northern hemisphere where the galaxy behaves very differently in the sky than from the southern hemisphere. Because of the different orientation of the galactic plane, the distortions caused by chromatic beams in our simulations are more easily fit by foreground models like that in Equation 6 at northern latitudes than southern latitudes.
5.1.4 Exploring parameter distributions
The natural endpoint of the EDGES-like analysis is not a maximum likelihood fit like the one shown in Figure 1 of Bowman et al. (2018), on which Figure 2 was based. Instead, one wishes to explore the parameter distribution using a sampling method like Markov Chain Monte Carlo (MCMC) exploration. However, the resulting distribution is only meaningful if the pieces of the model (i.e. foreground and signal models) accurately describe their respective components. In fact, parameter distributions from global 21-cm signal fits to observations are generated by a complex combination of the following four factors: 1) the foreground model parameterization; 2) the bias of the foreground model; 3) the signal model parameterization; 4) the bias of the signal model.
Ideally, factors 2 and 4 are unimportant because the foreground and signal models can fit the true foreground and signal to within the noise level. However, it is impossible to fully verify that this is the case in practice; so, their effects must be considered. Section 3 is not meant to explain the exact form and parameter distribution of the trough observed by EDGES, but instead to show that generic polynomial foreground models like those used by EDGES do not account for beam chromaticity and may lead to untrustable results. Essentially, we are pointing out that factor 2 is likely affecting the EDGES analysis.
In addition to generating bias in fits, residuals from fits with inaccurate foreground models may have led EDGES to choose the unjustified flattened Gaussian signal model, possibly furthering the signal bias of factor 4 (with respect to a more conventional signal model) silently while significantly reducing overall residuals.111111If true, this would be a case of factor 2 affecting factor 3. For robustness reasons, it is important to justify the signal model in a way external to the observations themselves.
Due to the complex interactions between the four factors in generating a posterior distribution, MCMC exploration of distributions is outside the scope of this paper and is left for an extended examination in future work.
5.2 Minimum assumption analysis
5.2.1 Advantages and limitations
While moving the experiment to a different location could provide evidence that the reported trough is a product of an inadequate foreground model, this does not solve the underlying problem—that the foreground model does not account for beam chromaticity.
The MAA proposed in Section 4 avoids this problem by making a basis specific to the given antenna by performing SVD on a training set of simulations made with that antenna’s beam (see Section 4.2.2).
In addition, the MAA allows any possible global signal, avoiding unwanted bias from generating a signal model that interacts with the foreground model bias to produce false results. In fact, under the assumption that the foreground model generated by SVD is adequate, the MAA signal mean and covariance constitute a direct measurement not of a signal model, but of the signal itself. In fact, since all degrees of freedom are retained, a physical model can be fit directly to the constraints implied by and (such as those shown in the bottom panel of Figure 3).
Note that, for simplicity, the cosmic microwave background (CMB) is not included in the training set used in Section 4, which merely uses the Haslam map scaled with a spectral index of -2.5. If it was included (as it should be when analyzing data from a real experiment),121212More precisely, the CMB should be subtracted from the foreground model used to generate the training set so that it can be found by the signal fitting and subtracted after the fact. then the signal mean would include a 2.725 K flat spectrum component which would need to be subtracted out.
Another important note about the MAA is that it is predicated upon the beam-weighted foreground training set provided to it. The results may change if the training set is changed, even if the data being analyzed remain the same. The training set used in Section 4 contained a wide variety of beam FWHMs but only one intrinsic foreground map. A different training set built using many intrinsic foreground maps and one beam might produce errors that differ significantly from the ones presented in Figure 3. However, even as different training sets would lead to different levels of precision, the accuracy of the MAA should remain steady as the training set changes as long as the true beam-weighted foreground is encompassed by the SVD eigenmodes of the training sets.131313Here, we use precision to refer to the size of uncertainties, which is known even when the true 21-cm signal is unknown and accuracy to refer to the size of the difference between the signal reconstruction and the true signal with respect to the size of the uncertainties. One aspect of this effect is that some training sets will require more modes to be included in the foreground model than others, forcing one to degrade precision in order to retain accuracy.
5.2.2 Extension to motion induced dipole
Slosar (2017) pointed out that, much like with the Cosmic Microwave Background (CMB), Doppler shifting of the 21-cm background induces a dipole component that is related to the monopole. In particular, the dipole component, , of the 21-cm signal is related to the monopole spectrum, , through141414Note that this assumes that there is no intrinsic (i.e. non-motion-induced) dipole of the 21-cm signal.
(28) |
where is the magnitude of our velocity with respect to the CMB as a fraction of the speed of light and is the angle between that velocity and the line of sight. Based on CMB observations, . Deshpande (2018) suggested that Equation 28 should be considered an essential qualifying test of any measurement of the global signal. Due to the fact that (by Equation 28), is linear in , it could conceivably be included in the MAA in order for any fit to pass this test by default, essentially extending the minimal signal assumption introduced in Section 4.1. To do so, we note that the sum of the monopole and dipole components is
(29) |
where . To determine how this would appear to a single antenna experiment, we must multiply by the beam and integrate over all angles. Defining as the signature of the 21-cm signal in the spectrum measured at the time period (or, equivalently, pointing angle), it is clear that
(30) |
where .151515Note that will not be known exactly because the beam will not be known exactly. However, since the dipole component will be on the order of 10 mK (see Slosar, 2017) and will be of order unity, the expected imprecision levels in the beam, which should be at the sub-percent level, should not impact the results significantly. This can be written in matrix notation where frequency channels correspond to the elements of vectors. In this way, the signature of the 21-cm signal in the spectrum is
(31) |
where is the diagonal matrix with nonzero elements given by , is the diagonal matrix with the frequencies of the data channels on the diagonal, and is a derivative matrix (see Appendix E for subtleties involved in taking derivatives with a matrix). Now, combining all spectra into one vector leaves where
(32) |
To extend the MAA to include the motion-induced dipole, one must merely plug this into Equations 19a and 19b. The effect of including the motion-induced dipole on the results of the MAA is left for future work.
5.3 Between the two extremes
In past work (Tauscher et al., 2018; Rapetti et al., 2019; Tauscher et al., 2020), we laid out an SVD-based pipeline for extracting the 21-cm global signal from the large beam-weighted foregrounds. That analysis is identical in its foreground basis generation to the MAA from Section 4. However, it differs in that the signal is restricted to a specific model, either a physically motivated one or one created from performing SVD on a signal training set. So, in a sense, the pipeline described in those works is between the two extremes discussed in this paper of strong assumptions (EDGES-like analysis) and robust assumptions (MAA).
6 Conclusions
In this paper, we formulated a list of general assumptions for global 21-cm signal analyses. These include assumptions about the calibration of the instrument (Assumption 1), the distribution of the noise (Assumption 2), the adequacy of the foreground model (Assumption 3), and the form of the signal (Assumption 4). We then contrasted two different specific forms of these assumptions, an EDGES-like analysis and a new, so-called minimum assumption analysis (MAA).
The EDGES-like analysis is performed on single spectra with a polynomial-based foreground model and a flattened Gaussian signal model. We presented fits of simulations using zero signal and the Haslam map (as seen from the EDGES latitude) scaled with a constant spectral index and weighted by four different Gaussian beams with quadratic FWHMs. Two of these fits resulted in large flattened Gaussian troughs near 78 MHz, like the one reported by the EDGES team. These troughs also appeared when the Haslam map was replaced by a toy model of galactic emission that uses a 25 K background temperature (at 408 MHz) and models the galactic plane as a Gaussian in latitude with a peak of 300 K at the galactic center and 200 K at the anticenter, showing that the interaction of the beam with the Galactic plane introduces troughs.
This vulnerability to false troughs is due to the inadequacy of the foreground model, which does not account for beam chromaticity. While the only real solution to this problem is to modify the analysis technique, steps such as moving the experiment to the northern hemisphere, where the galaxy appears very differently, should at least modify (and potentially decrease) any false troughs that are found.
The MAA, on the other hand, is performed on many time-binned spectra (see also Tauscher et al., 2020) instead of one spectrum. Also, instead of assuming a specific model for the signal, it allows for any possible spectrum that is the same across each time bin. In addition, the foreground model accounts for beam chromaticity because the basis vectors are taken from applying Singular Value Decomposition (SVD) to a training set of foregrounds weighted by the beam of the specific antenna being used (Tauscher et al., 2018). Given the beam-weighted foreground training set employed, we found that, under these assumptions, any signal can be measured with uncertainties within an order of magnitude of the noise level.
While the MAA could be considered the most robust, conservative form of the assumptions specified in Section 2, if there are well motivated theoretical models for the signal, the pipeline we laid out in Tauscher et al. (2018), Rapetti et al. (2019), and Tauscher et al. (2020) can be applied. That method uses training sets for both the beam-weighted foreground and the global signal to create models for each of them. Ultimately, experimenters can decide if a theoretical model of the signal (not just based on residuals from the data with respect to the intrinsic foreground model, as in the case of the flattened Gaussian) is rigorous enough to be explored using our full pipeline, for stronger constraints. However, given the current theoretical uncertainties, if the MAA is possible for the selected beam-weighted foreground model, we recommend to start with this analysis before selecting a physical model of the signal because it allows for any signal—even those which are unexpected—and may guide the model selection procedure.
Appendix A Minimum assumption analysis general calculation
In this appendix, we will compute the minimum assumption maximum likelihood for signal reconstruction and the uncertainties on that reconstruction when the signal expansion matrix is (i.e. the signal appears in the data as ), the noise covariance is , and the foreground basis matrix is .
The signal assumption may be implemented by assuming for an invertible161616Note that invertible matrices are square, meaning that there are as many parameters in as there are frequencies in . basis matrix .171717 could be assumed to be the identity matrix, but the analytical calculations can be completed more easily if the signal basis matrix is normalized. The model for the full data is
(A1a) | ||||
(A1b) |
This can also be written , where , , and is the foreground basis that applies to all of the spectra simultaneously, as opposed to the single spectrum basis laid out in Section 2. The likelihood function is therefore
(A2) |
The maximum likelihood value of , , and its covariance, , are given by
(A3) |
These are easiest to calculate when and are normalized through and . Under these conditions, the covariance of the signal parameters is
(A4) |
where is the projection matrix described in Section 2, which is given by under the current normalization conditions. This means that the channel covariance of the signal distribution is , which can be written as
(A5) |
The mean of the signal parameters is
(A6) |
and the channel mean of the signal distribution is , or
(A7) |
To satisfy our normalization condition, the signal basis matrix must satisfy . Multiplying on the left by and on the right by , we find that . Therefore, Equation A5 becomes
(A8) |
This means that if any vector in the column space of , i.e. any vector of the form , can be represented by the foreground vectors, then the uncertainties are infinite. Plugging Equation A8 into Equation A7, we find that the signal channel mean is
(A9) |
Appendix B Minimum assumption analysis with only total power spectra
As mentioned in Section 4.1, when there are spectra concatenated together in the data curve being fit, i.e. , the signal expansion matrix is given by . Assuming the different spectra have independent noise, we can write the covariance in this case as
(B1) |
We split into different components through . This means that the normalization condition of the foreground basis matrix, , becomes , the foreground projection matrix is
(B2) |
the signal channel covariance matrix is
(B3) |
and the signal channel mean is
(B4) |
Appendix C Minimum assumption analysis with polarization
In this appendix, we will layout the form of the MAA when the data vector is the concatenation of spectra containing all four Stokes parameters at time bins, i.e.
(C1) |
The signal appears only in the Stokes I spectra, so the signal expansion matrix is
(C2) |
Since the four Stokes parameters of the time bin have roughly the same noise covariance,181818If , , and have magnitudes of order , then the fractional difference between the noise levels on , , , and is of order . See Tauscher et al. (2020) for exact calculations of Stokes parameters noise levels. For the purposes here, it is best just to assume all four Stokes parameters have the same noise level. , we can write the full noise covariance matrix as
(C3) |
In this case, the foreground basis returned by the SVD algorithm is given by
(C4) |
which implies that the normalization convention for the foreground basis matrix is , and the foreground projection matrix is
(C5) |
Very similarly to Equation B3, the signal channel covariance given by Equation 19a is
(C6) |
The signal channel mean is
(C7) |
For computation purposes, we define the total signal noise covariance, , the weighted total power basis, , the weighted total power data, , and the overlap vector of the data, , through
(C8) |
With these definitions, and are given by
(C9) |
These equations also work when calculating the channel mean and covariance with total power spectra only as long as the sum over in the definition of includes only .
Appendix D Signal bias statistic under minimum assumption analysis
In this appendix, we examine the effect of biases in the foreground model, i.e. the effect of non-zero on the uncertainties of minimum assumption analyses. To do this, we write the data as , where is the foreground bias (i.e. unmodeled foreground) that satisfies , is the modeled foreground, is the true signal, and is the Gaussian noise realization with covariance . With these definitions, the signal channel mean is given by
(D1) |
Since and , this means
(D2a) | ||||
(D2b) | ||||
(D2c) |
This means that the signal channel bias is
(D3) |
The so-called signal bias statistic, , defined through , which yields the squared number of sigma at which the signal is measured in an averaged sense across the band, satisfies
(D4) |
where is the number of frequency channels. Assuming that follows a normal distribution (with a singular covariance matrix), the expectation value and variance of are
(D5a) | ||||
(D5b) |
where , , and . Assuming that approximately follows a normal distribution, this means that, at a confidence level of ,
(D6) |
This equation connects the RMS number of sigma, , to confidence levels, , in the general case where the foreground model is imperfect.
Appendix E Taking derivative of finite-dimensional signal with matrix
In Section 5.2.2, the frequency derivative of a signal is written as . There are some subtleties to this representation. First, the discrete nature of means that any derivative taken will be a finite difference approximation. Second, note that the derivative of a vector with elements is only strictly defined on the midpoints. The derivative matrix defined in this way is and in the case of equally spaced frequency channels with resolution is given by
(E1) |
To define in the same space as , the derivative must be interpolated in some way (i.e. must be made square in some way). A natural interpolation scheme is to take the derivative at the first (last) endpoint to be the derivative at the midpoint of the first (last) pair of elements and to take the derivative at each interior point to be the average of the derivatives at the two adjacent midpoints. Defined in this way, the derivative matrix from Equation E1 is modified to
(E2a) | ||||
(E2b) |
This is the derivative matrix referred to in Section 5.2.2.
References
- Barkana (2018) Barkana, R. 2018, Nature, 555, 71, doi: 10.1038/nature25791
- Barkana et al. (2018) Barkana, R., Outmezguine, N. J., Redigol, D., & Volansky, T. 2018, Phys. Rev. D, 98, 103005, doi: 10.1103/PhysRevD.98.103005
- Berlin et al. (2018) Berlin, A., Hooper, D., Krnjaic, G., & McDermott, S. D. 2018, Physical Review Letters, 121, 011102, doi: 10.1103/PhysRevLett.121.011102
- Bernardi (2018) Bernardi, G. 2018, in IAU Symposium, Vol. 333, Peering towards Cosmic Dawn, ed. V. Jelić & T. van der Hulst, 98–101, doi: 10.1017/S1743921318000674
- Bowman et al. (2018) Bowman, J. D., Rogers, A. E. E., Monsalve, R. A., Mozdzen, T. J., & Mahesh, N. 2018, Nature, 555, 67, doi: 10.1038/nature25792
- Bradley et al. (2019) Bradley, R. F., Tauscher, K., Rapetti, D., & Burns, J. O. 2019, ApJ, 874, 153, doi: 10.3847/1538-4357/ab0d8b
- Burns et al. (2019) Burns, J., Bale, S., Bassett, N., et al. 2019, BAAS, 51, 6. https://arxiv.org/abs/1902.06147
- Burns (2020) Burns, J. O. 2020, arXiv e-prints, arXiv:2003.06881. https://arxiv.org/abs/2003.06881
- Burns et al. (2017) Burns, J. O., Bradley, R., Tauscher, K., et al. 2017, ApJ, 844, 33, doi: 10.3847/1538-4357/aa77f4
- Creque-Sarbinowski et al. (2019) Creque-Sarbinowski, C., Ji, L., Kovetz, E. D., & Kamionkowski, M. 2019, Phys. Rev. D, 100, 023528, doi: 10.1103/PhysRevD.100.023528
- de Lera Acedo (2019) de Lera Acedo, E. 2019, International Conference on Electromagnetics in Advanced Applications, 0626, doi: 10.1109/ICEAA.2019.8879199
- DeBoer et al. (2017) DeBoer, D. R., Parsons, A. R., Aguirre, J. E., et al. 2017, PASP, 129, 045001, doi: 10.1088/1538-3873/129/974/045001
- Deshpande (2018) Deshpande, A. A. 2018, ApJ, 866, L7, doi: 10.3847/2041-8213/aae318
- Draine & Miralda-Escudé (2018) Draine, B. T., & Miralda-Escudé, J. 2018, ApJ, 858, L10, doi: 10.3847/2041-8213/aac08a
- Ewall-Wice et al. (2018) Ewall-Wice, A., Chang, T.-C., Lazio, J., et al. 2018, ApJ, 868, 63, doi: 10.3847/1538-4357/aae51d
- Ewall-Wice et al. (2019) Ewall-Wice, A., Chang, T.-C., & Lazio, T. J. W. 2019, ArXiv e-prints, arXiv:1903.06788. https://arxiv.org/abs/1903.06788
- Feng & Holder (2018) Feng, C., & Holder, G. 2018, ApJ, 858, L17, doi: 10.3847/2041-8213/aac0fe
- Fialkov & Barkana (2019) Fialkov, A., & Barkana, R. 2019, MNRAS, 486, 1763, doi: 10.1093/mnras/stz873
- Fialkov et al. (2018) Fialkov, A., Barkana, R., & Cohen, A. 2018, Physical Review Letters, 121, 011101, doi: 10.1103/PhysRevLett.121.011101
- Haslam et al. (1982) Haslam, C. G. T., Salter, C. J., Stoffel, H., & Wilson, W. E. 1982, A&AS, 47, 1
- Hills et al. (2018) Hills, R., Kulkarni, G., Meerburg, P. D., & Puchwein, E. 2018, Nature, 564, E32, doi: 10.1038/s41586-018-0796-5
- Liu et al. (2013) Liu, A., Pritchard, J. R., Tegmark, M., & Loeb, A. 2013, Phys. Rev. D, 87, 043002, doi: 10.1103/PhysRevD.87.043002
- Loeb & Muñoz (2018) Loeb, A., & Muñoz, J. B. 2018, Physics Online Journal, 11, 69, doi: 10.1103/Physics.11.69
- Mebane et al. (2019) Mebane, R. H., Mirocha, J., & Furlanetto, S. R. 2019, ArXiv e-prints, arXiv:1910.10171. https://arxiv.org/abs/1910.10171
- Monsalve et al. (2018) Monsalve, R. A., Greig, B., Bowman, J. D., et al. 2018, ApJ, 863, 11, doi: 10.3847/1538-4357/aace54
- Monsalve et al. (2017a) Monsalve, R. A., Rogers, A. E. E., Bowman, J. D., & Mozdzen, T. J. 2017a, ApJ, 847, 64, doi: 10.3847/1538-4357/aa88d1
- Monsalve et al. (2017b) —. 2017b, ApJ, 835, 49, doi: 10.3847/1538-4357/835/1/49
- Mozdzen et al. (2019) Mozdzen, T. J., Mahesh, N., Monsalve, R. A., Rogers, A. E. E., & Bowman, J. D. 2019, MNRAS, 483, 4411, doi: 10.1093/mnras/sty3410
- Muñoz et al. (2018) Muñoz, J. B., Dvorkin, C., & Loeb, A. 2018, Phys. Rev. Lett., 121, 121301, doi: 10.1103/PhysRevLett.121.121301
- Nhan et al. (2019) Nhan, B. D., Bordenave, D. D., Bradley, R. F., et al. 2019, ApJ, 883, 126, doi: 10.3847/1538-4357/ab391b
- Pritchard & Loeb (2012) Pritchard, J. R., & Loeb, A. 2012, Reports on Progress in Physics, 75, 086901, doi: 10.1088/0034-4885/75/8/086901
- Rapetti et al. (2019) Rapetti, D., Tauscher, K., Mirocha, J., & Burns, J. O. 2019, arXiv e-prints, arXiv:1912.02205. https://arxiv.org/abs/1912.02205
- Sims & Pober (2020) Sims, P. H., & Pober, J. C. 2020, MNRAS, 492, 22, doi: 10.1093/mnras/stz3388
- Singh & Subrahmanyan (2019) Singh, S., & Subrahmanyan, R. 2019, ApJ, 880, 26, doi: 10.3847/1538-4357/ab2879
- Singh et al. (2018) Singh, S., Subrahmanyan, R., Udaya Shankar, N., et al. 2018, ApJ, 858, 54, doi: 10.3847/1538-4357/aabae1
- Slosar (2017) Slosar, A. 2017, Phys. Rev. Lett., 118, 151301, doi: 10.1103/PhysRevLett.118.151301
- Spinelli et al. (2019) Spinelli, M., Bernardi, G., & Santos, M. G. 2019, MNRAS, 489, 4007, doi: 10.1093/mnras/stz2425
- Switzer & Liu (2014) Switzer, E. R., & Liu, A. 2014, ApJ, 793, 102, doi: 10.1088/0004-637X/793/2/102
- Tauscher et al. (2020) Tauscher, K., Rapetti, D., & Burns, J. O. 2020, arXiv e-prints, arXiv:2003.05452. https://arxiv.org/abs/2003.05452
- Tauscher et al. (2018) Tauscher, K., Rapetti, D., Burns, J. O., & Switzer, E. 2018, ApJ, 853, 187, doi: 10.3847/1538-4357/aaa41f
- Vedantham et al. (2014) Vedantham, H. K., Koopmans, L. V. E., de Bruyn, A. G., et al. 2014, MNRAS, 437, 1056, doi: 10.1093/mnras/stt1878