Bayesian noise wave calibration for 21-cm global experiments

I. L. V. Roque,¹ W. J. Handley^1,2 and N. Razavi-Ghods¹
¹Astrophysics Group, Cavendish Laboratory, JJ Thomson Avenue, Cambridge, CB3 0HE, UK
²Kavli Institute for Cosmology, Madingley Road, Cambridge, CB3 0HA, UK
E-mail: [email protected]

(Accepted XXX. Received YYY; in original form ZZZ)

Abstract

Detection of millikelvin-level signals from the ‘Cosmic Dawn’ requires an unprecedented level of sensitivity and systematic calibration. We report the theory behind a novel calibration algorithm developed from the formalism introduced by the EDGES collaboration for use in 21-cm experiments. Improvements over previous approaches are provided through the incorporation of a Bayesian framework and machine learning techniques such as the use of Bayesian evidence to determine the level of frequency variation of calibration parameters that is supported by the data, the consideration of correlation between calibration parameters when determining their values and the use of a conjugate-prior based approach that results in a fast algorithm for application in the field. In self-consistency tests using empirical data models of varying complexity, our methodology is used to calibrate a 50 $\Omega$ ambient-temperature load. The RMS error between the calibration solution and the measured temperature of the load is 8 mK, well within the $1\sigma$ noise level. Whilst the methods described here are more applicable to global 21-cm experiments, they can easily be adapted and applied to other applications, including telescopes such as HERA and the SKA.

keywords:

instrumentation: interferometers – methods: statistical – dark ages, reionization, first stars

^†^†pubyear: 2020^†^†pagerange: Bayesian noise wave calibration for 21-cm global experiments–Bayesian noise wave calibration for 21-cm global experiments

1 Introduction

For nearly a century, scientists have been using radio-frequency instruments to advance the study of astronomy and complement information from the visual regime of the electromagnetic spectrum (Pritchard & Loeb, 2012). As we begin to take measurements of the early universe, these instruments must continue to evolve to support observations. Unexplored cosmic information from the Epoch of Reionisation and Cosmic Dawn redshifted into the radio spectrum could provide constraints on fundamental physics such as primordial black holes, galaxy formation, and universal curvature as discussed in Furlanetto et al. (2009). A unique probe of phenomena from the early cosmos is the hydrogen that inundates the intergalactic medium (IGM). Heating and cooling of the IGM associated with hydrogen’s absorption and emission of 21-cm photons produce a dynamic brightness temperature relative to the cosmic microwave background temperature, tracing the evolution of surrounding structure during the Cosmic Dawn. The brightness temperature of this 21-cm photon signal can be described by

	$\displaystyle T_{21}(z)\approx$	$\displaystyle\ 0.023\mathrm{K}\ \times$		(1)
		$\displaystyle x_{\text{H\,{i}}}(z)\left[\left(\frac{0.15}{\Omega_{\mathrm{m}}}\right)\left(\frac{1+z}{10}\right)\right]^{\frac{1}{2}}\left(\frac{\Omega_{\mathrm{b}}h}{0.02}\right)\left[1-\frac{T_{\mathrm{R}}(z)}{T_{\mathrm{S}}(z)}\right],$		(1)

which is heavily dependent on environmental factors of the early universe such as $x_{\text{H\,{i}}}$ , the fraction of neutral hydrogen, $\Omega_{\mathrm{m}}$ and $\Omega_{\mathrm{b}}$ , the matter and baryon densities with respect to the universal critical density for a flat universe and Hubble’s constant. Here, the $0.023$ is a constant from atomic-line physics. $T_{\mathrm{R}}$ is the background radiation temperature and $T_{\mathrm{S}}$ is known as the ‘21-cm spin temperature’, which is related to the kinetic temperature of neutral hydrogen gas in the IGM (Zaldarriaga et al., 2004; Field, 1958). This cosmic hydrogen signature measurable in the spectral sky has been redshifted to wavelengths under 200 MHz through the expansion of the universe as discussed in Pritchard & Loeb (2012).

There has been a recent surge in the field of 21-cm cosmology following the reported detection of an absorption feature consistent with a Cosmic Dawn signature. This was reported by the Experiment to Detect the Global EoR Signature (EDGES) in early 2018 from measurements of a sky-averaged radio spectrum (Bowman et al., 2018). The signal, centred at 78 MHz with a width corresponding to a period between 180 million and 270 million years after the Big Bang, matches the theoretical position in frequency, but its depth of $\sim 0.5$ K is a factor of two greater than the largest predictions from theoretical models (Cohen et al., 2017). This discrepancy would suggest that the temperature difference between the IGM and the cosmic microwave background was much larger than previously thought and would require new physics to explain, such as dark matter-baryon interactions (Barkana, 2018) or excess radio backgrounds (Ewall-Wice et al., 2020).

Another possible explanation for this discrepancy is that the measured signal is not cosmological but of systematic origin. This may be the case in EDGES due to some of the methodology used, such as a potentially unphysical foreground removal method and calibration of the receiver in a separate environment from the data acquisition (Hills et al., 2018; Razavi-Ghods, 2018). In this paper, we present a novel calibration algorithm that improves on the work of the EDGES team (Rogers & Bowman, 2012) through the utilisation of a Bayesian framework to promote efficient use of the data to remove systematics. Using conjugate priors and machine learning techniques, our pipeline can be applied in the field with the collection of data with additional capabilities for optimising individual noise wave parameters and incorporating correlations between them.

This paper is organised as follows. In section 2 we review the methodology behind calibration using noise waves as well as present a Bayesian framework that provides greater flexibility in radiometer calibration. Section 3 describes the process of using mock data sets modelled after empirical measurements of reflection coefficients with the incorporation of a realistic noise model to evaluate our pipeline.

2 Methods

In this section, we detail the methodology behind radiometer calibration using noise wave parameters. An overview of global signal measurement are outlined in section 2.1. Section 2.2 summarises the basic procedure with some mathematical improvements while section 2.3 describes our Bayesian framework and its associated advantages.

2.1 Measuring the global signal

The noise necessitating calibration emerges during measurement-taking. In an averaged or global experiment, the sky temperature $T_{\mathrm{sky}}(\Omega,\nu,t)$ is a function of the direction $\Omega$ , frequency $\nu$ and time $t$ . This can be broken down into two primary components: the global 21-cm signal $T_{21}$ and astrophysical foregrounds $T_{\mathrm{f}}$

T_{\mathrm{sky}}(\Omega,\nu,t)=T_{21}(\nu)+T_{\mathrm{f}}(\Omega,\nu,t).

(2)

The antenna measures the sky signal convolved with the normalised antenna directivity $B$ . The process of measurement introduces the random noise term $N_{\mathrm{data}}$ .

D(\nu,t)=\int T_{\mathrm{sky}}(\Omega,\nu,t)B(\Omega,\nu)\mathrm{d}\Omega+N_{\mathrm{data}}.

(3)

Our 21-cm signature can thus be represented as

T_{21}\approx D(\nu,t)-\int T_{\mathrm{f}}(\Omega,\nu,t)B(\Omega,\nu)\mathrm{d}\Omega-N_{\mathrm{data}}.

(4)

Here, the integral is assessed through foreground and beam modelling techniques such as those discussed in Anstey et al. (2020) while modelling of $N_{\mathrm{data}}$ from the statistical properties of $D(\nu,t)$ is accomplished by a calibration algorithm as articulated in this paper and outlined in Fig. 1. Having a fully Bayesian framework when modelling the beam, the sky and the systematics has major advantages for global 21-cm experiments such as REACH (de Lera Acedo et al., 2021), as it provides the greatest flexibility in being able to model all effects and jointly fit for them.

Refer to caption — Figure 1: Diagram showing the evolution of the 21-cm signal hampered by astrophysical foregrounds, convolvution with the antenna beam and the emergence of measurement noise before calibration to retrieve the antenna temperature.

2.2 Calibration methodology

The standard calibration strategy follows the method introduced by Dicke to characterise systematic features in radio frequency instruments (Dicke, 1946) and is widely used in experiments such as EDGES (Monsalve et al., 2017) and LOFAR (Bilous, A. V. et al., 2016) to evaluate the spectral index of the sky’s diffuse radio background (Rogers & Bowman, 2012). This technique involves measurements of two internal reference standards; a load and a noise source, in addition to a series of external calibration sources attached to the receiver input in lieu of the antenna. These include an ambient-temperature ‘cold’ load, a ‘hot’ load heated to $\sim 400$ K, an open-ended cable and a shorted cable. A block diagram showing this arrangement is presented in Fig. 2.

When calibrating the receiver, reflection coefficients are taken of the calibration source connected to the receiver input ( $\Gamma_{\mathrm{cal}}$ ) and of the receiver itself ( $\Gamma_{\mathrm{rec}}$ ) as well as power spectral densities (PSDs) of the input ( $P_{\mathrm{cal}}$ ), the internal reference load ( $P_{\mathrm{L}}$ ) and the internal reference noise source ( $P_{\mathrm{NS}}$ ) (Monsalve et al., 2017). These measurements are used to calculate a preliminary ‘uncalibrated’ antenna temperature $T_{\mathrm{cal}}^{*}$

T_{\mathrm{cal}}^{*}=T_{\mathrm{NS}}\left(\frac{P_{\mathrm{cal}}-P_{\mathrm{L}}}{P_{\mathrm{NS}}-P_{\mathrm{L}}}\right)+T_{\mathrm{L}},

(5)

where $T_{\mathrm{L}}$ and $T_{\mathrm{NS}}$ are assumptions for the noise temperature of the internal reference load and excess noise temperature of the internal noise source above ambient, respectively. This initial calculation is used to calibrate out any time-dependent system gain that emerges from a series of filters, amplifiers and cables, as well as the analogue-to-digital converter within the experimental apparatus (Monsalve et al., 2017). Each PSD measurement can be expressed in terms of specific response contributions as detailed in Bowman et al. (2018)

$\displaystyle P_{\mathrm{cal}}=g_{\mathrm{sys}}\Bigg{[}$	$\displaystyle T_{\mathrm{cal}}\left(1-\|\Gamma_{\mathrm{cal}}\|^{2}\right)\left\|\frac{\sqrt{1-\|\Gamma_{\mathrm{rec}}\|^{2}}}{1-\Gamma_{\mathrm{cal}}\Gamma_{\mathrm{rec}}}\right\|^{2}$	(6)
$\displaystyle+$	$\displaystyle T_{\mathrm{unc}}\|\Gamma_{\mathrm{cal}}\|^{2}\left\|\frac{\sqrt{1-\|\Gamma_{\mathrm{rec}}\|^{2}}}{1-\Gamma_{\mathrm{cal}}\Gamma_{\mathrm{rec}}}\right\|^{2}$
$\displaystyle+$	$\displaystyle T_{\mathrm{cos}}\operatorname{Re}\left(\Gamma_{\mathrm{cal}}\frac{\sqrt{1-\|\Gamma_{\mathrm{rec}}\|^{2}}}{1-\Gamma_{\mathrm{cal}}\Gamma_{\mathrm{rec}}}\right)$
$\displaystyle+$	$\displaystyle T_{\mathrm{sin}}\operatorname{Im}\left(\Gamma_{\mathrm{cal}}\frac{\sqrt{1-\|\Gamma_{\mathrm{rec}}\|^{2}}}{1-\Gamma_{\mathrm{cal}}\Gamma_{\mathrm{rec}}}\right)+T_{0}\Bigg{]}.$

Here, $g_{\mathrm{sys}}$ is the system gain referenced to the receiver input and $T_{\mathrm{cal}}$ is our calibrated input temperature. $T_{\mathrm{unc}}$ , $T_{\mathrm{cos}}$ , and $T_{\mathrm{sin}}$ are the ‘noise wave parameters’ introduced by Meys (1978) to calibrate the instrument. $T_{\mathrm{unc}}$ represents the portion of noise reflected by the antenna that is uncorrelated with the output noise of the low noise amplifier (LNA). $T_{\mathrm{cos}}$ and $T_{\mathrm{sin}}$ are the portions of reflected noise correlated with noise from the LNA (Monsalve et al., 2017; Rogers & Bowman, 2012). In the EDGES experiment, these calibration quantities are modelled using seven-term polynomials in frequency.

The PSDs for the internal reference load and noise source can similarly be expressed as in equation 6. However, since the reflection coefficients of the internal references are typically less than 0.005, they are taken to be zero in order to simplify the equations

P_{\mathrm{L}}=g_{\mathrm{sys}}^{*}[T_{\mathrm{L}}\left(1-|\Gamma_{\mathrm{rec}}|^{2}\right)+T_{0}^{*}],

(7)

P_{\mathrm{NS}}=g_{\mathrm{sys}}^{*}[\left(T_{\mathrm{L}}+T_{\mathrm{NS}}\right)\left(1-|\Gamma_{\mathrm{rec}}|^{2}\right)+T_{0}^{*}].

(8)

As shown in Fig. 2, the internal references may be on a separate reference plane than the receiver input, resulting in a system gain $g_{\mathrm{sys}}^{*}$ and a noise offset $T_{0}^{*}$ different from those defined in equation 6. This effect is taken into account by two additional scale and offset parameters, $C_{1}$ and $C_{2}$ , introduced by EDGES (Monsalve et al., 2017).

Since $C_{1}$ and $C_{2}$ also correct for first-order assumptions in the noise temperatures of the internal reference load and noise source, we have chosen to absorb these terms into $T_{\mathrm{L}}$ and $T_{\mathrm{NS}}$ . This adjustment allows all calibration parameters, $T_{\mathrm{unc}}$ , $T_{\mathrm{cos}}$ , $T_{\mathrm{sin}}$ , and an ‘effective’ $T_{\mathrm{NS}}$ and $T_{\mathrm{L}}$ , to be solved for in units of kelvin, facilitating a joint solution of parameters. Expanding equation 5 using equations 6, 7 and 8 yields a linear identity providing a relationship between the uncalibrated input temperature and a final calibrated temperature of any device connected to the receiver input

$\displaystyle T_{\mathrm{NS}}\left(\frac{P_{\mathrm{cal}}-P_{\mathrm{L}}}{P_{\mathrm{NS}}-P_{\mathrm{L}}}\right)+T_{\mathrm{L}}$	$\displaystyle=T_{\mathrm{cal}}\left[\frac{1-\|\Gamma_{\mathrm{cal}}\|^{2}}{\|1-\Gamma_{\mathrm{cal}}\Gamma_{\mathrm{rec}}\|^{2}}\right]$	(9)
	$\displaystyle+T_{\mathrm{unc}}\left[\frac{\|\Gamma_{\mathrm{cal}}\|^{2}}{\|1-\Gamma_{\mathrm{cal}}\Gamma_{\mathrm{rec}}\|^{2}}\right]$
	$\displaystyle+T_{\mathrm{cos}}\left[\frac{\operatorname{Re}\left(\frac{\Gamma_{\mathrm{cal}}}{1-\Gamma_{\mathrm{cal}}\Gamma_{\mathrm{rec}}}\right)}{\sqrt{1-\|\Gamma_{\mathrm{rec}}\|^{2}}}\right]$
	$\displaystyle+T_{\mathrm{sin}}\left[\frac{\operatorname{Im}\left(\frac{\Gamma_{\mathrm{cal}}}{1-\Gamma_{\mathrm{cal}}\Gamma_{\mathrm{rec}}}\right)}{\sqrt{1-\|\Gamma_{\mathrm{rec}}\|^{2}}}\right],$

where all parameters are frequency-dependent. This is not explicitly shown for simplicity of notation. For estimation of the noise wave parameters, $T_{\mathrm{cal}}$ , $\Gamma_{\mathrm{cal}}$ and $\Gamma_{\mathrm{rec}}$ are measured along with the PSDs while $g_{\mathrm{sys}}$ and $T_{\mathrm{0}}$ are calibrated out. The cold and hot loads exhibit the main temperature references needed for $T_{\mathrm{L}}$ and $T_{\mathrm{NS}}$ . The cables facilitate the derivation of the noise wave parameters describing spectral ripples from the noise properties of the receiver by acting as antennas looking at an isotropic sky with temperatures equal to the cables’ physical temperatures (Rogers & Bowman, 2012).

2.3 Bayesian calibration framework

One possible source of systematics in the calibration methodology used by EDGES comes from measuring the response of the four external calibrators along with the receiver reflection coefficient in a laboratory away from where the instrument is actually deployed (Bowman et al., 2018). This process, especially with regards to how calibration parameters change, can be non-trivial. Furthermore, the fixed polynomial order used by EDGES for all noise wave parameters may underfit or overfit individual parameters and thus ‘fit out’ data useful for determining systematics or potentially even the 21-cm signal itself if a joint fit is performed.

In response to these issues, we have developed a calibration pipeline that improves on the strategies presented in section 2.2. We introduce a novel Bayesian methodology using conjugate priors for a dynamic application of our algorithm to be run with data collection regardless of system complexity. Also included are model selection methods using machine learning techniques for the optimisation of individual noise wave parameters to combat overfitting and underfitting, the results of which converge with that of a least-squares approach when wide priors are adopted. Our pipeline easily incorporates many more calibrators than the standard four shown in Fig. 2 to increase constraints on noise wave parameters while identifying possible correlations between them. A schematic of the improved calibration method is shown in Fig. 3.

In order to simplify our calibration approach, we first define the following terms

X_{\mathrm{unc}}=-\frac{|\Gamma_{\mathrm{cal}}|^{2}}{1-|\Gamma_{\mathrm{cal}}|^{2}},

(10)

X_{\mathrm{L}}=\frac{|1-\Gamma_{\mathrm{cal}}\Gamma_{\mathrm{rec}}|^{2}}{1-|\Gamma_{\mathrm{cal}}|^{2}},

(11)

X_{\mathrm{cos}}=-\operatorname{Re}\left(\frac{\Gamma_{\mathrm{cal}}}{1-\Gamma_{\mathrm{cal}}\Gamma_{\mathrm{rec}}}\times\frac{X_{\mathrm{L}}}{\sqrt{1-|\Gamma_{\mathrm{rec}}|^{2}}}\right),

(12)

X_{\mathrm{sin}}=-\operatorname{Im}\left(\frac{\Gamma_{\mathrm{cal}}}{1-\Gamma_{\mathrm{cal}}\Gamma_{\mathrm{rec}}}\times\frac{X_{\mathrm{L}}}{\sqrt{1-|\Gamma_{\mathrm{rec}}|^{2}}}\right),

(13)

X_{\mathrm{NS}}=\left(\frac{P_{\mathrm{cal}}-P_{\mathrm{L}}}{P_{\mathrm{NS}}-P_{\mathrm{L}}}\right)X_{\mathrm{L}},

(14)

which represent initial calibration measurements on $D$ in the frequency domain for the characterisation of $N_{\mathrm{data}}$ from equation 3 via our noise wave parameters. It is expected that calibration-related deviations of $D$ in the time domain are sufficiently curtailed through practical strategies such as temperature control of the receiver environment. Incorporating these into equation 9, with some rearrangement, then gives the equation

X_{\mathrm{unc}}T_{\mathrm{unc}}+X_{\mathrm{cos}}T_{\mathrm{cos}}+X_{\mathrm{sin}}T_{\mathrm{sin}}+X_{\mathrm{NS}}T_{\mathrm{NS}}+X_{\mathrm{L}}T_{\mathrm{L}}=T_{\mathrm{cal}},

(15)

at each frequency. Here, there are no squared or higher-order terms, allowing us to take advantage of the linear form by grouping the data and noise wave parameters into separate matrices

	X	$\displaystyle\equiv\begin{pmatrix}X_{\mathrm{unc}}\quad X_{\mathrm{cos}}\quad X_{\mathrm{sin}}\quad X_{\mathrm{NS}}\quad X_{\mathrm{L}}\end{pmatrix},$
	$\displaystyle\boldsymbol{\Theta}$	$\displaystyle\equiv\begin{pmatrix}T_{\mathrm{unc}}\quad T_{\mathrm{cos}}\quad T_{\mathrm{sin}}\quad T_{\mathrm{NS}}\quad T_{\mathrm{L}}\end{pmatrix}^{\top}.$		(16)

In these equations, all of our data; the reflection coefficient measurements and power spectral densities, are grouped in an X vector which forms a matrix where one of the axes is frequency. The calibration parameters as frequency-dependent polynomials of varying degree are collected into a $\boldsymbol{\boldsymbol{\Theta}}$ vector which serves as our model describing $N_{\mathrm{data}}$ . Applying these definitions condenses the calibration equation into

\textbf{{T}}_{\mathrm{cal}}=\textbf{{X}}\boldsymbol{\boldsymbol{\Theta}}+\sigma,

(17)

where $\textbf{{T}}_{\mathrm{cal}}$ is a vector over frequency and $\sigma$ is a noise vector representing our error. Since EDGES assumes that each power spectral density measurement is frequency independent, we have assumed that $\sigma$ is a multivariate normal distribution. This assumption is implicit in the EDGES analysis in which they use a least-squares minimisation approach for solving model parameters.

For calibration of the receiver, we are concerned with the construction of predictive models of the noise wave parameters, $\boldsymbol{\Theta}$ , in the context of some dataset, T. We can use $\boldsymbol{\Theta}$ to calculate the probability of observing the data given a specific set of noise wave parameters:

	$\displaystyle p\big{(}\textbf{{T}}\>\big{\|}\>\boldsymbol{\Theta},\sigma^{2}\big{)}$	$\displaystyle=$		(18)
		$\displaystyle\frac{1}{2\pi\sigma^{2}}^{N/2}\exp{\Bigg{\{}-\frac{1}{2\sigma^{2}}\left(\textbf{{T}}-\textbf{{X}}\boldsymbol{\Theta}\right)^{\top}\left(\textbf{{T}}-\textbf{{X}}\boldsymbol{\Theta}\right)\Bigg{\}}},$		(18)

where, $N$ is the number of measurements. This distribution on the data is the likelihood. For the purposes of calibration, T may be $\textbf{{T}}_{\mathrm{cal}}$ measurements or alternatively, $\textbf{{T}}_{\mathrm{sky}}$ for prediction of a sky signal. Our model must also specify a prior distribution, quantifying our initial assumptions on the values and spread of our noise wave parameters which we specify as a multivariate normal inverse gamma distribution:

	$\displaystyle p\left(\boldsymbol{\Theta},\sigma^{2}\right)\propto$	$\displaystyle\left(\frac{1}{\sigma^{2}}\right)^{a+1+\left(d/2\right)}\times$		(19)
		$\displaystyle\exp\left[-\frac{1}{\sigma^{2}}\{b+\frac{1}{2}\left(\boldsymbol{\Theta}-\boldsymbol{\mu}_{\boldsymbol{\Theta}}\right)^{\top}\textbf{{V}}_{\boldsymbol{\Theta}}^{-1}\left(\boldsymbol{\Theta}-\boldsymbol{\mu}_{\boldsymbol{\Theta}}\right)\}\right],$		(19)

which is proportional up to an integration constant. Here, $a$ and $b$ , which are greater than zero, along with $\textbf{{V}}_{\boldsymbol{\Theta}}$ and $\boldsymbol{\mu}_{\boldsymbol{\Theta}}$ represent our prior knowledge on the noise wave parameters. $d$ is the length of our vector $\boldsymbol{\Theta}$ .

Equation 18 is determined by a set of values for our model $\boldsymbol{\Theta}$ . We can marginalise out the dependence on $\boldsymbol{\Theta}$ and our noise term by integrating over the prior distribution by both $\boldsymbol{\Theta}$ and $\sigma^{2}$ at once. Following the steps in Banerjee (2009)

	$\displaystyle p\left(\textbf{{T}}_{\mathrm{cal}}\right)$	$\displaystyle=\int p\left(\textbf{{T}}_{\mathrm{cal}}\>\big{\|}\>\boldsymbol{\Theta},\sigma^{2}\right)p\left(\boldsymbol{\Theta},\sigma^{2}\right)\mathrm{d}\boldsymbol{\Theta}\mathrm{d}\sigma^{2}$		(20)
		$\displaystyle=\frac{b^{a}\Gamma\left(a^{}\right)\sqrt{\|\textbf{{V}}^{}\|}}{{b^{}}^{a^{}}\Gamma\left(a\right)\sqrt{\|\textbf{{V}}_{\boldsymbol{\Theta}}\|}}(2\pi)^{-N/2},$		(20)

where

$\displaystyle a^{*}$	$\displaystyle=a+\frac{N}{2},$	(21)
$\displaystyle b^{*}$	$\displaystyle=b+\frac{1}{2}[\boldsymbol{\mu}_{\boldsymbol{\Theta}}^{\top}\textbf{{V}}_{\boldsymbol{\Theta}}^{-1}\boldsymbol{\mu}_{\boldsymbol{\Theta}}+\textbf{{T}}_{\mathrm{cal}}^{\top}\textbf{{T}}_{\mathrm{cal}}-\boldsymbol{\mu}^{\top}\textbf{{V}}^{-1}\boldsymbol{\mu}^{*}],$
$\displaystyle\boldsymbol{\mu}^{*}$	$\displaystyle=\left(\textbf{{V}}_{\boldsymbol{\Theta}}^{-1}+\textbf{{X}}^{\top}\textbf{{X}}\right)^{-1}\left(\textbf{{V}}_{\boldsymbol{\Theta}}^{-1}\boldsymbol{\mu}_{\boldsymbol{\Theta}}+\textbf{{X}}^{\top}\textbf{{T}}_{\mathrm{cal}}\right),$
$\displaystyle\textbf{{V}}^{*}$	$\displaystyle=\left(\textbf{{V}}_{\boldsymbol{\Theta}}^{-1}+\textbf{{X}}^{\top}\textbf{{X}}\right)^{-1},$

and $\Gamma\left(x\right)$ represents the Gamma function, not to be confused with the notation for our reflection coefficients. Equation 20 is the evidence, which gives the probability of observing the data $\textbf{{T}}_{\mathrm{cal}}$ given our model.¹¹1It is in fact better to use the equivalent more numerically stable expression $b^{*}=b+\boldsymbol{q}^{\top}\boldsymbol{q}+\boldsymbol{q}^{\top}\textbf{{X}}\textbf{{V}}_{\boldsymbol{\Theta}}\textbf{{X}}^{\top}\boldsymbol{q}$ , where $\boldsymbol{q}=\textbf{{T}}_{\mathrm{cal}}-\textbf{{X}}\boldsymbol{\mu}^{*}$ to avoid cancellation of large terms.

With the prior distribution specified, we use Bayes’ equation to invert the conditioning of the likelihood and find the posterior using the likelihood, prior and evidence:

p\left(\boldsymbol{\Theta},\sigma^{2}\>\big{|}\>\textbf{{T}}_{\mathrm{cal}}\right)=\frac{p\left(\textbf{{T}}_{\mathrm{cal}}\>\big{|}\>\boldsymbol{\Theta},\sigma^{2}\right)p\left(\boldsymbol{\Theta},\sigma^{2}\right)}{p\left(\textbf{{T}}_{\mathrm{cal}}\right)}.

(22)

Similarly from Banerjee (2009), this can be written as

	$\displaystyle p\Bigl{(}\boldsymbol{\Theta},\sigma^{2}\>\big{\|}$	$\displaystyle\textbf{{T}}_{\mathrm{cal}}\Bigl{)}\propto\left(\frac{1}{\sigma^{2}}\right)^{a^{*}+\frac{d}{2}+1}\times$		(23)
		$\displaystyle\exp{\Bigg{\{}-\frac{1}{\sigma^{2}}\Bigg{[}b^{}+\frac{1}{2}\left(\boldsymbol{\Theta}-\boldsymbol{\mu}^{}\right)^{\top}\textbf{{V}}^{-1}\left(\boldsymbol{\Theta}-\boldsymbol{\mu}^{}\right)\Bigg{]}\Bigg{\}}}.$		(23)

The posterior distribution represents the uncertainty of our parameters after analysis, reflecting the increase in information (Nagel, 2017). We highlight the difference between the ‘likelihood-only’ least-squares approach versus the Bayesian approach with the former being a special case of the latter with very wide priors demonstrable when $\textbf{{V}}_{\boldsymbol{\Theta}}\rightarrow\infty\Rightarrow\textbf{{V}}_{\boldsymbol{\Theta}}^{-1}\rightarrow 0$ , and $\boldsymbol{\mu}^{*}$ becomes $\boldsymbol{\Theta}$ . The transition from ‘non-starred’ variables to ‘starred’ variables represents our ‘Bayesian update’ of the prior to the posterior noise wave parameters in light of the calibration data $\textbf{{T}}_{\mathrm{cal}}$ .

As we can see, the posterior distribution is in the same probability distribution family as equation 19, making our prior a conjugate prior on the likelihood distribution. The use of conjugate priors gives a closed-form solution for the posterior distribution through updates of the prior hyperparameters via the likelihood function (Banerjee, 2009; Orloff & Bloom, 2013). The resulting numerical computation is many orders of magnitude faster than MCMC methods relying on full numerical sampling and permits an in-place calculation in the same environment as the data acquisition. This becomes particularly useful for the speed of the algorithm as frequency dependence is introduced in which the computations would not be manageable without conjugate gradients.

To allow for a smooth frequency dependency, we promote each of our noise wave parameters in section 2.3 to a vector of polynomial coefficients

T_{\mathrm{i}}=\begin{pmatrix}T_{\mathrm{i}}^{[0]},&T_{\mathrm{i}}^{[1]},&T_{\mathrm{i}}^{[2]},&...,&T_{\mathrm{i}}^{[n]}\end{pmatrix},

(24)

where $i$ is our noise wave parameter label; $i\in\{\mathrm{unc,\ cos,\ sin,\ NS,\ L}\}$ , modelled using $n+1$ polynomial coefficients. Likewise

\textbf{{X}}_{i}=\begin{pmatrix}\textbf{{X}}_{i},&\textbf{{X}}_{i}\left(\frac{\nu}{\nu_{0}}\right),&\textbf{{X}}_{i}{\left(\frac{\nu}{\nu_{0}}\right)}^{2},&...,&\textbf{{X}}_{i}{\left(\frac{\nu}{\nu_{0}}\right)}^{n}\end{pmatrix},

(25)

where $\nu$ is a vector of input frequencies which are raised to powers up to $n$ . For a vector of $n$ ’s attributed to our calibration parameters, under this notation multiplication in equation 17 is element-wise and equation 20 is effectively $p\left(\textbf{{T}}_{\mathrm{cal}}|\textbf{{n}}\right)$ . Assuming a uniform prior on n, inverting Bayes’ theorem gives $p\left(\textbf{{n}}|\textbf{{T}}_{\mathrm{cal}}\right)$ for use in model comparison in which the relative probabilities of models can be evaluated in light of the data and priors. Occam’s razor advises whether the extra complexity of a model is needed to describe the data (Trotta, 2008), permitting optimisation of the polynomial orders for individual noise wave parameters as detailed in section 3.3. By taking a random sampling of the resulting posterior, we characterise the noise wave parameters as multivariate distributions depicted in contour plots which exhibit a peak value accompanied by $1\sigma$ and $2\sigma$ variance as well as correlation between parameters inferred from a covariance matrix.

Following characterisation of the receiver, we next apply the $\textbf{{T}}_{\mathrm{cal}}$ from our calibration to a set of raw antenna data $\hat{\textbf{{X}}}$ for prediction of our sky signal, $\textbf{{T}}_{\mathrm{sky}}$ , from equation 3. The predictions for the data follow from the posterior predictive distribution

p\left(\textbf{{T}}_{\mathrm{sky}}\>\big{|}\>\textbf{{T}}_{\mathrm{cal}}\right)=\int p\left(\textbf{{T}}_{\mathrm{sky}}\>\big{|}\>\boldsymbol{\Theta},\sigma^{2}\right)p\left(\boldsymbol{\Theta},\sigma^{2}\>\big{|}\>\textbf{{T}}_{\mathrm{cal}}\right)\mathrm{d}\boldsymbol{\Theta}\mathrm{d}\sigma^{2}.

(26)

The first probability in the integral is the likelihood for our antenna measurement $\textbf{{T}}_{\mathrm{sky}}$ and the second is our posterior from equation 23. Following the steps in Banerjee (2009), this can be shown to be a multivariate Student’s t-distribution written as:

	$\displaystyle p\Big{(}$	$\displaystyle\textbf{{T}}_{\mathrm{sky}}\>\big{\|}\>\textbf{{T}}_{\mathrm{cal}}\Big{)}=\frac{\Gamma\left(a^{}+\frac{d}{2}\right)}{\Gamma\left(a^{}\right)\pi^{\frac{d}{2}}\|2b^{}\left(I+\hat{\textbf{{X}}}\textbf{{V}}^{}\hat{\textbf{{X}}}^{\top}\right)\|^{\frac{1}{2}}}$		(27)
		$\displaystyle\times\left[1+\frac{\left(\textbf{{T}}_{\mathrm{sky}}-\hat{\textbf{{X}}}\boldsymbol{\mu}^{}\right)^{\top}\left(I+\hat{\textbf{{X}}}\textbf{{V}}^{}\hat{\textbf{{X}}}^{\top}\right)^{-1}\left(\textbf{{T}}_{\mathrm{sky}}-\hat{\textbf{{X}}}\boldsymbol{\mu}^{}\right)}{2b^{}}\right]^{-\left(a^{*}+\frac{d}{2}\right)},$		(27)

where $I$ is the $N\times N$ identity matrix and $a^{*}$ , $b^{*}$ , $\boldsymbol{\mu}^{*}$ and $\textbf{{V}}^{*}$ are defined in equation 21. This new distribution on $\textbf{{T}}_{\mathrm{sky}}$ corresponds to a set of points with error bars and represents the calibrated sky temperature as the output of the receiver.

3 Empirical modelling and simulations

To verify the performance of our pipeline and highlight features of the algorithm, we evaluate the results of self-consistency checks using empirical models of data based on measurements taken in the laboratory. To make this data as realistic as possible, we used actual measurements of the reflection coefficients of many types of calibrators (see table 1) to generated power spectral densities using equations 6, 7 and 8 given a set of realistic model noise wave parameters along with some assumptions about the noise, which are described in section 3.4. The impedance of the calibrators which were measured with a vector network analyser (VNA) and used in our pipeline are shown on a Smith chart in Fig. 4

Table 1: Table of calibrators used in the creation of our empirical data models for analysis. Calibrators are added in pairs in the order below when increasing the number of calibration sources used by our algorithm.

Calibrator	Temperature
Cold load ( $50\ \Omega$ )	298 K
Hot load ( $50\ \Omega$ )	373 K
Gore cable $+5\ \Omega$	298 K
Gore cable $+500\ \Omega$	298 K
Gore cable $+31\ \Omega$	298 K
Gore cable $+81\ \Omega$	298 K
$25\ \Omega$ resistor	298 K
$100\ \Omega$ resistor	298 K

We start by demonstrating the importance of correlation between noise wave parameters when determining their values to provide a better calibration solution for the reduction of systematic features in the data such as reflections (section 3.1). We then show the increased constraints on these noise wave parameters attributed to the inclusion of more calibrators than the standard number of four (section 3.2). Following this, we illustrate the effectiveness of model selection for the optimisation of individual noise wave parameters to prevent the loss of information resulting from overfitting or underfitting of the data (section 3.3). Finally, these features are incorporated into a calibration solution applied to a $50\ \Omega$ load (section 3.4).

3.1 Correlation between noise wave parameters

In this section, we show the first major feature of our Bayesian pipeline; the consideration of correlation between noise wave parameters when deriving their values. This is best demonstrated when noise is introduced in an idealised way as to retain a form matching the Gaussian form of our mathematical model. To do this, empirical models of power spectral densities are calculated from equations 6, 7 and 8 using measurements of $\Gamma_{\mathrm{rec}}$ , $\Gamma_{\mathrm{cal}}$ and $T_{\mathrm{cal}}$ for the cold and hot loads, as well as a set of realistic noise wave parameters. Gaussian noise of one unit variation is then added to the $T_{\mathrm{cal}}$ measurements after the calculation to conserve its Gaussian form. This data is submitted to our algorithm and the resulting posterior distributions for coefficients of the polynomial noise wave parameters are compared to the initial values.

Such posterior distributions can be seen in Fig. 5 showing the results of models using only the cold load (grey posterior), only the hot load (red posterior) and using both loads in tandem (blue posterior). For these calculations we chose a set of model noise wave parameters as constants across the frequency band;

	$\displaystyle T_{\mathrm{unc}}=250\ \mathrm{K}$
	$\displaystyle T_{\mathrm{cos}}=190\ \mathrm{K}$
	$\displaystyle T_{\mathrm{sin}}=90\ \mathrm{K}$
	$\displaystyle T_{\mathrm{NS}}=1200\ \mathrm{K}$
	$\displaystyle T_{\mathrm{L}}=298\ \mathrm{K}$

In Fig. 5, a strong correlation between the $T_{\mathrm{L}}$ and $T_{\mathrm{NS}}$ is evident as the hot-load posterior is highly skewed as expected from equations 11 and 14. The resulting intersection of posteriors from the individual loads facilitate the derivation of noise wave parameters as the dual-load posterior is found within the region of posterior overlap crossing with the values of the model shown in the inset of Fig. 5. Retrieval of the noise wave parameter values using correlations between them found in the data demonstrate the relevance of this information which is not taken into account in previous calibration techniques.

3.2 Constraints with additional calibrators

A nice feature of our pipeline is the ability to include as many calibrators as required to constrain the calibration parameters. For analysis, six more calibrators are introduced in pairs following the order presented in table 1. We include data generated from measurements of multiple resistors terminating a high quality 25 m cable made by GORE^®. Data for these calibrators is once again generated using fixed terms and Gaussian noise of one unit variation added to $T_{\mathrm{cal}}$ as discussed above. Fig. 6 shows the results of models using four, six, and eight calibrators.

As shown, the inclusion of more calibrators increases the constraint on the resulting noise wave parameters. However, we note that after the inclusion of four calibrators, the relative additional constraint decreases with each additional calibrator and thus the use of more than eight calibrators would be unnecessary. The values of noise wave parameters used to generate the data as indicated by the cross hairs in Fig. 6 all fall within $1\sigma$ of our pipeline’s resulting posterior averages for models using all eight calibrators.

3.3 Optimisation of individual noise wave parameters

The final highlight of our Bayesian pipeline is a the use of machine learning techniques to optimise individual noise wave parameters. This is advantageous as a blanket set of order-seven polynomials applied to all noise wave parameters, such as done in the EDGES experiment, may underfit or overfit individual parameters and misidentify systematics or information about the signal being measured.

The optimisation procedure compares the evidences (equation 20) of different models to determine the vector of noise wave parameter polynomial coefficients n that best describes the data as briefly mentioned at the end of section 2.3. Since the model favoured by the data will have the highest evidence, we use a steepest descent procedure to compare models in ‘n-space’ and determine the direction of the gradient in ‘evidence-space’. After multiple iterations, this brings us to the model with the maximal evidence. Since n consists of five numbers corresponding to the number of polynomial coefficients for each of the five noise wave parameters, models are generated by individually increasing each index of n by 1. We expect the evidence to follow an ‘Occam’s cliff,’ in which the evidence sharply increases preceding the optimal n with a slow fall off following the maximum.

To demonstrate this, data is generated using measurements from all eight calibrators of table 1 and noise wave parameters as second-order polynomials

	$\displaystyle T_{\mathrm{unc}}=x^{2}-3x+250\ \mathrm{K}$
	$\displaystyle T_{\mathrm{cos}}=2x^{2}+190\ \mathrm{K}$
	$\displaystyle T_{\mathrm{sin}}=3x^{2}+8x+90\ \mathrm{K}$
	$\displaystyle T_{\mathrm{NS}}=4x^{2}+5x+1200\ \mathrm{K}$
	$\displaystyle T_{\mathrm{L}}=5x^{2}+10x+298\ \mathrm{K}$

where $x$ is our normalised frequency. Gaussian noise of one unit variation is applied to the calibrator input temperatures as before. The evidences of various models are plotted in Fig. 7 in which an Occam’s cliff can be seen peaking at polynomial order two. As expected from the plot, the steepest descent algorithm finds that noise wave parameters modelled as second-order polynomials best describe the data.

3.4 Application with realistic noise

To demonstrate the robustness of our pipeline, we conducted self-consistency checks using empirically modelled data with a more complicated noise model. This data was generated using reflection coefficients of eight calibrators and the receiver measured in the laboratory. These reflection coefficients were then smoothed using a cubic smoothing spline (Weinert, 2009) in order to maintain their approximate shape over frequency. The same second-order noise wave parameters detailed in section 3.3 are used with the reflection coefficients to generate our model power spectral densities. Following this, we added of order 1% Gaussian noise independently to the smoothed $\Gamma_{\mathrm{rec}}$ and $\Gamma_{\mathrm{cal}}$ as well as $P_{\mathrm{cal}}$ to more accurately represent the instrument noise from measurement equipment such as vector network analysers. No noise was added to the calibrator input temperatures. This results in a model that does not match the Gaussian form of our mathematical model as in the previous sections and thus does not demonstrate the features of our pipeline as explicitly, but is more representative of data set expected from measurements in the field. Data for the receiver and the cold load generated using this noise model are shown in Fig. 8.

Using data generated for all eight calibrators with our realistic noise model, the calibration algorithm selects optimal polynomial orders matching those of the model noise wave parameters whose values fall within within $1\sigma$ of the posterior peak values as shown in Fig. 9. For these higher order tests, we use fgivenx plots which condense noise wave parameter posteriors into samples that can be compared to the model parameter values instead of comparing each individual coefficient (Handley, 2018).

When this calibration model is used to calibrate an ambient-temperature $50\ \Omega$ load, the RMS error between the calibrated temperature and the measured temperature is 8 mK, well within the $1\sigma$ noise level (bottom right panel of Fig. 9). This level of accuracy is comparable to the 26 mK noise floor estimated for the EDGES pipeline in 2016 (Monsalve et al., 2017).

By individually adjusting each component of noise arising in our realistic noise model, we may determine what kind of noise our calibration algorithm is most sensitive to, as well as calculate the maximum amount of noise permissible for a specified level of systematic feature reduction. These topics are intended to be explored in a future work.

4 Conclusions

Here we presented the development of a calibration methodology based on the procedure used by EDGES but with key improvements to characterise reflections arising at connections within the receiver. Our pipeline utilises the Dicke switching technique and a Bayesian framework in order to individually optimise calibration parameters while identifying correlations between them using a dynamic algorithm to be applied in the same environment as the data acquisition. In a comprehensive investigation, we have evaluated our algorithm’s interpretation of empirical models of data which have been generated from known noise wave parameters and a realistic noise model. The solution, applied to an ambient-temperature $50\ \Omega$ load, produces a calibrated temperature with an RMS residual temperature of 8 mK. Future work for the pipeline regards application of real calibrator data, optimisation of noise wave parameter coefficients through marginalisation techniques and incorporation into an end-to-end simulation based on an entire experimental apparatus to better understand error tolerances. The flexibility of the algorithm attributed to our novel approach allows its application to any experiment relying on similar forms of calibration such as REACH (de Lera Acedo et al., 2021), were we intend to use this for in-the-field and on-the-fly radiometer calibration.

Acknowledgements

ILVR would like to thank S. M. Masur for her helpful comments. WJH was supported by a Gonville & Caius Research Fellowship, STFC grant number ST/T001054/1 and a Royal Society University Research Fellowship. We would like to thank the The Cambridge-Africa ALBORADA Research Fund for their support. We would also like to thank the Kavli Foundation for their support of the REACH experiment.

Data Availability

The data underlying this article will be shared on reasonable request to the corresponding author.

References

de Lera Acedo et al. (2021) de Lera Acedo E., et al., 2021, REACH: Radio Experiment for the Analysis of Cosmic Hydrogen, in press
Anstey et al. (2020) Anstey D., de Lera Acedo E., Handley W., 2020, arXiv e-prints, p. arXiv:2010.09644
Banerjee (2009) Banerjee S., 2009, Bayesian Linear Model: Gory Details, Online, http://www.biostat.umn.edu/~ph7440/pubh7440/BayesianLinearModelGoryDetails.pdf
Barkana (2018) Barkana R., 2018, Nature, 555, 71
Bilous, A. V. et al. (2016) Bilous, A. V. et al., 2016, A&A, 591, A134
Bowman et al. (2018) Bowman J. D., Rogers A. E. E., Monsalve R. A., Mozdzen T. J., Mahesh N., 2018, Nature, 555, 67
Cohen et al. (2017) Cohen A., Fialkov A., Barkana R., Lotem M., 2017, Monthly Notices of the Royal Astronomical Society, 472, 1915
Dicke (1946) Dicke R. H., 1946, Review of Scientific Instruments, 17, 268
Ewall-Wice et al. (2020) Ewall-Wice A., Chang T.-C., Lazio T. J. W., 2020, MNRAS, 492, 6086
Field (1958) Field G. B., 1958, The Astrophysical Journal, 129, 536
Furlanetto et al. (2009) Furlanetto S. R., et al., 2009, in astro2010: The Astronomy and Astrophysics Decadal Survey. p. 83 (arXiv:0902.3011)
Handley (2018) Handley W., 2018, The Journal of Open Source Software, 3, 849
Hills et al. (2018) Hills R., Kulkarni G., Meerburg P. D., Puchwein E., 2018, Nature, 564, E32
Meys (1978) Meys R. P., 1978, IEEE Transactions on Microwave Theory and Techniques, MTT-26, 34
Monsalve et al. (2017) Monsalve R. A., Rogers A. E. E., Bowman J. D., Mozdzen T. J., 2017, The Astrophysical Journal, 835, 49
Nagel (2017) Nagel J. B., 2017, PhD thesis, ETH Zurich
Orloff & Bloom (2013) Orloff J., Bloom J., 2013, Conjugate priors: Beta and normal, Online, https://math.mit.edu/~dav/05.dir/class15-prep.pdf
Pritchard & Loeb (2012) Pritchard J. R., Loeb A., 2012, Reports on Progress in Physics, 75, 8
Razavi-Ghods (2018) Razavi-Ghods N., 2018, A new instrument to measure the global EoR signal (Internal Engineering Memo), University of Cambridge, doi:10.5281/zenodo.4761931
Rogers & Bowman (2012) Rogers A. E. E., Bowman J. D., 2012, Radio Science, 47
Trotta (2008) Trotta R., 2008, Contemporary Physics, 49, 71
Weinert (2009) Weinert H. L., 2009, Computational Statistics and Data Analysis, 53, 932
Zaldarriaga et al. (2004) Zaldarriaga M., Furlanetto S. R., Hernquist L., 2004, ApJ, 608, 622

$\displaystyle T_{\mathrm{NS}}\left(\frac{P_{\mathrm{cal}}-P_{\mathrm{L}}}{P_{\mathrm{NS}}-P_{\mathrm{L}}}\right)+T_{\mathrm{L}}$	$\displaystyle=T_{\mathrm{cal}}\left[\frac{1-\|\Gamma_{\mathrm{cal}}\|^{2}}{\|1-\Gamma_{\mathrm{cal}}\Gamma_{\mathrm{rec}}\|^{2}}\right]$	(9)
	$\displaystyle+T_{\mathrm{unc}}\left[\frac{\|\Gamma_{\mathrm{cal}}\|^{2}}{\|1-\Gamma_{\mathrm{cal}}\Gamma_{\mathrm{rec}}\|^{2}}\right]$
	$\displaystyle+T_{\mathrm{cos}}\left[\frac{\operatorname{Re}\left(\frac{\Gamma_{\mathrm{cal}}}{1-\Gamma_{\mathrm{cal}}\Gamma_{\mathrm{rec}}}\right)}{\sqrt{1-\|\Gamma_{\mathrm{rec}}\|^{2}}}\right]$
	$\displaystyle+T_{\mathrm{sin}}\left[\frac{\operatorname{Im}\left(\frac{\Gamma_{\mathrm{cal}}}{1-\Gamma_{\mathrm{cal}}\Gamma_{\mathrm{rec}}}\right)}{\sqrt{1-\|\Gamma_{\mathrm{rec}}\|^{2}}}\right],$