This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Bayesian noise wave calibration for 21-cm global experiments

I. L. V. Roque,1 W. J. Handley1,2 and N. Razavi-Ghods1
1Astrophysics Group, Cavendish Laboratory, JJ Thomson Avenue, Cambridge, CB3 0HE, UK
2Kavli Institute for Cosmology, Madingley Road, Cambridge, CB3 0HA, UK
E-mail: [email protected]
(Accepted XXX. Received YYY; in original form ZZZ)
Abstract

Detection of millikelvin-level signals from the ‘Cosmic Dawn’ requires an unprecedented level of sensitivity and systematic calibration. We report the theory behind a novel calibration algorithm developed from the formalism introduced by the EDGES collaboration for use in 21-cm experiments. Improvements over previous approaches are provided through the incorporation of a Bayesian framework and machine learning techniques such as the use of Bayesian evidence to determine the level of frequency variation of calibration parameters that is supported by the data, the consideration of correlation between calibration parameters when determining their values and the use of a conjugate-prior based approach that results in a fast algorithm for application in the field. In self-consistency tests using empirical data models of varying complexity, our methodology is used to calibrate a 50 Ω\Omega ambient-temperature load. The RMS error between the calibration solution and the measured temperature of the load is 8 mK, well within the 1σ1\sigma noise level. Whilst the methods described here are more applicable to global 21-cm experiments, they can easily be adapted and applied to other applications, including telescopes such as HERA and the SKA.

keywords:
instrumentation: interferometers – methods: statistical – dark ages, reionization, first stars
pubyear: 2020pagerange: Bayesian noise wave calibration for 21-cm global experimentsBayesian noise wave calibration for 21-cm global experiments

1 Introduction

For nearly a century, scientists have been using radio-frequency instruments to advance the study of astronomy and complement information from the visual regime of the electromagnetic spectrum (Pritchard & Loeb, 2012). As we begin to take measurements of the early universe, these instruments must continue to evolve to support observations. Unexplored cosmic information from the Epoch of Reionisation and Cosmic Dawn redshifted into the radio spectrum could provide constraints on fundamental physics such as primordial black holes, galaxy formation, and universal curvature as discussed in Furlanetto et al. (2009). A unique probe of phenomena from the early cosmos is the hydrogen that inundates the intergalactic medium (IGM). Heating and cooling of the IGM associated with hydrogen’s absorption and emission of 21-cm photons produce a dynamic brightness temperature relative to the cosmic microwave background temperature, tracing the evolution of surrounding structure during the Cosmic Dawn. The brightness temperature of this 21-cm photon signal can be described by

T21(z)\displaystyle T_{21}(z)\approx 0.023K×\displaystyle\ 0.023\mathrm{K}\ \times (1)
xi(z)[(0.15Ωm)(1+z10)]12(Ωbh0.02)[1TR(z)TS(z)],\displaystyle x_{\text{H\,{i}}}(z)\left[\left(\frac{0.15}{\Omega_{\mathrm{m}}}\right)\left(\frac{1+z}{10}\right)\right]^{\frac{1}{2}}\left(\frac{\Omega_{\mathrm{b}}h}{0.02}\right)\left[1-\frac{T_{\mathrm{R}}(z)}{T_{\mathrm{S}}(z)}\right],

which is heavily dependent on environmental factors of the early universe such as xix_{\text{H\,{i}}}, the fraction of neutral hydrogen, Ωm\Omega_{\mathrm{m}} and Ωb\Omega_{\mathrm{b}}, the matter and baryon densities with respect to the universal critical density for a flat universe and Hubble’s constant. Here, the 0.0230.023 is a constant from atomic-line physics. TRT_{\mathrm{R}} is the background radiation temperature and TST_{\mathrm{S}} is known as the ‘21-cm spin temperature’, which is related to the kinetic temperature of neutral hydrogen gas in the IGM (Zaldarriaga et al., 2004; Field, 1958). This cosmic hydrogen signature measurable in the spectral sky has been redshifted to wavelengths under 200 MHz through the expansion of the universe as discussed in Pritchard & Loeb (2012).

There has been a recent surge in the field of 21-cm cosmology following the reported detection of an absorption feature consistent with a Cosmic Dawn signature. This was reported by the Experiment to Detect the Global EoR Signature (EDGES) in early 2018 from measurements of a sky-averaged radio spectrum (Bowman et al., 2018). The signal, centred at 78 MHz with a width corresponding to a period between 180 million and 270 million years after the Big Bang, matches the theoretical position in frequency, but its depth of 0.5\sim 0.5 K is a factor of two greater than the largest predictions from theoretical models (Cohen et al., 2017). This discrepancy would suggest that the temperature difference between the IGM and the cosmic microwave background was much larger than previously thought and would require new physics to explain, such as dark matter-baryon interactions (Barkana, 2018) or excess radio backgrounds (Ewall-Wice et al., 2020).

Another possible explanation for this discrepancy is that the measured signal is not cosmological but of systematic origin. This may be the case in EDGES due to some of the methodology used, such as a potentially unphysical foreground removal method and calibration of the receiver in a separate environment from the data acquisition (Hills et al., 2018; Razavi-Ghods, 2018). In this paper, we present a novel calibration algorithm that improves on the work of the EDGES team (Rogers & Bowman, 2012) through the utilisation of a Bayesian framework to promote efficient use of the data to remove systematics. Using conjugate priors and machine learning techniques, our pipeline can be applied in the field with the collection of data with additional capabilities for optimising individual noise wave parameters and incorporating correlations between them.

This paper is organised as follows. In section 2 we review the methodology behind calibration using noise waves as well as present a Bayesian framework that provides greater flexibility in radiometer calibration. Section 3 describes the process of using mock data sets modelled after empirical measurements of reflection coefficients with the incorporation of a realistic noise model to evaluate our pipeline.

2 Methods

In this section, we detail the methodology behind radiometer calibration using noise wave parameters. An overview of global signal measurement are outlined in section 2.1. Section 2.2 summarises the basic procedure with some mathematical improvements while section 2.3 describes our Bayesian framework and its associated advantages.

2.1 Measuring the global signal

The noise necessitating calibration emerges during measurement-taking. In an averaged or global experiment, the sky temperature Tsky(Ω,ν,t)T_{\mathrm{sky}}(\Omega,\nu,t) is a function of the direction Ω\Omega, frequency ν\nu and time tt. This can be broken down into two primary components: the global 21-cm signal T21T_{21} and astrophysical foregrounds TfT_{\mathrm{f}}

Tsky(Ω,ν,t)=T21(ν)+Tf(Ω,ν,t).T_{\mathrm{sky}}(\Omega,\nu,t)=T_{21}(\nu)+T_{\mathrm{f}}(\Omega,\nu,t). (2)

The antenna measures the sky signal convolved with the normalised antenna directivity BB. The process of measurement introduces the random noise term NdataN_{\mathrm{data}}.

D(ν,t)=Tsky(Ω,ν,t)B(Ω,ν)dΩ+Ndata.D(\nu,t)=\int T_{\mathrm{sky}}(\Omega,\nu,t)B(\Omega,\nu)\mathrm{d}\Omega+N_{\mathrm{data}}. (3)

Our 21-cm signature can thus be represented as

T21D(ν,t)Tf(Ω,ν,t)B(Ω,ν)dΩNdata.T_{21}\approx D(\nu,t)-\int T_{\mathrm{f}}(\Omega,\nu,t)B(\Omega,\nu)\mathrm{d}\Omega-N_{\mathrm{data}}. (4)

Here, the integral is assessed through foreground and beam modelling techniques such as those discussed in Anstey et al. (2020) while modelling of NdataN_{\mathrm{data}} from the statistical properties of D(ν,t)D(\nu,t) is accomplished by a calibration algorithm as articulated in this paper and outlined in Fig. 1. Having a fully Bayesian framework when modelling the beam, the sky and the systematics has major advantages for global 21-cm experiments such as REACH (de Lera Acedo et al., 2021), as it provides the greatest flexibility in being able to model all effects and jointly fit for them.

Refer to caption
Figure 1: Diagram showing the evolution of the 21-cm signal hampered by astrophysical foregrounds, convolvution with the antenna beam and the emergence of measurement noise before calibration to retrieve the antenna temperature.

2.2 Calibration methodology

The standard calibration strategy follows the method introduced by Dicke to characterise systematic features in radio frequency instruments (Dicke, 1946) and is widely used in experiments such as EDGES (Monsalve et al., 2017) and LOFAR (Bilous, A. V. et al., 2016) to evaluate the spectral index of the sky’s diffuse radio background (Rogers & Bowman, 2012). This technique involves measurements of two internal reference standards; a load and a noise source, in addition to a series of external calibration sources attached to the receiver input in lieu of the antenna. These include an ambient-temperature ‘cold’ load, a ‘hot’ load heated to 400\sim 400 K, an open-ended cable and a shorted cable. A block diagram showing this arrangement is presented in Fig. 2.

When calibrating the receiver, reflection coefficients are taken of the calibration source connected to the receiver input (Γcal\Gamma_{\mathrm{cal}}) and of the receiver itself (Γrec\Gamma_{\mathrm{rec}}) as well as power spectral densities (PSDs) of the input (PcalP_{\mathrm{cal}}), the internal reference load (PLP_{\mathrm{L}}) and the internal reference noise source (PNSP_{\mathrm{NS}}) (Monsalve et al., 2017). These measurements are used to calculate a preliminary ‘uncalibrated’ antenna temperature TcalT_{\mathrm{cal}}^{*}

Refer to caption
Figure 2: Diagram of a typical calibration setup. For characterisation of the receiver, a switch cycles between a calibrator connected to the input and the two internal references.
Tcal=TNS(PcalPLPNSPL)+TL,T_{\mathrm{cal}}^{*}=T_{\mathrm{NS}}\left(\frac{P_{\mathrm{cal}}-P_{\mathrm{L}}}{P_{\mathrm{NS}}-P_{\mathrm{L}}}\right)+T_{\mathrm{L}}, (5)

where TLT_{\mathrm{L}} and TNST_{\mathrm{NS}} are assumptions for the noise temperature of the internal reference load and excess noise temperature of the internal noise source above ambient, respectively. This initial calculation is used to calibrate out any time-dependent system gain that emerges from a series of filters, amplifiers and cables, as well as the analogue-to-digital converter within the experimental apparatus (Monsalve et al., 2017). Each PSD measurement can be expressed in terms of specific response contributions as detailed in Bowman et al. (2018)

Pcal=gsys[\displaystyle P_{\mathrm{cal}}=g_{\mathrm{sys}}\Bigg{[} Tcal(1|Γcal|2)|1|Γrec|21ΓcalΓrec|2\displaystyle T_{\mathrm{cal}}\left(1-|\Gamma_{\mathrm{cal}}|^{2}\right)\left|\frac{\sqrt{1-|\Gamma_{\mathrm{rec}}|^{2}}}{1-\Gamma_{\mathrm{cal}}\Gamma_{\mathrm{rec}}}\right|^{2} (6)
+\displaystyle+ Tunc|Γcal|2|1|Γrec|21ΓcalΓrec|2\displaystyle T_{\mathrm{unc}}|\Gamma_{\mathrm{cal}}|^{2}\left|\frac{\sqrt{1-|\Gamma_{\mathrm{rec}}|^{2}}}{1-\Gamma_{\mathrm{cal}}\Gamma_{\mathrm{rec}}}\right|^{2}
+\displaystyle+ TcosRe(Γcal1|Γrec|21ΓcalΓrec)\displaystyle T_{\mathrm{cos}}\operatorname{Re}\left(\Gamma_{\mathrm{cal}}\frac{\sqrt{1-|\Gamma_{\mathrm{rec}}|^{2}}}{1-\Gamma_{\mathrm{cal}}\Gamma_{\mathrm{rec}}}\right)
+\displaystyle+ TsinIm(Γcal1|Γrec|21ΓcalΓrec)+T0].\displaystyle T_{\mathrm{sin}}\operatorname{Im}\left(\Gamma_{\mathrm{cal}}\frac{\sqrt{1-|\Gamma_{\mathrm{rec}}|^{2}}}{1-\Gamma_{\mathrm{cal}}\Gamma_{\mathrm{rec}}}\right)+T_{0}\Bigg{]}.

Here, gsysg_{\mathrm{sys}} is the system gain referenced to the receiver input and TcalT_{\mathrm{cal}} is our calibrated input temperature. TuncT_{\mathrm{unc}}, TcosT_{\mathrm{cos}}, and TsinT_{\mathrm{sin}} are the ‘noise wave parameters’ introduced by Meys (1978) to calibrate the instrument. TuncT_{\mathrm{unc}} represents the portion of noise reflected by the antenna that is uncorrelated with the output noise of the low noise amplifier (LNA). TcosT_{\mathrm{cos}} and TsinT_{\mathrm{sin}} are the portions of reflected noise correlated with noise from the LNA (Monsalve et al., 2017; Rogers & Bowman, 2012). In the EDGES experiment, these calibration quantities are modelled using seven-term polynomials in frequency.

The PSDs for the internal reference load and noise source can similarly be expressed as in equation 6. However, since the reflection coefficients of the internal references are typically less than 0.005, they are taken to be zero in order to simplify the equations

PL=gsys[TL(1|Γrec|2)+T0],P_{\mathrm{L}}=g_{\mathrm{sys}}^{*}[T_{\mathrm{L}}\left(1-|\Gamma_{\mathrm{rec}}|^{2}\right)+T_{0}^{*}], (7)
PNS=gsys[(TL+TNS)(1|Γrec|2)+T0].P_{\mathrm{NS}}=g_{\mathrm{sys}}^{*}[\left(T_{\mathrm{L}}+T_{\mathrm{NS}}\right)\left(1-|\Gamma_{\mathrm{rec}}|^{2}\right)+T_{0}^{*}]. (8)

As shown in Fig. 2, the internal references may be on a separate reference plane than the receiver input, resulting in a system gain gsysg_{\mathrm{sys}}^{*} and a noise offset T0T_{0}^{*} different from those defined in equation 6. This effect is taken into account by two additional scale and offset parameters, C1C_{1} and C2C_{2}, introduced by EDGES (Monsalve et al., 2017).

Since C1C_{1} and C2C_{2} also correct for first-order assumptions in the noise temperatures of the internal reference load and noise source, we have chosen to absorb these terms into TLT_{\mathrm{L}} and TNST_{\mathrm{NS}}. This adjustment allows all calibration parameters, TuncT_{\mathrm{unc}}, TcosT_{\mathrm{cos}}, TsinT_{\mathrm{sin}}, and an ‘effective’ TNST_{\mathrm{NS}} and TLT_{\mathrm{L}}, to be solved for in units of kelvin, facilitating a joint solution of parameters. Expanding equation 5 using equations 6, 7 and 8 yields a linear identity providing a relationship between the uncalibrated input temperature and a final calibrated temperature of any device connected to the receiver input

TNS(PcalPLPNSPL)+TL\displaystyle T_{\mathrm{NS}}\left(\frac{P_{\mathrm{cal}}-P_{\mathrm{L}}}{P_{\mathrm{NS}}-P_{\mathrm{L}}}\right)+T_{\mathrm{L}} =Tcal[1|Γcal|2|1ΓcalΓrec|2]\displaystyle=T_{\mathrm{cal}}\left[\frac{1-|\Gamma_{\mathrm{cal}}|^{2}}{|1-\Gamma_{\mathrm{cal}}\Gamma_{\mathrm{rec}}|^{2}}\right] (9)
+Tunc[|Γcal|2|1ΓcalΓrec|2]\displaystyle+T_{\mathrm{unc}}\left[\frac{|\Gamma_{\mathrm{cal}}|^{2}}{|1-\Gamma_{\mathrm{cal}}\Gamma_{\mathrm{rec}}|^{2}}\right]
+Tcos[Re(Γcal1ΓcalΓrec)1|Γrec|2]\displaystyle+T_{\mathrm{cos}}\left[\frac{\operatorname{Re}\left(\frac{\Gamma_{\mathrm{cal}}}{1-\Gamma_{\mathrm{cal}}\Gamma_{\mathrm{rec}}}\right)}{\sqrt{1-|\Gamma_{\mathrm{rec}}|^{2}}}\right]
+Tsin[Im(Γcal1ΓcalΓrec)1|Γrec|2],\displaystyle+T_{\mathrm{sin}}\left[\frac{\operatorname{Im}\left(\frac{\Gamma_{\mathrm{cal}}}{1-\Gamma_{\mathrm{cal}}\Gamma_{\mathrm{rec}}}\right)}{\sqrt{1-|\Gamma_{\mathrm{rec}}|^{2}}}\right],

where all parameters are frequency-dependent. This is not explicitly shown for simplicity of notation. For estimation of the noise wave parameters, TcalT_{\mathrm{cal}}, Γcal\Gamma_{\mathrm{cal}} and Γrec\Gamma_{\mathrm{rec}} are measured along with the PSDs while gsysg_{\mathrm{sys}} and T0T_{\mathrm{0}} are calibrated out. The cold and hot loads exhibit the main temperature references needed for TLT_{\mathrm{L}} and TNST_{\mathrm{NS}}. The cables facilitate the derivation of the noise wave parameters describing spectral ripples from the noise properties of the receiver by acting as antennas looking at an isotropic sky with temperatures equal to the cables’ physical temperatures (Rogers & Bowman, 2012).

2.3 Bayesian calibration framework

One possible source of systematics in the calibration methodology used by EDGES comes from measuring the response of the four external calibrators along with the receiver reflection coefficient in a laboratory away from where the instrument is actually deployed (Bowman et al., 2018). This process, especially with regards to how calibration parameters change, can be non-trivial. Furthermore, the fixed polynomial order used by EDGES for all noise wave parameters may underfit or overfit individual parameters and thus ‘fit out’ data useful for determining systematics or potentially even the 21-cm signal itself if a joint fit is performed.

In response to these issues, we have developed a calibration pipeline that improves on the strategies presented in section 2.2. We introduce a novel Bayesian methodology using conjugate priors for a dynamic application of our algorithm to be run with data collection regardless of system complexity. Also included are model selection methods using machine learning techniques for the optimisation of individual noise wave parameters to combat overfitting and underfitting, the results of which converge with that of a least-squares approach when wide priors are adopted. Our pipeline easily incorporates many more calibrators than the standard four shown in Fig. 2 to increase constraints on noise wave parameters while identifying possible correlations between them. A schematic of the improved calibration method is shown in Fig. 3.

Refer to caption
Figure 3: Outline of the Bayesian calibration algorithm. Blue blocks represent data to be taken, red blocks represent calculations and green blocks represent calculation outputs.

In order to simplify our calibration approach, we first define the following terms

Xunc=|Γcal|21|Γcal|2,X_{\mathrm{unc}}=-\frac{|\Gamma_{\mathrm{cal}}|^{2}}{1-|\Gamma_{\mathrm{cal}}|^{2}}, (10)
XL=|1ΓcalΓrec|21|Γcal|2,X_{\mathrm{L}}=\frac{|1-\Gamma_{\mathrm{cal}}\Gamma_{\mathrm{rec}}|^{2}}{1-|\Gamma_{\mathrm{cal}}|^{2}}, (11)
Xcos=Re(Γcal1ΓcalΓrec×XL1|Γrec|2),X_{\mathrm{cos}}=-\operatorname{Re}\left(\frac{\Gamma_{\mathrm{cal}}}{1-\Gamma_{\mathrm{cal}}\Gamma_{\mathrm{rec}}}\times\frac{X_{\mathrm{L}}}{\sqrt{1-|\Gamma_{\mathrm{rec}}|^{2}}}\right), (12)
Xsin=Im(Γcal1ΓcalΓrec×XL1|Γrec|2),X_{\mathrm{sin}}=-\operatorname{Im}\left(\frac{\Gamma_{\mathrm{cal}}}{1-\Gamma_{\mathrm{cal}}\Gamma_{\mathrm{rec}}}\times\frac{X_{\mathrm{L}}}{\sqrt{1-|\Gamma_{\mathrm{rec}}|^{2}}}\right), (13)
XNS=(PcalPLPNSPL)XL,X_{\mathrm{NS}}=\left(\frac{P_{\mathrm{cal}}-P_{\mathrm{L}}}{P_{\mathrm{NS}}-P_{\mathrm{L}}}\right)X_{\mathrm{L}}, (14)

which represent initial calibration measurements on DD in the frequency domain for the characterisation of NdataN_{\mathrm{data}} from equation 3 via our noise wave parameters. It is expected that calibration-related deviations of DD in the time domain are sufficiently curtailed through practical strategies such as temperature control of the receiver environment. Incorporating these into equation 9, with some rearrangement, then gives the equation

XuncTunc+XcosTcos+XsinTsin+XNSTNS+XLTL=Tcal,X_{\mathrm{unc}}T_{\mathrm{unc}}+X_{\mathrm{cos}}T_{\mathrm{cos}}+X_{\mathrm{sin}}T_{\mathrm{sin}}+X_{\mathrm{NS}}T_{\mathrm{NS}}+X_{\mathrm{L}}T_{\mathrm{L}}=T_{\mathrm{cal}}, (15)

at each frequency. Here, there are no squared or higher-order terms, allowing us to take advantage of the linear form by grouping the data and noise wave parameters into separate matrices

X (XuncXcosXsinXNSXL),\displaystyle\equiv\begin{pmatrix}X_{\mathrm{unc}}\quad X_{\mathrm{cos}}\quad X_{\mathrm{sin}}\quad X_{\mathrm{NS}}\quad X_{\mathrm{L}}\end{pmatrix},
𝚯\displaystyle\boldsymbol{\Theta} (TuncTcosTsinTNSTL).\displaystyle\equiv\begin{pmatrix}T_{\mathrm{unc}}\quad T_{\mathrm{cos}}\quad T_{\mathrm{sin}}\quad T_{\mathrm{NS}}\quad T_{\mathrm{L}}\end{pmatrix}^{\top}. (16)

In these equations, all of our data; the reflection coefficient measurements and power spectral densities, are grouped in an X vector which forms a matrix where one of the axes is frequency. The calibration parameters as frequency-dependent polynomials of varying degree are collected into a 𝚯\boldsymbol{\boldsymbol{\Theta}} vector which serves as our model describing NdataN_{\mathrm{data}}. Applying these definitions condenses the calibration equation into

Tcal=X𝚯+σ,\textbf{{T}}_{\mathrm{cal}}=\textbf{{X}}\boldsymbol{\boldsymbol{\Theta}}+\sigma, (17)

where Tcal\textbf{{T}}_{\mathrm{cal}} is a vector over frequency and σ\sigma is a noise vector representing our error. Since EDGES assumes that each power spectral density measurement is frequency independent, we have assumed that σ\sigma is a multivariate normal distribution. This assumption is implicit in the EDGES analysis in which they use a least-squares minimisation approach for solving model parameters.

For calibration of the receiver, we are concerned with the construction of predictive models of the noise wave parameters, 𝚯\boldsymbol{\Theta}, in the context of some dataset, T. We can use 𝚯\boldsymbol{\Theta} to calculate the probability of observing the data given a specific set of noise wave parameters:

p(T|𝚯,σ2)\displaystyle p\big{(}\textbf{{T}}\>\big{|}\>\boldsymbol{\Theta},\sigma^{2}\big{)} =\displaystyle= (18)
12πσ2N/2exp{12σ2(TX𝚯)(TX𝚯)},\displaystyle\frac{1}{2\pi\sigma^{2}}^{N/2}\exp{\Bigg{\{}-\frac{1}{2\sigma^{2}}\left(\textbf{{T}}-\textbf{{X}}\boldsymbol{\Theta}\right)^{\top}\left(\textbf{{T}}-\textbf{{X}}\boldsymbol{\Theta}\right)\Bigg{\}}},

where, NN is the number of measurements. This distribution on the data is the likelihood. For the purposes of calibration, T may be Tcal\textbf{{T}}_{\mathrm{cal}} measurements or alternatively, Tsky\textbf{{T}}_{\mathrm{sky}} for prediction of a sky signal. Our model must also specify a prior distribution, quantifying our initial assumptions on the values and spread of our noise wave parameters which we specify as a multivariate normal inverse gamma distribution:

p(𝚯,σ2)\displaystyle p\left(\boldsymbol{\Theta},\sigma^{2}\right)\propto (1σ2)a+1+(d/2)×\displaystyle\left(\frac{1}{\sigma^{2}}\right)^{a+1+\left(d/2\right)}\times (19)
exp[1σ2{b+12(𝚯𝝁𝚯)V𝚯1(𝚯𝝁𝚯)}],\displaystyle\exp\left[-\frac{1}{\sigma^{2}}\{b+\frac{1}{2}\left(\boldsymbol{\Theta}-\boldsymbol{\mu}_{\boldsymbol{\Theta}}\right)^{\top}\textbf{{V}}_{\boldsymbol{\Theta}}^{-1}\left(\boldsymbol{\Theta}-\boldsymbol{\mu}_{\boldsymbol{\Theta}}\right)\}\right],

which is proportional up to an integration constant. Here, aa and bb, which are greater than zero, along with V𝚯\textbf{{V}}_{\boldsymbol{\Theta}} and 𝝁𝚯\boldsymbol{\mu}_{\boldsymbol{\Theta}} represent our prior knowledge on the noise wave parameters. dd is the length of our vector 𝚯\boldsymbol{\Theta}.

Equation 18 is determined by a set of values for our model 𝚯\boldsymbol{\Theta}. We can marginalise out the dependence on 𝚯\boldsymbol{\Theta} and our noise term by integrating over the prior distribution by both 𝚯\boldsymbol{\Theta} and σ2\sigma^{2} at once. Following the steps in Banerjee (2009)

p(Tcal)\displaystyle p\left(\textbf{{T}}_{\mathrm{cal}}\right) =p(Tcal|𝚯,σ2)p(𝚯,σ2)d𝚯dσ2\displaystyle=\int p\left(\textbf{{T}}_{\mathrm{cal}}\>\big{|}\>\boldsymbol{\Theta},\sigma^{2}\right)p\left(\boldsymbol{\Theta},\sigma^{2}\right)\mathrm{d}\boldsymbol{\Theta}\mathrm{d}\sigma^{2} (20)
=baΓ(a)|V|baΓ(a)|V𝚯|(2π)N/2,\displaystyle=\frac{b^{a}\Gamma\left(a^{*}\right)\sqrt{|\textbf{{V}}^{*}|}}{{b^{*}}^{a^{*}}\Gamma\left(a\right)\sqrt{|\textbf{{V}}_{\boldsymbol{\Theta}}|}}(2\pi)^{-N/2},

where

a\displaystyle a^{*} =a+N2,\displaystyle=a+\frac{N}{2}, (21)
b\displaystyle b^{*} =b+12[𝝁𝚯V𝚯1𝝁𝚯+TcalTcal𝝁V1𝝁],\displaystyle=b+\frac{1}{2}[\boldsymbol{\mu}_{\boldsymbol{\Theta}}^{\top}\textbf{{V}}_{\boldsymbol{\Theta}}^{-1}\boldsymbol{\mu}_{\boldsymbol{\Theta}}+\textbf{{T}}_{\mathrm{cal}}^{\top}\textbf{{T}}_{\mathrm{cal}}-\boldsymbol{\mu}^{*\top}\textbf{{V}}^{*-1}\boldsymbol{\mu}^{*}],
𝝁\displaystyle\boldsymbol{\mu}^{*} =(V𝚯1+XX)1(V𝚯1𝝁𝚯+XTcal),\displaystyle=\left(\textbf{{V}}_{\boldsymbol{\Theta}}^{-1}+\textbf{{X}}^{\top}\textbf{{X}}\right)^{-1}\left(\textbf{{V}}_{\boldsymbol{\Theta}}^{-1}\boldsymbol{\mu}_{\boldsymbol{\Theta}}+\textbf{{X}}^{\top}\textbf{{T}}_{\mathrm{cal}}\right),
V\displaystyle\textbf{{V}}^{*} =(V𝚯1+XX)1,\displaystyle=\left(\textbf{{V}}_{\boldsymbol{\Theta}}^{-1}+\textbf{{X}}^{\top}\textbf{{X}}\right)^{-1},

and Γ(x)\Gamma\left(x\right) represents the Gamma function, not to be confused with the notation for our reflection coefficients. Equation 20 is the evidence, which gives the probability of observing the data Tcal\textbf{{T}}_{\mathrm{cal}} given our model.111It is in fact better to use the equivalent more numerically stable expression b=b+𝒒𝒒+𝒒XV𝚯X𝒒b^{*}=b+\boldsymbol{q}^{\top}\boldsymbol{q}+\boldsymbol{q}^{\top}\textbf{{X}}\textbf{{V}}_{\boldsymbol{\Theta}}\textbf{{X}}^{\top}\boldsymbol{q}, where 𝒒=TcalX𝝁\boldsymbol{q}=\textbf{{T}}_{\mathrm{cal}}-\textbf{{X}}\boldsymbol{\mu}^{*} to avoid cancellation of large terms.

With the prior distribution specified, we use Bayes’ equation to invert the conditioning of the likelihood and find the posterior using the likelihood, prior and evidence:

p(𝚯,σ2|Tcal)=p(Tcal|𝚯,σ2)p(𝚯,σ2)p(Tcal).p\left(\boldsymbol{\Theta},\sigma^{2}\>\big{|}\>\textbf{{T}}_{\mathrm{cal}}\right)=\frac{p\left(\textbf{{T}}_{\mathrm{cal}}\>\big{|}\>\boldsymbol{\Theta},\sigma^{2}\right)p\left(\boldsymbol{\Theta},\sigma^{2}\right)}{p\left(\textbf{{T}}_{\mathrm{cal}}\right)}. (22)

Similarly from Banerjee (2009), this can be written as

p(𝚯,σ2|\displaystyle p\Bigl{(}\boldsymbol{\Theta},\sigma^{2}\>\big{|} Tcal)(1σ2)a+d2+1×\displaystyle\textbf{{T}}_{\mathrm{cal}}\Bigl{)}\propto\left(\frac{1}{\sigma^{2}}\right)^{a^{*}+\frac{d}{2}+1}\times (23)
exp{1σ2[b+12(𝚯𝝁)V1(𝚯𝝁)]}.\displaystyle\exp{\Bigg{\{}-\frac{1}{\sigma^{2}}\Bigg{[}b^{*}+\frac{1}{2}\left(\boldsymbol{\Theta}-\boldsymbol{\mu}^{*}\right)^{\top}\textbf{{V}}^{*-1}\left(\boldsymbol{\Theta}-\boldsymbol{\mu}^{*}\right)\Bigg{]}\Bigg{\}}}.

The posterior distribution represents the uncertainty of our parameters after analysis, reflecting the increase in information (Nagel, 2017). We highlight the difference between the ‘likelihood-only’ least-squares approach versus the Bayesian approach with the former being a special case of the latter with very wide priors demonstrable when V𝚯V𝚯10\textbf{{V}}_{\boldsymbol{\Theta}}\rightarrow\infty\Rightarrow\textbf{{V}}_{\boldsymbol{\Theta}}^{-1}\rightarrow 0, and 𝝁\boldsymbol{\mu}^{*} becomes 𝚯\boldsymbol{\Theta}. The transition from ‘non-starred’ variables to ‘starred’ variables represents our ‘Bayesian update’ of the prior to the posterior noise wave parameters in light of the calibration data Tcal\textbf{{T}}_{\mathrm{cal}}.

As we can see, the posterior distribution is in the same probability distribution family as equation 19, making our prior a conjugate prior on the likelihood distribution. The use of conjugate priors gives a closed-form solution for the posterior distribution through updates of the prior hyperparameters via the likelihood function (Banerjee, 2009; Orloff & Bloom, 2013). The resulting numerical computation is many orders of magnitude faster than MCMC methods relying on full numerical sampling and permits an in-place calculation in the same environment as the data acquisition. This becomes particularly useful for the speed of the algorithm as frequency dependence is introduced in which the computations would not be manageable without conjugate gradients.

To allow for a smooth frequency dependency, we promote each of our noise wave parameters in section 2.3 to a vector of polynomial coefficients

Ti=(Ti[0],Ti[1],Ti[2],,Ti[n]),T_{\mathrm{i}}=\begin{pmatrix}T_{\mathrm{i}}^{[0]},&T_{\mathrm{i}}^{[1]},&T_{\mathrm{i}}^{[2]},&...,&T_{\mathrm{i}}^{[n]}\end{pmatrix}, (24)

where ii is our noise wave parameter label; i{unc,cos,sin,NS,L}i\in\{\mathrm{unc,\ cos,\ sin,\ NS,\ L}\}, modelled using n+1n+1 polynomial coefficients. Likewise

Xi=(Xi,Xi(νν0),Xi(νν0)2,,Xi(νν0)n),\textbf{{X}}_{i}=\begin{pmatrix}\textbf{{X}}_{i},&\textbf{{X}}_{i}\left(\frac{\nu}{\nu_{0}}\right),&\textbf{{X}}_{i}{\left(\frac{\nu}{\nu_{0}}\right)}^{2},&...,&\textbf{{X}}_{i}{\left(\frac{\nu}{\nu_{0}}\right)}^{n}\end{pmatrix}, (25)

where ν\nu is a vector of input frequencies which are raised to powers up to nn. For a vector of nn’s attributed to our calibration parameters, under this notation multiplication in equation 17 is element-wise and equation 20 is effectively p(Tcal|n)p\left(\textbf{{T}}_{\mathrm{cal}}|\textbf{{n}}\right). Assuming a uniform prior on n, inverting Bayes’ theorem gives p(n|Tcal)p\left(\textbf{{n}}|\textbf{{T}}_{\mathrm{cal}}\right) for use in model comparison in which the relative probabilities of models can be evaluated in light of the data and priors. Occam’s razor advises whether the extra complexity of a model is needed to describe the data (Trotta, 2008), permitting optimisation of the polynomial orders for individual noise wave parameters as detailed in section 3.3. By taking a random sampling of the resulting posterior, we characterise the noise wave parameters as multivariate distributions depicted in contour plots which exhibit a peak value accompanied by 1σ1\sigma and 2σ2\sigma variance as well as correlation between parameters inferred from a covariance matrix.

Following characterisation of the receiver, we next apply the Tcal\textbf{{T}}_{\mathrm{cal}} from our calibration to a set of raw antenna data X^\hat{\textbf{{X}}} for prediction of our sky signal, Tsky\textbf{{T}}_{\mathrm{sky}}, from equation 3. The predictions for the data follow from the posterior predictive distribution

p(Tsky|Tcal)=p(Tsky|𝚯,σ2)p(𝚯,σ2|Tcal)d𝚯dσ2.p\left(\textbf{{T}}_{\mathrm{sky}}\>\big{|}\>\textbf{{T}}_{\mathrm{cal}}\right)=\int p\left(\textbf{{T}}_{\mathrm{sky}}\>\big{|}\>\boldsymbol{\Theta},\sigma^{2}\right)p\left(\boldsymbol{\Theta},\sigma^{2}\>\big{|}\>\textbf{{T}}_{\mathrm{cal}}\right)\mathrm{d}\boldsymbol{\Theta}\mathrm{d}\sigma^{2}. (26)

The first probability in the integral is the likelihood for our antenna measurement Tsky\textbf{{T}}_{\mathrm{sky}} and the second is our posterior from equation 23. Following the steps in Banerjee (2009), this can be shown to be a multivariate Student’s t-distribution written as:

p(\displaystyle p\Big{(} Tsky|Tcal)=Γ(a+d2)Γ(a)πd2|2b(I+X^VX^)|12\displaystyle\textbf{{T}}_{\mathrm{sky}}\>\big{|}\>\textbf{{T}}_{\mathrm{cal}}\Big{)}=\frac{\Gamma\left(a^{*}+\frac{d}{2}\right)}{\Gamma\left(a^{*}\right)\pi^{\frac{d}{2}}|2b^{*}\left(I+\hat{\textbf{{X}}}\textbf{{V}}^{*}\hat{\textbf{{X}}}^{\top}\right)|^{\frac{1}{2}}} (27)
×[1+(TskyX^𝝁)(I+X^VX^)1(TskyX^𝝁)2b](a+d2),\displaystyle\times\left[1+\frac{\left(\textbf{{T}}_{\mathrm{sky}}-\hat{\textbf{{X}}}\boldsymbol{\mu}^{*}\right)^{\top}\left(I+\hat{\textbf{{X}}}\textbf{{V}}^{*}\hat{\textbf{{X}}}^{\top}\right)^{-1}\left(\textbf{{T}}_{\mathrm{sky}}-\hat{\textbf{{X}}}\boldsymbol{\mu}^{*}\right)}{2b^{*}}\right]^{-\left(a^{*}+\frac{d}{2}\right)},

where II is the N×NN\times N identity matrix and aa^{*}, bb^{*}, 𝝁\boldsymbol{\mu}^{*} and V\textbf{{V}}^{*} are defined in equation 21. This new distribution on Tsky\textbf{{T}}_{\mathrm{sky}} corresponds to a set of points with error bars and represents the calibrated sky temperature as the output of the receiver.

3 Empirical modelling and simulations

To verify the performance of our pipeline and highlight features of the algorithm, we evaluate the results of self-consistency checks using empirical models of data based on measurements taken in the laboratory. To make this data as realistic as possible, we used actual measurements of the reflection coefficients of many types of calibrators (see table 1) to generated power spectral densities using equations 6, 7 and 8 given a set of realistic model noise wave parameters along with some assumptions about the noise, which are described in section 3.4. The impedance of the calibrators which were measured with a vector network analyser (VNA) and used in our pipeline are shown on a Smith chart in Fig. 4

Table 1: Table of calibrators used in the creation of our empirical data models for analysis. Calibrators are added in pairs in the order below when increasing the number of calibration sources used by our algorithm.
Calibrator Temperature
Cold load (50Ω50\ \Omega) 298 K
Hot load (50Ω50\ \Omega) 373 K
Gore cable +5Ω+5\ \Omega 298 K
Gore cable +500Ω+500\ \Omega 298 K
Gore cable +31Ω+31\ \Omega 298 K
Gore cable +81Ω+81\ \Omega 298 K
25Ω25\ \Omega resistor 298 K
100Ω100\ \Omega resistor 298 K
Refer to caption
Figure 4: Smith chart (Argand diagram) showing the measured complex impedance of the calibrators used in the Bayesian pipeline across a range of frequencies.

We start by demonstrating the importance of correlation between noise wave parameters when determining their values to provide a better calibration solution for the reduction of systematic features in the data such as reflections (section 3.1). We then show the increased constraints on these noise wave parameters attributed to the inclusion of more calibrators than the standard number of four (section 3.2). Following this, we illustrate the effectiveness of model selection for the optimisation of individual noise wave parameters to prevent the loss of information resulting from overfitting or underfitting of the data (section 3.3). Finally, these features are incorporated into a calibration solution applied to a 50Ω50\ \Omega load (section 3.4).

3.1 Correlation between noise wave parameters

In this section, we show the first major feature of our Bayesian pipeline; the consideration of correlation between noise wave parameters when deriving their values. This is best demonstrated when noise is introduced in an idealised way as to retain a form matching the Gaussian form of our mathematical model. To do this, empirical models of power spectral densities are calculated from equations 6, 7 and 8 using measurements of Γrec\Gamma_{\mathrm{rec}}, Γcal\Gamma_{\mathrm{cal}} and TcalT_{\mathrm{cal}} for the cold and hot loads, as well as a set of realistic noise wave parameters. Gaussian noise of one unit variation is then added to the TcalT_{\mathrm{cal}} measurements after the calculation to conserve its Gaussian form. This data is submitted to our algorithm and the resulting posterior distributions for coefficients of the polynomial noise wave parameters are compared to the initial values.

Such posterior distributions can be seen in Fig. 5 showing the results of models using only the cold load (grey posterior), only the hot load (red posterior) and using both loads in tandem (blue posterior). For these calculations we chose a set of model noise wave parameters as constants across the frequency band;

Tunc=250K\displaystyle T_{\mathrm{unc}}=250\ \mathrm{K}
Tcos=190K\displaystyle T_{\mathrm{cos}}=190\ \mathrm{K}
Tsin=90K\displaystyle T_{\mathrm{sin}}=90\ \mathrm{K}
TNS=1200K\displaystyle T_{\mathrm{NS}}=1200\ \mathrm{K}
TL=298K\displaystyle T_{\mathrm{L}}=298\ \mathrm{K}

In Fig. 5, a strong correlation between the TLT_{\mathrm{L}} and TNST_{\mathrm{NS}} is evident as the hot-load posterior is highly skewed as expected from equations 11 and 14. The resulting intersection of posteriors from the individual loads facilitate the derivation of noise wave parameters as the dual-load posterior is found within the region of posterior overlap crossing with the values of the model shown in the inset of Fig. 5. Retrieval of the noise wave parameter values using correlations between them found in the data demonstrate the relevance of this information which is not taken into account in previous calibration techniques.

Refer to caption
Figure 5: Plot showing the joint posteriors of TLT_{\mathrm{L}} and TNST_{\mathrm{NS}} for models using the cold load, the hot load, and both loads concurrently shown as the grey, red and blue posteriors respectively. The black cross hairs mark the noise wave parameter values used to generate data submitted to the pipeline. A zoom-in of the posterior intersection is provided to illustrate the constraint of noise wave parameter values attributed to the correlation between parameters.

3.2 Constraints with additional calibrators

A nice feature of our pipeline is the ability to include as many calibrators as required to constrain the calibration parameters. For analysis, six more calibrators are introduced in pairs following the order presented in table 1. We include data generated from measurements of multiple resistors terminating a high quality 25 m cable made by GORE®. Data for these calibrators is once again generated using fixed terms and Gaussian noise of one unit variation added to TcalT_{\mathrm{cal}} as discussed above. Fig. 6 shows the results of models using four, six, and eight calibrators.

Refer to caption
Figure 6: Posterior results of our pipeline using data from four, six and eight calibrators shown in grey, red and blue, respectively. Cross hairs mark the values of noise wave parameters used to generate the data. These values fall within 1σ1\sigma of the posterior mean values. We can see that the constraint on noise wave parameter values increases with the number of calibrators used in our pipeline which is encouraging.

As shown, the inclusion of more calibrators increases the constraint on the resulting noise wave parameters. However, we note that after the inclusion of four calibrators, the relative additional constraint decreases with each additional calibrator and thus the use of more than eight calibrators would be unnecessary. The values of noise wave parameters used to generate the data as indicated by the cross hairs in Fig. 6 all fall within 1σ1\sigma of our pipeline’s resulting posterior averages for models using all eight calibrators.

3.3 Optimisation of individual noise wave parameters

The final highlight of our Bayesian pipeline is a the use of machine learning techniques to optimise individual noise wave parameters. This is advantageous as a blanket set of order-seven polynomials applied to all noise wave parameters, such as done in the EDGES experiment, may underfit or overfit individual parameters and misidentify systematics or information about the signal being measured.

The optimisation procedure compares the evidences (equation 20) of different models to determine the vector of noise wave parameter polynomial coefficients n that best describes the data as briefly mentioned at the end of section 2.3. Since the model favoured by the data will have the highest evidence, we use a steepest descent procedure to compare models in ‘n-space’ and determine the direction of the gradient in ‘evidence-space’. After multiple iterations, this brings us to the model with the maximal evidence. Since n consists of five numbers corresponding to the number of polynomial coefficients for each of the five noise wave parameters, models are generated by individually increasing each index of n by 1. We expect the evidence to follow an ‘Occam’s cliff,’ in which the evidence sharply increases preceding the optimal n with a slow fall off following the maximum.

To demonstrate this, data is generated using measurements from all eight calibrators of table 1 and noise wave parameters as second-order polynomials

Tunc=x23x+250K\displaystyle T_{\mathrm{unc}}=x^{2}-3x+250\ \mathrm{K}
Tcos=2x2+190K\displaystyle T_{\mathrm{cos}}=2x^{2}+190\ \mathrm{K}
Tsin=3x2+8x+90K\displaystyle T_{\mathrm{sin}}=3x^{2}+8x+90\ \mathrm{K}
TNS=4x2+5x+1200K\displaystyle T_{\mathrm{NS}}=4x^{2}+5x+1200\ \mathrm{K}
TL=5x2+10x+298K\displaystyle T_{\mathrm{L}}=5x^{2}+10x+298\ \mathrm{K}

where xx is our normalised frequency. Gaussian noise of one unit variation is applied to the calibrator input temperatures as before. The evidences of various models are plotted in Fig. 7 in which an Occam’s cliff can be seen peaking at polynomial order two. As expected from the plot, the steepest descent algorithm finds that noise wave parameters modelled as second-order polynomials best describe the data.

Refer to caption
Figure 7: Evidence of multiple models are plotted which display the Occam’s cliff. Data is generated using noise wave parameters as order-2 polynomials. We see that for the model with the highest evidence, that is, the model favoured by the data, the number of polynomial coefficients matches that of the model noise wave parameters.

3.4 Application with realistic noise

To demonstrate the robustness of our pipeline, we conducted self-consistency checks using empirically modelled data with a more complicated noise model. This data was generated using reflection coefficients of eight calibrators and the receiver measured in the laboratory. These reflection coefficients were then smoothed using a cubic smoothing spline (Weinert, 2009) in order to maintain their approximate shape over frequency. The same second-order noise wave parameters detailed in section 3.3 are used with the reflection coefficients to generate our model power spectral densities. Following this, we added of order 1% Gaussian noise independently to the smoothed Γrec\Gamma_{\mathrm{rec}} and Γcal\Gamma_{\mathrm{cal}} as well as PcalP_{\mathrm{cal}} to more accurately represent the instrument noise from measurement equipment such as vector network analysers. No noise was added to the calibrator input temperatures. This results in a model that does not match the Gaussian form of our mathematical model as in the previous sections and thus does not demonstrate the features of our pipeline as explicitly, but is more representative of data set expected from measurements in the field. Data for the receiver and the cold load generated using this noise model are shown in Fig. 8.

Refer to caption
Figure 8: Power spectral densities and reflection coefficients for the receiver and the cold load generated under our realistic noise model.

Using data generated for all eight calibrators with our realistic noise model, the calibration algorithm selects optimal polynomial orders matching those of the model noise wave parameters whose values fall within within 1σ1\sigma of the posterior peak values as shown in Fig. 9. For these higher order tests, we use fgivenx plots which condense noise wave parameter posteriors into samples that can be compared to the model parameter values instead of comparing each individual coefficient (Handley, 2018).

When this calibration model is used to calibrate an ambient-temperature 50Ω50\ \Omega load, the RMS error between the calibrated temperature and the measured temperature is 8 mK, well within the 1σ1\sigma noise level (bottom right panel of Fig. 9). This level of accuracy is comparable to the 26 mK noise floor estimated for the EDGES pipeline in 2016 (Monsalve et al., 2017).

By individually adjusting each component of noise arising in our realistic noise model, we may determine what kind of noise our calibration algorithm is most sensitive to, as well as calculate the maximum amount of noise permissible for a specified level of systematic feature reduction. These topics are intended to be explored in a future work.

Refer to caption
Figure 9: Results from 1000 samples using data generated with our more realistic noise model (shown in black). The second-order noise wave parameters shown in red are used to generate the data inputted to our pipeline. The polynomial order and values of the noise wave parameters that best suit the data according to our algorithm match that of the empirical model. This solution is applied to an ambient-temperature load, shown in the bottom right panel as our predictive y^\hat{y} from equation 27, and calibrates it to within 1σ1\sigma of ambient temperature.

4 Conclusions

Here we presented the development of a calibration methodology based on the procedure used by EDGES but with key improvements to characterise reflections arising at connections within the receiver. Our pipeline utilises the Dicke switching technique and a Bayesian framework in order to individually optimise calibration parameters while identifying correlations between them using a dynamic algorithm to be applied in the same environment as the data acquisition. In a comprehensive investigation, we have evaluated our algorithm’s interpretation of empirical models of data which have been generated from known noise wave parameters and a realistic noise model. The solution, applied to an ambient-temperature 50Ω50\ \Omega load, produces a calibrated temperature with an RMS residual temperature of 8 mK. Future work for the pipeline regards application of real calibrator data, optimisation of noise wave parameter coefficients through marginalisation techniques and incorporation into an end-to-end simulation based on an entire experimental apparatus to better understand error tolerances. The flexibility of the algorithm attributed to our novel approach allows its application to any experiment relying on similar forms of calibration such as REACH (de Lera Acedo et al., 2021), were we intend to use this for in-the-field and on-the-fly radiometer calibration.

Acknowledgements

ILVR would like to thank S. M. Masur for her helpful comments. WJH was supported by a Gonville & Caius Research Fellowship, STFC grant number ST/T001054/1 and a Royal Society University Research Fellowship. We would like to thank the The Cambridge-Africa ALBORADA Research Fund for their support. We would also like to thank the Kavli Foundation for their support of the REACH experiment.

Data Availability

The data underlying this article will be shared on reasonable request to the corresponding author.

References