This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

aainstitutetext: Department of Physics, Korea Advanced Institute of Science and Technology (KAIST), Daejeon 34141, Republic of Koreabbinstitutetext: Center for Axion and Precision Physics Research, Institute for Basic Science (IBS), Daejeon 34141, Republic of Korea

Analytical estimation of the signal to noise ratio efficiency in axion dark matter searches using a Savitzky-Golay filter

A. K. Yi,111Now at SLAC National Accelerator Laboratory, 2575 Sand Hill Rd., Menlo Park, California 94025, USA b    S. Ahn b,2    B. R. Ko,222Corresponding author. b,a    and Y. K. Semertzidis [email protected] [email protected] [email protected] [email protected]
Abstract

The signal to noise ratio efficiency ϵSNR\epsilon_{\rm SNR} in axion dark matter searches has been estimated using large-statistic simulation data reflecting the background information and the expected axion signal power obtained from a real experiment. This usually requires a lot of computing time even with the assistance of powerful computing resources. Employing a Savitzky-Golay filter for background subtraction, in this work, we estimated a fully analytical ϵSNR\epsilon_{\rm SNR} without relying on large-statistic simulation data, but only with an arbitrary axion mass and the relevant signal shape information. Hence, our work can provide ϵSNR\epsilon_{\rm SNR} using minimal computing time and resources prior to the acquisition of experimental data, without the detailed information that has to be obtained from real experiments. Axion haloscope searches have been observing the coincidence that the frequency independent scale factor ξ\xi is approximately consistent with the ϵSNR\epsilon_{\rm SNR}. This was confirmed analytically in this work, when the window length of the Savitzky-Golay filter is reasonably wide enough, i.e., at least 5 times the signal window.

Keywords:
Dark Matter, Axions and ALPs, Beyond Standard Model

1 Introduction

According to cosmological measurements and the standard model of Big Bang cosmology PLANCK , cold dark matter (CDM) is responsible for about 85% of the total matter in our Universe. CDM does not belong to the Standard Model of particle physics (SM), and as of today the nature of CDM is unknown in spite of the strong evidence of its existence CDM-EVIDENCE . The axion AXION is one of the iconic CDM candidates and originally stems from a breakdown of a new symmetry proposed by Peccei and Quinn PQ , as a very natural solution to the strong CPCP problem in the SM strongCP . In light of the current galaxy formation dark matter has to be massive, stable, and nonrelativistic, and the axion is predicted to meet all of these conditions. The axion haloscope search proposed by Sikivie utilizes axion-photon coupling and a microwave resonant cavity, which results in the resonant conversion of axions to photons sikivie . This enhances the detected axion signal power drastically. Making the axion haloscope the most promising method for axion dark matter searches in the microwave region. Recent experimental efforts have reached the Dine-Fischler-Srednicki-Zhitnitskii (DFSZ) axion sensitivity ADMX-DFSZ ; 12TB-PRL , where the DFSZ axion can be implemented in grand unified theories (GUT) GUT . If DFSZ cold axion dark matter turns out to constitute 100% of the local dark matter density, that would not only explain the total matter in our Universe, but also support GUT.

Since the resonated axion signal power is only sensitive to the resonant frequency region, and information about axion mass is absent, the most significant and practical figure of merit in axion haloscope searches is the scanning rate scanrate . One of the experimental parameters determining the scanning rate is the signal to noise ratio (SNR) efficiency squared ϵSNR2\epsilon^{2}_{\rm SNR} JINST , where the SNR=Paaγγ/σPn{\rm SNR}=P^{a\gamma\gamma}_{a}/\sigma_{P_{n}} according to the radiometer equation DICKE and we define ϵSNR\epsilon_{\rm SNR} later in equation (2). In the SNR equation, PaaγγP^{a\gamma\gamma}_{a} is the expected axion signal power for an axion-photon coupling strength sikivie ; scanrate and σPn\sigma_{P_{n}} is the fluctuation in the noise power PnP_{n}. Estimates of the ϵSNR\epsilon_{\rm SNR} have relied on large-statistic simulation data that usually require a lot of computation time and storage resources ADMX ; HAYSTAC ; JHEP . With hints reported in refs. HAYSTAC ; JHEP , in this study we have developed an analytical method to estimate the ϵSNR\epsilon_{\rm SNR} for axion dark matter searches. The ϵSNR\epsilon_{\rm SNR} for axion haloscope searches is frequency-independent or, equivalently, background-independent with the relevant scale factor ξ\xi, as demonstrated by refs. HAYSTAC ; JHEP , where ξ\xi is necessary and affected by the combination of power excess bins that are correlated with each other due to background subtraction. In addition, one of our previous works showed that ϵSNR\epsilon_{\rm SNR} is practically the signal power efficiency due to background subtraction JHEP . In another one of our recent works 12TB-PRD , however, we found a subtle effect to the background fluctuation due to background subtraction with a specific condition, which was considered as well.

Inspired by that information, we calculated the signal power efficiency ϵPaaγγ=ϵsig\epsilon_{P^{a\gamma\gamma}_{a}}=\epsilon_{\rm sig} whose effects can first be seen in Fig. 1 and ξ\xi independently using only an arbitrary axion mass and the relevant signal shape information. They showed good agreement with those obtained from large-statistic simulation data. In the end, axion haloscope searches have been observing the coincidence that the frequency independent scale factor ξ\xi is approximately consistent with the ϵSNR\epsilon_{\rm SNR}. This was confirmed analytically in this work, if the window length of our background estimator is reasonably wide enough, i.e., at least 5 times the signal window.

2 Obtaining the SNR efficiency analytically

The general data analysis procedure for axion haloscope searches studied in this work aims to maximize the significance of an axion signal by weighting and combining data points HAYSTAC . Raw data is acquired through power spectra in the frequency domain. The digitized power spectrum data will have the background noise power stored in frequency bins that are spaced at intervals of the resolution bandwidth (RBW) Δν\Delta\nu. The background will feature a baseline characterized by the system noise and its fluctuations. If an axion signal exists amidst the background, it will appear as a lineshape which is distributed to multiple frequency bins. In the case of the standard halo model, the lineshape is a boosted Maxwellian TURNER

f(ν)=2π(321r1νaβ2)sinh(3r2(ννa)νaβ2)exp(3(ννa)νaβ23r22).f(\nu)=\frac{2}{\sqrt{\pi}}\left(\sqrt{\frac{3}{2}}\frac{1}{r}\frac{1}{\nu_{a}\expectationvalue{\beta^{2}}}\right)\sinh\left(3r\sqrt{\frac{2(\nu-\nu_{a})}{\nu_{a}\expectationvalue{\beta^{2}}}}\right)\exp\left(-\frac{3(\nu-\nu_{a})}{\nu_{a}\expectationvalue{\beta^{2}}}-\frac{3r^{2}}{2}\right). (1)

This lineshape depends on the axion frequency νa\nu_{a}, r=vE/vrmsr=v_{E}/v_{\mathrm{rms}}, and β2=1.5vrms2/c2\expectationvalue{\beta^{2}}=1.5v_{\mathrm{rms}}^{2}/c^{2}, where vEv_{E} is the velocity of Earth with respect to the galaxy halos, vrmsv_{\mathrm{rms}} is the local circular velocity of the galaxy, and cc is the speed of light. In the analysis stage an axion signal window Δνa\Delta\nu_{a} is used where the integral of f(ν)f(\nu) from νa\nu_{a} to νa+Δνa\nu_{a}+\Delta\nu_{a} nears unity. The axion signal power is weak; for instance in the CAPP-12TB experiment it was on the order of tens of yoctowatts 12TB-PRL , making it difficult to distinguish from noise fluctuations. Therefore the data is further processed in a way that increases the SNR of an expected signal.

Following the conventional methods of haloscope data analysis, an axion signal, should it exist, will appear as a sharp peak on the background as shown in Fig. 1. This is also replicable by inserting a software-injected axion signal into an experiment’s background. Then the SNR of an input signal, which will have its background perfectly removed, can be compared with the SNR of signal which has its background removed by a fit. Examples of a fit estimator are a χ2\chi^{2} fit such as in refs. 12TB-PRL ; CAPP-8TB-PRL or a Savitzky-Golay (SG) filter SGFILTER as seen in refs. 12TB-PRD ; HAYSTAC . In this work we will be focusing on the latter as it does not require a functional form of the background structure, can handle zero-fluctuation data like the axion signal power considered in axion dark matter searches, and its estimates are fully predictable.

Refer to caption
Refer to caption
Figure 1: The effect of an axion signal distorting a background estimation using an SG filter (left) and its effect in terms of normalized power excess (right). An axion signal with exaggerated signal power was added at 1.096 GHz.

Typically any estimate of the background will be affected by a signal such as the one shown in Fig. 1 and due to this the output SNR will not recover 100% of the input. The difference in the input and output SNR can be expressed through an efficiency term ϵSNR\epsilon_{\rm SNR}

ϵSNR=SNRoutputSNRinput1ξϵsigξ,\epsilon_{\rm SNR}=\frac{\mathrm{SNR}_{\mathrm{output}}}{\mathrm{SNR}_{\mathrm{input}}}\frac{1}{\xi}\simeq\frac{\epsilon_{\rm sig}}{\xi}, (2)

where we define ϵsig=δoutput/δinput\epsilon_{\rm sig}=\delta_{\rm output}/\delta_{\rm input} for input and output signal powers δinput\delta_{\rm input} and δoutput\delta_{\rm output}. ϵsig\epsilon_{\rm sig} is approximately equal to SNRoutput/SNRinput\mathrm{SNR}_{\mathrm{output}}/\mathrm{SNR}_{\mathrm{input}} as σPn\sigma_{P_{n}} for input and output are practically equal JHEP . The frequency-independent scale factor ξ\xi is used to offset the bin-to-bin correlations that are introduced from background subtraction HAYSTAC ; JHEP . This value is set to the width of the Gaussian fit for the normalized power excess of the output. In terms of input, the normalized power excess’ Gaussian width should be equal to one.

We will first proceed to discuss how to obtain the value of ϵsig\epsilon_{\rm sig}. Using an SG filter as a background estimate, it is possible to construct a signal following equation (1) on a flat background that does not include any noise fluctuations. The input signal in units of arbitrary power excess will be distributed into nc=Δνa/Δνn_{c}=\Delta\nu_{a}/\Delta\nu frequency bins that have a resolution bandwidth Δν\Delta\nu, assuming that the first bin’s frequency is equal to νa\nu_{a}. The input signal power in the kkth bin is

δk=δaLk=δaνa+(k12)Δννa+(k+12)Δνf(ν)dν,\delta_{k}=\delta_{a}L_{k}=\delta_{a}\int_{\nu_{a}+\left(k-\frac{1}{2}\right)\Delta\nu}^{\nu_{a}+\left(k+\frac{1}{2}\right)\Delta\nu}f(\nu)\differential\nu, (3)

where δa\delta_{a} is a multiplicative constant that represents PaaγγP_{a}^{a\gamma\gamma} and LkL_{k} are the fractions of axion signal power stored into the kkth bin. While our SG filter is designed to describe the background only, it will be applied to the set of δk\delta_{k} (the red line in the left panel of Fig. 2) to parametrize the spectrum containing the signal as well as the flat background. As also seen in the left panel of Fig. 2, the smoothing from the filter, shown as orange dashed lines, does not fit the background properly around the signal region when one is present. The exact response of the SG filter is

δ^k=k=WWckδk+k,\hat{\delta}_{k}=\sum_{k^{\prime}=-W}^{W}c_{k^{\prime}}\delta_{k+k^{\prime}}, (4)

with kck=1\sum_{k}c_{k}=1. δ^k\hat{\delta}_{k} is the SG filter’s estimate of the background for the cases considered in this work. What is considered to be the signal power after background subtraction will be δk=δkδ^k\delta^{\prime}_{k}=\delta_{k}-\hat{\delta}_{k} (the blue line in the left panel of Fig. 2). The coefficients ckc_{k} only depend on the SG filter parameters, window length 2W+12W+1 and polynomial order dd. As kck=1\sum_{k}c_{k}=1, an estimation done with the SG filter acts like a weighted average. The same 2W+12W+1 values are applied to the coefficients when obtaining each separate δ^k\hat{\delta}_{k} with different kk. A few examples of SG filter coefficients depending on the parameters can be seen in the right panel of Fig. 2. The coefficients can be obtained from its definition in the original paper SGFILTER , and it can be fit using a ddth-order polynomial (for even dd). It is also easy to obtain ckc_{k} in SG filter packages in programming languages PYTHON .

Refer to caption
Refer to caption
Figure 2: Left panel shows the power excess of an axion signal at 1.096 GHz before and after applying an SG filter (2W+1=4012W+1=401, d=4d=4). The power excess is in arbitrary units (a.u.) and the peak value of the lineshape was set to one. Right panel shows the SG filter coefficients ckc_{k} for different WW and dd. For any |k|>W\absolutevalue{k}>W, ckc_{k} is not defined and set to zero in any relations using it.

δk\delta_{k} and δk\delta_{k}^{\prime} are combined according to the analysis procedure HAYSTAC . Following this all further combination processes use inverse-variance weighting. For any stage of combination, this means that a combined power δcombined\delta^{\mathrm{combined}} and its error σcombined\sigma^{\mathrm{combined}} will follow

δcombined\displaystyle\delta^{\mathrm{combined}} =kwkδkbeforekwk\displaystyle=\frac{\sum_{k}w_{k}\delta_{k}^{\mathrm{before}}}{\sum_{k}w_{k}} (5)
σcombined\displaystyle\sigma^{\mathrm{combined}} =kwk2(σkbefore)2kwk\displaystyle=\frac{\sqrt{\sum_{k}w_{k}^{2}(\sigma_{k}^{\mathrm{before}})^{2}}}{\sum_{k}w_{k}}

for all relevant powers δkbefore\delta_{k}^{\mathrm{before}} and their errors σkbefore\sigma_{k}^{\mathrm{before}} used for the combination. In the presence of σkbefore\sigma_{k}^{\mathrm{before}}, the weights wk=(σkbefore)2w_{k}=(\sigma_{k}^{\mathrm{before}})^{-2}, except during the coadding process to obtain the grand spectrum. In this case wk=(σkbefore)2Lkw_{k}=(\sigma_{k}^{\mathrm{before}})^{-2}L_{k}.

Now we will obtain the SNR ratio through the axion signal power ratio: ϵsig=δoutput/δinput\epsilon_{\rm sig}=\delta_{\rm output}/\delta_{\rm input}. As the SG filter response of each δk\delta_{k} is up to the multiplicative constant δa\delta_{a}, its effects cancel out in the value of ϵsig\epsilon_{\rm sig}. Therefore the value of ϵsig\epsilon_{\rm sig} will not change when signal power is added in a way that changes δa\delta_{a} by the same ratio for both input and output, such as vertical combination. This was confirmed to be in agreement with large-statistic simulations which do include vertical combination and will be further discussed in Sec. 4. Rebinning, or equivalently RBW reduction, however, does affect the results and will be discussed in Sec. 3. If this process is not used, the only leftover contribution to the analysis process is coadding bins. We can now use equation (5) with δkbefore=δk\delta_{k}^{\rm before}=\delta_{k} or δk\delta^{\prime}_{k} and wk=Lkw_{k}=L_{k}. For the particular bin that coincides with the axion frequency, the signal power after coadding ncn_{c} bins is

δinput=k=0nc1Lkδkk=0nc1Lk=δak=0nc1Lk2k=0nc1Lk\delta_{\rm input}=\frac{\sum_{k=0}^{n_{c}-1}L_{k}\delta_{k}}{\sum_{k=0}^{n_{c}-1}L_{k}}=\frac{\delta_{a}\sum_{k=0}^{n_{c}-1}L_{k}^{2}}{\sum_{k=0}^{n_{c}-1}L_{k}} (6)

and

δoutput=k=0nc1Lkδkk=0nc1Lk=δa(k=0nc1Lk2i=0nc1j=0nc1cijLiLj)k=0nc1Lk\delta_{\rm output}=\frac{\sum_{k=0}^{n_{c}-1}L_{k}\delta_{k}^{\prime}}{\sum_{k=0}^{n_{c}-1}L_{k}}=\frac{\delta_{a}\left(\sum_{k=0}^{n_{c}-1}L_{k}^{2}-\sum_{i=0}^{n_{c}-1}\sum_{j=0}^{n_{c}-1}c_{i-j}L_{i}L_{j}\right)}{\sum_{k=0}^{n_{c}-1}L_{k}} (7)

resulting in

ϵsig=SNRoutputSNRinputδoutputδinput=1i=0nc1j=0nc1cijLiLjk=0nc1Lk2\epsilon_{\rm sig}=\frac{\mathrm{SNR}_{\rm output}}{\mathrm{SNR}_{\rm input}}\simeq\frac{\delta_{\rm output}}{\delta_{\rm input}}=1-\frac{\sum_{i=0}^{n_{c}-1}\sum_{j=0}^{n_{c}-1}c_{i-j}L_{i}L_{j}}{\sum_{k=0}^{n_{c}-1}L_{k}^{2}} (8)

where if |ij|>W\absolutevalue{i-j}>W, cij=0c_{i-j}=0. As ϵsig\epsilon_{\rm sig} does not take any noise fluctuations into account it does not carry any error. Hence, this can be used as an exact value instead of an estimate. The value of ϵsig\epsilon_{\rm sig} depending on WW is shown in the right panel of Fig. 3.

Refer to caption
Refer to caption
Figure 3: Left panel shows the input and output coadded axion signal power at 1.096 GHz. ϵsig\epsilon_{\rm sig} is calculated when Δνa=4050\Delta\nu_{a}=4050 Hz and Δν=50\Delta\nu=50 Hz at the bin coinciding with the axion frequency νa=1.096\nu_{a}=1.096 GHz, which is where the spectrum shows maximum power excess. Right panel shows the calculated ϵsig\epsilon_{\rm sig} value at the axion frequency. The SG filter window is also represented with a normalized value C=(2W+1)/(Δνa/Δν)C=(2W+1)/(\Delta\nu_{a}/\Delta\nu).

The remaining factor in ϵSNR\epsilon_{\rm SNR}, ξ\xi, will be obtained next. For this calculation there will be no axion signal, and only the background is considered. This time the inputs will be multiple independent random variables xkx_{k} that follow a Gaussian distribution 𝒩(x¯k,σxk2)\mathcal{N}\left(\bar{x}_{k},\sigma_{x_{k}}^{2}\right). The normalized power is

Xk=xkσxkX_{k}=\frac{x_{k}}{\sigma_{x_{k}}} (9)

making σXk\sigma_{X_{k}} one. The estimated background using the SG filter for XkX_{k} is

X^k=k=WWckXk+k.\hat{X}_{k}=\sum_{k^{\prime}=-W}^{W}c_{k^{\prime}}X_{k+k^{\prime}}. (10)

The power excess is Xk=XkX^kX^{\prime}_{k}=X_{k}-\hat{X}_{k}. Ideally the estimate X^k\hat{X}_{k} is equal to X¯k=x¯k/σxk\bar{X}_{k}=\bar{x}_{k}/\sigma_{x_{k}} as XkX_{k} follows an 𝒩(X¯k,σXk2)\mathcal{N}\left(\bar{X}_{k},\sigma_{X_{k}}^{2}\right) distribution. It was observed 12TB-PRD and is known that the standard deviation of XkX^{\prime}_{k} depends on WW and is reduced by

ξSG=1b2W+1,\xi_{\rm SG}=1-\frac{b}{2W+1}, (11)

where b=9/8b=9/8 for d=2d=2 and b=225/128b=225/128 for d=4d=4 if WW is large enough SGFILTER-PROPERTIES . Therefore XkX^{\prime}_{k} must be further divided by ξSG\xi_{\rm SG} to make it follow an 𝒩(0,σXk2)\mathcal{N}\left(0,\sigma_{X^{\prime}_{k}}^{2}\right) distribution with σXk=1\sigma_{X^{\prime}_{k}}=1. The normalized power excess for SG filter estimated data is

Xk=1ξSG(Xkk=WWckXk+k).X^{\prime}_{k}=\frac{1}{\xi_{\rm SG}}\left(X_{k}-\sum_{k^{\prime}=-W}^{W}c_{k^{\prime}}X_{k+k^{\prime}}\right). (12)

The use of normalized power excess to obtain ξ\xi is valid as it is independent of the background HAYSTAC ; JHEP and so are the coefficients ckc_{k}. The exception to this would be when background subtraction is done poorly or is biased, which will be explored in Sec. 4. Here we will continue with the assumption that X^k\hat{X}_{k} is an unbiased estimate of X¯k\bar{X}_{k}. Similar to the discussion from before, a vertical combination combines different raw spectra, so the random variables that are combined must be independent, and therefore it does not affect the distribution of XkX_{k} nor XkX^{\prime}_{k}.

Now equation (5) is used again with δkbefore=Xk\delta_{k}^{\rm before}=X_{k} or XkX^{\prime}_{k} and wk=Lkw_{k}=L_{k} with σkbefore=σXk\sigma_{k}^{\rm before}=\sigma_{X_{k}} or σXk\sigma_{X^{\prime}_{k}}. The normalized input noise at the axion frequency corresponding to k=0k=0 after coadding is

Xcoadded=k=0nc1LkXkk=0nc1Lk2,X_{\rm coadded}=\frac{\sum_{k=0}^{n_{c}-1}L_{k}X_{k}}{\sqrt{\sum_{k=0}^{n_{c}-1}L_{k}^{2}}}, (13)

which has a standard deviation of one as expected from ncn_{c} independent random variables. Coadding the set of XkX^{\prime}_{k} we have

Xcoadded=k=0nc1LkXkk=0nc1Lk2=1ξSGi=WW+nc1CiXik=0nc1Lk2.X_{\rm coadded}^{\prime}=\frac{\sum_{k=0}^{n_{c}-1}L_{k}X^{\prime}_{k}}{\sqrt{\sum_{k=0}^{n_{c}-1}L_{k}^{2}}}=\frac{1}{\xi_{\rm SG}}\frac{\sum_{i=-W}^{W+n_{c}-1}C_{i}X_{i}}{\sqrt{\sum_{k=0}^{n_{c}-1}L_{k}^{2}}}. (14)

XkX^{\prime}_{k} are no longer independent to each other since each can include overlapping terms that can range from XWX_{-W} to XW+nc1X_{W+n_{c}-1} according to equation (12). Therefore this equation is used to decouple XcoaddedX_{\rm coadded}^{\prime} into independent random variables XiX_{i}. The coefficient value in front of XiX_{i} is

Ci=Lik=0nc1cikLkC_{i}=L_{i}-\sum_{k=0}^{n_{c}-1}c_{i-k}L_{k} (15)

with the constraints Li=0L_{i}=0 for i<0i<0 or inci\geq n_{c}, ck=0c_{k}=0 for k<Wk<-W or k>Wk>W. The standard deviation of XcoaddedX_{\rm coadded}^{\prime}, which corresponds to the frequency independent scale factor ξ\xi, is

ξ=1ξSGi=WW+nc1Ci2k=0nc1Lk2.\xi=\frac{1}{\xi_{\rm SG}}\frac{\sqrt{\sum_{i=-W}^{W+n_{c}-1}C_{i}^{2}}}{\sqrt{\sum_{k=0}^{n_{c}-1}L_{k}^{2}}}. (16)

The value of ξ\xi for an SG filter with varying WW can be seen in Fig. 4.

Refer to caption
Figure 4: The calculated value of ξ\xi using an SG filter with d=4d=4. The LkL_{k} used for coadding is the same as those used to calculate ϵsig\epsilon_{\rm sig} above. The SG filter window is also represented with a normalized value C=(2W+1)/(Δνa/Δν)C=(2W+1)/(\Delta\nu_{a}/\Delta\nu).

It can be observed that both ϵsig\epsilon_{\rm sig} and ξ\xi become closer to unity as WW increases. This can be understood intuitively since the SG filter is an estimate for a background in the context of this raw spectrum. As WW becomes larger, the SG filter will tend to “ignore” the signal and therefore retain more of its power, without being too biased. On the other hand, for noise fluctuations when more points are taken into consideration, the bin-to-bin correlations are smaller as WW increases since the sum of all ckc_{k} is one, meaning that the contribution of each bin is smaller, as seen from the right panel of Fig. 2. This also tends to average out to X¯k\bar{X}_{k}.

3 Considering the effects of rebinning spectra

By merging nmn_{m} bins and increasing the RBW value to Δνmerged=nmΔν\Delta\nu^{\rm merged}=n_{m}\Delta\nu, the power spectrum undergoes a process called rebinning. This process combines the nmn_{m} bins according to equation (5). For the calculation of ϵsig\epsilon_{\rm sig}, the signal powers after rebinning with wk=1w_{k}=1 and δkbefore=δk\delta^{\rm before}_{k}=\delta_{k} are

δkmerged=iδinm,\displaystyle\delta_{k}^{\rm merged}=\frac{\sum_{i}\delta_{i}}{n_{m}}, (17)
δkmerged=iδinm,\displaystyle\delta_{k}^{\prime\rm merged}=\frac{\sum_{i}\delta^{\prime}_{i}}{n_{m}},

which are the averages of the signal powers over nmn_{m} bins.

A coadded bin after rebinning will coadd nc/nmn_{c}/n_{m} bins, with the weights used for coadding wk=Lkmergedw_{k}=L_{k}^{\rm merged} following

Lkmerged=νa+(k12)Δνmergedνa+(k+12)Δνmergedf(ν)dν.L_{k}^{\rm merged}=\int_{\nu_{a}+\left(k-\frac{1}{2}\right)\Delta\nu^{\rm merged}}^{\nu_{a}+\left(k+\frac{1}{2}\right)\Delta\nu^{\rm merged}}f(\nu)\differential\nu. (18)

Obtaining ϵsig\epsilon_{\rm sig} is straightforward in the sense that the rebinning is applied to each δk\delta_{k}, then the rebinned bins are coadded.

δinputmerged\displaystyle\delta^{\rm merged}_{\rm input} =k=0nc/nm1Lkmergedδkmergedk=0nc/nm1Lkmerged\displaystyle=\frac{\sum_{k=0}^{n_{c}/n_{m}-1}L_{k}^{\rm merged}\delta_{k}^{\rm merged}}{\sum_{k=0}^{n_{c}/n_{m}-1}L_{k}^{\rm merged}} (19)
δoutputmerged\displaystyle\delta^{\rm merged}_{\rm output} =k=0nc/nm1Lkmergedδkmergedk=0nc/nm1Lkmerged\displaystyle=\frac{\sum_{k=0}^{n_{c}/n_{m}-1}L_{k}^{\rm merged}\delta_{k}^{\prime\rm merged}}{\sum_{k=0}^{n_{c}/n_{m}-1}L_{k}^{\rm merged}}

and ϵsigmerged=δoutputmerged/δinputmerged\epsilon^{\rm merged}_{\rm sig}=\delta^{\rm merged}_{\rm output}/\delta^{\rm merged}_{\rm input}. For both signal input and output the resulting SNR values have been shown to decrease, as seen in ref. 12TB-PRL . While this is a drawback of rebinning, the number of rescan candidates is also reduced, which is an advantage that could be considered as a tradeoff 12TB-PRL ; 12TB-PRD .

Now the rebinned counterpart for noise, XkmergedX_{k}^{\rm merged} and XkmergedX_{k}^{\prime\mathrm{merged}}, will yield a modified ξmerged\xi^{\rm merged}. Using equation (16) and the fact XkX_{k} and XkX_{k}^{\prime} are combined in a linear fashion, the only part of the equation that needs modification is LkL_{k}, which must be expressed in terms of LkmergedL_{k}^{\rm merged}. For any nmn_{m} bins merged, XkmergedX_{k}^{\rm merged} and σXkmerged\sigma_{X_{k}^{\rm merged}} are as follows

Xkmerged\displaystyle X_{k}^{\rm merged} =iXinm,\displaystyle=\frac{\sum_{i}X_{i}}{n_{m}}, (20)
σXkmerged\displaystyle\sigma_{X_{k}^{\rm merged}} =1nm,\displaystyle=\frac{1}{\sqrt{n_{m}}},

where the total number of ii is nmn_{m}. The same applies for XkmergedX_{k}^{\prime\rm merged}. All nmn_{m} merged bins will now be coadded with the same LkmergedL_{k}^{\rm merged} value instead of the nmn_{m} different LkL_{k} values (e.g., L0L_{0}, L1L_{1}, \cdots, Lnm1L_{n_{m}-1}) from before. The coadded and normalized values are

𝒳coadded\displaystyle\mathcal{X}_{\rm coadded} =XcoaddedσXcoadded=nmk=0nc/nm1LkmergedXkmergedk=0nc/nm1(Lkmerged)2=k=0nc1Lk/nmmergedXkk=0nc1(Lk/nmmerged)2,\displaystyle=\frac{X_{\rm coadded}}{\sigma_{X_{\rm coadded}}}=\frac{\sqrt{n_{m}}\sum_{k=0}^{n_{c}/n_{m}-1}L^{\rm merged}_{k}X^{\rm merged}_{k}}{\sqrt{\sum_{k=0}^{n_{c}/n_{m}-1}\left(L^{\rm merged}_{k}\right)^{2}}}=\frac{\sum_{k=0}^{n_{c}-1}L^{\rm merged}_{\lfloor{k/n_{m}}\rfloor}X_{k}}{\sqrt{\sum_{k=0}^{n_{c}-1}\left(L^{\rm merged}_{\lfloor{k/n_{m}}\rfloor}\right)^{2}}}, (21)
𝒳coadded\displaystyle\mathcal{X}^{\prime}_{\rm coadded} =XcoaddedσXcoadded=k=0nc1Lk/nmmergedXkk=0nc1(Lk/nmmerged)2=1ξSGi=WW+nc1CimergedXik=0nc1(Lk/nmmerged)2,\displaystyle=\frac{X^{\prime}_{\rm coadded}}{\sigma_{X^{\prime}_{\rm coadded}}}=\frac{\sum_{k=0}^{n_{c}-1}L^{\rm merged}_{\lfloor{k/n_{m}}\rfloor}X^{\prime}_{k}}{\sqrt{\sum_{k=0}^{n_{c}-1}\left(L^{\rm merged}_{\lfloor{k/n_{m}}\rfloor}\right)^{2}}}=\frac{1}{\xi_{\rm SG}}\frac{\sum_{i=-W}^{W+n_{c}-1}C^{\rm merged}_{i}X_{i}}{\sqrt{\sum_{k=0}^{n_{c}-1}\left(L^{\rm merged}_{\lfloor{k/n_{m}}\rfloor}\right)^{2}}},

where the floor function x\lfloor{x}\rfloor returns the largest integer smaller or equal to xx. 𝒳coadded\mathcal{X}_{\rm coadded} has a standard deviation of one as expected. For 𝒳coadded\mathcal{X}^{\prime}_{\rm coadded}, the standard deviation, or equivalently, the frequency independent scale factor after rebinning ξmerged\xi^{\rm merged} is obtained as

ξmerged=1ξSGi=WW+nc1(Cimerged)2k=0nc1(Lk/nmmerged)2\xi^{\rm merged}=\frac{1}{\xi_{\rm SG}}\frac{\sqrt{\sum_{i=-W}^{W+n_{c}-1}(C_{i}^{\rm merged})^{2}}}{\sqrt{\sum_{k=0}^{n_{c}-1}(L^{\rm merged}_{\lfloor{k/n_{m}}\rfloor})^{2}}} (22)

and the relevant results are shown in table 1. CimergedC_{i}^{\rm merged} is also different from CiC_{i} since it also includes LkmergedL^{\rm merged}_{k} as

Cimerged=Li/nmmergedk=0nc1cikLk/nmmerged.C_{i}^{\rm merged}=L^{\rm merged}_{\lfloor{i/n_{m}}\rfloor}-\sum_{k=0}^{n_{c}-1}c_{i-k}L^{\rm merged}_{\lfloor{k/n_{m}}\rfloor}. (23)

As shown in equation (21), the rebinned set of XkmergedX^{\prime\rm merged}_{k} in equation (21) is separable into XkX^{\prime}_{k} using equation (20) and ultimately XkX_{k} via equation (12). By unpacking the merged variables fully into independent random variables, the calculation of ξmerged\xi^{\rm merged} in equation (22) becomes as straightforward as ξ\xi in equation (16).

Putting the effects of rebinning from both sides together, the SNR efficiency ϵSNRmerged=ϵsigmerged/ξmerged\epsilon_{\rm SNR}^{\rm merged}=\epsilon^{\rm merged}_{\rm sig}/\xi^{\rm merged} is smaller than ϵSNR\epsilon_{\rm SNR}, which is a separate effect from the aforementioned SNR decrease reported in ref. 12TB-PRL . This is later shown in table 1 (ϵSNR\epsilon_{\rm SNR} and ϵSNRmerged\epsilon_{\rm SNR}^{\rm merged}) and Fig. 6 (blue and orange lines), although the difference is smaller for large WW. While ϵsigmerged<ϵsig\epsilon^{\rm merged}_{\rm sig}<\epsilon_{\rm sig} and ξmerged<ξ\xi^{\rm merged}<\xi, the difference between ϵsig\epsilon_{\rm sig} and its rebinned counterpart is larger than the change in ξ\xi, leading to a decrease in SNR efficiency after rebinning.

4 Comparison with large-statistic simulations

So far the value of ϵSNR\epsilon_{\rm SNR} has been obtained given νa\nu_{a}, Δνa\Delta\nu_{a}, and Δν\Delta\nu. In order to see if this agrees with previously used large-statistic simulations, an axion signal was inserted by software using realistic CAPP-12TB background data. The input CAPP-12TB background shapes use the function and its parameters that were obtained from the χ2\chi^{2} fit in the original data analysis 12TB-PRL . Another background was constructed for comparison as well, which used the same parameters except for one that described the cavity bandwidth, which was multiplied by 20. This ensured that the structure of the cavity was not seen in this background, and instead was a “smooth” one. Both backgrounds are compared in Fig. 5. Each simulation was done 10000 times with pseudo-random-generated noise to obtain a sufficient amount of statistics, with an axion signal injected near 1.096 GHz. We will denote the results for the two types of backgrounds as “12TB” and “smooth,” respectively, e.g., ϵSNR12TB\epsilon_{\rm SNR}^{\rm 12TB} or ξsmooth\xi^{\rm smooth}. Since the simulations follow the CAPP-12TB analysis parameters, there is rebinning. Hence the relevant values should be compared with ϵSNRmerged\epsilon_{\rm SNR}^{\rm merged} and ξmerged\xi^{\rm merged}.

Refer to caption
Figure 5: A comparison between the backgrounds used for large-statistic simulations.

Compared to the time and disk space needed for large-statistic simulations, obtaining the values analytically for the same parameters is practically interactive, taking less than a minute while using only computer memory. The results can be compared in table 1 and Fig. 6.

Analytical Simulation Analytical Simulation
2W+12W+1 ξ\xi ξmerged\xi^{\rm merged} ξsmooth\xi^{\rm smooth} ξ12TB\xi^{\rm 12TB} ϵSNR\epsilon_{\rm SNR} ϵSNRmerged\epsilon_{\rm SNR}^{\rm merged} ϵSNRsmooth\epsilon_{\rm SNR}^{\rm smooth} ϵSNR12TB\epsilon_{\rm SNR}^{\rm 12TB}
201 0.6948 0.6567 0.6540 0.6537 0.7145 0.6779 0.6823 0.6819
401 0.8510 0.8328 0.8314 0.8315 0.8525 0.8356 0.8388 0.8342
601 0.9028 0.8912 0.8910 0.9019 0.9015 0.8905 0.8919 0.8543
801 0.9286 0.9196 0.9198 0.9849 0.9262 0.9180 0.9193 0.7840
1001 0.9429 0.9362 0.9371 1.1285 0.9410 0.9346 0.9353 0.6261
Table 1: A comparison of various ϵ\epsilon and ξ\xi obtained from analytic calculation or simulations. The polynomial order for all SG filters was d=4d=4. As the simulation results include rebinning, ξsmooth\xi^{\rm smooth} and ξ12TB\xi^{\rm 12TB} should be compared with ξmerged\xi^{\rm merged}, while ϵSNRsmooth\epsilon_{\rm SNR}^{\rm smooth} and ϵSNR12TB\epsilon_{\rm SNR}^{\rm 12TB} should be compared with ϵSNRmerged\epsilon_{\rm SNR}^{\rm merged}. Efficiency results on the simulation side include a 1% error. The results do not consider the additional 5% loss in SNR that occurs due to rebinning, i.e., SNRinput\mathrm{SNR}_{\mathrm{input}} and SNRoutput\mathrm{SNR}_{\mathrm{output}} are both at the Δν=450\Delta\nu=450 Hz level when calculating ϵSNRmerged\epsilon_{\rm SNR}^{\rm merged}, ϵSNRsmooth\epsilon_{\rm SNR}^{\rm smooth}, and ϵSNR12TB\epsilon_{\rm SNR}^{\rm 12TB}.
Refer to caption
Figure 6: A comparison of the calculated ϵSNR\epsilon_{\rm SNR} values and large-statistic simulation ϵSNR\epsilon_{\rm SNR} values for an SG filtered axion signal at νa=1.096\nu_{a}=1.096 GHz with Δνa=4050\Delta\nu_{a}=4050 Hz. The blue line represents ϵSNR\epsilon_{\rm SNR} after coadding at Δν=50\Delta\nu=50 Hz and the orange line represents ϵSNR\epsilon_{\rm SNR} after rebinning with nm=9n_{m}=9 bins merged (Δνmerged=450\Delta\nu^{\rm merged}=450 Hz) and then coadding. The green and red dots are results from simulations with different backgrounds and comes with 1% error (10000 averages). The SG filter window is also represented by a normalized value C=(2W+1)/(Δνa/Δν)C=(2W+1)/(\Delta\nu_{a}/\Delta\nu).

As seen from the simulation results, a smooth background matches the rebinned analytical calculations well, but the 12TB background does not. This is expected from ref. SGFILTER-PROPERTIES , i.e., as WW increases the background estimation begins to show bias for the 12TB background shown in Fig. 5, similar to the SG filter’s tendency to “ignore” the axion signal, as mentioned earlier. Looking at ξ12TB\xi^{\rm 12TB}, ξsmooth\xi^{\rm smooth}, and ξmerged\xi^{\rm merged} in table 1, this bias propagates to the grand spectrum and becomes evident from 2W+1=8012W+1=801. Furthermore, the bias even shows a positive correlation when 2W+1=10012W+1=1001, which has also been noted in HAYSTAC . In terms of ϵSNR\epsilon_{\rm SNR} in Fig. 6, which combines the effects of ϵsig\epsilon_{\rm sig} and ξ\xi, we can see the bias become evident from 2W+1=6012W+1=601.

Refer to caption
Figure 7: 100 averaged normalized power excesses for 2W+1=4012W+1=401, d=4d=4 and 2W+1=10012W+1=1001, d=4d=4 for the CAPP-12TB background.

The bias shown in the normalized power excess in the CAPP-12TB background suggests that there is a limit to using larger WW for SG filters and this can affect both ϵSNR\epsilon_{\rm SNR} and ξ\xi in an undesirable manner. Using the analytical calculations can work as a time-efficient way to obtain the SNR efficiency and a predictor for ξ\xi during data analysis. If the resulting ξ\xi from data analysis is noticeably different compared to its predicted value, this could indicate that there is bias in background subtraction. As seen in Fig. 7, there is a pattern in the bias that comes from the cavity response in the CAPP-12TB background. The bias is not always obvious as even the 2W+1=4012W+1=401 estimate shows a slight bump near the center of the normalized power excess, but only after one hundred different power spectra have been averaged. While this ultimately did not dramatically affect the results in table 1, there is a certain upper limit in WW which could be used before it becomes inaccurate. This, however, does not suggest a limitation of this analytical method but rather acts as a confidence check for background subtraction to look out for instances such as the one shown in Fig. 7. The verification of proper background subtraction is necessary in the analysis procedure not only for the correct estimation of SNR but also for obtaining unbiased power excesses.

In table 2, as further validation, two other experiments that used the SG filter for background subtraction and provided the required parameters (νa\nu_{a}, Δν\Delta\nu, WW, dd, nmn_{m}) have their ξ\xi and ϵSNR\epsilon_{\rm SNR} values compared with the analytical method.

Experiment νa\nu_{a} Δν\Delta\nu WW dd nmn_{m} ξ\xi^{\dagger} ξ\xi^{\dagger\dagger} ϵSNR\epsilon_{\rm SNR}^{\dagger} ϵSNR\epsilon_{\rm SNR}^{\dagger\dagger}
HAYSTAC HAYSTAC 5.75 GHz 100 Hz 500 4 10 0.93 0.919 0.90 0.909
CAST-CAPP CAST-CAPP 5.027 GHz 50 Hz 500 4 28 0.74 0.765 0.717 0.716
Table 2: Comparisons of reported values from other experiments (ξ\xi^{\dagger} and ϵSNR\epsilon_{\rm SNR}^{\dagger}) with their analytically calculated values from this work (ξ\xi^{\dagger\dagger} and ϵSNR\epsilon_{\rm SNR}^{\dagger\dagger}). HAYSTAC used an unboosted Maxwellian to obtain ξ\xi^{\dagger} and ϵSNR\epsilon_{\rm SNR}^{\dagger} in their paper, which was also considered in our calculations. The CAST-CAPP experiment initially reported W=1000W=1000 in their published paper, but the corresponding authors have confirmed that this was a typo and the actual parameter used was 2W+1=10012W+1=1001 (W=500W=500).

5 Summary

We report a fully analytical estimation of the signal to noise ratio efficiency ϵSNR\epsilon_{\rm SNR} in axion dark matter searches employing an SG filter for a background estimate. Provided the background estimation is properly done with appropriate SG filter parameters like the blue colored data shown in Fig. 7, this approach provides us with the ϵSNR\epsilon_{\rm SNR} interactively, using only an arbitrary axion mass and the relevant shape information, without relying on large-statistic simulation data, reflecting the background information and the expected axion signal power obtained from a real experiment. By comparing our ξ\xi or ξmerged\xi^{\rm merged} with the width of the normalized grand power spectrum from the experimental data, one can also quickly cross check the reasonableness of the error estimation as well as contamination of non-Gaussian components in the experimental data. Axion haloscope searches have been observing the coincidence that the frequency independent scale factor ξ\xi is approximately consistent with ϵSNR\epsilon_{\rm SNR}. This was confirmed analytically in this work, when the window length of the SG filter is reasonably wide enough, i.e., at least 5 times the signal window.

Acknowledgements.
This work is supported by the Institute for Basic Science (IBS) under Project Code No. IBS-R017-D1-2023-a00.

References

  • (1) P. A. R. Ade et al. (Planck Collaboration), Astron. Astrophys. 594 (2016) A13.
  • (2) V. Rubin and W. K. Ford Jr., ApJ 159 (1970) 379; Douglas Clowe et al., ApJ 648 (2006) L109.
  • (3) S. Weinberg, Phys. Rev. Lett. 40 (1978) 223; F. Wilczek, Phys. Rev. Lett. 40 (1978) 279.
  • (4) R. D. Peccei and H. R. Quinn, Phys. Rev. Lett. 38 (1977) 1440.
  • (5) G. ’t Hooft, Phys. Rev. Lett, 37 (1976) 8; Phys. Rev. D 14 (1976) 3432; 18 (1978) 2199(E); J. H. Smith, E. M. Purcell, and N. F. Ramsey, Phys. Rev. 108 (1957) 120; W. B. Dress, P. D. Miller, J. M. Pendlebury, P. Perrin, and N. F. Ramsey, Phys. Rev. D 15 (1977) 9; I. S. Altarev et al., Nucl. Phys. A341 (1980) 269.
  • (6) P. Sikivie, Phys. Rev. Lett. 51 (1983) 1415; Phys. Rev. D 32 (1985) 2988.
  • (7) N. Du et al. (ADMX Collaboration), Phys. Rev. Lett. 120 (2018) 151301; T. Braine et al. (ADMX Collaboration), Phys. Rev. Lett. 124 (2020) 101303; C. Bartram et al. (ADMX Collaboration), Phys. Rev. Lett. 127 (2021) 261803.
  • (8) Andrew K. Yi et al., Phys. Rev. Lett. 130 (2023) 071002.
  • (9) Howard Georgi and S. L. Glashow, Phys. Rev. Lett. 32 (1974) 438.
  • (10) L. Krauss, J. Moody, F. Wilczek, and D. E. Morris, Phys. Rev. Lett. 55 (1985) 1797.
  • (11) S. Ahn et al., JINST 17 (2022) P05025.
  • (12) R. H. Dicke, Rev. Sci. Instrum. 17 (1946) 268.
  • (13) S. J. Asztalos et al., Phys. Rev. D 64 (2001) 092003.
  • (14) B. M. Brubaker, L. Zhong, S. K. Lamoreaux, K. W. Lehnert, and K. A. van Bibber, Phys. Rev D 96 (2017) 123008.
  • (15) S. Ahn, S. Lee, J. Choi, B. R. Ko, and Y. K. Semertzidis, J. High Energ. Phys. 04 (2021) 297.
  • (16) Andrew K. Yi et al., Phys. Rev. D 108 (2023) L021304.
  • (17) M. S. Turner, Phys. Rev. D 42 (1990) 3572.
  • (18) S. Lee, S. Ahn, J. Choi, B. R. Ko, and Y. K. Semertzidis, Phys. Rev. Lett. 124 (2020) 101802.
  • (19) A. Savitzky and M. J. E. Golay, Anal. Chem. 36 (1964) 1627.
  • (20) Python Software Foundation, https://www.python.org/.
  • (21) M. Buschmann et al., Nat. Commun. 13 1049 (2022).
  • (22) H. Zielger, Appl. Spectroscopy, 35 (1981) 1.