This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Pixel domain multi-resolution minimum variance painting of CMB maps

Hao Liu
Abstract

This work introduces an unbiased minimum variance painting of the pixel domain CMB maps for both the missing and available sky regions. In the missing region, it generates CMB realizations that have identical statistical properties to the expected CMB signal; and in the available region, it significantly alleviate the unwanted residuals, e.g., the residual of EB-leakage. The time cost of this method follows \simO(Nside3)O(N_{side}^{3}) (NsideN_{side} is the map resolution parameter), which is similar to the fast spherical harmonic transforms and is much better than the traditional minimum variance method that follows \simO(Nside6)O(N_{side}^{6}). This method is the basis of a series of minimum variance estimations for analyzing CMB datasets, and more works based on it will follow in the future.

1 Introduction

In the measurement of cosmic microwave background (CMB) anisotropy, a few key problems were constantly studied from the beginning until the present day. For example, how to compute the full sky CMB angular power spectrum (APS) from various cosmological models and vice versa; and how to estimate the true CMB signal and APS from incomplete and contaminated sky maps. In this work, we focus on the latter, especially the estimation and reconstruction of pixel domain CMB signals, because it usually makes following CMB analysis much easier.

For the case of CMB temperature, the best unbiased solution of this kind of problem was given by [1], which is usually referred to as the fisher estimator, a method that finds the CMB APS with the maximum likelihood condition. It was also mentioned in [1], that the maximum likelihood and minimum variance conditions are equivalent for the Gaussian isotropic CMB signal; thus, one is free to start from either side.

The main problem of the fisher estimator is the computational cost. According to the tests in [2], a full implementation of the fisher estimator at a relatively low resolution (Nside=64N_{side}=64) of the HEALPix pixelization scheme [3] already requires about 4×1054\times 10^{5} CPU hours and 19 TB memory, which is a heavy task even for a super cluster. For higher resolutions, the time cost scales as O(Nside6)O(N_{side}^{6}); thus, the computation is essentially impossible. Therefore, many alternative methods were developed. Some of them keep pursuing the maximum likelihood (or minimum variance), but manage to do the job in a faster way ([4, 5, 6, 7, 8, 9, 10], and a review can be found in section 5 of [11]); and some others choose to accept sub-optimal results, but as an exchange, the time cost can be significantly reduced, e.g., the pseudo-CC_{\ell} method [12, 13] and the Spice estimator [14]. Overall speaking, the former still requires to use a super cluster, especially for the case of higher resolutions or higher-\ells; and the latter is the currently most practical way to estimate the high-\ell CMB APS, but the quality of the result is relatively lower, and especially, it cannot deal with a pixel-domain estimation.

For the case of polarization, a key problem that hinders the minimum variance solution is the leakage from the E-mode polarizations to the B-mode polarizations, called the EB-leakage [15, 16]. However, in our recent work [17], it was shown how to explicitly correct the EB-leakage in the pixel domain, and with some clearly defined preconditions, the results are proven to be the best ones. Thus, it is about time to design a series of minimum variance solution for both the temperature and polarization, and from map to the final cosmological parameters. Given the solution of EB-leakage, the most urgent problem at the moment is to greatly reduce the computational cost of the pixel domain minimum variance estimation, which is the basis of all following works in this series, and will be the main goal of this work.

Meanwhile, by greatly reducing the time cost of pixel domain minimum variance estimation, several interesting possibilities are also turned on, especially for studying the large-scale and low-\ell problems. For example, it will become possible to change the sky mask pixel-by-pixel to investigate the impact upon all kinds of large scale anomalies, in order to find out a possible association to a specific sky region.

The outline of this work is as follows: In section 2, we explain some mathematical basis of the method, and the technical details of implementation are introduced in section 3. The implementation and performance are tested and illustrated in section 4, and a conclusion and discussion is given in section 5.

2 The mathematical basis

The multi-resolution minimum variance painting (MrMVP) is based on the ideal of constrained in-painting by [18], but some similar ideas can also be found in [19, 20, 21]. Basically, assume we have an available dataset 𝑿=𝑺+𝑵\bm{X}=\bm{S}+\bm{N}, where 𝑺\bm{S} is the true signal and 𝑵\bm{N} is the noise. For convenience, here 𝑵\bm{N} includes all unwanted components like the noise, systematics and various residuals. The main goal of painting is to estimate 𝑺\bm{S} from 𝑿\bm{X} in both the missing and available regions, including the expectation and constrained realizations. Customarily, when we focus on the reconstruction of the missing region, the operation is called an in-painting.

A basic in-painting method only focus on estimating the expectation in the missing region, like the diffusive in-paint method [22, 23]; whereas a complete in-painting method will consider the expectation and constrained realizations at the same time, like the one in [18]. With the well known theory of multi-variant minimum variance regression (for example, see appendix A of [17]), if we only want to solve for the expectation, then the result is given by the following unbiased minimum variance solution:

𝑺~\displaystyle\bm{\widetilde{S}} =\displaystyle= 𝑷𝑸1𝑿,\displaystyle\bm{PQ}^{-1}\bm{X}, (2.1)

where 𝑺~\bm{\widetilde{S}} is the pixel domain expectation of 𝑺\bm{S} and 𝑷=𝑿𝑺T\bm{P}=\langle\bm{XS}^{T}\rangle, 𝑸=𝑿𝑿T\bm{Q}=\langle\bm{XX}^{T}\rangle are the covariance matrices of 𝑿\bm{X}-with-𝑺\bm{S} and 𝑿\bm{X}-with-𝑿\bm{X}, respectively. If we further assume 𝑺\bm{S} and 𝑵\bm{N} are uncorrelated (which is usually true), then the above equation becomes

𝑷=𝑺𝑺T,𝑸=𝑺𝑺T+𝑵𝑵T,\displaystyle\bm{P}=\langle\bm{SS}^{T}\rangle,~{}~{}\bm{Q}=\langle\bm{SS}^{T}\rangle+\langle\bm{NN}^{T}\rangle, (2.2)

and eq. (2.1) becomes the well known Wiener filter. Note that in eq. (2.1), if we are dealing with the missing region, then 𝑿\bm{X} and 𝑺\bm{S} belong to different regions, but 𝑷\bm{P} is not zero, which makes the estimation of the missing region possible.

It is important to emphasis that 𝑺~\bm{\widetilde{S}} is an “expectation” but not a “realization”; such that the true signal 𝑺\bm{S} is always different to 𝑺~\bm{\widetilde{S}}. In fact, in the view of statistics, the true signal 𝑺\bm{S} is included as a member of the ensemble {𝑺i}\{\bm{S}_{i}\} containing all possible realizations, and 𝑺\bm{S} is not superior compared to any other members. Therefore, the next immediate question is how to generate the constrained ensemble {𝑺i}\{\bm{S}_{i}\}, which is important because 𝑺i\bm{S}_{i}, as fullsky realizations of the CMB signal, make many estimations extremely convenient, as also mentioned in [18].

The generation of ensemble {𝑺i}\{\bm{S}_{i}\} is based on the following constraint and prior conditions:

  1. 1.

    Constraint: 𝑺i\bm{S}_{i} has to be consistent with the actual dataset 𝑿\bm{X}.

  2. 2.

    Prior condition: In case of CMB, 𝑺i\bm{S}_{i} should be Gaussian and isotropic.

  3. 3.

    Prior condition: 𝑷\bm{P} and 𝑸\bm{Q} are determined either by the prior APS or by an iterative solution.

With the above constraint and prior conditions, the ensemble {𝑺i}\{\bm{S}_{i}\} can be generated from the key equation provided in [24, 18]. For convenience of reading, it is retyped below using the above notations:

𝑺i=𝑺~+𝒔i𝑷𝑸1𝒙i,\displaystyle\bm{S}_{i}=\bm{\widetilde{S}}+\bm{s}_{i}-\bm{PQ}^{-1}\bm{x}_{i}, (2.3)

where 𝑺i\bm{S}_{i} is a constrained realization; 𝒔i\bm{s}_{i} is the ii-th unconstrained fullsky realization generated with the prior conditions 23, 𝒙i\bm{x}_{i} is 𝒔i\bm{s}_{i} plus noise, and 𝑺~\bm{\widetilde{S}} is the expectation given by 2.1 that carries the constraint 1. The physical meaning of the above equation is: a properly constrained realization is the expectation 𝑺~\bm{\widetilde{S}} solved from the real data, plus the difference between an unconstrained realization 𝒔i\bm{s}_{i} and the expectation 𝑷𝑸1𝒙i\bm{PQ}^{-1}\bm{x}_{i} solved from 𝒔i\bm{s}_{i}.

As explained in [1], the CMB statistical properties are fully described by its covariance matrix; thus, in order to prove 𝑺i\bm{S}_{i} is a properly constrained CMB realization, we need to show that it has the same covariance matrix with the expected CMB signal, i.e., it is statistically indistinguishable with a true CMB signal. The mathematical proof of this point is given as follows: With eq. (2.3), the covariance matrix of 𝑺i\bm{S}_{i} is

𝑺i𝑺it=\displaystyle\langle\bm{S}_{i}\bm{S}_{i}^{t}\rangle= (𝑺~+𝒔i𝑴𝒙i)(𝑺~+𝒔i𝑴𝒙i)t\displaystyle\langle(\bm{\widetilde{S}}+\bm{s}_{i}-\bm{M}\bm{x}_{i})\cdot(\bm{\widetilde{S}}+\bm{s}_{i}-\bm{M}\bm{x}_{i})^{t}\rangle (2.4)
=\displaystyle= 𝑺~𝑺~t+𝒔i𝒔it+𝑴𝒙i𝒙it𝑴t𝑴𝒙i𝒔it𝒔i𝒙it𝑴t\displaystyle\langle\bm{\widetilde{S}}\bm{\widetilde{S}}^{t}\rangle+\langle\bm{s}_{i}\bm{s}_{i}^{t}\rangle+\bm{M}\langle\bm{x}_{i}\bm{x}_{i}^{t}\rangle\bm{M}^{t}-\bm{M}\langle\bm{x}_{i}\bm{s}_{i}^{t}\rangle-\langle\bm{s}_{i}\bm{x}_{i}^{t}\rangle\bm{M}^{t}
=\displaystyle= 𝑴𝑸𝑴t+𝑷+𝑴𝑸𝑴t𝑴𝑷𝑷𝑴t\displaystyle\bm{M}\bm{Q}\bm{M}^{t}+\bm{P}+\bm{M}\bm{Q}\bm{M}^{t}-\bm{M}\bm{P}-\bm{P}\bm{M}^{t}
=\displaystyle= 𝑷+2𝑴𝑸𝑴t𝑴𝑷𝑷𝑴t\displaystyle\bm{P}+2\bm{M}\bm{Q}\bm{M}^{t}-\bm{M}\bm{P}-\bm{P}\bm{M}^{t}
=\displaystyle= 𝑷+2𝑷𝑸1𝑸𝑸1𝑷2𝑷𝑸1𝑷\displaystyle\bm{P}+2\bm{P}\bm{Q}^{-1}\bm{Q}\bm{Q}^{-1}\bm{P}-2\bm{P}\bm{Q}^{-1}\bm{P}
=\displaystyle= 𝑷.\displaystyle\bm{P}.

Therefore, the constrained realizations 𝑺i\bm{S}_{i} have identical statistical properties with the true CMB signal, and is hence a properly constrained fullsky CMB realization. Moreover, because the CMB angular power spectrum is completely determined by the pixel domain covariance matrix, we automatically get the conclusion that, the expected angular power of 𝑺i\bm{S}_{i} is also identical to the expected CMB angular power spectrum. Therefore, eq. (2.3) is enough to establish the mathematical basis of MrMVP for both the pixel and harmonic domains, and the rest of the problem is about the multi-resolution scheme and details of implementation, which will be described in the next section.

3 Implementation and technical details

In this section, we describe some technical details of the implementation of MrMVP.

3.1 Fast computation of the pixel domain covariance matrix

The implementation of eq. (2.3) depends on computing the pixel-domain covariance matrices 𝑷\bm{P} and 𝑸\bm{Q} that are consist of the CMB and non-CMB parts. Using the non-polarized case for illustration, the CMB part of the covariance matrix is given by the following equation:

Cij=14π(2l+1)W2CP[cos(θij)],C_{ij}=\frac{1}{4\pi}\sum_{\ell}{(2l+1)W_{\ell}^{2}C_{\ell}P_{\ell}[\cos{(\theta_{ij})}]}, (3.1)

where CijC_{ij} is the element of the CMB covariance matrix, CC_{\ell} is the CMB APS, WW_{\ell} is the transfer function, P(x)P_{\ell}(x) is the Legendre polynomial, and θij\theta_{ij} is the open angle between the ii-th and jj-th pixels. Let’s take Nside=512N_{side}=512 as an example: At this resolution, there are about 3×1063\times 10^{6} pixels on the sky, and we usually go up to max=1536\ell_{max}=1536. Thus there are about 101310^{13} elements in the covariance matrix, and the number of float point operations is at the order of 101610^{16}. Although eq. (3.1) is not very complicated, such a big number of operations should still be avoided.

In this work, the idea is to first greatly oversample θij\theta_{ij} from 0 to π\pi. In the HEALPix pixelization scheme, at a resolution of NsideN_{side}, 4Nside4N_{side} points between 0 and 2π2\pi are usually enough to support the spherical harmonic transforms, but we oversample it to 64Nside64N_{side} points to compute the function C(θ)C(\theta), which is more than enough to guarantee the precision. Then the full covariance matrix is computed by linear interpolation of the oversampled C(θ)C(\theta) function. By this way, the time cost is reduced from aNpix3aN_{pix}^{3} to bNpix2bN_{pix}^{2}. Note that not only the power index is reduced from 3 to 2, but also the constant coefficient bb is much smaller than aa, because it is much easier to compute the interpolation than to compute the Legendre polynomials.

In case of the polarization, the full pixel domain covariance matrices are given in eq. (10) of [25] or eq. (A10-A15) of  [26], and they are also in form of C(θ)C(\theta); thus, the computation can be accelerated with the same technique.

3.2 The degree-of-freedom problem

According to eq. (2.1), we need to invert the covariance matrix 𝑸\bm{Q} to get the solution, thus we want 𝑸\bm{Q} to be non-singular. For convenience of discussion, 𝑸\bm{Q} is converted to the harmonic domain: If the CMB signal is Gaussian and isotropic, then a full covariance matrix of the CMB in the harmonic domain is diagonal and takes the following look:

𝑪=(C000000C100000C100000C100000C2),\displaystyle\bm{C}=\begin{pmatrix}C_{0}&0&0&0&0&\cdots\\ 0&C_{1}&0&0&0&\cdots\\ 0&0&C_{1}&0&0&\cdots\\ 0&0&0&C_{1}&0&\cdots\\ 0&0&0&0&C_{2}&\cdots\\ \cdots&\cdots&\cdots&\cdots&\cdots&\cdots\end{pmatrix}, (3.2)

where each CC_{\ell} appears 2+12\ell+1 times. Thus for a finite max\ell_{max}, the maximum number of non-zero diagonal elements is (max+1)2(\ell_{max}+1)^{2}, which is equal to the matrix’s upper limit of degree-of-freedom (DOF).

However, the size of the covariance matrix is determined not by max\ell_{max}, but by the number of map pixels. With the HEALPix pixelization scheme, this number is equal to 12Nside212N_{side}^{2}, where Nside=2,4,8,N_{side}=2,4,8,\cdots is the resolution parameter. Therefore, to ensure that 𝑸\bm{Q} have enough DOF and is hence non-singular, the following inequality must be satisfied:

(max+1)212Nside2max12Nside13.5Nside.\displaystyle(\ell_{max}+1)^{2}\geq 12N_{side}^{2}\Longrightarrow\ell_{max}\geq\sqrt{12}N_{side}-1\approx 3.5N_{side}. (3.3)

However, according to the HEALPix manual, the pixelization scheme by HEALPix can only support a computation up to max=3Nside1\ell_{max}=3N_{side}-1. Higher \ells can still be computed but are no longer reliable. Thus we have a gap between 3Nside3N_{side} and 3.5Nside3.5N_{side}, which can make 𝑸\bm{Q} singular. Because of this degree of freedom problem, it is actually impossible to do a pixel domain minimum variance estimation at the original resolution without prior information — it has to be done either with a prior APS or by combining two neighboring resolutions.

In principle, both ways are allowed: If we use a prior APS, then the analysis becomes a typical Bayesian estimation and can be further developed into a suit of minimum variance estimations from map to cosmological parameters. This is our goal in the following works; however, if we choose to combine two neighboring resolutions, then the estimation is done in a bind way, which helps to make the result more robust by providing a crosscheck free from the prior APS. In this work, we are mainly dealing with the pixel domain estimation; thus, we shall focus on the latter.

3.3 The transfer function of downgrading/upgrading

In order to do the minimum variance estimation in a blind way and avoid the DOF problem, we need to downgrade the input map by one level to Nside=0.5NsideN_{side}^{\prime}=0.5N_{side}, and then upgrade it back to the original resolution to compute the APS up to max=2Nside=4Nside\ell_{max}=2N_{side}=4N_{side}^{\prime}. This provides a precise estimation of the APS up to 4Nside4N_{side}^{\prime} for the NsideN_{side}^{\prime} resolution except for the pixelization effect, because the upgraded map is morphologically identical to the map at NsideN_{side}^{\prime}. However, this also means, when we perform a blind pixel domain minimum variance estimation at Nside=512N_{side}=512, the input map resolution should be at least Nside=1024N_{side}=1024.

The above downgrading/upgrading operation will inevitably change the angular power spectrum due to pixelization; however, the corresponding effect cannot be computed by the HEALPix pixel window functions, because a downgraded-and-upgraded map has identical values for the sub-pixels of NsideN_{side}^{\prime} but different values for other pixels, which is non-isotropic at the sub-pixel level. Therefore, the effective transfer function should be computed by simulation. Fortunately, this kind of transfer function does not change significantly with the expected APS and does not need to be updated frequently. In figure 1, we compare the transfer functions given by simulation and the one computed directly from the HEALPix pixel window function without considering the sub-pixel non-isotropy, with Nside=64N_{side}^{\prime}=64. One can see that they are almost identical at <2Nside\ell<2N_{side}^{\prime}, but are slightly different at higher \ells.

Refer to caption
Figure 1: Comparison of the effective transfer function of downgrading/upgrading (black) and the one computed directly from the HEALPix window functions ignoring the sub-pixel level non-isotropy (red).

3.4 The multi-resolution scheme

A full implementation of eq. (2.3) at a high resolution requires too much time and memory, thus we use a multi-resolution scheme to make the computation feasible. Basically, this means to do a full implementation of eq. (2.3) at Nside=16N_{side}=16, and then go to a higher resolution and use less pixels for estimation. The schemes are different for the missing and available regions: For reconstructing the missing region at Nside>16N_{side}>16, only the available pixels within a certain angular distance from the edge of the mask are used, and for each higher resolution, this angular distance is reduced by 50%. For estimating the available sky region at Nside>16N_{side}>16, the computations are done in units of mosaic disks centered at the sky pixels of resolution Nsidemd=Nside/32N^{md}_{side}=N_{side}/32, and the radius are 1536/Nside1536^{\circ}/N_{side}. These values ensure that the mosaic disks will cover all sky pixels, but will not be too big to handle. In figure 2, the sky area used to estimate the missing/available regions are illustrated, respectively.

Refer to caption
Refer to caption
Figure 2: Illustration of the multi-resolution scheme for the missing region (left) and for the available region (right). In the left panel, the deep blue region is an example of the missing region (WMAP polarization mask), and the red area is the region used for reconstruction at working Nside=64N_{side}=64. In the right panel are the overlapping mosaic discs at working Nside=128N_{side}=128. When the resolution increases, the area in use decreases but the number of mosaic discs increases. In any case, each map pixel is covered at least once.

In the above schemes, for the case of estimating the missing region, the number of available pixels in use scales linearly with NsideN_{side}, whereas the number of pixels to be estimated follows Nside2N_{side}^{2}; thus, the time cost follows Nside3N_{side}^{3} for both the matrix inversion and matrix multiplication. As for the case of estimating the available region, the number of available pixels in use is nearly constant, but the number of mosaic discs follows Nside2N_{side}^{2}. Therefore, the overall time cost roughly follows Nside3N_{side}^{3}.

3.5 Lossless synthesis of multiple resolutions

By working with eq. (2.3) from Nside=16N_{side}=16 to NsideN_{side}^{\prime}, we get fullsky estimates of the CMB signal at a series of resolutions. These results should be synthesized to make a final output. The synthesis can be done either in the pixel domain or in the harmonic domain. Analysis and practical tests show that the pixel domain synthesis is not only lossless but also faster, more stable and more accurate; thus it is the default choice.

With the results at a series of resolutions starting from Nside=16N_{side}=16, the pixel domain synthesis is done as follows:

  1. 1.

    First upgrade the result at Nside=16N_{side}=16 to Nside=32N_{side}=32, which represents larger scale structures.

  2. 2.

    Then downgrade the result of Nside=32N_{side}=32 to Nside=16N_{side}=16 and upgrade back to Nside=32N_{side}=32.

  3. 3.

    Subtract the product of step 2 from the result at Nside=32N_{side}=32 to get the difference, which represents smaller scale structures.

  4. 4.

    Add the product of step 3 to the product of step 1 to make the synthesized result of Nside=32N_{side}=32.

  5. 5.

    Start again from step 1 with the product of step 4 to synthesize the next (higher) resolution, until reaching NsideN_{side}^{\prime}.

Practically, the above procedures can also be done in a descending order from NsideN_{side}^{\prime} to Nside=16N_{side}=16 to save the memory, and in this way, the original result at a higher resolution can be deleted after each round to free some memory.

It is easy to prove that the above synthesis procedures are lossless, i.e., when we do it with a series of multi-resolution maps downgraded from the same top-resolution map, the original map is restored by 100%. This also means the above synthesis procedures are safe for both temperature and polarizations, and will not cause a further E-to-B leakage by itself.

3.6 Matrix based fine tuning of the multi-resolution synthesis

The multi-resolution synthesis described in section 3.5 has the following matrix form: In the HEALPix pixelization scheme, each pixel at a lower resolution contains exactly four smaller pixels of the next (higher) resolution. Let the CMB signals be S0S_{0} at this bigger pixel (lower resolution) and S14S_{1-4} at the four smaller pixels (higher resolution), then the synthesis scheme in section 3.5 between two neighboring resolutions is described by the following matrix equation:

(S1S2S3S4)syn=14[(+31111+31111+31111+3)(S1S2S3S4)+(1111111111111111)(S0S0S0S0)],\displaystyle\begin{pmatrix}S_{1}\\ S_{2}\\ S_{3}\\ S_{4}\end{pmatrix}_{syn}=\frac{1}{4}\left[\begin{pmatrix}+3&-1&-1&-1\\ -1&+3&-1&-1\\ -1&-1&+3&-1\\ -1&-1&-1&+3\end{pmatrix}\cdot\begin{pmatrix}S_{1}\\ S_{2}\\ S_{3}\\ S_{4}\end{pmatrix}+\begin{pmatrix}1&1&1&1\\ 1&1&1&1\\ 1&1&1&1\\ 1&1&1&1\end{pmatrix}\cdot\begin{pmatrix}S_{0}\\ S_{0}\\ S_{0}\\ S_{0}\end{pmatrix}\right], (3.4)

which can be simplified to

(S1S2S3S4)syn=14(+4+3111+41+311+411+31+4111+3)(S0S1S2S3S4)=𝑴syn(S0S1S2S3S4)\displaystyle\begin{pmatrix}S_{1}\\ S_{2}\\ S_{3}\\ S_{4}\end{pmatrix}_{syn}=\frac{1}{4}\begin{pmatrix}+4&+3&-1&-1&-1\\ +4&-1&+3&-1&-1\\ +4&-1&-1&+3&-1\\ +4&-1&-1&-1&+3\end{pmatrix}\cdot\begin{pmatrix}S_{0}\\ S_{1}\\ S_{2}\\ S_{3}\\ S_{4}\end{pmatrix}=\bm{M}_{syn}\cdot\begin{pmatrix}S_{0}\\ S_{1}\\ S_{2}\\ S_{3}\\ S_{4}\end{pmatrix} (3.5)

If the self-consistency condition S0=(S1+S2+S3+S4)/4S_{0}=(S_{1}+S_{2}+S_{3}+S_{4})/4 is satisfied, then 𝑺syn=𝑺\bm{S}_{syn}=\bm{S}, which means the synthesis scheme is lossless by itself.

However, the lossless scheme is not unique, and it is possible to design other lossless synthesis schemes by fine tuning this 4×54\times 5 synthesis matrix 𝑴syn\bm{M}_{syn} to achieve a better effect. Unfortunately, this job is very time consuming; thus we leave it for the next work of this series.

3.7 Pre-processing of tiny holes in the mask

The multi-resolution scheme in section 3.4 does not work very well when the mask contains too many tiny holes, because each tiny hole will force to use some area around it for reconstruction, which is not really necessary. Therefore, if the mask contains too many tiny holes, like the ones for the point sources, then it is recommended to erase those tiny holes by the diffusive in-paint method [22, 23]. Although the diffusive in-paint method does not satisfy the minimum variance condition, it is especially safe and convenient for tiny holes in a mask, and the corresponding precision loss is usually negligible compared to the help it brings to us.

3.8 About the input CMB APS

The implementation of eq. (2.3) requires to use the CMB covariance matrix, which is determined by the input CMB APS; however, if we start from the pixel domain map directly, there is no CMB APS in advance. Therefore, we should either use a prior CMB APS, which turns the work into a typical Bayesian analysis; or we should assume an initial CMB APS, and improve it by the expectation of APS within {𝑺i}\{\bm{S}_{i}\} to get an iterative solution, which leads to a blind solution that does not depend on the prior APS. Both ways are allowed, and for the latter, the number of rounds needed to converge is roughly a few times of 1/fsky1/f_{sky}, which means when the available region is very small, the convergence can be slow. The reason for that is: when the available region is very small, it cannot provide enough correction to the fullsky APS in one round. However, in this case, the reconstruction of the missing region becomes much less important, so we can consider stopping at a lower resolution for the missing region reconstruction, and focus on the estimation of the available region, which can make the computation much faster. Another possible idea is to amplify the modification to the resulting APS by a certain factor to accelerate the convergence for a small fskyf_{sky}. Both ways will be tested in the following works.

4 Test results

We start testing MrMVP in the temperature case without noise at Nside=128N_{side}=128, with an input Λ\LambdaCDM power spectrum from the most recent Planck collaboration release [27], and the mask is chosen to be the WMAP KP2 mask without the point sources. With a 4-core laptop, we generate 1,000 realizations in the constrained ensemble {𝑺i}\{\bm{S}_{i}\}, and the total time cost is illustrated in figure 3 as a function of the resolution, which follows O(Nside3)O(N_{side}^{3}) nicely. According to the figure, the average time cost to generate one realization at Nside=128N_{side}=128 is about 0.1 second, which is a significantly improved speed compared to previous works.

Refer to caption
Figure 3: Illustration of the total time cost of generating 1000 realizations in {𝑺i}\{\bm{S}_{i}\} (black) from Nside=16N_{side}=16 to Nside=128N_{side}=128. The red line is for the O(Nside3)O(N_{side}^{3}) expectation. At low resolutions, the total time cost is higher than O(Nside3)O(N_{side}^{3}) due to the auxiliary operations like disk I/O, consistency test, mask and region processing, etc.

Then we compare the pixel domain expectation 𝑺~\widetilde{\bm{S}}, the input pixel domain CMB map and two constrained realizations 𝑺i\bm{S}_{i} in figure 4. Note that 𝑺~\widetilde{\bm{S}} is the minimum variance solution in the pixel domain, but its APS is not a minimum variance estimate of the CMB APS. The latter should be derived from the expectation of APS within {𝑺i}\{\bm{S}_{i}\}.

Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 4: Comparison of the pixel domain expectation 𝑺~\widetilde{\bm{S}} (upper-left), the input simulated CMB map (upper right), and two more constrained realizations 𝑺i\bm{S}_{i} derived by pixel domain operations (lower panels).

In figure 5, we compare the input Λ\LambdaCDM APS, the APS of the fullsky CMB map, the minimum variance estimate of the APS, and the RMS error of the minimum variance estimate. We can see that the minimum variance estimate of the APS is very well consistent with the fullsky CMB power spectrum.

Refer to caption
Figure 5: Comparison of the APS of the: input Λ\LambdaCDM (black), fullsky CMB map (red), minimum variance estimate (blue); and the RMS error of the minimum variance estimate (green).

Note that, because the entire ensemble {𝑺i}\{\bm{S}_{i}\} are constrained, the corresponding minimum variance estimate and the error amplitudes are also constrained. Therefore, the APS error bars derived from {𝑺i}\{\bm{S}_{i}\} are more informative than a general estimation of the error bars without considering the constraint. For example, the error bar at =2\ell=2 will follow the amplitude of the true quadrupole power; thus, if the true quadrupole power happens to be quite high (like the one in figure 5), then its error bar will increase accordingly, which faithfully reflects the actual constraint from the available dataset.

For the available region, the effect of MrMVP is to alleviate the unwanted residuals. The effectiveness of alleviation is shown in figure 6, where the input signal is a simulated B-mode map with r=0.05r=0.05, the available region is a disk in the center of the sky, and the contaminant is the residual of EB-leakage correction following the blind template correction method introduced in [28, 29, 17]. Such a residual is significantly correlated between different map pixels and is significantly non-isotropic. To make the residuals more visible and to test the method in a more difficult situation, the simulated residuals are amplified by factor 2, so they are bigger than what we may actually encounter. As we see from figure 6, the minimum variance estimate of the available region can efficiently alleviate the residual contamination, as far as its covariance matrix can be estimated.

Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 6: Illustration of the effectiveness of the minimum variance estimate of the available region. Panel 1 (from left to right) is an available region that is contaminated by the EB-leakage residuals after correction, and the residual component itself is shown in panel 2. After the minimum variance estimation, we obtain a clean region (panel 3), which is significantly improved compared to the contaminated one, and is better consistent with the true input (panel 4). The unit is μ\muK.

In summary, figures 45 confirm the correctness of the MrMVP implementation, figure 3 shows that it is very fast and scaled only as O(Nside3)O(N_{side}^{3}), and figure 6 shows that it can efficiently alleviate the residual errors with known covariance matrices, even if they are correlated and non-isotropic, like the residual of EB-leakage correction.

5 Conclusion and discussion

In this work, we have designed a pixel domain multi-resolution minimum variance painting method (MrMVP). Its time cost follows O(Nside3)O(N_{side}^{3}), which is much better than the O(Nside6)O(N_{side}^{6}) time cost of a traditional minimum variance method. The method helps to generate full sky CMB realizations that are properly constrained by the observed sky map, and a mathematical proof is given to confirm that these constrained realizations have identical statistical properties with the expected CMB signal, which is also consistent with the test results. When the method is applied to the missing region, it helps to alleviate the incomplete sky effect, and when it is applied to the available region, it can efficiently alleviate the unwanted pixel domain residuals with known covariance matrices, even if they are significantly correlated and non-isotropic, like the residual of EB-leakage. Both sides are useful in the data analysis of CMB experiments.

However, the method still need to be further developed and improved: additional routines needs to be developed to make it suitable for a complete Bayesian analysis from the pixel domain map to the final cosmological parameters; and the method itself needs to be fine tuned, especially for the synthesis matrix, to further improve its performance. Meanwhile, a GPU implementation of the method is also under consideration, because it can significantly reduce the time cost of inverting the covariance matrices. For small fskyf_{sky}, several ideas of improving the convergence speed are proposed and will be tested in the following works.

Acknowledgments

This research has made use of the HEALPix [3] package and data product from the Planck [30] collaboration. Hao Liu is also supported by the Youth Innovation Promotion Association, CAS.

References