This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

AI-Powered Reconstruction of Dark Matter Velocity Fields from Redshift-Space Halo Distribution

Xu Xiao School of Physics and Astronomy, Sun Yat-Sen University, Zhuhai 519082, China Jiacheng Ding School of Physics and Astronomy, Sun Yat-Sen University, Zhuhai 519082, China Xiao Lin Luo Sun Ke Lan School of Physics and Astronomy, Sun Yat-Sen University, Zhuhai 519082, China Liang Xiao School of Physics and Astronomy, Sun Yat-Sen University, Zhuhai 519082, China Shuai Liu School of Physics and Astronomy, Sun Yat-Sen University, Zhuhai 519082, China Xin Wang School of Physics and Astronomy, Sun Yat-Sen University, Zhuhai 519082, China Le Zhang [email protected] School of Physics and Astronomy, Sun Yat-Sen University, Zhuhai 519082, China Peng Cheng Laboratory, Shenzhen, Guangdong 518066, China CSST Science Center for the Guangdong–Hong Kong–Macau Greater Bay Area, SYSU, Zhuhai 519082, China Xiao-Dong Li [email protected] School of Physics and Astronomy, Sun Yat-Sen University, Zhuhai 519082, China Peng Cheng Laboratory, Shenzhen, Guangdong 518066, China CSST Science Center for the Guangdong–Hong Kong–Macau Greater Bay Area, SYSU, Zhuhai 519082, China
Abstract

In the study of cosmology and galaxy evolution, the peculiar velocity and density field of dark matter (DM) play a crucial role in studying many issues. Here, we propose a UNet-based deep learning to reconstruct the real-space DM velocity field from the spatial distribution of a sparse sample of DM halos in redshift space. By comparing and testing various properties, we demonstrate that the reconstructed velocity field is in good agreement with the actual situation. At k<0.3h/Mpck<0.3~{}h/{\rm Mpc}, the reconstruction of various velocity field components, including velocity magnitude and divergence, outperforms traditional linear perturbation theory. Additionally, the effects of redshift space distortions (RSD) are well corrected using the UNet model. Compared to the true real-space power spectra, the UNet reconstruction provides an unbiased estimate of the density, velocity, and momentum fields, remaining consistent within 2σ2\sigma level. We also demonstrate that the UNet model remains effective even with limited information about halo masses. Thus, our proposed UNet model has a wide range of applications in various aspects of cosmology, such as RSD, cosmic web analysis, the kinetic Sunyaev-Zel’dovich effect, BAO reconstruction, and so on.

1 Introduction

The large-scale peculiar velocity is increasingly being recognized as a valuable tool in cosmology, as it is directly influenced by the gravitational pulling. This renders it invaluable for studying the dark universe (Peacock et al., 2001; Linder & Jenkins, 2003; Zhang et al., 2007; Guzzo et al., 2008; Jain & Zhang, 2008; Wang, 2008; Reyes et al., 2010; Li et al., 2011; Clifton et al., 2012; Reid et al., 2012; Tojeiro et al., 2012; Weinberg et al., 2013; Joyce et al., 2015; Koyama, 2016), particularly its sensitivity to density fluctuations at horizon scales (Zhang & Stebbins, 2011; Zhang & Johnson, 2015), thereby providing insights into cosmic origin.

In a variety of circumstances, the measured velocity statistics may be given different weights. To illustrate, the kinetic Sunyaev-Zel’dovich (kSZ) effect is proportional to the gas momentum, which represents the gas density-weighted velocity. Conversely, the volume-weighted velocity power spectrum can be inferred from redshift-space distortions (RSD) by comparing the measured RSD power spectrum with theoretical models. In this approach, the theory of RSD includes the modelling of volume-weighted velocity statistics (Kaiser, 1987; Scoccimarro, 2004; Taruya et al., 2010; Zhang et al., 2013).

In the context of cosmological studies, volume-weighted velocity statistics are considered to be a more appropriate approach than density-weighted ones. It is preferable to employ density-weighted statistics when utilizing the distribution function approach (Seljak & McDonald, 2011; Okumura et al., 2012) and the streaming model (Peebles, 1980; White et al., 2015). In contrast to density-weighted statistics, volume-weighted statistics remain uninfluenced by uncertainties in galaxy density bias. Nevertheless, the measurement of volume-weighted velocity statistics presents a number of challenges, both in observational studies and in numerical simulations. Velocities in regions containing galaxies can be determined; however, velocities in regions lacking galaxies (i.e., those with no simulation particles) typically remain present. This sampling artifact inevitably distorts the measurement of volume-weighted velocity statistics, with the impact increasing as the particle number density decreases. Additionally, there exists a correlation between the halos/galaxies spatial distribution and the velocity field being measured, due to an underlying correlation between the large-scale structure (LSS) and the latter. Consequently, the sampling of a volume-weighted velocity field is subject to bias, which results in a biased measurement of volume-weighted velocity statistics (Bernardeau & van de Weygaert, 1996; Bernardeau et al., 1997; Schaap & van de Weygaert, 2000; Zheng et al., 2013, 2015a, 2015b; Zhang et al., 2015; Jennings et al., 2015).

It have been observed that the velocity sampling artifact increases in proportion to a decrease in the sample number density, ρh\rho_{h}. The effect is already discernible for ρh1(h/Mpc)3\rho_{h}\sim 1~{}(h/{\rm Mpc})^{3} (Zheng et al., 2013). In the case of sparse samples, such as those observed in massive halos, the problem is considerably more pronounced. For a sample density of ρh=103(h/Mpc)3\rho_{h}=10^{-3}~{}(h/{\rm Mpc})^{3}, the induced error in the velocity power spectrum reaches a value of approximately 10%, even at scales as large as k=0.2h/Mpck=0.2~{}h/\mathrm{Mpc} (Zheng et al., 2015b).

Several pioneering studies have been devoted to solving this long-standing problem. New velocity assignment methods have been developed, including the Voronoi tessellation method (Bernardeau & van de Weygaert, 1996), the Delaunay tessellation method (Schaap & van de Weygaert, 2000), the nearest-particle method (Zheng et al., 2013; Koda et al., 2016), the Kriging method (Yu et al., 2015, 2017), and a hybrid approach to determine the volume-weighted halo velocity bias (Chen et al., 2018). A theoretical model of the sampling artifact has been constructed (Zhang et al., 2015), validated in simulated DM velocity fields (Zheng et al., 2015b), and subsequently employed to correct the sampling artifact in the halo velocity field Zheng et al. (2015a).

Sampling artifacts can significantly impact cosmological constraints when velocity power spectra are derived from sparse galaxy samples. This issue also affects RSD, a key cosmological probe used by Stage IV surveys such as DESI (Levi et al., 2013) EUCLID (Laureijs et al., 2011), LSST (LSST Science Collaboration et al., 2009), WFIRST (Spergel et al., 2015), and CSST (Zhan, 2011). These surveys would measure the volume-weighted velocity power spectrum through RSD with a statistical precision of approximately 1% at 0.1h/Mpc0.1~{}h/{\rm Mpc}.

Both simulations and observations present challenges, prompting the question of whether it is possible to accurately infer the density-weighted or volume-weighted velocity field directly from the spatial distribution of halos? As this is a field-to-field mapping problem, it is non-trivial and complex. To date, there is no theoretical framework that can completely solve it. Given the large number of degrees of freedom in the field to be reconstructed, the traditional method is either invalid or inefficient for this purpose.

The recent advancements in machine learning algorithms, particularly those based on deep neural networks, present a significant opportunity to extract valuable insights from complex data. In more recent years, deep learning-based techniques have been applied with considerable success to almost all areas of cosmology and astrophysics (Mehta et al., 2019; Jennings et al., 2019; Carleo et al., 2019; Ntampaka et al., 2019), including weak gravitational lensing (Schmelzle et al., 2017; Gupta et al., 2018; Springer et al., 2018; Fluri et al., 2019; Jeffrey et al., 2019; Merten et al., 2019; Peel et al., 2019; Tewes et al., 2019), the cosmic microwave background (Caldeira et al., 2018; Rodriguez et al., 2018; Perraudin et al., 2019; Münchmeyer & Smith, 2019; Mishra et al., 2019), LSS for estimating cosmological parameters from the distribution of matter (Ravanbakhsh et al., 2017; Lucie-Smith et al., 2018; Pan et al., 2020; Lazanu, 2021), identifying DM halos, and reconstructing the initial conditions of the universe using machine learning (Modi et al., 2018; Berger & Stein, 2019; Lucie-Smith et al., 2019; Ramanah et al., 2019). In addition, this involves the mapping of coarse cosmology to the fine details (He et al., 2019; Li et al., 2021), extracting line intensity maps (Pfeffer et al., 2019), removing foregrounds in 21cm intensity mapping (Makinen et al., 2021), augmenting N-body simulations with gas (Tröster et al., 2019), mapping the 3D galaxy distribution in hydrodynamic simulations to its underlying DM distribution (Zhang et al., 2019), modeling small-scale galaxy formation physics in large cosmological volumes (Ni et al., 2021), reconstructing baryon acoustic oscillations (Mao et al., 2020) and the initial linear-regime matter density field (Shallue & Eisenstein, 2023), searching for gravitational waves (Dreissigacker et al., 2019; Gebhard:2019ldz), studying cosmic reionization (La Plante & Ntampaka, 2018; Gillet et al., 2019; Hassan et al., 2019b; Chardin et al., 2019; Hassan et al., 2019a), and analyzing supernovae (Lochner et al., 2016; Moss, 2018; Ishida et al., 2019; Li et al., 2019; Muthukrishna et al., 2019).

The pioneering work by (Wu et al., 2021) demonstrates that a UNet network can reconstruct the nonlinear velocity field of DM particles with high precision down to a scale of 2 h1Mpch^{-1}\rm Mpc, demonstrating a notable advantage in reconstructing the cosmic velocity field at nonlinear scales. However, reconstructing the velocity field from observed halos rather than particles is more challenging. As demonstrated by (Hong et al., 2021; Ganeshaiah Veena et al., 2023), the reconstruction of a density field or peculiar velocity field is possible from galaxy distributions. More recently,  (Wu et al., 2023) have proposed a method for reconstructing the various peculiar velocity fields down to k1.1k\lesssim 1.1 h/Mpch/\rm Mpc from the redshift-space distribution of DM halos.

However, none of these studies address the inference of velocity fields from the spatial distribution of DM halos in redshift space. Consequently, in this study, we propose a UNet model dedicated to reconstructing the real-space density, velocity, and momentum fields from the redshift-space spatial distribution of DM halos (and subhalos). The latter two essentially correspond to the volume-weighted and density-weighted velocity fields. This study employs a simulation data set to investigate the effectiveness of a UNet-structured neural network in velocity reconstruction. The data set is introduced in Sect. 2, where the architecture of the neural network and the training procedure are also detailed. The results of the network are presented in Sect. 3, and finally, the conclusion and discussion are presented in Sect. 4.

2 Methods

2.1 Dataset

In order to train and validate our deep learning model, we employed the CosmicGrowth simulations Jing (2018) as our training and test datasets. The CosmicGrowth simulation is a valuable tool for investigating the growth and evolution of DM halos and subhalos in the universe. The simulation was conducted in a box with a side length of 1.2 h1Gpch^{-1}{\rm Gpc}, containing 204832048^{3} DM particles with a mass resolution of MDM=3.8×109h1MM_{\rm DM}=3.8\times 10^{9}~{}h^{-1}M_{\odot}.

In this study, we adopt the standard flat Λ\LambdaCDM cosmological model, which is well consistent with the results of WMAP cosmology Komatsu et al. (2011); Hinshaw et al. (2013). The standard cosmological parameters adopted are as follows: {Ωc,Ωb,h,ns,σ8}={0.2235,0.0445,0.71,0.968,0.83}\{\Omega_{c},\Omega_{b},h,n_{s},\sigma_{8}\}=\{0.2235,0.0445,0.71,0.968,0.83\}.

In order to construct the halo/subhalo catalog in redshift space at z=0.59z=0.59, it is necessary to take the number density of the input data to be 3×103Mpc3h33\times 10^{-3}~{}{\rm Mpc}^{-3}h^{3}. This value is consistent with current spectroscopic observations. The relationship between the redshift space position, denoted by 𝒔\bm{s}, and the real space position, denoted by 𝒓\bm{r}, after accounting for the RSD effect is given by

𝒔=𝒓+𝒗z^aH(a)z^,\bm{s}=\bm{r}+\frac{\bm{v}\cdot\hat{z}}{aH(a)}\hat{z}\,, (1)

where 𝒗\bm{v} denotes the peculiar velocity, aa denotes the scale factor, H(a)H(a) denotes the Hubble parameter, and z^\hat{z} represents the unit vector along the line of sight (LoS). The density and velocity fields were computed based on the catalogue samples, with the haloes assigned to a 5123512^{3} mesh using the CIC (Cloud-in-Cell) scheme, with a cell resolution of 2.35h1Mpc32.35h^{-1}~{}{\rm Mpc}^{3}.

2.2 Training process

Refer to caption
Figure 1: Training scheme for reconstructing the DM velocity field in real space. The reconstruction process involves two major steps. The first step is to reconstruct the DM density field, ρDM\rho_{\rm DM}, from the sparse DM halo density field, ρs\rho_{s}, in redshift space. To achieve this, we trained a UNet model to predict ρDM\rho_{\rm DM}. Subsequently, we trained another UNet model that takes as input the concatenated data from the linear velocity prediction. This model is then used to reconstruct the direction and magnitude fields of velocity. This network structure is also applicable when the final output is the real-space momentum field.

The objective is to construct a mapping based on the UNet method, which will take as its input a sparse halo number density field in redshift space (to provide an accurate representation of real observations) and output the real-space fields, including the DM density field and DM velocity/momentum fields.

For clarification, let us first define two density fields that will be used during the UNet reconstruction process: 1) the redshift-space sparse halo number density field, ρs\rho_{s}; 2) the real-space DM density field, ρDM\rho_{\rm DM}.

Moreover, the real-space comoving velocity field is denoted by 𝒗\bm{v}, with its magnitude represented by |𝒗||\bm{v}| and its direction by v^\hat{v}. The momentum field in real space is defined as the number-weighted velocity, given by 𝒎=(1+δDM)𝒗\bm{m}=(1+\delta_{\rm DM})\cdot\bm{v}, where δDMρDM/ρ¯DM1\delta_{\rm DM}\equiv\rho_{\rm DM}/\bar{\rho}_{\rm DM}-1 is the DM density field relative to its mean.

The reconstruction process is divided into two principal steps, as illustrated in Fig. 1, which provides a detailed illustration of the UNet network in Fig. 2. For the illustrative purpose, the velocity will be taken as an example. The same methodology is employed for the processing of the momentum field.

In what follows, we will provide a comprehensive description of this process.

1) The mapping from ρs(𝒙)\rho_{s}(\bm{x}) to ρDM(𝒙)\rho_{\rm DM}(\bm{x}). The initial step is the reconstruction of the distribution of DM in real space, ρDM\rho_{\rm DM}, using the sparse halo number density field in redshift space, ρs\rho_{s}. Each channel of the input contains halos within a specific mass range. The DM halos (and subhalos) are catalogued in descending order of mass and divided into four mass intervals, with boundaries defined by the logarithmic mass ratio as follows: log10(M/M)[15.01,13.30,12.56,12.31,12.17]\log_{10}(M/M_{\odot})\in[15.01,13.30,12.56,12.31,12.17], corresponding to four number densities of halos ρh[0.0001,0.001,0.002,0.003](h/Mpc)3\rho_{h}\in[0.0001,0.001,0.002,0.003]~{}(h/{\rm Mpc})^{3}. Note that the mass information may be estimated to a reasonable degree of accuracy using empirical formulas, which allow for the inference of the masses of these objects from the observed apparent magnitudes of galaxies. The target data is the three-dimensional number density field of DM in real space. Our UNet model is constructed for the purpose of reconstructing the real-space ρDM\rho_{\rm DM}.

Two mass-weighting schemes are adopted for comparison: i) “with MhaloM_{\rm halo} weighting”: in this scheme, DM halos are divided into the four intervals based on mass, and each interval is assigned a weight proportional to its mass.

ii) “without MhaloM_{\rm halo} weighting”: in this scheme, DM halos are also divided into the four intervals based on mass, but no weights are assigned to these intervals.

The first scheme assumes an ideal scenario where we have complete knowledge of the mass information of DM halos. In contrast, the second one better reflects current observational capabilities, allowing only rough estimates of the mass distribution of galaxies, and thus corresponds more closely with actual observations. To assess the impact of mass information accuracy, results from both schemes will be presented simultaneously.

2) The mapping from a combination of ρs(𝒙)\rho_{s}(\bm{x}) and ρDM(𝒙)\rho_{\rm DM}(\bm{x}) to 𝒗(𝒙)\bm{v}(\bm{x}) or 𝒎(𝒙)\bm{m}(\bm{x}) is achieved using two UNet models. These models are constructed to reconstruct the direction and magnitude of the DM velocity and momentum fields in real space. The UNet models utilize ρs\rho_{s}, ρDM\rho_{\rm DM}, and the linear velocity in redshift space, 𝒗lin\bm{v}_{\rm lin}, derived from linear perturbation prediction as inputs.

2.3 Neural network model

Refer to caption Refer to caption
Figure 2: UNet neural network architecture employed for the reconstruction of the DM density and velocity/momentum fields. A 1283128^{3} box with a side length of 300 Mpc/h{\rm Mpc}/h is employed in the UNet architecture as the input. This architecture is comprised of a three-block structure: a up-convolution block, a down-convolution block, and a final convolution block. The output fields have dimensions of 1283128^{3}, with a volume of 3003(Mpc/h)3300^{3}~{}({\rm Mpc}/h)^{3}. The left panel provides detailed information on the components included in the three blocks of the UNet, while the right panel offers detailed descriptions of the “init”, “conv”, “trans”, and “output” layers. In the final convolution block, a dropout layer is employed between the convolutional transformation and batch normalization layers to enhance the performance of UNet and prevent overfitting. The values of N1,N2,N3N_{1},N_{2},N_{3}, and the dropout percentage are automatically provided by a machine learning tool called Optuna (Akiba et al., 2019).

In light of the neural network model proposed by (Wu et al., 2021), we employed the UNet architecture for the construction of our model. The architecture of our neural network and its components are depicted in Fig. 2. The input is data blocks with an arbitrary number of channels, with each channel corresponding to a specific mass range of ρs\rho_{s}.

Given that the velocity field is decomposed into the velocity magnitude and direction, we constructed two structurally similar neural networks to handle them separately. The networks conclude with an output layer comprising 1+3 channels. The three channels correspond to the components of the velocity direction, while the fourth channel corresponds to the velocity magnitude. The final step in the reconstruction process is the combination of the results from all output channels, which ultimately yields the complete reconstruction of the three-dimensional velocity field.

The notations in Fig. 2, “nin{n_{\rm in}}” and “nout{n_{\rm out}}”, represent the number of input and output channels, respectively. These values may vary across different reconstruction tasks, as shown in Tab. 1.

Table 1: Input and output channel numbers based on different reconstructed fields.
fields nin{n_{\rm in}} nout{\rm n_{\rm out}}
DM density (ρDM\rho_{\rm DM}) 4 1
magnitude of DM velocity (|𝒗||\bm{v}|) 8 1
direction of DM velocity (v^\hat{v}) 8 3
magnitude of DM momentum (|𝒎||\bm{m}|) 8 1
direction of DM momentum (m^\hat{m}) 8 3

The color blocks represent the various operations performed by the neural network, with arrow lines connecting the input and output. The dimensions of the input, intermediate, and output fields (channels ×\times spatial pixels) are defined, along with the size and number of labeled 3D convolutional kernels (“conv”).

Given the constraints of GPU memory, training time, and model size, we divided a large box with a physical scale of 1200 Mpc/h{\rm Mpc}/h into 434^{3} small boxes with a physical scale of 300 Mpc/h{\rm Mpc}/h, each containing 1283128^{3} grid points in the CIC interpolation scheme. A total of 32 small boxes were utilized for training and 32 for validation purposes. The neural network model receives inputs and generates outputs targeted at these small boxes, which consist of 1283128^{3} pixels each.

During the testing phase, We applied the neural network to large boxes, each consisting of 2563256^{3} pixels, as the input data for the model. The output also contains 2563256^{3} pixels, corresponding to a physical size of 600 Mpc/h{\rm Mpc}/h. Using the neural network on large boxes helps us eliminate less accurate outputs at the edges, which can be a significant issue when applying the UNet model to smaller boxes. For training, the kmink_{\rm min} for small boxes is 0.026, while the kmink_{\rm min} for validation with large boxes is 0.013.

Some points require further clarification, which will be detailed below.

  1. 1)

    To modify the field size, the stride=2\emph{stride}=2 parameter can be employed, which allows for a reduction or increase in the field size by a factor of two.

  2. 2)

    The “init” 3D convolutional layer enables the network to rapidly learn large-scale information due to its sufficiently large receptive field.

  3. 3)

    The convolutional layer, which is situated at the output of the network, serves to increase the network’s complexity. In order to prevent overfitting, a dropout layer is incorporated at the end of the convolutional layer. This layer also alters the number of channels.

  4. 4)

    The incorporation of batch normalization (BN) layers into the neural network facilitates the acceleration of training convergence and the prevention of overfitting. Additionally, the implementation of rectified linear units (ReLU) as activation layers subsequent to convolutional layers enhances the network’s nonlinearity.

  5. 5)

    The trained UNet enables the prediction of ρDM\rho_{\rm DM} by inputting ρs\rho_{s}. This, in turn, enables the prediction of the real-space fields, including the velocity (𝒗\bm{v}) and momentum (𝒎\bm{m}) fields, and the associated clustering statistics, which can be directly measured.

  6. 6)

    Additionally, due to the constraints of the limited dimensions of the training box, the model struggles to learn large-scale velocity modes. To mitigate potential bias from the small box dataset, the velocity field predicted by traditional linear reconstruction is used as one of the UNet inputs to reconstruct the velocity or momentum field. The linear velocity and momentum predictions in redshift space are determined through

    𝒗lin(𝒌)=afHi𝒌k2δs(𝒌)β,\bm{v}_{\rm lin}(\bm{k})=afH\frac{i\bm{k}}{k^{2}}\frac{\delta_{s}(\bm{k})}{\beta}\,, (2)

    and

    𝒎lin=(1+δsβ)𝒗lin.\bm{m}_{\rm lin}=(1+\frac{\delta_{s}}{\beta})\bm{v}_{\rm lin}\,. (3)

    Here δs\delta_{s} is divided by the linear bias β\beta, where β=\beta= 1.85 for “without MhaloM_{\rm halo} weighting” scheme and 2.57 for “with MhaloM_{\rm halo} weighting” scheme and is determined by the ratio of the integrated DM power spectrum to the integrated halo power spectrum over the relevant kk-range of interest, i.e.,

    β2=k<0.3Phalo(k)PDM(k)𝑑kk<0.3𝑑k.\beta^{2}=\frac{\int_{k<0.3}\frac{P_{\rm halo}(k)}{P_{\rm DM}(k)}dk}{\int_{k<0.3}dk}\,. (4)
  7. 7)

    The Tree Parzen Estimator (TPE) process (Bergstra et al., 2011), based on Bayesian optimization, is employed to optimize hyperparameters, with the Python package Optuna111https://optuna.readthedocs.io utilized to maximize the model accuracy (Akiba et al., 2019). The hyperparameters that are optimized include: 1) the learning rate; 2) dropout percentage; 3) the number of training data sets; 4) the number of channels in the intermediate layer, and 5) the training batch size. The hyperparameters are tuned by searching for values that minimize the validation loss of the model. A minimum of five experiments were conducted for each set of hyperparameter values, with the objective of evaluating the impact of these values on the performance.

2.4 Loss function

The objective of the training for UNet is to minimize the loss between the prediction for a given field and the simulation truth for each voxel.

  1. 1)

    For training our UNet model to reconstruct the real-space DM density fields ρDM\rho_{\rm DM}, we use the Mean Squared Error (MSE) loss function,

    =1Ni=1N(ρiρitrue)2,\mathcal{L}=\frac{1}{N}\sum_{i=1}^{N}\big{(}\rho_{i}-\rho^{\rm true}_{i}\big{)}^{2}\,, (5)

    where the index ii runs over all NN pixels of a field.

  2. 2)

    To account for the contributions of the velocity magnitude (v|𝒗|v\equiv|\bm{v}|) and the velocity direction (unit vector v^𝒗/|𝒗|\hat{v}\equiv\bm{v}/|\bm{v}|), we choose the following two-term loss function,

    =1Ni=1N[14(|vi||vitrue|)2+34(1cosϕi)],\mathcal{L}=\frac{1}{N}\sum_{i=1}^{N}\left[\frac{1}{4}(|v_{i}|-|v_{i}^{\rm true}|)^{2}+\frac{3}{4}\left(1-\cos\phi_{i}\right)\right]\,, (6)

    where cosϕiv^iv^itrue\cos\phi_{i}\equiv\hat{v}_{i}\cdot\hat{v}_{i}^{\rm true}, and the index ii denotes the ii-th pixel. As observed, the first term is responsible for |𝒗||\bm{v}| and corresponds to the MSE loss, which is essentially equivalent to the maximum likelihood solution under the Gaussian constant variance assumption. The second term, of course, measures the deviation between the reconstructed and true values of v^\hat{v}. The coefficients of these two terms can be considered as normalization factors and are determined by the number of channels: 1 for the magnitude |𝒗||\bm{v}| and 3 for the three directions, i.e., v^x\hat{v}_{x}, v^y\hat{v}_{y}, and v^z\hat{v}_{z}. Empirically, this loss function has proven to be stable and effective during our training process, yielding good results in velocity (momentum) reconstruction.

    Refer to caption
    Figure 3: Loss of the training (black) and validation (red) sets. The UNnet model achieves convergence after approximately 1000 epochs of training for the DM density, DM velocity, and DM momentum (from left to right), respectively. The loss function utilized for all models is the MSE loss function, as defined in Eqs. 5 and  6.
  3. 3)

    We trained our UNet using the popular Adam algorithm (Kingma & Ba, 2014) for training deep neural networks. This algorithm iteratively reduces the training loss by calculating its gradient with respect to the model parameters and taking a small step in the direction of maximum reduction. From Fig. 3, it can be seen that both the velocity model and the momentum model converge after approximately 1000 epochs of training.

3 result

This section will present the performance assessment of the trained UNet model, with the results presented based on predictions for 25 large boxes in the validation dataset that were not used in the model training and optimization of model structure and training parameters. Each large box have physical box size of 6003600^{3} (Mpc/h)3({\rm Mpc}/h)^{3} with the pixel number of 2563256^{3}. In order to ensure the most accurate results, we selected to test on the larger boxes. This is because the measurements on the larger boxes exhibit better statistical behavior, which reduces the sampling variance and thus statistical errors.

3.1 Visual inspection and point-wise comparison

The main objective of this study is to compare the UNet-predicted DM velocity/momentum in real space with the ground truth. In order to facilitate the analysis of the data, it is first necessary to describe the statistics that will be employed throughout the study. The two-point correlation function is a commonly used statistical measure for characterizing a density field,

ξ(𝒓)=δ(𝒙)δ(𝒙+𝒓),\xi(\bm{r})=\langle\delta(\bm{x})\delta(\bm{x}+\bm{r})\rangle\,, (7)

where the density contrast field, denoted by δ(𝒙)\delta(\bm{x}), is a function of any point 𝒙\bm{x}. The separation vector, 𝒓\bm{r}, is a vector that represents the distance between two points. The ensemble mean, represented by \langle\cdot\rangle, is computed by averaging over all points 𝒙\bm{x} in a spatial mean. The power spectrum of δ(𝒙)\delta(\bm{x}) is related to the Fourier transform of the correlation function, ξ(𝒓)\xi(\bm{r}), by a simple mathematical identity:

P(𝒌)=ξ(𝒓)ei𝒌𝒓d3𝒓,P(\bm{k})=\int\xi(\bm{r})\mathrm{e}^{i\bm{k}\cdot\bm{r}}\mathrm{d}^{3}\bm{r}\,, (8)

where the three-dimensional wavevector of the plane wave, denoted by 𝒌\bm{k}, has magnitude k|𝒌|k\equiv|\bm{k}|, related to the wavelength λ\lambda by k=2π/λk=2\pi/\lambda.

As with the scalar field δ\delta, we can also define power spectra for various velocity fields of interest. The velocity field, 𝒗\bm{v}, is completely described by its divergence, θ𝒗\theta\equiv\nabla\cdot\bm{v}, and its vorticity, 𝝎=×𝒗\bm{\omega}=\nabla\times\bm{v}, via the Helmholtz decomposition. In Fourier space, they become purely radial and transversal velocity modes, respectively, defined by θ(𝒌)=i𝒌𝒗(𝒌)\theta(\bm{k})=i\bm{k}\cdot\bm{v}(\bm{k}) and 𝝎(𝒌)=i𝒌×𝒗(𝒌)\bm{\omega}(\bm{k})=i\bm{k}\times\bm{v}(\bm{k}). The power spectra of the velocity, divergence, vorticity, and velocity magnitude are given by

θ(𝒌)θ(𝒌)=\displaystyle\langle\theta(\bm{k})\theta^{*}(\bm{k}^{\prime})\rangle= (2π)3Pθ(k)δ(𝒌𝒌),\displaystyle(2\pi)^{3}P_{\theta}(k)\delta(\bm{k}-\bm{k}^{\prime})\,, (9)
ωi(𝒌)ωj(𝒌)=\displaystyle\langle\omega^{i}(\bm{k})\omega^{*j}(\bm{k}^{\prime})\rangle= (2π)312(δijkikjk2)Pω(𝒌)δ(𝒌𝒌),\displaystyle(2\pi)^{3}\frac{1}{2}\bigg{(}\delta^{ij}-\frac{k^{i}k^{j}}{k^{2}}\bigg{)}P_{\omega}(\bm{k})\delta(\bm{k}-\bm{k}^{\prime})\,,
𝒗(𝒌)𝒗(𝒌)=\displaystyle\langle\bm{v}(\bm{k})\cdot\bm{v}^{*}(\bm{k}^{\prime})\rangle= (2π)3Pv(𝒌)δ(𝒌𝒌),\displaystyle(2\pi)^{3}P_{v}(\bm{k})\delta(\bm{k}-\bm{k}^{\prime})\,,

where indices ii and jj denote the components in the Fourier space coordinates.

According to the linear perturbation theory, the continuity equation leads to the following relationship: θ=fδ\theta=-\mathcal{H}f\delta, where =aH\mathcal{H}=aH is the conformal Hubble parameter, aa represents the cosmic scale factor, ff is the linear growth rate, defined as f=dlnD/dlnaf=d\ln D/d\ln a, with DD being the linear density growth factor. Under the Λ\LambdaCDM model, the growth rate is approximately given by fΩm0.55f\approx\Omega_{m}^{0.55} (Linder, 2005).

Refer to caption
Refer to caption
Figure 4: Comparison of the reconstructed fields by UNet with the true fields for velocity (upper) and momentum (lower) for slices from the test set, using the “without MhaloM_{\rm halo} weighting” weighting scheme. Each slice volume is 68×68×23(h1Mpc)368\times 68\times 23~{}(h^{-1}{\rm Mpc})^{3}. The sparse DM halo field (δs\delta_{s}), the true velocity and the reconstructed fields are displayed from left to right, along with the corresponding histograms. For each slice, the rightmost panel displays the statistical histogram distribution of the magnitude and direction of the velocity samples. The colored arrows in the velocity field indicate the magnitude and direction of the velocity. Through both visual and statistical analysis of the histogram, it can be observed that even in the absence of specific mass information for each DM halo, UNet is capable of accurately reconstructing the velocity/momentum fields.

Fig. 4 presents a comparison between the velocity magnitude and direction reconstructed by UNet and the ground truth values, using the “without MhaloM_{\rm halo} weighting” weighting scheme. As observed, the reconstructed velocity magnitude and the true velocity magnitude are 297.22±135.31297.22\pm 135.31 and 294.60±131.02294.60\pm 131.02 km/s, respectively. Additionally, the reconstructed momentum magnitude is 315.17±209.29315.17\pm 209.29 km/s, while the true one is 309.50±213.43309.50\pm 213.43 km/s. The two mean values are consistent with the true values across all slices, with significant small deviations. When considered their statistical uncertainties, these deviations are negligible.

Moreover, by defining the quantity as cosϕiv^iv^itrue\cos\phi_{i}\equiv\hat{v}_{i}\cdot\hat{v}^{\rm true}_{i}, where v^i\hat{v}_{i} is the velocity direction reconstructed by the model and v^itrue\hat{v}^{\rm true}_{i} is the true velocity direction, one can measure the deviation between the angles. The results of the tests conducted among these slices indicate that 1cosϕi1-\cos\phi_{i} is 0.01±0.060.01\pm 0.06 for velocity and 0.04±0.110.04\pm 0.11 for momentum, respectively. This implies that the directions of the reconstructed velocity fields are nearly identical to those of the true velocity field, and that our model performs high reconstruction accuracy.

Refer to caption
Refer to caption
Figure 5: Same as in Fig. 4, but for the divergence fields for velocity (upper) and momentum (lower).

Furthermore, Fig. 5 illustrates a comparison between the reconstructed velocity/momentum divergence and the true velocity/momentum divergence. The divergence of velocity reconstructed by UNet and the true value are 0.40±18.490.40\pm 18.49 and 0.92±17.550.92\pm 17.55 km/s, respectively. The divergence of momentum reconstructed by UNet and the true one are 0.34±25.830.34\pm 25.83 and 0.34±25.790.34\pm 25.79 km/s, respectively. At the same time, each of the histogram demonstrates that the reconstructed probability distribution is similar to that of the truth. The findings suggest that the discrepancies in the reconstruction for velocity/momentum divergence fields are well consistent with the true values, with an accuracy exceeding 1% relative to the statistical uncertainty.

3.2 Statistical analysis on reconstructed fields

Furthermore, let us introduce the metrics that will be employed for evaluating the reconstruction accuracy. For an arbitrary reconstructed field of halos from a UNet, denoted by the shorthand notation ff, where f{𝒗,θv}f\in\{\bm{v},\theta_{v}\} for velocity and its divergence, and f{𝒎,θm}f\in\{\bm{m},\theta_{m}\} for momentum and its divergence, the following metrics are employed to describe the relative deviation and correlation coefficient, respectively, in order to compare the reconstructed field with the true one.

The relative deviation, RR, and the correlation coefficient, CrC_{r}, are defined as follows:

R=𝒪f𝒪f1,Cr=1Npix1i(fif¯)(fif¯)σfσf,R=\frac{\mathcal{O}_{f}}{\mathcal{O}_{f^{\prime}}}-1\,,\quad C_{r}=\frac{1}{N_{\rm pix}-1}\sum_{i}\frac{(f_{i}-\bar{f})(f^{\prime}_{i}-\bar{f^{\prime}})}{\sigma_{f}\sigma_{f^{\prime}}}\,, (10)

where 𝒪f\mathcal{O}_{f} represents an arbitrary observable for ff. CrC_{r} is defined between the reconstructed field ff and the true field ff^{\prime}, both of which have the same total number of pixels, NpixN_{\rm pix}. The sample mean and standard deviation of field ff are denoted by f¯\bar{f} and σf\sigma_{f}, respectively. It is evident that both metrics provide a physical insight for comparison, such that the ideal reconstruction is equivalent to R=0R=0 and to Cr=1C_{r}=1.

The coefficients CrC_{r} and relative deviation RR for various velocity components are calculated using Eq. 10. The calculations are based on averaging over four test sets, each with a box size of 300 Mpc/h{\rm Mpc}/h on each side. The results are summarized in Tab. LABEL:tab:rc. It can be observed that in all cases, the CrC_{r} values for all fields, except the momentum divergence field, are at the level of 0.9. Additionally, the changes induced by the mass-weighting scheme on CrC_{r} are not significant. The velocity reconstruction appears to exhibit slightly superior performance compared to the momentum reconstruction. Moreover, the RR values approach zero, indicating that the reconstructed field has minimal bias compared to the true one. When comparing the ”with MhaloM_{\rm halo} weighting” scheme to the “without MhaloM_{\rm halo} weighting” scheme, we find that even without complete knowledge of the halo mass, we can still achieve accurate reconstruction of the field.

Refer to caption
Figure 6: Joint probability distributions of the reconstructed fields and the true fields for velocity magnitude (|𝒗||\bm{v}|) and momentum magnitude (|𝒎||\bm{m}|), using the weighting schemes of “with MhaloM_{\rm halo} weighting” and “without MhaloM_{\rm halo} weighting” for comparison. For each field, the distributions were calculated using a large box with a side length of 600 Mpc/h{\rm Mpc}/h. The predicted distributions are strongly correlated with the true ones, indicating that the model is capable of effective reconstruction.

In order to provide a more comprehensive visual representation of the reconstruction performance, Fig. 6 depicts the joint probability distributions of the reconstructed fields and the true fields for magnitudes of velocity and momentum. This illustration compares the results obtained with the two mass weighting schemes. For each of the fields, the distributions were calculated using a large box with a side length of 600Mpc/h600~{}{\rm Mpc}/h. As can be observed, the predicted distributions for each field exhibit a strong correlation with the true one, falling very close to the diagonal. These findings suggest that our neural network performs an effective reconstruction. Furthermore, when employing the “without MhaloM_{\rm halo} weighting” weighting scheme, even without knowledge of the specific halo mass values, our UNet model is still capable of accurately reconstructing the fields, closely matching the true values. This indicates that the network has learned the information about the mapping from the sparse halo number density field in redshift space to the velocity/momentum fields in real space. As demonstrated in Fig. 7 for their divergence fields, similar findings are evident.

Refer to caption
Figure 7: Similar to Fig 6, but for the velocity divergence (θv\theta_{v}) and the momentum divergence (θm\theta_{m}), with the two mass weighting schemes. The correlation between the predicted and true distributions remains strong even when the specific mass values are not considered. This demonstrates that our model is capable of accurately predicting the distribution of DM halos even in the absence of specific mass values. These findings indicate that the mapping from the sparse DM halo number density field in redshift space to the real-space DM velocity/momentum fields can be successfully and accurately learned by our model.
Refer to caption Refer to caption
Refer to caption Refer to caption
Figure 8: Comparison of the predicted joint distribution with the simulation truth for the “with MhaloM_{\rm halo} weighting” weighting scheme. The top-left and bottom-left panels show the joint distributions of density-velocity, ρ(δ|v)\rho(\delta|v), and density-momentum, ρ(δ|m)\rho(\delta|m), respectively. The top-right and bottom-right panels show the joint distributions of density-divergence, ρ(δ|θv)\rho(\delta|\theta_{v}) and ρ(δ|θm)\rho(\delta|\theta_{m}), respectively. These distributions were obtained using a large box with a side length of 600Mpc/h600~{}{\rm Mpc}/h. The results demonstrate that for all DM halo number densities with δ+1[0,8]\delta+1\in[0,8], the predicted velocity distributions exhibit good consistency with the true values.
Refer to caption Refer to caption
Refer to caption Refer to caption
Figure 9: Same as in Fig. 8, but for the weighting scheme of “without MhaloM_{\rm halo} weighting”. It is evident that the predicted joint distributions remain consistent with the simulation truth, even in the absence of precise mass values.

To further test the difference between the reconstructed velocity/momentum fields and the true ones, the joint probability distribution of density-divergence and density-velocity (momentum) are shown in Fig. 8 for the weighting scheme of “with MhaloM_{\rm halo} weighting” and Fig. 9 for “without halo mass”. It is evident that the predicted results are in strong agreement with the ground truth across a wide range of δDM\delta_{\rm DM} values, even in high-density regions where δDM4\delta_{\rm DM}\gtrsim 4. These high-density areas are sparse and highly non-linear, yet the model still performs effectively.

3.3 Comparison for DM density power spectra

Refer to caption
Figure 10: Comparison of the UNet-predicted density power spectra with the true one. Each plot displays the predicted (blue) and true (black) values, along with the redshift-space power spectrum of the DM halos (red). The left and right panels illustrate the weighting schemes of “with MhaloM_{\rm halo} weighting” and “without MhaloM_{\rm halo} weighting”. A total of 25 boxes (each with a side length of 600Mpc/h600~{}{\rm Mpc}/h) were employed in the test dataset to estimate the mean and standard deviation (shaded region) for the power spectra. The relative deviations, RR, are presented in the bottom panels. In all cases, the UNet model yields a good reconstruction performance, insensitive to the weighting scheme. Thus, using DM halo mass as a weight does not significantly affect the reconstruction of the DM density field.

As illustrated in Fig. 10, the resulting DM density power spectra in real space, calculated using Eq. 8 for various cases, are summarized. Two mass weighting schemes of “with MhaloM_{\rm halo} weighting” and “without MhaloM_{\rm halo} weighting” are presented in the left and right panels, respectively, for comparison. The auto power spectra for the predicted density (blue) and the true density (black) are shown, along with the auto power spectrum of sparse DM halo (red), providing a detailed comparison.

To accurately estimate the statistical uncertainty ΔP(k)\Delta P(k), the number of independent Fourier modes is taken into account, which leads to a rescaling of the standard deviation δP(k)\delta P(k) (estimated over the 25 test boxes) via

ΔP(k)=δP(k)VallVoverlap.\Delta P(k)=\delta P(k)\sqrt{\frac{V_{\rm all}}{V_{\rm overlap}}}\,. (11)

Here, VallV_{\rm all} represents the total independent volume, while VoverlapV_{\rm overlap} denotes the overlap volume for the calculated power spectrum. Since the boxe have a physical size of 1200×1200×600(Mpc/h)31200\times 1200\times 600~{}({\rm Mpc}/h)^{3}, divided into 25 test boxes with a size of 600 Mpc/h{\rm Mpc}/h on each side, the value of Vall/Voverlap\sqrt{V_{\rm all}/V_{\rm overlap}} is 0.4.

The relative deviation RR as defined in Eq. 10 are illustrated in the bottom panels. For “without MhaloM_{\rm halo} weighting” case, the discrepancies between the measured auto spectra and the true auto spectrum are significant small, essentially invisible in the left panel when k<0.3h/Mpck<0.3~{}h/{\rm Mpc}. It can be observed that the discrepancy across this scale range is |R|<0.13|R|<0.13, indicating a good recovery of both the amplitude information of the field, consistent with the findings in (Wang et al., 2024).

In comparison to the left panels, the reconstructed power spectra for “with MhaloM_{\rm halo} weighting” exhibit relatively larger deviations. The maximum deviation is R=0.15R=0.15 for auto correlation. By comparing these results, we demonstrate that the absence of this information does not significantly reduce reconstruction accuracy.

3.4 Comparison for velocity/momentum power spectra

Refer to caption Refer to caption
Figure 11: Comparison of the UNet-reconstructed momentum/velocity power spectra with the simulation truth for the two mass weighting schemes. The specific field and the weighting scheme are clearly indicated in each panel. The bottom panels present the resulting relative deviations. In each plot, the shaded region represents the 2σ2\sigma statistical uncertainty, estimated from a total of 25 boxes (each with a side length of 600Mpc/h600~{}{\rm Mpc}/h) in the test dataset. Based on Eqs. 2 and 3, the linear predictions are also provided for comparison, deviating increasingly from the true values as kk increases. As seen, the relative deviations are in the range of R[0.06,0.06]R\in[-0.06,0.06] at the scales of k[0.05,0.1]k\in[0.05,0.1]. The average relative deviations of auto power spectrum are 0.013±0.03,0.011±0.07,0.033±0.05,0.004±0.053-0.013\pm 0.03,-0.011\pm 0.07,0.033\pm 0.05,-0.004\pm 0.053 from left to right, respectively.
Refer to caption Refer to caption
Figure 12: Same as in Fig. 11, but for the momentum/velocity divergence fields with two different mass weighting schemes. The specific field and the weighting scheme are clearly indicated in each panel. The relative deviations fall within the range of [0.1,0.1][-0.1,0.1] at the scale of k[0.05,0.3]k\in[0.05,0.3], respectively, indicating a relatively accurate reconstruction. Compared to the linear predicted power spectra, the UNet model provides a more accurate reconstruction, especially when k>0.1h/Mpck>0.1~{}h/{\rm Mpc}.

Furthermore, we now examine the power spectrum reconstruction for velocity, momentum, and their divergences. In Fig. 11, the magnitudes of the momentum and velocity power spectra are presented. The auto correlation power spectra between the predicted and true fields based on the two weighting schemes are also shown for comparison. The estimated curves and shaded regions are based on the mean and standard deviation of the derived power spectra from a total of 25 boxes in the test dataset, each with a side length of 600Mpc/h600~{}{\rm Mpc}/h.

It can be observed that the auto correlation power spectra of the velocity and momentum can be reconstructed with a relative deviation of |R(k)|<0.07|R(k)|<0.07 for k[0.05,0.2]hMpc1k\in[0.05,0.2]~{}h{\rm Mpc}^{-1}. In contrast, the velocity and momentum spectra have |R(k)|[0,0.13]|R(k)|\in[0,0.13], indicating an overestimate in the scales of k[0.03,0.05]hMpc1k\in[0.03,0.05]~{}h{\rm Mpc}^{-1}. In general, the reconstruction performance for “with MhaloM_{\rm halo} weighting” across all scales appears better than that for “without MhaloM_{\rm halo} weighting”. This is evidenced by the fact that even without complete knowledge of the halo mass, we can still accurately reconstruct the auto-correlation of the DM velocity field and momentum field using UNet.

Moreover, since the momentum field is a density-weighted velocity, it is expected to be more sensitive to the reconstruction accuracy of the DM density field. However, the reconstruction accuracy for the momentum is comparable to that of the velocity, thanks to the good reconstruction of the DM density, as illustrated in Fig. 10. It is clear that as the reconstruction uncertainty from the DM density field increases, the statistical error in the momentum and its divergence power spectra will also increase.

Moreover, as shown in Fig. 12, the auto power spectra of the velocity and momentum divergence yield R(k)<0.06R(k)<0.06 for scales of kk from 0.050.05 to 0.1 h/Mpch/{\rm Mpc}. At k>0.05k>0.05 h/Mpch/{\rm Mpc}, the auto-correlation of velocity divergence agrees well with the true power spectrum within 2σ2\sigma, while the auto-correlation of momentum divergence slightly overestimates the true value. This discrepancy may arise from the inaccurate reconstruction of the DM density field, likely due to the limited number of large-mass halos in the current training samples.

In addition, a comparison of the results obtained by the two different weighting schemes reveals that there is not a significant difference between them. This suggests that precise mass information is not necessary for the velocity/momentum reconstruction. Overall, our UNet model demonstrates satisfactory performance on linear and nonlinear scales of k[0.03,0.3]h/Mpck\in[0.03,0.3]~{}h/{\rm Mpc}. These results highlight the ability of the UNet model to learn various velocity quantities from the halo number density field on the nonlinear scales, which is an important finding in this study. In light of the predicted power spectra, it can be reasonably concluded that the UNet model’s predictions are in relatively good agreement with the truth.

3.5 RSD corrections and power spectrum multipoles

To further validate the effectiveness of the UNet model, we will demonstrate comparisons between the reconstructed measurements and the simulation truth in real space. As the input observables in the neural network are in redshift space, the relevant comparison can be used to assess the accuracy of the peculiar velocities corrected. The following analysis was conducted based on a total of 18 simulation boxes, each with a side length of 600Mpc/h600~{}{\rm Mpc}/h.

Refer to caption
Figure 13: Comparison of the two-dimensional anisotropic 2PCFs, ξ(r,r)\xi(r_{\perp},r_{\|}), of density fields between the prediction and the simulation truth in real space for “with MhalosM_{\rm halos} weighting” scheme. The results are based on a total of 25 boxes, each with a side length of 600Mpc/h600~{}{\rm Mpc}/h. The contours were generated from the correlation values of [0.02,0.06,0.12][0.02,0.06,0.12], respectively. The gray shaded regions and blue dashed contours represent the derived ξ\xi of the true and reconstructed fields in real space, respectively. The red dotted contours represent the redshift-space 2PCF of the DM halos. It is evident that despite the input being that of the DM halo density in redshift space, the reconstructed 2PCF is in good agreement with the true value. This suggests that the UNet is capable of successfully recovering the real-space DM density field.

In Fig. 13, a comparison is performed between the two-dimensional anisotropic 2PCFs, ξ(r,r)\xi(r_{\perp},r_{\|}), of density fields between the prediction and the simulation truth in real space. As illustrated by the red dotted contours, the 2PCF measured in redshift space without any RSD corrections demonstrates the significant impact of RSD effects. The Kaiser effect results in the “squashed” structure along LoS, while the random velocities on the non-linear small scales produce the so-called “fingers-of-God” (FoG) effect, which causes structures to be elongated along the LoS. The velocity field reconstructed by UNet was utilized to correct the velocities of the halo. Note that the corrected 2PCF exhibits a high degree of agreement with the real-space 2PCF across all scales. Furthermore, the 2PCF corrections for both Kaiser and FoG effects are achieved with a high degree of accuracy. The successful corrections for the FoG effect indicate that the UNet is capable of accurately predicting the DM field in real space.

In order to accurately assess the statistical properties of an arbitrary real-space field, it is necessary to consider the higher-order power spectrum multipoles. In general, the power spectrum in redshift space P(k,μ)P(k,\mu), where μ\mu is the cosine of the angle between the wavevector 𝒌\bm{k} and the LoS direction, can be expressed by expanding its multipole components:

P(k)=2+1211𝑑μP(k,μ)(μ),P_{\ell}(k)=\frac{2\ell+1}{2}\int_{-1}^{1}d\mu P(k,\mu)\mathcal{L}_{\ell}(\mu)\,, (12)

where \mathcal{L}_{\ell} are the Legendre polynomials of order \ell. In this study, we focus on the Legendre polynomials of order \ell, specifically the quadrupole (=2\ell=2) and the hexadecapole (=4\ell=4), denoted as P2P_{2} and P4P_{4}, respectively.

Refer to caption Refer to caption
Figure 14: Comparison of the UNet-reconstructed power spectrum multipoles of the DM fields with the truth is presented. Top: a comparison of the measured quadrupole (P2P_{2}) of the DM halos in redshift space (red dashed), the UNet-reconstructed DM field (blue dashed) and the true value (black solid) in real space. The two mass weighting schemes: “without MhaloM_{\rm halo} weighting” (left) and “with MhaloM_{\rm halo} weighting” (right) are chosen to illustrate the reconstructed performance. The shaded regions represent the 2σ2\sigma confidence level, estimated from the test dataset. Bottom: same as in the top panels, but for the higher-order multipole of hexadecapole (P4P_{4}). As demonstrated, following the application of RSD corrections, the reconstructed multipoles of the quadrupole and hexadecapole exhibit well agreement with the true values at the 2σ2\sigma level within the range of k[0.04,0.4]h/Mpck\in[0.04,0.4]~{}h/{\rm Mpc}. Furthermore, the scheme of “with MhaloM_{\rm halo} weighting” is observed to provide a higher reconstruction accuracy and a smaller statistical uncertainty.

In Fig. 14, we compare the power spectrum multipoles of the UNet-reconstructed DM density field with the true values. The top panels depict the measured P2P_{2} in redshift space (red), the UNet-reconstructed DM field (blue) in real space, and the real-space true value (black). Two mass weighting schemes were employed for the purpose of demonstrating the reconstruction performance: “without MhaloM_{\rm halo} weighting” (left) and “with MhaloM_{\rm halo} weighting” (right). The shaded regions represent the 1σ1\sigma confidence interval, estimated based on the test dataset.

As demonstrated, the RSD effects can markedly contribute to anisotropic clustering, resulting in an amplitude of P2(k)P_{2}(k) that is approximately one to two orders of magnitude greater than that observed in real space across all kk values of interest. The quadrupole of the UNet-predicted DM density field in real space can significantly suppress the anisotropy clustering, yielding a reconstructed result that is in good agreement with the true one at the 2σ2\sigma level when k0.4h/Mpck\lesssim 0.4~{}h/{\rm Mpc}. Furthermore, it has been found that the “with MhaloM_{\rm halo} weighting” scheme yields a slightly more accurate reconstruction. The discrepancy between the reconstructed and true values averaged over the kk range is 174.29±702.27174.29\pm 702.27 for the “with MhaloM_{\rm halo} weighting” scheme and 175.05±711.88175.05\pm 711.88 for the “without MhaloM_{\rm halo} weighting” scheme. This indicates that we can reconstruct the quadrupole of the DM density field without complete knowledge of halo mass information.

The lower panels of Fig. 14 depict a similar comparison of a higher-order multipole, the hexadecapole, P4P_{4}. As a consequence of its higher order, the statistical uncertainty for halos in redshift space appears to exceed that of P2P_{2}. Nevertheless, the UNet is still capable of successfully reconstructing P4P_{4} of the DM density field in real space, with the associated statistical uncertainty being approximately comparable to that of P2P_{2}. The discrepancy between the reconstructed and true values falls within the 2σ2\sigma level. Furthermore, the use of a weighting scheme based on halo mass can result in a slight improvement in reconstruction accuracy, with the average deviation of 60.03±543.83-60.03\pm 543.83 reduced by 27%27\% compared to the “without MhaloM_{\rm halo} weighting” case.

The proposed UNet is capable of automatically applying RSD corrections. Furthermore, the reconstructed multipoles of both the quadrupole and hexadecapole have been found to agree with the true values at the 2σ2\sigma level, within the range of k[0.06,0.3]h/Mpck\in[0.06,0.3]~{}h/{\rm Mpc}. Moreover, the “with MhaloM_{\rm halo} weighting” scheme has been observed to yield higher reconstruction accuracy and smaller statistical uncertainty.

In conclusion, while the RSD effects can significantly contribute to anisotropic clustering, distorting P2(k)P_{2}(k) and P4(k)P_{4}(k), our proposed UNet effectively corrects for these RSD effects and yields accurate predictions for the real-space statistical measurements of density. The reconstructed results align well with the true values at the 2σ2\sigma level for k[0.03,0.4]h/Mpck\in[0.03,0.4]~{}h/{\rm Mpc}. Furthermore, the “with MhaloM_{\rm halo} weighting” scheme provides additional information, resulting in a slightly more accurate reconstruction.

4 Conclusions

The construction of three-dimensional velocity (and momentum) fields by galaxies and clusters is of great significance in the field of cosmology. These fields provide a wealth of information that is not available from the density field alone. They can be used to improve and correct various cosmological measurements. The reconstruction of high fidelity may even result in the discovery of unexpected findings. The reconstruction of cosmic volume-weighted velocity is susceptible to significant sampling artifacts, which presents a challenge for traditional reconstruction methods. Furthermore, traditional reconstruction methods frequently rely on numerous assumptions and approximations.

The objective of this study is to propose a deep learning approach based on the UNet neural network for the reconstruction of three-dimensional velocity/momentum fields in real space. This approach may provide a potential solution to the long-standing problem. This study has demonstrated the effectiveness of UNet in reconstructing fields directly from sparse samples of DM halo (and subhalo) spatial distribution in redshift space. The UNet architecture, with its sophisticated design, is capable of capturing diverse field characteristics and transforming high-dimensional, structured inputs, making it a valuable tool for such reconstructions.

The UNet was trained to transform the sparse halo density fields into velocity/momentum fields. To include the halo mass information and assess the sensitivity of the reconstruction performance to this knowledge, we adopted two weighting schemes: one that includes halo mass and one that does not. A comprehensive validation was conducted through a series of statistical tests, and the reconstructed velocity/momentum fields were found to exhibit a high degree of agreement with the ground truth. Moreover, the UNet-inferred DM velocity fields in real space essentially provide effective RSD corrections. Compared to the true values, we have demonstrated that the proposed UNet can reconstruct real-space velocity and momentum fields with a relative deviation of about R<0.13R<0.13 in highly non-linear regimes at k<0.3h/Mpck<0.3~{}h/{\rm Mpc}.

It is also worth noting that the reconstruction of our UNet model is effective even in the absence of precise DM halo mass information. A comparison of the pixel-by-pixel values and power spectra reveals that the UNet model is still capable of accurately reconstructing not only the DM density field and relevant power spectrum, but also the velocity/momentum fields and the corresponding power spectra. The reconstruction accuracy is not significantly different from those obtained when the accurate DM halo mass value is used. The findings indicate that the model is capable of performing reconstruction in a manner that is more consistent with real observations, which is a highly encouraging indication for the reconstruction of velocity/momentum fields using current and future real data.

The Stage IV galaxy surveys will yield more detailed measurements of LSS of the Universe than ever before. In consequence, novel computing technologies are required to analyze these high-dimensional, massive data sets. It is therefore anticipated that UNet-based neural networks will prove an invaluable tool in addressing the challenges inherent in traditional methodologies, thereby facilitating the extraction of cosmological information in a more profound and comprehensive manner.

Acknowledgments

This work is supported by National SKA Program of China (2020SKA0110401, 2020SKA0110402, 2020SKA0110100), the National Key R&D Program of China (2020YFC2201600), the National Science Foundation of China (12203107, 12073088, 12373005), the China Manned Space Project with No. CMS-CSST-2021 (A02, A03, B01), the Guangdong Basic and Applied Basic Research Foundation (2024A1515012309), and the 111 project of the Ministry of Education No. B20019. We also wish to acknowledge the Beijing Super Cloud Center (BSCC) and Beijing Beilong Super Cloud Computing Co., Ltd (http://www.blsc.cn/) for providing HPC resources that have significantly contributed to the research results presented in this paper.

References