AI-Powered Reconstruction of Dark Matter Velocity Fields from Redshift-Space Halo Distribution

Xu Xiao School of Physics and Astronomy, Sun Yat-Sen University, Zhuhai 519082, China Jiacheng Ding School of Physics and Astronomy, Sun Yat-Sen University, Zhuhai 519082, China Xiao Lin Luo Sun Ke Lan School of Physics and Astronomy, Sun Yat-Sen University, Zhuhai 519082, China Liang Xiao School of Physics and Astronomy, Sun Yat-Sen University, Zhuhai 519082, China Shuai Liu School of Physics and Astronomy, Sun Yat-Sen University, Zhuhai 519082, China Xin Wang School of Physics and Astronomy, Sun Yat-Sen University, Zhuhai 519082, China Le Zhang [email protected] School of Physics and Astronomy, Sun Yat-Sen University, Zhuhai 519082, China Peng Cheng Laboratory, Shenzhen, Guangdong 518066, China CSST Science Center for the Guangdong–Hong Kong–Macau Greater Bay Area, SYSU, Zhuhai 519082, China Xiao-Dong Li [email protected] School of Physics and Astronomy, Sun Yat-Sen University, Zhuhai 519082, China Peng Cheng Laboratory, Shenzhen, Guangdong 518066, China CSST Science Center for the Guangdong–Hong Kong–Macau Greater Bay Area, SYSU, Zhuhai 519082, China

Abstract

In the study of cosmology and galaxy evolution, the peculiar velocity and density field of dark matter (DM) play a crucial role in studying many issues. Here, we propose a UNet-based deep learning to reconstruct the real-space DM velocity field from the spatial distribution of a sparse sample of DM halos in redshift space. By comparing and testing various properties, we demonstrate that the reconstructed velocity field is in good agreement with the actual situation. At $k<0.3~{}h/{\rm Mpc}$ , the reconstruction of various velocity field components, including velocity magnitude and divergence, outperforms traditional linear perturbation theory. Additionally, the effects of redshift space distortions (RSD) are well corrected using the UNet model. Compared to the true real-space power spectra, the UNet reconstruction provides an unbiased estimate of the density, velocity, and momentum fields, remaining consistent within $2\sigma$ level. We also demonstrate that the UNet model remains effective even with limited information about halo masses. Thus, our proposed UNet model has a wide range of applications in various aspects of cosmology, such as RSD, cosmic web analysis, the kinetic Sunyaev-Zel’dovich effect, BAO reconstruction, and so on.

1 Introduction

The large-scale peculiar velocity is increasingly being recognized as a valuable tool in cosmology, as it is directly influenced by the gravitational pulling. This renders it invaluable for studying the dark universe (Peacock et al., 2001; Linder & Jenkins, 2003; Zhang et al., 2007; Guzzo et al., 2008; Jain & Zhang, 2008; Wang, 2008; Reyes et al., 2010; Li et al., 2011; Clifton et al., 2012; Reid et al., 2012; Tojeiro et al., 2012; Weinberg et al., 2013; Joyce et al., 2015; Koyama, 2016), particularly its sensitivity to density fluctuations at horizon scales (Zhang & Stebbins, 2011; Zhang & Johnson, 2015), thereby providing insights into cosmic origin.

In a variety of circumstances, the measured velocity statistics may be given different weights. To illustrate, the kinetic Sunyaev-Zel’dovich (kSZ) effect is proportional to the gas momentum, which represents the gas density-weighted velocity. Conversely, the volume-weighted velocity power spectrum can be inferred from redshift-space distortions (RSD) by comparing the measured RSD power spectrum with theoretical models. In this approach, the theory of RSD includes the modelling of volume-weighted velocity statistics (Kaiser, 1987; Scoccimarro, 2004; Taruya et al., 2010; Zhang et al., 2013).

In the context of cosmological studies, volume-weighted velocity statistics are considered to be a more appropriate approach than density-weighted ones. It is preferable to employ density-weighted statistics when utilizing the distribution function approach (Seljak & McDonald, 2011; Okumura et al., 2012) and the streaming model (Peebles, 1980; White et al., 2015). In contrast to density-weighted statistics, volume-weighted statistics remain uninfluenced by uncertainties in galaxy density bias. Nevertheless, the measurement of volume-weighted velocity statistics presents a number of challenges, both in observational studies and in numerical simulations. Velocities in regions containing galaxies can be determined; however, velocities in regions lacking galaxies (i.e., those with no simulation particles) typically remain present. This sampling artifact inevitably distorts the measurement of volume-weighted velocity statistics, with the impact increasing as the particle number density decreases. Additionally, there exists a correlation between the halos/galaxies spatial distribution and the velocity field being measured, due to an underlying correlation between the large-scale structure (LSS) and the latter. Consequently, the sampling of a volume-weighted velocity field is subject to bias, which results in a biased measurement of volume-weighted velocity statistics (Bernardeau & van de Weygaert, 1996; Bernardeau et al., 1997; Schaap & van de Weygaert, 2000; Zheng et al., 2013, 2015a, 2015b; Zhang et al., 2015; Jennings et al., 2015).

It have been observed that the velocity sampling artifact increases in proportion to a decrease in the sample number density, $\rho_{h}$ . The effect is already discernible for $\rho_{h}\sim 1~{}(h/{\rm Mpc})^{3}$ (Zheng et al., 2013). In the case of sparse samples, such as those observed in massive halos, the problem is considerably more pronounced. For a sample density of $\rho_{h}=10^{-3}~{}(h/{\rm Mpc})^{3}$ , the induced error in the velocity power spectrum reaches a value of approximately 10%, even at scales as large as $k=0.2~{}h/\mathrm{Mpc}$ (Zheng et al., 2015b).

Several pioneering studies have been devoted to solving this long-standing problem. New velocity assignment methods have been developed, including the Voronoi tessellation method (Bernardeau & van de Weygaert, 1996), the Delaunay tessellation method (Schaap & van de Weygaert, 2000), the nearest-particle method (Zheng et al., 2013; Koda et al., 2016), the Kriging method (Yu et al., 2015, 2017), and a hybrid approach to determine the volume-weighted halo velocity bias (Chen et al., 2018). A theoretical model of the sampling artifact has been constructed (Zhang et al., 2015), validated in simulated DM velocity fields (Zheng et al., 2015b), and subsequently employed to correct the sampling artifact in the halo velocity field Zheng et al. (2015a).

Sampling artifacts can significantly impact cosmological constraints when velocity power spectra are derived from sparse galaxy samples. This issue also affects RSD, a key cosmological probe used by Stage IV surveys such as DESI (Levi et al., 2013) EUCLID (Laureijs et al., 2011), LSST (LSST Science Collaboration et al., 2009), WFIRST (Spergel et al., 2015), and CSST (Zhan, 2011). These surveys would measure the volume-weighted velocity power spectrum through RSD with a statistical precision of approximately 1% at $0.1~{}h/{\rm Mpc}$ .

Both simulations and observations present challenges, prompting the question of whether it is possible to accurately infer the density-weighted or volume-weighted velocity field directly from the spatial distribution of halos? As this is a field-to-field mapping problem, it is non-trivial and complex. To date, there is no theoretical framework that can completely solve it. Given the large number of degrees of freedom in the field to be reconstructed, the traditional method is either invalid or inefficient for this purpose.

The recent advancements in machine learning algorithms, particularly those based on deep neural networks, present a significant opportunity to extract valuable insights from complex data. In more recent years, deep learning-based techniques have been applied with considerable success to almost all areas of cosmology and astrophysics (Mehta et al., 2019; Jennings et al., 2019; Carleo et al., 2019; Ntampaka et al., 2019), including weak gravitational lensing (Schmelzle et al., 2017; Gupta et al., 2018; Springer et al., 2018; Fluri et al., 2019; Jeffrey et al., 2019; Merten et al., 2019; Peel et al., 2019; Tewes et al., 2019), the cosmic microwave background (Caldeira et al., 2018; Rodriguez et al., 2018; Perraudin et al., 2019; Münchmeyer & Smith, 2019; Mishra et al., 2019), LSS for estimating cosmological parameters from the distribution of matter (Ravanbakhsh et al., 2017; Lucie-Smith et al., 2018; Pan et al., 2020; Lazanu, 2021), identifying DM halos, and reconstructing the initial conditions of the universe using machine learning (Modi et al., 2018; Berger & Stein, 2019; Lucie-Smith et al., 2019; Ramanah et al., 2019). In addition, this involves the mapping of coarse cosmology to the fine details (He et al., 2019; Li et al., 2021), extracting line intensity maps (Pfeffer et al., 2019), removing foregrounds in 21cm intensity mapping (Makinen et al., 2021), augmenting N-body simulations with gas (Tröster et al., 2019), mapping the 3D galaxy distribution in hydrodynamic simulations to its underlying DM distribution (Zhang et al., 2019), modeling small-scale galaxy formation physics in large cosmological volumes (Ni et al., 2021), reconstructing baryon acoustic oscillations (Mao et al., 2020) and the initial linear-regime matter density field (Shallue & Eisenstein, 2023), searching for gravitational waves (Dreissigacker et al., 2019; Gebhard:2019ldz), studying cosmic reionization (La Plante & Ntampaka, 2018; Gillet et al., 2019; Hassan et al., 2019b; Chardin et al., 2019; Hassan et al., 2019a), and analyzing supernovae (Lochner et al., 2016; Moss, 2018; Ishida et al., 2019; Li et al., 2019; Muthukrishna et al., 2019).

The pioneering work by (Wu et al., 2021) demonstrates that a UNet network can reconstruct the nonlinear velocity field of DM particles with high precision down to a scale of 2 $h^{-1}\rm Mpc$ , demonstrating a notable advantage in reconstructing the cosmic velocity field at nonlinear scales. However, reconstructing the velocity field from observed halos rather than particles is more challenging. As demonstrated by (Hong et al., 2021; Ganeshaiah Veena et al., 2023), the reconstruction of a density field or peculiar velocity field is possible from galaxy distributions. More recently, (Wu et al., 2023) have proposed a method for reconstructing the various peculiar velocity fields down to $k\lesssim 1.1$ $h/\rm Mpc$ from the redshift-space distribution of DM halos.

However, none of these studies address the inference of velocity fields from the spatial distribution of DM halos in redshift space. Consequently, in this study, we propose a UNet model dedicated to reconstructing the real-space density, velocity, and momentum fields from the redshift-space spatial distribution of DM halos (and subhalos). The latter two essentially correspond to the volume-weighted and density-weighted velocity fields. This study employs a simulation data set to investigate the effectiveness of a UNet-structured neural network in velocity reconstruction. The data set is introduced in Sect. 2, where the architecture of the neural network and the training procedure are also detailed. The results of the network are presented in Sect. 3, and finally, the conclusion and discussion are presented in Sect. 4.

2 Methods

2.1 Dataset

In order to train and validate our deep learning model, we employed the CosmicGrowth simulations Jing (2018) as our training and test datasets. The CosmicGrowth simulation is a valuable tool for investigating the growth and evolution of DM halos and subhalos in the universe. The simulation was conducted in a box with a side length of 1.2 $h^{-1}{\rm Gpc}$ , containing $2048^{3}$ DM particles with a mass resolution of $M_{\rm DM}=3.8\times 10^{9}~{}h^{-1}M_{\odot}$ .

In this study, we adopt the standard flat $\Lambda$ CDM cosmological model, which is well consistent with the results of WMAP cosmology Komatsu et al. (2011); Hinshaw et al. (2013). The standard cosmological parameters adopted are as follows: $\{\Omega_{c},\Omega_{b},h,n_{s},\sigma_{8}\}=\{0.2235,0.0445,0.71,0.968,0.83\}$ .

In order to construct the halo/subhalo catalog in redshift space at $z=0.59$ , it is necessary to take the number density of the input data to be $3\times 10^{-3}~{}{\rm Mpc}^{-3}h^{3}$ . This value is consistent with current spectroscopic observations. The relationship between the redshift space position, denoted by $\bm{s}$ , and the real space position, denoted by $\bm{r}$ , after accounting for the RSD effect is given by

\bm{s}=\bm{r}+\frac{\bm{v}\cdot\hat{z}}{aH(a)}\hat{z}\,,

(1)

where $\bm{v}$ denotes the peculiar velocity, $a$ denotes the scale factor, $H(a)$ denotes the Hubble parameter, and $\hat{z}$ represents the unit vector along the line of sight (LoS). The density and velocity fields were computed based on the catalogue samples, with the haloes assigned to a $512^{3}$ mesh using the CIC (Cloud-in-Cell) scheme, with a cell resolution of $2.35h^{-1}~{}{\rm Mpc}^{3}$ .

2.2 Training process

Refer to caption — Figure 1: Training scheme for reconstructing the DM velocity field in real space. The reconstruction process involves two major steps. The first step is to reconstruct the DM density field, $\rho_{\rm DM}$ , from the sparse DM halo density field, $\rho_{s}$ , in redshift space. To achieve this, we trained a UNet model to predict $\rho_{\rm DM}$ . Subsequently, we trained another UNet model that takes as input the concatenated data from the linear velocity prediction. This model is then used to reconstruct the direction and magnitude fields of velocity. This network structure is also applicable when the final output is the real-space momentum field.

The objective is to construct a mapping based on the UNet method, which will take as its input a sparse halo number density field in redshift space (to provide an accurate representation of real observations) and output the real-space fields, including the DM density field and DM velocity/momentum fields.

For clarification, let us first define two density fields that will be used during the UNet reconstruction process: 1) the redshift-space sparse halo number density field, $\rho_{s}$ ; 2) the real-space DM density field, $\rho_{\rm DM}$ .

Moreover, the real-space comoving velocity field is denoted by $\bm{v}$ , with its magnitude represented by $|\bm{v}|$ and its direction by $\hat{v}$ . The momentum field in real space is defined as the number-weighted velocity, given by $\bm{m}=(1+\delta_{\rm DM})\cdot\bm{v}$ , where $\delta_{\rm DM}\equiv\rho_{\rm DM}/\bar{\rho}_{\rm DM}-1$ is the DM density field relative to its mean.

The reconstruction process is divided into two principal steps, as illustrated in Fig. 1, which provides a detailed illustration of the UNet network in Fig. 2. For the illustrative purpose, the velocity will be taken as an example. The same methodology is employed for the processing of the momentum field.

In what follows, we will provide a comprehensive description of this process.

1) The mapping from $\rho_{s}(\bm{x})$ to $\rho_{\rm DM}(\bm{x})$ . The initial step is the reconstruction of the distribution of DM in real space, $\rho_{\rm DM}$ , using the sparse halo number density field in redshift space, $\rho_{s}$ . Each channel of the input contains halos within a specific mass range. The DM halos (and subhalos) are catalogued in descending order of mass and divided into four mass intervals, with boundaries defined by the logarithmic mass ratio as follows: $\log_{10}(M/M_{\odot})\in[15.01,13.30,12.56,12.31,12.17]$ , corresponding to four number densities of halos $\rho_{h}\in[0.0001,0.001,0.002,0.003]~{}(h/{\rm Mpc})^{3}$ . Note that the mass information may be estimated to a reasonable degree of accuracy using empirical formulas, which allow for the inference of the masses of these objects from the observed apparent magnitudes of galaxies. The target data is the three-dimensional number density field of DM in real space. Our UNet model is constructed for the purpose of reconstructing the real-space $\rho_{\rm DM}$ .

Two mass-weighting schemes are adopted for comparison: i) “with $M_{\rm halo}$ weighting”: in this scheme, DM halos are divided into the four intervals based on mass, and each interval is assigned a weight proportional to its mass.

ii) “without $M_{\rm halo}$ weighting”: in this scheme, DM halos are also divided into the four intervals based on mass, but no weights are assigned to these intervals.

The first scheme assumes an ideal scenario where we have complete knowledge of the mass information of DM halos. In contrast, the second one better reflects current observational capabilities, allowing only rough estimates of the mass distribution of galaxies, and thus corresponds more closely with actual observations. To assess the impact of mass information accuracy, results from both schemes will be presented simultaneously.

2) The mapping from a combination of $\rho_{s}(\bm{x})$ and $\rho_{\rm DM}(\bm{x})$ to $\bm{v}(\bm{x})$ or $\bm{m}(\bm{x})$ is achieved using two UNet models. These models are constructed to reconstruct the direction and magnitude of the DM velocity and momentum fields in real space. The UNet models utilize $\rho_{s}$ , $\rho_{\rm DM}$ , and the linear velocity in redshift space, $\bm{v}_{\rm lin}$ , derived from linear perturbation prediction as inputs.

2.3 Neural network model

In light of the neural network model proposed by (Wu et al., 2021), we employed the UNet architecture for the construction of our model. The architecture of our neural network and its components are depicted in Fig. 2. The input is data blocks with an arbitrary number of channels, with each channel corresponding to a specific mass range of $\rho_{s}$ .

Given that the velocity field is decomposed into the velocity magnitude and direction, we constructed two structurally similar neural networks to handle them separately. The networks conclude with an output layer comprising 1+3 channels. The three channels correspond to the components of the velocity direction, while the fourth channel corresponds to the velocity magnitude. The final step in the reconstruction process is the combination of the results from all output channels, which ultimately yields the complete reconstruction of the three-dimensional velocity field.

The notations in Fig. 2, “ ${n_{\rm in}}$ ” and “ ${n_{\rm out}}$ ”, represent the number of input and output channels, respectively. These values may vary across different reconstruction tasks, as shown in Tab. 1.

Table 1: Input and output channel numbers based on different reconstructed fields.

fields	${n_{\rm in}}$	${\rm n_{\rm out}}$
DM density ( $\rho_{\rm DM}$ )	4	1
magnitude of DM velocity ( $\|\bm{v}\|$ )	8	1
direction of DM velocity ( $\hat{v}$ )	8	3
magnitude of DM momentum ( $\|\bm{m}\|$ )	8	1
direction of DM momentum ( $\hat{m}$ )	8	3

The color blocks represent the various operations performed by the neural network, with arrow lines connecting the input and output. The dimensions of the input, intermediate, and output fields (channels $\times$ spatial pixels) are defined, along with the size and number of labeled 3D convolutional kernels (“conv”).

Given the constraints of GPU memory, training time, and model size, we divided a large box with a physical scale of 1200 ${\rm Mpc}/h$ into $4^{3}$ small boxes with a physical scale of 300 ${\rm Mpc}/h$ , each containing $128^{3}$ grid points in the CIC interpolation scheme. A total of 32 small boxes were utilized for training and 32 for validation purposes. The neural network model receives inputs and generates outputs targeted at these small boxes, which consist of $128^{3}$ pixels each.

During the testing phase, We applied the neural network to large boxes, each consisting of $256^{3}$ pixels, as the input data for the model. The output also contains $256^{3}$ pixels, corresponding to a physical size of 600 ${\rm Mpc}/h$ . Using the neural network on large boxes helps us eliminate less accurate outputs at the edges, which can be a significant issue when applying the UNet model to smaller boxes. For training, the $k_{\rm min}$ for small boxes is 0.026, while the $k_{\rm min}$ for validation with large boxes is 0.013.

Some points require further clarification, which will be detailed below.

1)

To modify the field size, the $\emph{stride}=2$ parameter can be employed, which allows for a reduction or increase in the field size by a factor of two.
2)

The “init” 3D convolutional layer enables the network to rapidly learn large-scale information due to its sufficiently large receptive field.
3)

The convolutional layer, which is situated at the output of the network, serves to increase the network’s complexity. In order to prevent overfitting, a dropout layer is incorporated at the end of the convolutional layer. This layer also alters the number of channels.
4)

The incorporation of batch normalization (BN) layers into the neural network facilitates the acceleration of training convergence and the prevention of overfitting. Additionally, the implementation of rectified linear units (ReLU) as activation layers subsequent to convolutional layers enhances the network’s nonlinearity.
5)

The trained UNet enables the prediction of $\rho_{\rm DM}$ by inputting $\rho_{s}$ . This, in turn, enables the prediction of the real-space fields, including the velocity ( $\bm{v}$ ) and momentum ( $\bm{m}$ ) fields, and the associated clustering statistics, which can be directly measured.
6)

Additionally, due to the constraints of the limited dimensions of the training box, the model struggles to learn large-scale velocity modes. To mitigate potential bias from the small box dataset, the velocity field predicted by traditional linear reconstruction is used as one of the UNet inputs to reconstruct the velocity or momentum field. The linear velocity and momentum predictions in redshift space are determined through

$\bm{v}_{\rm lin}(\bm{k})=afH\frac{i\bm{k}}{k^{2}}\frac{\delta_{s}(\bm{k})}{\beta}\,,$ (2)

and

$\bm{m}_{\rm lin}=(1+\frac{\delta_{s}}{\beta})\bm{v}_{\rm lin}\,.$ (3)

Here $\delta_{s}$ is divided by the linear bias $\beta$ , where $\beta=$ 1.85 for “without $M_{\rm halo}$ weighting” scheme and 2.57 for “with $M_{\rm halo}$ weighting” scheme and is determined by the ratio of the integrated DM power spectrum to the integrated halo power spectrum over the relevant $k$ -range of interest, i.e.,

$\beta^{2}=\frac{\int_{k<0.3}\frac{P_{\rm halo}(k)}{P_{\rm DM}(k)}dk}{\int_{k<0.3}dk}\,.$ (4)
7)

The Tree Parzen Estimator (TPE) process (Bergstra et al., 2011), based on Bayesian optimization, is employed to optimize hyperparameters, with the Python package Optuna¹¹1https://optuna.readthedocs.io utilized to maximize the model accuracy (Akiba et al., 2019). The hyperparameters that are optimized include: 1) the learning rate; 2) dropout percentage; 3) the number of training data sets; 4) the number of channels in the intermediate layer, and 5) the training batch size. The hyperparameters are tuned by searching for values that minimize the validation loss of the model. A minimum of five experiments were conducted for each set of hyperparameter values, with the objective of evaluating the impact of these values on the performance.

2.4 Loss function

The objective of the training for UNet is to minimize the loss between the prediction for a given field and the simulation truth for each voxel.

1)

For training our UNet model to reconstruct the real-space DM density fields $\rho_{\rm DM}$ , we use the Mean Squared Error (MSE) loss function,

$\mathcal{L}=\frac{1}{N}\sum_{i=1}^{N}\big{(}\rho_{i}-\rho^{\rm true}_{i}\big{)}^{2}\,,$ (5)

where the index $i$ runs over all $N$ pixels of a field.

To account for the contributions of the velocity magnitude ( $v\equiv|\bm{v}|$ ) and the velocity direction (unit vector $\hat{v}\equiv\bm{v}/|\bm{v}|$ ), we choose the following two-term loss function,

\mathcal{L}=\frac{1}{N}\sum_{i=1}^{N}\left[\frac{1}{4}(|v_{i}|-|v_{i}^{\rm true}|)^{2}+\frac{3}{4}\left(1-\cos\phi_{i}\right)\right]\,,

(6)

where $\cos\phi_{i}\equiv\hat{v}_{i}\cdot\hat{v}_{i}^{\rm true}$ , and the index $i$ denotes the $i$ -th pixel. As observed, the first term is responsible for $|\bm{v}|$ and corresponds to the MSE loss, which is essentially equivalent to the maximum likelihood solution under the Gaussian constant variance assumption. The second term, of course, measures the deviation between the reconstructed and true values of $\hat{v}$ . The coefficients of these two terms can be considered as normalization factors and are determined by the number of channels: 1 for the magnitude $|\bm{v}|$ and 3 for the three directions, i.e., $\hat{v}_{x}$ , $\hat{v}_{y}$ , and $\hat{v}_{z}$ . Empirically, this loss function has proven to be stable and effective during our training process, yielding good results in velocity (momentum) reconstruction.

3)

We trained our UNet using the popular Adam algorithm (Kingma & Ba, 2014) for training deep neural networks. This algorithm iteratively reduces the training loss by calculating its gradient with respect to the model parameters and taking a small step in the direction of maximum reduction. From Fig. 3, it can be seen that both the velocity model and the momentum model converge after approximately 1000 epochs of training.

3 result

This section will present the performance assessment of the trained UNet model, with the results presented based on predictions for 25 large boxes in the validation dataset that were not used in the model training and optimization of model structure and training parameters. Each large box have physical box size of $600^{3}$ $({\rm Mpc}/h)^{3}$ with the pixel number of $256^{3}$ . In order to ensure the most accurate results, we selected to test on the larger boxes. This is because the measurements on the larger boxes exhibit better statistical behavior, which reduces the sampling variance and thus statistical errors.

3.1 Visual inspection and point-wise comparison

The main objective of this study is to compare the UNet-predicted DM velocity/momentum in real space with the ground truth. In order to facilitate the analysis of the data, it is first necessary to describe the statistics that will be employed throughout the study. The two-point correlation function is a commonly used statistical measure for characterizing a density field,

\xi(\bm{r})=\langle\delta(\bm{x})\delta(\bm{x}+\bm{r})\rangle\,,

(7)

where the density contrast field, denoted by $\delta(\bm{x})$ , is a function of any point $\bm{x}$ . The separation vector, $\bm{r}$ , is a vector that represents the distance between two points. The ensemble mean, represented by $\langle\cdot\rangle$ , is computed by averaging over all points $\bm{x}$ in a spatial mean. The power spectrum of $\delta(\bm{x})$ is related to the Fourier transform of the correlation function, $\xi(\bm{r})$ , by a simple mathematical identity:

P(\bm{k})=\int\xi(\bm{r})\mathrm{e}^{i\bm{k}\cdot\bm{r}}\mathrm{d}^{3}\bm{r}\,,

(8)

where the three-dimensional wavevector of the plane wave, denoted by $\bm{k}$ , has magnitude $k\equiv|\bm{k}|$ , related to the wavelength $\lambda$ by $k=2\pi/\lambda$ .

As with the scalar field $\delta$ , we can also define power spectra for various velocity fields of interest. The velocity field, $\bm{v}$ , is completely described by its divergence, $\theta\equiv\nabla\cdot\bm{v}$ , and its vorticity, $\bm{\omega}=\nabla\times\bm{v}$ , via the Helmholtz decomposition. In Fourier space, they become purely radial and transversal velocity modes, respectively, defined by $\theta(\bm{k})=i\bm{k}\cdot\bm{v}(\bm{k})$ and $\bm{\omega}(\bm{k})=i\bm{k}\times\bm{v}(\bm{k})$ . The power spectra of the velocity, divergence, vorticity, and velocity magnitude are given by

$\displaystyle\langle\theta(\bm{k})\theta^{*}(\bm{k}^{\prime})\rangle=$	$\displaystyle(2\pi)^{3}P_{\theta}(k)\delta(\bm{k}-\bm{k}^{\prime})\,,$	(9)
$\displaystyle\langle\omega^{i}(\bm{k})\omega^{*j}(\bm{k}^{\prime})\rangle=$	$\displaystyle(2\pi)^{3}\frac{1}{2}\bigg{(}\delta^{ij}-\frac{k^{i}k^{j}}{k^{2}}\bigg{)}P_{\omega}(\bm{k})\delta(\bm{k}-\bm{k}^{\prime})\,,$
$\displaystyle\langle\bm{v}(\bm{k})\cdot\bm{v}^{*}(\bm{k}^{\prime})\rangle=$	$\displaystyle(2\pi)^{3}P_{v}(\bm{k})\delta(\bm{k}-\bm{k}^{\prime})\,,$

where indices $i$ and $j$ denote the components in the Fourier space coordinates.

According to the linear perturbation theory, the continuity equation leads to the following relationship: $\theta=-\mathcal{H}f\delta$ , where $\mathcal{H}=aH$ is the conformal Hubble parameter, $a$ represents the cosmic scale factor, $f$ is the linear growth rate, defined as $f=d\ln D/d\ln a$ , with $D$ being the linear density growth factor. Under the $\Lambda$ CDM model, the growth rate is approximately given by $f\approx\Omega_{m}^{0.55}$ (Linder, 2005).

Fig. 4 presents a comparison between the velocity magnitude and direction reconstructed by UNet and the ground truth values, using the “without $M_{\rm halo}$ weighting” weighting scheme. As observed, the reconstructed velocity magnitude and the true velocity magnitude are $297.22\pm 135.31$ and $294.60\pm 131.02$ km/s, respectively. Additionally, the reconstructed momentum magnitude is $315.17\pm 209.29$ km/s, while the true one is $309.50\pm 213.43$ km/s. The two mean values are consistent with the true values across all slices, with significant small deviations. When considered their statistical uncertainties, these deviations are negligible.

Moreover, by defining the quantity as $\cos\phi_{i}\equiv\hat{v}_{i}\cdot\hat{v}^{\rm true}_{i}$ , where $\hat{v}_{i}$ is the velocity direction reconstructed by the model and $\hat{v}^{\rm true}_{i}$ is the true velocity direction, one can measure the deviation between the angles. The results of the tests conducted among these slices indicate that $1-\cos\phi_{i}$ is $0.01\pm 0.06$ for velocity and $0.04\pm 0.11$ for momentum, respectively. This implies that the directions of the reconstructed velocity fields are nearly identical to those of the true velocity field, and that our model performs high reconstruction accuracy.

Furthermore, Fig. 5 illustrates a comparison between the reconstructed velocity/momentum divergence and the true velocity/momentum divergence. The divergence of velocity reconstructed by UNet and the true value are $0.40\pm 18.49$ and $0.92\pm 17.55$ km/s, respectively. The divergence of momentum reconstructed by UNet and the true one are $0.34\pm 25.83$ and $0.34\pm 25.79$ km/s, respectively. At the same time, each of the histogram demonstrates that the reconstructed probability distribution is similar to that of the truth. The findings suggest that the discrepancies in the reconstruction for velocity/momentum divergence fields are well consistent with the true values, with an accuracy exceeding 1% relative to the statistical uncertainty.

3.2 Statistical analysis on reconstructed fields

Furthermore, let us introduce the metrics that will be employed for evaluating the reconstruction accuracy. For an arbitrary reconstructed field of halos from a UNet, denoted by the shorthand notation $f$ , where $f\in\{\bm{v},\theta_{v}\}$ for velocity and its divergence, and $f\in\{\bm{m},\theta_{m}\}$ for momentum and its divergence, the following metrics are employed to describe the relative deviation and correlation coefficient, respectively, in order to compare the reconstructed field with the true one.

The relative deviation, $R$ , and the correlation coefficient, $C_{r}$ , are defined as follows:

R=\frac{\mathcal{O}_{f}}{\mathcal{O}_{f^{\prime}}}-1\,,\quad C_{r}=\frac{1}{N_{\rm pix}-1}\sum_{i}\frac{(f_{i}-\bar{f})(f^{\prime}_{i}-\bar{f^{\prime}})}{\sigma_{f}\sigma_{f^{\prime}}}\,,

(10)

where $\mathcal{O}_{f}$ represents an arbitrary observable for $f$ . $C_{r}$ is defined between the reconstructed field $f$ and the true field $f^{\prime}$ , both of which have the same total number of pixels, $N_{\rm pix}$ . The sample mean and standard deviation of field $f$ are denoted by $\bar{f}$ and $\sigma_{f}$ , respectively. It is evident that both metrics provide a physical insight for comparison, such that the ideal reconstruction is equivalent to $R=0$ and to $C_{r}=1$ .

The coefficients $C_{r}$ and relative deviation $R$ for various velocity components are calculated using Eq. 10. The calculations are based on averaging over four test sets, each with a box size of 300 ${\rm Mpc}/h$ on each side. The results are summarized in Tab. LABEL:tab:rc. It can be observed that in all cases, the $C_{r}$ values for all fields, except the momentum divergence field, are at the level of 0.9. Additionally, the changes induced by the mass-weighting scheme on $C_{r}$ are not significant. The velocity reconstruction appears to exhibit slightly superior performance compared to the momentum reconstruction. Moreover, the $R$ values approach zero, indicating that the reconstructed field has minimal bias compared to the true one. When comparing the ”with $M_{\rm halo}$ weighting” scheme to the “without $M_{\rm halo}$ weighting” scheme, we find that even without complete knowledge of the halo mass, we can still achieve accurate reconstruction of the field.

In order to provide a more comprehensive visual representation of the reconstruction performance, Fig. 6 depicts the joint probability distributions of the reconstructed fields and the true fields for magnitudes of velocity and momentum. This illustration compares the results obtained with the two mass weighting schemes. For each of the fields, the distributions were calculated using a large box with a side length of $600~{}{\rm Mpc}/h$ . As can be observed, the predicted distributions for each field exhibit a strong correlation with the true one, falling very close to the diagonal. These findings suggest that our neural network performs an effective reconstruction. Furthermore, when employing the “without $M_{\rm halo}$ weighting” weighting scheme, even without knowledge of the specific halo mass values, our UNet model is still capable of accurately reconstructing the fields, closely matching the true values. This indicates that the network has learned the information about the mapping from the sparse halo number density field in redshift space to the velocity/momentum fields in real space. As demonstrated in Fig. 7 for their divergence fields, similar findings are evident.

To further test the difference between the reconstructed velocity/momentum fields and the true ones, the joint probability distribution of density-divergence and density-velocity (momentum) are shown in Fig. 8 for the weighting scheme of “with $M_{\rm halo}$ weighting” and Fig. 9 for “without halo mass”. It is evident that the predicted results are in strong agreement with the ground truth across a wide range of $\delta_{\rm DM}$ values, even in high-density regions where $\delta_{\rm DM}\gtrsim 4$ . These high-density areas are sparse and highly non-linear, yet the model still performs effectively.

3.3 Comparison for DM density power spectra

As illustrated in Fig. 10, the resulting DM density power spectra in real space, calculated using Eq. 8 for various cases, are summarized. Two mass weighting schemes of “with $M_{\rm halo}$ weighting” and “without $M_{\rm halo}$ weighting” are presented in the left and right panels, respectively, for comparison. The auto power spectra for the predicted density (blue) and the true density (black) are shown, along with the auto power spectrum of sparse DM halo (red), providing a detailed comparison.

To accurately estimate the statistical uncertainty $\Delta P(k)$ , the number of independent Fourier modes is taken into account, which leads to a rescaling of the standard deviation $\delta P(k)$ (estimated over the 25 test boxes) via

\Delta P(k)=\delta P(k)\sqrt{\frac{V_{\rm all}}{V_{\rm overlap}}}\,.

(11)

Here, $V_{\rm all}$ represents the total independent volume, while $V_{\rm overlap}$ denotes the overlap volume for the calculated power spectrum. Since the boxe have a physical size of $1200\times 1200\times 600~{}({\rm Mpc}/h)^{3}$ , divided into 25 test boxes with a size of 600 ${\rm Mpc}/h$ on each side, the value of $\sqrt{V_{\rm all}/V_{\rm overlap}}$ is 0.4.

The relative deviation $R$ as defined in Eq. 10 are illustrated in the bottom panels. For “without $M_{\rm halo}$ weighting” case, the discrepancies between the measured auto spectra and the true auto spectrum are significant small, essentially invisible in the left panel when $k<0.3~{}h/{\rm Mpc}$ . It can be observed that the discrepancy across this scale range is $|R|<0.13$ , indicating a good recovery of both the amplitude information of the field, consistent with the findings in (Wang et al., 2024).

In comparison to the left panels, the reconstructed power spectra for “with $M_{\rm halo}$ weighting” exhibit relatively larger deviations. The maximum deviation is $R=0.15$ for auto correlation. By comparing these results, we demonstrate that the absence of this information does not significantly reduce reconstruction accuracy.

3.4 Comparison for velocity/momentum power spectra

Furthermore, we now examine the power spectrum reconstruction for velocity, momentum, and their divergences. In Fig. 11, the magnitudes of the momentum and velocity power spectra are presented. The auto correlation power spectra between the predicted and true fields based on the two weighting schemes are also shown for comparison. The estimated curves and shaded regions are based on the mean and standard deviation of the derived power spectra from a total of 25 boxes in the test dataset, each with a side length of $600~{}{\rm Mpc}/h$ .

It can be observed that the auto correlation power spectra of the velocity and momentum can be reconstructed with a relative deviation of $|R(k)|<0.07$ for $k\in[0.05,0.2]~{}h{\rm Mpc}^{-1}$ . In contrast, the velocity and momentum spectra have $|R(k)|\in[0,0.13]$ , indicating an overestimate in the scales of $k\in[0.03,0.05]~{}h{\rm Mpc}^{-1}$ . In general, the reconstruction performance for “with $M_{\rm halo}$ weighting” across all scales appears better than that for “without $M_{\rm halo}$ weighting”. This is evidenced by the fact that even without complete knowledge of the halo mass, we can still accurately reconstruct the auto-correlation of the DM velocity field and momentum field using UNet.

Moreover, since the momentum field is a density-weighted velocity, it is expected to be more sensitive to the reconstruction accuracy of the DM density field. However, the reconstruction accuracy for the momentum is comparable to that of the velocity, thanks to the good reconstruction of the DM density, as illustrated in Fig. 10. It is clear that as the reconstruction uncertainty from the DM density field increases, the statistical error in the momentum and its divergence power spectra will also increase.

Moreover, as shown in Fig. 12, the auto power spectra of the velocity and momentum divergence yield $R(k)<0.06$ for scales of $k$ from $0.05$ to 0.1 $h/{\rm Mpc}$ . At $k>0.05$ $h/{\rm Mpc}$ , the auto-correlation of velocity divergence agrees well with the true power spectrum within $2\sigma$ , while the auto-correlation of momentum divergence slightly overestimates the true value. This discrepancy may arise from the inaccurate reconstruction of the DM density field, likely due to the limited number of large-mass halos in the current training samples.

In addition, a comparison of the results obtained by the two different weighting schemes reveals that there is not a significant difference between them. This suggests that precise mass information is not necessary for the velocity/momentum reconstruction. Overall, our UNet model demonstrates satisfactory performance on linear and nonlinear scales of $k\in[0.03,0.3]~{}h/{\rm Mpc}$ . These results highlight the ability of the UNet model to learn various velocity quantities from the halo number density field on the nonlinear scales, which is an important finding in this study. In light of the predicted power spectra, it can be reasonably concluded that the UNet model’s predictions are in relatively good agreement with the truth.

3.5 RSD corrections and power spectrum multipoles

To further validate the effectiveness of the UNet model, we will demonstrate comparisons between the reconstructed measurements and the simulation truth in real space. As the input observables in the neural network are in redshift space, the relevant comparison can be used to assess the accuracy of the peculiar velocities corrected. The following analysis was conducted based on a total of 18 simulation boxes, each with a side length of $600~{}{\rm Mpc}/h$ .

In Fig. 13, a comparison is performed between the two-dimensional anisotropic 2PCFs, $\xi(r_{\perp},r_{\|})$ , of density fields between the prediction and the simulation truth in real space. As illustrated by the red dotted contours, the 2PCF measured in redshift space without any RSD corrections demonstrates the significant impact of RSD effects. The Kaiser effect results in the “squashed” structure along LoS, while the random velocities on the non-linear small scales produce the so-called “fingers-of-God” (FoG) effect, which causes structures to be elongated along the LoS. The velocity field reconstructed by UNet was utilized to correct the velocities of the halo. Note that the corrected 2PCF exhibits a high degree of agreement with the real-space 2PCF across all scales. Furthermore, the 2PCF corrections for both Kaiser and FoG effects are achieved with a high degree of accuracy. The successful corrections for the FoG effect indicate that the UNet is capable of accurately predicting the DM field in real space.

In order to accurately assess the statistical properties of an arbitrary real-space field, it is necessary to consider the higher-order power spectrum multipoles. In general, the power spectrum in redshift space $P(k,\mu)$ , where $\mu$ is the cosine of the angle between the wavevector $\bm{k}$ and the LoS direction, can be expressed by expanding its multipole components:

P_{\ell}(k)=\frac{2\ell+1}{2}\int_{-1}^{1}d\mu P(k,\mu)\mathcal{L}_{\ell}(\mu)\,,

(12)

where $\mathcal{L}_{\ell}$ are the Legendre polynomials of order $\ell$ . In this study, we focus on the Legendre polynomials of order $\ell$ , specifically the quadrupole ( $\ell=2$ ) and the hexadecapole ( $\ell=4$ ), denoted as $P_{2}$ and $P_{4}$ , respectively.

In Fig. 14, we compare the power spectrum multipoles of the UNet-reconstructed DM density field with the true values. The top panels depict the measured $P_{2}$ in redshift space (red), the UNet-reconstructed DM field (blue) in real space, and the real-space true value (black). Two mass weighting schemes were employed for the purpose of demonstrating the reconstruction performance: “without $M_{\rm halo}$ weighting” (left) and “with $M_{\rm halo}$ weighting” (right). The shaded regions represent the $1\sigma$ confidence interval, estimated based on the test dataset.

As demonstrated, the RSD effects can markedly contribute to anisotropic clustering, resulting in an amplitude of $P_{2}(k)$ that is approximately one to two orders of magnitude greater than that observed in real space across all $k$ values of interest. The quadrupole of the UNet-predicted DM density field in real space can significantly suppress the anisotropy clustering, yielding a reconstructed result that is in good agreement with the true one at the $2\sigma$ level when $k\lesssim 0.4~{}h/{\rm Mpc}$ . Furthermore, it has been found that the “with $M_{\rm halo}$ weighting” scheme yields a slightly more accurate reconstruction. The discrepancy between the reconstructed and true values averaged over the $k$ range is $174.29\pm 702.27$ for the “with $M_{\rm halo}$ weighting” scheme and $175.05\pm 711.88$ for the “without $M_{\rm halo}$ weighting” scheme. This indicates that we can reconstruct the quadrupole of the DM density field without complete knowledge of halo mass information.

The lower panels of Fig. 14 depict a similar comparison of a higher-order multipole, the hexadecapole, $P_{4}$ . As a consequence of its higher order, the statistical uncertainty for halos in redshift space appears to exceed that of $P_{2}$ . Nevertheless, the UNet is still capable of successfully reconstructing $P_{4}$ of the DM density field in real space, with the associated statistical uncertainty being approximately comparable to that of $P_{2}$ . The discrepancy between the reconstructed and true values falls within the $2\sigma$ level. Furthermore, the use of a weighting scheme based on halo mass can result in a slight improvement in reconstruction accuracy, with the average deviation of $-60.03\pm 543.83$ reduced by $27\%$ compared to the “without $M_{\rm halo}$ weighting” case.

The proposed UNet is capable of automatically applying RSD corrections. Furthermore, the reconstructed multipoles of both the quadrupole and hexadecapole have been found to agree with the true values at the $2\sigma$ level, within the range of $k\in[0.06,0.3]~{}h/{\rm Mpc}$ . Moreover, the “with $M_{\rm halo}$ weighting” scheme has been observed to yield higher reconstruction accuracy and smaller statistical uncertainty.

In conclusion, while the RSD effects can significantly contribute to anisotropic clustering, distorting $P_{2}(k)$ and $P_{4}(k)$ , our proposed UNet effectively corrects for these RSD effects and yields accurate predictions for the real-space statistical measurements of density. The reconstructed results align well with the true values at the $2\sigma$ level for $k\in[0.03,0.4]~{}h/{\rm Mpc}$ . Furthermore, the “with $M_{\rm halo}$ weighting” scheme provides additional information, resulting in a slightly more accurate reconstruction.

4 Conclusions

The construction of three-dimensional velocity (and momentum) fields by galaxies and clusters is of great significance in the field of cosmology. These fields provide a wealth of information that is not available from the density field alone. They can be used to improve and correct various cosmological measurements. The reconstruction of high fidelity may even result in the discovery of unexpected findings. The reconstruction of cosmic volume-weighted velocity is susceptible to significant sampling artifacts, which presents a challenge for traditional reconstruction methods. Furthermore, traditional reconstruction methods frequently rely on numerous assumptions and approximations.

The objective of this study is to propose a deep learning approach based on the UNet neural network for the reconstruction of three-dimensional velocity/momentum fields in real space. This approach may provide a potential solution to the long-standing problem. This study has demonstrated the effectiveness of UNet in reconstructing fields directly from sparse samples of DM halo (and subhalo) spatial distribution in redshift space. The UNet architecture, with its sophisticated design, is capable of capturing diverse field characteristics and transforming high-dimensional, structured inputs, making it a valuable tool for such reconstructions.

The UNet was trained to transform the sparse halo density fields into velocity/momentum fields. To include the halo mass information and assess the sensitivity of the reconstruction performance to this knowledge, we adopted two weighting schemes: one that includes halo mass and one that does not. A comprehensive validation was conducted through a series of statistical tests, and the reconstructed velocity/momentum fields were found to exhibit a high degree of agreement with the ground truth. Moreover, the UNet-inferred DM velocity fields in real space essentially provide effective RSD corrections. Compared to the true values, we have demonstrated that the proposed UNet can reconstruct real-space velocity and momentum fields with a relative deviation of about $R<0.13$ in highly non-linear regimes at $k<0.3~{}h/{\rm Mpc}$ .

It is also worth noting that the reconstruction of our UNet model is effective even in the absence of precise DM halo mass information. A comparison of the pixel-by-pixel values and power spectra reveals that the UNet model is still capable of accurately reconstructing not only the DM density field and relevant power spectrum, but also the velocity/momentum fields and the corresponding power spectra. The reconstruction accuracy is not significantly different from those obtained when the accurate DM halo mass value is used. The findings indicate that the model is capable of performing reconstruction in a manner that is more consistent with real observations, which is a highly encouraging indication for the reconstruction of velocity/momentum fields using current and future real data.

The Stage IV galaxy surveys will yield more detailed measurements of LSS of the Universe than ever before. In consequence, novel computing technologies are required to analyze these high-dimensional, massive data sets. It is therefore anticipated that UNet-based neural networks will prove an invaluable tool in addressing the challenges inherent in traditional methodologies, thereby facilitating the extraction of cosmological information in a more profound and comprehensive manner.

Acknowledgments

This work is supported by National SKA Program of China (2020SKA0110401, 2020SKA0110402, 2020SKA0110100), the National Key R&D Program of China (2020YFC2201600), the National Science Foundation of China (12203107, 12073088, 12373005), the China Manned Space Project with No. CMS-CSST-2021 (A02, A03, B01), the Guangdong Basic and Applied Basic Research Foundation (2024A1515012309), and the 111 project of the Ministry of Education No. B20019. We also wish to acknowledge the Beijing Super Cloud Center (BSCC) and Beijing Beilong Super Cloud Computing Co., Ltd (http://www.blsc.cn/) for providing HPC resources that have significantly contributed to the research results presented in this paper.

References

Akiba et al. (2019) Akiba, T., Sano, S., Yanase, T., Ohta, T., & Koyama, M. 2019, arXiv e-prints, arXiv:1907.10902, doi: 10.48550/arXiv.1907.10902
Akiba et al. (2019) Akiba, T., Sano, S., Yanase, T., Ohta, T., & Koyama, M. 2019, CoRR, abs/1907.10902
Berger & Stein (2019) Berger, P., & Stein, G. 2019, Mon. Not. Roy. Astron. Soc., 482, 2861, doi: 10.1093/mnras/sty2949
Bergstra et al. (2011) Bergstra, J., Bardenet, R., Bengio, Y., & Kégl, B. 2011, in Advances in Neural Information Processing Systems, ed. J. Shawe-Taylor, R. Zemel, P. Bartlett, F. Pereira, & K. Weinberger, Vol. 24 (Curran Associates, Inc.). https://proceedings.neurips.cc/paper_files/paper/2011/file/86e8f7ab32cfd12577bc2619bc635690-Paper.pdf
Bernardeau & van de Weygaert (1996) Bernardeau, F., & van de Weygaert, R. 1996, Mon. Not. Roy. Astron. Soc., 279, 693, doi: 10.1093/mnras/279.2.693
Bernardeau et al. (1997) Bernardeau, F., van de Weygaert, R., Hivon, E., & Bouchet, F. R. 1997, Mon. Not. Roy. Astron. Soc., 290, 566, doi: 10.1093/mnras/290.3.566
Caldeira et al. (2018) Caldeira, J., Wu, W. L. K., Nord, B., et al. 2018, doi: 10.1016/j.ascom.2019.100307
Carleo et al. (2019) Carleo, G., Cirac, I., Cranmer, K., et al. 2019. https://arxiv.org/abs/1903.10563
Chardin et al. (2019) Chardin, J., Uhlrich, G., Aubert, D., et al. 2019. https://arxiv.org/abs/1905.06958
Chen et al. (2018) Chen, J., Zhang, P., Zheng, Y., Yu, Y., & Jing, Y. 2018, Astrophys. J., 861, 58, doi: 10.3847/1538-4357/aaca2f
Clifton et al. (2012) Clifton, T., Ferreira, P. G., Padilla, A., & Skordis, C. 2012, Phys. Rept., 513, 1, doi: 10.1016/j.physrep.2012.01.001
Dreissigacker et al. (2019) Dreissigacker, C., Sharma, R., Messenger, C., Zhao, R., & Prix, R. 2019, Phys. Rev., D100, 044009, doi: 10.1103/PhysRevD.100.044009
Fluri et al. (2019) Fluri, J., Kacprzak, T., Lucchi, A., et al. 2019. https://arxiv.org/abs/1906.03156
Ganeshaiah Veena et al. (2023) Ganeshaiah Veena, P., Lilow, R., & Nusser, A. 2023, MNRAS, 522, 5291, doi: 10.1093/mnras/stad1222
Gillet et al. (2019) Gillet, N., Mesinger, A., Greig, B., Liu, A., & Ucci, G. 2019, Mon. Not. Roy. Astron. Soc., 484, 282, doi: 10.1093/mnras/stz010
Gupta et al. (2018) Gupta, A., Matilla, J. M. Z., Hsu, D., & Haiman, Z. 2018, Phys. Rev., D97, 103515, doi: 10.1103/PhysRevD.97.103515
Guzzo et al. (2008) Guzzo, L., et al. 2008, Nature, 451, 541, doi: 10.1038/nature06555
Hassan et al. (2019a) Hassan, S., Andrianomena, S., & Doughty, C. 2019a. https://arxiv.org/abs/1907.07787
Hassan et al. (2019b) Hassan, S., Liu, A., Kohn, S., & La Plante, P. 2019b, Mon. Not. Roy. Astron. Soc., 483, 2524, doi: 10.1093/mnras/sty3282
He et al. (2019) He, S., Li, Y., Feng, Y., et al. 2019, Proc. Nat. Acad. Sci., 116, 13825, doi: 10.1073/pnas.1821458116
Hinshaw et al. (2013) Hinshaw, G., Larson, D., Komatsu, E., et al. 2013, The Astrophysical Journal Supplement Series, 208, 19, doi: 10.1088/0067-0049/208/2/19
Hong et al. (2021) Hong, S. E., Jeong, D., Hwang, H. S., & Kim, J. 2021, ApJ, 913, 76, doi: 10.3847/1538-4357/abf040
Ishida et al. (2019) Ishida, E. E. O., et al. 2019, Mon. Not. Roy. Astron. Soc., 483, 2, doi: 10.1093/mnras/sty3015
Jain & Zhang (2008) Jain, B., & Zhang, P. 2008, Phys. Rev. D, 78, 063503, doi: 10.1103/PhysRevD.78.063503
Jeffrey et al. (2019) Jeffrey, N., Lanusse, F., Lahav, O., & Starck, J.-L. 2019. https://arxiv.org/abs/1908.00543
Jennings et al. (2015) Jennings, E., Baugh, C. M., & Hatt, D. 2015, Mon. Not. Roy. Astron. Soc., 446, 793, doi: 10.1093/mnras/stu2043
Jennings et al. (2019) Jennings, W. D., Watkinson, C. A., Abdalla, F. B., & McEwen, J. D. 2019, Mon. Not. Roy. Astron. Soc., 483, 2907, doi: 10.1093/mnras/sty3168
Jing (2018) Jing, Y. 2018, Science China Physics, Mechanics & Astronomy, 62, 19511, doi: 10.1007/s11433-018-9286-x
Joyce et al. (2015) Joyce, A., Jain, B., Khoury, J., & Trodden, M. 2015, Phys. Rept., 568, 1, doi: 10.1016/j.physrep.2014.12.002
Kaiser (1987) Kaiser, N. 1987, Mon. Not. Roy. Astron. Soc., 227, 1, doi: 10.1093/mnras/227.1.1
Kingma & Ba (2014) Kingma, D. P., & Ba, J. 2014, arXiv e-prints, arXiv:1412.6980. https://arxiv.org/abs/1412.6980
Koda et al. (2016) Koda, J., Blake, C., Beutler, F., Kazin, E., & Marin, F. 2016, Mon. Not. Roy. Astron. Soc., 459, 2118, doi: 10.1093/mnras/stw763
Komatsu et al. (2011) Komatsu, E., Smith, K. M., Dunkley, J., et al. 2011, The Astrophysical Journal Supplement Series, 192, 18, doi: 10.1088/0067-0049/192/2/18
Koyama (2016) Koyama, K. 2016, Rept. Prog. Phys., 79, 046902, doi: 10.1088/0034-4885/79/4/046902
La Plante & Ntampaka (2018) La Plante, P., & Ntampaka, M. 2018, Astrophys. J., 810, 110, doi: 10.3847/1538-4357/ab2983
Laureijs et al. (2011) Laureijs, R., Amiaux, J., Arduini, S., et al. 2011, arXiv preprint arXiv:1110.3193
Lazanu (2021) Lazanu, A. 2021, J. Cosmology Astropart. Phys, 2021, 039, doi: 10.1088/1475-7516/2021/09/039
Levi et al. (2013) Levi, M., Bebek, C., Beers, T., et al. 2013, arXiv preprint arXiv:1308.0847
Li et al. (2011) Li, M., Li, X.-D., Wang, S., & Wang, Y. 2011, Commun. Theor. Phys., 56, 525, doi: 10.1088/0253-6102/56/3/24
Li et al. (2019) Li, S.-Y., Li, Y.-L., & Zhang, T.-J. 2019. https://arxiv.org/abs/1907.00568
Li et al. (2021) Li, Y., Ni, Y., Croft, R. A. C., et al. 2021, Proceedings of the National Academy of Sciences, 118, doi: 10.1073/pnas.2022038118
Linder (2005) Linder, E. V. 2005, Phys. Rev. D, 72, 043529, doi: 10.1103/PhysRevD.72.043529
Linder & Jenkins (2003) Linder, E. V., & Jenkins, A. 2003, Mon. Not. Roy. Astron. Soc., 346, 573, doi: 10.1046/j.1365-2966.2003.07112.x
Lochner et al. (2016) Lochner, M., McEwen, J. D., Peiris, H. V., Lahav, O., & Winter, M. K. 2016, Astrophys. J. Suppl., 225, 31, doi: 10.3847/0067-0049/225/2/31
LSST Science Collaboration et al. (2009) LSST Science Collaboration, Abell, P. A., Allison, J., et al. 2009, arXiv e-prints, arXiv:0912.0201. https://arxiv.org/abs/0912.0201
Lucie-Smith et al. (2019) Lucie-Smith, L., Peiris, H. V., & Pontzen, A. 2019. https://arxiv.org/abs/1906.06339
Lucie-Smith et al. (2018) Lucie-Smith, L., Peiris, H. V., Pontzen, A., & Lochner, M. 2018, Mon. Not. Roy. Astron. Soc., 479, 3405, doi: 10.1093/mnras/sty1719
Makinen et al. (2021) Makinen, T. L., Lancaster, L., Villaescusa-Navarro, F., et al. 2021, JCAP, 04, 081, doi: 10.1088/1475-7516/2021/04/081
Mao et al. (2020) Mao, T.-X., Wang, J., Li, B., et al. 2020, arXiv e-prints, arXiv:2002.10218. https://arxiv.org/abs/2002.10218
Mehta et al. (2019) Mehta, P., Bukov, M., Wang, C.-H., et al. 2019, Phys. Rept., 810, 1, doi: 10.1016/j.physrep.2019.03.001
Merten et al. (2019) Merten, J., Giocoli, C., Baldi, M., et al. 2019, Mon. Not. Roy. Astron. Soc., 487, 104, doi: 10.1093/mnras/stz972
Mishra et al. (2019) Mishra, A., Reddy, P., & Nigam, R. 2019. https://arxiv.org/abs/1908.04682
Modi et al. (2018) Modi, C., Feng, Y., & Seljak, U. 2018, JCAP, 1810, 028, doi: 10.1088/1475-7516/2018/10/028
Moss (2018) Moss, A. 2018. https://arxiv.org/abs/1810.06441
Muthukrishna et al. (2019) Muthukrishna, D., Parkinson, D., & Tucker, B. 2019. https://arxiv.org/abs/1903.02557
Münchmeyer & Smith (2019) Münchmeyer, M., & Smith, K. M. 2019. https://arxiv.org/abs/1905.05846
Ni et al. (2021) Ni, Y., Li, Y., Lachance, P., et al. 2021, MNRAS, 507, 1021, doi: 10.1093/mnras/stab2113
Ntampaka et al. (2019) Ntampaka, M., et al. 2019. https://arxiv.org/abs/1902.10159
Okumura et al. (2012) Okumura, T., Seljak, U., & Desjacques, V. 2012, J. Cosmology Astropart. Phys, 2012, 014, doi: 10.1088/1475-7516/2012/11/014
Pan et al. (2020) Pan, S., Liu, M., Forero-Romero, J., et al. 2020, Science China Physics, Mechanics, and Astronomy, 63, 110412, doi: 10.1007/s11433-020-1586-3
Peacock et al. (2001) Peacock, J. A., et al. 2001, Nature, 410, 169, doi: 10.1038/35065528
Peebles (1980) Peebles, P. J. E. 1980, The large-scale structure of the universe
Peel et al. (2019) Peel, A., Lalande, F., Starck, J.-L., et al. 2019, Phys. Rev., D100, 023508, doi: 10.1103/PhysRevD.100.023508
Perraudin et al. (2019) Perraudin, N., Defferrard, M., Kacprzak, T., & Sgier, R. 2019, Astron. Comput., 27, 130, doi: 10.1016/j.ascom.2019.03.004
Pfeffer et al. (2019) Pfeffer, D. N., Breysse, P. C., & Stein, G. 2019. https://arxiv.org/abs/1905.10376
Ramanah et al. (2019) Ramanah, D. K., Charnock, T., & Lavaux, G. 2019, Phys. Rev., D100, 043515, doi: 10.1103/PhysRevD.100.043515
Ravanbakhsh et al. (2017) Ravanbakhsh, S., Oliva, J., Fromenteau, S., et al. 2017. https://arxiv.org/abs/1711.02033
Reid et al. (2012) Reid, B. A., et al. 2012, Mon. Not. Roy. Astron. Soc., 426, 2719, doi: 10.1111/j.1365-2966.2012.21779.x
Reyes et al. (2010) Reyes, R., Mandelbaum, R., Seljak, U., et al. 2010, Nature, 464, 256, doi: 10.1038/nature08857
Rodriguez et al. (2018) Rodriguez, A. C., Kacprzak, T., Lucchi, A., et al. 2018, Comput. Astrophys. Cosmol., 5, 4, doi: 10.1186/s40668-018-0026-4
Schaap & van de Weygaert (2000) Schaap, W. E., & van de Weygaert, R. 2000, Astron. Astrophys., 363, L29. https://arxiv.org/abs/astro-ph/0011007
Schmelzle et al. (2017) Schmelzle, J., Lucchi, A., Kacprzak, T., et al. 2017. https://arxiv.org/abs/1707.05167
Scoccimarro (2004) Scoccimarro, R. 2004, Phys. Rev. D, 70, 083007, doi: 10.1103/PhysRevD.70.083007
Seljak & McDonald (2011) Seljak, U., & McDonald, P. 2011, J. Cosmology Astropart. Phys, 2011, 039, doi: 10.1088/1475-7516/2011/11/039
Shallue & Eisenstein (2023) Shallue, C. J., & Eisenstein, D. J. 2023, MNRAS, 520, 6256, doi: 10.1093/mnras/stad528
Spergel et al. (2015) Spergel, D., Gehrels, N., Baltay, C., et al. 2015, arXiv e-prints, arXiv:1503.03757. https://arxiv.org/abs/1503.03757
Springer et al. (2018) Springer, O. M., Ofek, E. O., Weiss, Y., & Merten, J. 2018. https://arxiv.org/abs/1808.07491
Taruya et al. (2010) Taruya, A., Nishimichi, T., & Saito, S. 2010, Phys. Rev. D, 82, 063522, doi: 10.1103/PhysRevD.82.063522
Tewes et al. (2019) Tewes, M., Kuntzer, T., Nakajima, R., et al. 2019, Astron. Astrophys., 621, A36, doi: 10.1051/0004-6361/201833775
Tojeiro et al. (2012) Tojeiro, R., et al. 2012, Mon. Not. Roy. Astron. Soc., 424, 2339, doi: 10.1111/j.1365-2966.2012.21404.x
Tröster et al. (2019) Tröster, T., Ferguson, C., Harnois-Déraps, J., & McCarthy, I. G. 2019, Mon. Not. Roy. Astron. Soc., 487, L24, doi: 10.1093/mnrasl/slz075
Wang (2008) Wang, Y. 2008, JCAP, 05, 021, doi: 10.1088/1475-7516/2008/05/021
Wang et al. (2024) Wang, Z., Shi, F., Yang, X., et al. 2024, Sci. China Phys. Mech. Astron., 67, 219513, doi: 10.1007/s11433-023-2192-9
Weinberg et al. (2013) Weinberg, D. H., Mortonson, M. J., Eisenstein, D. J., et al. 2013, Phys. Rept., 530, 87, doi: 10.1016/j.physrep.2013.05.001
White et al. (2015) White, M., Reid, B., Chuang, C.-H., et al. 2015, Mon. Not. Roy. Astron. Soc., 447, 234, doi: 10.1093/mnras/stu2460
Wu et al. (2021) Wu, Z., Zhang, Z., Pan, S., et al. 2021, Astrophys. J., 913, 2, doi: 10.3847/1538-4357/abf3bb
Wu et al. (2023) Wu, Z., Xiao, L., Xiao, X., et al. 2023, MNRAS, 522, 4748, doi: 10.1093/mnras/stad1290
Yu et al. (2015) Yu, Y., Zhang, J., Jing, Y., & Zhang, P. 2015, Phys. Rev. D, 92, 083527, doi: 10.1103/PhysRevD.92.083527
Yu et al. (2017) —. 2017, Phys. Rev. D, 95, 043536, doi: 10.1103/PhysRevD.95.043536
Zhan (2011) Zhan, H. 2011, Scientia Sinica Physica, Mechanica & Astronomica, 41, 1441, doi: 10.1360/132011-961
Zhang & Johnson (2015) Zhang, P., & Johnson, M. C. 2015, JCAP, 06, 046, doi: 10.1088/1475-7516/2015/06/046
Zhang et al. (2007) Zhang, P., Liguori, M., Bean, R., & Dodelson, S. 2007, Phys. Rev. Lett., 99, 141302, doi: 10.1103/PhysRevLett.99.141302
Zhang et al. (2013) Zhang, P., Pan, J., & Zheng, Y. 2013, Phys. Rev. D, 87, 063526, doi: 10.1103/PhysRevD.87.063526
Zhang & Stebbins (2011) Zhang, P., & Stebbins, A. 2011, Phys. Rev. Lett., 107, 041301, doi: 10.1103/PhysRevLett.107.041301
Zhang et al. (2015) Zhang, P., Zheng, Y., & Jing, Y. 2015, Phys. Rev. D, 91, 043522, doi: 10.1103/PhysRevD.91.043522
Zhang et al. (2019) Zhang, X., Wang, Y., Zhang, W., et al. 2019. https://arxiv.org/abs/1902.05965
Zheng et al. (2015a) Zheng, Y., Zhang, P., & Jing, Y. 2015a, Phys. Rev. D, 91, 123512, doi: 10.1103/PhysRevD.91.123512
Zheng et al. (2015b) —. 2015b, Phys. Rev. D, 91, 043523, doi: 10.1103/PhysRevD.91.043523
Zheng et al. (2013) Zheng, Y., Zhang, P., Jing, Y., Lin, W., & Pan, J. 2013, Phys. Rev. D, 88, 103510, doi: 10.1103/PhysRevD.88.103510