Impact of Large-Scale Anisotropies on Galaxy Clustering and Cosmological Constraints
Abstract
We critically assess the impact of significant dipole and large-scale anisotropies on galaxy clustering signals, with a focus on radio continuum surveys. Our study reveals that these anisotropies—resulting from intrinsic cosmological effects and/or observational systematics—profoundly influence the two-point correlation function (2PCF) and angular power spectrum (). Notably, large-scale anisotropies can obscure or simulate non-Gaussianity signals, complicating the extraction of precise cosmological information. The results emphasize that it is crucial to address systematics and rigorously mask the dipole and its surrounding multipoles to obtain accurate cosmological constraints. This approach is essential for extracting cosmological results from clustering signals, particularly for future surveys such as SKA, DESI, and LSST, to ensure the precision and reliability of cosmological analyses.
1 Introduction
Galaxy clustering is a key tool for probing cosmological parameters, offering critical insights into the matter distribution and evolution of the Universe (Peebles, 1980). By comparing observed clustering patterns with theoretical models, we impose constraints on fundamental parameters such as matter density, fluctuation amplitude, and structure growth. This process refines our understanding of the Universe’s composition and the mechanisms behind galaxy and large-scale structure formation. In the era of precision cosmology, where achieving exceptional accuracy in parameter measurements is crucial, we rely on advanced observational techniques and data analysis methods. Next-generation surveys, such as the Square Kilometre Array (SKA; Wilkinson 1991; Bacon et al. 2020), the Dark Energy Spectroscopic Instrument (DESI; DESI Collaboration et al. 2016, 2022), and the Large Synoptic Survey Telescope (LSST; Ivezić et al. 2019), are set to dramatically enhance our ability to map the Universe and explore phenomena like dark energy, modified gravity (Clifton et al., 2012), and primordial non-Gaussianities (Komatsu & Spergel, 2001). These advancements will critically evaluate the current cosmological model and explore alternative cosmological models.
However, these analyses face challenges due to large-scale anisotropies in galaxy distributions. Whether arising from intrinsic cosmological effects or observational systematics, these anisotropies can distort clustering signals and complicate accurate cosmological inference. A prime example is the observed dipole in radio and infrared galaxy counts, first identified in the NRAO VLA Sky Survey (NVSS) (Blake & Wall, 2002; Singal, 2011). This dipole has generated debate over its origin-whether it reflects our motion relative to the cosmic rest frame, an intrinsic large-scale structure feature, or uncorrected systematics (Gibelyou & Huterer, 2012; Rubart & Schwarz, 2013; Tiwari et al., 2015; Tiwari & Nusser, 2016; Colin et al., 2017; Dolfi et al., 2019; Tiwari et al., 2019; Secrest et al., 2022). The impact of such anisotropies is profound. The dipole and other large-scale modes can obscure the isotropic Gaussian assumption commonly used in standard analyses, complicating the detection of subtle effects such as primordial non-Gaussianity or scale-dependent biases from inflationary models (Matarrese et al., 2000; Dalal et al., 2008; Bruni et al., 2012).
Most leading cosmological clustering studies have thus far overlooked these effects, operating under the assumption of large-scale isotropy and homogeneity—commonly known as the Cosmological Principle, a fundamental basis for standard cosmological analyses. Yet recent observations, especially from radio and infrared surveys (Tiwari et al., 2015; Tiwari & Nusser, 2016; Colin et al., 2017; Secrest et al., 2021, 2022; Singal, 2024; von Hausegger, 2024), reveal a prominent dipole signal in galaxy distributions, making it crucial to consider this signal and underscoring the urgency of addressing this issue.
In this letter, we investigate the effect of large-scale anisotropies on galaxy clustering measurements, focusing on the two-point correlation function (2PCF) and the angular power spectrum (). Using NVSS data as a case study, we assess implications for upcoming surveys like SKA, DESI, and LSST. We highlight the necessity of aggressive masking of large-scale anisotropies and rigorous systematic treatment to mitigate the dipole and adjacent multipoles, which can otherwise introduce significant biases in cosmological parameter estimates. We demonstrate how anisotropies can mimic or obscure key cosmological signals, such as non-Gaussianities and scale-dependent biases. We need for robust data processing techniques and careful management of large-scale systematics to fully exploit the capabilities of next-generation surveys. We conclude that effectively addressing large-scale anisotropies is essential for achieving precise cosmological inferences.
2 Galaxies as Cosmological Tracers
A typical galaxy catalog used to explore cosmology consists of the two-dimensional or three-dimensional positions of galaxies with specific properties, mapped over a fraction of the sky and across a range of radial distances—referred to as redshifts in cosmology. These catalogs are carefully prepared to ensure completeness above a certain flux or luminosity threshold, resulting in what is known as a flux-limited catalog. Alternatively, one can select a subsample over a specific range of redshifts where the catalog is considered complete in terms of containing all galaxies within that range. These are known as volume-limited catalogs, which are somewhat challenging to achieve with our current observational capabilities.
Regardless of the galaxy sample used, covering a particular range of dark matter halo masses is crucial, as it ensures that the derived galaxy bias for the population in the catalog is well-defined. This is important because galaxies serve as biased tracers of the underlying matter distribution (Kaiser, 1984; Cole & Kaiser, 1989; Dekel & Rees, 1987; Peacock & Smith, 2000). To relate the background matter distribution—essentially the cosmological structure—using galaxy positions, a halo-galaxy biasing model is required (Press & Schechter, 1974; Sheth & Tormen, 1999). This model connects the observed distribution of galaxies to that of dark matter halos, which are linked to the underlying matter density field. By understanding and applying the appropriate biasing model, one can infer the dark matter distribution from the observed galaxy distribution, thereby gaining essential insights into the large-scale structure of the Universe and advancing our understanding of cosmology.
A real galaxy survey covers only a fraction of the sky. Combining different catalogs can extend the sky area covered (Colin et al., 2017; Tiwari & Aluri, 2019); however, the galactic plane remains inaccessible due to high dust extinction, stellar crowding, and galactic contamination. Additionally, there are other masked regions where data is either contaminated by bright extended sources or where the survey’s RMS noise is high. At best, large-scale clustering signals are typically studied with 70-90% sky coverage. Nonetheless, the point remains that methods must be employed to recover the full-sky clustering signal from surveys available only over a partial sky region. This recovery is particularly challenging if the dipole or any other multipole is significantly larger in comparison to its neighboring multipoles. In such cases, the power from a multipole with high amplitude leaks significantly into its neighboring multipoles, leading to notably higher clustering in those neighboring multipoles as well.
3 Clustering Signal in the Presence of a Large Dipole
Radio continuum surveys lack redshift information, so clustering analysis is typically performed using only the angular positions of galaxies. Conventional approaches involve calculating the two-point correlation function and the angular power spectrum to constrain cosmological parameters. These basic tests can be supplemented with higher-order correlations to achieve a more detailed understanding of the clustering signal and the underlying cosmology.
It is well-established that galaxy catalogs from radio continuum and infrared surveys, and likely from visible surveys as well, exhibit a significant dipole (Secrest et al., 2022; Aluri et al., 2023; Peebles, 2022). This dipole introduces complexities that can affect the reliability of clustering analyses. An excess dipole, whether arising from intrinsic cosmological effects or observational systematics, can contaminate clustering measurements and complicate the extraction of accurate cosmological information. In addition to a significant dipole, non-Gaussian initial perturbations can significantly contribute to large-scale clustering. A simple model for non-Gaussianity can be followed from (Komatsu & Spergel, 2001), where a local type of non-Gaussianity is described by the “simplest weak nonlinear coupling” case, given as:
(1) |
where represents the primordial curvature perturbations, is the Gaussian perturbation, and is the non-Gaussianity parameter. This leads to a scale-dependent galaxy bias (Matarrese et al., 2000; Dalal et al., 2008; Seljak, 2009; Desjacques & Seljak, 2010; Xia et al., 2010; Bruni et al., 2012; Maartens et al., 2013):
(2) |
where is the scale- and redshift-dependent bias, with representing the wave number that corresponds to the inverse of the spatial scale of perturbations, and is the redshift, which denotes the epoch of evaluation. Here, is the scale-independent Gaussian bias, is the critical matter overdensity for ellipsoidal collapse, taken as (Xia et al., 2010; Maartens et al., 2013), is the transfer function, is the growth factor, is the speed of light, and is the Hubble constant. In this letter, while we have specifically considered the local-type primordial non-Gaussianity model for simplicity, it is to note that other inflationary-motivated non-Gaussianity models can similarly lead to scale-dependent biases in galaxy clustering (Matarrese & Verde, 2008).
The most widely used method to study the clustering signal from radio continuum surveys is conventionally through the 2-point correlation functions and the angular power spectrum (). The theoretical formulation for the angular power spectrum is given by (Peebles, 1980):
(3) |
where is the matter power spectrum, is the radial distribution function, representing the probability of observing a galaxy between redshift and , and is the spherical Bessel function of the first kind for integer . The galaxy bias becomes scale-dependent in the presence of non-Gaussianity, i.e., when . For Gaussian primordial perturbations, the galaxy bias is typically assumed to be scale-independent. Subsequently, the theoretical estimate of the 2-PCF, , derived from the angular power spectrum :
(4) |
where represents the Legendre polynomials.
When recovering the full-sky clustering signal from surveys that cover only a partial sky, the dipole signal leaks into other nearby multipoles. This leakage can overlap with the non-Gaussianity signal, which is also dominant at large scales, making it challenging to distinguish between the two effects. To illustrate this issue, we present a typical case for the NRAO VLA Sky Survey (NVSS; Condon et al. 1998). We assume the and bias values for NVSS from Nusser & Tiwari (2015), i.e. and . Here, we focus on qualitative plots; similar results are anticipated for other surveys, such as the upcoming SKA, DESI, and LSST.

In Figure 1, we present the angular power spectrum, which is widely used to study the clustering signal from radio continuum surveys. We consider the parameter up to 150, which is the maximum expected value for this parameter (Becker et al., 2012), although the current limit from CMB temperature and E-mode polarization maps is . For NVSS, Xia et al. (2010) analyzed the clustering signal and found . From the figure, it is evident that the non-Gaussianity signals can cause a significant increase in s up to . However, at this scale, systematic effects may also produce similar increases in clustering. For instance, the TIFR GMRT Sky Survey (TGSS) ADR1 catalog exhibits increased clustering up to , likely due to systematics (Tiwari et al., 2019). If these systematics are not properly accounted for, they may be misinterpreted as significant non-Gaussianity. Additionally, a prominent dipole signal-whether from systematic effects or cosmic origins-can further complicate the analysis. When recovering s from partial sky coverage, which is always the case, the dipole signal can leak into nearby multipoles and mimic non-Gaussianity signals.

To more clearly demonstrate how both the dipole signal and non-Gaussianity can produce similar clustering results, we present the angular 2PCF results in Figure 2. The 2PCF beyond scales of approximately 0.5 degrees is significantly influenced by non-Gaussianity. Interestingly, the presence of a dipole signal closely resembles the non-Gaussianity signal. Notably, the observed NVSS dipole, about (Singal, 2011; Gibelyou & Huterer, 2012; Rubart & Schwarz, 2013; Tiwari et al., 2015; Tiwari & Nusser, 2016; Dolfi et al., 2019; Siewert et al., 2021; Secrest et al., 2022), corresponds approximately to an value of 50, which is consistent with the observed value within one sigma as reported by Xia et al. (2010). It appears that the dipole signal present in the data is being misinterpreted as a non-Gaussianity signal. Furthermore, since large-scale anisotropy and/or the presence of a non-Gaussianity signal affects the 2PCF over a significant range of angular scales, 2PCF is less favored for fitting cosmological models and obtaining cosmological results, particularly when dealing with non-Gaussianity signals. Before fitting any cosmological model to the 2PCF, the dipole signal should be removed. The simplest way to achieve this could be by fitting a cosine function with a constant to the observed 2PCF 111Note from equation 4 that the dipole term corresponds to and is proportional to the cosine.. The s could be a better choice; however, it is essential to ensure that the data is not affected by large-scale systematics, and the dipole and its neighboring multipoles are masked before fitting cosmology. The results we presented are specific to NVSS, but as other radio continuum surveys have similar redshift profiles and galaxy biasing, we expect similar results.
For spectroscopic and photometric surveys where redshift information is available, cosmology is studied using three-dimensional correlation functions, power spectrum, and bispectrum. However, as the dipole signal and non-Gaussianity signals both affect large scales, they remain ambiguous to distinguish even with these estimators. There are also general relativistic (GR) corrections that become larger at higher redshifts and are again more significant at large scales (Bruni et al., 2012). However, since the galaxy population from most existing and planned surveys peaks below or around redshift 1, and considering that is small as suggested by Planck observations, the GR and non-Gaussianity effects remain at the sub-percentage level. The dipole, other large-scale multipoles, and systematics remain the main challenges and must not be misinterpreted as non-Gaussianity signals. These large-scale signals need to be carefully removed before drawing any cosmological conclusions.
We conclude that extracting cosmology from galaxy surveys in the presence of large-scale anisotropies is non-trivial and requires careful consideration. Firstly, systematics should be thoroughly studied and accounted for, as they can lead to an increase in the clustering signal at large scales (Tiwari et al., 2019). Systematic effects present in the data may closely resemble non-Gaussianity signals in both the 2PCF and s and three-dimensional power spectrum and other estimators. Not only non-Gaussianity, but even a dipole signal—whether due to systematics or a true cosmological origin—is difficult to distinguish when using 2PCF as a clustering measure. It may be somewhat distinguishable with s if the survey data covers a sufficiently large sky area and the dipole signal does not leak significantly into nearby multipoles. General relativistic effects, non-Gaussianity, and possibly the cosmic dipole also exhibit some dependence on redshift (Bruni et al., 2012; Tiwari et al., 2022). Studying large-scale clustering signals in redshift slices may potentially help. However, even in this case, the signals could still be misinterpreted as mild non-Gaussianity or vice versa. It is advisable to use the CMB-extracted values of non-Gaussianity as a strong prior when interpreting large-scale anisotropy signals. Furthermore, when possible, the remaining cosmological parameters can be reliably extracted from galaxy catalogs by analyzing the clustering signal while avoiding the largest scales.
References
- Aluri et al. (2023) Aluri, P. K., Cea, P., Chingangbam, P., et al. 2023, Classical and Quantum Gravity, 40, 094001, doi: 10.1088/1361-6382/acbefc
- Bacon et al. (2020) Bacon, D. J., Battye, R. A., Bull, P., et al. 2020, PASA, 37, e007, doi: 10.1017/pasa.2019.51
- Becker et al. (2012) Becker, A., Huterer, D., & Kadota, K. 2012, JCAP, 12, 034, doi: 10.1088/1475-7516/2012/12/034
- Blake & Wall (2002) Blake, C., & Wall, J. 2002, Nature, 416, 150. https://arxiv.org/abs/astro-ph/0203385
- Bruni et al. (2012) Bruni, M., Crittenden, R., Koyama, K., et al. 2012, Phys. Rev. D, 85, 041301, doi: 10.1103/PhysRevD.85.041301
- Clifton et al. (2012) Clifton, T., Ferreira, P. G., Padilla, A., & Skordis, C. 2012, Phys. Rept., 513, 1, doi: 10.1016/j.physrep.2012.01.001
- Cole & Kaiser (1989) Cole, S., & Kaiser, N. 1989, MNRAS, 237, 1127, doi: 10.1093/mnras/237.4.1127
- Colin et al. (2017) Colin, J., Mohayaee, R., Rameez, M., & Sarkar, S. 2017, Mon. Not. Roy. Astron. Soc., 471, 1045, doi: 10.1093/mnras/stx1631
- Condon et al. (1998) Condon, J. J., Cotton, W. D., Greisen, E. W., et al. 1998, AJ, 115, 1693, doi: 10.1086/300337
- Dalal et al. (2008) Dalal, N., Dore, O., Huterer, D., & Shirokov, A. 2008, Phys. Rev. D, 77, 123514, doi: 10.1103/PhysRevD.77.123514
- Dekel & Rees (1987) Dekel, A., & Rees, M. J. 1987, Nature, 326, 455, doi: 10.1038/326455a0
- DESI Collaboration et al. (2016) DESI Collaboration, Aghamousa, A., Aguilar, J., et al. 2016, arXiv e-prints, arXiv:1611.00036, doi: 10.48550/arXiv.1611.00036
- DESI Collaboration et al. (2022) DESI Collaboration, Abareshi, B., Aguilar, J., et al. 2022, AJ, 164, 207, doi: 10.3847/1538-3881/ac882b
- Desjacques & Seljak (2010) Desjacques, V., & Seljak, U. 2010, Class. Quant. Grav., 27, 124011, doi: 10.1088/0264-9381/27/12/124011
- Dolfi et al. (2019) Dolfi, A., Branchini, E., Bilicki, M., et al. 2019, A&A, 623, A148, doi: 10.1051/0004-6361/201834317
- Gibelyou & Huterer (2012) Gibelyou, C., & Huterer, D. 2012, MNRAS, 427, 1994, doi: 10.1111/j.1365-2966.2012.22032.x
- Ivezić et al. (2019) Ivezić, Ž., Kahn, S. M., Tyson, J. A., et al. 2019, ApJ, 873, 111, doi: 10.3847/1538-4357/ab042c
- Kaiser (1984) Kaiser, N. 1984, ApJ, 284, L9, doi: 10.1086/184341
- Komatsu & Spergel (2001) Komatsu, E., & Spergel, D. N. 2001, Phys. Rev. D, 63, 063002, doi: 10.1103/PhysRevD.63.063002
- Maartens et al. (2013) Maartens, R., Zhao, G.-B., Bacon, D., Koyama, K., & Raccanelli, A. 2013, JCAP, 02, 044, doi: 10.1088/1475-7516/2013/02/044
- Matarrese & Verde (2008) Matarrese, S., & Verde, L. 2008, Astrophys. J. Lett., 677, L77, doi: 10.1086/587840
- Matarrese et al. (2000) Matarrese, S., Verde, L., & Jimenez, R. 2000, Astrophys. J., 541, 10, doi: 10.1086/309412
- Nusser & Tiwari (2015) Nusser, A., & Tiwari, P. 2015, ApJ, 812, 85, doi: 10.1088/0004-637X/812/1/85
- Peacock & Smith (2000) Peacock, J. A., & Smith, R. E. 2000, MNRAS, 318, 1144, doi: 10.1046/j.1365-8711.2000.03779.x
- Peebles (1980) Peebles, P. J. E. 1980, The large-scale structure of the universe (Princeton University Press)
- Peebles (2022) Peebles, P. J. E. 2022, Annals Phys., 447, 169159, doi: 10.1016/j.aop.2022.169159
- Press & Schechter (1974) Press, W. H., & Schechter, P. 1974, ApJ, 187, 425, doi: 10.1086/152650
- Rubart & Schwarz (2013) Rubart, M., & Schwarz, D. J. 2013, A&A, 555, doi: 10.1051/0004-6361/201321215
- Secrest et al. (2022) Secrest, N. J., von Hausegger, S., Rameez, M., Mohayaee, R., & Sarkar, S. 2022, Astrophys. J. Lett., 937, L31, doi: 10.3847/2041-8213/ac88c0
- Secrest et al. (2021) Secrest, N. J., von Hausegger, S., Rameez, M., et al. 2021, ApJ, 908, L51, doi: 10.3847/2041-8213/abdd40
- Seljak (2009) Seljak, U. 2009, Phys. Rev. Lett., 102, 021302, doi: 10.1103/PhysRevLett.102.021302
- Sheth & Tormen (1999) Sheth, R. K., & Tormen, G. 1999, Mon. Not. Roy. Astron. Soc., 308, 119, doi: 10.1046/j.1365-8711.1999.02692.x
- Siewert et al. (2021) Siewert, T. M., Schmidt-Rubart, M., & Schwarz, D. J. 2021, A&A, 653, A9, doi: 10.1051/0004-6361/202039840
- Singal (2011) Singal, A. K. 2011, ApJL, 742, L23, doi: 10.1088/2041-8205/742/2/L23
- Singal (2024) Singal, A. K. 2024, Mon. Not. Roy. Astron. Soc., 532, L1, doi: 10.1093/mnrasl/slae039
- Tiwari & Aluri (2019) Tiwari, P., & Aluri, P. K. 2019, Astrophys. J., 878, 32, doi: 10.3847/1538-4357/ab1d58
- Tiwari et al. (2019) Tiwari, P., Ghosh, S., & Jain, P. 2019, Astrophys. J., 887, 175, doi: 10.3847/1538-4357/ab54c8
- Tiwari et al. (2022) Tiwari, P., Kothari, R., & Jain, P. 2022, Astrophys. J. Lett., 924, L36, doi: 10.3847/2041-8213/ac447a
- Tiwari et al. (2015) Tiwari, P., Kothari, R., Naskar, A., Nadkarni-Ghosh, S., & Jain, P. 2015, Astroparticle Physics, 61, 1, doi: 10.1016/j.astropartphys.2014.06.004
- Tiwari & Nusser (2016) Tiwari, P., & Nusser, A. 2016, Journal of Cosmology and Astroparticle Physics, 2016, 062, doi: 10.1088/1475-7516/2016/03/062
- von Hausegger (2024) von Hausegger, S. 2024, Mon. Not. Roy. Astron. Soc., 535, L49, doi: 10.1093/mnrasl/slae092
- Wilkinson (1991) Wilkinson, P. N. 1991, in Astronomical Society of the Pacific Conference Series, Vol. 19, IAU Colloq. 131: Radio Interferometry. Theory, Techniques, and Applications, ed. T. J. Cornwell & R. A. Perley, 428–432
- Xia et al. (2010) Xia, J.-Q., Viel, M., Baccigalupi, C., et al. 2010, Astrophys. J. Lett., 717, L17, doi: 10.1088/2041-8205/717/1/L17