This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

On the possibility of Baryon Acoustic Oscillation measurements at redshift z>7.6z>7.6 with the Roman Space Telescope

Siddharth Satpathy,1,2 Zhaozhou An,1,2 Rupert A. C. Croft,1,2,3,4 Tiziana Di Matteo,1,2,3,4 Ananth Tenneti,1,2 Yu Feng,5 Katrin Heitmann6 & Graziano Rossi7
1Department of Physics, Carnegie Mellon University, 5000 Forbes Ave., Pittsburgh, PA 15213, USA
2The McWilliams Center for Cosmology, Carnegie Mellon University, 5000 Forbes Ave., Pittsburgh, PA 15213, USA
3 School of Physics, The University of Melbourne, VIC 3010, Australia
4 ARC Centre of Excellence for All Sky Astrophysics in 3 Dimensions (ASTRO 3D), Australia
5Berkeley Center for Cosmological Physics, Department of Physics, University of California Berkeley, Berkeley, CA 94720, USA
6HEP and MCS Divisions, Argonne National Laboratory, Lemont, IL 60439, USA
7Department of Physics and Astronomy, Sejong University, Seoul, 143-747, Korea
E-mail: [email protected]
(Accepted XXX. Received YYY; in original form ZZZ)
Abstract

The Nancy Grace Roman Space Telescope (RST), with its field of view and high sensitivity will make surveys of cosmological large-scale structure possible at high redshifts. We investigate the possibility of detecting Baryon Acoustic Oscillations (BAO) at redshifts z>7.6z>7.6 for use as a standard ruler. We use data from the hydrodynamic simulation BlueTides in conjunction with the gigaparsec-scale Outer Rim simulation and a model for patchy reionization to create mock RST High Latitude Survey grism data for Lyman-α\alpha emission line selected galaxies at redshifts z=7.4z=7.4 to z=10z=10, covering 2280 square degrees. We measure the monopoles of galaxies in the mock catalogues and fit the BAO features. We find that for a line flux of L=7×1017erg/s/cm2L=7\times 10^{-17}\ {\rm erg/s/cm}^{2}, the 5σ5\sigma detection limit for the current design, the BAO feature is partially detectable (measured in three out of four survey quadrants analysed independently). The resulting root mean square error on the angular diameter distance to z=7.7z=7.7 is 7.9%\%. If we improve the detection sensitivity by a factor of two (i.e. L=3.5×1017erg/s/cm2L=3.5\times 10^{-17}\ {\rm erg/s/cm}^{2}), the distance error reduces to 1.4%1.4\%. We caution that many more factors are yet to be modelled, including dust obscuration, the damping wing due to the intergalactic medium, and low redshift interlopers. If these issues do not strongly affect the results, or different observational techniques (such as use of multiple lines) can mitigate them, RST or similar instruments may be able to constrain the angular diameter distance to the high redshift Universe.

keywords:
Cosmology: dark energy - Cosmology: large scale structure of Universe - Galaxies: statistics.
pubyear: 2019pagerange: On the possibility of Baryon Acoustic Oscillation measurements at redshift z>7.6z>7.6 with the Roman Space TelescopeOn the possibility of Baryon Acoustic Oscillation measurements at redshift z>7.6z>7.6 with the Roman Space Telescope

1 Introduction

Baryon Acoustic Oscillations (BAO) have become very useful as a standard ruler to constrain cosmological models (see e.g., the review by Weinberg et al. (2013)). Due to the distinctive nature of their signature, the BAO scale constitutes one of the most effective observables in cosmology. The BAO features can be detected as peaks in two point correlation functions or wiggles in power spectra. These signatures of BAO can be used to measure the Hubble constant and the angular diameter distance as function of redshift. As such, it is used as a robust and unique probe to investigate the nature of dark energy.

The sound horizon at the time of recombination leaves its imprint on both the cosmic microwave background (CMB) and three dimensional distribution of matter. Using simulations, in this paper we explore whether it may be possible to measure the BAO scale at redshifts z>7.6z>7.6 using galaxies detected by the RST satellite (formerly the Wide Field Infrared Survey Telescope, WFIRST111 https://wfirst.gsfc.nasa.gov/ ). These measurements would represent the first BAO detection between z=2.3z=2.3 and z=1100z=1100.

The first observations made of BAO (Eisenstein et al., 2005; Cole et al., 2005)) at low redshifts (z00.4z\sim 0-0.4) used luminous red galaxies (LRGs) as tracers of structure, and these have been supplemented by even larger samples at higher redshifts (e.g., Zhai et al. (2017), Bautista et al. (2018)), up to z=0.72z=0.72. The density field at redshifts z23.5z\sim 2-3.5 is accessible using the Lyman-α\alpha forest of absorption in quasar spectra, and its measured clustering has led to the determination of the BAO scale at z2.32.4z\sim 2.3-2.4 (Busca et al., 2013; Slosar et al., 2013; du Mas des Bourboux et al., 2017; de Sainte Agathe et al., 2019). The gap between low redshift galaxy determinations and the forest was recently filled by a determination using quasar clustering at z=1.52z=1.52 (Ata et al., 2018). Emission line galaxies at z=0.85z=0.85 (Raichoor et al., 2017) and z=0.9z=0.9 (Comparat et al., 2016) from the Extended Baryon Oscillation Spectroscopic Survey (eBOSS) will also soon be adding constraints in this crucial redshift range, around the time that the Universe started to accelerate. Radio observations of 21cm emission will increase the volume covered (Vanderlinde & Chime Collaboration, 2014). At higher redshifts, galaxies have been discovered as far away as z=11.1z=11.1 (Oesch et al., 2016), in small Hubble Space Telescope (HST) fields at present. The RST satellite1 will have a much wider field of view than HST, and the planned RST grism survey holds the promise of measuring the large-scale structure of the Universe at early times with galaxy survey techniques which have been hitherto been confined to low redshifts.

Among the reasons for looking at the BAO scale in a different redshift regime are the possibility of early dark energy (Hill & Baxter, 2018), or models where dark matter decays into radiation and affects the expansion rate (Wang et al., 2014). There is also current tension between locally determined H0H_{0} measurements (Riess et al., 2018) and those determined from the CMB in the context of cold dark matter (CDM) (Planck Collaboration et al., 2018a). Using multiple redshifts to determine the consistency (or not) of BAO measurements should be useful in finding a solution to this controversy.

Cosmological surveys designed to measure the BAO feature must cover large volumes, as the comoving length scale involved is 150 megaparsecs, and cosmic variance must be suppressed. The number density of sampling objects must be large enough to overcome shot noise. When there is domination by shot-noise, one can still obtain BAO measurements and the error will just scale as 1/(N)1/\sqrt{(}N), where NN is the number of objects. For example, eBOSS quasars are shot-noise dominated (Zarrouk et al., 2018). The shot-noise dominated regime carries the implication that one can return to the same volume and sample it further to obtain more precise results. Conversely, when one is cosmic variance limited, one gains more from sampling a new volume instead of sampling the same volume.

The current number density of quasars at z3z\sim 3 in the largest survey (final SDSS-III BOSS DR09; Eftekharzadeh et al., 2015) is 1.057±0.0071.057\pm 0.007 (h1h^{-1}Mpc)3, which is too low. Galaxies at similar redshifts and higher exist in sufficient numbers, but their relatively low luminosity has made gathering samples that cover sufficient volume impossible with current technology. RST and Euclid will enable the Universe at z=6z=6 to be studied with a more representative sample of galaxies than have been possible at present (small current fields of view lead to rarer brighter galaxies being missed). So far, galaxies have been found up to redshifts z=12z=12 with dropout techniques (Ellis et al., 2013). The largest samples with photometric redshifts at z6z\geq 6 are Chaikin et al. (2018), Yan et al. (2011), Bouwens et al. (2011), Laporte et al. (2017), Eyles et al. (2005) and Mobasher et al. (2005) which contain 72, 20, 1, 1, 2 and 1 galaxies respectively. In a few cases spectral confirmation has been obtained by identifiying the Lyman-α\alpha line (refs for z>7.5z>7.5 galaxies (Finkelstein et al., 2013; Song et al., 2016). Between z=6z=6 and 77, the Candels survey (Pentericci et al., 2018) has found 260 spectroscopically confirmed galaxies. Narrow band imaging surveys searching directly for the Lyman-α\alpha line offer a route to large samples in principle. An early example of such a search includes Tilvi et al. (2010) at z=7.7z=7.7 which was not able to confirm any objects. More recently, the LAGER survey (Hu et al., 2017) has followed up candidates with further spectroscopy and found a 66% success rate at z7.0z\sim 7.0.

There are number of obstacles to the finding and use of z>7.6z>7.6 galaxies as a BAO probe. The most important is the actual visibility of lines that would enable spectroscopic identification. The Lyman-α\alpha line is subject to multiple scatterings and absorption by dust within galaxies. It is also absorbed by neutral hydrogen content of the intergalactic medium (IGM), and Lyman-α\alpha emitting galaxies at the highest redshifts have proved difficult to find. One the other hand, at z6.5z\sim 6.5, Bagley et al. (2017) have found that Lyman-α\alpha emitters are detectable and may lie mainly in ionized bubbles, sufficiently large for the Lyman-α\alpha photons to redshift out of resonance before encountering the neutral IGM gas. At lower redshifts (e.g., z23z\sim 2-3), Lyman-α\alpha emission appears to have an escape fraction of 510\sim 5-10% (Gronwall et al., 2007; Francis et al., 2012) but at higher redshifts z>6z>6 it appears to climb to near 100% (Cassata, P. et al., 2011; Sobacchi & Mesinger, 2015).

Interlopers are another important issue. For example there will be galaxies at redshifts 1.1<z<1.91.1<z<1.9 for which Hydrogen Balmer lines will fall in the spectral window for Lyman-α\alpha at redshifts 8\sim 8. Intermediate redshift (1.7<z<2.81.7<z<2.8) galaxies will also contribute [OIII] 5007 emission. How well RST grism spectra could be used to identify galaxies at the highest redshifts and remove interlopers is an open question.

In this paper we carry out the simplest analysis possible, using a combination of a large darkmatter simulation (Outer Rim; Li et al., 2019) and a hydrodynamic simulation (BlueTides; Wilkins et al., 2016a; Wilkins et al., 2016b) to see whether BAO could be detected at redshifts z8z\sim 8 with RST’s spectral coverage. We make many simplifying assumptions, each of which are likely to strongly affect the outcome, but can be tested with future work. We assume that all Lyman-α\alpha emission can be seen. In other words, we suppose that the Lyman-α\alpha emitting galaxies lie in ionized bubbles (Kakiichi et al., 2016; Yajima et al., 2018). We ignore galaxy-scale extinction of Lyman-α\alpha emission. That is, we assume that the raw star formation in our hydrodynamic simulation of galaxy formation, BlueTides (Wilkins et al., 2016a; Wilkins et al., 2016b) can be directly translated into Lyman-α\alpha luminosity. We also do not directly address interlopers and do not do a complete analysis of spectral overlap in the grism spectra. We restrict ourselves to seeing whether mock observations of a RST grism survey under these conditions can detect the BAO feature. Our analysis therefore acts as a first step - if we find that it is possible, then these restrictions should be relaxed and tested to see whether it is worth pursuing the idea further.

The plan for the paper is as follows. In Section 2 we describe the different cosmological datasets we use. In Section 3 we show how these were used to construct mock catalogues for the RST high latitude survey. We describe details results obtained from mock catalogues in Section 4. We present comprehensive discussion and analysis of our results in Section 5.

Table 1: Simulation cosmological parameters based on assumed Λ\LambdaCDM cosmological model.
Name BlueTides Outer Rim
ΩΛ\Omega_{\Lambda} 0.7186 0.7352
ΩMatter\Omega_{\text{Matter}} 0.2814 0.2648
ΩBaryon\Omega_{\text{Baryon}} 0.0464 0.04479
h 0.697 0.71
σ8\sigma_{8} 0.820 0.8
nsn_{s} 0.971 0.963

2 Datasets

2.1 The BlueTides simulation

BlueTides is a hydrodynamic simulation which was designed to cover the high redshift Universe (Feng et al., 2015, 2016). The model uses the Smoothed Particle Hydrodynamics code MP-Gadget to evolve a cubical volume with 2×704032\times 7040^{3} particles. These particles are used to model dark matter, gas, stars and supermassive black holes. This simulation evolved a (400/h577)3(400/h\approx 577)^{3} cMpc3 cube to redshift z=7.3z=7.3. The resolution, volume and number of galaxy halos in BlueTides are well matched to depth and area of current observational surveys targeting the high-redshift Universe. Additional details of the BlueTides simulation can be found in (Feng et al., 2015, 2016; Wilkins et al., 2016a; Wilkins et al., 2016b; Wilkins et al., 2017).

A Friends-of-Friends (FoF) algorithm was used to identify galaxy halos. At the high redshifts relevant here these halos are sufficiently accurate representations of the galaxies that would be found in the simulations if observational determinations were used. This was seen directly in Feng et al. (2016) by comparing FoF halos with galaxy halos extracted from BlueTides mock images and identified with the Source Extractor algorithm (Bertin & Arnouts, 1996).

Feng et al. (2016) used results from the BlueTides simulation to find good correspondence between the predicted galaxy luminosity function and observations in the observable range 18MUV22.5-18\leq M_{\rm UV}\leq-22.5 with some dust extinction required to match the abundance of brighter objects. Additionally, Feng et al. (2016) also find good agreement between the star formation rate density in BlueTides and current observations in the redshift range 8z108\leq z\leq 10.

Wilkins et al. (2017) use galaxy halos from BlueTides simulation to show that intrinsic and observed luminosity functions at z{8,9,10}z\in\{8,9,10\} exhibit the fast expected build up of the galaxy population at high-redshifts. The work presented in Wilkins et al. (2017) reveals that the intrinsic luminosity function is broadly similar to the observed luminosity function at faint luminosities (M>20M>-20). At brighter luminosities, the luminosity function is shown to have a stronger steepening which indicates the increasing strength of dust attenuation. In their work, Wilkins et al. (2017) also demonstrate that a Schechter function provides a good overall fit to the shape of a the dust attenuated far-UV luminosity function for z{8,9,10}z\in\{8,9,10\}.

In Waters et al. (2016), it was shown that the number density of galaxies that will be observable by the RST High Latitude Survey should be high enough that BAO measurement at redshift z8z\sim 8 could be possible. The BlueTides volume being too small to make direct mock HLS observations and measurements of the BAO, it was necessary to follow up with different techniques, and this is the role of the current paper.

The BlueTides simulation is used in this paper in two roles: First, to make some example RST grism fields to check the degree of spectral overlap between high redshift galaxies and others at similar redshifts. Second, we use the BlueTides galaxies to populate the dark matter halos in a much larger dark matter only simulation. For the dark matter only simultion, we use the Outer Rim catalogue. Details of the Outer Rim simulation are described in  2.2.

2.2 The Outer Rim simulation

Refer to caption
Figure 1: The variation of star formation rate with mass for BlueTides galaxies at z=8z=8.
Refer to caption
Figure 2: A histogram of star formation rates for galaxies with masses from 2×1011M/h2\times 10^{11}M_{\sun}/h to 5×1011M/h5\times 10^{11}M_{\sun}/h in redshift 8. The red bins represent galaxies that can be observed with line detection sensitivity, L=7×1017erg/s/cm2L=7\times 10^{-17}\ {\rm erg/s/cm}^{2}.

The Outer Rim simulation (Heitmann et al., 2019; Li et al., 2019) is a large-scale dark matter only simulation which we use to measure large-scale clustering and BAO. It was run using the Hardware/Hybrid Accelerated Cosmology Code (HACC; Habib et al., 2013; Habib et al., 2016), on Mira, a BG/Q system at the Argonne Leadership Computing Facility. The cosmology used is a Λ\LambdaCDM model close to the best-fit model from WMAP-7 (Komatsu et al., 2011), with parameters given in Table 1. The comoving box size of the simulation is 3000h13000\ h^{-1}Mpc, yielding a volume 400 times that of BlueTides. 1.07 trillion particles were evolved, leading to a particle mass of mp=1.85×109h1Mm_{p}=1.85\times 10^{9}\ h^{-1}M_{\sun}. The model was run to redshift z=0z=0, but in this paper we use only outputs at z=7z=7 and above.

Halos are identified with a Friends-of-Friends (Davis et al., 1985) halo finder with a linking length of b=0.168b=0.168.

2.3 Galaxy Lyman-α\alpha flux

Refer to caption
Figure 3: A simulation of a grism image with galaxies at redshift z=8z=8 from the BlueTides simulation. Each horizontal line denotes the spectrum of a galaxy. The brightness is proportional to the flux in the pixels.

At the redshifts z8z\sim 8 which concern us here, we will simulate the Lyman-α\alpha line luminosity of galaxies, which will lie within the RST grism spectral range (see section 3.1 below). An example of Lyman-α\alpha line detection from a galaxy at these redshifts is given in the work of Lehnert et al. (2010). The redshift of the galaxy UDFy-381355395 is z=8.6z=8.6, and Lehnert et al. (2010) measured a Lyman-α\alpha line flux of (6.1±1.0)×1018(6.1\pm 1.0)\times 10^{-18} erg/s/cm2, corresponding to a luminosity of 5.5±1.0±1.8×10425.5\pm 1.0\pm 1.8\times 10^{42} erg/s.

In BlueTides, the star formation rate (SFR) for each galaxy is available to us, and we translate this quantity directly into Lyman-α\alpha luminosity using the following scaling equation (Cassata, P. et al., 2011).

LLyα(ergs1)=1.1×1042×SFR(Myr1)L_{{\rm Ly}\alpha}({\rm erg\ s^{-1}})=1.1\times 10^{42}\times{\rm SFR}(M_{\sun}\ {\rm yr}^{-1}) (1)

In equation 1, LLyαL_{{\rm Ly}\alpha} denotes the Lyman-α\alpha luminosity. This conversion factor is based on a stellar population with a Salpeter initial mass function (IMF) and is accurate to within a factor of a few for a range of population ages, high mass cutoffs of stars, and metallicities (Leitherer et al., 1999). Most relevant in our case is that there is no correction for effects like dust, escape fraction, and intergalactic or circumgalactic absorption. This means that using the raw star formation rate measurements from the simulation represents a best case for the detectability of galaxies and this should be borne in mind when interpreting the results. We will return to this in Section 5. Due to the paucity of observational data at these redshifts, the magnitude of these obscuring effects is not well known, but they are likely to be significant. In the example of the galaxy mentioned in Lehnert et al. (2010), the continuum SFR was estimated to be 24Myr12-4\ M_{\sun}\ {\rm yr}^{-1}. At the same time, Lehnert et al. (2010) evaluated the SFR from the Lyman-α\alpha line to be 0.32.1Myr10.3-2.1\ M_{\sun}\ {\rm yr}^{-1}, which leads to Lyman-α\alpha overall escape fraction from 8%100%8\%-100\%.

Refer to caption
Figure 4: A grism image for a model for L=7×1017erg/s/cm2L=7\times 10^{-17}\ {\rm erg/s/cm}^{2}. Each blue block represents a grism spectrum for a single galaxy. We use this plot to estimate overlap.

The BlueTides simulation volume is not sufficiently large to make mock RST survey catalogues that can be used to measure large-scale structure. We use the halo catalogues from the Outer Rim simulation to do this, painting SFRs from BlueTides galaxies onto correspondingly massive Outer Rim halos. Fig. 1 shows the relationship between mass and SFR for BlueTides at redshift z=8z=8, We can see that there is an approximately linear relationship between log(M)\log(M) and log(SFR)\log({\rm SFR}). Our SFR assignment scheme is as follows: for each Outer Rim halo, we randomly pick a BlueTides halo with total mass within 5×10105\times 10^{10} h1Mh^{-1}M_{\sun}, and assign the SFR for that halo to the Outer Rim object. In this way, we account for some aspects of the scatter between SFR and halo mass, although not any spatially correlated structure. For large mass galaxies we increase the range to make sure there are enough sample halos in BlueTides.

In one of the RST design documents, the 7σ7\sigma line flux sensitivity for the High Latitude Survey is 1016erg/s/cm210^{-16}\ {\rm erg/s/cm}^{2}. 222 https://wfirst.gsfc.nasa.gov/science/sdt_\_public/WFIRST-AFTA_\_SDT_\_Report_\_150310_\_Final.pdf We discuss RST and the flux limits on our mock surveys further in Section 3.1.1.

Fig. 2 shows the SFR distribution of BlueTides galaxies at z=8z=8 with mass from 2×1011h1M2\times 10^{11}\ h^{-1}M_{\sun} to 5×1011h1M5\times 10^{11}\ h^{-1}M_{\sun}. We can see that when the line sensitivity is 7×1017erg/s/cm27\times 10^{-17}\ {\rm erg/s/cm}^{2}, then the minimum detectable luminosity is 5×1043erg/s5\times 10^{43}\ {\rm erg/s} at z=8z=8, which corresponds to 40Myr140\ M_{\sun}\ {\rm yr}^{-1}. Only the most highly star-forming galaxies are therefore likely to be included in any BAO survey.

3 Methodology

Refer to caption
Figure 5: The percentage of overlapping galaxies as a function of line sensitivity. We notice a monotonically increasing trend in the variation of the percent of overlapping galaxies with line sensitivity. The solid line denotes a one dimensional interpolation between data points.

3.1 Mock grism images

Working with grism spectroscopy is complex (McCarthy et al., 1999; Fossati et al., 2017; Colbert et al., 2018), with relatively low resolution possible, mixing of spatial and spectral structure in background and stray light, and overlap of sources. We leave full simulation of mock RST grism surveys (including foregrounds and interlopers) to other work (Gehrels et al., 2015; Colbert et al., 2018; Malhotra, 2019). We will however make some extremely simple grism images using BlueTides simulation data, in order to assess the level of spectral overlap from galaxies at similar redshifts.

3.1.1 The RST High Latitude Grism Survey

The RST High Latitude Survey (HLS) is planned to have a nominal sky coverage of 2400 deg2, which will greatly increase the number of galaxies available at redshifts z8z\geq 8. This survey will be conducted by RST’s Wide Field Instrument (WFI) and is expected to be a dual imaging and grism spectroscopy survey. The imaging survey will obtain direct imaging through four filters (Y106, J129, H158, and F184) and grism spectroscopy will be done at four roll angles. The filters Y106, J129, H158, and F184 are expected to enable imaging in wavelength ranges 0.9271.192μm0.927-1.192\ \mu{\rm m}, 1.1311.454μm1.131-1.454\ \mu{\rm m}, 1.3801.774μm1.380-1.774\ \mu{\rm m} and 1.6832.000μm1.683-2.000\ \mu{\rm m} respectively. The imaging survey will therefore cover the wavelength range (0.9272μm0.927-2\ \mu{\rm m}). The grism coverage is the most relevant for our work, and the planned accessible range of wavelengths specified in the WFIRST-AFTA SDT report 2015 ((Spergel et al., 2015)) is 11.93μm1-1.93\ \mu{\rm m}. With these specifications, Lyman-α\alpha emission at 7.22<z<z=14.887.22<z<z=14.88 will be visible.

Refer to caption
Figure 6: The figure on the left is cross section of original data. The figure on the right is the cross section for same position in simulated data with L=6×1017erg/s/cm2L=6\times 10^{-17}\ {\rm erg/s/cm}^{2}.

3.1.2 Image construction

We take the z=8z=8 snapshot of BlueTides and project the positions of galaxies into two dimensions using Euclidean geometry. The simulation box comoving side length of 400h1400\ h^{-1}Mpc corresponds to the comoving radial distance between z=7.5z=7.5 and z=9.5z=9.5. As the number density of galaxies decreases rapidly with redshift, our estimate of the overlap between galaxies in our crude image simulation is likely to be conservative. At each galaxy position we plot a dispersed galaxy spectrum using the relationship between angular position and wavelength appropriate for RST (12μm1-2\ \mu{\rm m}) . We include the effect of the redshift distortions on mock grism image by shifting the wavelengths of the spectrum of each galaxy according to the peculiar velocity along the line of sight. The galaxy spectrum data corresponds to models described in Fig. 2d of Wilkins et al. (2013). Fig. 2d of Wilkins et al. (2013) illustrates the spectral energy distribution of a stellar population which has formed stars at a constant rate for 100 Myr for z=8.0z=8.0. For our present purposes identical spectra for each galaxy are sufficient and we do this for ease of computation. In future work with more realistic simulations, the spectra for each individual galaxy can be determined using population synthesis models which are available and could be used (Wilkins et al., 2019).

The angular extent of the spectra perpendicular to the dispersion direction is determined by the approximate angular size of each individual galaxy. For the angular size part, we refer to Feng et al. (2016) to get MUV{\rm M_{UV}} from SFR. After this we use relationship of galaxy half-light radii vs MUV{\rm M_{UV}} shown in Figure 4 of Feng et al. (2015) to get galaxy half life radius from MUV{\rm M_{UV}}. Using this information, we find that galaxies of mass 1011h1M10^{11}\ h^{-1}M_{\sun} have an angular size of 0.11 arcsec in the grism image.

Fig. 3 shows one of the simulated grism images. The brightness shown in the picture is proportional to the flux in the pixels.

3.1.3 Spectral overlap

It is common that there will be overlap between galaxies in the grism image because some galaxies are close to each other and the direction is close to the direction between two galaxies. This will make the process of extracting galaxy spectra harder and lead to a decrease in the number of observed galaxies with reliable position information. It is therefore necessary to consider the overlap in the grism image. Usually we can mitigate this effect by taking grism images with different dispersion direction. RST will take grism images at four angles: 00^{\circ}, 1515^{\circ}, 180180^{\circ} and 195195^{\circ}. To estimate the upper bound of overlap, we will only consider a grism image with dispersion in one direction. Fig. 4 shows the simplified model that we use to estimate overlap. We use rectangular blocks that have the same length as grism spectrum to replace the galaxies, and find the blocks that overlap with each other and mark the galaxies as overlapping galaxies. We calculate the percentage of overlapping galaxies. Fig. 5 shows the relationship between the percentage of overlapping galaxies and line sensitivity. We can see the percentage increase when line sensitivity decreases, with the percentage around 7%7\% at L=3.5×1017erg/s/cm2L=3.5\times 10^{-17}\ {\rm erg/s/cm}^{2} (we will see later that this is around the level that accurate BAO fitting is possible). We calculate the upper bound of overlap by neglecting multiple dispersion directions and regarding partial overlap as full overlap, so the real overlap should be less than this.

We note of course that here we are only treating overlap from galaxies in the same high redshift range that we are interested in. These will be greatly outnumbered by a much higher density of foreground galaxies from below z=7.4z=7.4 in an actual survey. We leave the problem of overlap from these galaxies and also non-overlapping interlopers (galaxies with other spectral lines in the same wavelength range as Lyman-α\alpha at high redshifts) to future work.

3.2 Mock survey data

Refer to caption
Figure 7: The relationship between the probability of a galaxy being observed and its mass. The solid lines are one dimensional interpolations between data points.

The data we use from the Outer Rim simulation covers snapshots from z=7.4\text{z}=7.4 to z=10\text{z}=10. Our mock RST high-latitude survey (HLS) data, will cover four 3000×3000×700h33000\times 3000\times 700\ h^{-3}Mpc3 cuboids drawn from the 30003h33000^{3}\ h^{-3}Mpc3 simulation volume. This corresponds to a sky area of 4×570=22804\times 570=2280 sq. deg, approximately equal to the total sky coverage of the RST HLS. The redshift coverage in the line of sight dimension is z=7.4\text{z}=7.4 to z=10\text{z}=10, with the survey volume covering 4×6.3=25.24\times 6.3=25.2 comoving (Gpc/h)3h)^{3}. We use the flat sky approximation for simplicity, and the conversion from distance to angle above refers to the edge of the volume, at z=10z=10. Because the whole of the Outer Rim simulation is necessary to produce a single mock RST HLS, we will analyse the four separate cuboids drawn from Outer Rim separately, as quarter-mocks. The scatter between the results from the different quarter-mocks will give us a crude estimate of cosmic variance.

In order to include redshift evolution in the mock surveys, we choose a cuboid with fixed position in different snapshots, and calculate the comoving radial distance of each redshift. We then extract the corresponding slice in the cuboid from different snapshots according to the relevant comoving radial distance. Stacking all the slices in order of redshift yields the mock survey. The redshift evolution is discrete rather than a true lightcone - we use redshifts at z=7.4, 7.7, 7.9, 8.4, 8.6, 8.8, 8.0, 9.2, 9.4, 9.6, 9.8 and 10.0. We include the effect of redshift distortions in the mock surveys using the peculiar velocities of the simulated galaxies.

As the Outer Rim simulation covers a 3000×3000×3000h33000\times 3000\times 3000\ h^{-3}Mpc3 cube, we are able to create four mock cuboids from the Outer Rim data, using different parts of the volume. Each of these covers 1/4\sim 1/4 of the RST HLS area, so that there are therefore four approximately independent quarter-mock surveys to use for baryon acoustic peak fitting.

Refer to caption
Figure 8: Here we show the distribution of galaxies along the z-axis for different line sensitivities.

As stated above, we assign BlueTides galaxy luminosities to Outer Rim halos according to their mass. We make several sets of mock surveys corresponding to different galaxy detection thresholds (different line sensitivities):

Table 2: This table shows the numbers of halos present in, and the mean redshifts of the three quarter-mock catalogues (L7, L3.5, L1).
Line detection Mock 1 Mock 2 Mock 3 Mock 4
sensitivity
Number of halos (before patchy reionization)
77 erg/s/cm2 15,484 15,344 15,280 15,144
3.53.5 erg/s/cm2 37,091 37,091 37,154 36,896
11 erg/s/cm2 194,684 194,522 195,157 194,107
Number of halos (with patchy reionization)
77 erg/s/cm2 13,999 14,040 13,989 13,910
3.53.5 erg/s/cm2 32,124 32,238 32,465 32,232
11 erg/s/cm2 147,463 148,150 148,994 147,529
Mean redshifts (before patchy reionization)
77 erg/s/cm2 7.6474 7.6498 7.6530 7.6519
3.53.5 erg/s/cm2 7.6895 7.6912 7.6962 7.6953
11 erg/s/cm2 7.7852 7.7863 7.7884 7.7867
Mean redshifts (with patchy reionization)
77 erg/s/cm2 7.6645 7.6650 7.6657 7.6711
3.53.5 erg/s/cm2 7.7156 7.7168 7.7197 7.7228
11 erg/s/cm2 7.8276 7.8280 7.8294 7.8297

(a) L7 quarter-mocks. The 2015 WFIRST technical report (Casertano et al., 2015) estimates that with current design parameters, the RST HLS grism survey will have a 7σ7\sigma Hα\alpha line detection sensitivity of 612×1017erg/s/cm26-12\times 10^{-17}\ {\rm erg/s/cm}^{2} for sources with effective radius <0.3<0.3 arcsecs. This is for Hα\alpha spectral line (λrest=6563Å\lambda_{\rm rest}=6563\ \mbox{\AA}) in the redshift range z=1.05z=1.05 to z=1.88z=1.88. In Spergel et al. (2015), a value of 1×1017erg/s/cm21\times 10^{-17}\ {\rm erg/s/cm}^{2} at 7σ7\sigma is quoted, which is in the same range. In our analysis, we use this value, but assume a 5 σ\sigma detection limit, which results in a lower limit to the detectable line flux of 7×1017erg/s/cm27\times 10^{-17}\ {\rm erg/s/cm}^{2}. As the RST survey strategies are as yet unfinalized, we assume that there could be some flexibility, both in the selection of the fiducial sensitivity, and also possibilities of deeper integrations over part of the survey area.

We assume that this sensitivity can be applied to the Lyman-α\alpha line over the same observed range of wavelengths. The line sensitivity for resolved sources is approximately a factor of two worse, but we expect the galaxies to be unresolved. For example, Feng et al. (2015) find half light radii of 0.510.5\sim 1 kpc for the most massive galaxies in BlueTides at redshift z=8z=8, which corresponds to 0.10.20.1-0.2 arcsec. For our first mocks, those closest to the RST design sensitivity, the mimimum Lyman-α\alpha line flux is 7×1017erg/s/cm27\times 10^{-17}\ {\rm erg/s/cm}^{2}. We name these the L7 quarter-mocks in what follows.
(b) L3.5 quarter-mocks. Unfortunately, even with the above parameter choices, which are in themselves relatively optimistic regarding the capabilities of RST, we shall see below that the possibility of detecting BAO is marginal. We therefore make another set of quarter-mocks with a 50% less restrictive line detection sensitivity limit, 3.5×1017erg/s/cm23.5\times 10^{-17}\ {\rm erg/s/cm}^{2}. This sensitivity limit could in principle be reached by increasing the integration time and lowering the sky area covered by the HLS. We name these the L3.5 quarter-mocks.
(c) L1 quarter-mocks. The final set of mocks we use for analysis have a line detection sensitivity of 1×1017erg/s/cm21\times 10^{-17}\ {\rm erg/s/cm}^{2} 7 times better than the planned RST design. We create these (L1) quarter-mocks in order to explore BAO measurement when there is extremely good sampling.

In Fig. 2 we show galaxies with SFR more than 44.20yr1M44.20\ {\rm yr}^{-1}M_{\sun} account for 15.9% of the total galaxy counts. We assume that if a galaxy mass falls between 2×1011h1M2\times 10^{11}\ h^{-1}M_{\sun} and 5×1011h1M5\times 10^{11}\ h^{-1}M_{\sun}, the probability of it being able to be detected is 15.9%. We calculate this probability for different mass ranges by using data from BlueTides, and match galaxies from simulated cuboid to decide which galaxies will be detected in the mock survey. Fig. 6 shows the difference after we apply this selection. Only 2% galaxies is kept in the right figure compared with left one. And, Fig. 7 shows the relationship between probability and mass. The maximum mass in redshift 9 and 10 is much smaller than other redshift, but considering the maximum mass of the Outer Rim is 1.30×1012h1M1.30\times 10^{12}\ h^{-1}M_{\sun} in redshift 9.2 and 6.7×1011h1M6.7\times 10^{11}\ h^{-1}M_{\sun} in redshift 10. Fig. 7 is enough to cover the mass range in the Outer Rim. And Fig. 8 shows the distribution of galaxies along the z-axis under different line sensitivity. There are several sudden changes in the distribution, the reason is that we use snapshots from discrete redshift to construct mock data so there is discontinuity between two redshift slices.

3.3 Measuring the correlation function

We use the Landy Szalay estimator (Landy & Szalay, 1993) to compute anisotropic two point correlation functions ξ^(s)\hat{\xi}(s) of galaxies in our datasets. We prefer the Landy Szalay estimator, since its performance has been proved to be better than other comparable two point correlation functions (Davis & Peebles, 1983; Hamilton, 1993) at large scales (Pons-Bordería et al., 1999; Kerscher et al., 2000). Equation 2 gives the formula for the calculation of the Landy Szalay estimator.

ξ^LS(s,μ)=DD(s,μ)2DR(s,μ)+RR(s,μ)RR(s,μ).\hat{\xi}_{\rm LS}(s,\mu)=\frac{DD(s,\mu)-2DR(s,\mu)+RR(s,\mu)}{RR(s,\mu)}. (2)

In equation 2, μ=cosθ\mu=cos\ \theta (where θ\theta is the angle between the line of sight and the pair separation vector for any given pair), DD(s,μ)DD(s,\mu) depicts the pair count of galaxies with pair separation distance ss and orientation μ\mu. The symbol DR(s,μ)DR(s,\mu) represents the cross-pair counts between galaxies and randoms which have pair separation ss and angle θ\theta, and RR(s,μ)RR(s,\mu) is the number of pairs for a random distribution. We have computed the correlation functions as a function of ss and μ\mu as an additional check on our calculations, to see that the clustering is behaving as expected. We do not present or use the two dimensional results further because the use of the full two-dimensional two point correlation functions in analyses is computationally expensive. One can use isotropised correlation functions (ξ~f(s)\tilde{\xi}_{f}(s)) to circumvent this. To obtain isotropised correlation functions, one uses specific kernels to marginalize out the μ\mu coordinate in the two point correlation function. In Hamilton (1993) it was shown that one can use Legendre polynomials as a kernel (f(μ)=P(μ)f(\mu)=P_{\ell}(\mu) to find the isotropic correlation functions ξ~(s)\tilde{\xi}_{\ell}(s). Different values of \ell will lead to isotropic correlation functions of different orders. In our research, we use isotropic correlation functions with Legendre polynomials and =0\ell=0 to obtain isotropized monopoles: (ξ~0(s)\tilde{\xi}_{0}(s)). For completeness, Equation 3 shows the process by which anisotropic correlationfunction ξ(s){\xi}_{\ell}(s) is obtained from the marginalization of anisotropic two correlation function (ξ^(s,μ)\hat{\xi}(s,\mu)).

ξ~(s)\displaystyle\tilde{\xi}_{\ell}(s) =2+1211ξ^LS(s,μ)P(μ)𝑑μ.\displaystyle=\frac{2\ell+1}{2}\int^{1}_{-1}\hat{\xi}_{\rm LS}(s,\mu)P_{\ell}(\mu)d\mu. (3)

In equation 3.3, we show how we can compute ξ~0(s)\tilde{\xi}_{0}(s) from equation 3.

ξ~0(s)\displaystyle\tilde{\xi}_{0}(s) =(12)11ξ^LS(s,μ)𝑑μ\displaystyle=\left(\frac{1}{2}\right)\int^{1}_{-1}\hat{\xi}_{\rm LS}(s,\mu)d\mu
(12)jΔμjξ^LS(s,μj).\displaystyle\approx\left(\frac{1}{2}\right)\displaystyle\sum_{j}\Delta\mu_{j}\hat{\xi}_{\rm LS}(s,\mu_{j}). (4)

In the work presented in this paper, we have used 100 bins in μ\mu in the computation of all two dimensional two-point correlation functions. Also, all the multipoles (ξ0,2(s)\xi_{0,2}(s)) that we compute for BlueTides and Outer Rim galaxies have evenly spaced bins of width 3h13\ h^{-1}Mpc in ss. Apart from their use as an internal check here, we reserve the presentation of results for multipoles higher than the monopole for future work.

3.4 Fitting the correlation function

Refer to caption
Figure 9: This plot shows a slice at rz=1500h1{\rm r}_{\rm z}=1500h^{-1} Mpc obtained from the three dimensional grid of cells with ionized and non-ionized regions determined from the reionization redshift field, zRE(𝐱)z_{\rm RE}(\mathbf{x}). The slice thicknesses in this three dimensional grid are 10 h1h^{-1} Mpc. The white patches correspond to regions which are ionized at z=9.4z=9.4, while the black patches denote non-ionized regions.
Refer to caption
Refer to caption
Figure 10: The plot on the left hand side shows a slice (rz=1500h1{\rm r}_{\rm z}=1500h^{-1} Mpc) of Outer Rim data at z=7.4z=7.4 before patchy reionization. In the right hand side, we show a slice (rz=1500h1{\rm r}_{\rm z}=1500h^{-1} Mpc) of Outer Rim data at z=7.4z=7.4 after reionization. The plot on the right hand side is obtained after Outer Rim data at z=7.4z=7.4 is analyzed using the grid of ionized and non-ionized regions described in section 3.5 and illustrated in Fig. 9.

For fitting of the correlation functions, we use techniques which are outlined in Anderson et al. (2012); Xu et al. (2012, 2013); Vargas-Magaña et al. (2014); Ansarinejad & Shanks (2018); Sarpa et al. (2019). We briefly describe the methods that we have used in this section.

We use the following fiducial form to fit space correlation monopoles ξ0(s)\xi_{0}(s) over the range 33h1Mpc<s<138h1Mpc33\ h^{-1}{\rm Mpc}<s<138\ h^{-1}{\rm Mpc} with bin sizes of s=3h1s=3\ h^{-1}Mpc.

ξfit(s)=B2ξm(αs)+A(s).\displaystyle\xi^{\rm fit}(s)=B^{2}\xi_{{\rm m}}(\alpha s)+A(s). (5)

Here, the symbol ‘ss’ represents the pair separation between galaxies in redshift space. The quantity BB is a multiplicative constant bias which adjusts the amplitude of the correlation functions (ξ\xi), and lets the model account for any unknown large-scale bias. The term

A(s)=a1s2+a2s+a3.\displaystyle A(s)=\frac{a_{1}}{s^{2}}+\frac{a_{2}}{s}+a_{3}. (6)

is included so that the fitting model can marginalize over broad-band effects due to scale dependent bias, redshift space distortions and errors during our assumption of the fiducial cosmology. The parameters a1,a2,a3a_{1},\ a_{2},\ a_{3} in equation 6 are nuisance parameters. The choice of number of parameters in A(s)A(s) in our work is inspired by Anderson et al. (2012); Xu et al. (2012, 2013); Vargas-Magaña et al. (2014); Ansarinejad & Shanks (2018); Sarpa et al. (2019). The inclusion of A(s)A(s) can help to mitigate the effects of assumption of a wrong cosmological model. At the same time, A(s)A(s) can be also be thought of as an effectual description of mode coupling not affecting the BAO scale but biasing its measurement if not considered properly (Crocce & Scoccimarro, 2008).

The scale dilation parameter α\alpha measures the position of the BAO peak of the data relative to the model. One can also think of the parameter α\alpha as the extent to which the acoustic peak in the data is shifted relative to the model. This isotropic shift in the positions of the BAO peak in the data and the model occurs due to non-linear structure growth. A numerical definition of α\alpha can be obtained in equation 7.

α=DV(z)rdrd,fidDV,fid(z)\displaystyle\alpha=\frac{D_{\rm V}(z)}{r_{d}}\frac{r_{d,{\rm fid}}}{D_{{\rm V},{\rm fid}}(z)} (7)

In equation 7 rdr_{d} represents the sound horizon at the drag epoch. The superscript ‘fid’ denotes the fiducial cosmology (given in Table 1.) A value of α<1\alpha<1 would represent a shift towards larger scales, whereas a value of α>1\alpha>1 denotes a shift towards smaller distance scales. The volume averaged distance DVD_{\rm V} is related to the Hubble constant HH, the angular diameter distance DAD_{\rm A}, the speed of light cc and the redshift zz by the following equation:

DV=[(1+z)2czDA2H]1/3D_{\rm V}=\left[(1+z)^{2}cz\frac{D_{\rm A}^{2}}{H}\right]^{1/3} (8)

The template power spectrum which we use is defined in the below mentioned relation.

Pm(k)=[Plin(k)Psmooth(k)]ek2ΣNL2/2+Psmooth(k)\displaystyle P_{\rm m}(k)=\left[P_{\rm lin}(k)-P_{\rm smooth}(k)\right]e^{-k^{2}\Sigma_{{\rm NL}}^{2}/2}+P_{\rm smooth}(k) (9)

In equation 9, Plin(k)P_{\rm lin}(k) represents the actual linear power spectrum at z=0z=0. The term Psmooth(k)P_{\rm smooth}(k) denotes the de-wiggled limit where the BAO feature is removed. More details of Psmooth(k)P_{\rm smooth}(k) can be found in Eisenstein & Hu (1998). The term ΣNL\Sigma_{{\rm NL}} helps to damp the BAO features in the linear power spectrum Plin(k)P_{\rm lin}(k). This parameter also helps to account for the broadening and shift of the BAO feature due to non linear structure evolution. We use ΣNL=0h1\Sigma_{{\rm NL}}=0\ h^{-1}Mpc in our work, i.e. Pm(k)=Plin(k)P_{{\rm m}}(k)=P_{\rm lin}(k). Choice of this value of ΣNL\Sigma_{{\rm NL}} is justified given the high redshifts that we consider (zOuterRim=9.4z_{\rm OuterRim}=9.4) and the correspondingly low value of the growth factor, g=0.123g=0.123. More details of this are given in Section 3.5.

We use CAMB (Lewis & Bridle, 2002) to compute the power spectrum Pm(k)P_{{\rm m}}(k) using the same cosmological model that is adopted in the Outer Rim simulation.

The model correlation function ξm\xi_{\rm m} is the Fourier transform of the the template power spectrum given in equation 9.

Xu et al. (2013) showed that variations in the bias like term BB do not affect the position of the BAO peak in monopoles. At the same time, the parameter α\alpha, which gives us a quantification of the acoustic scale, affects the position of the BAO peak. As such, α\alpha is the parameter that we are most interested in obtaining from our fits. The models that we use are non-linear in α\alpha, so we can nest a linear least-squares fitter inside a non-linear fitting routine. The best fit value of α\alpha is obtained by minimizing the χ2\chi^{2} goodness-of-fit indicator (non-linear fitter). To obtain an optimum value of α\alpha, we compute χ2\chi^{2} and shift models in the range 0.8<α<1.20.8<\alpha<1.2 with intervals of α=0.01\alpha=0.01. The choice of range within which values of α\alpha are examined, viz. [0.8,1.2]\left[0.8,1.2\right], is inspired by Vargas-Magaña et al. (2014) and Ansarinejad & Shanks (2018).

The best fit value of α\alpha is obtained by minimizing the χ2\chi^{2} goodness-of-fit indicator given in equation 10.

χ2(α)=[ξmockξfit(α)]T𝚺1[ξmockξfit(α)]\displaystyle\chi^{2}(\alpha)=[\xi^{\rm mock}-\xi^{\rm fit}(\alpha)]^{\rm T}\ \mathbf{\Sigma}^{-1}\ [\xi^{\rm mock}-\xi^{\rm fit}(\alpha)] (10)

In equation 10, ξfit(α)\xi^{\rm fit}(\alpha) represents a model vector at a given value of α\alpha, ξmock\xi^{\rm mock} denotes the correlation function measured from mock cuboids corresponding to a given line detection sensitivity, and 𝚺\mathbf{\Sigma} represents a diagonal covariance matrix which is obtained from all mock cuboids in the considered line detection sensitivity. In an ideal case, one would have prefered to use a variance-covariance matrix from a large number (preferably \sim100 or higher) of mock catalogues. But, in our work, we are dealing with high redshifts (z>7.6z>7.6) and we only have four mock cuboids for each of the three line detection sensitivities. We use these cuboids to construct separate covariance matrices for each line detection sensitivity. Hereafter, we shall use the symbol α\alpha^{\ast} to denote the best fit values of the scale dilation parameters α\alpha. To obtain robust results for best fit values of scale dilation parameters and best fit model vectors (ξfit(α)\xi^{\rm fit}(\alpha^{\ast})), we construct diagonal covariance matrices from mocks in two steps. In the first step, we compute a diagonal covariance matrix (say 𝚺0\mathbf{\Sigma}_{0}) from cuboids which correspond to the same line detection sensitivity. (Separate 𝚺0\mathbf{\Sigma}_{0} are constructed for catalogues which correspond to without patchy reionization and with patchy reionization.) In the second step, we construct a new diagonal covariance matrix (say 𝚺\mathbf{\Sigma}) by using a moving average (with a window size of three) over the diagonal elements of 𝚺0\mathbf{\Sigma}_{0}. That is to say,

𝚺{i,i}=[𝚺0{(i1),(i1)}+𝚺0{i,i}+𝚺0{(i+1),(i+1)}]3\displaystyle\mathbf{\Sigma}^{\{i,i\}}=\frac{\left[\mathbf{\Sigma}_{0}^{\{(i-1),(i-1)\}}+\mathbf{\Sigma}_{0}^{\{i,i\}}+\mathbf{\Sigma}_{0}^{\{(i+1),(i+1)\}}\right]}{3} (11)

To compute 𝚺{0,0}\mathbf{\Sigma}^{\{0,0\}}, we use equation 12

𝚺{0,0}=[𝚺0{0,0}+𝚺0{1,1}]2\displaystyle\mathbf{\Sigma}^{\{0,0\}}=\frac{\left[\mathbf{\Sigma}_{0}^{\{0,0\}}+\mathbf{\Sigma}_{0}^{\{1,1\}}\right]}{2} (12)

Similarly, to find the last diagonal element of 𝚺\mathbf{\Sigma}, i.e. 𝚺{N,N}\mathbf{\Sigma}^{\{N,N\}} we use equation 13. Here NN denotes the index of the largest distance scale (138h1Mpc138\ h^{-1}{\rm Mpc}) in our fitting range.

𝚺{N,N}=[𝚺0{(N1),(N1)}+𝚺0{N,N}]2\displaystyle\mathbf{\Sigma}^{\{N,N\}}=\frac{\left[\mathbf{\Sigma}_{0}^{\{(N-1),(N-1)\}}+\mathbf{\Sigma}_{0}^{\{N,N\}}\right]}{2} (13)

The approach highlighted in equations 1112 and 13 help in getting better agreement in positions of BAO peaks in mock and best fit model correlation functions.

Detailed description of the BAO fitting methodologies can also be found in Anderson et al. (2012); Xu et al. (2012, 2013); Vargas-Magaña et al. (2014).

3.5 Semi analytic model for patchy reionization

We follow the semi-analytic model outlined in Battaglia et al. (2013) to construct reionization fields from our density fields. The density fluctuation field is supposed to be at the midpoint of reionization, which is z=9.5z=9.5 for BlueTides. So, we use data from Outer Rim simulation which is closest to z=9.5z=9.5 (i.e. zOuterRim=9.4z_{\rm OuterRim}=9.4). The fluctuation field for halos from the Outer Rim catalogue at zOuterRim=9.4z_{\rm OuterRim}=9.4 is defined as:

δhalo(𝐱)δhalo(𝐱)δ¯haloδ¯halo\displaystyle\delta_{\rm halo}(\mathbf{x})\equiv\frac{\delta_{\rm halo}(\mathbf{x})-\bar{\delta}_{\rm halo}}{\bar{\delta}_{\rm halo}} (14)

Here, δ¯halo\bar{\delta}_{\rm halo} is the mean density of halos. The required linear bias, bb is obtained by using equation 15.

b=σ8gξ0z=9.4;OuterRimξ0z=0.0;CDM\displaystyle b=\frac{\sigma_{8}}{g}\cdot\sqrt{\frac{\xi_{0}^{z=9.4\ ;\ {\rm OuterRim}}}{\xi_{0}^{z=0.0\ ;\ {\rm CDM}}}} (15)

In equation 15, gg is the growth factor between z=9.4z=9.4 and z=0.0z=0.0, i.e. g=0.123g=0.123. The symbol ξ0z=9.4;OuterRim\xi_{0}^{z=9.4\ ;\ {\rm OuterRim}} denotes the correlation function monopole computed from Outer Rim halos at z=9.4z=9.4. The symbol ξ0z=0.0;CDM\xi_{0}^{z=0.0\ ;\ {\rm CDM}} represents the monopole from cold dark matter (CDM) at z=0.0z=0.0. We use a value of σ8=0.8\sigma_{8}=0.8. After the aforementioned steps, one can obtain the matter overdensity (δm\delta_{\rm m}) from equation 16

δm(𝐱)=δhalo(𝐱)/b\displaystyle\delta_{\rm m}(\mathbf{x})=\delta_{\rm halo}(\mathbf{x})/b (16)

Before we outline the steps for the computation of the reionization redshift field, zRE(𝐱)z_{\rm RE}(\mathbf{x}), it is important to define the fluctuation field δz(𝐱)\delta_{z}(\mathbf{x}).

δz(𝐱)[1+zRE(𝐱)][1+z¯]1+z¯\displaystyle\delta_{\rm z}(\mathbf{x})\equiv\frac{\left[1+z_{\rm RE}(\mathbf{x})\right]-\left[1+\bar{z}\right]}{1+\bar{z}} (17)

In equation 17, z¯\bar{z} is the mean value of the zREz_{\rm RE} field. At the same time, z¯\bar{z} sets the midpoint of reionization which is z=9.4z=9.4 for Outer Rim data.

From here, we can get the reionization redshift field, zRE(𝐱)z_{\rm RE}(\mathbf{x}), by following the below mentioned steps.

  1. 1.

    We Fourier transform the density fluctuation field δm\delta_{\rm m} from Outer Rim simulation at the midpoint of reionization (i.e. z=z¯=9.4z=\bar{z}=9.4) and obtain δ~m(k)\tilde{\delta}_{\rm m}(k).

  2. 2.

    We multiply the field δ~m(k)\tilde{\delta}_{\rm m}(k) with the bias factor given in equation 18 to obtain δ~z(k)\tilde{\delta}_{\rm z}(k).

    bmz(k)=b0(1+k/k0)α\displaystyle b_{\rm mz}(k)=\frac{b_{0}}{\left(1+k/k_{0}\right)^{\alpha}} (18)

    The above mentioned bias factor uses three parameters, viz. b0b_{0}, which is the bias amplitude on the largest scales, k0k_{0}, which is the scale threshold and α\alpha, which is the asymptotic exponent. For the three parameters b0b_{0}, k0k_{0}, α\alpha, we use the best-fit values which were used in Battaglia et al. (2013). More specifically, we use values of b0=0.593b_{0}=0.593 , k0=0.185Mpc1hk_{0}=0.185\ {\rm Mpc}^{-1}h and α=0.564\alpha=0.564 in our work. Since we don’t focus much on small distance scales in our work, we ignore the top hat filters that are used in Battaglia et al. (2013).

  3. 3.

    We inverse Fourier transform δ~z(k)\tilde{\delta}_{\rm z}(k) to real space to get the δz\delta_{\rm z} field.

  4. 4.

    Now, we can convert the δz\delta_{\rm z} field to zREz_{\rm RE} using equation 17.

Once we have the zRE(𝐱)z_{\rm RE}(\mathbf{x}) field, we can use that to modulate the galaxy distribution. Basically, we can use the same grid for all snapshots, and if a galaxy is in a cell that has reionized by that snapshot redshift, we treat it as being visible. If a galaxy is in a cell which hasn’t reionized (these will be the lower density cells, based on how the patchy reionization model works), then we say that it won’t be visible, and we remove it from the sample. As with the previous mock catalogues, we include the effect of redshift-space distortions in the galaxy positions: we use real space position to locate the galaxy in the reionization field, and then position the visible galaxies in redshift space.

We follow the following prescription to distinguish between lower and higher density cells. We use a Gaussian filter with σ=10h1\sigma=10\ h^{-1}Mpc to smooth the zRE(𝐱)z_{\rm RE}(\mathbf{x}) field. In the smoothed zRE(𝐱)z_{\rm RE}(\mathbf{x}) field, any cell whose value is less than the mean value of the smoothed zRE(𝐱)z_{\rm RE}(\mathbf{x}) field is considered as a low density cell. Similarly, any cell in the the smoothed zRE(𝐱)z_{\rm RE}(\mathbf{x}) field whose value is bigger than the mean is considered as a higher density cell. Fig. 9 shows a slice (rz=1500h1{\rm r}_{\rm z}=1500h^{-1} Mpc) of the three dimensional grid of cells with ionized and non-ionized regions which is obtained from the aforesaid method. We compare all snapshots (redshifts) with this grid of high and low density cells and construct a new catalogue of galaxies which correspond to scenarios after patchy reionization. In Fig. 10, we see slices (at rz=1500h1{\rm r}_{\rm z}=1500h^{-1} Mpc) from Outer Rim data at z=7.4z=7.4 with and without patchy reionization.

We measure correlation function monopoles in these galaxy catalogues which correspond the cases with and without patchy reionization. In the non-patchy case, we assume that the intervening intergalactic medium does not affect the visibility of the Lyman-α\alpha line from galaxies (i.e. the universe is optically thin at the time of Lyman-α\alpha emission).Results of monopoles and the ensuing BAO fitting are presented in Section 4.

4 Results

We show the fitting results of correlation functions from L7, L3.5 and L1 quarter mocks for scenarios with and without patchy reionization. The L7, L3.5 and L1 represent different observational sensitivities (with L7 being the fiducial RST HLS choice.) For each quarter mock (and for each scenario), we have results for the variation of χ2\chi^{2} with scale dilation parameter α\alpha for the four cuboids. At the same time, we show monopoles computed from the Landy Szalay estimator and best fit monopoles for each cuboid in each quarter mock (for each of the two aforementioned scenarios). The best fit monopoles correspond to the best fit value of α\alpha (i.e. α\alpha^{\ast}). The error on the angular diameter distance to the redshift z=7.7z=7.7 Universe is given by the error on the mean of the α\alpha^{\ast} values from the four quarter-mocks.

4.1 L7 mocks

The top four panels in Fig. 11 show plots of variation of χ2\chi^{2} with α\alpha for the four mock cuboids in L7 quarter-mock catalogue which correspond to distribution of halos without patchy reionization. The plots in the bottom four panels of Fig 11 show monopoles computed from Landy Szalay estimators and best fit monopoles (ξ(α)\xi(\alpha^{\ast})) for the aforementioned mock cuboids. For the cuboids which correspond to distribution of halos without patchy reionization, the minimum values of χ2\chi^{2} are 27.41, 31.32, 28.17, 45.86. For these cuboids, the best fit vaues of α\alpha are 1.08, 1.16, 0.86, 1.08 respectively. Consequently, for the cuboids from L7 quarter-mocks the mean value of minimum χ2\chi^{2} is 33.19, and the mean value of α\alpha^{\ast} is 1.04. We estimate the fractional error on the distance to z=7.8z=7.8 (the mean galaxy redshift) by computing the sample standard deviation of the α\alpha^{\ast} results for the four quarter-mocks, and then dividing by 4\sqrt{4} to give the error on the mean. The fractional error computed this way is 0.064, i.e., a distance error accurate to 6.4%6.4\%. We can see from the bottom panel of Fig. 11 however that one of the BAO peaks is not visibly detected.

One thing that is worth noting as this point is the fact that for a given mock catalogue (L7 or L3.5 or L1), plots at corresponding positions in the figures which illustrate results with and without patchy reionization are results gotten from the same cuboid. That is, if the leftmost and topmost subplot of Fig. 11 corresponds to Cuboid 1, then leftmost and topmost subplot of Fig. 12 also corresponds to Cuboid 1. The same holds of all mock catalogues (L7 or L3.5 or L1).

The left four panels in Fig. 12 illustrate the variations of χ2\chi^{2} with α\alpha for the four mock cuboids in L7 quarter-mock catalogue which correspond to distribution of halos after applying patchy reionization. In the right four panels in Fig. 12, we show monopoles computed from Landy Szalay estimators and best fit monopoles (ξ(α)\xi(\alpha^{\ast})) for these four mock cuboids. The cuboids from L7 quarter-mock catalogue which correspond to distribution of halos with patchy reionization have the following minimum χ2\chi^{2} values: 24.41, 26.68, 34.96, 26.59. These cuboids have best fit vaues of α\alpha as 1.04, 0.81, 1.05, 1.19 respectively. As such, the mean value of minimum χ2\chi^{2} for these cuboids is 28.16, and the mean value of α\alpha^{\ast} is 1.02. Again, as for the previous figure, one of the BAO peaks (in the top right panel) is not detected (although there is a chi-squared minimum and an estimate of α\alpha^{\ast} ). From the scatter of α\alpha^{\ast} values between quarter-mocks, we find the fractional error on the mean distance to be 0.079.

Refer to caption
Refer to caption
Figure 11: BAO fitting without patchy reionization: In the top two panels we show plots of the variations of χ2\chi^{2} with α\alpha for the four mock cuboids corresponding to L=7×1017erg/s/cm2L=7\times 10^{-17}\ {\rm erg/s/cm}^{2} (without patchy reionization). In the plots in the top four panels, the red dashed vertical lines denote the positions of the best fit α\alpha, viz. α\alpha^{\ast} for different mocks. In the figures in the bottom four panels, we show the monopoles computed from Landy Szalay estimator (ξ0mock\xi_{0}^{\rm mock}) and the best fit monopoles (ξ0fit(α)\xi_{0}^{\rm fit}(\alpha^{\ast})) obtained after fitting of ξ0mock\xi_{0}^{\rm mock} for the four mock cuboids corresponding to L=7×1017erg/s/cm2L=7\times 10^{-17}\ {\rm erg/s/cm}^{2} (without patchy reionization). In the plots in the bottom panels, the blue dots represent ξ0mock\xi_{0}^{\rm mock} while the orange dots denote ξ0fit(α)\xi_{0}^{\rm fit}(\alpha^{\ast}).
Refer to caption
Refer to caption
Figure 12: BAO fitting with patchy reionization: In the left four panels of this figure we show the variations of χ2\chi^{2} with α\alpha for the four mock cuboids corresponding to L=7×1017erg/s/cm2L=7\times 10^{-17}\ {\rm erg/s/cm}^{2} (after reionization). In the plots in the left four panels, the red dashed vertical lines denote the positions of the best fit α\alpha, viz. α\alpha^{\ast} for different mocks. In the right four panels , we show the monopoles computed from Landy Szalay estimator (ξ0mock\xi_{0}^{\rm mock}) and the best fit monopoles (ξ0fit(α)\xi_{0}^{\rm fit}(\alpha^{\ast})) obtained after fitting of ξ0mock\xi_{0}^{\rm mock} for the four mock cuboids corresponding to L=7×1017erg/s/cm2L=7\times 10^{-17}\ {\rm erg/s/cm}^{2} (with patchy reionization). In all the plots in the right panels, the blue dots represent ξ0mock\xi_{0}^{\rm mock} while the orange dots denote ξ0fit(α)\xi_{0}^{\rm fit}(\alpha^{\ast}).

4.2 L3.5 mocks

The left four panels of Fig. 13 show plots of variation of χ2\chi^{2} with α\alpha for the four mock cuboids in L3.5 quarter-mock catalogue which correspond to distribution of halos without patchy reionization. In the right four panels of Fig 13, we show monopoles computed from Landy Szalay estimators and best fit monopoles (ξ(α)\xi(\alpha^{\ast})) for the aforementioned mock cuboids. For the cuboids from L3.5 quarter-mock which correspond to the distribution of halos without patchy reionization, the values of minimum χ2\chi^{2} are 21.59, 20.03, 16.20, 26.42, and the values of α\alpha^{\ast} are 0.99, 1.08, 1.00, 1.12 respectively. Hence, the mean values of minimum χ2\chi^{2} and α\alpha^{\ast} are 21.06 and 1.05 respectively. The error on the BAO distance measurement is 0.032.

The plots in the left four panels of Fig. 14 illustrate the variations of χ2\chi^{2} with α\alpha for the four mock cuboids in L3.5 quarter-mock catalogue which correspond to distribution of halos after applying patchy reionization. In the bottom two rows of Fig. 14, we show monopoles computed from Landy Szalay estimators and best fit monopoles (ξ(α)\xi(\alpha^{\ast})) for these four mock cuboids. These cuboids have minimum values of χ2\chi^{2} of 38.80, 22.52, 43.14, 32.26 and best fit values of α\alpha as 0.95, 0.99, 1.02, 0.98. As such, the mean values of minimum χ2\chi^{2} and α\alpha^{\ast} are 34.18 and 0.98 respectively. The error on the BAO distance measurement is 0.014.

Refer to caption
Refer to caption
Figure 13: In the left four panels of this figure we show the variations of χ2\chi^{2} with α\alpha for the four mock cuboids corresponding to L=3.5×1017erg/s/cm2L=3.5\times 10^{-17}\ {\rm erg/s/cm}^{2} (without patchy reionization). In the plots in the left four panels, the red dashed vertical lines denote the positions of the best fit α\alpha, viz. α\alpha^{\ast} for different mocks. In the plots in the right four panels, we show the monopoles computed from Landy Szalay estimator (ξ0mock\xi_{0}^{\rm mock}) and the best fit monopoles (ξ0fit(α)\xi_{0}^{\rm fit}(\alpha^{\ast})) obtained after fitting of ξ0mock\xi_{0}^{\rm mock} for the four mock cuboids corresponding to L=3.5×1017erg/s/cm2L=3.5\times 10^{-17}\ {\rm erg/s/cm}^{2} (without patchy reionization). In each plot in the right panels, the blue dots represent ξ0mock\xi_{0}^{\rm mock} while the orange dots denote ξ0fit(α)\xi_{0}^{\rm fit}(\alpha^{\ast}).
Refer to caption
Refer to caption
Figure 14: In the left four panels of this figure we show the variations of χ2\chi^{2} with α\alpha for the four mock cuboids corresponding to L=3.5×1017erg/s/cm2L=3.5\times 10^{-17}\ {\rm erg/s/cm}^{2} (with patchy reionization). In the plots in the left four panels, the red dashed vertical lines denote the positions of the best fit α\alpha, viz. α\alpha^{\ast} for different mocks. In the right four panels, we show the monopoles computed from Landy Szalay estimator (ξ0mock\xi_{0}^{\rm mock}) and the best fit monopoles (ξ0fit(α)\xi_{0}^{\rm fit}(\alpha^{\ast})) obtained after fitting of ξ0mock\xi_{0}^{\rm mock} for the four mock cuboids corresponding to L=3.5×1017erg/s/cm2L=3.5\times 10^{-17}\ {\rm erg/s/cm}^{2} (with patchy reionization). In the plots in the right four panels, the blue dots represent ξ0mock\xi_{0}^{\rm mock} while the orange dots denote ξ0fit(α)\xi_{0}^{\rm fit}(\alpha^{\ast}).

4.3 L1 mocks

The left four panels of Fig. 15 show plots of variations of χ2\chi^{2} with α\alpha for the four mock cuboids in L1 quarter-mock catalogue which correspond to distribution of galaxies visible without patchy reionization. In the right four panels of Fig. 15, we show monopoles computed from Landy Szalay estimators and best fit monopoles (ξ(α)\xi(\alpha^{\ast})) for the aforementioned mock cuboids. These cuboids have the following minimum values of χ2\chi^{2}: 19.49, 18.58, 11.79, 13.34. They have the following values of α\alpha^{\ast}: 1.01, 1.03, 0.99, 0.95. Consequently the mean values of minimum χ2\chi^{2} and α\alpha^{\ast} for these mock cuboids are 15.80 and 1.00 respectively. The fractional distance error is 0.015.

The left four panels of Fig. 16 illustrate the variations of χ2\chi^{2} with α\alpha for the four mock cuboids in L1 quarter-mock catalogue which correspond to distribution of galaxies visible with patchy reionization. In the right four panels of Fig 16, we show monopoles computed from Landy Szalay estimators and best fit monopoles (ξ(α)\xi(\alpha^{\ast})) for these four mock cuboids. The cuboids L1 quarter-mock catalogue which correspond to distribution of galaxies after patchy reionization have the following values of minimum χ2\chi^{2}: 12.33, 14.26, 15.67, 14.26. The values of α\alpha^{\ast} for these cuboids are as follows: 0.96, 1.06, 1.01, 0.96. Hence, the mean values of minimum χ2\chi^{2} and α\alpha^{\ast} for these mock cuboids are 14.13 and 1.00. The fractional distance error is 0.024.

Refer to caption
Refer to caption
Figure 15: In the left four panels of this figure we show the variations of χ2\chi^{2} with α\alpha for the four mock cuboids corresponding to L=1×1017erg/s/cm2L=1\times 10^{-17}\ {\rm erg/s/cm}^{2} (without patchy reionization). In the plots in the left four panels, the red dashed vertical lines denote the positions of the best fit α\alpha, viz. α\alpha^{\ast} for different mocks. In the right panels, we show the monopoles computed from Landy Szalay estimator (ξ0mock\xi_{0}^{\rm mock}) and the best fit monopoles (ξ0fit(α)\xi_{0}^{\rm fit}(\alpha^{\ast})) obtained after fitting of ξ0mock\xi_{0}^{\rm mock} for the four mock cuboids corresponding to L=1×1017erg/s/cm2L=1\times 10^{-17}\ {\rm erg/s/cm}^{2} (without patchy reionization). In the plots in the right panels, the blue dots represent ξ0mock\xi_{0}^{\rm mock} while the orange dots denote ξ0fit(α)\xi_{0}^{\rm fit}(\alpha^{\ast}).
Refer to caption
Refer to caption
Figure 16: In the left four panels of this figure we show the variations of χ2\chi^{2} with α\alpha for the four mock cuboids corresponding to L=1×1017erg/s/cm2L=1\times 10^{-17}\ {\rm erg/s/cm}^{2} (with patchy reionization). In the plots on the left, the red dashed vertical lines denote the positions of the best fit α\alpha, viz. α\alpha^{\ast} for different mocks. In the right four panels, we show the monopoles computed from Landy Szalay estimator (ξ0mock\xi_{0}^{\rm mock}) and the best fit monopoles (ξ0fit(α)\xi_{0}^{\rm fit}(\alpha^{\ast})) obtained after fitting of ξ0mock\xi_{0}^{\rm mock} for the four mock cuboids corresponding to L=1×1017erg/s/cm2L=1\times 10^{-17}\ {\rm erg/s/cm}^{2} (with patchy reionization). In the plots in the right panels, the blue dots represent ξ0mock\xi_{0}^{\rm mock} while the orange dots denote ξ0fit(α)\xi_{0}^{\rm fit}(\alpha^{\ast}).

Table 3, gives summaries of our best fit results for all mocks in all three quarter-mock catalogues.

Table 3: This table shows best fit results of scale dilation parameter (i.e. α\alpha^{\ast}) and χ2(α)\chi^{2}(\alpha^{\ast}) in the three quarter-mock catalogues (L7, L3.5, L1).
Line detection α\alpha^{\ast} α\alpha^{\ast} α\alpha^{\ast} α\alpha^{\ast}
sensitivity (Mock 1) (Mock 2) (Mock 3) (Mock 4)
Before patchy reionization
77 erg/s/cm2 1.08 1.16 0.86 1.08
3.53.5 erg/s/cm2 0.99 1.08 1.00 1.12
11 erg/s/cm2 1.01 1.03 0.99 0.95
With patchy reionization
77 erg/s/cm2 1.04 0.81 1.05 1.19
3.53.5 erg/s/cm2 0.95 0.99 1.02 0.98
11 erg/s/cm2 0.96 1.06 1.01 0.96
Line detection χ2(α)\chi^{2}(\alpha^{\ast}) χ2(α)\chi^{2}(\alpha^{\ast}) χ2(α)\chi^{2}(\alpha^{\ast}) χ2(α)\chi^{2}(\alpha^{\ast})
sensitivity (Mock 1) (Mock 2) (Mock 3) (Mock 4)
Before patchy reionization
77 erg/s/cm2 27.41 31.32 28.17 45.86
3.53.5 erg/s/cm2 21.59 20.03 16.20 26.42
11 erg/s/cm2 19.49 18.58 11.79 13.34
With patchy reionization
77 erg/s/cm2 24.41 26.68 34.96 26.59
3.53.5 erg/s/cm2 38.80 22.52 43.14 32.26
11 erg/s/cm2 12.33 14.26 15.67 14.26
Table 4: The relative clustering linear bias breionb_{\rm reion} between mocks with and without reionization in the different quarter-mock catalogues (L7, L3.5, L1).
Line detection Mock 1 Mock 2 Mock 3 Mock 4
sensitivity breionb_{\rm reion} breionb_{\rm reion} breionb_{\rm reion} breionb_{\rm reion}
77 erg/s/cm2 1.71 1.10 1.30 1.17
3.53.5 erg/s/cm2 1.13 1.38 1.41 1.05
11 erg/s/cm2 3.76 1.52 2.65 1.70

4.4 Analysis of errors on the BAO distance measurements

We now carry out an analysis of the BAO distance errors and their scatter, in order to check that they are statistically consistent. For the L7 quarter mock catalogues, we found the following values of best fit α\alpha (α\alpha^{\ast}) from the four cuboids that represent the distribution of halos without patchy reionization: 1.08, 1.16, 0.86, 1.08. These values of α\alpha^{\ast} lead to an overall fractional error of 0.064. At the same time, we observed the following values of α\alpha^{\ast} in the four cuboids in the L7 quarter mock catalogues with patchy reionization : 1.04, 0.81, 1.05, 1.19. These values of α\alpha^{\ast} yield an error of 0.079 on BAO distance measurement. To reconcile the scatter in the values of α\alpha^{\ast} that we see for cuboids with and without patchy reionization, we carry out Fisher’s exact test (Fisher, 1922; Agresti, 1992). For each category of cuboids (with reionization or without), we are working with only four cuboids. This is a classic case of data with small sample sizes. Fisher’s exact test enables us to conduct accurate statistical significance tests on small samples. In this test, our null hypothesis is that there is no significant deviation in the distributions of the values of α\alpha^{\ast} for cases with and without patchy reionization. A two-tailed Fisher exact hypothesis test for comparison of the distributions of the aforesaid values, gives us a p-value which is approximately 0.99. This p value is very high and it strongly suggests that we cannot ignore the null hypothesis.

Similarly, for L3.5 mocks, we have the following values of α\alpha^{\ast} for halos without patchy reionization: 0.99, 1.08, 1.00, 1.12. For the four cuboids with patchy reionization, we have: 0.95, 0.99, 1.02, 0.98. We again conduct Fisher’s exact test. Our null hypothesis in this test is that there is no significant deviation in distributions of values of α\alpha^{\ast}. From a two-tailed Fisher’s exact test, this time we get a p-value of approximately 1. This strongly suggests that there is significant evidence in support of our null hypothesis.

Finally, for L1 mocks, we get the following values of α\alpha^{\ast} for halos without patchy reionization: 1.01, 1.03, 0.99, 0.95. When cuboids have patchy reionization, we get the following values of α\alpha^{\ast}: 0.96, 1.06, 1.01, 0.96. A two-tailed Fisher’s exact test gives us a p-value of approximately 1, again strong evidence of the fact that there is no significant deviation in distributions of values of α\alpha^{\ast} for cases with and without patchy reionization.

The results that we get from Fisher’s exact tests carried out on the mocks also suggest that we cannot make significant inferences from comparisons of scatter in values of α\alpha^{\ast} in cuboids with patchy reionization and without patchy reionization.

4.5 Linear bias between in cases with and without reionization

The results that we see in Figures 12, 11, 14, 13, 16 and 15 suggest that the clustering amplitude changes somewhat significantly with and without reionization and also depends on the line strength. Equation 19 below defines the linear bias due to reionization compared to no reionization for a given cuboid. It is defined using a ratio of two galaxy correlation functions and is of course different from the usual linear bias of galaxies with respect to mass.

breion=ξ0reionizationξ0noreionization\displaystyle b_{\rm reion}=\sqrt{\frac{\xi_{0}^{{\rm reionization}}}{\xi_{0}^{{\rm no\ reionization}}}} (19)

We tabulate the values of breionb_{\rm reion} for cuboids in the L7, L3.5 and L1 mocks in Table 4. For the L7 mock catalogue, we obtain a mean breionb_{\rm reion} of 1.32. We estimate the fractional errors on breionb_{\rm reion} by computing the sample standard deviation of the breionb_{\rm reion} results for the four quarter-mocks, and then dividing by 4\sqrt{4} to give the error on the mean. The fractional error computed this way is 0.136. In the case of the L3.5 mock catalogue, we get a mean breionb_{\rm reion} of 1.24 and a fractional error of 0.089. Finally, for the four quarter-mocks in L1 mock catalogue, we obtain a mean breionb_{\rm reion} of 2.41 and fractional error of 0.514. It is as we might expect: in all cases the effects of patchy reionization lead to higher amplitude clustering for the observable galaxies. The fainter galaxies (the L1 mocks) are also relatively much more strongly affected than the brighter galaxies.

5 Summary and Discussion

5.1 Summary

Using mock catalogues, we have investigated the prospect of utilizing the BAO feature at redshifts z>7.6z>7.6 as a standard ruler. We have utilized four mock catalogues, each representing approximately one quarter of the volume that will be surveyed by RST with grism spectroscopy.

We see evidence for BAO peaks in three of the four mocks when we use the planned RST HLS Lyman-α\alpha line detection sensitivity of L=7×1017erg/s/cm2L=7\times 10^{-17}\ {\rm erg/s/cm}^{2}. Hence, the BAO feature is likely detectable for this line detection sensitivity (measured in three out of four survey quadrants analysed independently). When we make mocks with better line detection sensitivities, we see evidence of BAO features in all mocks. For the standard line detection sensitivity, L=7×1017erg/s/cm2L=7\times 10^{-17}\ {\rm erg/s/cm}^{2}, we find a root mean square fractional distance error of 0.068 after comparison between mocks and fitted correlation function monopoles. For a factor of two fainter line detection sensitivity, L=3.5×1017erg/s/cm2L=3.5\times 10^{-17}\ {\rm erg/s/cm}^{2}, we find a distance error of 0.013. We find a distance error of 0.021 for mocks with line detection sensitivity L=1×1017erg/s/cm2L=1\times 10^{-17}\ {\rm erg/s/cm}^{2}.

5.2 Discussion

Our analysis of the use of BAO at high redshifts has included some of the relevant features and difficulties, such as non-linear clustering, redshift distortions, intrinsic galaxy line luminosities, and the effect of reionization bubbles. The results are promising, as we have summarised in Section 5.1 above. We have however left much for future work, including the visibility of the Lyman-α\alpha line at these redshifts (including the effects of dust and the Lyman-α\alpha damping wing), and the effects of interlopers and low redshift foregrounds. These factors could easily derail the possibility of measuring large-scale galaxy clustering at redshift z>7.6z>7.6, and should be investigated in detail. On the other hand, there are examples of galaxies being detected in Lyman-α\alpha emission at these redshifts, for example the z=7.51z=7.51 object by Tilvi et al. (2016), and z=7.452z=7.452 from Larson et al. (2018). Even if the Lyman-α\alpha emission line itself cannot be seen, the Lyman-α\alpha break can also be used to spectroscopically identify galaxies, as was done using the HST grism at z=11.1z=11.1 by Oesch et al. (2016). This could also be investigated to see whether it could enable clustering measurements at the BAO scale.

We have seen that the detectability of the BAO peak is just at the level of being possible with the fiducial flux limit of the RST HLS. To do this, we have already reduced the nominal 7σ\sigma detection limit to 5σ\sigma (7×1017erg/s/cm27\times 10^{-17}\ {\rm erg/s/cm}^{2}). The exact survey design choices made when the RST mission is finalized will therefore be extremely relevant to whether the BAO measurement will be feasible. We have found that with a sensitivity a factor of two better the BAO can be seen clearly in four survey quadrants. This means that it could be advantageous to make use of smaller and deeper areas.

The analysis presented in this work was based on only 4 quarter-mocks for each of the tested line detection sensitivities. In the future, large number of mock catalogues should be generated to allow for the build up of a full covariance matrix for the correlation function. Because of the small number of mocks here, we were only able to use the diagonal elements of the covariance matrix in model fitting. As a result, the distance error estimates, which were derived from the scatter between mocks, were necessarily conservative. In a future work with large numbers of mocks, the extra information available from the off diagonal covariance matrix elements will mean that the fits to the BAO peaks will be improved and the distance errors will be somewhat reduced. The uncertainty in the distance error will also be better estimated. In the present case, the error on the error on the mean from only four quarter mocks will be large, and this is why the distance scale error on the L1 mocks increases to 2.4% from 1.4% for the L3.5 mocks, in spite of a factor of 3 increase in sensitivity.

In our treatment of reionization, we have assumed that galaxies which lie within ionized bubbles will be visible in Lyman-α\alpha (consistent with the interpretation of Bagley et al. (2017)). We have compared the effects of including this patchy reionization, or not including it (all galaxies visible), and have found that the effect on BAO peak detection is quite small. For example, for the fiducial L7 sensitivity case, the effect of patchy reionization only increases the distance error from 6.4% to 7.9 %. This is likely to be because in the patchy model for reionization employed, overdense regions reionize first, and this is where most of the galaxy clustering signal lies. The reionization bubbles are also on smaller scales, than the BAO signal, and have a range of sizes, and therefore there is no masking of the BAO peak by bubbles. We note however that we have carried out no Lyman-α\alpha radiative transfer on the scales of individual galaxies, and that the visibility or not of the line in different environments is likely to be much more complex than our simple bubble model.

One significant difference between the BAO measurements at redshifts z>7z>7 and those for z<1z<1 is the lack of non-linearities in gravitational clustering. For example we can see that in the right panels of Fig. 15 there is no obvious smoothing of the BAO peak compared to the linear CDM fit. This is an advantage of working at higher redshifts, as no reconstruction techniques (Sherwin & White, 2019) are needed to recover the maximum benefit. The high redshift galaxies do have a high bias (Bhowmick et al., 2019) and so their clustering is detectable where clustering of the density field would not be.

Measuring BAO in a new regime would at the very least be useful as a consistency check on measurements that have been made at low redshifts and in the CMB. It is important to fill in the large gap between the Lyman-α\alpha forest measurements at z2.2z\sim 2.2 (Aubourg et al., 2015) and the features seen at z1100z\sim 1100 by Planck and WMAP (Planck Collaboration et al., 2018b; Bennett et al., 2013). Such measurements might shed some light on the tension between BAO and other Hubble parameter measurements (Cuceu et al., 2019), and whether early dark energy may be acting before the recombination epoch (Verde et al., 2019; Poulin et al., 2019).

One of the targets of cosmological analysis with the RST satellite is large scale structure measurements from galaxies at redshifts z13z\sim 1-3. This will extend the successful redshift space distortion, BAO, and other techniques to significantly earlier epochs than current large scale structure measurements from ground based surveys. Although much attention has rightly been focussed on this redshift range, we have seen in this paper that it may be possible to make an even greater leap, and use the large scale structure traced out by the very first massive galaxies as a cosmological probe. Although many potential pitfalls remain to be investigated, it is worth bearing this in mind when considering the RST survey strategies and parameters.

Acknowledgements

The BlueTides simulation was run on the BlueWaters facility at the National Center for Supercomputing Applications. TDM acknowledges funding from NSF ACI-1614853, NSF AST-1517593, NSF AST-1616168 and NASA ATP 19-ATP19-0084. TDM and RACC also acknowledge funding from NASA ATP 80NSSC18K101, and NASA ATP NNX17AK56G. RACC was supported by a Lyle Fellowship from the University of Melbourne, and TDM by a Shimmins Fellowship and a Lyle Fellowship from the University of Melbourne. G.R. acknowledges support from the National Research Foundation of Korea (NRF) through Grant No. 2017077508 funded by the Korean Ministry of Education, Science and Technology (MoEST), and from the faculty research fund of Sejong University in 2018. The work by KH was supported under the U.S. Department of Energy contract DE-AC02-06CH11357. This research used resources of the Argonne Leadership Computing Facility at the Argonne National Laboratory, which is supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC02-06CH11357.

Data availability

The data underlying this article will be shared on reasonable request to the corresponding author.

References