This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Detecting ionized bubbles around luminous sources during the reionization era using HI 21-cm signal

Arnab Mishra    Chandra Shekhar Murmu    Kanan K. Datta11footnotetext: Corresponding author    Samir Choudhuri    Suman Majumdar    Iffat Nasreen    Sk. Saiyad Ali
Abstract

Measuring the properties of the intergalactic medium (IGM) and sources during the Epoch of Reionization (EoR) is of immense importance. We explore the prospects of probing the IGM and sources through redshifted 21-cm observations of individual ionized bubbles surrounding known luminous sources during the EoR. Accordingly, we simulate HI 21-cm maps, foreground contaminants, and system noise which are specific to the uGMRT and SKA1-Low observations. Following the subtraction of the foreground from the total visibility, we employ a visibility-based matched filter technique to optimally combine the desired HI 21-cm signal while minimizing the system noise. Our analysis suggests that these ionized bubbles can be detected with more than 5σ5\sigma significance using approximately 2000\sim 2000 and 3000\sim 3000 hours of observation time with the uGMRT at redshift 7.17.1 and 8.38.3, respectively, when the mean neutral hydrogen fraction outside the targeted bubble is 0.9{\sim 0.9}. The SKA1-Low should be able to detect these bubbles with more than 8σ8\sigma significance using only 100\sim 100 hrs of observations. The total observing time increases both for the uGMRT and SKA1-Low when the mean neutral hydrogen fraction outside the targeted bubble decreases. Further, we investigate the impact of foreground subtraction on the detectability and find the signal-to-noise ratio decreases when smaller bandwidth is used. More importantly, we show that the matched filtering method can measure ionized bubble radius and constrain HI-neutral fraction reasonably well, providing deeper insights into the source properties and the intergalactic medium.

1 Introduction

It is widely accepted that ionizing UV radiation from the first galaxies and quasars began to ionize the neutral hydrogen (HI) in the intergalactic medium (IGM) during the redshift range from z20z\sim 20 to 66 [1, 2, 3, 4]. During this process, fully ionized regions, commonly referred to as ionized bubbles, began to form around these early galaxies and quasars, marking the onset of the epoch of reionization (EoR). As time progressed, these ionized bubbles grew and eventually merged, resulting in the ionization of the entire universe [5, 6, 7].

In recent times, there have been detections of several bright QSOs during the EoR [8, 9, 10, 11, 12, 13, 14, 15], and a handful of bright galaxies have also been detected at high redshifts as well [16, 17]. These bright, luminous sources will produce large ionized bubbles (HII regions) around them within the IGM. At the initial and intermediate stages of reionization, isolated or nearly isolated ionized bubbles are expected, which will remain buried inside the neutral (or partially ionized) IGM. Observations of redshifted HI 21-cm signal are a promising tool to study the epoch of reionization, including the individual ionized regions [18, 19, 20, 21, 22, 23, 24, 25, 26]. However, there are enormous challenges associated with these detections, as the signal is weaker by several orders of magnitude compared to the foreground contaminants [27, 28, 29, 30]. Moreover, the system noise is much stronger than the desired HI signal.

Previous studies have explored the detection of HII regions in HI 21-cm images [31, 18, 32, 23, 24, 25, 26]. This method enables direct estimation of the HI neutral fraction, the sizes of ionized bubbles, and the properties of the underlying sources. However, direct imaging of HI using redshifted 21-cm signal during the Epoch of Reionization (EoR) is challenging with current instruments like the uGMRT, LOFAR, MWA due to their limited sensitivities. The SKA1-Low is anticipated to produce images of individual ionized bubbles using the HI 21-cm signal, but this requires considerably long observation times.

This work investigates the prospects of individually detecting isolated ionized bubbles within the IGM, around known luminous sources using the matched filter algorithm. This technique works when the system noise of the radio interferometer dominates over the targeted HI signal. But the functional form of the targeted signal is known. The idea of applying a matched filter technique was first proposed in [33], which presents a visibility-based analytical framework to study the feasibility of detecting individual ionized bubbles during the EoR through radio-interferometric observations of redshifted HI 21-cm radiation. Later, it was shown that fluctuating intergalactic medium outside the targeted ionized bubble behaves as noise and hinders the detection of ionized bubbles of small sizes with radius 6\lesssim 6 Mpc [34]. Subsequent work such as [35] used detailed numerical simulations to study the detection prospects and associated challenges. [36] presents scaling relations, which enable us to quickly estimate detection prospects for various ionized bubble sizes, redshifts, and instruments. Further, [37] used a Bayesian framework to explore the possibility of constraining the parameters that characterize the bubbles. This approach of detecting individual ionized bubbles in HI 21-cm maps is complementary to the widely followed approach of detecting the summary statistics of the signal such as the power spectrum, bispectrum, etc. While the statistical methods probe the large scale behavior of the reionizing sources and the intergalactic medium [38, 39, 40, 41, 42, 43, 44], our approach sheds light on individual sources, ionized regions, and surrounding neutral medium. Interpreting the detection is also straightforward in our approach, unlike the statistical methods which might be complicated and model-dependent.

The majority of previous investigations have primarily focused on modeling HI 21-cm signal around ionized bubbles. In this work, we focus on targeted detection i.e., the location of the central luminous source is known from other observations. Along with simulated HI 21-cm maps which have a known source, we also simulate foreground contaminants and system noise. This is something that was not explicitly included in the previous works. Furthermore, we generated mock visibilities at baselines typical of the uGMRT/SKA1-Low observations. After subtracting the foreground from the total mock signal, we employ the matched filter technique on the residual signal. In that sense, the procedure adopted in this work resembles more with real observations. Moreover, we examine the constraints that observations with the uGMRT and SKA1-Low can impose on measuring bubble sizes and ionization fractions.

The outline of the paper is as follows: Section 2 provides a brief overview of the basics of the HI 21-cm signal around ionized bubbles. In section 3, we explain our methods to simulate the HI 21-cm signal around ionized bubbles, foreground contaminants, the generation of visibilities at baselines specific to a particular experiment, and system noise. Section 4 discusses foreground subtraction from the total visibilities and the application of the matched filter technique. Our results are presented in Section 5, while Section 6 presents a summary of the work undertaken in the paper alongside a discussion. Throughout our analysis we use the cosmological parameters h=0.7h=0.7, Ωm=0.27\Omega_{\rm m}=0.27, ΩΛ=0.73\Omega_{\Lambda}=0.73, Ωb=0.044\Omega_{\rm b}=0.044, σ8=0.83{\sigma_{8}=0.83}  and ns=0.96{n_{\rm s}=0.96} consistent with the WMAP measurements [45].

2 The HI 21-cm signal

During the EoR, the ionizing radiation from the first generation of luminous sources, such as galaxies and quasars, leads to the emergence of HII regions (ionized bubbles) around these sources, and the remaining neutral (HI) medium surrounds these ionized regions. The surrounding HI medium is expected to emit a line emission with a rest-frame wavelength of approximately 2121 cm, which arises due to the hyperfine spin-flip transition in the neutral hydrogen atoms. This redshifted HI 21-cm signal emitted from these surrounding HI mediums can be observed as a differential brightness temperature against the Cosmic Microwave Background, and one can express this differential brightness temperature as [46],

δTb(𝒏,z)=TsTγ1+z(1eτ21cm).{\rm\delta}T_{\rm b}({\bm{n}},z)=\frac{T_{\rm s}-T_{\gamma}}{1+z}\left(1-e^{-\tau_{\rm 21cm}}\right). (2.1)

TST_{\rm S}, TγT_{\gamma}, and τ21cm\tau_{\rm 21cm} are the HI spin temperature, CMB temperature, and the optical depth for the HI 21-cm line emission respectively. All these quantities depend on redshift zz and the line of sight direction 𝒏\bm{n}. Under the assumption that the HI 21-cm optical depth is small enough (τ21cm1\tau_{\rm 21cm}\ll 1) during the EoR, the above equation can be written as [47],

δTb(𝒏,z)=4mK(ρHIρ¯H)1+zΩm(Ωbh20.02)(0.7h)(1TγTs)[11+zH(z)vr].{\rm\delta}T_{\rm b}(\bm{n},z)=4{\rm mK}\bigg{(}\frac{\rho_{\rm HI}}{\overline{\rho}_{\rm H}}\bigg{)}\sqrt{\frac{1+z}{\Omega_{\rm m}}}\bigg{(}\frac{\Omega_{\rm b}h^{2}}{0.02}\bigg{)}\bigg{(}\frac{0.7}{h}\bigg{)}\bigg{(}1-\frac{T_{\gamma}}{T_{\rm s}}\bigg{)}\bigg{[}1-\frac{1+z}{H(z)}\frac{\partial v}{\partial r}\bigg{]}. (2.2)

ρHI\rho_{\rm HI} and ρ¯H\overline{\rho}_{\rm H} are the mass density of neutral hydrogen and the average mass density of total hydrogen (neutral and ionized). Ωm\Omega_{\rm m} and Ωb\Omega_{\rm b} are the matter and baryon density parameter, hh is the dimensionless Hubble parameter and H(z)H(z) is the Hubble parameter at redshift zz respectively. The term inside the square brackets arises due to the redshift space distortion, and v/r\partial v/\partial r is the divergence of the peculiar velocity along the line of sight. Since the HI spin temperature of the neutral hydrogen in the IGM is expected to be coupled to the kinetic temperature of the gas during the EoR via the Lyman-α\alpha coupling [48, 49, 50], it is expected to be much higher than the temperature of CMB (TsTγT_{\rm s}\gg T_{\gamma}). Therefore, one can usually neglect the term (1 - Tγ/TsT_{\gamma}/T_{\rm s}) in the HI 21-cm differential brightness temperature expression during the EoR. It is easily seen that the differential brightness temperature of the HI 21-cm signal δTb\delta T_{\rm b} directly probes the neutral hydrogen content (ρHI\rho_{\rm HI}) in the IGM. In the absence of neutral hydrogen in the HII regions, this differential brightness temperature is expected to be zero. Therefore, this allows us to probe the individual ionized bubbles within the neutral IGM.

In this paper, we specifically focus on individual HII regions around high redshift bright quasars and galaxies like the ones reported in [8] and [17], located at redshift z=7.1z=7.1 and z=8.3z=8.3 respectively. We expect large HII regions extending up to several tens of comoving Mpc around those bright luminous sources [8, 17]. Our ultimate objective is to investigate the detectability of such individual ionized regions using radio interferometric observations of HI 21-cm signal. The following section presents mock simulations of similar ionized bubbles around very bright quasars and galaxies during the EoR.

3 Simulating mock data

To investigate the prospects of detecting ionized bubbles individually in HI 21-cm maps, we first simulate mock data that resembles the uGMRT and SKA1-Low observations and at observational frequencies relevant to the EoR. It consists of simulations of the HII region around bright quasars/galaxies in large-scale HI maps, foreground contaminants, and system noise from radio interferometers. Subsequently, these simulated maps are converted into visibilities that mimic the uGMRT/SKA1-Low-like observations.

Our simulations are centered around two different observing frequencies, 175175 MHz and 153153 MHz, corresponding to the redshifts of z7.1z\approx 7.1 and 8.38.3, respectively, for the HI 21-cm line emission. The selection of these two frequencies is motivated by the discoveries of a bright QSO at redshift z=7.1z=7.1 [8] and a large ionized bubble at redshift z=8.3z=8.3 [17]. The angular size of our simulation cube is set to 3.3×3.3\sim 3.3^{\circ}\times 3.3^{\circ} which is closer to the uGMRT/SKA1-Low primary field of view at the observing frequencies considered here. Our simulation box has a total of 512512 grids on each side with a resolution of 1.031.03 Mpc and a total comoving length of 527527 Mpc on each side. This results in an angular resolution of 23′′\sim 23^{\prime\prime} and a total angular size of 3.33.3^{\circ}. Here, we have used up to 256256 frequency channels with a total bandwidth of 1616 MHz, resulting in a channel width of Δν=62.5\Delta\nu=62.5 kHz.

3.1 HI 21-cm signal around ionized bubbles

Here, we briefly describe simulated ionized regions surrounding a very bright quasar at z=7.1z=7.1, and around a collection of galaxies at z=8.3z=8.3 during the EoR. First, we simulated the dark-matter distribution using the NN-body code222https://github.com/rajeshmondal18/N-body developed by [51]. We simulated this within a cubic box of the comoving size of 527\sim 527 Mpc with 204832048^{3} grids and the same number of particles. The minimum halo mass resolved in this simulation is 6.26×109M{\sim 6.26\times 10^{9}M_{\odot}}, corresponding to a minimum of 1010 dark matter particles, each with a mass of 6.26×108M{\sim 6.26\times 10^{8}M_{\odot}}. We note that our simulations do not include the contributions of low-mass halos in the range 𝟏𝟎𝟖M\bm{10^{8}M_{\odot}} to 6.26×𝟏𝟎𝟗M\bm{\sim 6.26\times 10^{9}M_{\odot}}. In reality, these halos are expected to play a significant role in the reionization of the IGM, potentially giving rise to more diffuse, complex, and smaller ionized structures during the EoR. Then we use a Friends-of-Friends 333https://github.com/rajeshmondal18/FoF-Halo-finder (FoF) [52] halo finder to identify the collapsed halos that host the galaxies/bright quasars. These galaxies/quasars serve as the sources of ionizing photons during the EoR. Subsequently, the resolution of simulated cubes is degraded to 5123512^{3} grids. This resolution resembles the corresponding angular resolution of the uGMRT and SKA1-Low and has the added benefit of reduced computational requirements for simulation. We then employ a semi-numerical prescription [53, 54, 55] on these outputs to simulate ionization maps and the HI 21-cm differential brightness temperature maps.

The semi-numerical approach compares the average number of ionizing photons nγR\langle n_{\gamma}\rangle_{R} and neutral hydrogen atoms nHR\langle n_{\rm H}\rangle_{R} within a spherical region of radius RR around a grid cell. If the ionization condition nγRnHR\langle n_{\gamma}\rangle_{R}\geq\langle n_{\rm H}\rangle_{R} is met for any radius RR, then the grid cell is reckoned to be ionized, within that region. Otherwise, an ionized fraction value of xHII=nγgrid/nHgridx_{\rm HII}=\langle n_{\gamma}\rangle_{\rm grid}/\langle n_{\rm H}\rangle_{\rm grid} is assigned to that grid cell. This procedure is repeated for varying radius RR up to a maximum value RmaxR_{\rm max}, over the whole simulation box. It is assumed that the number of ionizing photons from halos, NγN_{\gamma} contributed to the ionization of the IGM, follows the relation NγMhN_{\gamma}\propto M_{\rm h}, where MhM_{\rm h} is the mass of the corresponding halo.

Here, we consider two scenarios. First, we aim to simulate an ionized region around a very bright quasar at a redshift of 7.17.1 similar to the one reported in [8]. In the second case, we simulate an ionized region around a collection of galaxies at redshift 8.38.3 as reported in [17]. In the first scenario, we assume that such a very bright quasar is hosted by the most massive halo (of mass Mh6×1012,M)M_{\rm h}\sim 6\times 10^{12},M_{\odot}) found in our simulation [20]. We further assume that the ionizing photon emission rate by the QSO is 1.3×1057s1\sim 1.3\times 10^{57}\,{\rm s}^{-1} [35, 20]. We note that along with the central quasar, many galaxies reside in and outside the QSO ionized regions. These galaxies are also contributing ionizing photons. The mass-averaged neutral Hydrogen fraction resulting from galaxies outside the QSO HII region is xHI0.88\langle x_{\rm HI}\rangle\sim 0.88 at a redshift of z=7.1z=7.1. In figure 1 (top left panel), the resulting HI 21cm map is shown with the ionized bubble around the quasar in the center of the map. The ionized region around the central bright QSO is considerably bigger (of radius around 23.523.5 comoving Mpc) than all other ionized regions produced by the galaxies.

In the second case, we consider a collection of galaxies near the most massive halo (of mass Mh3.5×1012MM_{\rm h}\sim 3.5\times 10^{12}M_{\odot}) in our simulation at z=8.3z=8.3. Simulation specifications such as the cubic box size, total number of particles used, and number of grids along a direction are the same as above. The ionizing luminosity of these galaxies, including the most massive halo in our simulation, is adjusted to create an HII bubble with a radius of 27.8\sim 27.8 Mpc, and the resulting mass-averaged neutral fraction is around xHI0.94\langle x_{\rm HI}\rangle\sim 0.94. The ionized bubble from this scenario is shown in the top right panel of figure 1. We see that the ionized bubble in this scenario is slightly aspherical in shape, mainly because it is created by a collection of sources located at different locations inside the bubble.

In the above two cases, the targeted HII bubbles are isolated and nearly spherical. This geometry arises because the ionizing radiation from other sources is negligible, and the IGM outside the targeted bubbles remains highly neutral. While these scenarios may be suitable at the onset of reionization, they are likely unrealistic for the later stages of the process. At later stages of reionization, ionized bubbles around very bright sources are surrounded by neighboring ionized regions, leading to an aspherical shape for the targeted bubble. The third scenario accounts for this complexity. The bottom panel of Fig. 1 displays a 2D map of the differential brightness temperature (δTb{\delta T_{b}}) at redshift z=7.1z=7.1, where the central targeted HII bubble appears highly aspherical. The quasar at the center is the same as the first case. Surrounding it there are numerous other HII regions, unlike the first case. This results in a much lower mean neutral fraction, which is 0.520.52 in this scenario.

We want to mention the considerable uncertainty regarding the ionizing photon emissivity rate from QSO, and therefore, the photon emissivity rate and the resulting QSO HII region may vary. The consequences of this uncertainty will be discussed later in the paper.

Refer to caption
Refer to caption
Refer to caption
Figure 1: The top left and bottom panels show differential brightness temperature maps of the HI 21-cm signal around an ionized bubble of comoving size 23.523.5 Mpc at z=7.1z=7.1 corresponding to observing frequency of 175175 MHz. The top right panel shows a similar map around an ionized bubble of comoving size 27.827.8 Mpc at z=8.3z=8.3 corresponding to observing frequency of 153153 MHz. The mass-averaged neutral hydrogen fraction resulting from the galaxies outside the central HII region is 0.880.88, 0.940.94, and, 0.520.52 for the top-left, top-right, and bottom panels respectively. The angular size of simulated maps is 3.3×3.3\sim 3.3^{\circ}\times 3.3^{\circ} with 23′′\sim 23^{\prime\prime} resolution.

3.2 Foreground simulations

The astrophysical foregrounds that contaminate the EoR HI 21-cm signal observations comprise various components [56], including diffuse galactic synchrotron emission, radio synchrotron emission from point-like sources, free-free radio emission from diffuse ionized gas, etc.

The diffuse galactic synchrotron emission (DSGE) is one of the most dominant foreground components. We model the angular power spectrum corresponding to the diffuse galactic synchrotron emission using a power-law formulation [57] as,

ClM(ν)=A150×(1000l)β(ν0ν)2α.{C_{l}}^{M}(\nu)=A_{150}\times{\left(\frac{1000}{l}\right)}^{\beta}{\left(\frac{\nu_{0}}{\nu}\right)^{2\alpha}}. (3.1)

Here, ν0=150\nu_{0}=150 MHz and ν=ν0+Δν\nu=\nu_{0}+\Delta\nu. The power-law index, β\beta, is set to 2.342.34, and the amplitude A150=513mK2A_{150}=513\,\text{mK}^{2} is based on previous work [58]. The parameter α\alpha describes how the power spectrum changes with frequency, which is set to 2.82.8 [29, 30]. This simplified model allows us to simulate DGSE maps and to study their impact on the detection of individual ionized bubbles in the HI 21-cm maps. We use the following equation to generate Fourier components of the DGSE on each gridded baseline UU, which is given by

ΔT~(U)=ΩClM2[x(U)+iy(U)].\Delta\tilde{T}(U)=\sqrt{\frac{\Omega{C_{l}}^{M}}{2}}[x(U)+iy(U)]. (3.2)

Here, Ω\Omega denotes the total solid angle corresponding to the spatial size of simulated maps. x(U)x(U) and y(U)y(U) are independent Gaussian random variables with zero mean and unit variance. Upon simulating 2D DGSE maps in Fourier space, we apply the inverse Fourier transform on them to obtain the simulated sky map δTb(𝜽)\delta T_{b}(\bm{\theta}) at a specific frequency. Subsequently, we use the frequency scaling to obtain the DGSE map at other desired frequencies. Figure 2 shows a single realization of the 2D simulated DGSE map centered at frequency ν=175\nu=175 MHz corresponding to redshift z=7.1z=7.1. As mentioned earlier, the angular resolution of these maps is 23′′23^{\prime\prime}, and the strength of temperature fluctuations changes with angular resolution. We note that the mean temperature of the DGSE map is set to zero, as the radio interferometric experiments, such as the uGMRT/SKA1-Low, are not sensitive to the mean due to the lack of the zero spacing baseline. Subsequently, we obtain mock sky maps at multiple frequency channels by combining the HI 21-cm and the foreground maps simulated above.

Refer to caption
Refer to caption
Figure 2: The left panel shows a simulated 2D image of the diffuse galactic synchrotron emission (DGSE) of angular size 3.3×3.3\sim 3.3^{\circ}\times 3.3^{\circ} with resolution of 23′′\sim 23^{\prime\prime} at frequency 175175 MHz. The right panel shows a 2D image of extragalactic radio point sources with a similar angular size, resolution and observing frequency.

3.2.1 Extragalactic radio point sources

One of the most significant foreground contaminants in HI 21-cm signal observation arises from the extragalactic radio point sources, such as Active Galactic Nuclei (AGN), quasars, and radio galaxies. At lower frequencies (150{\sim 150} MHz), these sources dominate the sky at smaller angular scales 4\lesssim 4^{\circ} [30]. Therefore, in order to study the HII regions at frequencies of 150150 and 175175 MHz, it is essential to model the point sources efficiently.

In our simulations, we model the population of extragalactic radio point sources using the following 150150 MHz differential source count model from [58]:

dNdS=103.75Jy.Sr(S1Jy)1.6,{\frac{dN}{dS}=\frac{10^{3.75}}{\rm Jy.Sr}\left(\frac{S}{1\,\rm Jy}\right)^{-1.6}}, (3.3)

where SS is the flux density of the point sources. We consider the point sources to be in the flux range between 0.10.1 mJy to 11 Jy and they are randomly distributed across an angular size of 3.3×3.3{\sim 3.3^{\circ}\times 3.3^{\circ}}. Based on these parameters, we obtained 77757775 point sources from the simulation. We model the spectral energy distribution of the point sources using a simple power law (Sνα{S\propto\nu^{-\alpha}}), where the spectral index α\alpha for each source is randomly assigned within the range of 0.70.7 to 0.80.8. The right panel of Figure 2 shows a 2D map of the point sources randomly distributed across an angular area of approximately 3.3×3.33.3^{\circ}\times 3.3^{\circ}, with an angular resolution of 23′′{\sim 23^{\prime\prime}}.

3.3 Visibilities for spherical ionized bubbles: An analytical approach

In radio interferometric observations, the measured quantity is the visibility, denoted as V(𝑼,ν)V(\bm{U},\nu), which can be written as the Fourier transform of the product of the specific intensity pattern of the sky, δIν(𝜽)\delta I_{\nu}(\bm{\theta}) and A(𝜽,ν)A(\bm{\theta},\nu). It is given as,

V(𝑼,ν)=d2θA(𝜽,ν)δIν(𝜽)e2πi𝜽.𝑼.V(\bm{U},\nu)=\int d^{2}\theta\,A(\bm{\theta},\nu)\,\delta I_{\nu}(\bm{\theta})\,e^{2\pi i\bm{\theta}.\bm{U}}. (3.4)

Here, the baseline 𝑼=𝒅/λ\bm{U}=\bm{d}/\lambda denotes the antenna separation 𝒅\bm{d} projected on the plane perpendicular to the line of sight in units of the observed wavelength (λ\lambda) of the signal. 𝜽\bm{\theta} is a two-dimensional vector on the plane of the sky with the origin at the center of the field of view (FoV), and A(𝜽,ν)A(\bm{\theta},\nu) is the beam pattern of the individual antenna. Assuming uniformly illuminated circular aperture of diameter DD, we model the antenna primary beam pattern as [57],

A(𝜽,ν)=[(2λπθD)J1(πθDλ)]2,A(\bm{\theta},\nu)=\left[\left(\frac{2\lambda}{\pi\theta D}\right)J_{1}\left(\frac{\pi\theta D}{\lambda}\right)\right]^{2}, (3.5)

where J1J_{1} is the first order Bessel function. The antenna diameter D=45D=45m and 3535m for the uGMRT and SKA1-Low respectively. The specific intensity pattern of the sky, δIν(𝜽)δTb(𝒏,z)\delta I_{\nu}(\bm{\theta})\equiv\delta T_{\rm b}({\bm{n}},z) and can be written as,

δIν(𝒏,z)=2kBν2c2δTb(𝒏,z).\delta I_{\nu}({\bm{n}},z)=\frac{2k_{B}\nu^{2}}{c^{2}}{\rm\delta}T_{\rm b}({\bm{n}},z). (3.6)

Considering a spherical ionized bubble surrounded by a uniform HI background at the center of the field of view, we can derive the analytical formula for the visibility of the HI 21-cm signal at a particular frequency channel ν\nu and baseline 𝑼\bm{U} following [33] as,

Scenter(𝑼,ν)=πIν¯xHIθν2[2J1(2πUθν)2πUθν]Θ(1|ννc|Δνb),S_{\rm center}(\bm{U},\nu)=-\pi\bar{I_{\nu}}x_{\rm HI}{\theta_{\nu}}^{2}\left[\frac{2J_{1}(2\pi U\theta_{\nu})}{2\pi U\theta_{\nu}}\right]\Theta\left(1-\frac{\left|\nu-\nu_{c}\right|}{\Delta\nu_{b}}\right), (3.7)

where

Iν¯=2.5×102Jy/sr(Ωbh20.02)(0.7h)(H0H(z))\bar{I_{\nu}}=2.5\times 10^{2}{\rm Jy/sr}\left(\frac{\Omega_{b}h^{2}}{0.02}\right)\left(\frac{0.7}{h}\right)\left(\frac{H_{0}}{H(z)}\right) (3.8)

is the specific intensity from uniform HI background at redshift zz, xHIx_{\rm HI} is the mass-averaged neutral hydrogen fraction, θR\theta_{R} is the angular size of the ionized bubble corresponding to comoving radius RR, J1J_{1} is first order Bessel function and Δνb\Delta\nu_{b} is ionized bubble size in frequency. Note that a spherical ionization bubble will appear as a circular one at a particular frequency channel. We use this analytical equation to design a filter for detecting the ionized bubbles in the mock HI 21-cm maps obtained from simulations.

3.4 Simulating visibilities

To simulate mock uGMRT and SKA1-Low observations, we consider baseline distributions corresponding to both instruments. There are currently 3030 active antennas at uGMRT [59] and the proposed number of antennas in SKA1-Low is 512512 [60]. Table 1 summarises instrument specifications. Figure 3 shows the baseline coverage for an 88-hour uGMRT and SKA1-Low observations at the 175175 MHz frequency corresponding to z=7.1z=7.1 for the HI 21-cm signal. It also shows the baseline density i.e., the total number of baselines per unit area of circular annulus corresponding to baseline UU and U+dUU+dU. For the uGMRT, baselines vary from 32λ32\lambda to 13500λ13500\lambda and for the SKA1-Low, the baseline range extends from 4λ4\lambda to 37000λ37000\lambda over the same 88 hour, 175175 MHz observations. As expected, the number density of baselines and their extent are much greater for the SKA1-Low, as it has more antennae, and some of them are placed at large distances compared to the uGMRT. The baseline distribution at 153153 MHz (corresponding to z=8.3z=8.3) is very similar to that at 175175 MHz but will be reduced (refer to Table 1).

Observation No. of antennae zz ν\nu(MHz) No. of baselines per snapshots Umax(λ)U_{max}(\lambda) Umin(λ)U_{min}(\lambda)
uGMRT 3030 7.17.1 175175 435435 1350013500 3232
uGMRT 3030 8.38.3 153153 435435 1200012000 2828
SKA1-Low 512512 7.17.1 175175 130816130816 3700037000 44
SKA1-Low 512512 8.38.3 153153 130816130816 3200032000 3.53.5
Table 1: This shows some relevant parameters for the uGMRT and SKA1-Low at different observing frequencies.

We generate the mock visibility data of the HI 21-cm signal and the foregrounds (DGSE and extragalactic point sources) at all simulated baselines at 256256 frequency channels, centered around frequencies ν=175\nu=175 MHz and 153153 MHz, using the Discrete Fourier Transformation (DFT) of the total simulated maps. For simplicity, we keep the baseline distribution the same in all frequency channels.

Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 3: Left panels show baseline coverage for 88 hours of the uGMRT (upper left panel ) and SKA1-Low (lower left panel) observations at 175175 MHz. Right panels show corresponding baseline density as a function of baseline.

Figure 4 shows simulated visibilities corresponding to the 2D central HI 21-cm map around the ionized bubble centered at z=7.1z=7.1. The top left and top right panels show plots of the signal as a function of baseline and frequency respectively, for the uGMRT and the bottom panels display results for the SKA1-Low. We also compare the simulated visibility with the analytical estimation assuming a perfectly spherical ionized bubble of comoving radius 2323 Mpc and surrounded by uniform and fully neutral medium (refer to equation 3.7). The IGM around the ionized bubble in simulated maps is not uniform, but traces the underlying DM distribution and is also filled with many small ionized bubbles. This causes HI density to fluctuate and consequently, the visibility signal also fluctuates as a function of baseline and frequency. We see in Figure 4 that the HI fluctuations are stronger at lower baselines which reflects the fact that HI power spectrum PHI(k)P_{\rm HI}(k) (or equivalently the visibility-visibility correlations) increases at lower kk-modes or baselines [61, 34]. We further see that the fluctuating HI outside the bubble dominates over the visibilities solely due to the HI 21-cm signal around a spherical ionized bubble which is buried in a uniform HI background. Visibilities are similar for the second scenario at z=8.3z=8.3 for both experiments and we do not show them here explicitly.

Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 4: Left panels show simulated visibilities as a function of baseline UU for the central frequency channel at 175175 MHz for the uGMRT (upper panel) and SKA1-Low (lower panel). Red lines show analytical predictions for a spherical ionized bubble of the same radius as in simulation and uniform HI outside the bubble. Right panels show the same as a function of observing frequency ν\nu for fixed baselines U=34λU=34\lambda (uGMRT) and U=11λU=11\lambda (SKA1-Low).

3.5 System noise

To make our study more realistic, we introduce system noise contribution from radio interferometers in our simulated mock visibility data. The system noise contribution to the measured visibility in each baseline and frequency channel is an independent Gaussian random variable with zero mean and the root mean square [33]

N2=2kBTsysAeffΔνΔt,\sqrt{\langle N^{2}\rangle}=\frac{\sqrt{2}k_{B}T_{\rm sys}}{A_{\rm eff}\sqrt{\Delta\nu\Delta t}}, (3.9)

where TsysT_{\rm sys} represents the total system temperature, AeffA_{\rm eff} is the effective collecting area of the antenna, and Δν\Delta\nu and Δt\Delta t denote the channel width and correlator integration time, respectively.

For the uGMRT, Tsys=300T_{\rm sys}=300 K and Aeff/(2kB)=0.33A_{\rm eff}/(2k_{\rm B})=0.33 K/Jy [59] at frequency ν175\nu\sim 175 MHz. The estimated root mean square noise in visibility is approximately 0.64Jy0.64\,{\rm Jy} for frequency channel width (Δν\Delta\nu) of 62.562.5 kHz and integration time (Δt\Delta t) of 1616 seconds in each baseline. Tsys=300T_{\rm sys}=300 K and Aeff/(2kB)=0.348A_{\rm eff}/(2k_{\rm B})=0.348 K/Jy for the SKA1-Low at the same frequency considering antenna efficiency factor unity. TsysT_{\rm sys} and Aeff/(2kB)A_{\rm eff}/(2k_{\rm B}) change to 400400 K and 0.3480.348 K/Jy for the SKA1-Low at 153153 MHz frequency. For the uGMRT, these numbers are 400400 K and 0.330.33 K/Jy [59] respectively at 153153 MHz frequency. We note that the rms noise will be reduced by a factor of tobs/Δt\sqrt{t_{\rm obs}/\Delta t} where tobst_{\rm obs} is the total observation time.

Our mock simulation considers 88 hrs of observations with 1616s and 320320s integration time for the uGMRT and SKA1-Low respectively. This produces a total of 783000783000 and 1177344011773440 baselines for the uGMRT and SKA1-Low respectively. Lower integration time will produce a higher number of baselines for the SKA1-Low and hence will increase the requirement of computational resources which is beyond our reach. The system noise contribution for 88 hrs of observations will be too high to detect the HI 21-cm signal around the individual ionized bubbles. Therefore, we present our results for 20482048 and 30723072 hrs of uGMRT observations at redshifts 7.17.1 and 8.38.3, respectively, assuming a total of 256256 and 384384 nights of observations, with each night lasting 88 hours. Further, for the third scenario, we present the results at redshift 7.17.1 for 51205120 hours, assuming 640640 nights of observations. The increased observing time for the third scenario is due to the lower mean neutral hydrogen fraction outside the targeted bubble. Assuming all nights are the same, we reduce the system noise corresponding to an 88 hrs observation by a factor of 1616, 19.619.6, and 25.2925.29 for the three scenarios, respectively. The resulting mock signal becomes equivalent to 20482048, 30723072, and 51205120 hours of uGMRT observations. Similarly, for the SKA1-Low we present our results for 9696 hrs of observation for the first two cases at redshift 7.17.1 and 8.38.3 (neutral hydrogen fraction 0.9{\sim 0.9}) and for 200200 hrs of observation for the third case at redshift 7.17.1 (neutral hydrogen fraction 0.5{\sim 0.5}). Eventually, we obtain mock visibility data with all the signal contributions (HI 21-cm signal, galactic foreground, extragalactic point sources, and system noise contribution) typical of the uGMRT and SKA1-Low observations.

Figure 5 shows the mock visibility data comprising of HI signal, diffused galactic foreground contribution, and system noise as a function of frequency for two baselines U=34λU=34\lambda (top left panel) and U=230λU=230\lambda (top right panel) for the uGMRT at 175175 MHz. The corresponding mock visibility for the SKA1-Low is shown in the bottom panels (bottom left panel: U=11λU=11\lambda, and bottom right panel: U=254λU=254\lambda). They also show diffused galactic foreground contributions (red dashed lines). We see that the diffused galactic foreground dominates over the HI 21-cm signal and system noise at a small baseline (left panel) and, at a large baseline (right panel) the system noise dominates over the other two components. The HI 21-cm signal remains buried under the stronger foreground and system noise contributions.

Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 5: Upper panels show mock visibilities (HI 21-cm signal + diffuse galactic foreground + system noise) as a function of observing frequency for the uGMRT at baselines U=34λU=34\lambda (upper left panel) and U=230λU=230\lambda (upper right) at 175175 MHz. Lower panels show the same for the SKA1-Low at baselines U=11λU=11\lambda (lower left panel) and U=254λU=254\lambda (lower right). Red dashed lines show diffuse galactic foreground contamination only.

Figure 6 shows the total mock visibility comprising of HI signal, diffused galactic synchrotron emission, radiation from extra-galactic radio point sources, and system noise as a function of frequency for two baselines U=34λ{U=34\lambda} (left panel) and U=230λ{U=230\lambda} (right panel) for the uGMRT at 175175 MHz. We note that Figure 6 shows contribution both from the extragalactic point sources, galactic diffuse synchrotron radiation along with other components unlike Fig. 5 which shows galactic diffuse synchrotron emission only as the foreground component. The radio point sources are modeled using a simplistic power-law form as discussed in subsection 3.2.1. The green lines indicate visibilities due to point sources. We see that at a small baseline (left panel) the diffuse galactic synchrotron emission dominates over the point sources; at a large baseline (right panel), the point sources dominate over the diffuse galactic synchrotron emission. The total mock visibilities are similar for the second scenario at z=8.3{z=8.3} for both experiments, except for the fact that the system noise and foreground contributions are higher. Therefore, we do not show them here explicitly. The subsequent section discusses the subtraction of the foreground contribution.

Here, we would like to note that the visibility data used in this study does not contain calibration errors, ionospheric effects, and radio-frequency interference (RFI). A comprehensive investigation of these factors will be addressed in future work.

Refer to caption
Refer to caption
Figure 6: Left panel shows the total mock visibilities (HI 21-cm signal + DGSE + radio point sources + system noise) as a function of observing frequency for the uGMRT at baseline U=34λ{U=34\lambda} and the right panel shows the same at baseline U=230λ{U=230\lambda} at 175175 MHz observing frequency. Red dashed lines show diffuse galactic foreground contamination and the green lines show the point source contribution separately.

4 Detecting individual ionized bubble in HI 21-cm maps

4.1 Foreground subtraction

The foreground contributions are anticipated to be several orders of magnitude higher than the HI 21-cm signal around ionized bubbles during the EoR. Therefore, one significant challenge is reliably subtracting the foreground signal from the observed signal. All foreground mitigation techniques use the fact that the foregrounds are smooth along the frequency [62, 63]. As discussed earlier, the foreground signal in our simulated mock data has also been assumed to be smooth along frequency. To mitigate foreground components, we fit the total mock visibility signal with a 3rd3^{rd} order polynomial of observing frequency ν\nu as follows,

f(ν)=aν3+bν2+cν+d,f(\nu)=a\nu^{3}+b\nu^{2}+c\nu+d, (4.1)

where parameters aa, bb, cc, and dd are constant. Then, we find the best-fit polynomial along each line of sight corresponding to a baseline and subsequently subtract them from the total simulated mock visibility signal.

Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 7: This shows recovered visibilities after foreground subtraction for the uGMRT (top panels) and SKA1-Low (bottom panels) at the same baselines and frequency considered in Figure 5.

Figure 7 presents the residual visibility at 175 MHz after the foreground contribution is subtracted from the total mock visibility for the uGMRT (top panels) and the SKA1-Low (bottom panels). Here, we have only considered the diffuse galactic synchrotron emission as the foreground contamination. However, we get a very similar residual signal when we perform similar analysis on the total mock visibilities considering the radio point source contribution, which has been shown in Figure 9. The residual signal contains the HI 21-cm signal, the system noise, and the un-subtracted foreground, if any. To assess the reliability of foreground subtraction, we compare the combined input HI 21-cm signal and system noise with the extracted signal for two different baselines U=34λ{U=34\lambda} (top left panel) and 230λ{230\lambda} (top right panel), as depicted in Figure 8. In the bottom panels of Figure 8, we show a similar comparison for SKA1-Low for two different baselines U=11λ{U=11\lambda} (bottom left panel) and U=254λ{U=254\lambda} (bottom right panel). The plots show that the residual visibility closely matches the input visibility data, with only minor deviations. These slight deviations are possibly due to some residual foreground contribution and subtraction of the HI 21-cm signal and the system noise during the foreground subtraction. Consequently, we can conclude that we have mostly subtracted the foreground data from the mock visibility data, making it ready for further analysis, such as matched filtering. We do not present similar results for the second scenario at z=8.3z=8.3 as it is similar to the first scenario.

Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 8: These plots compare the recovered visibility (red dashed lines) after subtracting the diffused galactic foreground contribution with the original visibility consisting of the HI 21-cm signal and noise contributions (blue lines) for the uGMRT (top panels) and SKA1-Low (bottom panels) at same baselines and observing frequency considered in Figure 5. Here, the foreground component consists of diffuse galactic radiation only and visibilities are plotted for every fourth frequency channel for visual clarity.
Refer to caption
Refer to caption
Figure 9: These plots compare the recovered visibility (red dashed lines) after subtracting the extra galactic point sources with the original visibility consisting of the HI 21-cm signal and noise contributions (blue lines) for the uGMRT at same baselines and observing frequency considered in the top panels of Figure 5. Here, the foreground component consists of extra galactic point sources only. Visibilities are plotted for every fourth frequency channel for visual clarity.

4.2 Matched filter

The residual mock visibility (see Figure 7) contains mainly the HI 21-cm signal around the ionized bubble and the system noise from radio interferometers. We see that the dominance of system noise over the HI 21-cm signal is so pronounced that the latter’s presence is barely discernible. Here, we employ a matched filter technique to maximize the signal-to-noise ratio, thereby enhancing the prospects of detecting the HI 21-cm signal around an ionized bubble. The functional form of the visibility signal corresponding to neutral hydrogen around a spherical ionized bubble is quite well known (refer to eq. 3.7). Additionally, the system noise is anticipated to follow a Gaussian random distribution. These conditions enable us to implement the matched filter-based technique effectively. Below, we briefly summarize the method proposed in [33] and consider the following estimator for detecting ionized bubbles from the HI 21-cm signal, which is given as

E^=[a,bSf(𝑼a,νb)V^(𝑼a,νb)]/[a,b1].{\hat{E}}=\left[{\sum_{a,b}{S_{f}}^{*}(\bm{U}_{a},\nu_{b})\hat{V}(\bm{U}_{a},\nu_{b})}\right]/\left[{\sum_{a,b}1}\right]. (4.2)

Here, Sf(𝑼,ν)S_{f}(\bm{U},\nu) is the filter which is a function of both baseline 𝑼\bm{U} and frequency ν\nu. V^(𝑼a,νb)\hat{V}(\bm{U}_{a},\nu_{b}) is the total observed visibility which consists of the desired HI 21-cm signal, residual foregrounds, system noise, etc. Here, 𝑼a\bm{U}_{a} and νb\nu_{b} refer to the different baselines and frequency channels in our observations. Here, we note that foreground components are subtracted from the total observed visibilities and we employ the matched filtering technique only on the residual visibility which contains only the HI 21-cm signal and system noise. Figure 3 shows a typical uGMRT and SKA1-Low baseline distribution for 88 hrs of observations. We note that the estimator is calculated by taking the sum over all baselines and frequency channels. We get one estimator for a given filter. The contribution of the system noise to the estimator is expected to be zero if it is averaged over a large number of independent realizations. However, for a single realization of the system noise, the contribution of it is unlikely to be exactly zero. It would change for different realizations of the system noise. Therefore, we estimate the variance of the estimator due to the system noise using the following equation [33],

(ΔE^)2NS=N2[a,b|Sf(𝑼a,νb)|2]/[a,b]2.\left<(\Delta{\hat{E}})^{2}\right>_{NS}={\langle N^{2}\rangle}\left[\sum_{a,b}\left|S_{f}(\bm{U}_{a},\nu_{b})\right|^{2}\right]/\left[\sum_{a,b}\right]^{2}. (4.3)

Here, N2\langle N^{2}\rangle is the variance of the system noise for single visibility and can be calculated using eq. 3.9. The symbol a,b\sum_{a,b} denotes summation over all baselines and frequency channels. We note that the variance to the estimator depends on the chosen filter, system noise, and baseline distribution but not on the desired signal. The signal-to-noise ratio (SNR) of a particular detection (or non-detection) can be calculated as,

SNR=E^(ΔE^)2NS.{\rm SNR}=\frac{\langle\hat{E}\rangle}{\sqrt{\left<(\Delta{\hat{E}})^{2}\right>_{\rm NS}}}. (4.4)

In this work, we consider SNR5{\rm SNR}\gtrsim 5 as the threshold for detection. We would like to mention that the SNR reaches its maximum when the selected filter exactly (or closely) matches with the HI 21-cm signal which is contaminated by the system noise. Therefore, the chosen filter has the same functional form as the HI 21-cm signal around an ionized bubble (refer to eq. 3.7). It also has one free parameter i.e., the radius of the ionized bubble RbR_{\rm b}. We assume that the location of the bubble is known a priori from other observations. To find a filter that matches the targeted signal, one needs to explore the entire possible ranges of all parameters (here the size of the targeted bubble). The particular set of parameters yielding the maximum SNR will reveal the size of the ionized bubble in the observed HI 21-cm signal.

5 Results

5.1 Detectability of the QSO ionized bubble at 𝒛=7.1\bm{z=7.1}

Here, we present our findings regarding the detectability of ionized bubbles around bright QSO/galaxies in HI 21-cm maps, employing the matched filter technique. We assume that the targeted ionized bubble is located at the antenna phase center. Thus, we simulate the targeted ionized bubble at the center of the simulation cube. Subsequently, we apply the matched filtering technique to the residual visibility obtained after subtracting foreground contaminants. As explained above the residual visibility contains the HI 21-cm signal, system noise, and unsubtracted foregrounds. First, we calculate the estimator E^{\hat{E}} using equation 4.2. Because the actual size of the targeted ionized bubble is not known in real observations we calculate the estimator for various filter sizes ranging from 55 to 3535 Mpc. Subsequently, we estimate the variance of the estimator (ΔE^)2NS\left<(\Delta{\hat{E}})^{2}\right>_{\rm NS} due to the system noise using equation 3.9 and the signal to noise ratio (SNR, refer to eq. 4.4) for the same range of filter sizes. Figure 10 shows the resulting SNR as a function of filter size RfR_{\rm f} for 20482048 hrs of uGMRT observation (left panel) and 9696 hrs of SKA1-Low observations (right panel) at z=7.1z=7.1.

Refer to caption
Refer to caption
Figure 10: This figure shows the signal-to-noise ratio for 1010 independent noise realizations corresponding to 20482048 hrs of observation time for the uGMRT (left panel) and 9696 hrs of SKA1-Low observations (right panel) at redshift 7.17.1.

The ten different curves in Figure 10 represent the SNR for ten independent realizations of the system noise. We see that for both the uGMRT and the SKA1-Low, the SNR peaks at filter size 23\sim 23 Mpc, with minor deviations in different noise realizations. As per the principle of the matched filter technique, the SNR peaks when the filter matches the actual targeted signal (i.e., the HI 21-cm signal around the ionized bubble) in the simulated HI maps. Therefore, we can conclude that the extracted radius of the ionized bubble in the simulated HI 21-cm map is around 23\sim 23 Mpc. We want to note that the actual radius of the ionized bubble measured directly from the simulated map is 23.523.5 Mpc, which is very close to the extracted ones using the matched filter technique. The location of the peak in the SNR curve shifts around the mean value for different realizations, which we shall discuss in detail in the next section.

Further, we see that peak SNR values in all the ten realizations are 4\gtrsim 4 for 20482048 hrs of observations with the uGMRT and 10\sim 10 for 9696 hrs of observations with the SKA1-Low. We assume a total bandwidth of 1616 MHz and 6.256.25 MHz while subtracting the foreground for the uGMRT and SKA1-Low, respectively. The choice of smaller bandwidth for the SKA1-Low is solely due to the limited computational resources available. Use of the higher bandwidth for the SKA1-Low would have subtracted the HI 21-cm signal less during the foreground subtraction process, thus increasing the SNR. Subsection 5.4 discusses the impact of foreground subtraction in more detail. The higher SNR obtained in our results indicates that it is possible to significantly detect individual ionized bubbles in the HI 21-cm maps using the uGMRT with 20482048 hrs observations and using the SKA1-Low with only around 100\sim 100 hrs of observations.

Refer to caption
Refer to caption
Figure 11: The left panel shows the mean signal-to-noise (SNR) ratio of 100100 independent noise realizations as a function of filter size for 20482048 hrs of the uGMRT observations at redshift 7.17.1. The right panel shows the SNR of 2020 independent noise realizations for 9696 hrs of the SKA1-Low observations. The shaded region shows 1σ1\sigma uncertainty estimated from independent realizations.

To assess the significance of the above result, we estimate the SNR for 100100 independent realizations of the system noise for the uGMRT and 2020 independent realizations for SKA1-Low, while keeping the HI 21-cm signal and foreground components the same. Figure 10 shows the SNR for some of these realizations for the uGMRT (left panel) and SKA1-Low (right panel) at z=7.1z=7.1. The left panel of Figure 11 shows the corresponding mean SNR and 1σ1-\sigma spread for all 100100 realizations of the uGMRT, while the right panel shows the same for the SKA1-Low with 2020 realizations. We see that all the realizations show a peak in the SNR. In principle, the peak should appear at the actual ionized bubble size of 23.523.5 Mpc. However, the peak location varies in the range of 183018-30 Mpc and 222422-24 Mpc for the uGMRT and SKA1-Low, respectively, for different realizations. It can be seen in Figure 12, which shows a histogram of peak locations for 100100 realizations for the uGMRT.

Refer to caption
Figure 12: This figure shows the histogram of filter sizes corresponding to peaks of the SNR for 100100 different noise realizations for the uGMRT at redshift 7.17.1.

We see that, although in most cases, the peak is in the range of 222622-26 Mpc for the uGMRT, some extreme realizations show peaks at higher/lower filter radii. The peak value of the SNR for the uGMRT also changes in the range of 37\sim 3-7. Due to much lower system noise and higher sensitivity, the SKA1-Low should be able to detect the ionized bubble with reduced observing time and higher SNR. The estimated bubble size is also highly accurate for the SKA1-Low, which we do not explicitly show here. We note that the above analysis considers only the DGSE as the foreground contaminant. However, we have repeated the analysis including contributions from both the DGSE and radio point sources as foregrounds. We find very similar results to those presented here. Therefore, we do not explicitly show the outcomes of the combined foreground contamination here.

5.2 Detectability of the ionized bubble at 𝒛=8.3\bm{z=8.3}

Unlike the ionized bubble at z=7.1z=7.1, mainly created by a bright QSO, the bubble at z=8.3z=8.3 is produced by a collection of galaxies as indicated in [17]. This ionized bubble is relatively more aspherical in shape than at z=7.1z=7.1. Figure 13 presents the SNR as a function of filter size RfR_{\rm f} for 3072 hrs of uGMRT (left panel) and 9696 hrs of SKA1-Low (right panel) observations at z=8.3z=8.3.

Refer to caption
Refer to caption
Figure 13: This figure shows the signal-to-noise ratio for 1010 independent noise realizations corresponding to 30723072 hrs of observation time for the uGMRT (left panel) and 9696 hrs of SKA1-Low observations (right panel) at redshift 8.38.3.

Different curves are for different independent noise realizations. The system temperature is higher at z=8.3z=8.3 compared to z=7.1z=7.1. Therefore, the SNR decreases at a higher redshift for the same observation time. We see that for both the uGMRT and the SKA1-Low, the SNR peaks at a filter size 28\sim 28 Mpc, with minor deviations in different noise realizations. It is very close to the actual radius of the ionized bubble measured directly from the simulated map, which is 27.827.8 Mpc.

Further, we see that peak SNR values in all ten realizations are 4\gtrsim 4 for the uGMRT and 8\sim 8 for the SKA1-Low. Like the previous case, we assume a total bandwidth of 1616 MHz and 6.256.25 MHz while subtracting the foreground for the uGMRT and SKA1-Low, respectively. We see that it is possible to reliably detect individual ionized bubbles in the HI 21-cm maps using the uGMRT with 30723072 hrs of observation time and the SKA1-Low with only 100\sim 100 hrs of observations at z=8.3z=8.3.

The left panel of Figure 14 shows the mean SNR along with the 1σ1-\sigma spread for 100100 noise realizations of the uGMRT, while the right panel shows the same for the SKA1-Low with 2020 realizations at z=8.3z=8.3. Further, the peak location varies between 2424 and 3636 Mpc for the uGMRT for different realizations (Figure 13), which can be seen in Figure 15, which shows a histogram of peak locations for 100100 realizations for the uGMRT. The peak value of the SNR for the uGMRT also changes in the range of 46\sim 4-6 and 6.58.56.5-8.5 for the SKA1-Low.

Refer to caption
Refer to caption
Figure 14: The left panel shows the mean signal-to-noise (SNR) ratio of 100100 independent noise realizations as a function of filter size for 30723072 hrs of the uGMRT observations at redshift 8.38.3. The right panel shows the SNR of 2020 independent noise realizations for 9696 hrs of the SKA1-Low observations. The shaded region shows 1σ1\sigma uncertainty estimated from independent realizations.
Refer to caption
Figure 15: Same as Figure 12 at redshift z=8.3z=8.3.

5.3 Detectability of the QSO ionized bubble at 𝒛=7.1\bm{z=7.1} for 𝒙𝐇𝐈=0.52\bm{x_{\rm HI}=0.52}

Here, we present our findings for the third case, where the targeted ionized bubble is surrounded by other ionized regions which result in a lower neutral hydrogen fraction of xHI=0.52{x_{\rm HI}=0.52}. Further, the targeted bubble at the center also became aspherical. A lower neutral hydrogen fraction means that the background HI 21-cm signal is weaker, which results in a lower SNR value. We use the same matched filtering technique as described earlier but increased the observation time to 51205120 hours for the uGMRT and 200200 hours for the SKA1-Low, considering the lower SNR and increased difficulty in detecting ionized regions at a lower neutral fraction. Figure 16 shows the SNR as a function of filter size Rf{R_{\rm f}} for 51205120 hours of uGMRT observation (left panel) and 200200 hours of SKA1-Low observations (right panel) at z=7.1z=7.1. As in previous cases, each curve represents a different noise realization. We see that for the uGMRT, the SNR peaks at a filter size 25{\sim 25} Mpc, and for the SKA1-Low, it peaks at 23{\sim 23} Mpc, with minor deviations in different noise realizations.

Refer to caption
Refer to caption
Figure 16: This figure shows the signal-to-noise ratio for 1010 independent noise realizations corresponding to 51205120 hours of observation time for the uGMRT (left panel) and 200200 hours of SKA1-Low observations (right panel) at redshift 7.17.1.

Further, we see that peak SNR values in all ten realizations are 3{\gtrsim 3} for the uGMRT and 11{\sim 11} for the SKA1-Low. Like the previous cases, we assume a total bandwidth of 1616 MHz and 6.256.25 MHz while subtracting the foreground for the uGMRT and SKA1-Low, respectively. We see that it is possible to reliably detect individual ionized bubbles in the HI 21-cm maps using the uGMRT with 51205120 hours of observation time and the SKA1-Low with only 200\sim 200 hours of observations at z=7.1z=7.1 with significantly much lower neutral hydrogen fraction value.

Refer to caption
Refer to caption
Figure 17: The left panel shows the mean signal-to-noise (SNR) ratio of 100100 independent noise realizations as a function of filter size for 51205120 hours of the uGMRT observations at redshift 7.17.1. The right panel shows the SNR of 2020 independent noise realizations for 200200 hours of the SKA1-Low observations. The shaded region shows 1σ{1\sigma} uncertainty estimated from independent realizations.
Refer to caption
Figure 18: Same as Figure 12 at redshift z=7.1z=7.1 with a lower neutral hydrogen fraction of 0.520.52.

The left panel of Figure 17 shows the mean SNR along with the 1σ{1-\sigma} spread for 100100 noise realizations of the uGMRT, while the right panel shows the same for the SKA1-Low with 2020 realizations at z=7.1z=7.1 for xHI=0.52{x_{\rm HI}=0.52}. Further, the peak location varies between 1616 and 3232 Mpc for the uGMRT for different realizations (Figure 16), which can be seen in Figure 18, that shows a histogram of peak locations for 100100 realizations for the uGMRT. The peak value of the SNR for the uGMRT also changes in the range of 35\sim 3-5 and 10.512.510.5-12.5 for the SKA1-Low.

5.4 Impact of foreground subtraction

Refer to caption
Figure 19: This shows the signal-to-noise ratio as a function of frequency bandwidth used during foreground subtraction for the uGMRT at redshift z=7.1z=7.1.

We noticed that during the process of foreground subtraction using a polynomial along the frequency axis, a small part of the HI 21-cm signal also gets subtracted along with the foreground. It reduces the SNR to some extent. We further notice that the amount of subtraction of the HI 21-cm signal depends on the frequency bandwidth used during the foreground subtraction process. The fraction of HI 21-cm signal subtracted is more for smaller bandwidth compared to the case when larger frequency bandwidth is used. Consequently, the drop in the SNR is more significant for smaller bandwidths. Figure 19 shows the SNR as a function of the bandwidth for 20482048 hrs of observations with uGMRT at z=7.1z=7.1. We see that the SNR, which is lower than 22 at 66 MHz bandwidth, increases to 4.54.5 when 1616 MHz bandwidth is assumed. Here, we note that the SNR reaches 6.56.5 for the above observation when no foreground is used and the process of foreground subtraction is skipped. Therefore, the SNR is reduced by 3030 percent when the bubble size is 23.5\sim 23.5 Mpc, and 1616 MHz bandwidth is assumed for foreground subtraction. Using higher bandwidth during the foreground subtraction would enhance the SNR and prospects of ionized bubble detection. However, increased bandwidth will increase the computational cost during the analysis. We find similar results for the SKA1-Low but do not explicitly show them here. The impact of foreground subtraction on the SNR will also depend on the foreground subtraction methods, and therefore, a thorough investigation is required.

5.5 Constraining the neutral hydrogen fraction

We see that the visibility corresponding to the HI 21-cm signal around a spherical ionized bubble in the IGM is proportional to the mass-averaged neutral hydrogen fraction xHIx_{\rm HI} of the IGM outside the bubble (refer to eq. 3.7). One can further show that [36],

SNR=CxHI,{\rm SNR}=Cx_{\rm HI}, (5.1)

where CC depends on the bubble size, redshift, and various observational parameters such as the TsysT_{\rm sys}, total number of antennae, observing time, antenna collecting area, and baseline distribution. All these quantities are constant at a fixed redshift. One can use this relation to place constraints on xHIx_{\rm HI}. Now, we focus on the ionized bubble around the QSO at z=7.1z=7.1 and 20482048 hrs of observation time with uGMRT. The SNR value changes for different noise realizations in Figure 10 (left panel). The average SNR value peaks at 4.64.6 (Figure 11) at the filter size 23 Mpc for the uGMRT at z=7.1z=7.1. Further, we see that 1σ1\sigma uncertainty in the SNR value at the same filter size is 0.73. Therefore, we estimate the hydrogen neutral fraction using xHI=(SNR±1σSNR)/Cx_{\rm HI}=({\rm SNR}\pm 1\sigma_{\rm SNR})/C, where the SNR and 1σSNR1\sigma_{\rm SNR} is measured at the peak location.

To calculate CC, we simulate an entirely neutral and uniform HI 21-cm signal around a spherical ionized bubble of radius 23 Mpc, i.e., xHI=1x_{\rm HI}=1 outside the ionized bubble. It allows us to estimate CC using eq. 5.1. Now, after adding the foreground component only with the HI 21-cm signal, we follow the entire analysis and calculate the SNR as a function of filter size without adding system noise from the radio interferometers. The SNR, in this case, peaks at 23 Mpc and is supposed to be equal to CC (following eq. 5.1) as the xHI=1x_{\rm HI}=1 outside the ionized bubble in this case. We use this value of CC to estimate the mean xHIx_{\rm HI} and the 1σ1\sigma uncertainty associated with it. The estimated neutral fraction is xHI=0.93±0.15x_{\rm HI}=0.93\pm 0.15 at z=7.1z=7.1 for 20482048 hrs of observations with uGMRT. It is very close to the actual xHIx_{\rm HI} in the simulation, which is 0.880.88. The estimated 1σ1\sigma upper limit of xHIx_{\rm HI} exceeds unity which is, of course, unrealistic. The SKA1-Low places much tighter constraints, which is xHI=0.92±0.055x_{\rm HI}=0.92\pm 0.055 using only 9696 hrs of observations for the same scenario. Similarly we also studied this for the third case, where the neutral hydrogen fraction is lower in the simulation, discussed in subsection 5.3. We find the estimated neutral hydrogen fraction to be xHI=0.49±0.09{x_{\rm HI}=0.49\pm 0.09} at z=7.1z=7.1 for 51205120 hours of observations with the uGMRT, which is very close to the actual neutral hydrogen fraction xHI=0.52{x_{\rm HI}=0.52}.

The above result demonstrates the prospects of constraining the mass-averaged neutral fraction using the technique discussed in this work. The SKA1-Low is expected to put much tighter constraints on the xHIx_{\rm HI}. A thorough analysis is required considering different bubble sizes and hydrogen neutral fractions at different redshifts for various instruments, and we leave this exercise for future work.

6 Summary & Discussion

Several bright QSOs/galaxies have recently been detected at z6z\gtrsim 6. We expect large ionized bubbles, with comoving sizes of tens of Mpc, around these sources, which are embedded in a neutral HI medium. We have conducted a detailed investigation into the prospects of detecting such individual ionized bubbles using observations of the HI 21-cm signal from the epoch of reionization by the uGMRT and SKA1-Low. We have mainly focused on two known sources: one very bright QSO at z=7.1z=7.1 [8] and a large ionized bubble around a collection of galaxies at redshift 8.38.3 [17].

First, we simulated the HI 21-cm signal around bright QSOs/galaxies matching the above cases and foreground contaminants in a 3D cube. Subsequently, we have generated visibilities corresponding to the HI 21-cm signal and foregrounds at baselines similar to the uGMRT and SKA1-Low experiments and added the system noise with the visibilities corresponding to the HI signal and foreground contaminants. We then employ a foreground mitigation technique based on a polynomial fitting algorithm to remove them from the total visibilities. The residual visibilities match well with the combination of the input HI signal and system noise. However, it is heavily dominated by the system noise. To minimize the effects of system noise in the estimated quantity, we employed a technique based on the matched filter algorithm. This technique is applicable in our case as the functional form of the desired HI 21-cm signal around bright sources is known. We find that the expected signal-to-noise ratio for the proposed estimator for the uGMRT is 5{\sim 5} for both 2000{\sim 2000} hours of observations at redshift 7.17.1 and 3000{\sim 3000} hours at redshift 8.38.3, when the mean HI fraction outside the targeted bubble is 0.9{\sim 0.9}. However, the required observing time increases to 5120{\sim 5120} hrs for the uGMRT at redshift 7.17.1 to achieve a similar SNR when the mean HI fraction is lower at 0.5{\sim 0.5}. The SKA1-Low will have a much higher SNR (810\sim 8-10) with just 100\sim 100 hrs of observations at same redshifts 7.17.1 and 8.38.3, when the mean HI fraction outside the targeted bubble is 0.9{\sim 0.9}. However, the required observing time increases to 200{\sim 200} hrs for the SKA1-Low at redshift 7.17.1 to achieve a similar SNR when the mean HI fraction is lower at 0.5{\sim 0.5}.

Apart from detecting the HI 21-cm signal, the technique is also able to measure the size of the ionized regions and HI neutral fraction. For the uGMRT 2000\sim 2000 hours of observations at redshift 7.17.1, we find that the estimated bubble size ranges between 222622-26 Mpc for 100100 independent noise realizations when the actual bubble size is 23.523.5 Mpc. Similar results are obtained for 5120{\sim 5120} hours of observations with the uGMRT at redshift 7.17.1 with a mean HI fraction of 0.5\sim 0.5 and for 3000{\sim 3000} hours at redshift 8.38.3 with a mean HI fraction of 0.9{\sim 0.9}. The SKA1-Low should measure the bubble size more accurately, with the uncertainty in bubble size estimation being less than 11 Mpc at both mentioned redshifts.

We have also explored the potential to constrain the neutral hydrogen fraction of the IGM outside the ionized bubble. We find that our technique can either directly measure the neutral hydrogen fraction or put an upper or lower limit on it, depending on the actual neutral fraction and the observational specifications.

The process of foreground subtraction using a polynomial along the frequency axis also removes some of the desired HI signal. We found that the fraction of subtracted HI signal increases when using smaller frequency bandwidth, significantly reducing the signal-to-noise ratio. We thoroughly investigated this effect using a wide range of bandwidths and found that the SNR increases by more than four times when the frequency bandwidth is increased from 55 MHz to 1616 MHz for bubble size 23.523.5 Mpc. We studied three particular cases here and results may vary if sizes of ionized bubbles and redshifts change. However, our methods can be applied to any similar cases.

Here, we focused on the targeted detection of ionized bubbles around sources previously detected by other telescopes. Several such sources have been observed during the reionization epoch. Additionally, the matched filter technique can detect ionized bubbles even without prior source detection. However, this unguided detection requires exploring a vast parameter space, which is computationally expensive. The underlying bubble may not be spherical, as assumed in our study. However, as long as the bubble is relatively isolated, the method remains applicable, though the SNR might be reduced. Furthermore, we note that our simulations do not account for the contributions of low-mass halos to reionization. This omission likely results in an underestimation of the diffuse and complex structure of the ionized regions. Accurately incorporating these contributions could, to some extent, degrade the predicted signal-to-noise ratio. Foregrounds are assumed to be spectrally smooth here, which may not be accurate in actual observations. A more sophisticated method for foreground mitigation may be needed in case foregrounds behave differently. We would also like to note that this study does not account for potential errors arising from improper calibration [64], ionospheric effects, or radio-frequency interference (RFI). These practical issues will be addressed in future works.

Acknowledgments

AM and CSM acknowledge financial support from Council of Scientific and Industrial Research (CSIR) via CSIR-SRF fellowships under grant no. 09/0096(13611)/2022-EMR-I and 09/1022(0080)/2019-EMR-I respectively. KKD acknowledges financial support from SERB (Govt. of India) through a project under MATRICS scheme (MTR/2021/000384). SM acknowledges financial support through the project titled “Observing the Cosmic Dawn in Multicolour using Next Generation Telescopes” funded by the Science and Engineering Research Board (SERB), Department of Science and Technology, Govt. of India through the Core Research Grant No. CRG/2021/004025. The authors acknowledge the use of computational resources available to the Inter-University Centre for Astronomy and Astrophysics (IUCAA), Pune, and to the Cosmology with Statistical Inference (CSI) research group at the Indian Institute of Technology Indore (IIT Indore).

References