This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

BP3M: Bayesian Positions, Parallaxes, and Proper Motions derived from the Hubble Space Telescope and Gaia data

Kevin A. McKinnon Department of Astronomy & Astrophysics, University of California, Santa Cruz, 1156 High Street, Santa Cruz, CA 95064, USA Andrés del Pino Centro de Estudios de Física del Cosmos de Aragón (CEFCA), Unidad Asociada al CSIC, Plaza San Juan 1, E-44001, Teruel, Spain Constance M. Rockosi Department of Astronomy & Astrophysics, University of California, Santa Cruz, 1156 High Street, Santa Cruz, CA 95064, USA Miranda Apfel Department of Astronomy & Astrophysics, University of California, Santa Cruz, 1156 High Street, Santa Cruz, CA 95064, USA Puragra Guhathakurta Department of Astronomy & Astrophysics, University of California, Santa Cruz, 1156 High Street, Santa Cruz, CA 95064, USA Roeland P. van der Marel Space Telescope Science Institute, 3700 San Martin Drive, Baltimore, MD 21218, USA Center for Astrophysical Sciences, The William H. Miller III Department of Physics & Astronomy, Johns Hopkins University, Baltimore, MD 21218, USA Paul Bennet Space Telescope Science Institute, 3700 San Martin Drive, Baltimore, MD 21218, USA Mark A. Fardal Eureka Scientific, 2452 Delmer Street, Suite 100, Oakland, CA 94602, U.S.A. Mattia Libralato AURA for the European Space Agency (ESA), ESA Office, Space Telescope Science Institute, 3700 San Martin Drive, Baltimore, MD 21218, USA Sangmo Tony Sohn Space Telescope Science Institute, 3700 San Martin Drive, Baltimore, MD 21218, USA Eduardo Vitral Space Telescope Science Institute, 3700 San Martin Drive, Baltimore, MD 21218, USA Laura L. Watkins AURA for the European Space Agency (ESA), ESA Office, Space Telescope Science Institute, 3700 San Martin Drive, Baltimore, MD 21218, USA
(Received October 30, 2023; Revised FUTURE DATE; Accepted FUTURE DATE)
Abstract

We present a hierarchical Bayesian pipeline, BP3M, that measures positions, parallaxes, and proper motions (PMs) for cross-matched sources between Hubble Space Telescope (HST) images and Gaia – even for sparse fields (N<10N_{*}<10 per image) – expanding from the recent GaiaHub tool. This technique uses Gaia-measured astrometry as priors to predict the locations of sources in HST images, and is therefore able to put the HST images onto a global reference frame without the use of background galaxies/QSOs. Testing our publicly-available code in the Fornax and Draco dSphs, we measure accurate PMs that are a median of 8-13 times more precise than Gaia DR3 alone for 20.5<G<21mag20.5<G<21~{}\mathrm{mag}. We are able to explore the effect of observation strategies on BP3M astrometry using synthetic data, finding an optimal strategy to improve parallax and position precision at no cost to the PM uncertainty. Using 1619 HST images in the sparse COSMOS field (median 9 Gaia sources per HST image), we measure BP3M PMs for 2640 unique sources in the 16<G<21.5mag16<G<21.5~{}\mathrm{mag} range, 25% of which have no Gaia PMs; the median BP3M PM uncertainty for 20.25<G<20.75mag20.25<G<20.75~{}\mathrm{mag} sources is 0.44masyr10.44~{}\mathrm{mas}~{}\mathrm{yr}^{-1} compared to 1.03masyr11.03~{}\mathrm{mas}~{}\mathrm{yr}^{-1} from Gaia, while the median BP3M PM uncertainty for sources without Gaia-measured PMs (20.75<G<21.5mag20.75<G<21.5~{}\mathrm{mag}) is 1.16masyr11.16~{}\mathrm{mas}~{}\mathrm{yr}^{-1}. The statistics that underpin the BP3M pipeline are a generalized way of combining position measurements from different images, epochs, and telescopes, which allows information to be shared between surveys and archives to achieve higher astrometric precision than that from each catalog alone.

Proper motions (1295), Astrostatistics (1882), Milky Way stellar halo (1060), Milky Way Galaxy (1054), Stellar kinematics (1608), Dwarf galaxies (416)
journal: AAS Publishingfacilities: HST, Gaiasoftware: Astropy (Astropy Collaboration et al., 2013, 2018, 2022), corner (Foreman-Mackey, 2016), emcee (Foreman-Mackey et al., 2013), GaiaHub (del Pino et al., 2022), IPython (Pérez & Granger, 2007), jupyter (Kluyver et al., 2016), matplotlib (Hunter, 2007), numpy (Harris et al., 2020), pandas (Wes McKinney, 2010; pandas development team, 2020), scipy (Virtanen et al., 2020)

1 Introduction

High precision proper motions (PM) of individual stars have dramatically increased our understanding of kinematics in the Local Group. In particular, the Hubble Space Telescope (HST) has a rich history of providing the precise astrometry needed to make accurate motion measurements. This is thanks to detailed work over the last few decades to characterize and correct HST geometric distortions, describe point spread functions (PSFs), and define best practices for mapping images onto one another (e.g. Anderson & King, 2004, 2006; Anderson, 2007; Bellini & Bedin, 2009; Bellini et al., 2011). As a result, the PMs measured from multi-epoch HST imaging have been used to study key astronomical questions, such as the relative motion of the Milky Way (MW) and M31 (Sohn et al., 2012; van der Marel et al., 2012), the mass of the MW (Sohn et al., 2018), and MW stellar halo kinematic measurements in individual fields (Cunningham et al., 2019b).

More recently, the Gaia mission (Gaia Collaboration et al., 2018) has facilitated substantial leaps in Local Group science by providing full astrometric solutions for millions of stars. In the MW, these results include the identification of our most massive merger, the Gaia-Sausage-Enceladus (e.g Helmi et al., 2018; Belokurov et al., 2018; Mackereth et al., 2019), detailed inventories of the progenitors that built the MW (e.g. Naidu et al., 2020), and nearly-complete catalogs of the motions of nearby globular clusters (Vasiliev & Baumgardt, 2021).

While both of these telescopes have and will continue to fundamentally alter our knowledge about the local universe, they both have limitations. In HST’s case, the small field of view and relatively long observation times mean that only a small portion of the sky has been observed at multiple epochs with significant time baselines. For Gaia, their all-sky catalog necessarily does not see as faint as a standard HST observation, and the precision of their astrometric solutions decreases fairly significantly for their faintest magnitudes (G21magG\sim 21~{}\mathrm{mag}). Gaia’s fairly bright limiting magnitude for stars with precise PMs restricts the spatial resolution on which it can measure average kinematics in the stellar halo. Consequently, many of the previous studies cited above focus on stellar populations over large portions of the sky when measuring the kinematics of the MW stellar halo, but a growing body of work has shown the benefit of being able to resolve (chemo)dynamics on small spatial scales (e.g. Cunningham et al., 2019b; Iorio & Belokurov, 2021; McKinnon et al., 2023). With better sky velocities, we can improve constraints on the formation and evolution history of our Galaxy (e.g. Cunningham et al., 2022, 2023, Apfel et al. in prep).

To reduce the effect of these limitations and to increase the astrometric constraining power of either telescope alone, recent studies have been exploring how to combine information between datasets from different telescopes. Specifically, using archival HST images that have long time baselines with Gaia, PMs that are factors of 10 to 20 times more precise than Gaia alone have been measured for dwarf spheroidal galaxy and globular cluster stars (Massari et al., 2017, 2018, 2020; del Pino et al., 2022; Bennet et al., 2022), which have enabled internal velocity dispersion measurements. Techniques for cross-telescope combinations will become even more important as the field progresses further into the Big Data era of astronomy, especially as future missions come online (e.g. Nancy Grace Roman Space Telescope, Rubin, Euclid).

Regardless of where the astrometry comes from, PMs in the MW can be more challenging to measure than PMs of distant sources. This is largely because the apparent motion on the sky can be quite large and the motion from parallax can become significant. There are three main ways that a star can appear to move between successive images: (1) apparent offset because of statistical uncertainty on position, (2) offsets due to parallax, and (3) offsets due to PM. In studies of distant sources, the motion from parallax can often be ignored. Because of relatively small Gaia uncertainties for source positions, the impact that offsets between true positions and Gaia-measured positions have on PM measurements is usually small enough that it can also be ignored, especially as the time baseline increases. For extragalactic stars, then, all apparent motion is the result of PM, and the longer the time baseline between observations, the more precise and accurate the PM measurements. When the position uncertainties become large (e.g. for faint Gaia sources) and the parallaxes become substantial (e.g. D<1D<1 kpc), then detailed and simultaneous accounting of all three motion components becomes necessary. One additional difficulty in measuring MW PMs, especially in studies of the stellar halo, comes from the fact that many fields are quite sparse; this limits the number of sources that can be used to constrain the transformation parameters that align images from multiple epochs and impacts the accuracy and precision of the resulting PMs.

While the GaiaHub pipeline of del Pino et al. (2022) was developed for and performs well in populated fields, a key motivation of this work was to develop a complementary pipeline that can also handle very sparse fields (e.g. N<10N_{*}<10). To address the challenges listed previously, we create a hierarchical Bayesian pipeline, named BP3M (Bayesian Positions, Parallaxes, and Proper Motions), to simultaneously measure the positions, parallaxes, and PMs of all Gaia sources in an HST image while also finding the best mapping of HST images onto Gaia. This package is publicly available on Github111https://github.com/KevinMcK95/BayesianPMs and is designed to be used in combination with the GaiaHub code. The underlying statistics are general in that they apply to any two or more sets of position measurements separated by time, regardless of the telescope they come from. In this way, it is also a useful tool for planning future observations.

In this paper, we describe a standard approach for measuring PMs in Section 2, which then motivates the statistics and pipeline, BP3M, we present in Section 3. We examine and validate the pipeline’s outputs in Section 4 using synthetic data. We then highlight some applications of BP3M on real data in Section 5, including sparse fields in the MW and nearby dwarf spheroidals. Finally, our complete set of results are summarized in Section 6.

2 Measuring Proper Motions

To measure the motion of stars, one traditionally identifies the positions of sources at two epochs, measures the best transformation parameters between those two images, and transforms the coordinates of sources from one image into the other. The final offsets can then be used to measure the relative motion of each of the sources. When mapping one image onto another, a standard approach (Anderson, 2007) is to describe the position of a source in the first image, (X,Y)(X,Y), in coordinates of the second image (X,Y)(X^{\prime},Y^{\prime}) using:

(XY)=(abcd)[(XY)(X0Y0)]+(W0Z0)\left(\begin{matrix}X^{\prime}\\ Y^{\prime}\\ \end{matrix}\right)=\left(\begin{matrix}a&b\\ c&d\\ \end{matrix}\right)\cdot\left[\left(\begin{matrix}X\\ Y\\ \end{matrix}\right)-\left(\begin{matrix}X_{0}\\ Y_{0}\\ \end{matrix}\right)\right]+\left(\begin{matrix}W_{0}\\ Z_{0}\\ \end{matrix}\right) (1)

where the (X0,Y0)(X_{0},Y_{0}) and (W0,Z0)(W_{0},Z_{0}) vectors are the center of rotation in each coordinate system. The (a,b,c,d)(a,b,c,d) matrix accounts for differences in pixel scale, rotation, and skew as follows:

pixelscaleratio\displaystyle\mathrm{pixel~{}scale~{}ratio} =PSR=adbc,\displaystyle=\mathrm{PSR}=\sqrt{ad-bc}\ , (2a)
tanθ\displaystyle\tan\theta =(bca+d),\displaystyle=\left(\frac{b-c}{a+d}\right)\ , (2b)
onaxisskew\displaystyle\mathrm{on~{}axis~{}skew} =12(ad),\displaystyle=\frac{1}{2}(a-d)\ , (2c)
offaxisskew\displaystyle\mathrm{off~{}axis~{}skew} =12(b+c).\displaystyle=\frac{1}{2}(b+c)\ . (2d)

With this relationship, we see that there are 6 parameters that need to be fit when transforming one image onto another222We only need to fit for either (X0,Y0)(X_{0},Y_{0}) or (W0,Z0)(W_{0},Z_{0}) because we can fix one vector and the other will absorb those choices.. In general, this would only require the positions of 3 sources found in both images, but in practice, many more sources are required to measure a robust transformation solution.

To be clear, this technique measures the change in (X,Y)(X,Y) as a function of time, but these measurements are relative to the population of sources in the image. For instance, a collection of stars moving with the same PM will only have a difference in translation at two epochs; fitting for transformation parameters will then yield zero change in the relative positions of all the stars. This is still a very useful technique in the cases where the relative motion of stars is of interest, such as for measuring internal kinematics of clusters and galaxies.

To find the PM of the stars in an absolute reference frame, we require known anchor points that have no motion (e.g. background stationary sources like galaxies and QSOs). An alternative approach is to cross-match the stars in an image to an absolute-frame-calibrated survey to estimate the bulk velocity/average absolute velocity of the sources, which is the technique that the GaiaHub pipeline (del Pino et al., 2022) employs. In a high level overview, the GaiaHub pipeline steps are as follows:

  1. 1.

    Following previous techniques for HST data reduction (e.g. Bellini et al., 2017, 2018; Libralato et al., 2018, 2019, 2022), fit HST images with different point spread functions (PSFs) to identify sources and measure their positions using hst1pass (Anderson, 2022);

  2. 2.

    Cross-match HST sources with Gaia;

  3. 3.

    Find the transformation parameters that minimize the offsets between the HST source positions and Gaia positions for each image;

  4. 4.

    With the best fit transformation parameters in hand, any remaining offsets are divided by the time baseline to give the relative PM for each source in an image;

  5. 5.

    Use an average of Gaia-measured PMs to estimate the bulk velocity, giving PMs in an absolute frame;

  6. 6.

    Combine the multiple GaiaHub PM measurements of a source using an inverse-variance weighted average to get a final PM and uncertainty.

As is shown in del Pino et al. (2022), the pipeline performs well, especially in the medium density outskirts of nearby galaxies for which it was developed. Two key assumptions are required for GaiaHub results to accurately describe a population: (1) the parallax motion of the sources are insignificant, and (2) there are enough sources such that their PM distribution is close to Gaussian. The second point is particularly important because the offsets between the sources in different images, which determines the transformation parameters, are minimized when fitting for the transformation. If there are a small number of sources, the individual offsets have an out-sized impact on the resulting transformation parameters; sources with high offsets between images (such as a fast-moving foreground star), can torque the transformation solution around to try to reduce what is an intrinsically large separation. Similarly, in the limiting case where an image contains 3 sources, we can find transformation parameters such that there is no remaining offset, even if the sources truly have moved. In both cases, the transformation parameters are different from the truth, and this impacts the PM measurements of all sources in an image. GaiaHub allows users to remove the impact of outlier PMs (e.g. foreground stars) by defining a co-moving population or to “rewind” sources using the GaiaHub-measures PMs while iterating on the transformation fitting, but both of these modes don’t apply to halo populations that lack an obvious co-moving frame and have a small number of sources per HST image.

Motivated by the PM improvements that GaiaHub has shown when information from Gaia and HST is combined together, we create a tool that enables Gaia+HST PM measurements in sparse fields to study the MW stellar halo. Because these sparse fields have low stellar density (e.g. <20<20 Gaia sources per HST image) and can contain a significant fraction of nearby sources (e.g. foreground disk and nearby halo stars, which can have non-negligible parallax motion and relatively large sky velocities), our pipeline cannot use the same assumptions that go into GaiaHub. This realization is what ultimately informed our decision to model the motions and transformation parameters simultaneously, though we emphasize that the following pipeline benefits immensely from the GaiaHub project. Specifically, the PSF fitting to measure centroids in HST, the cross-matching between HST and Gaia, and the initial measurements of the transformation parameters from GaiaHub are integral components of the pipeline we present in the remainder of this work.

2.1 COSMOS Test Sample

To demonstrate the performance of GaiaHub in sparse MW halo fields, we turn to the COSMOS field (Nayyeri et al., 2017; Muzzin et al., 2013) from the Cosmic Assembly Near-infrared Deep Extragalactic Legacy Survey (CANDELS; Grogin et al., 2011; Koekemoer et al., 2011, PIs: S. Faber, H. Ferguson). COSMOS is located at a high Galactic latitude and near the Galactic anti-center (l=236.8degl=236.8~{}\mathrm{deg}, b=42.1degb=42.1~{}\mathrm{deg}) and boasts HST imaging that covers a large area of sky: 2deg2\sim 2~{}\mathrm{deg}^{2} was imaged by ACS/WFC in 2003/2004, and 288 arcmin2 of that area (i.e. 4%\sim 4\%) was imaged again in 2011/2012. From simulated COSMOS-like data – presented in detail in Appendix B – PMs of halo and foreground disk stars in our magnitude range (i.e. 16<G<21.5mag16<G<21.5~{}\mathrm{mag}) can regularly be as large as 100masyr1\sim 100~{}\mathrm{mas}~{}\mathrm{yr}^{-1} in size.

We analyse all COSMOS HST images within a 0.5deg0.5~{}\mathrm{deg} radius of the field’s center, which is 39%\sim 39\% of the total COSMOS area. Figure 1 shows a histogram of the number of sources matched between the HST frames and Gaia for the 1619 HST images in our analysis. In these HST images, GaiaHub finds 2640 unique Gaia sources, with 4, 9, and 23 sources per image in the minimum, median, and maximum cases. To be clear, there are many repeat exposures at the same HST pointing as well as overlapping and multi-epoch regions of the COSMOS field, so all of the sources appear in more than one of the 1619 images.

Refer to caption
Figure 1: Histogram of the number of GaiaHub cross-matched sources between Gaia and HST for the 1619 HST images in COSMOS within 0.5deg0.5~{}\mathrm{deg} of the field’s center. There is a median of 9 Gaia sources in each HST frame, and a total of 2640 unique Gaia sources.
Refer to caption
Figure 2: Comparison of PMs measured by Gaia and GaiaHub for 1409 COSMOS sources. These sources are a subset of the 2640 unique COSMOS sources discussed in Figure 1 (1985 of which have Gaia-measured PMs) that GaiaHub determined to have well-measured Gaia PMs for rewinding to HST epochs. The PMs generally fall along or agree with the one-to-one line (dashed black diagonals in the left and middle panels) within their uncertainties, though there are also many PM measurements that disagree significantly. This is largely because the GaiaHub pipeline was not tuned for small numbers of sources per HST image or for the large PMs seen in the MW stellar halo.

A comparison of the Gaia- and GaiaHub-measured PMs for 1264 COSMOS sources is presented in Figure 2. The GaiaHub analysis was run using the --rewind_stars option, which iterates on the transformation fitting using Gaia (in the first iteration) and GaiaHub (in subsequent iterations) PMs to extract better solutions, especially when high-PM stars are present. However, the use of this option removes many sources that have poorly-measured Gaia PMs, resulting in more than half of the sources being rejected by GaiaHub in the COSMOS fields (i.e. 1659 used by GaiaHub from the 2640 possible unique sources). While more relaxed GaiaHub options produce PMs for all 2640 unique Gaia sources in the COSMOS images, these results are sensitive to small-number statistics and poorly-measured Gaia PMs that can shift the reference frame solution and yield extreme disagreements with Gaia.

In general, the GaiaHub and Gaia PMs follow the one-to-one line, and many PM measurements agree within their uncertainties. However, there are also some PMs that show significant disagreement between the two catalogs, as is evident in the rightmost panel. Using Mahalanobis distances between the PM measurements (i.e. accounting for their uncertainties) for sources with <5σ<5\sigma disagreement between Gaia and GaiaHub PMs, we find that the GaiaHub PM uncertainties would need to be inflated by a factor of 1.7\sim 1.7 to explain the PM differences. For 10%\sim 10\% these PMs, the GaiaHub measurements are more than 5σ5\sigma different from the corresponding Gaia values, whereas a sample size of 1000\sim 1000 should expect to see no deviations this large.

Because GaiaHub is not tuned for sparse fields, it is not surprising to see these disagreements. Some of this disparity comes from the fact that GaiaHub is only able to propagate realistic uncertainties on the transformation parameters to the measured PMs when there are several repeat HST observations of the same field. For much of COSMOS, there are only 4\sim 4 HST observations per field, which is not enough samples to obtain statistically robust propagation using GaiaHub’s approach. While the transformation uncertainty is usually a small component of the total error budget when there are a large number of sources, the transformation uncertainties in the sparse images of COSMOS are significant, so many of the GaiaHub PM uncertainties are underestimated. Handling these sparse field cases therefore requires a new tool.

3 Bayesian Positions, Parallaxes, and Proper Motions: The BP3M Pipeline

We build a hierarchical Bayesian tool, BP3M, that measures transformation parameters to map HST images onto Gaia while also simultaneously measuring the PMs, parallaxes, and positions for the sources in an image. While it may appear computationally prohibitive to measure PMs, parallaxes, position of all stars in an image and the transformation parameters simultaneously, a few statistical tricks – namely conjugacy between likelihood and prior distributions – make this approach feasible. The pipeline is able to consider multiple HST images concurrently in a proper Bayesian fashion, which can significantly improve precision for the different motion components of each source.

We use the Gaia-measured positions, parallaxes, and PMs and corresponding uncertainties/covariance as prior distributions to describe each source position over time. This improves the measured astrometric solution because the priors allow the measured positions of the sources to be compared to their expected positions at the epoch of each image, rather than comparing the individual position measurements at each time. This approach is quite general in that it does not require identifying a clean kinematic sample or reference stars and background sources. In many cases, bright foreground stars with well-measured Gaia astrometry can serve as anchors that help define the transformation solution.

Throughout the statistics presented in this section and Appendix A, we refer to HST and Gaia in the subscripts of the variables, but there is nothing about this math or its implementation in BP3M that restricts us to only these telescopes. The statistical statements are generally true for any collection of measurements with at least two positions of the same source – it applies to HST+HST, HST+Roman, HST+Gaia+Roman, etc – though there are a minimum of three sources per image needed to be able to measure the 6 transformation parameters. For our COSMOS analysis, all HST images have 4 or more Gaia sources.

Table 1: Definitions of fitting parameters. In general, HH refers to an HST value, GG refers to an Gaia value, an apostrophe indicates a prior measurement, jj refers to the HST image number, and ii refers to the source index.
Parameter Description
(aj,bj,cj,dj)\left(a_{j},b_{j},c_{j},d_{j}\right) Transformation matrix parameters for the jj-th HST frame to the jj-th pseudo Gaia frame
(X0,j,Y0,j)(X_{0,j},Y_{0,j}) Center coordinate in the jj-th HST frame
(W0,j,Z0,j)(W_{0,j},Z_{0,j}) Center coordinate of jj-th pseudo Gaia frame
(XH,i,j,YH,i,j)(X_{H,i,j}^{\prime},Y_{H,i,j}^{\prime}) Observed coordinate of ii-th source in the jj-th HST frame
σH,i,j\sigma_{H,i,j} GaiaHub-measured pixel position uncertainty (in both x and y) of the ii-th source in the jj-th HST frame
(XG,i,j,YG,i,j)(X_{G,i,j}^{\prime},Y_{G,i,j}^{\prime}) Observed coordinate of ii-th source in the jj-th pseudo Gaia frame
PSG,j\mathrm{PS}_{G,j} Pseudo pixel scale of jj-th pseudo Gaia frame
θiT=(αi,δi)\vec{\theta}_{i}^{\prime T}=(\alpha_{i}^{\prime},\delta_{i}^{\prime}) Gaia-measured position vector for the ii-th source
(σα,i,σδ,i,ρα,δ,i)(\sigma_{\alpha*,i}^{\prime},\sigma_{\delta,i}^{\prime},\rho_{\alpha*,\delta,i}^{\prime}) Gaia-measured position uncertainties and correlation for the ii-th source
plxi\mathrm{plx}^{\prime}_{i} Gaia-measured parallax for the ii-th source
σplx,i\sigma_{\mathrm{plx},i}^{\prime} Gaia-measured parallax uncertainty for the ii-th source
μiT=(μα,i,μδ,i)\vec{\mu}_{i}^{\prime T}=(\mu_{\alpha*,i}^{\prime},\mu_{\delta,i}^{\prime}) Gaia-measured PM vector for the ii-th source
(σμα,i,σμδ,i,ρμα,μδ,i)(\sigma_{\mu_{\alpha*},i}^{\prime},\sigma_{\mu_{\delta},i}^{\prime},\rho_{\mu_{\alpha*},\mu_{\delta},i}^{\prime}) Gaia-measured PM uncertainties and correlation for the ii-th source
ΔθT=(Δαi,Δδi)\vec{\Delta\theta}^{T}=(\Delta\alpha*_{i},\Delta\delta_{i}) True position offset vector for the ii-th source
plxi\mathrm{plx}_{i} True parallax for the ii-th source
μiT=(μα,i,μδ,i)\vec{\mu}_{i}^{\prime T}=(\mu_{\alpha*,i},\mu_{\delta,i}) True PM vector for the ii-th source
tG=J2016.0t_{G}=\mathrm{J2016.0} Time of Gaia frame observation
tH,jt_{H,j} Time of jj-th HST frame observation
Δtj=tH,jtG\Delta t_{j}=t_{H,j}-t_{G} Time between Gaia and jj-th HST frame (negative for older HST images)
Δplxi,j\vec{\Delta\mathrm{plx}}_{i,j} Offset-per-parallax vector for ii-th source between Gaia and jj-th HST frame

3.1 The Statistics

This subsection will be quite technical in detail, so readers who are less interested in the formal statistics are encouraged to skip to Section 3.2.

First, we define many of the key variables and parameters in Table 1. While we are ultimately concerned with the PMs in RA and Dec, it is convenient to work in a 2D plane projection (e.g. what a detector sees) when comparing images. Because Gaia’s measurements do not correspond to positions in a true image, we follow GaiaHub’s approach of transforming the Gaia coordinates into XY coordinates on a pseudo image using a tangent-plane projection

rG,i,j\displaystyle r_{G,i,j} =sin(δG,j,0)sin(δG,i)\displaystyle=\sin\left(\delta_{G,j,0}\right)\sin\left(\delta_{G,i}^{\prime}\right) (3a)
+cos(δG,j,0)cos(δG,j)cos(αG,iαG,j,0),\displaystyle+\cos\left(\delta_{G,j,0}\right)\cos\left(\delta_{G,j}^{\prime}\right)\cos\left(\alpha_{G,i}^{\prime}-\alpha_{G,j,0}\right)\ ,
rad2pixG,j\displaystyle\mathrm{rad2pix}_{G,j} =180π36001000PSG,j1pixelsradian,\displaystyle=\frac{180}{\pi}\cdot 3600\cdot 1000\cdot\mathrm{PS}_{G,j}^{-1}~{}\mathrm{\frac{pixels}{radian}}\ , (3b)
XG,i,j\displaystyle X_{G,i,j} =XG,j,0rad2pixG,j\displaystyle=X_{G,j,0}-\mathrm{rad2pix}_{G,j} (3c)
×cos(δG,i)sin(αG,iαG,j,0)r,\displaystyle\times\frac{\cos\left(\delta_{G,i}^{\prime}\right)\sin\left(\alpha_{G,i}^{\prime}-\alpha_{G,j,0}\right)}{r}\ ,
YG,i,j\displaystyle Y_{G,i,j} =YG,j,0\displaystyle=Y_{G,j,0} (3d)
+rad2pixG,j[cos(δG,j,0)sin(δG,i)r\displaystyle+\mathrm{rad2pix}_{G,j}~{}\left[\frac{\cos\left(\delta_{G,j,0}\right)\sin\left(\delta_{G,i}^{\prime}\right)}{r}\right.
sin(δG,j,0)cos(δG,i)cos(αG,iαG,j,0)r],\displaystyle\left.-\frac{\sin\left(\delta_{G,j,0}\right)\cos\left(\delta_{G,i}^{\prime}\right)\cos\left(\alpha_{G,i}^{\prime}-\alpha_{G,j,0}\right)}{r}\right]\ ,

where the (αG,i,δG,i)(\alpha_{G,i}^{\prime},\delta_{G,i}^{\prime}) are the Gaia-measured coordinates of source ii in radians, (αG,j,0,δG,j,0)\left(\alpha_{G,j,0},\delta_{G,j,0}\right) are the coordinates of the center of the Gaia pseudo image when considering HST image jj, (XG,j,0,YG,j,0)\left(X_{G,j,0},Y_{G,j,0}\right) are the pixel coordinates of the center of the Gaia pseudo image, and (XG,i,j,YG,i,j)(X_{G,i,j},Y_{G,i,j}) are the coordinates of source ii in the Gaia pseudo image in pixels. The (αG,j,0,δG,j,0)\left(\alpha_{G,j,0},\delta_{G,j,0}\right) and (XG,j,0,YG,j,0)\left(X_{G,j,0},Y_{G,j,0}\right) are inherited from the GaiaHub-measured values for each HST image, which come from an average of the source positions, and these values are not left as free parameters during the fitting process. The chosen pixel scale PSG,j\mathrm{PS}_{G,j} for each Gaia pseudo image is set to the nominal HST pixel scale that it is paired with (e.g. 50maspixel150~{}\mathrm{mas}\cdot\mathrm{pixel}^{-1} for HST’s ACS/WFC). Then, changes in the RA and Dec can be transformed to changes in XY using the Jacobian matrix:

𝑱i,j=(δXG,i,jδαG,iδXG,i,jδδG,iδYG,i,jδαG,iδYG,i,jδδG,i)\boldsymbol{J}_{i,j}=\left(\begin{matrix}\frac{\delta X_{G,i,j}}{\delta\alpha*_{G,i}^{\prime}}&\frac{\delta X_{G,i,j}}{\delta\delta_{G,i}^{\prime}}\\ \frac{\delta Y_{G,i,j}}{\delta\alpha*_{G,i}^{\prime}}&\frac{\delta Y_{G,i,j}}{\delta\delta_{G,i}^{\prime}}\\ \end{matrix}\right)

which is usually very close to

𝑱j=1PSG,j(1001)\boldsymbol{J}_{j}=\frac{1}{\mathrm{PS_{G,j}}}\left(\begin{matrix}-1&0\\ 0&1\\ \end{matrix}\right)

when (Δα,Δδ)(\Delta\alpha*,\Delta\delta) are in mas and (ΔXG,i,j,ΔYG,i,j)(\Delta X_{G,i,j},\Delta Y_{G,i,j}) are Gaia pseudo pixels. We note that this simplification only holds for small field-of-view detectors. In the cases of large PMs and large time baselines, the off-diagonal elements can start to become important, so we opt to use the more general version in our pipeline.

To set up the probability model, there are a few remaining key terms that we need to define:

𝑹j=(ajbjcjdj)\boldsymbol{R}_{j}=\left(\begin{matrix}a_{j}&b_{j}\\ c_{j}&d_{j}\\ \end{matrix}\right)

which is the transformation matrix for the jj-th HST frame to the jj-th pseudo Gaia frame,

𝑽H,i,j=(σH,i,j200σH,i,j2)\boldsymbol{V}_{H,i,j}=\left(\begin{matrix}\sigma_{H,i,j}^{2}&0\\ 0&\sigma_{H,i,j}^{2}\\ \end{matrix}\right)

which is the GaiaHub-measured pixel position covariance matrix for the ii-th source in the jj-th HST frame,

𝑽θ,i=(σα,i2ρα,δ,iσα,iσδ,iρα,δ,iσα,iσδ,iσδ,i2)\boldsymbol{V}_{\theta,i}=\left(\begin{matrix}\sigma_{\alpha*,i}^{\prime 2}&\rho_{\alpha*,\delta,i}^{\prime}\cdot\sigma_{\alpha*,i}^{\prime}\cdot\sigma_{\delta,i}^{\prime}\\ \rho_{\alpha*,\delta,i}^{\prime}\cdot\sigma_{\alpha*,i}^{\prime}\cdot\sigma_{\delta,i}^{\prime}&\sigma_{\delta,i}^{\prime 2}\\ \end{matrix}\right)

which is the Gaia-measured position covariance matrix for the ii-th source, and finally

𝑽μ,i=(σμα,i2ρμα,μδ,iσμα,iσμδ,iρμα,μδ,iσμα,iσμδ,iσμδ,i2)\boldsymbol{V}_{\mu,i}=\left(\begin{matrix}\sigma_{\mu_{\alpha*},i}^{\prime 2}&\rho_{\mu_{\alpha*},\mu_{\delta},i}^{\prime}\cdot\sigma_{\mu_{\alpha*},i}^{\prime}\cdot\sigma_{\mu_{\delta},i}^{\prime}\\ \rho_{\mu_{\alpha*},\mu_{\delta},i}^{\prime}\cdot\sigma_{\mu_{\alpha*},i}^{\prime}\cdot\sigma_{\mu_{\delta},i}^{\prime}&\sigma_{\mu_{\delta},i}^{\prime 2}\\ \end{matrix}\right)

which is the Gaia-measured PM covariance matrix for the ii-th source. In these equations and the ones that will follow, our convention is to show matrices (and matrices of matrices) using a bold-faced typesetting.

The fundamental relationships between the prior Gaia measurements and the true parameter for source ii are given by:

θi\displaystyle\vec{\theta}_{i}^{\prime} =θi+Δθi,\displaystyle=\vec{\theta}_{i}+\vec{\Delta\theta}_{i}\ , (4a)
p(Δθi|Δθi)\displaystyle p(\vec{\Delta\theta}_{i}^{\prime}|\vec{\Delta\theta}_{i}) =𝒩(Δθi|Δθi,𝑽θ,i),\displaystyle=\mathcal{N}\left(\vec{\Delta\theta}_{i}^{\prime}|\vec{\Delta\theta}_{i},\boldsymbol{V}_{\theta,i}\right)\ , (4b)
p(plxi|plxi)\displaystyle p(\mathrm{plx}_{i}^{\prime}|\mathrm{plx}_{i}) =𝒩(plxi|plxi,σplx,i),\displaystyle=\mathcal{N}\left(\mathrm{plx}_{i}^{\prime}|\mathrm{plx}_{i},\sigma^{\prime}_{\mathrm{plx},i}\right)\ , (4c)
p(μi|μi)\displaystyle p(\vec{\mu}_{i}^{\prime}|\vec{\mu}_{i}) =𝒩(μi|μi,𝑽μ,i),\displaystyle=\mathcal{N}\left(\vec{\mu}_{i}^{\prime}|\vec{\mu}_{i},\boldsymbol{V}_{\mu,i}\right)\ , (4d)

which says that the Gaia-measured values are offset from the true values by noise dictated by their Gaia-measured uncertainties. While we have explicitly included a mean for the Gaia-measured position offset vector Δθi\vec{\Delta\theta}_{i}^{\prime} to be as general as possible, in practice Δθi=0\vec{\Delta\theta}_{i}^{\prime}=\vec{0} because Gaia has no expected offset from the positions it reports. In the form of the probability statements above, we have assumed that there is no correlation between, for example, parallax and μα\mu_{\alpha*}, which is not exactly correct because Gaia has measurements for these correlations. Our choices, however, make the following math easier, though it comes at the cost of some additional constraining power from the Gaia priors being ignored. The Gaia-measured correlations between these parameters are indeed included in future sections of this work when we compare the BP3M distributions to Gaia’s results. Future versions of the BP3M pipeline will work to incorporate these prior correlations, which will lead to even tighter posterior distributions on position, parallax, and PMs.

While it may seem nonphysical, Gaia observed parallax measurements can be negative (Lindegren et al., 2018), but it is important to remember that the observed parallax values define the mean of a distribution whose width/uncertainty usually places a large amount of probability in positive parallaxes; in this way, we must treat the Gaia astrometric measurements as distributions and not individual points. Luri et al. (2018), for example, explain how the definitions of motion on the sky (both by Gaia as well as in this work) technically allow for negative observed parallax values, which is especially likely to occur when the position uncertainty is relatively large compared to the size of the parallax motion (e.g. see their Section 3 and Figure 2); as a result, they remind the reader that the Gaia observed parallaxes should not be thought of as a direct measurement of distance, and instead, distances need to be estimated by proper statistical modelling of the information contained in the astrometric solution distributions.

Because some of the sources in an HST image have no Gaia-measured parallaxes or PMs, we find it useful to put a population/global prior on the PMs and parallaxes:

p(plxi|plx^,σplx^)\displaystyle p(\mathrm{plx}_{i}|\hat{\mathrm{plx}},\sigma_{\hat{\mathrm{plx}}}) =𝒩(plxi|plx^,σplx^),\displaystyle=\mathcal{N}\left(\mathrm{plx}_{i}|\hat{\mathrm{plx}},\sigma_{\hat{\mathrm{plx}}}\right)\ , (5a)
p(μi|μ^,𝑽μ^)\displaystyle p(\vec{\mu}_{i}|\hat{\mu},\boldsymbol{V}_{\hat{\mu}}) =𝒩(μi|μ^,𝑽μ^),\displaystyle=\mathcal{N}\left(\vec{\mu}_{i}|\hat{\mu},\boldsymbol{V}_{\hat{\mu}}\right)\ , (5b)

which says that there are some global distributions that the true parallaxes and PMs of the sources originate from. While we are free to play with the parameters of the population distributions, we note that they do need to be Gaussian in form so that we retain the necessary conjugacy. We choose to use diffuse hyperpriors, with the goal of minimally impacting the sources with Gaia-measured parallaxes and PMs while offering some guidance to the sources without. In the future, the parallax global prior could be made more constraining in cases where the distances are better known (e.g. clean populations of extra-Galactic stars) or using a magnitude-dependent parallax prior to better incorporate our understanding of the distribution of stars in the MW. Similar changes could also be made to the PM global prior. For the current version of the pipeline, which focuses on faint stars in the MW stellar halo and extra-Galactic sources, we choose the parallax prior to have a mean of 0.5 mas and width of 10.0 mas; when generating synthetic MW thick disk and halo stars (described in Appendix B), we find that these choices contain 99.9% of the synthetic parallaxes within 1σ1\sigma and 100% of the synthetic parallaxes within 3σ3\sigma (maximum parallax of 28\sim 28 mas). For the PM prior, we use the sources with Gaia-measured PMs to estimate a mean and covariance matrix, and then we multiply that covariance matrix by a factor of 10210^{2} to guard against the possibility that some of the sources without Gaia PMs are significantly different in their PM from the other sources.

The information linking Gaia to HST image jj for source ii is then given by:

ΔdG,i,j\displaystyle\vec{\Delta d}_{G,i,j} =(XG,i,jYG,i,j)𝑹j(XH,i,jX0,jYH,i,jY0,j)(W0,jZ0,j),\displaystyle=\left(\begin{matrix}X_{G,i,j}\\ Y_{G,i,j}\\ \end{matrix}\right)-\boldsymbol{R}_{j}\cdot\left(\begin{matrix}X_{H,i,j}-X_{0,j}\\ Y_{H,i,j}-Y_{0,j}\\ \end{matrix}\right)-\left(\begin{matrix}W_{0,j}\\ Z_{0,j}\\ \end{matrix}\right)\ , (6a)
Δmi,j\displaystyle\vec{\Delta m}_{i,j} =μiΔtj+plxiΔplxi,jΔθi,\displaystyle=\vec{\mu}_{i}\cdot\Delta t_{j}+\mathrm{plx_{i}}\cdot\vec{\Delta\mathrm{plx}}_{i,j}-\vec{\Delta\theta}_{i}\ , (6b)
ΔdG,i,j\displaystyle\vec{\Delta d}_{G,i,j} 𝒩(𝑱i,jΔmi,j,𝑽d,i,j=𝑱i,j𝑽θ,i𝑱i,jT\displaystyle\sim\mathcal{N}\left(\boldsymbol{J}_{i,j}\cdot\vec{\Delta m}_{i,j},\boldsymbol{V}_{d,i,j}=\boldsymbol{J}_{i,j}\cdot\boldsymbol{V}_{\theta,i}\cdot\boldsymbol{J}_{i,j}^{T}\right.
+𝑹j𝑽H,i,j𝑹jT),\displaystyle\left.\hskip 113.81102pt+\boldsymbol{R}_{j}\cdot\boldsymbol{V}_{H,i,j}\cdot\boldsymbol{R}_{j}^{T}\right)\ , (6c)

which says that the measured offset between the Gaia and HST positions (after applying the transformation) in the pseudo Gaia image is distributed around the offset implied by the sum of the motion from PM, parallax, and uncertainty in position. As mentioned back in the description of Equation 1, we are able to hold (X0,j,Y0,j)(X_{0,j},Y_{0,j}) fixed as constants while the (W0,j,Z0,j)(W_{0,j},Z_{0,j}) are left as free parameters during the fitting. We note that the relationship in Equation 6 ignores the impact that radial motion has on changing the distance to a star between observations (e.g. see see Section 2.3.1 of van de Ven et al., 2006) – and therefore the magnitude of the PM and parallax – though this is a safe assumption because the distance change is almost always extremely small (e.g. a star with LOS velocity of 1000kms11000~{}\mathrm{km}~{}\mathrm{s}^{-1} would only experience a distance change of 0.01 pc after 10 years).

The Δplxi,j\vec{\Delta\mathrm{plx}}_{i,j} term, which we refer to as the parallax offset vector, is a 2D vector that defines the direction and magnitude of the offset between two observations as a result of parallax motion for a source with a parallax of 1 mas; in this way, the parallax offset vector can be multiplied by any parallax value (i.e. plxiΔplxi,j\mathrm{plx}_{i}\cdot\vec{\Delta\mathrm{plx}}_{i,j}) to find the appropriate offset in (Δα,Δδ)(\Delta\alpha*,\Delta\delta) between two observations as a result of parallax motion. To measure the Δplxi,j\vec{\Delta\mathrm{plx}}_{i,j}, we use the HST and Gaia observation times, (tH,j,tG)(t_{H,j},t_{G}), the position of the source in Gaia, (αi,δi)(\alpha_{i}^{\prime},\delta_{i}^{\prime}), and built-in functions of astropy (Astropy Collaboration et al., 2013, 2018, 2022). Examples of the parallax motion for different positions on the sky are shown in Figure 3, where the Gaia observation time (i.e. J2016.0) is at the origin and the orbits trace out the parallax motion over the course of a year; the parallax offset vector at any time is simply the vector that connects the corresponding point on the ellipse to the origin.

Refer to caption
Figure 3: Examples of parallax in the plane of the sky for different fields. The origin (black point) is the time that corresponds to Gaia observations (i.e. J2016.0), and each ellipse is the path that a star with a parallax of 1mas1~{}\mathrm{mas} sweeps out on the sky over the course of a year, with the direction of motion shown by the arrowhead. These ellipses demonstrate why the ability to constrain precise parallaxes is highly dependent on the exact observation times as well as position on the sky.

With the above definitions in hand, we can use the distributions on the ΔdG,i,j\vec{\Delta d}_{G,i,j} vectors for all the sources in all images we are considering to construct the following likelihood distribution:

p(𝑿𝑯,𝒀𝑯,𝑿𝑮,𝒀𝑮|a,b,c,d,W0,Z0,𝝁,𝐩𝐥𝐱,𝚫𝜽)=i=1nj=1nim𝒩(ΔdG,i,j|𝑱i,jΔmi,j,𝑽d,i,j).\begin{split}p(&\boldsymbol{X_{H}},\boldsymbol{Y_{H}},\boldsymbol{X_{G}},\boldsymbol{Y_{G}}|\vec{a},\vec{b},\vec{c},\vec{d},\vec{W_{0}},\vec{Z}_{0},\boldsymbol{\mu},\boldsymbol{\mathrm{plx}},\boldsymbol{\Delta\theta})\\ &=\prod_{i=1}^{n_{*}}\prod_{j=1}^{n_{im}}\mathcal{N}\left(\vec{\Delta d_{G,i,j}}|\boldsymbol{J}_{i,j}\cdot\vec{\Delta m}_{i,j},\boldsymbol{V}_{d,i,j}\right).\end{split} (7)

We note here that we have purposefully chosen to omit writing out the dependency on a few of the explanatory variables (e.g. Δtj\Delta t_{j}, the covariance matrices) for ease of reading the math.

From Bayes’ Law, we arrive at the following posterior:

p(a,b,c,d,W0,Z0,𝝁,plx,𝚫𝜽|𝑿𝑯,𝒀𝑯,𝑿𝑮,𝒀𝑮,plx^,σplx^,μ^,𝑽μ^)[j=1nimp(aj,bj,cj,dj,W0,j,Z0,j)]i=1n{p(μi|μi)p(μi|μ^,𝑽μ^)p(plxi|plxi)p(plxi|plx^)p(Δθi|Δθi)j=1nim𝒩(ΔdG,i,j|𝑱i,jΔmi,j,𝑽d,i,j)}\begin{split}p&\left(\vec{a},\vec{b},\vec{c},\vec{d},\vec{W_{0}},\vec{Z}_{0},\boldsymbol{\mu},\vec{\mathrm{plx}},\boldsymbol{\Delta\theta}|\right.\\ &\hskip 56.9055pt\left.\boldsymbol{X_{H}},\boldsymbol{Y_{H}},\boldsymbol{X_{G}},\boldsymbol{Y_{G}},\hat{\mathrm{plx}},\sigma_{\hat{\mathrm{plx}}},\hat{\mu},\boldsymbol{V}_{\hat{\mu}}\right)\\ \propto&\left[\prod_{j=1}^{n_{im}}p(a_{j},b_{j},c_{j},d_{j},W_{0,j},Z_{0,j})\right]\cdot\\ &\prod_{i=1}^{n_{*}}\left\{p(\vec{\mu}_{i}^{\prime}|\vec{\mu}_{i})\cdot p(\vec{\mu}_{i}|\hat{\mu},\boldsymbol{V}_{\hat{\mu}})\cdot p(\mathrm{plx}_{i}^{\prime}|\mathrm{plx}_{i})\cdot p(\mathrm{plx}_{i}|\hat{\mathrm{plx}})\cdot\right.\\ &\hskip 18.49411pt\left.p(\vec{\Delta\theta}_{i}^{\prime}|\vec{\Delta\theta}_{i})\cdot\prod_{j=1}^{n_{im}}\mathcal{N}\left(\vec{\Delta d_{G,i,j}}|\boldsymbol{J}_{i,j}\cdot\vec{\Delta m}_{i,j},\boldsymbol{V}_{d,i,j}\right)\right\}\end{split} (8)

where p(aj,bj,cj,dj,W0,j,Z0,j)p(a_{j},b_{j},c_{j},d_{j},W_{0,j},Z_{0,j}) is the prior distribution on the transformation parameters and the remaining distributions have been defined above. In many of our initial analyses, we chose to use a flat prior on the transformation parameters, which yielded reasonable results. However, we soon realized from our analysis of a few hundred HST ACS/WFC images that there are relatively tight constraints for our expectations of the pixel scale ratio and skew terms. To that end, when using HST ACS/WFC data, we choose to employ a few relatively diffuse/not-overly-constraining priors on the skews, pixel scale ratio, and HST angle, as well as the (W0,j,Z0,j)(W_{0,j},Z_{0,j}) vector:

p(skewj\displaystyle p(\mathrm{skew}_{j} |HSTACS/WFC)\displaystyle|\mathrm{\textit{HST}~{}ACS/WFC}) (9a)
=𝒩(skewj|μ=skewj,σ=5×104),\displaystyle=\mathcal{N}\left(\mathrm{skew}_{j}|\mu=\mathrm{skew}_{j}^{\prime},\sigma=5\times 10^{-4}\right)\ ,
p(PSRj\displaystyle p(\mathrm{PSR}_{j} |HSTACS/WFC)\displaystyle|\mathrm{\textit{HST}~{}ACS/WFC}) (9b)
=𝒩(PSRj|μ=PSRj,σ=5×104),\displaystyle=\mathcal{N}\left(\mathrm{PSR}_{j}|\mu=\mathrm{PSR}_{j}^{\prime},\sigma=5\times 10^{-4}\right)\ ,
p(θj)\displaystyle p(\theta_{j}) =𝒩(θj|μ=θj,σ=1),\displaystyle=\mathcal{N}\left(\theta_{j}|\mu=\theta_{j}^{\prime},\sigma=1^{\circ}\right)\ , (9c)
p(W0,j)\displaystyle p(W_{0,j}) =𝒩(W0,j|μ=W0,j,σ=10pixels),\displaystyle=\mathcal{N}\left(W_{0,j}|\mu=W_{0,j}^{\prime},\sigma=10~{}\mathrm{pixels}\right)\ , (9d)
p(Z0,j)\displaystyle p(Z_{0,j}) =𝒩(Z0,j|μ=Z0,j,σ=10pixels),\displaystyle=\mathcal{N}\left(Z_{0,j}|\mu=Z_{0,j}^{\prime},\sigma=10~{}\mathrm{pixels}\right)\ , (9e)

where we use the same prior on both the on- and off-axis skew terms. The prior means of each distribution come from the parameters measured by the previous iteration of transformation parameter fitting; in the case of the first iteration, we use the GaiaHub outputs. In this way, we tell the pipeline that the transformation solution is likely nearby the previous iteration’s solution, while the relatively large widths allows the values to change significantly if necessary. To transform from priors in angle, skews, and PSR, we need to account for a transformation Jacobian:

𝑱TR,j=(δonskewjδajδoffskewjδajδPSRjδajδθjδajδonskewjδbjδoffskewjδbjδPSRjδbjδθjδbjδonskewjδcjδoffskewjδcjδPSRjδcjδθjδcjδonskewjδdjδoffskewjδdjδPSRjδdjδθjδdj)\boldsymbol{J}_{\mathrm{TR,j}}=\left(\begin{matrix}\frac{\delta~{}\mathrm{on~{}skew}_{j}}{\delta a_{j}}&\frac{\delta~{}\mathrm{off~{}skew}_{j}}{\delta a_{j}}&\frac{\delta\mathrm{PSR}_{j}}{\delta a_{j}}&\frac{\delta\theta_{j}}{\delta a_{j}}\\ \frac{\delta~{}\mathrm{on~{}skew}_{j}}{\delta b_{j}}&\frac{\delta~{}\mathrm{off~{}skew}_{j}}{\delta b_{j}}&\frac{\delta\mathrm{PSR}_{j}}{\delta b_{j}}&\frac{\delta\theta_{j}}{\delta b_{j}}\\ \frac{\delta~{}\mathrm{on~{}skew}_{j}}{\delta c_{j}}&\frac{\delta~{}\mathrm{off~{}skew}_{j}}{\delta c_{j}}&\frac{\delta\mathrm{PSR}_{j}}{\delta c_{j}}&\frac{\delta\theta_{j}}{\delta c_{j}}\\ \frac{\delta~{}\mathrm{on~{}skew}_{j}}{\delta d_{j}}&\frac{\delta~{}\mathrm{off~{}skew}_{j}}{\delta d_{j}}&\frac{\delta\mathrm{PSR}_{j}}{\delta d_{j}}&\frac{\delta\theta_{j}}{\delta d_{j}}\\ \end{matrix}\right)

such that

p(aj,bj,cj,dj)=p(onskewj,offskewj,PSRj,θj)|𝑱TR,j|.p(a_{j},b_{j},c_{j},d_{j})=p(\mathrm{on~{}skew}_{j},\mathrm{off~{}skew}_{j},\mathrm{PSR}_{j},\theta_{j})\cdot|\boldsymbol{J}_{\mathrm{TR,j}}|.

It should be noted that these priors may not always apply to all sets of ACS/WFC data, so this may need to be adjusted for other fields/applications.

The complete forms of the posterior conditional distributions for each motion component or source ii are listed in Appendix A, defining the following important distributions:

p(plxi|a,b,c,d,W0,Z0,),\displaystyle p(\mathrm{plx}_{i}|\vec{a},\vec{b},\vec{c},\vec{d},\vec{W_{0}},\vec{Z}_{0},\dots)\ , (10a)
p(μi|plxi,a,b,c,d,W0,Z0,),\displaystyle p(\vec{\mu}_{i}|\mathrm{plx}_{i},\vec{a},\vec{b},\vec{c},\vec{d},\vec{W_{0}},\vec{Z}_{0},\dots)\ , (10b)
p(Δθi|μi,plxi,a,b,c,d,W0,Z0,),\displaystyle p(\vec{\Delta\theta_{i}}|\vec{\mu}_{i},\mathrm{plx}_{i},\vec{a},\vec{b},\vec{c},\vec{d},\vec{W_{0}},\vec{Z}_{0},\dots)\ , (10c)

where the ellipsis refers to the other variables that each distribution depends on. One key takeaway from the statistics that went into constructing the results in Appendix A is that the conjugacy between all distributions (i.e. all distributions are normally distributed/Gaussians) allows us to fairly easily combine multiple distributions to arrive at well-defined and closed-form posterior distributions.

With these posterior conditionals in hand, we can perform the following steps for each source for a given set of transformation parameters:

  1. 1.

    Draw samples of plxi\mathrm{plx}_{i} from p(plxi|)p(\mathrm{plx}_{i}|\dots);

  2. 2.

    Use those plxi\mathrm{plx}_{i} samples to draw samples of μi\vec{\mu}_{i} from p(μi|plxi,)p(\vec{\mu}_{i}|\mathrm{plx}_{i},\dots);

  3. 3.

    Use those (μi,plxi)\left(\vec{\mu}_{i},\mathrm{plx}_{i}\right) samples to draw Δθi\vec{\Delta\theta_{i}} samples from p(Δθi|μi,plxi,)p(\vec{\Delta\theta_{i}}|\vec{\mu}_{i},\mathrm{plx}_{i},\dots).

Once we have samples of the PMs, parallaxes, and position offsets for each source, we can calculate the posterior probability of a set of transformation parameters. We do this by marginalizing over the individual samples of the PMs, parallaxes, and position offsets using Bayes’ Law such that

p(a,b,c,d,W0,Z0|)=p(a,b,c,d,W0,Z0,𝝁,plx,𝚫𝜽|)i=1np(μi,plxi,Δθi|a,b,c,d,W0,Z0,).\begin{split}p(&\vec{a},\vec{b},\vec{c},\vec{d},\vec{W_{0}},\vec{Z}_{0}|\dots)\\ &=\frac{p(\vec{a},\vec{b},\vec{c},\vec{d},\vec{W_{0}},\vec{Z}_{0},\boldsymbol{\mu},\vec{\mathrm{plx}},\boldsymbol{\Delta\theta}|\dots)}{\prod_{i=1}^{n_{*}}p(\vec{\mu}_{i},\mathrm{plx}_{i},\vec{\Delta\theta_{i}}|\vec{a},\vec{b},\vec{c},\vec{d},\vec{W_{0}},\vec{Z}_{0},\dots)}.\end{split} (11)

which is independent of the particular values of the PMs, parallaxes, and positions of each source. This relationship is the key that makes the simultaneous fitting of the 6 transformation parameters per image and 5 motion parameters per source feasible; because we can quickly draw the PM, parallax, and position samples directly from a known posterior distribution given a set of transformation parameters, we can efficiently calculate the posterior probability of the transformation parameters alone. Because we can calculate posterior probabilities for a given set of transformation parameters, we are able to sample those parameters from the posterior distribution using a Metropolis-Hastings (MH) MCMC algorithm. Then, for each sample of transformation parameters, we can sample from the posterior conditional distributions of parallaxes, PMs, and position offsets for each source. In this way, the uncertainty of the transformation fitting is propagated to the resulting PM, parallaxes, and positions.

3.2 The Pipeline

In terms of implementing the statistics in the BP3M pipeline, the general steps are as follows:

  1. 1.

    Read in the position, parallax, PM data from Gaia, where it exists, and the corresponding positions in the HST frames from the GaiaHub output catalogs333To be clear, GaiaHub analyses the HST images, and BP3M operates on the outputs of the HST image analysis from GaiaHub. BP3M does not interact with the raw HST images, though we will use “HST images” as a synonym for “GaiaHub-produced HST analysis catalog” throughout the text of this work.;

  2. 2.

    Use the GaiaHub transformation parameter values as starting guesses;

  3. 3.

    Using sources with Gaia priors on PM, estimate the global PM distribution;

  4. 4.

    Draws samples for the transformation parameters using MH-MCMC, evaluating posterior probabilities using only sources with Gaia priors when measuring the transformation;

  5. 5.

    Given a set of transformation parameters, draw samples of positions, parallaxes, and PMs for all sources;

  6. 6.

    Identify outliers (e.g. bad cross-matches between Gaia and HST) as sources that have more than 2σ2\sigma disagreement between the expected and observed positions in each HST image;

  7. 7.

    Re-estimate the global PM distribution using the new posteriors;

  8. 8.

    Repeat the fitting process using the non-outlier sources, including sources without Gaia priors;

  9. 9.

    Compare the new list of outliers to a union of the new list and the most recent previous list of outliers. If the new list length is more than 10% different from the length of the union, repeat the fitting with the new list of outliers, and stop otherwise.

This process requires a minimum of two iterations to achieve good results, though it isn’t uncommon for the outlier list to change enough that a third iteration is required. In terms of processing time, running BP3M on a 2016 MacBook Pro for an HST image with 200\sim 200 sources takes approximate 15 minutes to complete its analysis, while an image with 10\sim 10 sources can be analysed in around 5 minutes. Increasing the number of HST images being analysed together effectively multiplies the computation time by the number of images; this is largely because we need to fit 6 additional transformation parameters for each additional image, and necessarily need to increase the number of MCMC walkers and number of MCMC iterations. Best practice is to analyse all images independently before combining them to save time on searching the transformation parameter space.

Because the HST images are mapped onto the global reference frame of Gaia, the transformation parameters that BP3M measures are also useful in constraining non-Gaia sources. Once the best set of transformation parameters between HST and Gaia have been measured, we can return to the list of all sources in the HST image as measured by GaiaHub, which can include sources much fainter than Gaia can see (i.e. G>21.5magG>21.5~{}\mathrm{mag}). When comparing multiple HST images at various epochs, these faint sources can be cross-matched with each other to recover their PMs and parallaxes to much fainter magnitudes. While the pipeline currently offers this feature, it is currently untested and optimized, but preliminary results suggest this approach will be fruitful. It may prove particularly useful in very sparse fields where the number of Gaia sources in each HST image is small, but the shared source list between the HST frames is large.

3.3 Caveats

One key assumption that is built in to GaiaHub – and therefore BP3M – comes from how HST sources are cross-matched with Gaia. An HST source is matched with a Gaia source if they are nearest neighbours within some angular distance of each other (e.g. GaiaHub default of 5 HST pixels, 250mas\sim 250~{}\mathrm{mas}, though this can be changed), which ultimately sets upper limits on the sizes of PMs that can be measured. This works quite well in medium to low density regions where the stars are likely far enough from each other with respect to their motion between images. In high density regions or for fast moving stars, the closest pair of sources in successive images are not necessarily the correct matching. While future versions of our pipeline may adjust the cross-matching technique to reduce this confusion (e.g. rewind the Gaia sources using Gaia-measured PMs, where they exist, before cross-matching and defining matches probabilistically), we stress that our work focuses on the low to medium density regions where the cross-matching assumptions are valid.

As a back-of-the-envelope test, the cross-matching technique is a good assumption when the average distance between stars in an HST image is twice as large as the average position change from PM:

ρ1/2>2Δtμ¯.\rho_{*}^{-1/2}>2\cdot\Delta t\cdot\bar{\mu}. (12)

where ρ\rho_{*} is the stellar number density in area on the sky. We can use this equation to determine different limits for choices of time baselines, densities, and average PM sizes. For example, with ACS/WFC’s 4096×40964096\times 4096 pixel detector with pixel scale of 50maspixel150~{}\mathrm{mas}\cdot\mathrm{pixel}^{-1}, a time baseline of 15 years, and an average PM of 100masyr1100~{}\mathrm{mas}~{}\mathrm{yr}^{-1}, we find that there would need to be greater than 46604660 stars in the image for there to be a significant amount of confusion in the cross-matching; as these numbers are similar to HST images in the COSMOS field, this implies that sparse regions in the halo have very low risk of incorrect cross-matching. In denser regions, like nearby galaxies, a time-baseline of 15 years and average PM of 5masyr15~{}\mathrm{mas}~{}\mathrm{yr}^{-1} implies that a limiting stellar number density of 1/9pixel2=4.4×105mas21/9~{}\mathrm{pixel}^{-2}=4.4\times 10^{-5}~{}\mathrm{mas}^{-2}. In practice, we would like the factor of 2 to be even larger (e.g. 5 or 10) to be safe, but this sets the threshold. Future versions of the pipeline will likely explore improvements to the cross-matching between HST and Gaia using the posterior positions and motion measurements from BP3M.

4 Validation with Synthetic Data

Refer to caption
Figure 4: Corner plot of the posterior samples (black points, lines, and histograms) of the transformation parameters when fitting an image with 200 stars with a 15 year time-baseline. The blue lines show the locations of the input parameters used to create the synthetic data, and the values of the transformation matrix (a,b,c,d(a,b,c,d) have been multiplied by 1000 for clarity. The chosen input parameters are representative of real transformation solutions measured by GaiaHub for HST ACS/WFC images.

To understand how the pipeline performs in ideal conditions where we know the input transformation parameters and stellar motions perfectly, we first generate synthetic, COSMOS-like data. This process is described in detail in Appendix B. In summary, the synthetic sources are a mix of foreground thick disk stars and halo stars covering the 16<G<21.5mag16<G<21.5~{}\mathrm{mag} range with realistic Gaia measurements and uncertainties of position, parallax, and PM. Like with real Gaia data, sources with G>21magG>21~{}\mathrm{mag} have no Gaia-measured PMs or parallaxes. We create synthetic HST observations of these sources – as well as the corresponding GaiaHub-like output catalogs that BP3M expects to use for initial guesses of the transformation parameters – while varying the numbers of sources per HST image and time baselines. Finally, the data from the synthetic catalogs are analysed by BP3M. We emphasize that creating synthetic data with different configurations (e.g. transformation parameters, time baselines, stellar velocities and distance distributions) is quite straightforward using our technique, and it is not necessarily restricted to only HST-like observations. Our method is a useful avenue for estimating the impact that future observations or telescopes can have on stellar motion measurements as well as designing best practices.

4.1 Recovering Transformation Parameters

While the comparison with real data will follow in Section 5, we begin by testing the pipeline on synthetic data to see how well we can recover the input transformation parameters and stellar motion (i.e. position, parallax, and PM) used to generate the synthetic HST image. Figure 4 shows the posterior distributions on the transformation parameters that BP3M measures for one synthetic HST image that has 200 sources and a 15 year time baseline from Gaia; the black points and histograms show the posterior draws, while the blue lines show the true values used to generate the image. The posterior distribution and the input values agree very strongly with each other. The chosen input transformation parameters are representative of real transformation solutions as measured by GaiaHub.

Refer to caption
Figure 5: 6D distance of the transformation parameters (i.e. the parameters in Figure 4) from their true values for 300 realizations of synthetic HST images with time baselines from 5 to 15 years and number of sources from 5 to 200. The blue histogram shows the data, while the orange curve is the expected distribution; the agreement between these histograms show that the BP3M pipeline does a good job of recovering the input transformation parameters within their uncertainties.

We repeat this process of generating synthetic images while changing the number of sources and the time baseline, and then measure how far the posterior distribution is from the truth. Specifically, we use the posterior samples of the transformation parameters (e.g. the black points in Figure 4) to define a 6D posterior mean vector, v\vec{v}, and corresponding 6×66\times 6 covariance matrix, 𝑽v\boldsymbol{V}_{\vec{v}}. The 6D distance between the posterior mean vector and the truth vector, v\vec{v}^{\prime}, is then defined using the Mahalanobis distance metric

D=((vv)T𝑽v1(vv))1/2.\begin{split}D=\left((\vec{v}-\vec{v}^{\prime})^{T}\cdot\boldsymbol{V}_{\vec{v}}^{-1}\cdot(\vec{v}-\vec{v}^{\prime})\right)^{1/2}.\end{split} (13)
Refer to caption
Figure 6: Comparison of Gaia, GaiaHub, and BP3M PMs for 200 synthetic COSMOS-like stars with a 15 year time baseline from Gaia. From left to right, The upper panels shows the difference of the Gaia, GaiaHub, and BP3M PMs from the true PM values, and all axes share the same limits. In the GaiaHub and BP3M panels, points in green or blue show PMs of sources that are bright enough to have synthetic Gaia parallax and PM measurements and magenta or orange points show PMs of sources that are too faint for Gaia to have measurements. The bottom two panels compare the PM vector uncertainty size between Gaia, GaiaHub, and BP3M, with the right panel showing the division of the black points by the green and blue points from the left panel.

This is analogous to the 1D case of dividing the absolute difference between a mean value and the truth by the uncertainty. Likewise, the units of DD can be thought of as the number of σ\sigma between the truth and mean, and we will use this definition (for varying dimensions of v\vec{v}) when studying residuals throughout this work. When we analyse 300 synthetic HST images with time baselines between 5 and 15 years from Gaia and 5 to 200 sources per image, we get the blue histogram in Figure 5; the orange distribution shows the expected outcome – namely, a χ\chi distribution with 6 degrees of freedom and a scale of 1.0 – which agrees well with the measured outputs. This result shows that the pipeline does a good job of recovering the input transformation parameters within the posterior uncertainties.

4.2 Recovering Motions

Refer to caption
Figure 7: Comparison of BP3M-measured posterior PM’s with the truth for 5 sets of synthetic HST images, each with 200 sources and a 15 year time baseline from Gaia. The points are colored by whether the BP3M sources have Gaia parallax and PM priors (blue), or are too faint to have Gaia parallax and PM priors (orange). The uncertainty lines are the eigenvalues of the posterior covariance matrix, scaled so that each line corresponds to 68% probability. All the points are clustered around (0,0), implying that BP3M recovers trustworthy PMs.
Refer to caption
Figure 8: Comparison of uncertainty-scaled distance of the 5D posterior vectors (i.e. 2D position, parallax, and 2D PM) from the truth. The blue histogram represents the BP3M data for the 1000 synthetic stars in Figure 7, and the orange histogram is the expected distribution (χ\chi with 5 degrees of freedom and a scale of 1.0). The agreement between these two curves is evidence that the pipeline is recovering good posterior 5D vectors with realistic uncertainties.
Refer to caption
Figure 9: Same as Figure 6, but with 10 synthetic COSMOS-like stars. The top panels share axis limits. The BP3M PM improvement factor is less significant than in the bottom right panel of Figure 6, owing to the larger uncertainty in the transformation parameter fitting because of the smaller number of sources.

Next, we analyse how well the pipeline is able to recover the true motions of the sources in each synthetic image of MW halo stars. We also analyse the synthetic data using a GaiaHub-like approach, though we cannot pass the exact synthetic data to GaiaHub because it requires raw HST images that are not currently included in the simulations. Still, the GaiaHub analysis of synthetic data is similar to what GaiaHub would produce after iteration on the transformation parameter fitting (e.g. using the --rewind_stars option), which makes it the same as the analysis mode used to analyze the real COSMOS data in Figure 2.

An example comparison of the PMs for Gaia, GaiaHub, and BP3M is shown in Figure 6 for the same synthetic image that is considered in Figure 4, where in each panel we compare the observed PMs to the known truth. When showing GaiaHub and BP3M results (top middle and right panels), we color the sources by whether there are Gaia-measured parallaxes and PMs to be used as priors; green or blue points designate sources with (“Gaia Prior”) and magenta or orange points designate sources without (“No Gaia Prior”). To be precise, all of the sources have Gaia-measured priors on their position at J2016.0, but not all sources have Gaia priors on their parallaxes and PMs.

The first key takeaway from this figure is that the GaiaHub and BP3M PMs are clustered closer to the true values (i.e. the origin in the upper panels) when compared to Gaia alone: the black points in the Gaia panel represent the same sources as the green points in the GaiaHub panel and the blue points in the BP3M panel. In the GaiaHub panel, the green points near the origin have a systematic trend along the y=xy=x line that is not apparent in the BP3M panel. Investigating this trend using different synthetic data with different PM populations, it seems that large PMs are likely the cause. When focusing on smaller-magnitude PM populations (e.g. the extragalactic populations that GaiaHub was developed for), this systematic trend disappears. As a result, the PM uncertainties in the GaiaHub panel (and the corresponding points in the lower panels) need to be 2\sim 2 times larger on average to explain the difference between the measured PMs and the truth, whereas the BP3M PM uncertainties completely capture any differences with the truth. While GaiaHub and BP3M both show improvements over Gaia alone, BP3M handling of large PMs (e.g. as expected in the MW halo) and the accuracy of the PM uncertainties help motivate why BP3M’s creation was necessary.

Next, the bottom panels show that the size of the PM uncertainty has decreased significantly when HST information is combined with Gaia using BP3M; the left panel compares the PM uncertainties, while the right panel shows how much smaller the BP3M PM uncertainties are, compared to the Gaia PM uncertainties, as a function of magnitude. As mentioned in the previous paragraph, the GaiaHub uncertainties would need to be a factor of 2 larger than the values shown in this figure to explain the offsets of the measured PMs from the truth. Overall, this figure shows that combining the HST and Gaia data with BP3M not only increases PM precision, it also increases accuracy. To be clear, BP3M is not simply shrinking the Gaia PM distribution around the same mean as a result of an increased time baseline and number of images, it is truly improving the astrometric solution of each source.

The particular pattern of PM uncertainties (i.e. a lower branch around 0.2masyr10.2~{}\mathrm{mas}~{}\mathrm{yr}^{-1} and an upper branch around 1.5masyr11.5~{}\mathrm{mas}~{}\mathrm{yr}^{-1}) is a consequence of our choices in modelling the synthetic data, which is detailed in Appendix B. As a brief summary, the HST position uncertainties are based on real GaiaHub-analysed sources as a function of magnitude in COSMOS. At a given magnitude, some of the sources have well-measured positions, while others are less constrained, which leads to the PM uncertainty bifurcation shown in Figure 6.

When considering the uncertainty on a vector, there are a few different approaches one can take. In this work, we are concerned with 2D vectors (e.g. positions, PMs), 3D vectors (e.g. parallax plus 2D PM), and 5D vectors (i.e. 2D position, parallax, and 2D PM) of the different motion components. Because the different components of these vectors can have differing units, and because there can be substantial correlations in the covariance matrices we measure, we choose not to use the standard metric of the quadrature sum of the individual uncertainties of each component. Instead, when we compare vectors that have associated covariance matrices, we are interested in how much the size of that covariance matrix has changed, which includes the effect of correlation. We define the size of a vector’s uncertainty to be the determinant of the covariance matrix to the 1/(2d)1/(2d) power, where dd is the vector’s number of dimensions:

σv=|𝑽v|1/(2d)||\sigma_{\vec{v}}||=|\boldsymbol{V}_{\vec{v}}|^{1/(2d)} (14)

where 𝑽v\boldsymbol{V}_{\vec{v}} is the covariance matrix of v\vec{v}. With this definition, the area/volume created by the vector’s covariance matrix, i.e. the determinant |𝑽v||\boldsymbol{V}_{\vec{v}}|, is equal to the area/volume of a purely diagonal covariance matrix:

𝑽σv=(σv2000σv2000σv2).\boldsymbol{V}_{||\sigma_{\vec{v}}||}=\left(\begin{matrix}||\sigma_{\vec{v}}||^{2}&0&\dots&0\\ 0&||\sigma_{\vec{v}}||^{2}&&\vdots\\ \vdots&&\ddots&0\\ 0&\dots&0&||\sigma_{\vec{v}}||^{2}\\ \end{matrix}\right).

In this case, the resulting uncertainty size allows for the correlations between parameters to impact the certainty about the vector’s position, which serve to shrink the volume defined by that covariance matrix. For the remainder of this work, where we compare the uncertainty sizes of different vectors between Gaia and BP3M, we will be using the vector uncertainty size definition of Equation 14.

As an illustrative example, a highly-correlated measurement might have large uncertainties in all of the individual vector components, but the probability distribution implied by its covariance matrix covers only a small volume of parameter space owing to the high correlation between the components. Here, the quadrature sum of individual uncertainties would yield a large result, implying we know little about the true value of the vector, whereas the vector uncertainty size in Equation 14 would return a small value. This small value tells us that the volume of possible values that the vector can occupy is quite small because of the relationship between the dimensions. For a 2D vector vT=(x,y)\vec{v}^{T}=(x,y), the vector uncertainty size is given by:

σv=(σx2σy2[1ρx,y2])1/4\langle\sigma_{\vec{v}}\rangle=\left(\sigma_{x}^{2}\cdot\sigma_{y}^{2}\cdot\left[1-\rho_{x,y}^{2}\right]\right)^{1/4} (15)

where σx\sigma_{x} and σy\sigma_{y} are the corresponding uncertainty in each of the vector components and ρx,y\rho_{x,y} is the correlation coefficient between xx and yy.

To account for differences that individual realizations might have on the posterior BP3M PMs, we repeat the measurements of Figure 6; that is, we create 5 separate synthetic HST images that have 200 sources and a time baseline of 15 years. For the different realizations, the list of randomly chosen sources all come from the same synthetic catalog of COSMOS-like data, but each realization will have a slightly different set of true PMs, parallaxes, positions, magnitudes, and numbers of sources with/without Gaia priors. The differences of the posterior PMs from the truth for these 5×200=10005\times 200=1000 sources are shown in Figure 7, where the sources with Gaia-measured parallaxes and PMs are in blue and the converse are in orange. As expected for a well-behaved pipeline, the difference distribution clusters around the origin, and we see that the sources with Gaia priors on their motion have smaller posterior uncertainties than those without.

Of course, the PMs are only 2 of the 5 dimensions of the vector measured for each source, so we also compare the 5D vector (2D position, parallax, 2D PM) to the truth, again using the distance definition of Equation 13; this distribution for the 1000 stars in Figure 7 is given as the blue histogram in Figure 8, and it shows remarkable agreement with the expected χ\chi distribution with 5 degrees of freedom and a scale of 1. These figures show that the pipeline recovers the true positions, parallaxes, and PMs of all sources within their posterior uncertainties.

Finally, we explore the impact that the number of sources in an image have on the PM improvement factor when comparing BP3M to Gaia. An example analysis of a synthetic HST image with 10 sources and a 15 year time baseline is given in Figure 9. While the BP3M pipeline again finds good agreement with the truth, the median PM improvement factor over Gaia alone is only 1.39 for the sources in this synthetic image, which is much smaller than the median factor of 8.3 seen for the sources in the 20.5<G<21mag20.5<G<21~{}\mathrm{mag} range of Figure 6. This is largely because the smaller number of sources aren’t able to place as strong a constraint on the transformation parameters’ distribution, and the larger transformation parameter distribution propagates to the PM distributions of individual sources.

Comparing the GaiaHub and BP3M panels, there is good agreement between the PM means for this particular realization of synthetic data, which should not be taken for granted considering the large GaiaHub outliers that can appear in real COSMOS data (e.g. 10%\sim 10\% of the PMs in Figure 2). When we compare the PM uncertainties, however, we see significant differences between the two pipelines. In fact, the GaiaHub uncertainties would need to be increased by a factor of 8 to explain the differences with the true PMs, while the BP3M PMs agree with the truth within their uncertainties. Some of the PM uncertainty differences come from the fact that the GaiaHub analysis is only able to propagate realistic transformation solution uncertainties to the final PMs if there are several HST images of the same field, which is not the case when considering a single synthetic image. The impact of the transformation uncertainty is likely negligible for relatively large numbers of sources (e.g. N>100N>100, as discussed in del Pino et al., 2022), but it becomes quite important in sparse fields and when there are only a handful of repeat HST observations of the same field (e.g. N=4N=4 for many fields in COSMOS).

While it is fairly straightforward to estimate how much the GaiaHub uncertainties need to be inflated by using comparisons with the Gaia-measured PMs where they exist, it is much more difficult to assess and correct for the presence of systematics (e.g. as seen in the GaiaHub PMs of Figure 6) in sparse fields. Fortunately, these sparse halo conditions are what BP3M was developed for, and the synthetic data analysis provide evidence that the pipeline provides accurate PMs and uncertainties here.

We leave out example figures exploring the effect that a changing time baseline has on the posterior PMs because it follows expectations exactly; namely, for the same position uncertainty between two images, a smaller time baselines leads to larger PM uncertainties (e.g. Equation 2 of del Pino et al., 2022).

4.3 Optimizing Observing Strategies for Positions and Parallaxes

Our synthetic analysis also allows us to measure the improvement factors in the parallaxes and 2D position vectors of the sources. These results show, not unexpectedly, that the precision of these measurements is strongly tied (but not limited) to the magnitude of the source, the time baseline between images, the number of sources per image, and the effect of the position on the sky on the shape and orientation of the parallax motion. While we cannot change the time baselines of archival HST images, we can explore how changes in the observations times would have impacted the resulting BP3M parallaxes and positions. These lessons are particularly useful for planning of future observations using any telescope.

Refer to caption
Figure 10: Comparison of the uncertainties on position, parallax, and PM for 50 synthetic HST-observed stars in the COSMOS field. The points are colored by different observation strategies (i.e. time baseline offsets from Gaia) with the legend names described in the text. Note that some of the y-axes are in log scale while others are linear. The two bottom-most panels correspond to the 3D vector of parallax and PM, and the 5D vector of position, parallax, and PM; the units of these axes are a result of our definition of vector uncertainty in Equation 14.

As before, we create synthetic HST images of COSMOS stars, but in this analysis, we do not let the HST position uncertainty of sources change as a function of magnitude based on GaiaHub examples (e.g. as discussed in Appendix B). Instead, we choose to set the HST position uncertainty to 0.01 pixels, which is the centroid uncertainty that previous studies have measured for well-behaved stars (e.g. Bellini et al., 2011). We adopt the single, well-behaved value to limit the effect that any particular choice/realization of position uncertainties may have on the conclusions we make about optimal observing strategies. Similarly, we choose to use 50 stars in each synthetic HST image to reduce the impact of transformation solution uncertainty as seen in low-source-count images.

Our initial tests suggest that there is minimal improvement on the Gaia-measured parallax or position vectors when using only one HST image (i.e. one HST in combination with Gaia). Additionally, multiple HST images taken very nearby in time (e.g. the multiple exposures taken to guard against cosmic rays) effectively act as a single epoch and are useful for improving the position uncertainty, but don’t offer much information to the parallax. When multiple HST images with common targets and substantial enough time baselines are analysed together, then the parallaxes and positions begin to show improvement. To better understand how parallax and position precision depend on observation times, we choose to simulate 3 epochs of COSMOS observations with HST, with time offsets from Gaia near 12, 8, and 4 years. Specifically, we make the observations occur at the following times:

  1. 1.

    No Gaia Offset: All HST observations at year-multiples of the Gaia observation time,
    Δt=[12.0,8.0,4.0]years\Delta t=\left[-12.0,-8.0,-4.0\right]~{}\mathrm{years};

  2. 2.

    Half Gaia Offset: All HST observations at half-a-year offset from the Gaia date, Δt=[11.5,7.5,3.5]years\Delta t=\left[-11.5,-7.5,-3.5\right]~{}\mathrm{years};

  3. 3.

    Apocenter-then-Quarter: The earliest HST observations at the parallax orbit apocenter farthest from Gaia, then offset by a quarter year for the other epochs,
    Δt=[11.61,7.36,3.86]years\Delta t=\left[-11.61,-7.36,-3.86\right]~{}\mathrm{years};

  4. 4.

    Alternating Apocenters: The earliest HST observations at the parallax orbit apocenter farthest from Gaia, then alternating offsets by half a year for the other epochs,
    Δt=[11.61,7.11,3.61]years\Delta t=\left[-11.61,-7.11,-3.61\right]~{}\mathrm{years}.

The resulting position, parallax, PM uncertainties from analysing these HST observations concurrently is given in Figure 10, where the two bottom-most panels show the 3D (i.e. parallax, PM) and 5D (i.e. position, parallax, PM) vector uncertainties. This figure reveals many important lessons about the impact that relatively small time offsets of observations within a year have on the stellar measurements.

From the top three panels, we see that different observing strategies lead to significant differences in the resulting position and parallax precisions, though there are minimal differences between the PM uncertainties. This implies that improvements in parallax and position can be achieved with no cost to the PM precision. We emphasize, however, that this is only after analysing all 3 HST epochs together; when considering a single epoch at a time, sources without Gaia priors (i.e. G>21magG>21~{}\mathrm{mag}) do indeed have significantly larger PM uncertainties if there is an offset from the Gaia date as a result of the degeneracy between PM and parallax motions. Our results suggest that completed surveys with multiple HST epochs may be able to improve position and parallax measurements for free through careful design.

Of the observing strategies we consider (including many that are not represented in Figure 10), the best choice for improving positions and parallaxes was to use an alternating-parallax-orbit-apocenters approach, which is the 4th option in the list above and the red points in the figure. This strategy suggests that the parallax is most improved – and therefore the degeneracy between PM and parallax motions most disentangled – when we consider the average parallax orbit of the sources in a given field (e.g. the blue orbit in Figure 3 for COSMOS). The parallax ellipse can be used to identify the points in time that are most distant from each other (i.e. the extremes of the semi-major axis), which are therefore the easiest to tell apart for a given position uncertainty. Then, the best approach is to first observe at the time that is most offset from the Gaia observation time, and subsequent observations alternate by half a year such that they jump back and forth between the two apocenters. This approach also appears to have a large improvement on the position uncertainty, likely caused by the repeated sampling of the source’s position at the same time in the parallax orbit.

5 Applications with Real Data

We test the pipeline using real data in four key ways: (1) using well-studied nearby dwarf spheroidal (dSph) galaxies, (2) using cross-matches with an external QSO catalog, (3) using cross-matches with a PM catalog in COSMOS derived from multi-epoch HST imaging, and (4) using the full 2000\sim 2000 unique COSMOS sources presented in Section 2.1.

5.1 Comparison with Nearby Dwarf Spheroidals

Refer to caption
Figure 11: Comparison of PMs using 6 HST images of Fornax dSph analysed concurrently; these HST images were all taken using ACS/WFC in the F775W filter at the same epoch and same RA and Dec. The exposure time of each image is 250sec\sim 250~{}\mathrm{sec} and the time offset from Gaia is 12.8 years. The panels are similar to Figure 6, except the upper panels show measured PM values instead of offsets from known PMs of synthetic data; the upper left panel shows Gaia measurements, the middle panel shows GaiaHub-measured PMs, and the right panel shows BP3M posteriors. In all of the top panels, the 55 sources identified as cluster members by del Pino et al. (2022) are plotted with more-opaque points. Sources in the GaiaHub and BP3M panels are colored by whether Gaia-measured PMs exist (i.e. 91 sources with Gaia PMs and 107 without Gaia PMs). The GaiaHub and BP3M panels cover the same range of PM values to visually compare the PM clustering, with that range shown with a red box in the Gaia panel.
Refer to caption
Figure 12: Same as Figure 12, but analyzing 10 HST images of Draco concurrently. These HST images were all taken using ACS/WFC in the F606W filter at the same RA and Dec, with an average exposure time of 215sec\sim 215~{}\mathrm{sec}. The images span three epochs, with time offsets from Gaia of 11.2, 9.2, and 2.2 years. There are 79 sources with Gaia PMs and 6 without Gaia PMs. del Pino et al. (2022) identifies 62 member stars in this sample.
Refer to caption
Figure 13: Comparison of parallax and PM improvement factor to Gaia (i.e. size of Gaia parallax or PM uncertainty divided by size of BP3M parallax or PM uncertainty) for Fornax dSph as a function of the number of HST images used. These measurements concern the same data presented in Figure 12. While the most significant improvement happens when combining a single HST image with Gaia (i.e. median factor of 8.6\sim 8.6 for 20.5<G<21mag20.5<G<21~{}\mathrm{mag}), there is still a 14%\sim 14\% improvement for the faintest sources between considering one HST image and considering six. There is also a very slight improvement on the parallax uncertainties at the faintest magnitudes (for one image, median factor of 1.01\sim 1.01 for 20.5<G<21mag20.5<G<21~{}\mathrm{mag}), though the HST observations occurring at the same approximate time make it difficult to improve the parallax precision.
Refer to caption
Figure 14: Same as Figure 13, but for Draco instead of Fornax. These measurements concern the same data presented in Figure 12. Note that the number of images jumps from 5 to 10 on the colorbar. In contrast to the Fornax results, there is a more-noticeable improvement in the parallax uncertainties at the faintest magnitudes when the number of images increases (e.g. 18%\sim 18\% median improvement in precision for 20.5<G<21mag20.5<G<21~{}\mathrm{mag} when Nim=10N_{im}=10). This is likely a result of the HST images occurring at different epochs.

We analyse HST images using BP3M in nearby dSph galaxies to see how the posterior PMs and parallaxes compare to Gaia. This serves as a test with real data, as well as more proof that BP3M is improving the astrometric solutions for the sources it measures. We also show how analysing multiple HST images together impact the resulting PM and parallax precision.

A comparison of Gaia, GaiaHub, and BP3M measured PMs is presented in Figure 12 for the Fornax dwarf spheroidal. For the BP3M PMs, 6 HST images at the same epoch (time baseline from Gaia of 12.8 years) with a total of 198 unique sources are analysed together. The GaiaHub PMs are the same as in presented in del Pino et al. (2022) from analysis of the same 6 HST images, but in this case, a co-moving sample has been defined before transformation fitting to identify likely cluster members. While this figure is similar to Figure 6, the top panels are slightly different in that they show PM measurements instead of offsets from known PMs of synthetic data. Sources without Gaia-measured parallaxes and PMs are colored orange in the BP3M panel, which show larger uncertainties than the sources with Gaia priors in blue. The axes of the GaiaHub and BP3M panels have been set to cover the same range of values for easier visual comparison, and this range is 10\sim 10 times smaller than in the Gaia panel. As before, the lower panels compare the PM uncertainties. Figure 12 shows another PM comparison, this time using 10 HST images of the Draco dwarf spheroidal, taken at 3 different epochs (baselines of 11.2, 9.2, and 2.2 years from Gaia).

First, these figures show a significant tightening of the PMs when Gaia and HST are combined using either GaiaHub or BP3M. This suggests that the increase in PM precision over Gaia alone is not at the cost of PM accuracy. In comparing the PM uncertainties, we find that the GaiaHub values are slightly smaller than those of BP3M, but this is likely because BP3M propagates transformation parameter uncertainties to the final PMs and GaiaHub uses a single set of values for the transformation solution. We also see that the GaiaHub PMs form visually-tighter PM knots than the BP3M PMs, though we would expect them to be more similar based on the size of their PM uncertainties. These differences suggest that there are systematics in the BP3M PMs that are not present in the GaiaHub PMs. In testing possible explanations, we find that the BP3M results are able to look more like the GaiaHub ones when we restrict the samples to only the cluster members from del Pino et al. (2022). Theoretically, the BP3M statistics should not allow the presence of non-member sources to impact the PMs of member stars – in fact, their presence should help improve the transformation solution – but we find that this is not the case.

Some further testing suggests the likely culprit comes from BP3M’s estimate of the source position uncertainty in the HST images. As discussed previously, BP3M uses the same HST position uncertainties as GaiaHub, and this is calculated as a fraction of a PSF quality of fit statistic. When analysing a kinematically-cleaned, co-moving population of stars (i.e. the case that GaiaHub was designed for), this estimate of uncertainty performs well because the sample is restricted to faint sources that follow a linear relation between the quality of PSF fitting and HST position uncertainty. The validity of this approximation is evident from the resulting GaiaHub PMs and the results when BP3M uses the same subsample. However, this uncertainty approximation can occasionally underestimate the true uncertainty – especially for bright-but-not-saturated sources – leading to cases where the transformation solution becomes torqued away from the truth by a few unfairly-high-weighted sources. As more HST images are analysed together (e.g. as we present here with the dSph data), the amount that HST information contributes to the posterior measurements begins to outweigh the Gaia-measured priors, and it becomes more important for the HST uncertainties to be more accurate. It will be important for future versions of BP3M to identify other methods of measuring the HST position uncertainties to be as statistically robust as possible.

For all the BP3M analyses presented in this work, we emphasize that we use all Gaia sources found in each HST image to test the general nature of the pipeline. Our goal is to retain as many sources as possible for transformation fitting, which is especially important in sparse field cases. For BP3M, foreground bright stars – which often have well-measured Gaia positions, parallaxes, and PMs – that do not belong to the distant populations of interest are often useful anchors for the transformation fitting.

We next repeat the BP3M analysis while varying the number of HST images considered together to explore the effect this has on the posterior PMs and parallaxes. The resulting improvement factors as a function of magnitude are given in Figures 13 and 14 for Fornax and Draco respectively, with the legend displaying the time baselines from Gaia for the different sets of HST images. In the case of Fornax, the most significant improvement in PM occurs when analysing a single HST image with Gaia (i.e. a median PM precision increase by a factor of 8.6\sim 8.6 for 20.5<G<21mag20.5<G<21~{}\mathrm{mag}), though there is still an additional improvement when the remaining five HST images are brought in (14%\sim 14\% improvement for for 20.5<G<21mag20.5<G<21~{}\mathrm{mag}). The parallax uncertainties, however, do not improve very significantly for any of the images, though there is a slight, 1%\sim 1\%, precision increase at the faintest magnitudes. This is likely because the HST images are taken at a single epoch and the time baseline is quite close to a one year multiple of Gaia’s observation date (i.e. only offset by 20% of a year); this means that the parallax orbit is being sampled repeatedly in a small area at the same approximate time, which does not lead to significant increases in parallax precision.

For Draco, increasing the number of HST images does substantially improve the PMs for each image added; considering 10 HST images together with Gaia improves the PM precision by a median factor of 13\sim 13 for 20.5<G<21mag20.5<G<21~{}\mathrm{mag} compared to a median factor of 7.8\sim 7.8 when only one HST image is used. The parallaxes also see a noticeable change in precision as a function of HST images analysed. While the time offsets from Gaia are again only 20%\sim 20\% of a year, the HST images are spread over three epochs. In the case of Fornax, we effectively have measurements of positions for each source at two different times (i.e. a single epoch of HST images plus Gaia) while the Draco data have measurements of positions at four different times; for the Draco stars then, there is smaller set of parallax and PM combinations that can explain those four position measurements per source than if there were only two position measurements. It is likely that the improvement in the parallax precision also contributes to the increase in PM precision because the degeneracy between the different types of apparent motion become less entangled. However, the time offsets of the HST images from Gaia and from each other are not optimally spaced in the parallax orbit (i.e. near the apocenters), resulting in a median parallax precision increase by a factor of 1.18\sim 1.18 for the faintest magnitudes (G>20.5magG>20.5~{}\mathrm{mag}).

5.2 Comparison with QSOs

Refer to caption
Figure 15: PMs of the 52 sources in the COSMOS field that are nearest to a previously-identified QSO. The PMs are colored by whether Gaia-measured PMs and parallaxes were available as priors. The PM uncertainties are such that all of these PM measurements are consistent with stationary.
Refer to caption
Figure 16: QSO PM distance from stationary, scaled by their uncertainties. The orange histogram shows the expectation if the uncertainties explain all the non-zero sizes of the PMs, with errorbars giving the 68% region on the density in each bin for a sample size of N=52N=52; the two distributions agree quite well by eye, suggesting that the posterior PM uncertainties are reasonable and trustworthy.
Refer to caption
Figure 17: Posterior BP3M PMs for 31 sources in the COSMOS field that are shared with the HALO7D survey. The “HST” designation in the axis labels refer to the PMs of Cunningham et al. (2019b), which use HST images alone to measure PMs.

As an additional method of validating the BP3M posterior PMs and their uncertainties, we cross-match the 2000\sim 2000 unique COSMOS sources mentioned in Section 2 with the MILLIQUAS QSO catalog444https://heasarc.gsfc.nasa.gov/W3Browse/all/milliquas.html (Flesch, 2023). There are 52 sources in the QSO list that have COSMOS sources within 10mas10~{}\mathrm{mas}, so we run the HST images that contain these sources through the BP3M pipeline, with the resulting posterior PMs shown in Figure 15. As expected for extremely distant sources, the PMs are closely distributed around the origin.

Next, we use the PM uncertainties to measure the 2D distances of the PMs from stationary, and then compare these measurements to the expected distribution; the result of this process is shown in Figure 16. The expected distribution (orange histogram) for an equivalent sample size agrees quite well with the measured one (blue histogram). This is additional and external evidence that the BP3M posterior PM distributions are reasonable and trustworthy.

5.3 Comparison with HST-measured PMs

Like in Section 5.2, we identify sources in our COSMOS sample that are shared with an external survey; in this case, we cross-match with the COSMOS sample of the HALO7D survey (Cunningham et al., 2019a, b; McKinnon et al., 2023). The HALO7D PMs are described in Cunningham et al. (2019b); to summarize, the PMs are measured using multiple epochs of HST imaging with the absolute reference frame being defined by registration with background galaxies. These HST+HST PMs are therefore a result of traditional PM measurement techniques.

We analyse the HST images in the multi-epoch region of COSMOS that have Gaia sources in common with the HALO7D survey, finding 31 cross-matches. In Figure 17, we see that the BP3M PMs have strong agreement with their HST-only counterparts. As in the other comparisons we have presented, the distribution of 2D differences between the PM measurements agree statistically with our expectations, providing the final piece of validation for the BP3M outputs.

5.4 Proper Motions in Sparse Fields: COSMOS

Refer to caption
Figure 18: Posterior BP3M PMs for the 2640 unique sources in the COSMOS field (i.e. same sources referred to in Figure 1). While each source may fall in multiple HST images, we only show the PMs measured from analysing a single HST image with Gaia. The PMs are colored by whether Gaia-measured PMs and parallaxes were available as priors. The PM uncertainties of the sources with Gaia priors (blue points) are, on average, much smaller than those without Gaia priors (orange points).
Refer to caption
Figure 19: Comparison of posterior PMs and corresponding uncertainties between Gaia and BP3M for the same COSMOS sources as Figure 18. The different PM values agree with each other within their uncertainties across a range of PM sizes. In the upper right panel, the errorbars come from the Gaia-measured PM covariance matrices alone instead of a combination of BP3M and Gaia covariances because the BP3M results depend on the Gaia covariances. For many of the bright sources (G<19magG<19~{}\mathrm{mag}), their HST detection has a large position uncertainty, which leads to no improvement over the Gaia-measured PMs. For fainter targets (G>19magG>19~{}\mathrm{mag}), where the Gaia PM uncertainty increases and the HST positions are better measured, there is a significant improvement in the PM uncertainty. Approximately 25% of the COSMOS sources have no Gaia-measured PM, and these sources have a median PM uncertainty of 1.16masyr1\sim 1.16~{}\mathrm{mas}~{}\mathrm{yr}^{-1}.

Using the HST images of the COSMOS field discussed in Section 2.1, we run BP3M on all images within 0.50.5^{\circ} of the field’s center. This includes all regions of COSMOS with multi-epoch HST imaging (near the center), as well as single-epoch-only HST regions. The posterior PMs we measure for the 2640 unique sources that are found in common between the HST images and Gaia are shown in Figure 18, where the sources are colored by whether or not they have Gaia-measured PMs and parallaxes. As a quick sanity check, this PM distribution visually agrees with our expectations based on the synthetic COSMOS data created using previously-measured velocity distributions for the MW thick disk and stellar halo555See Figure 23 in Appendix B, but note that our “visual comparison” here means looking at the range and spread of the PMs, not comparing the different colored populations in each figure.. There may be, however, more BP3M PMs clustered at the origin than originally expected (i.e. the orange, “No Gaia Prior” points), but many of these sources may truly be background stationary sources, which were not included as a component of the synthetic data.

While each source may appear in multiple HST frames, we only show the results from analysing each HST image separately, and the displayed PM is from the posterior distribution with the smallest uncertainty. Currently, it is very computationally expensive to analyse multiple HST concurrently, making the complete simultaneous analysis of COSMOS HST images intractable. Fortunately, as we determined from our dSph test data, analyses of single HST images do not show noticeable systematics in BP3M PMs as a result of the HST position uncertainty estimates, so the PMs we present for the COSMOS sample are robust. After increasing computation efficiency and HST position uncertainty estimates in planned updates to the pipeline, we will be able to present a complete COSMOS analysis in future work.

We also compare the BP3M PMs and uncertainties to the corresponding Gaia PMs in Figure 19, where we again see excellent agreement between the two samples (top panels). Because the sources in this figure include the same sources shown in Figure 2, we can also compare the sparse-field PM performance between the GaiaHub and BP3M pipelines. In general, BP3M returns much closer agreement with the Gaia PMs, as is expected because it uses those measurements as prior constraints. In addition, BP3M is able to measure consistent PMs for all 2640 Gaia sources in the COSMOS images, which is a large increase from the 1659 PMs catalog of GaiaHub. These results suggest that BP3M’s approach has successfully addressed the issues producing the outliers of Figure 19, indicating it is a valid method of extracting useful and trustworthy PMs in sparse fields.

The middle and bottom panels of Figure 19 show the improvement on the PM uncertainty that BP3M measures by combining HST and Gaia, especially for the 25%\sim 25\% of sources that had no Gaia-measured PMs; the median PM uncertainty for sources without Gaia priors is found to be 1.16masyr1\sim 1.16~{}\mathrm{mas}~{}\mathrm{yr}^{-1}. While many of the brightest magnitude sources – those most likely to be at or near saturation in the HST images – show no improvement on Gaia’s PMs, the faintest magnitude sources (20.25<G<20.75mag20.25<G<20.75~{}\mathrm{mag}) have a median PM improvement factor of 2.6. As mentioned for the small-source-count case in Figure 9, the less extreme improvements in PM we see in sparse fields are likely the result of larger uncertainty in the transformation parameters, which propagates to the PM uncertainties of all sources in an image.

The BP3M transformation parameters could likely be pinned down much more precisely by identifying stationary sources in each image. For example, by finding the centroids in each HST image that are clearly associated with quasars and background galaxies, then updating the priors in parallax and PM to a narrow distribution around 0\vec{0}, the transformation solution can be improved by these anchor points. This would reduce the posterior width of the transformation parameter distributions for each image, especially for particularly sparse images, which would also reduce the posterior widths in the PMs for all of the sources in an HST image. Exploring the impact that known stationary targets can have on the BP3M PMs will be the focus of future work.

The median PM uncertainties in the 20<G<21.5mag20<G<21.5~{}\mathrm{mag} range are summarized for the COSMOS sample in Table 2. First, the BP3M sample that has Gaia parallax and PM priors has a significant increase in precision compared to Gaia alone. Second, the Gaia COSMOS sample (20<G<20.75mag20<G<20.75~{}\mathrm{mag}) has a similar median PM uncertainty to the BP3M sample without Gaia priors (20.75<G<21.5mag20.75<G<21.5~{}\mathrm{mag}), but the latter contains significantly more stars. Effectively, BP3M substantially increases the number of stars in a single field with Gaia-quality PMs at the faintest magnitudes.

Table 2: Median uncertainty in PM for the COSMOS stars in Gaia, BP3M with Gaia priors, and BP3M without Gaia priors in the 20<G<21.5mag20<G<21.5~{}\mathrm{mag} range. While the median uncertainties in the Gaia sample are smaller than the medians from the BP3M without Gaia priors sample, the former has significantly fewer stars than the latter and does not reach as faint. As shown in the middle panel of Figure 19, the Gaia COSMOS data extend only to G20.75magG\sim 20.75~{}\mathrm{mag}, while the BP3M sample extends to G21.5magG\sim 21.5~{}\mathrm{mag}.
Number of Median σμi||\sigma_{\vec{\mu}_{i}}||
Sample Sources (mas/yr)
Gaia 443 1.03
BP3M, Gaia Prior 443 0.44
BP3M, No Gaia Prior 632 1.13

A typical MW stellar halo isochrone (i.e. age=12Gyr{\mathrm{age}}=12~{}\mathrm{Gyr} and [Fe/H]=1.2dex[\mathrm{Fe}/\mathrm{H}]=-1.2~{}\mathrm{dex}, see Appendix B) has a red giant branch (RGB) base around an absolute magnitude of MG3magM_{G}\approx 3~{}\mathrm{mag}. For stars at the base of the RGB with apparent magnitudes in the 20<G<20.75mag20<G<20.75~{}\mathrm{mag} range, this translates to a distance range of 2535kpc\sim 25-35~{}\mathrm{kpc}. Using the median PM uncertainties in Table 2, this would imply an uncertainty in velocity on the sky in the range of 123173kms1\sim 123-173~{}\mathrm{km}~{}\mathrm{s}^{-1} for the Gaia-alone sample versus a 5274kms1\sim 52-74~{}\mathrm{km}~{}\mathrm{s}^{-1} range for the BP3M sample with Gaia priors. For the BP3M sample without Gaia priors, the apparent magnitude range of 20.75<G<21.5mag20.75<G<21.5~{}\mathrm{mag} translates to a distance range of 3550kpc\sim 35-50~{}\mathrm{kpc}, and results in a sky velocity uncertainty range of 190268kms1\sim 190-268~{}\mathrm{km}~{}\mathrm{s}^{-1} when using the median PM uncertainty for this subsample.

Like with the dwarf spheroidal data, many of these COSMOS sources appear in multiple HST images. As a result, the BP3M PM uncertainties presented in this section are significantly larger than the possible PM uncertainty that could be achieved by analysing the images concurrently, which will be the focus on future work. This may also likely yield improvements on the parallax precision, especially for sources in the multi-epoch HST regions of the field. Finally, there are many HST sources that are too faint to have Gaia counterparts, but they nonetheless appear in multiple HST frames. For these sources, future versions of BP3M can use the HST-to-Gaia transformation parameters for each image to identify the positions of these sources in a global reference frame as a function of time, and thus determine their best fit PMs and parallaxes.

In tests of a handful of COSMOS images where we analyse all overlapping HST concurrently, we measure a median PM uncertainty of 0.064masyr10.064~{}\mathrm{mas}~{}\mathrm{yr}^{-1} for 20<G<20.7520<G<20.75 mag (sources with Gaia-measured PMs) and a median PM uncertainty of 0.25masyr10.25~{}\mathrm{mas}~{}\mathrm{yr}^{-1} for 20.75<G<21.520.75<G<21.5 mag (sources with Gaia-measured PMs). In future work, we plan to use the extreme precision PMs to investigate substructure in the MW stellar halo at large distances (D>40D>40 kpc). Other future applications including measuring internal kinematics of perturbed stellar streams – whose stars are often in sparse fields that are analogous to COSMOS, and where the 3D velocity error budget is dominated by PM terms (e.g., GD-1 Price-Whelan & Bonaca, 2018; Malhan & Ibata, 2019) – to constrain the dark matter subhalo mass function.

6 Summary

We have presented the statistics that describe, in general, how positions of sources from two or more sets of images (from any telescope or instrument) should be combined in a Bayesian hierarchical framework to measure transformation parameters between the images, as well as distributions on the positions, parallaxes, and PMs for the individual sources. We use these statistics to created BP3M – a pipeline that builds off of GaiaHub – to combine archival HST and Gaia data to measure reliable positions, parallaxes, and PMs in the absolute reference frame of Gaia, even in sparse fields (e.g. N<10N_{*}<10 per HST image). We have tested this pipeline using a combination of synthetic HST images, nearby dwarf galaxies, a catalog of QSOs, and sparse MW stellar halo fields. In all cases, we find that the BP3M PMs are both accurate and precise, especially for magnitudes where Gaia has large PM uncertainties (19<G<21mag19<G<21~{}\mathrm{mag}) or no PM measurements at all (21<G<21.5mag21<G<21.5~{}\mathrm{mag}).

Our key results include:

  1. 1.

    When trying to maximize parallax precision, the best observing strategy we have found is to alternate the observation times between the two semi-major axis extremes of the ellipse created by the parallax motion. The improvement to the parallax and position precision come at no cost to the PM uncertainty (Section 4.3, Figures 10);

  2. 2.

    For time baselines of 10 to 13 years, the BP3M pipeline is able to measure PMs of stars in nearby dwarf spheroidals with uncertainties that are, on average, 8 to 13 times more precise than Gaia alone for 20.5<G<21mag20.5<G<21~{}\mathrm{mag} (Section 5.1, Figures 12 and 12);

  3. 3.

    For the sparse fields of COSMOS (median 9 sources shared with Gaia per HST image), we measure a median PM improvement factor of 2.6 for sources with Gaia-measured PMs, and find a median PM uncertainty of 1.16masyr11.16~{}\mathrm{mas}~{}\mathrm{yr}^{-1} for the 25%\sim 25\% of sources without Gaia-measured PMs (Section 5.4, Figure 19).

KM, CMR, PG, and MA were funded by NASA through grants associated with HST Archival Program AR-16625 awarded by the Space Telescope Science Institute, which is operated by the Association of Universities for Research in Astronomy, Inc., for NASA, under contract NAS5-26555. AdP acknowledges the financial support from the European Union - NextGenerationEU and the Spanish Ministry of Science and Innovation through the Recovery and Resilience Facility project J-CAVA and the project PID2021-124918NB-C41. KM thanks the individuals and teams of researchers whose decades of work to characterize the optics of HST that have made this work possible. KM also thanks Melodie Kao for code that helped implement the motion-from-parallax calculations. This work was carried out in collaboration with the HSTPROMO (High-resolution Space Telescope PROper MOtion) Collaboration666https://www.stsci.edu/~marel/hstpromo.html, a set of projects aimed at improving our dynamical understanding of stars, clusters, and galaxies in the nearby Universe through measurement and interpretation of proper motions from HST, Gaia, and other space observatories. We thank the collaboration members for sharing their ideas and software.

References

  • Anderson (2007) Anderson, J. 2007, Variation of the Distortion Solution, Instrument Science Report ACS 2007-08, 12 pages
  • Anderson (2022) —. 2022, One-Pass HST Photometry with hst1pass, Instrument Science Report WFC3 2022-5, 55 pages
  • Anderson & King (2004) Anderson, J., & King, I. R. 2004, Multi-filter PSFs and Distortion Corrections for the HRC, Instrument Science Report ACS 2004-15, 51 pages
  • Anderson & King (2006) —. 2006, PSFs, Photometry, and Astronomy for the ACS/WFC, Instrument Science Report ACS 2006-01, 34 pages
  • Astropy Collaboration et al. (2013) Astropy Collaboration, Robitaille, T. P., Tollerud, E. J., et al. 2013, A&A, 558, A33, doi: 10.1051/0004-6361/201322068
  • Astropy Collaboration et al. (2018) Astropy Collaboration, Price-Whelan, A. M., Sipőcz, B. M., et al. 2018, AJ, 156, 123, doi: 10.3847/1538-3881/aabc4f
  • Astropy Collaboration et al. (2022) Astropy Collaboration, Price-Whelan, A. M., Lim, P. L., et al. 2022, apj, 935, 167, doi: 10.3847/1538-4357/ac7c74
  • Bellini et al. (2011) Bellini, A., Anderson, J., & Bedin, L. R. 2011, PASP, 123, 622, doi: 10.1086/659878
  • Bellini et al. (2017) Bellini, A., Anderson, J., Bedin, L. R., et al. 2017, ApJ, 842, 6, doi: 10.3847/1538-4357/aa7059
  • Bellini & Bedin (2009) Bellini, A., & Bedin, L. R. 2009, PASP, 121, 1419, doi: 10.1086/649061
  • Bellini et al. (2018) Bellini, A., Libralato, M., Bedin, L. R., et al. 2018, ApJ, 853, 86, doi: 10.3847/1538-4357/aaa3ec
  • Belokurov et al. (2018) Belokurov, V., Erkal, D., Evans, N. W., Koposov, S. E., & Deason, A. J. 2018, MNRAS, 478, 611, doi: 10.1093/mnras/sty982
  • Bennet et al. (2022) Bennet, P., Alfaro-Cuello, M., Pino, A. d., et al. 2022, ApJ, 935, 149, doi: 10.3847/1538-4357/ac81c9
  • Bonaca et al. (2020) Bonaca, A., Conroy, C., Cargile, P. A., et al. 2020, ApJ, 897, L18, doi: 10.3847/2041-8213/ab9caa
  • Choi et al. (2016) Choi, J., Dotter, A., Conroy, C., et al. 2016, ApJ, 823, 102, doi: 10.3847/0004-637X/823/2/102
  • Conroy et al. (2019) Conroy, C., Naidu, R. P., Zaritsky, D., et al. 2019, ApJ, 887, 237, doi: 10.3847/1538-4357/ab5710
  • Cunningham et al. (2019a) Cunningham, E. C., Deason, A. J., Rockosi, C. M., et al. 2019a, ApJ, 876, 124, doi: 10.3847/1538-4357/ab16cb
  • Cunningham et al. (2023) Cunningham, E. C., Hunt, J. A. S., Price-Whelan, A. M., et al. 2023, arXiv e-prints, arXiv:2307.08730, doi: 10.48550/arXiv.2307.08730
  • Cunningham et al. (2019b) Cunningham, E. C., Deason, A. J., Sanderson, R. E., et al. 2019b, ApJ, 879, 120, doi: 10.3847/1538-4357/ab24cd
  • Cunningham et al. (2022) Cunningham, E. C., Sanderson, R. E., Johnston, K. V., et al. 2022, ApJ, 934, 172, doi: 10.3847/1538-4357/ac78ea
  • del Pino et al. (2022) del Pino, A., Libralato, M., van der Marel, R. P., et al. 2022, ApJ, 933, 76, doi: 10.3847/1538-4357/ac70cf
  • Dotter (2016) Dotter, A. 2016, ApJS, 222, 8, doi: 10.3847/0067-0049/222/1/8
  • Flesch (2023) Flesch, E. W. 2023, arXiv e-prints, arXiv:2308.01505, doi: 10.48550/arXiv.2308.01505
  • Foreman-Mackey (2016) Foreman-Mackey, D. 2016, The Journal of Open Source Software, 1, 24, doi: 10.21105/joss.00024
  • Foreman-Mackey et al. (2013) Foreman-Mackey, D., Hogg, D. W., Lang, D., & Goodman, J. 2013, PASP, 125, 306, doi: 10.1086/670067
  • Gaia Collaboration et al. (2018) Gaia Collaboration, Helmi, A., van Leeuwen, F., et al. 2018, A&A, 616, A12, doi: 10.1051/0004-6361/201832698
  • Grogin et al. (2011) Grogin, N. A., Kocevski, D. D., Faber, S. M., et al. 2011, ApJS, 197, 35, doi: 10.1088/0067-0049/197/2/35
  • Harris et al. (2020) Harris, C. R., Millman, K. J., van der Walt, S. J., et al. 2020, Nature, 585, 357, doi: 10.1038/s41586-020-2649-2
  • Helmi et al. (2018) Helmi, A., Babusiaux, C., Koppelman, H. H., et al. 2018, Nature, 563, 85, doi: 10.1038/s41586-018-0625-x
  • Hunter (2007) Hunter, J. D. 2007, Computing in Science & Engineering, 9, 90, doi: 10.1109/MCSE.2007.55
  • Iorio & Belokurov (2021) Iorio, G., & Belokurov, V. 2021, MNRAS, 502, 5686, doi: 10.1093/mnras/stab005
  • Kluyver et al. (2016) Kluyver, T., Ragan-Kelley, B., Pérez, F., et al. 2016, in Positioning and Power in Academic Publishing: Players, Agents and Agendas, ed. F. Loizides & B. Schmidt, IOS Press, 87 – 90
  • Koekemoer et al. (2011) Koekemoer, A. M., Faber, S. M., Ferguson, H. C., et al. 2011, ApJS, 197, 36, doi: 10.1088/0067-0049/197/2/36
  • Kroupa (2001) Kroupa, P. 2001, MNRAS, 322, 231, doi: 10.1046/j.1365-8711.2001.04022.x
  • Libralato et al. (2019) Libralato, M., Bellini, A., Piotto, G., et al. 2019, ApJ, 873, 109, doi: 10.3847/1538-4357/ab0551
  • Libralato et al. (2018) Libralato, M., Bellini, A., van der Marel, R. P., et al. 2018, ApJ, 861, 99, doi: 10.3847/1538-4357/aac6c0
  • Libralato et al. (2022) Libralato, M., Bellini, A., Vesperini, E., et al. 2022, ApJ, 934, 150, doi: 10.3847/1538-4357/ac7727
  • Lindegren et al. (2018) Lindegren, L., Hernández, J., Bombrun, A., et al. 2018, A&A, 616, A2, doi: 10.1051/0004-6361/201832727
  • Luri et al. (2018) Luri, X., Brown, A. G. A., Sarro, L. M., et al. 2018, A&A, 616, A9, doi: 10.1051/0004-6361/201832964
  • Mackereth et al. (2019) Mackereth, J. T., Schiavon, R. P., Pfeffer, J., et al. 2019, MNRAS, 482, 3426, doi: 10.1093/mnras/sty2955
  • Malhan & Ibata (2019) Malhan, K., & Ibata, R. A. 2019, MNRAS, 486, 2995, doi: 10.1093/mnras/stz1035
  • Massari et al. (2018) Massari, D., Breddels, M. A., Helmi, A., et al. 2018, Nature Astronomy, 2, 156, doi: 10.1038/s41550-017-0322-y
  • Massari et al. (2020) Massari, D., Helmi, A., Mucciarelli, A., et al. 2020, A&A, 633, A36, doi: 10.1051/0004-6361/201935613
  • Massari et al. (2017) Massari, D., Posti, L., Helmi, A., Fiorentino, G., & Tolstoy, E. 2017, A&A, 598, L9, doi: 10.1051/0004-6361/201630174
  • McKinnon et al. (2023) McKinnon, K. A., Cunningham, E. C., Rockosi, C. M., et al. 2023, arXiv e-prints, arXiv:2302.07293, doi: 10.48550/arXiv.2302.07293
  • Muzzin et al. (2013) Muzzin, A., Marchesini, D., Stefanon, M., et al. 2013, ApJS, 206, 8, doi: 10.1088/0067-0049/206/1/8
  • Naidu et al. (2020) Naidu, R. P., Conroy, C., Bonaca, A., et al. 2020, ApJ, 901, 48, doi: 10.3847/1538-4357/abaef4
  • Nayyeri et al. (2017) Nayyeri, H., Hemmati, S., Mobasher, B., et al. 2017, ApJS, 228, 7, doi: 10.3847/1538-4365/228/1/7
  • pandas development team (2020) pandas development team, T. 2020, pandas-dev/pandas: Pandas, latest, Zenodo, doi: 10.5281/zenodo.3509134
  • Paxton et al. (2011) Paxton, B., Bildsten, L., Dotter, A., et al. 2011, ApJS, 192, 3, doi: 10.1088/0067-0049/192/1/3
  • Paxton et al. (2013) Paxton, B., Cantiello, M., Arras, P., et al. 2013, ApJS, 208, 4, doi: 10.1088/0067-0049/208/1/4
  • Paxton et al. (2015) Paxton, B., Marchant, P., Schwab, J., et al. 2015, ApJS, 220, 15, doi: 10.1088/0067-0049/220/1/15
  • Pérez & Granger (2007) Pérez, F., & Granger, B. E. 2007, Computing in Science and Engineering, 9, 21, doi: 10.1109/MCSE.2007.53
  • Price-Whelan & Bonaca (2018) Price-Whelan, A. M., & Bonaca, A. 2018, ApJ, 863, L20, doi: 10.3847/2041-8213/aad7b5
  • Robin et al. (2003) Robin, A. C., Reylé, C., Derrière, S., & Picaud, S. 2003, A&A, 409, 523, doi: 10.1051/0004-6361:20031117
  • Schönrich et al. (2011) Schönrich, R., Asplund, M., & Casagrande, L. 2011, MNRAS, 415, 3807, doi: 10.1111/j.1365-2966.2011.19003.x
  • Sohn et al. (2012) Sohn, S. T., Anderson, J., & van der Marel, R. P. 2012, ApJ, 753, 7, doi: 10.1088/0004-637X/753/1/7
  • Sohn et al. (2018) Sohn, S. T., Watkins, L. L., Fardal, M. A., et al. 2018, ApJ, 862, 52, doi: 10.3847/1538-4357/aacd0b
  • van de Ven et al. (2006) van de Ven, G., van den Bosch, R. C. E., Verolme, E. K., & de Zeeuw, P. T. 2006, A&A, 445, 513, doi: 10.1051/0004-6361:20053061
  • van der Marel et al. (2012) van der Marel, R. P., Fardal, M., Besla, G., et al. 2012, ApJ, 753, 8, doi: 10.1088/0004-637X/753/1/8
  • Vasiliev & Baumgardt (2021) Vasiliev, E., & Baumgardt, H. 2021, MNRAS, 505, 5978, doi: 10.1093/mnras/stab1475
  • Virtanen et al. (2020) Virtanen, P., Gommers, R., Oliphant, T. E., et al. 2020, Nature Methods, 17, 261, doi: 10.1038/s41592-019-0686-2
  • Wes McKinney (2010) Wes McKinney. 2010, in Proceedings of the 9th Python in Science Conference, ed. Stéfan van der Walt & Jarrod Millman, 56 – 61, doi: 10.25080/Majora-92bf1922-00a
\onecolumngrid

Appendix A Motion Statistics

Here, we present the posterior conditional distributions in PM, parallax, and position for a given source.

Because of it’s lack of coefficients, it is easiest to solve for the posterior conditional on Δθi\vec{\Delta\theta_{i}} first:

(Δθi|μi,plxi,)𝒩(𝚺θ,i=[𝑽θ,i1+j=1nim𝑱ijT𝑽d,ij1𝑱ij]1,μθ,i=𝚺θ,i[𝑽θ,i1Δθi+j=1nim𝑱ijT𝑽d,ij1𝑱ij(𝐩𝐥𝐱𝒊Δplxij+𝚫𝒕𝒋μi𝑱ij1Δdij)])\begin{split}\left(\vec{\Delta\theta_{i}}|\vec{\mu}_{i},\mathrm{plx}_{i},\dots\right)\sim\mathcal{N}&\left(\boldsymbol{\Sigma}_{\theta,i}=\left[\boldsymbol{V}^{-1}_{\theta,i}+\sum_{j=1}^{n_{im}}\boldsymbol{J}_{ij}^{T}\cdot\boldsymbol{V}^{-1}_{d,ij}\cdot\boldsymbol{J}_{ij}\right]^{-1},\right.\\ &\hskip 14.22636pt\left.\vec{\mu}_{\theta,i}=\boldsymbol{\Sigma}_{\theta,i}\cdot\left[\boldsymbol{V}^{-1}_{\theta,i}\cdot\vec{\Delta\theta}^{\prime}_{i}+\sum_{j=1}^{n_{im}}\boldsymbol{J}_{ij}^{T}\cdot\boldsymbol{V}^{-1}_{d,ij}\cdot\boldsymbol{J}_{ij}\cdot\left(\boldsymbol{\mathrm{plx}_{i}}\cdot\vec{\Delta\mathrm{plx}_{ij}}+\boldsymbol{\Delta t_{j}}\cdot\vec{\mu}_{i}-\boldsymbol{J}^{-1}_{ij}\cdot\vec{\Delta d_{ij}}\right)\right]\right)\end{split} (A1)

where we use bold-faced versions of scalars to represent the corresponding identity matrix version of that number, such as

𝐩𝐥𝐱i=plxi𝑰2×2.\boldsymbol{\mathrm{plx}}_{i}=\mathrm{plx}_{i}\cdot\boldsymbol{I}_{2\times 2}.

We have also included a term for the Gaia-measured position offset vector Δθi\vec{\Delta\theta}_{i}^{\prime} to be as general as possible, but this vector is just 0\vec{0} because Gaia has no measured offset from the positions it reports.

Next, the posterior conditional on μi\vec{\mu}_{i} is given by:

(μi|plxi)𝒩(𝚺μ,i=[𝑽μ,i1+𝑽^𝝁1+𝚺μ,θ,i1+j=1nim𝚺μ,d,ij1]1,μμ,i=𝚺μ,i[𝑽μ,i1μi+𝑽𝝁^1μ^+𝚺μ,θ,i1μμ,θ,i+j=1nim𝚺μ,d,ij1𝑪μ,ij1Dμ,ij])\begin{split}(\vec{\mu}_{i}|\mathrm{plx}_{i}\dots)&\sim\mathcal{N}\left(\boldsymbol{\Sigma}_{\mu,i}=\left[\boldsymbol{V}_{\mu,i}^{-1}+\boldsymbol{\hat{V}_{\mu}}^{-1}+\boldsymbol{\Sigma}_{\mu,\theta,i}^{-1}+\sum_{j=1}^{n_{im}}\boldsymbol{\Sigma}_{\mu,d,ij}^{-1}\right]^{-1},\right.\\ &\hskip 34.99677pt\left.\vec{\mu}_{\mu,i}=\boldsymbol{\Sigma}_{\mu,i}\cdot\left[\boldsymbol{V}_{\mu,i}^{-1}\cdot\vec{\mu}_{i}^{\prime}+\boldsymbol{V_{\hat{\mu}}}^{-1}\cdot\hat{\mu}+\boldsymbol{\Sigma}_{\mu,\theta,i}^{-1}\cdot\vec{\mu}_{\mu,\theta,i}+\sum_{j=1}^{n_{im}}\boldsymbol{\Sigma}_{\mu,d,ij}^{-1}\cdot\boldsymbol{C}_{\mu,ij}^{-1}\cdot\vec{D}_{\mu,ij}\right]\right)\end{split} (A2)

where

𝑨μ,i=[𝚺θ,ij=1nim𝑱ijT𝑽d,ij1𝑱ijΔtj],\boldsymbol{A}_{\mu,i}=\left[\boldsymbol{\Sigma}_{\theta,i}\cdot\sum_{j=1}^{n_{im}}\boldsymbol{J}_{ij}^{T}\cdot\boldsymbol{V}_{d,ij}^{-1}\cdot\boldsymbol{J}_{ij}\cdot\Delta t_{j}\right], (A3)
Bμ,i=𝚺θ,i[j=1nim𝑱ijT𝑽d,ij1𝑱ij𝑱ij1Δdij𝑽θ,i1Δθiplxij=1nim𝑱ijT𝑽d,ij1𝑱ijΔplxij],\vec{B}_{\mu,i}=\boldsymbol{\Sigma}_{\theta,i}\cdot\left[\sum_{j=1}^{n_{im}}\boldsymbol{J}_{ij}^{T}\cdot\boldsymbol{V}_{d,ij}^{-1}\cdot\boldsymbol{J}_{ij}\cdot\boldsymbol{J}_{ij}^{-1}\cdot\vec{\Delta d_{ij}}-\boldsymbol{V}_{\theta,i}^{-1}\cdot\vec{\Delta\theta_{i}^{\prime}}-\mathrm{plx}_{i}\cdot\sum_{j=1}^{n_{im}}\boldsymbol{J}_{ij}^{T}\cdot\boldsymbol{V}_{d,ij}^{-1}\cdot\boldsymbol{J}_{ij}\cdot\vec{\Delta\mathrm{plx}_{ij}}\right], (A4)
𝑪μ,ij=𝚫𝒕𝒋𝑨μ,i,\boldsymbol{C}_{\mu,ij}=\boldsymbol{\Delta t_{j}}-\boldsymbol{A}_{\mu,i}, (A5)
Dμ,ij=𝑱ij1ΔdijBμ,i𝐩𝐥𝐱𝒊Δplxij.\vec{D_{\mu,ij}}=\boldsymbol{J}_{ij}^{-1}\cdot\vec{\Delta d_{ij}}-\vec{B}_{\mu,i}-\boldsymbol{\mathrm{plx}_{i}}\cdot\vec{\Delta\mathrm{plx}_{ij}}. (A6)

Finally, the posterior conditional on plxi\mathrm{plx}_{i} is given by:

(plxi|)𝒩(plxi|σplx,i^2=[σ^plx2+σplx,i2+σplx,θ,i2+σplx,μ,i2+σplx,μ^,i2+j=1nimσplx,d,ij2]1,μplx,i^=σplx,i^2[σ^plx2plx^+σplx,i2plxi+σplx,θ,i2μplx,θ,i+σplx,μ,i2μplx,μ,i+σplx,μ^,i2μplx,μ^,i+j=1nimσplx,d,ij2μplx,d,ij])\begin{split}(\mathrm{plx}_{i}|\dots)\sim&\mathcal{N}\left(\mathrm{plx}_{i}|\hat{\sigma_{\mathrm{plx},i}}^{2}=\left[\hat{\sigma}_{\mathrm{plx}}^{-2}+\sigma^{-2}_{\mathrm{plx},i}+\sigma^{-2}_{\mathrm{plx},\theta,i}+\sigma^{-2}_{\mathrm{plx},\mu,i}+\sigma^{-2}_{\mathrm{plx},\hat{\mu},i}+\sum_{j=1}^{n_{im}}\sigma^{-2}_{\mathrm{plx},d,ij}\right]^{-1},\right.\\ &\hskip 42.67912pt\left.\hat{\mu_{\mathrm{plx},i}}=\hat{\sigma_{\mathrm{plx},i}}^{2}\cdot\left[\hat{\sigma}_{\mathrm{plx}}^{-2}\cdot\hat{\mathrm{plx}}+\sigma^{-2}_{\mathrm{plx},i}\cdot\mathrm{plx}_{i}^{\prime}+\sigma^{-2}_{\mathrm{plx},\theta,i}\cdot\mu_{\mathrm{plx},\theta,i}+\sigma^{-2}_{\mathrm{plx},\mu,i}\cdot\mu_{\mathrm{plx},\mu,i}\right.\right.\\ &\hskip 227.62204pt\left.\left.+\sigma^{-2}_{\mathrm{plx},\hat{\mu},i}\cdot\mu_{\mathrm{plx},\hat{\mu},i}+\sum_{j=1}^{n_{im}}\sigma^{-2}_{\mathrm{plx},d,ij}\cdot\mu_{\mathrm{plx},d,ij}\right]\right)\end{split} (A7)

where

Aplx,μ,i=𝚺θ,ij=1nim𝑱ijT𝑽d,ij1𝑱ijΔplxij,\vec{A_{\mathrm{plx},\mu,i}}=-\boldsymbol{\Sigma}_{\theta,i}\cdot\sum_{j=1}^{n_{im}}\boldsymbol{J}_{ij}^{T}\cdot\boldsymbol{V}_{d,ij}^{-1}\cdot\boldsymbol{J}_{ij}\cdot\vec{\Delta\mathrm{plx}_{ij}}, (A8)
Bplx,μ,i=𝚺θ,i[𝑽θ,i1Δθij=1nim𝑱ijT𝑽d,ij1𝑱ij𝑱ij1Δdij],\vec{B_{\mathrm{plx},\mu,i}}=\boldsymbol{\Sigma}_{\theta,i}\cdot\left[\boldsymbol{V}_{\theta,i}^{-1}\cdot\vec{\Delta\theta_{i}^{\prime}}-\sum_{j=1}^{n_{im}}\boldsymbol{J}_{ij}^{T}\cdot\boldsymbol{V}_{d,ij}^{-1}\cdot\boldsymbol{J}_{ij}\cdot\boldsymbol{J}_{ij}^{-1}\cdot\vec{\Delta d_{ij}}\right], (A9)
Cplx,μ,i=𝚺μ,i[𝚺μ,θ,i1𝑨μ,i1Aplx,μ,ij=1nim𝚺μ,d,ij1𝑪μ,ij1(Aplx,μ,i+Δplxij)],\vec{C}_{\mathrm{plx},\mu,i}=\boldsymbol{\Sigma}_{\mu,i}\cdot\left[\boldsymbol{\Sigma}_{\mu,\theta,i}^{-1}\cdot\boldsymbol{A}_{\mu,i}^{-1}\cdot\vec{A_{\mathrm{plx},\mu,i}}-\sum_{j=1}^{n_{im}}\boldsymbol{\Sigma}_{\mu,d,ij}^{-1}\cdot\boldsymbol{C}_{\mu,ij}^{-1}\cdot\left(\vec{A_{\mathrm{plx},\mu,i}}+\vec{\Delta\mathrm{plx}_{ij}}\right)\right], (A10)
Dplx,μ,i=𝚺μ,i[𝑽μ,i1μi+𝑽𝝁^1μ^+𝚺μ,θ,i1𝑨μ,i1(ΔθiBplx,μ,i)+j=1nim𝚺μ,d,ij1𝑪μ,ij1(Bplx,μ,i+𝑱ij1Δdij)],\begin{split}\vec{D}_{\mathrm{plx},\mu,i}=&-\boldsymbol{\Sigma}_{\mu,i}\cdot\left[\boldsymbol{V}_{\mu,i}^{-1}\cdot\vec{\mu}_{i}^{\prime}+\boldsymbol{V_{\hat{\mu}}}^{-1}\cdot\hat{\mu}+\boldsymbol{\Sigma}_{\mu,\theta,i}^{-1}\cdot\boldsymbol{A}_{\mu,i}^{-1}\cdot\left(\vec{\Delta\theta_{i}}^{\prime}-\vec{B_{\mathrm{plx},\mu,i}}\right)\right.\\ &\hskip 142.26378pt\left.+\sum_{j=1}^{n_{im}}\boldsymbol{\Sigma}_{\mu,d,ij}^{-1}\cdot\boldsymbol{C}_{\mu,ij}^{-1}\cdot\left(\vec{B_{\mathrm{plx},\mu,i}}+\boldsymbol{J}_{ij}^{-1}\cdot\vec{\Delta d_{ij}}\right)\right],\end{split} (A11)
Gplx,d,ij=[Δplxij+ΔtjCplx,μ,iEplx,θ,i],\vec{G}_{\mathrm{plx},d,ij}=\left[\vec{\Delta\mathrm{plx}_{ij}}+\Delta t_{j}\cdot\vec{C_{\mathrm{plx},\mu,i}}-\vec{E_{\mathrm{plx},\theta,i}}\right], (A12)
Hplx,d,ij=[ΔtjDplx,μ,iFplx,θ,i+𝑱ij1Δdij].\vec{H}_{\mathrm{plx},d,ij}=\left[\Delta t_{j}\cdot\vec{D_{\mathrm{plx},\mu,i}}-\vec{F_{\mathrm{plx},\theta,i}}+\boldsymbol{J}_{ij}^{-1}\cdot\vec{\Delta d_{ij}}\right]. (A13)

Because these three posterior conditional distributions are all Gaussian, we are able to quickly sample PMs, parallaxes, and position offsets for source ii given a set of transformation parameters.

Table 3: Distributions used to generate synthetic data of halo and thick MW stars. 𝒮𝒦𝒩\mathcal{SKN} indicates a skew-normal distribution.
Component Distribution Functional Form
Halo p([Fe/H])p([\mathrm{Fe}/\mathrm{H}]) 0.406𝒮𝒦𝒩(μ=1.35dex,σ=0.5dex,a=5)0.406\cdot\mathcal{SKN}(\mu=-1.35~{}\mathrm{dex},\sigma=0.5~{}\mathrm{dex},a=5)\hskip 85.35826pt
+0.594𝒮𝒦𝒩(μ=0.9dex,σ=0.7dex,a=3)\hskip 85.35826pt+0.594\cdot\mathcal{SKN}(\mu=-0.9~{}\mathrm{dex},\sigma=0.7~{}\mathrm{dex},a=-3)
p(age)p(\mathrm{age}) 𝒩(μ=12Gyr,σ=2Gyr)\mathcal{N}(\mu=12~{}\mathrm{Gyr},\sigma=2~{}\mathrm{Gyr})
p(mass|[Fe/H],age)p(\mathrm{mass}|[\mathrm{Fe}/\mathrm{H}],\mathrm{age}) Kroupa (2001) IMF, k(massM)α\propto k\left(\frac{\mathrm{mass}}{M_{\odot}}\right)^{-\alpha} with {k=25,α=0.3,mass<0.08Mk=2,α=1.3,mass<0.5Mk=1,α=2.3,mass>0.5M\begin{cases}k=25,~{}\alpha=0.3,&\mathrm{mass}<0.08M_{\odot}\\ k=2,~{}\alpha=1.3,&\mathrm{mass}<0.5M_{\odot}\\ k=1,~{}\alpha=2.3,&\mathrm{mass}>0.5M_{\odot}\end{cases}
p(distancemoludus)p(\mathrm{distance~{}moludus}) D3(Rq27kpc)α\propto D^{3}\left(\frac{R_{q}}{27\mathrm{kpc}}\right)^{-\alpha} with {α=2.3,Rq<27kpcα=4.6,Rq27kpc\begin{cases}\alpha=2.3,&R_{q}<27\,\mathrm{kpc}\\ \alpha=4.6,&R_{q}\geq 27\,\mathrm{kpc}\end{cases}
p(vr)p(v_{r}) 𝒩(μ=0kms1,σ=130kms1)\mathcal{N}(\mu=0~{}\mathrm{km}~{}\mathrm{s}^{-1},\sigma=130~{}\mathrm{km}~{}\mathrm{s}^{-1})
p(vϕ)p(v_{\phi}) 𝒩(μ=0kms1,σ=70kms1)\mathcal{N}(\mu=0~{}\mathrm{km}~{}\mathrm{s}^{-1},\sigma=70~{}\mathrm{km}~{}\mathrm{s}^{-1})
p(vθ)p(v_{\theta}) 𝒩(μ=0kms1,σ=70kms1)\mathcal{N}(\mu=0~{}\mathrm{km}~{}\mathrm{s}^{-1},\sigma=70~{}\mathrm{km}~{}\mathrm{s}^{-1})
Thick Disk p([Fe/H])p([\mathrm{Fe}/\mathrm{H}]) 16𝒮𝒦𝒩(μ=1.05dex,σ=0.6dex,a=5)\frac{1}{6}\mathcal{SKN}(\mu=-1.05~{}\mathrm{dex},\sigma=0.6~{}\mathrm{dex},a=-5)\hskip 85.35826pt
+56𝒩(μ=0.54dex,σ=0.3dex)\hskip 85.35826pt+\frac{5}{6}\mathcal{N}(\mu=-0.54~{}\mathrm{dex},\sigma=0.3~{}\mathrm{dex})
p(age)p({\mathrm{age}}) 𝒩(μ=10Gyr,σ=2Gyr)\mathcal{N}(\mu=10~{}\mathrm{Gyr},\sigma=2~{}\mathrm{Gyr})
p(mass|[Fe/H],age)p(\mathrm{mass}|[\mathrm{Fe}/\mathrm{H}],\mathrm{age}) Kroupa (2001) IMF
p(distancemoludus)p(\mathrm{distance~{}moludus}) D3exp(RD3kpcz1kpc)\propto D^{3}\exp\left(-\frac{R_{D}}{3~{}\mathrm{kpc}}-\frac{z}{1~{}\mathrm{kpc}}\right), where RD2=x2+y2R_{D}^{2}=x^{2}+y^{2}
p(vRD)p(v_{R_{D}}) 𝒩(μ=0kms1,σ=45kms1)\mathcal{N}(\mu=0~{}\mathrm{km}~{}\mathrm{s}^{-1},\sigma=45~{}\mathrm{km}~{}\mathrm{s}^{-1})
p(vz)p(v_{z}) 𝒩(μ=0kms1,σ=20kms1)\mathcal{N}(\mu=0~{}\mathrm{km}~{}\mathrm{s}^{-1},\sigma=20~{}\mathrm{km}~{}\mathrm{s}^{-1})
p(vT)p(v_{T}) 𝒮𝒦𝒩(μ=242kms1,σ=46.2kms1,a=2)\mathcal{SKN}(\mu=242~{}\mathrm{km}~{}\mathrm{s}^{-1},\sigma=46.2~{}\mathrm{km}~{}\mathrm{s}^{-1},a=-2)
\twocolumngrid

Appendix B Generating Synthetic, COSMOS-like Data

Refer to caption
Refer to caption
Figure 20: Distributions on age (left panel) and [Fe/H][\mathrm{Fe}/\mathrm{H}] (right panel) for the MW halo (purple) and thick disk (turquoise) populations used to create synthetic stars. The halo population makes up 28% of the total population (lime green), while the thick disk is the other 72%.
Refer to caption
Figure 21: Color-magnitude diagram of synthetic COSMOS-like stars, colored by whether the stars belong to the MW halo (purple) or thick disk (turquoise). The histograms along the right and top edges include the total population (lime green).
Refer to caption
Figure 22: Distance distributions of synthetic COSMOS-like stars, colored by whether the stars belong to the MW halo (purple) or thick disk (turquoise).
Refer to caption
Figure 23: PMs of synthetic COSMOS-like stars, colored by whether the stars belong to the MW halo (purple) or thick disk (turquoise).
Refer to caption
Refer to caption
Refer to caption
Figure 24: Gaia Position, parallax, and PM uncertainties as a function of magnitude from all Gaia sources within 1 degree of the COSMOS field center. The black points are the Gaia-measured values and the red lines are a median-binning of those data, with a linear extrapolation where no Gaia measurements exist.

Following a similar approach to McKinnon et al. (2023), we define mass, age, [Fe/H][\mathrm{Fe}/\mathrm{H}], and distance distributions for both a MW thick disk and a MW halo population. We also define distributions for the Galactocentric 3D velocities for the halo and thick disk. The distributions for each parameter that go into generating a synthetic star are summarized in Table 3. The particular choices for the velocity distributions come from the MW halo velocity ellipsoid measurements of Cunningham et al. (2019b), as do the distance modulus distributions. For the age distributions, we use an analytical approximation of the age distributions presented in Bonaca et al. (2020). The [Fe/H][\mathrm{Fe}/\mathrm{H}] distributions are again analytical approximations of the results presented in Conroy et al. (2019) and Naidu et al. (2020). The age and [Fe/H][\mathrm{Fe}/\mathrm{H}] distributions are displayed in Figure 20, where the distributions have been scaled by their contribution to the total halo population. After consulting a Besançon model (Robin et al., 2003) in a COSMOS-like field in the 16<G<21.5mag16<G<21.5~{}\mathrm{mag} range, we choose to set the fraction of the number of observed stars that belong to the halo (i.e. “halo fraction”) to 28%.

Using these distributions and the halo fraction, we draw stellar parameters (i.e. mass, age, and [Fe/H][\mathrm{Fe}/\mathrm{H}]) for each synthetic star, and then interpolate to that point using the MIST isochrones777https://waps.cfa.harvard.edu/MIST/index.html (Dotter, 2016; Choi et al., 2016; Paxton et al., 2011, 2013, 2015). This interpolated point yields the color and absolute magnitude of each synthetic star. After we draw distances, we can measure the apparent magnitudes for each star, and then perform rejection sampling to get a sample of stars that fall within a particular range in magnitude and/or color. A color-magnitude diagram is shown in Figure 21 for the COSMOS-like stars we generate, with histograms along the top and right edges showing the distribution of the halo (purple), thick disk (turquoise), and total (lime green) populations. A histogram of the heliocentric distance to each synthetic star is given in Figure 22.

Next, for each synthetic star, we draw the 3D velocity components based on whether that star belongs to the halo or the thick disk. Using the stellar position on the sky and distance allow us to transform those Galactocentric velocities into observable velocities888To transform between the observer frame and a Galactocentric one, we use r=8.1r_{\odot}=8.1 kpc, assume a circular speed of 235kms1235~{}{\rm km\ s}^{-1}, and solar peculiar motion (U,V,W)=(11.1,12.24,7.25)kms1(U,V,W)=(11.1,12.24,7.25)~{}{\rm km\ s}^{-1} (Schönrich et al., 2011).. This yields the PMs shown in Figure 23.

To create outputs that the BP3M pipeline expects, the next step is to create synthetic HST images and corresponding Gaia measurements. For synthetic Gaia uncertainties on position, parallax, and PM, we look at real Gaia-measured stars within 1 degree of the COSMOS field center; the resulting uncertainties in each dimension are shown in Figure 24 as a function of magnitude (black points) with a median-binned line overlaid (red). In cases where the data do not extend as faint as needed, we linearly extrapolate from the median binning.

We use a similar approach when it comes to modeling the uncertainties in the HST image positions. Specifically, we use the 2000\sim 2000 real COSMOS stars from Section 5.4 to look at the GaiaHub-measured HST position uncertainties as a function of magnitude, and this is shown in Figure 25. When we go to assign HST position uncertainties to the synthetic stars, we identify the 10 nearest real COSMOS stars in GG magnitude and randomly select one of those stars to bequeath its position uncertainty to the synthetic one.

Refer to caption
Figure 25: Centroid position uncertainty in HST pixels as a function of magnitude as measured by GaiaHub for the 2000\sim 2000 real COSMOS stars in Figure 18, where the y-axis is in log scale. In general, the brighter magnitude sources have larger HST position uncertainties because they are more likely to be saturated in the COSMOS HST exposures. At the faintest magnitudes, there are some sources with large HST position uncertainties because their PSF-fitting was not as well-constrained as some other faint sources.

With all of the true properties of the synthetic star defined, we can create Gaia-like measurements of the position, parallax, and uncertainty by applying some noise corresponding to the uncertainties we’ve defined. Those true positions and motions can then be played forward or backward in time until reaching the correct epoch of the synthetic HST image. After choosing the transformation parameters that map the synthetic HST image onto the synthetic Gaia data, we can apply the correct HST position noise, and then save the outputs in the same .csv files that BP3M expects as inputs. The final step of the process is to run BP3M on the newly-generated synthetic HST images.