Gaia Data Release 3. Astrometric binary star processing
Abstract
Context. The Gaia Early Data Release 3 contained the positions, parallaxes and proper motions of 1.5 billion sources, among which some did not fit well the ”single star” model. Binarity is one of the causes of this.
Aims. Four million of these stars were selected and various models were tested to detect binary stars and to derive their parameters.
Methods. A preliminary treatment was used to discard the partially resolved double stars and to correct the transits for perspective acceleration. It was then investigated whether the measurements fit well with an acceleration model with or without jerk. The orbital model was tried when the fit of any acceleration model was beyond our acceptance criteria. A Variability-Induced Mover (VIM) model was also tried when the star was photometrically variable. A final selection has been made in order to keep only solutions that probably correspond to the real nature of the stars.
Results. At the end, acceleration solutions, about orbital solutions and 869 VIM solutions were retained. In addition, formulae for calculating the uncertainties of the Campbell orbital elements from orbital solutions expressed in Thiele-Innes elements are given in an appendix.
Key Words.:
binaries:general – Catalogs – Astrometry – Methods: data analysis – Astronomical instrumentation, methods and techniques1 Introduction
1.1 Context
Since the start of the Gaia satellite observations in summer 2014, several data releases have been made available to the scientific community. The astrometric parameters of more than one billion stars have been determined three times with ever increasing precision, but these data still described the stars as single. The third Data Release (DR3) and the Early third Data Release (EDR3) are the first to have a significant number of observations covering a sufficiently long time span (about 33 months) to make it worthwhile to use Gaia observations to search for binary stars.
The extension of the exploitation of Gaia observations to binary stars reveals two more difficulties than the application of the single star model. As with its predecessor Hipparcos, the double star candidates are covered by different numerical models, without being certain that any of them really corresponds to a given star. Therefore, we have to solve the problem of choosing the model that best applies to a given star, as well as the final acceptance of the solution. Before that, we have to filter the input data to retain only those stars that may be able to get a suitable solution.
1.2 Selection of stars to be processed
The astrometric processing of the Gaia Early Data Release 3 (EDR3, Gaia Collaboration, 2021) left some 36.5 million stars that were considered to be possibly double. These stars were all brighter than G=19, had a renormalized unit weight error (; see Lindegren et al., 2021) greater than 1.4, and were observed over at least 12 visibility periods. These objects included many pairs of stars that were resolved whenever the orientation of the scan direction allowed, and seen as a single photocentre otherwise. These so-called “partially resolved double stars” are likely to appear as unresolved binary stars of extremely rapid orbital motion, and should be discarded as far as possible. It is planned to apply a specific treatment to them in the next DRs, but for the moment we can only eliminate them when possible. For this purpose, two quantities published in Gaia EDR3 and coming from the astrometric binary detection processing (Lindegren et al., 2021) were used to establish two additional criteria: The percentage of CCD observations where image parameter determination (IPD) detected more than one peak, called , must be less than or equal to 2, and , which is the amplitude of the natural logarithm of the goodness-of-fit obtained in the IPD versus position angle of scan, must be less than 0.1. As a result, the number of targets to be treated was reduced to 10.9 million. In order to further reduce the number of partially resolved double stars, a final selection was based on photometric properties. As the pixels of the astrometric field are much smaller than those of the BP and RP windows, a component of a close binary at the resolution limit appears less bright in magnitude (from the astrometric field) than one would expect from its magnitude in the BP and RP fields. The corrected BP and RP flux excess factor and its uncertainty as defined in Riello et al. (2021, Eq. 6 and 18) were taken into account. The drastic condition , corresponding to a 90% confidence interval, was applied and a selection of stars was finally obtained.
2 Treatment overview

2.1 Overall organisation
The organisation of the astrometric binary star processing, as it was finally realised, is summarised in Fig. 1. The input data consist of the selected stars, for which the following information is used: the along-scan (AL) coordinates of each transit of the astrometric field of view, the partial derivatives of the single star astrometric model, the magnitudes of the transits, and the parameters of a preliminary single star solution, i.e. the coordinate offsets, the parallax and the proper motion. The production of the astrometric data is described in Lindegren et al. (2021), while the magnitude reduction is explained in Riello et al. (2021). The data are complemented by radial velocities, when available (Seabroke et al., 2021).
The calculation of the astrometric solutions is done in the so-called main processing, as explained hereafter. The data discussed above are ingested by the preprocessing, which prepares them for the calculation of different types of solutions. The solutions are calculated one after the other, starting with a new calculation of the single star solution, then the acceleration solutions, the orbital solution, and, finally, the variability-induced mover (VIM) solutions if the star is photometrically variable. As soon as a solution is considered acceptable, the processing cascade is interrupted and the star goes to post-processing. The post-processing essentially consists of re-filtering the solutions, applying more stringent criteria than those used after the solutions were calculated. When a solution is discarded by post-processing, it is replaced by the EDR3 single star solution.
In practice, the software for managing the cascade of calculation and acceptance of solutions is much more sophisticated than the scheme shown in Fig. 1. In fact, the solution that entered the post-processing was sometimes a so-called “alternative” solution (not to be confused with the ”OrbitalAlternative” solutions in Holl et al., 2022): a solution that met poor acceptance criteria, and which was retained because it was the solution that best matched the observations at the end of the cascade. More information on this aspect of the process is given at the end of Section 2.2.2, but we can already highlight two things: First, that we did not systematically try all the models to retain the one that was the most suitable; this was motivated mainly by the concern to save computational time, but also by the fear of accepting complex solutions for objects that could fall under a simpler model (for example, giving an acceleration solution for an orbital binary is a less serious error than giving a false orbital solution for a binary that should only have an acceleration solution). Despite its apparent mathematical simplicity, the VIMF model has been placed at the end because it combines data of two kinds (astrometric and photometric), which undermines the reliability of the solutions. Secondly, post-processing consists only of rejecting solutions, without trying to replace them with more complex or alternative solutions. Ideally, the filters applied in post-processing should be included in the main processing, but this was not possible in the time available.
2.2 Criteria for the acceptance of solutions
We define hereafter two quality criteria, the goodness of fit and the significance, that can distinguish acceptable solutions from those that are irrelevant or at least potentially wrong. In order to decide whether a solution is really plausible, we will also consider astrophysical criteria and other constraints, which will allow us to identify some solutions as not very plausible. These bad solutions will allow us to delimit a quality domain where they are absent or rare, and whose solutions will be kept.
2.2.1 The goodness of fit
The goodness of fit indicates if the model provides predictions which are compatible with the actual measurements. As in the preparation of the Hipparcos catalogue (ESA, 1997, vol. 1, Sect. 2.1), we use the estimator, which is given by the formula:
(1) |
where is the number of degrees of freedom, and is the sum of the squared normalised residuals. For a solution derived through an adequate model, is expected to obey the normal distribution . This property is attributed to linear models (Stuart & Ord, 1994), but we verified by simulations that it also applies to the orbital model presented hereafter, when the semi-major axis of the orbit is significantly larger than its uncertainty. A large means that the model is inadequate, or that the uncertainties used to derive the are underestimated. In practice, we know that this last possibility widely applies to the astrometric transits used in DR3. Therefore, only very large values of indicate that the model cannot be accepted. Nevertheless, assuming that the model is adequate, the uncertainties may be revised in order to obtain a zero . The simplest correction method, since it keeps the relative weights of the transits and therefore the solution of the model, consists in multiplying the uncertainties of the observations by the coefficient:
(2) |
It is worth noticing that the correction by the coefficient also applies to the uncertainties of the model parameters. In addition to the correction of uncertainties, was also taken into account for the selection of solutions. In the main processing, solutions of , if any, were rejected, but a stricter condition was applied in the post-processing: whatever the model, purely astrometric solutions with uncorrected were considered as questionable, and were finally not retained.
Although similar in principle, this approach is slightly different from that followed in the treatment of single star solutions for EDR3 (Lindegren et al., 2021): Eq. 2 shows that the coefficient is equal to , which is approximately equal to . On the other hand, we do not apply a renormalisation, but this does not matter since we only consider stars brighter than .
2.2.2 The significance
The significance is a dimensionless quantity that was already introduced in the Hipparcos catalogue (ESA, 1997, vol. 1, Sect. 2.3.3) in order to decide whether the use of a model including additional parameters is really justified. When the additional parameters are the coordinates of a two-dimensional vector, as is the case for the acceleration models in Sect. 4, the significance, , is defined as the module of this vector divided by its uncertainty. Therefore, is calculated with the equation:
(3) |
where and are the coordinates of the additional vector characterising the model (for instance, the acceleration), and are the uncertainties of and , respectively, corrected by the coefficient derived with Eq. 2, and the correlation coefficient between and . The values of , and are all taken from the variance-covariance matrix.
When the solutions were calculated, a preliminary selection was made through basic filtering: the solutions of and were considered good enough to be retained, without trying other models. Otherwise, the solutions with and were considered as “alternative” solutions (see Sect. 2.1), and the smaller alternative was provisionally accepted at the end of the cascade. The limit of 5 was adopted empirically on the basis of initial tests of the data. Thresholds of 5 and 12, which amount to as many s, seem exceptionally high and are much higher than the thresholds of 3 and 4 that were adopted on the basis of simulations for alternative solutions and for direct acceptance respectively. This is because, in reality the uncertainties of the transits of a star are certainly not all overestimated by the same coefficient; correcting the uncertainties by a single coefficient is only a stopgap measure, as it is not possible to separate transits with a correct original uncertainty from those with a very underestimated uncertainty.
The set of solutions thus selected was then subjected to a more severe filtering in order to keep only the most credible solutions, as explained below for each model. The final selection criteria are presented in Table 1. For the acceleration models and for the VIMF model, they are more severe than the direct acceptance criteria, and they lead to the rejection of alternative solutions. However, all alternative solutions contribute to the statistical properties of the main processing solutions, which are presented hereafter, and which are the basis for the choice of the final criteria.
Model | Dimension | Significance | Selection conditions | |||
significance | Other | |||||
Acceleration : | ||||||
Constant | 7 | - | ||||
Variable | 9 | - | ||||
Orbital∗ : | ||||||
eccentric | 12 | |||||
circular or | 10 | - | ||||
pseudo-circular | ||||||
VIMF | 7 | - |
*: These selection conditions do not apply to orbital solutions from main processing when they have been confirmed by a SB1 spectrocopic solution, as explained in Gosset et al. (2022).
3 The preprocessing
3.1 Rejection of outliers
Each transit of a star through the Gaia astrometric instrument comes down to the measurement of two coordinates giving its position relative to a reference point assigned to the star (see Lindegren et al., 2021). Of these, only the coordinate along the scan axis, called hereafter the AL abscissa, , is taken into account, being considerably more accurate than the other. The AL abscissae were measured along the 9 CCD transits that constitute each field-of-view (FoV) transit. As the duration of the FoV transit is only a few seconds while we are looking for binary stars with periods of at least several days, the 9 CCD transits amount to 9 measurements of the same quantity. Outliers were therefore detected by intercomparison between the CCD transits. In practice, each AL abscissa was compared to the median of the 9 values, and the transit CCD was rejected when the difference exceeded 5 times the uncertainty of the abscissa.
This rejection process was not the only one that was applied, and subsequently transits were rejected after the calculation of some astrometric solutions. The largest residual transit was rejected when it exceeded 5 times the uncertainty, and the solution was recalculated; however, this operation was subject to three restrictions: (1) it was only done for linear models, i.e. the single star model and the two acceleration models, (2) the of the solution should exceed the number of transits used multiplied by 1.41 and (3) the proportion of rejected transits was limited to 5 %.
3.2 Correction of the perspective acceleration
The distance from the Sun and the apparent position of a nearby star are varying during the time-span of the Gaia mission, inducing slight changes in astrometric parameters which are usually assumed to be constant. These so-called “perspective effects” were taken into account in the calculation of the astrometric parameters of a few single stars in the Hipparcos Catalogue (ESA, 1997). Dravins, Lindegren & Madsen (1999) and Lindegren & Dravins (2003) have shown that these effects are so closely related to radial velocity that they could be used to measure it. For Gaia, Halbwachs (2009) has shown that perspective effects do not significantly affect the astrometric orbits of binary stars; however, they do result in a “perspective acceleration” of stars that could be mistaken for an acceleration due to orbital motion.
The perspective acceleration is based on the radial proper motion, , which is defined as the radial velocity, , divided by distance. In practice, , expressed in yr-1, is related to and parallax, by the following relationship:
(4) |
where is in km.s-1, is in mas, is the duration of the year in days, and is the length of the astronomical unit (au) in meters. Perspective acceleration changes the AL abscissa of a star, , by adding the following quantity:
(5) |
where is the epoch in years, and is an epoch close to the middle of the DR3 mission (in practice, ). The partial derivatives of the abscissa with respect to the parallax and to the proper motion coordinates are those of the 5-parameter single star model. If is unknown, then the abscissae are related to the parameters by a system of non-linear equations. Fortunately, the radial velocities of the bright stars are measured by Gaia, and a calculation based on the single star model, ignoring the perspective acceleration, already gives a good approximation of the parallax and of the proper motion. Thus, may be determined from Eq. 4.
There are then two possibilities to take into account the perspective acceleration: either multiply the partial derivatives by , as they are in Eq. 5, or correct the abscissae by subtracting the perspective acceleration contribution, . Because of its simplicity, we followed the latter method: was calculated from Eq. 5, using preliminary values of , and , and was subtracted to the AL abscissa. This correction was made for transits of all stars of known radial velocity when they were closer than 200 pc, perspective effects being negligible beyond that. The astrometric binary solutions obtained for stars whose measurements have been corrected for perspective acceleration are identified by a flag in the solution tables.
3.3 New calculation of the single star solution
The above operations may have changed the transits sufficiently that the star should no longer be considered a potential binary. For this reason, the single star solutions were recalculated, and they were accepted when was zero or negative. Of the stars that entered the processing chain, only 28 left at this stage. The rest were passed through the acceleration models.
4 The acceleration solutions
Acceleration models were already introduced in the reduction of the Hipparcos catalogue in order to describe binary stars that could not be covered by the orbital model because their period was too long. Two models were considered: the constant acceleration model, where the trajectory is a parabola, and the variable acceleration model, which includes the time derivatives of the acceleration. The calculation of the solutions from these models was done with rejection of outliers, as explained at the end of Sect. 3.1.
4.1 The constant acceleration model
Under the effect of the gravitational force, the two components of a binary are accelerated towards each other. If they cannot be resolved and the photocentre does not coincide with the barycentre, the apparent motion of their photocentre is accelerated; when the orbital period is much longer than the time interval covered by the observations, the acceleration is nearly constant in intensity and direction. These physical considerations are at the origin of the constant acceleration model. Each coordinate of the photocentre varies with time according to a parabola. However, fitting a parabola to the set of observations means setting the mid-mission position of the star as given by the first two terms of the solution, so that it is tangent to the parabola, and therefore far from the mean position of the star. To provide a position closer to the mean position of the star, we have followed the method explained in the Hipparcos catalogue (ESA, 1997, vol. 1, Sect. 2.3.3), and the partial derivatives of the abscissa with respect to the acceleration, , are given by the following equations:
(6) |
where is 2016.0 as in Sect. 3.2, and is half the time interval covered by the observation of the star. For simplicity, the same value of was assumed for all stars, and we have adopted half the time between the start of the Gaia science mission and the last DR3 observation, which, according to the information available when the software was finalised, lead to d, or 1.417 yr. This is an overestimation, however: in fact, the last observation of the DR3 was made on 28 May 2017, 3 days later than we had assumed, but, on the other hand, the astrometric data of the first month of the Gaia scientific mission were not taken into account (Lindegren et al., 2021). Moreover, the actual duration covered by the observations of a star is necessarily shorter than the duration covered by all the observations. For all these reasons, should be rather of the order of 470 days. However, the resulting shift in the mean position is very small: 0.06 mas times the acceleration in mas.yr-2.
4.2 The variable acceleration model
When the gravitational force is significantly changing over the duration of the mission, but when the orbital period is still much longer than the observation time span, the time derivative of the acceleration, , is added to the parameters of the model. However, the introduction of these parameters could modify the proper motion of the star. In order to derive a proper motion corresponding to the average displacement of the photocentre during the mission, the partial derivatives of the abscissa with respect to and are given by the equations:
(7) |
where is the same as in Sect. 4.1 above. We emphasise that the only effect of introducing is to produce a position and proper motion similar to that which would result from applying the single star model. Since acceleration models cannot locate the barycentre of a binary, these parameters are still affected by the orbital motion of the photocentre relative to the barycentre.
The significance of the variable acceleration model is defined from the vector. As above, it is derived from Eq. 3, as explained in Sect. 2.2.2.
It is worth noticing that the acceleration models are both linear models, and that the solutions are found by solving a system of linear equations by singular value decomposition.
In the Hipparcos reduction, the constant acceleration solution was considered as acceptable when the acceleration was significant, but the variable acceleration solution was always tried, and, when it was significant, it was preferred to the constant acceleration solution. We have kept this strategy, and, for that reason, the variable acceleration model was tried first. The variable acceleration solution was retained when it satisfied the conditions: and . Otherwise, the constant acceleration solution was derived, and it was accepted when it satisfied the same conditions. Because of this acceptance, no other models were tried. Therefore acceleration solutions were retained for many binary stars with periods shorter than the duration of the Gaia mission. This choice was originally made to save computing time; this reason was no longer relevant at the time of processing the DR3, but the strategy remained unchanged because it was considered that the risk of producing an orbit with a wrong period was more serious than that of giving only a polynomial fit for a trajectory that could be described by an ellipse. This is a matter of judgement which may be considered questionable, and will probably be approached differently in the next data release.
4.3 Selection of the acceleration solutions
To delimit a domain of acceptance of the solutions, we consider, for both acceleration models, the true acceleration projected on the sky, called in the following. is derived from the equation:
(8) |
When the apparent acceleration is expressed in mas.yr-2 and when the parallax is in mas, is obtained in au.yr-2.
In order to estimate a maximum value for , we consider a bright star with a dark companion, so that the acceleration of the photocentre is that of the bright star. Applying the law of gravitation, and assuming that the distance between the components is approximately equal to the semi-major axis of the orbit of the bright star around the companion, we deduce from Kepler’s third law that the acceleration is then:
(9) |
where is the period, the mass of the bright star and the mass ratio, i.e. the ratio between the mass of the dark companion and .
An upper limit of is obtained when the bright star is a giant solar-mass star with a companion still on the main sequence or on the subgiant branch, with slightly less than 1, and when the period is twice the duration of the Gaia mission. Such a system is not frequent, but not exceptional in a sample bounded by a limit in magnitude. With a period of 5.7 yr, the acceleration may be as large as about 2.4 for a circular orbit; over half a period, the average acceleration can then be as high as 2.0 . This result is however a rough estimate: We have neglected the eccentricity, which will produce a larger acceleration near the periastron, and we have assumed a period much longer than the duration of the mission, whereas we know that many solutions are for shorter periods. Considering the distribution of values obtained from the solutions (see Fig. 2, (a) and (b)), we see a concentration below , which we adopt as the limit for an acceptable acceleration solution. Stars whose acceleration is really beyond this limit are certainly very rare, but they are also objects of great scientific interest, such as binaries with a massive dark component, and for this reason we do not reject them directly.




The following conditions were introduced to restrict the proportion of larger than this limit:
-
1.
As mentioned in Sect. 4.2 above, the main processing was performed with the threshold for direct acceptance and for the alternative solutions. In addition, conditions based on the significance of the parallax were introduced for both levels: for the constant acceleration model, and for the variable acceleration model. These conditions have been set because they very effectively reduce the amount of high accelerations; it is worth noticing that, in fact, they are not criteria based on the quality of the solutions: reading them from right to left, they both mean that, for a given distance, the most significant solutions are rejected. This results in rejecting the largest accelerations because many of them are highly significant, and their occurence rate is thus artificially reduced. However, it was necessary to apply these criteria to avoid too much pollution by partially resolved double stars. Another beneficial consequence of this filter is that the short period binaries whose acceleration solution was rejected in this way were able to have an orbital solution. Including alternative solutions, the main processing yielded constant acceleration solutions and variable acceleration solutions. These solutions must be filtered in the post-processing to keep only those that seem most likely to be real.
-
2.
The significance, , must be over 20 for both acceleration models.
-
3.
The goodness of fit, , must be below 22 for the constant acceleration model, and below 25 for the variable acceleration model. The limit is more severe for the constant acceleration solutions so that the sequences of the HR diagrams deduced from the parallaxes of these solutions are always thinner than those deduced from the single star model.
Conditions (2) and (3) reduced the number of solutions to for constant accelerations, and for variable accelerations.
The two lower panels of Fig. 2 show the distributions of the selected solutions in the vs. diagrams. Thanks to the parallax filtering, item (1) above, the solutions with au.yr-2 are very rare: 0.03 % of the constant acceleration solutions, and 0.3 % for the variable accelerations. If filters (2) and (3) are applied to a test sample unaffected by filter (1), the proportions of large are 3.1 and 5.6 % respectively. Unless one is prepared to sacrifice the vast majority of solutions, these high rates resist the quality criteria we have tried; this issue is still being discussed in Sect. 7, in the light of independent investigations. Several origins have been considered for the high acceleration solutions: short-period binaries, partially resolved binaries or outlier astrometric measurements. We can only conclude that the stars with acceleration solutions are most likely genuine binaries, but the physical interpretation of the accelerations is hazardous.
5 The orbital solutions
An orbital solution is calculated when the orbit has been observed either in its entirety or over a sufficient portion to extrapolate the missing part.
5.1 Calculation of the orbital solutions
The abscissa of an unresolved astrometric binary can be written as in the following equation:
(10) |
where is the abscissa of the barycentre, which is given by the single star model, , , and are the Thiele-Innes (TI, hereafter) elements, whose definition is given in Eq. 20, is the eccentricity of the orbit, and is the eccentric anomaly. depends on the observation epoch, but also on the orbital period, , on , and on the periastron epoch, , so the model is finally based on 12 unknowns: the 5 parameters of the single star solution, and , , , , , , ; the partial derivatives of with respect to the TI elements are given in Eq. 10, and those with respect to , and can be found in Goldin & Makarov (2006).
The derivation of the orbital solutions is based on the Levenberg-Marquardt algorithm, but this can only lead to the solution if it starts from a good starting point. In practice, a calculation starting from often leads to the right result if the period tested is close to the actual period, but scanning a grid covering the space of , and is still necessary to be sure to find the solution. The period range tried is from 10 days to the duration covered by the observations of the star divided by 0.6. The solutions are calculated and published in terms of TI elements, but the calculation of the elements usually used to describe an orbit, which are the so-called Campbell elements (semi-major axis, inclination of the orbit, position angle of the ascending node and periastron longitude), is presented in Appendix A, where the calculations of their uncertainties are also given. The extension of this calculation to the Thiele-Innes elements introduced by both astrometric and spectroscopic orbits is presented in Appendix B.
The significance of the orbital model is defined as the semi-major axis of the orbit, , divided by its uncertainty. This definition of the significance was adopted since a significant orbit is expected to have a large semi-major axis, when, on the contrary, an orbit with a semi-major axis close to or smaller than its uncertainty may be obtained for a single star. The semi-major axis is not explicitly a parameter of the orbital model that we have used, but it may be derived from the TI elements, thanks to Eq. 21. Its uncertainty is derived from Eq. 26 and Eq. 27.
5.2 Pseudo-circular and circular orbital solutions
Rare orbits (8 in total) have been directly calculated as circular and are given with . These calculations required special arrangements, which have been also adopted for orbits with very small eccentricities, as explained below.
Preliminary calculations showed that the eccentricities below 0.006 were all at least three times smaller than their uncertainties. For such solutions, the uncertainties of the TI elements are also often very large, as is the uncertainty of , which can be larger than the period. These anomalies arise from the indeterminacy of the position of the periastron, affecting the TI elements through their dependence on the longitude of the periastron, (see Eq. 20). They do not affect the validity of the solution, for which the Campbell elements calculated as explained in Appendix A have correct uncertainties, except for , whose uncertainty far exceeds ; however, they may confuse the user.
In order to limit these effects, orbits with eccentricity less than 0.0005 have been “pseudo-circularised” in the following way: The Campbell elements were derived, and the periastron longitude, , was set to 0 in order to place the periastron on the ascending node. Therefore, the periastron epoch, , has been advanced by subtracting the time interval corresponding to the orbit section between the ascending node and the initial periastron, which for a circular orbit is . As the initial orbit is not circular, this formula is an approximation that is only possible for very small eccentricities; this is why it was only done for . The TI elements were then recalculated, keeping the semi-major axis, orbit inclination and line-of-nodes position angle of the initial solution.
After these transformations, the solution is based on only 10 unknowns: and are fixed, and only 3 of the 4 TI elements remain, since implies that , or . Therefore, was no longer considered as an unknown provided that mas. Otherwise, it was that had to be removed, but this did not happen in practice. Thus, the solution variance-covariance matrix was recalculated for the 10 selected unknowns. This matrix was inserted into the matrix of orbital solutions, setting the unnecessary terms to 0.
In the end, this gives quite reasonable uncertainties for the TI elements and for . However, the solution will give an uncertainty and correlation terms that are zero for , which is wrong and needs to be corrected in order to deduce the uncertainties of Campbell’s elements as explained in Appendix A. The uncertainty of , , is derived from the equation:
(11) |
The correlation coefficients relating to the other TI elements are:
(12) |
These modifications must also be made to circular orbits if the uncertainties of their Campbell elements are to be calculated.
5.3 Selection of the orbital solutions






The orbital solutions were first selected according to the thesholds set out in Sect. 2.2.2, the significance being calculated as explained in the previous section. In order to identify doubtful solutions, we have taken into account the mass function, which is derived from:
(13) |
where is the period in days, and where and are in the same units; is thus obtained in solar masses. Taking into account that refers to the orbit of the photocentre, the third Kepler’s law gives the following expression:
(14) |
where and are the photometric fluxes of the brightest and of the faintest component, respectively. refers to the masses. When is negligible compared to , this equation becomes:
(15) |
where and where the index “1” refers to the brightest component.
Panel (a) of Fig. 3 shows the period vs. mass function diagram. There are two kinds of false solutions:
-
•
Solutions of . This limit looks rather large for dwarf stars, but roughly acceptable for binaries with a giant component and a component still close to the main sequence, or for a triple system with a massive pair as a secondary component.
-
•
Solutions with particular periods, which probably depend on the 63-day precession period of the satellite. These solutions have usually very large mass function,but they descend sufficiently into the small range to contaminate plausible solutions.
A large part of the last category of false solutions was discarded by using a filter based on the parallax significance, as indicated in Fig. 3b. This filter consists of selecting only the solutions that satisfy the following condition:
(16) |
and it allowed to keep astrometric orbits as a result of the main processing.
Although the filter of Eq. 16 has rejected most of the large mass function solutions, a few concentrations around particular periods still persist in the processing results (Fig. 3c and d). Most of these remaining false solutions were eliminated in the post-processing, where only solutions satisfying the following two filters were selected:
-
•
A filter based on the quality of the eccentricity: , where is in days.
-
•
A filter based on the significance: , where is again in days.
After these operations, about orbits were retained, and the concentrations along particular periods have all disappeared, as shown in Fig. 3e. Moreover, very few orbits have eccentricities beyond the 10-day circularisation limit, calculated according to Halbwachs, Mayor & Udry (2005). The proportion of is now about 1 %, but it can be seen in Fig. 3f that it increases considerably with the period: for example, it is only 0.2 % for periods of less than 1 year, but reaches 3.3 % for periods longer than 1000 days.
Subsequently, all the main processing astrometric orbits were compared to the SB1 spectroscopic orbits derived from Gaia radial velocities (Gosset et al., 2022). The matching of the orbits of the two types resulted in nearly combined orbits, of which about 2470 were recovered after having been rejected by the post-processing. At the same time, almost 3500 post-processing orbital solutions were rejected by the validation (Babusiaux et al., 2022), or discarded as redundant with solutions obtained with Markov Chain Monte Carlo and Genetic Algorithms (Holl et al., 2022). In the end, our contribution to the DR3 astrometric orbital solutions amounts to nearly 165500 orbits.
6 The variability-induced movers
The variability-induced movers (VIM hereafter) were first described by Wielen (1996). They consist in unresolved binary systems containing one photometrically variable component. The result is a photocentre which is moving between the components in accordance with fluctuations in the total brightness of the system. Simultaneously, it is following the orbital motion, if this is perceptible over the duration of the mission. Several types of VIM were expected from the Gaia astrometric and photometric measurements: when the orbital motion is not perceptible, the system is a fixed VIM, or VIMF. When the orbital motion looks like a motion of the components relative to each other, in a straight line and at constant velocity, the system is a VIM with linear motion, or VIML. This is followed by VIM with acceleration (constant or variable) or VIMA, and then VIM with orbital motion, or VIMO. Only VIMFs are described in detail below, as the others could not be included in the DR3.
6.1 The VIM with fixed components (VIMF) model
The additional parameters specific to the VIMF model are the coordinates of the photocentre, , measured from the variable component when the total photometric flux of the system is equal to a reference flux, noted . In practice, is the median flux of all transits of the system. The partial derivatives of the abscissa with respect to an are given by the following equations:
(17) |
where is the photometric flux of the transit. Therefore, the position of the system and its proper motion also apply to the photocentre when the total flux is .
The parameters of the VIMF model are thus derived from a linear system that is solved by singular value decomposition, but the uncertainty of each transit deserves a dedicated calculation, since it depends on the astrometric uncertainty of the measured abscissa, , but also on the uncertainty of the photometric flux, , on which an error from the model depends. For each transit, this error, , is calculated from the equation:
(18) |
The total uncertainty of transit is then given by the equation:
(19) |
The uncertainties are used to derive any VIMF solution, as well as and the correction coefficient as explained in Sect. 2.2.1. As depends on and , and thus on the VIMF solution, the calculation is iterative and the VIMF model cannot be considered linear in the strict sense. The significance of the solution is defined from the modulus of the vector, as explained in Sect. 2.2.2.
As far as the VIML model is concerned hereafter, suffice it to say that it has two more parameters, which are the time derivatives of and .
6.2 Selection of the VIMF solutions

The main criterion we have for estimating whether a VIMF solution is plausible is the apparent distance that separates the median photocentre from the variable component. As the resolution of Gaia is at least 180 mas (Lindegren et al., 2021) and the components of a VIM should never be separated, must remain under a limit of no more than a hundred mas; we are therefore looking here for quality criteria such that this condition is met.
The vs. and significance vs diagrams obtained after a first processing trial do not immediately show any limits in or significance that would allow almost only mas. Fortunately, a selection becomes possible when we consider the “parallax significance”, i.e. the quantity . It is clear from Fig. 4 that the selection of solutions of parallax significance greater than 30 rejects most excessive values of . Therefore, this criterion was applied in the main processing, in addition to the criteria and . As before, the criteria for selecting alternative solutions were and less than that of the alternative solution that may have already been selected.
Two thousand five hundred and eight VIMF solutions were obtained from the main processing. The vs. diagram of these stars is shown in Fig. 5a. As for the previous models, solutions with higher than 25 seem doubtful, because the proportion of large values in them increases, and, even more so, small values become increasingly rare as increases. They are therefore rejected, which leaves 1660 solutions. The significance vs. diagram of these solutions shows a large scattering of values when significance is below 20, especially below 12. However, the threshold was set at 20 as a precaution. Beyond that, increases with (Fig. 5b), which corresponds to a value of that is approximately constant and close to one tenth of mas, which seems acceptable.
The 869 VIMF solutions thus retained in the end have distances below 40 mas, with three exceptions: three solutions of significance close to 100, when is 55, 56 and 101 mas. These values are not unrealistic, and it seems possible that all VIMF solutions are valid. Unfortunately, this may also be due to the low effectiveness of using to point out dubious solutions, as the application of other VIM models has shown below.


6.3 The other VIM models
As explained at the beginning of this section, four other VIM models were tried, from VIML to VIMO. Contrarely to the VIMF model, the parameters of these models allow the estimation of a minimum value for the mass of the systems. The result was devastating: for all these models, the minimum masses are concentrated between 10 and . We therefore decided to remove these models from the mass processing cascade.
7 Discussion and conclusion
This first publication of astrometric binaries of different types observed by Gaia was made under difficult conditions, since the uncertainties of the measurements were often very underestimated. However, we were able to calculate hundreds of thousands of reliable solutions by keeping the weights of the observations and correcting the uncertainties. Nevertheless, this was done at the cost of a drastic selection, based on very high quality criteria, and it would be difficult to use the selected solutions to deduce the statistical properties of binary stars, as the sample is incomplete and biased.
Because of the cascade structure that leads to the testing of different models, we may have retained solutions by direct acceptance that would have been superseded by another model, had it been tried. This choice has the advantage of preserving the quality of the orbital solutions, by assigning acceleration solutions to partially resolved double stars that would have been processed despite the filters set up to reject them. However, it was done at the expense of the completeness of the orbital solutions, and it is likely that some acceleration solutions actually correspond to binaries with periods short enough to have an orbital solution. Moreover, the quality criteria alone could not bring the acceleration solutions to a reasonable rate of high acceleration. Ulrich Bastian studied this problem long after the post-processing was done, and he estimated that constant acceleration solutions are only highly reliable when the parallax is greater than 5 mas. This criterion leaves only solutions, but none with acceleration greater than 3 au.yr-2. When only quality criteria are applied, ie. when criterion (1) of Sect. 4.3 is not applied, the restriction to parallaxes greater than 5 mas reduces the au.yr-2 rate from 3.1 % to less than 1 %; this confirms the effectiveness of this extremely drastic selection. For future DRs, it is hoped that the treatment of partially resolved double stars, as well as testing all astrometric models and choosing the one most compatible with the data, will allow a wider selection of acceleration solutions.
The selection of orbital solutions was particularly severe to reject orbits that are concentrated in certain periods, which are likely to be produced by the satellite’s scanning law. Despite these difficulties, about astrometric orbits were finally selected in DR3. The periods of the false orbits should be different in the following data releases, as the precession spin has been reversed during the last 6 months of observations that will be part of the DR4; however, if these noise-induced orbits do not disappear at this time, they could be more difficult to detect, being distributed over a larger number of periods. Furthermore, the lack of one-year periods, which comes from a degeneracy between parallactic and orbital motion, is expected to remain.
The acceleration and orbital models apply to unresolved binaries with either components that do not vary photometrically, or one component always much brighter than the other. This limits their interest in the study of young stars, or more generally, pulsating stars. VIM models extend the scientific scope of astrometric binaries, provided that only one of the components is photometrically variable. Unfortunately, the number of VIM solutions appears to be very modest, numbering only a few hundred, limited to the VIMs with fixed components. This rarity is partly due to the fact that the model has only been tried at the end of the cascade, and we do not know how many acceptable VIM solutions were lost due to the acceptance of a solution of another type.
When both components are variable, the VIM solution can be very degraded: When the brightnesses of the two stars vary in the same direction, the displacement of the photocentre can be very small while the brightness of the system varies greatly; for the VIMF model, this is interpreted as a short D distance. Conversely, if the brightness of one component increases while that of the other decreases, the fluctuation in total brightness that may be small while the photocentre displacement is large. As the luminosities of the components do not generally vary in phase, one can expect VIMF solutions of large , and therefore large , with values that can be minimal or, on the contrary, very large. A large leads to a decrease in significance, thanks to the coefficient of Eq. 2, and, as a result, the pollution of VIMF solutions by systems whose two components are photometrically variable will be limited thanks to the high acceptance thresholds that have been adopted.
Another difference between fixed-component VIMs and other solutions is that the selection could only be adjusted on the basis of the median distance between the photocentre and the photometric variable component, excluding any dynamic criteria. Therefore, these solutions should be considered with caution. In turn, it is possible that some acceleration or orbital solutions have been assigned to objects that should have received a VIM solution. However, these cases are too rare to constitute detectable pollution: Arenou et al. (2022) have shown that, statistically, acceleration solutions have proper motions affected by orbital motion, which would not be the case for VIMFs. Similarly, if VIMFs receive an orbital solution, the inclination of the solution must be 90 degrees, and the distribution of inclinations does not show an excess for this value.
Despite these limitations raised above, the DR3 is a major advance in the field of binary stars. The hundreds of thousands of new astrometric orbits will have considerable importance in the future, compared to the less than 3000 visual binary star orbits in the Sixth Catalog of Orbits of Visual Binary Stars111https://www.usno.navy.mil/USNO/astrometry/optical-IR-prod/wds/orb6. Such a change of scale could open a new era in the statistical study of binary stars, especially if the next DRs could be less affected by selection effects. Instead of looking at the properties of pairs with a primary component of a given type, it may be possible to consider pairs with a given total mass. Such an approach would shed new light on the star formation process, provided that the many biases affecting binary selection are taken into account.
Acknowledgements.
This work is part of the reduction of the Gaia satellite observations (https://www.cosmos.esa.int/gaia). The Gaia space mission is operated by the European Space Agency, and the data are being processed by the Gaia Data Processing and Analysis Consortium (DPAC, https://www.cosmos.esa.int/web/gaia/dpac/consortium)). The Gaia archive website is https://archives.esac.esa.int/gaia. Funding for the DPAC is provided by national institutions, in particular the institutions participating in the Gaia MultiLateral Agreement (MLA). We acknowledge for the financial support of the french “Centre National d’études spatiales” (CNES) and of the BELgian federal Science Policy Office (BELSPO) through a PROgramme de Développement d’Expériences scientifiques (PRODEX). Over the two decades that it took to develop the calculation methods and write the software, we benefited from the contributions of Sylvie Jancart, Rémy Onsay, Didier Pelat and Patrick Gempin. We also received constant support from François Mignard and Alain Jorissen. The acceptability of acceleration solutions was discussed with Ulrich Bastian and Lennart Lindegren, using data produced by Sergei Klioner and Hagen Steidelmüller. Finally, we are grateful to Michael Biermann and to an anonymous referee for their careful reading of the manuscript and the relevance of their comments.References
- Arenou et al. (2022) Arenou, F., Babusiaux, C., Barstow, M. et al. 2022 A&A, accepted
- Babusiaux et al. (2022) Babusiaux, C. et al. 2022 A&A, in preparation
- Binnendijk (1960) Binnendijk, L. 1960, Properties of double stars; a survey of parallaxes and orbits, University of Pennsylvania Press, Philadelphia
- Dravins, Lindegren & Madsen (1999) Dravins, D., Lindegren, L., & Madsen, S. 1999, A&A, 348, 1040
- ESA (1997) ESA 1997, The Hipparcos and Tycho Catalogue, SP-1200
- Gaia Collaboration (2021) Gaia Collaboration, Brown, A. G. A., Vallenari, A., Prusti, T., de Bruijne, J. H. J., Babusiaux, C. & M. Biermann, M. 2021, A&A, 649, A1
- Goldin & Makarov (2006) Goldin, A. & Makarov, V.V. 2006, ApJS, 166, 341
- Gosset et al. (2022) Gosset, E. et al. 2022 A&A, in preparation
- Halbwachs, Mayor & Udry (2005) Halbwachs, J.-L., Mayor, M. & Udry, S. 2005, A&A, 431, 1129
- Halbwachs (2009) Halbwachs, J.-L. 2009, MNRAS394, 1075
- Heintz (1978) Heintz,W.D. 1978, Double stars, Reidel, Dordrecht
- Holl et al. (2022) Holl, B., Sozzetti, A., Sahlmann, J. et al. 2022 A&A, in preparation
- Lindegren & Dravins (2003) Lindegren, L. & Dravins, D. 2003, A&A, 401, 1185
- Lindegren et al. (2021) Lindegren, L., Klioner, S.A., Hernández, J., et al. 2021, A&A, 649, A2
- Riello et al. (2021) Riello, M., De Angeli, F., Evans, D. W., et al. 2021, A&A, 649, A3
- Seabroke et al. (2021) Seabroke, G., Fabricius, C., Teyssier, D., et al. 2021, A&A, submitted
- Stuart & Ord (1994) Stuart A., Ord K., 1994, Kendall’s Advanced Theory of Statistics, vol. 1. Edward Arnold, London
- Wielen (1996) Wielen, R. 1996, A&A, 314, 679
Appendix A Conversion of Thiele-Innes elements () into Campbell elements () and calculation of the uncertainties
The Thiele-Innes (TI) elements, , , and are given by the following equations (Binnendijk, 1960; Heintz, 1978):
(20) |
where , , and are the Campbell elements. The definitions of these elements and their calculations from , , , are given hereafter:
-
•
(or for a photocentric orbit) is the semi-major axis of the orbit, in the same unit as the TI elements.
-
•
is the inclination of the orbit, i.e. the angle between the orbital plane and the plane of the sky. when the orbital motion is in the direct sense, and when the photocentre is revolving around the barycentre in a clockwise direction.
-
•
is the position angle of the ascending node. For us, the ascending node is the intersection between the orbital plane and the plane of the sky in the direction where the right ascension is increasing. Therefore, .
-
•
is the periastron longitude, measured in the orbital plane from the ascending node defined above. is measured in the sense ofthe orbital motion.
Binnendijk (1960) gives the following method for converting TI elements into Campbell elements:
The semi-major axis , is derived from:
(21) |
The angles and are derived simultaneously. Preliminary estimates, between and , are derived as follows:
(22) |
The exact values of are obtained, modulo , taking into account that and have the same sign. Similarly, has the same sign as . and are then derived from the equations:
(23) |
When is negative, must still be added to both and . After that, a correction of may still be applied in order to meet the conditions and .
To derive the inclination , we first need to calculate two intermediate terms, called and hereafter:
(24) |
In order to obtain the value of , the calculation is different depending on whether is greater than , or the reverse:
(25) |
The uncertainties of , , and are derived by differentiating the equations above, and by taking into account error propagation.
We call hereafter the error on element and (X,Y) the term of the variance covariance matrix of the solution linking the elements and , ie , where is the correlation coefficient between and . The uncertainties are then given by the equations below.
To derive the uncertainty of , the following intermediate terms must be calculated from and derived in Eq. 21:
(26) |
The uncertainty of the semi-major axis is then:
(27) |
The uncertainties of and are also derived using dedicated intermediate terms. We first calculate the followings:
(28) |
These terms are used to derive other intermediate terms dedicated to the calculation of the uncertainty of :
(29) |
The uncertainty of , is then derived with the equation hereafter:
(30) |
The terms from Eq. 28 are also used to compute the following intermediate terms, which are required to derive the uncertainty of :
(31) |
The uncertainty of is then:
(32) |
Like the derivation of , the derivation of the uncertainty of the inclination depends on a comparison between and , derived in Eq. 24 above.
is derived as follows. We first calculate the terms:
(33) |
if , the continuation of the calculation is as explained hereafter. An extra term, , is added to and :
(34) |
The intermediate terms dedicated to the calculation of are then:
(35) |
and, still for , is derived from the equation:
(36) |
When , the additional term is derived:
(37) |
The intermediate terms dedicated the calculation of are now:
(38) |
and, still for , is derived from the equation:
(39) |
The four Campbell elements and their uncertainties are thus deduced from the solutions calculated in TI elements. However, it is not possible to estimate the correlation coefficients between the terms.
Appendix B Conversion of Thiele-Innes elements () into elements () and calculation of the uncertainties
The and elements are functions of the and elements that come from the spectroscopic orbit:
(40) |
from which it appears that:
(41) |
knowing that sign.
must be approximately equal to either , or . This ambiguity is due to differences in definition: is the longitude of the periastron of the photocentre orbit, which, in the absence of radial velocity, is measured from the node whose position angle is between 0 and . For its part, is the longitude of the periastron of the orbit of the spectroscopically most visible component, measured from the node where this component moves away from the Sun. Therefore, the true orientation of the orbital plane will depend on the position of the photocentre with respect to to the barycentre and the spectroscopic component: if the photocentre is assumed to be between the barycentre and the spectroscopic component, then the true ascending node and periastron of the astrometric orbit are marked by the same angles as those of the spectroscopic orbit. If and are significantly different, it is necessary to add to to make them roughly coincide, and also to add to to take into account the change of reference node.
Alternatively, the photocentre may be opposite to the spectroscopic component (this may occur, for example, when the spectral type of the brightest component is very different from the templates used for spectroscopic reduction, so that the spectroscopic orbit corresponds to the least bright binary component). When this occurs, must be approximately equal to . If it is not, must be added to and must be added to for the orientation of the astrometric orbit to be completely defined.
The calculation of the uncertainty of depends on the values of and , given by Eq. 24. When , we compute the following terms:
(42) |
where , , , and are taken from Eq. 35, and
(43) |
The uncertainty is then calculated with equation:
(44) |
When , these equations become:
(45) |
where , , , and are taken from Eq. 38, and
(46) |
The uncertainty is then calculated with the equation:
(47) |
The uncertainty of , , is derived from:
(48) |