This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Light rays in the Solar system experiments: phases and displacements

Pravin Kumar Dahal and Daniel R. Terno Department of Physics and Astronomy, Macquarie University,
Sydney, NSW 2109, Australia
E-mail: [email protected]
Abstract

Geometric optics approximation is sufficient to describe the effects in the near-Earth environment. In this framework Faraday rotation is purely a reference frame (gauge) effect. However, it cannot be simply dismissed. Establishing local reference frame with respect to some distant stars leads to the Faraday phase error between the ground station and the spacecraft of the order of 101010^{-10} in the leading post-Newtonian expansion of the Earth’s gravitational field. While the Wigner phase of special relativity is of the order 10410^{-4}10510^{-5}. Both types of errors can be simultaneously mitigated by simple encoding procedures. We also present briefly the covariant formulation of geometric optic correction up to the subleading order approximation, which is necessary for the propagation of electromagnetic/ gravitational waves of large but finite frequencies. We use this formalism to obtain a closed form of the polarization dependent correction of the light ray trajectory in the leading order in a weak spherically symmetric gravitational field.

keywords:
Geometric optics; Wigner phase; Gravitational Faraday rotation; Post-Newtonian expansion; Gravitational spin Hall effect.
\bodymatter

1 Introduction

Space deployment of quantum technology [1, 2, 3] brings it into a weakly relativistic regime. As an unintended but fortunate side effect, low-Earth orbit (LEO) quantum communication satellites provide new opportunities to test fundamental physics. Once the tiny putative physical effects fall within the sensitivity range of these devices, they may impose constraints on practical quantum communications, time-keeping, or remote sensing tasks[4, 5, 6]. A more futuristic technology, such as the proposed solar gravity telescope[7, 8, 9] actually needs the general-relativistic effects for its operation.

For flying qubits that are implemented as polarization states of photons[10] the dominant source of relativistic errors in this setting is the Wigner rotation (or phase), an effect special relativity (SR)[4, 6]. Gravitational polarization rotation, also known as the gravitational Faraday effect [11, 12] occurs in a variety of astrophysical systems, such as accretion disks around astrophysical black hole candidates [13] or gravitational lensing [14]. This effect was the subject of a large number of theoretical investigations, primarily within the geometric optics approximation [11, 12, 14, 15, 16, 17], and also from the perspective of quantum communications[18]. Interpretation of these results was until recently sometimes contradictory, as it is important to carefully analyze the relation between the reference frames of the emitter and the detector. Moreover, even if at the leading order the gravitationally induced polarization rotation in the near-Earth environment is pure gauge effect, it cannot be simply dismissed. We provide a simple estimation of this emitter- and observer-dependent phase and give its explicit form in several settings.

Already in the leading post-eikonal order trajectories of light beams are affected by polarization.[19, 20, 21]. The optical gravitational spin Hall effect has recently received a comprehensive treatment in Refs. 23, 24. Using this formalism we obtain a closed form of the leading correction in case of a weak spherically symmetric gravitational field. As expected, on the scale of the Solar system — be it the near-Earth environment or a focal plane of the solar gravity telescope at 600a.u.— the effect is negligible. However, it is interesting conceptually and its scaling indicates that it may play a much more important role in the strong gravity regions[24].

The rest of this article is organized as follows. In Sec. 2 we review the SR effects. Polarization rotation in general stationary spacetimes is described in Sec. 3, where we also evaluate the effects in communication with Earth-orbiting satellites. In Sec. 4 we briefly summarize the main techniques of Refs. 23, 24 and then obtain the polarization-dependent changes in the light ray trajectories in the solar system experiments.

We work with c==G=1c=\hbar=G=1. The constants GG and cc are restored in a small number of expressions where their presence is helpful. The spacetime metric gμν\textsl{g}_{\mu\nu} has a signature +++-+++. The four-vectors are distinguished by the sans font, 𝗄{\mathsf{k}}, kμ=(𝗄)μk^{\mu}=({\mathsf{k}})^{\mu}. The three-dimensional spatial metric is denoted as γmn\gamma_{mn}, and three-dimensional vectors are set in boldface, 𝐤{\bf{k}}, or are referred to by their explicit coordinate form, kmk^{m}. The inner product in the metric γ\gamma is denoted as 𝐤𝐟{\bf{k}}\!\cdot\!\mathbf{f}, and the unit vectors in this metric are distinguished by the caret, 𝐤^\mathbf{\hat{k}}, 𝐤^𝐤^1\mathbf{\hat{k}}\cdot\mathbf{\hat{k}}\equiv 1. Post-Newtonian calculations employ a fiducial Euclidean space. Euclidean vectors are distinguished by arrows, k{\vec{k}}. Components of the two types of vectors may coincide, (𝐯)m=(b)m(\mathbf{v})^{m}=({\vec{b}})^{m}, but bk=k=13vmkm{\vec{b}}\!\cdot\!{\vec{k}}=\sum_{k=1}^{3}v^{m}k^{m}. Accordingly, the coordinate distance is the Euclidean length of the radius vector, rxxr\equiv\sqrt{{\vec{x}}\!\cdot\!{\vec{x}}}.

Refer to caption
Figure 1: Scheme of the communication round between the ground station (GS) and the spacecraft (SC). The light ray is highlighted in blue. All the vectors are indicated in the Earth-centered inertial frame. In the flat spacetime approximation the unit tangent vectors to the ray 𝐤^GS\mathbf{\hat{k}}_{\mathrm{GS}} and 𝐤^SC\mathbf{\hat{k}}_{\mathrm{SC}} at the GS and the SC, as well as the direction to an infinitely distant guide star, 𝜻^GS\mbox{\boldmath$\hat{\zeta}$}_{\mathrm{GS}} and 𝜻^SC\mbox{\boldmath$\hat{\zeta}$}_{\mathrm{SC}}, respectively coincide. Velocity of the GS at the emission of the signal is 𝐯\mathbf{v} and velocity of the SC at the moment of detection is 𝐮\mathbf{u}.

2 Wigner rotation and special relativistic effects

Consider one round of communications between the ground station (GS) and a low energy orbit spacecraft (SC). The problem is most conveniently analyzed in the geocentric system, the origin of which coincides with the centre of the Earth at the moment of emission. If we direct the zz-axis of the system along the Earth’s angular momentum, then the velocity 𝐯\mathbf{v} of the GS lies in the xy-plane. Whereas, the velocity 𝐮\mathbf{u} of the SC and the initial propagation direction 𝐤^\mathbf{\hat{k}} are arbitrary when expressed in the global frame. This setting is depicted in Fig. 1.

Quantum states of a photon with a definite four-momentum 𝗄=(|𝐤|,𝐤){\mathsf{k}}=(|{\bf{k}}|,{\bf{k}}) can be represented either as Hilbert space vectors or as complex polarization vectors in the usual three-dimensional space,

|Ψ𝗄=f+|𝗄,++f|𝗄,𝐟^𝐤=f+𝐛^𝐤^++f𝐛^𝐤^,|\Psi_{\mathsf{k}}\rangle=f_{+}|{\mathsf{k}},+\rangle+f_{-}|{\mathsf{k}},-\rangle\quad\Leftrightarrow\quad\mathbf{\hat{f}}_{\bf{k}}=f_{+}{\mathbf{\hat{b}}}_{\mathbf{\hat{k}}}^{+}+f_{-}{\mathbf{\hat{b}}}_{\mathbf{\hat{k}}}^{-}, (1)

with the transversal vectors 𝐛^𝐤^±\mathbf{\hat{b}}^{\pm}_{\mathbf{\hat{k}}}, 𝐤𝐛^𝐤^±=0{\bf{k}}\cdot\mathbf{\hat{b}}^{\pm}_{\mathbf{\hat{k}}}=0. The correspondence is rooted in the relationship between finite-dimensional and unitary representations of the Poincaré group [26, 27].

Unitary operators UU that describe the state transformation between the Lorentz frames are obtained via the induced representation of the Poincaré group. Basis states that correspond to an arbitrary momentum (𝗄)μkμ({\mathsf{k}})^{\mu}\equiv k^{\mu} are defined with the help of standard Lorentz transformation L(kμ)L(k^{\mu}), that takes the four momentum from the standard value kSμk_{\mathrm{S}}^{\mu} to kμk^{\mu}. For massless particles, kSμ=(1,0,0,1)k_{\mathrm{S}}^{\mu}=(1,0,0,1) and

L(kμ)=R(𝐤^)Bz(ξ|𝐤|),L(k^{\mu})=R(\mathbf{\hat{k}})B_{z}(\xi_{|{\bf{k}}|}), (2)

where R(𝐤^)=Rz(ϕ)Ry(θ)R(\mathbf{\hat{k}})=R_{z}(\phi)R_{y}(\theta) rotates the zz-axis into the direction 𝐤^\mathbf{\hat{k}} by performing rotations around the yy- and zz-axis by angles θ\theta and ϕ\phi, respectively. These rotations follow the boost Bz(ξ|𝐤|)B_{z}(\xi_{|{\bf{k}}|}) along the zz-axis, that brings the magnitude of momentum to |𝐤||{\bf{k}}|.

The states of an arbitrary momentum are defined via

|𝗄,±:=U(L(𝗄))|𝗄S,±,|{\mathsf{k}},\pm\rangle\vcentcolon=U\big{(}L({\mathsf{k}})\big{)}|{\mathsf{k}}_{\mathrm{S}},\pm\rangle, (3)

while the standard right- and left-circular polarization vectors are defined as

𝐛^𝐤^±:=R(𝐤^)(𝐱^±i𝐲^)/2,\mathbf{\hat{b}}_{\mathbf{\hat{k}}}^{\pm}\vcentcolon=R(\mathbf{\hat{k}})(\hat{\mathbf{x}}\pm i\hat{\mathbf{y}})/\sqrt{2}, (4)

where the linear polarization vectors are 𝐛^1:=R(𝐤^)𝐱^\mathbf{\hat{b}}_{1}\vcentcolon=R(\mathbf{\hat{k}})\hat{\mathbf{x}} and 𝐛^2:=R(𝐤^)𝐲^\mathbf{\hat{b}}_{2}\vcentcolon=R(\mathbf{\hat{k}})\hat{\mathbf{y}}, respectively. Alternatively, these vectors can be obtained as

𝐛^2=𝐳^×𝐤^|𝐳^×𝐤^|,𝐛^1=𝐛^2×𝐤^.\mathbf{\hat{b}}_{2}=\frac{\hat{\mathbf{z}}\times\hat{\mathbf{k}}}{|\hat{\mathbf{z}}\times\mathbf{\hat{k}}|},\qquad\mathbf{\hat{b}}_{1}=\mathbf{\hat{b}}_{2}\times\mathbf{\hat{k}}. (5)

The explicit form of the polarization four-vector 𝖿𝗄\mathsf{f}_{\mathsf{k}}, 𝖿𝗄𝗄=0\mathsf{f}_{\mathsf{k}}\cdot{\mathsf{k}}=0 depends on the gauge [4, 18, 22].

Under arbitrary Lorentz transformation, states transform (apart from the normalization factor) via

U(Λ)|𝗄,±=U(ΛL(𝗄))|𝗄S,±=U(L(Λ𝗄))U(W)|𝗄s,±,U(\Lambda)|{\mathsf{k}},\pm\rangle=U(\Lambda L({\mathsf{k}}))|{\mathsf{k}}_{\mathrm{S}},\pm\rangle=U(L(\Lambda{\mathsf{k}}))U(W)|{\mathsf{k}}_{s},\pm\rangle, (6)

where the transformation W=L1(Λ𝗄)ΛL(𝗄)W=L^{-1}(\Lambda{\mathsf{k}})\Lambda L({\mathsf{k}}) to the subgroup (Wigner’s little group) group that leaves 𝗄s{\mathsf{k}}_{s} invariant,

U(W)|𝗄S,σ=±D(W)σσ|𝗄S,σ.U(W)|{\mathsf{k}}_{\mathrm{S}},\sigma\rangle=\sum_{\pm^{\prime}}D(W)_{\sigma\sigma^{\prime}}|{\mathsf{k}}_{\mathrm{S}},\sigma^{\prime}\rangle. (7)

The matrices D(W)σσD(W)_{\sigma\sigma^{\prime}} form the representation of the little group. For massless particles an element of the little group WW can be decomposed as

W=SRz(ϖ),W=SR_{z}(\varpi), (8)

where SS is a translation in the (xy)(xy)-plane and Rz(ϖ)R_{z}(\varpi) a rotation. As translations do not contribute to the physical degrees of freedom of photons, the state |𝗄,±|{\mathsf{k}},\pm\rangle transforms as

U(Λ)|𝗄,±=e±iϖ|Λ𝗄,±.U(\Lambda)|{\mathsf{k}},\pm\rangle=e^{\pm i\varpi}|\Lambda{\mathsf{k}},\pm\rangle. (9)

There are no generic explicit expressions for ϖ\varpi. Their evaluation is not considerably simpler if Λ=\Lambda={\mathscr{R}}, where {\mathscr{R}} is a rotation (as there is no risk of confusion we use the same designation for the four-dimensional matrices of spatial dimensions and for their 3×33\times 3 blocks). However, as the transformation law of 𝐟^\mathbf{\hat{f}} can be obtained from the three-dimensional form of the Lorentz transformations of the transversal electromagnetic wave, in this case [4, 28]

U()|Ψ𝗄𝐟^𝐤.U({\mathscr{R}})|\Psi_{\mathsf{k}}\rangle\Leftrightarrow{\mathscr{R}}\mathbf{\hat{f}}_{\bf{k}}. (10)

Moreover, an arbitrary rotation around the direction 𝐛^2\mathbf{\hat{b}}_{2}, R𝐛^2(α)R_{\mathbf{\hat{b}}_{2}}(\alpha), does not introduce a phase ϖ\varpi [16]. This provides the motivation for introduction of the so-called Newton gauge that we review below.

In communications with the Earth-orbiting satellites settings of Fig. 1 the Wigner phase is the dominant relativistic effect[30]. While the Wigner phase is of the order 10410^{-4}10510^{-5}, we will see below that establishing the local reference frame with respect to some distant stars leads to the Faraday phase error of the order of 101010^{-10}.

3 Faraday rotation and general relativistic effects

Here we describe the effects of gravity on polarization in the geometric optics approximation, wave it can be considered a vector that is simply affected by the geodesic motion of null particles to which it is attached. We present the gravitational Faraday effect in a way that clearly separates the gauge-independent part from the effects of the reference frame choices[16] and is convenient for the near-Earth calculations that use the post-Newtonian approximation. Then we demonstrate that at the leading order the polarization rotation is a purely gauge effect and evaluate it for a practically useful choice of the reference frames.

The equation of geometric optics are obtained by performing the short wave expansion of the wave equation in the Lorentz gauge[29, 31],

Aμ=aμeiψ,A^{\mu}=a^{\mu}e^{i{\psi}}, (11)

where aμa^{\mu} is the slowly varying complex amplitude and ψ{\psi} is the rapidly varying real phase. In later calculations, we use the wave vector kμ:=ψ,μk_{\mu}\vcentcolon={\psi}_{,\mu}, the squared amplitude a=(aμaμ)1/2a=\left(a^{\mu*}a_{\mu}\right)^{1/2} and the polarization vector fμ=aμ/af^{\mu}=a^{\mu}/a is transverse to the trajectory, fμkμ=0f^{\mu}k_{\mu}=0.

The eikonal equation

gμνψxμψxν=0,\textsl{g}^{\mu\nu}\frac{\partial\psi}{\partial x^{\mu}}\frac{\partial\psi}{\partial x^{\nu}}=0, (12)

is the leading term in the expansion of the wave equation [29, 31] is the Hamilton-Jacobi equation for a free massless particle on a given background spacetime. It allows description of light propagation in terms of fictitious massless particles.

The wave vector kμk_{\mu}, which is normal to the hypersurface of constant phase is null kμkμ=0k^{\mu}k_{\mu}=0, and is geodesic

kμμkν=0,k^{\mu}\nabla_{\mu}k^{\nu}=0, (13)

as it is the gradient of a scalar function. Similarly, the polarization vector fμf^{\mu} is parallel propagated along it

kμμfν=0.k^{\mu}\nabla_{\mu}f^{\nu}=0. (14)

A convenient three dimensional representation of the evolution of polarization vectors is possible in stationary spacetimes. Static observers follow the congruence of timelike Killing geodesics that define projection from spacetime manifold \cal M onto the three dimensional space Σ3\Sigma_{3}, Π:Σ3\Pi:{\cal M}\to\Sigma_{3}. The metric gμν\textsl{g}_{\mu\nu} on \cal M can be written in terms of three dimensional scalar hh, a vector 𝐠\mathbf{g} with component gm\textsl{g}_{m} and a metric γmn\gamma_{mn} on Σ3\Sigma_{3} as

ds2=h(dx0gmdxm)2+dl2,d\textsl{s}^{2}=-h\left(dx^{0}-\textsl{g}_{m}dx^{m}\right)^{2}+dl^{2}, (15)

where h=g00h=-\textsl{g}_{00}, gm=g0m/g00g_{m}=-\textsl{g}_{0m}/\textsl{g}_{00}, and the three dimensional distance dl2=γmndxmdxndl^{2}=\gamma_{mn}dx^{m}dx^{n}, where

γmn=gmng0mg0ng00.\gamma_{mn}=\textsl{g}_{mn}-\frac{\textsl{g}_{0m}\textsl{g}_{0n}}{\textsl{g}_{00}}. (16)

Using the relationships between the three and four dimensional covariant derivatives, the propagation equations in a stationary spacetimes result in the following three dimensional equations

D𝐤^Ds=𝛀×𝐤^,D𝐟^Ds=𝛀×𝐟^,\frac{D\mathbf{\hat{k}}}{Ds}=\mbox{\boldmath$\Omega$}\times\mathbf{\hat{k}},\qquad\frac{D\mathbf{\hat{f}}}{Ds}=\mbox{\boldmath$\Omega$}\times\mathbf{\hat{f}}, (17)

where D/DsD/Ds is the covariant derivative in three dimensional space with metric γmn\gamma_{mn}. Thus, both the polarization and propagation vectors are rigidly rotated with an angular velocity

𝛀=2𝝎(𝝎.𝐤^)𝐤^𝐄g×𝐤^,\mbox{\boldmath$\Omega$}=2\mbox{\boldmath$\omega$}-(\mbox{\boldmath$\omega$}.\mathbf{\hat{k}})\mathbf{\hat{k}}-\mathbf{E}_{g}\times\mathbf{\hat{k}}, (18)

where 𝝎\omega and 𝐄g\mathbf{E}_{g} could be interpreted as the gravitoelectric and gravitomagnetic field respectively

𝝎=12k0×𝐠,𝐄g=h2h.\mbox{\boldmath$\omega$}=-\frac{1}{2}k_{0}\nabla\times\mathbf{g},\qquad\mathbf{E}_{g}=-\frac{\nabla h}{2h}. (19)

In flat spacetimes, polarization basis is uniquely fixed by the Wigner little group construction. However, in general curved background, Wigner construction must be performed at every point. This is because, in the absence of a global reference direction, the standard polarization triad (𝐛^1,𝐛^2,𝐤^)(\mathbf{\hat{b}}_{1},\mathbf{\hat{b}}_{2},\mathbf{\hat{k}}) is different at every locations. Given such choice, the net polarization can be found by starting with the initial polarization ffin(xin)f_{\mathrm{fin}}(x_{\mathrm{in}}), parallel propagating it according to the rule Eq. (17) and then using the decomposition

ffin(xfin)=cosχ𝐛^1+sinχ𝐛^2,f_{\mathrm{fin}}(x_{\mathrm{fin}})=\cos\chi\mathbf{\hat{b}}_{1}+\sin\chi\mathbf{\hat{b}}_{2}, (20)

to read off an angle. For Schwarzschild spacetime, 𝝎=0\mbox{\boldmath$\omega$}=0, and thus differentiation of this equation gives

dχds=1𝐟^𝐛^1𝐟^D𝐛^2Ds.\frac{d\chi}{ds}=\frac{1}{\mathbf{\hat{f}}\cdot\mathbf{\hat{b}}_{1}}\mathbf{\hat{f}}\cdot\frac{D\mathbf{\hat{b}}_{2}}{Ds}. (21)

This equation implies that polarization rotation is a pure gauge effect. The phase remains zero if the standard directions are set with the help of the local free fall acceleration 𝐰\mathbf{w} of a stationary observer. At each point in the spacetime we choose the direction of the standard reference momentum, or equivalently the zz-axis of our standard polarization triad, to be 𝜻^:=𝐰\mbox{\boldmath$\hat{\zeta}$}\vcentcolon=\mathbf{w}. For a photon with momentum 𝐤\mathbf{k} we choose the linear polarization vector 𝐛^2\hat{\mathbf{b}}_{2} to point in the direction 𝜻^×𝐤^\mbox{\boldmath$\hat{\zeta}$}\times\hat{\mathbf{k}}, and finally we choose 𝐛^1:=𝐛^2×𝐤^\hat{\mathbf{b}}_{1}\vcentcolon=\mathbf{\hat{b}}_{2}\times\mathbf{\hat{k}} such that it completes the orthonormal triad (𝐛^1,𝐛^2,𝐤^)(\hat{\mathbf{b}}_{1},\hat{\mathbf{b}}_{2},\hat{\mathbf{k}}). This construction is known as the Newton gauge [16]. With this convention 𝛀=𝐄g×𝐤Ω𝐛^2\mbox{\boldmath$\Omega$}=-\mathbf{E}_{g}\times{\bf{k}}\equiv\Omega\mathbf{\hat{b}}_{2} and thus χ0\chi\equiv 0 along the trajectory. However, such choice of standard polarization direction is practically unfeasible. We will see below the consequences of setting the zz-axis with the help of a guide star.

Electromagnetic radiation and massless particles are not affected by Newtonian gravity. The post-Newtonian expansion[25] is conveniently organized in powers of ϵ2GM/c2v2/c2\epsilon^{2}\sim GM/c^{2}\ell\sim v^{2}/c^{2}, where GM/-GM/\ell is the (maximal) typical potential and vv is a typical velocity of massive particles. The parameter ϵ\epsilon helps to keep track of the orders and is set to unity at the end of the calculations. The leading post-Newtonian contributions are of order ϵ2\epsilon^{2}; to take gravitomagnetic effects into account, we need contributions up to ϵ3\epsilon^{3}.

The post-Newtonian expansion of the metric near a single slowly rotating quasirigid gravitating body, up to ϵ3\epsilon^{3}, assuming that the underlying theory of gravity is general relativity is

ds2=V2(r)c2dt2+Rdxcdt+W2(r)dxdx,d\textsl{s}^{2}=-V^{2}(r)c^{2}dt^{2}+\vec{R}\!\cdot\!d\vec{x}cdt+W^{2}(r)d\vec{x}\!\cdot\!d\vec{x}, (22)

where

V(r)=1ϵ2Uc2,W(r)=1+ϵ2Uc2.V(r)=1-\epsilon^{2}\frac{U}{c^{2}},\qquad W(r)=1+\epsilon^{2}\frac{U}{c^{2}}. (23)

The Newtonian gravitational potential U=-U= GMQ(r,θ)/r-GMQ(r,\theta)/r GM/r\simeq-GM/r depends on the mass MM and the higher order multipoles. The frame-dragging term is

R=ϵ34Gc3J×xr3,\vec{R}=-\epsilon^{3}\frac{4G}{c^{3}}\frac{\vec{J}\times\vec{x}}{r^{3}}, (24)

where J\vec{J} is the angular momentum of the rotating body. Hence, we see that the gauge invariant polarization rotation is absent in the leading order post-Newtonian expansion and the Faraday phase at order ϵ2\epsilon^{2} is a reference frame effect. We again revert to the units G=c=1G=c=1. To obtain leading order contributions to the phase and polarization, photon trajectories only need to be expanded up to ϵ2\epsilon^{2}

k=nϵ22Mr(t)nϵ22Mdd2(x(t)nr(t)x0nr0),\vec{k}=\vec{n}-\epsilon^{2}\frac{2M}{r(t)}\vec{n}-\epsilon^{2}\frac{2M\vec{d}}{d^{2}}\left(\frac{\vec{x}(t)\!\cdot\!\vec{n}}{r(t)}-\frac{\vec{x}_{0}\!\cdot\!\vec{n}}{r_{0}}\right), (25)

where x(t)=x0+(tt0)n+𝒪(ϵ2)\vec{x}(t)=\vec{x}_{0}+(t-t_{0})\vec{n}+{\cal O}(\epsilon^{2}) and d=x0(x0n)n\vec{d}=\vec{x}_{0}-(\vec{x}_{0}\!\cdot\!\vec{n})\vec{n} is the vector joining the centre of the earth and the point of closest approach of the unperturbed ray. Here, n\vec{n} is the initial propagation direction, nn=1\vec{n}\cdot\vec{n}=1.

The leading order post-Newtonian metric is spherically symmetric. Note that the initial polarization that is perpendicular to the propagation plane remains perpendicular to it. So, we select the reference frame differently from that we have done in special relativistic calculations. Here we focus only on gravitational effects and treat them separately from the effects of rotation and relative motion. We take z=0z=0, the plane where the ray from GS to SC lies, set their velocities zero and consider the polarization vector

f=(0,0,1)=const.\vec{f}=(0,0,1)=\mathrm{const.} (26)
Refer to caption
Refer to caption
Refer to caption
Figure 2: Faraday phase ϖF\varpi_{\mathrm{F}} depends on the choice of the reference direction ζ(α,β)\vec{\zeta}(\alpha,\beta) and the initial propagation direction n(ϕ)\vec{n}(\phi). The angles are defined in Eqs. (31) and (32). (left) β=0\beta=0. (center) β=7π/16\beta=7\pi/16. (right) β=π\beta=\pi.

The gauge dependent Faraday phase results from the change in the definitions of standard polarization directions along the trajectory. The reference directions 𝜻^i\mbox{\boldmath$\hat{\zeta}$}_{i} (unit vector pointing to the distant star), i=i= GS, SC are obtained from the tangents to the rays from the fixed guide star that arrive to the GS and SC respectively. We assume the reference star to be infinitely far. Approximating the differences in reference directions as arising solely form the gravitational field of the earth,

ζi=l(1ϵ22Mri)ϵ22Mdidi2(xilri1),\vec{\zeta}_{i}=\vec{l}\left(1-\epsilon^{2}\frac{2M}{r_{i}}\right)-\epsilon^{2}\frac{2M\vec{d}_{i}}{d_{i}^{2}}\left(\frac{\vec{x}_{i}\!\cdot\!\vec{l}}{r_{i}}-1\right), (27)

where ll=1\vec{l}\!\cdot\!\vec{l}=1; l-\vec{l} is the flat spacetime direction from the infinitely distant star to the observers. Standard polarization vectors at GS and SC could be defined as

𝐛^2i=ζi×k|ζi×k|,𝐛^1i=𝐛^2i×𝐤^.\mathbf{\hat{b}}_{2i}=\frac{\vec{\zeta}_{i}\times\vec{k}}{|\vec{\zeta}_{i}\times\vec{k}|},\qquad\mathbf{\hat{b}}_{1i}=\mathbf{\hat{b}}_{2i}\times\mathbf{\hat{k}}. (28)

So, from Eq. (21), we get

Δχcosχ=𝐟^SC𝐛^2SC𝐟^GS𝐛^2GS\Delta\chi\cos\chi=\mathbf{\hat{f}}_{SC}\!\cdot\!\mathbf{\hat{b}}_{2SC}-\mathbf{\hat{f}}_{GS}\!\cdot\!\mathbf{\hat{b}}_{2GS} (29)

or,

Δχ=1𝐟^GS𝐛^1GS(𝐟^SC𝐛^2SC𝐟^GS𝐛^2GS).\Delta\chi=\frac{1}{\mathbf{\hat{f}}_{GS}\!\cdot\!\mathbf{\hat{b}}_{1GS}}(\mathbf{\hat{f}}_{SC}\!\cdot\!\mathbf{\hat{b}}_{2SC}-\mathbf{\hat{f}}_{GS}\!\cdot\!\mathbf{\hat{b}}_{2GS}). (30)

If we choose the x-axis to pass through GS, then

n=(cosϕ,sinϕ,0),12π<ϕ<12π,\displaystyle\vec{n}=(\cos\phi,\sin\phi,0),\qquad-\frac{1}{2}\pi<\phi<\frac{1}{2}\pi, (31)
l=(cosα,sinαsinβ,sinαcosβ),0α12π,\displaystyle\vec{l}=(\cos\alpha,\sin\alpha\sin\beta,\sin\alpha\cos\beta),\qquad 0\leq\alpha\leq\frac{1}{2}\pi, (32)

where the altitude of the guided star is π/2α\pi/2-\alpha. When the reference direction ζ\vec{\zeta} and the propagation direction k\vec{k} are collinear, then the standard polarization directions are undefined. If ζ\vec{\zeta} lies in the plane determined by GS, SC and centre of earth, then the Faraday phase is zero. Moreover, the post-Newtonian phase fails in the limit of α=0\alpha=0 or β=π/2\beta=\pi/2, 3π/23\pi/2. The plot of the Faraday phase for different reference and propagation direction is shown in Fig. 2.

4 Polarization-dependent trajectory

Taking into account the post-eikonal terms modifies the description of light propagation along the null rays that are geodesic trajectories of massless particles. These particles are still null, but their motion is no more geodesic. It is most conveniently described by using a specially designed null tetrad[17, 32]. It is introduced as follows.

Trajectory X(s)X(s) of a null particle with the tangent vector KK can be written as

dXμds=Kμ,KμKμ=0.\frac{dX^{\mu}}{ds}=K^{\mu},\qquad K^{\mu}K_{\mu}=0. (33)

This trajectory is typically is not a geodesic[23, 24], and the acceleration vector 𝗐\mathsf{w} is defined by

wμ:=KνK;νμ,Kμwμ=0.w^{\mu}\vcentcolon=K^{\nu}K^{\mu}_{;\nu},\qquad K^{\mu}w_{\mu}=0. (34)

The second tetrad vector 𝗇{\mathsf{n}}, 𝗇2=0{\mathsf{n}}^{2}=0 is chosen to satisfy

𝖪𝗇=1.{\mathsf{K}}\cdot{\mathsf{n}}=-1. (35)

A pair of complex conjugated null vectors, 𝗆{\mathsf{m}} and 𝗆¯=𝗆\bar{\mathsf{m}}={\mathsf{m}}^{*}, are built from two spacelike vectors and satisfy

𝗆𝗆¯=1.{\mathsf{m}}\cdot\bar{{\mathsf{m}}}=1. (36)

All other inner products vanish and the metric can be expressed as

gμν=K(μnν)+m(μm¯ν).\textsl{g}_{\mu\nu}=-K_{(\mu}n_{\nu)}+m_{(\mu}\bar{m}_{\nu)}. (37)

It is possible to reparameterize the null curves and rescale the vectors of the null tetrad while preserving orthonormality relations[17, 32],

𝖪A𝖪,𝗇A1𝗇,\displaystyle{\mathsf{K}}\to A{\mathsf{K}},\qquad{\mathsf{n}}\to A^{-1}{\mathsf{n}}, (38)
𝖪𝖪,𝗆𝗆+a𝖪,𝗆¯𝗆¯+a𝖪,𝗇𝗇+a𝗆+a𝗆¯+aa𝖪,\displaystyle{\mathsf{K}}\to{\mathsf{K}},\quad{\mathsf{m}}\to{\mathsf{m}}+a{\mathsf{K}},\quad\bar{{\mathsf{m}}}\to\bar{{\mathsf{m}}}+a^{*}{\mathsf{K}},\quad{\mathsf{n}}\to{\mathsf{n}}+a^{*}{\mathsf{m}}+a\bar{{\mathsf{m}}}+aa^{*}{\mathsf{K}}, (39)
𝗆eiϕ𝗆,𝗆eiϕ𝗆¯.\displaystyle{\mathsf{m}}\to e^{i\phi}{\mathsf{m}},\qquad{\mathsf{m}}\to e^{-i\phi}\bar{{\mathsf{m}}}. (40)

Using this freedom it is possible to choose the tetrad in such a way[24] that the vector 𝗇{\mathsf{n}} is parallel propagated along the ray,

𝖪𝗇=0,\nabla_{\mathsf{K}}{\mathsf{n}}=0, (41)

while

𝖪𝗆=κ𝗇,𝖪𝗆¯=κ𝗇,\nabla_{\mathsf{K}}{\mathsf{m}}=-\kappa{\mathsf{n}},\qquad\nabla_{\mathsf{K}}\bar{\mathsf{m}}=-\kappa^{*}{\mathsf{n}}, (42)

where

κ=𝗐𝗆=mμKνKμ;ν.\kappa=-\mathsf{w}\cdot{\mathsf{m}}=-m^{\mu}K^{\nu}K_{\mu;\nu}. (43)

With this choice of the null tetrad the polarization-dependent correction to the trajectory that takes into account the frequency ω\omega is finite, and given by[24]

D2XμDs2=KνK;νμ=iσωRναβμKνmαm¯β=:Fμ,\frac{D^{2}X^{\mu}}{Ds^{2}}=K^{\nu}K^{\mu}_{;\nu}=-\frac{i\sigma}{\omega}R^{\mu}_{~{}\nu\alpha\beta}K^{\nu}m^{\alpha}\bar{m}^{\beta}=\vcentcolon F^{\mu}, (44)

where RναβμR^{\mu}_{~{}\nu\alpha\beta} is the Riemann tensor. At this stage no assumptions about the spacetime were made. Effects of this term in the solar system are adequately described by the weak field expansion of the Schwarzschild metric. We will shortly see that in the leading order the polarization-dependent acceleration is of the order of ϵ2\epsilon^{2}. We also treat Eq. (44) as a correction to the null geodesic. Thus the uperturbed motion can be confined to the equatorial plane.

The trajectory is decomposed as

Xμ(s)=xμ(s)+αxσμ(s),X^{\mu}(s)=x^{\mu}(s)+\alpha x_{\sigma}^{\mu}(s), (45)

where xμ(s)x^{\mu}(s) is a null geodesic (that we assume can be affinely parameterized by ss), xσμ(s)x_{\sigma}^{\mu}(s) is the polarization-dependent correction, and the formal arbitrary infinitesimal parameter α\alpha is introduced to ease the perturbative manipulations. The tangent vector

Kμ=kμ+αΛμ,K^{\mu}=k^{\mu}+\alpha\Lambda^{\mu}, (46)

is null so,

kμΛμ=0.k^{\mu}\Lambda_{\mu}=0. (47)

For the weak gravity regime it is convenient use the usual post-Newtonian expansion (the standard choice of the coordinates (t,x,y,z)(t,x,y,z), but without switching to the coordinate time tt as an evolution parameter). On the dimensional grounds we expect as the famous light deflection[29, 25] is the ϵ2\epsilon^{2} effect (Δϕ=2rg/\Delta\phi=2r_{\textsl{g}}/\ell, where \ell is the impact parameter and rg=2Mr_{\textsl{g}}=2Mis the gravitational radius), the leading polarization-dependent effect is of the order α=ϵ2λ/\alpha=\epsilon^{2}\lambda/\ell, where λ\lambda is the characteristic wave length. Taking this into account and writing MM explicitly, we note that at the leading order

kμ=\displaystyle k^{\mu}= k(0)μ+Mk(1)μ,\displaystyle k_{(0)}^{\mu}+Mk_{(1)}^{\mu}, (48)
Λμ=\displaystyle\Lambda^{\mu}= Mqμ,\displaystyle Mq^{\mu}, (49)

where the components k(0)μk_{(0)}^{\mu} are constant. Taking into account that q;νμ=q,νμ+𝒪(M)q^{\mu}_{;\nu}=q^{\mu}_{,\nu}+\mathcal{O}(M), we have

KνK;νμ=Mk(0)νq,νμ+𝒪(M2).K^{\nu}K^{\mu}_{;\nu}=Mk_{(0)}^{\nu}q^{\mu}_{,\nu}+\mathcal{O}(M^{2}). (50)

It is convenient to describe the unperturbed null geodesic using the full Schwarzschild solution. The symmetry allows to restrict the motion to the equatorial plane. The tangent can be taken as

kμ=(1f(r¯),±12r¯2f(r¯),0,r¯2),k^{\mu}=\left(\frac{1}{f(\bar{r})},\pm\sqrt{1-\frac{\ell^{2}}{\bar{r}^{2}}f(\bar{r})},0,\frac{\ell}{\bar{r}^{2}}\right), (51)

where f(r¯)=12M/rf(\bar{r})=1-2M/r. Here r¯\bar{r} is the Schwarzschild radial coordinate (i.e. the circumferential radius). In the weak field regime it is related to the isotropic radial coordinate r=xxr=\sqrt{{\vec{x}}\cdot{\vec{x}}} as r¯r+M\bar{r}\approx r+M.

We used the freedom of rescaling the null vectors to set the energy of a fictitious photon to E=1E=1, so the reduced angular momentum :=L/E\ell\vcentcolon=L/E equals to the impact parameter[17]. The sign of krk^{r} depends on whether the photon approaches the origin (i.e. the Sun), or moves away from it. In the equatorial plane we have 𝗄𝗄𝗄=0{\mathsf{k}}\nabla_{\mathsf{k}}{\mathsf{k}}=0.

The second vector of the null tetrad

nμ=(r¯22(r¯22f(r¯)),r¯f(r¯)2r¯22f(r¯),0,f(r¯)2(r¯22f(r¯)),n^{\mu}=\left(\frac{\bar{r}^{2}}{2\big{(}\bar{r}^{2}-\ell^{2}f(\bar{r})\big{)}},\mp\frac{\bar{r}f(\bar{r})}{2\sqrt{\bar{r}^{2}-\ell^{2}f(\bar{r})}},0,\frac{\ell f(\bar{r})}{2\big{(}\bar{r}^{2}-\ell^{2}f(\bar{r})}\right), (52)

is chosen so that 𝗄𝗇=1{\mathsf{k}}\cdot{\mathsf{n}}=-1, and the complex-conjugate pair of null vectors is given by

mμ=12(ir¯22f(r¯),0,1r,ir¯22f(r¯)),m¯μ=mμ.m^{\mu}=\frac{1}{\sqrt{2}}\left(\frac{i\ell}{\sqrt{\bar{r}^{2}-\ell^{2}f(\bar{r})}},0,\frac{1}{r},\frac{i}{\sqrt{\bar{r}^{2}-\ell^{2}f(\bar{r})}}\right),\qquad\bar{m}^{\mu}=m^{\mu*}. (53)

These three vectors are to be adjusted to satisfy the conditions required for the validity of Eq. (44). This adjustment is quite straightforward, as in our setting

𝗄𝗇=iα(𝗆𝗆¯),\nabla_{\mathsf{k}}{\mathsf{n}}=i\alpha({\mathsf{m}}-\bar{{\mathsf{m}}}), (54)

and

𝗄𝗆=iα𝗄,𝗄𝗆¯=iα𝗄,\nabla_{\mathsf{k}}{\mathsf{m}}=i\alpha{\mathsf{k}},\qquad\nabla_{\mathsf{k}}\bar{{\mathsf{m}}}=-i\alpha{\mathsf{k}}, (55)

where

α=±(r3M)2r(r¯22f(r¯)),\alpha=\pm\frac{\ell(r-3M)}{\sqrt{2}r(\bar{r}^{2}-\ell^{2}f(\bar{r}))}, (56)

where the sign of α\alpha coincides with that of krk^{r}.

In the leading order approximation the trajectories are geodesic, and the parallel transport of 𝗄{\mathsf{k}} results in κ=0\kappa=0. In turn it enables the parallel transport of the tetrad if the function aa satisfies

a,r=iα/kr=i(r¯3M)2(r¯22f(r¯))3/2,a_{,r}=-i\alpha/k^{r}=\frac{i\ell(\bar{r}-3M)}{\sqrt{2}\big{(}\bar{r}^{2}-\ell^{2}f(\bar{r})\big{)}^{3/2}}, (57)

where the sign does not change on transition from decreasing to increasing distance.

Null geodesics in the Schwarzschild spacetime are classified [17] according to the behaviour of the three roots of the equation C(r¯)=r32(r¯2M)=0C(\bar{r})=r^{3}-\ell^{2}(\bar{r}-2M)=0. Only one root is relevant for the trajectories we study,

r¯0=M+𝒪(M2).\bar{r}_{0}=\ell-M+\mathcal{O}(M^{2}). (58)

Up to the terms of the third order in MM the roots are 2M2M and r±r¯0r\pm\bar{r}_{0}. Using this we find

a,r𝑑r=i(3rMr¯02)2(r¯2r¯02)r¯02+𝒪(M3),\int a_{,r}dr=\frac{i\ell(3rM-\bar{r}_{0}^{2})}{\sqrt{2(\bar{r}^{2}-\bar{r}_{0}^{2})}\bar{r}_{0}^{2}}+\mathcal{O}(M^{3}), (59)

that is sufficient to obtain the desired a(r¯)a(\bar{r}) at any point of the trajectory apart from r=r¯0r=\bar{r}_{0}.

The same expression is valid for both increasing and decreasing rr. For r>r¯0Mr>\ell\sim\bar{r}_{0}\gg M we can approximate it as

a=i2r¯223iM2(1r¯3(r¯22)3/2)+𝒪(M2).a=-\frac{i\ell}{\sqrt{2}\sqrt{\bar{r}^{2}-\ell^{2}}}-\frac{3iM}{\sqrt{2}\ell}\left(1-\frac{\bar{r}^{3}}{(\bar{r}^{2}-\ell^{2})^{3/2}}\right)+\mathcal{O}(M^{2}). (60)

It is convenient to express the acceleration FμF^{\mu} in terms of the original tetrad vectors as

Fμ=iσωRαβγμ(kαmβm¯γ+akαkβm¯γ+akαmβkγ).F^{\mu}=-\frac{i\sigma}{\omega}R^{\mu}_{~{}\alpha\beta\gamma}(k^{\alpha}m^{\beta}\bar{m}^{\gamma}+ak^{\alpha}k^{\beta}\bar{m}^{\gamma}+a^{*}k^{\alpha}m^{\beta}k^{\gamma}). (61)

Only one component is non-zero at the leading order:

Fθ=σω3Mr¯4(1r¯22f(r¯)+2a(r¯)r¯2).F^{\theta}=-\frac{\sigma}{\omega}\frac{3\ell M}{\bar{r}^{4}}\left(\frac{1}{\sqrt{\bar{r}^{2}-\ell^{2}f(\bar{r})}}+\frac{\sqrt{2}a(\bar{r})\ell}{\bar{r}^{2}}\right). (62)

Using again the decomposition of C(r¯)C(\bar{r}) it becomes

Fθσω3Mr¯4r¯2r¯02(1+2r¯2)=σω3Mr¯6r¯22r¯2r¯02,F^{\theta}\approx\frac{\sigma}{\omega}\frac{3\ell M}{\bar{r}^{4}\sqrt{\bar{r}^{2}-\bar{r}_{0}^{2}}}\left(-1+\frac{\ell^{2}}{\bar{r}^{2}}\right)=-\frac{\sigma}{\omega}\frac{3\ell M}{\bar{r}^{6}}\frac{{\bar{r}^{2}-\ell^{2}}}{\sqrt{\bar{r}^{2}-\bar{r}_{0}^{2}}}, (63)

We evaluate the correction xσ(s)x_{\sigma}(s) using the post-Newtonian formalism that was outlined above. It is important to note that to use only the leading order ϵ2\epsilon^{2} while using the order of α\alpha polarization correction to describe a particular trajectory is unjustified, as the next post-Newtonian correction for the geometric optics term is still higher. Using the setting that is described below ϵ2=rg/=4.2×106\epsilon^{2}=r_{\textsl{g}}/\ell=4.2\times 10^{-6}, while λ/=1.4×109\lambda/\ell=1.4\times 10^{-9} for λ=1\lambda=1m. However, as the exact general relativistic calculation within the geometric optics approximation leads to polarization-independent results, calculation of the polarization-dependent shift can be done using only the ϵ2\epsilon^{2} expansion.

To obtain an estimate of the effect we neglect the difference between r¯0\bar{r}_{0} and \ell, essentially regularizing the expressions near the closest approach radius. The difference between the isotropic radial coordinate that is used in the post-Newtonian analysis and the Schwarzschild radial coordinate is of the order of 2M2M\ll\ell and affects only the higher-order terms. The unperturbed motion is confined to the zz-plane. We can assume that k=(1,0,0)\vec{k}=(-1,0,0), that corresponds to >0\ell>0. In this approximation

Fz=+3σMωr5r22+𝒪(M2),F^{z}=+\frac{3\sigma\ell M}{\omega r^{5}}\sqrt{{r}^{2}-\ell^{2}}+\mathcal{O}(M^{2}), (64)

while all other components are zero at the leading order. Then the explicit equation for the leading polarization-dependent correction is

q,xz=3σωr5r22.q^{z}_{,x}=-\frac{3\sigma\ell}{\omega r^{5}}\sqrt{{r}^{2}-\ell^{2}}. (65)

We are interested in a light ray that comes from afar (“minus infinity”), grazes the sun and is detected at some finite (but large) distance r2r_{2}\gg\ell, the initial off-plane displacement and velocity are zero. Similarly to the calculations of corrections for the classical tests of general relativity[29, 25] we use the unperturbed (i.e. flat spacetime trajectory with r0=r_{0}=\ell

y=,x=t.y=\ell,\qquad x=-t. (66)

Then the correction δz=Mqz\delta z=Mq^{z} is obtained by an elementary integration. For the far field (rr\gg\ell) the trajectory approaches a straight line with

θ=2σωM2,\theta_{\infty}=\frac{2\sigma}{\omega}\frac{M}{\ell^{2}}, (67)

giving at the distance rr from the sun the transversal shift

Δz=4Mω2r.\Delta z=\frac{4M}{\omega\ell^{2}}r. (68)

This shift coincides with the result obtained using the polarization-dependent part of the angle of deflection Δϕ\Delta\phi between in-and outgoing polarized rays that was reported in Ref. 20. On the other hand, this approximation is too crude to identify a convergence of the initially divergent trajectories that was reported in Ref. 23, and a more delicate approximations will be investigated in the follow-up work.

5 Summary

Describing propagation of electromagnetic waves in vacuum in terms of rays that follow null geodesics is a very good approximation in the high-frequency regime. Within this approximation the polarization rotation in the Schwarzschild metric, and as a result in the leading post-Newtonian approximation, is a purely gauge effect. This phase will be present as a consequence of practical methods of setting up reference frames in the Earth-to-spacecraft communications. However, for the near-Earth experiments these effects are typically about 10510^{-5} weaker than the SR effects. Observable gravity-induced deviations from the geometric optics approximation are expected only in ultra-strong gravitational fields.

References