This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

11institutetext: Nigel T. Bishop 22institutetext: Department of Mathematics, Rhodes University, Grahamstown 6140, South Africa.
22email: [email protected]

Introduction to gravitational wave astronomy

Nigel T. Bishop corresponding author
Abstract

This chapter provides an overview of gravitational wave (GW) astronomy, providing background material that underpins the other, more specialized chapters in this handbook. It starts with a brief historical review of the development of GW astronomy, from Einstein’s prediction of GWs in 1916 to the first direct detection in 2015. It presents the theory of linearized perturbations about Minkowski spacetime of Einstein’s equations, and shows how gauge transformations reduce the problem to the standard wave equation with two degrees of freedom, or polarizations, h+,h×h_{+},h_{\times}. We derive the quadrupole formula, which relates the motion of matter in a source region to the far GW field. It is shown that GWs carry energy, as well as linear and angular momentum, away from a source. The GW field of an orbiting circular binary is found; and properties of the evolution of the binary including rate of inspiral and time to coalescence, are calculated. A brief review is given of existing and proposed GW detectors, and of how to estimate source parameters in LIGO or Virgo data of a GW event. The contributions that GW observations have already made to physics, astrophysics and cosmology are discussed.

Keywords

Gravitational waves; Quadrupole formula; Orbital inspiral; LIGO; LISA

1 Introduction

1.1 Historical development

Soon after proposing the general theory of relativity (GR), Einstein showed, by linearizing the field equations, that the theory implies the existence of gravitational waves (GWs) Einstein1916 ; Einstein1918 , and obtained what has become known as the quadrupole formula. However, the concept of a GW actually predates GR. Newtonian gravitation theory can be expressed as an elliptic equation with the gravitational field changing instantaneously if the source changes. Since special relativity had established the speed of light as a universal speed limit, this suggests that the gravitational theory should be expressed as a hyperbolic system, so implying the existence of GWs. During the early years, i.e. from about 1920 to 1960, there was uncertainty about the nature of GWs. It was not clear whether GWs carry energy since GW energy at a given event in spacetime cannot be defined; it was not clear whether the quadrupole formul could be applied to gravitationally bound systems; and it was argued that the full nonlinear theory of GR does not permit GW solutions. For a detailed discussion of these issues, see Kennefick:1997kb .

The mathematical theory of GWs became well-established in the 1960s. The above issues were resolved by expressing the Einstein equations of the full nonlinear theory of GR in a suitable coordinate system Bondi62 ; Sachs62 , and also by the “shortwave approximation” averaging procedure Isaacson68 . Once it was clear that the emission of GWs causes a loss of energy in the emitting system, the dynamics of an orbiting binary and the resulting inspiral could be calculated using linearized theory and the quadrupole approximation Peters:1963ux ; Peters:1964 .

In this period, there were also important developments in the experimental area. The theory of stellar evolution had long indicated that the end-state of a massive star would be a neutron star or a black hole. However, the first observation of a neutron star was of a pulsar in the Crab nebula in 1969, and the first evidence of a massive (15M\approx 15M_{\odot}) compact object that could only be a black hole was Cygnus X-1 in 1971. The confirmation of the existence of these compact objects was important: their inspiral and merger would be both powerful and in the frequency range 10\approx 10Hz to 1000\approx 1000Hz which would be suitable for a terrestrial detector. The first attempt to detect gravitational waves was reported in 1969 using a bar detector Weber:1969bz . This paper actually reported the detection of a number of GW events, although subsequently it became clear that these events were not astrophysical but rather the result of experimental error. In 1979, observations of the binary pulsar system PSR 1913+16 showed that the orbit was inspiralling at a rate consistent with GW emission in GR, thus providing the first experimental evidence for the existence of GWs Taylor79 .

The most powerful GW sources, and therefore those most likley to be detected, are the merger of two compact objects, but this case does not satisfy the conditions of the quadrupole approximation and there is a need to go beyond linearized theory. This can be achieved by means of a series expansion, adding terms of quadratic, cubic and higher orders to the linearized expressions. This approach was considered in the early years of GR Lorentz17 ; Lorentz37 , and in the 1980s formalized as the Post-Newtonian method. For the actual merger, a numerical simulation of the full Einstein equations is needed, normally with the spacetime foliated into a sequence of spacelike hypersurfaces  Arnowitt62 . In 1977, the GWs from a head-on collision of two Schwarzschild black holes were computed  Smarr77 , but it was only in 2005 that codes were able to make a stable evolution of the physically realistic problem of the inspiral and merger of two black holes  Pretorius:2005gq . Since then, many other groups have successfully evolved black hole spacetimes, as well as neutron star mergers and supernovae, often using a combination of Post-Newtonian and numerical methods.

The possibility of using laser interferometry to detect GWs was suggested in the 1960s, and simple prototypes were constructed at that time. In 1980, the US National Science Foundation provided funding for the construction of certain prototypes, as well as for a study of the technical issues and the costs of building an interferometer with arms several km in length. This eventually led to the construction of LIGO (Laser Interferometry Gravitational-Wave Observatory) facilities at Hanford and Livingstone, USA, with 4km arms; scientific studies commenced in 2002. Also at this time, the much smaller detector GEO600 (600m arms) started operation, and in 2007 Virgo in Italy, which has 3km arms and is a project of the European Gravitational Observatory consortium, made its first science run. These instruments underwent a number of upgrades to improve the sensitivity, and the first direct detection of GWs was made on 14 September 2015 by LIGO Hanford and Livingstone Abbott2016a .

1.2 Content of this chapter

This chapter provides an overview of GW astronomy, and describes aspects of the basic theory of GWs that form the foundation of the field. It provides the background material that underpins the specialized chapters in this handbook. The discussion is mainly within the context of the Einstein equations linearized about Minkowski spacetime. This theory is sufficiently straightforward that it can be presented within a single chapter, yet is also widely applicable. The magnitude of GWs is dimensionless, and those propagating past the Earth that have been detected can be characterized as being 𝒪(1022){\mathcal{O}}(10^{-22}); thus linearized theory certainly applies. The later phase of compact object inspiral is driven by GW emission; this process lasts millions or billions of years and, apart from the final seconds or minutes, is accurately described by linearized theory. Even though linearized theory does not provide an accurate waveform for the actual merger of two compact objects, it remains useful as it provides a simple guide in analytic form.

The theory of linearizing Einstein’s equations about Minkowski spacetime is developed in Sec. 2. A key issue is the use of gauge transformations (which, in this context, are coordinate transformations that are almost the identity transformation) to simplify the equations to a form that is manifestly the wave equation. The simplest solution to the wave equation is a plane wave, i.e. in Cartesian coordinates (t,x,y,z)(t,x,y,z) a function f(t,z)f(t,z) that satisfies the wave equation; this solution well describes GWs in the solar system produced by a source many Mpc away. We construct the plane wave solution, and make further gauge transformations to the Transverse Traceless, or TT, gauge. In this way, the 10 components of a symmetric tensor are reduced to 2 independent components or polarizations, usually denoted as h+,h×h_{+},h_{\times}. Finally, this section investigates the effect of a plane GW on a system of test particles. In the TT gauge, the coordinates of the particles do not change, but that does not mean that there is no movement: a coordinate independent quantity such as the proper distance between two of the particles does vary with time. It is also shown that the Riemann tensor has non-zero components, so the spacetime is not flat.

The generation of GWs by the motion of matter is discussed in Sec. 3, and the quadrupole formula is derived. As an example, the formula is applied to the astrophysically important problem of a binary comprising two masses in circular orbit around each other. The two polarization modes, h+,h×h_{+},h_{\times}, are evaluated with respect to spherical polar coordinates. We next discuss the energy, as well as the linear and angular momenta, carried away by GWs. These effects are beyond the scope of linearized theory, being quadratic in the perturbations, and the calculations are only outlined with much detail omitted. This section also summarizes methods that are used for situations where the quadrupole formula is inadequate: Post-Newtonian approximations, numerical relativity, the theory of quasinormal modes of a black hole, and Extreme Mass Ratio Inspirals (EMRIs).

Now that we know what GWs carry away from a system, conservation laws are used in Sec. 4 to find the evolution of an orbiting circular binary. The orbital diameter and wave period slowly decrease, and we find an expression for the time to coalescence. We also find a relation between the chirp mass, which is a function of the individual masses of the binary, and an expression involving the wave frequency and its rate of change; this means that the chirp mass is entirely determined by observational GW data. It is also shown that GW emission causes an eccentric orbit to circularize, so the focus on circular orbits is for reasons of physics rather than for mathematical convenience.

Sec. 5 provides an outline of various existing and planned GW detection facilities, including terrestrial laser interferometers, satellite systems, and pulsar timing. We indicate the frequency range in which a detector is sensitive, and some actual or expected astrophysical sources. Then, for the LIGO and Virgo network, we outline the process of determining basic source parameters of an observed GW event representing a compact object inspiral and merger. Some contributions that GW observations have already made to physics, cosmology and astrophysics are outlined.

The chapter ends with a Conclusion, Sec. 6.

2 Propagation of gravitational waves

2.1 Linearized Einstein equations in vacuum

The essential idea is to consider spacetimes that comprise small perturbations about Minkowski spacetime. More precisely, the metric tensor gαβg_{\alpha\beta} is written

gαβ=ηαβ+hαβ,g_{\alpha\beta}=\eta_{\alpha\beta}+h_{\alpha\beta}\,, (1)

where ηαβ=\eta_{\alpha\beta}= is the metric of Minkowski spacetime which in Cartesian (t,x,y,z)(t,x,y,z) coordinates is diag(1,1,1,1)(-1,1,1,1), and hαβh_{\alpha\beta} is a small perturbation. In the linearized approximation, terms of order 𝒪((hαβ)2){\mathcal{O}}\left(\left(h_{\alpha\beta}\right)^{2}\right) are neglected. The contravariant metric is written as gαβ=ηαβhαβg^{\alpha\beta}=\eta^{\alpha\beta}-h^{\alpha\beta}, and the identities δγα=gαβgβγ=ηαβηβγ\delta^{\alpha}_{\gamma}=g^{\alpha\beta}g_{\beta\gamma}=\eta^{\alpha\beta}\eta_{\beta\gamma} imply that

hαβ=ηαγηβδhγδ.h^{\alpha\beta}=\eta^{\alpha\gamma}\eta^{\beta\delta}h_{\gamma\delta}\,. (2)

Thus, the indices of quantities of order 𝒪(hαβ){\mathcal{O}}(h_{\alpha\beta}) are raised and lowered using the background metric ηαβ\eta_{\alpha\beta} rather than the full metric gαβg_{\alpha\beta}. The first step towards constructing the Einstein equations is to determine the metric connection

Γαβμ=12ημν(βhνα+αhνβνhαβ)=12(βhαμ+αhβμμhαβ).\Gamma^{\mu}_{\ \alpha\beta}=\frac{1}{2}\eta^{\mu\nu}(\partial_{\beta}h_{\nu\alpha}+\partial_{\alpha}h_{\nu\beta}-\partial_{\nu}h_{\alpha\beta})=\frac{1}{2}(\partial_{\beta}h^{\mu}_{\ \ \alpha}+\partial_{\alpha}h^{\mu}_{\ \ \beta}-\partial^{\mu}h_{\alpha\beta})\,. (3)

Then the Ricci tensor is

Rμν=αΓμνανΓμαα=12(ανhμα+αμhναααhμνμνh),R_{\mu\nu}=\partial_{\alpha}\Gamma^{\alpha}_{\ \mu\nu}-\partial_{\nu}\Gamma^{\alpha}_{\ \mu\alpha}=\frac{1}{2}(\partial_{\alpha}\partial_{\nu}h^{\ \;\alpha}_{\mu}+\partial_{\alpha}\partial_{\mu}h^{\ \;\alpha}_{\nu}-\partial_{\alpha}\partial^{\alpha}h_{\mu\nu}-\partial_{\mu}\partial_{\nu}h)\,, (4)

where h=hαα=ηαβhαβh=h^{\alpha}_{\alpha}=\eta^{\alpha\beta}h_{\alpha\beta}. The Ricci scalar is thus

R=Rμμ=αβhαβααh,R=R^{\mu}_{\mu}=\partial_{\alpha}\partial_{\beta}h^{\alpha\beta}-\partial_{\alpha}\partial^{\alpha}h\,, (5)

and therefore Einstein’s equations Gμν=RμνRgμν/2=8πTμνG_{\mu\nu}=R_{\mu\nu}-Rg_{\mu\nu}/2=8\pi T_{\mu\nu} are

12\displaystyle\frac{1}{2} (ανhμα+αμhναααhμνμνhημν(αβhαβααh))\displaystyle\left(\partial^{\alpha}\partial_{\nu}h_{\mu\alpha}+\partial^{\alpha}\partial_{\mu}h_{\nu\alpha}-\partial_{\alpha}\partial^{\alpha}h_{\mu\nu}-\partial_{\mu}\partial_{\nu}h-\eta_{\mu\nu}(\partial^{\alpha}\partial^{\beta}h_{\alpha\beta}-\partial^{\alpha}\partial_{\alpha}h)\right)
=8πTμν.\displaystyle=8\pi T_{\mu\nu}\,. (6)

Eq. (2.1) may be somewhat simplified by changing the metric perturbation hαβh_{\alpha\beta} to its trace reversed form, that is

h¯αβ=hαβ12ηαβh so that h¯=h and hαβ=h¯αβ12ηαβh¯.\bar{h}_{\alpha\beta}=h_{\alpha\beta}-\frac{1}{2}\eta_{\alpha\beta}h\mbox{ so that }\bar{h}=-h\mbox{ and }{h}_{\alpha\beta}=\bar{h}_{\alpha\beta}-\frac{1}{2}\eta_{\alpha\beta}\bar{h}\,. (7)

Einstein’s equations are then

Gμν=12(ααh¯μν+ημναβh¯αβναh¯μαμαh¯να)=8πTμν.G_{\mu\nu}=-\frac{1}{2}\left(\partial_{\alpha}\partial^{\alpha}\bar{h}_{\mu\nu}+\eta_{\mu\nu}\partial^{\alpha}\partial^{\beta}\bar{h}_{\alpha\beta}-\partial_{\nu}\partial^{\alpha}\bar{h}_{\mu\alpha}-\partial_{\mu}\partial^{\alpha}\bar{h}_{\nu\alpha}\right)=8\pi T_{\mu\nu}. (8)

The first term ααh¯μν\partial_{\alpha}\partial^{\alpha}\bar{h}_{\mu\nu} is the wave operator applied to h¯μν\bar{h}_{\mu\nu}, and the problem would reduce to a standard wave equation if the other terms could be made to disappear. This can be achieved on applying the Lorentz gauge condition

αh¯αβ=0,\partial^{\alpha}\bar{h}_{\alpha\beta}=0\,, (9)

in Eq. (8) to obtain

12h¯μν=8πTμν,-\frac{1}{2}\Box\bar{h}_{\mu\nu}=8\pi T_{\mu\nu}\,, (10)

where the operator =αα\Box=\partial_{\alpha}\partial^{\alpha}, and clearly in vacuum h¯μν=0\Box\bar{h}_{\mu\nu}=0. Imposition of the Lorentz gauge condition is a constraint on the coordinates being used, and is not a restriction on the geometry of the spacetime. This matter is discussed further in the next section.

2.2 Gauge transformations

A gauge transformation is a coordinate transformation of the form

x(NEW)α=x(OLD)α+ξα(x(OLD)β)x^{\mbox{{\tiny(NEW)}}\alpha}=x^{\mbox{{\tiny(OLD)}}\alpha}+\xi^{\alpha}(x^{\mbox{{\tiny(OLD)}}\beta}) (11)

where βξα\partial_{\beta}\xi^{\alpha} is of the same order of smallness as hαβh_{\alpha\beta}; terms of order 𝒪(hαβ)2{\mathcal{O}}(h_{\alpha\beta})^{2}, 𝒪(hαββξα){\mathcal{O}}(h_{\alpha\beta}\partial_{\beta}\xi^{\alpha}), 𝒪(βξα)2{\mathcal{O}}(\partial_{\beta}\xi^{\alpha})^{2} are neglected. Applying the coordinate transformation (11) to the metric (1) gives

hαβ(NEW)=hαβ(OLD)βξααξβ,h^{\mbox{{\tiny(NEW)}}}_{\alpha\beta}=h^{\mbox{{\tiny(OLD)}}}_{\alpha\beta}-\partial_{\beta}\xi_{\alpha}-\partial_{\alpha}\xi_{\beta}\,, (12)

and then the change to trace-reversed form leads to

h¯αβ(NEW)=h¯αβ(OLD)βξααξβ+ηαβμξμ.\bar{h}^{\mbox{{\tiny(NEW)}}}_{\alpha\beta}=\bar{h}^{\mbox{{\tiny(OLD)}}}_{\alpha\beta}-\partial_{\beta}\xi_{\alpha}-\partial_{\alpha}\xi_{\beta}+\eta_{\alpha\beta}\partial_{\mu}\xi^{\mu}\,. (13)

Requiring h¯αβ(NEW)\bar{h}^{\mbox{{\tiny(NEW)}}}_{\alpha\beta} to satisfy the Lorentz gauge condition then implies

ξβ=αh¯αβ(OLD).\Box\xi_{\beta}=\partial^{\alpha}\bar{h}^{\mbox{{\tiny(OLD)}}}_{\alpha\beta}\,. (14)

The right hand side of Eq. (14) is regarded as given, and then Eq. (14) comprises separate wave equations for each of the four unknowns ξ0,,ξ3\xi_{0},\cdots,\xi_{3}. The existence of solutions to these four wave equations follows from the theory of partial differential equations. Note however that the solutions are not unique, and we are free to write

ξα=ξ(0)α+ζα,\xi^{\alpha}=\xi^{(0)\alpha}+\zeta^{\alpha}\,, (15)

where ξ(0)α\xi^{(0)\alpha} is a solution to Eq. (14), and where ζα\zeta^{\alpha} satisfies ζα=0\Box\zeta^{\alpha}=0.

The possibility to make gauge transformations means that a metric that appears to be wavelike may not represent a GW. For example, suppose that the spacetime is Minkowski (and so does not contain gravitational waves) with h¯αβ(OLD)=0\bar{h}^{\mbox{{\tiny(OLD)}}}_{\alpha\beta}=0, and that ξα=(0,0,0,ϵcos(tx))\xi_{\alpha}=(0,0,0,\epsilon\cos(t-x)). Then

h¯αβ=(000ϵsin(tx)000ϵsin(tx)0000ϵsin(tx)ϵsin(tx)00).\bar{h}_{\alpha\beta}=\left(\begin{array}[]{cccc}0&0&0&\epsilon\sin(t-x)\\ 0&0&0&-\epsilon\sin(t-x)\\ 0&0&0&0\\ \epsilon\sin(t-x)&-\epsilon\sin(t-x)&0&0\end{array}\right)\,. (16)

A simple test to determine whether or not a vacuum spacetime is Minkowski in unusual coordinates, is to evaluate the Riemann tensor: Rβγδα=0R^{\alpha}_{\phantom{\alpha}\beta\gamma\delta}=0 if and only if the spacetime is Minkowskian.

2.3 Plane wave solutions

Let us now make the ansatz

h¯αβ=(Aαβexp(ikαxα)),\bar{h}^{\alpha\beta}=\Re\left(A^{\alpha\beta}\exp(ik_{\alpha}x^{\alpha})\right)\,, (17)

where the kαk_{\alpha} are real constants and the AαβA_{\alpha\beta} are complex constants; note that a general wave may be expressed as a Fourier sum of terms with the above form. Substituting into the vacuum field equations h¯αβ=0\Box\bar{h}_{\alpha\beta}=0 leads to ηαβkαkβ=0\eta^{\alpha\beta}k_{\alpha}k_{\beta}=0, so that kαk_{\alpha} is a null vector, and usually k0k_{0} is written as ω\omega. Since AαβA^{\alpha\beta} is symmetric, it has 10 independent components. Now, the Lorentz gauge condition βh¯αβ=0\partial^{\beta}\bar{h}_{\alpha\beta}=0 implies 4 conditions

Aαβkβ=0,A^{\alpha\beta}k_{\beta}=0\,, (18)

reducing the number of independent components of AαβA^{\alpha\beta} from 10 to 6. Then a gauge transformation can be made where

ξα=(Bαexp(ikμxμ))\xi^{\alpha}=\Re(B^{\alpha}\exp(ik_{\mu}x^{\mu})) (19)

with BαB^{\alpha} constant; since ξα\xi^{\alpha} satisfies the identity ξα=0\Box\xi^{\alpha}=0, the condition βh¯αβ=0\partial_{\beta}\bar{h}^{\alpha\beta}=0 is not affected. The 4 constants BαB^{\alpha} are used to reduce the independent components of AαβA^{\alpha\beta} from 6 to 2.

In order to apply the above and construct in a transparent way an explicit form of AαβA^{\alpha\beta} with only two independent components, we align the coordinates with the direction of propagation of the wave. Let the wave be propagating in the zz-direction with frequency ω/(2π)\omega/(2\pi) so that kα=(ω,0,0,ω)k_{\alpha}=(\omega,0,0,-\omega). Then the conditions (18) imply that Aα0=Aα3A^{\alpha 0}=A^{\alpha 3}, so we can write

Aαβ=(A00A01A02A00A01A11A12A01A02A12A22A02A00A01A02A00),A^{\alpha\beta}=\left(\begin{array}[]{cccc}A^{00}&A^{01}&A^{02}&A^{00}\\ A^{01}&A^{11}&A^{12}&A^{01}\\ A^{02}&A^{12}&A^{22}&A^{02}\\ A^{00}&A^{01}&A^{02}&A^{00}\end{array}\right)\,, (20)

which has six independent components A00,A01,A02,A11,A12,A22A^{00},A^{01},A^{02},A^{11},A^{12},A^{22}. Differentiating the gauge transformation (19) gives αζβ=(iBαkβexp(ikμxμ))\partial^{\alpha}\zeta^{\beta}=\Re(iB^{\alpha}k^{\beta}\exp(ik_{\mu}x^{\mu})), so that from Eq. (13),

h¯αβ=((AαβiBαkβiBβkα+ηαβiBμkμ)exp(ikμxμ)).\bar{h}^{\prime\alpha\beta}=\Re\left((A^{\alpha\beta}-iB^{\alpha}k^{\beta}-iB^{\beta}k^{\alpha}+\eta^{\alpha\beta}iB^{\mu}k_{\mu})\exp(ik_{\mu}x^{\mu})\right)\,. (21)

Now, Aαβ,Bα,(kαkα),(xαxα)A^{\alpha\beta},B^{\alpha},(k^{\prime\alpha}-k^{\alpha}),(x^{\prime\alpha}-x^{\alpha}) are small quantities, and so the product of any pair is negligible. Thus we can write h¯αβ=((Aαβexp(ikμxμ))\bar{h}^{\prime\alpha\beta}=\Re\left((A^{\prime\alpha\beta}\exp(ik_{\mu}x^{\mu})\right), and then

Aαβ=AαβiBαkβiBβkα+ηαβiBμkμ.A^{\prime\alpha\beta}=A^{\alpha\beta}-iB^{\alpha}k^{\beta}-iB^{\beta}k^{\alpha}+\eta^{\alpha\beta}iB^{\mu}k_{\mu}\,. (22)
A00\displaystyle A^{\prime 00} =A00+iω(B0+B3),A01=A01+iωB1,A02=A02+iωB2\displaystyle=A^{00}+i\omega(B^{0}+B^{3})\,,\qquad A^{\prime 01}=A^{01}+i\omega B^{1}\,,\qquad A^{\prime 02}=A^{02}+i\omega B^{2}
A11\displaystyle A^{\prime 11} =A11+iω(B3B0),A22=A22+iω(B3B0),A12=A12.\displaystyle=A^{11}+i\omega(B^{3}-B^{0})\,,\qquad A^{\prime 22}=A^{22}+i\omega(B^{3}-B^{0})\,,\qquad A^{\prime 12}=A^{12}\,. (23)

The constants BαB^{\alpha} may be chosen so as to simplify AαβA^{\prime\alpha\beta} in any desired way, and common practice is to use them to set

A00=A01=A03=0,A22=A11,A^{\prime 00}=A^{\prime 01}=A^{\prime 03}=0\,,\;\;A^{\prime 22}=-A^{\prime 11}\,, (24)

so that AαβA^{\prime\alpha\beta} is

Aαβ=(00000A11A1200A12A1100000).A^{\prime\alpha\beta}=\left(\begin{array}[]{cccc}0&0&0&0\\ 0&A^{\prime 11}&A^{\prime 12}&0\\ 0&A^{\prime 12}&-A^{\prime 11}&0\\ 0&0&0&0\end{array}\right)\,. (25)

2.4 The TT gauge

The resulting metric h¯αβ\bar{h}_{\alpha\beta} is transverse: the wave travels in the zz-direction, and its effects are in the xx- and yy-directions. More generally, the transverse property is expressed as

h¯αβkα=0.\bar{h}^{\prime}_{\alpha\beta}k^{\alpha}=0\,. (26)

Further the metric is traceless, i.e.

h¯αα=h¯=0,\bar{h}^{\prime\alpha}_{\alpha}=\bar{h}^{\prime}=0\,, (27)

so that the metric and its trace-reverse form are the same, h¯αβ=hαβ\bar{h}^{\prime}_{\alpha\beta}=h^{\prime}_{\alpha\beta}. A metric satisfying Eqs. (26) and (27) is said to be in the Transverse Traceless or TT gauge. From now on, we will drop the to denote a quantity in this gauge, and in circumstances where there is a need to differentiate the gauge of quantities, we will use the superfix TT to denote a quantity in the TT gauge.

The GW derived above has two independent components h11,h12h^{11},h^{12}. However, there is no essential difference between the components, as can be seen by making a coordinate transformation which, geometrically, corresponds to a rotation of the (x,y)(x,y) axes about the zz-axis through an angle θ=π/4\theta=\pi/4. More precisely, (t,x,y,z)((t,x,y,z)(t,x,y,z)\rightarrow((t^{\prime},x^{\prime},y^{\prime},z^{\prime}) where

x=xcosθ+ysinθ,y=xsinθycosθ,x^{\prime}=x\cos\theta+y\sin\theta\,,\qquad y^{\prime}=x\sin\theta-y\cos\theta\,, (28)

and here cosθ=sinθ=2/2\cos\theta=\sin\theta=\sqrt{2}/2. Application to the metric gαβ=ηαβhαβg^{\alpha\beta}=\eta^{\alpha\beta}-h^{\alpha\beta} leads to

h11=h22=h12,h12=h21=h11.h^{\prime 11}=-h^{\prime 22}=h^{12}\,,\qquad h^{\prime 12}=h^{\prime 21}=-h^{11}\,. (29)

The two independent components of a GW, h11,h12h^{11},h^{12}, are usually denoted by h+,h×h_{+},h_{\times} respectively and are called the two polarization states of the GW. There is an analogy to electromagnetic waves which are also transverse to the direction of propagation and which also have polarization states, although in this case the angle between the two states is π/2\pi/2 rather than π/4\pi/4. As shown above, the distinction between h+,h×h_{+},h_{\times} is observer dependent, but in some circumstances the polarization can be described in an observer-independent way. We write

h+=(A+exp(iω(tz))),h×=(A×exp(iω(tz))).h_{+}=\Re(A_{+}\exp\left(i\omega(t-z))\right)\,,\;\;h_{\times}=\Re(A_{\times}\exp\left(i\omega(t-z))\right)\,. (30)

Now, A+,A×A_{+},A_{\times} are complex constants (i.e., independent of the coordinates xαx^{\alpha}), and consider the ratio A×/A+=RAA_{\times}/A_{+}=R_{A} which must also be constant. There are two interesting special cases: (1) Suppose that the ratio RAR_{A} has zero imaginary part, then h×/h+h_{\times}/h_{+} is constant, and so we may change coordinates by rotating the (x,y)(x,y) axes about the zz-axis through some angle θ\theta to achieve h×=0h_{\times}=0; in this case the GW is said to be linearly polarised. (2) Suppose that RA=±iR_{A}=\pm i, then from the identity cos2s+sin2s=1\cos^{2}s+\sin^{2}s=1, it follows that the magnitude of the GW h+2+h×2\sqrt{h_{+}^{2}+h_{\times}^{2}} is constant, and the wave is said to be circularly polarised. The physical properties leading to these names are discussed in the next sub-section.

2.5 The effect of a gravitational wave on a system of test particles

We now consider the physical effects of a GW in the TT gauge as described in the previous sub-section. Since hα0=0h_{\alpha 0}=0, it follows from Eq. (3) that Γ00α=0\Gamma^{\alpha}_{00}=0. The geodesic equation is

d2xαdτ2=Γμναdxμdτdxνdτ,\frac{d^{2}x^{\alpha}}{d\tau^{2}}=-\Gamma^{\alpha}_{\mu\nu}\frac{dx^{\mu}}{d\tau}\frac{dx^{\nu}}{d\tau}\,, (31)

where τ\tau is the proper time; if a particle is initially at rest, dxμ/dτ=(1,0,0,0){dx^{\mu}}/{d\tau}=(1,0,0,0) so that d2xα/dτ2=0{d^{2}x^{\alpha}/}{d\tau^{2}}=0 and the particle remains at rest. But this is just a consequence of the coordinates being used, and does not mean that the GW has no physical effect. In order to analyze this further, we use the equation of geodesic deviation. Consider two nearby particles, AA and BB, each with 4-velocity UαU^{\alpha}, and let cαc^{\alpha} be a vector connecting AA and BB and orthogonal to UαU^{\alpha}; then

d2cαdτ2+ΓβγαUβUγ=RβγδαUβUδcγ.\frac{d^{2}c^{\alpha}}{d\tau^{2}}+\Gamma^{\alpha}_{\beta\gamma}U^{\beta}U^{\gamma}=-R^{\alpha}_{\phantom{\alpha}\beta\gamma\delta}U^{\beta}U^{\delta}c^{\gamma}\,. (32)

Now suppose that we use proper coordinates with AA located at the origin; then Uα=(1,0,0,0)U^{\alpha}=(1,0,0,0), cα=(0,ci)=(0,xBi)c^{\alpha}=(0,c^{i})=(0,x_{B}^{i}) (with i=1,2,3i=1,2,3 denoting a spatial index), the metric is

ds2=(ηαβ+𝒪(|xi|2))dxαdxβ,ds^{2}=\left(\eta_{\alpha\beta}+{\mathcal{O}}(|x^{i}|^{2})\right)dx^{\alpha}dx^{\beta}\,, (33)

Γβγα=0\Gamma^{\alpha}_{\beta\gamma}=0, τ=t\tau=t, and Eq. (32) simplifies to

d2xBidt2=R0j0ixBj.\frac{d^{2}x_{B}^{i}}{dt^{2}}=-R^{i}_{\phantom{i}0j0}x_{B}^{j}\,. (34)

It is important to note that RβγδαR^{\alpha}_{\phantom{\alpha}\beta\gamma\delta} is gauge-invariant, that is if we make a gauge transformation given by Eq. (11), then in the new gauge Rβγδ(NEW)α=Rβγδ(OLD)αR^{\mbox{{\tiny(NEW)}}\alpha}_{\phantom{\mbox{{\tiny(NEW)}}\alpha}\beta\gamma\delta}=R^{\mbox{{\tiny(OLD)}}\alpha}_{\phantom{\mbox{{\tiny(OLD)}}\alpha}\beta\gamma\delta}. This result follows since Rβγδα=𝒪(hαβ)R^{\alpha}_{\phantom{\alpha}\beta\gamma\delta}={\mathcal{O}}(h_{\alpha\beta}), and the coordinate transformation matrix is

x(NEW)αx(OLD)β=δβα+𝒪(hαβ),\frac{\partial x^{\mbox{{\tiny(NEW)}}\alpha}}{\partial x^{\mbox{{\tiny(OLD)}}\beta}}=\delta^{\alpha}_{\beta}+{\mathcal{O}}(h_{\alpha\beta})\,, (35)

so to 𝒪(hαβ){\mathcal{O}}(h_{\alpha\beta}) only the δβα\delta^{\alpha}_{\beta} is used in the coordinate transformation. Thus we may use the TT gauge to evaluate the Riemann tensor in Eq. (34), and we find

R0101=R0202=12t2h+,R0201=R0102=12t2h×,R^{1}_{\phantom{1}010}=-R^{2}_{\phantom{2}020}=-\frac{1}{2}\partial^{2}_{t}h_{+}\,,\;\;R^{1}_{\phantom{1}020}=R^{2}_{\phantom{2}010}=-\frac{1}{2}\partial^{2}_{t}h_{\times}\,, (36)

with the remaining components of the form R0j0iR^{i}_{\phantom{i}0j0} being zero. Then Eq. (34) simplifies to

d2xBidt2=(12t2h+xB1+12t2h×xB2,12t2h×xB112t2h+xB2,0).\frac{d^{2}x_{B}^{i}}{dt^{2}}=\left(\frac{1}{2}\partial^{2}_{t}h_{+}x_{B}^{1}+\frac{1}{2}\partial^{2}_{t}h_{\times}x_{B}^{2},\frac{1}{2}\partial^{2}_{t}h_{\times}x_{B}^{1}-\frac{1}{2}\partial^{2}_{t}h_{+}x_{B}^{2},0\right)\,. (37)

We now write xBi(t)=xB0i+ϵi(t)x^{i}_{B}(t)=x^{i}_{B0}+\epsilon^{i}(t) with ϵi\epsilon^{i} of order 𝒪(h){\mathcal{O}}(h), and replace xBix^{i}_{B} by xB0ix^{i}_{B0} on the right hand side of Eq. (37) (as terms of order 𝒪(h2){\mathcal{O}}(h^{2}) are ignorable) so that Eq. (37) can be integrated to give

ϵi=(12h+xB01+12h×xB02,12h×xB0112h+xB02,0).\epsilon^{i}=\left(\frac{1}{2}h_{+}x_{B0}^{1}+\frac{1}{2}h_{\times}x_{B0}^{2},\frac{1}{2}h_{\times}x_{B0}^{1}-\frac{1}{2}h_{+}x_{B0}^{2},0\right)\,. (38)

In order to facilitate the interpretation of Eq. (38), let us suppose that the (x1,x2)(x^{1},x^{2}) axes have been rotated so that xB02=0x^{2}_{B0}=0, use Eq. (30), and suppose that the origin of the spacetime coordinates is such that A+A_{+} is real. Then in the case of a linearly polarised wave

ϵi=12xB01A+cosωt(1,RA,0),\epsilon^{i}=\frac{1}{2}x_{B0}^{1}A_{+}\cos\omega t\left(1,R_{A},0\right)\,, (39)

so that BB moves in a straight line; and in the case of a circularly polarized wave

ϵi=12xB01A+(cosωt,sinωt,0),\epsilon^{i}=\frac{1}{2}x_{B0}^{1}A_{+}\left(\cos\omega t,\sin\omega t,0\right)\,, (40)

so that BB moves in a circle. The effect of a GW, for both linear and circular polarization states is illustrated in Fig. 1.

Refer to caption
Figure 1: The effect on a circle of particles transverse to a GW. Upper row: circular polarization, Lower row: linear polarization. In the horizontal axis, TT is the period of the GW. The particles are in their unperturbed state at T/4T/4 and 3T/43T/4 in the lower row. In the circular polarization case it may appear that the whole ellipse is rotating, but actually particles are moved a little from the unperturbed state to give this impression; in the same way that a wave moving across the ocean is caused by water moving up and down, not horizontally.

The preceeding analysis brings out two important general points. Firstly, since Rβγδα0R^{\alpha}_{\phantom{\alpha}\beta\gamma\delta}\neq 0, the spacetime is not Minkoskian. Secondly, suppose that the particles AA and BB are massive and are connected by a damping system; then the relative motion and acceleration will mean that work is done on the damping system and therefore its temperature must increase. These considerations show that GWs are real physical effects that contain energy that must have originated in some source and which can, in principle, be extracted from the GWs.

3 Generation of gravitational waves

3.1 Linearized Einstein equations with matter: the quadrupole formula

From standard wave equation theory, the solution to Eq. (10) representing an outgoing wave is

h¯μν(t,xi)=4Tμν(t|xixi|,xj)|xixi|d3x,\bar{h}^{\mu\nu}(t,x^{i})=4\int\frac{T^{\mu\nu}(t-|x^{i}-x^{\prime i}|,x^{\prime j})}{|x^{i}-x^{\prime i}|}d^{3}x^{\prime}\,, (41)

where (t,xi)(t,x^{i}) is the event at which the metric perturbation h¯μν\bar{h}^{\mu\nu} is evaluated, xix^{\prime i} represents a spatial point inside the source, and |xixi||x^{i}-x^{\prime i}| is the Euclidean distance between xix^{i} and xix^{\prime i}. Suppose now that the source is matter localized near the origin in a region r=|xi|<r0r=|x^{i}|<r_{0}, and we are evaluating the metric perturbations where r=𝒪(rE)r={\mathcal{O}}(r_{E}) with r0rEr_{0}\ll r_{E}; then Eq. (41) simplifies to

h¯μν(t,xi)=4rTμν(tr,xj)d3x.\bar{h}^{\mu\nu}(t,x^{i})=\frac{4}{r}\int T^{\mu\nu}(t-r,x^{\prime j})d^{3}x^{\prime}\,. (42)

The discussion in Sec. 2.3 showed that the 10 components of h¯μν\bar{h}^{\mu\nu} have only 6 independent components, so it is sufficient to determine the spatial components h¯ij\bar{h}^{ij}. In the linearized approximation, the conservation condition αT0α=0\nabla_{\alpha}T^{0\alpha}=0 simplifies to tT00+iT0i=0\partial_{t}T^{00}+\partial_{i}T^{0i}=0, and applying this relation twice gives

ijTij=t2T00.\partial_{i}\partial_{j}T^{ij}=\partial^{2}_{t}T^{00}\,. (43)

Using the above together with the rules of basic calculus gives

t2(T00xixj)=\displaystyle\partial^{2}_{t}(T^{00}x^{i}x^{j})= (kTk)xixj\displaystyle(\partial_{k}\partial_{\ell}T^{k\ell})x^{i}x^{j}
=\displaystyle= k(Tkxixj)2k(Tkixj+Tkjxi)+2Tij,\displaystyle\partial_{k}\partial_{\ell}(T^{k\ell}x^{i}x^{j})-2\partial_{k}(T^{ki}x^{j}+T^{kj}x^{i})+2T^{ij}\,, (44)

from which it follows that

Tijd3x=12t2T00xixjd3x,\int T^{ij}d^{3}x=\frac{1}{2}\partial^{2}_{t}\int T^{00}x^{i}x^{j}d^{3}x\,, (45)

since the remaining terms k[(Tkxixj)2(Tkixj+Tkjxi)]\partial_{k}\left[\partial_{\ell}(T^{k\ell}x^{i}x^{j})-2(T^{ki}x^{j}+T^{kj}x^{i})\right] can be transformed by the divergence theorem into a surface integral over the boundary of the volume of integration where Tαβ=0T^{\alpha\beta}=0.

As well as the gravitational field being weak with hμν1h_{\mu\nu}\ll 1, we now assume that throughout the source region the relative speed of the matter flow is much less than that of light. Then T00T^{00} can be replaced by the matter density ρ\rho so that Eq. (41) becomes

h¯ij(t,xi)=2rd2Iij(tr)dt2,\bar{h}^{ij}(t,x^{i})=\frac{2}{r}\frac{d^{2}I^{ij}(t-r)}{dt^{2}}\,, (46)

where the 3-dimensional tensor

Iij(t)=ρ(t,xk)xixjd3xI^{ij}(t)=\int\rho(t,x^{\prime k})x^{\prime i}x^{\prime j}d^{3}x (47)

is the second moment, also called the quadrupole moment, of the mass distribution. In the case that the matter distribution is treated as a system of NN particles,

Iij(t)=a=1NMaxaixaj.I^{ij}(t)=\sum_{a=1}^{N}M_{a}x^{\prime i}_{a}x^{\prime j}_{a}\,. (48)

Example: Equal mass circular binary

Consider two particles, AA and BB, each of mass MM in circular orbit around each other with orbital diameter r0r_{0}, and suppose that the angular velocity of each particle is ω\omega. Choosing coordinates so that the orbit is in the xyx-y plane, the position vector of particle AA is

xAi=(r02cosωt,r02sinωt,0),x^{i}_{A}=\left(\frac{r_{0}}{2}\cos\omega t,\frac{r_{0}}{2}\sin\omega t,0\right)\,, (49)

and for particle BB, xBi=xAix_{B}^{i}=-x^{i}_{A}. Then

Iij=Mr022(cos2ωtcosωtsinωt0cosωtsinωtsin2ωt0000).I^{ij}=\frac{Mr_{0}^{2}}{2}\left(\begin{array}[]{ccc}\cos^{2}\omega t&\cos\omega t\sin\omega t&0\\ \cos\omega t\sin\omega t&\sin^{2}\omega t&0\\ 0&0&0\end{array}\right)\,. (50)

Then using Eq. (46), it follows that

h¯ij=2Mr02ω2r(cos2ω(tr)sin2ω(tr)0sin2ω(tr)cos2ω(tr)0000).\bar{h}^{ij}=-\frac{2Mr_{0}^{2}\omega^{2}}{r}\left(\begin{array}[]{ccc}\cos 2\omega(t-r)&\sin 2\omega(t-r)&0\\ \sin 2\omega(t-r)&-\cos 2\omega(t-r)&0\\ 0&0&0\end{array}\right)\,. (51)

For an observer on the zz-axis, Eq. (51) is already in the TT gauge, but for any other observer a transformation to the TT gauge is needed. We now construct the transformation for an observer on the xx-axis, by amending the theory developed in Sec. 2.3. The wave propagation vector becomes kμ=(2ω,2ω,0,0)k_{\mu}=(2\omega,-2\omega,0,0); then using Eq. (18) and noting the zero entries in Eq.(51), we have

Aαβ=(A11A11A120A11A11A120A12A12A2200000).A^{\alpha\beta}=\left(\begin{array}[]{cccc}A^{11}&A^{11}&A^{12}&0\\ A^{11}&A^{11}&A^{12}&0\\ A^{12}&A^{12}&A^{22}&0\\ 0&0&0&0\end{array}\right)\,. (52)

Eq. (22) is now applied, giving

A11=\displaystyle A^{\prime 11}= A11+i2ω(B0+B1),A12=A12+i2ωB2,A13=i2ωB3\displaystyle A^{11}+i2\omega(B^{0}+B^{1})\,,\qquad A^{\prime 12}=A^{12}+i2\omega B^{2}\,,\qquad A^{\prime 13}=i2\omega B^{3}
A22=\displaystyle A^{\prime 22}= A22+i2ω(B1B0),A23=0,A33=i2ω(B1B0).\displaystyle A^{22}+i2\omega(B^{1}-B^{0})\,,\qquad A^{\prime 23}=0\,,\qquad A^{\prime 33}=i2\omega(B^{1}-B^{0})\,. (53)

The TT condition A22+A33=0A^{\prime 22}+A^{\prime 33}=0 is satisfied upon setting i2ω(B1B0)=A22/2i2\omega(B^{1}-B^{0})=-A^{22}/2, and then (B0+B1),B2,B3(B^{0}+B^{1}),B^{2},B^{3} are chosen so that A11=A12=A13=0A^{\prime 11}=A^{\prime 12}=A^{\prime 13}=0. Thus,

h¯ij=Mr02ω2r(0000cos2ω(tr)000cos2ω(tr)).\bar{h}^{ij}=\frac{Mr_{0}^{2}\omega^{2}}{r}\left(\begin{array}[]{ccc}0&0&0\\ 0&\cos 2\omega(t-r)&0\\ 0&0&-\cos 2\omega(t-r)\\ \end{array}\right)\,. (54)

Finally, in order to be able to make a consistent comparison of the waveform in different directions, we need to rotate the (x,z)(x,z) coordinates about the yy-axis through an angle π/2\pi/2 so that the GW is travelling in the zz-direction. The result is

h¯ij=Mr02ω2r(cos2ω(tr)000cos2ω(tr)0000).\bar{h}^{ij}=\frac{Mr_{0}^{2}\omega^{2}}{r}\left(\begin{array}[]{ccc}-\cos 2\omega(t-r)&0&0\\ 0&\cos 2\omega(t-r)&0\\ 0&0&0\\ \end{array}\right)\,. (55)

In the general case, the observer’s position is expressed using spherical polars (r,θ,ϕ)(r,\theta,\phi); without loss of generality, we set ϕ=0\phi=0 since a non-zero ϕ\phi is equivalent to changing tt to tϕ/ωt-\phi/\omega. The wave propagation vector becomes kμ=(ω,ωsinθ,0,ωcosθ)k_{\mu}=(\omega,-\omega\sin\theta,0,-\omega\cos\theta), and the transformation to the TT gauge proceeds in a similar way to that described above, but the details are omitted. In order for the TT form to be expilicit, we must also make a coordinate transformation that represents a rotation through an angle θ\theta of the x,zx,z coordinates about the yy-axis. The result for a general circular binary with masses M1,M2M_{1},M_{2} is

h+=\displaystyle h_{+}= 2ω2μr02r(1+cos2θ)cos(2ω(tr)2ϕ)\displaystyle-\frac{2\omega^{2}\mu r_{0}^{2}}{r}(1+\cos^{2}\theta)\cos(2\omega(t-r)-2\phi)
h×=\displaystyle h_{\times}= 4ω2μr02rcosθsin(2ω(tr)2ϕ),\displaystyle-\frac{4\omega^{2}\mu r_{0}^{2}}{r}\cos\theta\sin(2\omega(t-r)-2\phi)\,, (56)

where μ=M1M2/(M1+M2)\mu=M_{1}M_{2}/(M_{1}+M_{2}) is M/2M/2 for the equal mass case, and μ\mu is called the reduced mass of the binary. The magnitudes of both GW components h+,h×h_{+},h_{\times} are maximum at θ=0,π\theta=0,\pi (i.e. for an observer seeing the binary “face-on”), and minimum for an observer at θ=π/2\theta=\pi/2 (i.e., for an observer in the plane of the binary). Note further that at θ=0,π\theta=0,\pi the GW is circularly polarized, whereas at θ=π/2\theta=\pi/2 it is linearly polarized.

3.2 Energy of gravitational waves

The energy associated with gravitational waves is a second-order effect, and so its justification goes beyond the linearized approximation considered so far. Here, we outline the calculation, but with many details omitted Isaacson68 . Eq. (1) is amended to

gαβ=gαβ[B]+hαβ[1]+hαβ[2],g_{\alpha\beta}=g^{[B]}_{\alpha\beta}+h^{[1]}_{\alpha\beta}+h^{[2]}_{\alpha\beta}\,, (57)

where: hαβ[1]h^{[1]}_{\alpha\beta} is hαβh_{\alpha\beta} in Eq. (1) and is the first-order perturbation; hαβ[2]h^{[2]}_{\alpha\beta} is the second-order perturbation; and gαβ[B]g^{[B]}_{\alpha\beta} is the background metric, and is a little different to the Minkowski metric ηαβ\eta_{\alpha\beta} because it includes the response of the background geometry to the first-order perturbations. Evaluation of the Ricci tensor for the metric of Eq. (57) gives a similar expansion

Rαβ=Rαβ[B](g[B])+Rαβ[1](h[1])+Rαβ[2](h[1])+Rαβ[1](h[2]),R_{\alpha\beta}=R^{[B]}_{\alpha\beta}(g^{[B]})+R^{[1]}_{\alpha\beta}(h^{[1]})+R^{[2]}_{\alpha\beta}(h^{[1]})+R^{[1]}_{\alpha\beta}(h^{[2]})\,, (58)

where Rαβ[1](h[1])R^{[1]}_{\alpha\beta}(h^{[1]}) is of order 𝒪(h[1]){\mathcal{O}}\left(h^{[1]}\right), and the last two terms are of order 𝒪((h[1])2){\mathcal{O}}\left(\left(h^{[1]}\right)^{2}\right). Then solving Eq. (58) to first-order in h[1]h^{[1]} gives Rαβ[1](h[1])=0R^{[1]}_{\alpha\beta}(h^{[1]})=0, from which the linearized theory discussed in the preceeding sections of this chapter is obtained.

The next step is to split the second-order part of RαβR_{\alpha\beta} into a part that varies on scales much larger than the wavelength of the GWs, and a remainder that contains the local fluctuations. Defining <x><x> to be the average of the quantity xx over a suitable region (in a sense made precise in Isaacson68 ), then

Rαβ[B](g[B])+<Rαβ[2](h[1])>\displaystyle R^{[B]}_{\alpha\beta}(g^{[B]})+<R^{[2]}_{\alpha\beta}(h^{[1]})> =0,\displaystyle=0\,, (59)
Rαβ[1](h[2])+Rαβ[2](h[1])<Rαβ[2](h[1])>\displaystyle R^{[1]}_{\alpha\beta}(h^{[2]})+R^{[2]}_{\alpha\beta}(h^{[1]})-<R^{[2]}_{\alpha\beta}(h^{[1]})> =0.\displaystyle=0\,. (60)

We can then define

Tαβ[GW]=18π(<Rαβ[2](h[1])>12gαβ[B]<R[2](h[1])>),T^{[GW]}_{\alpha\beta}=-\frac{1}{8\pi}\left(<R^{[2]}_{\alpha\beta}(h^{[1]})>-\frac{1}{2}g^{[B]}_{\alpha\beta}<R^{[2]}(h^{[1]})>\right)\,, (61)

since, from Eq. (59), the Einstein tensor for the background metric can be written as

Gαβ[B]=Rαβ[B]12gαβ[B]R[B]=(<Rαβ[2](h[1])>12gαβ[B]<R[2](h[1])>).G^{[B]}_{\alpha\beta}=R^{[B]}_{\alpha\beta}-\frac{1}{2}g^{[B]}_{\alpha\beta}R^{[B]}=-\left(<R^{[2]}_{\alpha\beta}(h^{[1]})>-\frac{1}{2}g^{[B]}_{\alpha\beta}<R^{[2]}(h^{[1]})>\right)\,. (62)

Evaluation of Tαβ[GW]T^{[GW]}_{\alpha\beta} leads to a simple form when hαβ[1]h^{[1]}_{\alpha\beta} satisfies the Lorentz and TT gauge conditions

Tαβ[GW]=132π<(αhμν)(βhμν)>.T^{[GW]}_{\alpha\beta}=\frac{1}{32\pi}<(\partial_{\alpha}h_{\mu\nu})(\partial_{\beta}h^{\mu\nu})>\,. (63)

In the case of GWs moving radially outwards from a source which is far away, we can write

T00[GW]=T01[GW]=T11[GW]=132π<(thμν)(thμν)>=116π<(th+)2+(th×)2>.T^{[GW]}_{00}=-T^{[GW]}_{01}=T^{[GW]}_{11}=\frac{1}{32\pi}<(\partial_{t}h_{\mu\nu})(\partial_{t}h^{\mu\nu})>=\frac{1}{16\pi}<(\partial_{t}h_{+})^{2}+(\partial_{t}h_{\times})^{2}>\,. (64)

The stress-energy tensor Tαβ[GW]T^{[GW]}_{\alpha\beta} can be written in the form ρkαkβ\rho k_{\alpha}k_{\beta} with kαk_{\alpha} being the null vector (1,1,0,0)(-1,1,0,0), and ρ\rho being the right hand side of Eq. (64); thus Tαβ[GW]T^{[GW]}_{\alpha\beta} can be interpreted as being sourced by dust comprising massless particles moving radially outwards at the speed of light. The total power crossing the 2-surface SS at r=r=constant, t=t= constant is then

L[GW]=ST[GW]01r2𝑑Ω=r216πS(th+)2+(th×)2dΩ,L^{[GW]}=\int_{S}T^{[GW]01}r^{2}d\Omega=\frac{r^{2}}{16\pi}\int_{S}(\partial_{t}h_{+})^{2}+(\partial_{t}h_{\times})^{2}d\Omega\,, (65)

with the averaging <><> removed since h+,h×h_{+},h_{\times} vary slowly in the angular directions. For example, in the case of a circular binary with h+,h×h_{+},h_{\times} given by Eq. (56),

L[GW]=325μ2r04ω6.L^{[GW]}=\frac{32}{5}\mu^{2}r_{0}^{4}\omega^{6}\,. (66)

Eq. (65) describes a power output, and conservation of energy means that it should be balanced by an energy loss somewhere else. Indeed, there is a precise result to this effect Bondi62 ; Sachs62 , which we briefly outline. The mathematical framework used is coordinates based on outgoing null cones. For example, the metric of Minkowski spacetime is

ds2=du22dudr+r2(dθ2+sin2θdϕ2)ds^{2}=-du^{2}-2du\,dr+r^{2}(d\theta^{2}+\sin^{2}\theta d\phi^{2}) (67)

in which the coordinate transformation tu=trt\rightarrow u=t-r has been applied to the usual Minkowski metric in spherical polar coordinates (ds2=dt2+dr2+r2(dθ2+sin2θdϕ2)ds^{2}=-dt^{2}+dr^{2}+r^{2}(d\theta^{2}+\sin^{2}\theta d\phi^{2})). For a general spacetime, the metric is of the form Eq. (67) plus additional terms (which may not be small). However, the key assumption made is that the geometry is asymptotically flat, which means that the additional terms 0\rightarrow 0 at least as fast as 1/r1/r as rr\rightarrow\infty. In the general case, the Einstein equations are very complicated, but if we consider only the leading-order term (in an asymptotic expansion in 1/r1/r), then they simplify enormously and it can be shown that

dmdu=L[GW].\frac{dm}{du}=-L^{[GW]}\,. (68)

In general, the physical meaning of mm is ambiguous, but in the case that the spacetime is static the physical meaning is clear: it is the Schwarzschild mass of the system. We can therefore consider a system that is static until time u1u_{1}, then emits GWs until time u2u_{2} from when it is again static:

m(u2)m(u1)=u1u2L[GW]𝑑u.m(u_{2})-m(u_{1})=-\int_{u_{1}}^{u_{2}}L^{[GW]}du\,. (69)

If the dimensionless quantity dm/du1dm/du\ll 1 then the system is regarded as quasi-static, and mm in Eq. (68) is treated as the Schwarzschild mass. What does it mean to say that the total mass of a system, for example an orbiting binary, is decreasing? In general relativity, all forms of energy contribute to the total mass. If the system is approximately Newtonian and satisfies the conditions for the quadrupole formula to be valid, then the total mass is the sum of the rest masses of the orbiting objects (which does not change), together with the Newtonian kinetic plus potential energies. This matter is pursued further in Sec. 4.1.

As well as energy, GWs may also carry angular momentum and linear momentum. The formula for the linear momentum PiP^{i} is

dPidt=r216πSni((th+)2+(th×)2)𝑑Ω,\frac{dP^{i}}{dt}=-\frac{r^{2}}{16\pi}\int_{S}n^{i}\left((\partial_{t}h_{+})^{2}+(\partial_{t}h_{\times})^{2}\right)d\Omega\,, (70)

where nin^{i} is the unit vector normal to SS. When the motion of the source is in the xyx-y plane, then the angular momentum JiJ_{i} is in the zz-direction, and is

Jz=r216πSth+ϕh++th×ϕh×dΩ.J_{z}=\frac{r^{2}}{16\pi}\int_{S}\partial_{t}h_{+}\partial_{\phi}h_{+}+\partial_{t}h_{\times}\partial_{\phi}h_{\times}d\Omega\,. (71)

These formulas are derived, for example, in Alcubierre:2008 . Further, even when the source is not weak, sufficiently far from the source h+,h×h_{+},h_{\times} are small and so in the limit as rr\rightarrow\infty Eqs (65), (70), and (71) still apply.

3.3 Beyond the quadrupole formula

The derivation of the quadrupole formula assumed that velocities are small, but that does not apply for compact objects near merger. In this case, an accurate GW calculation requires going beyond the quadrupole formula. The various options available are summarized here.

The post-Newtonian (PN) method gives a formula for the GWs as a series with the first term in the series being the quadrupole result; thus, the method is an extension of the quadrupole formula. The series can be regarded as being in terms of powers of vv; however, the formalism used in PN work normally uses SI (with G,c1G,c\neq 1) rather than geometric units, and a term in the series expansion of the form 1/c2n1/c^{2n} is said to be of PN order nn. The quadrupole term is PN order 1, and other terms are PN order 32\frac{3}{2} or higher. PN methods accurately describe the inspiral of two compact objects; further, results are obtained simply by evaluating analytic formulas, and so the computation of the evolution of the system is very quick. However, as merger approaches the gravitational field becomes highly nonlinear and velocities may approach that of light, and the PN approximation ceases to be accurate. The PN method is discussed further in chapter 22 in this book; see also the review article Blanchet06 .

In numerical relativity, the full nonlinear Einstein equations are solved. The spacetime is foliated into a sequence of 3D hypersurfaces, and data on an initial hypersurface is evolved to the next hypersurface; the process is then repeated until the desired region of the spacetime has been covered. While numerical relativity is accurate even when the fields are highly nonlinear, it can require substantial computational resources. Thus, the GWs emitted by a compact binary system are normally calculated using the PN method during the inspiral, followed by numerical relativity for the final stages of the inspiral and the merger. As with PN methods, numerical relativity is a huge field, but it is not discussed further here. It is covered in chapter 23 in this book, and also in the text-books Alcubierre:2008 ; Baumgarte2010 .

Quasinormal modes are perturbations on a Kerr background that satisfy regularity conditions at the black hole horizon and infinity. After merger, and assuming a black hole is formed, a system will emit GWs at a frequency and decay rate that depend only on the final mass and angular momentum of the black hole. Details can be found in Chandrasekhar98 ; Kokkotas99a . This process is called the “ring-down”, and is quite short compared to the inspiral and merger phases. Physically, it represents a distorted black hole at merger relaxing to the static Kerr solution. The ring-down phase is normally included in a numerical simulation with the properties compared to the analytic solution as a check.

The search for GWs in detector data requires a large number (𝒪(104){\mathcal{O}}(10^{4})) of wavform templates to cover the parameter space of a binary merger. It is not practical to generate these templates using numerical relativity since each run takes several weeks. Thus, methods have been developed that use the limited numerical relativity results availableto set certain parameters in formulas that are either algebraic or the solution of an ordinary differential equation, thus enabling the rapid generation of the required waveform templates. The most commonly used such methods are the Effective One Body (EOB) approach (see, e.g., Damour2011 ), which can be regarded as being mainly an analytical modification of post-Newtonian theory; and Phenomenological waveform models (e.g., Santamaria2010 ), in which the waveform during the merger phase is constructed as a best fit to the numerical relativity data.

An Extreme Mass Ratio Inspiral (EMRI) is a compact binary inspiral and merger when the mass of one companion is much lower than that of the other Poisson11 . Astrophysically, such events occur when a supermassive black hole captures an object with a mass of order a few MM_{\odot}. Such events are not in the LIGO/Virgo frequency band, but are expected to be observed by LISA, see Sec. 5.

The result of GW calculations is often reported in terms of ψ4\psi_{4} (rather than h+,h×h_{+},h_{\times}), and the result decomposed into functions denoted by Y,ms{}_{s}Y^{\ell,m}. What are these quantities? The Newman-Penrose quantity ψ4\psi_{4} is a complex quantity defined in terms of the Weyl tensor and a null tetrad, and is related to h+,h×h_{+},h_{\times} by

ψ4=t2(h+ih×).\psi_{4}=\partial^{2}_{t}(h_{+}-ih_{\times})\,. (72)

The Y,ms{}_{s}Y^{\ell,m} are spin-weighted spherical harmonics that can be regarded as generalizations of the familiar spherical harmonics Y,mY^{\ell,m} to be able to represent vector and tensor quantities over the unit sphere. In order to describe quadrupolar GWs, only a limited set of the Y,ms{}_{s}Y^{\ell,m} will be needed, specifically

Y2,22\displaystyle{}_{2}Y^{2,2} =58πexp(2iϕ)(1cosθ)2,Y2,22=58πexp(2iϕ)(1+cosθ)2\displaystyle=\frac{\sqrt{5}}{8\sqrt{\pi}}\exp(2i\phi)(1-\cos\theta)^{2}\,,\;{}_{2}Y^{2,-2}=\frac{\sqrt{5}}{8\sqrt{\pi}}\exp(-2i\phi)(1+\cos\theta)^{2}\,
Y2,22\displaystyle{}_{-2}Y^{2,2} =58πexp(2iϕ)(1+cosθ)2,Y2,22=58πexp(2iϕ)(1cosθ)2.\displaystyle=\frac{\sqrt{5}}{8\sqrt{\pi}}\exp(2i\phi)(1+\cos\theta)^{2}\,,\;{}_{-2}Y^{2,-2}=\frac{\sqrt{5}}{8\sqrt{\pi}}\exp(-2i\phi)(1-\cos\theta)^{2}\,. (73)

Then, for example, Eq. (56), which gives the GW emission of a circular binary, can be re-expressed as

ψ4\displaystyle\psi_{4} =32μr02ω45π5r×\displaystyle=\frac{32\mu r_{0}^{2}\omega^{4}\sqrt{5\pi}}{5r}\times
[cos(2ωt)(Y2,22+Y2,22)+sin(2ωt)(iY2,22iY2,22)].\displaystyle\left[\cos(2\omega t)\left({}_{-2}Y^{2,-2}+{}_{-2}Y^{2,2}\right)+\sin(2\omega t)\left(i\,{}_{-2}Y^{2,-2}-i\,{}_{-2}Y^{2,2}\right)\right]\,. (74)

We refer to the literature for a fuller discussion of the definition and properties of the Newman-Penrose scalars, and of spin-weighted spherical harmonics, e.g. Alcubierre:2008 ; Baumgarte2010 ; Bishop16 .

4 The gravitational wave field of an orbiting binary

4.1 Inspiral rate and time to merger

Refer to caption
Figure 2: Variables used in the orbiting circular binary calculations

We now consider the long-term behaviour of a binary system in circular orbit, with the GW field given by Eq. (56) and consequently the energy loss LGWL_{GW} given by Eq. (66). The variables used are illustrated in Fig. 2: there are two masses m1,m2m_{1},m_{2} with velocities v1,v2v_{1},v_{2} respectively in circular orbit of diameter r0r_{0} and angular velocity ω\omega, and let M=m1+m2,μ=m1m2/MM=m_{1}+m_{2},\mu=m_{1}m_{2}/M. Let r1,r2r_{1},r_{2} be the distances from m1,m2m_{1},m_{2} respectively to the centre of mass of the system CC so that r1+r2=r0r_{1}+r_{2}=r_{0} and r1m1=r2m2r_{1}m_{1}=r_{2}m_{2}; solving these two equations leads to r1=r0m2/M,r2=r0m1/Mr_{1}=r_{0}m_{2}/M,r_{2}=r_{0}m_{1}/M. In Newtonian theory, gravitational attraction is balanced by the orbital centripetal acceleration, which leads to

ω2=Mr03,\omega^{2}=\frac{M}{r_{0}^{3}}\,, (75)

then the Newtonian kinetic plus potential energy of the system is

E=μMr0+12ω2r02μ=μMr0+μM2r0=μM2r0.E=-\frac{\mu M}{r_{0}}+\frac{1}{2}\omega^{2}r_{0}^{2}\mu=-\frac{\mu M}{r_{0}}+\frac{\mu M}{2r_{0}}=-\frac{\mu M}{2r_{0}}\,. (76)

Now, conservation of energy gives tE+L[GW]=0\partial_{t}E+L^{[GW]}=0 with L[GW]L^{[GW]} given by Eq. (66), so that

tr0=64M2μ5r03,\partial_{t}r_{0}=-\frac{64M^{2}\mu}{5r_{0}^{3}}\,, (77)

which easily integrates to give a time to coalescence tct_{c} (when r0(t)=0r_{0}(t)=0)

tc=5r04256M2μ.t_{c}=\frac{5r_{0}^{4}}{256M^{2}\mu}\,. (78)

Eqs. (77) and (78) lead to important results in GW astrophysics. Consider the example of a binary system comprising two 10M10M_{\odot} black holes in circular orbit such that the time to coalescence tct_{c} is 10910^{9} years: what is the initial diameter of the orbit r0r_{0}? The value of r0r_{0} for which tct_{c} is 10910^{9} years is astrophysically important, since if the initial separation were larger (by a factor of 2), then tct_{c} would be greater than the age of the universe and so GW emission alone would not cause the binary to coalesce. Inverting Eq. (78) gives

r0=4(109years×2000M35)14=7.5×106km,r_{0}=4\left(\frac{10^{9}\mbox{years}\times 2000M_{\odot}^{3}}{5}\right)^{\frac{1}{4}}=7.5\times 10^{6}\;\mbox{km}\,, (79)

where we have used the conversion factors 11\;year = 0.95×10130.95\times 10^{13}\;km, and 1M=1.481M_{\odot}=1.48\;km. By astronomical standards, the distance 7.5×1067.5\times 10^{6}\;km is very small – it is about 5 solar diameters. Thus, the formation of coalescing black hole binaries must involve astrophysical processes other than GW emission.

The derivation of Eqs. (77) and (78) assumed that the system is quasi-static, but that is not the case at, and just before, coalescence. From the quadrupole formula the magnitude of a GW scales as v2v^{2} (where vv is the orbital velocity), and the error is 𝒪(v4){\mathcal{O}}(v^{4}). The problem being considered determines an acceptable magnitude for the error. Suppose that we take vmax=0.1v_{\mbox{\small max}}=0.1 (so that the error in the GW is 𝒪(104){\mathcal{O}}(10^{-4})), and we find that for the equal mass binary example the orbital diameter when v=vmaxv=v_{\mbox{\small max}} is r0=50Mr_{0}=50M. Working out the time to coalescence gives 12s; of course, this figure is not accurate, but if the time to coalescence is 10910^{9} years any error occuring during the last few seconds of the inspiral is negligible.

Differentiating Eq. (75) gives

tωω=32tr0r0,\frac{\partial_{t}\omega}{\omega}=-\frac{3}{2}\frac{\partial_{t}r_{0}}{r_{0}}\,, (80)

and we can then replace tr0,r0\partial_{t}r_{0},r_{0} in Eq. (77) by tω,ω\partial_{t}\omega,\omega to obtain the chirp mass formula

(m1m2)3/5M1/5=μ3/5M2/5=(596π8/3f11/3(tf))3/5,{\mathcal{M}}\equiv\frac{(m_{1}m_{2})^{3/5}}{M^{1/5}}=\mu^{3/5}M^{2/5}=\left(\frac{5}{96}\pi^{-8/3}f^{-11/3}(\partial_{t}f)\right)^{3/5}\,, (81)

where ff is the wave frequency, related to the orbital angular velocity by ω=πf\omega=\pi f. Eq. (81) will be discussed further in Sec. 5.2: the key point is that both ff and tf\partial_{t}f are determinable entirely from LIGO/Virgo data, and thus so is the chirp mass {\mathcal{M}}. Observed changes to the orbital frequency of the Hulse-Taylor binary pulsar Taylor79 , and other systems, have provided confirmation of this formula since the 1970s.

The decrease in the Newtonian orbital energy of the binary (Eq. (76)) can be regarded as being caused by a radiation reaction force (also called the “self-force”). A straightforward Newtonian calculation shows that the radiation reaction is of magnitude

F=325μ2r03ω5,F=\frac{32}{5}\mu^{2}r_{0}^{3}\omega^{5}\,, (82)

and is applied to each object in the binary in the opposite direction to that of its orbital motion. Since the radiation reaction comprises two equal and opposite forces a distance r0r_{0} apart, it imposes a torque of magnitude r0Fr_{0}F which is equal to the rate of change of angular momentum of the system. Since angular momentum is conserved, the GWs must be carrying angular momentum away at a rate

r0F=325μ2r04ω5=32μ2M5/25r07/2.r_{0}F=\frac{32}{5}\mu^{2}r_{0}^{4}\omega^{5}=\frac{32\mu^{2}M^{5/2}}{5r_{0}^{7/2}}\,. (83)

Now the angular momentum JJ of the system is

J=ω(m1r12+m2r22)=r02ωμ=μMr0,J=\omega(m_{1}r_{1}^{2}+m_{2}r_{2}^{2})=r_{0}^{2}\omega\mu=\mu\sqrt{Mr_{0}}\,, (84)

so that

tJ=μMtr02r0=32μ2M5/25r07/2\partial_{t}J=\mu\frac{\sqrt{M}\partial_{t}r_{0}}{2\sqrt{r_{0}}}=-\frac{32\mu^{2}M^{5/2}}{5r_{0}^{7/2}} (85)

which is consistent with Eq. (83). The rate at which GWs transport angular momentum may be expressed in a form similar to Eq. (65), giving the value found in Eq. (85) for a circular binary. This also applies to the transport of linear momentum; although it is zero for the binary being considered here, it can be substantial for two black holes with misaligned spins. These results are not discussed here, but are given, for example, in Baumgarte2010 .

4.2 Eccentricity reduction by gravitational waves

In general binary orbits are elliptical rather than circular, but an effect of GWs is to circularize an elliptic orbit. Thus close to merger (when the GW emission is strongest and most likely to be detected), binary orbits have a very low eccentricity and can be treated as circular; the only exception would be the unlikely scenario that the binary is located in a sytem containing other massive objects that increase the eccentricity of the binary. The eccentricity reduction formula was first derived in Peters:1964 , and here we just outline the calculation.

An elliptical orbit is described by its semi-major axis aa and its eccentricity ee. In Newtonian theory, these are constants of the motion, and here they are treated as slowly varying functions of time. The orbital equation is

r=2a(1e2)1+ecosϕ,r=\frac{2a(1-e^{2})}{1+e\cos\phi}\,, (86)

where rr is the distance between the orbiting objects. A circular orbit is recovered on setting e=0,a=r0/2e=0,a=r_{0}/2 so that r=2a=r0r=2a=r_{0}. The Newtonian energy and Newtonian angular momentum are

E=μM4a,J=μ2Ma(1e2),E=-\frac{\mu M}{4a}\,,\;\;J=\mu\sqrt{2Ma(1-e^{2})}\,, (87)

and the rates of energy and angular momentum emission by GWs are

L[GW]\displaystyle\left<L^{[GW]}\right> =μ2M35a5(1e2)7/2(1+73e224+37e496),\displaystyle=-\frac{\mu^{2}M^{3}}{5a^{5}(1-e^{2})^{7/2}}\left(1+\frac{73e^{2}}{24}+\frac{37e^{4}}{96}\right)\,, (88)
dJdt\displaystyle\left<\frac{dJ}{dt}\right> =22μ2M5/25a7/2(1e2)2(1+7e28),\displaystyle=-\frac{2\sqrt{2}\mu^{2}M^{5/2}}{5a^{7/2}(1-e^{2})^{2}}\left(1+\frac{7e^{2}}{8}\right)\,, (89)

where\left<\;\right> denotes an average over one orbit. The above formulas reduce to those for a binary in circular orbit on seting e=0,a=r0/2e=0,a=r_{0}/2. Substituting Eqs. (87) into Eqs. (89) leads to equations for ta,te\partial_{t}a,\partial_{t}e, which are combined to give

dade=12a19e(1+𝒪(e2)),\left<\frac{da}{de}\right>=\frac{12a}{19e}\left(1+{\mathcal{O}}(e^{2})\right)\,, (90)

from which it follows that

ee0(aa0)19/12.e\approx e_{0}\left(\frac{a}{a_{0}}\right)^{19/12}\,. (91)

During the time that GW emission starts to drive an inspiral until the GWs become detectable, the size of the orbit (i.e. aa) decreases by a factor of about 10410^{4}, so that e106e0<106e\approx 10^{-6}e_{0}<10^{-6}.

5 Detection of gravitational waves

GW detectors are sensitive only in a specific frequency range, and so key factors for the detection of a GW are its magnitude and frequency, and it is useful to have simple order of magnitude estimates of these quantities. From Eqs. (56) and (75), it follows that

|h+,h×|=𝒪(2μMr0r),|h_{+},h_{\times}|={\mathcal{O}}\left(\frac{2\mu M}{r_{0}r}\right)\,, (92)

which is maximized just before merger when r02Mr_{0}\sim 2M so that

|h+,h×|=𝒪(μr)0.5×1019μ/1Mr/1Mpc.|h_{+},h_{\times}|={\mathcal{O}}\left(\frac{\mu}{r}\right)\approx 0.5\times 10^{-19}\frac{\mu/1M_{\odot}}{r/1\mbox{Mpc}}\,. (93)

The GW frequency is estimated from Eq. (75) as ω=2/(4M)\omega=\sqrt{2}/(4M) so that

f24πM2×104MMHz.f\sim\frac{\sqrt{2}}{4\pi M}\approx 2\times 10^{4}\frac{M_{\odot}}{M}\,\mbox{Hz}\,. (94)

The GW from a compact binary inspiral slowly increases in magnitude and frequency until reaching the maximum values indicated above.

Refer to caption
Figure 3: LIGO, LISA and Pulsar Timing frequency ranges and target astrophysical events

The various options for measuring GWs are described in detail in chapters 2 to 7 in this book, and here only an outline is presented. The frequency ranges in which detection systems are sensitive, and the corresponding astrophysical processes, are summarized in Fig. 3.

5.1 GW detection facilities

Refer to caption
Figure 4: Schematic representation of a terrestrial laser interferometer detector. The arm length LL is 4km for LIGO and 3km for Virgo.

Ground-based laser interferometer detectors detector comprise two perpendicular arms each of length several km, and laser light travels along each arm and back again, see Fig. 4; then the difference in travel time between the two arms is measured as an interference fringe. These detectors are sensitive in the frequency range 20\approx 20Hz to 2000\approx 2000Hz, and have observed compact binary mergers (with the compact objects being neutron stars or stellar-mass black holes) since September 2015 Abbott2016a . Other potential GW sources that may be detected are discussed in Sec. 5.3. Detector sensitivity has been continually improved since the first science run in 2001. This is important, since a 10-fold increase in sensitivity means a 10-fold increase in the distance at which a source may be detected (see Eq. 46), which increases the volume searched and therefore the expected detection rate by a factor of 10001000. Currently the network comprises operational detectors GEO600 in Germany (this is the least sensitive detector), KAGRA in Japan, LIGO Livingstone in USA, LIGO Hanford in USA and VIRGO in Italy, with LIGO-India scheduled to be operational in 2024. In order for there to be a high probability that an observation of a GW event is real rather than an instrument glitch, it is important that it be observed in at least two detectors. Further, the accuracy of sky localization increases with the number of (well-separated) detectors in which the event is observed; accurate sky localization improves the chance that an EM counterpart can be identified, leading to a fuller picture of the astrophysics of the event.

The Laser Interferometer Space Antenna (LISA) is a mission of the European Space Agency and is scheduled for launch in 2034. It comprises three satellites at the vertices of an equilateral triangle with sides 2.5×1062.5\times 10^{6}km in an Earth-like orbit around the Sun. It will be sensitive to GWs in the frequency band 10510^{-5}Hz to 11Hz. The GW events that are expected to be observed include the merger of supermassive black holes and EMRIs. LISA should also detect GWs from close binaries comprising stellar-mass black holes, neutron stars or white dwarfs, and should also be able to predict the end-point of an inspiral, i.e. the time of a compact binary merger event detectable by a gound based facility.

Millisecond pulsars can be regarded as highly accurate clocks. Pulsar timing projects measure accurately the time of arrival of each pulse from a particular pulsar, and compare it to the expected time of arrival assuming a uniform time between the pulses. This difference is recorded as the timing residual. (In practice, the process is rather more complicated, because it is only when averaged over a number of pulses that uniform pulse emission occurs, and further there may be glitches in pulse emission; thus the data analysis is statistical rather than direct). These measurements are repeated for a number of pulsars distributed over the sky. If a GW passes the Earth, then the timing residuals will oscillate according to the form of the GW. Pulsar timing is sensitive to GWs at very low frequencies, about 10710^{-7}Hz to 10910^{-9}Hz corresponding to periods of a few months up to a decade, generated, for example, by super-massive black hole binaries well before merger. Data collection has been ongoing since 2004, but to date has only been able to set upper limits.

5.2 Data interpretation and determination of source parameters

A major aspect of laser interferometer data analysis is to identify GW signals that are hidden by noisy data. The methods used vary according to the type of signal being searched for; these matters are not pursued here, but are discussed in detail in chapters 30 to 36. The presentation here is limited to an outline of the determination of the astrophysical parameters from GW data of a binary black hole merger.

Suppose that a GW signal is identified in detectors AA and BB, and that it is observed in detector BB a time tABt_{AB} after it is observed in detector AA. Then if ΔAB\Delta_{AB} is the light travel time from AA to BB, the direction to the GW source must be at an angle

α=arccos(tABΔAB)\alpha=\arccos\left(\frac{t_{AB}}{\Delta_{AB}}\right) (95)

to the line ABAB. In practice, there is an error involved in the measurement of tABt_{AB}, which means that the source is located within an annulus, rather than on a circle, on the sky. Further, as well as the timing of the GW signal, we also know its magnitude in each detector, and when combined with a model of the source this may lead to a further restriction on the direction to the source; i.e., the annulus may be reduced to a partial annulus. In the case that the GW event is seen in 3 or more detectors, the sky localization reduces to a point if there are no measurement errors, and in practice to a small region of the sky.

Refer to caption
Figure 5: Detector arms projected into plane N

For the next steps, we assume that the quadrupolar formulas remain valid. The chirp mass {\mathcal{M}} is determined using Eq. (81). Then, using Eq. (75) and ω=πf\omega=\pi f in Eq. (56) gives

(h+,h×)\displaystyle(h_{+},h_{\times}) =25/3(πf)2/3r\displaystyle=-\frac{2{\mathcal{M}}^{5/3}(\pi f)^{2/3}}{r}
×((1+cos2θ)cos(2ω(tr)2ϕ),2cosθsin(2ω(tr)2ϕ)).\displaystyle\times\left((1+\cos^{2}\theta)\cos(2\omega(t-r)-2\phi),2\cos\theta\sin(2\omega(t-r)-2\phi)\right)\,. (96)

Thus, a measurement of the polarization modes h+,h×h_{+},h_{\times} would lead to the ratio h×/h+h_{\times}/h_{+} which can be solved for the angle θ\theta (which is the polar coordinate of the Earth in the reference frame of the source), and then Eq. (96) may be solved to find the distance rr between source and Earth. If the event is well-localized on the sky, then the plane NN transverse to the propagation direction is known. Within NN, let 𝒆θ,𝒆ϕ\bm{e}_{\theta},\bm{e}_{\phi} be unit vectors in the θ,ϕ\theta,\phi directions of the source frame, with respect to which h+,h×h_{+},h_{\times} of Eq (96) are defined. Let 𝒆1x,𝒆1y\bm{e}_{1x},\bm{e}_{1y} be unit vectors representing the directions of the arms of a detector D1D_{1} projected into NN, and let the angle between 𝒆θ\bm{e}_{\theta} and 𝒆1x\bm{e}_{1x} be ψ\psi. The angle between 𝒆1x\bm{e}_{1x} and 𝒆1y\bm{e}_{1y} is known, so that between 𝒆θ\bm{e}_{\theta} and 𝒆1y\bm{e}_{1y} is known in terms of ψ\psi. These quantities are illustrated in Fig. 5. Now, given h+,h×h_{+},h_{\times} and ψ\psi, we can use Eq. (28) to rotate the coordinate axes and find the change in length of each arm of D1D_{1} due to the GW, and the difference between these changes is the detector response. Thus the measured detector response leads to one equation involving the three unknowns h+,h×h_{+},h_{\times} and ψ\psi. Measurements in detectors D2,D3D_{2},D_{3} (and if possible, more detectors so as to reduce error bars) then provide sufficient information to close the system and solve for the unknowns.

The remaining astrophysical parameters are determined using features of the waveform beyond the quadrupole formula. A template bank of merger waveforms over the parameter space to be explored is needed. For black hole mergers, the frequency of the GWs scales as the inverse of the total mass MM, and the magnitude scales as MM; then the parameter space comprises the mass ratio q=m1/m2q=m_{1}/m_{2}, and the spins 𝑺1,𝑺2\bm{S}_{1},\bm{S}_{2}. The mismatch between the calculated waveform (hch_{c}) and the observed waveform (hoh_{o}) is defined as

(hc)=1hc,ho{\mathcal{M}}(h_{c})=1-\langle h_{c},h_{o}\rangle (97)

where the inner product is evaluated in the Fourier domain, and includes the detector response function of each detector in which a signal is observed. It is normalized so that it satisfies the conditions 0h1,h210\leq\langle h_{1},h_{2}\rangle\leq 1 with h1,h2=1\langle h_{1},h_{2}\rangle=1 iff h1=h_{1}= constant ×h2\times h_{2}. Then (hc){\mathcal{M}}(h_{c}) is minimized over the template bank. Since hoh_{o} is subject to measurement error, the values found for the various astrophysical parameters are also subject to error.

5.3 GW observations: Contributions to Physics and Astrophysics

Fundamental physics

The inspiral and merger of a binary neutron star system was observed by LIGO/Virgo as GW170817 Abbott2017f ; Abbott2017c . It was followed, 1.71.7s after merger in the GW signal, by a gamma-ray burst GRB 170817A. The distance to the system is 4040Mpc, so we can deduce that the speed of GWs (cGWc_{\tiny{\mbox{GW}}}) is very close to being exactly that of light (cEMc_{\tiny{\mbox{EM}}})  Abbott2017GRB

13×1015cGWcEM1+7×1016.1-3\times 10^{-15}\leq\frac{c_{\tiny{\mbox{GW}}}}{c_{\tiny{\mbox{EM}}}}\leq 1+7\times 10^{-16}\,. (98)

In general relativity, the speed of GW propagation is exactly the same as that of light so this result is a strong test of the theory. Further, all GW waveforms observed to date have been consistent with the predictions of general relativity. These waveforms have been produced by compact object mergers and include the strong field regime. (Previously, experiments have only been able to test the theory in the regime of small deviations from Newtonian theory). Again, this amounts to a strong test of general relativity.

Cosmology

Data for the binary neutron star event GW170817 was taken by three detectors enabling the angle θ\theta to be constrained; then Eq. (96) is used to provide an independent distance estimate. This is an example of a compact binary merger being a standard siren Schutz86b . Since an optical counterpart was identified, the red-shift of the system is known to good precision, and thus the expansion rate, i.e. the Hubble constant, can be measured. The result is 70.08.0+12.070.0^{+12.0}_{-8.0}km/s/Mpc. The uncertainty in the value should be reduced in future as more events are discovered. The value obtained is consistent with estimates from both cosmic microwave background data and supernovae (SN1a), and does not yet shed light on the tension between these values.

Astrophysics

The first GW detection, GW150914Abbott2016a , was the merger of two black holes of mass approximately 36M36M_{\odot} and 29M29M_{\odot} to give a 62M62M_{\odot} black hole. Until then, the most massive black hole that had been observed was 25M\approx 25M_{\odot}. As more and more such events are detected, the mass distribution of black holes in the local universe, as well as the rate at which mergers occur, will be well-constrained, and this information will enable improved modeling of black hole, and black hole merger, formation channels. Similar remarks apply to neutron star mergers, although to date only two such events have been reported. Event GW190814 Abbott2020 was particularly interesting because it involved the merger of a 23M23M_{\odot} black hole with a 2.6M2.6M_{\odot} compact object, which could be either the most massive neutron star, or the smallest black hole, to have been observed.

Besides compact object mergers, current detectors are expected to observe GWs from supernovae, as well as the continuous signal from any asymmetry in a millisecond pulsar (which is a rapidly rotating neutron star). Such signals have not yet been detected, but their observation should lead to significant improvement to the modeling of these systems.

6 Conclusion

GWs in general relativity were first predicted in 1916, but it was nearly 100 years until the first direct detection of a GW event in 2015. Detections of new events are becoming more and more frequent, and the results to date have already had an impact on astrophysics, cosmology and physics. This impact will grow as detector sensitivities are improved and new facilities come online, so that more events, and of different types, are detected. The future of GW astronomy is, of course, unknown, but we can expect that it will bring some fascinating results, and probably some surprises.

References

  • (1) A. Einstein, “Naherungsweise Integration der Feldgleichungen der Gravitation,” Sitzungsberichte der Königlich Preussischen Akademie der Wissenschaften (Berlin), vol. 1916, pp. 688–696, 1916.
  • (2) A. Einstein, “Über Gravitationswellen,” Sitzungsberichte der Königlich Preussischen Akademie der Wissenschaften (Berlin), vol. 1918, pp. 154–167, 1918.
  • (3) D. Kennefick, “Controversies in the History of the Radiation Reaction problem in General Relativity,” 1997.
  • (4) H. Bondi, M. G. J. van der Burg, and A. W. K. Metzner, “Gravitational waves in general relativity VII. Waves from axi-symmetric isolated systems,” Proc. R. Soc. London, vol. A269, pp. 21–52, 1962.
  • (5) R. Sachs, “Gravitational waves in general relativity VIII. Waves in asymptotically flat space-time,” Proc. Roy. Soc. London, vol. A270, pp. 103–126, 1962.
  • (6) R. Isaacson, “Gravitational radiation in the limit of high frequency. II. nonlinear terms and the effective stress tensor,” Phys. Rev., vol. 166, pp. 1272–1280, 1968.
  • (7) P. C. Peters and J. Mathews, “Gravitational radiation from point masses in a Keplerian orbit,” Phys. Rev., vol. 131, pp. 435–440, 1963.
  • (8) P. C. Peters, “Gravitational radiation and the motion of two point masses,” Phys. Rev., vol. 136, pp. B1224–B1232, 1964.
  • (9) J. Weber, “Evidence for discovery of gravitational radiation,” Phys. Rev. Lett., vol. 22, pp. 1320–1324, 1969.
  • (10) J. Taylor, L. Fowler, and P. McCulloch, “Measurements of general relativistic effects in the binary pulsar psr1913 + 16,” Nature, vol. 277, pp. 437–440, 1979.
  • (11) H. Lorentz and J. Droste, “The motion of a system of bodies under the influence of their mutual attraction, according to Einstein’s theory,” Versl. K. Akad. Wet. Amsterdam, vol. 26, pp. 392, 649, 1917.
  • (12) H. Lorentz and J. Droste, “The motion of a system of bodies under the influence of their mutual attraction, according to Einstein’s theory,” in The Collected Papers of H.A. Lorentz, Vol. 5, (H. Lorentz, ed.), pp. 330–355, The Hague, Netherlands: Nijhoff, 1937.
  • (13) R. Arnowitt, S. Deser, and C. W. Misner, “Republication of: The dynamics of general relativity,” General Relativity and Gravitation, vol. 40, pp. 1997–2027, Sept. 2008.
  • (14) L. Smarr, “Spacetimes generated by computers: Black holes with gravitational radiation,” Ann. N. Y. Acad. Sci., vol. 302, pp. 569–604, 1977.
  • (15) F. Pretorius, “Evolution of binary black hole spacetimes,” Phys. Rev. Lett., vol. 95, p. 121101, 2005.
  • (16) B. P. Abbott et. al., “Observation of Gravitational Waves from a Binary Black Hole Merger,” Phys. Rev. Lett., vol. 116, p. 061102, 2016.
  • (17) M. Alcubierre, Introduction to 3+13+1 Numerical Relativity. Oxford, UK: Oxford University Press, 2008.
  • (18) L. Blanchet, “Gravitational radiation from post-Newtonian sources and inspiralling compact binaries,” Living Rev. Relativ., vol. 9, p. 4, 2006.
  • (19) T. W. Baumgarte and S. L. Shapiro, Numerical Relativity: Solving Einstein’s Equations on the Computer. Cambridge University Press, Cambridge UK, 2010.
  • (20) S. Chandrasekhar, The mathematical theory of black holes. New York: Oxford University Press., 1998.
  • (21) K. Kokkotas and B. Schmidt, “Quasi-Normal Modes of Stars and Black Holes,” Living Reviews in Relativity, vol. 2, p. 2, Sept. 1999.
  • (22) T. Damour and A. Nagar, The Effective One-Body Description of the Two-Body Problem, pp. 211–252. Dordrecht: Springer Netherlands, 2011.
  • (23) L. Santamaría et al., “Matching post-Newtonian and numerical relativity waveforms: Systematic errors and a new phenomenological model for nonprecessing black hole binaries,” Phys. Rev. D, vol. 82, p. 064016, Sept. 2010.
  • (24) E. Poisson, A. Pound, and I. Vega, “The motion of point particles in curved spacetime,” Living Reviews in Relativity, vol. 14, 2011.
  • (25) N. T. Bishop and L. Rezzolla, “Extraction of gravitational waves in numerical relativity,” Living Reviews in Relativity, vol. 19, 2016.
  • (26) B. P. Abbott et. al., “GW170817: Observation of Gravitational Waves from a Binary Neutron Star Inspiral,” Phys. Rev. Lett., vol. 119, p. 161101, 2017.
  • (27) B. P. Abbott et. al., “Multi-messenger Observations of a Binary Neutron Star Merger,” Astrophys. J. Lett., vol. 848, p. L12, 2017.
  • (28) B. P. Abbott et. al., “Gravitational Waves and Gamma-Rays from a Binary Neutron Star Merger: GW170817 and GRB 170817A,” Astrophys. J. Lett., vol. 848, p. L13, 2017.
  • (29) B. F. Schutz, “Determining the hubble constant from gravitational wave observations,” Nature, vol. 323, pp. 310–311, 1986.
  • (30) B. P. Abbott et. al., “GW190814: Gravitational Waves from the Coalescence of a 23 Solar Mass Black Hole with a 2.6 Solar Mass Compact Object,” Astrophys. J. Lett., vol. 896, p. L44, 2020.

Index

  • Angular momentum of gravitational waves, \hyperpage17
  • Bar-detector, \hyperpage2
  • Chirp mass {\mathcal{M}}, \hyperpage21, \hyperpage26
  • Eccentricity, \hyperpage22
  • Effective One Body (EOB), \hyperpage18
  • Energy of gravitational waves, \hyperpage14
  • Extreme Mass Ratio Inspiral (EMRI), \hyperpage18, \hyperpage24
  • Gauge transformation, \hyperpage6, 7
  • Gauge-invariant, \hyperpage10
  • GEO600, \hyperpage3, \hyperpage23
  • Hubble constant, \hyperpage28
  • KAGRA, \hyperpage23
  • Laser interferometer, \hyperpage3, \hyperpage23
  • LIGO, \hyperpage3, \hyperpage18, \hyperpage21, \hyperpage23–25, \hyperpage27
  • Linearized Einstein equations, \hyperpage5, \hyperpage12
  • LISA, \hyperpage18, \hyperpage24
  • Lorentz gauge, \hyperpage6, 7, \hyperpage15
  • Mismatch between waveforms, \hyperpage27
  • Momentum of gravitational waves, \hyperpage17
  • Newman-Penrose quantity ψ4\psi_{4}, \hyperpage18
  • Numerical relativity, \hyperpage3, \hyperpage17, 18
  • Orbiting binary, \hyperpage2, \hyperpage13, \hyperpage16, \hyperpage18, 19, \hyperpage23, \hyperpage25, \hyperpage28
  • Phenomenological waveform models, \hyperpage18
  • Plane wave solution, \hyperpage7
  • Polarization modes h+,h×h_{+},h_{\times}, \hyperpage9, \hyperpage11, \hyperpage14, \hyperpage27
  • Post-Newtonian, \hyperpage3, \hyperpage17
  • Pulsar timing, \hyperpage24
  • Quadrupole formula, \hyperpage2, \hyperpage12, \hyperpage16, 17, \hyperpage26
  • Quadrupole moment, \hyperpage13
  • Quasinormal modes, \hyperpage18
  • Self-force, \hyperpage21
  • Source parameter determination, \hyperpage25
  • Speed of gravitational waves, \hyperpage27
  • Spin-weighted spherical harmonic Y,ms{}_{s}Y^{\ell,m}, \hyperpage18
  • Time to coalescence, \hyperpage20
  • Trace reversed, \hyperpage5
  • TT gauge, \hyperpage8, 9, \hyperpage13–15
  • Virgo, \hyperpage3, \hyperpage23, \hyperpage27