Degenerate behavior in nonlinear vacuum electrodynamics

C. A. Escobar [email protected] Instituto de Física, Universidad Nacional Autónoma de México, Apdo. Postal 20-364, Ciudad de México 01000, México R. Potting [email protected] Departamento de Física, Universidade do Algarve, FCT, 8005-139 Faro, Portugal CENTRA, Instituto Superior Técnico, Universidade de Lisboa, Avenida Rovisco Pais, Lisboa, Portugal

Abstract

We study nonlinear vacuum electrodynamics in the first-order formulation proposed by Plebański. We analyze in detail the equations of motion, and identify conditions for which a singularity can occur for the time derivative of one of the field components. The resulting degenerate behavior can give rise to a shock wave with a reduction of the local number of degrees of freedom. We use an example model to illustrate the occurrence of superluminal propagation for field values approaching the singularity.

I Introduction

In 1934 Born and Infeld BornInfeld proposed a nonlinear modification of Maxwell theory with the objective to eliminate the infinite self-energy of a point charge. The model follows from a Lorentz-invariant Lagrangian and it gained renewed attention when it was shown that it can be derived as an effective action from quantized string theory FradkinTseytlin . Another nonlinear, Lorentz-invariant modification of Maxwell theory was first derived from quantum electrodynamics by Heisenberg and Euler EulerHeisenberg ; Berestetskii . Plebański Plebanski showed these models are two examples of a large class of nonlinear electrodynamics (NLED) theories that are defined by a gauge-invariant as well as Lorentz-invariant first-order Lagrangian. The equations of motion of these models can be expressed like the usual Maxwell equations in material media, where now the permitivities and permeabilities are Lorentz-invariant functions of the electromagnetic field variables. In recent decades models of nonlinear electrodynamics have been proposed in the context of QCD (the Pagels-Tomboulis model Arodz ), as arising from Kaluza-Klein compactification Lemos ; Gibbons , in order to explain accelerated expansion of the universe Novello , in the context of galactic magnetic fields Campanelli , etc. There is also an extensive literature of solutions of nonlinear electrodynamics in a curved space background, such as black holes AyonBeato ; Bronnikov .

Wave propagation in the presence of background fields in NLED theories has been studied by analyzing the so-called Fresnel equation, which amounts to a dispersion relation for the wave vectors. It can be derived by studying either the propagation of surfaces of discontinuities nonlinear-propagation1 ; nonlinear-propagation1a ; nonlinear-propagation2 , or by assuming an approximate a plane-wave ansatz Lammerzahl3 . It is found that subluminal as well as superluminal propagation is possible in general Lammerzahl ; causal1 , and that birefringence effects can occur birefringence1 ; Lammerzahl2 . In this work we will use a different approach and analyze the NLED equations of motion directly for arbitrary potentials.

As we will show in this work, for certain NLED potentials there are hypersurfaces in field space on which the equations of motion develop a singularity in the sense that the time derivative of one of the field components blows up. Moreover, the equation of motion for this field component contains an term which is multiplied by an undetermined coefficient, and at the same time a certain (nonlinear) combination of the field variables is forced to vanish. This indicates a loss of two phase space degrees of freedom in the dynamical system, from the usual four (just like in Maxwell theory) to two. We also consider a simple example in which we analyze wave propagation in a background of nonzero fields, and analyze what happens on points at which the background field lies on a singular hypersurface.

This paper is organized as follows. In section II we review the Plebański formulation of NLED. In section III we analyze the equations of motion, with an emphasis on the degenerate behavior on certain hypersurfaces in field space. In section IV we analyze the linearized equations of motion for fluctuations on a background of constant magnetic field, for a particular NLED potential, highlighting the behavior near the points on the degenerate hypersurface. Finally, we present our conclusions, as well as an outlook, in section V.

II First-order formulation of nonlinear electrodynamics

In this section we will review the first-order framework for nonlinear electrodynamics that we will use in this work, introducing notation and fixing our conventions. Starting point is the action

S=\int d^{4}x\,\mathcal{L}

(1)

in Minkowski space, with a Lagrangian density EscobarUrrutia

\mathcal{L}=-P^{\mu\nu}\partial_{\mu}A_{\nu}-V(P,Q)-A_{\mu}J^{\mu}

(2)

that depends on the vector potential $A_{\mu}$ and on the antisymmetric tensor $P^{\mu\nu}$ , which are treated as independent fields in (2). The potential $V$ is taken to depend on $P^{\mu\nu}$ through the Lorentz scalars

P=\frac{1}{4}P_{\mu\nu}P^{\mu\nu}\>\qquad\mbox{and}\qquad Q=\frac{1}{4}P_{\mu\nu}\tilde{P}^{\mu\nu}\>

(3)

where the dual to $P^{\mu\nu}$ is defined by

\tilde{P}^{\mu\nu}=\frac{1}{2}\epsilon^{\mu\nu\rho\sigma}P_{\rho\sigma}\>.

(4)

The Levi-Civita symbol is defined with the convention $\epsilon^{0123}=-\epsilon_{0123}=1$ . Note that in this work we assume the metric convention $(+,-,-,-)$ and use natural, Heaviside-Lorentz units (with $c=\hbar=1$ ). From (4) we find the inverse relation

P^{\mu\nu}=-\frac{1}{2}\epsilon^{\mu\nu\rho\sigma}\tilde{P}_{\rho\sigma}\>.

(5)

The last term in the Lagrangian density (2) defines a minimal coupling to the external current density $J^{\mu}$ , which is assumed to be conserved:

\partial_{\mu}J^{\mu}=0\>.

(6)

The equations of motion of (1) are

	$\displaystyle\frac{\delta S}{\delta A_{\mu}}$	$\displaystyle=-\partial_{\nu}P^{\mu\nu}-J^{\mu}=0\>,$		(7)
	$\displaystyle\frac{\delta S}{\delta P^{\mu\nu}}$	$\displaystyle=-\frac{1}{2}F_{\mu\nu}-\frac{1}{2}\left(V_{P}P_{\mu\nu}+V_{Q}\tilde{P}_{\mu\nu}\right)=0\>,$		(8)

where the lower indices on $V$ indicate the partial derivatives $V_{P}={\partial V}/{\partial P}$ and $V_{Q}={\partial V}/{\partial Q}$ , and

F_{\mu\nu}\equiv\partial_{\mu}A_{\nu}-\partial_{\nu}A_{\mu}\>.

(9)

Eq. (8) becomes the constitutive relation

F_{\mu\nu}=-2\frac{\partial V}{\partial P^{\mu\nu}}=-V_{P}P_{\mu\nu}-V_{Q}\tilde{P}_{\mu\nu}\>.

(10)

From definition (9) it follows that $F_{\mu\nu}$ satisfies the consistency condition (Bianchi identity)

\partial_{\mu}\tilde{F}^{\mu\nu}=\frac{1}{2}\partial_{\mu}\epsilon^{\mu\nu\rho\sigma}F_{\rho\sigma}=0\>.

(11)

The constitutive relation (10) can be inverted by considering $\mathcal{L}$ to be a function of $F^{\mu\nu}$ (as well as $A_{\mu}$ and $J^{\mu}$ ). By Lorentz invariance, $\mathcal{L}$ should then be a function of the invariants invariants

F=\frac{1}{4}F_{\mu\nu}F^{\mu\nu}\>\qquad\mbox{and}\qquad G=\frac{1}{4}F_{\mu\nu}\tilde{F}^{\mu\nu}\>.

(12)

Taking the variation of the action $S=\int\mathcal{L}(F,G)d^{4}x$ with respect to $F_{\mu\nu}$ and comparing it with the variation of the action in its original form (1) it follows that

P^{\mu\nu}=-\mathcal{L}_{F}F^{\mu\nu}-\mathcal{L}_{G}\tilde{F}^{\mu\nu}

(13)

which expresses the inverse of the constitutive relation (10).

It will be useful in the following to express the above relations in terms of the usual vector fields $\vec{D}$ , $\vec{E}$ , $\vec{H}$ and $\vec{B}$ by defining

P^{\mu\nu}=\begin{pmatrix}0&-D_{x}&-D_{y}&-D_{z}\\ D_{x}&0&-H_{z}&H_{y}\\ D_{y}&H_{z}&0&-H_{x}\\ D_{z}&-H_{y}&H_{x}&0\end{pmatrix}

(14)

and

F^{\mu\nu}=\begin{pmatrix}0&-E_{x}&-E_{y}&-E_{z}\\ E_{x}&0&-B_{z}&B_{y}\\ E_{y}&B_{z}&0&-B_{x}\\ E_{z}&-B_{y}&B_{x}&0\end{pmatrix}\>.

(15)

The invariants $P$ , $Q$ , $F$ and $G$ can then be written as

	$\displaystyle P$	$\displaystyle=\frac{1}{2}(\vec{H}^{2}-\vec{D}^{2})\qquad\mbox{and}\qquad Q=-\vec{H}\cdot\vec{D}\>,$		(16)
	$\displaystyle F$	$\displaystyle=\frac{1}{2}(\vec{B}^{2}-\vec{E}^{2})\qquad\mbox{and}\qquad G=-\vec{B}\cdot\vec{E}\,.$		(17)

while the constitutive relations (10) and (13) can be expressed in matrix form:

\begin{pmatrix}\vec{E}\\ \vec{B}\end{pmatrix}=\begin{pmatrix}-V_{P}&-V_{Q}\\ V_{Q}&-V_{P}\end{pmatrix}\begin{pmatrix}\vec{D}\\ \vec{H}\end{pmatrix}

(18)

and

\begin{pmatrix}\vec{D}\\ \vec{H}\end{pmatrix}=\begin{pmatrix}-\mathcal{L}_{F}&-\mathcal{L}_{G}\\ \mathcal{L}_{G}&-\mathcal{L}_{F}\end{pmatrix}\begin{pmatrix}\vec{E}\\ \vec{B}\end{pmatrix}\>.

(19)

Equations (7) and (11) then take the familiar form of the Maxwell equations in a material medium

$\displaystyle\vec{\nabla}\cdot\vec{D}$	$\displaystyle=J^{0}\>,$	(20)
$\displaystyle\vec{\nabla}\times\vec{H}-\frac{\partial\vec{D}}{\partial t}$	$\displaystyle=\vec{J}\>,$	(21)
$\displaystyle\vec{\nabla}\cdot\vec{B}$	$\displaystyle=0\>,$	(22)
$\displaystyle\vec{\nabla}\times\vec{E}+\frac{\partial\vec{B}}{\partial t}$	$\displaystyle=0\>.$	(23)

The relations (18) and (19) yield, by consistency,

\begin{pmatrix}-V_{P}&-V_{Q}\\ V_{Q}&-V_{P}\end{pmatrix}=\begin{pmatrix}-\mathcal{L}_{F}&-\mathcal{L}_{G}\\ \mathcal{L}_{G}&-\mathcal{L}_{F}\end{pmatrix}^{-1}

(24)

from which it follows that

\mathcal{L}_{F}=\frac{V_{P}}{V_{P}^{2}+V_{Q}^{2}}\qquad\mbox{and}\qquad\mathcal{L}_{G}=\frac{-V_{Q}}{V_{P}^{2}+V_{Q}^{2}}\>.

(25)

For the energy-momentum tensor we get from Eq. (2)

	$\displaystyle T_{\mu\nu}$	$\displaystyle=P_{\mu}{}^{\lambda}F_{\nu\lambda}-\eta_{\mu\nu}\mathcal{L}$
		$\displaystyle=V_{P}P_{\mu}{}^{\lambda}P_{\nu\lambda}-V_{Q}P_{\mu}{}^{\lambda}\tilde{P}_{\nu\lambda}+2\eta_{\mu\nu}\bigl{(}V_{P}P+V_{Q}Q\bigr{)}$		(26)

where in the second identity we used the constitutive relation (10). For the energy density we obtain

U=T_{00}=V-V_{P}|\vec{H}|^{2}-V_{Q}Q

(27)

where we made use of the relations (16) and the explicit form (14) of $P_{\mu\nu}$ .

It is instructive to consider a few well-known special cases.

The usual Maxwell equations in vacuum follow by taking $\mathcal{L}(F,G)=-F$ , so that $\mathcal{L}_{F}=-1$ and $\mathcal{L}_{G}=0$ . It then follows that $P^{\mu\nu}=-\mathcal{L}_{F}F^{\mu\nu}=F^{\mu\nu}$ , so that the constitutive relations are trivial: $\vec{D}=\vec{E}$ and $\vec{H}=\vec{B}$ . From relations (25) we find $V_{P}=-1$ , $V_{Q}=0$ , so that $V(P,Q)=-P$ .

Another example is given by the Born-Infeld action, which is defined by the potential

V(P,Q)=b^{2}\sqrt{1-\dfrac{2P}{b^{2}}-\dfrac{Q^{2}}{b^{4}}}-b^{2}\>.

(28)

where the Born-Infeld parameter $b$ has mass dimension 2. From its partial derivatives $V_{P}$ and $V_{Q}$

V_{P}=\dfrac{-1}{\sqrt{1-\dfrac{2P}{b^{2}}-\dfrac{Q^{2}}{b^{4}}}}\qquad\mbox{and}\qquad V_{Q}=\dfrac{\dfrac{Q}{b^{2}}}{\sqrt{1-\dfrac{2P}{b^{2}}-\dfrac{Q^{2}}{b^{4}}}}

(29)

we can find expressions for $\mathcal{L}_{F}$ and $\mathcal{L}_{G}$ in terms of the invariants $P$ and $Q$ by using relations (25). It is straightforward to obtain explicit relations for $P$ and $Q$ in terms of $F$ and $G$ (and vice-versa) by using the constitutive relation (10) together with the expressions (29). It follows that

\mathcal{L}_{F}=\dfrac{-1}{\sqrt{1+\dfrac{2F}{b^{2}}-\dfrac{G^{2}}{b^{4}}}}\qquad\mbox{and}\qquad\mathcal{L}_{G}=\dfrac{\dfrac{G}{b^{2}}}{\sqrt{1+\dfrac{2F}{b^{2}}-\dfrac{G^{2}}{b^{4}}}}\>.

(30)

which can be integrated to yield the explicit form of the Born-Infeld Lagrangian

\mathcal{L}=b^{2}-b^{2}\sqrt{1+\frac{2F}{b^{2}}-\frac{G^{2}}{b^{4}}}

(31)

in terms of the invariants $F$ and $G$ . For a detailed treatment of the Born-Infeld model in the context of the first-order formalism see Plebanski .

While for the Born-Infeld potential it is relatively straightforward to obtain the dual expression for the associated Lagrangian as a function of the invariants $F$ and $G$ in explicit form, this is not always the case for general Plebański models. In fact, the relations for $P$ and $Q$ in terms of $F$ and $G$ are not even always invertible, giving rise to branch points Bronnikov . In this case the physics described by the dual Lagrangian formulation only coincides with the original one for a restricted set of field values. We will see that this issue is particularly important for the range of field values that give rise to the singular behavior we will study below.

As we will use the fields $\vec{D}$ and $\vec{H}$ to describe the dynamics of the theory, we need their time development. The time development of the $\vec{D}$ field is directly determined by Ampère’s law (21). In order to extract the time development of $\vec{H}$ from Faraday’s law (23), it is necessary to use the constitutive relations (18). They depend on the quantities $V_{P}$ and $V_{Q}$ , which in turn are functions of $\vec{H}$ and $\vec{D}$ . This makes determining the time development of $\vec{H}$ less than trivial. Explicitly, from Eqs. (23) and (18) one finds

\frac{\partial}{\partial t}(V_{Q}\vec{D}-V_{P}\vec{H})=\vec{\nabla}\times(V_{P}\vec{D}+V_{Q}\vec{H})\>.

(32)

For the time derivatives of $V_{P}$ and $V_{Q}$ we have

	$\displaystyle\frac{\partial V_{P}}{\partial t}$	$\displaystyle=V_{PP}\frac{\partial P}{\partial t}+V_{PQ}\frac{\partial Q}{\partial t}$
		$\displaystyle=(V_{PP}\vec{H}-V_{PQ}\vec{D})\cdot\frac{\partial\vec{H}}{\partial t}-(V_{PP}\vec{D}+V_{PQ}\vec{H})\cdot\frac{\partial\vec{D}}{\partial t}$		(33)

and

	$\displaystyle\frac{\partial V_{Q}}{\partial t}$	$\displaystyle=V_{PQ}\frac{\partial P}{\partial t}+V_{QQ}\frac{\partial Q}{\partial t}$
		$\displaystyle=(V_{PQ}\vec{H}-V_{QQ}\vec{D})\cdot\frac{\partial\vec{H}}{\partial t}-(V_{PQ}\vec{D}+V_{QQ}\vec{H})\cdot\frac{\partial\vec{D}}{\partial t}\>.$		(34)

One can now work out the time derivatives on the left-hand side of Eq. (32), substitute the expressions (33) and (34). The resulting equation can be written as

$\displaystyle\frac{\partial\vec{H}}{\partial t}=\frac{1}{V_{P}}\Bigl{[}$	$\displaystyle V_{Q}\frac{\partial\vec{D}}{\partial t}-\vec{\nabla}\times(V_{P}\vec{D}+V_{Q}\vec{H})$
	$\displaystyle{}-\Bigl{(}(V_{PP}\vec{H}-V_{PQ}\vec{D})\cdot\frac{\partial\vec{H}}{\partial t}-(V_{PP}\vec{D}+V_{PQ}\vec{H})\cdot\frac{\partial\vec{D}}{\partial t}\Bigl{)}\,\vec{H}$
	$\displaystyle{}+\Bigl{(}(V_{PQ}\vec{H}-V_{QQ}\vec{D})\cdot\frac{\partial\vec{H}}{\partial t}-(V_{PQ}\vec{D}+V_{QQ}\vec{H})\cdot\frac{\partial\vec{D}}{\partial t}\Bigl{)}\,\vec{D}\Bigr{]}$	(35)

The scalar products $\vec{D}\cdot(\partial\vec{H}/\partial t)$ and $\vec{H}\cdot(\partial\vec{H}/\partial t)$ on the right-hand side of the equation can be obtained by taking the scalar product of Eq. (35) with $\vec{D}$ and with $\vec{H}$ , respectively. We then obtain the following expressions for $\vec{H}\cdot(\partial\vec{H}/\partial t)$ and $\vec{D}\cdot(\partial\vec{H}/\partial t)$ :

	$\displaystyle\vec{H}\cdot\frac{\partial\vec{H}}{\partial t}$	$\displaystyle=-S^{-1}\bigl{(}(V_{P}+QV_{PQ}+D^{2}V_{QQ})A_{1}+(H^{2}V_{PQ}+QV_{QQ})A_{2}\bigr{)}$		(36)
	$\displaystyle\vec{D}\cdot\frac{\partial\vec{H}}{\partial t}$	$\displaystyle=-S^{-1}\bigl{(}(QV_{PP}+D^{2}V_{PQ})A_{1}+(V_{P}+H^{2}V_{PP}+QV_{PQ})A_{2}\bigr{)}\>,$		(37)

with

$\displaystyle A_{1}$	$\displaystyle=\vec{H}\cdot\vec{\nabla}\times(V_{P}\vec{D}+V_{Q}\vec{H})$
	$\displaystyle\qquad{}-\left((V_{Q}+H^{2}V_{PQ}+QV_{QQ})\vec{H}+(H^{2}V_{PP}+QV_{PQ})\vec{D}\right)\cdot\frac{\partial\vec{D}}{\partial t}$	(38)
$\displaystyle A_{2}$	$\displaystyle=\vec{D}\cdot\vec{\nabla}\times(V_{P}\vec{D}+V_{Q}\vec{H})$
	$\displaystyle\qquad{}+\left((QV_{PQ}+D^{2}V_{QQ})\vec{H}+(-V_{Q}+QV_{PP}+D^{2}V_{PQ})\vec{D}\right)\cdot\frac{\partial\vec{D}}{\partial t}\>$	(39)

and

S=\bigl{(}V_{PP}V_{QQ}-V_{PQ}^{2}\bigr{)}\bigl{(}H^{2}D^{2}-Q^{2}\bigr{)}+V_{P}\bigl{(}H^{2}V_{PP}+2QV_{PQ}+D^{2}V_{QQ}\bigr{)}+V_{P}^{2}\>,

(40)

where we introduced the notation $D=|\vec{D}|$ and $H=|\vec{H}|$ .

It is evident from the (quite complicated) Eq. (35) that the time derivative of $\vec{H}$ is well defined and non-singular as long as $V_{P}\neq 0$ and $S\neq 0$ . However, when $V_{P}\to 0$ or $S\to 0$ , $\partial\vec{H}/\partial t$ can diverge.

III Global dynamics and degeneracy surfaces

We already saw in section II that degenerate behavior can occur if either of the conditions

	$\displaystyle S$	$\displaystyle=0$		(41)
	$\displaystyle V_{P}$	$\displaystyle=0$		(42)

(where $S$ is given by Eq. (40)) is satisfied. In particular, the time derivative of $\vec{H}$ given by Eq. (35) can diverge if either $V_{P}$ or $S$ tends to zero.

Let us first consider the degenerate dynamics of the $\vec{H}$ field close to the hypersurface $S=0$ , $V_{P}\neq 0$ . For simplicity, we will take the potential $V$ only to depend on $P$ , not on $Q$ . Note that in that case $S=V_{P}(V_{P}+H^{2}V_{PP})$ and thus the hypersurface $S=0$ then becomes equivalent to the condition $V_{P}+H^{2}V_{PP}=0$ .

Now as long as $V_{P}+H^{2}V_{PP}$ is small but nonzero, we can use expression (35) which simplifies substantially because we can put to zero all terms involving $V_{Q}$ , $V_{PQ}$ and $V_{QQ}$ . It then follows that

\dot{\vec{H}}=\dfrac{1}{V_{P}}\left(-\vec{\nabla}\times(V_{P}\vec{D})+\dfrac{NV_{PP}\vec{H}}{V_{P}+H^{2}V_{PP}}\right)\>,

(43)

where we defined

N\equiv\vec{H}\cdot(\vec{\nabla}\times V_{P}\vec{D})+V_{P}\vec{D}\cdot(\vec{\nabla}\times\vec{H}-\vec{J})\>.

(44)

Note that the right-hand-side of (43) is (potentially) singular in the limit $V_{P}+H^{2}V_{PP}\to 0$ .

However, if $V_{P}+H^{2}V_{PP}$ is exactly zero, Eqs. (32) and (21) imply that

\dot{\vec{H}}=-\dfrac{\vec{\nabla}\times(V_{P}\vec{D})}{V_{P}}+\beta\vec{H}

(45)

where the parameter $\beta$ is free, together with the condition

N=0\>.

(46)

Thus we see that on the $V_{P}+H^{2}V_{PP}=0$ surface the time derivative of the longitudinal component of $\vec{B}$ (that is, fluctuations of $\vec{B}$ that modify its modulus, not its direction) are undetermined and thus could be arbitrarily large. Condition (46) represents an extra constraint that is forced on the field configuration. The way this can be interpreted is that the freedom in the value of $\beta$ allows for the modulus of $\vec{H}$ to be adapted such that condition (46) is satisfied.

Now suppose that we are in a region of spacetime with non-constant values of $V_{P}+H^{2}V_{PP}$ , including the value zero. The points in which $V_{P}+H^{2}V_{PP}=0$ can then be expected to form a two-dimensional surface $\Sigma$ in space. We see then from Eq. (43) that, if on those points $N$ takes nonzero values, the time derivatives of $\vec{H}$ will then diverge as we approach $\Sigma$ . To understand better what happens on the points close to $\Sigma$ , it is instructive to evaluate the time derivative of the square of $V_{P}+H^{2}V_{PP}$ . One finds

	$\displaystyle\frac{1}{2}\frac{\partial}{\partial t}\left((V_{P}+H^{2}V_{PP})^{2}\right)$	$\displaystyle=-(3V_{PP}+H^{2}V_{PPP})N$
		$\displaystyle\qquad{}+2(V_{P}+H^{2}V_{PP})V_{PP}\vec{D}\cdot(\vec{\nabla}\times\vec{H}-\vec{J}\,)$		(47)

where we assumed that $V_{P}+H^{2}V_{PP}\neq 0$ . In the limit $V_{P}+H^{2}V_{PP}\to 0$ the second term tends to zero, leaving us with a finite, generally nonzero, value (given by the first term).

There are three possibilities for the values of $V_{P}+H^{2}V_{PP}$ on points close to $\Sigma$ .

•

If $(3V_{PP}+H^{2}V_{PPP})N<0$ , the absolute value of $V_{P}+H^{2}V_{PP}$ will grow in time. In other words $V_{P}+H^{2}V_{PP}$ will be driven away from zero, in opposite directions depending on the value (positive or negative).
•

If $(3V_{PP}+H^{2}V_{PPP})N>0$ , the absolute value of $V_{P}+H^{2}V_{PP}$ will decrease in time. In fact, as they approach zero, their time derivative will diverge, indicating a possible discontinuity in the field values. Thus $V_{P}+H^{2}V_{PP}$ will be driven to zero, whether its value is positive or negative. There even appears to be a paradox if the time derivative of the non-negative quantity $(V_{P}+H^{2}V_{PP})^{2}$ stays negative in the limit $V_{P}+H^{2}V_{PP}\to 0$ . Note, however, that as soon as $V_{P}+H^{2}V_{PP}$ becomes zero the time derivative of $\vec{H}$ is no longer given by Eq. (43), but by Eq. (45). It is then easy to check that $\frac{\partial}{\partial t}\bigl{(}(V_{P}+H^{2}V_{PP})^{2}\bigr{)}=0$ , eliminating the apparent inconsistency encountered above. Moreover, note that, by condition (46), the quantity $N$ is forced to become zero.
•

The case $(3V_{PP}+H^{2}V_{PPP})N=0$ interpolates between the two above situations.

Thus we find the following picture. Whenever there is a spatial surface $\Sigma$ on which $V_{P}+H^{2}V_{PP}$ vanishes, while the quantity $(3V_{PP}+H^{2}V_{PPP})N$ is positive, points on which $V_{P}+H^{2}V_{PP}$ is close to zero are driven to zero. Thus the surface $\Sigma$ can be expected to grow to a region of finite volume, and will continue to grow as long as the value of the quantity $(3V_{PP}+H^{2}V_{PPP})N$ on the points bordering (but outside) $\Sigma$ is positive. Inside the region $\Sigma$ there are two conditions that have to be satisfied. First of all, the quantity $N$ has to remain zero. Therefore, its time derivative has to vanish. This yields the condition

$\displaystyle\dot{N}$	$\displaystyle=2\vec{\nabla}\beta\cdot\bigl{(}\vec{E}\times\vec{H}\bigr{)}-\beta\vec{J}\cdot\vec{E}+V_{P}\bigl{\|}\vec{\nabla}\times\vec{H}-\vec{J}\bigr{\|}^{2}-\frac{1}{V_{P}}\bigl{\|}\vec{\nabla}\times\vec{E}\bigr{\|}^{2}$
	$\displaystyle\qquad{}-\vec{E}\cdot\left(\vec{\nabla}\times\bigl{(}\frac{1}{V_{P}}\vec{\nabla}\times\vec{E}\bigr{)}-\dot{\vec{J}}\right)+\vec{H}\cdot\vec{\nabla}\times\left(V_{P}\bigl{(}\vec{\nabla}\times\vec{H}-\vec{J}\bigr{)}\right)$
	$\displaystyle=0\>.$	(48)

Thus the parameter $\beta$ has to satisfy a first-order partial differential equation. The latter allows for boundary conditions on $\beta$ to be chosen on a two-dimensional surface, which can be taken to coincide with the boundary of $\Sigma$ . A second condition is that the value of $V_{P}+H^{2}V_{PP}$ has to vanish. For its time derivative we find

\frac{\partial}{\partial t}(V_{P}+H^{2}V_{PP})=2V_{PP}\vec{D}\cdot(\vec{\nabla}\times\vec{H}-\vec{J})+\beta H^{2}(3V_{PP}+H^{2}V_{PPP})\>.

(49)

As long as the system remains on $\Sigma$ , this has to vanish for consistency, fixing the value of $\beta$ . Conditions (49) and (48) can be satisfied jointly at least on the two-dimensional boundary of $\Sigma$ . For the points inside the volume of $\Sigma$ to remain on the degenerate surface it is necessary that conditions (49) and (48) remain both satisfied. If not, these points may be forced off, after which they will either fall back on the surface, or are driven away, depending on the sign of the expression on the right-hand side of Eq. (47).

The dynamical surface defined this way behaves very much like a shock wave separating two regions on which the expression $V_{P}+H^{2}V_{PP}$ is either zero on nonzero. The shock wave moves toward the region in which $V_{P}+H^{2}V_{PP}$ is nonzero whenever the quantity $(3V_{PP}+H^{2}V_{PPP})N$ is positive, thereby increasing the size of the region $\Sigma$ , while the opposite happens when $(3V_{PP}+H^{2}V_{PPP})N$ is negative.

That such shock waves turn up should perhaps not come as a surprise. It is well known that shock waves can be produced out of a continuous initial state in nonlinear electrodynamics LutzkyToll , except in certain “exceptional” models including Born-Infeld electrodynamics Boillat-1970 ; Gibbons . At a shock wave the characteristics associated with the partial differential equations describing the field dynamics intersect, at which point the field equations cease to determine uniquely the time development of the associated field components. A similar failure of the equations of motions to uniquely determine the time development of the fields is evident from Eq. (45) due to the presence of the free parameter $\beta$ . It occurs together with the appearance of the extra constraint (46), reducing the number of local phase space degrees of freedom from four to two.

As an aside, we note that the points on which $V_{P}+H^{2}V_{PP}=0$ have a particular significance. It follows from the constitutive relation $\vec{B}=-V_{P}\vec{H}$ and the fact that $V_{P}$ only depends on $P=(H^{2}-D^{2})/2$ that $B=|\vec{B}|$ is a function of $H$ and $D$ . One readily verifies that

\frac{\partial B}{\partial H}=V_{P}+H^{2}V_{PP}\>,

(50)

so that the condition $V_{P}+H^{2}V_{PP}=0$ corresponds to stationary points of the modulus of $\vec{B}$ as a function of $H$ , taking $D$ constant. In the cases of interest $B$ actually has a local minimum.

Next, we consider the degenerate dynamics of the $\vec{H}$ field close to the hypersurface $S\neq 0$ , $V_{P}=0$ . Here we will take the potential to be an arbitrary function of the quantities $P$ and $Q$ . For the time derivative of $V_{P}$ it follows that

\frac{\partial V_{P}}{\partial t}=V_{PP}\left(\vec{H}\cdot\frac{\partial\vec{H}}{\partial t}-\vec{D}\cdot\frac{\partial\vec{D}}{\partial t}\right)-V_{PQ}\left(\vec{D}\cdot\frac{\partial\vec{H}}{\partial t}+\vec{H}\cdot\frac{\partial\vec{D}}{\partial t}\right)\>.

(51)

Using Eqs. (36), (37), (II) and (II), we can evaluate the right-hand side of Eq. (52). Unlike the situation close to the hypersurface $S=0$ described above, it turns out that in the limit $V_{P}\to 0$ , the time derivative of $V_{P}$ is finite:

\lim_{V_{P}\to 0}\frac{\partial V_{P}}{\partial t}=\frac{-Q}{H^{2}D^{2}-Q^{2}}\vec{\nabla}V_{Q}\cdot(\vec{H}\times\vec{D})\>.

(52)

Therefore

\lim_{V_{P}\to 0}\frac{\partial\bigl{(}(V_{P})^{2}\bigr{)}}{\partial t}=\lim_{V_{P}\to 0}2V_{P}\frac{\partial V_{P}}{\partial t}=0

(53)

and thus the dynamics does not suffer the same kind of singular behaviour at the $V_{P}=0$ surface as we encountered for the case $S=0$ . The only possible exception can occur whenever the quantity $H^{2}D^{2}-Q^{2}$ turns equal to zero (which happens when $\vec{H}$ and $\vec{D}$ are parallel). However we will not investigate this possibility in detail in this work. In particular, $V_{P}$ just evolves regularly when passing through the surface, as described by Eq. (52).

IV Analysis of the linearized equations of motion away from the vacuum

In this section we will consider the case of a potential $V(P)$ which, for simplicity, is taken independent of $Q$ . The equations of motion then reduce to

	$\displaystyle\partial_{t}\vec{D}$	$\displaystyle=\vec{\nabla}\times\vec{H}$		(54)
	$\displaystyle\partial_{t}(V_{P}\vec{H})$	$\displaystyle=-\vec{\nabla}\times(V_{P}\vec{D})\>.$		(55)

We now write the fields $\vec{H}$ and $\vec{D}$ as fluctuations around a constant field configuration $P_{0}=\{\vec{H}_{0},\vec{D}_{0}\}$ , where we will choose $\vec{D}_{0}=0$ . Thus

	$\displaystyle\vec{H}$	$\displaystyle=\vec{H}_{0}+\vec{h}$		(56)
	$\displaystyle\vec{D}$	$\displaystyle=\vec{d}\>.$		(57)

The equations of motion (54) and (55) then become, up to linear order

$\displaystyle\partial_{t}\vec{d}$	$\displaystyle=\vec{\nabla}\times\vec{h}$	(58)
$\displaystyle(V_{P}+H_{0}^{2}V_{PP})\partial_{t}h_{\\|}$	$\displaystyle=-V_{P}(\vec{\nabla}\times\vec{d})_{\\|}$	(59)
$\displaystyle\partial_{t}\vec{h}_{\bot}$	$\displaystyle=-(\vec{\nabla}\times\vec{d})_{\bot}$	(60)

(here we have dropped the dependence of $V_{P}$ and $V_{PP}$ on $P_{0}$ ). The $\|$ and $\bot$ indices indicate the components parallel and perpendicular to $\vec{H}_{0}$ .

It is easy to check that there is a massless transverse mode with the polarization of the $\vec{h}_{\bot}$ vector perpendicular to $\vec{k}$ (as well as to $\vec{H}_{0}$ ) and $\vec{d}$ perpendicular to $\vec{h}_{\bot}$ and $\vec{k}$ that has the usual phase (or group) velocity 1.

To analyze the remaining mode, we choose Cartesian coordinates such that $\vec{H}_{0}=H_{0}\vec{e}_{x}$ . Eqs. (58) and (59) then yield

\partial_{t}^{2}h_{x}=\left(\partial_{x}^{2}+\frac{V_{P}}{V_{P}+|\vec{H}_{0}|^{2}V_{PP}}(\partial_{y}^{2}+\partial_{z}^{2})\right)h_{x}\>.

(61)

(where $h_{x}=h_{\|}$ ). Let us suppose first, for simplicity, that $h_{x}$ is independent of $x$ . It then follows that, if $1+|\vec{H}_{0}|^{2}\frac{V_{PP}}{V_{P}}>0$ , this represents a massless mode with phase (as well as group) velocity equal to $v_{ph}=\sqrt{\frac{V_{P}}{V_{P}+|\vec{H}_{0}|^{2}V_{PP}}}$ . However, if $1+|\vec{H}_{0}|^{2}\frac{V_{PP}}{V_{P}}<0$ , there is no plane-wave type propagation. In Fig. 1 we plot, as an example, the form of the energy density (27) for the potential

V(P)=-P+P^{3}-P^{5}

(62)

and, in Fig. 2, the corresponding dependence of the square of the propagation velocity.

Refer to caption — Figure 1: Form of the energy density as a function of $P=|\vec{H}_{0}|^{2}/2$ for the potential $V(P)=-P+P^{3}-P^{5}$ .

We see that latter is negative in the interval between the local maximum and minimum of the energy density. Moreover, when approaching the local minimum from the right, the phase/group velocity tends to infinity!

At the points where $V_{P}+|\vec{H}_{0}|^{2}V_{PP}=0$ , the equation of motion degenerates into the constraint equation $(\partial_{y}^{2}+\partial_{z}^{2})h_{x}=0$ . Thus, the fluctuations parallel to $\vec{H}_{0}$ stop being physical, together with the appearance of a constraint. This parallels the appearance of the free parameter $\beta$ in the equation of motion (45) for the $\vec{H}$ field on the degenerate surface together with the appearance of the condition (46).

V Discussion

In this work, we considered a large class of nonlinear vacuum electrodynamics models using a first-order approach introduced by Plebański Plebanski . We analyzed in detail the equations of motion, focusing on situations in which the equations of motion develop a singularity. In particular, this can happen for the equation of motion for the $\vec{H}$ field. At the space-time points close to a singularity, the time derivative of $\vec{H}$ can diverge, while on the space-time points with the singularity, the time derivative of $\vec{H}$ (more precisely, its modulus) acquires an indeterminacy. When this happens, the equations of motion imply also that an extra constraint is turned on, thus reducing the number of local degrees of freedom.

As we have shown, the dynamics of the region (surface) of degeneracy has the behavior of a shock wave. Shock waves have been shown to appear in general models of nonlinear electrodynamics in the context of the propagation of linear disturbances in an electromagnetic field background due to the possibility of the formation of caustics Boillat-1970 ; Gibbons . They have also been shown to arise in a study of the Euler-Heisenberg Lagrangian Zheleznyakov . Of course it is no surprise that we have encountered them as well in our rather different approach.

In section IV we analyzed, as an illustration, the linearized field fluctuations in a constant background magnetic field for a particular Plebański model. We showed that one of the propagation modes is superluminal for a range of background field strengths, while a singularity is encountered in the equation of motion as the field approaches its critical value. At that point, the equation of motion turns into a constraint equation, confirming the breakdown in the number of degrees of freedom. From this example it is reasonable to expect also for general Plebański models, that as the singularity is approached, there is a range of field values for which either the phase velocity of perturbations is larger than one (the speed of light), or the field equation does not permit wave-like solutions.

In the literature studies exist Shabad ; Lammerzahl ; causal1 that take as a starting point consistency conditions such as absence of superluminal propagation and/or causality, unitarity and stability. In particular, in an approach based on the analysis of the photon propagator in a constant electromagnetic field background Shabad , the requirements that the group velocity is less than one and that residue of the propagator be positive (a requirement for unitarity) can be shown to yield a number of conditions on the first and second partial derivatives of the Lagrangian density with respect to the Lorentz invariants $F$ and $G$ . In a completely different approach, already mentioned in the introduction Lammerzahl ; causal1 , the Fresnel equation for wave propagation in the presence of background fields was studied. It was shown that the absence of superluminal propagation implies conditions on the first and second partial derivatives of the Lagrangian density, or, equivalently, on the potential $V(P,Q)$ , partially overlapping with the results in Shabad . In this work we took a different philosophy, analyzing the NLED equations of motion directly for arbitrary potentials, in particular focusing on singular properties of the equations of motion. Therefore, it can be regarded as complementary to the earlier studies Shabad ; Lammerzahl ; causal1 .

We showed that a necessary condition for a singularity in the field equations to occur is that either of the values of $V_{P}$ , or $S$ as defined in Eq. (40) becomes zero. While this condition can hold in a large class of Plebański models, there are certainly models for which the condition can never be satisfied. A rather obvious example is Maxwell theory, for which $V_{P}=-1$ and $S=1$ have nonzero constant values. A less trivial example is Born-Infeld theory: we see from Eq. (29) that $V_{P}$ is negative definite, while the same can be shown to be true for the quantity $S$ . Therefore, for such models the degenerate behavior we studied in this paper can never occur.

For generic Plebański models, however, the quantities $V_{P}$ and/or $S$ could become zero for certain ranges of field values. For the Euler-Heisenberg Lagrangian the onset of degenerate behavior translates in a breakdown of the model itself, as it derives from a causal, unitary quantum field theory. For instance, electron-positron pair production will occur when the electric field is beyond the Schwinger limit of $1.3\times 10^{18}\,$ V/m. This is several orders of magnitude out of reach of even the strongest laser fields currently available ¹¹1However, experiments are underway to detect QED effects with colliding petawatt laser pulses (see Ref. lasers ).. However, the conditions necessary for reaching singular behavior could well be relevant in the context of some of the astrophysical or cosmological scenarios mentioned in section I, as well as others that have been considered in the literature, in which Plebański models have been proposed.

Acknowledgements.

R. P. thanks the kind hospitality of the Center for Applied Space Technology and Microgravity (ZARM) in Bremen, Germany, where part of this work was carried out. C. A. E. was supported by a UNAM-DGAPA postdoctoral fellowship and the project PAPIIT No. IN111518. R. P. acknowledges financial support by the Fundação para a Ciência e a Tecnologia of Portugal (FCT) through grant SFRH/BSAB/150324/2019.

References

(1) Born M and Infeld L 1934 Proc. Roy. Soc. Lond. A 144 425
(2) Fradkin E S and Tseytlin A A 1985 Phys. Lett. 158B 316
(3) Heisenberg W and Euler H 1936 Z. Phys. 98 714 (arXiv:physics/0605038)
(4) Berestetskii V B, Lifshitz E M and Pitaevskii E M 1982 Quantum Electrodynamics, 2nd Ed. (Butterworth-Heinemann)
(5) Plebański J 1970 Lectures on non-linear electrodynamics, (Copenhagen: Nordita)
(6) Arodz H, Slusarczyk M and Wereszczynski A 2001 Acta Phys. Polon. B 32 2155
(7) Lemos J P S and Kerner R 2000 Grav. Cosmol. 6 49
(8) Novello M, Perez Bergliaffa S E and Salim J 2004 Phys. Rev. D 69 127301
(9) Campanelli L, Cea P, Fogli G L and L. Tedesco L 2008 Phys. Rev. D 77 043001
(10) Ayon-Beato E and Garcia A 1998 Phys. Rev. Lett. 80 5056
(11) Bronnikov K A 2001 Phys. Rev. D 63 044005
(12) De Lorenci V A, Klippert R, Novello M and Salim J M 2000 Phys. Lett. B 482 134
(13) Novello M, De Lorenci V A, Salim J M and Klippert R 2000 Phys. Rev. D 61 045001
(14) Obukhov Y N and Rubilar G F 2002 Phys. Rev. D 66 024042
(15) Schellstede G O, Perlick V and Lämmerzahl C 2015 Phys. Rev. D 92 025039
(16) Schellstede G O, Perlick V and Lämmerzahl C 2016 Annalen Phys. 528 738
(17) Abalos F, Carrasco F, Goulart E and Reula O 2015 Phys. Rev. D 92 084024
(18) Perlick V, Lämmerzahl C and Macias A 2018 Phys. Rev. D 98 105014
(19) de Melo C A M, Medeiros L G and Pompeia P J 2015 Mod. Phys. Lett. A 30 1550025
(20) Escobar C A and Urrutia L F 2014 EPL 106 31002
(21) Escobar C A and Urrutia L F 2014 J. Math. Phys. 55 032902
(22) Lutzky M and Toll J S 1959 Phys. Rev. 113 1649
(23) Boillat G 1970 J. Math. Phys. 11 941
(24) Gibbons G W and Herdeiro C A R 2001 Phys. Rev. D 63 064006 (see, especially, section 2.4)
(25) Zheleznyakov V V and Fabrikant A L 1982 JETP 55 794
(26) Shabad A E and Usov V V 2011 Phys. Rev. D 83 105006
(27) Turcu I C E et al 2019 High Power Laser Sci. Engin. 7 e10