This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Particle on the sphere:
group-theoretic quantization
in the presence of a magnetic monopole

Rodrigo Andrade e Silva   and   Ted Jacobson
[email protected]@umd.edu
   Center for Fundamental Physics, University of Maryland
College Park, MD 20742, USA

The problem of quantizing a particle on a 2-sphere has been treated by numerous approaches, including Isham’s global method based on unitary representations of a symplectic symmetry group that acts transitively on the phase space. Here we reconsider this simple model using Isham’s scheme, enriched by a magnetic flux through the sphere via a modification of the symplectic form. To maintain complete generality we construct the Hilbert space directly from the symmetry algebra, which is manifestly gauge-invariant, using ladder operators. In this way, we recover algebraically the complete classification of quantizations, and the corresponding energy spectra for the particle. The famous Dirac quantization condition for the monopole charge follows from the requirement that the classical and quantum Casimir invariants match. In an appendix we explain the relation between this approach and the more common one that assumes from the outset a Hilbert space of wave functions that are sections of a nontrivial line bundle over the sphere, and show how the Casimir invariants of the algebra determine the bundle topology.

1 Introduction

Canonical quantization is a magic wand, discovered by Dirac, that transmogrifies a classical dynamical theory into a corresponding quantum theory, often in perfect agreement with observations. However, for most classical theories Dirac’s procedure depends on the choice of phase space coordinates over which to wave the wand, so the resulting quantum theory is ambiguous. Moreover, a generic phase space has nontrivial topology, and does not even admit a global coordinate chart. In complete generality, the only recourse is to accept the ambiguity, and to explore all quantizations. But some classical dynamical systems possess symmetries that can be used to identify a restricted class of quantizations which preserve these symmetries in the quantum theory. Such quantizations would obviously yield the best guess, if indeed the original classical theory is the classical limit of some quantum theory.

Isham provided a generalization of Dirac’s canonical quantization that is designed to preserve a chosen transitive group of phase space symmetries, and can be applied to topologically nontrivial phase spaces [1, 2]. Our primary interest in this, as was Isham’s, is ultimately to restrict the possibilities for nonperturbative quantization of general relativity. But to develop understanding of how the scheme works, the ambiguities that remain, and the relation to other quantization schemes, it is useful to consider simpler systems. A particle on a 2-sphere is one of the simplest such systems, and has already been treated by many different approaches, including Isham’s [1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20]. Here we consider this simple model, enriched by the inclusion of a magnetic flux through the sphere, with the aim of implementing Isham’s quantization scheme without making any choices other than that of the group of canonical symmetries.111A similar approach appears to have been considered in [11], however we could not obtain access to the relevant part of the document. Related work is also mentioned in [21] by the same author, but it refers to a preprint that we could not find. In order to maintain complete generality for the unitary representations of the quantized algebra, we construct the Hilbert space directly from the algebra, rather than adopting the framework of wave functions. In this way, we recover algebraically the complete classification of quantizations, as well as the famous Dirac quantization condition for the monopole charge and the corresponding energy spectra. We also explain the relation between this approach and the more common one that assumes from the outset a Hilbert space of generalized wave functions that are sections of a nontrivial line bundle over the sphere.

Isham applied his quantization scheme to the phase space 𝒫=TS2\mathcal{P}=T^{*}S^{2}, the cotangent bundle of the 2-sphere, with the canonical symplectic form ω=dpidqi\omega=dp_{i}\wedge dq^{i}. He found that the Hilbert space must carry some unitary irreducible (projective) representation of the 3-dimensional Euclidean group, E3=3SO(3)E_{3}=\mathbb{R}^{3}\rtimes SO(3). From Mackey’s theory of induced representations, one obtains that each irreducible unitary representation 𝒰(n)\mathscr{U}^{\!(n)} of U(1)U(1), labeled by an integer nn, yields a Hilbert space represented by sections of a certain bundle, 𝒰(n)\mathscr{U}^{\!(n)}-associated to the Hopf bundle SU(2)SU(2)/U(1)S2SU(2)\rightarrow SU(2)/U(1)\sim S^{2}. (See Appendix C for details.) These sections can be seen as “twisted” wavefunctions over the sphere, with the “twisting” described by the first Chern number of the bundle (which is nn for the 𝒰(n)\mathscr{U}^{\!(n)}-associated bundle). The choice of this integer nn is equivalent to the assignment of an intrinsic spin to the particle, and its value is fixed to be zero if one imposes, as a correspondence principle, that classical Casimir invariants of the Poisson algebra are preserved upon quantization. When the particle is electrically charged, and a magnetic monopole field is included, we find again the same classification of Hilbert spaces, but the correspondence principle for Casimir invariants determines nn in terms of the product of the monopole charge with the electric charge. The inclusion of the magnetic monopole field is thus equivalent to the assignment of an intrinsic spin to to the particle.

The usual prescription to include coupling to a magnetic field is to make the replacement ppeAp\rightarrow p-eA in the Hamiltonian, where pp is the momentum, ee is the electric charge of the particle and AA is the magnetic potential 1-form in some local gauge. If AA is defined globally on the sphere then the magnetic field B=dAB=dA is an exact 2-form, so the net magnetic flux through the sphere must vanish. In the presence of a nonzero net magnetic flux, therefore, AA cannot be defined globally on the sphere, and hence the Hamiltonian is not globally defined. This can be accommodated by defining the Hamiltonian in local gauge patches, and accompanying a gauge transformation AA+dλA\rightarrow A+d\lambda with a corresponding symplectic (canonical) transformation pp+edλp\rightarrow p+ed\lambda. In this way, however, the canonical momentum ceases to be an observable, and a global description is lacking.

Instead, we shall maintain manifest gauge invariance and a globally defined Hamiltonian by incorporating the magnetic field into the symplectic structure [22, 23]. That is, we replace pp by p+eAp+eA in the symplectic form dpidqidp_{i}\wedge dq^{i}. This results in d(pi+eAi)dqi=d(pidqi)+ed(Aidqi)d(p_{i}+eA_{i})\wedge dq^{i}=d(p_{i}dq^{i})+ed(A_{i}dq^{i}), which can be written covariantly as

ω=dθ+eπB.\omega=d\theta+e\pi^{*}B\,. (1.1)

Here θ\theta is the canonical symplectic potential, defined by θ(X)=p(πX)\theta(X)=p(\pi_{*}X), where XX is a tangent vector on the cotangent bundle TS2T^{*}S^{2}, and π:TS2S2\pi:T^{*}S^{2}\rightarrow S^{2} is the bundle projection map (with π\pi^{*} and π\pi_{*} the pull-back and push-forward of π\pi), and BB is the magnetic 2-form on the sphere (whose integral over any region gives the magnetic flux through it). Our problem is thus to quantize a phase space with topology TS2T^{*}S^{2} and symplectic form given by (1.1).

This paper is organized as follows. In section 2, we begin with a brief review of Isham’s quantization scheme. In section 3, we study the phase space TS2T^{*}S^{2} with the charged symplectic form (1.1), identifying the appropriate quantizing group. In section 4.1, we proceed to the quantization, establishing the correspondence between classical observables and quantum self-adjoint operators. In section 4.2, we show that the Casimir invariants of the algebra play an important role in linking the classical and quantum worlds. In particular, this is how the magnetic monopole makes its way into the quantum theory. Next, in section 4.3, we study the representations of the group by constructing ladder operators for J2J^{2}, the angular momentum squared. From the assumption that the theory is free of negative-norm states, in section 4.4 we recover Dirac’s charge quantization condition. Finally, in section 4.5, we compute the energy spectrum for a (non-relativistic) particle on a geometric sphere. In Appendix A we establish the absence of nontrivial central extensions of the Euclidean algebra 3{\cal E}_{3}, and in Appendix B we give some details omitted in the derivation of section 4.3. In Appendix C we show how the representation in terms of “twisted” wavefunctions can be recovered and, in particular, how the magnetic monopole is related to their twisting, and in Appendix D we consider non-uniform magnetic fields.

2 Isham’s quantization scheme

In the founding days of quantum mechanics, Dirac remarked that “The correspondence between the quantum and classical theories lies not so much the limiting agreement when 0\hbar\rightarrow 0 as in the fact that the mathematical operations on the two theories obey in many cases the same laws.” [24]. This observation led him to postulate the general canonical quantization scheme which replaces Poisson brackets of classical functions on phase space by quantum commutators of quantum operators, i.e., [f^,g^]=i{f,g}^[\hat{f},\hat{g}]=i\hbar\widehat{\{f,g\}}. More precisely, one seeks a linear homomorphism from the algebra of real functions on the phase space, with product defined by the Poisson bracket {f,g}\{f,g\}, to an algebra of self-adjoint operators on some Hilbert space, with product defined by the commutator 1i[f^,g^]\frac{1}{i\hbar}[\hat{f},\hat{g}], satisfying certain conditions, such as mapping the constant function f=1f=1 to the identity 1^\hat{1} and, for all functions ϕ\phi, mapping ϕ(f)\phi(f) to ϕ(f^)\phi(\hat{f}).

It turns out that no such map exists in general, as a consequence of Groenewold–Van Hove obstructions.222Strictly speaking, the Van Hove no-go theorem applies only for trivial phase spaces, 𝒫=2n\mathcal{P}=\mathbb{R}^{2n}. The result has been extended to other cases, and it is expected that this kind of obstruction is generic [25]. However, some examples have been found where a full, unobstructed quantization is possible [26, 27]. Hence one must be careful to select a relatively small set of observables that can be consistently quantized, but which is still large enough to allow for the construction of quantized versions of all other classical observables. The trivial example is that of 𝒫=2n\mathcal{P}=\mathbb{R}^{2n}, where one can canonically quantize the global coordinates qiq^{i} and pip_{i}, and then carry along all other observables f(q,p)f(q^,p^)f(q,p)\rightarrow f(\hat{q},\hat{p}) to the quantum theory (modulo operator-ordering issues). Isham’s proposal is to generalize this by identifying a transitive group of symplectic symmetries of the phase space and using it to generate both a special set of classical observables and their associated quantum self-adjoint operators. We call this group the quantizing group. (It is sometimes referred to as the “canonical group”.) In the trivial case just mentioned, for example, it could be the group of coordinate translations, (q,p)(q+a,p+b)(q,p)\rightarrow(q+a,p+b), where qq and pp are any global canonical coordinates. Typically, the dynamical system possesses other structures that would select a preferred canonical group, such as a metric on the configuration space that appears in the Hamiltonian. We shall briefly review Isham’s scheme in this section. For more details, see [1, 2].

Consider a phase space 𝒫\mathcal{P} with symplectic 2-form ω\omega. Assume that the phase space is a homogeneous space for some Lie group GG of symplectic symmetries. That is, there is a transitive left action δg:𝒫𝒫\delta_{g}:\mathcal{P}\rightarrow\mathcal{P} of GG on 𝒫\mathcal{P} such that δgω=ω\delta_{g}^{*}\omega=\omega for all gGg\in G. Each element ξ\xi in the Lie algebra 𝔤T1G\mathfrak{g}\sim T_{1}G of GG induces a vector field XξX_{\xi} on 𝒫\mathcal{P} defined by Xξ|p=ϕp(ξ)X_{\xi}|_{p}=\phi_{p*}(\xi), where ϕp:G𝒫\phi_{p}:G\rightarrow\mathcal{P} is defined by ϕp(g)=δg(p)\phi_{p}(g)=\delta_{g}(p). This map is an antihomomorphism from 𝔤\mathfrak{g} into the algebra of vector fields on 𝒫\mathcal{P}, i.e., [Xξ,Xη]=X[η,ξ][X_{\xi},X_{\eta}]=X_{[\eta,\xi]}. Because δg\delta_{g} preserves ω\omega, XξX_{\xi} is a (locally) Hamiltonian field, i.e., £Xξω=0\pounds_{X_{\xi}}\omega=0. We therefore have d(ıXξω)=£XξωıXξdω=0d(\imath_{X_{\xi}}\omega)=\pounds_{X_{\xi}}\omega-\imath_{X_{\xi}}d\omega=0, where ı\imath denotes the interior product. That is, ıXξω\imath_{X_{\xi}}\omega is closed and thus locally exact, so that dQξ=ıXξωdQ_{\xi}=-\imath_{X_{\xi}}\omega admits local solutions QξQ_{\xi}, defined up to addition of a constant function on 𝒫\mathcal{P}. Since we want these charges333We call charge any function generated in this way by the group of symplectic symmetries, regardless if they are (also) dynamical symmetries in the sense of Noether’s theorem. QξQ_{\xi} to play the role of the canonical observables, we require that GG generates only globally Hamiltonian fields on 𝒫\mathcal{P}, meaning that the associated charges are all defined globally on 𝒫\mathcal{P}.

The symplectic form endows the space of functions on the phase space with an algebraic structure, 𝒜C\mathcal{A}_{C}, where the product is given by the Poisson bracket444Because ω\omega is non-degenerate, any function ff on 𝒫\mathcal{P} can be associated with a unique vector field XfX_{f} on 𝒫\mathcal{P} via the relation df=ıXfωdf=-\imath_{X_{f}}\omega. The Poisson bracket between two functions, ff and ff^{\prime}, is defined by {f,f}:=ω(Xf,Xf)\{f,f^{\prime}\}:=-\omega(X_{f},X_{f^{\prime}}).. In particular, when the functions are taken to be the charges, we have {Qξ,Qη}=ω(Xξ,Xη)\{Q_{\xi},Q_{\eta}\}=-\omega(X_{\xi},X_{\eta}), and it happens that the map ξQξ\xi\mapsto Q_{\xi} is a homomorphism from 𝔤\mathfrak{g} into 𝒜C\mathcal{A}_{C} up to central charges, that is, {Qξ,Qη}=Q[ξ,η]+z(ξ,η)\{Q_{\xi},Q_{\eta}\}=Q_{[\xi,\eta]}+z(\xi,\eta), where z(ξ,η)z(\xi,\eta) is constant on 𝒫\mathcal{P}. In practice, we can assume that this is a true homomorphism, i.e., z=0z=0, since the group can always be extended by a central element to make that so.555If the central charge z(ξ,η)z(\xi,\eta) is not trivial (i.e., it cannot be removed by a redefinition of the charges QξQξ+f(ξ)Q_{\xi}\rightarrow Q_{\xi}+f(\xi), for some f:𝔤f:\mathfrak{g}\rightarrow\mathbb{R}), one can always extend the group by a central element so that the extended algebra, 𝔤S\mathfrak{g}\oplus_{S}\mathbb{R}, has product law [(ξ,a),(η,b)]=([ξ,η],z(ξ,η))[(\xi,a),(\eta,b)]=([\xi,\eta],z(\xi,\eta)). The new group has a natural action on 𝒫\mathcal{P} (where the central element acts trivially), and the new charges are related to the old ones simply by Q(ξ,a)=Qξ+aQ_{(\xi,a)}=Q_{\xi}+a. Consequently, the map (ξ,a)Q(ξ,a)(\xi,a)\mapsto Q_{(\xi,a)} is a true homomorphism. Hence, we can always assume that GG is already the extension of whatever group we started with.

Next we use GG to construct the quantum theory. Let U:GAut()U:G\rightarrow\text{Aut}(\mathcal{H}) be an irreducible unitary representation of GG on a Hilbert space \mathcal{H}. Each element of the algebra ξ𝔤\xi\in\mathfrak{g} can be exponentiated to a one-parameter subgroup of GG, exp(tξ)\exp(t\xi), and the corresponding one-parameter unitary group is generated by a self-adjoint operator Q^ξ\widehat{Q}_{\xi} on \mathcal{H}, as U(exptξ)=etQ^ξ/iU(\exp t\xi)=e^{t\widehat{Q}_{\xi}/i\hbar}. From the definition of a representation, the map ξQ^ξ\xi\mapsto\widehat{Q}_{\xi} is a homomorphism from 𝔤\mathfrak{g} into 𝒜Q\mathcal{A}_{Q}, where 𝒜Q\mathcal{A}_{Q} is the algebra of self-adjoint operators on \mathcal{H} with product given by 1i[,]\frac{1}{i\hbar}[\cdot,\cdot]. It follows that the quantization map, which associates to each classical charge QξQ_{\xi} the corresponding generator Q^ξ\widehat{Q}_{\xi} of the unitary representation,

QξQ^ξ,Q_{\xi}\mapsto\widehat{Q}_{\xi}\,, (2.1)

is a homomorphism from 𝒜C\mathcal{A}_{C} into 𝒜Q\mathcal{A}_{Q}. The logic of this quantization scheme is summarized in Fig. 1.

ξ𝔤{\xi\in\mathfrak{g}}Xξ{X_{\xi}}U(expξ){U(\exp\xi)}Qξ{Q_{\xi}}Q^ξ{\widehat{Q}_{\xi}}
Figure 1: On the classical side, each element ξ\xi of the Lie algebra of the quantizing group induces a Hamiltonian vector field XξX_{\xi} (on the phase space), which in turn defines a Hamiltonian charge QξQ_{\xi}. On the quantum side, the group element expξ\exp\xi is represented by a unitary transformation (on a Hilbert space), whose self-adjoint generator is Q^ξ\widehat{Q}_{\xi}.

Since the space of physical states is actually the ray space, :=/U(1)\mathcal{R}:=\mathcal{H}/U(1), corresponding to the quotient of the Hilbert space \mathcal{H} by phases eiθU(1)e^{i\theta}\in U(1), it is natural to consider also projective representations of the group GG. A projective representation is a homomorphism from GG into the group of projective unitary operators on \mathcal{R}, P𝒰()P\mathscr{U}(\mathcal{H}), consisting of equivalence classes UeiθUU\sim e^{i\theta}U of unitary operators on \mathcal{H}. In essence, including projective irreducible unitary representations of GG amounts to constructing the quantum theory based on irreducible unitary representations666Here “unitary” means that the observables QξQ_{\xi} are represented by self-adjoint operators Q^ξ\widehat{Q}_{\xi} on the Hilbert space. of the algebra of observables QξQ_{\xi}.777Technically, there would also be projective representations of GG associated with non-trivial central extensions (by 2-cocycles) of its algebra, if those exist. However, unless such a central extension appears already in the classical Poisson algebra, it will not be of interest to us here, because of the Casimir correspondence principle introduced below.

The condition that the group acts transitively on the phase space ensures that any function on 𝒫\mathcal{P} can (locally) be expressed in terms of the canonical observables QξQ_{\xi}’s. To see this, consider the momentum map, JJ, which is a function from 𝒫\mathcal{P} to 𝔤\mathfrak{g}^{*} (the dual algebra of GG) defined by J(p)(ξ):=Qξ(p)J(p)(\xi):=Q_{\xi}(p), where p𝒫p\in\mathcal{P} and ξ𝔤\xi\in\mathfrak{g}. If this map were an embedding, then all real functions on 𝒫\mathcal{P} could be written as functions of QξQ_{\xi}’s. More concretely, note that a basis {ξi}\{\xi_{i}\} of 𝔤\mathfrak{g} induces a coordinate system in 𝔤\mathfrak{g}^{*} defined by coordinate functions wi(σ):=σ(ξi)w_{i}(\sigma):=\sigma(\xi_{i}), where σ𝔤\sigma\in\mathfrak{g}^{*}, and these have the property that Qξi=JwiQ_{\xi_{i}}=J^{*}w_{i}. Since any smooth function f:𝒫f:\mathcal{P}\rightarrow\mathbb{R} could, in this case, be seen as the pull-back under JJ of some function F:𝔤F:\mathfrak{g}^{*}\rightarrow\mathbb{R}, then ff could be written as function of the QξiQ_{\xi_{i}}’s. In general, although JJ may not be an embedding, transitivity of the group action guarantees that it is an immersion of 𝒫\mathcal{P} into 𝔤\mathfrak{g}^{*}. Transitivity implies that, at any p𝒫p\in\mathcal{P}, any tangent vector VTp𝒫V\in T_{p}\mathcal{P} is equal to XηX_{\eta} for some η𝔤\eta\in\mathfrak{g}. The non-degeneracy of ω\omega then implies that dQξ(V)=ω(Xξ,Xη)dQ_{\xi}(V)=-\omega(X_{\xi},X_{\eta}) is nonvanishing for at least one ξ\xi. In other words, there is no direction VV along which all charges are (locally) constant, implying that any function on 𝒫\mathcal{P} can be locally written in terms of the charges888To say there is no direction VV along which all charges are (locally) constant is equivalent to saying that the derivative JJ_{*} is injective, so that JJ is an immersion. A further analysis [1] reveals that if the immersion fails to be an embedding, it is at worst a covering map, i.e., 𝒫\mathcal{P} is a covering space for its image under JJ. This limits the extent to which functions on 𝒫\mathcal{P} can fail to be globally expressible in terms of the charges.. Therefore, the special set of observables generated in this way is indeed not too “small”.

This completes our review of Isham’s quantization scheme. In pursuit of generality, the scheme refers only to minimal structure required to define a “canonical quantization”, which associates to a certain chosen classical Poisson algebra of observables a corresponding quantum algebra of observables. But in order to fully define a physical quantum theory, a particular representation of the algebra must be chosen, and the dynamics must be implemented via a quantization of the Hamiltonian. This may require additional physical ingredients to be introduced in the quantization. In many cases the choice of a representation is restricted by what we shall call a Casimir correspondence principle. A classical Casimir invariant is an observable that Poisson commutes with the entire Poisson algebra. If that observable admits a quantization (i.e., a choice of operator ordering) that commutes with the entire quantum algebra, then it is a quantum Casimir invariant. Such a quantum Casimir is a multiple of the identity in each irreducible representation of the quantum algebra, so it takes the same value in all states of such a representation. For the classical system to arise from a classical limit of the quantum system, the eigenvalue of the quantum Casimir observable should match the corresponding classical value. This Casimir correspondence principle plays an important role in selecting which irreducible representation corresponds to the quantization of a given classical system, and it sometimes restricts which classical systems can arise from the classical limit of a quantum system. In the present paper, for instance, the possible values of the magnetic monopole charge will be restricted by this principle to those allowed by the famous Dirac condition.

3 The phase space for a particle on the sphere

In this section we address the classical part of Isham’s quantization scheme for the case of a particle on a 2-sphere in the presence of a magnetic monopole. The phase space is 𝒫=TS2\mathcal{P}=T^{*}S^{2} but, as explained in the introduction, the symplectic form must be given by (1.1) if the Hamiltonian is to be a globally well-defined function. Our goal is to identify a suitable group of symplectic symmetries of this phase space and then compute the associated Poisson charges.

3.1 A transitive group of symplectic symmetries

A magnetic field 2-form BB on S2S^{2} admits an infinite dimensional symmetry group that acts transitively on S2S^{2}, provided that BB is nowhere vanishing. This is the group of “area” preserving diffeomorphisms, where BB defines the area element. We are interested in quantizing an SO(3)SO(3) subgroup of this group, which not only is the smallest group that can act transitively on S2S^{2} but is also the group of isometries of a round metric. There are infinitely many such subgroups, which all lead to equivalent phase space quantizations. We write the magnetic field as

B=gϵ,B=g\epsilon\,, (3.1)

where the 2-form ϵ\epsilon is scaled so that S2ϵ=4π\int_{S^{2}}\epsilon=4\pi, and gg is a dimensionful coefficient. The total flux through the sphere is simply 4πg4\pi g, so gg can be interpreted as the magnetic charge of a monopole “inside”999If the magnetic field were not nowhere vanishing, we could still separate it as B=gϵ+dAB=g\epsilon+dA, where AA is a globally defined potential 1-form that could be included in the Hamiltonian, while the gϵg\epsilon term could be included in the symplectic form.. The kinetic energy term in the Hamiltonian for a charged particle on S2S^{2} involves a particular metric on the S2S^{2}. In order to naturally quantize this Hamiltonian, we will ultimately base the quantization on the SO(3)SO(3) group of isometries of this metric, however that choice plays no role until we come to quantizing the Hamiltonian.

We denote by lR(x)l_{R}(x) the action of a rotation RSO(3)R\in SO(3) on a point xx on the sphere. As with any action on the configuration space, there is a natural lifted action to the cotangent bundle, defined by

LR(p)=lR1p,L_{R}(p)=l_{R}^{-1*}p\,, (3.2)

which maps the fiber over xx to that over lR(x)l_{R}(x), i.e. it satisfies πLR=lRπ\pi\circ L_{R}=l_{R}\circ\pi. The canonical potential 1-form θ\theta is invariant under the lift of any point transformation (diffeomorphism of the configuration space).101010Let ϕ\phi be a diffeomorphism of the configuration space, and let Φ\Phi be its lift to the phase space, i.e., Φ(p):=ϕ1p\Phi(p):=\phi^{-1*}p. If VTp𝒫V\in T_{p}\mathcal{P}, then Φθ(V)=θ(ΦV)=(Φ(p))(πΦV)=(ϕ1p)(πΦV)=p(ϕ1πΦV)=p(πV)=θ(V)\Phi^{*}\theta(V)=\theta(\Phi_{*}V)=(\Phi(p))(\pi_{*}\Phi_{*}V)=(\phi^{-1*}p)(\pi_{*}\Phi_{*}V)=p(\phi_{*}^{-1}\pi_{*}\Phi_{*}V)=p(\pi_{*}V)=\theta(V). Alternatively, using coordinates, pidxi=pjxjxixixkdxk=pkdxkp^{\prime}_{i}dx^{\prime i}=p_{j}\frac{\partial x^{j}}{\partial x^{\prime i}}\frac{\partial x^{\prime i}}{\partial x^{k}}dx^{k}=p_{k}dx^{k}. In particular, we have LRθ=θL_{R}^{*}\theta=\theta, and therefore LRdθ=dLRθ=dθL_{R}^{*}d\theta=dL_{R}^{*}\theta=d\theta. Moreover πB\pi^{*}B is invariant under rotations: LR(πB)=(πLR)B=(lRπ)B=πBL_{R}^{*}(\pi^{*}B)=(\pi\circ L_{R})^{*}B=(l_{R}\circ\pi)^{*}B=\pi^{*}B. Therefore the symplectic form (1.1) is invariant,

LRω=ω.L_{R}^{*}\omega=\omega\,. (3.3)

That is, for all RR, LRL_{R} is a symmetry of the symplectic form.

The quantizing group should be larger than just SO(3)SO(3), since the rotations act only “horizontally” on the phase space. For the quantizing group to act transitively, it should include elements that move points along the fibers of the cotangent bundle. The simplest “vertical” action is a translation of momentum,

Fα(p)=pα,F_{\alpha}(p)=p-\alpha\,, (3.4)

where α\alpha is a 1-form field on S2S^{2}. (For notational simplicity we leave implicit the point π(p)\pi(p) at which α\alpha is evaluated.) This acts on the symplectic form (1.1) as

Fαω=d(Fαθ)+e(πFα)B.F^{*}_{\alpha}\omega=d(F^{*}_{\alpha}\theta)+e(\pi\circ F_{\alpha})^{*}B\,. (3.5)

The term πB\pi^{*}B is invariant since πFα=π\pi\circ F_{\alpha}=\pi. The symplectic potential, however, transforms non-trivially. For any VTp𝒫V\in T_{p}\mathcal{P}, we have

Fαθ(V)=θ(FαV)=(Fαp)(πFαV)\displaystyle F^{*}_{\alpha}\theta(V)=\theta(F_{\alpha*}V)=(F_{\alpha}p)(\pi_{*}F_{\alpha*}V)
=(pα)(πV)=(θπα)(V),\displaystyle=(p-\alpha)(\pi_{*}V)=(\theta-\pi^{*}\alpha)(V)\,, (3.6)

so that

Fαω=ωπdα.F^{*}_{\alpha}\omega=\omega-\pi^{*}d\alpha\,. (3.7)

In order for FαF_{\alpha} to be a symplectic symmetry, we must require α\alpha to be closed. We will restrict further to exact 1-forms, α=df\alpha=df, with ff globally defined on S2S^{2} to ensure that the associated charges will be globally defined.

The (infinite-dimensional) space of all exact 1-form fields is unnecessarily large, so we look for a “minimal” set of momentum translations that act transitively along the fibers and are consistent with the spherical symmetry, in the sense that they extend the chosen SO(3)SO(3) into a larger group. As observed by Isham (in a more general setting), a suitable set can be generated by realizing the configuration space S2S^{2} as an orbit of a representation of SO(3)SO(3) in a vector space, and defining the momentum translations as the pullback to the orbit of the “constant” 1-forms on that vector space. In particular, we can choose the fundamental representation on 3\mathbb{R}^{3}, and identify the S2S^{2} with the orbit passing through u=(0,0,1)u=(0,0,1) in 3\mathbb{R}^{3}, that is, with the set of unit vectors x3x\in\mathbb{R}^{3} such that x=Rux=Ru for some RSO(3)R\in SO(3).111111This identification of the configuration space with the unit sphere in the abstract 3\mathbb{R}^{3} should not be confused with the physical sphere, which may have its own geometry. The orbit is a “unit sphere” with respect to the inner product v,u=i=13viui\langle v,u\rangle=\sum_{i=1}^{3}v_{i}u_{i} on the abstract 3\mathbb{R}^{3}. (The notation “xx” for these vectors coincides with that which we used already to label the points in S2S^{2}.) Any dual vector α3\alpha\in\mathbb{R}^{3*}, can naturally be seen as a 1-form field on 3\mathbb{R}^{3}. Moreover, this 1-form field is exact, for it can be written as dfαdf_{\alpha}, where the function fα:3f_{\alpha}:\mathbb{R}^{3}\rightarrow\mathbb{R} is defined by

fα(x):=α(x).f_{\alpha}(x):=\alpha(x)\,. (3.8)

These 1-form fields can be pulled-back to S2S^{2} to define the corresponding action along the fibers of 𝒫\mathcal{P}.

Combining the LRL_{R} and FαF_{\alpha} transformations, we get a transitive group of symplectic symmetries of the phase space: the semidirect product G=3SO(3)G=\mathbb{R}^{3*}\rtimes SO(3), acting on 𝒫\mathcal{P} as

Λ(α,R)(p)=lR1pα,\Lambda_{(\alpha,R)}(p)=l_{R}^{-1*}p-\alpha\,, (3.9)

which satisfies the product rule

(α,R)(α,R)=(α+lR1α,RR).(\alpha,R)(\alpha^{\prime},R^{\prime})=(\alpha+l_{R}^{-1*}\alpha^{\prime},RR^{\prime})\,. (3.10)

Since 33\mathbb{R}^{3*}\sim\mathbb{R}^{3} and the co-representation of SO(3)SO(3) in 3\mathbb{R}^{3*} is equivalent to its representation in 3\mathbb{R}^{3},121212In a matrix realization, R1α=(R1)Tα=RαR^{-1*}\alpha=(R^{-1})^{T}\alpha=R\alpha. this group is isomorphic to the Euclidean group, E3=3SO(3)E_{3}=\mathbb{R}^{3}\rtimes SO(3).

We take this group, GG, to be the quantizing group. Note that it is independent of the magnetic term in the symplectic form. Since its algebra does not admit any non-trivial central extension by 2-cocycles z(ξ,η)z(\xi,\eta),131313Since we found no explicit demonstration of this statement in the literature, we include one in Appendix A for completeness. This formal proof is not really necessary for our purposes, however, as in the next section we explicitly compute the Poisson algebra and show that no central charges arise. no (non-trivial) central charge can appear in the associated Poisson algebra. The quantizing algebra is thus the same as in the uncharged case. However, as we shall see in Sec. 4.2, the magnetic term makes itself felt through the value of a Casimir invariant of the corresponding Poisson bracket algebra, which carries over to the quantum theory.

3.2 Classical canonical observables

We next compute the classical charges associated with the quantizing group GG, beginning with the SO(3)SO(3) generators. Let nn be an element of the algebra 𝔰𝔬(3)\mathfrak{so}(3), and denote its exponential by Rn=exp(n)R_{n}=\exp(n). (Here exp:𝔤G\exp:\mathfrak{g}\rightarrow G is the usual Lie group exponential map.) Let X¯n\overline{X}_{n} be the vector field (on S2S^{2}) induced by nn through the action of SO(3)SO(3) on S2S^{2}, and let XnX_{n} be the vector field (on 𝒫\mathcal{P}) induced by nn through the lifted action of SO(3)SO(3) on 𝒫\mathcal{P} defined in (3.2). Since the group action on 𝒫=TS2\mathcal{P}=T^{*}S^{2} maps fibers into fibers, we note that X¯n\overline{X}_{n} is just the projection of XnX_{n} to the sphere, i.e., X¯n=πXn\overline{X}_{n}=\pi_{*}X_{n}. The corresponding charge PnP_{n} is defined by

dPn\displaystyle dP_{n} =ıXnω\displaystyle=-\imath_{X_{n}}\omega
=ıXn(dθ+egπϵ)\displaystyle=-\imath_{X_{n}}(d\theta+eg\,\pi^{*}\epsilon)
=d[θ(Xn)]egπıX¯nϵ.\displaystyle=d[\theta(X_{n})]-eg\,\pi^{*}\imath_{\overline{X}_{n}}\epsilon\,. (3.11)

The first term in the last line follows from 0=£Xnθ=ıXndθ+dıXnθ0=\pounds_{X_{n}}\theta=\imath_{X_{n}}d\theta+d\imath_{X_{n}}\theta, and θ\theta is invariant under any point transformation, as discussed in the paragraph leading to (3.3). Like the first term, the second term is also an exact 1-form: it is closed since d(πıX¯nϵ)=πdıX¯nϵ=πX¯nϵ=0d(\pi^{*}\imath_{\overline{X}_{n}}\epsilon)=\pi^{*}d\imath_{\overline{X}_{n}}\epsilon=\pi^{*}{\cal L}_{\overline{X}_{n}}\epsilon=0, and since S2S^{2} is simply connected it is therefore also exact, i.e., ıX¯nϵ=dΓn\imath_{\overline{X}_{n}}\epsilon=d\Gamma_{n}, for some Γn:S2\Gamma_{n}:S^{2}\rightarrow\mathbb{R}. The charge PnP_{n} is thus given by

Pn=p(X¯n)egΓnπP_{n}=p(\overline{X}_{n})-eg\,\Gamma_{n}\circ\pi (3.12)

up to an additive constant. The use of the symbol “PP” is motivated by the fact that the charges associated with spatial transformations are the analogue of momentum coordinates. The term p(X¯n)p(\overline{X}_{n}) alone is the usual orbital angular momentum associated with the rotation Killing vector field X¯n\overline{X}_{n}, while PnP_{n} is the canonical angular momentum, i.e., the charge that generates rotations on the phase space with symplectic form ω\omega (1.1). On a phase space with the “usual” symplectic form dθd\theta, the canonical angular momentum would have been simply p(X¯n)p(\overline{X}_{n}).

Next we compute the charges associated with the 3\mathbb{R}^{3*} part of the group. Since the group 3\mathbb{R}^{3*} is a vector space, it can be naturally identified with its Lie algebra. Let YαY_{\alpha} be the momentum translation vector field on 𝒫\mathcal{P} induced by an element α\alpha of the Lie algebra of 3\mathbb{R}^{3*} . The corresponding charge QαQ_{\alpha} is defined by

dQα\displaystyle dQ_{\alpha} =ıYαω\displaystyle=-\imath_{Y_{\alpha}}\omega
=ıYαdθ\displaystyle=-\imath_{Y_{\alpha}}d\theta
=£Yαθ\displaystyle=-\pounds_{Y_{\alpha}}\theta
=ddtFtαθ|t=0\displaystyle=-\left.\frac{d}{dt}F_{t\alpha}^{*}\theta\right|_{t=0}
=πα\displaystyle=\pi^{*}\alpha
=πdfα\displaystyle=\pi^{*}df_{\alpha}
=d(fαπ),\displaystyle=d(f_{\alpha}\circ\pi)\,, (3.13)

where in the second line we used that ıYαϵ=0\imath_{Y_{\alpha}}\epsilon=0; in the third line that ıYαθ=0\imath_{Y_{\alpha}}\theta=0; in the fourth line that the flow induced by α\alpha is pFtα(p)p\mapsto F_{t\alpha}(p); in the fifth line we used (3.6); and in the sixth line we used that dfα=αdf_{\alpha}=\alpha, as defined in (3.8).141414Alternatively, in local coordinates adapted to the bundle structure of TS2T^{*}S^{2}, the flow generated by α=αidqi\alpha=\alpha_{i}dq^{i} is given by Ftα(qi,pi)=(qi,pitαi)F_{t\alpha}(q^{i},p_{i})=(q^{i},p_{i}-t\alpha_{i}), so Yα=αipiY_{\alpha}=-\alpha_{i}\frac{\partial}{\partial p_{i}}, and the charge differential is dQα=iYα(dpidqi+egϵijdqidqj)=αidqi=παdQ_{\alpha}=-i_{Y_{\alpha}}(dp_{i}\wedge dq^{i}+eg\,\epsilon_{ij}dq^{i}\wedge dq^{j})=\alpha_{i}dq^{i}=\pi^{*}\alpha. In the last step the π\pi^{*} appears because, in this equation, qiq^{i} are coordinates on TS2T^{*}S^{2}, while in the definition of α\alpha they are coordinates on S2S^{2}. The charge associated with α\alpha is therefore

Qα=fαπQ_{\alpha}=f_{\alpha}\circ\pi\, (3.14)

up to an additive constant. The use of the symbol “QQ” here is motivated by the fact that the charges associated with momentum translations are the analogue of position coordinates.

To be more concrete, it is convenient to use the realization of S2S^{2} as the unit sphere in the abstract 3\mathbb{R}^{3}, which was introduced in the previous subsection. We identify n𝔰𝔬(3)n\in\mathfrak{so}(3) with the vector in 3\mathbb{R}^{3} whose direction is the corresponding axis of rotation and whose magnitude |n||n| gives the angle of rotation of exp(n)\exp(n), according to the right-hand rule. Then, using adapted spherical coordinates in which nn is aligned with θ=0\theta=0, we have X¯n=|n|ϕ\overline{X}_{n}=|n|\partial_{\phi}, hence ıX¯nϵ=|n|ıϕsinθdθdϕ=|n|d(cosθ)=d(nx)\imath_{\overline{X}_{n}}\epsilon=|n|\,\imath_{\partial_{\phi}}\!\sin\theta\,d\theta\wedge d\phi=|n|\,d(\cos\theta)=d(n\cdot x), so we can choose Γn=nx\Gamma_{n}=n\cdot x. Therefore, the canonical charges are given, up to an additive constant, by

Pn=p(X¯n)egnx\displaystyle P_{n}=p(\overline{X}_{n})-eg\,n\cdot x (3.15)
Qα=α(x).\displaystyle Q_{\alpha}=\alpha(x)\,. (3.16)

The notation is somewhat abbreviated here. Strictly speaking, PnP_{n} and QαQ_{\alpha} are functions on the phase space 𝒫{\cal P}, which is specified above by giving their values at a point p𝒫p\in{\cal P}. The vector field X¯n\overline{X}_{n} is implicitly evaluated at π(p)\pi(p), and xx is the unit vector representative of π(p)\pi(p) in the embedded realization of S23S^{2}\subset\mathbb{R}^{3}.

We next consider the Poisson bracket algebra of the charges. By construction, this algebra matches the Lie algebra of the canonical group that defined the charges, up to a possible central extension. If a central extension appears in such an algebra, in general it may or may not be removable using the freedom to shift the charges by addition of constants. As mentioned above, the Euclidean algebra in itself (i.e. apart from any canonical realization) does not admit any non-trivial central extension (by 2-cocycles), so that it must be possible to choose the additive constants such that the Poisson algebra matches the Lie algebra. In fact, the choices we have made in (3.15) and (3.16) satisfy this criterion, and the Poisson algebra takes the form

{Pn,Pn}=P[n,n]\displaystyle\{P_{n},P_{n^{\prime}}\}=P_{[n,n^{\prime}]}
{Qα,Pn}=QXnα\displaystyle\{Q_{\alpha},P_{n}\}=Q_{{\cal L}_{X_{n}}\alpha}
{Qα,Qα}=0,\displaystyle\{Q_{\alpha},Q_{\alpha^{\prime}}\}=0\,, (3.17)

which matches the semi-direct product structure of the algebra 𝔤=3S𝔰𝔬(3)\mathfrak{g}=\mathbb{R}^{3*}\oplus_{S}\mathfrak{so}(3) of GG without central charges. That is, denoting elements of 𝔤\mathfrak{g} by (α,n)3S𝔰𝔬(3)(\alpha,n)\in\mathbb{R}^{3*}\oplus_{S}\mathfrak{so}(3), the product rule reads

[(0,n),(0,n)]=(0,[n,n])\displaystyle[(0,n),(0,n^{\prime})]=(0,[n,n^{\prime}])
[(α,0),(0,n)]=(Xnα,0)\displaystyle[(\alpha,0),(0,n)]=(\mathcal{L}_{X_{n}}\alpha,0)
[(α,0),(α,0)]=0,\displaystyle[(\alpha,0),(\alpha^{\prime},0)]=0\,, (3.18)

revealing how the linear association (α,n)Pn+Qα(\alpha,n)\mapsto P_{n}+Q_{\alpha} is a (true) homomorphism.

To verify that the choices (3.15) and (3.16) lead to no central charges, and for later purposes, it is convenient to introduce a basis for 𝔤\mathfrak{g}. Using the identification 𝔰𝔬(3)3\mathfrak{so}(3)\sim\mathbb{R}^{3}, choose an orthonormal basis {ei}\{e_{i}\} (i=1,2,3i=1,2,3) in 3\mathbb{R}^{3}. (Note that exp(ei)\exp(e_{i}) implements a right-handed rotation by the angle 11 around the ii-axis.) It is straightforward to check that [ei,ej]=εijkek[e_{i},e_{j}]=\varepsilon_{ijk}e_{k}, where εijk\varepsilon_{ijk} is the Levi-Civita symbol.151515 As SO(3)SO(3) acts on 3\mathbb{R}^{3} from the left, the algebra element n3n\in\mathbb{R}^{3} induces the vector field X¯n|x=n×x\left.\overline{X}_{n}\right|_{x}=n\times x, where x3x\in\mathbb{R}^{3}. The Lie bracket of two such vector fields is given by [X¯n,X¯n]=X¯n×n[\overline{X}_{n},\overline{X}_{n^{\prime}}]=-\overline{X}_{n\times n^{\prime}}. Together with [X¯ξ,X¯η]=X¯[η,ξ][\overline{X}_{\xi},\overline{X}_{\eta}]=\overline{X}_{[\eta,\xi]} (see third paragraph of section 2) this yields [n,n]=n×n[n,n^{\prime}]=n\times n^{\prime}. Let {ei}\{e^{i}\} denote the dual basis, satisfying ei(ej)=δjie^{i}(e_{j})=\delta^{i}_{\,\,j}. We define

Ji:=Pei\displaystyle J_{i}:=P_{e_{i}}
Ni:=Qei,\displaystyle N_{i}:=Q_{e^{i}}\,, (3.19)

which satisfy the algebra

{Ji,Jj}=εijkJk\displaystyle\{J_{i},J_{j}\}=\varepsilon_{ijk}J_{k}
{Ji,Nj}=εijkNk\displaystyle\{J_{i},N_{j}\}=\varepsilon_{ijk}N_{k}
{Ni,Nj}=0.\displaystyle\{N_{i},N_{j}\}=0\,. (3.20)

This is the algebra of the Euclidean group, presented in terms of a basis of generators of rotation and translation.

We can express (3.15) and (3.16) in this basis. If xS23x\in S^{2}\subset\mathbb{R}^{3}, we can write X¯ei=ei×x\overline{X}_{e_{i}}=e_{i}\times x. Also, if pp is a co-vector on S2S^{2} at xx, we can (abusing the notation) associate it with a vector pp in 3\mathbb{R}^{3}, tangent to S2S^{2} at xx, such that pv=p(v)p\cdot v=p(v), where vv is any vector on 3\mathbb{R}^{3} tangent to S2S^{2} at xx. In this way, we have p(X¯ei)=p(ei×x)=ei(x×p)p(\overline{X}_{e_{i}})=p\cdot(e_{i}\times x)=e_{i}\cdot(x\times p), which is the familiar orbital angular momentum about the axis eie_{i} in 3\mathbb{R}^{3}. Thus, Ji=ei(x×pegx)J_{i}=e_{i}\cdot(x\times p-eg\,x). Also, NiN_{i}, evaluated at any point in the fiber over xx, can be written as Ni=ei(x)=eixN_{i}=e^{i}(x)=e_{i}\cdot x. In a 3-vector notation,

J\displaystyle J =x×pegx\displaystyle=x\times p-eg\,x (3.21)
N\displaystyle N =x,\displaystyle=x\,, (3.22)

so we have Ji=eiJJ_{i}=e_{i}\cdot J and Ni=eiNN_{i}=e_{i}\cdot N.

To establish (3.20), i.e., to verify that indeed there are no missing central terms, we may evaluate the brackets at points in the phase space where both sides of the equation vanish. For example, recall that {J1,J2}=ω(Xe1,Xe2)\{J_{1},J_{2}\}=-\omega(X_{e_{1}},X_{e_{2}}). The vector field Xe1X_{e_{1}} vanishes at the points in phase space with zero momentum and located at the rotation axis e1e_{1} on the S2S^{2} (i.e., the two points in the intersection of the zero section with the fibers over x=±e1x=\pm e_{1}). Hence {J1,J2}\{J_{1},J_{2}\} vanishes there. On the other hand, according to (3.21), at the same point the function J3J_{3} is equal to ege3e1=0-eg\,e_{3}\cdot e_{1}=0. Any constant added to J3J_{3} would spoil this agreement. This argument works for all of the JiJ_{i} brackets, so we conclude that no central term need be added on the right hand side of the first bracket in (3.20). The argument just given also implies that {J1,N2}=ω(Xe1,Ye2)\{J_{1},N_{2}\}=-\omega(X_{e_{1}},Y_{e^{2}}) vanishes at the axis point e1e_{1}, while N3N_{3} equals e3(e1)=0e^{3}(e_{1})=0 at that same point. Hence no central term appears in the second bracket either. As for the last brackets in (3.20), since the right hand side vanishes, it is unaffected by addition of a constant to any charge. In conclusion, as we claimed, the charges defined in (3.15) and (3.16) provide a realization of the quantizing algebra without central charges.

4 The quantum theory

Quantization of the theory amounts to constructing a unitary irreducible (UI) projective representation of the canonical group GE3G\sim E_{3}, in which the value of all classical Casimir invariants carry over to the quantum theory, modulo possible operator ordering ambiguity that might arise in quantizing the Casimir invariant. Since the Euclidean algebra does not admit (non-trivial) central extensions, the UI projective representations of E3E_{3} are in correspondence with true UI representations of its universal cover, E^33SU(2)\widehat{E}_{3}\sim\mathbb{R}^{3}\rtimes SU(2), which in turn are in correspondence with UI representations of the Euclidean algebra 𝔤=3S𝔰𝔬(3)\mathfrak{g}=\mathbb{R}^{3}\oplus_{S}\mathfrak{so}(3).161616See [28]; or, for an informal discussion, [29].

4.1 Quantum canonical observables

The quantum version of the classical canonical observables JiJ_{i} and NiN_{i} are the self-adjoint generators of the corresponding unitary transformations in some Hilbert space \mathcal{H}. That is, given some unitary irreducible representation UU of GG (or its universal cover), and denoting elements of 𝔤\mathfrak{g} by (α,n)(\alpha,n), we define the operators J^i\widehat{J}_{i} and N^i\widehat{N}_{i} by

U[exp(0,λei)]=:eiλJ^i/\displaystyle U[\exp(0,\lambda e_{i})]=:e^{-i\lambda\widehat{J}_{i}/\hbar}
U[exp(λei,0)]=:eiλN^i/,\displaystyle U[\exp(\lambda e^{i},0)]=:e^{-i\lambda\widehat{N}_{i}/\hbar}\,, (4.1)

where λ\lambda is an arbitrary real parameter. It follows from the group structure that the quantized algebra satisfies

[J^i,J^j]=iεijkJ^k\displaystyle[\widehat{J}_{i},\widehat{J}_{j}]=i\hbar\,\varepsilon_{ijk}\widehat{J}_{k}
[J^i,N^j]=iεijkN^k\displaystyle[\widehat{J}_{i},\widehat{N}_{j}]=i\hbar\,\varepsilon_{ijk}\widehat{N}_{k}
[N^i,N^j]=0.\displaystyle[\widehat{N}_{i},\widehat{N}_{j}]=0\,. (4.2)

The quantization map is JiJ^iJ_{i}\rightarrow\widehat{J}_{i} and NiN^iN_{i}\rightarrow\widehat{N}_{i}, and (4.2) is the quantization of the Poisson algebra (3.20). For notational simplicity we shall henceforth omit the “hat” symbol over quantum operators, since it should be clear from the context whether we are referring to the classical or the quantum observables.

4.2 Casimir invariants

Casimir operators, by definition, commute with all elements of the algebra, and their eigenvalues can therefore be used to label its irreducible representations171717If an operator commutes with all other operators in an irreducible representation, then according to Schur’s lemma it must be proportional to the identity operator.. There are two independent Casimir operators associated with the algebra 𝔤\mathfrak{g},

N2=i(Ni)2\displaystyle N^{2}=\sum_{i}(N_{i})^{2}
NJ=iNiJi.\displaystyle N\cdot J=\sum_{i}N_{i}J_{i}\,. (4.3)

Their classical correspondents Poisson-commute with everything in the classical algebra and, using (3.22) and (3.21), we have

N2=x2=1\displaystyle N^{2}=x^{2}=1
NJ=egx2=eg,\displaystyle N\cdot J=-eg\,x^{2}=-eg\,, (4.4)

revealing that these two quantities are constant classical observables. Since no operator ordering ambiguities arise in quantizing N2N^{2} and NJN\cdot J (=JN=J\cdot N), we take as part of the quantization prescription that the quantum theory must carry the representation for which the values of the two Casimirs (4.3) are given by precisely the corresponding classical values (4.4). Note that the value of NJN\cdot J is the only way the presence of the magnetic monopole is felt in the quantum theory.

4.3 Representations of the algebra

While the representation theory of the Euclidean group is well known from Mackey’s theory of induced representations carried by a space of wavefunctions [30, 31], we will present it here using a rather simpler abstract ladder-operator approach.181818After completing this work we found that a similar realization of the representation appears in [12], although no explicit derivation is presented there. The method is reminiscent of the usual derivation of the unitary irreducible representations of SU(2)SU(2). Some of the details are left to Appendix B. In Appendix C we discuss an alternative derivation based on Mackey theory, which provides a construction of the Hilbert space based on wave functions on S2S^{2}.

Note first that the Euclidean algebra (4.2) is invariant under rescaling of NN. Thus, without loss of generality, we can particularize to the representation with N2=1N^{2}=1. The value of the other Casimir is left arbitrary,

NJ=s,N\cdot J=s\hbar\,, (4.5)

with ss some real parameter.

Let us start with a basis of simultaneous eigenvectors of J2J^{2} and J3J_{3}, denoted as |j,m|j,m\rangle, defined by

J2|j,m=j(j+1)2|j,m\displaystyle J^{2}|j,m\rangle=j(j+1)\hbar^{2}|j,m\rangle
J3|j,m=m|j,m.\displaystyle J_{3}|j,m\rangle=m\hbar|j,m\rangle\,. (4.6)

At this point it is not clear which values of jj are allowed, for a given ss, nor if there is more than one state with a given value of jj and mm. Note that J±=J1±iJ2J_{\pm}=J_{1}\pm iJ_{2} act as raising and lowering operators for mm and, from the standard analysis of the angular momentum algebra, we know that, for a given jj, mm varies from j-j to jj in integer steps. Also, jj can only be a non-negative integer or half-integer (i.e., j120+j\in\frac{1}{2}\mathbb{Z}^{0+}), but it may be that only a subset of that is included.

Before systematically deriving the properties of the irreducible representations, it is enlightening to guess, by a simple but non-rigorous reasoning, which values of ss and jj are included. Supposing that there is (in a limiting sense) a state localized at the north pole of the sphere, the operator NJN\cdot J acts on such a state as J3J_{3} (see (4.18) for details). This implies that 2s2s must be integer valued in order for the representation to be non-trivial. Since it has the J3J_{3} eigenvalue ss\hbar, such a state must be constructed from states with jsj\geq s. By virtue of rotational symmetry, the same can be said about states localized at any other point on the S2S^{2}, and one would expect an irreducible representation to be constructed from the span of these states with jsj\geq s. Moreover, since NN is a vector operator (in the way it transforms under commutation with the JiJ_{i}), its action can change the value of jj by plus or minus unity, which suggests that repeated action of NN will both raise the jj values without bound and lower them until they reach a floor, which presumably lies at j=sj=s, since jsj\geq s is the only apparent constraint. That is, the representation must include all values j=s+nj=s+n, for non-negative integers nn. Indeed this is the correct spectrum, as we now show by explicit construction of the representation.191919The earliest mention of this spectrum that we have found appears in [32], although no derivation was given there.

We now present the rigorous derivation of the irreducible representations, analyzing how certain operators in the algebra act as “ladder operators” to shift the values jj and mm. From the Wigner-Eckart theorem we know that when the vector operator NiN_{i} acts on a state, it can only change the value of jj by 1-1, 0 or 11. And, since acting on a state with N±=N1±iN2N_{\pm}=N_{1}\pm iN_{2} changes the mm value by ±1\pm 1, it follows that N+|j,j|j+1,j+1N_{+}|j,j\rangle\propto|j+1,j+1\rangle. Thus N+N_{+} acts as a raising operator for edge states |j,j|j,j\rangle, i.e. states with the maximal m=jm=j for a given jj. For now, let us assume that if |j,j|j,j\rangle is in the Hilbert space then so is N+|j,jN_{+}|j,j\rangle, i.e., that its norm is positive. This assumption will be justified later.

Now let |j0,j0|j_{0},j_{0}\rangle be the ground state, in the sense that there are no states with j<j0j<j_{0} in the representation being constructed. (We know that there must be such a lowest jj state, since jj is non-negative.) Since we are constructing an irreducible representation, the whole Hilbert space \mathcal{H} must be generated by acting with all elements of the algebra on any given state, in particular the ground state. Consider then the set of states

|j,m:=(J)jm(N+)jj0|j0,j0,|j,m\rangle:=(J_{-})^{j-m}(N_{+})^{j-j_{0}}|j_{0},j_{0}\rangle\,, (4.7)

where jj00+j-j_{0}\in\mathbb{Z}^{0+} and jmj-j\leq m\leq j. Since these states have distinct eigenvalues for the the self-adjoint operators J2J^{2} and/or J3J_{3}, they are necessarily orthogonal (although not normalized). We will show that a representation exists only if s12s\in\frac{1}{2}\mathbb{Z}, in which case there is a unique irreducible representation, spanned by the states (4.7) with j0=|s|j_{0}=|s|. To establish this, we first prove that these states are closed under the action of the entire algebra, and then we show that that they all have positive norm, provided the ss quantization condition holds and j0j_{0} has the required value.

Refer to caption
Refer to caption
Figure 2: The diagram on the left depicts an arbitrary edge state |j,j|j,j\rangle and the action of the operators N+N_{+}, N3N_{3}, NN_{-} and JJ_{-} on it — the image of |j,j|j,j\rangle is in a linear superposition of states at the endpoints of the corresponding arrows; note that J3J_{3} keeps |j,j|j,j\rangle fixed and J+J_{+} annihilates it. The diagram on the right shows the example of the representation with s=3/2s=3/2, indicating the states that are present in it.

It is clear how to define the action of JiJ_{i} on this basis, using just the angular momentum algebra. That is, Ji|j,mJ_{i}|j,m\rangle can be written as a linear combination of |j,m1|j,m-1\rangle, |j,m|j,m\rangle and |j,m+1|j,m+1\rangle with coefficients determined by the algebra. So let us focus on defining the action of the NN’s. It is easy to see that, since the commutator of an NN with a JJ gives an NN, the action of NiN_{i} on any state is well-defined provided that the NN’s have a well-defined action on the edge states |j,j|j,j\rangle. Similarly, if the edge states have positive norm then so do the rest of the states.

By the definition of the basis states (4.7) we have

N+|j,j=|j+1,j+1.N_{+}|j,j\rangle=|j+1,j+1\rangle\,. (4.8)

Next, using only the algebra and the Casimir invariants, we show in Appendix B that

N3|j,j=s(j+1)|j,j12(j+1)|j+1,jN_{3}|j,j\rangle=\frac{s}{(j+1)}|j,j\rangle-\frac{1}{2(j+1)}|j+1,j\rangle (4.9)

and, for j>j0j>j_{0},

N|j,j=\displaystyle N_{-}|j,j\rangle= 2j2j+1(1s2j2)|j1,j1+\displaystyle\frac{2j}{2j+1}\left(1-\frac{s^{2}}{j^{2}}\right)|j-1,j-1\rangle+
+s|j,j1j(j+1)|j+1,j12(2j+1)(j+1).\displaystyle\leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ +s\frac{|j,j-1\rangle}{j(j+1)}-\frac{|j+1,j-1\rangle}{2(2j+1)(j+1)}\,. (4.10)

(The action of the algebra on the edge state |j,j|j,j\rangle is depicted on the left in Fig. 2.) It then remains to determine N|j0,j0N_{-}|j_{0},j_{0}\rangle, to establish that the states |j,j|j,j\rangle with jj0j\geq j_{0} have positive norm, and to show that the algebra is consistent with the assumption that jj cannot be lowered below j0j_{0}.

Observe that the form of (4.10) indicates that the operator

L(j)=Nsj(j+1)J+12(2j+1)(j+1)(J)2N+L^{(j)}=N_{-}-\frac{s}{j(j+1)}J_{-}+\frac{1}{2(2j+1)(j+1)}(J_{-})^{2}N_{+} (4.11)

acts as a jj-lowering operator for |j,j|j,j\rangle, i.e. , L(j)|j,j|j1,j1L^{(j)}|j,j\rangle\propto|j-1,j-1\rangle, modulo the assumption that j>j0j>j_{0}. In fact, using only the algebra it can be established for any jj, including j0j_{0}, that

[J2,L(j)]|j,j=2j2L(j)|j,j,[J^{2},L^{(j)}]|j,j\rangle=-2j\hbar^{2}L^{(j)}|j,j\rangle\,, (4.12)

hence J2L(j)|j,j=j(j1)2L(j)|j,jJ^{2}L^{(j)}|j,j\rangle=j(j-1)\hbar^{2}L^{(j)}|j,j\rangle.

The squared norms of the raised and lowered edge states can be computed using only the algebra and Casimir invariants. Doing so, we find

N+|j,j2=2(j+1)2j+3(1s2(j+1)2)j,j|j,j\|N_{+}|j,j\rangle\|^{2}=\frac{2(j+1)}{2j+3}\left(1-\frac{s^{2}}{(j+1)^{2}}\right)\langle j,j|j,j\rangle (4.13)
L(j)|j,j2=2j2j+1(1s2j2)j,j|j,j.\|L^{(j)}|j,j\rangle\|^{2}=\frac{2j}{2j+1}\left(1-\frac{s^{2}}{j^{2}}\right)\langle j,j|j,j\rangle\,. (4.14)

(See Appendix B for more details.) Starting with a state |j,j|j,j\rangle, we can apply a sequence of L(j)L^{(j)} operators to lower jj successively. This process stops, preventing jj from becoming negative, only if j0j_{0} differs from jj by an integer and L(j0)|j0,j0=0L^{(j_{0})}|j_{0},j_{0}\rangle=0. It thus follows from (4.14), and the non-degeneracy of the Hilbert space, that j0=|s|j_{0}=|s|. Moreover, the condition L(j0)|j0,j0=0L^{(j_{0})}|j_{0},j_{0}\rangle=0, together with (4.11), defines the action of NN_{-} on the ground state,

N|j0,j0=s|j0,j01j0(j0+1)|j0+1,j012(2j0+1)(j0+1).N_{-}|j_{0},j_{0}\rangle={s}\frac{|j_{0},j_{0}-1\rangle}{j_{0}(j_{0}+1)}-\frac{|j_{0}+1,j_{0}-1\rangle}{2(2j_{0}+1)(j_{0}+1)}\,. (4.15)

This happens to agree with (4.10) particularized to j=j0=|s|j=j_{0}=|s| (i.e., although the derivation of (4.10) applied only for j>j0j>j_{0}, the result actually extends to j=j0j=j_{0}). Finally, (4.13) shows that, since j|s|j\geq|s|, N+N_{+} always creates a positive-norm state. This establishes that, if |j0,j0|j_{0},j_{0}\rangle has positive (squared) norm, then so do all of the other edge states. It follows that all states defined in (4.7) have positive norm and thus must be in \mathcal{H}. (This array of states is illustrated on the right in Fig. 2, for the case j0=3/2j_{0}=3/2.)

We conclude that the Hilbert space, in the representation with (N2,NJ)=(1,s)(N^{2},N\cdot J)=(1,s\hbar), is indeed spanned by the states defined in (4.7), with j0=|s|j_{0}=|s|. As a consequence, we see that the Hilbert space is non-trivial if and only if

s12,s\in\frac{1}{2}\mathbb{Z}\,, (4.16)

since jj can only take values in 120+\frac{1}{2}\mathbb{Z}^{0+}.

4.4 Dirac’s condition and intrinsic spin

According to the matching condition for the Casimir (4.4), and the definition of ss (4.5), the quantum theory is based on the representation with s=eg/s=-eg/\hbar. The requirement for the theory to be non-trivial, (4.16), thus implies

eg=n2,n,eg=\frac{n}{2}\hbar\,,\quad n\in\mathbb{Z}\,, (4.17)

which is precisely Dirac’s charge quantization condition.

It is interesting to note how the more restrictive Schwinger condition [33, 34, 35], eg=neg=n\hbar, would appear in this approach. In the previous section we derived the constraint on ss by analyzing the representations of the algebra generated by JJ’s and NN’s. However, not all of these can be “integrated” to representations of the group G=3SO(3)G=\mathbb{R}^{3*}\rtimes SO(3). As mentioned before, the representations of the algebra are in correspondence with representations of the universal cover of the group, G~=3SU(2)\widetilde{G}=\mathbb{R}^{3*}\rtimes SU(2). In order to have a true representation of SO(3)SO(3), rather than just a projective representation (i.e. a representation up to a phase), one must impose that a rotation by 2π2\pi corresponds to the identity operator, which implies that only integer spins, j0+j\in\mathbb{Z}^{0+}, are allowed. Were one to insist that the quantum theory be based on true representations of the quantizing group, GG, this would thus impose the constraint ss\in\mathbb{Z}, which leads to Schwinger’s condition. However, there is no fundamental reason within quantum mechanics to exclude projective representations as realizations of a symmetry.

Let us now make a comment about intrinsic spin. In addition to the interpretation given in Section 4.2 for the Casimir NJN\cdot J, as eg-eg, there is another interpretation, as a measure of the intrinsic spin of the particle. To see this, consider a basis of simultaneous eigenvectors of N=(N1,N2,N3)N=(N_{1},N_{2},N_{3}), denoted by |n1,n2,n3|n_{1},n_{2},n_{3}\rangle. In the representation with N2=1N^{2}=1 we must have n12+n22+n32=1n_{1}^{2}+n_{2}^{2}+n_{3}^{2}=1. This can be interpreted as a (non-normalizable) state localized at the point n=(n1,n2,n3)n=(n_{1},n_{2},n_{3}) on the sphere. Without loss of generality, consider such a state localized at the north pole |u=|0,0,1|u\rangle=|0,0,1\rangle. We have

J3|u=JN|u=s|u,J_{3}|u\rangle=J\cdot N|u\rangle=s\hbar|u\rangle\,, (4.18)

so that |u|u\rangle is an eigenstate of J3J_{3} with eigenvalue s12s\in\frac{1}{2}\mathbb{Z}. But |u|u\rangle is localized at the north pole, so this angular momentum must be an intrinsic spin of the particle. Hence, there is an equivalence between a spinless particle with electric charge ee in the presence of a magnetic monopole of charge gg inside the sphere, and a particle with spin s=egs\hbar=-eg, with no magnetic monopole [36, 37]. Interestingly, the same equivalence occurs in classical physics: a free particle with spin would precess on a circle smaller than the great circle, conserving angular momentum, exactly as if there were a magnetic field curving the orbit of a spinless charged particle. In fact, if it is not possible to turn off the magnetic monopole (or modify its charge), and the particle is truly living on a sphere (without access to higher dimensions), then there is no way to distinguish a magnetic monopole from an intrinsic spin.

4.5 Energy spectrum

In this section we compute the energy spectrum for a non-relativistic particle of mass mm living on a round sphere of radius rr, in the presence of a magnetic field given by (3.1). We assume here that the magnetic field is uniform with respect to the metric on the sphere. The classical time-evolution Hamiltonian is

H=12mhabpapb,H=\frac{1}{2m}h^{ab}p_{a}p_{b}\,, (4.19)

where pap_{a} is the canonical momentum—a cotangent vector on the sphere—and habh_{ab} is the metric on the sphere (using abstract tensor index notation). Note that the magnetic field does not appear in the Hamiltonian for, in our approach, it is fully encoded in the symplectic form. From Hamilton’s equations, we see that the kinematical velocity is simply related to the momentum by

x˙α={xα,H}=1mhαβpβ,\dot{x}^{\alpha}=\{x^{\alpha},H\}=\frac{1}{m}h^{\alpha\beta}p_{\beta}\,, (4.20)

where xα=(x1,x2)x^{\alpha}=(x^{1},x^{2}) are any coordinates for the sphere. Thus, on a solution, the value of the Hamiltonian is the kinetic energy, 12mhαβx˙αx˙β\frac{1}{2}mh_{\alpha\beta}\dot{x}^{\alpha}\dot{x}^{\beta}.

Up to this point only the topology of the sphere, and the choice of an SO(3)SO(3) subgroup of the diffeomorphisms preserving BB, has played a role in identifying and quantizing the canonical group. Now we identify this subgroup with the symmetry group of the metric habh_{ab}, and use that identification to express the Hamiltonian in terms of the canonical observables JJ and NN. To this end, we first express the inverse metric of the sphere in terms of the rotation Killing vector fields X¯i:=X¯n=ei\overline{X}_{i}:=\overline{X}_{n=e_{i}} (cf. footnote 15) and the geometrical radius rr,

hab=1r2X¯iaX¯ib,h^{ab}=\frac{1}{r^{2}}\,\overline{X}_{i}^{a}\,\overline{X}_{i}^{b}\,, (4.21)

with implicit summation over the repeated index ii.202020To verify that this sum yields the inverse metric, note first that it is invariant under rotations. The normalization constant can be determined by looking at a specific point, namely (θ,ϕ)=(π/2,0)(\theta,\phi)=(\pi/2,0). There we have X¯1=0\overline{X}_{1}=0, X¯2=θ\overline{X}_{2}=\partial_{\theta} and X¯3=ϕ\overline{X}_{3}=\partial_{\phi}, and the inverse metric thus reads r2(θ2+ϕ2)r^{-2}(\partial_{\theta}^{2}+\partial_{\phi}^{2}), as desired. Inserting (4.21) for habh^{ab} in the Hamiltonian (4.19) yields

H\displaystyle H =12mr2(X¯iapa)(X¯ibpb)\displaystyle=\frac{1}{2mr^{2}}(\overline{X}_{i}^{a}p_{a})(\overline{X}_{i}^{b}p_{b})
=12mr2(Ji+egNi)(Ji+egNi)\displaystyle=\frac{1}{2mr^{2}}(J_{i}+egN_{i})(J_{i}+egN_{i})
=12mr2[J2(eg)2],\displaystyle=\frac{1}{2mr^{2}}[J^{2}-(eg)^{2}]\,, (4.22)

where we used (3.15), (3.16), and (3.2) in the second line, and (4.4) in the third line. To quantize, we just replace the function JJ by the quantum operator JJ. The Hamiltonian is clearly diagonal in the basis |j,m|j,m\rangle, and the energy eigenvalues are defined by

H|j,m=22mr2[j(j+1)j02]|j,m,H|j,m\rangle=\frac{\hbar^{2}}{2mr^{2}}\left[j(j+1)-j_{0}^{2}\right]|j,m\rangle\,, (4.23)

where j0=|s|=|eg|/j_{0}=|s|=|eg|/\hbar, and jj00+j-j_{0}\in\mathbb{Z}^{0+}. Equivalently, in terms of the non-negative integer k=jj0k=j-j_{0}, the energies can be enumerated as

Ekm=22mr2[k(k+1)+|s|(2k+1)].E_{km}=\frac{\hbar^{2}}{2mr^{2}}\left[k(k+1)+|s|\left(2k+1\right)\right]\,. (4.24)

The degeneracy of these “Landau levels” arises only from mm, so there are 2j+1=2(k+|s|)+12j+1=2(k+|s|)+1 states at level jj. In particular, the ground state has degeneracy 2|s|+12|s|+1, corresponding to one additional state for each magnetic flux quantum. This agrees with other derivations of the energy spectrum [18, 38, 32, 39].

The case where the magnetic field is not uniform with respect to the metric is slightly more subtle. In principle, we could include it in the symplectic form and proceed as before. However, unless BB is uniform, no SO(3)SO(3) subgroup of the BB-preserving diffeomorphisms of the sphere would be an isometry of the metric. Consequently, there would be no preferred form for the Hamiltonian when written in terms of the canonical observables, leading to operator-ordering ambiguities in the quantization. Instead we can split the magnetic field into its monopole and higher multipole parts, continuing to include the monopole part in the symplectic structure, but incorporating the higher multipole part into the Hamiltonian via the usual minimal coupling. By doing so, we obtain a globally defined Hamiltonian, which admits a standard application of canonical quantization. We discuss this case in Appendix D, where we also address the issue of gauge invariance in Isham’s group theoretic quantization scheme.

5 Discussion

In this paper we have studied the problem of quantizing a particle on a 2-sphere in the presence of a magnetic monopole using Isham’s group-theoretic scheme. Based on the principles of canonical quantization, the quantum theory is constructed from unitary irreducible (projective) representations of a transitive group of symplectic symmetries of the phase space. Our goal was to analyze the problem in a rigorous manner, emphasising the role of Casimir invariants in connecting the classical and the quantum worlds. To ensure robustness and generality of the quantization, we referred only to intrinsic properties of the system, adopted a gauge-invariant approach, and did not make any a priori assumptions about the Hilbert space.

In order to formulate the problem in a gauge-invariant language, with globally defined objects, we described the magnetic monopole as a “flux term” in the symplectic form on the phase space. A natural transitive group of symmetries of the phase space is the Euclidean group, E3=3SO(3)E_{3}=\mathbb{R}^{3*}\rtimes SO(3), where SO(3)SO(3) implements spatial rotations of the sphere and 3\mathbb{R}^{3*} corresponds to momentum translations (i.e., boosts). The canonical observables consist of three angular momentum “coordinates”, JJ, generating the spatial rotations, and three position “coordinates”, NN, generating the momentum translations. Although the group structure is independent of the magnetic monopole, the presence of the latter affects the expressions of the canonical observables, JJ and NN, as functions on the phase space, and it also affects the value of the classical Casimir JN=egJ\cdot N=-eg. To construct the Hilbert space, we employed an algebraic method involving ladder operators that raise and lower the eigenvalue of the J2J^{2} operator. This method for deriving the unitary irreducible representations of the Euclidean group resembles the usual approach for SU(2)SU(2). By imposing that the theory is free of negative-norm states, one gets a constraint on the possible values of the Casimir invariants N2N^{2} and JNJ\cdot N. If we set N2=1N^{2}=1 (which can always be done without loss of generality), then JNJ\cdot N must be an integer multiple of /2\hbar/2. Insisting that the values of these quantum invariants must be the same as their classical counterparts, we obtain Dirac’s charge quantization condition, eg=n/2eg=n\hbar/2.

Although Isham’s scheme provides a well-founded algorithm for quantization, it still suffers from ambiguities. One ambiguity appears in the choice of the particular SO(3)SO(3) subgroup of the (infinite-dimensional) group of volume-preserving diffeomorphisms of the sphere, whose lift to the phase space was used to construct the quantizing group. Although we argued that this choice does not affect the Hilbert space, it is relevant for the dynamics of the quantum theory. Indeed, the form of the Hamiltonian as a function of the canonical observables depends on which SO(3)SO(3) subgroup is chosen. For a generic metric, no such a choice is preferred and thus the Hamiltonian would have no preferred form, leading to operator-ordering ambiguities. We contrast this with the case of the round sphere, where the group of isometries of the metric provides a preferred SO(3)SO(3) subgroup of symmetries, and leads to a simple (free) Hamiltonian that can be quantized unambiguously. Save for a few other highly symmetric cases (like the ellipsoid), some additional principle would be needed to resolve the ambiguities associated with a generic geometry. Another, more fundamental, ambiguity that is potentially resolved by the metric is the choice of the quantizing group itself. In fact, there are many finite-dimensional groups that act transitively on the sphere (e.g., the Lorentz group), so it may be that the underlying justification for the choice of SO(3)SO(3) comes down to the symmetries of the metric. However the example of a particle on S3S^{3} reveals that the metric may not be sufficient to cure this ambiguity, as in this case both SU(2)SU(2) and SO(4)SO(4) act transitively and isometrically on a round sphere. For the case of a particle on S3S^{3}, see for example [40, 41, 42, 43].

It is important to stress that this procedure is predicated on the assumption that the classical theory, and in particular all of the observables, are defined globally on phase space, since the quantum description is inherently global. To illustrate this point, suppose that, instead of incorporating the magnetic monopole field in the symplectic form, we took the more common route of using the canonical sympletic form and including the vector potential AA in the Hamiltonian, H=12m(peA)2H=\frac{1}{2m}(p-eA)^{2}. It would still be natural to choose the Euclidean group for the quantizing group. If we did so, the Casimir JNJ\cdot N would vanish classically and, if the Casimir matching principle were to apply, this would mean that JNJ\cdot N would also vanish quantum mechanically. But the quantum theory should not depend on the particular way one decides to describe the classical system, and we have already established that in the quantum theory JNJ\cdot N should be equal to eg-eg. This failure of the Casimir matching principle can be attributed to the failure of the classical phase space description to be global. Since no globally defined vector potential AA can describe the monopole, one must cover the sphere in at least two overlapping gauge patches. Classically, one can focus on phase space trajectories that are temporarily confined to one or the other patch, switching descriptions in the overlap region, but that precludes a global map between the classical and quantum observables. This is not to say that one cannot describe the quantum theory using the vector potential for the monopole, but only that it cannot be done according to the standard framework of global canonical quantization. Instead, one can ensure that the quantum description itself is globally well defined. As explained in Appendix C, in such a global description the wave functions are sections of a complex line bundle over the sphere carrying a representation of the Euclidean group. In such a representation, peAp-eA can be realized as a covariant derivative provided that the product of the charges satisfies the Dirac quantization condition eg=n/2eg=n/2 for some integer nn, and that the Chern number of the bundle is nn. In this formulation, the quantum Casimir JNJ\cdot N takes the correct value, eg-eg.

As the previous paragraph makes clear, when quantizing a classical system the determination of the correct Hilbert space can depend not only on the intrinsic structure of the phase space, but on the dynamics as well. For the particle on the sphere in a monopole field, the impact of the dynamics on the Hilbert space is felt either from the Casimir matching principle (when the dynamical effects of the magnetic monopole are encoded in the symplectic form), or from the Chern class selection (when the monopole is encoded in the Hamiltonian via the gauge potential). This role of the dynamics is absent in quantum mechanics on n\mathbb{R}^{n} since (by the Stone-von Neumann theorem) the canonical algebra of position and momentum coordinates has a unique representation, so the Hamiltonian plays no role in selecting the Hilbert space. On the other hand, it is ubiquitous in quantum field theory, where the infinite dimensionality of the algebra allows for many representations of the canonical algebra, and the selection of a representation depends on other physical observables, such as the Hamiltonian. For example, as shown by Haag’s theorem [44, 45], the representations containing a translation invariant vacuum state in infinite volume differ even for a free scalar field with different masses [46]. The problem of a particle on a sphere provides a simple example where the non-uniqueness of the representation, resolved only by the dynamics, comes from the non-trivial topology of the phase space, rather than from the infinite dimensionality of the algebra.

The difference between the quantizations on a plane and on a sphere can be understood from a group-theoretic perspective, in terms of the “planar limit” of the sphere, as follows. We expect that a particle that remains near the north pole of the sphere at all times should not be able to “feel” the global structure of the sphere. Thus, in some limit, the quantum mechanics on a sphere must reduce to the usual one on a plane. To see how this works, consider a “sector” of the Hilbert space in which X1:=N1θX_{1}:=N_{1}\sim\theta, X2:=N2θX_{2}:=N_{2}\sim\theta and N31N_{3}\approx 1, where θ1\theta\ll 1 is the angle around the north pole where the particle is localized. Note that the “vertical momentum” P3θ|P|P_{3}\sim\theta|P| is small in this sector, so we can approximate J1P2egX1J_{1}\approx-P_{2}-egX_{1}, J2P1egX2J_{2}\approx P_{1}-egX_{2} and J3X1P2X2P1egJ_{3}\approx X_{1}P_{2}-X_{2}P_{1}-eg, where terms of order θ2\theta^{2} were neglected. In this sector, the algebra reduces to [Xi,Pj]=iδijN3[X_{i},P_{j}]=i\hbar\delta_{ij}N_{3}, J3J_{3} behaves as the generator of rotations for XiX_{i} and PiP_{i}, and N3N_{3} becomes a central element (taking the value 11 in the relevant representation). This deformation of the algebra is known as the Inönü-Wigner contraction. At the group level, this corresponds to a deformation of the Euclidean group E3=3SO(3)E_{3}=\mathbb{R}^{3}\rtimes SO(3) into (2×)(2SO(2))(\mathbb{R}^{2}\times\mathbb{R})\rtimes(\mathbb{R}^{2}\rtimes SO(2)), where the first factor is generated by (X1,X2;N3)(X_{1},X_{2};N_{3}) and the second by (P1,P2;J3)(P_{1},P_{2};J_{3}). This contracted group can be reexpressed as H(2)SO(2)H(2)\rtimes SO(2), where H(2)H(2) is the Heisenberg group in two spatial dimensions, generated by (X1,P1,X2,P2;N3)(X_{1},P_{1},X_{2},P_{2};N_{3}), and SO(2)SO(2) is the rotation group around the origin, generated by J3J_{3}. Note that S:=J3(X1P2X2P1)=egS:=J_{3}-(X_{1}P_{2}-X_{2}P_{1})=-eg is a Casimir operator, interpreted as the intrinsic spin of the particle. Since J3J_{3} differs from X1P2X2P1X_{1}P_{2}-X_{2}P_{1} only by a Casimir operator, it follows that the irreducible representations of H(2)SO(2)H(2)\rtimes SO(2) are also irreducible when restricted to H(2)H(2). As H(2)H(2) has a unique irreducible unitary representation, this confirms that we do in fact recover the quantum mechanics on a plane (for any value of the intrinsic spin Casimir SS). It is interesting to note an important difference between the plane and the sphere: the subgroup SO(2)SO(2) of E3E_{3} was “pulled out” of SO(3)SO(3) during the deformation, so it appears as a factor in H(2)SO(2)H(2)\rtimes SO(2) rather than as a subgroup of SO(3)SO(3). The spin is therefore not quantized on a plane, because the SO(2)SO(2) gets “unwrapped” to \mathbb{R} when considering projective representations (that is, when considering the universal cover of the group). Thus the quantization of the spin on a sphere is a truly topological effect.

In conclusion, we have seen that the problem of quantizing a particle on a sphere is quite effective in revealing some of the subtleties associated with a non-trivial phase space topology. It also serves to illustrate how Isham’s scheme, which provides a general class of quantum theories compatible with the classical kinematics, must be paired with additional principles (e.g., Casimir matching) and considerations about dynamics in order to single out a preferred quantum theory. The fact that non-trivial phase space topology affects global aspects of quantization applies as well to quantum field theories, for which the implications are not yet fully known. In particular, the longstanding problem of quantizing general relativity, as a candidate theory of quantum gravity, is a prime example: even before imposing the Hamiltonian constraints, if the configuration space is given by metrics (or co-frame fields) on a spatial slice, the condition that these are non-degenerate (and of positive signature) leads to a phase space with non-trivial topology. This suggests that standard canonical commutation relations are not appropriate, and that instead the quantization should be based on an affine algebra [47, 48, 49, 50, 51], a conclusion that is also reached by the application of Isham’s global, group theoretic quantization scheme [52, 53].

Acknowledgements

We are grateful to M. Greiter for helpful correspondence. This research was supported in part by the National Science Foundation under Grants PHY-1708139 at UMD and PHY-1748958 at KITP.

Appendix A No central extensions for the 3\mathcal{E}_{3} algebra

In this section we offer a simple derivation of the fact that the algebra 3\mathcal{E}_{3} (3.20) of the Euclidean group E3E_{3} does not admit non-trivial central extensions by 2-cocycles. Although this result is presumably well-known, we have not found a reference proving it explicitly.212121There are some general theorems that can be applied to the Euclidean algebra, such as Proposition 1 of [54] or Proposition 14.1 of [55]. These theorems imply that n\mathcal{E}_{n} does not admit non-trivial central extensions except for n=2n=2.

Let 𝔤\mathfrak{g} be a Lie algebra and let 𝔤~=𝔤S\widetilde{\mathfrak{g}}=\mathfrak{g}\oplus_{S}\mathbb{R} be a central extension. If ξ𝔤\xi\in\mathfrak{g} and rr\in\mathbb{R}, we generically define the product on 𝔤~\widetilde{\mathfrak{g}} as

[(ξ,r),(ξ,r)]=([ξ,ξ],θ(ξ,ξ)),\left[(\xi,r),(\xi^{\prime},r^{\prime})\right]=\left([\xi,\xi^{\prime}],\theta(\xi,\xi^{\prime})\right)\,, (A.1)

for some function θ:𝔤2\theta:\mathfrak{g}^{2}\rightarrow\mathbb{R}. If this is to form a Lie algebra, θ\theta must be linear, antisymmetric and, due to Jacobi identity, satisfy

θ(ξ,[ξ,ξ′′])+θ(ξ,[ξ′′,ξ])+θ(ξ′′,[ξ,ξ])=0,\theta(\xi,[\xi^{\prime},\xi^{\prime\prime}])+\theta(\xi^{\prime},[\xi^{\prime\prime},\xi])+\theta(\xi^{\prime\prime},[\xi,\xi^{\prime}])=0\,, (A.2)

which is called the 2-cocycle condition. The extension is said to be trivial if θ(ξ,ξ)=f([ξ,ξ])\theta(\xi,\xi^{\prime})=f([\xi,\xi^{\prime}]) for some linear f:𝔤f:\mathfrak{g}\rightarrow\mathbb{R}. (In this case, note that ξ(ξ,f(ξ))\xi\mapsto(\xi,f(\xi)) is a homomorphism from 𝔤\mathfrak{g} to 𝔤~\widetilde{\mathfrak{g}}.)

Consider now the 3\mathcal{E}_{3} algebra. Define

θij=θ(Ji,Jj)\displaystyle\theta_{ij}=\theta(J_{i},J_{j})
θαβ=θ(Nα,Nβ)\displaystyle\theta_{\alpha\beta}=\theta(N_{\alpha},N_{\beta})
θαi=θ(Nα,Ji),\displaystyle\theta_{\alpha i}=\theta(N_{\alpha},J_{i})\,, (A.3)

where Latin and Greek indices are used to distinguish these components (i.e., θij\theta_{ij} represents a different set of numbers than θαβ\theta_{\alpha\beta}). These components can be rewritten as

θij=ϵijkwk\displaystyle\theta_{ij}=\epsilon_{ijk}w_{k}
θαβ=ϵαβγhγ\displaystyle\theta_{\alpha\beta}=\epsilon_{\alpha\beta\gamma}h_{\gamma}
θαi=ϵαiβwβ,\displaystyle\theta_{\alpha i}=\epsilon_{\alpha i\beta}w_{\beta}\,, (A.4)

for the nine numbers wiw_{i}, hαh_{\alpha} and wαw_{\alpha}.

For a trivial extension, we must have

θij=θ(Ji,Jj)=f([Ji,Jj])=f(ϵijkJk)=ϵijkfk\displaystyle\theta_{ij}=\theta(J_{i},J_{j})=f([J_{i},J_{j}])=f(\epsilon_{ijk}J_{k})=\epsilon_{ijk}f_{k}
θαβ=θ(Nα,Nβ)=f([Nα,Nβ])=0\displaystyle\theta_{\alpha\beta}=\theta(N_{\alpha},N_{\beta})=f([N_{\alpha},N_{\beta}])=0
θαi=θ(Nα,Ji)=f([Nα,Ji])=f(ϵαiβNβ)=ϵαiβfβ,\displaystyle\theta_{\alpha i}=\theta(N_{\alpha},J_{i})=f([N_{\alpha},J_{i}])=f(\epsilon_{\alpha i\beta}N_{\beta})=\epsilon_{\alpha i\beta}f_{\beta}\,, (A.5)

where fi=f(Ji)f_{i}=f(J_{i}) and fα=f(Nα)f_{\alpha}=f(N_{\alpha}). We conclude that the extension is trivial if and only if hα=0h_{\alpha}=0, for in this case we can always define fi=wif_{i}=w_{i} and fα=wαf_{\alpha}=w_{\alpha}.

Consider the cocycle condition for two NN’s and one JJ,

θ(Nα,[Nβ,Ji])+θ(Nβ,[Ji,Nα])+θ(Ji,[Nα,Nβ])=0,\theta(N_{\alpha},[N_{\beta},J_{i}])+\theta(N_{\beta},[J_{i},N_{\alpha}])+\theta(J_{i},[N_{\alpha},N_{\beta}])=0\,, (A.6)

which gives ϵβiγθαγ+ϵiαγθβγ=0\epsilon_{\beta i\gamma}\theta_{\alpha\gamma}+\epsilon_{i\alpha\gamma}\theta_{\beta\gamma}=0. Or, in terms of hαh_{\alpha}, δiαhβδiβhα=0\delta_{i\alpha}h_{\beta}-\delta_{i\beta}h_{\alpha}=0. Contracting ii and α\alpha, we get hβ=0h_{\beta}=0, proving that the 3\mathcal{E}_{3} algebra admits only trivial central extensions.

Appendix B Details on the construction of the Hilbert space

In this appendix we explain some of the details involved in the construction of the Hilbert space of the theory, presented in section 4.3. In particular, we want to derive equations (4.9)-(4.15). For simplicity, we use =1\hbar=1 in this section.

Note first that the angular momentum algebra,

[J3,J±]=±J±,[J+,J]=2J3,[J_{3},J_{\pm}]=\pm J_{\pm}\,,\qquad[J_{+},J_{-}]=2J_{3}\,, (B.1)

alone yields the action of JiJ_{i} on the basis states (4.7):

J3|j,m\displaystyle J_{3}|j,m\rangle =m|j,m,\displaystyle=m|j,m\rangle\,, (B.2)
J|j,m\displaystyle J_{-}|j,m\rangle =|j,m1,\displaystyle=|j,m-1\rangle\,, (B.3)
J+|j,m\displaystyle J_{+}|j,m\rangle =[j(j+1)m(m+1)]|j,m+1.\displaystyle=\left[j(j+1)-m(m+1)\right]|j,m+1\rangle\,. (B.4)

Moreover, the norms of all the states |j,m|j,m\rangle are related to the norm of |j,j|j,j\rangle recursively, via

j,\displaystyle\langle j, m1|j,m1=2mj,m|j,m+\displaystyle m-1|j,m-1\rangle=2m\langle j,m|j,m\rangle\,+
+[j(j+1)m(m+1)]2j,m+1|j,m+1,\displaystyle+[j(j+1)-m(m+1)]^{2}\langle j,m+1|j,m+1\rangle\,, (B.5)

hence they all have positive norm provided the edge states |j,j|j,j\rangle do. Thus we need only consider the action on and norms of the edge states.

For the rest, we use the algebra relations

[J3,N±]=±N3,[J±,N3]=N±,[J±,N]=±2N3,[J_{3},N_{\pm}]=\pm N_{3}\,,\quad[J_{\pm},N_{3}]=\mp N_{\pm}\,,\quad[J_{\pm},N_{\mp}]=\pm 2N_{3}\,, (B.6)

(and [J±,N±]=0[J_{\pm},N_{\pm}]=0). Using the last of these, the Casimir NJ=sN\cdot J=s can be written as

NJ=N3(J3+1)+JN++NJ+2,N\cdot J=N_{3}(J_{3}+1)+\frac{J_{-}N_{+}+N_{-}J_{+}}{2}\,, (B.7)

and applying this on |j,j|j,j\rangle yields

N3|j,j=sj+1|j,j12(j+1)|j+1,j.N_{3}|j,j\rangle=\frac{s}{j+1}|j,j\rangle-\frac{1}{2(j+1)}|j+1,j\rangle\,. (B.8)

To find N|j,jN_{-}|j,j\rangle for j>j0j>j_{0}, we can write

N|j,j\displaystyle N_{-}|j,j\rangle =NN+|j1,j1\displaystyle=N_{-}N_{+}|j-1,j-1\rangle
=(1N32)|j1,j1,\displaystyle=\left(1-N_{3}^{2}\right)|j-1,j-1\rangle\,, (B.9)

in which we set N2=1N^{2}=1. Using formula (B.8) we have

N32\displaystyle N_{3}^{2} |j1,j1=N3(sj|j1,j112jJ|j,j)\displaystyle|j-1,j-1\rangle=N_{3}\left(\frac{s}{j}|j-1,j-1\rangle-\frac{1}{2j}J_{-}|j,j\rangle\right)
=sjN3|j1,j112j(JN3N)|j,j,\displaystyle=\frac{s}{j}N_{3}|j-1,j-1\rangle-\frac{1}{2j}(J_{-}N_{3}-N_{-})|j,j\rangle\,, (B.10)

which together with (B.8) and (B) yields

N|j,j\displaystyle N_{-}|j,j\rangle =2j2j+1(1s2j2)|j1,j1+\displaystyle=\frac{2j}{2j+1}\left(1-\frac{s^{2}}{j^{2}}\right)|j-1,j-1\rangle\,+
+\displaystyle+\, sj(j+1)|j,j1|j+1,j12(2j+1)(j+1),\displaystyle\frac{s}{j(j+1)}|j,j-1\rangle-\frac{|j+1,j-1\rangle}{2(2j+1)(j+1)}\,, (B.11)

provided that j>j0j>j_{0}.

Next we compute the norms N+|j,j2\|N_{+}|j,j\rangle\|^{2} and L(j)|j,j2\|L^{(j)}|j,j\rangle\|^{2}, where L(j)L^{(j)} is the jj-lowering operator defined in (4.11),

L(j)=Nsj(j+1)J+12(2j+1)(j+1)(J)2N+.L^{(j)}=N_{-}-\frac{s}{j(j+1)}J_{-}+\frac{1}{2(2j+1)(j+1)}(J_{-})^{2}N_{+}\,. (B.12)

In what follows, we use \approx to denote operator identities that are valid only within j,j||j,j\langle j,j|\cdots|j,j\rangle. For the raised state we compute

NN+=1N32\displaystyle N_{-}N_{+}=1-N_{3}^{2}\approx
1(sj+112(j+1)NJ+)(sj+112(j+1)JN+)\displaystyle 1-\left(\frac{s}{j+1}-\frac{1}{2(j+1)}N_{-}J_{+}\right)\left(\frac{s}{j+1}-\frac{1}{2(j+1)}J_{-}N_{+}\right)
1s2(j+1)212(j+1)NN+,\displaystyle\qquad\approx 1-\frac{s^{2}}{(j+1)^{2}}-\frac{1}{2(j+1)}N_{-}N_{+}\,, (B.13)

and, solving for NN+N_{-}N_{+},

NN+2(j+1)2j+3(1s2(j+1)2),N_{-}N_{+}\approx\frac{2(j+1)}{2j+3}\left(1-\frac{s^{2}}{(j+1)^{2}}\right)\,, (B.14)

which yields (4.13) for the squared norm. For the lowered state we have

L(j)L(j)j(2j+3)(j+1)(2j+1)NN+2s2j(j+1)2,L^{(j){\dagger}}L^{(j)}\approx\frac{j(2j+3)}{(j+1)(2j+1)}N_{-}N_{+}-\frac{2s^{2}}{j(j+1)^{2}}\,, (B.15)

and using the result for NN+N_{-}N_{+} we get

L(j)L(j)2j2j+1(1s2j2),L^{(j){\dagger}}L^{(j)}\approx\frac{2j}{2j+1}\left(1-\frac{s^{2}}{j^{2}}\right)\,, (B.16)

which yields (4.14) for the squared norm.

Appendix C Wavefunctions and Chern numbers

In this appendix we review Mackey’s approach, and apply it to the construction of the irreducible unitary representations of the Euclidean group E3E_{3} (and of its universal cover), which recovers the usual concept of wavefunctions as sections of complex vector bundles. We show in particular that the value of the Casimir NJN\cdot J, which is related to the magnetic charge, determines the first Chern number of the bundle via the Casimir matching requirement. Interestingly, in geometric quantization [9, 18], where quantum states are also given by sections of a line bundle, the same relation between the magnetic charge and the bundle topology arises, albeit in a different manner: by construction, the line bundle carries a connection whose curvature is required to coincide with the symplectic form (1.1). Here we follow closely Isham’s presentation [1] of Mackey’s theory, using the language of fiber bundles. For a more technically complete presentation, framed in the language of measure theory, see for example [56] or [31].

As motivation, let us first consider only the SO(3)SO(3) part of the group. Given its natural action on S2S^{2}, lR(x)=:Rxl_{R}(x)=:Rx, we can construct a representation with wavefunctions ψ:S2d\psi:S^{2}\rightarrow\mathbb{C}^{d} defined by

(U(R)ψ)(x)=ψ(R1x),\left(U(R)\psi\right)(x)=\psi(R^{-1}x)\,, (C.1)

where RSO(3)R\in SO(3) and xS2x\in S^{2}. This representation is unitary with respect to the inner product ψ,ϕ=𝑑μψϕ\langle\psi,\phi\rangle=\int\!d\mu\,\psi^{*}\phi, where dμd\mu is the Euclidean measure on the sphere, but it is not irreducible.222222For example, in the case d=1d=1, the space of complex functions on the sphere can be expanded in spherical harmonics, but since each ll-subspace is invariant the representation is not irreducible. Also, it is not exhaustive (i.e., not all unitary representations have this form).

To generalize this, consider a Hermitian vector bundle over the sphere, dBS2\mathbb{C}^{d}\rightarrow B\rightarrow S^{2}.232323In the notation F(fiber)E(total space)M(base space)F\text{(fiber)}\rightarrow E\text{(total space)}\rightarrow M\text{(base space)}, the second arrow represents the bundle projection map from the total space to the base space, while the first arrow merely indicates that each fiber of the bundle is an embedded copy of FF. We want to take sections of this bundle, Ψ:S2B\Psi:S^{2}\rightarrow B, as the vectors of the representation. Since the analogue of (C.1) would involve a mapping between distinct fibers, we must require that the bundle admits a lift of the group action, LR:BBL_{R}:B\rightarrow B, satisfying τLR=lRτ\tau\circ L_{R}=l_{R}\circ\tau (compatibility with fiber structure) and LRLR=LRRL_{R}\circ L_{R^{\prime}}=L_{RR^{\prime}} (compatibility with group structure). In that case, we define

(U(R)Ψ)(x)=LR(Ψ(R1x)).\left(U(R)\Psi\right)(x)=L_{R}(\Psi(R^{-1}x))\,. (C.2)

Note that this makes sense because LRL_{R} maps the point Ψ(R1x)\Psi(R^{-1}x), on the fiber over R1xR^{-1}x, to a point on the fiber over xx. The Hermitian structure of BB, with inner product on each fiber denoted by (,)(\,,), gives rise to an inner product on the space of sections defined by

Ψ,Φ=S2𝑑μ(x)(Ψ(x),Φ(x)).\langle\Psi,\Phi\rangle=\int_{S^{2}}\!d\mu(x)\,\left(\Psi(x),\Phi(x)\right)\,. (C.3)

In order for the representation to be unitary with respect to this inner product, we must require that the group lift is compatible with the hermitian structure of the bundle, i.e., (LRz,LRz)=(z,z)\left(L_{R}z,L_{R}z^{\prime}\right)=(z,z^{\prime}), where zz and zz^{\prime} are points on the same fiber of BB. Note that (C.1) is the special case where both the bundle and the group lift are trivial.

To find the most general wavefunction representation of SO(3)SO(3) on S2S^{2}, we must classify all the bundles dBS2\mathbb{C}^{d}\rightarrow B\rightarrow S^{2} that admit such a lift of SO(3)SO(3). It is possible to show [2] that any such a bundle must be associated to the “master bundle” SO(2)SO(3)SO(3)/SO(2)S2SO(2)\rightarrow SO(3)\rightarrow SO(3)/SO(2)\sim S^{2}, with projection map equal to the quotient q:SO(3)SO(3)/SO(2)q:SO(3)\rightarrow SO(3)/SO(2), via some homomorphism 𝒰:SO(2)U(d)\mathscr{U}:SO(2)\rightarrow U(d). More precisely, BB is necessarily isomorphic to SO(3)×𝒰dSO(3)\times_{\mathscr{U}}\mathbb{C}^{d}, the bundle defined by the equivalence classes [R,z]=[Rh,𝒰(h1)z][R,z]=[Rh,\mathscr{U}(h^{-1})z], for RSO(3)R\in SO(3), zdz\in\mathbb{C}^{d} and hSO(2)SO(3)h\in SO(2)\subset SO(3), with projection map q𝒰([R,z])=q(R)q_{\mathscr{U}}([R,z])=q(R). The group action on this bundle is given by LR[R,z]:=[RR,z]L_{R}[R^{\prime},z]:=[RR^{\prime},z]. Note that 𝒰\mathscr{U} is a unitary representation of SO(2)SO(2), and we will later be interested in the irreducible ones. Since SO(2)SO(2) is abelian, its irreducible representations are one-dimensional, so only the case d=1d=1 is relevant. These representations are given by

𝒰(n)(θ)=einθ,\mathscr{U}^{\!(n)}(\theta)=e^{-in\theta}\,, (C.4)

where nn\in\mathbb{Z}. The choice of this representation is the only discrete choice that enters the construction of the general representation of E3E_{3} using Mackey theory, hence one can anticipate that nn must be related to the magnetic monopole index in (4.17).

We are now ready to consider the full group, 3SO(3)\mathbb{R}^{3*}\rtimes SO(3). For more transparency, let us consider first a generic group of the form VKV\rtimes K, where VV is a vector space and KK is a Lie group.242424Mackey’s theory also applies, with a minor modification, if VV is abelian and KK is a separable, locally compact group. When VV is not a vector space, we just need to replace below the dual space VV^{*} by the space of unitary characters Char(V)\text{Char}(V). The product rule is given by

(v,k)(v,k)=(v+ρkv,kk),(v,k)(v^{\prime},k^{\prime})=(v+\rho_{k}v^{\prime},kk^{\prime})\,, (C.5)

where vVv\in V, kKk\in K and ρ:KAut(V)\rho:K\rightarrow\text{Aut}(V) is a left KK-action on VV. Later we shall particularize to K=SO(3)K=SO(3) and V=3V=\mathbb{R}^{3*}, in the dual representation ρR=lR1\rho_{R}=l^{*}_{R^{-1}} (see (3.10)). Since a generic element (v,k)(v,k) can be decomposed as (v,e)(0,k)(v,e)(0,k), where ee is the identity element of KK, the operators representing VKV\rtimes K on a Hilbert space \mathcal{H} will factorize accordingly, U(v,k)=U(v,e)U(0,k)U(v,k)=U(v,e)U(0,k). We can define A(v):=U(v,e)A(v):=U(v,e) and D(k):=U(0,k)D(k):=U(0,k), so that

U(v,k)=A(v)D(k).U(v,k)=A(v)D(k)\,. (C.6)

Thus, in classifying the representations of VKV\rtimes K, we can study the representations of VV and KK separately, in the following manner.

Starting with VV, define the self-adjoint generators N(v)N(v) by

A(λv)=eiλN(v),A(\lambda v)=e^{-i\lambda N(v)}\,, (C.7)

where λ\lambda\in\mathbb{R}. Since VV is abelian, we have [N(v),N(v)]=0[N(v),N(v^{\prime})]=0, and N(v+λv)=N(v)+λN(v)N(v+\lambda v^{\prime})=N(v)+\lambda N(v^{\prime}), meaning that NN is a linear map from VV into a set of commuting, self-adjoint operators on \mathcal{H}. Accordingly, a simultaneous eigenvector |χ|\chi\rangle of N(v)N(v), for all vv, determines an element wVw\in V^{*}, such that

N(v)|χ=w(v)|χ.N(v)|\chi\rangle=w(v)|\chi\rangle\,. (C.8)

Nothing requires the eigenvalues of N(v)N(v) to be non-degenerate, so each ww may label a Hilbert (sub)space 𝒮w\mathcal{S}_{w}\subset\mathcal{H}. It follows from the group structure that the operator D(k)D(k) maps 𝒮w\mathcal{S}_{w} unitarily onto 𝒮ρ~kw\mathcal{S}_{\widetilde{\rho}_{k}w}, where ρ~k\widetilde{\rho}_{k} is the dual action of KK on VV^{*}, defined as ρ~kw(v)=w(ρk1v)\widetilde{\rho}_{k}w(v)=w(\rho_{k^{-1}}v) for all vVv\in V. The Hilbert space \mathcal{H} will be given by a “direct sum” (or rather, “direct integral”) of 𝒮w\mathcal{S}_{w} over ww’s in some region of VV^{*}. If D(k)D(k) is to act in a closed fashion, such a region must consist of one or more orbits of KK. To ensure irreducibility of the VKV\rtimes K representation, we must take this region to be a single KK-orbit 𝒪\mathcal{O} (or its closure) in VV^{*}. Roughly speaking,

``w𝒪𝒮w".\mathcal{H}\sim``\oplus_{w\in{\mathcal{O}}}\mathcal{S}_{w}"\,. (C.9)

More precisely, \mathcal{H} will be the space of sections of a vector bundle over 𝒪\mathcal{O}, with fibers 𝒮wd\mathcal{S}_{w}\sim\mathbb{C}^{d} (for some dimension dd). To classify these representations we must therefore classify the corresponding vector bundles.

As explained before for the case K=SO(3)K=SO(3), in order for a vector bundle dB𝒪\mathbb{C}^{d}\rightarrow B\rightarrow\mathcal{O} over 𝒪K/H\mathcal{O}\sim K/H (where HH is the little group corresponding to the orbit 𝒪\mathcal{O}) to carry a representation of KK, it must admit a lift of the KK-action, and thus must be associated to the master bundle HKK/HH\rightarrow K\rightarrow K/H via a unitary irreducible representation 𝒰:HU(d)\mathscr{U}:H\rightarrow U(d) of HH. Cross sections of this bundle, Ψ:𝒪B=K×𝒰d\Psi:\mathcal{O}\rightarrow B=K\times_{\mathscr{U}}\mathbb{C}^{d}, form a linear space which carries a representation of VKV\rtimes K. The element (v,k)(v,k) is represented by

(U(v,k)Ψ)(w)=eiw(v)dμkdμ(w)Lk(Ψ(ρ~k1w)),(U(v,k)\Psi)(w)=e^{-iw(v)}\sqrt{\frac{d\mu_{k}}{d\mu}(w)}\,L_{k}\left(\Psi(\widetilde{\rho}_{k^{-1}}w)\right)\,, (C.10)

where w𝒪w\in\mathcal{O} and LkL_{k} is the lift of KK to K×𝒰dK\times_{\mathscr{U}}\mathbb{C}^{d} defined by Lk[k,z]=[kk,z]L_{k}[k^{\prime},z]=[kk^{\prime},z]. The phase factor on the right-hand side is A(v)A(v), and the rest is the factor D(k)D(k), as in (C.6). Note that D(k)D(k) is analogous to (C.2), except for the Jacobian-like factor dμk/dμd\mu_{k}/d\mu, which deserves a few words. In the case of SO(3)SO(3), it is possible to define the inner product (C.3) using the Euclidean measure on the spherical orbit, and the invariance of this measure under SO(3)SO(3) implies that U(R)U(R) in (C.2) is unitary, so this Jacobian-like factor is not needed. In general, however, the orbit 𝒪\mathcal{O} may not admit an invariant measure,252525In many cases, such as when KK is locally compact and HH is compact, the Haar measure on KK can be pushed down to K/HK/H, defining an invariant measure on 𝒪\mathcal{O}. but fortunately it always admits a measure μ\mu that is quasi-invariant under KK. That is, μ\mu and its push-forward μk:=ρ~kμ\mu_{k}:=\widetilde{\rho}_{k*}\mu through ρ~k\widetilde{\rho}_{k} have the same sets of measure zero.262626Given a measurable map f:XYf:X\rightarrow Y and a measure μ\mu on XX, its push-forward to YY is defined as fμ[B]=μ[f1(B)]f_{*}\mu[B]=\mu[f^{-1}(B)], where BB is any Borel subset of YY and f1f^{-1} denotes the pre-image under ff. In order to make U(v,k)U(v,k) unitary under such a measure, we must introduce the Jacobian-like factor dμk/dμ{d\mu_{k}}/{d\mu}, called the Radon-Nikodym derivative of μk\mu_{k} with respect to μ\mu, which is a positive (μ\mu-almost-everywhere) continuous function on 𝒪\mathcal{O} satisfying μk[B]=Bdμkdμ𝑑μ\mu_{k}[B]=\int_{B}\frac{d\mu_{k}}{d\mu}d\mu for all Borel sets B𝒪B\subset\mathcal{O}. Representations defined for equivalent measures (i.e., having the same sets of measure zero) are unitarily equivalent, and since there is only one quasi-invariant measure on K/HK/H (up to equivalence), the measure in (C.10) is determined by 𝒪\mathcal{O} (up to equivalence).

Note that these representations are labeled by the choice of the orbit 𝒪\mathcal{O} and the little group representation 𝒰\mathscr{U}. These representations are irreducible as long as 𝒰\mathscr{U} is irreducible. If VV^{*} decomposes into regular KK-orbits, meaning that there exists a Borel map ζ:V/KV\zeta:V^{*}/K\rightarrow V^{*} that associates a dual vector to each orbit, then all unitary irreducible representations are generated in this way. This is Mackey’s main result.

In the case of interest, 3SO(3)\mathbb{R}^{3*}\rtimes SO(3), the space where the orbits live is (3)(\mathbb{R}^{3*})^{*}, which can be naturally identified with 3\mathbb{R}^{3}. Thus ρ~R\widetilde{\rho}_{R}, which acts on w3w\in\mathbb{R}^{3**} as wρR1w=lRww\mapsto\rho^{*}_{R^{-1}}w=l^{**}_{R}w, acts on x3x\in\mathbb{R}^{3} as xlRx=Rxx\mapsto l_{R}x=Rx. That is, SO(3)SO(3) acts on (3)3(\mathbb{R}^{3*})^{*}\sim\mathbb{R}^{3} in just the standard way. The orbits 𝒪\mathcal{O} decompose into two classes: spheres (with any radius) and a point (at the origin). The little group for the first kind is SO(2)SO(2), while for the second it is SO(3)SO(3). Since the orbits are regular, the irreducible unitary representations are labeled by the radius a+{0}a\in\mathbb{R}^{+}\cup\{0\} of the orbit and, for a>0a>0 (which is the case of interest), the integer nn\in\mathbb{Z} specifying the irreducible unitary representation 𝒰(n)(θ)=einθ\mathscr{U}^{\!(n)}(\theta)=e^{-in\theta} of the little group SO(2)SO(2). For a given value of nn, the Hilbert space consists of sections of the line bundle SO(3)×𝒰(n)SO(3)\times_{\mathscr{U}^{\!(n)}}\mathbb{C}.

How are the basic operators, NN and JJ, realized on this Hilbert space? Since the Euclidean measure on S2S^{2} is invariant under SO(3)SO(3), we do not have the Jacobian factor in (C.10), which simplifies to

(U(α,R)Ψ)(x)=eiα(x)/LRΨ(R1x),(U(\alpha,R)\Psi)(x)=e^{-i\alpha(x)/\hbar}L_{R}\Psi(R^{-1}x)\,, (C.11)

Note that we have introduced an \hbar in the phase factor, for notational convenience. In analogy with (4.1), we define more generally the generating operators JηJ_{\eta} and JαJ_{\alpha} via

U(exp(0,λη))=:eiλJη/\displaystyle U(\exp(0,\lambda\eta))=:e^{-i\lambda J_{\eta}/\hbar}
U(exp(λα,0))=:eiλNα/,\displaystyle U(\exp(\lambda\alpha,0))=:e^{-i\lambda N_{\alpha}/\hbar}\,, (C.12)

where η𝔰𝔬(3)3\eta\in\mathfrak{so}(3)\sim\mathbb{R}^{3} and α3\alpha\in\mathbb{R}^{3*}. It follows that

Jη=iddλU(exp(0,λη))|λ=0\displaystyle J_{\eta}=i\hbar\left.\frac{d}{d\lambda}U(\exp(0,\lambda\eta))\right|_{\lambda=0}
Nα=iddλU(exp(λα,0))|λ=0.\displaystyle N_{\alpha}=i\hbar\left.\frac{d}{d\lambda}U(\exp(\lambda\alpha,0))\right|_{\lambda=0}\,. (C.13)

Hence,

(JηΨ)(x)=i𝔇ηΨ(x)\displaystyle(J_{\eta}\Psi)(x)=-i\hbar\,\mathfrak{D}_{\eta}\Psi(x)
(NαΨ)(x)=α(x)Ψ(x),\displaystyle(N_{\alpha}\Psi)(x)=\alpha(x)\Psi(x)\,, (C.14)

where 𝔇\mathfrak{D} is a derivative operator defined by

𝔇ηΨ(x)=ddλLRληΨ(Rλη1x)|λ=0.\mathfrak{D}_{\eta}\Psi(x)=-\left.\frac{d}{d\lambda}L_{R_{\lambda\eta}}\Psi\left(R_{\lambda\eta}^{-1}x\right)\right|_{\lambda=0}\,. (C.15)

It satisfies 𝔇η(fΨ)=f𝔇ηΨ+Xη(f)Ψ\mathfrak{D}_{\eta}(f\Psi)=f\mathfrak{D}_{\eta}\Psi+X_{\eta}(f)\Psi, where f:𝒪f:\mathcal{O}\rightarrow\mathbb{C} and XηX_{\eta} is the vector field (tangent to 𝒪S2\mathcal{O}\sim S^{2}) generated by η\eta.

We now wish to relate the Casimirs N2N^{2} and NJN\cdot J with the labels aa and nn of the wavefunction representations. In an orthonormal basis eie_{i} for 3\mathbb{R}^{3}, and dual basis eie^{i} for 3\mathbb{R}^{3*}, we have N2=i=13(Nei)2N^{2}=\sum_{i=1}^{3}(N_{e^{i}})^{2}, and NeiΨ(x)=ei(x)Ψ(x)=xiΨ(x)N_{e^{i}}\Psi(x)=e^{i}(x)\Psi(x)=x^{i}\Psi(x), so

N2Ψ(x)=x2Ψ(x)=a2Ψ(x).N^{2}\Psi(x)=x^{2}\Psi(x)=a^{2}\Psi(x)\,. (C.16)

Thus, not surprisingly, N2N^{2} corresponds to the radius squared, a2a^{2}, of the sphere. The representation scales trivially with N2N^{2}, and the choice we made previously was N2=1N^{2}=1, so we here consider also the case a=1a=1.

Next we consider NJN\cdot J, which as we now show is related with the index nn of the bundle SO(3)×𝒰(n)SO(3)\times_{\mathscr{U}^{\!(n)}}\mathbb{C}. We establish this in two independent ways. In the first way, we show how NJN\cdot J is related to the little group representation (C.4), which is labeled by nn. In the second way, we show how NJN\cdot J can be directly related to the integrated curvature of a suitably constructed connection, which directly yields the Chern number of the bundle, and thus the index nn. In this second way, we will not need to invoke Schur’s lemma, but rather it will be a consequence of the construction that NJN\cdot J is proportional to the identity, similarly to what happened with N2N^{2} above.

In the first way, we note that since NJN\cdot J is a Casimir in the algebra, it must according to Schur’s lemma be proportional to the identity, so it suffices to evaluate its action on a single state Ψ(x)\Psi(x) at any given point xx. Let us take xx to be the north pole u=(0,0,1)u=(0,0,1), and construct the bundle SO(3)×𝒰(n)SO(3)\times_{\mathscr{U}^{\!(n)}}\mathbb{C} using the SO(2)SO(2) little group of uu. Then we have NJΨ(u)=J3Ψ(u)N\cdot J\Psi(u)=J_{3}\Psi(u) and, since uu is a fixed point of Rλe3R_{\lambda e_{3}}, the value of J3Ψ(u)J_{3}\Psi(u) depends only on the value of Ψ\Psi at uu (as opposed to in a neighborhood of uu). Denoting Ψ(u)=[1,z]SO(3)×𝒰(n)\Psi(u)=[1,z]\in SO(3)\times_{\mathscr{U}^{\!(n)}}\mathbb{C}, where 11 is the identity element of SO(3)SO(3) and zz\in\mathbb{C}, we have

J3Ψ(u)\displaystyle J_{3}\Psi(u) =i𝔇e3Ψ(u)\displaystyle=-i\hbar\,\mathfrak{D}_{e_{3}}\Psi(u)
=iddλLRλe3Ψ(u)\displaystyle=i\hbar\frac{d}{d\lambda}L_{R_{\lambda e_{3}}}\Psi(u)
=iddλ[Rλe3,z]\displaystyle=i\hbar\frac{d}{d\lambda}[R_{\lambda e_{3}},z]
=iddλ[1,𝒰(Rλe3)z]\displaystyle=i\hbar\frac{d}{d\lambda}[1,\mathscr{U}(R_{\lambda e_{3}})z]
=iddλeinλ[1,z]\displaystyle=i\hbar\frac{d}{d\lambda}e^{-in\lambda}[1,z]
=nΨ(u),\displaystyle=\hbar n\Psi(u)\,, (C.17)

where d/dλd/d\lambda is evaluated at λ=0\lambda=0 at every step. In the second line we used that Rλe31u=uR^{-1}_{\lambda e_{3}}u=u; in the third line we used the definition of the group lift to the associated bundle, LR[R,z]=[RR,z]L_{R}[R^{\prime},z]=[RR^{\prime},z]; in the fourth line we used the defining property of the associated bundle SO(3)×𝒰(n)SO(3)\times_{\mathscr{U}^{\!(n)}}\mathbb{C}, [R,z]=[Rh,𝒰(h1)z][R,z]=[Rh,\mathscr{U}(h^{-1})z], with h=Rλe31h=R^{-1}_{\lambda e_{3}}; and in the fifth line we used (C.4). Therefore JNΨ(x)=nΨ(x)J\cdot N\Psi(x)=\hbar n\Psi(x), so we conclude that the value of the Casimir invariant JNJ\cdot N is determined by the little group representation 𝒰(n)\mathscr{U}^{\!(n)}. Given the identification JN=egJ\cdot N=-eg, we obtain the Schwinger condition eg=neg=-n\hbar, which is more restrictive than Dirac’s condition (4.17).

In order to include also projective representations of the quantizing group, we must extend SO(3)SO(3) to SU(2)SU(2) and repeat the same analysis. The relevant master principal bundle is then the Hopf bundle U(1)SU(2)SU(2)/U(1)S2U(1)\rightarrow SU(2)\rightarrow SU(2)/U(1)\sim S^{2}. Because the little group is U(1)U(1), the associated bundles are again constructed with the representations 𝒰(n)\mathscr{U}^{\!(n)}, so the quantum states are represented by sections of SU(2)×𝒰(n)SU(2)\times_{\mathscr{U}^{\!(n)}}\mathbb{C}. The group SU(2)SU(2) acts on the sphere as

eivσ(xσ)eivσ=(R2vx)σ,e^{iv\cdot\sigma}(x\cdot\sigma)e^{-iv\cdot\sigma}=(R_{2v}x)\cdot\sigma\,, (C.18)

where v3v\in\mathbb{R}^{3}, xS23x\in S^{2}\subset\mathbb{R}^{3} and σ=(σ1,σ2,σ3)\sigma=(\sigma_{1},\sigma_{2},\sigma_{3}) are the Pauli matrices. To obtain the usual normalization for the 𝔰𝔲(2)\mathfrak{su}(2) algebra (i.e., with structure constants fijk=εijkf_{ijk}=\varepsilon_{ijk}) we must take the basis {σ1/2,σ2/2,σ3/2}\{\sigma_{1}/2,\sigma_{2}/2,\sigma_{3}/2\}, so that e3=σ3/2e_{3}=\sigma_{3}/2, and thus J3=Jη=σ3/2J_{3}=J_{\eta=\sigma_{3}/2}. Repeating the steps of the previous derivation, the only difference is that the little group phase factor is 𝒰(ei(σ3/2)λ)=einλ/2\mathscr{U}(e^{i(\sigma_{3}/2)\lambda})=e^{-in\lambda/2}, which leads to J3Ψ(x)=n/2J_{3}\Psi(x)=\hbar n/2. That is, s=n/2s=n/2, matching Dirac’s condition.

In the second way of evaluating NJN\cdot J, we note that the Chern number is a topological property of the bundle, as it does not depend on the connection used to evaluate it, so we shall just use the structures available to construct some arbitrary connection. The natural ingredient to use is the derivative operator 𝔇\mathfrak{D}, which is defined in (C.15) in terms of the group lift LRL_{R}. A possible definition for a covariant derivative is

VΨ(x)=𝔇x×VΨ(x),\nabla_{V}\Psi(x)=\mathfrak{D}_{x\times V}\Psi(x)\,, (C.19)

where VTxS2Tx3V\in T_{x}S^{2}\subset T_{x}\mathbb{R}^{3}. One can check that this \nabla satisfies all properties of a covariant derivative, so it defines a connection on the bundle. Using (C.14) we can write it as272727From (3.21) we see that iV-i\hbar\nabla_{V} is nothing more than the “natural” quantization of the classical linear momentum along VV, since Vp=V(N×J)V\cdot p=-V\cdot(N\times J), recovering the picture that pp acts as a derivative on wave functions.

VΨ=iV(N×J)Ψ.\nabla_{V}\Psi=-\frac{i}{\hbar}\,V\cdot(N\times J)\Psi\,. (C.20)

The curvature FF of this connection is defined by

(XYYX[X,Y])Ψ=F(X,Y)Ψ,(\nabla_{X}\nabla_{Y}-\nabla_{Y}\nabla_{X}-\nabla_{[X,Y]})\Psi=F(X,Y)\Psi\,, (C.21)

for vector fields XX and YY on S2S^{2}. Treating XX and YY as vectors on 3\mathbb{R}^{3}, tangent to the sphere, and using (C.20), we obtain

F(X,Y)=12XiYj[(N×J)i,(N×J)j].F(X,Y)=-\frac{1}{\hbar^{2}}X^{i}Y^{j}[(N\times J)_{i},(N\times J)_{j}]\,. (C.22)

Using the algebra (4.2) of NN and JJ we get

F(X,Y)=1iN2(X×Y)J.F(X,Y)=\frac{1}{i\hbar}N^{2}(X\times Y)\cdot J\,. (C.23)

Since X×YX\times Y is normal to the sphere, it acts on wavefunctions (in the representation with N2=1N^{2}=1) as ϵ(X,Y)N\epsilon(X,Y)N, where ϵ=sinθdθdϕ\epsilon=\sin\theta d\theta\wedge d\phi is the area form on the unit sphere. Thus the curvature 2-form can be expressed as

F=1iϵNJ.F=\frac{1}{i\hbar}\,\epsilon\,N\cdot J\,. (C.24)

We see that NJN\cdot J must act on sections as a function, i.e., NJΨ(x)=s(x)Ψ(x)N\cdot J\,\Psi(x)=\hbar s(x)\Psi(x), for some s:S2s:S^{2}\rightarrow\mathbb{R}. Because the connection was constructed in a rotation invariant way, the curvature must also be rotation invariant. This implies that ss is actually a constant, which is consistent with the fact that NJN\cdot J is a Casimir operator in an irreducible representation. The corresponding first Chern number is then

C1=S2i2πF=s2πS2ϵ=2s,C_{1}=\int_{S^{2}}\frac{i}{2\pi}F=\frac{s}{2\pi}\int_{S^{2}}\epsilon={2s}\,, (C.25)

which shows that the Casimir NJN\cdot J is directly related to this topological number, as anticipated. The possible values of ss are quantized, since the bundle SO(3)×𝒰(n)SO(3)\times_{\mathscr{U}^{\!(n)}}\mathbb{C} has first Chern number 2n2n, while the extended bundle SU(2)×𝒰(n)SU(2)\times_{\mathscr{U}^{\!(n)}}\mathbb{C} has first Chern number nn. Note that this is consistent with our previous result, where s=NJ/s=N\cdot J/\hbar was shown to be related to the bundle index nn.

Appendix D Non-uniform magnetic fields

In this appendix we discuss the case where the magnetic field is not uniform with respect to the metric on the sphere. In principle, we could proceed as before, including the magnetic field in the symplectic form, so maintaining gauge invariance explicitly. However, unless BB is uniform, no SO(3)SO(3) subgroup of the BB-preserving diffeomorphisms of the sphere would be an isometry of the (round) metric. Consequently, there would be no preferred form for the Hamiltonian, leading to operator-ordering ambiguities in the quantization. Thus, it is convenient to split BB into its monopole and higher-pole parts, B=gϵ+dAB=g\epsilon+dA, where ϵ\epsilon is the normalized volume form (i.e., whose integral is 4π4\pi) invariant under the isometries of the metric and AA is a globally-defined potential 1-form, and then include only the monopole term, gϵg\epsilon, in the symplectic form, as in (1.1), while including the higher-pole part, dAdA, in the Hamiltonian via the usual minimal coupling. In this way, the quantizing group and the associated canonical observables, JJ and NN, depend only on the monopole term, while the higher-pole term affects only the Hamiltonian.

In a given global gauge the Hamiltonian reads

H=12m(peA)2,H=\frac{1}{2m}(p-eA)^{2}\,, (D.1)

in which (peA)2=hab(peA)a(peA)b(p-eA)^{2}=h^{ab}(p-eA)_{a}(p-eA)_{b}, where hh is the metric on the sphere. Assuming a round metric, and using expression (4.21) in terms of the Killing vector fields XiX_{i} (generated by JiJ_{i}), the Hamiltonian becomes

H=12mr2[JiegNieA(Xi)]2,H=\frac{1}{2mr^{2}}\left[J_{i}-egN_{i}-eA(X_{i})\right]^{2}\,, (D.2)

where a summation over i{1,2,3}i\in\{1,2,3\} is implicit. Since A(Xi)A(X_{i}) is a function of the position only, it can be written in terms of NN unambiguously. In the wavefunction realization of Appendix C, it acts simply by multiplication.

Now we must worry about gauge-invariance. In particular, we must ensure that the same theory is obtained if one uses another choice of (global) gauge AA^{\prime}. Since the sphere is simply connected, we have

A=A+dσ,A^{\prime}=A+d\sigma\,, (D.3)

for some function σ:S2\sigma:S^{2}\rightarrow\mathbb{R}. Note that

H=12mr2(peA)2=12mr2(pedσeA)2,H^{\prime}=\frac{1}{2mr^{2}}(p-eA^{\prime})^{2}=\frac{1}{2mr^{2}}(p-ed\sigma-eA)^{2}\,, (D.4)

which has the same form as the original Hamiltonian if we define a new momentum variable p=pd(eσ)p^{\prime}=p-d(e\sigma). This corresponds to a momentum translation, defined as

Kσ(p)=pd(eσ),K_{\sigma}(p)=p-d(e\sigma)\,, (D.5)

which is a symplectomorphism of the phase space satisfying H=KσHH^{\prime}=K_{\sigma}^{*}H. Note that it maps the symplectic flow of HH^{\prime} into that of HH and, being vertical on the phase space, it leaves unchanged the projection of the dynamical trajectories to the configuration space. Therefore the two gauge-related Hamiltonians produce equivalent classical dynamics. As to the quantization, note first that H=KσHH^{\prime}=K_{\sigma}^{*}H implies that HH^{\prime} has the same functional form when written in terms of transformed charges Q=KσQQ^{\prime}=K_{\sigma}^{*}Q as HH written in terms of QQ. That is, if H=f(Q)H=f(Q) then H=f(Q)H^{\prime}=f(Q^{\prime}). Since the Poisson brackets is defined from the symplectic structure, which is invariant under KσK_{\sigma}, we have that the algebra of charges is preserved under such a transformation, i.e., {KσQi,KσQj}=Kσ{Qi,Qj}\{K_{\sigma}^{*}Q_{i},K_{\sigma}^{*}Q_{j}\}=K_{\sigma}^{*}\{Q_{i},Q_{j}\}. Thus the charges QQ^{\prime} satisfy the same algebra as QQ. Moreover the Casimirs are functionals of the charges, with form depending only on the algebra, so it should be that C=KCC^{\prime}=K^{*}C. But since CC is constant on the phase space, we have C=CC^{\prime}=C. Consequently, as the charges satisfy the same algebra, with the same Casimir values, the quantizations are equivalent (provided the same ordering prescription is applied to the Hamiltonian).

At the group level, the charges QQ and QQ^{\prime} generate the same group GG, but realized differently on the phase space. Namely, if the charges QQ generate a realization Λ:GDiff(𝒫)\Lambda:G\rightarrow\text{Diff}(\mathcal{P}) of GG as symplectomorphisms of 𝒫\mathcal{P}, then it can be shown that QQ^{\prime} generate the transformed realization, Λ\Lambda^{\prime}, defined by

Λg=Kσ1ΛgKσ.\Lambda^{\prime}_{g}=K_{\sigma}^{-1}\circ\Lambda_{g}\circ K_{\sigma}\,. (D.6)

Therefore, we see that a change of gauge in the Hamiltonian can be “reversed” by simply changing the way that the quantizing group acts on the phase space. Since it is the same group (with the same Casimir values) which is undergoing quantization, the same quantum theory is obtained. At the quantum level, this change of realization corresponds to a unitary transformation on the Hilbert space. Mirroring (D.6), the KK-transformation is implemented as

U(Λg)=T(Kσ)U(Λg)T(Kσ),U(\Lambda^{\prime}_{g})=T(K_{\sigma})^{\dagger}U(\Lambda_{g})T(K_{\sigma})\,, (D.7)

where T(Kσ)T(K_{\sigma}) is a unitary transformation, ensuring that the representation U(Λg)U(\Lambda^{\prime}_{g}) is equivalent to U(Λg)U(\Lambda_{g}). Since KσK_{\sigma} is simply a generalized type of momentum translation, the action of TT is defined, on the wave functions of Appendix C, analogously to NN in (C.14), as

T(Kσ)Ψ(x)=eieσ(x)/Ψ(x).T(K_{\sigma})\Psi(x)=e^{-ie\sigma(x)/\hbar}\Psi(x)\,. (D.8)

To verify that this is the desired transformation, define the “momentum operator” by

p(V)=V(N×J)=iV,p(V)=V\cdot(-N\times J)=-i\hbar\nabla_{V}\,, (D.9)

where VV is a vector field on the sphere and \nabla is the covariant derivative defined in (C.19). Conjugating by T(K)T(K) we get

T(Kσ)p(V)T(Kσ)Ψ(x)=(pedσ)(V)Ψ(x)=p(V)Ψ(x),T(K_{\sigma})^{\dagger}p(V)T(K_{\sigma})\Psi(x)=(p-ed\sigma)(V)\Psi(x)=p^{\prime}(V)\Psi(x)\,, (D.10)

where pp^{\prime} is precisely the momentum operator defined from the modified charges, i.e., p(V)=V(N×J)p^{\prime}(V)=V\cdot(-N^{\prime}\times J^{\prime}). Therefore, since this implies that H=T(Kσ)HT(Kσ)H^{\prime}=T(K_{\sigma})^{\dagger}HT(K_{\sigma}), the standard picture where a change of gauge corresponds to a phase transformation on wave functions is recovered.

References

  • [1] C. J. Isham, “Topological and global aspects of quantum theory,” in Relativity, groups and topology. 2 (B. S. DeWitt and R. Stora, eds.), North-Holland Physics Pub., 1984.
  • [2] C. Isham, “Canonical groups and the quantization of general relativity,” Nuclear Physics B-Proceedings Supplements, vol. 6, pp. 349–356, 1989.
  • [3] H. Kleinert, “Path integral on spherical surfaces in DD dimensions and on group spaces,” Physics Letters B, vol. 236, no. 3, pp. 315–320, 1990.
  • [4] H. Kleinert and S. V. Shabanov, “Proper Dirac quantization of free particle on DD-dimensional sphere,” Phys. Lett. A, vol. 232, pp. 327–332, 1997.
  • [5] N. P. Landsman and N. Linden, “The geometry of inequivalent quantizations,” Nuclear Physics B, vol. 365, no. 1, pp. 121–160, 1991.
  • [6] Y. Ohnuki and S. Kitakado, “Fundamental algebra for quantum mechanics on SDS^{D} and gauge potentials,” Journal of mathematical physics, vol. 34, no. 7, pp. 2827–2851, 1993.
  • [7] D. McMullan and I. Tsutsui, “On the emergence of gauge structures and generalized spin when quantizing on a coset space,” Annals of Physics, vol. 237, no. 2, pp. 269–321, 1995.
  • [8] P. Diţă, “Quantization of the motion of a particle on an NN-dimensional sphere,” Physical Review A, vol. 56, no. 4, p. 2574, 1997.
  • [9] N. M. J. Woodhouse, Geometric quantization. Oxford University Press, 1997.
  • [10] C. Neves and C. Wotzasek, “Stückelberg field-shifting quantization of a free particle on a DD-dimensional sphere,” Journal of Physics A: Mathematical and General, vol. 33, no. 36, p. 6447, 2000.
  • [11] A. Bouketir, “Group theoretic quantisation on spheres and quantum Hall effect.” Ph.D. Thesis, Universiti Putra Malaysia, (2000). http://psasir.upm.edu.my/id/eprint/9053.
  • [12] K. Kowalski and J. Rembielinski, “Quantum mechanics on a sphere and coherent states,” Journal of Physics A: Mathematical and General, vol. 33, no. 34, p. 6035, 2000.
  • [13] S.-T. Hong, W. Tae Kim, and Y.-J. Park, “Improved Dirac quantization of a free particle,” Modern Physics Letters A, vol. 15, no. 31, pp. 1915–1922, 2000.
  • [14] E. Abdalla and R. Banerjee, “Quantisation of the multidimensional rotor,” Brazilian Journal of Physics, vol. 31, no. 1, pp. 80–83, 2001.
  • [15] S.-T. Hong and K. D. Rothe, “The gauged O(3)O(3) sigma model: Schrödinger representation and Hamilton–Jacobi formulation,” Annals of Physics, vol. 311, no. 2, pp. 417–430, 2004.
  • [16] Q. Liu, L. Tang, and D. Xun, “Geometric momentum: The proper momentum for a free particle on a two-dimensional sphere,” Physical Review A, vol. 84, no. 4, p. 042101, 2011.
  • [17] B. C. Hall and J. J. Mitchell, “Coherent states for a 22-sphere with a magnetic field,” Journal of Physics A: Mathematical and Theoretical, vol. 45, no. 24, p. 244025, 2012.
  • [18] G. M. Kemp and A. P. Veselov, “On geometric quantization of the Dirac magnetic monopole,” Journal of Nonlinear Mathematical Physics, vol. 21, no. 1, pp. 34–42, 2014.
  • [19] Z. Zhong-Shuai, X. Shi-Fa, X. Da-Mao, and L. Quan-Hui, “An enlarged canonical quantization scheme and quantization of a free particle on two-dimensional sphere,” Communications in Theoretical Physics, vol. 63, no. 1, p. 19, 2015.
  • [20] S. Ouvry and A. P. Polychronakos, “Anyons on the sphere: analytic states and spectrum,” Nuclear Physics B, vol. 949, p. 114797, 2019.
  • [21] A. Bouketir and H. Zainuddin, “Quantization and Hall effect: necessities and difficulties,” 1999. https://inis.iaea.org/search/search.aspx?orig_q=RN:32027462.
  • [22] J. Śniatycki, “Prequantization of charge,” Journal of Mathematical Physics, vol. 15, no. 5, pp. 619–620, 1974.
  • [23] S. Sternberg, “Minimal coupling and the symplectic mechanics of a classical particle in the presence of a Yang-Mills field,” Proceedings of the National Academy of Sciences, vol. 74, no. 12, pp. 5253–5254, 1977.
  • [24] P. A. M. Dirac, “The fundamental equations of quantum mechanics,” Proceedings of the Royal Society of London. Series A, vol. 109, no. 752, pp. 642–653, 1925.
  • [25] M. J. Gotay, “Obstructions to quantization,” in Mechanics: from theory to computation, pp. 171–216, Springer, 2000.
  • [26] M. J. Gotay, “On a full quantization of the torus,” in Quantization, coherent states, and complex structures, pp. 55–62, Springer, 1995.
  • [27] M. J. Gotay and J. Grabowski, “On quantizing nilpotent and solvable basic algebras,” Canadian Mathematical Bulletin, vol. 44, no. 2, pp. 140–149, 2001.
  • [28] V. Bargmann, “On unitary ray representations of continuous groups,” Annals of Mathematics, pp. 1–46, 1954.
  • [29] ACuriousMind, “Why exactly do sometimes universal covers, and sometimes central extensions feature in the application of a symmetry group to quantum physics?.” Physics Stack Exchange. https://physics.stackexchange.com/q/203945 (version: 2020-06-11).
  • [30] U. Niederer and L. O’Raifeartaigh, “Realizations of the Unitary Representations of the Inhomogeneous Space-Time Groups I. General Structure,” Fortschritte der Physik, vol. 22, no. 3, pp. 111–129, 1974.
  • [31] R. Raczka and A. O. Barut, Theory of group representations and applications. World Scientific Publishing Company, 1986.
  • [32] F. D. M. Haldane, “Fractional quantization of the Hall effect: a hierarchy of incompressible quantum fluid states,” Physical Review Letters, vol. 51, no. 7, p. 605, 1983.
  • [33] J. Schwinger, “Magnetic charge and quantum field theory,” Physical Review, vol. 144, no. 4, p. 1087, 1966.
  • [34] J. Schwinger, “Magnetic charge and the charge quantization condition,” Physical Review D, vol. 12, no. 10, p. 3105, 1975.
  • [35] A. Peres, “Rotational invariance of magnetic monopoles,” Physical Review, vol. 167, no. 5, p. 1449, 1968.
  • [36] E. Fradkin, Field theories of condensed matter physics. Cambridge University Press, 2013.
  • [37] T. Kikuchi and G. Tatara, “Spin dynamics with inertia in metallic ferromagnets,” Physical Review B, vol. 92, no. 18, p. 184410, 2015.
  • [38] T. T. Wu and C. N. Yang, “Dirac monopole without strings: monopole harmonics,” Nuclear Physics B, vol. 107, no. 3, pp. 365–380, 1976.
  • [39] M. Greiter, “Landau level quantization on the sphere,” Physical Review B, vol. 83, no. 11, p. 115129, 2011.
  • [40] V. Aldaya, M. Calixto, J. Guerrero, and F. López-Ruiz, “Group-quantization of nonlinear sigma models: Particle on S2S^{2} revisited,” Reports on Mathematical Physics, vol. 64, no. 1-2, pp. 49–58, 2009.
  • [41] V. Aldaya, M. Calixto, J. Guerrero, and F. López-Ruiz, “Quantum integrability of the dynamics on a group manifold,” Journal of Nonlinear Mathematical Physics, vol. 15, no. sup3, pp. 1–12, 2008.
  • [42] V. Aldaya, J. Guerrero, F. López-Ruiz, and F. Cossío, “SU(2)SU(2) particle sigma model: the role of contact symmetries in global quantization,” Journal of Physics A: Mathematical and Theoretical, vol. 49, no. 50, p. 505201, 2016.
  • [43] J. Guerrero, F. F. López-Ruiz, and V. Aldaya, “SU(2)SU(2)-particle sigma model: momentum-space quantization of a particle on the sphere S3S^{3},” Journal of Physics A: Mathematical and Theoretical, vol. 53, no. 14, p. 145301, 2020.
  • [44] F. Coester and R. Haag, “Representation of States in a Field Theory with Canonical Variables,” Phys. Rev., vol. 117, no. 4, p. 1137, 1960.
  • [45] R. Haag, Local quantum physics: Fields, particles, algebras. Springer Science & Business Media, 2012.
  • [46] A. Duncan, The conceptual framework of quantum field theory. Oxford University Press, 2012.
  • [47] J. Klauder and E. Aslaksen, “Elementary model for quantum gravity,” Phys. Rev. D, vol. 2, pp. 272–276, 1970.
  • [48] M. Pilati, “Strong-coupling quantum gravity. I. Solution in a particular gauge,” Physical Review D, vol. 26, no. 10, p. 2645, 1982.
  • [49] M. Pilati, “Strong-coupling quantum gravity. II. Solution without gauge fixing,” Physical Review D, vol. 28, no. 4, p. 729, 1983.
  • [50] J. R. Klauder, “Noncanonical quantization of gravity. I. Foundations of affine quantum gravity,” Journal of Mathematical Physics, vol. 40, no. 11, pp. 5860–5882, 1999.
  • [51] J. R. Klauder, “The affine quantum gravity programme,” Classical and Quantum Gravity, vol. 19, no. 4, p. 817, 2002.
  • [52] C. J. Isham and A. C. Kakas, “A group theoretical approach to the canonical quantisation of gravity. i. construction of the canonical group,” Classical and Quantum Gravity, vol. 1, no. 6, p. 621, 1984.
  • [53] C. Isham and A. C. Kakas, “A group theoretical approach to the canonical quantisation of gravity. ii. unitary representations of the canonical group,” Classical and Quantum Gravity, vol. 1, no. 6, p. 633, 1984.
  • [54] C. Vizman, “Central extensions of semidirect products and geodesic equations,” Physics Letters A, vol. 330, no. 6, pp. 460–469, 2004.
  • [55] E. van den Ban, “Representation theory and applications in classical quantum mechanics.” Lectures for the MRI Spring School “Lie groups in Analysis, Geometry and Mechanics”, Utrecht, June 2004. https://webspace.science.uu.nl/~ban00101/lecnotes/repq.pdf (version: v4, 20/6).
  • [56] G. W. Mackey, Induced representations of groups and quantum mechanics. New York, NY: Benjamin, 1968.