This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

11institutetext: Department of Physics, North Carolina State University, Raleigh, NC 27607, USA

Strongly coupled QFT dynamics via TQFT coupling

Mithat Ünsal [email protected]
Abstract

We consider a class of quantum field theories and quantum mechanics, which we couple to N{\mathbb{Z}}_{N} topological QFTs, in order to classify non-perturbative effects in the original theory. The N{\mathbb{Z}}_{N} TQFT structure arises naturally from turning on a classical background field for a N{\mathbb{Z}}_{N} 0- or 1-form global symmetry. In SU(N)SU(N) Yang-Mills theory coupled to N{\mathbb{Z}}_{N} TQFT, the non-perturbative expansion parameter is exp[SI/N]=exp[8π2/g2N]\exp[-S_{I}/N]=\exp[-{8\pi^{2}}/{g^{2}N}] both in the semi-classical weak coupling domain and strong coupling domain, corresponding to a fractional topological charge configurations. To classify the non-perturbative effects in original SU(N)SU(N) theory, we must use PSU(N)PSU(N) bundle and lift configurations (critical points at infinity) for which there is no obstruction back to SU(N)SU(N). These provide a refinement of instanton sums: integer topological charge, but crucially fractional action configurations contribute, providing a TQFT protected generalization of resurgent semi-classical expansion to strong coupling. Monopole-instantons (or fractional instantons) on T3×SL1T^{3}\times S^{1}_{L} can be interpreted as tunneling events in the ’t Hooft flux background in the PSU(N)PSU(N) bundle. The construction provides a new perspective to the strong coupling regime of QFTs and resolves a number of old standing issues, especially, fixes the conflicts between the large-NN and instanton analysis. We derive the mass gap at θ=0\theta=0 and gaplessness at θ=π\theta=\pi in 1\mathbb{CP}^{1} model, and mass gap for arbitrary θ\theta in N1,N3\mathbb{CP}^{N-1},N\geq 3 on 2{\mathbb{R}}^{2}.

1 Introduction

The applications of coupling a topological quantum field theory (TQFT) to quantum field theory (QFT) received recent interest in the discussion of mixed anomalies Gukov:2013zka ; Kapustin:2014gua ; Gaiotto:2017yup . In this context, it is used to extract kinematic constraints imposed by symmetry on a QFT. This is an exact, albeit non-dynamical information about quantum theory.

Here, we would like to explore the implications of coupling a TQFT to QFT in the study of non-perturbative dynamics, a territory that we cannot expect to be exact, in search of reliable approximations. We want to take advantage of robustness of TQFT in parts of the story. One of our physical goals is to understand what controls the strength of non-perturbative effects in an asymptotically free QFT on large dd-dimensional manifold MdM_{d}, where MdM_{d} serves as a regularization for d{\mathbb{R}}^{d}. More practically, we would like to understand the role of topological defects such as instantons, monopole instantons and fractional instantons in the dynamics much more precisely both in weak coupling semi-classical domain and in strong coupling domain by using the restrictions that follows from TQFT.

Let us denote a theory by 𝖳\mathsf{T} and its global symmetry by GG. To be less abstract, we develop three (extremely) parallel stories. Quantum mechanics of a particle on a circle in the presence of a potential with NN degenerate minima, that we call TNT_{N} model for short, 2d2d N1\mathbb{CP}^{N-1} model, and 4d4d SU(N)SU(N) Yang-Mills theory. TNT_{N} quantum mechanics and Yang-Mills theory possess a N{\mathbb{Z}}_{N} global symmetry, 0-form or 1-form, respectively. N1\mathbb{CP}^{N-1} has a SU(N)/NSU(N)/{\mathbb{Z}}_{N} global symmetry. One can turn on a classical background field for these symmetries (also called background gauging).111We use background gauging for turning on a classical background field for a global symmetry and coupling it in a gauge invariant manner to the theory. Note that with a background field turned on, it is still meaningful to look at correlators that transform under background gauging. These are non-trivial and transform covariantly. There is no path integration at this stage. If we choose to sum over all possible backgrounds, i.e, perform a path integral over the background field by making it dynamical, we refer to it as gauging. In this case, the correlators that transform under gauge transformations are identically zero by Elitzur’s theorem Elitzur:1975im . Both of these ideas are used throughout this work. In doing so, coupling QFT 𝖳\mathsf{T} to a N{\mathbb{Z}}_{N} TQFT becomes strictly necessary, we call the resulting theory as 𝖳/N\mathsf{T}/{\mathbb{Z}}_{N} theory.222Strictly speaking, 𝖳/N\mathsf{T}/{\mathbb{Z}}_{N} is appropriate description for QM and Yang-Mills. In N1\mathbb{CP}^{N-1}, it is slight abuse of language. To turn on SU(N)/NSU(N)/{\mathbb{Z}}_{N} background, we first turn on an SU(N)SU(N) background field, and then, couple the theory to a N{\mathbb{Z}}_{N} TQFT. This is what we mean by 𝖳/N\mathsf{T}/{\mathbb{Z}}_{N} model in N1\mathbb{CP}^{N-1}. As a result of this procedure, we learn the existence of new non-perturbative effects and scales relevant to the original theory, see Fig. 1.

A recurring theme can be described simply in quantum mechanics. Consider TNT_{N} model, with N{\mathbb{Z}}_{N} shift symmetry, Fig.2. The partition function is described as a sum over periodic paths on S1S^{1}, it receives contribution only from integer winding number WW\in{\mathbb{Z}} configurations. Refer to the action of W=1W=1 instanton as SIS_{I}. There are clearly fractional instantons IjI_{j} with winding number W1NW\in\frac{1}{N}{\mathbb{Z}} and action SI/NS_{I}/N in the theory, which contribute at leading order to the spectrum, but forbidden to contribute to partition function, because they are not periodic paths in S1S^{1}. Now, if we couple the theory to N{\mathbb{Z}}_{N} TQFT and gauge N{\mathbb{Z}}_{N}, these configurations become closed paths in S1/NS^{1}/{\mathbb{Z}}_{N}, but there is an obstruction to lift them to closed paths in original S1S^{1}. Yet, a pair of loops with winding numbers +11=0+1-1=0 in S1/NS^{1}/{\mathbb{Z}}_{N} lift to S1S^{1} as 1N1N=0\frac{1}{N}-\frac{1}{N}=0 and contribute to the partition function with action 2SI/N2S_{I}/N. The precise semi-classical description of this saddle here is critical point at infinity Behtash:2018voa which we explain in QM discussion. So, there are zero winding, and fractional action non-BPS configuration contributing to partition function in TNT_{N} quantum mechanics.

Refer to caption
Figure 1: By coupling a QFT to a TQFT, the topological charge and action become fractional. Such configurations do not lift to the partition function of the theory 𝖳{\mathsf{T}}, but their integer topological charge and fractional action composites do. Even the W=1NW=\frac{1}{N} fractional instantons contribute to some observables in 𝖳{\mathsf{T}}, but not to partition function. We demonstrate that the non-perturbative expansion parameter in theory is exp[SI/N]\exp[-S_{I}/N] instead of exp[SI]\exp[-S_{I}]. This statement was already known to be true in semi-classical domain. The TQFT coupling tells us that it is also true in strong coupling domain.

We formalize this construction as follows in QM and QFT.

  • In d=1,2,4d=1,2,4, we describe a N{\mathbb{Z}}_{N} TQFT and couple it to the theory 𝖳\mathsf{T}: quantum mechanics, N1\mathbb{CP}^{N-1} and Yang-Mills, respectively.

  • By turning on a classical N{\mathbb{Z}}_{N} background field or by gauging it (summing over all possible backgrounds, corresponding to 𝖳/N\mathsf{T}/{\mathbb{Z}}_{N}), we find a set of saddles which do not lift to the original theory 𝖳\mathsf{T}. But fractional action non-BPS composites of such objects lift to 𝖳\mathsf{T} and contribute to partition function.

  • We can view the (fixed) classical N{\mathbb{Z}}_{N} TQFT backgrounds as (fixed) ’t Hooft fluxes or twisted boundary conditions in d=2,4d=2,4 tHooft:1979rtg ; tHooft:1977nqb .

  • In Yang-Mills theory, the non-trivial configurations in the SU(N)/NSU(N)/{\mathbb{Z}}_{N} bundle do not uplift to SU(N)SU(N) bundle, but certain integer topological charge and fractional action configurations do. The same statement is also true in N1\mathbb{CP}^{N-1}, where U(1)/NU(1)/{\mathbb{Z}}_{N} vs. U(1)U(1) are appropriate bundles. Both of these statements are independent of strength of coupling, in particular, valid at both weak and strong coupling due to TQFT coupling.

  • Since in order to find the saddles in TNT_{N} model, we need to directly look at either TNT_{N} with classical N{\mathbb{Z}}_{N} background (A(1),A(0))(A^{(1)},A^{(0)}) or equivalently, N{\mathbb{Z}}_{N} twisted boundary conditions or (TN/N)p(T_{N}/{\mathbb{Z}}_{N})_{p} models where N{\mathbb{Z}}_{N} is gauged, we claim that in order to find the relevant saddles of SU(N)SU(N) gauge theories, we should first consider either SU(N)SU(N) with background gauge field (B(2),B(1))(B^{(2)},B^{(1)}) for the N[1]{\mathbb{Z}}_{N}^{[1]} 1-form symmetry or (SU(N)/N)p(SU(N)/{\mathbb{Z}}_{N})_{p} theory where N[1]{\mathbb{Z}}_{N}^{[1]} is gauged. After finding these configurations, we can patch them up to find the ones that can be lifted to SU(N)SU(N) theory. These are fractional action (e.g. 2SI/N2S_{I}/N) configurations that contribute to the partition function of SU(N)SU(N) theory.

  • These configurations are TQFT-protected, and exist both at weak and strong coupling. In compactified theories, they exist both at Tlarged1×Ssmall1T^{d-1}_{\rm large}\times S^{1}_{\rm small} as well as TlargedT^{d}_{\rm large} where the latter is a place holder for d{\mathbb{R}}^{d}.

  • This construction shows that, in d=2d=2 N1\mathbb{CP}^{N-1} and d=4d=4 Yang-Mills theory, the non-perturbative effects are not controlled by the (BPST) instanton factor eSI+iθ\mathrm{e}^{-S_{I}+\mathrm{i}\theta} Polyakov:1975yp ; Belavin:1975fg , but instead by exponentially larger eSI/N+iθ/N\mathrm{e}^{-S_{I}/N+\mathrm{i}\theta/N} fractional instanton factor, due to TQFT-protection.

  • This extends our earlier result in flavor ΩF\Omega_{F}-twisted N1\mathbb{CP}^{N-1} model on semi-classical small ×SL1{\mathbb{R}}\times S^{1}_{L} Dunne:2016nmc ; Dunne:2012ae to large ×SL1{\mathbb{R}}\times S^{1}_{L}, to the strong coupling domain, by augmenting flavor twisted theory with a N{\mathbb{Z}}_{N} TQFT background (i.e. with ’t Hooft twist.) It also extends some results in semi-classical deformed Yang-Mills theory, 𝒩=1{\cal N}=1 SYM, QCD(adj) and some other QCD-like theories (see Unsal:2007jx ; Unsal:2007vu ; Unsal:2008ch ; Shifman:2008ja ) from small 3×SL1{\mathbb{R}}^{3}\times S^{1}_{L} to large 3×SL1{\mathbb{R}}^{3}\times S^{1}_{L}, to the strong coupling domain by coupling these systems to N{\mathbb{Z}}_{N} TQFTs.

  • Based on matching weak coupling semi-classical descriptions, matching of global symmetries and mixed ’t Hooft anomalies, and improving an old work of Fateev, Frolov, Schwarz, and Berg, Lüscher Berg:1979uq ; Fateev:1979dc in crucial ways, we show that low-energy spectrum of the N1\mathbb{CP}^{N-1} model is described by NN-flavor massive Schwinger model or its non-abelian bosonization, the mass deformation of SU(N)1SU(N)_{1} WZW model with an extra scalar.

  • The mass parameter in Schwinger model, or mass deformed WZW models, due to matchings of semi-classical descriptions, map to a fractional instanton effect eSI/N\mathrm{e}^{-S_{I}/N} in N1\mathbb{CP}^{N-1}, which is nothing but the strong scale of the theory. By studying renormalization group for topological defect operators Kosterlitz:1974sm (derived originally to explain the Kosterlitz-Thouless phase transition Kosterlitz:1973xp ), we describe the low energy spectrum of the theory.

  • We determine the mass gap and low energy spectrum for arbitrary θ\theta in N1\mathbb{CP}^{N-1} on 2{\mathbb{R}}^{2} via this procedure. For N3N\geq 3, the spectrum is gapped for any θ\theta, and composed of an adjoint and singlet. For the N=2N=2 theory, the theory is gapped at θπ\theta\neq\pi. We show the existence of a gapless triplet at θ=π\theta=\pi. Our formalism give a prediction for the mass gap in the 1\mathbb{CP}^{1} model as mgap(θ)=Λ|cosθ2|2/3m_{\rm gap}(\theta)=\Lambda\left|\cos\frac{\theta}{2}\right|^{{2}/{3}}, which we hope can be tested by other means.

Our construction resolves many puzzles concerning conflicts between large-NN vs. instantons, instantons vs. multi-branch structure of vacua, incorrect θ\theta-angle dependence of vacuum energy due to instantons, instantons vs. η\eta^{\prime} puzzle, and many others mentioned in books and reviews Coleman198802 ; Vainshtein:1981wh ; Schafer:1996wv ; Marino:2015yie . In retrospect, our current construction proves why recent studies on d1×S1{\mathbb{R}}^{d-1}\times S^{1} in the context of adiabatic continuity and resurgent semi-classical analysis Dunne:2016nmc ; Dunne:2012ae ; Unsal:2008ch ; Shifman:2008ja capture interesting non-perturbative dynamics of QFT on d{\mathbb{R}}^{d}. In the light of our construction, it may be worthwhile to revisit the meron idea of Gross et. al. Callan:1977gz ; Gross:1977wu ; Callan:1977qs on d{\mathbb{R}}^{d}. This idea requires some fixing which can easily be done, for the N=2N=2 case, see Sec.B. It also needs correct generalization to N3N\geq 3.

1.1 Earlier works on Yang-Mills on T3×T^{3}\times{\mathbb{R}}: Not all QM reductions are the same.

We finalize the introduction by describing historical precedents to this work333We thank anonymous referee for encouraging us to write this part which clarifies the relation between the present work and earlier discussions of Yang-Mills theory in a quantum mechanical reduction down to small T3×T^{3}\times{\mathbb{R}}., especially concerning the different class of constructions in a quantum mechanical reduction on T3×T^{3}\times\mathbb{R}. This will also give us an opportunity to highlight where our work resembles or differs from the earlier works.

An exact solution to self-duality equation with topological charge Q=1/NQ=1/N and action 8π2g2N\frac{8\pi^{2}}{g^{2}N} on T4T^{4} is first found by ’t Hooft tHooft:1981nnx , and generalized by van Baal vanBaal:1982ag . ’t Hooft original physical motivation was to find configurations which survive in the large-NN limit, and which are exponentially more important than the usual instantons even at finite-NN. (The usual instantons are suppressed as eNe^{-N} in the large-NN limit, and are irrelevant.) These solutions can be interpreted as configurations in SU(N)SU(N) with discrete flux background turned on, or in PSU(N)PSU(N) bundle. However, these are constant solutions, and the geometry of T4T^{4} must satisfy a certain aspect ratio to have a 8π2g2N\frac{8\pi^{2}}{g^{2}N} action. Unlike on 3×S1{\mathbb{R}}^{3}\times S^{1} where it is easy to find non-trivial (space-time dependent) fractional instanton solutions, the monopole-instantons, where dynamics abelianize at long distances due to adjoint-Higgsing sourced by the vev of gauge holonomy, it proved to be extremely difficult to find non-trivial solutions in systems which do not abelianize at long distance.

Discrete flux background and twisted boundary conditions are used to reduce 𝒩=1{\cal N}=1 SU(N)SU(N) SYM to quantum mechanics on T3×T^{3}\times{\mathbb{R}} and employed to calculate supersymmetric index in Witten’s work Witten:1982df . The work closest in spirit to our construction, working with non-supersymmetric theories, and aiming to preserve the vacuum structure and fractional instantons in a quantum mechanical limit, are the works of Gonzalez-Arroyo et. al. In their construction, dynamics do not abelianize, and they have found by numerical simulations on latticized T3×T^{3}\times{\mathbb{R}} that time-dependent fractional instanton solutions with action 8π2g2N\frac{8\pi^{2}}{g^{2}N} exist in the presence of ’t Hooft flux GarciaPerez:1989gt ; GarciaPerez:1992fj ; GarciaPerez:1993jw ; GonzalezArroyo:1995zy ; GonzalezArroyo:1995ex ; GonzalezArroyo:1996jp ; Montero:2000mv ; GonzalezArroyo:1997uj ; Gonzalez-Arroyo:2019wpu . The analytic form of these solutions is still an open and important problem. The difference between the works of Witten and Gonzalez-Arroyo, and our work is following. We reduce deformed YM on 3×Sβ1{\mathbb{R}}^{3}\times S^{1}_{\beta} (which is adiabatically connected to pure YM on 4{\mathbb{R}}^{4}) down to T3×Sβ1T^{3}\times S^{1}_{\beta} by using magnetic GNO flux in the co-weight lattice of SU(N)SU(N). This is more refined data than discrete ’t Hooft flux. Magnetic NN-ality determines the discrete magnetic flux. The fact that we can turn on 𝝁1\bm{\mu}_{1} GNO flux and this is degenerate in energy with 𝝁𝟏𝜶𝟏,,𝝁1j=1N1𝜶j\bm{\mu_{1}-\alpha_{1}},\ldots,\bm{\mu}_{1}-\sum_{j=1}^{N-1}\bm{\alpha}_{j} (all with discrete magnetic flux one) helps us to identify the tunneling in QM with the monopole-instantons on 3×Sβ1{\mathbb{R}}^{3}\times S^{1}_{\beta}, which carry magnetic charges in the extended simple root system of SU(N)SU(N). In both construction, there are NN-classical vacua (sourced by turning on a classical background) and NN- fractional instantons with topological charge Q=1/NQ=1/N and action 8π2g2N\frac{8\pi^{2}}{g^{2}N}, an extremely desirable aspect.

A frequently asked question is the relation between Luscher Luscher:1982ma and van Baal’s works vanBaal:1986ag ; vanBaal:2000zc , which employs T3×T^{3}\times{\mathbb{R}}, without incorporating discrete flux, and our present work.444van Baal also have important works on the discrete flux backgrounds and classification of the configurations in PSU(N)PSU(N) bundle. These works would like to benefit from the absence of phase transitions in finite volume. However, unlike our construction, in these compactification, vacuum structure is not persistent. In particular, in SU(N)SU(N) theory, there are N3N^{3} vacua related to perturbative breaking of (N[0])3({\mathbb{Z}}_{N}^{[0]})^{3} center symmetry to all orders in perturbation theory, instead of NN classical vacua that emerge from classical flux insertion. However, 0-form symmetries cannot break spontaneously in QM unless enforced by a mixed anomaly.555In fact, d1d-1 form symmetries cannot break spontaneously in dd dimensions unless enforced by a mixed anomaly. Indeed, there are instantons in the theory leading to a unique vacuum. These instantons between N3N^{3} perturbative vacua are not classical fractional instantons with action 8π2g2N\frac{8\pi^{2}}{g^{2}N} or instantons with action 8π2g2\frac{8\pi^{2}}{g^{2}}. Rather, these instantons arise from balancing of the classical action with one-loop potential for the gauge holonomy, i.e, they are “quantum instantons”, with action of order O(1/g)O(1/g), i.e, they are not related to fractional instantons that we are talking about, that exist in thermo-dynamic limit on 3×S1{\mathbb{R}}^{3}\times S^{1} or the ones that appear in PSU(N)PSU(N) bundle classification.

Furthermore, if we consider the large-NN limit of the Luscher Luscher:1982ma and van Baal’s compactification vanBaal:1986ag ; vanBaal:2000zc , the tunneling amplitude between N3N^{3} perturbative vacua has an NN dependence of the form eNe^{-\sqrt{N}}, and the tunneling becomes forbidden in the NN\rightarrow\infty limit. That is, even in finite volume, there is a phase transition in the large-NN limit (which is also a thermo-dynamic limit) of their construction, between the small T3×T^{3}\times{\mathbb{R}} theory and large T3×T^{3}\times{\mathbb{R}} theory according to (N[0])3({\mathbb{Z}}_{N}^{[0]})^{3} center-symmetry realization. (This phase transition is actually the reason why Eguchi-Kawai reduction Eguchi:1982nm in its original form without discrete flux fails Bhanot:1982sh ; Kiskis:2003rd .) In sharp contrast, in our and Gonzalez-Arroyo’s constructions, there are NN-classical vacua, induced by classical background, and the tunneling between them is a classical fractional instanton effect with action 8π2g2N\frac{8\pi^{2}}{g^{2}N} and does not disappear in the large-NN limit. (This is actually the reason why Twisted-Eguchi-Kawai reduction with judiciously chosen discrete flux satisfies large-NN volume independence GonzalezArroyo:2010ss .) Very importantly, in our set-up, the fractional instantons induce effects such as exp[8π2g2N+iθN]\exp[-\frac{8\pi^{2}}{g^{2}N}+i\frac{\theta}{N}]. One of the most interesting non-perturbative phenomena in infinite volume gauge theory is that observables are NN-branched functions, where each branch is 2πN2\pi N periodic, yet, the observables are 2π2\pi periodic, with cusps at θ=π\theta=\pi Witten:1980sp . The set of fractional instantons (including critical points at infinity) are compatible with a complete Fourier basis, which can describe such non-trivial observables with multi-branched structure, and this compatibility of saddles with a complete basis for Fourier expansion of observables is far from trivial. Indeed, we have the same intentions with Luscher and van Baal, but thanks to the GNO and discrete flux backgrounds we use, our quantum mechanical constructions have a remembrance of fractional topological configurations on 3×S1{\mathbb{R}}^{3}\times S^{1} and the configurations in PSU(N)PSU(N) bundle even on large-T4T^{4}. Therefore, the constructions with flux are more powerful strategies to understand non-perturbative Yang-Mills dynamics across different length scales.

Finally, perhaps, a point that is not sufficiently understood (or emphasized) in the past, is the fact that the flux backgrounds have a re-interpretation as coupling the QFTs to TQFT (backgrounds) Kapustin:2014gua . These theories are globally different, but locally the same. For example, mass spectrum are completely identical on large T4T^{4}. The TQFT coupling instructs us that the existence of fractional topological charge configurations is independent of T4T^{4} size, and is valid at both weak and strong coupling. This is explored in detail in present work and in a companion paper Unsal:2021cch .

Refer to caption
Figure 2: TNT_{N} model is quantum mechanics with periodic potential cos(Nq)-\cos(Nq) on a circle with fundamental domain q[0,2π]q\in[0,2\pi]. In the Born-Oppenheimer approximation, it is equivalent to NN-site lattice model. In partition function, we sum only over periodic paths, corresponding to integer winding number topologies, but obviously, there are other saddles with fractional winding number.

2 Quantum mechanical TNT_{N} and (TN/N)p(T_{N}/{\mathbb{Z}}_{N})_{p} models

Consider quantum mechanics of a particle on a circle with a potential of the form

V(q)\displaystyle V(q) =cos(Nq),qq+2π\displaystyle=-\cos(Nq),\qquad q\sim q+2\pi (1)

qq is physically identified with q+2πq+2\pi. The theory has a N{\mathbb{Z}}_{N} global shift symmetry,

N:qq+2πN\displaystyle{\mathbb{Z}}_{N}:q\mapsto q+\frac{2\pi}{N} (2)

The system has NN-degenerate minima to all orders in perturbation theory. The partition function, Z(β)=tr[exp(βH)]Z(\beta)=\text{tr}[\exp(-\beta H)], in the β\beta\rightarrow\infty limit, is dominated by the lowest-NN states. Within the Born-Oppenheimer approximation, one can forget about higher states in the spectrum, as non-perturbative instanton induced splittings are exponentially smaller than splittings between harmonic oscillator levels. In this limit, the system may be viewed as NN-site lattice model with nearest neighbor hopping interaction, described by a tight-binding approximation. For convenience, we call this system TNT_{N} model. See Fig.2. Some semi-classical and anomaly aspects of this model are studied in Unsal:2012zj ; Kikuchi:2017pcp ; Gaiotto:2017yup .

This simple system provides a playground for some ideas that we will employ in diverse non-trivial QFTs. Hence, we explore in detail the connection between various ways of looking to this system and interpret them in simple physical terms. The connection between the N{\mathbb{Z}}_{N} twisted boundary conditions, and turning on N{\mathbb{Z}}_{N} background field is well known. These constructions have a precise interpretation as coupling QM to N{\mathbb{Z}}_{N} TQFT, which is fairly useful in QFT. Later, we may also choose to gauge N{\mathbb{Z}}_{N} symmetry. This is equivalent to summing over all possible background fields or twisted boundary conditions, or performing a path integral over the TQFT. Gauging, of course, changes the theory, from TNT_{N} to TN/NT_{N}/{\mathbb{Z}}_{N}. In the latter, qq and q+2πNq+\frac{2\pi}{N} are physically identified. i.e, we end up with the particle on a circle with 1-minimum on the circle. However, there are different versions of this 1-site theory that we call (TN/N)p(T_{N}/{\mathbb{Z}}_{N})_{p}, depending how we gauge N{\mathbb{Z}}_{N}. The pp in the (TN/N)p(T_{N}/{\mathbb{Z}}_{N})_{p} has three equivalent, but somehow different sounding interpretations, as discrete theta angle θp\theta_{p}, or level pp Chern-Simons term that is present in TQFT, or picking Bloch state with quasi-momentum pp from the Hilbert space of the original theory, which we describe in detail, as it translates verbatim to QFT.

Basics: The path-integral representation of the partition function is

Z(β)\displaystyle Z(\beta) =tr[eβH]=q(β)=q(0)𝒟qexp(S[q]),where\displaystyle=\text{tr}[\mathrm{e}^{-\beta H}]=\int_{q(\beta)=q(0)}{\mathcal{D}}q\;\exp(-S[q]),\qquad{\rm where} (3)
S[q]\displaystyle S[q] =1g𝑑τ(12q˙2+V(q))+iθ2πdq\displaystyle=\frac{1}{g}\int d\tau\left({\textstyle{\frac{1}{2}}}\dot{q}^{2}+V(q)\right)+\frac{\mathrm{i}\theta}{2\pi}\int\mathrm{d}q (4)

The trace tr[]\text{tr}[\cdot] in the operator formalism translates in the path integral formalism to integration over all paths obeying the periodic boundary condition, q(β)=q(0)q(\beta)=q(0). The periodic paths are classified according to the homotopy group of the maps

q(τ):Sβ1S1,π1(S1)=\displaystyle q(\tau):S^{1}_{\beta}\rightarrow S^{1},\;\;\pi_{1}(S^{1})=\mathbb{Z} (5)

which is nothing but winding number (corresponding to topological charge):

W=12πdq.\displaystyle W=\frac{1}{2\pi}\int\mathrm{d}q\in{\mathbb{Z}}. (6)

which is integer valued.

Despite the fact that the partition function is given by summing over integer winding number WW\in\mathbb{Z} configurations, it is clear that there are classical solutions with fractional topological charge, called fractional instantons, see Fig.2.666These are obvious in quantum mechanics, but not always so obvious in QFT. This is the reason why we lay out a somewhat abstract formalism in QM, which carries over to QFT. Obvious facts in QM will map to non-obvious statements in QFT both at weak and strong coupling on dd-manifolds, MdM_{d} including d{\mathbb{R}}^{d} limits. The configuration with W=1W=1 is an instanton, and its action is SW=1SIS_{W=1}\equiv S_{I}. This is our yard-stick throughout the problem. The configurations with W=1/NW=1/N are referred to as fractional instantons, and their action is SI/NS_{I}/N. These are tunneling configurations connecting neighboring minima jj+1j\rightarrow j+1. Clearly, a single fractional instanton j{\cal I}_{j} cannot contribute to the partition function, as it does not obey the periodic boundary condition in (4). Equivalently, these are periodic paths in S1/NS^{1}/{\mathbb{Z}}_{N} which cannot be lifted to S1S^{1}. However, an j{\cal I}_{j}, ¯j\overline{\cal I}_{j} pair, which has W=0W=0, but action 2SI/N2S_{I}/N, must contribute to the partition function since it is a periodic path in S1S^{1}.

Digression on critical points at infinity and resurgent cancellations: The j{\cal I}_{j}, ¯j\overline{\cal I}_{j} configurations can be viewed as critical points at infinity. Here, we will go over this concept fairly quickly. See Behtash:2018voa for details and Dunne:2016nmc for broad review. The fact that the combination of j{\cal I}_{j} and ¯j\overline{\cal I}_{j} configuration is often called “unstable” in old literature is a red-herring. In semi-classics, the critical point at infinity has a non-Gaussian Lefschetz thimble (steepest descent cycle) that one must integrate over. This contribution maps to second order terms in cluster expansion as

β22![j][¯j]+β1![j¯j]±\displaystyle\frac{\beta^{2}}{2!}[{\cal I}_{j}][\overline{\cal I}_{j}]+\frac{\beta}{1!}[{\cal I}_{j}\overline{\cal I}_{j}]_{\pm} (7)

where the first, maximally extensive term in β\beta is dilute (non-interacting) instanton gas contribution and sub-extensive term capture the correlated instanton-anti-instanton event [a¯a]±[{\cal I}_{a}\overline{\cal I}_{a}]_{\pm} which is two-fold ambiguous. Remarkable fact that comes from resurgent analysis is that this ambiguity cancels against the ambiguity of lateral Borel resummation of perturbation theory for vacuum energy, 𝔹E0,±\mathbb{B}E_{0,\pm}. At second order in semi-classics, we find the cancellation, and

Im𝔹E0,±+Im[j¯j]±=0\displaystyle\text{Im}\mathbb{B}E_{0,\pm}+\text{Im}[{\cal I}_{j}\overline{\cal I}_{j}]_{\pm}=0 (8)

The combination is unambiguous and meaningful within resurgent semi-classical analysis framework Dunne:2016nmc .

Strictly speaking, action 2SI/N2S_{I}/N and W=0W=0 configurations can equivalently be viewed as uncorrelated events [j][{\cal I}_{j}] and [¯j][\overline{\cal I}_{j}] and correlated bion events [j¯j]±[{\cal I}_{j}\overline{\cal I}_{j}]_{\pm}, for which its real unambiguous part Re[j¯j]\text{Re}[{\cal I}_{j}\overline{\cal I}_{j}] contribute to the partition function, as both correspond to periodic paths in S1S^{1}.777Throughout the paper, we will not bother much with the resurgent structure, but it is often at the back of our minds. It is important that most of the critical points that contribute to actions in QFTs and QM are “critical points at infinity”, and is decomposable to BPS and non-BPS configurations. We encourage the reader to go over Behtash:2018voa if necessary. This background knowledge allows us to take some steps swiftly in the course of this work. Similarly, configuration such as an ordered product j=1Nj=12N\prod_{j=1}^{N}{\cal I}_{j}={\cal I}_{1}{\cal I}_{2}\ldots{\cal I}_{N} with integer topological charge W=1W=1, and integer action 1×SI1\times S_{I} must contribute to the partition function since it is also a periodic path. More generally, generically non-BPS configurations with action (W+2n¯N)SI(W+\frac{2\overline{n}}{N})S_{I} and topological charge WW\in{\mathbb{Z}} contribute to partition function.

Let us now derive the fractional instanton decomposition of the path integral. Since we already know the energy spectrum, we can take a short cut and use it to derive the fractional instanton sum and see explicitly the paths that contribute at leading semi-classics.

Within the Born-Oppenheimer approximation, we can write a tight-binding Hamiltonian to describe low energy physics (We shifted zero point energy ω/2\hbar\omega/2 to zero, and ξ=KeS/N\xi=K\mathrm{e}^{-S/N} is the non-perturbatively small hopping parameter.)

H=j=1Nξeiθ/N|j+1j|+h.c.\displaystyle H=-\sum_{j=1}^{N}\xi\mathrm{e}^{i\theta/N}|j+1\rangle\langle j|+{\rm h.c.} (9)

The energy spectrum is given by diagonalizing the Hamiltonian:

Ek(θ)=2KeS/Ncosθ+2πkN\displaystyle E_{k}(\theta)=-2K\mathrm{e}^{-S/N}\cos\frac{\theta+2\pi k}{N} (10)

The ground state depends on what the theta angle is. The ground state energy is EG.S.(θ)=minkEk(θ)E_{\mathrm{G.S.}}(\theta)=\min_{k}E_{k}(\theta), exhibiting a two-fold degeneracy at odd-integer multiples of θ=π\theta=\pi.

Let us now turn the partition function into a fractional instanton sum:

Z(β,θ)\displaystyle Z(\beta,\theta) =k=0N1e2βKeSNcosθ+2πkN\displaystyle=\sum_{k=0}^{N-1}\mathrm{e}^{2\beta K\mathrm{e}^{-\frac{S}{N}}\cos\frac{\theta+2\pi k}{N}} (11)
=k=0N1n=0n¯=01n!1n¯!(βKeSN+iθ+2πkN)n(βKeSNiθ+2πkN)n¯\displaystyle=\sum_{k=0}^{N-1}\sum_{n=0}^{\infty}\sum_{\overline{n}=0}^{\infty}\frac{1}{n!}\frac{1}{\overline{n}!}\left(\beta K\mathrm{e}^{-\frac{S}{N}+\mathrm{i}\frac{\theta+2\pi k}{N}}\right)^{n}\left(\beta K\mathrm{e}^{-\frac{S}{N}-\mathrm{i}\frac{\theta+2\pi k}{N}}\right)^{\overline{n}} (12)

We can perform the summation over kk by using the identity

k=0N1ei2πk(nn¯)/N=NWδnn¯WN\displaystyle\sum_{k=0}^{N-1}\mathrm{e}^{\mathrm{i}2\pi k(n-\overline{n})/N}=N\sum_{W\in{\mathbb{Z}}}\delta_{n-\overline{n}-WN} (13)

converting the sum over kk, which takes place in the space of representations of N{\mathbb{Z}}_{N} translation symmetry into a sum over WW\in{\mathbb{Z}} which we will interpret as winding number. This gives the path integral in the leading order of dilute gas approximation.

Z(β,θ)\displaystyle Z(\beta,\theta) =NWn=0n¯=01n!1n¯!(βKeS/N+iθ/N)n(βKeS/Niθ/N)n¯δnn¯WN,0\displaystyle=N\sum_{W\in{\mathbb{Z}}}\sum_{n=0}^{\infty}\sum_{\overline{n}=0}^{\infty}\frac{1}{n!}\frac{1}{\overline{n}!}\left(\beta K\mathrm{e}^{-S/N+\mathrm{i}\theta/N}\right)^{n}\left(\beta K\mathrm{e}^{-S/N-\mathrm{i}\theta/N}\right)^{\overline{n}}\delta_{n-\overline{n}-WN,0} (14)

where delta function gives a constraint on the fractional instanton sum. (14) can be interpreted as a grand canonical ensemble of fractional instantons. For a term in the sum with nn fractional instanton, and n¯{\overline{n}} fractional anti-instantons, the periodic orbits correspond to:

nn¯WN=0,i.e.,nn¯=0modN\displaystyle n-\overline{n}-WN=0,\qquad{\rm{i.e.,}}\;\;n-\overline{n}=0\;\;{\rm mod}\;N (15)

These configurations which contribute to Z(β,θ)Z(\beta,\theta) comes with fugacity

eSIN(n+n¯)eiθN(nn¯)=e(W+2n¯N)SIeiWθ\displaystyle\mathrm{e}^{-\frac{S_{I}}{N}(n+\overline{n})}\;\mathrm{e}^{\mathrm{i}\frac{\theta}{N}(n-\overline{n})}=\mathrm{e}^{-\left(W+\frac{2\overline{n}}{N}\right)S_{I}}\;\mathrm{e}^{\mathrm{i}W\theta} (16)

i.e, they possess fractional action, but integer topological charge.

The simple observation, that saddles with fractional action, but integer topological charge contributes to the path integral (see Fig. 3), is at the heart of all reliable semi-classical analysis of gauge theories, and sigma models, such as 𝒩=1{\cal N}=1 SYM, Yang-Mills with double-trace deformations, and PN1\mathbb{C}P^{N-1} models, and is responsible for the multi-branched structure of the vacuum energy. This structure naturally arise in the context of resurgence and sometimes called graded resurgence triangle Dunne:2012ae ; Cherman:2014ofa .

In (14), KK is the instanton prefactor that is not important for this particular discussion, and we will ignore it throughout the paper. β\beta is there due to position zero mode of a single fractional instanton. βnn!\frac{\beta^{n}}{n!} arises due to the integration over the positions of nn fractional instantons, see e.g. Coleman198802 . The overall factor NN is due to NN distinct classical minima in the q[0,2π)q\in[0,2\pi) fundamental domain.

The constrained sum can be simplified further. Solving the constraint,

Z(β,θ)\displaystyle Z(\beta,\theta) =NW[n¯=0(βKeSIN)n¯n¯!n¯fractionalantiinst.(βKeSIN)n¯+WN(n¯+WN)!(n¯+NW)fractionalinst.]eiWθ\displaystyle=N\sum_{W\in{\mathbb{Z}}}\left[\sum_{\overline{n}=0}^{\infty}\underbrace{\frac{(\beta K\mathrm{e}^{-\frac{S_{I}}{N}})^{\overline{n}}}{{\overline{n}}!}}_{\overline{n}-\rm fractional\;anti-inst.}\underbrace{\frac{(\beta K\mathrm{e}^{-\frac{S_{I}}{N}})^{\overline{n}+WN}}{(\overline{n}+WN)!}}_{(\overline{n}+NW)-\rm fractional\;inst.}\right]\mathrm{e}^{\mathrm{i}W\theta} (17)
=NW[INW(2βKeSIN)]eiWθ\displaystyle=N\sum_{W\in{\mathbb{Z}}}\left[I_{NW}(2\beta K\mathrm{e}^{-\frac{S_{I}}{N}})\right]\mathrm{e}^{\mathrm{i}W\theta} (18)

where INWI_{NW} is the modified Bessel function of order NWNW. We learn that the series expansion of the modified Bessel function is actually an instanton sum. Since Z(θ+2π)=Z(θ)Z(\theta+2\pi)=Z(\theta), (18) is nothing but the Fourier series expansion of the partition function:

Z(θ)=WZWeiWθ,ZW=NINW(2βKeSIN)\displaystyle Z(\theta)=\sum_{W\in{\mathbb{Z}}}Z_{W}\mathrm{e}^{\mathrm{i}W\theta},\qquad\qquad Z_{W}=NI_{NW}(2\beta K\mathrm{e}^{-\frac{S_{I}}{N}}) (19)

Despite the fact that only periodic orbits with the theta angle dependence eiWθe^{\mathrm{i}W\theta} contribute, the non-trivial theta-angle dependence which leads to multi-branched structure and θ\theta dependence in the observables in the form θ+2πkN\frac{\theta+2\pi k}{N} naturally arises! This fact will also help us in gauge theories on 4{\mathbb{R}}^{4} and 3×S1{\mathbb{R}}^{3}\times S^{1}.

Refer to caption
Figure 3: Only integer winding (topological charge) configurations corresponding to periodic paths contribute to partition function. To detect fractional instantons in QM is visually trivial (See Fig.1). Formally, it requires either twisted boundary condition by using N{\mathbb{Z}}_{N} shift symmetry, or coupling to N{\mathbb{Z}}_{N} topological gauge theory. The latter more abstract formalism is more useful in gauge theories in four-dimensions, to understand the true nature of saddles contributing to Yang-Mills theory. Despite the fact that W=1N,S=SI/NW=\frac{1}{N},S=S_{I}/N fractional topological charge and fractional configurations are not contributing to partition function, W=0,S=2SI/NW=0,S=2S_{I}/N fractional action configurations do. This is responsible for the emergence of multi-branch structure as a function of theta angle. This construction tells us that trying to understand the saddles in SU(N)SU(N) gauge theory by working directly with SU(N)SU(N) bundle is not correct. It neglects most important contributions.

2.1 Coupling to N{\mathbb{Z}}_{N} TQFT background

The quantum mechanical model we are studying has a N{\mathbb{Z}}_{N} global 0-form shift symmetry. First, we describe coupling it to a N{\mathbb{Z}}_{N} background gauge field. This stage will help us to reveal the existence of fractional instantons that actually exist in the TNT_{N} model, in a more abstract way. But this somehow abstract formalism help us greatly in the QFT applications. We can later choose to sum over all possible backgrounds, which is equivalent to gauging N{\mathbb{Z}}_{N} completely, reducing the model to TN/NT_{N}/{\mathbb{Z}}_{N}, which is a 1-site model. In our discussion, we aim to make the details fairly explicit and conceptual. This will provide important insights that we will use in QFT on dd-manifolds MdM_{d}.

Coupling QM to TQFT: We first couple our quantum mechanical model to a classical N{\mathbb{Z}}_{N} topological gauge theory background Gukov:2013zka ; Kapustin:2014gua . The quantum topological theory is given by the partition function

Ztop,p\displaystyle Z_{{\rm top},p} =𝒟A(1)𝒟A(0)𝒟F(0)eiF(0)(NA(1)dA(0))+ipA(1)\displaystyle=\int{\mathcal{D}}A^{(1)}{\mathcal{D}}A^{(0)}{\mathcal{D}}F^{(0)}\;\mathrm{e}^{\mathrm{i}\int F^{(0)}\wedge(NA^{(1)}-\mathrm{d}A^{(0)})+\mathrm{i}p\int A^{(1)}} (20)

Here, F(0)F^{(0)} is a Lagrange multiplier zero-form field that we integrate over, and ipA(1)ip\int A^{(1)} term is Chern-Simons term in 1d1d, and pp must be an integer for the theory to be gauge invariant.888This term will provide different possibilities in gauging. It will turn out to be related to discrete theta angle, which in turn, determines which Bloch state is picked from the lowest NN state after gauging. For now, pp term is not very important and can be set to zero if desired.

Integrating F(0)F^{(0)}, we end up with a pair, (A(1),A(0))(A^{(1)},A^{(0)}) which obeys

NA(1)=dA(0),NA(1)=dA(0)=2π\displaystyle NA^{(1)}=\mathrm{d}A^{(0)},\qquad N\int A^{(1)}=\int\mathrm{d}A^{(0)}=2\pi{\mathbb{Z}} (21)

and is properly quantized. The path integration of TQFT is fairly trivial, and yields:

Ztop,p\displaystyle Z_{{\rm top},p} =𝒟A(1)𝒟A(0)δ((NA(1)dA(0))eipA(1)\displaystyle=\int{\mathcal{D}}A^{(1)}{\mathcal{D}}A^{(0)}\delta((NA^{(1)}-\mathrm{d}A^{(0)})\;\;\mathrm{e}^{\mathrm{i}p\int A^{(1)}} (22)
=1N𝒟A(0)eipN𝑑A(0)\displaystyle=\frac{1}{N}\int{\mathcal{D}}A^{(0)}\;\;\mathrm{e}^{\mathrm{i}\frac{p}{N}\int dA^{(0)}} (23)
=1Nk=0N1ei2πpkN=δp=0modN\displaystyle=\frac{1}{N}\sum_{k=0}^{N-1}\mathrm{e}^{\mathrm{i}\frac{2\pi pk}{N}}=\delta_{p=0\;{\rm mod}\;N} (24)

(A(1),A(0))(A^{(1)},A^{(0)}) pair describe a N{\mathbb{Z}}_{N} gauge field that can be turned on in quantum mechanical TNT_{N} model to probe saddles, in particular, to probe the fractional instantons.

The 0-form gauge invariance of the action (20) is given by

A(1)A(1)+dλ(0),A(0)A(0)+Nλ(0),F(0)F(0)\displaystyle A^{(1)}\mapsto A^{(1)}+\mathrm{d}\lambda^{(0)},\qquad A^{(0)}\mapsto A^{(0)}+N\lambda^{(0)},\qquad F^{(0)}\mapsto F^{(0)} (25)

To couple a classical N{\mathbb{Z}}_{N} background field to the qq-field, we declare

qqλ(0),\displaystyle q\mapsto q-\lambda^{(0)}, (26)

Gauge invariant combinations of our quantum field aa and classical fields (A(1),A(0))(A^{(1)},A^{(0)}) are:

Nq+A(0),dq+A(1)=(q˙+Aτ)dτ\displaystyle Nq+A^{(0)},\qquad dq+A^{(1)}=(\dot{q}+A_{\tau})\mathrm{d}\tau (27)

Hence, the partition function in the classical (A(1),A(0))(A^{(1)},A^{(0)}) background can be found by promoting dqdq and qq to these gauge invariant combinations:

Z\displaystyle Z [(A(1),A(0)),p]=𝒟F(0)q(β)=q(0)𝒟qeiF(0)(NA(1)dA(0))+ipA(1)\displaystyle[(A^{(1)},A^{(0)}),p]=\int{\mathcal{D}}F^{(0)}\;\int_{q(\beta)=q(0)}{\mathcal{D}}q\;\mathrm{e}^{\mathrm{i}\int F^{(0)}\wedge(NA^{(1)}-\mathrm{d}A^{(0)})+\mathrm{i}p\int A^{(1)}} (28)
×exp(1g𝑑τ(12(q˙+Aτ)2cos(Nq+A(0)))+iθ2π(dq+A(1)))\displaystyle\times\exp\left(-\frac{1}{g}\int d\tau\left({\textstyle{\frac{1}{2}}}(\dot{q}+A_{\tau})^{2}-\cos(Nq+A^{(0)})\right)+\frac{\mathrm{i}\theta}{2\pi}\int(\mathrm{d}q+A^{(1)})\right)\qquad\ (29)

Note that at this stage, we do not perform a path integral over (A(1),A(0))(A^{(1)},A^{(0)}). These are truly classical fields that we turn on according to what we want to do. Moreover, the path integral 𝒟q\int{\mathcal{D}}q is over periodic paths.

Equivalence of symmetry twisted boundary condition to coupling QM to TQFT: The N{\mathbb{Z}}_{N} background gauge field is equivalent to the twisted boundary condition in path integral. Let us show this explicitly. Let 𝖴\mathsf{U} denote the translation operator for N{\mathbb{Z}}_{N} shift symmetry in the TNT_{N} model. Then, the sum over transition amplitude in quantum mechanics for sites \ell units apart is given by:

Z\displaystyle Z_{\ell} =tr[eβH𝖴]=j=1Nj+|eβH|j=y(β)=y(0)+2πN𝒟yexp(S[y]),\displaystyle=\text{tr}[\mathrm{e}^{-\beta H}\mathsf{U}^{\ell}]=\sum_{j=1}^{N}\langle j+\ell|\mathrm{e}^{-\beta H}|j\rangle=\int_{y(\beta)=y(0)+\frac{2\pi}{N}\ell}\;{\mathcal{D}}y\exp(-S[y]), (30)

where y(τ)y(\tau) are paths obeying twisted boundary condition. Now, let us undo the twisted boundary condition in favor of a periodic boundary condition, and insert its effects into the action. Define

q(τ)=y(τ)2πNβτ,henceq(β)=q(0)mod 2π.\displaystyle q(\tau)=y(\tau)-\frac{2\pi\ell}{N\beta}\tau,\qquad{\rm hence}\;\;q(\beta)=q(0)\;\;{\rm mod}\;2\pi. (31)

Plugging this into original action S[y]S[y], we find

S[q,]=1g𝑑τ[12(q˙+2πNβ)2cos(Nq+2πβτ)]+iθ2π(dq+2πNβdτ)\displaystyle S[q,\ell]=\frac{1}{g}\int d\tau\left[{\textstyle{\frac{1}{2}}}\Big{(}\dot{q}+\frac{2\pi\ell}{N\beta}\Big{)}^{2}-\cos\Big{(}Nq+\frac{2\pi\ell}{\beta}\tau\Big{)}\right]+\frac{\mathrm{i}\theta}{2\pi}\int\Big{(}\mathrm{d}q+\frac{2\pi\ell}{N\beta}\mathrm{d}\tau\Big{)} (32)

which agrees exactly with (29), with the identification:

A(1)=2πNβdτ,A(0)=2πβτ\displaystyle A^{(1)}=\frac{2\pi\ell}{N\beta}\mathrm{d}\tau,\qquad A^{(0)}=\frac{2\pi\ell}{\beta}\tau (33)

Happily, these obey the conditions for a background N{\mathbb{Z}}_{N} gauge field (21).

So, the twisted boundary conditions in (30) are traded for a background field, and the partition function can equivalently be written as:

Z\displaystyle Z_{\ell} =q(β)=q(0)𝒟qexp(S[q,]),\displaystyle=\int_{q(\beta)=q(0)}{\mathcal{D}}q\;\exp(-S[q,\ell]), (34)

Note the explicit time dependence in the action (32). This is not a fluke, it is in fact exactly as it should. This is the classical background that we are dealing with.

One final remark concerns instanton equations in the N{\mathbb{Z}}_{N} background field. Using Bogomolny factorization with action (29), we find

(dq+A(1))=±[2V(Nq+A(0))]1/2,q(β)=q(0)mod 2π.\displaystyle(\mathrm{d}q+A^{(1)})=\pm[2V(Nq+A^{(0)})]^{1/2},\qquad\;\;q(\beta)=q(0)\;\;{\rm mod}\;2\pi. (35)

which is nothing but

y˙=±[2V(Ny)]1/2,y(β)=y(0)+2πNmod 2π.\displaystyle\dot{y}=\pm[2V(Ny)]^{1/2},\qquad\;\;y(\beta)=y(0)+\frac{2\pi}{N}\;\;{\rm mod}\;2\pi. (36)

in the yy parametrization. So, to find the fractional instanton solution, we can work with either parametrization. Either request periodic path solutions with background fields or paths solving (36) and satisfying twisted boundary conditions. These two simply related interpretation will help us to figure out a non-trivial solution in PSU(N)PSU(N) bundle in gauge theories on 3×S1{\mathbb{R}}^{3}\times S^{1}.

In the formulation where q(τ)q(\tau) is a periodic path in the (A(1),A(0))(A^{(1)},A^{(0)}) background (29), the topological term is

W=12π(dq+A(1)))=12πdq+12πNdA(0)1N1N\displaystyle W=\frac{1}{2\pi}\int(\mathrm{d}q+A^{(1)}))=\underbrace{\frac{1}{2\pi}\int\mathrm{d}q}_{\in{\mathbb{Z}}}+\underbrace{\frac{1}{2\pi N}\int\mathrm{d}A^{(0)}}_{\in\frac{1}{N}{\mathbb{Z}}}\in\frac{1}{N}{\mathbb{Z}} (37)

where the first term is integer valued as it is over periodic paths, and the winding number is fractional due to background. In the formulation where yy is a path satisfying twisted boundary condition (30),

W=12πdy1N\displaystyle W=\frac{1}{2\pi}\int\mathrm{d}y\in\frac{1}{N}{\mathbb{Z}} (38)

This is now fractional due to the twisted boundary condition of y(τ)y(\tau) field. The action corresponding to these fractional winding configurations are also fractional S=1NSIS=\frac{1}{N}S_{I} due to BPS bound.

Revealing fractional instantons formally: The existence of fractional instantons with fractional winding number is obvious from Fig.2. Expressing the partition function Z[(A(1),A(0))]Z(β,θ)Z[(A^{(1)},A^{(0)})]\equiv Z_{\ell}(\beta,\theta) as an fractional instanton sum.

Z(β,θ)\displaystyle Z_{\ell}(\beta,\theta) =tr[eβH𝖴]\displaystyle=\text{tr}[\mathrm{e}^{-\beta H}\mathsf{U}^{\ell}] (39)
=k=0N1ei2πkNeξcosθ+2πkN\displaystyle=\sum_{k=0}^{N-1}\mathrm{e}^{\mathrm{i}\frac{2\pi\ell k}{N}}\mathrm{e}^{\xi\cos\frac{\theta+2\pi k}{N}} (40)
=[NWn¯=0ξ2n¯++WN(n¯++WN)!n¯!eiWθ]eiNθ\displaystyle=\left[N\sum_{W\in{\mathbb{Z}}}\sum_{\overline{n}=0}^{\infty}\frac{\xi^{2\overline{n}+\ell+WN}}{(\overline{n}+\ell+WN)!\;\overline{n}!}\mathrm{e}^{\mathrm{i}W\theta}\right]\mathrm{e}^{\mathrm{i}\frac{\ell}{N}\theta} (41)
=[NWINW+(2ξ)eiWθ]eiNθ\displaystyle=\left[N\sum_{W\in{\mathbb{Z}}}I_{NW+\ell}(2\xi)\mathrm{e}^{\mathrm{i}W\theta}\right]\mathrm{e}^{\mathrm{i}\frac{\ell}{N}\theta} (42)

The second line immediately emerges from our knowledge of energy spectrum of the theory (10) and the fact that eigenstates of the Hamiltonian are just Bloch states, and [𝖴,H]=0[\mathsf{U},H]=0. To obtain the third line, we can expand the exponentials and sum over kk explicitly. Not surprisingly, this is equivalent to the dilute gas of fractional instantons in the path integral picture. The fourth line is a nice mathematical identity which tells us that series expansion of all modified Bessel functions are actually instanton sums. The sum is over n¯\overline{n}, and configurations that contribute are actually n¯\overline{n} fractional anti-instantons, and (n¯++WN)(\overline{n}+\ell+WN) fractional instantons. The winding number of such a configuration is W+NW+\frac{\ell}{N}, and the sum has an overall eiNθ\mathrm{e}^{\mathrm{i}\frac{\ell}{N}\theta} dependence that arise from the fractional part of winding number. These fractional instantons contribute to transition amplitudes in quantum mechanics.

Note that by turning on (A(1),A(0))(A^{(1)},A^{(0)})\equiv\ell background, we are not changing the theory or its Hilbert space, which, within Born-Oppenheimer approximation is just NN dimensional, that we denote as N{\cal H}_{N}. In simple words, we are summing over just transition amplitudes in quantum mechanics for states which are \ell units apart. The construction merely reveals the topological configurations which play important role in the full theory.

Probably, one of the counter-intuitive sounding aspects of this construction (yet widely familiar from the symmetric double-well potential, see ZinnJustin:2002ru ) is following. We keep repeating that single fractional instanton j{\cal I}_{j} is not contributing to the partition function, as it is not a periodic path. Yet, the energy gap in the theory is sourced by single fractional instanton effect. The reason is clear. Single fractional instanton gives rise to the transition matrix element between nearest neighbor sites, which in turn determine the splittings in the spectrum of the theory. Say, at θ=0\theta=0, the gap in the spectrum is given by

Egap=Ek=1Ek=0=2KeSI/N(cos2πN1),\displaystyle E_{\rm gap}=E_{k=1}-E_{k=0}=-2K\mathrm{e}^{-S_{I}/N}\Big{(}\cos\ \frac{2\pi}{N}-1\Big{)}, (43)

a single fractional instanton effect, similar to the gap in the symmetric double-well.

2.2 Uses of gauging N{\mathbb{Z}}_{N}

Discrete theta angle θp\theta_{p} = level pp Chern-Simons = picking Bloch state with momentum pp Now, we can gauge N{\mathbb{Z}}_{N} symmetry. Physically, gauging declares that instead of qq being physically equivalent to q+2πq+2\pi, it is identified with q+2πNq+\frac{2\pi}{N}. With this procedure, the target-space circle size is effectively reduced to 2πN\frac{2\pi}{N}. In other words, NN-site model is reduced to 11-site, and out of NN states in the lowest band, only one state survives. Gauging, unlike turning a background as described above, changes the Hilbert space, by diluting it. It reduces N{\cal H}_{N} down to one state Hilbert space 1{\cal H}_{1}.

Gauging N{\mathbb{Z}}_{N} symmetry amounts to summing over \ell, =0N1Z\sum_{\ell=0}^{N-1}Z_{\ell}. However, this is not the only choice in summation. In fact, the sum over Z(β)Z_{\ell}(\beta) can be done in NN-different ways, corresponding to NN discrete θ\theta angles, θp=2πpN\theta_{p}=\frac{2\pi p}{N}. This θp\theta_{p} is nothing but the 1d Chern-Simons term in the original topological gauge theory. As a result,

Z(TN/N)p\displaystyle Z_{(T_{N}/{\mathbb{Z}}_{N})_{p}} =𝒟A(1)𝒟A(0)Z[(A(1),A(0)),p]δ(NA(1)dA(0))\displaystyle=\int{\mathcal{D}}A^{(1)}{\mathcal{D}}A^{(0)}\;Z[(A^{(1)},A^{(0)}),p]\;\delta(NA^{(1)}-\mathrm{d}A^{(0)}) (44)
1N=0N1ei2πpNZ\displaystyle\equiv\frac{1}{N}\sum_{\ell=0}^{N-1}\mathrm{e}^{-\mathrm{i}\frac{2\pi\ell p}{N}}\;Z_{\ell} (45)

As already stated, as a result of gauging, the NN-site model reduce to 1-site model, and Hilbert space gets diluted by a factor of NN. Which state survives gauging is dictated by the choice of θp\theta_{p}, or equivalently, Chern-Simons level pp. Using (42)

Z(TN/N)p\displaystyle Z_{(T_{N}/{\mathbb{Z}}_{N})_{p}} =1N=0N1ei2πpNZ(β)\displaystyle=\frac{1}{N}\sum_{\ell=0}^{N-1}\mathrm{e}^{-\mathrm{i}\frac{2\pi\ell p}{N}}Z_{\ell}(\beta) (46)
=1N=0N1ei2πpNk=0N1ei2πkNeξcosθ+2πkN\displaystyle=\frac{1}{N}\sum_{\ell=0}^{N-1}\mathrm{e}^{-\mathrm{i}\frac{2\pi\ell p}{N}}\sum_{k=0}^{N-1}\mathrm{e}^{\mathrm{i}\frac{2\pi\ell k}{N}}\mathrm{e}^{\xi\cos\frac{\theta+2\pi k}{N}} (47)
=k=0N1δpkeξcosθ+2πkN\displaystyle=\sum_{k=0}^{N-1}\delta_{pk}\mathrm{e}^{\xi\cos\frac{\theta+2\pi k}{N}} (48)
=eξcosθ+2πpN\displaystyle=\;\mathrm{e}^{\xi\cos\frac{\theta+2\pi p}{N}} (49)

Only the Bloch state with momentum 2πpN\frac{2\pi p}{N} in the original Brillouin zone of the NN-site model survives in the Born-Oppenheimer approximation.

If we relax Born-Oppenheimer approximation and consider the whole Hilbert space, then, the gauging procedure will pick just one-state with Bloch momentum pp from each band. In this sense, the whole Hilbert space gets diluted by a factor of NN.

Finally, one can formalize the connection between the topological configurations in the (TN/N)p(T_{N}/{\mathbb{Z}}_{N})_{p} model, the TNT_{N} model with background (A(1),A(0))(A^{(1)},A^{(0)})\equiv\ell or the original theory without background fields. This is one of the crux of the matter that is most important in gauge theory where things are a bit more involved. The object that we would identify as instanton with winding number 11 and action S0S_{0} in the (TN/N)(T_{N}/{\mathbb{Z}}_{N}) model is the fractional instanton with winding number 1N\frac{1}{N} and action S0=SINS_{0}=\frac{S_{I}}{N} of the TNT_{N} model. Therefore, (TN/N)p(T_{N}/{\mathbb{Z}}_{N})_{p} theory has non-perturbative data that is relevant to the original TNT_{N} theory.

Let us make this a bit more precise. When we gauge N{\mathbb{Z}}_{N}, and sum over all (A(1),A(0))(A^{(1)},A^{(0)}), (instead of just turning on a fixed classical N{\mathbb{Z}}_{N} background), the distinction between isolated minima and fractional instantons j{\cal I}_{j} disappear. We can in fact call

j1,jbygaugingN\displaystyle{\cal I}_{j}\mapsto{\cal I}_{1},\;\forall j\qquad{\rm by\;gauging}\;{\mathbb{Z}}_{N} (50)

Then, we can make an obstruction vs. admission list:

  • Instantons of the form [1]k,k0modN[{\cal I}_{1}]^{k},k\neq 0\;{\rm mod}\;N, which are closed paths in S1/NS^{1}/{\mathbb{Z}}_{N} cannot be lifted to S1S^{1}.

  • Instanton anti-instanton pairs of the form [1]k[¯1]k[{\cal I}_{1}]^{k}[\overline{\cal I}_{1}]^{k} are closed paths in S1/NS^{1}/{\mathbb{Z}}_{N}, and they can be lifted to S1S^{1} with W=0W=0. These configurations possess fractional quantized action in the original (ungauged) formulation.

  • Instantons in multiples of NN [1]Nq,q[{\cal I}_{1}]^{Nq},\;q\in{\mathbb{Z}} can also be lifted to S1S^{1}, with W=qW=q. These configurations possess integer quantized action.

These facts are by no means surprising, in fact, rather simple. But this leads us to the following intriguing situation.

  • Since in order to find the saddles in TNT_{N} model, we need to directly look at either

    • TNT_{N} with classical N{\mathbb{Z}}_{N} background (A(1),A(0))(A^{(1)},A^{(0)}) or equivalently, N{\mathbb{Z}}_{N} twisted boundary conditions

    • (TN/N)p(T_{N}/{\mathbb{Z}}_{N})_{p} models where N{\mathbb{Z}}_{N} is gauged

    we claim that in order to find the relevant saddles of SU(N)SU(N) gauge theories, we should first consider either SU(N)SU(N) with background gauge field (B(2),B(1))(B^{(2)},B^{(1)}) for the N[1]{\mathbb{Z}}_{N}^{[1]} 1-form symmetry or (SU(N)/N)p(SU(N)/{\mathbb{Z}}_{N})_{p} theory where N[1]{\mathbb{Z}}_{N}^{[1]} is gauged.999This is one reason we parallel the notation of quantum mechanics with gauge theory, and emphasize the correspondence: TNSU(N)T_{N}\Leftrightarrow SU(N) and (TN/N)p(SU(N)/N)p(T_{N}/{\mathbb{Z}}_{N})_{p}\Leftrightarrow(SU(N)/{\mathbb{Z}}_{N})_{p}. The latter systems has non-perturbative data that is relevant to the original systems, where it is interpreted as fractional. After finding these configurations, we can patch them up to find the ones that can be lifted to SU(N)SU(N) theory. These are fractional action (e.g. 2SI/N2S_{I}/N) configurations that contribute to the partition function of SU(N)SU(N) theory.

3 N1\mathbb{CP}^{N-1} in 2d and N{\mathbb{Z}}_{N} TQFT

The N1\mathbb{CP}^{N-1} model may be expressed as a U(1)U(1) gauge theory with an NN-component elementary field zi(x)z_{i}(x), obeying the constraint i=1N|zi(x)|2=1\sum_{i=1}^{N}|z_{i}(x)|^{2}=1. The theory is invariant under the action of a SU(N)SU(N) rotation on the zi(x)z_{i}(x) field. However, SU(N)SU(N) does not act faithfully, since zi(x)z_{i}(x) is not a gauge invariant operator. The global symmetry that acts faithfully on states in Hilbert space is

G=SU(N)/N=PSU(N)\displaystyle G=SU(N)/\mathbb{Z}_{N}=PSU(N) (51)

In other words, the N\mathbb{Z}_{N} center of the SU(N)SU(N) is also part of the U(1)U(1) gauge structure, and should not be counted in the global symmetry. All gauge invariant local operators of the N1\mathbb{CP}^{N-1} model are in PSU(N)PSU(N) representations.

We would like to turn on an SU(N)/N{SU(N)/{\mathbb{Z}}_{N}} background. To do this, we first introduce a background SU(N)SU(N) one-form gauge field A(1)A^{(1)} Gaiotto:2017yup ; Tanizaki:2017qhf . Further, to gauge the N\mathbb{Z}_{N} part, we turn on a pair of U(1)U(1) 2-form and 1-form gauge fields (B(2),B(1))(B^{(2)},B^{(1)}), and a Lagrange multiplier F(0)F^{(0)}:

Ztop,p\displaystyle Z_{{\rm top},p} =𝒟B(2)𝒟B(1)𝒟F(0)eiF(0)(NB(2)dB(1))+ipB(2)\displaystyle=\int{\mathcal{D}}B^{(2)}{\mathcal{D}}B^{(1)}{\mathcal{D}}F^{(0)}\;\mathrm{e}^{\mathrm{i}\int F^{(0)}\wedge(NB^{(2)}-\mathrm{d}B^{(1)})+\mathrm{i}p\int B^{(2)}} (52)

Therefore,

NB(2)=dB(1),NB(2)=dB(1)=2π\displaystyle NB^{(2)}=\mathrm{d}B^{(1)},\qquad N\int B^{(2)}=\int\mathrm{d}B^{(1)}=2\pi{\mathbb{Z}} (53)

and also promote the SU(N)SU(N) background gauge field A(1)A^{(1)} to a U(N)U(N) background gauge field A~(1)\tilde{A}^{(1)}:

A~(1)=A(1)+𝟙NB(1)(locally)\displaystyle\tilde{A}^{(1)}=A^{(1)}+\frac{{\mathbbm{1}}}{N}B^{(1)}\qquad({\rm locally}) (54)

The covariant derivative in the A~(1)\tilde{A}^{(1)} background takes the form

D(a,A~(1))zdziaz+iA~(1)z\displaystyle D(a,\widetilde{A}^{(1)})z\equiv\mathrm{d}z-\mathrm{i}az+\mathrm{i}\widetilde{A}^{(1)}z (55)

where aa is the original dynamical U(1)U(1) gauge field and A~(1)\tilde{A}^{(1)} is classical. This couples the N1\mathbb{CP}^{N-1} model to an SU(N)/N{SU(N)/{\mathbb{Z}}_{N}} background field, and the theory now has a 1-form (background) gauge invariance

B(2)B(2)+dλ(1),B(1)B(1)+Nλ(1),A~(1)A~(1)+λ(1),aa+λ(1).\displaystyle B^{(2)}\mapsto B^{(2)}+\mathrm{d}\lambda^{(1)},\;\;B^{(1)}\mapsto B^{(1)}+N\lambda^{(1)},\;\;\widetilde{A}^{(1)}\mapsto\widetilde{A}^{(1)}+\lambda^{(1)},\;\;a\mapsto a+\lambda^{(1)}. (56)

which couples the dynamical field with the classical background. The gauge invariant combination of dynamical and classical fields are given by

(a𝟙A~(1)),(daB(2))\displaystyle(a\mathbbm{1}-\tilde{A}^{(1)}),\qquad(\mathrm{d}a-B^{(2)}) (57)

Therefore, the action of the theory minimally coupled to the background field is given by

Sbgrd=2g2[D(a,A~(1))zD(a,A~(1))z]iθ2π[daB(2)],S_{\mathrm{bgrd}}={2\over g^{2}}\int[D(a,\widetilde{A}^{(1)})z^{\dagger}\wedge*D(a,\widetilde{A}^{(1)})z]-\mathrm{i}{\theta\over 2\pi}\int[\mathrm{d}a-B^{(2)}], (58)

BPS instantons can be found by using Bogomolnyi factorization:

D(a,A~(1))z=±iD(a,A~(1))z\displaystyle D(a,\widetilde{A}^{(1)})z=\pm\mathrm{i}*D(a,\widetilde{A}^{(1)})z (59)

As a result of classical PSU(N)PSU(N) background, the topological charge and action are no longer quantized in integer units, just like our simple QM problem. Rather, we have, as a perfect counterpart of (37),

Q=12π(daB(2))=da2π+1NdB(1)2π1N1N\displaystyle Q={1\over 2\pi}\int(\mathrm{d}a-B^{(2)})=\underbrace{\int\frac{\mathrm{d}a}{2\pi}}_{\in\mathbb{Z}}+\underbrace{{1\over N}\int\frac{\mathrm{d}B^{(1)}}{2\pi}}_{\in\frac{1}{N}\mathbb{Z}}\in\frac{1}{N}\mathbb{Z} (60)

The first term is integer valued because aa is in U(1)U(1) bundle. The second term arise from proper quantization of dB(1)2π\frac{\mathrm{d}B^{(1)}}{2\pi}. Due to BPS nature of these configurations, their action is fractional S=SIN=4πg2NS=\frac{S_{I}}{N}=\frac{4\pi}{g^{2}N}.

Despite the fact that these saddles do not contribute to the partition function of the N1\mathbb{CP}^{N-1} model without background fields, certain pairs of such configurations, such as

(a,¯a)forwhichQ=1N1N=0,S=(1N+1N)SI=2NSI\displaystyle({\cal I}_{a},\overline{\cal I}_{a})\qquad{\rm for\;which}\;Q=\frac{1}{N}-\frac{1}{N}=0,\qquad\qquad S=\Big{(}\frac{1}{N}+\frac{1}{N}\Big{)}S_{I}=\frac{2}{N}S_{I} (61)

do. There is no obstruction for them to contribute. Similarly,

a=1NaforwhichQ=N×1N=1,S=N×1NSI=SI\displaystyle\prod_{a=1}^{N}{\cal I}_{a}\qquad{\rm for\;which}\;Q=N\times\frac{1}{N}=1\in{\mathbb{Z}},\qquad\qquad S=N\times\frac{1}{N}S_{I}=S_{I} (62)

also contributes.

More technically, Q=1NQ=\frac{1}{N} configurations live in a PSU(N)PSU(N) bundle which cannot be uplifted to an SU(N)SU(N) bundle. Yet, composite configurations such as (a,¯a)({\cal I}_{a},\overline{\cal I}_{a}) pair and a=1Na\prod_{a=1}^{N}{\cal I}_{a} cary integer topological charge and can be lifted to an SU(N)SU(N) bundle.

This argument proves the existence of non-BPS fractional action configurations in the vacuum of the N1\mathbb{CP}^{N-1} model on arbitrarily large-T2T^{2}, whose non-perturbative significance is controlled by

Sa=SN\displaystyle S_{a}=\frac{S_{\cal I}}{N} =4πg2N,a,2de4πg2(μ)N+iθN,Λ=μa,2d,\displaystyle=\frac{4\pi}{g^{2}N},\qquad{\cal I}_{a,2d}\sim\mathrm{e}^{-\frac{4\pi}{g^{2}(\mu)N}+i\frac{\theta}{N}},\qquad\Lambda=\mu{\cal I}_{a,2d}, (63)

where Λ\Lambda is the strong scale of the theory.

The main point is following: Even when we consider N1\mathbb{CP}^{N-1} sigma model without any insertion of B(2)B^{(2)} flux, we should still consider the field space in the U(1)/NU(1)/{\mathbb{Z}}_{N} bundle, where both topological charge and action are fractional. Then, we should sum over configurations that can be lifted to U(1)U(1) bundle. This is a well-defined mathematical prescription. This, inevitably, leads us to the statement that the non-perturbative expansion parameter in the theory is eSI/N=e4πg2(μ)N\mathrm{e}^{-S_{I}/N}=\mathrm{e}^{-\frac{4\pi}{g^{2}(\mu)N}} and it is exponentially more important than BPST instanton:

eSINe2SINe3SINeSIinstanton\displaystyle\mathrm{e}^{-\frac{S_{I}}{N}}\gg\mathrm{e}^{-2\frac{S_{I}}{N}}\gg\mathrm{e}^{-3\frac{S_{I}}{N}}\gg\ldots\gg\underbrace{\mathrm{e}^{-{S_{I}}}}_{\rm instanton} (64)

Note that \gg indicates here exponential hierarchies. In particular, at large-NN limit, 2d instantons scale as e4πg2(μ)eO(N1)\mathrm{e}^{-\frac{4\pi}{g^{2}(\mu)}}\sim\mathrm{e}^{-O(N^{1})} and is completely suppressed, while these configurations persist as e4πg2(μ)NeO(N0)\mathrm{e}^{-\frac{4\pi}{g^{2}(\mu)N}}\sim\mathrm{e}^{-O(N^{0})}.

3.1 ’t Hooft flux using (SU(N)×U(1))N\frac{(SU(N)\times U(1))}{\mathbb{Z}_{N}}

In N1\mathbb{CP}^{N-1} model, by just using U(1)U(1) gauge structure, it is impossible to impose ’t Hooft’s twisted boundary conditions. If we turn on an SU(N)SU(N) background field, this also allows us to turn on a non-trivial ’t Hooft flux corresponding to twisted boundary conditions, associated with

SU(N)×U(1)N\displaystyle\frac{SU(N)\times U(1)}{\mathbb{Z}_{N}} (65)

Let Ωμ,ωμ\Omega_{\mu},\omega_{\mu} denote the transition functions for the non-abelian and abelian parts. Ω1(x2)\Omega_{1}(x_{2}) is the transition function between (x1+L1,x2)(x1,x2)(x_{1}+L_{1},x_{2})\sim(x_{1},x_{2}). Ωμ\Omega_{\mu} function is independent of xμx_{\mu}, but depends on other coordinate. For N1\mathbb{CP}^{N-1} fields, we impose

z(x1+L1,x2)\displaystyle z(x_{1}+L_{1},x_{2}) =Ω1(x2)z(x1,x2)ω11(x2),\displaystyle=\Omega_{1}(x_{2})z(x_{1},x_{2})\omega_{1}^{-1}(x_{2}),\qquad (66)
z(x1,x2+L2)\displaystyle z(x_{1},x_{2}+L_{2}) =Ω2(x1)z(x1,x2)ω21(x1)\displaystyle=\Omega_{2}(x_{1})z(x_{1},x_{2})\omega_{2}^{-1}(x_{1}) (67)

where ω1(x2)eiα1(x2)\omega_{1}(x_{2})\equiv\mathrm{e}^{\mathrm{i}\alpha_{1}(x_{2})} is pure phase. We can connect z(x1+L1,x2+L2)z(x_{1}+L_{1},x_{2}+L_{2}) with z(x1,x2)z(x_{1},x_{2}) in two different ways. For consistency at the corners, the transition functions must satisfy

Ω1(L2)Ω2(0)\displaystyle\Omega_{1}(L_{2})\Omega_{2}(0) =Ω2(L1)Ω1(0)ei2πN\displaystyle=\Omega_{2}(L_{1})\Omega_{1}(0)\mathrm{e}^{\mathrm{i}\frac{2\pi\ell}{N}} (68)
ω1(L2)ω2(0)\displaystyle\omega_{1}(L_{2})\omega_{2}(0) =ω2(L1)ω1(0)ei2πN\displaystyle=\omega_{2}(L_{1})\omega_{1}(0)\mathrm{e}^{\mathrm{i}\frac{2\pi\ell}{N}} (69)

corresponding to =0,1,,N1\ell=0,1,\ldots,N-1-units of ’t Hooft flux. This is the gauge invariant data in (67). The gauge covariance of the flux condition tells us that under a gauge transformation, e.g., z(0,x2)g(0,x2)z(0,x2)h1(0,x2)z(0,x_{2})\rightarrow g(0,x_{2})z(0,x_{2})h^{-1}(0,x_{2}), the transition matrices transform as Ω1(x2)g(L1,x2)Ω1(x2)g1(0,x2)\Omega_{1}(x_{2})\rightarrow g(L_{1},x_{2})\Omega_{1}(x_{2})g^{-1}(0,x_{2}) etc. So, there is some gauge freedom in the choice of transition matrices.

As an example, we can satisfy relations (69) with the choices Ω1(x2)=ei2πTNx2L2\Omega_{1}({x_{2}})=\mathrm{e}^{\mathrm{i}\frac{2\pi T\ell}{N}\frac{x_{2}}{L_{2}}}, Ω2=1\Omega_{2}=1, ω1(x2)=ei2πNx2L2\omega_{1}({x_{2}})=\mathrm{e}^{\mathrm{i}\frac{2\pi\ell}{N}\frac{x_{2}}{L_{2}}}, ω2=1\omega_{2}=1 where T=diag(1,,1,(N1))T={\rm diag}(1,\ldots,1,-(N-1)). As a result,

Ω1(L2)=Ω1(0)ei2πN\displaystyle\Omega_{1}(L_{2})=\Omega_{1}(0)\mathrm{e}^{\mathrm{i}\frac{2\pi\ell}{N}} (70)
ω1(L2)=ω1(0)ei2πN\displaystyle\omega_{1}(L_{2})=\omega_{1}(0)\mathrm{e}^{\mathrm{i}\frac{2\pi\ell}{N}} (71)

In the U(1)U(1) gauge theory, only =0\ell=0 is allowed. If we turn on just SU(N)SU(N) background without turning on a background N{\mathbb{Z}}_{N} gauge field, \ell is still zero. (This does not mean it is uninteresting, see Section 3.3.) However, turning on a PSU(N)PSU(N) background field, we can absorb the aperiodicity of the transition function of one part into the other.

3.2 Fractional instantons in N1\mathbb{CP}^{N-1} on large M2M_{2}

Below, we review the standard textbook construction of instantons and demonstrate how it is modified once a PSU(N)PSU(N) background is turned on.

Instantons in 2d are topologically non-trivial configurations with integer winding number. To see this, note that finite action demands Dμz=0D_{\mu}z=0 as |x||x|\rightarrow\infty, a covariantly constant configuration.

zi(x)=nieiα(x),nin¯i=1\displaystyle z_{i}(x)=n_{i}e^{i\alpha(x)},\;\;n_{i}\overline{n}_{i}=1 (72)

where nin_{i} is a constant vector with unit norm. Therefore, in the perturbative vacuum,

a=dα\displaystyle a=\mathrm{d}\alpha (73)

which are pure gauge configurations. Let ϕ\phi parametrize S1S^{1}_{\infty} boundary of 2{\mathbb{R}}^{2}. Pure gauge configurations are of the form α(ϕ)=Wϕ,W\alpha(\phi)=W\phi,W\in{\mathbb{Z}}. This gives a mapping from the boundary of the physical space to the disjoint classes of gauge transformations, which correspond to homotopy classes π1(S1)=\pi_{1}(S^{1})={\mathbb{Z}}. This is the instanton number or topological charge

Q=12πda=12πa=12πdα.\displaystyle Q=\frac{1}{2\pi}\int\mathrm{d}a=\frac{1}{2\pi}\oint a=\frac{1}{2\pi}\oint\mathrm{d}\alpha\in{\mathbb{Z}}\;. (74)

Assume now we turn on a PSU(N)PSU(N) background field. Locally, the U(N)U(N) gauge field can be written as:

𝒜(1)=``A~(1)+a𝟙"locally\displaystyle{{\cal A}^{(1)}}=``\tilde{A}^{(1)}+a{\mathbbm{1}}"\qquad\rm locally (75)

where the gauge connection 𝒜(1){{\cal A}^{(1)}} is in U(N)(U(1)×SU(N))/NU(N)\cong(U(1)\times SU(N))/{\mathbb{Z}}_{N}.

Example: The simplest example of a U(N)U(N) field of this form is following. Consider F12=2π12NL1L2TF_{12}=\frac{2\pi\ell_{12}}{NL_{1}L_{2}}T, with T=diag(1,1,,1,(N1))T=\text{diag}(1,1,\dots,1,-(N-1)) is a Cartan generator, and f12=2π12NL1L2𝟙f_{12}=-\frac{2\pi\ell_{12}}{NL_{1}L_{2}}\mathbbm{1}. Hence, F𝒜(1)=2π12L1L2diag(0,0,,0,1)F_{{{\cal A}^{(1)}}}=-\frac{2\pi\ell_{12}}{L_{1}L_{2}}\text{diag}(0,0,\dots,0,1) is a U(N)U(N) gauge field strength which cannot be decomposed into SU(N)SU(N) and U(1)U(1) parts, but can be decomposed to PSU(N)PSU(N) and U(1)/NU(1)/{\mathbb{Z}}_{N} parts. The corresponding gauge fields are (setting A1=0,a1=0A_{1}=0,a_{1}=0), A2(x1)=2π12NL1L2Tx1,a2(x1)=2π12NL1L2𝟙x1A_{2}(x_{1})=\frac{2\pi\ell_{12}}{NL_{1}L_{2}}Tx_{1},a_{2}(x_{1})=-\frac{2\pi\ell_{12}}{NL_{1}L_{2}}\mathbbm{1}x_{1} and the transition functions are given by Ω1(x2)=ei2π12NL2Tx2\Omega_{1}(x_{2})=\mathrm{e}^{-\mathrm{i}\frac{2\pi\ell_{12}}{NL_{2}}Tx_{2}}, ω1(x2)=ei2π12NL2𝟙x2\omega_{1}(x_{2})=\mathrm{e}^{-\mathrm{i}\frac{2\pi\ell_{12}}{NL_{2}}{\mathbbm{1}}x_{2}}, Ω2=1,ω2=1\Omega_{2}=1,\omega_{2}=1. Hence, the transition functions obey (69), corresponding to the insertion of 12\ell_{12} units of ’t Hooft flux.

Now, the topological term can be written as

12πNtr[F𝒜(1)]\displaystyle\frac{1}{2\pi N}\int\text{tr}[F_{{{\cal A}^{(1)}}}] (76)

This reduces to (74) if A~(1){{\tilde{A}}^{(1)}} is in SU(N)SU(N). But if A~(1){{\tilde{A}}^{(1)}} is a connection in a PSU(N)PSU(N) bundle which cannot be lifted to an SU(N)SU(N) bundle, the topological charge in (76) is no longer quantized in integer units, rather it can assume fractional values, multiples of 1/N1/N. In this case, the dynamical field aa is not a simple U(1)U(1) gauge field, but a U(1)/NU(1)/{\mathbb{Z}}_{N} field.

Let us make this abstract argument concrete. For U(N)U(N) gauge structure, the covariantly constant configurations as |x||x|\rightarrow\infty takes the form:

zi(|x|)=eiT(ϕ)eiβ(ϕ)ni,\displaystyle z_{i}(|x|\rightarrow\infty)=e^{iT(\phi)}e^{i\beta(\phi)}n_{i}, (77)

instead of (72). We choose

T(ϕ)+β(ϕ)=ϕN(11(N1))+ϕN𝟙=(00ϕ)\displaystyle T(\phi)+\beta(\phi)=\frac{\phi}{N}\left(\begin{array}[]{cccc}-1&&&\\ &-1&&\\ &&\ddots&\\ &&&(N-1)\\ \end{array}\right)+\frac{\phi}{N}{\mathbbm{1}}=\left(\begin{array}[]{cccc}0&&&\\ &0&&\\ &&\ddots&\\ &&&\phi\\ \end{array}\right) (86)

such that eiT(ϕ)eiβ(ϕ)e^{iT(\phi)}e^{i\beta(\phi)} is a proper U(N)U(N) gauge transformation. However, eiT(ϕ)\mathrm{e}^{\mathrm{i}T(\phi)} does not live in an SU(n)SU(n) bundle, rather it is in the PSU(n)PSU(n) bundle. Similarly, eiβ(ϕ)e^{i\beta(\phi)} can no longer be characterized by a proper U(1)U(1) gauge transformation, instead lives in U(1)/NU(1)/{\mathbb{Z}}_{N}.

eiT(ϕ+2π)\displaystyle\mathrm{e}^{\mathrm{i}T(\phi+2\pi)} =eiT(ϕ)ei2πN\displaystyle=\mathrm{e}^{\mathrm{i}T(\phi)}\mathrm{e}^{-\mathrm{i}\frac{2\pi}{N}} (87)
eiβ(ϕ+2π)\displaystyle\mathrm{e}^{\mathrm{i}\beta(\phi+2\pi)} =eiβ(ϕ)e+i2πN\displaystyle=\mathrm{e}^{\mathrm{i}\beta(\phi)}\mathrm{e}^{+\mathrm{i}\frac{2\pi}{N}} (88)

This is an example of A~(1)\tilde{A}^{(1)} being a connection in PSU(n)PSU(n) bundle which cannot be lifted to SU(n)SU(n) bundle.

As a result, the dynamical gauge connection aa can no longer be characterized by a U(1)U(1) gauge field, and 𝑑a\int da is not a multiple of 2π2\pi. Instead, the dynamical aa and background A~(1)\widetilde{A}^{(1)} are now intertwined in an inseparable way. Denoting g(ϕ)=eiT(ϕ)eiβ(ϕ)=Diag(1,,1,eiNβ(ϕ))g(\phi)=\mathrm{e}^{iT(\phi)}\mathrm{e}^{i\beta(\phi)}={\rm Diag}\left(1,\ldots,1,\mathrm{e}^{\mathrm{i}N\beta(\phi)}\right), the topological charge (76) can now be written as

1N12πtr[ig1dg]=12πdβ=1N\displaystyle\frac{1}{N}\frac{1}{2\pi}\int\text{tr}[-ig^{-1}\mathrm{d}g]=\frac{1}{2\pi}\int\mathrm{d}\beta=\frac{1}{N}{\mathbb{Z}} (89)

and it is quantized in units of 1N\frac{1}{N}, same as (60). Note that this is the counterpart of (38) in quantum mechanics. It is clear that Q=1NQ=\frac{1}{N} is in a U(1)/NU(1)/{\mathbb{Z}}_{N} bundle, and cannot be lifted to an U(1)U(1) bundle. Yet, composite configurations such as (a,¯a)({\cal I}_{a},\overline{\cal I}_{a}) pair and a=1Na\prod_{a=1}^{N}{\cal I}_{a} can be lifted to U(1)U(1) bundle. As in our quantum mechanics example, these composites are the non-trivial fractional action configurations contributing to the partition function of original N1\mathbb{CP}^{N-1} even in the absence of PSU(N)PSU(N) background.

Refer to caption
Figure 4: Mapping from S1S^{1}_{\infty} to field space. Q=12πa=12πdβ1NQ=\frac{1}{2\pi}\oint a=\frac{1}{2\pi}\oint\mathrm{d}\beta\in\frac{1}{N}{\mathbb{Z}}, because eiβ(ϕ)\mathrm{e}^{\mathrm{i}\beta(\phi)} is in U(1)/NU(1)/{\mathbb{Z}}_{N} bundle. Figure is for 3\mathbb{CP}^{3} model.

3.3 Flavor twist and ’t Hooft flux, and relation to ×S1{\mathbb{R}}\times S^{1}

In a joint work with Dunne Dunne:2012ae , we proved that if N1\mathbb{CP}^{N-1} model is compactified on a cylinder with an SU(N)SU(N) flavor twisted boundary condition, and if the SU(N)SU(N) background is symmetric under N{\mathbb{Z}}_{N} shifts up to permutations, i.e,

z~(x1+L1,x2)=ΩFz~(x1,x2),\displaystyle\tilde{z}(x_{1}+L_{1},x_{2})=\Omega_{F}\tilde{z}(x_{1},x_{2}),\qquad (90)
ΩFDiag(ei2πμ1,ei2πμN)=Diag(1,ei2πN,,ei2π(N1)N)\displaystyle\Omega_{F}\sim{\rm Diag}\left(\mathrm{e}^{\mathrm{i}2\pi\mu_{1}}\ldots,\mathrm{e}^{\mathrm{i}2\pi\mu_{N}}\right)={\rm Diag}\left(1,\mathrm{e}^{\mathrm{i}\frac{2\pi}{N}},\ldots,\mathrm{e}^{\mathrm{i}\frac{2\pi(N-1)}{N}}\right) (91)

then there exists instantons with fractional topological charge and action,

W=1N,S=1N4πg2(weakcoupling,noB(2)flux)\displaystyle W=\frac{1}{N},\qquad S=\frac{1}{N}\frac{4\pi}{g^{2}}\qquad({\rm weak\;coupling,\;no\;}B^{(2)}{\;\rm flux}) (92)

in the semi-classical domain on ×SL1{\mathbb{R}}\times S^{1}_{L}. See also Fujimori:2017oab ; Fujimori:2018kqp ; Ishikawa:2019tnw ; Misumi:2014jua ; Misumi:2019upg . Fractional instantons are also described in earlier work on N1\mathbb{CP}^{N-1} in Eto:2004rz ; Eto:2006pg ; Eto:2006mz ; Brendel:2009mp ; Bruckmann:2007zh .

Refer to caption
Figure 5: When the moduli parameter of instanton is smaller than the scale set by the ΩF\Omega_{F} background, instanton looks like a single lump (first figure), and when the moduli parameter is large, it fractionates to NN-pieces (second figure). Corresponding gauge holonomies for small and large instantons are also depicted, demonstrating fractionalization. For these solutions, ’t Hooft flux is set to zero. TQFT argument tells us that even when the instanton does not physically split to parts, it can still be viewed as a composite of NN W=1/NW=1/N fractional instantons. Figure borrowed from Dunne:2012ae .

It is conjectured that this regime is continuously connected to the theory at large ×SL1{\mathbb{R}}\times S^{1}_{L} and 2{\mathbb{R}}^{2}. The rationale behind this is following. The above construction, in the operator formalism translates to

ZΩF\displaystyle Z_{\Omega_{F}} =tr[eL1HΩ^F]\displaystyle=\text{tr}[\mathrm{e}^{-L_{1}H}\widehat{\Omega}_{F}] (93)

where Ω^F=k=1Nei2πkNQ^k=eik=1N2πkNQ^k\widehat{\Omega}_{F}=\prod_{k=1}^{N}\mathrm{e}^{\mathrm{i}\frac{2\pi k}{N}\widehat{Q}_{k}}=\mathrm{e}^{\mathrm{i}\sum_{k=1}^{N}\frac{2\pi k}{N}\widehat{Q}_{k}} where Q^k\widehat{Q}_{k} is the number operator associated with zkz_{k} quanta. The Hilbert space of N1\mathbb{CP}^{N-1} model fills in representation of PSU(N)PSU(N), singlet, adjoint etc. For example, eL1HΩ^F|(z¯izj)=eL1Madjei(ij)2πN|(z¯izj)\mathrm{e}^{-L_{1}H}\widehat{\Omega}_{F}|(\overline{z}_{i}z^{j})\rangle=\mathrm{e}^{-L_{1}M_{\rm adj}}\mathrm{e}^{\mathrm{i}(i-j)\frac{2\pi}{N}}|(\overline{z}_{i}z^{j})\rangle and the state sum of these states leads to (1)eL1Madj(-1)\mathrm{e}^{-L_{1}M_{\rm adj}} instead of (N21)eL1Madj(N^{2}-1)\mathrm{e}^{-L_{1}M_{\rm adj}} that would occur in thermal state sum. There is also a singlet limxyz¯i(x)eixyazi(y)\lim_{x\rightarrow y}\overline{z}_{i}(x)\mathrm{e}^{\mathrm{i}\int_{x}^{y}a}z^{i}(y) which is almost degenerate with the adjoint. (The difference is O(1/N)O(1/N) Witten:1978bc ). As a result, there are extremely powerful spectral cancellation in the Hilbert space \cal H of N1\mathbb{CP}^{N-1}. In particular, quite remarkably,

limNZΩF(L)=eLEground\lim_{N\rightarrow\infty}Z_{\Omega_{F}}(L)=\mathrm{e}^{-LE_{\rm ground}} (94)

which means no state apart from the ground state contributes to the twisted partition function. This is the idea of quantum distillation, which tells us that the ZΩF(L)Z_{\Omega_{F}}(L) must be an analytic function of LL Dunne:2018hog , similar to cancellation that takes place in index calculations in supersymmetric theories Witten:1982df . Note that the cancellation in the index calculation is a result of supersymmetry, in our case, the theory is bosonic and we engineered the spectral cancellation by using its global symmetry. The fact that this is possible is a remarkable aspect of the theory.

Let us now see if we can learn something deeper from this construction concerning strongly coupled QFT, and non-perturbative configurations that contribute to path integrals. Since anomalies are independent of the strength of the coupling, we can try to take advantage of that. The first hint in this direction is that a mixed anomaly between PSU(N)PSU(N) and 𝖢\mathsf{C} on 2{\mathbb{R}}^{2} survives upon compactification on ×S1{\mathbb{R}}\times S^{1} Tanizaki:2017qhf if and only if ΩF\Omega_{F}- twisted boundary conditions (91) are used, and appropriate B(2)=A1d(1)L1dx2B^{(2)}=A^{(1)}_{\rm 1d}\wedge L^{-1}\mathrm{d}x_{2} background is turned on.

Now, in the presence of B(2)B^{(2)} on a general 𝐓2\mathbf{T}^{2}, we showed that fractional topological charge and fractional action configurations exist. Importantly, this statement is not restricted to only Slarge1×Ssmall1S^{1}_{\rm large}\times S^{1}_{\rm small}, but also valid at for Slarge1×Slarge1S^{1}_{\rm large}\times S^{1}_{\rm large}, where the long distance theory is necessarily strongly coupled.

To connect weak and strong coupling regimes, we reconsider B(2)B^{(2)} flux along with ΩF\Omega_{F} background. We use

z~(x1+L1,x2)\displaystyle\tilde{z}(x_{1}+L_{1},x_{2}) =Ω1F(x2)z~(x1,x2)ω11(x2),\displaystyle=\Omega_{1}^{F}(x_{2})\tilde{z}(x_{1},x_{2})\omega_{1}^{-1}(x_{2}),\qquad (95)
z~(x1,x2+L2)\displaystyle\tilde{z}(x_{1},x_{2}+L_{2}) =Ω2(x1)z~(x1,x2)ω21(x1)\displaystyle=\Omega_{2}(x_{1})\tilde{z}(x_{1},x_{2})\omega_{2}^{-1}(x_{1}) (96)

where we embedded both B(2)B^{(2)} flux and ΩF\Omega_{F} background into transition matrix:

Ω1F(x2)\displaystyle\Omega_{1}^{F}({x_{2}}) =ei2πTNx2L2ΩF,Ω2(x1)=1,\displaystyle=\mathrm{e}^{\mathrm{i}\frac{2\pi T\ell}{N}\frac{x_{2}}{L_{2}}}\Omega_{F},\qquad\Omega_{2}(x_{1})=1, (97)
ω1(x2)\displaystyle\omega_{1}({x_{2}}) =ei2πNx2L2,ω2(x1)=1\displaystyle=\mathrm{e}^{\mathrm{i}\frac{2\pi\ell}{N}\frac{x_{2}}{L_{2}}},\qquad\qquad\omega_{2}(x_{1})=1 (98)

This construction, in the operator formalism, amounts to

ZΩF,\displaystyle Z_{\Omega_{F},\ell} =tr[eL1HΩ^F𝖴^]\displaystyle=\text{tr}[\mathrm{e}^{-L_{1}H}\widehat{\Omega}_{F}\widehat{\mathsf{U}}^{\ell}] (99)

where we separated Ω^F\widehat{\Omega}_{F}, and 𝖴^\widehat{\mathsf{U}}^{\ell} for clarity. In the path integral formalism, we can express it either as twisted boundary conditions, or coupling to a background N{\mathbb{Z}}_{N} TQFT:

ZΩF,\displaystyle Z_{\Omega_{F},\ell} =z~(L1,x2)=Ω1F(x2)z~(0,x2)ω11(x2)𝒟z~exp(2g2[D(a)z~D(a))z~]iθ2π[da])\displaystyle=\int_{\tilde{z}(L_{1},x_{2})=\Omega_{1}^{F}(x_{2})\tilde{z}(0,x_{2})\omega_{1}^{-1}(x_{2})}{\mathcal{D}}\tilde{z}\;\exp{-\left({2\over g^{2}}\int[D(a)\tilde{z}^{\dagger}\wedge*D(a))\tilde{z}]-\mathrm{i}{\theta\over 2\pi}\int[\mathrm{d}a]\right)} (100)
=𝒟F(0)z(β)=z(0)𝒟zeiF(0)(NB(2)dB(1))\displaystyle=\int{\mathcal{D}}F^{(0)}\;\int_{z(\beta)=z(0)}{\mathcal{D}}z\;\;\;\mathrm{e}^{\mathrm{i}\int F^{(0)}\wedge(NB^{(2)}-\mathrm{d}B^{(1)})} (102)
×exp(2g2[D(a,A~(1))zD(a,A~(1))z]iθ2π[daB(2)])\displaystyle\times\exp-\left({2\over g^{2}}\int[D(a,\widetilde{A}^{(1)})z^{\dagger}\wedge*D(a,\widetilde{A}^{(1)})z]-\mathrm{i}{\theta\over 2\pi}\int[\mathrm{d}a-B^{(2)}]\right) (103)

where \ell is ’t Hooft flux, and we set discrete theta angle to p=0p=0.

The topological charge in N1\mathbb{CP}^{N-1} model with ΩF\Omega_{F} twist is fractionally quantized W=1NW=\frac{1}{N}{\mathbb{Z}} for two, at first sight, seemingly different reasons. These are

Q=12πtbc𝑑a\displaystyle Q=\frac{1}{2\pi}\int_{\rm tbc}da =μi+1μi2π|μi=2πN(i1)=1N\displaystyle=\frac{\mu_{i+1}-\mu_{i}}{2\pi}\Big{|}_{\mu_{i}=\frac{2\pi}{N}(i-1)}=\frac{1}{N} (104)
=1N|=1=1N\displaystyle=\frac{1}{N}\ell\Big{|}_{\ell=1}=\frac{1}{N} (105)

The first equality is the property of N{\mathbb{Z}}_{N} symmetric (up to permutation) background. It does not need the insertion of ’t Hooft flux, but without that insertion, it seems to be deformable from 1N\frac{1}{N}{\mathbb{Z}} topological charge. In what sense, is it robust then? The point is, if we wish to preserve a mixed anomaly that exists on 2{\mathbb{R}}^{2} between PSU(N)PSU(N) and 𝖢\mathsf{C}, this particular background is the unique one which achieves thatTanizaki:2017qhf . The boundary condition (91) remains invariant under the intertwined combination of a center transformation along with N{\mathbb{Z}}_{N} cyclic permutation of fields, zizi+1z_{i}\rightarrow z_{i+1} Cherman:2017tey .

The second equality is telling us that in the ’t Hooft flux background, topological charge is 1N\frac{1}{N} quantized, regardless of what the value of ΩF\Omega_{F} is. Both of these statements are independent of coupling, and also persist at strongly coupled regime on arbitrary size T2T^{2}. Next, we exhibit that this is the case in two examples, one is large-NN limit of N1\mathbb{CP}^{N-1} and the other is supersymmetric version.

Reminder of tunnelings in N1\mathbb{CP}^{N-1} model: Here, we would like to detail few more information about tunneling events in N1\mathbb{CP}^{N-1} model with ΩF\Omega_{F} twisted background on small 1×S1{\mathbb{R}}^{1}\times S^{1} that we will use later. Ref. Dunne:2012ae also has an obvious reinterpretation on 2=+×S1{\mathbb{R}}^{2}={\mathbb{R}}^{+}\times S^{1} with ΩF\Omega_{F} and B(2)B^{(2)} flux background. In these backgrounds, the classical minima of the model are

z(|x|)=𝒆a,𝒆a=(,0,1ath,0,),a=1,,N\displaystyle z(|x|\rightarrow\infty)=\bm{e}_{a},\qquad{\bm{e}}_{a}=\Big{(}\ldots,0,\underbrace{1}_{a^{\rm th}},0,\ldots\Big{)},\qquad a=1,\ldots,N (106)

Let τ\tau denotes coordinate on SL1S^{1}_{L} where we take LL\rightarrow\infty. Then, the minimal tunneling events correspond to

Δz=z(τ=L)z(τ=0)=𝒆a𝒆a+1=𝜶a,a=1,,N\displaystyle\Delta z=z(\tau=L)-z(\tau=0)=\bm{e}_{a}-\bm{e}_{a+1}=\bm{\alpha}_{a},\qquad a=1,\ldots,N (107)

where 𝒆N+1𝒆1\bm{e}_{N+1}\equiv\bm{e}_{1} and 𝜶N=a=1N1𝜶a\bm{\alpha}_{N}=-\sum_{a=1}^{N-1}\bm{\alpha}_{a} is the affine root of SU(N)SU(N) Lie algebra. These fractional instanton events a{\cal I}_{a} have topological charge W=1/NW=1/N and action S=SIN=4πg2NS=\frac{S_{I}}{N}=\frac{4\pi}{g^{2}N} and are the semi-classical ingredients of our analysis. The fugacity associated with these events are

ae4πg2(μ)N+iθN=Λμ1\displaystyle{\cal I}_{a}\sim\mathrm{e}^{-\frac{4\pi}{g^{2}(\mu)N}+\mathrm{i}\frac{\theta}{N}}=\Lambda\mu^{-1} (108)

These are minimal tunneling events associated with 𝜶a\bm{\alpha}_{a}. More generally, the set of tunneling events obeying BPS-type equation are N2NN^{2}-N types, and associated with charges

𝜶ab=𝒆a𝒆b\displaystyle\bm{\alpha}_{ab}=\bm{e}_{a}-\bm{e}_{b} (109)

with topological charge and action

W=(ab)/N,S=|ab|SIN\displaystyle W=(a-b)/N,\qquad S=|a-b|\frac{S_{I}}{N} (110)

The usual instanton in 2d Polyakov:1975yp corresponds to =a=1Na{\cal I}=\prod_{a=1}^{N}{\cal I}_{a} with charge 𝜶1++𝜶N=0\bm{\alpha}_{1}+\ldots+\bm{\alpha}_{N}=0 configurations. We can also construct infinite towers of the fractional instantons with charge 𝜶a\bm{\alpha}_{a} by merging it with 2d instantons, a()q,q{\cal I}_{a}({\cal I})^{q},\;q\in{\mathbb{Z}}.

As we emphasized, there are also important bion configurations, minimal versions of which are [a¯a+1][{\cal I}_{a}\overline{\cal I}_{a+1}]. These have ”Coulomb” charges 𝜶a𝜶a+1\bm{\alpha}_{a}-{\bm{\alpha}}_{a+1}, topological charge W=0W=0, and action S=2SI/NS=2S_{I}/N. There are also neutral bions [a¯a]±[{\cal I}_{a}\overline{\cal I}_{a}]_{\pm} which have zero ”Coulomb” and topological charge, and S=2SI/NS=2S_{I}/N action.

3.4 Interpolation between weak and strong coupling holonomy potentials

Small LL holonomy potential: In the N1\mathbb{CP}^{N-1} model, the gauge field aa does not have a kinetic term in the the UV-Lagrangian, and is not an independent degree of freedom. It is related to the dynamical fields z(x1,x2)z(x_{1},x_{2}) through equations of motions,

a=iz¯dz\displaystyle a=\mathrm{i}\overline{z}\mathrm{d}z (111)

In the small LΛ1L\Lambda\lesssim 1 regime, we can write down the potential for the holonomy field eiS1a\mathrm{e}^{\mathrm{i}\oint_{S^{1}}a}, both with and without the insertion of ΩF\Omega_{F}. They exhibit a striking difference:

Vthermal(a)\displaystyle V_{\rm thermal}(a) =2NπL2n=11n2cos(aLn)2NπL2cos(aL)\displaystyle=-\frac{2N}{\pi L^{2}}\sum_{n=1}^{\infty}\frac{1}{n^{2}}\cos(aLn)\sim-\frac{2N}{\pi L^{2}}\cos(aL) (112)
VΩF(a)\displaystyle V_{\rm\Omega_{F}}(a) =1πL2n=11n2(tr(ΩFn)eiaLn+c.c.)\displaystyle=-\frac{1}{\pi L^{2}}\sum_{n=1}^{\infty}\frac{1}{n^{2}}\left(\text{tr}(\Omega_{F}^{n})\mathrm{e}^{\mathrm{i}aLn}+{\rm c.c.}\right) (113)
=2πL21Nn=11n2cos(NaLn)2πL21Ncos(NaL)\displaystyle=-\frac{2}{\pi L^{2}}\frac{1}{N}\sum_{n=1}^{\infty}\frac{1}{n^{2}}\cos{(NaLn)}\sim-\frac{2}{\pi L^{2}}\frac{1}{N}\cos(NaL) (114)

where aLaL is periodic by 2π2\pi. In thermal case, there is a unique minimum in the fundamental domain, and a potential barrier as aa+2πa\rightarrow a+2\pi corresponding precisely to tunneling events with W=1W=1.

In the presence of the ΩF\Omega_{F} twist, we obtain a potential cos(NLa)\sim\cos(NLa) just like our TNT_{N} quantum mechanics, which has NN-minima at La=2πNj,j=0,1,,N1La=\frac{2\pi}{N}j,\;j=0,1,\ldots,N-1. The interpolation between these configurations correspond to W=1/NW=1/N fractional instanton events.

It is important to note that, since aa is not an independent degree of freedom, these minima are some configurations of zz-field, and tunneling is between zz field configurations, which translates to aa through (111). The profiles of these fields are shown in Fig.5 borrowed from Dunne:2012ae .

Note that the ΩF\Omega_{F} twist is an SU(N)SU(N) background. We can also turn an SU(N)/NSU(N)/{\mathbb{Z}}_{N} background by coupling to B(2)=A1d(1)L1dx2B^{(2)}=A^{(1)}_{\rm 1d}\wedge L^{-1}\mathrm{d}x_{2}. Then \ell units of B(2)B^{(2)} flux amounts to considering tr[eL1HΩ^F𝖴]\text{tr}[\mathrm{e}^{-L_{1}H}\widehat{\Omega}_{F}\mathsf{U}^{\ell}] which strictly enforces that a transition between jj and j+j+\ell takes place with topological charge W=NW=\frac{\ell}{N} and action S=N4πg2S=\frac{\ell}{N}\frac{4\pi}{g^{2}}.

Large LL holonomy potential: Now, let us consider large-LL limit, where the asymptotically free N1\mathbb{CP}^{N-1} coupling g2(μ)g^{2}(\mu) necessarily becomes large at distances |x|Λ1|x|\geq\Lambda^{-1}. A useful handle in this regime is to consider large-NN limit. At large-NN, the theory is solvable Polyakov:1987ez . It can be shown to acquire a mass gap dynamically,

M=Λ=μe4π/Ng2(μ)\displaystyle M=\Lambda=\mu\,e^{-4\pi/Ng^{2}(\mu)} (115)

which very much looks like a fractional instanton effect. But the large-NN solution is very shy about giving us any insights concerning the microscopic origins of mass.

The saddle of the large-NN solution on 2{\mathbb{R}}^{2} is a=0,λ=M2a=0,\lambda=M^{2}, where λ\lambda is Lagrange multiplier that appears in the action as λ(z¯z1)\int\lambda(\overline{z}z-1) imposing the constraint |z|2=1|z|^{2}=1. A remarkable fact about N1\mathbb{CP}^{N-1} model is large-NN volume independence in the ΩF\Omega_{F} twisted background. The large-NN saddle point equation on 2{\mathbb{R}}^{2} may be derived precisely at any size ×S1{\mathbb{R}}\times S^{1} if and only if one also takes the background ΩF\Omega_{F} Sulejmanpasic:2016llc ; Dunne:2012ae .

At large-NN, we may say that zz particles become massive. Ref. Witten:1978bc showed that these particles generate a kinetic term for the gauge field at one-loop level on 2{\mathbb{R}}^{2}. The effective Lagrangian is

Leff=1e2(da)2+|Dz|2+M2|z|2,L_{\text{eff}}=\frac{1}{e^{2}}(\mathrm{d}a)^{2}+|Dz|^{2}+M^{2}|z|^{2}, (116)

where e2=48πM2Ne^{2}=\frac{48\pi M^{2}}{N}, and the effective theory is just NN- component scalar QED. Consider this theory on ×SL1{\mathbb{R}}\times S^{1}_{L}. At low temperature (L)(L\rightarrow\infty) for thermal theory, and ΩF\Omega_{F}-twisted (non-thermal) theory, we obtain the holonomy potentials as

Vthermal(a,M)\displaystyle V_{\rm thermal}(a,M) =2NπL2n=1fnn2cos(aLn)2NL2(ML2π)1/2eMLcos(aL)\displaystyle=-\frac{2N}{\pi L^{2}}\sum_{n=1}^{\infty}\frac{f_{n}}{n^{2}}\cos(aLn)\approx-\frac{2N}{L^{2}}\left(\frac{ML}{2\pi}\right)^{1/2}\mathrm{e}^{-ML}\cos(aL) (117)
VΩF(a,M)\displaystyle V_{\rm\Omega_{F}}(a,M) =1πL2n=1fnn2(tr(ΩFn)eiaLn+c.c.)\displaystyle=-\frac{1}{\pi L^{2}}\sum_{n=1}^{\infty}\frac{f_{n}}{n^{2}}\left(\text{tr}(\Omega_{F}^{n})\mathrm{e}^{\mathrm{i}aLn}+{\rm c.c.}\right) (118)
=2πL21Nn=1fNnn2cos(NaLn)2NL2(MLN2π)1/2eMLNcos(NaL)\displaystyle=-\frac{2}{\pi L^{2}}\frac{1}{N}\sum_{n=1}^{\infty}\frac{f_{Nn}}{n^{2}}\cos{(NaLn)}\approx-\frac{2}{NL^{2}}\left(\frac{MLN}{2\pi}\right)^{1/2}\mathrm{e}^{-MLN}\cos(NaL)\qquad\qquad (119)

where fn=(nLM)K1(LMn)f_{n}=(nLM)K_{1}(LMn) where K1K_{1} is the modified Bessel function of the first kind, and in the simplified form, we used asymptotic behavior K1(z)π2zez,z.K_{1}(z)\sim\sqrt{\frac{\pi}{2z}}e^{-z},z\rightarrow\infty.

The untwisted case is derived by Affleck, who showed the existence of a zero temperature (L=)(L=\infty) phase transition, as can be seen by the non-commutativity of LL\rightarrow\infty, NN\rightarrow\infty limits. In the ΩF\Omega_{F}-twisted case, Sulejmanpasic showed the commutativity of the two limits Sulejmanpasic:2016llc . In the ΩF\Omega_{F} twisted case, we again end up with our TNT_{N} model, with NN inequivalent minima, and instantons again split up to fractional instantons with winding number W=1/NW=1/N.

The whole discussion in the LΛ1L\Lambda\lesssim 1 domain and LMN1LMN\gg 1 domains are extremely similar. Note that if we interpret the instanton in Vthermal(a)V_{\rm thermal}(a) in (119) as quantum instantons in the effective theory according to Affleck with SN1MLeML/2S\sim N\frac{1}{ML}e^{-ML/2}, we should view the instantons in the VΩF(a)V_{\rm\Omega_{F}}(a) as fractional quantum instantons with action S1MLeMLNS\sim\frac{1}{ML}\mathrm{e}^{-MLN}. However, neither detail is actually too important for our purpose. The aa-field in this description is to a certain extend a place holder, because e2M2Ne^{2}\sim\frac{M^{2}}{N} and electric coupling tend to zero in the large-N limit. Therefore, aa is actually a mnemonic of what is going on in the world of zz-field through the relation (111).

The main point is, on a spacetime manifold 𝐓𝟐{\bf T^{2}}, viewed as finite-volume regularization of 2{\mathbb{R}}^{2}, (93) tells us that there are configurations with fractional winding number 1N\frac{1}{N}{\mathbb{Z}} and action S=4πg2NS=\frac{4\pi}{g^{2}N}, for which aa lives in the U(1)/NU(1)/{\mathbb{Z}}_{N} bundle. To see the existence of these configurations without the insertion of (B(2),B(1))(B^{(2)},B^{(1)}) flux, by just using the SU(N)SU(N) twist is also possible, and already shown in Dunne:2012ae in semi-classical domain. However, (B(2),B(1))(B^{(2)},B^{(1)}) flux tells us that these configurations are there both at weak or strong coupling with certainty. This is just a statment concerning field configuration topology. As we have shown in TNT_{N} quantum mechanics, we can use these configurations to built fractional action and integer winding configurations which live in the U(1)U(1) bundle. Next, we show that a description based on this idea is actually taking place in 2d softly broken 𝒩=(2,2){\cal N}=(2,2) N1\mathbb{CP}^{N-1} model.

3.5 𝒩=(2,2){\cal N}=(2,2) N1\mathbb{CP}^{N-1} model on 2{\mathbb{R}}^{2} with soft mass

Consider the 𝒩=(2,2){\cal N}=(2,2) supersymmetric N1\mathbb{CP}^{N-1} model. This model has a 2N{\mathbb{Z}}_{2N} chiral symmetry which is broken dynamically to 2{\mathbb{Z}}_{2} by the formation of fermion-bilinear condensate.

k|ψψ+|k=NΛei2πkN,k=0,1,,N1\langle k|\psi_{-}\psi_{+}|k\rangle=N\Lambda\,e^{i\frac{2\pi k}{N}},\qquad k=0,1,\ldots,N-1 (120)

leading to NN isolated vacua, consistent with the index IW=tr((1)F)=NI_{W}=\text{tr}((-1)^{F})=N Witten:1982df .

Consider adding a soft mass term to the Lagrangian, mψψ++h.c.m\psi_{-}\psi_{+}+{\rm h.c.}. In this case, the kthk^{\rm th} vacuum energy density is modified into

k=mψψ+k+c.c.=2mNΛcosθ+2πkN\displaystyle{\cal E}_{k}=-m\langle\psi_{-}\psi_{+}\rangle_{k}+{\rm c.c.}=-2mN\Lambda\cos\frac{\theta+2\pi k}{N} (121)

at leading order in mm. Here Λ=μeSI/N=μe4πg2(μ)N\Lambda=\mu\mathrm{e}^{-S_{I}/N}=\mu\mathrm{e}^{-\frac{4\pi}{g^{2}(\mu)N}} is the renormalization group invariant strong scale of the theory (at one-loop order), and μ\mu is the (Pauli-Villars) renormalization scale. The graded partition function for the mass deformed theory is

Z(m,θ)=tr[(1)FeLH],\displaystyle Z(m,\theta)=\text{tr}[(-1)^{F}\mathrm{e}^{-LH}]\;, (122)

and on a 2-manifold, it is modified as

Z(m,θ)=k=0N1e2mNΛV2cosθ+2πkN\displaystyle Z(m,\theta)=\sum_{k=0}^{N-1}\mathrm{e}^{2mN\Lambda V_{2}\cos\frac{\theta+2\pi k}{N}} (123)

where V2V_{2} is the volume of the 2-manifold and mm is small. (Assume V2V_{2} is large compared to strong length scale.) Based on our quantum mechanical example, we can rewrite the partition function as

Z(m,θ)\displaystyle Z(m,\theta) =NWn=0n¯=01n!1n¯!(mNμV2eSIN+iθN)n(mNμV2eSINiθN)n¯δnn¯WN,0\displaystyle=N\sum_{W\in{\mathbb{Z}}}\sum_{n=0}^{\infty}\sum_{\overline{n}=0}^{\infty}\frac{1}{n!}\frac{1}{\overline{n}!}\left(mN\mu V_{2}\mathrm{e}^{-\frac{S_{I}}{N}+\mathrm{i}\frac{\theta}{N}}\right)^{n}\left(mN\mu V_{2}\mathrm{e}^{-\frac{S_{I}}{N}-\mathrm{i}\frac{\theta}{N}}\right)^{\overline{n}}\delta_{n-\overline{n}-WN,0} (124)
=NW[n¯=0(mNμV2eSIN)2n¯+WNn¯!(n¯+WN)!]eiWθ\displaystyle=N\sum_{W\in{\mathbb{Z}}}\left[\sum_{\overline{n}=0}^{\infty}\frac{(mN\mu V_{2}\mathrm{e}^{-\frac{S_{I}}{N}})^{2\overline{n}+WN}}{{\overline{n}}!(\overline{n}+WN)!}\right]\mathrm{e}^{\mathrm{i}W\theta} (125)
=NW[INW(2mNμV2eSIN)]eiWθ(strongcoupling,noB(2)flux)\displaystyle=N\sum_{W\in{\mathbb{Z}}}\left[I_{NW}(2mN\mu V_{2}\mathrm{e}^{-\frac{S_{I}}{N}})\right]\mathrm{e}^{\mathrm{i}W\theta}\qquad({\rm strong\;coupling,\;no\;}B^{(2)}{\;\rm flux}) (126)

The interpretation of this formula in the light of our discussion of N1\mathbb{CP}^{N-1} model is following.

  • Terms in the sum are sourced by the solution of self-duality equation in the N1\mathbb{CP}^{N-1} theory with PSU(N)PSU(N) background. However, these cannot directly contribute to the partition function as they live in a part of PSU(N)PSU(N) bundle that cannot be lifted to SU(N)SU(N) bundle. Happily, the constraint tells us that they do not.

  • The constraint δnn¯WN,0\delta_{n-\overline{n}-WN,0} guarantees that non-BPS configurations in the PSU(N)PSU(N) bundle that can be uplifted to SU(N)SU(N) bundle contribute to the sum. The sum consistently reduces to a sum over integer topological charge configurations WW\in{\mathbb{Z}}, but there are fractional action configurations S=2NSIS=\frac{2}{N}S_{I} contributing to it.

  • In the massless theory, the chiral symmetry is 2N{\mathbb{Z}}_{2N}. In the PSU(N)PSU(N) background, each of these objects carries 22 zero modes. This is not inconsistent with ABJ anomaly thanks to the constraint δnn¯WN,0\delta_{n-\overline{n}-WN,0}. The 22 zero modes is also a natural reflection of mixed anomaly between PSU(N)PSU(N) and 2N{\mathbb{Z}}_{2N}. If we were to gauge PSU(N)PSU(N), we would indeed end up with 22 zero modes with each of these configurations. These zero modes are lifted by the insertion of mass term mm.

  • The solutions in the PSU(N)PSU(N) background have 2 bosonic zero modes, and altogether, they should have 2N2N bosonic zero modes as a 2d2d BPST instanton. V2V_{2} may be viewed as the volume of the bosonic moduli space. μ\mu is Pauli-Villars regularization scale. It appears with the power μnbnf2=μ1\mu^{n_{b}-\frac{n_{f}}{2}}=\mu^{1} where nb=2,nf=2n_{b}=2,n_{f}=2 are the numbers of bosonic and fermionic moduli.

  • The factor NN in the argument and order of Bessel function arises because there are NN different types of fractional instantons in the PSU(N)PSU(N) background. Note that at m=0m=0, supersymmetric point, modified Bessel obeys Ij(0)=δj,0I_{j}(0)=\delta_{j,0} and Z(m=0,θ)=NZ(m=0,\theta)=N which is just supersymmetric index Witten:1982df . At m=0m=0, the only sector contributing to graded partition function is the zero topological charge sector.

Again, we reach to the same conclusion in 𝒩=(2,2){\cal N}=(2,2) N1\mathbb{CP}^{N-1}, as in the bosonic model. Even when we consider N1\mathbb{CP}^{N-1} model without B(2)B^{(2)} flux, we should still consider the field space in the U(1)/NU(1)/{\mathbb{Z}}_{N} bundle, where both topological charge and action are fractional, and sum up over configurations that can be lifted to U(1)U(1) bundle. This tells us that the non-perturbative expansion parameter in the theory is eSI/N\mathrm{e}^{-S_{I}/N} which is exponentially more important than 2d instantons Polyakov:1975yp .

Topological susceptibility and how to rescue naive instanton analysis: Despite the fact that (126) is a sum over WW\in{\mathbb{Z}} configurations, it leads to to topological susceptibility that is in qualitative agreement with the purely bosonic theory

χtop.soft=2θ2|θ=0=2mΛNvs.χtop.bos.=3Λ2πN\displaystyle\chi_{\rm top.}^{\rm soft}=\frac{\partial^{2}{\cal E}}{\partial\theta^{2}}\Big{|}_{\theta=0}=\frac{2m\Lambda}{N}\qquad{\rm vs.}\qquad\chi_{\rm top.}^{\rm bos.}=\frac{3\Lambda^{2}}{\pi N} (127)

The result in bosonic theory is based on large-NN solution (see DAdda:1978vbw , and Marino:2015yie for a pedagogical description) and lattice simulations Vicari:2008jw . We expect, once, mΛm\geq\Lambda, χtop.soft\chi_{\rm top.}^{\rm soft} to saturate to χtop.bos.\chi_{\rm top.}^{\rm bos.}. The important things is both are of order 1/N1/N as they should, as opposed to eO(N)\mathrm{e}^{-O(N)} as predicted by naive instanton analysis.

In the standard instanton analysis in 2d that we learn from textbooks, one is instructed to sum over all integer topological charge WW\in{\mathbb{Z}} configurations with action S=|W|SIS=|W|S_{I} Coleman198802 ; Schafer:1996wv :

Zins.gas(θ)\displaystyle Z_{\rm ins.\;gas}(\theta) =n=0n¯=01n!1n¯!(V2KeSI+iθ)n(V2KeSIiθ)n¯\displaystyle=\sum_{n=0}^{\infty}\sum_{\overline{n}=0}^{\infty}\frac{1}{n!}\frac{1}{\overline{n}!}\left(V_{2}K\mathrm{e}^{-S_{I}+\mathrm{i}\theta}\right)^{n}\left(V_{2}K\mathrm{e}^{-S_{I}-\mathrm{i}\theta}\right)^{\overline{n}} (128)
=WZ[n¯=01(n¯+W)!1n¯!(V2KeSI)2n¯+W]eiWθ\displaystyle=\sum_{W\in Z}\left[\sum_{\overline{n}=0}^{\infty}\frac{1}{(\overline{n}+W)!}\frac{1}{\overline{n}!}\left(V_{2}K\mathrm{e}^{-S_{I}}\right)^{2\overline{n}+W}\right]\mathrm{e}^{\mathrm{i}W\theta} (129)
=WZIW(V2KeSI)eiWθ\displaystyle=\sum_{W\in Z}I_{W}\left(V_{2}K\mathrm{e}^{-S_{I}}\right)\mathrm{e}^{\mathrm{i}W\theta} (130)
=eV2KeSIcosθ\displaystyle=\mathrm{e}^{V_{2}K\mathrm{e}^{-S_{I}}\cos\theta} (131)

leading to vacuum energy density and topological susceptibility

(θ)KeSIcosθKeO(N)cosθ,χtop.naiveeO(N)\displaystyle{\cal E}(\theta)\sim-K\mathrm{e}^{-S_{I}}\cos\theta\sim-K\mathrm{e}^{-O(N)}\cos\theta,\qquad\;\;\chi_{\rm top.}^{\rm naive}\sim\mathrm{e}^{-O(N)} (132)

which are in clear contradiction with the correct result (127).

In principle, in both (131) and (126), we are summing over only WW\in{\mathbb{Z}} configurations. What went so wrong in the naive instanton analysis? Of course, the answer is clear. In (126), we used the configurations with fractional action and topological charge, but imposed a global constraint on topological charge which restricts the sum to WW\in{\mathbb{Z}}. This is equivalent to the statement that we used saddles in the U(1)/NU(1)/{\mathbb{Z}}_{N} bundle thanks to TQFT coupling and then, lifted the configurations that obey (126) constraint to the U(1)U(1) theory. In this way, we obtain the θ\theta angle dependence and topological susceptibility correctly. This settles the issue raised in DAdda:1978vbw concerning the role of fractional instantons in determination of topological susceptibility.

4 Continuation of N1\mathbb{CP}^{N-1} semiclassics to strong coupling

The discussion of this section is the heart of our idea of connecting weakly coupled calculable domain to strongly coupled domain. Hence, it is important to summarize what we did so far for N1\mathbb{CP}^{N-1}, and state what we will do next.

1) We showed the existence of smooth field configurations with fractional W=1/NW=1/N topological charge in the presence of background gauge fields for global symmetries. Their existence follows from the usual Bogomolny argument showing they have minimum action proportional to their (fractional) topological charge.

2) The backgrounds in which you see this fractionalization are topological in the sense that they are related to background gauge fields for discrete N\mathbb{Z}_{N} symmetries. One can also gauge the discrete symmetry in question by summing over their background fields weighted by a topological phase called discrete theta angle which effectively projects out all but one fractional topological charge sector, and so gives the same information.

3) There are then configurations which are approximately the sum of many widely-separated fractional instantons (ie, governed by “saddle points at infinity”) in the semi-classical domain Behtash:2018voa . Below, we will present the strong coupling realization of this concept.

4) These configurations can then contribute to the path integral with fractional-instanton action even if background and boundary conditions only allow integer topological charge.

5) It is a matter of dynamics whether these fractional-instanton-action configurations do or do not contribute in an important way to the path integral. But analytic continuation from weak coupling semiclassical expansion (familiar from resurgent transseries expansion Dunne:2012ae ) suggests that at strong coupling there might be an effective description in terms of fractional-instanton local fields (ie, fields which create local excitations with the quantum numbers of the fractional-instanton bosonic and fermionic zero modes). This is familiar from Polyakov model Polyakov:1975rs ; Polyakov:1987ez and deformed Yang-Mills theory on 3×S1{\mathbb{R}}^{3}\times S^{1} Unsal:2008ch ; Unsal:2007jx where dual theory is constructed based on monopole-instantons. However, although this information is extremely crucial for us, this is not the approach we will pursue. 101010In particular, weak coupling semi-classical expansions is always hierarchical. On the other hand, strong coupling descriptions are egalitarian. We will comment on this, in retrospect, after our construction is over. Hopefully we will do something more powerful.

6) In Refs.Berg:1979uq ; Fateev:1979dc , Fateev, Frolov, Schwarz, and Berg, Lüscher, considered multi-instantons in 1\mathbb{CP}^{1} on 2{\mathbb{R}}^{2}, and parametrized the moduli space of nn-instanton, which is 4n4n dimensional, as 2n2n complex coordinates. Gross uses this parametrization earlier in Gross:1977wu to show that the interaction between the instanton and anti-instanton is a dipole-dipole interaction in 2d2d, where each dipole is as if it is composed of vortex-anti-vortex pair. There is no explicit physical splitting in this parametrization, (let us call it X-parametrization for brevity where X is short-hand for position moduli in terms of fractional vortex instantons) for example, a single instanton is still a single lump and not two lumps! The magical thing of this parametrization is that the determinant of the fluctuation operator can be calculated exactly, and muti-instanton gas behaves as if it is a Coulomb gas of vortices interacting each other depending on their charges and position moduli. We will give a review of these works, and close its shortcomings. Of course, one of our main motivation is the consistency between fractional quantization of action and topological charge induced by TQFT coupling, fractional quantization that emerges in weak coupling resurgent expansion and the analysis of determinant of the fluctuation operator in X-parametrization admitting an interpretation as if the theory on 2{\mathbb{R}}^{2} can be described in terms of these fractional defects. Our goal is to make this correspondence precise.

7) Inspired from the analysis of Refs.Berg:1979uq ; Fateev:1979dc , consistency in weak coupling semi-classical limits and resurgent expansions, matching of the global symmetries and mixed ’t Hooft anomalies, and by the consistency (stability) under RG flow of these descriptions, we propose

  • massive NN-flavor Schwinger = Abelian bosonization = mass-deformed SU(N)1SU(N)_{1} WZW

models as low-energy description of asymptotically free N1\mathbb{CP}^{N-1} model on 2{\mathbb{R}}^{2}.

Remarks: The fractional vortex instantons in all cases are classified with the roots of SU(N)SU(N) algebra. In semi-classical limits in each case, the elementary tunneling events are given by:

Δaffine(1)={𝜶1,,𝜶N}\displaystyle\Delta_{\rm affine}^{(1)}=\left\{\bm{\alpha}_{1},\ldots,\bm{\alpha}_{N}\right\} (133)
𝜶aΓrofSU(N),\displaystyle{\bm{\alpha}}_{a}\in\Gamma_{r}\;{\rm of}\;SU(N), =a=1NV~a\displaystyle\qquad{\cal I}=\prod_{a=1}^{N}\widetilde{V}_{a} (134)

where 𝜶N=a=1N1𝜶a\bm{\alpha}_{N}=-\sum_{a=1}^{N-1}\bm{\alpha}_{a} is the affine root. We can in principle write a dilute gas based on these charges, and also include their correlated events such as [V~aV~b][\widetilde{V}_{a}{\widetilde{V}}_{b}] or [V~aV~¯b][\widetilde{V}_{a}\overline{\widetilde{V}}_{b}] etc, but instead we do something more powerful. We write theories for which all orders non-perturbative expansions are the same, according to our understanding of resurgence. We can call the operators V~a\widetilde{V}_{a} associated with tunneling the dual vertex operators.

The action of the abelian bosonization of Schwinger model will be written in terms of vertex operators (not dual vertex operators), Va{V}_{a}, associated with the 𝝂aΓw{\bm{\nu}}_{a}\in\Gamma_{w} weights of SU(N)SU(N) Lie algebra. As asserted, tunnelings in the abelian bosonized description are associated with the 𝜶aΓr{\bm{\alpha}}_{a}\in\Gamma_{r}. In particular, abelian bosonization, its non-abelian version, and massive Schwinger model are not effective theories describing proliferations of fractional vortex instantons. The fractioal vortex instantons in these theories are same as the one in N1\mathbb{CP}^{N-1} model in weak coupling limit.

There is a necessary matching of scales between different theories. For example, the non-perturbative expansion parameter in N1\mathbb{CP}^{N-1} is eSI/N\mathrm{e}^{-S_{I}/N}, which controls the fugacity of vortex instantons with charges 𝜶aΓr\bm{\alpha}_{a}\in\Gamma_{r} in semi-classical domain. This maps to mass of the fermion in the Schwinger model, mψeSI/Nm_{\psi}\sim\mathrm{e}^{-S_{I}/N}, which in turn becomes mass of the rebosonized field in bosonization construction, or mass deformation of the WZW model.

The above description is self-consistent, because the mass gap cuts off the IR divergence in the fractional-instanton size modulus, so justifies being analytically connected to a dilute fractional-instanton gas (ie, semiclassical) picture. The renormalization group analysis is independent of θ\theta-angle. In particular, it gives a prediction of gaplessness for 1\mathbb{CP}^{1} at θ=π\theta=\pi as we show in detail. As a result, gap, gaplessness and the θ\theta-angle dependence of mass gap that we find in our formalism are reliable qualitative predictions.

4.1 Fateev, Frolov, Schwarz, and Berg, Lüscher, in retrospect

In 1979, Refs.Berg:1979uq ; Fateev:1979dc came up with a very clever construction in the 1\mathbb{CP}^{1} model, though as they point out themselves, their method has some issues. (See Polyakov’s textbook for an assessment Polyakov:1987ez .) Our construction partially justifies their work on rigorous grounds, and improves it. We feel it is extremely important to understand this discussion in order to perform a similar analysis in gauge theories on 4{\mathbb{R}}^{4}. So, below, we will explain and improve the ideas of Refs.Berg:1979uq ; Fateev:1979dc ; Polyakov:1987ez and bring our own perspective.

Let us outline Refs.Berg:1979uq ; Fateev:1979dc from our perspective. They consider a qq-instanton configurations in 1\mathbb{CP}^{1} model. For q=1q=1, an instanton on 2{\mathbb{R}}^{2} has 4 bosonic zero modes, which are usually parametrized as as 2 position moduli for the center of instanton aca^{\rm c}, one size moduli ρ\rho, and a U(1)U(1) angular moduli ϕ\phi. The qq multi-instanton solution in 1\mathbb{CP}^{1} model can be written in terms of a holomorphic function

w(𝗓)=i=1q𝗓ai1𝗓ai2\displaystyle w(\mathsf{z})=\prod_{i=1}^{q}\frac{\mathsf{z}-a^{1}_{i}}{\mathsf{z}-a^{2}_{i}} (135)

where

{a11,,aq1},{a12,,aq2}\displaystyle\{a^{1}_{1},\ldots,a^{1}_{q}\},\qquad\{a^{2}_{1},\ldots,a^{2}_{q}\} (136)

are just 4q4q complex moduli. We will refer to this as X-parametrization, for short. It turns out that this is a very useful parametrization for the combination of the following two reasons. reasons that will become clear.

1) Despite the fact that, what one may think at first sight, the exact solution for instanton (q=1q=1) does not correspond to physical splitting of an instanton into two distinct lumps. If one inspects the topological charge density (integrand in (137)) for the q=1q=1 solution, it is clear that this is a one-lump configuration:

Q=1πd2x|a1a2|2(|𝗓12((a1+a2)|2+|a1a2|2)2=1πd2xρ2(|𝗓ac|2+ρ2)2=1\displaystyle Q=\frac{1}{\pi}\int{d^{2}x}\frac{|a^{1}-a^{2}|^{2}}{(|\mathsf{z}-{\textstyle{\frac{1}{2}}}((a^{1}+a^{2})|^{2}+|a^{1}-a^{2}|^{2})^{2}}=\frac{1}{\pi}\int{d^{2}x}\frac{\rho^{2}}{(|\mathsf{z}-a^{\rm c}|^{2}+\rho^{2})^{2}}=1 (137)

centered at aca^{\rm c}, with size moduli ρ\rho, and not two lumps located at a1a^{1} and a2a^{2}:

ac=12(a1+a2),ar=ρeiϕ=(a1a2)\displaystyle a^{\rm c}={\textstyle{\frac{1}{2}}}(a^{1}+a^{2}),\qquad a^{\rm r}=\rho\mathrm{e}^{\mathrm{i}\phi}=(a^{1}-a^{2}) (138)

Compare this with the configurations found on ×S1{\mathbb{R}}\times S^{1} in the ΩF\Omega_{F} background. There, when the size moduli ρ\rho is smaller than the length scale set by ΩF\Omega_{F}, there is a single smooth lump. When the size moduli ρ\rho is larger than the scale set by ΩF\Omega_{F}, the instanton fractionates to NN distinct smooth lumps. See Fig. 5 and Ref.Dunne:2012ae .

2) Despite the fact that q=1q=1 configuration in (135) is a single lump, when the quantum fluctuation determinant around multi-instanton configuration is computed exactly at one-loop order, it admits an elegant interpretation in terms of a Coulomb gas in 2d, as if each instanton is a dipole of two fractional vortex instantons. The partition function (counting only instantons) can be written as Berg:1979uq ; Fateev:1979dc ; Polyakov:1987ez :

Zinst\displaystyle Z_{\rm inst} =q=0(KeSI/2)2q(q!)2j=1qd2aj1d2aj2\displaystyle=\sum_{q=0}^{\infty}\frac{(K\mathrm{e}^{-S_{I}/2})^{2q}}{(q!)^{2}}\int\prod_{j=1}^{q}d^{2}a^{1}_{j}\;d^{2}a^{2}_{j} (139)
×exp[i<jlog|ai1aj1|2+i<jlog|ai2aj2|2i,jlog|ai1aj2|2]\displaystyle\times\exp\left[\sum_{i<j}\log|a^{1}_{i}-a^{1}_{j}|^{2}+\sum_{i<j}\log|a^{2}_{i}-a^{2}_{j}|^{2}-\sum_{i,j}\log|a^{1}_{i}-a^{2}_{j}|^{2}\right] (140)

This suggests the following picture: It is as if we are supposed to think of instantons {\cal I} as the composites of fractional vortex instantons on 2{\mathbb{R}}^{2}, V1{V}^{1}, and V2{V}^{2}. In some sense, the formalism presented in Refs.Berg:1979uq ; Fateev:1979dc can be viewed as “fractionalization without fractionalization”. One can formally write V1V2{\cal I}\sim{V}^{1}{V}^{2}, where we assign V1{V}^{1} Coulomb charge +1 , and V2{V}^{2} charge 1-1. We will learn that V2{V}^{2} cannot be interpreted as anti-vortex, because if we incorporate theta angle, both V1{V}^{1} and V2{V}^{2} acquires phases eiθ2\mathrm{e}^{\mathrm{i}\frac{\theta}{2}}.111111The sum which takes only instantons into account (without anti-instantons) is actually meaningful here, but not always meaningful. For example, in 3d Polyakov model, it would not be meaningful because one cannot impose global charge neutrality with only charge +1 monopoles. (Coulomb system with only + charges is not stable.) Here, one can guarantee global charge neutrality thanks to the fact that there are two types of fractional vortex instantons with opposite electric charges, and one can have charge neutrality without taking anti-instantons into account. On 3×S1{\mathbb{R}}^{3}\times S^{1} deformed Yang-Mills theory (unlike Polyakov model on 3{\mathbb{R}}^{3}), taking only monopole-instantons into account (without anti-monopoles) is also meaningful for the same reason. There are two kinds of monopoles with charges +1+1 and 1-1, and one does not need anti-monopoles to have global charge neutrality. In fact, the SU(N)SU(N) theory with only monopoles is integrable. It maps to complex affine Toda theory, which happens to have a real spectrum (at θ=0\theta=0) and can be viewed as a Hirota bilinear form. See Unsal:2008ch on these subtle issues. Once, one includes both monopoles and anti-monopoles, one obtains (225) that we discuss in the next section which is no longer integrable. There are actually V¯1\overline{V}^{1} and V¯2\overline{V}^{2} in the game, that are not included at this stage, but easy to incorporate (see below.) So, the system behaves as a Coulomb gas with ±\pm charges, with fugacities eSI/2e^{-S_{I}/2} for fractional vortex-instantons. The Coulomb systems in 2d have two phases, a gapless molecular phase and gapped plasma phase. 1\mathbb{CP}^{1} (at θ=0\theta=0) happens to be in the gapped plasma phase. The fugacity of the fractional instantons emerges as the inverse correlation length of the system, eSI/2Λ\mathrm{e}^{-S_{I}/2}\sim\Lambda, the strong scale of the theory. Now, interestingly, despite the fact that these arise from smooth instanton configurations with X-parametrization, the final form of the action coincide with the one obtained by Gross in Gross:1977wu obtained by using merons and intuition.

The instanton partition function ZinstZ_{\rm inst} can be reproduced by using a free massless Dirac field ψ=(ψLψR)\psi={\psi_{L}\choose\psi_{R}}, and bosonization formulas in 2d2d (see Sec. 32 of ZinnJustin:2002ru )

σ+(x)=ψ¯LψR(x),σ(x)=ψ¯RψL(x),ψ¯ψ=σ++σ\displaystyle\sigma_{+}(x)=\overline{\psi}_{L}\psi_{R}(x),\qquad\sigma_{-}(x)=\overline{\psi}_{R}\psi_{L}(x),\qquad\qquad\overline{\psi}\psi=\sigma_{+}+\sigma_{-} (141)

and observing that

σ+(a11)σ+(aq1)σ(a12)σ(ap2)=i<j|ai1aj1|2|ai2aj2|2i,j|ai1aj2|2δpqeVCoulombδpq\displaystyle\Big{\langle}\sigma_{+}(a^{1}_{1})\ldots\sigma_{+}(a^{1}_{q})\sigma_{-}(a^{2}_{1})\ldots\sigma_{-}(a^{2}_{p})\Big{\rangle}=\prod_{i<j}|a^{1}_{i}-a^{1}_{j}|^{2}|a^{2}_{i}-a^{2}_{j}|^{2}\prod_{i,j}|a^{1}_{i}-a^{2}_{j}|^{-2}\delta_{pq}\;\equiv\mathrm{e}^{-V_{\rm Coulomb}}\delta_{pq} (142)
=δpqexp[i<jlog|ai1aj1|2+i<jlog|ai2aj2|2i,jlog|ai1aj2|2]\displaystyle=\delta_{pq}\exp{\left[\sum_{i<j}\log|a^{1}_{i}-a^{1}_{j}|^{2}+\sum_{i<j}\log|a^{2}_{i}-a^{2}_{j}|^{2}-\sum_{i,j}\log|a^{1}_{i}-a^{2}_{j}|^{2}\right]} (143)

where \langle\cdot\rangle is evaluated by using massless Dirac action. The δpq\delta_{pq} is due to conservation of chirality. As a result, the instanton partition function (140) can be written as

Zinst=𝒟ψ¯𝒟ψexp[(ψ¯iγμμψ+KeSI2ψ¯ψ)]\displaystyle Z_{\rm inst}=\int{\mathcal{D}}\overline{\psi}{\mathcal{D}}\psi\;\;\exp\left[-\int\left(\overline{\psi}\mathrm{i}\gamma_{\mu}\partial_{\mu}\psi+K\mathrm{e}^{-{S_{I}\over 2}}\overline{\psi}\psi\right)\right] (144)

Despite the fact that the inclusion of anti-instantons is presented in Refs.Berg:1979uq ; Fateev:1979dc ; Polyakov:1987ez as a conceptual difficulty, it is actually a relatively simple matter now, especially after understanding the role of fractional instanton-antiinstantons in resurgence Dunne:2012ae , and we describe their inclusion below. There are few issues with the above formula.

  • The reliability (or at least self-consistency) of semi-classical expansion.

  • The fermionic theory (at least naively) has a global U(1)U(1) symmetry that does not match with the SU(2)/2SU(2)/{\mathbb{Z}}_{2} symmetry of the original 1\mathbb{CP}^{1} theory.121212Thanks to Aleksey Cherman for discussions about this point. Effective field theories must respect the global symmetries of microscopic theories. On the other hand, there is actually hope that this point may be fixable, because abelian bosonization (as opposed to non-abelian bosonization) makes some symmetries extremely non-obvious Coleman:1976uz . There is clearly something to be figured out.

  • If we incorporate θ\theta, it will not produce θ\theta angle physics correctly. But this is obvious, since we did not include anti-instantons.

What FFS-BL did not know and resurgence at help. The coupling a ΩF\Omega_{F}-twist and N{\mathbb{Z}}_{N} TQFT makes it manifest that the topological defects with fractional charge W=1/NW=1/N and action SI/NS_{I}/N exist both in the weak coupling semi-classical domain on ×S1{\mathbb{R}}\times S^{1} and on arbitrary T2T^{2} at strong coupling. Furthermore, unlike the past times Berg:1979uq ; Fateev:1979dc , we no longer feel uncomfortable about (fractional or not) instanton anti-instanton pairs because resurgence theory tells us what their effects are, and how they are incorporated correctly and cancel ambiguities in perturbation theory Dunne:2012ae . In particular, when we do semi-classics, we are forced (this is not a choice) to consider the complexification of field space. In the old days, it was thought that the instanton-antiinstanton configurations could not be distinguished from perturbative vacuum, and the quasi-zero mode integration between a fractional instanton-antiinstanton would be ill-defined. Now, we know two facts. Fractional instanton antiinstanton configurations are critical points at infinity (see around (7) for a reminder), and in order to calculate their contribution to path integral, we need to do path intergal over the Lefschetz thimble, which is meaningful Dunne:2012ae ; Behtash:2018voa . Lefschetz thimble integration at second order in semi-classic gives two types of contributions, uncorrelated and correlated pairs β22![a][¯a]+β[a¯a]±\frac{\beta^{2}}{2!}[{\cal I}_{a}][\overline{\cal I}_{a}]+\beta[{\cal I}_{a}\overline{\cal I}_{a}]_{\pm}. In other words, thimble integration is a sophisticated version of cluster expansion, which gives the effect of both uncorrelated pairs as well as correlated pairs. The effect of correlated pair is two-fold ambiguous Im[a¯a]±±e2SI/N\text{Im}[{\cal I}_{a}\overline{\cal I}_{a}]_{\pm}\sim\pm\mathrm{e}^{-2S_{I}/N}. But there is another ambiguity in the theory that comes upon lateral (left/right) Borel resummation of perturbation theory, which we denote as 𝔹±\mathbb{B}_{\pm}. Remarkable fact that comes from resurgent analysis of N1\mathbb{CP}^{N-1} is that

Im𝔹±+Im[a¯a]±=0\displaystyle\text{Im}\mathbb{B}_{\pm}+\text{Im}[{\cal I}_{a}\overline{\cal I}_{a}]_{\pm}=0 (145)

i.e., the imaginary ambiguities cancel Dunne:2012ae . This ambiguity cancellation is at order e2SI/NΛ2\mathrm{e}^{-2S_{I}/N}\sim\Lambda^{2} and is the semi-classical realization of ’t Hooft’s famous IR renormalon puzzle tHooft:1977xjm (see also Munster:1981zn ) in N1\mathbb{CP}^{N-1} model. It should be noted in the analysis below, we do not use resurgence, in fact, we do not even use second order correlated pairs in semi-classics except one occasion where it is strictly necessary to determine the IR-dynamics in 1\mathbb{CP}^{1} at θ=π\theta=\pi around (209). But the advantage that resurgence provides is clarity it provides in knowing what various non-BPS configurations are, and what the corresponding operators and their physical effects are, and we do not need to need all thanks to renormalization group. Therefore, in the light of these progress progress in resurgent semi-classical analysis, we feel encouraged to reconsider the FFS-BL analysis Berg:1979uq ; Fateev:1979dc on 2{\mathbb{R}}^{2}, and improve it.

4.2 Incorporating θ\theta and fractional anti-instantons into FFS-BL

Based on (140), we view the instanton on 2{\mathbb{R}}^{2} as a composite of the two fractional (vortex) instantons. But recall that the topological charge density or action density (137) are not a two lump configuration. They can be smoothly be split to two-lumps either using ΩF\Omega_{F} twist background, or in the background of B(2)B^{(2)} 2{\mathbb{Z}}_{2}-flux TQFT coupling. Both of these backgrounds introduce a scale and if the moduli parameter becomes larger than a certain scale set by the background, we see fractionalization explicitly. The remarkable fact about the Berg:1979uq ; Fateev:1979dc analysis is that it reveals something deep about the inner structure of 2d instanton via a parametrization and exact computation of the fluctuation determinant.

Due to its extreme importance in the story, we will also incorporate θ\theta angle dependence from now on. First, note that incorporating θ\theta angle in Berg:1979uq ; Fateev:1979dc modifies (144) as eSI2eSI2+iθ2\mathrm{e}^{-\frac{S_{I}}{2}}\rightarrow\mathrm{e}^{-\frac{S_{I}}{2}+\mathrm{i}\frac{\theta}{2}}. Expanding the exponentials, we can perform the path integral by using (143) which tells us that when the number of σ+\sigma_{+} and σ\sigma_{-} insertions are not equal, the integration vanishes due to excess chirality. As a result, the fractional vortex instanton sum (140) is modified into q=0()eiqθ\sum_{q=0}^{\infty}(\ldots)\mathrm{e}^{\mathrm{i}q\theta}, where q=0,1,2,q=0,1,2,\ldots which indicates that the sum is only over non-negative integer winding instantons (without anti-instantons). The inclusion of θ\theta modifies the action of Refs Berg:1979uq ; Fateev:1979dc into

S=ψ¯iγμμψ+KeSI2eiθ2ψ¯ψ\displaystyle S=\overline{\psi}\mathrm{i}\gamma_{\mu}\partial_{\mu}\psi+K\mathrm{e}^{-{S_{I}\over 2}}\mathrm{e}^{\mathrm{i}\frac{\theta}{2}}\overline{\psi}\psi (146)

which looks a bit awkward, because eiθ2\mathrm{e}^{\mathrm{i}\frac{\theta}{2}} appears in front of the Dirac mass term.

If we were to incorporate our sum based on proliferation of V1{V}_{1}, V¯1\overline{V}_{1}, (instead of V1{V}_{1}, V2{V}_{2} pair), the effective action would be replaced by ψ¯iγμμψ+eiθ2ψ¯LψR+eiθ2ψ¯RψL\overline{\psi}\mathrm{i}\gamma_{\mu}\partial_{\mu}\psi+\mathrm{e}^{\mathrm{i}\frac{\theta}{2}}\overline{\psi}_{\mathrm{L}}\psi_{\mathrm{R}}+\mathrm{e}^{-\mathrm{i}\frac{\theta}{2}}\overline{\psi}_{\mathrm{R}}\psi_{\mathrm{L}}. This can only describe topological charge Q=0Q=0 sector of the theory.

Let us know take into account both instantons and anti-instantons. We view V1V2{\cal I}\sim{V}_{1}{V}_{2} where Va{V}_{a} stands for fractional vortex instanton:

V1V2,V1eSI2+iθ2σ+,V2eSI2+iθ2σ\displaystyle{\cal I}\sim{V}_{1}{V}_{2},\qquad{V}_{1}\sim\mathrm{e}^{-\frac{S_{I}}{2}+\mathrm{i}\frac{\theta}{2}}\sigma_{+},\qquad{V}_{2}\sim\mathrm{e}^{-\frac{S_{I}}{2}+\mathrm{i}\frac{\theta}{2}}\sigma_{-} (147)
¯V¯1V¯2,V¯1eSI2iθ2σ,V¯2eSI2iθ2σ+\displaystyle\overline{\cal I}\sim\overline{V}_{1}\overline{V}_{2},\qquad\overline{V}_{1}\sim\mathrm{e}^{-\frac{S_{I}}{2}-\mathrm{i}\frac{\theta}{2}}\sigma_{-},\qquad\overline{V}_{2}\sim\mathrm{e}^{-\frac{S_{I}}{2}-\mathrm{i}\frac{\theta}{2}}\sigma_{+} (148)

The topological and Coulomb charges of these configurations are:

V1:(+12,+1),V2:(+12,1)\displaystyle{V}_{1}:\left(+{\textstyle{\frac{1}{2}}},+1\right),\qquad{V}_{2}:\left(+{\textstyle{\frac{1}{2}}},-1\right) (149)
V¯1:(12,1),V¯2:(12,+1)\displaystyle\overline{V}_{1}:\left(-{\textstyle{\frac{1}{2}}},-1\right),\qquad\overline{V}_{2}:\left(-{\textstyle{\frac{1}{2}}},+1\right) (150)

Note that Coulomb charges of V1{V}_{1} and V¯2\overline{V}_{2} are same, but their topological charge differ in sign. They interact with each other according to Coulomb’s law in 2d2d. The proliferations of these events on 2{\mathbb{R}}^{2} produce a modified mass term: V1+V2+V¯1+V¯2{V}_{1}+{V}_{2}+\overline{V}_{1}+\overline{V}_{2}. As a result, we obtain the generalization of (146),

Zinst/antiinst\displaystyle Z_{\rm inst/anti-inst} =𝒟ψ¯𝒟ψexp[(ψ¯iγμμψ+2KeSI2cosθ2(ψ¯ψ))]\displaystyle=\int{\mathcal{D}}\overline{\psi}{\mathcal{D}}\psi\;\;\exp\left[-\int\left(\overline{\psi}\mathrm{i}\gamma_{\mu}\partial_{\mu}\psi+2K\mathrm{e}^{-{S_{I}\over 2}}\cos\frac{\theta}{2}(\overline{\psi}\psi)\right)\right] (151)

which is now invariant under θθ\theta\rightarrow-\theta. This path integration does reproduce the proliferation of both vortices and anti-vortices with the correct θ\theta angle dependence, and incorporate all WW\in{\mathbb{Z}} sectors. However, it is not only a sum over configurations of the form n,¯n{\cal I}^{n},\overline{\cal I}^{n}, it also include contribution of configurations such as V1V¯1{V}_{1}\overline{V}_{1} and V2V¯2{V}_{2}\overline{V}_{2}.

Let us now investigate the configurations contributing to the full partition function carefully.

Z\displaystyle Z =𝒟ψ¯𝒟ψe(ψ¯iγμμψ+λ(eiθ2ψ¯LψR+eiθ2ψ¯RψL+eiθ2ψ¯RψL+eiθ2ψ¯LψR))\displaystyle=\int{\mathcal{D}}\overline{\psi}{\mathcal{D}}\psi\;\;\mathrm{e}^{-\int\left(\overline{\psi}\mathrm{i}\gamma_{\mu}\partial_{\mu}\psi+\lambda(\mathrm{e}^{\mathrm{i}\frac{\theta}{2}}\overline{\psi}_{L}\psi_{R}+\mathrm{e}^{\mathrm{i}\frac{\theta}{2}}\overline{\psi}_{R}\psi_{L}+\mathrm{e}^{-\mathrm{i}\frac{\theta}{2}}\overline{\psi}_{R}\psi_{L}+\mathrm{e}^{-\mathrm{i}\frac{\theta}{2}}\overline{\psi}_{L}\psi_{R})\right)} (152)
=𝒟ψ¯𝒟ψe(ψ¯iγμμψ)\displaystyle=\int{\mathcal{D}}\overline{\psi}{\mathcal{D}}\psi\;\;\mathrm{e}^{-\int\left(\overline{\psi}\mathrm{i}\gamma_{\mu}\partial_{\mu}\psi\right)} (153)
×n1=0(eiθ2λ)n1n1!(j=1n1d2aj1)σ+(a11)σ+(an11)\displaystyle\times\sum_{n_{1}=0}^{\infty}\frac{\left(\mathrm{e}^{\mathrm{i}\frac{\theta}{2}}\lambda\right)^{n_{1}}}{n_{1}!}\int\left(\prod_{j=1}^{n_{1}}d^{2}a^{1}_{j}\right)\sigma_{+}(a^{1}_{1})\ldots\sigma_{+}(a^{1}_{n_{1}}) (154)
×n2=0(eiθ2λ)n2n2!(j=1n2d2aj2)σ(a12)σ(an22)\displaystyle\times\sum_{n_{2}=0}^{\infty}\frac{\left(\mathrm{e}^{\mathrm{i}\frac{\theta}{2}}\lambda\right)^{n_{2}}}{n_{2}!}\int\left(\prod_{j=1}^{n_{2}}d^{2}a^{2}_{j}\right)\sigma_{-}(a^{2}_{1})\ldots\sigma_{-}(a^{2}_{n_{2}}) (155)
×n¯1=0(eiθ2λ)n¯1n¯1!(j=1n¯1d2bj1)σ(b11)σ(bn¯11)\displaystyle\times\sum_{\overline{n}_{1}=0}^{\infty}\frac{\left(\mathrm{e}^{-\mathrm{i}\frac{\theta}{2}}\lambda\right)^{\overline{n}_{1}}}{\overline{n}_{1}!}\left(\prod_{j=1}^{\overline{n}_{1}}d^{2}b^{1}_{j}\right)\sigma_{-}(b^{1}_{1})\ldots\sigma_{-}(b^{1}_{\overline{n}_{1}}) (156)
×n¯2=0(eiθ2λ)n¯2n¯2!(j=1n¯2d2bj2)σ+(b12)σ+(bn¯22)\displaystyle\times\sum_{\overline{n}_{2}=0}^{\infty}\frac{\left(\mathrm{e}^{-\mathrm{i}\frac{\theta}{2}}\lambda\right)^{\overline{n}_{2}}}{\overline{n}_{2}!}\left(\prod_{j=1}^{\overline{n}_{2}}d^{2}b^{2}_{j}\right)\sigma_{+}(b^{2}_{1})\ldots\sigma_{+}(b^{2}_{\overline{n}_{2}}) (157)

A typical term in the sum has (n1,n2,n¯1,n¯2)(n_{1},n_{2},\overline{n}_{1},\overline{n}_{2}) many (V1,V2,V¯1,V¯2)({V}_{1},{V}_{2},\overline{V}_{1},\overline{V}_{2}) insertions, respectively. In (157), the number of σ+\sigma_{+} insertions is n1+n¯2n_{1}+\overline{n}_{2} and the number of σ\sigma_{-} insertions is n2+n¯1n_{2}+\overline{n}_{1}. Therefore, using (143), we observe that the sum is non-zero if and only if

n1n¯1=n2n¯2=W\displaystyle n_{1}-\overline{n}_{1}=n_{2}-\overline{n}_{2}=W\in{\mathbb{Z}} (158)

constraint is obeyed. We can solve this one constraint to turn four sums into triple sum one of which can be identified as a topological charge WW\in{\mathbb{Z}}:

Z\displaystyle Z =W[n¯1=0n¯2=0λ2n¯1+2n¯2+2W(n¯1+W)!n¯1!(n¯2+W)!n¯2!j=1n1d2aj1j=1n2d2aj2j=1n¯1d2bj1j=1n¯2d2bj2eSint]na=n¯a+WeiWθ\displaystyle=\sum_{W\in\mathbb{Z}}\left[\sum_{\overline{n}_{1}=0}^{\infty}\sum_{\overline{n}_{2}=0}^{\infty}\frac{\lambda^{2\overline{n}_{1}+2\overline{n}_{2}+2W}}{(\overline{n}_{1}+W)!\overline{n}_{1}!(\overline{n}_{2}+W)!\overline{n}_{2}!}\int\prod_{j=1}^{n_{1}}d^{2}a^{1}_{j}\prod_{j=1}^{n_{2}}d^{2}a^{2}_{j}\prod_{j=1}^{\overline{n}_{1}}d^{2}b^{1}_{j}\prod_{j=1}^{\overline{n}_{2}}d^{2}b^{2}_{j}\;\mathrm{e}^{-S_{\rm int}}\right]_{n_{a}=\overline{n}_{a}+W}\mathrm{e}^{iW\theta} (159)

where the interaction action is:

Sint=4πi=12qiqj[k=1nil=1njG(akialj)+k=1n¯il=1n¯jG(bkiblj)2k=1nil=1n¯jG(akiblj)]ni=n¯i+W\displaystyle S_{\rm int}=4\pi\sum_{i=1}^{2}q_{i}\cdot q_{j}\left[\sum_{k=1}^{n_{i}}\sum_{l=1}^{n_{j}}G(a^{i}_{k}-a^{j}_{l})+\sum_{k=1}^{\overline{n}_{i}}\sum_{l=1}^{\overline{n}_{j}}G(b^{i}_{k}-b^{j}_{l})-2\sum_{k=1}^{n_{i}}\sum_{l=1}^{\overline{n}_{j}}G(a^{i}_{k}-b^{j}_{l})\right]_{n_{i}=\overline{n}_{i}+W} (160)

where G(a)=14πlog|a|2G(a)=-\frac{1}{4\pi}\log|a|^{2} is 2d Green’s function.

Now, if we take (151) seriously, assuming the semi-classics that leads to it is reliable (we will question this later), it tells us that the 1\mathbb{CP}^{1} theory on 2{\mathbb{R}}^{2} develops a finite correlation length for θ0\theta\neq 0, a mass gap, of the form:

mgap(θ)=?Λ|cosθ2|\displaystyle m_{\rm gap}(\theta)\;\underbrace{=}_{?}\;\Lambda\left|\cos\frac{\theta}{2}\right| (161)

This is, of course, a beautiful result at first sight, compliant with all of our educated guesses, a non-perturbative mass gap at θπ\theta\neq\pi and gaplessness at θ=π\theta=\pi, and a wonderful confirmation of the formalism described in Refs Berg:1979uq ; Fateev:1979dc , and Polyakov:1987ez . But this formula (not its consequences) (161) is actually not quite correct, because the formalism we set-up to this point, has some flaws.

First, let us describe why we got an “correct looking” result. The partition function has the form Z=WZWeiWθZ=\sum_{W\in\mathbb{Z}}Z_{W}\mathrm{e}^{iW\theta} where ZWZ_{W} is the partition function of a fixed WW\in{\mathbb{Z}} sector. But contributing configurations is richer than just naive instanton sums. Clearly, there are configurations of the form V1V¯1{V}_{1}\overline{V}_{1} with action e2×SIN\mathrm{e}^{-2\times\frac{S_{I}}{N}} which happens to be just instanton action for N=2N=2, but for general NN, fractional action configurations. At any rate, even for N=2N=2, this configuration is distinct from 2d instanton Polyakov:1975yp , which is V1V2{V}_{1}{V}_{2}. The fact that configurations such as V1{V}_{1} here, and V¯1\overline{V}_{1} there contribute to path integral is responsible for the appearance of cosθ2\cos\frac{\theta}{2} in the effective action. With the usual instantons, we could never get such a factor. This is a good part of the truth.

Polyakov, in his book Polyakov:1987ez , following a proposal of Ref.Bukhvostov:1980sn , argues that to incorporate anti-instantons, one should include a second Dirac fermion and couple the two in a special way. This is by no means necessary. The Coulomb charge of V1{V}_{1} is same as the Coulomb charge of V¯2\overline{V}_{2}. At θ=0\theta=0, there is no distinction between the corresponding defects and the corresponding operators. Only when we turn on θ\theta, we see their difference due to topology V1eSI2+iθ2σ+,V¯2eSI2iθ2σ+{V}_{1}\sim\mathrm{e}^{-\frac{S_{I}}{2}+\mathrm{i}\frac{\theta}{2}}\sigma_{+},\;\overline{V}_{2}\sim\mathrm{e}^{-\frac{S_{I}}{2}-\mathrm{i}\frac{\theta}{2}}\sigma_{+}, but this still does not demand introduction of a second fermion, since the Coulomb charges are same. Hence, to produce the correct sum over vortex instantons and anti-instantons, (151) is perfectly sufficient as we showed.

A possible issue with (144) (or even (151)) is that, at least naively, it does not look consistent with the global symmetry of the original 1\mathbb{CP}^{1} model, which is SU(2)/2SU(2)/{\mathbb{Z}}_{2}. Rather, it has an obvious U(1)U(1) symmetry. But effective field theories must respect the global symmetries of microscopic theories, and there is no negotiation room there. However, similar to abelian bosonization, sometimes symmetries may be obscured. Still, we want to make global symmetry manifest.

Furthermore, the summation over fractional vortex instantons VaV_{a}, V¯a\overline{V}_{a} is a good approximation if there exists a weak coupling semi-classical domain of the theory, similar to small S1×S^{1}\times{\mathbb{R}} as in Dunne:2012ae . But here, we are in the strongly coupled domain of the P1\mathbb{C}P^{1}. Is it possible in some way to take into account correlated events such as [VaV¯1]±,[VaV¯b],[V1V1V1],[{V}_{a}\overline{V}_{1}]_{\pm},[{V}_{a}\overline{V}_{b}],[{V}_{1}{V}_{1}{V}_{1}],\ldots etc. Note that this is not a quantitative, but qualitative issue. For example, does the gaplessness (161) persist once higher order effects are taken into account? Or can the theta dependence include terms of the form cos3θ2\cos\frac{3\theta}{2}, or include fractional powers such as (cosθ2)1/3\left(\cos\frac{\theta}{2}\right)^{1/3}? If one can achieve this, there is a hope to go beyond semi-classical ideas via such construction.

4.3 Fermionization, Abelian rebosonization and mass gap at arbitrary NN

We are largely inspired from the fermionic description of the fractional vortex instanton gas Berg:1979uq ; Fateev:1979dc in 1\mathbb{CP}^{1} model. However, this construction comes with a number of deficits, a number of which we already fixed. However, probably the most important issue is that original 1\mathbb{CP}^{1} has a SU(2)/2SU(2)/{\mathbb{Z}}_{2} symmetry, which is not manifest in (144) (or (151)). In more general N1\mathbb{CP}^{N-1} case, this symmetry is SU(N)/NSU(N)/{\mathbb{Z}}_{N}, and it is important to capture the mixed anomaly structure of these theories correctly. Therefore, we introduce NN massive fermions and gauge over-all U(1)U(1) in the general case. Therefore, we land on the NN-flavor Schwinger model with an SU(N)SU(N) invariant mass term.

S=12e2M2|da|2+iθ2πM2da+M2d2x𝝍¯γμ(μ+iaμ)𝝍+m𝝍¯𝝍S={1\over 2e^{2}}\int_{M_{2}}|\mathrm{d}a|^{2}+{\mathrm{i}\theta\over 2\pi}\int_{M_{2}}\mathrm{d}a+\int_{M_{2}}\mathrm{d}^{2}x\,\overline{\bm{\psi}}\gamma^{\mu}(\partial_{\mu}+\mathrm{i}a_{\mu}){\bm{\psi}}+m\overline{\bm{\psi}}{\bm{\psi}} (162)

The global symmetry, mixed anomalies and global inconsistencies, weak coupling semi-classical description of the NN-flavor Schwinger model match exactly with the PN1{\mathbb{C}P}^{N-1} model. Furthermore, 22-flavor Schwinger model provides the correct fix for the result of Refs Berg:1979uq ; Fateev:1979dc on 2{\mathbb{R}}^{2} based on the X-parametrization of moduli space. Because of these matching, it is natural that the low energy theory for the N1\mathbb{CP}^{N-1} model is described by massive Schwinger model, or equivalently, its abelian bosonization or its non-abelian bosonization given by mass deformed SU(N)1SU(N)_{1} WZW model with an extra scalar. The global symmetry in all cases is G=SU(N)/NG=SU(N)/{\mathbb{Z}}_{N} These theories also have charge conjugation symmetry 𝖢{\mathsf{C}} at θ=0,π\theta=0,\pi. The mixed anomaly and global inconsistency are exactly the same in these theories Dunne:2018hog ; Misumi:2019dwq .

We will now show that, upon bosonization, the tunneling events in this theory are described by the same set as in N1\mathbb{CP}^{N-1} model, by (107) and (109). Later, when we describe semi-classics, we will observe that the mass of the fermion mm and the coupling e2e^{2} in Schwinger model are controlled by strong scale Λ=μe4πg2(μ)N\Lambda=\mu\mathrm{e}^{-\frac{4\pi}{g^{2}(\mu)N}} of the N1\mathbb{CP}^{N-1} model. In particular, mΛm\sim\Lambda and e2=48πΛ2Ne^{2}=\frac{48\pi\Lambda^{2}}{N}.

First, let us review Coleman’s celebrated result, by following his footsteps in Coleman:1976uz , with some minor remarks sprinkled here and there. For NN-Fermi fields, NN-Bose fields are introduced. The bosonization dictionary is:

ψ¯aγμψaπ1/2ϵμννϕa\displaystyle\overline{\psi}_{a}\gamma^{\mu}\psi_{a}\leftrightarrow\pi^{-1/2}\epsilon_{\mu\nu}\partial_{\nu}\phi_{a} (163)
ψ¯aψacmNmcos(4πϕa)\displaystyle\overline{\psi}_{a}\psi_{a}\leftrightarrow-cmN_{m}\cos\left(\sqrt{4\pi}\phi_{a}\right) (164)

where NmN_{m} denotes normal ordering with respect to mass mm, and cc is numerical constant. There are two useful identities proven in Coleman:1974bu and used in Coleman:1976uz . Nm[Π2+(1ϕ)2]=Nμ[Π2+(1ϕ)2+constant]N_{m}[\Pi^{2}+(\partial_{1}\phi)^{2}]=N_{\mu}[\Pi^{2}+(\partial_{1}\phi)^{2}+\rm constant] (just an additive shift to ground state energy, which we can be ignorant about in this work), and more importantly

Nmcos(βϕ)=(μm)β2/4πNμcos(βϕ)\displaystyle N_{m}\cos(\beta\phi)=\left(\frac{\mu}{m}\right)^{\beta^{2}/4\pi}N_{\mu}\cos(\beta\phi) (165)

This relation encodes multiplicative remormalization of the cos(βϕ)\cos(\beta\phi) operator, to account for the change of the normal ordering scale.

To obtain the bosonized action, we use the dictionary and integrate out the gauge field.

L=12a=1N(dϕa)2cm2a=1Ncos(4πϕa)+e22πmink(a=1Nϕa14π(θ+2πk))2L={\textstyle{\frac{1}{2}}}\sum_{a=1}^{N}(\mathrm{d}\phi_{a})^{2}-cm^{2}\sum_{a=1}^{N}\cos\left(\sqrt{4\pi}\phi_{a}\right)+\frac{e^{2}}{2\pi}\min_{k}\Big{(}\sum_{a=1}^{N}\phi_{a}-\frac{1}{\sqrt{4\pi}}(\theta+2\pi k)\Big{)}^{2} (166)

This is same as Coleman’s original Coleman:1976uz for N=2N=2, except that we took into account the global aspects of gauge field in integrating it out, hence, mink\min_{k} structure in the potential. As Coleman instructs us, the SU(N)SU(N) invariance of the theory is obscured in this presentation, but not lost. If we wanted to keep SU(N)SU(N) invariance manifest throughout the discussion, we have to use non-abelian bosonization and mass perturbation of the WZW action. Because of its utility, we will return to this perspective as well.

Define the following transformation:

ϕ~\displaystyle\widetilde{\phi} =1Na=1Nϕa1N𝒆0ϕ\displaystyle=\frac{1}{\sqrt{N}}\sum_{a=1}^{N}\phi_{a}\equiv\frac{1}{\sqrt{N}}{\bm{e}}_{0}\cdot\bm{\phi} (167)
ϕ~a\displaystyle\widetilde{\phi}_{a} =ϕa1Na=1Nϕa=𝝂aϕa=1,,Na=1Nϕ~a=0\displaystyle=\phi_{a}-\frac{1}{N}\sum_{a=1}^{N}\phi_{a}={\bm{\nu}}_{a}\cdot\bm{\phi}\qquad a=1,\ldots,N\qquad\sum_{a=1}^{N}\widetilde{\phi}_{a}=0 (168)

where now NN-fields ϕ~a\widetilde{\phi}_{a} obeys a constraint, hence, representing N1N-1 fields, we can write the kinetic term as a=1N(dϕ~a)2+12(dϕ~)2=12a=1N(dϕa)2\sum_{a=1}^{N}(\mathrm{d}\widetilde{\phi}_{a})^{2}+\frac{1}{2}(\mathrm{d}\widetilde{\phi})^{2}=\frac{1}{2}\sum_{a=1}^{N}(\mathrm{d}\phi_{a})^{2}. As a result, the lagrangian can be written as

L\displaystyle L =12(dϕ~)2+12Ne2πmink(ϕ~14πN(θ+2πk))2+\displaystyle=\frac{1}{2}(\mathrm{d}\widetilde{\phi})^{2}+\frac{1}{2}\frac{Ne^{2}}{\pi}\min_{k}\left(\widetilde{\phi}-\sqrt{\frac{1}{4\pi N}}(\theta+2\pi k)\right)^{2}+ (169)
+12a=1N(dϕ~a)2cm2a=1Ncos(4πNϕ~+4πϕ~a)\displaystyle+\frac{1}{2}\sum_{a=1}^{N}(\mathrm{d}\widetilde{\phi}_{a})^{2}-cm^{2}\sum_{a=1}^{N}\cos\left(\sqrt{\frac{4\pi}{N}}\widetilde{\phi}+\sqrt{4\pi}\widetilde{\phi}_{a}\right) (170)

ϕ~\widetilde{\phi} particle has a mass square M2=Ne2π=48Λ2M^{2}=\frac{Ne^{2}}{\pi}=48\Lambda^{2}. In Schwinger model, this mode can be made parametrically heavy compared to other modes. However, since we are performing a matching between N1\mathbb{CP}^{N-1} and Schwinger model to match their semi-classical description in terms of tunneling events, we know that this mode cannot be made parametrically heavier than the lower lying modes. This mode is likely the SU(N)SU(N) singlet mode in the spectrum of N1\mathbb{CP}^{N-1}, limxyz¯i(x)eixyazi(y)\lim_{x\rightarrow y}\overline{z}_{i}(x)\mathrm{e}^{\mathrm{i}\int_{x}^{y}a}z^{i}(y), which differs from the adjoint by an order O(1/N)O(1/N) splitting Witten:1978bc . So, this splitting is more pronounced for N=2,3N=2,3, and not so important at large-NN where indeed, the adjoint and singlet become degenerate. With this in mind, let us integrate out the singlet field. Using the renormal ordering (or matching) prescription (165) of Ref.Coleman:1976uz , the low-energy theory takes the form:

L\displaystyle L =12a=1N(dϕ~a)2cmN+1NMN1Na=1Ncos(θ+2πkN+4πϕ~a),ϕ~N=a=1N1ϕ~a\displaystyle=\frac{1}{2}\sum_{a=1}^{N}(\mathrm{d}\widetilde{\phi}_{a})^{2}-cm^{\frac{N+1}{N}}M^{\frac{N-1}{N}}\sum_{a=1}^{N}\cos\left(\frac{\theta+2\pi k}{N}+\sqrt{4\pi}\;\widetilde{\phi}_{a}\right),\qquad\widetilde{\phi}_{N}=-\sum_{a=1}^{N-1}\widetilde{\phi}_{a} (171)

Two mass parameters in this action can be further merged to one, by re-using a renormal-ordering with respect to mass mm^{\prime}.

L\displaystyle L =12(dϕ)2(m)2a=1Ncos(4π𝝂aϕ)+\displaystyle={\textstyle{\frac{1}{2}}}(\mathrm{d}\bm{\phi})^{2}-(m^{\prime})^{2}\sum_{a=1}^{N}\ \cos\left(\sqrt{4\pi}{\bm{\nu}}_{a}\cdot\bm{\phi}\right)+\ldots (172)
m(θ)\displaystyle m^{\prime}(\theta) =(cm2NMN1Ncosθ+2πkN)NN+1\displaystyle=\left(cm^{\frac{2}{N}}M^{\frac{N-1}{N}}\cos\frac{\theta+2\pi k}{N}\right)^{\frac{N}{N+1}} (173)

This result agrees with Refs.Coleman:1976uz for N=2N=2 and Smilga:1992hx ; Hetrick:1995wq ; Hosotani:1998za for general NN. Here, we can regard ϕ\bm{\phi} as an NN-component vector. The mode associated with 1N𝒆0ϕ{1\over{\sqrt{N}}}\bm{e}_{0}\cdot\bm{\phi} decouples from the rest, making (173) particularly easy formulation to work with. 131313 In the process of completing this paper, we became aware of Ref.Wamer:2020hmv , which gives a description of flag manifold sigma models, where fractional vortex instantons associated with 𝝂aΓw{\bm{\nu}}_{a}\in\Gamma_{w} gaps out the theory. It is easy to make a connection between that work and ours by using SU(N)1SU(N)_{1} WZW formulation (196), and using double-trace deformations Unsal:2008ch to connect WZW to flag manifold, generalizing Tanizaki:2018xto .

Tunneling events in abelian bosonization description: The elementary tunneling events in (173) can be described similar to (107), and they correspond to

Δϕ=ϕ(τ=L)ϕ(τ=0)=π𝜶a,a=1,,N\displaystyle\Delta\bm{\phi}=\bm{\phi}(\tau=L)-\bm{\phi}(\tau=0)=\sqrt{\pi}\bm{\alpha}_{a},\qquad a=1,\ldots,N (174)

the simple roots and affine root of SU(N)SU(N) algebra living in the affine root system.

Δaffine(1)={𝜶1,,𝜶N}\displaystyle\Delta_{\rm affine}^{(1)}=\left\{\bm{\alpha}_{1},\ldots,\bm{\alpha}_{N}\right\} (175)

The crucial point is that the action of these tunneling events is controlled by mm^{\prime}. Using the matching conditions with N1\mathbb{CP}^{N-1} for mm and MM, we observe that the tunneling amplitude is controlled by eSI/N\mathrm{e}^{-S_{I}/N} in the bosonized version of the Schwinger model.

We can also define tunneling events associated with higher roots, and define the orbits as:

Δaffine(k)\displaystyle\Delta_{\rm affine}^{(k)} ={𝜶a+𝜶a+1+𝜶a+k1|a=1,N},k=1,,N1\displaystyle=\left\{\bm{\alpha}_{a}+\bm{\alpha}_{a+1}+\ldots\bm{\alpha}_{a+k-1}|a=1,\ldots N\right\},\qquad k=1,\ldots,N-1 (176)

corresponding to deomposition of N2NN^{2}-N roots in total to N1N-1 orbits. Note that Δaffine(N)=𝜶a+𝜶a+1+𝜶a+N1=0.\Delta_{\rm affine}^{(N)}=\bm{\alpha}_{a}+\bm{\alpha}_{a+1}+\ldots\bm{\alpha}_{a+N-1}=0. The events in the kthk^{\rm th} orbit have fugacity ekSI/N\mathrm{e}^{-kS_{I}/N}

The proliferation of the vortices describe a classical two-dimensional Coulomb gas of N2NN^{2}-N types of charges associated with charges 𝜶ab=𝝂a𝝂b=𝒆a𝒆b\bm{\alpha}_{ab}=\bm{\nu}_{a}-\bm{\nu}_{b}=\bm{e}_{a}-\bm{e}_{b} interacting with each other via via the 2d Coulomb’s law,

±𝜶𝜷14πlog|𝒂𝜶𝒂𝜷|2\displaystyle\pm\bm{\alpha}\cdot\bm{\beta}\frac{1}{4\pi}\log|\bm{a}_{\bm{\alpha}}-\bm{a}_{\bm{\beta}}|^{2} (177)

In order to define the operators associated with tunneling events, we need to use abelian duality in 2d. Using d𝝈=dϕ\mathrm{d}\bm{\sigma}=\star\mathrm{d}\bm{\phi}, we can write the tunneling events as:

V~𝜶ab=e|ab|SINeiπ𝜶ab𝝈\displaystyle\widetilde{V}_{\bm{\alpha}_{ab}}=\mathrm{e}^{-\frac{|a-b|S_{I}}{N}}\mathrm{e}^{\mathrm{i}\sqrt{\pi}\bm{\alpha}_{ab}\cdot\bm{\sigma}} (178)

Note that these events are hierarchical in a semi-classical domain, for example, once we introduce the ΩF\Omega_{F} background.

Since both semi-classical expansion and the mixed anomalies of the massive Schwinger model are same with the N1\mathbb{CP}^{N-1}, the massive Schwinger model will provide an qualitatively accurate description of the infrared physics of N1\mathbb{CP}^{N-1}.

4.4 Fractional instanton renormalization group (FIRG)

Since ϕ~a\widetilde{\phi}_{a} is a constrained field, we need to be careful in reading off the dimension of the cos()\cos(\ldots) operator. For example, setting N=2N=2 on (171), case examined in Coleman:1976uz , ϕ~2=ϕ~1\widetilde{\phi}_{2}=-\widetilde{\phi}_{1}, where ϕ~1\widetilde{\phi}_{1} is unconstrained. Then, properly normalizing the kinetic term by a field redefinition,

L\displaystyle L =12(dϕ~1)22(m)2cos(2πϕ~1),\displaystyle=\frac{1}{2}(\mathrm{d}\widetilde{\phi}_{1})^{2}-2(m^{\prime})^{2}\cos\left(\sqrt{2\pi}\;\widetilde{\phi}_{1}\right), (179)

and dimension of the cos()\cos(\ldots) operator is Δ[cos(2πϕ~1)]=β24π=12\Delta[\cos(\sqrt{2\pi}\;\widetilde{\phi}_{1})]=\frac{\beta^{2}}{4\pi}={\textstyle{\frac{1}{2}}}, in agreement with Coleman:1976uz . For general NN, it is slightly tricky to read off the dimension of cos()\cos(\ldots) operator and for this, the way we parametrized things in (173) is useful. In this formula, ϕ\bm{\phi} field is unconstrained, but there is an extra redundant mode in it that decouples from the dynamics. As a result, the dimension of vertex operators are easy to read:

dim(Va)=Δ[ei4π𝝂aϕ]=β2𝝂a24π=N1N\displaystyle{\rm dim}({V}_{a})=\Delta\left[\mathrm{e}^{\mathrm{i}\sqrt{4\pi}\;{\bm{\nu}_{a}\cdot\bm{\phi}}}\right]=\frac{\beta^{2}{\bm{\nu}_{a}}^{2}}{4\pi}=\frac{N-1}{N} (180)

This is consistent with non-abelian bosonization in which the identification is:

ψ¯L,aψR,bUab=ei4πNϕ~gab\displaystyle\overline{\psi}_{\mathrm{L},a}\psi_{\mathrm{R},b}\sim U_{ab}=\mathrm{e}^{\mathrm{i}\sqrt{\frac{4\pi}{N}}\widetilde{\phi}}g_{ab} (181)

and the dimensions of the corresponding operators are Δ[ei4πNϕ~]=1N\Delta\Big{[}\mathrm{e}^{\mathrm{i}\sqrt{\frac{4\pi}{N}}\widetilde{\phi}}\Big{]}=\frac{1}{N} and Δ[gab]=N1N\Delta\Big{[}g_{ab}\Big{]}=\frac{N-1}{N} where gg is SU(N)SU(N) valued.

Note that IR scaling in the vertex operator in the large-NN limit goes to 1, same as free field or UV of a fermion bilinear in 2d. The most dramatic change in the scaling dimension happens for N=2N=2, in that case, the scaling of fermion bilinear drops from 11 in the UV to 12\frac{1}{2} in the IR. This has an interesting effect that we discuss separately for N=2N=2 case with θ\theta angle.

The mass matrix for the ϕ\bm{\phi} field can be found by diagonalizing a(𝝂aϕ)2\sum_{a}({\bm{\nu}}_{a}\cdot\bm{\phi})^{2} matrix. The mass square matrix components are Mij2=𝝂ai𝝂aj=δij1NM_{ij}^{2}={\bm{\nu}}_{a}^{i}{\bm{\nu}}_{a}^{j}=\delta_{ij}-\frac{1}{N}

M2=𝟏N1N𝑱N\displaystyle M^{2}=\bm{1}_{N}-\frac{1}{N}{\bm{J}}_{N} (182)

where 𝑱N\bm{J}_{N} is all-one matrix and 𝟏N\bm{1}_{N} is identity matrix. M2M^{2} has N1N-1 eigenvalues which are equal to 11, for which the eigenvectors are 𝜶a,a=1,N1\bm{\alpha}_{a},a=1,\ldots N-1 and 11 eigenvalue equal to 0 with eigenvector 𝒆0\bm{e}_{0}.

Relevants and irrelevants in infrared dynamics (IR scaling dimensions): Let us list the scaling dimension of various vertex operators.141414The vertex operators in the effective action (198) can be viewed as tunneling events of the effective action based on proliferation of the vortices with charges (174) in the 𝜶Γr\bm{\alpha}\in\Gamma_{r}, based on the dual field 𝝈\bm{\sigma} in (178). This will help us to identify which of these operators are relevant in the infrared. It will also help us ”tower” of interesting operators which cary same Coulomb charge (scaling dimension), but higher topological charge.

dim(1)\displaystyle{\rm dim}(1) =0\displaystyle=0 (183)
dim(Va)\displaystyle{\rm dim}({V}_{a}) =Δ[ei4π𝝂aϕ]=(𝝂a)2=(11N)\displaystyle=\Delta\left[\mathrm{e}^{\mathrm{i}\sqrt{4\pi}\;{\bm{\nu}_{a}\cdot\bm{\phi}}}\right]=(\bm{\nu}_{a})^{2}=\Big{(}1-\frac{1}{N}\Big{)} (184)
dim(VaVb)\displaystyle{\rm dim}({V}_{a}{V}_{b}) =Δ[ei4π(𝝂a+𝝂b)ϕ]=(𝝂a+𝝂b)2={2(12N)ab4(11N)a=b\displaystyle=\Delta\left[\mathrm{e}^{\mathrm{i}\sqrt{4\pi}\;{(\bm{\nu}_{a}+\bm{\nu}_{b})\cdot\bm{\phi}}}\right]=(\bm{\nu}_{a}+\bm{\nu}_{b})^{2}=\left\{\begin{array}[]{ll}2\Big{(}1-\frac{2}{N}\Big{)}&\qquad a\neq b\cr 4\Big{(}1-\frac{1}{N}\Big{)}&\qquad a=b\end{array}\right. (187)
dim(VaV¯b)\displaystyle{\rm dim}({V}_{a}\overline{V}_{b}) =Δ[ei4π(𝝂a𝝂b)ϕ]=(𝝂a𝝂b)2=2\displaystyle=\Delta\left[\mathrm{e}^{\mathrm{i}\sqrt{4\pi}\;{(\bm{\nu}_{a}-\bm{\nu}_{b})\cdot\bm{\phi}}}\right]=(\bm{\nu}_{a}-\bm{\nu}_{b})^{2}=2 (188)
dim(VaVbVc)\displaystyle{\rm dim}({V}_{a}{V}_{b}{V}_{c}) =Δ[ei4π(𝝂a+𝝂b+𝝂c)ϕ]=(𝝂a+𝝂b+𝝂c)23(13N)\displaystyle=\Delta\left[\mathrm{e}^{\mathrm{i}\sqrt{4\pi}\;{(\bm{\nu}_{a}+\bm{\nu}_{b}+\bm{\nu}_{c})\cdot\bm{\phi}}}\right]=(\bm{\nu}_{a}+\bm{\nu}_{b}+\bm{\nu}_{c})^{2}\geq 3\Big{(}1-\frac{3}{N}\Big{)} (189)
dim(Va1Vak)\displaystyle{\rm dim}({V}_{a_{1}}\ldots{V}_{a_{k}}) =Δ[ei4π(𝝂a1++𝝂ak)ϕ]=(𝝂a1++𝝂ak)2k(1kN)\displaystyle=\Delta\left[\mathrm{e}^{\mathrm{i}\sqrt{4\pi}\;{(\bm{\nu}_{a_{1}}+\ldots+\bm{\nu}_{a_{k}})\cdot\bm{\phi}}}\right]=(\bm{\nu}_{a_{1}}+\ldots+\bm{\nu}_{a_{k}})^{2}\geq k\Big{(}1-\frac{k}{N}\Big{)} (190)
dim(a=1NVa)\displaystyle{\rm dim}\left(\prod_{a=1}^{N}{V}_{a}\right) =Δ[ei4π(𝝂a1++𝝂aN)ϕ]=0\displaystyle=\Delta\left[\mathrm{e}^{\mathrm{i}\sqrt{4\pi}\;{(\bm{\nu}_{a_{1}}+\ldots+\bm{\nu}_{a_{N}})\cdot\bm{\phi}}}\right]=0 (191)
dim((a=1NVa)qVb)\displaystyle{\rm dim}\left(\left(\prod_{a=1}^{N}{V}_{a}\right)^{q}{V}_{b}\right) =Δ[ei4π𝝂bϕ]=(11N)\displaystyle=\Delta\left[\mathrm{e}^{\mathrm{i}\sqrt{4\pi}\;{\bm{\nu}_{b}\cdot\bm{\phi}}}\right]=\Big{(}1-\frac{1}{N}\Big{)} (192)
dim(Va1Vak)\displaystyle{\rm dim}({V}_{a_{1}}\ldots{V}_{a_{k}}) =dim(Vak+1VaN)\displaystyle={\rm dim}({V}_{a_{k+1}}\ldots{V}_{a_{N}}) (193)

To be relevant in the renormalization group sense Kosterlitz:1973xp , an operator must have Δ<2\Delta<2. These are vertex operators Va{V}_{a} with (Δ1)(\Delta\sim 1) and two-vertex operators [VaVb][{V}_{a}{V}_{b}] and [VaV¯b][{V}_{a}\overline{V}_{b}] for which (Δ2)(\Delta\sim 2). We can ignore others (Δ3)(\Delta\gtrsim 3) for IR physics, but still need to be careful because of some subtleties, such as dim(Va1)=dim(Va1VaN1)dim(VaN){\rm dim}({V}_{a_{1}})={\rm dim}({V}_{a_{1}}\ldots{V}_{a_{N-1}})\equiv{\rm dim}({V}_{a_{N}}).

Finally, in the large-NN limit, only level k=1k=1 (and k=N1k=N-1) operators are relevant. k=2k=2 and k=N2k=N-2 are marginal, and the rest is irrelevant. This will allow us to determine the IR physics of the theory sufficiently robustly.

4.5 Non-abelian bosonization: Mass deformed SU(N)1SU(N)_{1} WZW

The abelian bosonization reveals crucial amount of information about dynamics, but makes the non-abelian global symmetry of the theory non-obvious. On the other hand, if we use non-Abelian bosonization, the matching of the global symmetries become manifest. Let us describe this briefly.

Non-abelian bosonization maps NN-flavors of free massless Dirac fermions to U(N)1U(N)_{1} WZW model Witten:1983ar ; Polyakov:1983tt ; Polyakov:1984et . The correspondence between operators is

ψLψ¯RU,\psi_{\mathrm{L}}\overline{\psi}_{\mathrm{R}}\sim U, (194)

where UU is the U(N)U(N)-group valued scalar field. The 0-form symmetry of the massless Schwinger model and U(N)1U(N)_{1} WZW models are SU(N)L×SU(N)R(N)V,{SU(N)_{\mathrm{L}}\times SU(N)_{\mathrm{R}}}\over(\mathbb{Z}_{N})_{\mathrm{V}}, acting on the corresponding operators by conjugation ()VL()VR(\cdot)\mapsto V_{\mathrm{L}}(\cdot)V_{\mathrm{R}}^{\dagger}. Turning on a mass term for fermion reduce the symmetry to the vectorlike subgroup,

PSU(N)=SU(N)V/(N)V\displaystyle PSU(N)={SU(N)_{\mathrm{V}}}/(\mathbb{Z}_{N})_{\mathrm{V}} (195)

which is the global symmetry of three theories: N1\mathbb{CP}^{N-1} model, massive NN-flavor Schwinger model, and mass deformation of SU(N)1SU(N)_{1} WZW model.

The bosonized action of the massive Schwinger model is given by

S\displaystyle S =12e2M2|da|2+18πM2tr(|dU|2)c2m2(trU+trU)\displaystyle={1\over 2e^{2}}\int_{M_{2}}|\mathrm{d}a|^{2}+{1\over 8\pi}\int_{M_{2}}\text{tr}\left(|\mathrm{d}U|^{2}\right)-\frac{c}{2}m^{2}(\text{tr}U+\text{tr}U^{\dagger})
+i12πM3tr((UdU)3)+12πM2da(lndet(U)+iθ),\displaystyle+{\mathrm{i}\over 12\pi}\int_{M_{3}}\text{tr}\left((U^{\dagger}\mathrm{d}U)^{3}\right)+{1\over 2\pi}\int_{M_{2}}\mathrm{d}a\wedge(\ln\det(U)+\mathrm{i}\theta), (196)

The first two terms are kinetic terms, and the third term is mass deformation, the fourth term is the level-11 Wess-Zumino term. The last term can enforces the ABJ anomaly U(1)R(N)RU(1)_{\mathrm{R}}\to(\mathbb{Z}_{N})_{\mathrm{R}} of the massless limit, and incorporates theta dependence away from massless point.

Integrating out the gauge field as in the abelian bosonization discussion, we obtain

S\displaystyle S =18πM2tr(|dU|2)c2m2(trU+trU)\displaystyle={1\over 8\pi}\int_{M_{2}}\text{tr}\left(|\mathrm{d}U|^{2}\right)-\frac{c}{2}m^{2}(\text{tr}U+\text{tr}U^{\dagger})
+i12πM3tr((UdU)3)+e22πM2(lndet(U)+iθ)2,\displaystyle+{\mathrm{i}\over 12\pi}\int_{M_{3}}\text{tr}\left((U^{\dagger}\mathrm{d}U)^{3}\right)+{e^{2}\over 2\pi}\int_{M_{2}}(\ln\det(U)+\mathrm{i}\theta)^{2}, (197)

This is the non-abelian version of (166) and now, the full PSU(N)PSU(N) symmetry is manifest. We can further (locally) integrate out mode associated with det(U)\det(U) to obtain counterpart of (171).

According to (193), there are very few relevant or marginally relevant deformations of level-1 SU(N)SU(N) WZW. These are trU\text{tr}U with Δ1\Delta\sim 1 and tr(U2)\text{tr}(U^{2}) with Δ2\Delta\sim 2 in large-NN limit, and, |trU|2,(trU)2|\text{tr}U|^{2},(\text{tr}U)^{2} for which Δ2\Delta\sim 2. In particular, double-trace deformations of the form |trUn|2,(trUn)2|\text{tr}U^{n}|^{2},(\text{tr}U^{n})^{2} for which Δ2n2\Delta\sim 2n^{2}, are irrelevant in general for n2n\geq 2.

4.6 Secrets of Coleman’s formula: Relation of RG with all orders semi-classics

In our construction, (173) arises as low energy description of the N1\mathbb{CP}^{N-1} model. For that purpose, we can identify parameters of fermionized low energy limit of N1\mathbb{CP}^{N-1} as mΛeSI/Nm\sim\Lambda\sim\mathrm{e}^{-S_{I}/N} and μ2Ne2Λ2\mu^{2}\sim Ne^{2}\sim\Lambda^{2} and hence, mass gap is given by

m(θ)=Λmaxk(cosθ+2πkN)NN+1\displaystyle m^{\prime}(\theta)=\Lambda\max_{k}\left(\cos\frac{\theta+2\pi k}{N}\right)^{\frac{N}{N+1}} (198)

We would like to discuss rich physics associated with this result, which will teach us new things concerning extension of semi-classical ideas to 2{\mathbb{R}}^{2}. We believe all the major lessons extend to gauge theory on 4{\mathbb{R}}^{4} and 3×S1{\mathbb{R}}^{3}\times S^{1} and has non-trivial implications.

  • Conceptually, the most interesting aspect of Coleman’s result Coleman:1976uz is following: Naively speaking, if we obtain (171) from semi-classical gas of vortices in S[𝝈]S[\bm{\sigma}], we are performing a first order in semi-classic analysis. The Lagrangian has a cos(θN)cos(4π𝝂aϕ)\cos\left(\frac{\theta}{N}\right)\cos\left(\sqrt{4\pi}\;{\bm{\nu}_{a}\cdot\bm{\phi}}\right) factor in it which happily arise from the leading order fractional instantons, Va{V}_{a}. However, in (173), we obtain mass gap (cosθ+2πkN)NN+1\left(\cos\frac{\theta+2\pi k}{N}\right)^{\frac{N}{N+1}} which is impossible to obtain at any finite order in semi-classical expansion. It requires infinite order in semi-classics. What happened in between?

  • What Coleman did Coleman:1976uz , as he also states, is renormalization group for topological defects Kosterlitz:1974sm ; Kosterlitz:1973xp . But renormalization group is smart, even if we just write down few operators that are permitted by symmetries, it will induce all the other operators that are permitted by symmetries. In (198), it is actually inducing infinitely many relevant operators and resumming over them!

  • What are these infinitely many relevant operators and why did we forget about them at our starting point? The operator (cosθ+2πkN)NN+1cos(4π𝝂aϕ)\left(\cos\frac{\theta+2\pi k}{N}\right)^{\frac{N}{N+1}}\cos\left(\sqrt{4\pi}{\bm{\nu}_{a}\cdot\bm{\phi}}\right) is actually telling us that, apart from the ei4π𝝂aϕ+iθN\mathrm{e}^{\mathrm{i}\sqrt{4\pi}{\bm{\nu}_{a}\cdot\bm{\phi}}+i\frac{\theta}{N}}, there are infinitely many other operators of the form ei4π𝝂aϕ+iθ(q+1N)\mathrm{e}^{\mathrm{i}\sqrt{4\pi}{\bm{\nu}_{a}\cdot\bm{\phi}}+i\theta(q+\frac{1}{N})}, the scaling dimension (or Coulomb charges) of which are the same, but topological charges are distinct, associated with topological defects with winding number W=1N(1+Nq),q=0,1,2,W=\frac{1}{N}(1+Nq),\;q=0,1,2,\ldots in the semi-classical language.151515For gauge theory on 3×S1{\mathbb{R}}^{3}\times S^{1}, each monopole instanton has an infinite KK tower associated with it. If we wish to go beyond semi-classics, the above discussion instructs us that we should some over the whole tower.

Let us consider the N=2N=2 case, where the effects is most pronounced. The mass gap as a function of θ\theta angle in this case is given by: 161616In the vicinity of θ=π\theta=\pi, our construction implies that the mass gap in the 1\mathbb{CP}^{1} model vanishes in a particular way: mgap(θ)=Λ|θπ2|23,θπ\displaystyle m_{\rm gap}(\theta)=\Lambda\left|\frac{\theta-\pi}{2}\right|^{\frac{2}{3}},\qquad\theta\rightarrow\pi (199) If one can overcome the sign problem due to θ\theta angle, it would be nice to check this prediction of our formalism.

mgap(θ)=Λ|cosθ2|23\displaystyle m_{\rm gap}(\theta)=\Lambda\left|\cos\frac{\theta}{2}\right|^{\frac{2}{3}} (200)

Let us express mgap(θ)m_{\rm gap}(\theta) as a Fourier series:

mgap(θ)Λ1=q=0aqcos((q+12)θ)\displaystyle m_{\rm gap}(\theta)\Lambda^{-1}=\sum_{q=0}^{\infty}a_{q}\cos\left((q+{\textstyle{\frac{1}{2}}})\theta\right) (201)

where the Fourier coefficients are given by:

aq=1πππdθmgap(θ)Λ1cos((q+12)θ)=243π3Γ(23)Γ(56q)Γ(q+116)\displaystyle a_{q}=\frac{1}{\pi}\int_{-\pi}^{\pi}\mathrm{d}\theta\;m_{\rm gap}(\theta)\Lambda^{-1}\;\cos\left((q+{\textstyle{\frac{1}{2}}})\theta\right)=-\frac{2^{\frac{4}{3}}\pi}{\sqrt{3}\Gamma\left(-\frac{2}{3}\right)\Gamma\left(\frac{5}{6}-q\right)\Gamma\left(q+\frac{11}{6}\right)}\qquad\qquad (202)

Numerically, we find

{a1,a2,a3,a4,a5,}={1.07119,0.09738,0.04009,0.02266,0.01484,}\displaystyle\{a_{1},a_{2},a_{3},a_{4},a_{5},\ldots\}=\{1.07119,-0.09738,0.04009,-0.02266,0.01484,\ldots\} (203)

which suggest that even in the case where the leading order semi-classics is supposed to be at its worst (N=2)(N=2), the numerical error it makes is quite small. In large-NN, the leading order semi-classics capture the whole result (198).

Our intermediate level Lagrangian (198) is based on leading vertex operators:

V1,V2,V¯1,V¯2,\displaystyle{V}_{1},\;{V}_{2},\;\overline{V}_{1},\;\overline{V}_{2}, (204)

But renormalization group generates all other (relevant and irrelevant) operators permitted by symmetries. Therefore, we ended up having a theory which describes proliferation of

V1,V2,V¯1,V¯2,[(V1V2)V1],[(V1V2)V2],[(V1V2)V¯1],[(V1V2)V¯2],\displaystyle{V}_{1},\;{V}_{2},\;\overline{V}_{1},\;\overline{V}_{2},\;[({V}_{1}{V}_{2}){V}_{1}],\;[({V}_{1}{V}_{2}){V}_{2}],\;[({V}_{1}{V}_{2})\overline{V}_{1}],\;[({V}_{1}{V}_{2})\overline{V}_{2}],\ldots (205)

topological defects with Coulomb charge ±1\pm 1 (which determines the relevance), but arbitrary half integer topological charge, W12+W\in{\textstyle{\frac{1}{2}}}+{\mathbb{Z}}, which tells us that these are sourced by topologically distinct defects:

[(V1V2)qVa]ei4πϕ~aei12(q+12)θ\displaystyle[({V}_{1}{V}_{2})^{q}{V}_{a}]\mapsto\mathrm{e}^{\mathrm{i}\sqrt{4\pi}\widetilde{\phi}_{a}}\mathrm{e}^{\mathrm{i}\frac{1}{2}(q+{\textstyle{\frac{1}{2}}})\theta} (206)

There are infinitely many correlated defects, which are relevant in the RG sense. It is natural that these defects generate the non-analytic structure that we see in Coleman’s theta angle dependence (173). These contribution arise from the different columns in the resurgence triangle, see Fig.3 or Ref.Dunne:2012ae associated with W12+W\in{\textstyle{\frac{1}{2}}}+{\mathbb{Z}}.

4.7 1\mathbb{CP}^{1} at θ=π\theta=\pi

The semi-classical approximation must be reliable provided e2πg2(~)=Λ~1\mathrm{e}^{-\frac{2\pi}{g^{2}(\tilde{\ell})}}=\Lambda\tilde{\ell}\lesssim 1 where ~\tilde{\ell} is the characteristic distance between two fractional vortex instanton position (~|ai1aj1|)(\tilde{\ell}\sim|a^{1}_{i}-a^{1}_{j}|). So, close fractional vortex instanton positions in Euclidean set-up benefit from asymptotic freedom, just like close quarks do. In principle, we should not use this formula for ~Λ1\tilde{\ell}\gtrsim\Lambda^{-1}.

The good news is, the correlation length of the system (at θ=0\theta=0) is given by Λ1\Lambda^{-1}. After summation over fractional vortex instantons and anti-instantons, the would be long distance divergences are cut-off by the Debye length, and the mass gap can be obtained self-consistently.

Now, this may receive an objection as we change θ\theta. Is it possible that (201) is quantitatively and qualitatively correct for any θ\theta even when correlation length diverges at θ=π\theta=\pi? How seriously should we take the vanishing of the gap at θ=π\theta=\pi?

We encounter exactly the same situation in the leading order semi-classics of the deformed Yang-Mills on small 3×S1{\mathbb{R}}^{3}\times S^{1}, but there, the gaplessness is lifted at second order (see Unsal:2012zj for details). Here, we will reach to a different conclusion due to renormalization group construction applied to defect theory (173). The crucial difference of Coulomb gas in 3d and 2d is that in the former Coulomb gas is always in the plasma phase (with only one known exception Cherman:2017dwt ) while in 2d Coulomb gas, the system has a Kosterlitz-Thouless transition, so both gaplessness and gap are possibilities. See fradkin_2013 , page 320.

This problem is particularly important for anti-ferromagnetic spin chains in 1-dimensional spatial lattice. These systems at low energies are described by 2d2d 1\mathbb{CP}^{1} non-linear sigma model. Integer spin is described by a sigma model without theta term and half integer spin is described by 1\mathbb{CP}^{1} with θ=π\theta=\pi, θ=2πS\theta=2\pi S. Haldane’s conjecture Haldane:1983ru states that integer and half-integer spin chains falls into distinct universality classes, and half-integer spin systems are gapless (θ=π\theta=\pi), while the integer spin systems are gapped (θ=0\theta=0). See also Affleck:1987ch .

Since leading order vertex operators Va{V}_{a} produce zero mass gap at θ=π\theta=\pi, we would like to understand the role of the second order effect of the form:

[V1V¯2]+[V2V¯1]+[V1V¯1]±+[V2V¯2]±\displaystyle[{V}_{1}\overline{V}_{2}]+[{V}_{2}\overline{V}_{1}]+[{V}_{1}\overline{V}_{1}]_{\pm}+[{V}_{2}\overline{V}_{2}]_{\pm} (207)

As we know from resurgence theory, [VaV¯a]±[{V}_{a}\overline{V}_{a}]_{\pm} type effects are are two-fold ambiguous, and the ambiguous part cancels the renormalon ambiguity in perturbation theory. Their non-ambiguous part leads to something extremely interesting. It leads to an anomalous dimension for various important operators.

We would like to determine the amplitude of these events, importantly, their signs compared to vertex operators Va{V}_{a}. Consider 2{\mathbb{R}}^{2} as +×S1{\mathbb{R}}^{+}\times S^{1}. In the presence of ΩF\Omega_{F} background, we have Va{V}_{a} and V¯a\overline{V}_{a} events. To write the correlated amplitude, we write an integral over the quasi-zero mode Lefschetz thimble. This gives us:

[V1V¯2]\displaystyle[{V}_{1}\overline{V}_{2}] =(γlog(Ag))[V1][V¯2]\displaystyle=\Big{(}-\gamma-\log\Big{(}\frac{A}{g}\Big{)}\Big{)}[{V}_{1}][\overline{V}_{2}] (208)
[V1V¯1]±\displaystyle[{V}_{1}\overline{V}_{1}]_{\pm} =(γlog(Ag)±iπ)[V1][V¯1]\displaystyle=\Big{(}-\gamma-\log\Big{(}\frac{A}{g}\Big{)}\pm\mathrm{i}\pi\Big{)}[{V}_{1}][\overline{V}_{1}] (209)

Again, by resurgent structure Dunne:2012ae ; Behtash:2018voa , ambiguous imaginary parts cancel between Borel resummed perturbation theory and non-perturbative effects. Now, we can write combinations of non-perturbative effects as

2Υ\displaystyle 2\Upsilon [(ψ¯LψR)2+(ψ¯RψL)2+2(ψ¯LψR)(ψ¯RψL)],where\displaystyle\left[(\overline{\psi}_{\mathrm{L}}\psi_{\mathrm{R}})^{2}+(\overline{\psi}_{\mathrm{R}}\psi_{\mathrm{L}})^{2}+2(\overline{\psi}_{\mathrm{L}}\psi_{\mathrm{R}})(\overline{\psi}_{\mathrm{R}}\psi_{\mathrm{L}})\right],\qquad{\rm where}\qquad (210)
2Υ\displaystyle 2\Upsilon =K2(γlog(Ag))eSI<0\displaystyle=K^{2}\left(-\gamma-\log\left(\frac{A}{g}\right)\right)\mathrm{e}^{-S_{I}}<0 (211)

The crucial point is that there is a sign difference between the leading order fractional instanton effects and second order correlated fractional instanton effects.171717This is similar to double-well quantum mechanics. E0E_{0} is shifted down by eSIe^{-S_{I}} and E1E_{1} is shifted up by instanton effects, but E0+E1E_{0}+E_{1} remains the same at leading order in semi-classics. However, at second order, [II¯]e2SI[I\overline{I}]\sim e^{-2S_{I}} correlated events shifts both E0E_{0} and E1E_{1} up by the same amount. In particle on a circle with unique minimum at θ=0\theta=0, instantons shift the ground state energy down, but both [II¯][I\overline{I}] and [II][II] shift ground state energy up. So, there is a relative sign between instanton effects and correlated instanton effect in purely bosonic theories. In general, however, the amplitude of the two-events is a subtle and deep issue. If we were to consider 1\mathbb{CP}^{1} with 𝗇\mathsf{n} Dirac fermions (𝗇=1\mathsf{n}=1 is supersymmetric theory), it is possible to show that Fugacity[V1V¯1]±=Fugacity[V1V¯2]e±i𝗇π{\rm Fugacity}[{V}_{1}\overline{V}_{1}]_{\pm}={\rm Fugacity}[{V}_{1}\overline{V}_{2}]\mathrm{e}^{\pm\mathrm{i}\mathsf{n}\pi} where ei𝗇π\mathrm{e}^{\mathrm{i}\mathsf{n}\pi} is called hidden topological angle Behtash:2015kna that arise from Lefschetz thimble integration. This extra phase is super-important, and is responsible for the vanishing of the ground state energy in supersymmetric theories, see Behtash:2015kna for details. Therefore, we can write the combinations of bion contributions as: 2Υ[(ψ¯LψR)2+(ψ¯RψL)212jμ2]2\Upsilon\left[(\overline{\psi}_{\mathrm{L}}\psi_{\mathrm{R}})^{2}+(\overline{\psi}_{\mathrm{R}}\psi_{\mathrm{L}})^{2}-\frac{1}{2}j_{\mu}^{2}\right] where jμ=ψ¯γμψj_{\mu}=\overline{\psi}\gamma_{\mu}\psi, and Υ<0\Upsilon<0. After understanding these non-trivial facts that arise from resurgence theory, we can proceed with abelian bosonization similar to (173). Following almost identical steps with Fradkin’s book fradkin_2013 , we obtain

L\displaystyle L =12(dϕ~)2+Υπ(dϕ~)2(m)2(θ)cos(2πϕ~)4Υcos(22πϕ~)\displaystyle=\frac{1}{2}(\mathrm{d}\widetilde{\phi})^{2}+\frac{\Upsilon}{\pi}(\mathrm{d}\widetilde{\phi})^{2}-(m^{\prime})^{2}(\theta)\cos(\sqrt{2\pi}\widetilde{\phi})-4\Upsilon\cos(2\sqrt{2\pi}\widetilde{\phi}) (212)

where the first part of the kinetic term is the image of fermion kinetic term, the second part is the image of jμ2j_{\mu}^{2}, which is sourced by neutral bions [V1V¯1]+[V2V¯2][{V}_{1}\overline{V}_{1}]+[{V}_{2}\overline{V}_{2}], and interaction term arise from charged bions [V1V¯2]+[V2V¯1][{V}_{1}\overline{V}_{2}]+[{V}_{2}\overline{V}_{1}]. Note that neutral bion contribution produce a non-linearity which effects the dimensions of the charged bion operator, and this effect will be important.

Setting θ=π\theta=\pi, and defining φ=(1+2Υπ)1/2ϕ~\varphi=\left({1+\frac{2\Upsilon}{\pi}}\right)^{1/2}\widetilde{\phi} turns the kinetic term to canonical form. Using β2=2π1+2Υπ\beta^{2}=\frac{2\pi}{1+\frac{2\Upsilon}{\pi}}, we can write the action as

L=12(dφ)24Υcos(2βφ)\displaystyle L=\frac{1}{2}(\mathrm{d}\varphi)^{2}-4{\Upsilon}\cos(2\beta\varphi) (213)

Since

Δ[cos(2βφ)]=21+2Υπ\displaystyle\Delta\Big{[}\cos(2\beta\varphi)\Big{]}=\frac{2}{1+\frac{2\Upsilon}{\pi}} (214)

the charged bion (doubly charged vertex operator), which was exactly marginal at leading order in semi-classical expansion where Υ=0\Upsilon=0, is driven towards irrelevance at second order in semi-classics where Υ<0\Upsilon<0. As a result, the theory at θ=π\theta=\pi is gapless.

4.8 Why is this working? (Improving Polyakov’s optimism with RG)

A source of quantitative error in the formalism is following. The instanton, as we know, has a size moduli, ρ\rho. In the classical theory, which is scale invariant, to determine the single instanton contribution, we end up with the integral

Z1V2dρρ3e4πg2(ρ)V2dρρ3(Λρ)N=V2ΛN𝑑ρρN3\displaystyle Z_{1}\sim V_{2}\int\frac{d\rho}{\rho^{3}}\mathrm{e}^{-\frac{4\pi}{g^{2}(\rho)}}\sim V_{2}\int\frac{d\rho}{\rho^{3}}\;(\Lambda\rho)^{N}=V_{2}\Lambda^{N}\int d\rho\rho^{N-3} (215)

which is IR divergent for all NN. Furthermore, one should not use the one-loop result for the strong scale for ρ>ρcΛ1\rho>\rho_{\rm c}\sim\Lambda^{-1} since g2(ρ)g^{2}(\rho) becomes large, but at least, at large-NN limit, the one-loop result for β\beta function becomes exact.

If we go ahead and do a summation over all fractional vortex instanton effects, and construct a grand canonical ensemble of topological defect configurations, we learn that the system induce a finite correlation length ξ=m1Λ1\xi=m^{-1}\sim\Lambda^{-1}. This immediately tells us that one should not extrapolate (215) naively. Instead, we interpret the Debye length as a cut-off, replacing

𝑑ρρN3𝑑ρρN3emρ.\displaystyle\int d\rho\rho^{N-3}\mapsto\int d\rho\rho^{N-3}\mathrm{e}^{-m\rho}. (216)

This removes the IR divergent bad domain ρ>ξ\rho>\xi from integration, see Fig.6. Since ρcξ\rho_{c}\sim\xi, it is impossible to do qualitative error in this way, and the infrared theory for N1\mathbb{CP}^{N-1} which captures low lying modes is described by the mass deformed SU(N)1SU(N)_{1} WZW (197), where mass deformation is governed by mμeSI/N=Λm\sim\mu\mathrm{e}^{-S_{I}/N}=\Lambda.

The logic of the previous paragraph follows Polyakov, Polyakov:1987ez , Section 6.1. However, in the important case of N=2N=2, the rationale of Polyakov breaks down around θπ\theta\sim\pi where correlation length diverges as ξ(θ)=Λ11|cos(θ/2)|2/3\xi(\theta)=\Lambda^{-1}\frac{1}{|\cos(\theta/2)|^{2/3}}.

Should we not trust this result as the bad domain ρ[Λ1,)\rho\in[\Lambda^{-1},\infty) resurfaces in the integration (216)? Is this prediction unreliable? Our claim is otherwise. We should trust it as much as we trust the θ=0\theta=0 results. We will prove this by improving Polyakov’s logic with renormalization group.

Refer to caption
Figure 6: This diagram depicts a source of quantitative error. We use one-loop result for strong scale, which should be reliable for l~ρcΛ1\tilde{l}\lesssim\rho_{c}\sim\Lambda^{-1} and unreliable for l~ρc\tilde{l}\gtrsim\rho_{c}, where it leads to an IR divergence. At the same time, the proliferation of fractional vortex instantons with fugacity eSI/N\mathrm{e}^{-S_{I}/N} generates a finite correlation length ξΛ1\xi\sim\Lambda^{-1}. Because of the Debye screening, this cuts-off the interaction between fractional vortex instantons at distances larger than ξρc\xi\sim\rho_{c}, essentially removing the bad zone from integration. Depending on the numerical values of ξρc\xi\sim\rho_{c}, there may be a band which induce quantitative error. We then use RG arguments to find relevant operators at θ=0\theta=0. Since RG is independent of θ\theta, we can extrapolate our effective theory to arbitrary θ\theta. In 1\mathbb{CP}^{1}, this gives us a gapless theory at θ=π\theta=\pi. Despite the fact that “naive” error band becomes infinite, this result is reliable because the determiner of error is θ=0\theta=0. Therefore, both gap and gaplessness are robust qualitative predictions of the formalism.

The θ=0\theta=0 and θ0\theta\neq 0 are not independent theories, we are just adding a topological theta angle to the theory. If we did everything right at θ=0\theta=0 and constructed the correct long distance theory based on renormalization group of topological defects, why should it fail by turning on a topological angle at θ=π\theta=\pi?

Indeed, at θ=0\theta=0, we showed that the vertex operators eilβφ\mathrm{e}^{\mathrm{i}l\beta\varphi} have dimensions

ΔlΔ[eilβφ]=l2β24π=12(1+2Υπ)l2,\displaystyle\Delta_{l}\equiv\Delta[\mathrm{e}^{\mathrm{i}l\beta\varphi}]=\frac{l^{2}\beta^{2}}{4\pi}=\frac{1}{2(1+\frac{2\Upsilon}{\pi})}l^{2}, (217)

Therefore, unit charge vertex operator has dimension 12\sim{\textstyle{\frac{1}{2}}} and is highly relevant, l3l\geq 3 are irrelevant, and l=2l=2 vertex operator is marginally irrelevant. This construction is certainly valid at θ=0\theta=0 where our EFT is self-consistent. The scaling dimensions are independent of θ\theta angle. If we construct effective theory correctly at θ=0\theta=0, turning on θ\theta will not change that fact, and despite the fact that ξ(θ)ρc\xi(\theta)\gg\rho_{c}, we must obtain the correct results at arbitrary θ\theta.

The theory does not become gapless at θ=π\theta=\pi because eiβφ\mathrm{e}^{\mathrm{i}\beta\varphi} becomes irrelevant, to the contrary, its is extremely relevant. The theory becomes gapless because of the destructive topological interference between V1=eSI2eiβφ+iθ2{V}_{1}=\mathrm{e}^{-\frac{S_{I}}{2}}\mathrm{e}^{\mathrm{i}\beta\varphi+\mathrm{i}\frac{\theta}{2}} and V¯2=eSI2eiβφiθ2\overline{V}_{2}=\mathrm{e}^{-\frac{S_{I}}{2}}\mathrm{e}^{\mathrm{i}\beta\varphi-\mathrm{i}\frac{\theta}{2}}, and the fact that the l=2l=2 vertex operator is marginally irrelevant.

For general N1,N3\mathbb{CP}^{N-1},N\geq 3 theories, we are even safer in our construction. There are only three types of relevant operators in the renormalization group of vertex opetators, (see (193) for a full list)

dim(Va)=(11N)\displaystyle{\rm dim}({V}_{a})=\Big{(}1-\frac{1}{N}\Big{)} (218)
dim(VaVb)=2(12N),dim(VaV¯b)\displaystyle{\rm dim}({V}_{a}{V}_{b})=2\Big{(}1-\frac{2}{N}\Big{)},\qquad{\rm dim}({V}_{a}\overline{V}_{b}) =2ab,\displaystyle=2\qquad a\neq b, (219)

and all the rest is irrelevant. Since dim2{\rm dim}\sim 2 for [VaV¯b][{V}_{a}\overline{V}_{b}] and [VaVb][{V}_{a}{V}_{b}] operators, they do not alter the conclusions of dim1{\rm dim}\sim 1 Va{V}_{a} operators, we can ignore their effects accepting quantitative error. Since we are after the more modest goal of determining the existence of gap, we hope this is acceptable. Therefore, our demonstration of mass gap for N3N\geq 3 theories on 2{\mathbb{R}}^{2} is on a robust footing.

Mass deformed WZW interpretation: Of course, the above statements translate to non-abelian bosonization language nicely (197). The massive UU field in the mass deformed SU(N)1SU(N)_{1} WZW describe the massive adjoint z¯a(x)zb(x)\overline{z}_{a}(x)z^{b}(x) field and singlet (slightly heavier) z¯a(x)eixyaza(y)\overline{z}_{a}(x)\mathrm{e}^{\mathrm{i}\int_{x}^{y}a}z^{a}(y) field in the spectrum, (N21)1(N^{2}-1)\oplus 1. In the large-NN limit, trU\text{tr}U mass deformation has dimension Δ1\Delta\sim 1. Double trace operators |trU|2,(trU)2|\text{tr}U|^{2},(\text{tr}U)^{2} have Δ2\Delta\sim 2. These describe the low lying spectrum of the N1,N3\mathbb{CP}^{N-1},N\geq 3 model, an adjoint and a singlet whose masses are at the strong scale of the theory.

For the N=2N=2 case, we need to pay more attention. Let us write

Uabei4πNϕ~gab\displaystyle U_{ab}\sim\mathrm{e}^{\mathrm{i}\sqrt{\frac{4\pi}{N}}\widetilde{\phi}}g_{ab} (220)

in (197). In this case, at θ=0\theta=0, we have a massive triplet and slightly heavier singlet. In fact, as θ\theta goes from 0 to π\pi, the gap between lowest lying triplets and singlet widens, and it becomes fully justified to integrate out the heavy ϕ~\widetilde{\phi} mode. In this case, in the IR, Δ[trg]=12\Delta[\text{tr}g]={\textstyle{\frac{1}{2}}}, but this operator disappears from deformed WZW action due to topological interference induced by θ\theta term. A possible mass term may be generated by the double trace operator, |trg|2|\text{tr}g|^{2}. But this operator has Δ[|trg|2]>2\Delta[|\text{tr}g|^{2}]>2 (214) and is marginally irrelevant. Therefore, at θ=π\theta=\pi, we obtain a gapless triplet and a massive singlet. (Same conclusion for N=2N=2 theory are also reached by Affleck in Ref.Affleck:1988wz by different means.)

5 What are we summing over in semiclassics in dYM theory on 3×S1\mathbb{R}^{3}\times S^{1}?

Deformed Yang-Mills theory is center-symmetry preserving double-trace deformation of the Yang-Mills action on small 3×S1\mathbb{R}^{3}\times S^{1} Unsal:2008ch ; Shifman:2008ja . In the sense of gauge invariant order parameters, it is continuously connected to pure YM on 4{\mathbb{R}}^{4}. The deformation allows a calculable (semi-classical) regime in which one can study non-perturbative properties such as confinement, θ\theta angle dependence, mass gap generation. See Unsal:2007jx ; Unsal:2007vu ; Unsal:2008ch ; Poppitz:2011wy ; Poppitz:2012sw ; Poppitz:2012nz ; Argyres:2012vv ; Argyres:2012ka ; Anber:2011gn ; Anber:2015wha ; Misumi:2014raa ; Cherman:2014ofa ; Poppitz:2008hr ; Aitken:2017ayq ; Thomas:2011ee ; Zhitnitsky:2013hs ; Anber:2017rch ; Itou:2019gtg ; Anber:2013sga ; Kitano:2017jng ; Kan:2019rsz .181818Modeling of Yang-Mills vacuum outside its region of validity of semi-classics is described in Liu:2015jsa ; Liu:2015ufa ; Larsen:2018crg . The construction in this paper provides some retrospective rationale for this study. In particular, the coupling of YM to N{\mathbb{Z}}_{N} TQFT proves that fractional topological charge W=1/NW=1/N configurations is there even at strong coupling of pure YM theory. There is by now compelling evidence that deformed YM, QCD(F) with ΩF\Omega_{F} twist, , QCD(adj) and a number of other theories on small 3×SL1\mathbb{R}^{3}\times S^{1}_{L} regime are continuously connected to the corresponding theories on 4\mathbb{R}^{4}. Recently, Bonati:2018rfg ; Bonati:2019kmf showed by numerical lattice simulations that topological susceptibility at small 3×SL1{\mathbb{R}}^{3}\times S^{1}_{L} dYM theory is identical to the one of pure YM theory on large T44T^{4}\sim{\mathbb{R}}^{4}. It is important to recall that in these theories, the SL1S^{1}_{L} circle never has a thermal interpretation in the standard sense Unsal:2007jx .

It should be noted that there also had been important body of work on T3×T^{3}\times{\mathbb{R}} by using ’t Hooft twisted boundary conditions, pioneered by Gonzalez-Arroyo, Garcia-Perez and van Baal GarciaPerez:1993jw ; GonzalezArroyo:1995zy ; GonzalezArroyo:1995ex ; GonzalezArroyo:1996jp ; Zhitnitsky:1991eg ; GarciaPerez:1992fj ; Bruckmann:2003ag ; GarciaPerez:1999bc ; GarciaPerez:2007ne ; vanBaal:2000zc ; Perez:2010jx ; GarciaPerez:1993ab ; Lebedev:1988wd ; GarciaPerez:1999hs . (See also Witten:1982df ; Witten:2000nv ; Cohen:1983fd for other uses of twisted boundary conditions.) These works, in the small T3T^{3} regime, aimed to construct a semi-classical description of the vacuum and confinement mechanism based on ’t Hooft’s fractional topological charge W=1/NW=1/N and action S=SI/NS=S_{I}/N configurations. As we emphasized multiple times, ’t Hooft’s original solutions on T4T^{4} are constant (space-time independent) and to a certain degree, (apart from its high value in showing that action S=SI/NS=S_{I}/N configuration do exist), are fairly uninteresting. The biggest stumbling block in this program had been the lack of exact analytic non-trivial (spacetime dependent) solution with S=SI/NS=S_{I}/N. However, these authors were able to demonstrate the existence of non-trivial BPS S=SI/NS=S_{I}/N (time dependent) configurations by using lattice techniques, leaving no doubt for their existence, though these works are not sufficiently appreciated. Some semi-classical dynamics due to fractional instantons is understood. (See GonzalezArroyo:1995zy and also vanBaal:2000zc for a review.) It should be noted that these works are prior to the explicit analytical solutions of W=1W=1 non-trivial holonomy calorons and their fractionization to W=1/NW=1/N constituents. And possible connections between fractional instantons and monopole instantons are partly discussed in GarciaPerez:1999hs ; GarciaPerez:1999bc , but it is not concluded that monopole-instantons live in PSU(N)PSU(N) bundle, a missing link between the two that we filled up in the present work.

Our analysis of Yang-Mills theory on M4M_{4} can be viewed in some sense putting the analysis of gauge theories on T3×T^{3}\times{\mathbb{R}} and 3×S1{\mathbb{R}}^{3}\times S^{1} in complete agreement. In particular, we are able to do so by proving that the monopole-instanton are actually exact solutions in the PSU(N)PSU(N) bundle. This fulfills a (numerically substantiated) key assumption in the analysis of Gonzalez-Arroyo et. al. Then, we show that by coupling a TQFT to gauge theory, that the configurations with action 1/N1/N are also present in the strong coupling domain.

Returning back to 3×S1{\mathbb{R}}^{3}\times S^{1} studies, despite the fact that the long distance theory of the theories on 3×S1{\mathbb{R}}^{3}\times S^{1} reduce to 3d theory, the microscopic theory is 4d. The deformed YM and QCD theories on 3×SL1{\mathbb{R}}^{3}\times S^{1}_{L} are closer in all respect to their decompactifcation limit on 4{\mathbb{R}}^{4} than the theories on 3{\mathbb{R}}^{3} limit.191919In fact, even at small-LL, where LΛO(N0)L\Lambda\sim O(N^{0}), taking NN\rightarrow\infty, these theories are equivalent to the theories on 4{\mathbb{R}}^{4}. In certain sense, effective circle size is Leff=NLL_{\rm eff}=NL and at N=N=\infty, for the neutral sector observables (this is the sector singlet under zero-form part of center symmetry), it is fair to interpret this as if there is no compactification. This is called large-NN volume independence. For example, these theories have theta angle parameters, multi-branch structure, mass gap determined by 4d strong scale, global ABJ anomalies. Very often, they also have the same global and mixed anomalies as the 4d theories Tanizaki:2017qhf ; Cherman:2017dwt ; Sulejmanpasic:2020zfs .

Below, we would like to determine in detail the configurations that contribute to partition function of deformed Yang-Mills in the semi-classical domain in the same spirit as our simple quantum mechanical TNT_{N} example. But before that, we would like to provide a quick review of the deformed theory and set the notation.

Quick overview of deformed Yang-Mills: Remembrance of things past

  • In pure Yang-Mills theory, center symmetry is broken at small circle size (high-temperature) on 3×S1{\mathbb{R}}^{3}\times S^{1} Gross:1980br . Center-symmetry can be stabilized at small-LL by the addition of double-trace operator Δ=k=1N2|trU3k|2\Delta{\cal L}=\sum_{k=1}^{\lfloor{N\over 2}\rfloor}|\text{tr}U_{3}^{k}|^{2} where U3U_{3} denotes Polyakov loop along compact x3x3+Lx_{3}\sim x_{3}+L circle. 202020 We refer to non-compact directions as x1,x2,x4x_{1},x_{2},x_{4}. This convention is more useful when we describe the same set-up with the insertion of ’t Hooft flux. Despite the fact that this term looks non-local, it can arise from local action, e.g. Yang-Mills + (massless or massive) adjoint fermions at small S1S^{1} with periodic boundary conditions induce the center stabilization dynamically. See Unsal:2007jx ; Unsal:2008ch ; Shifman:2008ja for details.

  • The minimum of the holonomy potential is given by eiϕ=(1,ω,ω2,,ωN1)\mathrm{e}^{\mathrm{i}\bm{\phi}_{\star}}=(1,\omega,\omega^{2},\ldots,\omega^{N-1}) up to Weyl permutations, where (eiϕ)i=(U3)ii(\mathrm{e}^{\mathrm{i}\bm{\phi}_{\star}})_{i}=(U_{3})_{ii}. This forces the dynamics to abelianize at small-LL, and at long distances, the theory is described by maximal abelian sub-group SU(N)U(1)N1SU(N)\to U(1)^{N-1}, just in terms of photons. Off-diagonal gluons gets masses 2π/NL\geq{2\pi/NL}.

  • There are multiple useful basis to describe the IR physics. The diagonal components of gauge fields and compact scalar can be written as 𝒂=i=1N1ai𝜶i\bm{a}=\sum_{i=1}^{N-1}a_{i}\bm{\alpha}_{i} where aia_{i} are canonically normalized gauge fields and ϕ=i=1N1ϕi𝜶i=(ϕ1,ϕ2ϕ1,,ϕN1)\bm{\phi}=\sum_{i=1}^{N-1}\phi_{i}\bm{\alpha}_{i}=(\phi_{1},\phi_{2}-\phi_{1},\ldots,-\phi_{N-1}). The periodicity of ϕ\bm{\phi} is governed by the root lattice, ϕϕ+2π𝜶i,𝜶iΓr\bm{\phi}\sim\bm{\phi}+2\pi\bm{\alpha}_{i},\bm{\alpha}_{i}\in\Gamma_{r},

    ϕN12πΓr\displaystyle\bm{\phi}\in\frac{{\mathbb{R}}^{N-1}}{2\pi\Gamma_{r}} (221)

    hence, ϕi\phi_{i} are periodic variable with period 2π2\pi. ϕ\bm{\phi}_{\star} is the center of Weyl chamber where dynamics completely abelianize.

  • We can perform abelian duality Polyakov:1987ez and express the free 3d action as

    Seff\displaystyle S_{\mathrm{eff}} =(12g2L|dϕ|2+g28π2L|d𝝈θ2πdϕ|2)\displaystyle=\int\left({1\over 2g^{2}L}\Bigl{|}\mathrm{d}\bm{\phi}\Bigr{|}^{2}+{g^{2}\over 8\pi^{2}L}\Bigl{|}\mathrm{d}\bm{\sigma}-{\theta\over 2\pi}\mathrm{d}\bm{\phi}\Bigr{|}^{2}\right) (222)

    In most theories, we can also forget about ϕ\bm{\phi} as it is gapped by holonomy potential, and can be dropped in long distance effective field theory. The dual photon field 𝝈\bm{\sigma} has periodicity determined by weight lattice, 𝝈𝝈+2π𝝁i,𝝁iΓw\bm{\sigma}\sim\bm{\sigma}+2\pi\bm{\mu}_{i},\;\bm{\mu}_{i}\in\Gamma_{w}.

    𝝈N12πΓw\displaystyle\bm{\sigma}\in\frac{{\mathbb{R}}^{N-1}}{2\pi\Gamma_{w}} (223)

    A useful basis for our purpose is 𝝈=i=1N1σi𝝁i\bm{\sigma}=\sum_{i=1}^{N-1}\sigma_{i}\bm{\mu}_{i}, hence σiσi+2π\sigma_{i}\sim\sigma_{i}+2\pi, and σi\sigma_{i} are 2π2\pi-periodic scalars.

  • There are NN types of the fundamental monopole instantons, and the monopole operators are given by

    a(x)=eSI/Nei𝜶a𝝈(x)eiθ/N(a=1,,N).{\cal M}_{a}(x)=\mathrm{e}^{-S_{I}/N}\mathrm{e}^{\mathrm{i}\bm{\alpha}_{a}\cdot\bm{\sigma}(x)}\mathrm{e}^{\mathrm{i}\theta/N}\quad(a=1,\ldots,N). (224)

    N1N-1 of these monopoles are the regular ones associated with SU(N)U(1)N1SU(N)\to U(1)^{N-1} adjoint Higgsing. The monopole associated with the affine root 𝜶N=a=1N1𝜶a\bm{\alpha}_{N}=-\sum_{a=1}^{N-1}\bm{\alpha}_{a} Lee:1997vp ; Lee:1998bb ; Kraan:1998kp ; Kraan:1998pm ; Kraan:1998sn has the same action with the rest of the simple roots due to compactness of the adjoint scalar or equivalently, due to the fact that Polyakov loop acts as a group valued field at a center-symmetric background. See also Collie:2009iz .

  • The sum over all monopole instantons generates an effective potential. The grand canonical ensemble of the monopole gives the partition function:

    Z=𝒟𝝈exp[(g28π2L|d𝝈|22KeSI/Na=1Ncos(𝜶a𝝈+θN))]\displaystyle Z=\int{\mathcal{D}}\bm{\sigma}\;\exp\Big{[}-\Big{(}\int{g^{2}\over 8\pi^{2}L}|\mathrm{d}\bm{\sigma}|^{2}-2K\mathrm{e}^{-S_{I}/N}\sum_{a=1}^{N}\cos\left(\bm{\alpha}_{a}\cdot\bm{\sigma}+{\theta\over N}\right)\Big{)}\Big{]} (225)

    which leads to a non-perturbative mass gap, finite string tension with N-ality kN/2k\lesssim N/2, NN-branched vacua, and CP breaking at θ=π\theta=\pi.

  • The effective field theory description in in terms monopole-instanton and bion local fields is based on the parametric scale separation between the dual photon and WW-boson, and is valid provided (ΛLN)1(\Lambda LN)\lesssim 1. However, EFTs can be used to determine some observables, but not all observables. For example, despite the fact that EFT (225) can produce mass gap, string tension with NN-ality kN/2k\lesssim N/2 Unsal:2007jx ; Unsal:2008ch ; Polyakov:1975rs , exact value of chiral condensate in 𝒩=1{\cal N}=1 SYM Davies:2000nw , it cannot produce string tensions with NN-ality kN/2k\gtrsim N/2 correctly. Reproducing those require the inclusion of WW-bosons, which are present in microscopic theory, but not in EFT. We will dwell onto these subtleties a bit more precisely in the conclusion section.

5.1 Mini-space formalism: Global constraints in monopole-instanton sums

A number of issues come to mind with this construction and has been raised over the years.

Configurations contributing to partition function: One is the appearance of θ/N\theta/N in the monopole operator (224) and effective action (225) that arise from a monopole-instanton with topological charge Q=1NQ=\frac{1}{N}. Yet, if we consider the SU(N)SU(N) theory on orientable 4-manifolds, it is well-known that the topological charge is quantized in integer units WW\in{\mathbb{Z}}. Therefore, the configurations that contribute to the partition function must have a theta dependence of the form eiWθ\mathrm{e}^{iW\theta}. How is this compatible with the appearance of θ/N\theta/N in the monopole operators and effective action?

If we compactify the theory on small 3×SL1{\mathbb{R}}^{3}\times S^{1}_{L} down to quantum mechanics, T2×Sβ1×SL1T^{2}\times S^{1}_{\beta}\times S^{1}_{L}, then, in the path integral representation of the partition function Z(β)=trexp[βHT2×SL1]Z(\beta)=\text{tr}\;\exp[{-\beta H_{T^{2}\times S^{1}_{L}}}], we are supposed to sum over fields that obey periodic boundary conditions. Yet, the presence of a monopole implies that the magnetic flux piercing T2T^{2} 𝚽mag(τ)=T2𝑩\bm{\Phi}_{\rm mag}(\tau)=\int_{T^{2}}\bm{B} changes by

𝚽mag(β)𝚽mag(0)=(T2𝑩)|τ=0τ=β=2πg𝜶a,a=1,,N.\displaystyle\bm{\Phi}_{\rm mag}(\beta)-\bm{\Phi}_{\rm mag}(0)=\left(\int_{T^{2}}{\bm{B}}\right)\Big{|}_{\tau=0}^{\tau=\beta}=\frac{2\pi}{g}\bm{\alpha}_{a},\qquad a=1,\ldots,N. (226)

Such configurations are present in the theory, but just like our quantum mechanical example, they should not contribute to partition function, which is a sum over periodic paths. But of course, despite the fact that these contribution are absent in the sum, most of the non-perturbative low energy phenomena are sourced by them, again just like our QM example.

The image of the issue raised on T2×Sβ1×SL1T^{2}\times S^{1}_{\beta}\times S^{1}_{L} for the infinite volume theory can be simply described as follows. The magnetic flux at infinity on 3×S1\mathbb{R}^{3}\times S^{1} is a choice of boundary condition. If we choose the flux at infinity to be zero, the net flux from all sources must vanish by Gauss’s law, the only configuration that can contribute to the partition function must satisfy magnetic neutrality at infinity S2𝑩total=0\int_{S^{2}_{\infty}}\bm{B}_{\rm total}=0.

Below, we show that the master partition function (225) is aware of these issues and cleverly takes these constraints into account, as mentioned in passing in Tanizaki:2019rbk . We prove in detail that magnetic charge neutrality comes naturally from (225). Furthermore, as a bonus, once magnetic charge neutrality emerges, the topological charge of corresponding configurations become integer quantized. (Opposite is not true, configurations with integer winding number need not be magnetically neutral.) We also note that the points about global neutrality are also emphasized in Diakonov:2007nv ; Diakonov:2010qg , though we disagree with the naming of monopole-instanton as dyon, since it is not an eigenstate of electric charge operator.

Minispace formalism: The path integration over the fields 𝒟𝝈eS\int{\mathcal{D}}\bm{\sigma}\;\mathrm{e}^{-S} has a zero mode part, the integration over the space of constant (space-time) independent 𝝈\bm{\sigma}. In this subspace, the measure reduce to an ordinary integral over the fundamental cell of 𝝈{\bm{\sigma}} field given in (223),

σN12πΓw[d𝝈](i=1N102π𝑑σi)\displaystyle\int_{\sigma\in\frac{{\mathbb{R}}^{N-1}}{2\pi\Gamma_{w}}}[d{\bm{\sigma}}]\equiv\Big{(}\prod_{i=1}^{N-1}\int_{0}^{2\pi}d\sigma_{i}\Big{)} (227)

The action over the constant modes reduces to s=2V3KeSI/Na=1Ncos(𝜶a𝝈+θN)s=-2V_{3}K\mathrm{e}^{-S_{I}/N}\sum_{a=1}^{N}\cos\left(\bm{\alpha}_{a}\cdot\bm{\sigma}+{\theta\over N}\right). So, the zero mode integration becomes (set ξ=V3KeSI/N\xi=V_{3}K\mathrm{e}^{-S_{I}/N})

z(θ)\displaystyle z(\theta) =cell[d𝝈]es\displaystyle=\int_{\rm cell}[d{\bm{\sigma}}]\;\mathrm{e}^{-s} (228)
=(i=1N102π𝑑σi)a=1Neξei(𝜶a𝝈+θN)eξei(𝜶a𝝈+θN)\displaystyle=\Big{(}\prod_{i=1}^{N-1}\int_{0}^{2\pi}d\sigma_{i}\Big{)}\prod_{a=1}^{N}\mathrm{e}^{\xi\mathrm{e}^{\mathrm{i}\left(\bm{\alpha}_{a}\cdot\bm{\sigma}+{\theta\over N}\right)}}\mathrm{e}^{\xi\mathrm{e}^{-\mathrm{i}\left(\bm{\alpha}_{a}\cdot\bm{\sigma}+{\theta\over N}\right)}} (229)

We call s(𝝈)s(\bm{\sigma}) mini-space action and z(θ)z(\theta) mini-space partition function. Using the good basis 𝝈=i=1N1σi𝝁i\bm{\sigma}=\sum_{i=1}^{N-1}\sigma_{i}{\bm{\mu}_{i}}, we have 𝜶i𝝈=σi,n=1,,N1\bm{\alpha}_{i}\cdot\bm{\sigma}=\sigma_{i},\;n=1,\ldots,N-1 and 𝜶N𝝈=(σ1++σN1)\bm{\alpha}_{N}\cdot\bm{\sigma}=-(\sigma_{1}+\ldots+\sigma_{N-1}), which allows us to easily perform the integration.

z(θ)=a=1N(na=0n¯a=0)\displaystyle z(\theta)=\prod_{a=1}^{N}\left(\sum_{n_{a}=0}^{\infty}\sum_{\overline{n}_{a}=0}^{\infty}\right) δn1n¯1,nNn¯NδnN1n¯N1,nNn¯Nn1!n¯1!nN!n¯N!\displaystyle\frac{\delta_{n_{1}-\overline{n}_{1},n_{N}-\overline{n}_{N}}\ldots\delta_{n_{N-1}-\overline{n}_{N-1},n_{N}-\overline{n}_{N}}}{n_{1}!\overline{n}_{1}!\ldots n_{N}!\overline{n}_{N}!} (230)
ξn1++nN+n¯1++n¯NeiθN(n1++nN(n¯1++n¯N))\displaystyle\xi^{n_{1}+\ldots+n_{N}+\overline{n}_{1}+\ldots+\overline{n}_{N}}\mathrm{e}^{i\frac{\theta}{N}(n_{1}+\ldots+n_{N}-(\overline{n}_{1}+\ldots+\overline{n}_{N}))} (231)

where N1N-1 constraints comes from the integration over the N1N-1 independent σi\sigma_{i} field. The solution of the constraints is

n1n¯1=n2n¯2==nNn¯N=W\displaystyle n_{1}-\overline{n}_{1}=n_{2}-\overline{n}_{2}=\ldots=n_{N}-\overline{n}_{N}=W (232)

where WW\in{\mathbb{Z}}. This enforces magnetic neutrality, i.e, magnetic flux at infinity is zero. This constraint tells us that nan¯an_{a}-\overline{n}_{a} is independent of aa. Clearly, if nan¯an_{a}-\overline{n}_{a} is zero, both magnetic neutrality and topological neutrality are guaranteed. But what if nan¯a=W,an_{a}-\overline{n}_{a}=W,\;\forall a, an aa-independent excess of monopoles over anti-monopoles? In that case, magnetic neutrality is guaranteed thanks to the fact that a=1N𝜶a=0\sum_{a=1}^{N}\bm{\alpha}_{a}=0. Furthermore, when this condition is satisfied, the corresponding configuration has an integer topological charge given by WW\in{\mathbb{Z}}! Therefore, the mini-space partition function can be written as

z(θ)=W(n¯1=0n¯N=0)1(n¯1+W)!n¯1!(n¯N+W)!n¯N!ξ2n¯1++2n¯N+NWeiWθ\displaystyle z(\theta)=\sum_{W\in\mathbb{Z}}\left(\sum_{\overline{n}_{1}=0}^{\infty}\ldots\sum_{\overline{n}_{N}=0}^{\infty}\right)\frac{1}{(\overline{n}_{1}+W)!\overline{n}_{1}!\ldots(\overline{n}_{N}+W)!\overline{n}_{N}!}\xi^{2\overline{n}_{1}+\ldots+2\overline{n}_{N}+NW}\mathrm{e}^{iW\theta} (233)

This elegant formula tells us many interesting things:

  • The configurations that contribute to the partition function have fractional actions and integer topological charges:

    S\displaystyle S =SIN(2n¯1++2n¯N)+SI|W|SI(2N|k|+|W|),k,W\displaystyle=\frac{S_{I}}{N}(2\overline{n}_{1}+\ldots+2\overline{n}_{N})+S_{I}|W|\in S_{I}\left(\frac{2}{N}|k|+|W|\right),\qquad k,W\in{\mathbb{Z}} (234)
    Q\displaystyle Q =W\displaystyle=W\in{\mathbb{Z}} (235)

    just like our simple quantum mechanical TNT_{N} model.

  • Despite the fact that the monopole instantons have fractional topological charge, only their magnetically neutral combinations contribute to the sum. Magnetic neutrality enforces the quantization of topological charge as well. This satisfies the boundary condition at infinity on S2S^{2}_{\infty}.

  • This construction also holds for any sufficiently large M3M_{3} times small SL1S^{1}_{L}. We can think 3{\mathbb{R}}^{3} as the infinite volume limit of M3M_{3}.

We can decompose mini-space partition function into its Fourier modes z(θ)=WzWeiWθz(\theta)=\sum_{W\in{\mathbb{Z}}}z_{W}\mathrm{e}^{\mathrm{i}W\theta}, and comparing with (233); we learn that

zW\displaystyle z_{W} =(n¯1=0n¯N=0)1(n¯1+W)!n¯1!(nN+W)!nN!ξ2n¯1++2n¯N+NW\displaystyle=\left(\sum_{\overline{n}_{1}=0}^{\infty}\ldots\sum_{\overline{n}_{N}=0}^{\infty}\right)\frac{1}{(\overline{n}_{1}+W)!\overline{n}_{1}!\ldots(n_{N}+W)!n_{N}!}\xi^{2\overline{n}_{1}+\ldots+2\overline{n}_{N}+NW} (236)
=[IW(2ξ)]N\displaystyle=[I_{W}(2\xi)]^{N} (237)

where IW(2ξ)I_{W}(2\xi) is modified Bessel function. zWz_{W} carries the information of all configurations (within the dilute monopole gas approximation) contributing to the topological sector with charge WW\in{\mathbb{Z}}.

Tunneling events that contribute to the partition function are the transitions between pure gauge configurations which differ in the winding number WW only, just like regular instantons on 4{\mathbb{R}}^{4}. Net magnetic flux is zero at infinity, but this does not mean that the corresponding configurations are boring. Instead, each contributing configuration has sub-structure, nan¯a=W,an_{a}-\overline{n}_{a}=W,\;\forall a monopole-instantons.

5.2 Vacuum energy density and multi-branch structure

A final remark is on the NN-branched vacuum structure of deformed Yang-Mills theory. Monopole induced potential has NN extrema within the fundamental domain of 𝝈\bm{\sigma} field given in (223). These are:

𝝈k=2πNk𝝆2πNki=1N1𝝁i,\bm{\sigma}_{k}={2\pi\over N}k\bm{\rho}\equiv{2\pi\over N}k\sum_{i=1}^{N-1}\bm{\mu}_{i}, (238)

with k=0,1,,N1k=0,1,\ldots,N-1 and 𝝆\bm{\rho} is the Weyl vector. Approximately N2\frac{N}{2} of these branches are meta-stable for a given value of θ\theta Aitken:2018mbb . As θ\theta is varied, the set of meta-stable vacua changes. From this, we can deduce the vacuum energy densities as

k=2NKeSI/Ncos(θ+2πkN).\mathcal{E}_{k}=-2NK\mathrm{e}^{-S_{I}/N}\cos\left({\theta+2\pi k\over N}\right). (239)

As a result, we can write the partition function for the theory defined on a 33-manifold as

Z(θ)=k=0N1e2NV3KeSI/Ncos(θ+2πkN)k=0N1e2Nξcos(θ+2πkN)Z(\theta)=\sum_{k=0}^{N-1}\mathrm{e}^{2NV_{3}K\mathrm{e}^{-S_{I}/N}\cos\left({\theta+2\pi k\over N}\right)}\equiv\sum_{k=0}^{N-1}\mathrm{e}^{2N\xi\cos\left({\theta+2\pi k\over N}\right)} (240)

which can be converted into a fractional instanton sum:

Z(θ)\displaystyle Z(\theta) =NWn=0n¯=01n!1n¯!(NξeiθN)n(NξeiθN)n¯δnn¯WN,0\displaystyle=N\sum_{W\in{\mathbb{Z}}}\sum_{n=0}^{\infty}\sum_{\overline{n}=0}^{\infty}\frac{1}{n!}\frac{1}{\overline{n}!}\left(N\xi\mathrm{e}^{\mathrm{i}\frac{\theta}{N}}\right)^{n}\left(N\xi\mathrm{e}^{-\mathrm{i}\frac{\theta}{N}}\right)^{\overline{n}}\delta_{n-\overline{n}-WN,0} (241)
=NWn¯=01(n+WN)!1n¯!(Nξ)2n¯+WNeiWθ\displaystyle=N\sum_{W\in{\mathbb{Z}}}\sum_{\overline{n}=0}^{\infty}\frac{1}{(n+WN)!}\frac{1}{\overline{n}!}(N\xi)^{2\overline{n}+WN}\mathrm{e}^{\mathrm{i}W\theta} (242)
=NWIWN(2ξN)eiWθ\displaystyle=N\sum_{W\in{\mathbb{Z}}}I_{WN}(2\xi N)\mathrm{e}^{\mathrm{i}W\theta} (243)

In (243), quantization of topological charge emerges as a result of the constraint, δnn¯WN,0\delta_{n-\overline{n}-WN,0}. In this formula, there is no information about magnetic charges of the individual monopole instantons. That information is washed away by the choice of the vacuum branch in (240). At the kthk^{\rm th} branch, all monopole operators with non-zero adjoint charge (also called GNO charge) but zero ’t Hooft charge acquire the same vev, ei𝜶a𝝈=ei2πkN\langle\mathrm{e}^{\mathrm{i}\bm{\alpha}_{a}\cdot\bm{\sigma}}\rangle=\mathrm{e}^{\mathrm{i}2\pi k\over N} Aitken:2018mbb and the information about individual magnetic charges 𝜶a\bm{\alpha}_{a} of NN types of monopoles is lost. In this sense, (233) has more information than (243). It knows that nan¯a=Wan_{a}-\overline{n}_{a}=W\;\;\forall a. This naturally implies a=1N(nan¯a)nn¯=WN\sum_{a=1}^{N}(n_{a}-\overline{n}_{a})\equiv n-\overline{n}=WN.

6 Coupling Yang-Mills theory to N{\mathbb{Z}}_{N} TQFT

The Lagrangian of SU(N)SU(N) Yang-Mills theory is

YM=12gYM2tr[FF]+iθYM8π2tr[FF]{\cal L}_{\mathrm{YM}}={1\over 2g^{2}_{\mathrm{YM}}}\int\text{tr}[F\wedge\star F]+{\mathrm{i}\,\theta_{\mathrm{YM}}\over 8\pi^{2}}\int\text{tr}[F\wedge F] (244)

where the second term is properly quantized topological term:

Q=18π2tr[FF]\displaystyle Q=\frac{1}{8\pi^{2}}\int\text{tr}[F\wedge F]\in{\mathbb{Z}} (245)

SU(N)SU(N) YM theory possess a N[1]{\mathbb{Z}}_{N}^{[1]} electric one-form symmetry, which acts on Wilson line operators.

To probe the topological configurations in YM theory, we follow the same method as in our TNT_{N} quantum mechanics example. We turn on a background gauge field for the N[1]{\mathbb{Z}}_{N}^{[1]} 1-form symmetry Kapustin:2014gua ; Gaiotto:2014kfa , introducing pair of U(1)U(1) 2-form and 1-form gauge fields (B(2),B(1))(B^{(2)},B^{(1)}) satisfying

NB(2)=dB(1),NB(2)=dB(1)=2π\displaystyle NB^{(2)}=\mathrm{d}B^{(1)},\qquad N\int B^{(2)}=\int\mathrm{d}B^{(1)}=2\pi{\mathbb{Z}} (246)

The action corresponding to the N{\mathbb{Z}}_{N} topological gauge theory is given by

Ztop,p\displaystyle Z_{{\rm top},p} =𝒟B(2)𝒟B(1)𝒟C(2)eiC(2)(NB(2)dB(1))+ipN4πB(2)B(2)\displaystyle=\int{\mathcal{D}}B^{(2)}{\mathcal{D}}B^{(1)}{\mathcal{D}}C^{(2)}\;\mathrm{e}^{\mathrm{i}\int C^{(2)}\wedge(NB^{(2)}-\mathrm{d}B^{(1)})+\mathrm{i}p{N\over 4\pi}\int B^{(2)}\wedge B^{(2)}} (247)

where C(2)C^{(2)} is Lagrange multiplier, and pp is the discrete theta angle.

To couple the SU(N)SU(N) YM theory to the background gauge field B(2)B^{(2)}, we promote the SU(N)SU(N) gauge field aa into a U(N)U(N) gauge field a~\tilde{a}. The U(N)U(N) gauge field is related to dynamical SU(N)SU(N) gauge field aa locally as

a~=a+1NB(1).\widetilde{a}=a+{1\over N}B^{(1)}. (248)

To gauge N[1]\mathbb{Z}_{N}^{[1]}, we introduce gauge invariance under the one-form gauge transformation,

B(2)B(2)+dΛ(1),B(1)B(1)+NΛ(1),B^{(2)}\mapsto B^{(2)}+\mathrm{d}\Lambda^{(1)},\qquad B^{(1)}\mapsto B^{(1)}+N\Lambda^{(1)}, (249)

and hence,

a~a~+Λ(1),F~F~+dΛ(1)\widetilde{a}\mapsto\widetilde{a}+\Lambda^{(1)},\qquad\widetilde{F}\mapsto\widetilde{F}+\mathrm{d}\Lambda^{(1)} (250)

In writing SU(N)SU(N) theory in a N{\mathbb{Z}}_{N} TQFT (equivalently ’t Hooft flux) background, we have to replace SU(N)SU(N) field strength with the gauge-invariant combination of the U(N)U(N) field strength F~=F(a~)\widetilde{F}=F(\widetilde{a}) and B(2)B^{(2)}:

F(a)F~B(2).F(a)\Rightarrow\widetilde{F}-B^{(2)}. (251)

The action of the SU(N)SU(N) theory in the B(2)B^{(2)} background can be written as:

S[B(2),B(1),a~]=12gYM2tr[(F~B(2))(F~B(2))]+iθYM8π2tr[(F~B(2))(F~B(2))]{S}[B^{(2)},B^{(1)},\widetilde{a}]={1\over 2g^{2}_{\mathrm{YM}}}\int\text{tr}[(\widetilde{F}-B^{(2)})\wedge\star(\widetilde{F}-B^{(2)})]+{\mathrm{i}\,\theta_{\mathrm{YM}}\over 8\pi^{2}}\int\text{tr}[(\widetilde{F}-B^{(2)})\wedge(\widetilde{F}-B^{(2)})] (252)

If we further wish to obtain the partition function of the (SU(N)/N)p(SU(N)/{\mathbb{Z}}_{N})_{p} gauge theory, then, we have to sum over all B(2)B^{(2)}, N{\mathbb{Z}}_{N} valued 2-form gauge fields. Namely,

Z(SU(N)/N)p=𝒟B(2)𝒟B(1)𝒟C(2)𝒟a~eiC(2)(NB(2)dB(1))+ipN4πB(2)B(2)eS[B(2),B(1),a~]\displaystyle Z_{(SU(N)/{\mathbb{Z}}_{N})_{p}}=\int{\mathcal{D}}B^{(2)}{\mathcal{D}}B^{(1)}{\mathcal{D}}C^{(2)}{\mathcal{D}}\widetilde{a}\;\mathrm{e}^{\mathrm{i}\int C^{(2)}\wedge(NB^{(2)}-\mathrm{d}B^{(1)})+\mathrm{i}p{N\over 4\pi}\int B^{(2)}\wedge B^{(2)}}\mathrm{e}^{-S[B^{(2)},B^{(1)},\widetilde{a}]} (253)

where

N8π2B(2)B(2)1N\displaystyle{N\over 8\pi^{2}}\int B^{(2)}\wedge B^{(2)}\in\frac{1}{N}\mathbb{Z} (254)

is the N{\mathbb{Z}}_{N} ’t Hooft flux. This partition function is the counter-part of our simple quantum mechanical example (45). For other background fields that one can turn on in Yang-Mills theory, see Wan:2019oyr .

In SU(N)SU(N) theory, topological charge is quantized in integer units (245). When we turn on B(2)B^{(2)} background, the topological term becomes

Q\displaystyle Q =18π2tr[(F~B(2))(F~B(2))]\displaystyle={1\over 8\pi^{2}}\int\text{tr}[(\widetilde{F}-B^{(2)})\wedge(\widetilde{F}-B^{(2)})] (255)
=18π2tr[(F~F~)N8π2B(2)B(2))]1N1N.\displaystyle=\underbrace{{1\over 8\pi^{2}}\int\text{tr}[(\widetilde{F}\wedge\widetilde{F})}_{\in{\mathbb{Z}}}-\underbrace{\frac{N}{8\pi^{2}}\int B^{(2)}\wedge B^{(2)})]}_{\in\frac{1}{N}{\mathbb{Z}}}\in\frac{1}{N}\mathbb{Z}. (256)

fractionally quantized, just like our simple quantum mechanics and N1\mathbb{CP}^{N-1} examples. In the second line, the first term is integer quantized because F~\widetilde{F} is in the U(N)U(N) bundle. The second term corresponds to N{\mathbb{Z}}_{N} ’t Hooft flux background.

Let us now determine the saddles in the theory with non-trivial B(2)B^{(2)} background, and their actions. The action functional for θ=0\theta=0 can be manipulated by using Bogomolny factorization:

S\displaystyle S =1gYM2tr[(F~B(2))(F~B(2))]\displaystyle={1\over g^{2}_{\mathrm{YM}}}\int\text{tr}[(\widetilde{F}-B^{(2)})\wedge\star(\widetilde{F}-B^{(2)})] (257)
=12gYM2tr[((F~B(2))±(F~B(2)))((F~B(2))±(F~B(2)))]\displaystyle={1\over 2g^{2}_{\mathrm{YM}}}\int\text{tr}\left[\left((\widetilde{F}-B^{(2)})\pm\star(\widetilde{F}-B^{(2)})\right)\wedge\star\left((\widetilde{F}-B^{(2)})\pm\star(\widetilde{F}-B^{(2)})\right)\right] (258)
8π2g218π2tr[(F~B(2))(F~B(2))]\displaystyle\mp\frac{8\pi^{2}}{g^{2}}\frac{1}{8\pi^{2}}\int\text{tr}[(\widetilde{F}-B^{(2)})\wedge(\widetilde{F}-B^{(2)})] (259)

The BPS bound is saturated by modified version of the instanton equation in the B(2)B^{(2)} background:

(F~B(2))=(F~B(2))\displaystyle(\widetilde{F}-B^{(2)})=\mp\star(\widetilde{F}-B^{(2)}) (260)

This equation is the counterpart of (35) in QM and (59) in N1\mathbb{CP}^{N-1} in appropriate N{\mathbb{Z}}_{N} backgrounds. Using (256), we see that the action is bounded from below by

S=8π2g218π2tr[(F~B(2))(F~B(2))]=SINS=\mp\frac{8\pi^{2}}{g^{2}}\frac{1}{8\pi^{2}}\int\text{tr}[(\widetilde{F}-B^{(2)})\wedge(\widetilde{F}-B^{(2)})]=\frac{S_{I}}{N} (261)

When we turn on a B(2)B^{(2)} flux, we are still in SU(N)SU(N) theory, just like in our QM example, when we consider the tr[𝖴eβH]\text{tr}[\mathsf{U}^{\ell}e^{-\beta H}], we are still in the TNT_{N} model. If we sum over all backgrounds, we move to (SU(N)/N)p(SU(N)/{\mathbb{Z}}_{N})_{p} gauge theory just like we moved to (TN/N)p(T_{N}/{\mathbb{Z}}_{N})_{p} model in QM.

Formally, in quantum mechanics, it takes the insertion of 𝖴\mathsf{U}^{\ell} into the state sum to see explicitly the existence of fractional topological charge saddles. (Intuitively, just looking to the Figures, this is of course obvious.) This enforces the following interpretation. We have to accept that in the SU(N)SU(N) theory, saddles with fractional charge 1N\frac{1}{N}{\mathbb{Z}} and action SIN=8π2g2N\frac{S_{I}}{N}=\frac{8\pi^{2}}{g^{2}N} exist, but they are configuration in the SU(N)/NSU(N)/{\mathbb{Z}}_{N} bundle. As such, all it takes is the insertion of a ’t Hooft flux to see these configurations in the PSU(N)PSU(N) bundle in the original SU(N)SU(N) theory. Then, we can build configurations in the SU(N)SU(N) theory which live in the SU(N)SU(N) bundle, which are non-BPS composites of objects in the PSU(N)PSU(N) bundle. Therefore, in exact analogy with our quantum mechanical example, there are topological configurations with fractional action, 2|n|NSI+|W|SI\frac{2|n|}{N}S_{I}+|W|S_{I} and integer topological charge. This strongly suggests that the non-perturbative expansion parameter in the theory is eSI/N=exp[8π2g2(μ)N]\mathrm{e}^{-S_{I}/N}=\exp[-\frac{8\pi^{2}}{g^{2}(\mu)N}], which is exponentially more important than BPST instanton Belavin:1975fg :

eSINe2SINe3SINeSIinstanton\displaystyle\mathrm{e}^{-\frac{S_{I}}{N}}\gg\mathrm{e}^{-2\frac{S_{I}}{N}}\gg\mathrm{e}^{-3\frac{S_{I}}{N}}\gg\ldots\gg\underbrace{\mathrm{e}^{-{S_{I}}}}_{\rm instanton} (262)

Remarkably, the picture that emerges in the large 4-manifold M4M_{4} is almost identical to our construction of dilute gas of monopole instantons on deformed YM and 𝒩=1{\cal N}=1 SYM theory on 3×S1{\mathbb{R}}^{3}\times S^{1}.

6.1 T3×SL1T^{3}\times S^{1}_{L} with ’t Hooft flux and monopole-instantons

We can use the idea of double-trace deformation both in SU(N)SU(N) and PSU(N)PSU(N) theory, because the double-trace operator can be viewed as a trace in adjoint representation which belongs to both groups. Therefore, regardless of the existence of center symmetry, in both case, holonomy potential has a minimum at eiϕ=(1,ω,ω2,,ωN1)\mathrm{e}^{\mathrm{i}\bm{\phi}_{\star}}=(1,\omega,\omega^{2},\ldots,\omega^{N-1}) at the center of Weyl chamber. Hence, in both case, dynamics abelianize to maximal abelian subgroup.

The magnetic monopoles that participate to non-perturbative dynamics have magnetic charges in root lattice, 𝜶Γr{\bm{\alpha}}\in\Gamma_{r}, both for SU(N)SU(N) and PSU(N)PSU(N), corresponding to monopole operators a(x)ei𝜶a𝝈(x){\cal M}_{a}(x)\sim\mathrm{e}^{\mathrm{i}\bm{\alpha}_{a}\cdot\bm{\sigma}(x)} (224) in the effective field theory. The distinction between the two is that PSU(N)PSU(N) admits test magnetic charges 𝝁Γw{\bm{\mu}}\in\Gamma_{w}, which are forbidden in SU(N)SU(N), corresponding to probe operators ei𝝁a𝝈(x)\mathrm{e}^{\mathrm{i}\bm{\mu}_{a}\cdot\bm{\sigma}(x)} where 𝝁a\bm{\mu}_{a} are fundamental weights. The magnetically charged operators which are non-trivial under Γw/Γr\Gamma_{w}/\Gamma_{r} are immediately in PSU(N)PSU(N) bundle, but how about dynamical monopoles 𝜶Γr{\bm{\alpha}}\in\Gamma_{r} which carry Q=1/NQ=1/N? What is the precise role they play?

Reminder about monopoles: Consider a monopole with charge 𝝁Γw\bm{\mu}\in\Gamma_{w}. Its magnetic field at asymptotic large distances is:

𝗕=𝒓^r2(𝝁𝑯)\displaystyle\mathsf{\bm{B}}=\frac{\bm{\widehat{r}}}{r^{2}}(\bm{\mu}\cdot\bm{H}) (263)

where 𝑯=(H1,,HN1)\bm{H}=(H^{1},\ldots,H^{N-1}) denote the Cartan generators of SU(N)SU(N) algebra. For 𝝁\bm{\mu} which is a weight of fundamental representation, (𝝁𝑯)=T2N(\bm{\mu}\cdot\bm{H})=\frac{T}{2N} up to permutations, where T=diag(1,,1,(N1))T={\rm diag}(1,\ldots,1,-(N-1)). If 𝝁\bm{\mu} is in NN-index representation or adjoint representation, (𝝁N𝑯)(𝜶𝑯)T2(\bm{\mu}_{N}\cdot\bm{H})\sim(\bm{\alpha}\cdot\bm{H})\sim\frac{T}{2}.

Now, consider a sphere S2S^{2} surrounding the monopole. The non-abelian gauge connection on the north and south hemisphere patches, corresponding to the above magnetic flux, is given by:

AϕN\displaystyle A^{N}_{\phi} =1rT2N(1cosθ)sinθ\displaystyle=\frac{1}{r}\frac{T}{2N}\frac{(1-\cos\theta)}{\sin\theta} (264)
AϕS\displaystyle A^{S}_{\phi} =1rT2N(1cosθ)sinθ\displaystyle=\frac{1}{r}\frac{T}{2N}\frac{(-1-\cos\theta)}{\sin\theta} (265)

The two gauge connections are same up to gauge transformation. On a small strip along the equator where the patches overlap, we have

AϕNAϕS=1rsinθϕT(ϕ)\displaystyle A^{N}_{\phi}-A^{S}_{\phi}=\frac{1}{r\sin\theta}\partial_{\phi}T(\phi) (266)

where

T(ϕ)=ϕNTandeiT(ϕ+2π)=eiT(ϕ)ei2πN\displaystyle T(\phi)=\frac{\phi}{N}T\qquad{\rm and}\;\;\mathrm{e}^{\mathrm{i}T(\phi+2\pi)}=\mathrm{e}^{\mathrm{i}T(\phi)}\mathrm{e}^{-\mathrm{i}\frac{2\pi}{N}} (268)

i.e. charge 𝝁\bm{\mu} configuration lives in PSU(n)PSU(n) bundle. Concerning charge 𝜶{\bm{\alpha}} monopole, which is dynamical in the theory, we have the transition matrix (eiT(ϕ))N(\mathrm{e}^{\mathrm{i}T(\phi)})^{N}, which is an element of PSU(N)PSU(N) bundle which can lift to SU(N)SU(N) bundle. But still, its topological charge is 1/N1/N. We discuss its importance below.

Refer to caption
Figure 7: Consider YM in the background of N2πB12(2)=12=1\frac{N}{2\pi}\int B^{(2)}_{12}=\ell_{12}=1 unit of ’t Hooft flux, Z12=tr[eβH12]Z_{\ell_{12}}=\text{tr}[\mathrm{e}^{-\beta H_{\ell_{12}}}]. The existence of 12=1\ell_{12}=1, in finite volume, leads to NN-perturbatively degenerate minima, just like TNT_{N} QM model. a) A non-trivial periodic (in Euclidean time) configuration with topological charge W=1W=-1 contributing to Z12Z_{\ell_{12}} is a collection of NN dynamical monopole instantons, each of which have zero ’t Hooft charge, but non-zero magnetic charge. b) A periodic configuration with topological charge W=0W=0 contributing to Z12Z_{\ell_{12}}. c) This configuration does not contribute to Z12Z_{\ell_{12}} as it is not periodic in Euclidean time, but it contributes to the sum over transition amplitudes, Z1234=tr[eβH12(𝖴c)34],34=1Z_{\ell_{12}\ell_{34}}=\text{tr}[\mathrm{e}^{-\beta H_{\ell_{12}}}(\mathsf{U}_{\mathrm{c}})^{\ell_{34}}],\;\ell_{34}=1. The aperiodicity in the euclidean time direction can also be undone, and replaced with an N2πB34(2)=34=1\frac{N}{2\pi}\int B^{(2)}_{34}=\ell_{34}=1 background.

Monopole-instantons in the ’t Hooft flux background: In this part, we would like to see the role of monopole-instantons in the non-trivial ’t Hooft flux background which is in the PSU(N)PSU(N) bundle, and determine an analytic solution to (260) when the theory is compactified on large T3T^{3} (serving as regularization of 3{\mathbb{R}}^{3}) times a small SL1S^{1}_{L}. First of all, following our construction in quantum mechanics and N1\mathbb{CP}^{N-1} model, we know that we can undo the background and substitute it with boundary conditions.

Then, we can check if our monopole-instantons are sensible solutions within PSU(N)PSU(N), fitting with the boundary conditions. We will come to the conclusion that our monopole-instantons with topological charge Qa=1/NQ_{a}=1/N, action Sa=SI/NS_{a}=S_{I}/N and magnetic charge 𝜶a{\bm{\alpha}}_{a} have an interpretation as tunneling events between configurations in the PSU(N)PSU(N) bundle in the semi-classical domain of the theory.

We can think turning on kk unit of ’t Hooft magnetic flux in 12-plane, (in 33-direction), in multiple ways. The simplest way is to start with twisted boundary conditions on T2×2T^{2}\times{\mathbb{R}}^{2}, and undo the twist in favor of background B12(2)B^{(2)}_{12} fields. Gray band in Fig. 7 is a depiction of such classical background. This flux background exists without any reference to magnetic monopoles. On each T2T^{2} slice, we have

N2πB12(2)=k12modN\displaystyle\frac{N}{2\pi}\int B^{(2)}_{12}=k\equiv\ell_{12}\;\;\;{\rm mod}\;\;N (269)

flux. A useful way to think about it is as follows. Such ’t Hooft flux lines can be viewed as the field line created between magnetic monopoles with charges 𝝁k{\bm{\mu}}_{k} and 𝝁k-{\bm{\mu}}_{k}, associated with magnetic NN-ality kk and k-k. (But recall that such charges are not present in the SU(N)SU(N) theory, neither as dynamical object nor as probes. Even if when we gauge the 1-form center symmetry, and move to say PSU(N)0PSU(N)_{0} theory which is locally equivalent to SU(N)SU(N) theory, such monopoles are not present dynamically. They are only allowed as probe charges.) These magnetic flux lines carry a magnetic flux through the 12-surface as well, given by

12𝑩=2πg𝝁k\displaystyle\int_{12}\bm{B}=\frac{2\pi}{g}\bm{\mu}_{k} (270)

So, we have to think that these flux lines are present and upper and lower plane in 7 are identified despite the fact that charges are not present in the theory.212121In certain sense, we can think of the ’t Hooft magnetic flux 12=k\ell_{12}=k sourced by 𝝁k{\bm{\mu}}_{k} and 𝝁k-{\bm{\mu}}_{k} pair similar to the mythical symbol ouroboros, a snake swallowing its own tail. We can forget about source and sink, and just take the flux lines into consideration. Another useful way to think about the 12=k\ell_{12}=k flux line is a non-dynamical center-vortex, see Greensite:2003bk about center-vortices. Indeed, a center-vortex passing through a fundamental Wilson loop in 12-plane contributes to W(C)=eiaW(C)=\mathrm{e}^{\mathrm{i}\oint a} as ei12𝑩=ei2πkN\mathrm{e}^{\mathrm{i}\int_{12}\bm{B}}=\mathrm{e}^{\mathrm{i}\frac{2\pi k}{N}}. All these perspectives are useful once we think about tunneling events in flux backgrounds.

Now, we can give an interpretation of the tunneling events corresponding to dynamical monopole-instantons with charge 𝜶aΓr{\bm{\alpha}}_{a}\in\Gamma_{r}, in the SU(N)SU(N) theory on T2×Sβ1×SL1T^{2}\times S^{1}_{\beta}\times S^{1}_{L} with 12=1\ell_{12}=1 unit of magnetic ’t Hooft flux background, see Fig. 7. First, note that the configurations in the 12=1\ell_{12}=1 can be attached different magnetic flux, corresponding to any representative of the fundamental (defining) representation:

𝝂1𝝁1,𝝂2𝝁1𝜶1,𝝂3𝝁1𝜶1𝜶2,𝝂N𝝁1a=1N1𝜶a,\displaystyle{\bm{\nu}}_{1}\equiv{\bm{\mu}}_{1},\qquad{\bm{\nu}}_{2}\equiv{\bm{\mu}}_{1}-{\bm{\alpha}}_{1},\qquad{\bm{\nu}}_{3}\equiv{\bm{\mu}}_{1}-{\bm{\alpha}}_{1}-{\bm{\alpha}}_{2},\qquad\ldots\qquad{\bm{\nu}}_{N}\equiv{\bm{\mu}}_{1}-\sum_{a=1}^{N-1}{\bm{\alpha}}_{a}, (271)

The magnetic flux passing through the T2T^{2} for these configurations are:

𝚽=T2𝑩=2πg𝝂a,a=1,,N.\displaystyle\bm{\Phi}=\int_{T^{2}}{\bm{B}}=\frac{2\pi}{g}{\bm{\nu}}_{a},\qquad a=1,\ldots,N. (272)

Assume that the flux is uniform through the surface of torus T2T^{2}.222222One does not need to make this assumption, but it makes the discussion a bit simpler, without sacrificing correctness. The ’t Hooft flux insertion cost an energy, which depends only on 12=k\ell_{12}=k. For example, for 12=1\ell_{12}=1, it does not matter what magnetic flux out of the list (271) is associated with it. Then for these NN-configurations, the energy of the corresponding states are exactly degenerate even at finite volume. If 𝑩a=1A2πg𝝂a{\bm{B}}_{a}=\frac{1}{A}\frac{2\pi}{g}{\bm{\nu}}_{a}, their energies are

Ea\displaystyle E_{a} =12T2𝑩a2=12A(2πg)2𝝂a2=12A(2πg)2(11N),a=1,,N.\displaystyle={\textstyle{\frac{1}{2}}}\int_{T^{2}}{\bm{B}_{a}}^{2}=\frac{1}{2A}\Big{(}\frac{2\pi}{g}\Big{)}^{2}{\bm{\nu}}_{a}^{2}=\frac{1}{2A}\Big{(}\frac{2\pi}{g}\Big{)}^{2}\Big{(}1-\frac{1}{N}\Big{)},\qquad a=1,\ldots,N. (273)

The tunneling between the flux configurations |𝝂a|{\bm{\nu}}_{a}\rangle and |𝝂a+1|{\bm{\nu}}_{a+1}\rangle changes magnetic flux by simple and affine rooot 𝜶a,a=1,,N{\bm{\alpha}}_{a},\;a=1,\ldots,N, but does not change ’t Hooft flux, which is fixed, 12=1\ell_{12}=1.

Δ\displaystyle\Delta T2𝑩=2πg(𝝂a𝝂a+1)=2πg𝜶a,a=1,,N.\displaystyle\int_{T^{2}}{\bm{B}}=\frac{2\pi}{g}\left({\bm{\nu}}_{a}-{\bm{\nu}}_{a+1}\right)=-\frac{2\pi}{g}{\bm{\alpha}}_{a},\qquad a=1,\ldots,N. (274)
Δ\displaystyle\Delta (N2πB12(2))=0\displaystyle\left(\frac{N}{2\pi}\int B^{(2)}_{12}\right)=0\;\; (275)

This change in the magnetic flux is associated with the magnetic charge of the dynamical monopole instantons present in the theory.

6.2 Two types of monopole events, Born-Oppenheimer and TNT_{N} QM

One needs to be careful concerning two distinct class of tunneling events in compactified set-up on T2×SL1×S1T^{2}\times S^{1}_{L}\times S^{1}. These are between

  • a)a)

    Monopole tunneling events between exactly degenerate minima as described above, and

  • b)b)

    Tunnelings that become degenerate only in Area(T2){\rm Area}(T^{2})\rightarrow\infty limit.

The discussion above implies the existence of is different from the discussion of monopole-instantons in the absence of ’t Hooft flux background, for example, in the Polyakov model, see page 226 of Ref.Banks:2008tpa (which is one of the rare textbooks which discusses Polyakov model and Hamiltonian interpretation of the tunneling.) A monopole-instanton in the case of Polyakov model always changes the energy of vacuum state at finite Area(T2){{\rm Area}({T^{2}})}. If 𝚽=T2𝑩=2πg𝜶ana\bm{\Phi}=\int_{T^{2}}{\bm{B}}=\frac{2\pi}{g}{\bm{\alpha}}_{a}n_{a} is magnetic flux (no summation over aa), then the change in energy at finite volume between the zero-magnetic flux state and 𝚽\bm{\Phi} flux state is:

ΔE=T212𝑩2=12(2πg)2na2Area(T2)>0\displaystyle\Delta E=\int_{T^{2}}{\textstyle{\frac{1}{2}}}{\bm{B}}^{2}={\textstyle{\frac{1}{2}}}\frac{\left(\frac{2\pi}{g}\right)^{2}n_{a}^{2}}{{\rm Area}({T^{2}})}>0 (276)

Therefore, at finite volume, these states are not degenerate. Only when the area tends to infinity, the cost of energy becomes vanishingly small.

limArea(T2)ΔE=0\displaystyle\lim_{{\rm Area}({T^{2}})\rightarrow\infty}\Delta E=0 (277)

and these states become degenerate with the zero magnetic flux state. These are the perturbative vacua, which only emerge in the Area(T2){\rm Area}({T^{2}})\rightarrow\infty limit. The tunneling events between them is the Hamiltonian interpretation of monopole instantons in Polyakov model.

In the presence of ’t Hooft flux background, say 12=1\ell_{12}=1, there is a genuine NN-fold vacuum degeneracy at finite Area(T2){{\rm Area}({T^{2}})} associated with monopoles in the affine root system. The classical degenerate vacua at finite volume are the states |𝝂a,a=1,,N|{\bm{\nu}}_{a}\rangle,a=1,\ldots,N. These are the counterpart of our perturbative |j,j=1,,N|j\rangle,j=1,\ldots,N in our quantum mechanical TNT_{N} model. Therefore, in some respect, there are two distinct class of tunneling events in the case of 12=1\ell_{12}=1 background. For example, starting with the state |𝝂1|{\bm{\nu}}_{1}\rangle, tunneling with 𝜶1-{\bm{\alpha}}_{1} and +𝜶1+{\bm{\alpha}}_{1} takes us to states |𝝂1𝜶1=|𝝂2|\bm{\nu}_{1}-\bm{\alpha}_{1}\rangle=|{\bm{\nu}}_{2}\rangle, and |𝝂1+𝜶1=|2𝝂1𝝂2|{\bm{\nu}}_{1}+\bm{\alpha}_{1}\rangle=|2{\bm{\nu}}_{1}-{\bm{\nu}}_{2}\rangle.

E|𝝂1𝜶1E|𝝂1=0\displaystyle E_{|{\bm{\nu}}_{1}-\bm{\alpha}_{1}\rangle}-E_{|{\bm{\nu}}_{1}\rangle}=0 (278)
E|𝝂1+𝜶1E|𝝂1=12(2πg)24Area(T2)>0Area(T2)0\displaystyle E_{|{\bm{\nu}}_{1}+\bm{\alpha}_{1}\rangle}-E_{|{\bm{\nu}}_{1}\rangle}={\textstyle{\frac{1}{2}}}\frac{\left(\frac{2\pi}{g}\right)^{2}4}{{\rm Area}({T^{2}})}>0\underbrace{\longrightarrow}_{{\rm Area}({T^{2}})\rightarrow\infty}0 (279)

where the first type is associated with the exact NN-fold perturbative degeneracy at finite volume and the latter is associated with the emergent degeneracy as in the Polyakov model.

Since all magnetic flux states except for |𝝂a,a=1,,N|{\bm{\nu}}_{a}\rangle,\;a=1,\ldots,N have high energy in the small T2××SL1T^{2}\times{\mathbb{R}}\times S^{1}_{L} limit, within Born-Oppenheimer approximation, the Yang-Mills theory with 12=1\ell_{12}=1 flux actually reduces to our simple TNT_{N} model. The role of the fractional instantons a{\cal I}_{a} in the QM system is played by monopole-instantons with charge 𝜶a-\bm{\alpha}_{a} in the Yang-Mills quantum mechanics. This is a concrete and nice correspondence with quantum mechanics and Yang-Mills theory. For example, it allows us to derive the vacuum energy densities for the NN-branched vacua (240) and fractional theta angle dependence by simple quantum mechanics. However, we are more interested in the large-T4T^{4} limit in general.

6.3 What does N{\mathbb{Z}}_{N} TQFT background has anything to do with monopoles?

Now, we can consider the partition function just in the background of ’t Hooft flux 12\ell_{12}.

Z12\displaystyle Z_{\ell_{12}} =tr[eβH12]=pbc𝒟aeS(a,B12(2))\displaystyle=\text{tr}[\mathrm{e}^{-\beta H_{\ell_{12}}}]=\int_{\rm pbc}{\mathcal{D}}a\;\mathrm{e}^{-S(a,B^{(2)}_{12})} (280)

This is the exact counter-part of the regular partition function in our TNT_{N} model with NN-degenerate minima. The existence of potential in the QM model is a substitute for the effect of 12{\ell_{12}} flux in Yang-Mills theory. To determine (280), we have to sum over all periodic configurations, in particular,

Φ(β)=Φ(0)\displaystyle\Phi(\beta)=\Phi(0) (281)

Therefore, exactly the same constraint that we obtained in the zero flux sector (232) is still operative, nan¯a=W,a=1,,Nn_{a}-\overline{n}_{a}=W,a=1,\ldots,N and independent of aa for dynamical monopoles of type 𝜶a\bm{\alpha}_{a}. The configurations contributing to the sum must satisfy magnetic neutrality, which automatically impose quantization of the topological charge WW\in{\mathbb{Z}}.

Exactly as in our QM example, where in order to see what contributes to the state sum, we need to look to a transition matrix element, between the states |𝝂a|{\bm{\nu}}_{a}\rangle and |𝝂a+1|{\bm{\nu}}_{a+1}\rangle. This is what is called an electric ’t Hooft flux insertion. Let us make this more explicit.

The N[1]{\mathbb{Z}}_{N}^{[1]} form center symmetry of the theory on 4{\mathbb{R}}^{4}, upon compactification on T4T2×SL1×Sβ1T^{4}\equiv T^{2}\times S^{1}_{L}\times S^{1}_{\beta}, becomes (N[0])4({\mathbb{Z}}_{N}^{[0]})^{4}. The states |𝝂a|{\bm{\nu}}_{a}\rangle described above are related to each other by a center-transformation, 𝖴c|𝝂a=|𝝂a+1\mathsf{U}_{\mathrm{c}}|{\bm{\nu}}_{a}\rangle=|{\bm{\nu}}_{a+1}\rangle. Insertion of this operator in partition function corresponds to 34=1\ell_{34}=1 units of electric ’t Hooft flux.

Z1234\displaystyle Z_{\ell_{12}\ell_{34}} =tr[eβH12(𝖴c)34]\displaystyle=\text{tr}[\mathrm{e}^{-\beta H_{\ell_{12}}}(\mathsf{U}_{\mathrm{c}})^{\ell_{34}}] (282)
=a=1N𝝂a|eβH12(𝖴c)34|𝝂a\displaystyle=\sum_{a=1}^{N}\;\;\langle\bm{\nu}_{a}|\mathrm{e}^{-\beta H_{\ell_{12}}}(\mathsf{U}_{\mathrm{c}})^{\ell_{34}}|{\bm{\nu}}_{a}\rangle (283)
=a=1N𝝂a|eβH12|𝝂a+34\displaystyle=\sum_{a=1}^{N}\;\;\langle\bm{\nu}_{a}|\mathrm{e}^{-\beta H_{\ell_{12}}}|{\bm{\nu}}_{a+\ell_{34}}\rangle (284)
=Φ(β)=Φ(0)+2πg𝜶a,a+34𝒟aeS(a,B12(2))\displaystyle=\int_{\rm\Phi(\beta)=\Phi(0)+\frac{2\pi}{g}\bm{\alpha}_{a,a+\ell_{34}}}{\mathcal{D}}a\;\mathrm{e}^{-S(a,B^{(2)}_{12})} (285)
=pbc𝒟aeS(a,B12(2),B34(2))\displaystyle=\int_{\rm pbc}{\mathcal{D}}a\;\mathrm{e}^{-S(a,B^{(2)}_{12},B^{(2)}_{34})} (286)

In the fourth step, we have the boundary conditions corresponding to a transition amplitude:

Φ(β)=Φ(0)+2πg𝜶a,a+34Φ(0)+2πg(𝝂a𝝂a+34)\displaystyle\Phi(\beta)=\Phi(0)+\frac{2\pi}{g}\bm{\alpha}_{a,a+\ell_{34}}\equiv\Phi(0)+\frac{2\pi}{g}(\bm{\nu}_{a}-\bm{\nu}_{a+\ell_{34}}) (287)

which is the change in the magnetic flux, corresponding to a dynamically allowed monopole event in the root lattice Γr\Gamma_{r}. This is the twisted boundary condition in terms of magnetic flux description. In the last line of (286), we undo the twisted boundary condition into a periodic boundary condition, just like in the simple QM example, and transmute the effect to the background B34(2)B^{(2)}_{34} flux. Note that we can also think twisted boundary condition in different ways. In particular, we could have written it more conventionally as U3(β=L4)=ei2πN34U3(0)U_{3}(\beta=L_{4})=\mathrm{e}^{\mathrm{i}\frac{2\pi}{N}{\ell_{34}}}U_{3}(0). This is equivalent to above. This is because the center transformation acts on the center-symmetric holonomy field eiϕ=(1,ω,ω2,,ωN1)\mathrm{e}^{\mathrm{i}\bm{\phi}_{\star}}=(1,\omega,\omega^{2},\ldots,\omega^{N-1}) as eiϕei2πNeiϕ\mathrm{e}^{\mathrm{i}\bm{\phi}_{\star}}\rightarrow\mathrm{e}^{\mathrm{i}\frac{2\pi}{N}}\mathrm{e}^{\mathrm{i}\bm{\phi}_{\star}}. The action is invariant under this transformation if we act on the fields by appropriate cyclic permutation, e.g, it takes σkσk1\sigma_{k}\rightarrow\sigma_{k-1} Cherman:2016jtu . This cyclically permutes background magnetic flux, and corresponds to the twisted boundary condition on fourth line. 232323The gauge invariant definition of the U(1)NU(1)^{N} photons are Fμν,k=1Np=0N1ei2πkpNtr(U3pFμν)\displaystyle F_{\mu\nu,k}=\frac{1}{N}\sum_{p=0}^{N-1}\;\mathrm{e}^{-\mathrm{i}\frac{2\pi kp}{N}}\;\text{tr}(U_{3}^{p}F_{\mu\nu}) (288) where FμνF_{\mu\nu} inside trace is the non-abelian gauge field strength. Crucially, counter-part of the adjoint Higgs field is the Polyakov loop. Under a zero-form center transformation, Polyakov loop transforms as U3pei2πpNU3pU_{3}^{p}\rightarrow\mathrm{e}^{\mathrm{i}\frac{2\pi p}{N}}U_{3}^{p}. Hence, the photon in the 3d theory, transforms under the center transformation as Fμν,kFμν,k1F_{\mu\nu,k}\rightarrow F_{\mu\nu,k-1}. The same is also true for the dual photon. Clearly, the monopole operator ei𝜶k𝝈ei𝜶k1𝝈\mathrm{e}^{\mathrm{i}\bm{\alpha}_{k}\cdot\bm{\sigma}}\rightarrow\mathrm{e}^{\mathrm{i}\bm{\alpha}_{k-1}\cdot\bm{\sigma}} and the flux states are cyclically shifted, |𝝂k|𝝂k1|\bm{\nu}_{k}\rangle\rightarrow|\bm{\nu}_{k-1}\rangle. In other words, the zero-form part of center-symmetry acts like N{\mathbb{Z}}_{N} discrete translation symmetry in the TNT_{N} model.

The oddity of this result: On 3×S1{\mathbb{R}}^{3}\times S^{1}, the action and topological charge of the monopole-instantons with magnetic charge Qm=2πg𝜶aQ_{m}=\frac{2\pi}{g}\bm{\alpha}_{a} is determined by center-symmetric gauge holonomy:

Diag(U3)=eiϕ=(1,ω,ω2,,ωN1)\displaystyle{\rm Diag}(U_{3})=\mathrm{e}^{\mathrm{i}\bm{\phi}_{\star}}=(1,\omega,\omega^{2},\ldots,\omega^{N-1}) (289)

This background is quantum mechanically stable in QCD(adj) and can be made stable in Yang-Mills by using the double trace deformation. Minimal action and topological charge in this background is given by

Sa=4πg2(𝜶a.ϕ)=8π2g2N,Q\displaystyle S_{a}=\frac{4\pi}{g^{2}}(\bm{\alpha}_{a}.\bm{\phi}_{\star})=\frac{8\pi^{2}}{g^{2}N},\qquad\qquad Q =12π(𝜶a.ϕ)=1N\displaystyle=\frac{1}{2\pi}(\bm{\alpha}_{a}.\bm{\phi}_{\star})=\frac{1}{N} (290)

On the other hand, it is also well-known that in the non-vanishing ’t Hooft flux background, or equivalently, by coupling YM to N{\mathbb{Z}}_{N} TQFT, minimal action and topological charge are also 1N\frac{1}{N} quantized.

S=8π2g218π2B(2)B(2)=8π2g2N,Q=N8π2B(2)B(2)=1234N=1N\displaystyle S=\frac{8\pi^{2}}{g^{2}}\frac{1}{8\pi^{2}}\int B^{(2)}\wedge B^{(2)}=\frac{8\pi^{2}}{g^{2}N},\qquad Q={N\over 8\pi^{2}}\int B^{(2)}\wedge B^{(2)}=\frac{\ell_{12}\ell_{34}}{N}=\frac{1}{N} (291)

Clearly, except for the results, these two constructions do not look anything alike. One is the action of a dynamical monopole instanton on the zero ’t Hooft flux background and the other seems to be a property of the classical B(2)B^{(2)} background associated with 1-form N[1]{\mathbb{Z}}_{N}^{[1]} center symmetry. In the latter, there seems to be naively no data of the monopole-instanton, and it is quite tempting that, despite the agreement of the final results, these two construction has nothing to do with one another.

But again, truth is subtler. Let us recap what we did in this section again because it is interesting. On T2×SL1×Sβ1T^{2}\times S^{1}_{L}\times S^{1}_{\beta}, assume first that center-symmetric background (289) is stabilized. Turning on 12=1\ell_{12}=1 on T2T^{2} part gives us NN- exactly degenerate minima at the classical level. There are NN-configurations which carry 1-unit of ’t Hooft flux, but they also carry an orientation, that we labeled by 𝝂a,a=1,2,\bm{\nu}_{a},a=1,2,\ldots in (271). 34=1\ell_{34}=1 background is equivalent to a twisted boundary condition U3(β=L4)=ei2πN34U3(0)U_{3}(\beta=L_{4})=\mathrm{e}^{\mathrm{i}\frac{2\pi}{N}{\ell_{34}}}U_{3}(0). But as we explained above, this amounts to the transition amplitude between 𝝂a\bm{\nu}_{a} and 𝝂a+1\bm{\nu}_{a+1}, meaning that the difference of the magnetic flux on the τ=β\tau=\beta and τ=0\tau=0 is just the charge Qm=ΔT2𝑩=2πg(𝝂a𝝂a+1)=2πg𝜶aQ_{m}=\Delta\int_{T^{2}}{\bm{B}}=\frac{2\pi}{g}\left({\bm{\nu}}_{a}-{\bm{\nu}}_{a+1}\right)=-\frac{2\pi}{g}{\bm{\alpha}}_{a}. This is the charge of our dynamical monopole-instanton event. 242424 If the minima for U3U_{3} is classically center-broken configurations U3=ei2πNj𝟙U_{3}=\mathrm{e}^{\mathrm{i}\frac{2\pi}{N}j}{\mathbbm{1}} given in (318), then, the twisted boundary conditions U3(β=L4)=ei2πN34U3(0)U_{3}(\beta=L_{4})=\mathrm{e}^{\mathrm{i}\frac{2\pi}{N}{\ell_{34}}}U_{3}(0) takes |j|j\rangle to |j+34|j+\ell_{34}\rangle. This gives a dynamics which is different from above. See Appendix A. It would be interesting to understand the connection between these two constructions more precisely.

Now, we realized our promise. To find the relevant saddles of SU(N)SU(N) gauge theories, we should first consider either SU(N)SU(N) with background gauge field (B(2),B(1))(B^{(2)},B^{(1)}) for the N[1]{\mathbb{Z}}_{N}^{[1]} 1-form symmetry or (SU(N)/N)p(SU(N)/{\mathbb{Z}}_{N})_{p} theory where N[1]{\mathbb{Z}}_{N}^{[1]} is gauged. After finding the configurations in the (SU(N)/N)p(SU(N)/{\mathbb{Z}}_{N})_{p}, we can patch them up to find the ones that can be lifted to SU(N)SU(N) theory. These are fractional action (e.g. 2SI/N2S_{I}/N) configurations that contribute to the partition function of SU(N)SU(N) theory. Indeed, the mini-space formalism that we derived in Sec.5.1 reveals this fact. The construction and effective field theory based on monopole-instantons and bions works in semi-classical weak coupling domain. The fact that non-perturbative effects are controlled by action SIN\frac{S_{I}}{N} and topological charge W=1/NW=1/N defects is true both in strong coupling domain and weak coupling domain.

We also note that N1\mathbb{CP}^{N-1} model with an ΩF\Omega_{F} and 12\ell_{12} flux background and Yang-Mills theory on the center-symmetric background with ’t Hooft fluxes 12,34\ell_{12},\ell_{34} maps to identical problems from the point of view of mixed anomalies Yamazaki:2017ulc ; Yamazaki:2017dra . For a general discussion of background field coupling to detect ’t Hooft anomalies of 4d Yang-Mills, and their connection to 2d N?1\mathbb{CP}^{N?1}, see also Wan:2018zql . It is not a coincidence that the combination of (290) and (291) in d=4d=4 is same as (105) in d=2d=2.

Monopole-instantons in B(2)B^{(2)} background: In order to determine the fractional instantons on arbitrary M4M_{4}, we are supposed to solve in 4d the BPS equation (260) in the B(2)B^{(2)} background. If one of the directions is compactified, in one-lower dimension, the background field decomposes as B4d(2)=B3d(2)+B3d(1)dx3L3B^{(2)}_{\rm 4d}=B^{(2)}_{\rm 3d}+B^{(1)}_{\rm 3d}\wedge\frac{\mathrm{d}x_{3}}{L_{3}} and the monopole instanton equations become

(F~B3d(2))=(daa3~B3d(1))\displaystyle(\widetilde{F}-B^{(2)}_{\rm 3d})=\mp\star(\widetilde{\mathrm{d}_{a}a_{3}}-B^{(1)}_{\rm 3d}) (292)

However, as we did in quantum mechanics (compare (35) with (36)) and PN1{\mathbb{C}P}^{N-1} model, we can undo the background and convert it to boundary conditions that the monopole instanton has to satisfy. This is nothing but the regular monopole instanton equations tHooft:1974kcl ; Polyakov:1974ek

F=daa3\displaystyle F=\mp\star{\mathrm{d}_{a}a_{3}} (293)

Our construction tells us that these well-known solutions fits with the ’t Hooft flux boundary conditions and are indeed non-trivial solutions in PSU(N)PSU(N) bundle.

Remarks: Although we phrased the fact that monopole instantons with topological charge Q=1/NQ=1/N and action 8π2g2N\frac{8\pi^{2}}{g^{2}N} fits perfectly with the SU(N)SU(N) with twisted boundary conditions or PSU(N)PSU(N) bundle, the exact solutions within PSU(N)PSU(N) bundle had historically been a difficult endeavor. In 1981, ’t Hooft showed that some constant abelian solutions that had topological charge 1/N1/N and were easy to find, but had total action which never descends below 8π2(N1)/(g2N)8\pi^{2}(N-1)/(g^{2}N). Then, he found some non-abelian constant solutions that has action 8π2/(g2N)8\pi^{2}/(g^{2}N), a desired property. Historically, however, it was not easy to determine the time dependent or space-time dependent solutions. Gonzalez-Arroyo et. al. found by simulations on latticized T3×T^{3}\times{\mathbb{R}} that time-dependent fractional instanton solutions with action 1/N{1}/{N} exist in the presence of ’t Hooft flux GarciaPerez:1989gt ; GarciaPerez:1992fj ; Montero:2000mv ; GonzalezArroyo:1997uj , see also Gonzalez-Arroyo:2019wpu for recent work. One of our aim in this work was to determine these solutions analytically, and we were able to interpret the well-known monopole-instanton solutions on 3×SL1{\mathbb{R}}^{3}\times S^{1}_{L} as non-trivial solutions in the flux background.

6.4 Fractionalization in 𝒩=1{\cal N}=1 SU(N)SU(N) theory on M4M_{4}

The evidence we gather suggest that the fundamentally important configurations in SU(N)SU(N) gauge theory are fractional topological charge and fractional action instantons. The fact that such configurations exist is a mathematical statement about classification of the classical configurations, determined by the properties by the SU(N)/NSU(N)/{\mathbb{Z}}_{N} bundle. However, one may ask, why such a statement, is not in apparent contradiction with the global chiral symmetry, ABJ anomaly or quantization of topological charge in SU(N)SU(N) theories on orientable 4-manifolds M4M_{4}?

Let us consider 𝒩=1{\cal N}=1 supersymmetric Yang-Mills theory with G=SU(N)G=SU(N) gauge group. This model has a 2N{\mathbb{Z}}_{2N} chiral symmetry which is broken dynamically to 2{\mathbb{Z}}_{2} by the formation of fermion-bilinear condensate.

k|trλλ|k=NΛ3ei2πk+θN,k=0,1,,N1\langle k|\text{tr}\lambda\lambda|k\rangle=N\Lambda^{3}\,\mathrm{e}^{\mathrm{i}\frac{2\pi k+\theta}{N}},\qquad k=0,1,\ldots,N-1 (294)

leading to NN isolated supersymmetric vacua Witten:1982df . However, we claim that the classical configurations in this theory should be determined by using SU(N)/NSU(N)/{\mathbb{Z}}_{N} bundle. But in a non-trivial flux background (254), the chiral symmetry is reduced to 2{\mathbb{Z}}_{2}, instead of 2N{\mathbb{Z}}_{2N}. Each of these configurations has two exact fermion zero modes. One may ask why the existence of objects with just 22 fermion zero modes is not in contradiction with the ABJ anomaly Bell:1969ts in the SU(N)SU(N) theory (with no background turned on). In particular, we expect

aeSINtrλλ\displaystyle{\cal I}_{a}\sim\mathrm{e}^{-\frac{S_{I}}{N}}\text{tr}\lambda\lambda (295)

to be present in the Euclidean vacuum, even when the description of the vacuum is non-semiclassical. Of course, the answer is tied with the fact that only integer topological charge configurations are in the SU(N)SU(N) bundle, hence, only such configurations are part of the SU(N)SU(N) theory. However, the key point is we can construct SU(N)SU(N) configurations out of PSU(N)PSU(N) bundle in different ways. For example,

IW=0\displaystyle I_{W=0} =a(xa)¯a(x¯a)\displaystyle={\cal I}_{a}(x_{a})\overline{\cal I}_{a}(\overline{x}_{a}) (296)
IW=1\displaystyle I_{W=1} =a=1Na(xa)eSI+iθa=1N(trλλ)(xa)\displaystyle=\prod_{a=1}^{N}{\cal I}_{a}(x_{a})\sim\mathrm{e}^{-S_{I}+i\theta}\prod_{a=1}^{N}(\text{tr}\lambda\lambda)(x_{a}) (297)

are both in the SU(N)SU(N) bundle, but not a(xa){\cal I}_{a}(x_{a}) itself. As long as one fractional instanton here (at this spacetime point) has a friend fractional anti-instanton on the dark side of the Moon, the combination lives happily in the SU(N)SU(N) bundle. The right hand side in (297) is the first configuration to reduce the chiral symmetry down to 2N{\mathbb{Z}}_{2N} in SU(N)SU(N) gauge theory. The proliferation of fractional instantons with the global constraint (232) is also capable of breaking the discrete chiral symmetry Davies:2000nw ; Unsal:2007jx at weak coupling.

To see the presence of the fractional events on 4{\mathbb{R}}^{4}, let us reverse engineer the fractional instanton sum. Consider a soft mass deformation, Δm=mg2trλλ+h.c.\Delta{\cal L}_{m}=\frac{m}{g^{2}}\text{tr}\lambda\lambda+{\rm h.c.} of 𝒩=1{\cal N}=1 SYM. In this case, the vacuum energy density is modified into k=mg2k|trλλ|k+c.c.{\cal E}_{k}=-\frac{m}{g^{2}}\langle k|\text{tr}\lambda\lambda|k\rangle+{\rm c.c.} at leading order in mm. The partition function on a four manifold can be written as:

Z(θ)=k=0N1e2mN2Λ3V4cosθ+2πkN\displaystyle Z(\theta)=\sum_{k=0}^{N-1}\mathrm{e}^{2mN^{2}\Lambda^{3}V_{4}\cos\frac{\theta+2\pi k}{N}} (298)

where Λ3=μ3eSI/N=μ3e8π2g2(μ)N\Lambda^{3}=\mu^{3}\mathrm{e}^{-S_{I}/N}=\mu^{3}\mathrm{e}^{-\frac{8\pi^{2}}{g^{2}(\mu)N}} is the strong scale and V4V_{4} is the volume of the 4-manifold that the theory is defined. Based on our quantum mechanical example, we can rewrite the partition function as

Z(θ)\displaystyle Z(\theta) =NWn=0n¯=01n!1n¯!(mN2μ3V4eSIN+iθN)n(mN2μ3V4eSINiθN)n¯δnn¯WN,0\displaystyle=N\sum_{W\in{\mathbb{Z}}}\sum_{n=0}^{\infty}\sum_{\overline{n}=0}^{\infty}\frac{1}{n!}\frac{1}{\overline{n}!}\left(mN^{2}\mu^{3}V_{4}\mathrm{e}^{-\frac{S_{I}}{N}+\mathrm{i}\frac{\theta}{N}}\right)^{n}\left(mN^{2}\mu^{3}V_{4}\mathrm{e}^{-\frac{S_{I}}{N}-\mathrm{i}\frac{\theta}{N}}\right)^{\overline{n}}\delta_{n-\overline{n}-WN,0} (299)
=NW[INW(2mN2μ3V4eSIN)]eiWθ(strongcoupling,notHooftflux)\displaystyle=N\sum_{W\in{\mathbb{Z}}}\left[I_{NW}(2mN^{2}\mu^{3}V_{4}\mathrm{e}^{-\frac{S_{I}}{N}})\right]\mathrm{e}^{\mathrm{i}W\theta}\qquad({\rm strong\;coupling,\;no\;}\;{\rm{}^{\prime}t\;Hooft}{\;\rm flux})\qquad (300)

A few remarks are in order:

  • Individual terms in the sum are sourced by the solution of self-duality equation (260) in the SU(N)/NSU(N)/{\mathbb{Z}}_{N} bundle. Minimal configurations have action SI/NS_{I}/N and topological charge 1N\frac{1}{N}, and they do not contribute to the partition function, but they contribute to physical observables.

  • The constraint δnn¯WN,0\delta_{n-\overline{n}-WN,0} guarantees that the sum is over integer topological charge configurations WW\in{\mathbb{Z}} which belong to SU(N)SU(N). This is ultimately the reason why the proliferation of fractional instantons with the global constraint (300) is in agreement with ABJ anomaly, and integer quantization of topological charge on 4-manifolds, and is also capable of breaking chiral symmetry dynamically.

  • The solutions in PSU(N)PSU(N) bundle has 4 bosonic zero modes, which may perhaps interpreted as the position moduli. V4V_{4} may be viewed as the volume of the bosonic moduli. μ\mu us Pauli-Villars renormalization scale. It appears with the combination μnbnf/2=μ3\mu^{n_{b}-n_{f}/2}=\mu^{3} where nb=4,nf=2n_{b}=4,n_{f}=2 are the numbers of bosonic and fermionic zero modes. The moduli space of an instanton in SU(N)SU(N) theory, which has NN constituents, can be parametrized in terms of NN 4-position, ai4,i=1,,Na_{i}\in{\mathbb{R}}^{4},\;i=1,\ldots,N.

7 Prospects and comments

Theory NN-fold classical degeneracy Partially twisted partition function Twisted partition function Minimal topological charge in TQFT background
TNT_{N} V(Nq)V(Nq) Z=tr[eβH]Z=\text{tr}[\mathrm{e}^{-\beta H}] Z=tr[eβH𝖴]Z_{\ell}=\text{tr}[\mathrm{e}^{-\beta H}\mathsf{U}^{\ell}] 12πA(1)=1N\frac{1}{2\pi}\int A^{(1)}=\frac{1}{N}
N1\mathbb{CP}^{N-1} ΩF\Omega_{F}-twist ZΩF=tr[eβHΩF]Z_{\Omega_{F}}=\text{tr}[\mathrm{e}^{-\beta H_{\Omega_{F}}}] ZΩF,=tr[eβHΩF𝖴]Z_{\Omega_{F},\ell}=\text{tr}[\mathrm{e}^{-\beta H_{\Omega_{F}}}\mathsf{U}^{\ell}] 12πB(2)=1N\frac{1}{2\pi}\int B^{(2)}=\frac{1}{N}
SU(N)SU(N) YM 12\ell_{12} flux Z12=tr[eβH12]Z_{\ell_{12}}=\text{tr}[\mathrm{e}^{-\beta H_{\ell_{12}}}] Z1234=tr[eβH12𝖴34]Z_{\ell_{12}\ell_{34}}=\text{tr}[\mathrm{e}^{-\beta H_{\ell_{12}}}\mathsf{U}^{\ell_{34}}] N8π2B(2)B(2)=1N\frac{N}{8\pi^{2}}\int B^{(2)}\wedge B^{(2)}=\frac{1}{N}

We developed three parallel constructions, in d=1,2,4d=1,2,4 QM and QFTs. The analogous quantities are listed above. The classical NN-fold degenerate minima in QM is a consequence of the potential V(Nq)V(Nq). In N1\mathbb{CP}^{N-1} it is induced by turning on ΩF\Omega_{F} background for a global symmetry, and in the YM theory, it is induced by turning on B(2)=12B^{(2)}=\ell_{12} background in the 12-plane. The simplest way to realize that these NN-fold classical degeneracy is not a fiction is to recall that in N1\mathbb{CP}^{N-1} and YM theory, the vacua at the quantum level is NN-branched. The classical (as well as all orders perturbative minima) that emerges in the second column is the precursor of that well-know fact. In Yang-Mills, in the 3-direction associated with circle SL1S^{1}_{L}, we may or may not have a center-symmetric holonomy field U3U_{3}. Our main application is in center-symmetric background, but the application in the Appendix A is in classically center-broken background. The NN-fold perturbative degeneracy is independent of that. The generalized partition function in third column is periodic in β\beta and it receives contribution only from WW\in{\mathbb{Z}} integer topological charge configurations. The important point is that these configurations can and do have fractional action, which starts with 2SIN\frac{2S_{I}}{N}, in the W=0W=0 sector. Observables are controlled by SIN\frac{S_{I}}{N}. To see these configurations more explicitly, we inspect the fully twisted partition functions, also involving a twist associated with Sβ1S^{1}_{\beta} circle (say, the ’t Hooft flux in 34-plane in gauge theory and 12-plane in N1\mathbb{CP}^{N-1}). We are free to put all twists as boundary conditions. Alternatively, we can use periodic boundary conditions everywhere, and turn on classical background fields. This is equivalent to coupling the QFT to N{\mathbb{Z}}_{N} TQFT, and reveals the true nature of non-perturbative configurations in the original theory without any backgrounds.

Pure Yang-Mills theory on 4{\mathbb{R}}^{4}: Coupling YM theory to a N{\mathbb{Z}}_{N} TQFT tells us that we need to use PSU(N)PSU(N) bundle to build up WW\in{\mathbb{Z}} configurations in the SU(N)SU(N), which generically possess fractional action. The construction instructs us that non-perturbative expansion parameter in pure SU(N)SU(N) Yang-Mills theory is

eSIN+iθN=e8π2g2(μ)N+iθN\displaystyle\mathrm{e}^{-\frac{S_{I}}{N}+\mathrm{i}\frac{\theta}{N}}=\mathrm{e}^{-\frac{8\pi^{2}}{g^{2}(\mu)N}+\mathrm{i}\frac{\theta}{N}} (301)

This is exactly like the monopole-instanton factor in deformed Yang-Mills theory on 3×S1{\mathbb{R}}^{3}\times S^{1} Unsal:2008ch where μ\mu is Pauli-Villars scale. This parameter is exponentially more important than the 4d instanton amplitude Belavin:1975fg .

QCD and general SU(N)SU(N) gauge theory, NP expansion parameter: In his original works, ’t Hooft argues that the twisted boundary conditions cannot be applied to QCD with fundamental quarks as there is no longer a center symmetry. Though the absence of center symmetry is a correct statement, his negative conclusion can can be avoided in two different ways.

  • If gcd(Nf,N)1{\rm gcd}(N_{f},N)\neq 1, we can impose ’t Hooft twisted boundary conditions by turning on first an SU(Nf)SU(N_{f}) background. This becomes most efficient for Nf=NN_{f}=N, where we can show that expansion parameter becomes exp[SI/N+iθ/N]\exp\left[-{S_{I}}/{N}+\mathrm{i}{\theta}/{N}\right].

  • If we turn on U(1)V/NU(1)_{V}/{\mathbb{Z}}_{N} background Roberge:1986mm (which is part of genuine vector-like symmetry of the theory), then, we can impose twisted boundary conditions even for Nf=1N_{f}=1. In fact, our construction in N1\mathbb{CP}^{N-1} is the dual of this idea, where we had U(1)U(1) gauge structure, and SU(N)/NSU(N)/{\mathbb{Z}}_{N} global symmetry, see Sec. 3.1. In 1-flavor QCD, we have SU(N)SU(N) gauge structure and U(1)/NU(1)/{\mathbb{Z}}_{N} global symmetry. In either case, we can impose the same twists. As a result, we were able to turn on ’t Hooft flux in a theory with quarks. Therefore, the expansion parameter is always exp[8π2/g2N]\exp[-{8\pi^{2}}/{g^{2}N}] regardless of matter content of SU(N)SU(N) gauge theory.

Renormalization group for topological defects: If exp[8π2/g2N]\exp[-{8\pi^{2}}/{g^{2}N}] is expansion parameter in all SU(N)SU(N) QCD-like theories, what distinguishes an asymptotically free IR-CFT from a confining theory which exhibit chiral symmetry breaking and confinement? We need to formulate a renormalization group, similar to Kosterlitz:1974sm ; fradkin_2013 , which tells us relevance vs. irrelevance of defect operators. Indeed, we used such a method at θ=π\theta=\pi of 1\mathbb{CP}^{1} model to exhibit the irrelevance of doubly-charged vertex operators and show conformality, and to prove the relevance of vertex operators at θ=0\theta=0, and mass gap.

Rethinking deformed Yang-Mills theory on 3×S1{\mathbb{R}}^{3}\times S^{1}: On 3×S1{\mathbb{R}}^{3}\times S^{1}, a monopole-instanton possesses four bosonic zero mode. This introduces the Pauli-Villars μ\mu dependence of the form μ4\mu^{4}. The fluctuation operator over the non-zero modes in the background of a monopole gives [Det(D2)]1=(μrmon)1/3[{\rm Det}(-D^{2})]^{-1}=(\mu r_{\rm mon})^{-1/3} where rmon=mW1=LN2πr_{\rm mon}=m_{W}^{-1}=\frac{LN}{2\pi} is the characteristic size of monopole-instantons, which is the inverse WW-boson mass, determined by the adjoint Higgsing. The combination of the zero and non-zero modes just gives the expected result, μ4(1/3)=μ11/3\mu^{4-(1/3)}=\mu^{11/3} which combines with the exponential to give the 1-loop renormalization group invariant scale,

μ11/3e8π2g2(μ)N+iθN=Λ11/3eiθN\displaystyle\mu^{11/3}\mathrm{e}^{-\frac{8\pi^{2}}{g^{2}(\mu)N}+\mathrm{i}\frac{\theta}{N}}=\Lambda^{11/3}\mathrm{e}^{\mathrm{i}\frac{\theta}{N}} (302)

The mass gap on semi-classical domain on 3×S1{\mathbb{R}}^{3}\times S^{1} is given by:

m2(θ)=Λ2(ΛLN)5/3[Maxkcos(θ+2πkN)],LNΛ1,semiclassicaldomain\displaystyle m^{2}(\theta)=\Lambda^{2}(\Lambda LN)^{5/3}\;\Big{[}{\rm Max}_{k}\cos\Big{(}{\theta+2\pi k\over N}\Big{)}\Big{]},\qquad LN\Lambda\ll 1,\;{\rm semiclassical\;domain} (303)

This result is based on a justified effective field theory based on proliferation of monopole-instantons. The merit of this formula is that it captures the θ\theta angle dependence and existence of mass gap correctly. As mentioned above, LNLN is the monopole-instanton size rmonr_{\rm mon}, and semi-classics is reliable provided rmonΛ1r_{\rm mon}\Lambda\lesssim 1.

One of the new insights of this work is to recognize that the monopole-instantons can also be interpreted as tunneling events in the PSU(N)PSU(N) bundle, in a fixed ’t Hooft flux sector. See Fig.(7). This may seem a trivial observation, but this interpretation is important, because ’t Hooft himself, in 1981, was only capable of finding constant solutions on T4T^{4} on PSU(N)PSU(N) bundle and these solutions required certain ratio of scales in T4T^{4} geometry to achieve BPS bound. (He says in his paper, “Considering the difficulty we had in finding them it looked worth-while to publish the result.” tHooft:1981nnx ). After this work, the only non-trivial solutions that has been obtained were via numerical lattice simulations. See Gonzalez-Arroyo:2019wpu for an up to date report.

The fact that monopole-instantons are in PSU(N)PSU(N) bundle makes one wonder if they may be weak coupling reincarnation of configurations with action SI/NS_{I}/N on arbitrarily large Tlarge44T^{4}_{\rm large}\sim{\mathbb{R}}^{4} down to Tlarge3×Ssmall13×Ssmall1T^{3}_{\rm large}\times S^{1}_{\rm small}\sim{\mathbb{R}}^{3}\times S^{1}_{\rm small}. It is tempting to speculate that on 4{\mathbb{R}}^{4}, the fractional instantons can produce a gap of the form m2(θ)=Λ11/6(ρ)5/6[Maxkf(θ+2πkN)]p+m^{2}(\theta)=\Lambda^{11/6}(\rho)^{5/6}\left[{\rm Max}_{k}f\left({\theta+2\pi k\over N}\right)\right]^{p}+\ldots where ρ\rho is a size moduli of instanton, and pp can be a real number (not necessarily one as in semi-classical domain) of order one. (Recall that in 1\mathbb{CP}^{1}, we showed that mass gap is of the form m(θ)=Λ|cos(θ/2)|2/3m(\theta)=\Lambda|\cos(\theta/2)|^{2/3}.) Very likely, similar to N1\mathbb{CP}^{N-1} where Debye length provides a self-consistent cut-off over the instanton size moduli (216), there may be a similar mechanism on 4{\mathbb{R}}^{4} where ρ\rho is cut-off at order Λ1\Lambda^{-1}.

k-strings and center-vortices: In pure Yang-Mills, there is only one fundamental string, and kk-string tensions are determined by NN-ality of representation of sources in the strong coupling domain. However, in Polyakov model on 3{\mathbb{R}}^{3} Polyakov:1987ez and Seiberg-Witten theory on 4{\mathbb{R}}^{4} Seiberg:1994rs , there are N1N-1 types of fundamental strings, with N1N-1 different string tensions, astray from the pure Yang-Mills theory (but an unavoidable property of these theories, see e.g. Douglas:1995nw ). In deformed YM, 𝒩=1{\cal N}=1 SYM or QCD(adj) on 3×S1{\mathbb{R}}^{3}\times S^{1}, there is again only one fundamental string tension, just like pure Yang-Mills, thanks to unbroken N{\mathbb{Z}}_{N} center symmetry and the absence of elementary adjoint Higgs field Poppitz:2017ivi .

Despite this remarkable fact, it is important to notice that the infrared physics of deformed Yang-Mills theory (as well as Polyakov model Polyakov:1975rs ) cannot be described by just writing an EFT based on proliferation of monopole-instantons, one should keep in mind the existence of WW-bosons as emphasized Greensite:2016pfc ; Greensite:2003bk . If one forgets WW-bosons, this would lead to non-zero tension for adjoint or k=Nk=N string, which is incorrect, as well as quantitatively incorrect string tensions for k>N/2k>N/2.

But fortunately, dynamical WW-bosons are in the full microscopic theory, and they can easily screen NN-ality zero sources. There is no doubt that:

  • i)

    In the weak coupling semiclassical regime on 3×S1{\mathbb{R}}^{3}\times S^{1} center-symmetric background, there exists an EFT description in terms monopole-instanton and bion local fields (ie, fields which create local excitations with the quantum numbers of the fractional-instanton along with their fermionic zero modes, and local bion fields). A number of low-energy observables, mass gap, topological susceptibilities, chiral condensates, and NN-ality kN/2k\lesssim N/2 (single winding) Wilson loop expectation values are accurately (and sometimes exactly) described by the local EFT based on these topological defects in the semi-classical domain. This is as explicit as it can ever be Aharony:1997bx ; Unsal:2007jx ; Unsal:2008ch ; Davies:2000nw , and there is no doubt of its validity either in supersymmetric or non-supersymmetric theories.

  • ii)

    However, to screen the adjoint probes, and to get the right string tensions for kN/2k\gtrsim N/2, one must keep WW-bosons in the description, (which we should not call EFT anymore), otherwise these observables will come out to be wrong. In other words, Wilson loops with kN/2k\gtrsim N/2 is not in the set of low-enegy observable that EFT can address correctly.

  • iii)

    We do not know a semi-classical effective field theory describing the role of center-vortices, even when a large subset of non-perturbative observables are completely describable in terms of semi-classics and EFT, as in 𝒩=1{\cal N}=1 SYM or QCD(adj) on 3×S1{\mathbb{R}}^{3}\times S^{1}, or Polyakov model on 3{\mathbb{R}}^{3}.

The EFT for deformed YM on small 3×S1{\mathbb{R}}^{3}\times S^{1} center-symmetric regime is based on the parametric separation of scales between the non-perturbative dual photon mass mγΛ(ΛLN)5/6m_{\gamma}\sim\Lambda(\Lambda LN)^{5/6} and the WW-boson mass, 2πLNΛ(ΛLN)1\frac{2\pi}{LN}\sim\Lambda(\Lambda LN)^{-1}, and EFT is valid provided

mγmW(ΛLN)11/61\displaystyle\frac{m_{\gamma}}{m_{W}}\sim(\Lambda LN)^{11/6}\lesssim 1 (304)

But as we emphasized above, the EFT can be used to determine some observables, but not all observables. For example, it is not capable of producing kN/2k\gtrsim N/2 string tensions or multi-winding loops for some kN/2k\lesssim N/2 probes. To produce those correctly, we must incorporate into EFT WW-bosons, the heavy degrees of freedom, which are not part of EFT. Once this is done, one cannot call the combined theory as EFT, better call it microscopic theory. But it has an EFT subsector.

Perhaps, a reasonable proposal is to aim to demonstrate that {\{monopole-instanton + bions + WW bosons}\} system is equivalent to center-vortices in weak coupling domain. In some ways, it seems like one needs to incorporate WW-bosons to the description and then, integrate it out. But then, very likely, there will not be a local description. In other words, we suspect that there may not be a local semi-classical EFT description for center-vortices at all.

But we still think that center vortices is a major part of the story, very likely not more or less important than fractional instantons, and the two are possibly intimately connected. There are some other interesting facts about the center-vortices that support this perspective. ’t Hooft magnetic flux background itself is actually a (non-dynamical) center vortex, and dynamical monopole-instanton tunneling events can be viewed as junction on them (See Fig.(7)) and do not change the charge of ’t Hooft background as it has zero ’t Hooft charge. On T4T^{4}, the intersection point of center-vortices acts as sourcing topological charge W=1/NW=1/N Engelhardt:2010ft , which naturally lives in PSU(N)PSU(N) bundle, and smoothed center-vortex configurations are related to instantons Trewartha:2015ida , which we proved to be composites of fractional instantons with W=1/NW=1/N even on large T4T^{4} strong coupling domain.

Finally, probably most importantly, when we couple SU(N)SU(N) Wilson lattice gauge theory Wilson:1974sk to N{\mathbb{Z}}_{N} TQFT, and gauge it, vortices become local minima of the action for any NN, obviating various restrictions concerning stability of center-vortex configurations Bachas:1982ep ; Greensite:2016pfc . However, one should be careful with the fact that there are multiple lattice PSU(N)0PSU(N)_{0} theories (with or without magnetic matter), and the one which is locally same as SU(N)SU(N) theory should not have magnetic matter charged under magnetic center NN-ality. We are currently investigating PSU(N)0PSU(N)_{0} lattice gauge theories (with or without magnetic matter) in order to reveal these connections precisely.

The importance of reparametrization of instanton moduli space: The instanton in SU(N)SU(N) gauge theory on 4{\mathbb{R}}^{4} has 4N4N bosonic zero modes and qq-instantons have 4Nq4Nq moduli parameters. The analogy with the analysis of Refs.Berg:1979uq ; Fateev:1979dc suggest that perhaps, we should parametrize the moduli space of instanton as

{ajm},m=1,q,j=1,,N\displaystyle\{a^{m}_{j}\},\qquad m=1,\ldots q,\;j=1,\ldots,N (305)

These are NqNq 44-positions (or quaternions) that we may think as fractional instanton positions.

An intriguing fact that we learned in the course of this work is that the monopole-instantons on 3×S1{\mathbb{R}}^{3}\times S^{1} are in PSU(N)PSU(N) bundle and come exactly as μ11/3exp[8π2g2N+iθN]\mu^{11/3}\exp[-\frac{8\pi^{2}}{g^{2}N}+\mathrm{i}\frac{\theta}{N}] as we already knew from Unsal:2008ch .

  • The non-perturbative expansion parameter in all center symmetric theories in weak coupling domain on 3×S1{\mathbb{R}}^{3}\times S^{1} is this factor.

  • The non-perturbative expansion parameter in the strongly coupled domain on 4{\mathbb{R}}^{4} for all theories that admit a coupling to N{\mathbb{Z}}_{N} TQFT is the same non-perturbative factor.

It is hard to believe that this is an accident.

The picture we obtain from 2{\mathbb{R}}^{2} and ×S1{\mathbb{R}}\times S^{1} suggests something deeper, compliant with the previous paragraph. The instanton solutions of Berg:1979uq ; Fateev:1979dc on 2{\mathbb{R}}^{2} do not physically break an instanton into pieces. (Action density and topological density are single lumps for one instanton). It is a reparametrization of the usual solution. The brilliant thing is that the fluctuation determinant in the background of parametrization (136) admits an interpretation as if we are summing over fractional vortex instantons, which carry fractional charge and fractional action, as it appears in (140). To see the fractional instantons explicitly, we need to do an extra work, for example, couple the theory to ΩF\Omega_{F} background (see Dunne:2016nmc for a review), or ’t Hooft flux background, or consider it in non-trivial holonomy background Lee:1997vp ; Lee:1998bb ; Kraan:1998kp ; Kraan:1998pm ; Kraan:1998sn , and then, dial the moduli parameters. For small moduli, one still does not see physical separation, and only for sufficiently large moduli parameters, one sees fractionalization explicitly (see Fig. 5). May be, BPST instanton is far more clever than us, and it hides its deepest secrets inside in its moduli parametrization.

QCD(F) on 4{\mathbb{R}}^{4} vs. 3×S1{\mathbb{R}}^{3}\times S^{1}: Think of QCD with fundamental quarks, on M4M_{4}. Turn on a background for the global symmetry U(1)V/NU(1)_{V}/{\mathbb{Z}}_{N}, upon which we can couple the system to N{\mathbb{Z}}_{N} TQFT. Our claim is that this system should admit topological excitations that are controlled by the action SI/NS_{I}/N, and these are relevant to real QCD on 4{\mathbb{R}}^{4}.

Consider, for example, QCD with NfN_{f} flavors on 3×S1{\mathbb{R}}^{3}\times S^{1}. For concreteness, let us take NfNN_{f}\leq N, but NfN_{f} can be made arbitrary. Then, in a center-symmetric background for gauge holonomy, we can also turn on an ΩFSU(Nf)\Omega_{F}\in SU(N_{f}) as in (91). Then, we have NN monopoles with action SI/NS_{I}/N. NNfN-N_{f} of the monopoles do not have a fermi zero mode (call this set 𝒮1{\cal S}_{1}) and NfN_{f} of them have 2 fermi zero modes each (call this set 𝒮2{\cal S}_{2}). This is the set-up introduced in Cherman:2016hcd ; Cherman:2017dwt to explain the transmutation of θ+2πkNf\frac{\theta+2\pi k}{N_{f}} dependence in NfN_{f} flavor QCD to the θ+2πkNc\frac{\theta+2\pi k}{N_{c}} in pure Yang-Mills as the fermions are decoupled by increasing their masses.

Monopole instantons on 𝒮1{\cal S}_{1} each has 4 bosonic zero modes, 3 position and one angular. Perhaps, in the light of our intentions, it is wise to denote d3adϕL1d4a\mathrm{d}^{3}a\mathrm{d}\phi\equiv L^{-1}\mathrm{d}^{4}a. Monopole instantons on 𝒮2{\cal S}_{2} each has 4 bosonic zero modes, 3 position and one angular, and 2 fermi zero modes.

Then, the measure, in the background of the monopole instantons, which amounts to the calculation of pre-exponential factor, takes the form:

dμmon,j{μ4d4ae8π2g2N+iθN(det[D2](μ))1(rmonμ)1/3,=μ413e8π2g2N+iθNd4aj𝒮1μ4d4aμ1d2ξfe8π2g2N+iθN(det[D2](μ))1(rmonμ)1/3(det[γμDμ](μ))(rmonμ)1/3,=μ413μ1+13e8π2g2N+iθNd4ad2ξfj𝒮2\displaystyle{{\rm d}\mu_{{\rm mon},j}}\sim\left\{\begin{array}[]{ll}\mu^{4}d^{4}a\;\mathrm{e}^{-\frac{8\pi^{2}}{g^{2}N}+\mathrm{i}\frac{\theta}{N}}\;\underbrace{\left({\det}^{{}^{\prime}}[-D^{2}](\mu)\right)^{-1}}_{(r_{\rm mon}\mu)^{-1/3}},&\cr=\mu^{4-\frac{1}{3}}\mathrm{e}^{-\frac{8\pi^{2}}{g^{2}N}+\mathrm{i}\frac{\theta}{N}}d^{4}a\ldots&{j\in{\cal S}_{1}}\cr\cr\mu^{4}d^{4}a\;\mu^{-1}d^{2}\xi_{f}\mathrm{e}^{-\frac{8\pi^{2}}{g^{2}N}+\mathrm{i}\frac{\theta}{N}}\;\underbrace{\left({\det}^{{}^{\prime}}[-D^{2}](\mu)\right)^{-1}}_{(r_{\rm mon}\mu)^{-1/3}}\underbrace{\left({\det}^{{}^{\prime}}_{\cal R}[\gamma_{\mu}D_{\mu}](\mu)\right)}_{(r_{\rm mon}\mu)^{1/3}},&\cr=\mu^{4-\frac{1}{3}}\mu^{-1+\frac{1}{3}}\mathrm{e}^{-\frac{8\pi^{2}}{g^{2}N}+\mathrm{i}\frac{\theta}{N}}d^{4}a\;d^{2}\xi_{f}\ldots&{j\in{\cal S}_{2}}\end{array}\right. (311)

where det{\det}^{{}^{\prime}} is the fluctuation operator over non-zero modes, which also induces μ\mu-scale dependence.

BPSTj𝒮1jj𝒮2j\displaystyle{\cal I}_{\rm BPST}\sim\prod_{{j\in{\cal S}_{1}}}{\cal M}_{j}\prod_{{j\in{\cal S}_{2}}}{\cal M}_{j} (μ413e8π2g2N+iθN)NNf(μ413μ1+13e8π2g2N+iθN)Nf\displaystyle\sim\left(\mu^{4-\frac{1}{3}}\mathrm{e}^{-\frac{8\pi^{2}}{g^{2}N}+\mathrm{i}\frac{\theta}{N}}\right)^{N-N_{f}}\left(\mu^{4-\frac{1}{3}}\mu^{-1+\frac{1}{3}}\mathrm{e}^{-\frac{8\pi^{2}}{g^{2}N}+\mathrm{i}\frac{\theta}{N}}\right)^{N_{f}} (312)
=μ113N23Nfe8π2g2+iθ\displaystyle=\mu^{\frac{11}{3}N-\frac{2}{3}N_{f}}\mathrm{e}^{-\frac{8\pi^{2}}{g^{2}}+\mathrm{i}\theta} (313)

The part of this expression involving Pauli-Villars scales and how it combines to produce renormalization group invariant strong scale by combining NN constituent looks like an elegant conspiracy. The standard calculations of the pre-exponential factor for a BPST instanton (in the standard parametrization) can be found in NSVZ review Vainshtein:1981wh .

It is very tempting that the structure above arises in 4{\mathbb{R}}^{4} when QCD(F) is considered on U(1)V/NU(1)_{V}/{\mathbb{Z}}_{N} background and the theory is coupled to a N{\mathbb{Z}}_{N} TQFT. We suspect that in the decompactification to 4{\mathbb{R}}^{4}, (rmonμ)(r_{\rm mon}\mu) type factors on 3×S1{\mathbb{R}}^{3}\times S^{1} will be replaced by combinations such as (|ajmakn|μ)(|a^{m}_{j}-a^{n}_{k}|\mu), (|a¯jma¯kn|μ)(|\overline{a}^{m}_{j}-\overline{a}^{n}_{k}|\mu), or (|ajma¯kn|μ)(|a^{m}_{j}-\overline{a}^{n}_{k}|\mu). Once we consider original QCD on 4{\mathbb{R}}^{4}, we should probably inspect proliferation of the fractional instantons which should appear in a way similar to monopole-instantons do on 3×S1{\mathbb{R}}^{3}\times S^{1}. They must obey the global constraints similar to monopole-instantons, and we should see if one can obtain a self-consistent construction of QCD vacuum for general NfN_{f}.

Acknowledgements.
I am deeply grateful to Aleksey Cherman, Yuya Tanizaki, Tin Sulejmanpasic for many discussions at odd hours. I am also thankful to Margarita Garcia Perez, Thomas Schäfer, Patrick Draper, Mohamed Anber, Gerald Dunne, Tony Gonzalez-Arroyo, Mikhail Shifman, David Gross, Philip Argyres, Rob Pisarski, Pierre van Baal for past discussions. Although I have not met with Polyakov in person, I am thankful to him for his unpretentious beautiful book. My work is supported by the U.S. Department of Energy, Office of Science, Division of Nuclear Physics under Award DE-SC0013036.

Appendix A Yang-Mills with flux on 13×Sβ11^{3}\times S^{1}_{\beta} lattice vs. TNT_{N} model

Let us consider a 4d lattice gauge theory with discrete space and continuous time, Ns3×N_{s}^{3}\times{\mathbb{R}}. Then, to reduce the model to simple quantum mechanics, we take Ns=1N_{s}=1 in the same spirit with the twisted Eguchi-Kawai model GonzalezArroyo:1982ub ; GonzalezArroyo:1982hz ; GonzalezArroyo:2010ss , by employing twisted boundary conditions, but we do not take the large-NN limit.

We consider 11-site theory with Lagrangian

g2=tr|dU~i|2+tr|[U~i,U~j]|2\displaystyle g^{2}{\cal L}=\text{tr}|\mathrm{d}\widetilde{U}_{i}|^{2}+\text{tr}|[\widetilde{U}_{i},\widetilde{U}_{j}]|^{2} (314)

in the A0=0A_{0}=0 gauge. In path integral, we assume that U~i\widetilde{U}_{i} obey ’t Hooft twisted boundary conditions on the 131^{3} lattice.

U~i,𝐧+Nj𝐞𝐣\displaystyle\widetilde{U}_{i,{\bf n}+N_{j}{\bf e_{j}}} =ΩjU~i,𝐧Ωj\displaystyle=\Omega_{j}\widetilde{U}_{i,{\bf n}}\Omega^{\dagger}_{j} (315)

Consistency at, e.g. N1𝐞𝟏+N2𝐞𝟐N_{1}{\bf e_{1}}+N_{2}{\bf e_{2}} requires (recall N1=N2=1N_{1}=N_{2}=1):

Ω1Ω2=Ω2Ω1ei2πN12\displaystyle\Omega_{1}\Omega_{2}=\Omega_{2}\Omega_{1}\mathrm{e}^{\mathrm{i}\frac{2\pi}{N}\ell_{12}} (316)

where 12\ell_{12} is B(2)B^{(2)} flux mod NN. Since lattice is only 1-site, this forces the transitions matrices to be independent of space coordinates. This actually leads to subtle and important differences between an arbitrarily small continuum T3×Sβ1T^{3}\times S^{1}_{\beta} theory and 13×Sβ11^{3}\times S^{1}_{\beta} theory. In the former, the transition matrices are allowed to depend on spacetime, but not in the latter, and this has consequences.

Undoing the twisted boundary conditions in favor of a classical background field, we end up with the flux 12\ell_{12} in the 1212 face and the matrix model becomes

g2=tr|dUi|2+tr|[U1U2ei2πN12U2U1]|2+tr|[U1,U3]|2+tr|[U2,U3]|2\displaystyle g^{2}{\cal L}=\text{tr}|\mathrm{d}U_{i}|^{2}+\text{tr}|[U_{1}U_{2}-\mathrm{e}^{\mathrm{i}\frac{2\pi}{N}\ell_{12}}U_{2}U_{1}]|^{2}+\text{tr}|[U_{1},U_{3}]|^{2}+\text{tr}|[U_{2},U_{3}]|^{2} (317)

where path integral over the Ui(τ)U_{i}(\tau) field is periodic both in time and space direction (which is reduced to 1-site).

Let us describe the classical minima of this system. The minimum of the modified commutator (take 12=1\ell_{12}=1) is U1=CU_{1}=C, U2=SU_{2}=S, where CC and SS are clock and shift matrices. The minima of the action correspond to the values of U3U_{3} which commutes both with CC and SS. Since CC and SS are generators of GL(N,)GL(N,\mathbb{C}) algebra, the only matrix that commutes with both is proportional to identity. Since U3U_{3} lives in SU(N)SU(N), the minima are

U3=ei2πNj𝟙,j=0,,N1\displaystyle U_{3}=\mathrm{e}^{\mathrm{i}\frac{2\pi}{N}j}{\mathbbm{1}},\qquad j=0,\ldots,N-1 (318)

Therefore, the classical system for the 12=1\ell_{12}=1 sector has NN-minima, corresponding to NN center broken configurations. This is analogous to our discussion on large- T3×Sβ1T^{3}\times S^{1}_{\beta} where we showed that the 12=1\ell_{12}=1 sector was also NN-fold degenerate, see (273). Of course, in quantum mechanics, center symmetry cannot break spontaneously due to tunneling effects at any finite-NN.

How do we describe the tunneling between these minima? Let us take U3(τ)U_{3}(\tau) of the form

U3=Diag(eiΘ1,,eiΘN),whereΘN=(Θ1++ΘN1)\displaystyle U_{3}={\rm Diag}(\mathrm{e}^{\mathrm{i}\Theta_{1}},\ldots,\mathrm{e}^{\mathrm{i}\Theta_{N}}),\qquad{\rm where}\qquad\Theta_{N}=-(\Theta_{1}+\ldots+\Theta_{N-1}) (319)

Then, the (classical) potential can be expressed as

V(Θi)=i=1Ncos(ΘiΘi+1)V(\Theta_{i})=-\sum_{i=1}^{N}\cos(\Theta_{i}-\Theta_{i+1}) (320)

Despite the fact that this potential looks like Gross-Pisarski-Yaffe (GPY) type potentials for gauge holonomy Gross:1980br , this is superficial and they are fundamentally different. The GPY type-potentials is a quantum effect, starting at 1-loop order as O(g2)O(g^{2}). However, (320) is classical, not quantum. It is induced by the presence of classical ’t Hooft flux background. As a result, for example, if one examines instantons in the theory with flux, their action is 1g2\sim\frac{1}{g^{2}} since they arise from classical action. In contrast, if one examines tunneling on small T3×T^{3}\times{\mathbb{R}} without flux, the action is of the form 1g\sim\frac{1}{g}, because these configurations arise from the balancing of classical actions against quantum induced potential, see eg. vanBaal:1986ag ; Luscher:1982ma .

To describe these tunnelings, one can take simple ansatz for U3(τ)U_{3}(\tau)

U3(τ)=Diag(eiΘ,,eiΘ,ei(N1)Θ)U_{3}(\tau)={\rm Diag}(\mathrm{e}^{\mathrm{i}\Theta},\ldots,\mathrm{e}^{\mathrm{i}\Theta},\mathrm{e}^{-\mathrm{i}(N-1)\Theta}) (321)

Then, the Lagrangian becomes

g2=(N1)N(dΘ)2cos(NΘ)\displaystyle g^{2}{\cal L}=(N-1)N(\mathrm{d}\Theta)^{2}-\cos(N\Theta) (322)

where ΘΘ+2π\Theta\sim\Theta+2\pi. This is nothing but our TNT_{N} model, with the distinction that the mass of the particle, instead of being set to one, is now (N1)N(N-1)N.

The remnant of the center-symmetry (which becomes (N[0])4({\mathbb{Z}}_{N}^{[0]})^{4} 0-form symmetry on T3×Sβ1T^{3}\times S^{1}_{\beta}) is just U3ei2πNU3U_{3}\rightarrow\mathrm{e}^{\mathrm{i}\frac{2\pi}{N}}U_{3} or equivalently, acting on coordinate Θ\Theta as a shift symmetry ΘΘ+2πN\Theta\rightarrow\Theta+\frac{2\pi}{N}, i.e., the part of center-symmetry which acts on U3U_{3} non-trivially becomes the N{\mathbb{Z}}_{N} shift symmetry of the TNT_{N} model. To reveal the fractional instantons in the background of ’t Hooft flux, we can proceed exactly as in our QM example.

We can build the fractional instanton events by considering the transition amplitudes between the minima which are 34{\ell_{34}} units apart. Hamiltonian now is the one corresponding to (322) or more microscopically, (317), where HH already has the magnetic flux data, so we call it H12H_{\ell_{12}}. So, the quantity we are calculating is

Z1234\displaystyle Z_{\ell_{12}\ell_{34}} =tr[eβH12(𝖴c)34]\displaystyle=\text{tr}[\mathrm{e}^{-\beta H_{\ell_{12}}}(\mathsf{U}_{\mathrm{c}})^{\ell_{34}}] (323)

which forces twisted boundary condition in U3U_{3} by 34\ell_{34} unit, U3(β=L4)=ei2πN34U3(0)U_{3}(\beta=L_{4})=\mathrm{e}^{\mathrm{i}\frac{2\pi}{N}{\ell_{34}}}U_{3}(0).

These fractional instantons are the ones in the theory with ’t Hooft flux, (12,34)({\ell_{12},\ell_{34}}). They describe tunneling events in our matrix model. Roughly, θ˙E3(τ)\dot{\theta}\sim E_{3}(\tau) and cos(Nθ(τ))B3(τ)\cos(N\theta(\tau))\sim B_{3}(\tau). Despite the fact that these are reasonable fractional instanton events in the 1-site model, they do not provide a description of the fractional instantons in continuum theory on small T3×T^{3}\times{\mathbb{R}} in the presence of ’t Hooft flux. For once, their action is Sa(N(N1)g2NS_{a}\sim\frac{\sqrt{(N(N-1)}}{g^{2}N}. In fact, at this stage, we are facing a problem that ’t Hooft faced in a time independent continuum version of this problem tHooft:1981nnx where the simple solutions he found did have an action which did not descend below Sa=(N1)N8π2g2S_{a}=\frac{(N-1)}{N}\frac{8\pi^{2}}{g^{2}}. (The problem in tHooft:1981nnx is fixed by introducing space-time dependent transition matrices.) In our case as well, this is an artifact of 1-site model, and the fact that we are restricting to space-independent transition functions. Our continuum formulations tells us that even at arbitrarily small volume T3×Sβ1T^{3}\times S^{1}_{\beta}, there exist configurations which saturate BPS bound, and this is consistent with lattice simulations on Ns3×Sβ1N_{s}^{3}\times S^{1}_{\beta} where Ns2N_{s}\geq 2 Gonzalez-Arroyo:2019wpu ; Montero:2000mv .

Appendix B Aside: How to fix merons in 1\mathbb{CP}^{1}?

Both instanton Polyakov:1975yp and smooth fractional instanton solutions (in the ΩF\Omega_{F} or B(2)B^{(2)} flux background) Bruckmann:2007zh ; Dunne:2012ae have aspects in sharp contrast with meron configurations Gross:1977wu . But clearly they look quite alike as shown in Figure below, where horizontal axis is the angular coordinate on S1S^{1}_{\infty} in Fig.4. Let us try to make some sense out of this situation for N=2N=2.

Refer to caption
Figure 8: Horizontal axis is the angular coordinate on S1S^{1}_{\infty} in Fig.4. a) Meron data is piecewise continuous or discontinuous. Action is asserted to be log divergent Gross:1977wu , but that needs a re-interpretation, see text. b) Instanton solution in the ΩF\Omega_{F} background. A fractional instanton pair is a critical point at infinity. Above configuration is on the Lefschetz thimble of this exact critical point. Insertion of B(2)B^{(2)} ’t Hooft flux, one can isolate each smooth configuration with topological charge 12{\textstyle{\frac{1}{2}}}.

1) For merons, configurations, action densities, and topological charge densities are either piece-wise continuous or discontinuous, and they possess logarithmically divergent action Gross:1977wu ; Callan:1977gz ; Callan:1977qs .

2) Instantons are smooth single lump solutions Polyakov:1975yp . On the other hand, the parametrization of Refs.Berg:1979uq ; Fateev:1979dc , the determinant of fluctuation operator gives a precise mathematical description as if these single lumps ought to be interpreted as composites of fractional vortex instantons, despite the fact that an instanton does not physically fractionate to two pieces. Gross uses this parametrization even earlier and shows that the interaction between the instanton and anti-instanton is a dipole-dipole interaction in 2d2d in Gross:1977wu . The two are consistent.

3) The solutions in Bruckmann:2007zh ; Dunne:2012ae are generalization of Ref. Polyakov:1975yp to the ΩF\Omega_{F} background, and it physically fractionates an instanton into pieces. If we want to single out a fractional instanton, we need to insert a B(2)B^{(2)} flux, and this will guarantee a W=1/NW=1/N configuration with S=SI/NS=S_{I}/N. These configurations are smooth and finite action, unlike merons.

If we superficially smoothen the meron, as suggested by Gross Gross:1977wu , that is close enough to exact analytic solutions in the ΩF\Omega_{F} background Bruckmann:2007zh ; Dunne:2012ae . What is called the logarithmically divergent action of meron pair is the logarithmic Coulomb interaction between the fractional vortex instantons. This also arise from exact computation of the determinant of the fluctuation operator Berg:1979uq ; Fateev:1979dc .

Meron idea is not generalized correctly to SU(N)SU(N) gauge theories and N1\mathbb{CP}^{N-1} with N3N\geq 3 Gross:1977wu ; Callan:1977gz ; Callan:1977qs . The difficulty is in the motto “meron= half-instanton”. For a given NN, the fractionalization (in center-symmetric holonomy background, or ΩF\Omega_{F} or N{\mathbb{Z}}_{N} TQFT background) is always into NN constituents, not more, not less! Given this fact, it seems very worthwhile to revisit the idea.

’t Hooft long time ago came up with a PSU(N)PSU(N) bundle construction, to figure out configurations with topological charge and action 1N\frac{1}{N}. He was only able to find some exact, but uninteresting constant configurations on T4T^{4}. To be fair to him, one should state that no one was able to find non-trivial (space-time dependent) solutions in PSU(N)PSU(N) bundle up to date analytically! Numerically, they are proven to exist in GarciaPerez:1989gt ; GarciaPerez:1992fj ; Montero:2000mv . In this work, we interpreted monopole-instanton solutions as configurations in the PSU(N)PSU(N) bundle with action SI/NS_{I}/N on 3×S1{\mathbb{R}}^{3}\times S^{1} and this is a concrete realization of (another) long sought dream of ’t Hooft.

’t Hooft asserts that his idea of configurations with N{\mathbb{Z}}_{N} topological charge are important for the confinement and mass gap problem, but not merons or instantons tHooft:1977nqb . But at least for SU(2)SU(2) or 1\mathbb{CP}^{1}, these two ideas (with enough cosmetic changes) are actually the same. (I suspect both of them would think otherwise, hopefully up until today.) The correct part of the meron idea in 1\mathbb{CP}^{1} is the “constituents” that interact with each other via long range logarithmic interactions.

Probably, the strangest insight that our work brings is following. Despite the naming “fractionalization”, a physical fractionalization of action density may or may not take place. Physical fractionalization of a lump is only achieved by turning on appropriate background field. Yet, the clever parametrization of moduli space (136) tells us that mathematically, the moduli parameters acts as if they are the positions of fractional vortex instantons. This is in some sense, can be viewed as fractionalization without fractionalization.

Sometimes, it takes long time to figure out truth in a wild dream Gross:1977wu , and utility of a formal construction tHooft:1977nqb , perhaps, because the truth lives somewhere in between.

References