This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Fermi Machine
— Quantum Many-Body Solver Derived from Correspondence
between Noninteracting and Strongly Correlated Fermions

Masatoshi Imada Physics DivisionPhysics Division Sophia University Sophia University Chiyoda Chiyoda Tokyo 102-8554 Tokyo 102-8554 Japan
Faculty of Engineering Japan
Faculty of Engineering University of Tokyo University of Tokyo 7-3-1 Hongo 7-3-1 Hongo Bunkyo-ku Bunkyo-ku Tokyo 113-8656 Tokyo 113-8656 Japan Japan
Abstract

Stimulated by the successful descriptions of strongly correlated electron systems by fractionalized fermions, correspondence between interacting fermions and non-interacting multi-component fermions is formulated in examples of the Hubbard model. The formalism enables constructions of the neural network for quantum many-body solvers represented by coupled noninteracting fermions. After showing the exact correspondences of 1- and 2-site Hubbard models to two-component noninteracting fermions, a numerical algorithm of the quantum machine learning for the Hubbard model is proposed. Benchmark for the 4-site systems is successfully presented and promising future directions as well as implications are discussed.

1 Introduction

Fractionalization of electrons in strongly correlated electron systems has been proposed in several different phenomena. One dimensional itinerant electron systems exhibit the separation of spin and charge degrees of freedom [1, 2]. Models of polyacetylene accommodate spin and charge solitons with fractionalized charge as elementary excitations [3]. Fractional quantum Hall states under the stong magnetic fields [4] as well as under the emergent chiral symmetry breaking [5] show the fractionalization of electronic charge and fractional quantizations. Slave bosons and fermions were proposed to approximately describe the 2D systems as models of cuprate high-TcT_{c} superconductors [6, 7]. Recently, the fractionalization of an electron into two or multiple fermion components has been examined and has successfully accounted for otherwise puzzling spectroscopic experimental results of the cuprates [8, 9, 10, 11, 12, 13].

Meanwhile, numerical method to construct ground state wave functions of strongly interacting quantum many-body systems is one of the grand challenges in physics. Various algorithms such as a variety of auxiliary-field quantum Monte Carlo [14, 15, 16], variational Monte Carlo [17, 18, 19, 20, 21, 22], density matrix renormalization group [23], tensor network [24, 25, 26], density matrix embedding [27] and dynamical mean-field theory [28, 29] have been proposed. Recently, neural network with Boltzmann machine has established an accurate way to approximate ground states of quantum spin systems [30] as well as itinerant fermion systems [31]. Vision transformer, another architecture of the neural network, has also shown the state-of-the-art accuracy for quantum spin models [32, 33]. We note that these neural networks are constructed by introducing hidden variables that are described by classical degrees of freedom.

In this paper, we discuss mapping of the Hubbard model (or in more general, interacting lattice fermions) to noninteracting multi-component fermion models in the ground states that materializes the fractionalization, from which we propose a quantum machine learning algorithm aiming at an efficient quantum many-body solver. Here, we introduce a fermion system as the hidden part coupled to the physical system, instead of the classical one such as the Ising spins in the previous neural networks.

In Secs. I and II, the equivalence between the Hubbard model and the two-component fermion model (TCFM) is shown for the atomic limit and for the 2-site systems, respectively. Based on this equivalence, we propose a quantum machine learning algorithm in Sec. III, where the Hubbard model is approximated by a deep-layer representation of noninteracting multi-component fermion models. The couplings between the layers are represented by the hybridization of fermions belonging to neighboring layers. This method is viewed as an extension of restricted Boltzmann-machine representation of the neural quantum states proposed by Carleo and Troyer [30] to a quantum neural network, where the Ising hidden variable in the Boltzmann machine is replaced by the fermionic operators that allows hybridization to generate quantum entanglement. Furthermore, it is also regarded as an embodiment of the electron fractionalization [9, 10], which has succeeded in solving a number of experimental puzzles in the cuprate high-TcT_{c} superconductors [8, 12, 10, 13]. This quantum neural network can also be viewed as a tractable realization of the ground states by a systematic expansion to take account of correlation effects similarly to the formulation by the equation of motion method or equivalently the continued fraction expansion of the Green’s function for the excited states [34, 35, 36]. The present method does not seem to have difficulty of exponentially increasing terms encountered in the conventional equation of motion method.

2 Hubbard model in the atomic limit

In this section, the result shown in Appendix A of Ref. \citenImada2019 is briefly summarized for the self-contained presentation. We consider the Hubbard UU term

U\displaystyle{\cal H}_{U} =\displaystyle= Unn,\displaystyle Un_{\uparrow}n_{\downarrow}, (1)

with nσ=cσcσn_{\sigma}=c^{\dagger}_{\sigma}c_{\sigma} for the creation (annihilation) operators cσc_{\sigma}^{\dagger} (cσc_{\sigma}) of the spin σ\sigma, and introduce an auxiliary fermion

d~σ=cσ(12nσ),\displaystyle\tilde{d}_{\sigma}=c_{\sigma}(1-2n_{-\sigma}), (2)

together with

c~σ=cσ.\displaystyle\tilde{c}_{\sigma}=c_{\sigma}. (3)

Equation (1) can then be rewritten in a form of the TCFM as

TCFM\displaystyle{\cal H}_{\rm TCFM} =\displaystyle= (c~)+(d~)+(c~d~),\displaystyle{\cal H}^{(\tilde{c})}+{\cal H}^{(\tilde{d})}+{\cal H}^{(\tilde{c}\tilde{d})}, (4)
(c~)\displaystyle{\cal H}^{(\tilde{c})} =\displaystyle= μc~c~σc~σ,\displaystyle\mu_{\tilde{c}}\tilde{c}_{\sigma}^{\dagger}\tilde{c}_{\sigma}, (5)
(d~)\displaystyle{\cal H}^{(\tilde{d})} =\displaystyle= μd~d~σd~σ,\displaystyle\mu_{\tilde{d}}\tilde{d}_{\sigma}^{\dagger}\tilde{d}_{\sigma}, (6)
(c~d~)\displaystyle{\cal H}^{(\tilde{c}\tilde{d})} =\displaystyle= Λ(c~σd~σ+H.c),\displaystyle\Lambda(\tilde{c}_{\sigma}^{\dagger}\tilde{d}_{\sigma}+{\rm H.c}), (7)

with the mapping

μc~=μd~=Λ=U2.\displaystyle\mu_{\tilde{c}}=\mu_{\tilde{d}}=-\Lambda=\frac{U}{2}. (8)

For the derivation of Eq.(4) from Eq. (1), see below.

Note first that d~\tilde{d} and c~\tilde{c} satisfy the exact anticommutation relation in the ground state average,

c~σc~σ+c~σc~σ=1,\displaystyle\langle\tilde{c}_{\sigma}\tilde{c}_{\sigma}^{\dagger}+\tilde{c}_{\sigma}^{\dagger}\tilde{c}_{\sigma}\rangle=1, (9)
d~σd~σ+d~σd~σ=1,\displaystyle\langle\tilde{d}_{\sigma}\tilde{d}_{\sigma}^{\dagger}+\tilde{d}_{\sigma}^{\dagger}\tilde{d}_{\sigma}\rangle=1, (10)
c~σd~σ+d~σc~σ=0,\displaystyle\langle\tilde{c}_{\sigma}\tilde{d}_{\sigma}^{\dagger}+\tilde{d}_{\sigma}^{\dagger}\tilde{c}_{\sigma}\rangle=0, (11)

where \langle\cdots\rangle is defined by

=Φ0(1)||Φ0(1)+Φ0(1)||Φ0(1)Φ0(1)|Φ0(1)+Φ0(1)|Φ0(1),\displaystyle\langle\cdots\rangle=\frac{\langle\Phi_{0\uparrow}^{(1)}|\cdots|\Phi_{0\uparrow}^{(1)}\rangle+\langle\Phi_{0\downarrow}^{(1)}|\cdots|\Phi_{0\downarrow}^{(1)}\rangle}{\langle\Phi_{0\uparrow}^{(1)}|\Phi_{0\uparrow}^{(1)}\rangle+\langle\Phi_{0\downarrow}^{(1)}|\Phi_{0\downarrow}^{(1)}\rangle}, (12)

because of the ground state degeneracy of Kramers doublet, when the one-particle ground state is described as we prove later as

|Φ0σ(1)=(αcc~σ+αdd~σ)|0|\Phi_{0\sigma}^{(1)}\rangle=(\alpha_{c}\tilde{c}_{\sigma}^{\dagger}+\alpha_{d}\tilde{d}_{\sigma}^{\dagger})|0\rangle (13)

with σ\sigma being either \uparrow or \downarrow. (Note that the one-particle state must be degenerate between |Φ0(1)|\Phi_{0\uparrow}^{(1)}\rangle and |Φ0(1)|\Phi_{0\downarrow}^{(1)}\rangle.) In this sense, c~\tilde{c} and d~\tilde{d} behave as orthogonal fermions for single-particle excitations from the ground state, which is the degenerate ensemble of |Φ0(1)|\Phi_{0\uparrow}^{(1)}\rangle and |Φ0(1)|\Phi_{0\downarrow}^{(1)}\rangle.

By diagonalizing Eq.(4) for the case μc~=μd~\mu_{\tilde{c}}=\mu_{\tilde{d}}, one obtains for the spin σ\sigma part

DTCFM\displaystyle{\cal H}_{\rm DTCFM} =\displaystyle= μaaσaσ+μbbσbσ,\displaystyle\mu_{a}a_{\sigma}^{\dagger}a_{\sigma}+\mu_{b}b_{\sigma}^{\dagger}b_{\sigma}, (14)
μb\displaystyle\mu_{b} =\displaystyle= 12(μc~+μd~)+Λ,\displaystyle\frac{1}{2}(\mu_{\tilde{c}}+\mu_{\tilde{d}})+\Lambda, (15)
μa\displaystyle\mu_{a} =\displaystyle= 12(μc~+μd~)Λ\displaystyle\frac{1}{2}(\mu_{\tilde{c}}+\mu_{\tilde{d}})-\Lambda (16)

with

bσ\displaystyle b_{\sigma} =\displaystyle= 12(c~σ+d~σ)=2cσ(1nσ),\displaystyle\frac{1}{\sqrt{2}}(\tilde{c}_{\sigma}+\tilde{d}_{\sigma})=\sqrt{2}c_{\sigma}(1-n_{-\sigma}), (17)
aσ\displaystyle a_{\sigma} =\displaystyle= 12(c~σd~σ)=2cσnσ.\displaystyle\frac{1}{\sqrt{2}}(\tilde{c}_{\sigma}-\tilde{d}_{\sigma})=\sqrt{2}c_{\sigma}n_{-\sigma}. (18)

The bonding and antibonding states are given by Eqs. (17) and (18), respectively, which are nothing but the operators for the lower and upper Hubbard levels, respectively, in the Hubbard model whose averaged energies are given by E=0E=0 and UU. Namely, aside from the normalization factor, bσ{b}_{\sigma}^{\dagger} creates an electron with the spin σ\sigma when it is not occupied by the opposite spin electron (namely it fills the lower Hubbard state in the corresponding Hubbard model) as is apparent from the last equation of Eq.(17). On the other hand, aσa_{\sigma}^{\dagger}, creates an electron with the spin σ\sigma, when the opposite spin electron already exists, namely it creates a doublon at the upper Hubbard state in the Hubbard model. It helps an intuitive interpretation of the correspondence between the Hubbard model and the TCFM. With the choice Eq.(8), μa=0\mu_{a}=0 and μb=U\mu_{b}=U are obtained and the Hubbard gap is reproduced. In addition, the ground state bσ|0b_{\sigma}^{\dagger}|0\rangle indeed satisfies the form of Eq.(13). Then the mapping between Eqs.(1) and (4) becomes exact. In this correspondence, the Mott gap in the Hubbard model is interpreted as the hybridization gap in the TCFM.

In general, one can show that the single-particle Green’s function for the TCFM is given by

Gc~σ,c~σ(ω)\displaystyle G_{\tilde{c}_{\sigma},\tilde{c}_{\sigma}^{\dagger}}(\omega) =\displaystyle= 1ωμc~Λ2ωμd~,\displaystyle\frac{1}{\omega-\mu_{\tilde{c}}-\frac{\Lambda^{2}}{\omega-\mu_{\tilde{d}}}},
Gc~σ,d~σ(ω)\displaystyle G_{\tilde{c}_{\sigma},\tilde{d}_{\sigma}^{\dagger}}(\omega) =\displaystyle= Gd~σ,c~σ(ω)=Λ(ωμc~)(ωμd~)Λ2,\displaystyle G_{\tilde{d}_{\sigma},\tilde{c}_{\sigma}^{\dagger}}(\omega)=\frac{-\Lambda}{(\omega-\mu_{\tilde{c}})(\omega-\mu_{\tilde{d}})-\Lambda^{2}},
Gd~σ,d~σ(ω)\displaystyle G_{\tilde{d}_{\sigma},\tilde{d}_{\sigma}^{\dagger}}(\omega) =\displaystyle= 1ωμd~Λ2ωμc~.\displaystyle\frac{1}{\omega-\mu_{\tilde{d}}-\frac{\Lambda^{2}}{\omega-\mu_{\tilde{c}}}}. (19)

Then from Eq.(8), we obtain

Gc~σ,c~σ(ω)\displaystyle G_{\tilde{c}_{\sigma},\tilde{c}_{\sigma}^{\dagger}}(\omega) =\displaystyle= 1ωU2U24ωU2\displaystyle\frac{1}{\omega-\frac{U}{2}-\frac{\frac{U^{2}}{4}}{\omega-\frac{U}{2}}} (20)
=\displaystyle= 12[1ω+1ωU].\displaystyle\frac{1}{2}\left[\frac{1}{\omega}+\frac{1}{\omega-U}\right]. (21)

This is equivalent to the Green’s function of the atomic Hubbard, Eq.(1). Then the self-energy has the correct form as well:

Σ(ω)\displaystyle{\Sigma}(\omega) =\displaystyle= U24ωU2.\displaystyle\frac{\frac{U^{2}}{4}}{\omega-\frac{U}{2}}. (22)

In this way, exact correspondence is established between the TCFM (4) and the Hubbard model (1) in the atomic limit for the half-filled ground state as well as for single-particle excitations from it. Namely, the full Hilbert space of the Hubbard model in the atomic limit is equivalent to that of the TCFM.

3 Two-site Hubbard model

In this section, the mapping of the atomic limit already established and summarized in the last section is extended and a mapping between the 2-site Hubbard model and the 2-site TCFM is shown.

The 2-site Hubbard Hamiltonian reads

\displaystyle{\cal H} =\displaystyle= t+U,\displaystyle{\cal H}_{t}+{\cal H}_{U}, (23)
t\displaystyle{\cal H}_{t} =\displaystyle= σ[t(c1σc2σ+c2σc1σ)+μi=1,2ni,σ],\displaystyle\sum_{\sigma}[-t(c_{1\sigma}^{\dagger}c_{2\sigma}+c_{2\sigma}^{\dagger}c_{1\sigma})+\mu\sum_{i=1,2}n_{i,\sigma}], (24)
U\displaystyle{\cal H}_{U} =\displaystyle= Ui=1,2nini.\displaystyle U\sum_{i=1,2}n_{i\uparrow}n_{i\downarrow}. (25)

with ni,σ=ci,σci,σn_{i,\sigma}=c^{\dagger}_{i,\sigma}c_{i,\sigma}. We take μ=0\mu=0, because in the canonical ensemble, spatially uniform chemical potential does not change physical properties except for a trivial energy shift.

We first analyze the half-filled case with one \uparrow and one \downarrow electron and use the basis of the full Hilbert space expanded by |,,|,,|,0,|0,|\uparrow,\downarrow\rangle,|\downarrow,\uparrow\rangle,|\uparrow\downarrow,0\rangle,|0,\uparrow\downarrow\rangle in the notation |n1n1,n2n2|n_{1\uparrow}n_{1\downarrow},n_{2\uparrow}n_{2\downarrow}\rangle, where niσn_{i\sigma} is the number of spin σ\sigma fermion at the site ii and is denoted as \uparrow if ni=1n_{i\uparrow}=1 and \downarrow if ni=1n_{i\downarrow}=1 while 0 if ni=ni=0n_{i\uparrow}=n_{i\downarrow}=0 at the site ii. For instance, |,|\uparrow,\downarrow\rangle represents c1c2|0c_{1\uparrow}^{\dagger}c_{2\downarrow}^{\dagger}|0\rangle and |,=c2c1|0|\downarrow,\uparrow\rangle=c_{2\uparrow}^{\dagger}c_{1\downarrow}^{\dagger}|0\rangle such that up-spin creation operators are ordered in the left-hand side and spatial sites are ordered in the site-number order from 1 to 2 for the same spin. Here |0|0\rangle is the vacuum. In this notation, the Hamiltonian matrix is written as

=(|,|,|,0|0,,|00tt,|00tt,0|ttU00,|tt0U).\displaystyle{\cal H}=\left(\begin{array}[]{c|cccc}&|\uparrow,\downarrow\rangle&|\downarrow,\uparrow\rangle&|\uparrow\downarrow,0\rangle&|0,\uparrow\downarrow\rangle\\ \hline\cr\langle\uparrow,\downarrow|&0&0&-t&-t\\ \langle\downarrow,\uparrow|&0&0&-t&-t\\ \langle\uparrow\downarrow,0|&-t&-t&U&0\\ \langle 0,\uparrow\downarrow|&-t&-t&0&U\end{array}\right). (31)

Here and hereafter, we have explicitly written the choice of the basis as the row and column vectors to make the definition of the components of the 4×44\times 4 matrix clearer.

The eigenvalues are

E0\displaystyle E_{0} =\displaystyle= U2Q,\displaystyle\frac{U}{2}Q,
E1\displaystyle E_{1} =\displaystyle= 0,\displaystyle 0,
E2\displaystyle E_{2} =\displaystyle= U,\displaystyle U,
E3\displaystyle E_{3} =\displaystyle= U2P,\displaystyle\frac{U}{2}P, (32)

where P=1+1+R2P=1+\sqrt{1+R^{2}}, Q=11+R2Q=1-\sqrt{1+R^{2}} and R=4t/UR=4t/U. The normalized eigenfunction of the ground state with the energy E0E_{0} is given by

(R,R,Q,Q)/2(R2+Q2),(R,R,-Q,-Q)/\sqrt{2(R^{2}+Q^{2})}, (33)

which is the coefficients in the basis of (|,,|,,|,0,|0,)(|\uparrow,\downarrow\rangle,|\downarrow,\uparrow\rangle,|\uparrow\downarrow,0\rangle,|0,\uparrow\downarrow\rangle). Namely, the normalized ground state is

12(R2+Q2)(R|,+R|,Q|,0Q|0,).\frac{1}{\sqrt{2(R^{2}+Q^{2})}}(R|\uparrow,\downarrow\rangle+R|\downarrow,\uparrow\rangle-Q|\uparrow\downarrow,0\rangle-Q|0,\uparrow\downarrow\rangle). (34)

Now we introduce the same auxiliary fermions Eqs.(2) and (3) as the atomic limit. By using c~\tilde{c} and d~\tilde{d}, the Hubbard Hamiltonian (23) can be mapped to a noninteracting 2-site TCFM as

TCFM\displaystyle{\cal H}_{\rm TCFM} =\displaystyle= (c~)+(d~)+(c~d~),\displaystyle{\cal H}^{(\tilde{c})}+{\cal H}^{(\tilde{d})}+{\cal H}^{(\tilde{c}\tilde{d})}, (35)
(c~)\displaystyle{\cal H}^{(\tilde{c})} =\displaystyle= σ[tc~(c~1,σc~2,σ+H.c.)+μc~i=1,2nc~,i,σ],\displaystyle\sum_{\sigma}[-t_{\tilde{c}}(\tilde{c}_{1,\sigma}^{\dagger}\tilde{c}_{2,\sigma}+{\rm H.c.})+\mu_{\tilde{c}}\sum_{i=1,2}n_{\tilde{c},i,\sigma}],\ \ \ \ (36)
(d~)\displaystyle{\cal H}^{(\tilde{d})} =\displaystyle= σ[td~(d~1,σd~2,σ+H.c.)+μd~i=1,2nd~,i,σ],\displaystyle\sum_{\sigma}[-t_{\tilde{d}}(\tilde{d}_{1,\sigma}^{\dagger}\tilde{d}_{2,\sigma}+{\rm H.c.})+\mu_{\tilde{d}}\sum_{i=1,2}n_{\tilde{d},i,\sigma}],\ \ \ \ (37)
(c~d~)\displaystyle{\cal H}^{(\tilde{c}\tilde{d})} =\displaystyle= Λσi=1,2(c~i,σd~i,σ+H.c),\displaystyle\Lambda\sum_{\sigma}\sum_{i=1,2}(\tilde{c}_{i,\sigma}^{\dagger}\tilde{d}_{i,\sigma}+{\rm H.c}), (38)

where nc~,i,σ=c~i,σc~i,σn_{\tilde{c},i,\sigma}=\tilde{c}_{i,\sigma}^{\dagger}\tilde{c}_{i,\sigma} and nd~,i,σ=d~i,σd~i,σn_{\tilde{d},i,\sigma}=\tilde{d}_{i,\sigma}^{\dagger}\tilde{d}_{i,\sigma} again with the same mapping Eq. (8) together with

t~c~=t,t~d~=t.\displaystyle\tilde{t}_{\tilde{c}}=t,\tilde{t}_{\tilde{d}}=-t. (39)

The Hilbert space of this TCFM wave function may be expanded in the basis of (c~1,σ+d~1,σ)/2,(c~2,σ+d~2,σ)/2,(c~1,σd~1,σ)/2,(c~2,σd~2,σ)/2))|0,(\tilde{c}_{1,\sigma}^{\dagger}+\tilde{d}_{1,\sigma}^{\dagger})/\sqrt{2},(\tilde{c}_{2,\sigma}^{\dagger}+\tilde{d}_{2,\sigma}^{\dagger})/\sqrt{2},(\tilde{c}_{1,\sigma}^{\dagger}-\tilde{d}_{1,\sigma}^{\dagger})/\sqrt{2},(\tilde{c}_{2,\sigma}^{\dagger}-\tilde{d}_{2,\sigma}^{\dagger})/\sqrt{2}))|0\rangle, where |0|0\rangle is the vacuum of c~\tilde{c} and d~\tilde{d}. Since the spin degeneracy exists and the correlation between opposite spin fermions does not exist, we use an abbreviated notation of this basis as b~i=(c~i+d~i)/2,a~i=(c~id~i)/2\tilde{b}_{i}=(\tilde{c}_{i}+\tilde{d}_{i})/\sqrt{2},\tilde{a}_{i}=(\tilde{c}_{i}-\tilde{d}_{i})/\sqrt{2} for i=1,2i=1,2. The Hamiltonian matrix in this representation is

TCFM=(b~1b~2a~1a~2b~100ttb~200tta~1ttU0a~2tt0U),\displaystyle{\cal H}_{\rm TCFM}=\left(\begin{array}[]{c|cccc}&\tilde{b}_{1}&\tilde{b}_{2}&\tilde{a}_{1}&\tilde{a}_{2}\\ \hline\cr\tilde{b}_{1}^{\dagger}&0&0&-t&-t\\ \tilde{b}_{2}^{\dagger}&0&0&-t&-t\\ \tilde{a}_{1}^{\dagger}&-t&-t&U&0\\ \tilde{a}_{2}^{\dagger}&-t&-t&0&U\end{array}\right), (45)

for both of up and down spin sectors, which is the same as Eq.(31). Then the eigenvalues and eigenfunctions are of course the same as Eqs.(32) and (34), respectively. The ground-state wave function is given by

|Φ0TCFM\displaystyle|\Phi_{0}^{\rm TCFM}\rangle =\displaystyle= 12(R2+Q2)(R(b~1,+b~2,)+Q(a~1,+a~2,))\displaystyle\frac{1}{2(R^{2}+Q^{2})}(R(\tilde{b}_{1,\uparrow}^{\dagger}+\tilde{b}_{2,\uparrow}^{\dagger})+Q(\tilde{a}_{1,\uparrow}^{\dagger}+\tilde{a}_{2,\uparrow}^{\dagger})) (46)
×\displaystyle\times (R(b~1,+b~2,)+Q(a~1,+a~2,))|0,\displaystyle(R(\tilde{b}_{1,\downarrow}^{\dagger}+\tilde{b}_{2,\downarrow}^{\dagger})+Q(\tilde{a}_{1,\downarrow}^{\dagger}+\tilde{a}_{2,\downarrow}^{\dagger}))|0\rangle,

which has the symmetries of spin singlet and spatial parity even. The first excited state is given by

|Φ1TCFM\displaystyle|\Phi_{1}^{\rm TCFM}\rangle =\displaystyle= 24(R2+Q2)\displaystyle\frac{\sqrt{2}}{4(R^{2}+Q^{2})}
×\displaystyle\times ((R(b~1,b~2,)+Q(a~1,a~2,))\displaystyle((R(\tilde{b}_{1,\uparrow}^{\dagger}-\tilde{b}_{2,\uparrow}^{\dagger})+Q(\tilde{a}_{1,\uparrow}^{\dagger}-\tilde{a}_{2,\uparrow}^{\dagger}))
×(R(b~1,+b~2,)+Q(a~1,+a~2,))\displaystyle\times(R(\tilde{b}_{1,\downarrow}^{\dagger}+\tilde{b}_{2,\downarrow}^{\dagger})+Q(\tilde{a}_{1,\uparrow}^{\dagger}+\tilde{a}_{2,\uparrow}^{\dagger}))
\displaystyle- (R(b~1,+b~2,)+Q(a~1,+a~2,))\displaystyle(R(\tilde{b}_{1,\uparrow}^{\dagger}+\tilde{b}_{2,\uparrow}^{\dagger})+Q(\tilde{a}_{1,\uparrow}^{\dagger}+\tilde{a}_{2,\uparrow}^{\dagger}))
×(R(b~1,b~2,)+Q(a~1,a~2,)))|0,\displaystyle\times(R(\tilde{b}_{1,\downarrow}^{\dagger}-\tilde{b}_{2,\downarrow}^{\dagger})+Q(\tilde{a}_{1,\downarrow}^{\dagger}-\tilde{a}_{2,\downarrow}^{\dagger})))|0\rangle,

which is spin triplet and spatial parity odd. Other two higher excited states can be given similarly.

With this correspondence, the TCFM ground state (Eq.(46)) is exactly mapped to the Hubbard terminology as

|Φ0Hub\displaystyle|\Phi_{0}^{\rm Hub}\rangle =\displaystyle= 12(R2+Q2)(R(b~1,b~2,+b~2,b~1,)\displaystyle\frac{1}{2(R^{2}+Q^{2})}(R(\tilde{b}_{1,\uparrow}^{\dagger}\tilde{b}_{2,\downarrow}^{\dagger}+\tilde{b}_{2,\uparrow}^{\dagger}\tilde{b}_{1,\downarrow}^{\dagger}) (48)
+Q(a~1,b~1,+a~2,b~2,))|0\displaystyle+Q(\tilde{a}_{1,\uparrow}^{\dagger}\tilde{b}_{1,\downarrow}^{\dagger}+\tilde{a}_{2,\uparrow}^{\dagger}\tilde{b}_{2,\downarrow}^{\dagger}))|0\rangle
\displaystyle\Leftrightarrow Eq.(29).\displaystyle{\rm Eq.(29)}.

Here, in the Hubbard model, we used the relations (c~i,σd~i,σ)|0=0(\tilde{c}_{i,\sigma}^{\dagger}-\tilde{d}_{i,\sigma}^{\dagger})|0\rangle=0, (c~i,σ+d~i,σ)(c~i,σ+d~i,σ)|0=0(\tilde{c}_{i,\sigma}^{\dagger}+\tilde{d}_{i,\sigma}^{\dagger})(\tilde{c}_{i,-\sigma}^{\dagger}+\tilde{d}_{i,-\sigma}^{\dagger})|0\rangle=0 and (c~1i,σd~i,σ)(c~j,σ+d~j,σ)|0=0(\tilde{c}_{1i,\sigma}^{\dagger}-\tilde{d}_{i,\sigma}^{\dagger})(\tilde{c}_{j,-\sigma}^{\dagger}+\tilde{d}_{j,-\sigma}^{\dagger})|0\rangle=0 for iji\neq j, because c~i,σd~i,σ\tilde{c}_{i,\sigma}^{\dagger}-\tilde{d}_{i,\sigma}^{\dagger} creates a doublon at the iith site singly occupied by a spin σ-\sigma electron and c~i,σ+d~i,σ\tilde{c}_{i,\sigma}^{\dagger}+\tilde{d}_{i,\sigma}^{\dagger} creates an electron at the empty iith site. Mapping for the excited states is similarly shown. Therefore, the exact mapping of not only the ground state but also the whole structure of the spectra between the TCFM and Hubbard model is proven for the two-site system as well.

Here, we remark on the spin degeneracy in the TCFM. Because of the spin degeneracy, the total ground state of the TCFM with one up and one down spin sector is given by the product state |Φ0=|Φ0|Φ0|\Phi_{0}\rangle=|\Phi_{0\uparrow}\rangle|\Phi_{0\downarrow}\rangle. Then the total ground-state energy with one up and one down spin fermions is twice of the ground state energy E0E_{0} in Eq.(32). However, as in the single-site problem, the ground state average in the TCFM \langle\cdots\rangle should be taken as Eq.(12) and the factor 2 is canceled.

For the filling doped with one hole or one electron from the half filling has also trivially the same mapping. For instance, the energy of a hole doped ground state is

|Φ0TCFM\displaystyle|\Phi_{0}^{\rm TCFM}\rangle =\displaystyle= 12(b~1,+b~2,)|0,\displaystyle\frac{1}{2}(\tilde{b}_{1,\uparrow}^{\dagger}+\tilde{b}_{2,\uparrow}^{\dagger})|0\rangle, (49)

which has the energy E0=0E_{0}=0 and is the same as the one-hole doped Hubbard model.

4 Fermi Machine as a Quantum Neural Network

We have shown that the Hubbard model has an exact mapping to the TCFM for one- and two-site problems and the ground-state wave function of the Hubbard model can be constructed from the ground state of the TCFM. It demonstrates that the TCFM is able to describe the energy scales of the Mott gap separating the lower and upper Hubbard bands scaled by E2E0UE_{2}-E_{0}\propto U and the singlet-triplet gap (E1E0E_{1}-E_{0} and E3E2E_{3}-E_{2}) associated with the superexchange interaction JJ, which scales to 4t2/U4t^{2}/U in the strong coupling limit. Up to the 2 sites, the exact ground and excited states of the Hubbard model are represented by the corresponding ground and excited states of the TCFM, respectively by optimizing the variational parameters given by Eqs.(8) and (39), namely, μc~,μd~,Λ,tc~\mu_{\tilde{c}},\mu_{\tilde{d}},\Lambda,t_{\tilde{c}} and td~t_{\tilde{d}}.

This suggests a potential to represent NsN_{s}-site Hubbard models defined by

\displaystyle{\cal H} =\displaystyle= t+U,\displaystyle{\cal H}_{t}+{\cal H}_{U}, (50)
t\displaystyle{\cal H}_{t} =\displaystyle= i,j,σ[t(ciσcjσ+H.c.+μiNsni,σ],\displaystyle\sum_{\langle i,j\rangle,\sigma}[-t(c_{i\sigma}^{\dagger}c_{j\sigma}+{\rm H.c.}+\mu\sum_{i}^{N_{s}}n_{i,\sigma}], (51)
U\displaystyle{\cal H}_{U} =\displaystyle= UiNsnini\displaystyle U\sum_{i}^{N_{s}}n_{i\uparrow}n_{i\downarrow} (52)

for any system size NsN_{s} by a quantum neural network along this line. We will discuss the representability of the wave function later.

In principle, by introducing the self-energies of the visible electron cc through the effect of the hidden fermions dd, it was shown that the spectral function and energy spectra of any interacting system can be represented formally but exactly by the hierarchy of the self-energy structure, which represents the energy eigenvalues of the Hamiltonian by poles and zeros of Green’s function through the continued fraction expansion [34, 35, 36] as

G(q,ω)\displaystyle G(q,\omega) =\displaystyle= 1ωϵc(q)Σ1(q,ω),\displaystyle\frac{1}{\omega-\epsilon_{c}(q)-\Sigma_{1}(q,\omega)}, (53)
Σ1(q,ω)\displaystyle\Sigma_{1}(q,\omega) =\displaystyle= η1(q)ωϵ1(q)Σ2(q,ω),\displaystyle\frac{\eta_{1}(q)}{\omega-\epsilon_{1}(q)-\Sigma_{2}(q,\omega)},
Σ2(q,ω)\displaystyle\Sigma_{2}(q,\omega) =\displaystyle= η2(q)ωϵ2(q)Σ3(q,ω).\displaystyle\frac{\eta_{2}(q)}{\omega-\epsilon_{2}(q)-\Sigma_{3}(q,\omega)}.
\displaystyle\cdots

Here we assume the translational symmetry of the Hamiltonian parameters to allow the momentum representation. This can be achieved by formally tracing the equation of motion for the cc operator in the Heisenberg representation as

idc(t)dt\displaystyle i\hbar\frac{dc(t)}{dt} =\displaystyle= [c,],\displaystyle[c,{\cal H}], (54)

for any Hamiltonian \cal H and [A,B]ABBA[A,B]\equiv AB-BA.

Therefore, one can expect that the ground state of the Hubbard model (or any interacting lattice fermions) can be mapped to an optimized non-interacting multi-component fermion model (MCFM) represented by

MCFM\displaystyle{\cal H}_{\rm MCFM} =\displaystyle= (c~)+(c~d~)+(d~),\displaystyle{\cal H}^{(\tilde{c})}+{\cal H}^{(\tilde{c}\tilde{d})}+{\cal H}^{(\tilde{d})}, (55)
(c~)\displaystyle{\cal H}^{(\tilde{c})} =\displaystyle= i,j,σ(tc~,i,j)(c~i,σc~j,σ+H.c.)+i,σμc~,inc~,i,σ,\displaystyle\sum_{\langle i,j\rangle,\sigma}(-t_{\tilde{c},i,j})(\tilde{c}_{i,\sigma}^{\dagger}\tilde{c}_{j,\sigma}+{\rm H.c.})+\sum_{i,\sigma}\mu_{\tilde{c},i}n_{\tilde{c},i,\sigma},
(c~d~)\displaystyle{\cal H}^{(\tilde{c}\tilde{d})} =\displaystyle= Λσi(c~i,σd~i,σ(1)+H.c),\displaystyle\Lambda\sum_{\sigma}\sum_{i}(\tilde{c}_{i,\sigma}^{\dagger}\tilde{d}_{i,\sigma}^{(1)}+{\rm H.c}), (57)
(d~)\displaystyle{\cal H}^{(\tilde{d})} =\displaystyle= m=1M[i,jσ(td~,i,j(m))(d~i,σ(m)d~j,σ(m)+H.c.)\displaystyle\sum_{m=1}^{M}[\sum_{\langle i,j\rangle\sigma}(-t_{\tilde{d},i,j}^{(m)})(\tilde{d}_{i,\sigma}^{(m)\dagger}\tilde{d}_{j,\sigma}^{(m)}+{\rm H.c.}) (58)
+i,σμd~,i(m)d~i,σ(m)d~i,σ(m)]\displaystyle+\sum_{i,\sigma}\mu_{\tilde{d},i}^{(m)}\tilde{d}_{i,\sigma}^{(m)\dagger}\tilde{d}_{i,\sigma}^{(m)}]
+\displaystyle+ m=1M1Λ(m)i,σ(d~i,σ(m)d~i,σ(m+1)+H.c.),\displaystyle\sum_{m=1}^{M-1}\Lambda^{(m)}\sum_{i,\sigma}(\tilde{d}_{i,\sigma}^{(m)\dagger}\tilde{d}_{i,\sigma}^{(m+1)}+{\rm H.c.}),

as is illustrated in Fig. 1, because the Green’s function of Eq. (55) is given by the hierarchical structure given by Eq.(53) with the correspondence

ϵc(q)\displaystyle\epsilon_{c}(q) =\displaystyle= 1Ns,jeiq(j)(tc~,,j)+μc~,\displaystyle\frac{1}{N_{s}}\sum_{\langle\ell,j\rangle}e^{iq(\ell-j)}(-t_{\tilde{c},\ell,j})+\mu_{\tilde{c}},
η1(q)\displaystyle\eta_{1}(q) =\displaystyle= Λ,\displaystyle\Lambda,
ϵm(q)\displaystyle\epsilon_{m}(q) =\displaystyle= 1Ns,jeiqj)(td~,,j(m))+μd~,i(m),\displaystyle\frac{1}{N_{s}}\sum_{\langle\ell,j\rangle}e^{iq\ell-j)}(-t_{\tilde{d},\ell,j}^{(m)})+\mu_{\tilde{d},i}^{(m)},
ηm(q)\displaystyle\eta_{m}(q) =\displaystyle= Λ(m),\displaystyle\Lambda^{(m)}, (59)

if the chemical potential is spatially uniform.

Refer to caption
Figure 1: (Color online): Illustration of the architecture of Fermi machine network based on the MCFM compared to the Hubbard model.

The algorithm of the present quantum neural network to represent the ground state of the Hubbard model is the following:

  • We diagonalize the noninteracting MCFM Hamiltonian Eq. (55) and obtain the ground state by filling the low-energy fermions up to the Fermi level for a given set of parameters in Eqs. (LABEL:eq:MCFM_Hc)-(58).

  • Next we replace the hidden fermions d~\tilde{d} by using the rule Eq.(2) together with similar correspondence such as d~i,σ(m)=d~i,σ(m1)(12nd~(m1),i,σ)\tilde{d}_{i,\sigma}^{(m)}=\tilde{d}_{i,\sigma}^{(m-1)}(1-2n_{\tilde{d}^{(m-1)},i,-\sigma}) where nd~(m),i,σ=d~i,σ(m)d~i,σ(m)n_{\tilde{d}^{(m)},i,-\sigma}=\tilde{d}_{i,-\sigma}^{(m)\dagger}\tilde{d}_{i,-\sigma}^{(m)}.

  • Then by using the relations such as those below Eq. (48) used to derive Eq. (48) and its extension to deep-layer hidden operators, one can represent the wave function for the Hubbard model |Φ0Hub|\Phi_{0}^{\rm Hub}\rangle as in the derivation of Eq. (48), from which one can easily calculate the matrix elements of the Hubbard model and the energy expectation value of the Hubbard model by

    E0==Φ0Hub||Φ0HubΦ0Hub|Φ0Hub\displaystyle\langle E_{0}\rangle=\langle{\cal H}\rangle=\frac{\langle\Phi_{0}^{\rm Hub}|{\cal H}|\Phi_{0}^{\rm Hub}\rangle}{\langle\Phi_{0}^{\rm Hub}|\Phi_{0}^{\rm Hub}\rangle} (60)

    using Monte Carlo sampling of the real space configuration of cc fermion occupation. Here \cal H is given by Eqs. (50)-(52). Note that t|Φ0Hub{\cal H}_{t}|\Phi_{0}^{\rm Hub}\rangle is not a single Slater determinant even when |Φ0Hub|\Phi_{0}^{\rm Hub}\rangle is so. An easy way to obtain the average is

    E0t=t=limτ01τlog[Φ0Hub|exp[τt]|Φ0HubΦ0Hub|Φ0Hub]\displaystyle\langle E_{0t}\rangle=\langle{\cal H}_{t}\rangle=\lim_{\tau\rightarrow 0}\frac{1}{\tau}\log\left[\frac{\langle\Phi_{0}^{\rm Hub}|\exp[\tau{\cal H}_{t}]|\Phi_{0}^{\rm Hub}\rangle}{\langle\Phi_{0}^{\rm Hub}|\Phi_{0}^{\rm Hub}\rangle}\right] (61)

    because exp[τt]|Ψ\exp[\tau{\cal H}_{t}]|\Psi\rangle remains a single Slater determinant when |Ψ|\Psi\rangle is a single Slater determinant [16].

  • Finally, by using the variational principle, the parameters in the MCFM Hamiltonian Eqs. (55)-(58) are regarded as the variational parameters and are optimized to lower the energy Eq. (60) by following the variational principle.

There exist a few useful relations to simplify the MCFM state:

d~i,σ(m)\displaystyle\tilde{d}_{i,\sigma}^{(m)} =\displaystyle= d~i,σ(m1)(12nd~(m1),i,σ)=\displaystyle\tilde{d}_{i,\sigma}^{(m-1)}(1-2n_{\tilde{d}^{(m-1)},i,-\sigma})=\cdots (62)
=\displaystyle= ci,σ(12nci,σ)m,\displaystyle c_{i,\sigma}(1-2n_{c_{i,-\sigma}})^{m},
nd~i,σ(m)\displaystyle n_{\tilde{d}_{i,\sigma}^{(m)}} =\displaystyle= nd~i,σ(m1)==nci,σ\displaystyle n_{\tilde{d}^{(m-1)}_{i,\sigma}}=\cdots=n_{c_{i,\sigma}} (63)

The ground state of the MCFM can be represented generally as

|Ψ0MCFM\displaystyle|\Psi_{0}^{\rm MCFM}\rangle =\displaystyle= (|Ψ0MCFM00|Ψ0MCFM),\displaystyle\begin{pmatrix}|\Psi_{0\ \uparrow}^{\rm MCFM}\rangle&0\\ 0&|\Psi_{0\ \downarrow}^{\rm MCFM}\rangle\end{pmatrix}, (64)
|Ψ0σMCFM\displaystyle|\Psi_{0\ \sigma}^{\rm MCFM}\rangle =\displaystyle= qkF|ΨσMCFM(q),\displaystyle\prod_{q}^{k_{\rm F}}|\Psi_{\sigma}^{\rm MCFM}(q)\rangle, (65)
|ΨσMCFM(q)\displaystyle|\Psi_{\sigma}^{\rm MCFM}(q)\rangle =\displaystyle= [α0,qc~q,σ+m=1Mαm,qd~m,q,σ]|0,\displaystyle[\alpha_{0,q}\tilde{c}_{q,\sigma}^{\dagger}+\sum_{m=1}^{M}\alpha_{m,q}\tilde{d}_{m,q,\sigma}^{\dagger}]|0\rangle, (66)

where qq is the momentum. Here, in the ground state of the MCFM, fermions occupy up to the fermi momentum kFk_{\rm F}. Since the MCFM is diagonal in the momentum space, the ground-state wave function in the kk space is trivial if the parameters in Eqs. (LABEL:eq:MCFM_Hc)-(58) are given. The coefficient αm,q\alpha_{m,q} (m=0,Mm=0,\cdots M) is determined from the eigenvector of the ground state, which can be easily calculated numerically.

This MCFM ground state is mapped to the variational form of the ground state of the Hubbard model as

|Ψ0Hub\displaystyle|\Psi_{0}^{\rm Hub}\rangle =\displaystyle= (|Ψ0Hub00|Ψ0Hub),\displaystyle\begin{pmatrix}|\Psi_{0\ \uparrow}^{\rm Hub}\rangle&0\\ 0&|\Psi_{0\ \downarrow}^{\rm Hub}\rangle\end{pmatrix}, (67)
|Ψ0σHub=qkF[m=0Mαm,qeiqNsc,σ(12n,σ)m]|0.|\Psi_{0\ \sigma}^{\rm Hub}\rangle=\prod_{q}^{k_{\rm F}}[\sum_{m=0}^{M}\alpha_{m,q}\sum_{\ell}\frac{e^{iq\ell}}{\sqrt{N_{s}}}c_{\ell,\sigma}^{\dagger}(1-2n_{\ell,-\sigma})^{m}]|0\rangle. (68)

It can be rewritten as

|Ψ0σHub\displaystyle|\Psi_{0\ \sigma}^{\rm Hub}\rangle =\displaystyle= qkF|ΨσHub(q),\displaystyle\prod_{q}^{k_{\rm F}}|\Psi_{\sigma}^{\rm Hub}(q)\rangle, (69)
|ΨσHub(q)=[m=0Mαm,qeiqNsc,σ()mn,σ]|0.|\Psi_{\sigma}^{\rm Hub}(q)\rangle=[\sum_{m=0}^{M}\alpha_{m,q}\sum_{\ell}\frac{e^{iq\ell}}{\sqrt{N_{s}}}c_{\ell,\sigma}^{\dagger}(-)^{mn_{\ell,-\sigma}}]|0\rangle. (70)

It should be noted that |ΨσHub(q)|\Psi_{\sigma}^{\rm Hub}(q)\rangle contains electron operators cp,σc^{\dagger}_{p,\sigma} with not only the momentum qq but also all the momentum pp in the Brillouin zone in the momentum representation, which is a consequence of the interference with the spin σ-\sigma electron at the \ell-th site in Eq.(70) reflecting the scattering of σ\sigma and σ-\sigma electrons at the \ell-th site in the original Hubbard model. Indeed, intuitively, the minus factor for the odd mm in Eq. (70) represents the reduction of the weight of c,σc_{\ell,\sigma}^{\dagger} electron if σ-\sigma electron exists at the \ell-th site to punish the double occupation leading to the entanglement of spin up and down electrons. For instance, in the large UU limit, the corresponding Λ\Lambda\rightarrow-\infty makes the perfect exclusion of the double occupation through Eq. (70). The representability of this wave function is discussed in Appendix.

To numerically optimize the variational parameters in the MCFM (namely the parameters in Eqs. (55)-(58)), we need to estimate the ground-state energy using the obtained wave function through Eq. (60), by the average of the inserted sample |x|x\rangle as

E0\displaystyle E_{0} =\displaystyle= =xρ(x)F(x),\displaystyle\langle{\cal H}\rangle=\sum_{x}\rho(x)F(x), (71)
ρ(x)\displaystyle\rho(x) =\displaystyle= |x|Ψ0Hub|2/Ψ0Hub|Ψ0Hub,\displaystyle|\langle x|\Psi_{0}^{\rm Hub}\rangle|^{2}/\langle\Psi_{0}^{\rm Hub}|\Psi_{0}^{\rm Hub}\rangle, (72)
F(x)\displaystyle F(x) =\displaystyle= xΨ0Hub|xΨ0Hub|xx||x.\displaystyle\sum_{x^{\prime}}\frac{\langle\Psi_{0}^{\rm Hub}|x^{\prime}\rangle}{\langle\Psi_{0}^{\rm Hub}|x\rangle}\langle x^{\prime}|{\cal H}|x\rangle. (73)

Here, x|t|x\langle x^{\prime}|{\cal H}_{t}|x\rangle can be calculated in the same way as Eq. (61).

Hereafter, let us consider a NsN_{s}-site system with N/2N/2 up spin and down spin electrons each. (Extension to the case of an arbitrary number of up and down spin electrons, NN_{\uparrow}, and NN_{\downarrow}, respectively, is easy and straightforward.) The Monte Carlo sampling can be performed by taking ρ(x)\rho(x) as the probability to generate samples by following the importance sampling and the simple average of F(x)F(x) over sampling gives us the estimate of E0E_{0}. Here, |x|x\rangle is a sample of real space configuration (simple product state), such as (0,,,,,0,)(0,\uparrow,\uparrow,\uparrow\downarrow,\downarrow,0,\cdots), namely

|x\displaystyle|x\rangle =\displaystyle= N/2c,N/2c,|0,\displaystyle\prod_{\ell}^{N/2}c_{\ell,\uparrow}^{\dagger}\prod_{\ell^{\prime}}^{N/2}c_{\ell^{\prime},\downarrow}^{\dagger}|0\rangle, (74)

where the sets of the real space coordinate {}\{\ell\} and {}\{\ell^{\prime}\} specify the positions of the up and down spin electrons, respectively in the given sample. Since \cal H is a sparse matrix by assuming the system with the short-ranged hoppings and interactions, and has nonzero matrix element only in their ranges, the summation over |x|x^{\prime}\rangle can be explicitly taken for each |x|x\rangle.

To enhance the representability of the wave function, one can optionally introduce the “Fermi distribution factor” ff on top of Eq.(68) (or Eqs.(69) and (70)) similarly to the finite-temperature Boltzmann factor in the classical Boltzmann machine. Namely, in this scheme, |Ψ0σHub|\Psi_{0\ \sigma}^{\rm Hub}\rangle is modified and replaced by

|Ψ0σHub\displaystyle|\Psi_{0\ \sigma}^{\rm Hub}\rangle =\displaystyle= qf(β,q)|ΨσHub(q),\displaystyle\prod_{q}f(\beta,q)|\Psi_{\sigma}^{\rm Hub}(q)\rangle, (75)
f(β,q)\displaystyle f(\beta,q) =\displaystyle= 11+eβ(q),\displaystyle\frac{1}{1+e^{\beta{\cal E}(q)}}, (76)
(q)\displaystyle{\cal E}(q) =\displaystyle= σΨσMCFM(q)|MCFM|ΨσMCFM(q)ΨσMCFM(q)|ΨσMCFM(q),\displaystyle\sum_{\sigma}\frac{\langle\Psi_{\sigma}^{\rm MCFM}(q)|{\cal H}^{\rm MCFM}|\Psi_{\sigma}^{\rm MCFM}(q)\rangle}{\langle\Psi_{\sigma}^{\rm MCFM}(q)|\Psi_{\sigma}^{\rm MCFM}(q)\rangle},
|ΨσMCFM(q)\displaystyle|\Psi_{\sigma}^{\rm MCFM}(q)\rangle =\displaystyle= σ[α0,qc~q,σ+m=1Mαm,qd~m,q,σ]|0,\displaystyle\prod_{\sigma}[\alpha_{0,q}\tilde{c}_{q,\sigma}^{\dagger}+\sum_{m=1}^{M}\alpha_{m,q}\tilde{d}_{m,q,\sigma}^{\dagger}]|0\rangle, (77)

where β\beta is an additional variational parameter, |ΨσMCFM(q)|\Psi_{\sigma}^{\rm MCFM}(q)\rangle is the eigenstate of Eq.(55) with the momentum qq and |ΨσHub(q)|\Psi_{\sigma}^{\rm Hub}(q)\rangle is the same as Eq.(70). The product with respect to qq is over the full Brillouin zone for Eq. (75). If we take β\beta\rightarrow\infty, it is reduced to the original algorithm using Eqs.(69) and (70). Therefore this Fermi machine extension certainly enhances the representability with a strategy conceptually similar to the Boltzmann machine [30, 31] but by using a substantially different quantum algorithm.

In the practical calculation, it is useful to represent in the matrix form: |Ψ0σHub|\Psi_{0\ \sigma}^{\rm Hub}\rangle is represented by a Ns×N/2N_{s}\times N/2 matrix 𝒫σ{\cal P}_{\sigma}, where from Eq.(70), the matrix element is given by

(𝒫σ),k\displaystyle({\cal P}_{\sigma})_{\ell,k} =\displaystyle= m=0Mαm,q(k,σ)eiq(k,σ)()mn,σ.\displaystyle\sum_{m=0}^{M}\alpha_{m,q(k,\sigma)}e^{iq(k,\sigma)\ell}(-)^{mn_{\ell,-\sigma}}. (78)

Here, q(k,σ)q(k,\sigma) is the momentum of the kkth occupied electron with spin σ\sigma. The matrix representation of the sample |x𝒳|x\rangle\rightarrow{\cal X} is

(𝒳σ),=nNsiN/2δ,nδ,i\displaystyle({\cal X}_{\sigma})_{\ell,\ell^{\prime}}=\sum_{n}^{N_{s}}\sum_{i}^{N/2}\delta_{\ell,n}\delta_{\ell^{\prime},i} (79)

if the iith spin-σ{\sigma} electron is at the site nn in the given sample.

Since x||x\langle x^{\prime}|{\cal H}|x\rangle is easily calculated in Eq.(73), here we discuss how one can calculate Ψ0Hub|x\langle\Psi_{0}^{\rm Hub}|x^{\prime}\rangle and x|Ψ0Hub\langle x|\Psi_{0}^{\rm Hub}\rangle in the matrix representation. By defining 2Ns×N2N_{s}\times N matrices

𝒫\displaystyle{\cal P} =\displaystyle= (𝒫00𝒫)\displaystyle\begin{pmatrix}{\cal P}_{\uparrow}&0\\ 0&{\cal P}_{\downarrow}\end{pmatrix} (80)

and

𝒳\displaystyle{\cal X} =\displaystyle= (𝒳00𝒳),\displaystyle\begin{pmatrix}{\cal X}_{\uparrow}&0\\ 0&{\cal X}_{\downarrow}\end{pmatrix}, (81)

we can calculate x|Ψ0Hub\langle x|\Psi_{0}^{\rm Hub}\rangle from the determinant of the matrix product 𝒳T𝒫{}^{T}{\cal X}{\cal P} represented by a N×NN\times N matrix, where 𝒳T{}^{T}{\cal X} is the transpose of 𝒳\cal X. In Eq.(78), we need to be careful about the dependence on n,σn_{\ell,-\sigma}. Since n,σ=k(𝒳σ),kn_{\ell,-\sigma}=\sum_{k}({\cal X}_{-\sigma})_{\ell,k}, 𝒫σ{\cal P}_{\sigma} depends on 𝒳σ{\cal X}_{-\sigma}, but it can be easily numerically calculated, because practically, |Ψ0Hub|\Psi_{0}^{\rm Hub}\rangle appears only in the inner product with x|\langle x|.

In this way, E0E_{0} can be calculated by Monte Carlo sampling of |x|x\rangle as xF(x)/Nsample\sum_{x}F(x)/N_{\rm sample} for the important sampling generated with the probability w|x|Ψ0Hub|2w\propto|\langle x|\Psi_{0}^{\rm Hub}\rangle|^{2}. Here, NsampleN_{\rm sample} is the number of the samples and

F(x)=xdet[T𝒫𝒳T𝒳𝒳]det[T𝒫𝒳]F(x)=\sum_{x^{\prime}}\frac{{\rm det}[^{T}{\cal PX^{\prime}}^{T}{\cal X^{\prime}H}{\cal X}]}{{\rm det}[^{T}{\cal PX}]} (82)

is estimated from the ratio of two determinants of N×NN\times N matrices. Then the variational parameters in Eqs. (55)-(58) are optimized to lower E0E_{0}.

In the matrix representation, |ΨσMCFM(q)|\Psi_{\sigma}^{\rm MCFM}(q)\rangle and |ΨσHub(q)|\Psi_{\sigma}^{\rm Hub}(q)\rangle are NsN_{s}-component vectors and for instance, ΨσMCFM(q)|MCFM|ΨσMCFM(q)\langle\Psi_{\sigma}^{\rm MCFM}(q)|{\cal H}^{\rm MCFM}|\Psi_{\sigma}^{\rm MCFM}(q)\rangle is a scalar for diagonalized Ns×NsN_{s}\times N_{s} matrix of MCFM{\cal H}^{\rm MCFM} in the momentum representation of Eqs. (55)-(58).

As a further extension for more accurate variational form, long-range part of quantum entanglement may be more efficiently incorporated by adding layers connected by nonlocal hybridization such as

nonlocal\displaystyle{\cal H}_{\rm nonlocal} =\displaystyle= nonlocal(d~)+nonlocal(c~d~),\displaystyle{\cal H}^{(\tilde{d})}_{\rm nonlocal}+{\cal H}^{(\tilde{c}\tilde{d})}_{\rm nonlocal}, (83)
nonlocal(d~)\displaystyle{\cal H}^{(\tilde{d})}_{\rm nonlocal} =\displaystyle= i,jσ(td~,i,j(M+1))(d~i,σ(M+1)d~j,σ(M+1)+H.c.)\displaystyle\sum_{\langle i,j\rangle\sigma}(-t_{\tilde{d},i,j}^{(M+1)})(\tilde{d}_{i,\sigma}^{(M+1)\dagger}\tilde{d}_{j,\sigma}^{(M+1)}+{\rm H.c.})
+\displaystyle+ i,σμd~,i(M+1)nd~,i,σ(M+1),\displaystyle\sum_{i,\sigma}\mu_{\tilde{d},i}^{(M+1)}n_{\tilde{d},i,\sigma}^{(M+1)},
nonlocal(c~d~)\displaystyle{\cal H}^{(\tilde{c}\tilde{d})}_{\rm nonlocal} =\displaystyle= σ,σi,j(Λσ,σ(M)(ji)c~i,σd~j,σ(M+1)+H.c)\displaystyle\sum_{\sigma,\sigma^{\prime}}\sum_{i,j}(\Lambda^{(M)}_{\sigma,\sigma^{\prime}}(j-i)\tilde{c}_{i,\sigma}^{\dagger}\tilde{d}_{j,\sigma^{\prime}}^{(M+1)}+{\rm H.c}) (84)
=\displaystyle= σ,σk(Λσ,σ(M)(k)c~k,σd~k,σ(M+1))\displaystyle\sum_{\sigma,\sigma^{\prime}}\sum_{k}(\Lambda^{(M)}_{\sigma,\sigma^{\prime}}(k)\tilde{c}_{k,\sigma}^{\dagger}\tilde{d}_{k,\sigma^{\prime}}^{(M+1)})

as is illustrated in Fig. 2. Here, the spin dependence is taken to satisfy the spin rotational symmetry. To make the correspondence between d~(M+1)\tilde{d}^{(M+1)} and cc, one may generalize Eq. (2) or more concretely, d~j,σ(M+1)i,σΘσ,σ(ji)ci,σ(12nj,σ)\tilde{d}_{j,\sigma^{\prime}}^{(M+1)}\leftrightarrow\sum_{i,\sigma}\Theta_{\sigma^{\prime},\sigma}(j-i)c_{i,\sigma}(1-2n_{j,\sigma^{\prime}}), which leads to the replacement of Eq. (70) with

|ΨσHub(q)\displaystyle|\Psi_{\sigma}^{\rm Hub}(q)\rangle =\displaystyle= [m=0Mαm,qeiqNsc,σ()mn,σ\displaystyle[\sum_{m=0}^{M}\alpha_{m,q}\sum_{\ell}\frac{e^{iq\ell}}{\sqrt{N_{s}}}c_{\ell,\sigma}^{\dagger}(-)^{mn_{\ell,-\sigma}}
+\displaystyle+ αM+1,q,,σeiqNsΘσ,σ()c,σ()n,σ]|0,\displaystyle\alpha_{M+1,q}\sum_{\ell,\ell^{\prime},\sigma^{\prime}}\frac{e^{iq\ell^{\prime}}}{\sqrt{N_{s}}}\Theta_{\sigma^{\prime},\sigma}(\ell^{\prime}-\ell)c_{\ell,\sigma}^{\dagger}(-)^{n_{\ell^{\prime},\sigma^{\prime}}}]|0\rangle,
(85)

which means that the spin dependent Θσ,σ()\Theta_{\sigma,\sigma^{\prime}}(\ell^{\prime}-\ell) contains the spin parallel (σ=σ\sigma=\sigma^{\prime}) and antiparallel (σ=σ\sigma=-\sigma^{\prime}) components. Θσ,σ()\Theta_{\sigma,\sigma^{\prime}}(\ell^{\prime}-\ell) is regarded as additional variational parameters. The present extension explicitly allows dependence of the weight of the wave function at the \ellth site with spin σ\sigma on the occupation of parallel (σ\sigma) or antiparallel (σ-\sigma) component of the fermion at the \ell^{\prime}th site, which plays a role similar to Jastrow factor in the conventional variational Monte Carlo or nonlocal coupling between the visible (physical) and hidden variables in the Boltzmann machine. However, the present nonlocal hybridization explicitly introduces spatially extended quantum entanglement between up- and down-spin electrons. In addition, the present scheme does not break the SU(2) symmetry in contrast to the spin Jastrow factor in the variational Monte Carlo and the Boltzmann machine. This is because the amplitude of |ΨσHub(q)|\Psi_{\sigma}^{\rm Hub}(q)\rangle for the component of the up spin at the iith site and the down spin at the jjth site has the same value with the opposite configuration and the matrix element of spin diagonal component ΨσHub(q)|SizSjz|ΨσHub(q)\langle\Psi_{\sigma}^{\rm Hub}(q)|S_{i}^{z}S_{j}^{z}|\Psi_{\sigma}^{\rm Hub}(q)\rangle becomes the same as the off-diagonal one ΨσHub(q)|Si+Sj|ΨσHub(q)=ΨσHub(q)|SiSj+|ΨσHub(q)\langle\Psi_{\sigma}^{\rm Hub}(q)|S_{i}^{+}S_{j}^{-}|\Psi_{\sigma}^{\rm Hub}(q)\rangle=\langle\Psi_{\sigma}^{\rm Hub}(q)|S_{i}^{-}S_{j}^{+}|\Psi_{\sigma}^{\rm Hub}(q)\rangle. This nonlocal hybridization part may also be extended to deeper layers of M+3M+3rd, M+4M+4th \cdots up to M+MM+M^{\prime}th layer if necessary.

In this extension, Eqs. (62) and (63) become more complicated, because of the nonlocality but may enable the mutual orthogonality of d~i,σ(m)\tilde{d}_{i,\sigma}^{(m)} in the Hubbard terminology expected for Eq.(53). Then, if we include the kk-independent component of Λ\Lambda in Eq. (84), the local part represented by the first term of the r.h.s. of Eq. (85) can be absorbed, but still only the antiparallel spin combination is relevant for that kk-independent component. In this sense, further generalized simpler architecture would be represented by the following unified form instead of Eqs. (70) and (85):

|ΨσHub(q)\displaystyle|\Psi_{\sigma}^{\rm Hub}(q)\rangle =\displaystyle= m=0Mαm,q,,σeiqNsΘσ,σ(m)()c,σ\displaystyle\sum_{m=0}^{M}\alpha_{m,q}\sum_{\ell,\ell^{\prime},\sigma^{\prime}}\frac{e^{iq\ell^{\prime}}}{\sqrt{N_{s}}}\Theta_{\sigma^{\prime},\sigma}^{(m)}(\ell^{\prime}-\ell)c_{\ell,\sigma}^{\dagger} (86)
×()mn,σ|0.\displaystyle\times(-)^{mn_{\ell^{\prime},\sigma^{\prime}}}|0\rangle.

Alternatively, one can extend in the form

eiqNsd~,σ(m)\displaystyle\frac{e^{iq\ell}}{\sqrt{N_{s}}}\tilde{d}^{(m)\dagger}_{\ell,\sigma} \displaystyle\rightarrow ,σeiqNsΘσ,σ(m)(q,)d,σ(m1)\displaystyle\sum_{\ell^{\prime},\sigma^{\prime}}\frac{e^{iq\ell^{\prime}}}{\sqrt{N_{s}}}\Theta_{\sigma^{\prime},\sigma}^{(m)}(q,\ell^{\prime}-\ell)d^{(m-1)\dagger}_{\ell,\sigma}
×()nd~(m1),,σ\displaystyle\times(-)^{n_{\tilde{d}^{(m-1)},\ell^{\prime},\sigma^{\prime}}} (87)
\displaystyle\rightarrow Ξσ(m)(q,)c,σ,\displaystyle\Xi^{(m)}_{\sigma}(q,\ell)c_{\ell,\sigma}^{\dagger}, (88)
Ξσ(m)(q,)\displaystyle\Xi^{(m)}_{\sigma}(q,\ell) =\displaystyle= n=1m[n,σneiqnNsΘσn,σ(mn+1)(q,n)\displaystyle\prod_{n=1}^{m}\large[\sum_{\ell_{n},\sigma_{n}}\frac{e^{iq\ell_{n}}}{\sqrt{N_{s}}}\Theta_{\sigma_{n},\sigma}^{(m-n+1)}(q,\ell_{n}-\ell) (89)
×()nn,σn],\displaystyle\times(-)^{n_{\ell_{n},\sigma_{n}}}\large],
Ξσ(0)()\displaystyle\Xi^{(0)}_{\sigma}(\ell) =\displaystyle= eiqNs,\displaystyle\frac{e^{iq\ell}}{\sqrt{N_{s}}}, (90)

which leads to

|ΨσHub(q)\displaystyle|\Psi_{\sigma}^{\rm Hub}(q)\rangle =\displaystyle= m=0Mαm,qΞσ(m)(q,)c,σ|0.\displaystyle\sum_{m=0}^{M}\alpha_{m,q}\sum_{\ell}\Xi_{\sigma}^{(m)}(q,\ell)c_{\ell,\sigma}^{\dagger}|0\rangle. (91)
Refer to caption
Figure 2: (Color online): Illustration of the architecture of Fermi machine network extended from Fig. 1 by introducing momentum dependent (spatially extended) hybridization.

The variational optimization procedure is summarized as

  • For given parameters in Eqs. (55)-(58), determine αm,k\alpha_{m,k} for the ground state of the MCFM and then |Ψ0Hub|\Psi_{0}^{\rm Hub}\rangle in the form of Eqs. (67) and (69) with Eqs. (70), (85), (86) or (91).

  • Calculate x|Ψ0Hub\langle x|\Psi_{0}^{\rm Hub}\rangle for a Monte Carlo sample given by a real space configuration |x|x\rangle in the matrix representation detT𝒳𝒫{\rm det}^{T}{\cal X}{\cal P}.

  • Continue the Monte Carlo sampling with the importance sampling weight ρ(x)=|[x|Ψ0Hub]|2/Ψ0Hub|Ψ0Hub\rho(x)=|[\langle x|\Psi_{0}^{\rm Hub}\rangle]|^{2}/\langle\Psi_{0}^{\rm Hub}|\Psi_{0}^{\rm Hub}\rangle proportional to |det[T𝒳𝒫]|2|{\rm det}[^{T}{\cal XP}]|^{2} and calculate the energy expectation value

    E0\displaystyle E_{0} =\displaystyle= 1NsamplexF(x)\displaystyle\frac{1}{N_{\rm sample}}\sum_{x}F(x) (92)
    =\displaystyle= 1Nsamplex,xdet[T𝒫𝒳T𝒳𝒳]det[T𝒫𝒳].\displaystyle\frac{1}{N_{\rm sample}}\sum_{x,x^{\prime}}\frac{{\rm det}[^{T}{\cal PX^{\prime}}^{T}{\cal X^{\prime}H}{\cal X}]}{{\rm det}[^{T}{\cal PX}]}.
  • Optimize the variational parameters in Eqs. (LABEL:eq:MCFM_Hc)-(58) to lower E0E_{0} until convergence. For the optimization, the natural gradient [37, 38] (stochastic reconfiguration [39]) can be used.

So far, we have restricted the formulation to systems with translational invariance so that the momentum is a good quantum number. However, it can be easily extended to non-invariant cases such as systems under site-dependent random potential, interaction and/or bond-random hopping. In these cases, Eq. (65) is replaced by

|Ψ0σMCFM\displaystyle|\Psi_{0\ \sigma}^{\rm MCFM}\rangle =\displaystyle= kkF|ΨσMCFM(k),\displaystyle\prod_{k}^{k_{\rm F}}|\Psi_{\sigma}^{\rm MCFM}(k)\rangle, (93)

where kk is not the momentum any more and represents the kk-th lowest eigenstate of the MCFM Hamiltonian containing randomness, which can be obtained from the diagonalization of the MCFM Hamiltonian giving the amplitude of the eigenfunction represented by the component αm,,k\alpha_{m,\ell,k} for the site \ell in the mm-th layer. Then, Eq. (66) is replaced with

|ΨσMCFM(k)=[α0,,kc~,σ+m=1Mαm,,kd~m,,σ]|0,|\Psi_{\sigma}^{\rm MCFM}(k)\rangle=[\alpha_{0,\ell,k}\tilde{c}_{\ell,\sigma}^{\dagger}+\sum_{m=1}^{M}\alpha_{m,\ell,k}\tilde{d}_{m,\ell,\sigma}^{\dagger}]|0\rangle, (95)

and the corresponding Hubbard state (68) is modified to

|Ψ0σHub=kkF[m=0Mαm,,kc,σ(12n,σ)m]|0.\displaystyle|\Psi_{0\ \sigma}^{\rm Hub}\rangle=\prod_{k}^{k_{\rm F}}[\sum_{m=0}^{M}\alpha_{m,\ell,k}c_{\ell,\sigma}^{\dagger}(1-2n_{\ell,-\sigma})^{m}]|0\rangle. (96)

Further refinements discussed above may also be applied as well. As a simple starting point one may impose a constraint of the “quenched randomness” for the construction of the MCFM, which means that the MCFM paramters are taken not to depend on the layer index for the same site or bond.

5 Benchmark for 4-site Hubbard model

Here, the validity of the present formalism is tested in the 4-site Hubbard model at half filling and with hole doping. The MCFM results are compared with the exact ground state given by the numerical diagonalization. To make the parameters as simple as possible, we do not introduce the deep layers beyond the first hidden layer and the interlayer coupling Λ(m)\Lambda^{(m)} with m2m\geq 2 are switched off in this study. However, the nonlocal hybridization is taken into account in Sec. 5.2. The parameter β\beta in Eq. (76) is also taken to be \infty so that the Fermi distribution is taken at zero temperature.

Table 1: Comparison of estimated ground-state energies per site E0E_{0} of the half-filled 4-site Hubbard model between the exact diagonalization and Fermi machine with list of examples of the optimized variational parameters. Common (hyper)parameters are M=1M=1, M=0M^{\prime}=0, and tc~=1t_{\tilde{c}}=1.
exact E0E_{0} E0E_{0} by Fermi machine td~t_{\tilde{d}} Λ\Lambda μc~\mu_{\tilde{c}} μd~\mu_{\tilde{d}}
U=4U=4 -2.10275 -2.10275 1.0 -1.002 1.002 4.644
U=8U=8 -1.32023 -1.32023 1.0 -2.238 2.238 7.229
-1.32023 -1.32023 2.0 -3.330 3.330 10.731
Table 2: Comparison of estimated ground-state energies per site E0E_{0} of the hole doped 4-site Hubbard model between the exact diagonalization and Fermi machine with list of an example of optimized variational parameters for U=8,t=1U=8,t=1 with 1 up and down doped hole each. Parameters not listed in the table are M=1M=1, M=1M^{\prime}=1, tc~=td~=1t_{\tilde{c}}=t_{\tilde{d}}=1, td~,i,j,σ(M+1)=0t_{\tilde{d},i,j,\sigma}^{(M+1)}=0, and μd~,i(M+1)=μd~,i(M+1)=8.0\mu_{\tilde{d},i}^{(M+1)}=\mu_{\tilde{d},i}^{(M+1)}=8.0. The nonlocal hybridization is taken in a simple form as Λσ,σ(M)[k]=λσ,σ(M)cos[k]\Lambda^{(M)}_{\sigma,\sigma^{\prime}}[k]=\lambda^{(M)}_{\sigma,\sigma^{\prime}}\cos[k] so that the interlayer hybridization is extended simply to the nearest neighbor pair of {i,j}\{i,j\} in Eq. (84). Accordngly, we take Θσ,σ(ij)=1\Theta_{\sigma,\sigma^{\prime}}(i-j)=1 only for the nearest neighbor pair of i,j{i,j} for any combination of σ\sigma and σ\sigma^{\prime} and zero otherwise in Eq. (85).
exact E0E_{0} E0E_{0} by Fermi machine Λ\Lambda μc~\mu_{\tilde{c}} μd~\mu_{\tilde{d}} λ,(M)=λ,(M)\lambda^{(M)}_{\uparrow,\uparrow}=\lambda^{(M)}_{\uparrow,\uparrow} λ,(M)=λ,(M)\lambda^{(M)}_{\uparrow,\downarrow}=\lambda^{(M)}_{\downarrow,\uparrow}
U=8U=8 -3.20775 -3.20775 -2.71408 2.71408 4.73336 -0.35912 -0.71824

5.1 Half-filled 4-site Hubbard model

The ground-state energies of the half-filled 4-site chain with the periodic boundary condition at t=1t=1 are compared between the present Fermi machine using the level of Eq. (70) and the exact energies of the Hubbard model at U=4U=4 and 8 in Table 1. The optimized parameters are not unique and are redundant, namely, many optimized parameters can equally reproduce the exact results. In Table 1, two examples are shown for U=8U=8. Here simple parameters similar to the 2-site case are chosen. This redundancy indicates the flexibility and representability of the present framework.

5.2 Doped Case

Table 2 shows the doped case (1 up and 1 down holes doped) for U=8U=8 and t=1t=1. Here, the nonlocal hybridization term is necessary to reproduce the exact result. Still small numbers of parameters are enough.

We note that the applicability of the present method does not have the limitation for the range of the Hamiltonian parameters including 0U/t0\leq U/t\leq\infty as is clarified in the atomic limit, the two-site system and the numerical 4-site results.

6 Discussion, Summary and Outlook

In the conventional variational Monte Carlo method, the fermion Slater determinant or Pfaffian is used as the starting point [21]. A crucial difference of the present Fermi machine is the entanglement with the hidden fermion degrees of freedom hybridizing with the physical (visible) fermions in the physical Hubbard model, which allows the formation of the gap structure in the spectra without spontaneous symmetry breaking, generating zeros of the Green’s functions as well as poles as in the case of the genuine Mott insulator. This hybridization enables the flexible restructuring of the nodes of the wave function beyond the “single-particle electronic structure” representable by the visible (physical) electrons for electronic systems. Except for the starting trial state represented by the Slater determinant or Pfaffian, the deeper level of the architecture of the conventional variational wave functions in the literature has mostly classical nature such as the Gutzwiller and Jastrow factors, which are in marked contrast with the present approach, where the deeper levels of variables have fermionic nature and entangle with the physical (visible) fermions, efficiently optimizing the node structure. The Boltzmann machine has, in this regard, the same contrast because the hidden variable is the Ising classical spin and does not quantum mechanically entangle with the visible quantum variables. Recently proposed neural network [40] introduces an unusually constrained type of fermions as the hidden variables, whose wave function is determined solely from the real-space particle configuration of the visible fermions.

In the present work, the correspondence between strongly correlated fermions and multi-component noninteracting fermions is formulated. It offers an algorithm of quantum many-body solver. This Fermi machine variationaly approaches the ground state of correlated electrons by introducing dark (hidden) fermions hybridizing with the physical fermions, which substantiate the fractionalization of fermions emerging from the strong correlation effect, which has indeed been supported by the analyses of experimental results as well as the model studies of Mott insulators and high TcT_{c} superconductors in the literature [8, 12, 10, 13, 41].

The present correspondence establishes the relation between the “bulk” noninteracting multi-component fermions and the “edge” strongly correlated fermions. This has a conceptual similarity in other correspondence in physics such as the holographic correspondence in the AdS-CFT [42, 43, 44], the bulk-edge correspondence in topological materials [45, 46] and the mapping of dd-dimentional quantum systems to d+1d+1-dimensional classical ones in the path integral, though physical contents are substantially different. Further deep connections to these three frontiers are interesting future issues.

The efficiency of the present formalism for larger system sizes for a practical use is a future important subject to be pursued. For this purpose, efficient optimization of the variational parameters must be examined and tested, which is left for future studies.

In this paper, we have examined the ground state of the noninteracting MCFM represented by a Slater determinant. However, in the conventional variational Monte Carlo, pair-product wave functions instead of Slater determinants show higher accuracy [21]. It is an intriguing future issue, whether the mapping from ground states of the Hartree-Fock-Bogoiubov Hamiltonian, namely, a mean-field ground state of the symmetry broken Hamiltonian such as superconducting or magnetic mean-field order shows higher accuracy.

Acknowledgements The author is grateful to Wei-Lin Tu for useful comments. The author also thanks discussions with Filippo Vicentini, Ryui Kaneko and Shiro Sakai. This work is financially supported by MEXT KAKENHI, Grant-in-Aid for Transformative Research Area (GrantNo. JP22H05111 and No.JP22H05114). This work is also supported by MEXT as “Program for Promoting Researches on the Super computer Fugaku” (Simulation for basic science: approaching the new quantum era, Grant No. JPMXP1020230411).

Appendix A Repsentability of the ground state

In this Appendix we discuss the representability of the wave function by the present scheme. When we exchange the order of the product with respect to qq and the summation over mm in Eqs. (69) and (70), we obtain

|Ψ0σHub\displaystyle|\Psi_{0\ \sigma}^{\rm Hub}\rangle =\displaystyle= m=0M|Ψσ,mHub,\displaystyle\sum_{m=0}^{M}|\Psi_{\sigma,m}^{\rm Hub}\rangle, (97)
|Ψσ,mHub\displaystyle|\Psi_{\sigma,m}^{\rm Hub}\rangle =\displaystyle= qkF[αm,qeiqNsc,σ()mn,σ]|0.\displaystyle\prod_{q}^{k_{\rm F}}[\alpha_{m,q}\sum_{\ell}\frac{e^{iq\ell}}{\sqrt{N_{s}}}c_{\ell,\sigma}^{\dagger}(-)^{mn_{\ell,-\sigma}}]|0\rangle.

This can be interpreted as the representation by a linear combination of MM Slater determinants, where the mm-th one is |Ψσ,mHub|\Psi_{\sigma,m}^{\rm Hub}\rangle, aside from the entanglement of σ\sigma and σ-\sigma electrons contained in the odd mm terms. This exchange becomes justified in the following setup: By taking αm,q=0\alpha_{m,q}=0 for the odd mm, which is the case of μd~,i(m)=\mu_{\tilde{d},i}^{(m)}=\infty for odd mm, the even mm layers are indeed decoupled each other and each multi-particle state becomes a simple Slater determinant consisting of the single-layer component with even mm. Then if the Hamiltonian parameters of each even mmth layer is taken in such a way that the ground-state energy of each single-layer Slater determinant is degenerate with those of different even mmth layers, we may take any linear combination of them, generating a multi-Slater determinant. (Note that in this case, the weights of the linear combination are the variational parameters as well.) The construction of the MCFM Hamiltonian parameters to satisfy the degeneracy is of course possible, because the MCFM energy and occupied momenta at each layer can be freely tuned by taking appropriate dispersion at each layer. Since the complete set of the Hilbert space can be expanded by linear combinations of fermion determinants, this is a formal proof of the representability of the Hubbard model (more generally, any interacting lattice fermion system).

Of course, the number of Slater determinants for the complete representability increases rapidly with increasing system size if one follows this naive muli-determinant representation and this proof is just formal, while practically, the increase can be suppressed by a proper choice of the odd mm contribution as well as by the spatially off-diagonal hybridization, as is discussed from Eq.(83) through Eq. (91).

More importantly, Eq. (70) allows the sign change of the wave function dynamically when the opposite spin is occupied. This flexibly adjusts the nodal structure of the fermion wave function required for interacting systems.

References

  • [1] S.-i. Tomonaga, Prog. Theor. Phys. 5, 544 (1950).
  • [2] J. M. Luttinger, J. Math. Phys. 4, 1154 (1963).
  • [3] A. Heeger, S. Kivelson, J. R. Schrieffer, and W. P. Su, Rev. Mod. Phys. 89, 781 (1988).
  • [4] R. B. Laughlin, Phys. Rev. Lett. 50, 1395 (1983).
  • [5] F. D. M. Haldane, Phys. Rev. Lett. 61, 2015 (1988).
  • [6] S. Sachdev, Ann. Phys. (NY) 303, 226 (2003).
  • [7] G. Kotliar and A. E. Ruckenstein, Phys. Rev. Lett. 57, 1362 (1986).
  • [8] S. Sakai, M. Civelli, and M. Imada, Phys. Rev. Lett. 116, 057003 (2016).
  • [9] M. Imada and T. J. Suzuki, J. Phys. Soc. Jpn. 88, 024701 (2019).
  • [10] M. Imada, J. Phys. Soc. Jpn. 90, 074702 (2021).
  • [11] M. Imada, J. Phys. Soc. Jpn. 90, 111009 (2021).
  • [12] Y. Yamaji, T. Yoshida, A. Fujimori, and M. Imada, Phys. Rev. Res. 3, 043099 (2021).
  • [13] A. Singh, H. Y. Huang, J. D. Xie, J. Okamoto, C. T. Chen, T. Watanabe, A. Fujimori, M. Imada, and D. J. Huang, Nat. Commun. 13, 7906 (2022).
  • [14] R. Blankenbecler, D. J. Scalapino, and R. L. Sugar, Phys. Rev. D 24, 2278 (1981).
  • [15] S. Sorella, S. Baroni, R. Car, and M. Parrinello, Europhys. Lett. 8, 663 (1989).
  • [16] M. Imada and Y. Hatsugai, J. Phys. Soc. Jpn. 58, 3752 (1989).
  • [17] C. Gros, Phys. Rev. B 38, 931 (1988).
  • [18] H. Yokoyama and H. Shiba, J. Phys. Soc. Jpn. 57, 2482 (1988).
  • [19] S. Sorella, Phys. Rev. B 71, 241103 (2005).
  • [20] D. Tahara and M. Imada, J. Phys. Soc. Jpn. 77, 114701 (2008).
  • [21] T. Misawa, S. Morita, K. Yoshimi, M. Kawamura, Y. Motoyama, K. Ido, T. Ohgoe, M. Imada, and T. Kato, Computer Physics Communications 235, 447 (2019).
  • [22] S. Zhang, J. Carlson, and J. E. Gubernatis, Phys. Rev. B 55, 7464 (1997).
  • [23] S. R. White, Phys. Rev. Lett. 69, 2863 (1992).
  • [24] F. Verstraete, V. Murg, and J. Cirac, Advances in Physics 57, 143 (2008).
  • [25] R. Orús, Ann. Phys. 349, 117 (2014).
  • [26] Z. Y. Xie, J. Chen, M. P. Qin, J. W. Zhu, L. P. Yang, and T. Xiang, Phys. Rev. B 86, 045139 (2012).
  • [27] G. Knizia and G. K.-L. Chan, Phys. Rev. Lett. 109, 186404 (2012).
  • [28] W. Metzner and D. Vollhardt, Phys. Rev. Lett. 62, 324 (1989).
  • [29] A. Georges, G. Kotliar, W. Krauth, and M. J. Rozenberg, Rev. Mod. Phys. 68, 13 (1996).
  • [30] G. Carleo and M. Troyer, Science 355, 602 (2017).
  • [31] Y. Nomura, A. S. Darmawan, Y. Yamaji, and M. Imada, Phys. Rev. B 96, 205152 (2017).
  • [32] L. L. Viteritti, R. Rende, and F. Becca, Phys. Rev. Lett. 130, 236401 (2023).
  • [33] R. Rende, L. Viteritti, F. Becca, and S. Goldt, arXiv:2310.05715 .
  • [34] L. M. Roth, Phys. Rev. 184, 451 (1969).
  • [35] S. Onoda and M. Imada, Phys. Rev. B 67, 161102 (2003).
  • [36] S. Sakai, M. Civelli, and M. Imada, Phys. Rev. B 94, 115130 (2016).
  • [37] S.-i. Amari, Advances in Neural Information Processing Systems, 9, 127 ed. by M. C. Mozer, M. I. Jordan, Th. Petsche (Cambridge, MA: MIT Press. 1996)
  • [38] S.-i. Amari, Neural Computation 10, 251 (1998).
  • [39] S. Sorella, Phys. Rev. Lett. 80, 4558 (1998).
  • [40] J. R. Moreno, G. Carleo, A. Georges, and J. Stokes, Proceedings of the National Academy of Sciences 119, e2122059119 (2022).
  • [41] M. T. Schmid, J.-B. Morée, R. Kaneko, Y. Yamaji, and M. Imada, Phys. Rev. X 13, 041036 (2023).
  • [42] J. Maldacena, Int. J. Theor. Phys. 38, 1113 (1999).
  • [43] S. Hartnoll, A. Lucas, and S. Sachdev, Holographic Quantum Matter (MIT Press, The MIT Press, 2018).
  • [44] K. Hashimoto, S. Sugishita, A. Tanaka, and A. Tomiya, Phys. Rev. D 98, 046019 (2018).
  • [45] M. Z. Hasan and C. L. Kane, Rev. Mod. Phys. 82, 3045 (2010).
  • [46] Y. Ando, J. Phys. Soc. Jpn. 82, 102001 (2013).