Fermi Machine
— Quantum Many-Body Solver Derived from Correspondence
between Noninteracting and Strongly Correlated Fermions

Masatoshi Imada Physics DivisionPhysics Division Sophia University Sophia University Chiyoda Chiyoda Tokyo 102-8554 Tokyo 102-8554 Japan
Faculty of Engineering Japan
Faculty of Engineering University of Tokyo University of Tokyo 7-3-1 Hongo 7-3-1 Hongo Bunkyo-ku Bunkyo-ku Tokyo 113-8656 Tokyo 113-8656 Japan Japan

Abstract

Stimulated by the successful descriptions of strongly correlated electron systems by fractionalized fermions, correspondence between interacting fermions and non-interacting multi-component fermions is formulated in examples of the Hubbard model. The formalism enables constructions of the neural network for quantum many-body solvers represented by coupled noninteracting fermions. After showing the exact correspondences of 1- and 2-site Hubbard models to two-component noninteracting fermions, a numerical algorithm of the quantum machine learning for the Hubbard model is proposed. Benchmark for the 4-site systems is successfully presented and promising future directions as well as implications are discussed.

1 Introduction

Fractionalization of electrons in strongly correlated electron systems has been proposed in several different phenomena. One dimensional itinerant electron systems exhibit the separation of spin and charge degrees of freedom [1, 2]. Models of polyacetylene accommodate spin and charge solitons with fractionalized charge as elementary excitations [3]. Fractional quantum Hall states under the stong magnetic fields [4] as well as under the emergent chiral symmetry breaking [5] show the fractionalization of electronic charge and fractional quantizations. Slave bosons and fermions were proposed to approximately describe the 2D systems as models of cuprate high- $T_{c}$ superconductors [6, 7]. Recently, the fractionalization of an electron into two or multiple fermion components has been examined and has successfully accounted for otherwise puzzling spectroscopic experimental results of the cuprates [8, 9, 10, 11, 12, 13].

Meanwhile, numerical method to construct ground state wave functions of strongly interacting quantum many-body systems is one of the grand challenges in physics. Various algorithms such as a variety of auxiliary-field quantum Monte Carlo [14, 15, 16], variational Monte Carlo [17, 18, 19, 20, 21, 22], density matrix renormalization group [23], tensor network [24, 25, 26], density matrix embedding [27] and dynamical mean-field theory [28, 29] have been proposed. Recently, neural network with Boltzmann machine has established an accurate way to approximate ground states of quantum spin systems [30] as well as itinerant fermion systems [31]. Vision transformer, another architecture of the neural network, has also shown the state-of-the-art accuracy for quantum spin models [32, 33]. We note that these neural networks are constructed by introducing hidden variables that are described by classical degrees of freedom.

In this paper, we discuss mapping of the Hubbard model (or in more general, interacting lattice fermions) to noninteracting multi-component fermion models in the ground states that materializes the fractionalization, from which we propose a quantum machine learning algorithm aiming at an efficient quantum many-body solver. Here, we introduce a fermion system as the hidden part coupled to the physical system, instead of the classical one such as the Ising spins in the previous neural networks.

In Secs. I and II, the equivalence between the Hubbard model and the two-component fermion model (TCFM) is shown for the atomic limit and for the 2-site systems, respectively. Based on this equivalence, we propose a quantum machine learning algorithm in Sec. III, where the Hubbard model is approximated by a deep-layer representation of noninteracting multi-component fermion models. The couplings between the layers are represented by the hybridization of fermions belonging to neighboring layers. This method is viewed as an extension of restricted Boltzmann-machine representation of the neural quantum states proposed by Carleo and Troyer [30] to a quantum neural network, where the Ising hidden variable in the Boltzmann machine is replaced by the fermionic operators that allows hybridization to generate quantum entanglement. Furthermore, it is also regarded as an embodiment of the electron fractionalization [9, 10], which has succeeded in solving a number of experimental puzzles in the cuprate high- $T_{c}$ superconductors [8, 12, 10, 13]. This quantum neural network can also be viewed as a tractable realization of the ground states by a systematic expansion to take account of correlation effects similarly to the formulation by the equation of motion method or equivalently the continued fraction expansion of the Green’s function for the excited states [34, 35, 36]. The present method does not seem to have difficulty of exponentially increasing terms encountered in the conventional equation of motion method.

2 Hubbard model in the atomic limit

In this section, the result shown in Appendix A of Ref. \citenImada2019 is briefly summarized for the self-contained presentation. We consider the Hubbard $U$ term

\displaystyle{\cal H}_{U}

\displaystyle=

\displaystyle Un_{\uparrow}n_{\downarrow},

(1)

with $n_{\sigma}=c^{\dagger}_{\sigma}c_{\sigma}$ for the creation (annihilation) operators $c_{\sigma}^{\dagger}$ ( $c_{\sigma}$ ) of the spin $\sigma$ , and introduce an auxiliary fermion

\displaystyle\tilde{d}_{\sigma}=c_{\sigma}(1-2n_{-\sigma}),

(2)

together with

\displaystyle\tilde{c}_{\sigma}=c_{\sigma}.

(3)

Equation (1) can then be rewritten in a form of the TCFM as

$\displaystyle{\cal H}_{\rm TCFM}$	$\displaystyle=$	$\displaystyle{\cal H}^{(\tilde{c})}+{\cal H}^{(\tilde{d})}+{\cal H}^{(\tilde{c}\tilde{d})},$	(4)
$\displaystyle{\cal H}^{(\tilde{c})}$	$\displaystyle=$	$\displaystyle\mu_{\tilde{c}}\tilde{c}_{\sigma}^{\dagger}\tilde{c}_{\sigma},$	(5)
$\displaystyle{\cal H}^{(\tilde{d})}$	$\displaystyle=$	$\displaystyle\mu_{\tilde{d}}\tilde{d}_{\sigma}^{\dagger}\tilde{d}_{\sigma},$	(6)
$\displaystyle{\cal H}^{(\tilde{c}\tilde{d})}$	$\displaystyle=$	$\displaystyle\Lambda(\tilde{c}_{\sigma}^{\dagger}\tilde{d}_{\sigma}+{\rm H.c}),$	(7)

with the mapping

\displaystyle\mu_{\tilde{c}}=\mu_{\tilde{d}}=-\Lambda=\frac{U}{2}.

(8)

For the derivation of Eq.(4) from Eq. (1), see below.

Note first that $\tilde{d}$ and $\tilde{c}$ satisfy the exact anticommutation relation in the ground state average,

	$\displaystyle\langle\tilde{c}_{\sigma}\tilde{c}_{\sigma}^{\dagger}+\tilde{c}_{\sigma}^{\dagger}\tilde{c}_{\sigma}\rangle=1,$		(9)
	$\displaystyle\langle\tilde{d}_{\sigma}\tilde{d}_{\sigma}^{\dagger}+\tilde{d}_{\sigma}^{\dagger}\tilde{d}_{\sigma}\rangle=1,$		(10)
	$\displaystyle\langle\tilde{c}_{\sigma}\tilde{d}_{\sigma}^{\dagger}+\tilde{d}_{\sigma}^{\dagger}\tilde{c}_{\sigma}\rangle=0,$		(11)

where $\langle\cdots\rangle$ is defined by

\displaystyle\langle\cdots\rangle=\frac{\langle\Phi_{0\uparrow}^{(1)}|\cdots|\Phi_{0\uparrow}^{(1)}\rangle+\langle\Phi_{0\downarrow}^{(1)}|\cdots|\Phi_{0\downarrow}^{(1)}\rangle}{\langle\Phi_{0\uparrow}^{(1)}|\Phi_{0\uparrow}^{(1)}\rangle+\langle\Phi_{0\downarrow}^{(1)}|\Phi_{0\downarrow}^{(1)}\rangle},

(12)

because of the ground state degeneracy of Kramers doublet, when the one-particle ground state is described as we prove later as

|\Phi_{0\sigma}^{(1)}\rangle=(\alpha_{c}\tilde{c}_{\sigma}^{\dagger}+\alpha_{d}\tilde{d}_{\sigma}^{\dagger})|0\rangle

(13)

with $\sigma$ being either $\uparrow$ or $\downarrow$ . (Note that the one-particle state must be degenerate between $|\Phi_{0\uparrow}^{(1)}\rangle$ and $|\Phi_{0\downarrow}^{(1)}\rangle$ .) In this sense, $\tilde{c}$ and $\tilde{d}$ behave as orthogonal fermions for single-particle excitations from the ground state, which is the degenerate ensemble of $|\Phi_{0\uparrow}^{(1)}\rangle$ and $|\Phi_{0\downarrow}^{(1)}\rangle$ .

By diagonalizing Eq.(4) for the case $\mu_{\tilde{c}}=\mu_{\tilde{d}}$ , one obtains for the spin $\sigma$ part

$\displaystyle{\cal H}_{\rm DTCFM}$	$\displaystyle=$	$\displaystyle\mu_{a}a_{\sigma}^{\dagger}a_{\sigma}+\mu_{b}b_{\sigma}^{\dagger}b_{\sigma},$	(14)
$\displaystyle\mu_{b}$	$\displaystyle=$	$\displaystyle\frac{1}{2}(\mu_{\tilde{c}}+\mu_{\tilde{d}})+\Lambda,$	(15)
$\displaystyle\mu_{a}$	$\displaystyle=$	$\displaystyle\frac{1}{2}(\mu_{\tilde{c}}+\mu_{\tilde{d}})-\Lambda$	(16)

with

	$\displaystyle b_{\sigma}$	$\displaystyle=$	$\displaystyle\frac{1}{\sqrt{2}}(\tilde{c}_{\sigma}+\tilde{d}_{\sigma})=\sqrt{2}c_{\sigma}(1-n_{-\sigma}),$		(17)
	$\displaystyle a_{\sigma}$	$\displaystyle=$	$\displaystyle\frac{1}{\sqrt{2}}(\tilde{c}_{\sigma}-\tilde{d}_{\sigma})=\sqrt{2}c_{\sigma}n_{-\sigma}.$		(18)

The bonding and antibonding states are given by Eqs. (17) and (18), respectively, which are nothing but the operators for the lower and upper Hubbard levels, respectively, in the Hubbard model whose averaged energies are given by $E=0$ and $U$ . Namely, aside from the normalization factor, ${b}_{\sigma}^{\dagger}$ creates an electron with the spin $\sigma$ when it is not occupied by the opposite spin electron (namely it fills the lower Hubbard state in the corresponding Hubbard model) as is apparent from the last equation of Eq.(17). On the other hand, $a_{\sigma}^{\dagger}$ , creates an electron with the spin $\sigma$ , when the opposite spin electron already exists, namely it creates a doublon at the upper Hubbard state in the Hubbard model. It helps an intuitive interpretation of the correspondence between the Hubbard model and the TCFM. With the choice Eq.(8), $\mu_{a}=0$ and $\mu_{b}=U$ are obtained and the Hubbard gap is reproduced. In addition, the ground state $b_{\sigma}^{\dagger}|0\rangle$ indeed satisfies the form of Eq.(13). Then the mapping between Eqs.(1) and (4) becomes exact. In this correspondence, the Mott gap in the Hubbard model is interpreted as the hybridization gap in the TCFM.

In general, one can show that the single-particle Green’s function for the TCFM is given by

$\displaystyle G_{\tilde{c}_{\sigma},\tilde{c}_{\sigma}^{\dagger}}(\omega)$	$\displaystyle=$	$\displaystyle\frac{1}{\omega-\mu_{\tilde{c}}-\frac{\Lambda^{2}}{\omega-\mu_{\tilde{d}}}},$
$\displaystyle G_{\tilde{c}_{\sigma},\tilde{d}_{\sigma}^{\dagger}}(\omega)$	$\displaystyle=$	$\displaystyle G_{\tilde{d}_{\sigma},\tilde{c}_{\sigma}^{\dagger}}(\omega)=\frac{-\Lambda}{(\omega-\mu_{\tilde{c}})(\omega-\mu_{\tilde{d}})-\Lambda^{2}},$
$\displaystyle G_{\tilde{d}_{\sigma},\tilde{d}_{\sigma}^{\dagger}}(\omega)$	$\displaystyle=$	$\displaystyle\frac{1}{\omega-\mu_{\tilde{d}}-\frac{\Lambda^{2}}{\omega-\mu_{\tilde{c}}}}.$	(19)

Then from Eq.(8), we obtain

	$\displaystyle G_{\tilde{c}_{\sigma},\tilde{c}_{\sigma}^{\dagger}}(\omega)$	$\displaystyle=$	$\displaystyle\frac{1}{\omega-\frac{U}{2}-\frac{\frac{U^{2}}{4}}{\omega-\frac{U}{2}}}$		(20)
		$\displaystyle=$	$\displaystyle\frac{1}{2}\left[\frac{1}{\omega}+\frac{1}{\omega-U}\right].$		(21)

This is equivalent to the Green’s function of the atomic Hubbard, Eq.(1). Then the self-energy has the correct form as well:

\displaystyle{\Sigma}(\omega)

\displaystyle=

\displaystyle\frac{\frac{U^{2}}{4}}{\omega-\frac{U}{2}}.

(22)

In this way, exact correspondence is established between the TCFM (4) and the Hubbard model (1) in the atomic limit for the half-filled ground state as well as for single-particle excitations from it. Namely, the full Hilbert space of the Hubbard model in the atomic limit is equivalent to that of the TCFM.

3 Two-site Hubbard model

In this section, the mapping of the atomic limit already established and summarized in the last section is extended and a mapping between the 2-site Hubbard model and the 2-site TCFM is shown.

The 2-site Hubbard Hamiltonian reads

$\displaystyle{\cal H}$	$\displaystyle=$	$\displaystyle{\cal H}_{t}+{\cal H}_{U},$	(23)
$\displaystyle{\cal H}_{t}$	$\displaystyle=$	$\displaystyle\sum_{\sigma}[-t(c_{1\sigma}^{\dagger}c_{2\sigma}+c_{2\sigma}^{\dagger}c_{1\sigma})+\mu\sum_{i=1,2}n_{i,\sigma}],$	(24)
$\displaystyle{\cal H}_{U}$	$\displaystyle=$	$\displaystyle U\sum_{i=1,2}n_{i\uparrow}n_{i\downarrow}.$	(25)

with $n_{i,\sigma}=c^{\dagger}_{i,\sigma}c_{i,\sigma}$ . We take $\mu=0$ , because in the canonical ensemble, spatially uniform chemical potential does not change physical properties except for a trivial energy shift.

We first analyze the half-filled case with one $\uparrow$ and one $\downarrow$ electron and use the basis of the full Hilbert space expanded by $|\uparrow,\downarrow\rangle,|\downarrow,\uparrow\rangle,|\uparrow\downarrow,0\rangle,|0,\uparrow\downarrow\rangle$ in the notation $|n_{1\uparrow}n_{1\downarrow},n_{2\uparrow}n_{2\downarrow}\rangle$ , where $n_{i\sigma}$ is the number of spin $\sigma$ fermion at the site $i$ and is denoted as $\uparrow$ if $n_{i\uparrow}=1$ and $\downarrow$ if $n_{i\downarrow}=1$ while $0$ if $n_{i\uparrow}=n_{i\downarrow}=0$ at the site $i$ . For instance, $|\uparrow,\downarrow\rangle$ represents $c_{1\uparrow}^{\dagger}c_{2\downarrow}^{\dagger}|0\rangle$ and $|\downarrow,\uparrow\rangle=c_{2\uparrow}^{\dagger}c_{1\downarrow}^{\dagger}|0\rangle$ such that up-spin creation operators are ordered in the left-hand side and spatial sites are ordered in the site-number order from 1 to 2 for the same spin. Here $|0\rangle$ is the vacuum. In this notation, the Hamiltonian matrix is written as

\displaystyle{\cal H}=\left(\begin{array}[]{c|cccc}&|\uparrow,\downarrow\rangle&|\downarrow,\uparrow\rangle&|\uparrow\downarrow,0\rangle&|0,\uparrow\downarrow\rangle\\ \hline\cr\langle\uparrow,\downarrow|&0&0&-t&-t\\ \langle\downarrow,\uparrow|&0&0&-t&-t\\ \langle\uparrow\downarrow,0|&-t&-t&U&0\\ \langle 0,\uparrow\downarrow|&-t&-t&0&U\end{array}\right).

(31)

Here and hereafter, we have explicitly written the choice of the basis as the row and column vectors to make the definition of the components of the $4\times 4$ matrix clearer.

The eigenvalues are

$\displaystyle E_{0}$	$\displaystyle=$	$\displaystyle\frac{U}{2}Q,$
$\displaystyle E_{1}$	$\displaystyle=$	$\displaystyle 0,$
$\displaystyle E_{2}$	$\displaystyle=$	$\displaystyle U,$
$\displaystyle E_{3}$	$\displaystyle=$	$\displaystyle\frac{U}{2}P,$	(32)

where $P=1+\sqrt{1+R^{2}}$ , $Q=1-\sqrt{1+R^{2}}$ and $R=4t/U$ . The normalized eigenfunction of the ground state with the energy $E_{0}$ is given by

(R,R,-Q,-Q)/\sqrt{2(R^{2}+Q^{2})},

(33)

which is the coefficients in the basis of $(|\uparrow,\downarrow\rangle,|\downarrow,\uparrow\rangle,|\uparrow\downarrow,0\rangle,|0,\uparrow\downarrow\rangle)$ . Namely, the normalized ground state is

\frac{1}{\sqrt{2(R^{2}+Q^{2})}}(R|\uparrow,\downarrow\rangle+R|\downarrow,\uparrow\rangle-Q|\uparrow\downarrow,0\rangle-Q|0,\uparrow\downarrow\rangle).

(34)

Now we introduce the same auxiliary fermions Eqs.(2) and (3) as the atomic limit. By using $\tilde{c}$ and $\tilde{d}$ , the Hubbard Hamiltonian (23) can be mapped to a noninteracting 2-site TCFM as

$\displaystyle{\cal H}_{\rm TCFM}$	$\displaystyle=$	$\displaystyle{\cal H}^{(\tilde{c})}+{\cal H}^{(\tilde{d})}+{\cal H}^{(\tilde{c}\tilde{d})},$	(35)
$\displaystyle{\cal H}^{(\tilde{c})}$	$\displaystyle=$	$\displaystyle\sum_{\sigma}[-t_{\tilde{c}}(\tilde{c}_{1,\sigma}^{\dagger}\tilde{c}_{2,\sigma}+{\rm H.c.})+\mu_{\tilde{c}}\sum_{i=1,2}n_{\tilde{c},i,\sigma}],\ \ \ \$	(36)
$\displaystyle{\cal H}^{(\tilde{d})}$	$\displaystyle=$	$\displaystyle\sum_{\sigma}[-t_{\tilde{d}}(\tilde{d}_{1,\sigma}^{\dagger}\tilde{d}_{2,\sigma}+{\rm H.c.})+\mu_{\tilde{d}}\sum_{i=1,2}n_{\tilde{d},i,\sigma}],\ \ \ \$	(37)
$\displaystyle{\cal H}^{(\tilde{c}\tilde{d})}$	$\displaystyle=$	$\displaystyle\Lambda\sum_{\sigma}\sum_{i=1,2}(\tilde{c}_{i,\sigma}^{\dagger}\tilde{d}_{i,\sigma}+{\rm H.c}),$	(38)

where $n_{\tilde{c},i,\sigma}=\tilde{c}_{i,\sigma}^{\dagger}\tilde{c}_{i,\sigma}$ and $n_{\tilde{d},i,\sigma}=\tilde{d}_{i,\sigma}^{\dagger}\tilde{d}_{i,\sigma}$ again with the same mapping Eq. (8) together with

\displaystyle\tilde{t}_{\tilde{c}}=t,\tilde{t}_{\tilde{d}}=-t.

(39)

The Hilbert space of this TCFM wave function may be expanded in the basis of $(\tilde{c}_{1,\sigma}^{\dagger}+\tilde{d}_{1,\sigma}^{\dagger})/\sqrt{2},(\tilde{c}_{2,\sigma}^{\dagger}+\tilde{d}_{2,\sigma}^{\dagger})/\sqrt{2},(\tilde{c}_{1,\sigma}^{\dagger}-\tilde{d}_{1,\sigma}^{\dagger})/\sqrt{2},(\tilde{c}_{2,\sigma}^{\dagger}-\tilde{d}_{2,\sigma}^{\dagger})/\sqrt{2}))|0\rangle,$ where $|0\rangle$ is the vacuum of $\tilde{c}$ and $\tilde{d}$ . Since the spin degeneracy exists and the correlation between opposite spin fermions does not exist, we use an abbreviated notation of this basis as $\tilde{b}_{i}=(\tilde{c}_{i}+\tilde{d}_{i})/\sqrt{2},\tilde{a}_{i}=(\tilde{c}_{i}-\tilde{d}_{i})/\sqrt{2}$ for $i=1,2$ . The Hamiltonian matrix in this representation is

\displaystyle{\cal H}_{\rm TCFM}=\left(\begin{array}[]{c|cccc}&\tilde{b}_{1}&\tilde{b}_{2}&\tilde{a}_{1}&\tilde{a}_{2}\\ \hline\cr\tilde{b}_{1}^{\dagger}&0&0&-t&-t\\ \tilde{b}_{2}^{\dagger}&0&0&-t&-t\\ \tilde{a}_{1}^{\dagger}&-t&-t&U&0\\ \tilde{a}_{2}^{\dagger}&-t&-t&0&U\end{array}\right),

(45)

for both of up and down spin sectors, which is the same as Eq.(31). Then the eigenvalues and eigenfunctions are of course the same as Eqs.(32) and (34), respectively. The ground-state wave function is given by

	$\displaystyle\|\Phi_{0}^{\rm TCFM}\rangle$	$\displaystyle=$	$\displaystyle\frac{1}{2(R^{2}+Q^{2})}(R(\tilde{b}_{1,\uparrow}^{\dagger}+\tilde{b}_{2,\uparrow}^{\dagger})+Q(\tilde{a}_{1,\uparrow}^{\dagger}+\tilde{a}_{2,\uparrow}^{\dagger}))$		(46)
		$\displaystyle\times$	$\displaystyle(R(\tilde{b}_{1,\downarrow}^{\dagger}+\tilde{b}_{2,\downarrow}^{\dagger})+Q(\tilde{a}_{1,\downarrow}^{\dagger}+\tilde{a}_{2,\downarrow}^{\dagger}))\|0\rangle,$		(46)

which has the symmetries of spin singlet and spatial parity even. The first excited state is given by

$\displaystyle\|\Phi_{1}^{\rm TCFM}\rangle$	$\displaystyle=$	$\displaystyle\frac{\sqrt{2}}{4(R^{2}+Q^{2})}$
	$\displaystyle\times$	$\displaystyle((R(\tilde{b}_{1,\uparrow}^{\dagger}-\tilde{b}_{2,\uparrow}^{\dagger})+Q(\tilde{a}_{1,\uparrow}^{\dagger}-\tilde{a}_{2,\uparrow}^{\dagger}))$
		$\displaystyle\times(R(\tilde{b}_{1,\downarrow}^{\dagger}+\tilde{b}_{2,\downarrow}^{\dagger})+Q(\tilde{a}_{1,\uparrow}^{\dagger}+\tilde{a}_{2,\uparrow}^{\dagger}))$
	$\displaystyle-$	$\displaystyle(R(\tilde{b}_{1,\uparrow}^{\dagger}+\tilde{b}_{2,\uparrow}^{\dagger})+Q(\tilde{a}_{1,\uparrow}^{\dagger}+\tilde{a}_{2,\uparrow}^{\dagger}))$
		$\displaystyle\times(R(\tilde{b}_{1,\downarrow}^{\dagger}-\tilde{b}_{2,\downarrow}^{\dagger})+Q(\tilde{a}_{1,\downarrow}^{\dagger}-\tilde{a}_{2,\downarrow}^{\dagger})))\|0\rangle,$

which is spin triplet and spatial parity odd. Other two higher excited states can be given similarly.

With this correspondence, the TCFM ground state (Eq.(46)) is exactly mapped to the Hubbard terminology as

$\displaystyle\|\Phi_{0}^{\rm Hub}\rangle$	$\displaystyle=$	$\displaystyle\frac{1}{2(R^{2}+Q^{2})}(R(\tilde{b}_{1,\uparrow}^{\dagger}\tilde{b}_{2,\downarrow}^{\dagger}+\tilde{b}_{2,\uparrow}^{\dagger}\tilde{b}_{1,\downarrow}^{\dagger})$	(48)
		$\displaystyle+Q(\tilde{a}_{1,\uparrow}^{\dagger}\tilde{b}_{1,\downarrow}^{\dagger}+\tilde{a}_{2,\uparrow}^{\dagger}\tilde{b}_{2,\downarrow}^{\dagger}))\|0\rangle$
	$\displaystyle\Leftrightarrow$	$\displaystyle{\rm Eq.(29)}.$

Here, in the Hubbard model, we used the relations $(\tilde{c}_{i,\sigma}^{\dagger}-\tilde{d}_{i,\sigma}^{\dagger})|0\rangle=0$ , $(\tilde{c}_{i,\sigma}^{\dagger}+\tilde{d}_{i,\sigma}^{\dagger})(\tilde{c}_{i,-\sigma}^{\dagger}+\tilde{d}_{i,-\sigma}^{\dagger})|0\rangle=0$ and $(\tilde{c}_{1i,\sigma}^{\dagger}-\tilde{d}_{i,\sigma}^{\dagger})(\tilde{c}_{j,-\sigma}^{\dagger}+\tilde{d}_{j,-\sigma}^{\dagger})|0\rangle=0$ for $i\neq j$ , because $\tilde{c}_{i,\sigma}^{\dagger}-\tilde{d}_{i,\sigma}^{\dagger}$ creates a doublon at the $i$ th site singly occupied by a spin $-\sigma$ electron and $\tilde{c}_{i,\sigma}^{\dagger}+\tilde{d}_{i,\sigma}^{\dagger}$ creates an electron at the empty $i$ th site. Mapping for the excited states is similarly shown. Therefore, the exact mapping of not only the ground state but also the whole structure of the spectra between the TCFM and Hubbard model is proven for the two-site system as well.

Here, we remark on the spin degeneracy in the TCFM. Because of the spin degeneracy, the total ground state of the TCFM with one up and one down spin sector is given by the product state $|\Phi_{0}\rangle=|\Phi_{0\uparrow}\rangle|\Phi_{0\downarrow}\rangle$ . Then the total ground-state energy with one up and one down spin fermions is twice of the ground state energy $E_{0}$ in Eq.(32). However, as in the single-site problem, the ground state average in the TCFM $\langle\cdots\rangle$ should be taken as Eq.(12) and the factor 2 is canceled.

For the filling doped with one hole or one electron from the half filling has also trivially the same mapping. For instance, the energy of a hole doped ground state is

\displaystyle|\Phi_{0}^{\rm TCFM}\rangle

\displaystyle=

\displaystyle\frac{1}{2}(\tilde{b}_{1,\uparrow}^{\dagger}+\tilde{b}_{2,\uparrow}^{\dagger})|0\rangle,

(49)

which has the energy $E_{0}=0$ and is the same as the one-hole doped Hubbard model.

4 Fermi Machine as a Quantum Neural Network

We have shown that the Hubbard model has an exact mapping to the TCFM for one- and two-site problems and the ground-state wave function of the Hubbard model can be constructed from the ground state of the TCFM. It demonstrates that the TCFM is able to describe the energy scales of the Mott gap separating the lower and upper Hubbard bands scaled by $E_{2}-E_{0}\propto U$ and the singlet-triplet gap ( $E_{1}-E_{0}$ and $E_{3}-E_{2}$ ) associated with the superexchange interaction $J$ , which scales to $4t^{2}/U$ in the strong coupling limit. Up to the 2 sites, the exact ground and excited states of the Hubbard model are represented by the corresponding ground and excited states of the TCFM, respectively by optimizing the variational parameters given by Eqs.(8) and (39), namely, $\mu_{\tilde{c}},\mu_{\tilde{d}},\Lambda,t_{\tilde{c}}$ and $t_{\tilde{d}}$ .

This suggests a potential to represent $N_{s}$ -site Hubbard models defined by

$\displaystyle{\cal H}$	$\displaystyle=$	$\displaystyle{\cal H}_{t}+{\cal H}_{U},$	(50)
$\displaystyle{\cal H}_{t}$	$\displaystyle=$	$\displaystyle\sum_{\langle i,j\rangle,\sigma}[-t(c_{i\sigma}^{\dagger}c_{j\sigma}+{\rm H.c.}+\mu\sum_{i}^{N_{s}}n_{i,\sigma}],$	(51)
$\displaystyle{\cal H}_{U}$	$\displaystyle=$	$\displaystyle U\sum_{i}^{N_{s}}n_{i\uparrow}n_{i\downarrow}$	(52)

for any system size $N_{s}$ by a quantum neural network along this line. We will discuss the representability of the wave function later.

In principle, by introducing the self-energies of the visible electron $c$ through the effect of the hidden fermions $d$ , it was shown that the spectral function and energy spectra of any interacting system can be represented formally but exactly by the hierarchy of the self-energy structure, which represents the energy eigenvalues of the Hamiltonian by poles and zeros of Green’s function through the continued fraction expansion [34, 35, 36] as

$\displaystyle G(q,\omega)$	$\displaystyle=$	$\displaystyle\frac{1}{\omega-\epsilon_{c}(q)-\Sigma_{1}(q,\omega)},$	(53)
$\displaystyle\Sigma_{1}(q,\omega)$	$\displaystyle=$	$\displaystyle\frac{\eta_{1}(q)}{\omega-\epsilon_{1}(q)-\Sigma_{2}(q,\omega)},$
$\displaystyle\Sigma_{2}(q,\omega)$	$\displaystyle=$	$\displaystyle\frac{\eta_{2}(q)}{\omega-\epsilon_{2}(q)-\Sigma_{3}(q,\omega)}.$
$\displaystyle\cdots$

Here we assume the translational symmetry of the Hamiltonian parameters to allow the momentum representation. This can be achieved by formally tracing the equation of motion for the $c$ operator in the Heisenberg representation as

\displaystyle i\hbar\frac{dc(t)}{dt}

\displaystyle=

\displaystyle[c,{\cal H}],

(54)

for any Hamiltonian $\cal H$ and $[A,B]\equiv AB-BA$ .

Therefore, one can expect that the ground state of the Hubbard model (or any interacting lattice fermions) can be mapped to an optimized non-interacting multi-component fermion model (MCFM) represented by

$\displaystyle{\cal H}_{\rm MCFM}$	$\displaystyle=$	$\displaystyle{\cal H}^{(\tilde{c})}+{\cal H}^{(\tilde{c}\tilde{d})}+{\cal H}^{(\tilde{d})},$	(55)
$\displaystyle{\cal H}^{(\tilde{c})}$	$\displaystyle=$	$\displaystyle\sum_{\langle i,j\rangle,\sigma}(-t_{\tilde{c},i,j})(\tilde{c}_{i,\sigma}^{\dagger}\tilde{c}_{j,\sigma}+{\rm H.c.})+\sum_{i,\sigma}\mu_{\tilde{c},i}n_{\tilde{c},i,\sigma},$
$\displaystyle{\cal H}^{(\tilde{c}\tilde{d})}$	$\displaystyle=$	$\displaystyle\Lambda\sum_{\sigma}\sum_{i}(\tilde{c}_{i,\sigma}^{\dagger}\tilde{d}_{i,\sigma}^{(1)}+{\rm H.c}),$	(57)
$\displaystyle{\cal H}^{(\tilde{d})}$	$\displaystyle=$	$\displaystyle\sum_{m=1}^{M}[\sum_{\langle i,j\rangle\sigma}(-t_{\tilde{d},i,j}^{(m)})(\tilde{d}_{i,\sigma}^{(m)\dagger}\tilde{d}_{j,\sigma}^{(m)}+{\rm H.c.})$	(58)
		$\displaystyle+\sum_{i,\sigma}\mu_{\tilde{d},i}^{(m)}\tilde{d}_{i,\sigma}^{(m)\dagger}\tilde{d}_{i,\sigma}^{(m)}]$
	$\displaystyle+$	$\displaystyle\sum_{m=1}^{M-1}\Lambda^{(m)}\sum_{i,\sigma}(\tilde{d}_{i,\sigma}^{(m)\dagger}\tilde{d}_{i,\sigma}^{(m+1)}+{\rm H.c.}),$

as is illustrated in Fig. 1, because the Green’s function of Eq. (55) is given by the hierarchical structure given by Eq.(53) with the correspondence

$\displaystyle\epsilon_{c}(q)$	$\displaystyle=$	$\displaystyle\frac{1}{N_{s}}\sum_{\langle\ell,j\rangle}e^{iq(\ell-j)}(-t_{\tilde{c},\ell,j})+\mu_{\tilde{c}},$
$\displaystyle\eta_{1}(q)$	$\displaystyle=$	$\displaystyle\Lambda,$
$\displaystyle\epsilon_{m}(q)$	$\displaystyle=$	$\displaystyle\frac{1}{N_{s}}\sum_{\langle\ell,j\rangle}e^{iq\ell-j)}(-t_{\tilde{d},\ell,j}^{(m)})+\mu_{\tilde{d},i}^{(m)},$
$\displaystyle\eta_{m}(q)$	$\displaystyle=$	$\displaystyle\Lambda^{(m)},$	(59)

if the chemical potential is spatially uniform.

Refer to caption — Figure 1: (Color online): Illustration of the architecture of Fermi machine network based on the MCFM compared to the Hubbard model.

The algorithm of the present quantum neural network to represent the ground state of the Hubbard model is the following:

•

We diagonalize the noninteracting MCFM Hamiltonian Eq. (55) and obtain the ground state by filling the low-energy fermions up to the Fermi level for a given set of parameters in Eqs. (LABEL:eq:MCFM_Hc)-(58).
•

Next we replace the hidden fermions $\tilde{d}$ by using the rule Eq.(2) together with similar correspondence such as $\tilde{d}_{i,\sigma}^{(m)}=\tilde{d}_{i,\sigma}^{(m-1)}(1-2n_{\tilde{d}^{(m-1)},i,-\sigma})$ where $n_{\tilde{d}^{(m)},i,-\sigma}=\tilde{d}_{i,-\sigma}^{(m)\dagger}\tilde{d}_{i,-\sigma}^{(m)}$ .

•

Then by using the relations such as those below Eq. (48) used to derive Eq. (48) and its extension to deep-layer hidden operators, one can represent the wave function for the Hubbard model $|\Phi_{0}^{\rm Hub}\rangle$ as in the derivation of Eq. (48), from which one can easily calculate the matrix elements of the Hubbard model and the energy expectation value of the Hubbard model by

\displaystyle\langle E_{0}\rangle=\langle{\cal H}\rangle=\frac{\langle\Phi_{0}^{\rm Hub}|{\cal H}|\Phi_{0}^{\rm Hub}\rangle}{\langle\Phi_{0}^{\rm Hub}|\Phi_{0}^{\rm Hub}\rangle}

(60)

using Monte Carlo sampling of the real space configuration of $c$ fermion occupation. Here $\cal H$ is given by Eqs. (50)-(52). Note that ${\cal H}_{t}|\Phi_{0}^{\rm Hub}\rangle$ is not a single Slater determinant even when $|\Phi_{0}^{\rm Hub}\rangle$ is so. An easy way to obtain the average is

\displaystyle\langle E_{0t}\rangle=\langle{\cal H}_{t}\rangle=\lim_{\tau\rightarrow 0}\frac{1}{\tau}\log\left[\frac{\langle\Phi_{0}^{\rm Hub}|\exp[\tau{\cal H}_{t}]|\Phi_{0}^{\rm Hub}\rangle}{\langle\Phi_{0}^{\rm Hub}|\Phi_{0}^{\rm Hub}\rangle}\right]

(61)

because $\exp[\tau{\cal H}_{t}]|\Psi\rangle$ remains a single Slater determinant when $|\Psi\rangle$ is a single Slater determinant [16].

•

Finally, by using the variational principle, the parameters in the MCFM Hamiltonian Eqs. (55)-(58) are regarded as the variational parameters and are optimized to lower the energy Eq. (60) by following the variational principle.

There exist a few useful relations to simplify the MCFM state:

$\displaystyle\tilde{d}_{i,\sigma}^{(m)}$	$\displaystyle=$	$\displaystyle\tilde{d}_{i,\sigma}^{(m-1)}(1-2n_{\tilde{d}^{(m-1)},i,-\sigma})=\cdots$	(62)
	$\displaystyle=$	$\displaystyle c_{i,\sigma}(1-2n_{c_{i,-\sigma}})^{m},$	(62)
$\displaystyle n_{\tilde{d}_{i,\sigma}^{(m)}}$	$\displaystyle=$	$\displaystyle n_{\tilde{d}^{(m-1)}_{i,\sigma}}=\cdots=n_{c_{i,\sigma}}$	(63)

The ground state of the MCFM can be represented generally as

$\displaystyle\|\Psi_{0}^{\rm MCFM}\rangle$	$\displaystyle=$	$\displaystyle\begin{pmatrix}\|\Psi_{0\ \uparrow}^{\rm MCFM}\rangle&0\\ 0&\|\Psi_{0\ \downarrow}^{\rm MCFM}\rangle\end{pmatrix},$	(64)
$\displaystyle\|\Psi_{0\ \sigma}^{\rm MCFM}\rangle$	$\displaystyle=$	$\displaystyle\prod_{q}^{k_{\rm F}}\|\Psi_{\sigma}^{\rm MCFM}(q)\rangle,$	(65)
$\displaystyle\|\Psi_{\sigma}^{\rm MCFM}(q)\rangle$	$\displaystyle=$	$\displaystyle[\alpha_{0,q}\tilde{c}_{q,\sigma}^{\dagger}+\sum_{m=1}^{M}\alpha_{m,q}\tilde{d}_{m,q,\sigma}^{\dagger}]\|0\rangle,$	(66)

where $q$ is the momentum. Here, in the ground state of the MCFM, fermions occupy up to the fermi momentum $k_{\rm F}$ . Since the MCFM is diagonal in the momentum space, the ground-state wave function in the $k$ space is trivial if the parameters in Eqs. (LABEL:eq:MCFM_Hc)-(58) are given. The coefficient $\alpha_{m,q}$ ( $m=0,\cdots M$ ) is determined from the eigenvector of the ground state, which can be easily calculated numerically.

This MCFM ground state is mapped to the variational form of the ground state of the Hubbard model as

\displaystyle|\Psi_{0}^{\rm Hub}\rangle

\displaystyle=

\displaystyle\begin{pmatrix}|\Psi_{0\ \uparrow}^{\rm Hub}\rangle&0\\ 0&|\Psi_{0\ \downarrow}^{\rm Hub}\rangle\end{pmatrix},

(67)

|\Psi_{0\ \sigma}^{\rm Hub}\rangle=\prod_{q}^{k_{\rm F}}[\sum_{m=0}^{M}\alpha_{m,q}\sum_{\ell}\frac{e^{iq\ell}}{\sqrt{N_{s}}}c_{\ell,\sigma}^{\dagger}(1-2n_{\ell,-\sigma})^{m}]|0\rangle.

(68)

It can be rewritten as

\displaystyle|\Psi_{0\ \sigma}^{\rm Hub}\rangle

\displaystyle=

\displaystyle\prod_{q}^{k_{\rm F}}|\Psi_{\sigma}^{\rm Hub}(q)\rangle,

(69)

|\Psi_{\sigma}^{\rm Hub}(q)\rangle=[\sum_{m=0}^{M}\alpha_{m,q}\sum_{\ell}\frac{e^{iq\ell}}{\sqrt{N_{s}}}c_{\ell,\sigma}^{\dagger}(-)^{mn_{\ell,-\sigma}}]|0\rangle.

(70)

It should be noted that $|\Psi_{\sigma}^{\rm Hub}(q)\rangle$ contains electron operators $c^{\dagger}_{p,\sigma}$ with not only the momentum $q$ but also all the momentum $p$ in the Brillouin zone in the momentum representation, which is a consequence of the interference with the spin $-\sigma$ electron at the $\ell$ -th site in Eq.(70) reflecting the scattering of $\sigma$ and $-\sigma$ electrons at the $\ell$ -th site in the original Hubbard model. Indeed, intuitively, the minus factor for the odd $m$ in Eq. (70) represents the reduction of the weight of $c_{\ell,\sigma}^{\dagger}$ electron if $-\sigma$ electron exists at the $\ell$ -th site to punish the double occupation leading to the entanglement of spin up and down electrons. For instance, in the large $U$ limit, the corresponding $\Lambda\rightarrow-\infty$ makes the perfect exclusion of the double occupation through Eq. (70). The representability of this wave function is discussed in Appendix.

To numerically optimize the variational parameters in the MCFM (namely the parameters in Eqs. (55)-(58)), we need to estimate the ground-state energy using the obtained wave function through Eq. (60), by the average of the inserted sample $|x\rangle$ as

$\displaystyle E_{0}$	$\displaystyle=$	$\displaystyle\langle{\cal H}\rangle=\sum_{x}\rho(x)F(x),$	(71)
$\displaystyle\rho(x)$	$\displaystyle=$	$\displaystyle\|\langle x\|\Psi_{0}^{\rm Hub}\rangle\|^{2}/\langle\Psi_{0}^{\rm Hub}\|\Psi_{0}^{\rm Hub}\rangle,$	(72)
$\displaystyle F(x)$	$\displaystyle=$	$\displaystyle\sum_{x^{\prime}}\frac{\langle\Psi_{0}^{\rm Hub}\|x^{\prime}\rangle}{\langle\Psi_{0}^{\rm Hub}\|x\rangle}\langle x^{\prime}\|{\cal H}\|x\rangle.$	(73)

Here, $\langle x^{\prime}|{\cal H}_{t}|x\rangle$ can be calculated in the same way as Eq. (61).

Hereafter, let us consider a $N_{s}$ -site system with $N/2$ up spin and down spin electrons each. (Extension to the case of an arbitrary number of up and down spin electrons, $N_{\uparrow}$ , and $N_{\downarrow}$ , respectively, is easy and straightforward.) The Monte Carlo sampling can be performed by taking $\rho(x)$ as the probability to generate samples by following the importance sampling and the simple average of $F(x)$ over sampling gives us the estimate of $E_{0}$ . Here, $|x\rangle$ is a sample of real space configuration (simple product state), such as $(0,\uparrow,\uparrow,\uparrow\downarrow,\downarrow,0,\cdots)$ , namely

\displaystyle|x\rangle

\displaystyle=

\displaystyle\prod_{\ell}^{N/2}c_{\ell,\uparrow}^{\dagger}\prod_{\ell^{\prime}}^{N/2}c_{\ell^{\prime},\downarrow}^{\dagger}|0\rangle,

(74)

where the sets of the real space coordinate $\{\ell\}$ and $\{\ell^{\prime}\}$ specify the positions of the up and down spin electrons, respectively in the given sample. Since $\cal H$ is a sparse matrix by assuming the system with the short-ranged hoppings and interactions, and has nonzero matrix element only in their ranges, the summation over $|x^{\prime}\rangle$ can be explicitly taken for each $|x\rangle$ .

To enhance the representability of the wave function, one can optionally introduce the “Fermi distribution factor” $f$ on top of Eq.(68) (or Eqs.(69) and (70)) similarly to the finite-temperature Boltzmann factor in the classical Boltzmann machine. Namely, in this scheme, $|\Psi_{0\ \sigma}^{\rm Hub}\rangle$ is modified and replaced by

$\displaystyle\|\Psi_{0\ \sigma}^{\rm Hub}\rangle$	$\displaystyle=$	$\displaystyle\prod_{q}f(\beta,q)\|\Psi_{\sigma}^{\rm Hub}(q)\rangle,$	(75)
$\displaystyle f(\beta,q)$	$\displaystyle=$	$\displaystyle\frac{1}{1+e^{\beta{\cal E}(q)}},$	(76)
$\displaystyle{\cal E}(q)$	$\displaystyle=$	$\displaystyle\sum_{\sigma}\frac{\langle\Psi_{\sigma}^{\rm MCFM}(q)\|{\cal H}^{\rm MCFM}\|\Psi_{\sigma}^{\rm MCFM}(q)\rangle}{\langle\Psi_{\sigma}^{\rm MCFM}(q)\|\Psi_{\sigma}^{\rm MCFM}(q)\rangle},$
$\displaystyle\|\Psi_{\sigma}^{\rm MCFM}(q)\rangle$	$\displaystyle=$	$\displaystyle\prod_{\sigma}[\alpha_{0,q}\tilde{c}_{q,\sigma}^{\dagger}+\sum_{m=1}^{M}\alpha_{m,q}\tilde{d}_{m,q,\sigma}^{\dagger}]\|0\rangle,$	(77)

where $\beta$ is an additional variational parameter, $|\Psi_{\sigma}^{\rm MCFM}(q)\rangle$ is the eigenstate of Eq.(55) with the momentum $q$ and $|\Psi_{\sigma}^{\rm Hub}(q)\rangle$ is the same as Eq.(70). The product with respect to $q$ is over the full Brillouin zone for Eq. (75). If we take $\beta\rightarrow\infty$ , it is reduced to the original algorithm using Eqs.(69) and (70). Therefore this Fermi machine extension certainly enhances the representability with a strategy conceptually similar to the Boltzmann machine [30, 31] but by using a substantially different quantum algorithm.

In the practical calculation, it is useful to represent in the matrix form: $|\Psi_{0\ \sigma}^{\rm Hub}\rangle$ is represented by a $N_{s}\times N/2$ matrix ${\cal P}_{\sigma}$ , where from Eq.(70), the matrix element is given by

\displaystyle({\cal P}_{\sigma})_{\ell,k}

\displaystyle=

\displaystyle\sum_{m=0}^{M}\alpha_{m,q(k,\sigma)}e^{iq(k,\sigma)\ell}(-)^{mn_{\ell,-\sigma}}.

(78)

Here, $q(k,\sigma)$ is the momentum of the $k$ th occupied electron with spin $\sigma$ . The matrix representation of the sample $|x\rangle\rightarrow{\cal X}$ is

\displaystyle({\cal X}_{\sigma})_{\ell,\ell^{\prime}}=\sum_{n}^{N_{s}}\sum_{i}^{N/2}\delta_{\ell,n}\delta_{\ell^{\prime},i}

(79)

if the $i$ th spin- ${\sigma}$ electron is at the site $n$ in the given sample.

Since $\langle x^{\prime}|{\cal H}|x\rangle$ is easily calculated in Eq.(73), here we discuss how one can calculate $\langle\Psi_{0}^{\rm Hub}|x^{\prime}\rangle$ and $\langle x|\Psi_{0}^{\rm Hub}\rangle$ in the matrix representation. By defining $2N_{s}\times N$ matrices

\displaystyle{\cal P}

\displaystyle=

\displaystyle\begin{pmatrix}{\cal P}_{\uparrow}&0\\ 0&{\cal P}_{\downarrow}\end{pmatrix}

(80)

and

\displaystyle{\cal X}

\displaystyle=

\displaystyle\begin{pmatrix}{\cal X}_{\uparrow}&0\\ 0&{\cal X}_{\downarrow}\end{pmatrix},

(81)

we can calculate $\langle x|\Psi_{0}^{\rm Hub}\rangle$ from the determinant of the matrix product ${}^{T}{\cal X}{\cal P}$ represented by a $N\times N$ matrix, where ${}^{T}{\cal X}$ is the transpose of $\cal X$ . In Eq.(78), we need to be careful about the dependence on $n_{\ell,-\sigma}$ . Since $n_{\ell,-\sigma}=\sum_{k}({\cal X}_{-\sigma})_{\ell,k}$ , ${\cal P}_{\sigma}$ depends on ${\cal X}_{-\sigma}$ , but it can be easily numerically calculated, because practically, $|\Psi_{0}^{\rm Hub}\rangle$ appears only in the inner product with $\langle x|$ .

In this way, $E_{0}$ can be calculated by Monte Carlo sampling of $|x\rangle$ as $\sum_{x}F(x)/N_{\rm sample}$ for the important sampling generated with the probability $w\propto|\langle x|\Psi_{0}^{\rm Hub}\rangle|^{2}$ . Here, $N_{\rm sample}$ is the number of the samples and

F(x)=\sum_{x^{\prime}}\frac{{\rm det}[^{T}{\cal PX^{\prime}}^{T}{\cal X^{\prime}H}{\cal X}]}{{\rm det}[^{T}{\cal PX}]}

(82)

is estimated from the ratio of two determinants of $N\times N$ matrices. Then the variational parameters in Eqs. (55)-(58) are optimized to lower $E_{0}$ .

In the matrix representation, $|\Psi_{\sigma}^{\rm MCFM}(q)\rangle$ and $|\Psi_{\sigma}^{\rm Hub}(q)\rangle$ are $N_{s}$ -component vectors and for instance, $\langle\Psi_{\sigma}^{\rm MCFM}(q)|{\cal H}^{\rm MCFM}|\Psi_{\sigma}^{\rm MCFM}(q)\rangle$ is a scalar for diagonalized $N_{s}\times N_{s}$ matrix of ${\cal H}^{\rm MCFM}$ in the momentum representation of Eqs. (55)-(58).

As a further extension for more accurate variational form, long-range part of quantum entanglement may be more efficiently incorporated by adding layers connected by nonlocal hybridization such as

$\displaystyle{\cal H}_{\rm nonlocal}$	$\displaystyle=$	$\displaystyle{\cal H}^{(\tilde{d})}_{\rm nonlocal}+{\cal H}^{(\tilde{c}\tilde{d})}_{\rm nonlocal},$	(83)
$\displaystyle{\cal H}^{(\tilde{d})}_{\rm nonlocal}$	$\displaystyle=$	$\displaystyle\sum_{\langle i,j\rangle\sigma}(-t_{\tilde{d},i,j}^{(M+1)})(\tilde{d}_{i,\sigma}^{(M+1)\dagger}\tilde{d}_{j,\sigma}^{(M+1)}+{\rm H.c.})$
	$\displaystyle+$	$\displaystyle\sum_{i,\sigma}\mu_{\tilde{d},i}^{(M+1)}n_{\tilde{d},i,\sigma}^{(M+1)},$
$\displaystyle{\cal H}^{(\tilde{c}\tilde{d})}_{\rm nonlocal}$	$\displaystyle=$	$\displaystyle\sum_{\sigma,\sigma^{\prime}}\sum_{i,j}(\Lambda^{(M)}_{\sigma,\sigma^{\prime}}(j-i)\tilde{c}_{i,\sigma}^{\dagger}\tilde{d}_{j,\sigma^{\prime}}^{(M+1)}+{\rm H.c})$	(84)
	$\displaystyle=$	$\displaystyle\sum_{\sigma,\sigma^{\prime}}\sum_{k}(\Lambda^{(M)}_{\sigma,\sigma^{\prime}}(k)\tilde{c}_{k,\sigma}^{\dagger}\tilde{d}_{k,\sigma^{\prime}}^{(M+1)})$	(84)

as is illustrated in Fig. 2. Here, the spin dependence is taken to satisfy the spin rotational symmetry. To make the correspondence between $\tilde{d}^{(M+1)}$ and $c$ , one may generalize Eq. (2) or more concretely, $\tilde{d}_{j,\sigma^{\prime}}^{(M+1)}\leftrightarrow\sum_{i,\sigma}\Theta_{\sigma^{\prime},\sigma}(j-i)c_{i,\sigma}(1-2n_{j,\sigma^{\prime}})$ , which leads to the replacement of Eq. (70) with

$\displaystyle\|\Psi_{\sigma}^{\rm Hub}(q)\rangle$	$\displaystyle=$	$\displaystyle[\sum_{m=0}^{M}\alpha_{m,q}\sum_{\ell}\frac{e^{iq\ell}}{\sqrt{N_{s}}}c_{\ell,\sigma}^{\dagger}(-)^{mn_{\ell,-\sigma}}$
	$\displaystyle+$	$\displaystyle\alpha_{M+1,q}\sum_{\ell,\ell^{\prime},\sigma^{\prime}}\frac{e^{iq\ell^{\prime}}}{\sqrt{N_{s}}}\Theta_{\sigma^{\prime},\sigma}(\ell^{\prime}-\ell)c_{\ell,\sigma}^{\dagger}(-)^{n_{\ell^{\prime},\sigma^{\prime}}}]\|0\rangle,$
			(85)

which means that the spin dependent $\Theta_{\sigma,\sigma^{\prime}}(\ell^{\prime}-\ell)$ contains the spin parallel ( $\sigma=\sigma^{\prime}$ ) and antiparallel ( $\sigma=-\sigma^{\prime}$ ) components. $\Theta_{\sigma,\sigma^{\prime}}(\ell^{\prime}-\ell)$ is regarded as additional variational parameters. The present extension explicitly allows dependence of the weight of the wave function at the $\ell$ th site with spin $\sigma$ on the occupation of parallel ( $\sigma$ ) or antiparallel ( $-\sigma$ ) component of the fermion at the $\ell^{\prime}$ th site, which plays a role similar to Jastrow factor in the conventional variational Monte Carlo or nonlocal coupling between the visible (physical) and hidden variables in the Boltzmann machine. However, the present nonlocal hybridization explicitly introduces spatially extended quantum entanglement between up- and down-spin electrons. In addition, the present scheme does not break the SU(2) symmetry in contrast to the spin Jastrow factor in the variational Monte Carlo and the Boltzmann machine. This is because the amplitude of $|\Psi_{\sigma}^{\rm Hub}(q)\rangle$ for the component of the up spin at the $i$ th site and the down spin at the $j$ th site has the same value with the opposite configuration and the matrix element of spin diagonal component $\langle\Psi_{\sigma}^{\rm Hub}(q)|S_{i}^{z}S_{j}^{z}|\Psi_{\sigma}^{\rm Hub}(q)\rangle$ becomes the same as the off-diagonal one $\langle\Psi_{\sigma}^{\rm Hub}(q)|S_{i}^{+}S_{j}^{-}|\Psi_{\sigma}^{\rm Hub}(q)\rangle=\langle\Psi_{\sigma}^{\rm Hub}(q)|S_{i}^{-}S_{j}^{+}|\Psi_{\sigma}^{\rm Hub}(q)\rangle$ . This nonlocal hybridization part may also be extended to deeper layers of $M+3$ rd, $M+4$ th $\cdots$ up to $M+M^{\prime}$ th layer if necessary.

In this extension, Eqs. (62) and (63) become more complicated, because of the nonlocality but may enable the mutual orthogonality of $\tilde{d}_{i,\sigma}^{(m)}$ in the Hubbard terminology expected for Eq.(53). Then, if we include the $k$ -independent component of $\Lambda$ in Eq. (84), the local part represented by the first term of the r.h.s. of Eq. (85) can be absorbed, but still only the antiparallel spin combination is relevant for that $k$ -independent component. In this sense, further generalized simpler architecture would be represented by the following unified form instead of Eqs. (70) and (85):

	$\displaystyle\|\Psi_{\sigma}^{\rm Hub}(q)\rangle$	$\displaystyle=$	$\displaystyle\sum_{m=0}^{M}\alpha_{m,q}\sum_{\ell,\ell^{\prime},\sigma^{\prime}}\frac{e^{iq\ell^{\prime}}}{\sqrt{N_{s}}}\Theta_{\sigma^{\prime},\sigma}^{(m)}(\ell^{\prime}-\ell)c_{\ell,\sigma}^{\dagger}$		(86)
			$\displaystyle\times(-)^{mn_{\ell^{\prime},\sigma^{\prime}}}\|0\rangle.$		(86)

Alternatively, one can extend in the form

$\displaystyle\frac{e^{iq\ell}}{\sqrt{N_{s}}}\tilde{d}^{(m)\dagger}_{\ell,\sigma}$	$\displaystyle\rightarrow$	$\displaystyle\sum_{\ell^{\prime},\sigma^{\prime}}\frac{e^{iq\ell^{\prime}}}{\sqrt{N_{s}}}\Theta_{\sigma^{\prime},\sigma}^{(m)}(q,\ell^{\prime}-\ell)d^{(m-1)\dagger}_{\ell,\sigma}$
		$\displaystyle\times(-)^{n_{\tilde{d}^{(m-1)},\ell^{\prime},\sigma^{\prime}}}$	(87)
	$\displaystyle\rightarrow$	$\displaystyle\Xi^{(m)}_{\sigma}(q,\ell)c_{\ell,\sigma}^{\dagger},$	(88)
$\displaystyle\Xi^{(m)}_{\sigma}(q,\ell)$	$\displaystyle=$	$\displaystyle\prod_{n=1}^{m}\large[\sum_{\ell_{n},\sigma_{n}}\frac{e^{iq\ell_{n}}}{\sqrt{N_{s}}}\Theta_{\sigma_{n},\sigma}^{(m-n+1)}(q,\ell_{n}-\ell)$	(89)
		$\displaystyle\times(-)^{n_{\ell_{n},\sigma_{n}}}\large],$	(89)
$\displaystyle\Xi^{(0)}_{\sigma}(\ell)$	$\displaystyle=$	$\displaystyle\frac{e^{iq\ell}}{\sqrt{N_{s}}},$	(90)

which leads to

\displaystyle|\Psi_{\sigma}^{\rm Hub}(q)\rangle

\displaystyle=

\displaystyle\sum_{m=0}^{M}\alpha_{m,q}\sum_{\ell}\Xi_{\sigma}^{(m)}(q,\ell)c_{\ell,\sigma}^{\dagger}|0\rangle.

(91)

The variational optimization procedure is summarized as

•

For given parameters in Eqs. (55)-(58), determine $\alpha_{m,k}$ for the ground state of the MCFM and then $|\Psi_{0}^{\rm Hub}\rangle$ in the form of Eqs. (67) and (69) with Eqs. (70), (85), (86) or (91).
•

Calculate $\langle x|\Psi_{0}^{\rm Hub}\rangle$ for a Monte Carlo sample given by a real space configuration $|x\rangle$ in the matrix representation ${\rm det}^{T}{\cal X}{\cal P}$ .

•

Continue the Monte Carlo sampling with the importance sampling weight $\rho(x)=|[\langle x|\Psi_{0}^{\rm Hub}\rangle]|^{2}/\langle\Psi_{0}^{\rm Hub}|\Psi_{0}^{\rm Hub}\rangle$ proportional to $|{\rm det}[^{T}{\cal XP}]|^{2}$ and calculate the energy expectation value

	$\displaystyle E_{0}$	$\displaystyle=$	$\displaystyle\frac{1}{N_{\rm sample}}\sum_{x}F(x)$		(92)
		$\displaystyle=$	$\displaystyle\frac{1}{N_{\rm sample}}\sum_{x,x^{\prime}}\frac{{\rm det}[^{T}{\cal PX^{\prime}}^{T}{\cal X^{\prime}H}{\cal X}]}{{\rm det}[^{T}{\cal PX}]}.$		(92)

•

Optimize the variational parameters in Eqs. (LABEL:eq:MCFM_Hc)-(58) to lower $E_{0}$ until convergence. For the optimization, the natural gradient [37, 38] (stochastic reconfiguration [39]) can be used.

So far, we have restricted the formulation to systems with translational invariance so that the momentum is a good quantum number. However, it can be easily extended to non-invariant cases such as systems under site-dependent random potential, interaction and/or bond-random hopping. In these cases, Eq. (65) is replaced by

\displaystyle|\Psi_{0\ \sigma}^{\rm MCFM}\rangle

\displaystyle=

\displaystyle\prod_{k}^{k_{\rm F}}|\Psi_{\sigma}^{\rm MCFM}(k)\rangle,

(93)

where $k$ is not the momentum any more and represents the $k$ -th lowest eigenstate of the MCFM Hamiltonian containing randomness, which can be obtained from the diagonalization of the MCFM Hamiltonian giving the amplitude of the eigenfunction represented by the component $\alpha_{m,\ell,k}$ for the site $\ell$ in the $m$ -th layer. Then, Eq. (66) is replaced with

|\Psi_{\sigma}^{\rm MCFM}(k)\rangle=[\alpha_{0,\ell,k}\tilde{c}_{\ell,\sigma}^{\dagger}+\sum_{m=1}^{M}\alpha_{m,\ell,k}\tilde{d}_{m,\ell,\sigma}^{\dagger}]|0\rangle,

(95)

and the corresponding Hubbard state (68) is modified to

\displaystyle|\Psi_{0\ \sigma}^{\rm Hub}\rangle=\prod_{k}^{k_{\rm F}}[\sum_{m=0}^{M}\alpha_{m,\ell,k}c_{\ell,\sigma}^{\dagger}(1-2n_{\ell,-\sigma})^{m}]|0\rangle.

(96)

Further refinements discussed above may also be applied as well. As a simple starting point one may impose a constraint of the “quenched randomness” for the construction of the MCFM, which means that the MCFM paramters are taken not to depend on the layer index for the same site or bond.

5 Benchmark for 4-site Hubbard model

Here, the validity of the present formalism is tested in the 4-site Hubbard model at half filling and with hole doping. The MCFM results are compared with the exact ground state given by the numerical diagonalization. To make the parameters as simple as possible, we do not introduce the deep layers beyond the first hidden layer and the interlayer coupling $\Lambda^{(m)}$ with $m\geq 2$ are switched off in this study. However, the nonlocal hybridization is taken into account in Sec. 5.2. The parameter $\beta$ in Eq. (76) is also taken to be $\infty$ so that the Fermi distribution is taken at zero temperature.

Table 1: Comparison of estimated ground-state energies per site

E_{0}

of the half-filled 4-site Hubbard model between the exact diagonalization and Fermi machine with list of examples of the optimized variational parameters. Common (hyper)parameters are

M=1

M^{\prime}=0

, and

t_{\tilde{c}}=1

	exact $E_{0}$	$E_{0}$ by Fermi machine	$t_{\tilde{d}}$	$\Lambda$	$\mu_{\tilde{c}}$	$\mu_{\tilde{d}}$
$U=4$	-2.10275	-2.10275	1.0	-1.002	1.002	4.644
$U=8$	-1.32023	-1.32023	1.0	-2.238	2.238	7.229
	-1.32023	-1.32023	2.0	-3.330	3.330	10.731

Table 2: Comparison of estimated ground-state energies per site

E_{0}

of the hole doped 4-site Hubbard model between the exact diagonalization and Fermi machine with list of an example of optimized variational parameters for

U=8,t=1

with 1 up and down doped hole each. Parameters not listed in the table are

M=1

M^{\prime}=1

t_{\tilde{c}}=t_{\tilde{d}}=1

t_{\tilde{d},i,j,\sigma}^{(M+1)}=0

, and

\mu_{\tilde{d},i}^{(M+1)}=\mu_{\tilde{d},i}^{(M+1)}=8.0

. The nonlocal hybridization is taken in a simple form as

\Lambda^{(M)}_{\sigma,\sigma^{\prime}}[k]=\lambda^{(M)}_{\sigma,\sigma^{\prime}}\cos[k]

so that the interlayer hybridization is extended simply to the nearest neighbor pair of

\{i,j\}

in Eq. (84). Accordngly, we take

\Theta_{\sigma,\sigma^{\prime}}(i-j)=1

only for the nearest neighbor pair of

{i,j}

for any combination of

\sigma

and

\sigma^{\prime}

and zero otherwise in Eq. (85).

	exact $E_{0}$	$E_{0}$ by Fermi machine	$\Lambda$	$\mu_{\tilde{c}}$	$\mu_{\tilde{d}}$	$\lambda^{(M)}_{\uparrow,\uparrow}=\lambda^{(M)}_{\uparrow,\uparrow}$	$\lambda^{(M)}_{\uparrow,\downarrow}=\lambda^{(M)}_{\downarrow,\uparrow}$
$U=8$	-3.20775	-3.20775	-2.71408	2.71408	4.73336	-0.35912	-0.71824

5.1 Half-filled 4-site Hubbard model

The ground-state energies of the half-filled 4-site chain with the periodic boundary condition at $t=1$ are compared between the present Fermi machine using the level of Eq. (70) and the exact energies of the Hubbard model at $U=4$ and 8 in Table 1. The optimized parameters are not unique and are redundant, namely, many optimized parameters can equally reproduce the exact results. In Table 1, two examples are shown for $U=8$ . Here simple parameters similar to the 2-site case are chosen. This redundancy indicates the flexibility and representability of the present framework.

5.2 Doped Case

Table 2 shows the doped case (1 up and 1 down holes doped) for $U=8$ and $t=1$ . Here, the nonlocal hybridization term is necessary to reproduce the exact result. Still small numbers of parameters are enough.

We note that the applicability of the present method does not have the limitation for the range of the Hamiltonian parameters including $0\leq U/t\leq\infty$ as is clarified in the atomic limit, the two-site system and the numerical 4-site results.

6 Discussion, Summary and Outlook

In the conventional variational Monte Carlo method, the fermion Slater determinant or Pfaffian is used as the starting point [21]. A crucial difference of the present Fermi machine is the entanglement with the hidden fermion degrees of freedom hybridizing with the physical (visible) fermions in the physical Hubbard model, which allows the formation of the gap structure in the spectra without spontaneous symmetry breaking, generating zeros of the Green’s functions as well as poles as in the case of the genuine Mott insulator. This hybridization enables the flexible restructuring of the nodes of the wave function beyond the “single-particle electronic structure” representable by the visible (physical) electrons for electronic systems. Except for the starting trial state represented by the Slater determinant or Pfaffian, the deeper level of the architecture of the conventional variational wave functions in the literature has mostly classical nature such as the Gutzwiller and Jastrow factors, which are in marked contrast with the present approach, where the deeper levels of variables have fermionic nature and entangle with the physical (visible) fermions, efficiently optimizing the node structure. The Boltzmann machine has, in this regard, the same contrast because the hidden variable is the Ising classical spin and does not quantum mechanically entangle with the visible quantum variables. Recently proposed neural network [40] introduces an unusually constrained type of fermions as the hidden variables, whose wave function is determined solely from the real-space particle configuration of the visible fermions.

In the present work, the correspondence between strongly correlated fermions and multi-component noninteracting fermions is formulated. It offers an algorithm of quantum many-body solver. This Fermi machine variationaly approaches the ground state of correlated electrons by introducing dark (hidden) fermions hybridizing with the physical fermions, which substantiate the fractionalization of fermions emerging from the strong correlation effect, which has indeed been supported by the analyses of experimental results as well as the model studies of Mott insulators and high $T_{c}$ superconductors in the literature [8, 12, 10, 13, 41].

The present correspondence establishes the relation between the “bulk” noninteracting multi-component fermions and the “edge” strongly correlated fermions. This has a conceptual similarity in other correspondence in physics such as the holographic correspondence in the AdS-CFT [42, 43, 44], the bulk-edge correspondence in topological materials [45, 46] and the mapping of $d$ -dimentional quantum systems to $d+1$ -dimensional classical ones in the path integral, though physical contents are substantially different. Further deep connections to these three frontiers are interesting future issues.

The efficiency of the present formalism for larger system sizes for a practical use is a future important subject to be pursued. For this purpose, efficient optimization of the variational parameters must be examined and tested, which is left for future studies.

In this paper, we have examined the ground state of the noninteracting MCFM represented by a Slater determinant. However, in the conventional variational Monte Carlo, pair-product wave functions instead of Slater determinants show higher accuracy [21]. It is an intriguing future issue, whether the mapping from ground states of the Hartree-Fock-Bogoiubov Hamiltonian, namely, a mean-field ground state of the symmetry broken Hamiltonian such as superconducting or magnetic mean-field order shows higher accuracy.

Acknowledgements The author is grateful to Wei-Lin Tu for useful comments. The author also thanks discussions with Filippo Vicentini, Ryui Kaneko and Shiro Sakai. This work is financially supported by MEXT KAKENHI, Grant-in-Aid for Transformative Research Area (GrantNo. JP22H05111 and No.JP22H05114). This work is also supported by MEXT as “Program for Promoting Researches on the Super computer Fugaku” (Simulation for basic science: approaching the new quantum era, Grant No. JPMXP1020230411).

Appendix A Repsentability of the ground state

In this Appendix we discuss the representability of the wave function by the present scheme. When we exchange the order of the product with respect to $q$ and the summation over $m$ in Eqs. (69) and (70), we obtain

	$\displaystyle\|\Psi_{0\ \sigma}^{\rm Hub}\rangle$	$\displaystyle=$	$\displaystyle\sum_{m=0}^{M}\|\Psi_{\sigma,m}^{\rm Hub}\rangle,$		(97)
	$\displaystyle\|\Psi_{\sigma,m}^{\rm Hub}\rangle$	$\displaystyle=$	$\displaystyle\prod_{q}^{k_{\rm F}}[\alpha_{m,q}\sum_{\ell}\frac{e^{iq\ell}}{\sqrt{N_{s}}}c_{\ell,\sigma}^{\dagger}(-)^{mn_{\ell,-\sigma}}]\|0\rangle.$

This can be interpreted as the representation by a linear combination of $M$ Slater determinants, where the $m$ -th one is $|\Psi_{\sigma,m}^{\rm Hub}\rangle$ , aside from the entanglement of $\sigma$ and $-\sigma$ electrons contained in the odd $m$ terms. This exchange becomes justified in the following setup: By taking $\alpha_{m,q}=0$ for the odd $m$ , which is the case of $\mu_{\tilde{d},i}^{(m)}=\infty$ for odd $m$ , the even $m$ layers are indeed decoupled each other and each multi-particle state becomes a simple Slater determinant consisting of the single-layer component with even $m$ . Then if the Hamiltonian parameters of each even $m$ th layer is taken in such a way that the ground-state energy of each single-layer Slater determinant is degenerate with those of different even $m$ th layers, we may take any linear combination of them, generating a multi-Slater determinant. (Note that in this case, the weights of the linear combination are the variational parameters as well.) The construction of the MCFM Hamiltonian parameters to satisfy the degeneracy is of course possible, because the MCFM energy and occupied momenta at each layer can be freely tuned by taking appropriate dispersion at each layer. Since the complete set of the Hilbert space can be expanded by linear combinations of fermion determinants, this is a formal proof of the representability of the Hubbard model (more generally, any interacting lattice fermion system).

Of course, the number of Slater determinants for the complete representability increases rapidly with increasing system size if one follows this naive muli-determinant representation and this proof is just formal, while practically, the increase can be suppressed by a proper choice of the odd $m$ contribution as well as by the spatially off-diagonal hybridization, as is discussed from Eq.(83) through Eq. (91).

More importantly, Eq. (70) allows the sign change of the wave function dynamically when the opposite spin is occupied. This flexibly adjusts the nodal structure of the fermion wave function required for interacting systems.

References

[1] S.-i. Tomonaga, Prog. Theor. Phys. 5, 544 (1950).
[2] J. M. Luttinger, J. Math. Phys. 4, 1154 (1963).
[3] A. Heeger, S. Kivelson, J. R. Schrieffer, and W. P. Su, Rev. Mod. Phys. 89, 781 (1988).
[4] R. B. Laughlin, Phys. Rev. Lett. 50, 1395 (1983).
[5] F. D. M. Haldane, Phys. Rev. Lett. 61, 2015 (1988).
[6] S. Sachdev, Ann. Phys. (NY) 303, 226 (2003).
[7] G. Kotliar and A. E. Ruckenstein, Phys. Rev. Lett. 57, 1362 (1986).
[8] S. Sakai, M. Civelli, and M. Imada, Phys. Rev. Lett. 116, 057003 (2016).
[9] M. Imada and T. J. Suzuki, J. Phys. Soc. Jpn. 88, 024701 (2019).
[10] M. Imada, J. Phys. Soc. Jpn. 90, 074702 (2021).
[11] M. Imada, J. Phys. Soc. Jpn. 90, 111009 (2021).
[12] Y. Yamaji, T. Yoshida, A. Fujimori, and M. Imada, Phys. Rev. Res. 3, 043099 (2021).
[13] A. Singh, H. Y. Huang, J. D. Xie, J. Okamoto, C. T. Chen, T. Watanabe, A. Fujimori, M. Imada, and D. J. Huang, Nat. Commun. 13, 7906 (2022).
[14] R. Blankenbecler, D. J. Scalapino, and R. L. Sugar, Phys. Rev. D 24, 2278 (1981).
[15] S. Sorella, S. Baroni, R. Car, and M. Parrinello, Europhys. Lett. 8, 663 (1989).
[16] M. Imada and Y. Hatsugai, J. Phys. Soc. Jpn. 58, 3752 (1989).
[17] C. Gros, Phys. Rev. B 38, 931 (1988).
[18] H. Yokoyama and H. Shiba, J. Phys. Soc. Jpn. 57, 2482 (1988).
[19] S. Sorella, Phys. Rev. B 71, 241103 (2005).
[20] D. Tahara and M. Imada, J. Phys. Soc. Jpn. 77, 114701 (2008).
[21] T. Misawa, S. Morita, K. Yoshimi, M. Kawamura, Y. Motoyama, K. Ido, T. Ohgoe, M. Imada, and T. Kato, Computer Physics Communications 235, 447 (2019).
[22] S. Zhang, J. Carlson, and J. E. Gubernatis, Phys. Rev. B 55, 7464 (1997).
[23] S. R. White, Phys. Rev. Lett. 69, 2863 (1992).
[24] F. Verstraete, V. Murg, and J. Cirac, Advances in Physics 57, 143 (2008).
[25] R. Orús, Ann. Phys. 349, 117 (2014).
[26] Z. Y. Xie, J. Chen, M. P. Qin, J. W. Zhu, L. P. Yang, and T. Xiang, Phys. Rev. B 86, 045139 (2012).
[27] G. Knizia and G. K.-L. Chan, Phys. Rev. Lett. 109, 186404 (2012).
[28] W. Metzner and D. Vollhardt, Phys. Rev. Lett. 62, 324 (1989).
[29] A. Georges, G. Kotliar, W. Krauth, and M. J. Rozenberg, Rev. Mod. Phys. 68, 13 (1996).
[30] G. Carleo and M. Troyer, Science 355, 602 (2017).
[31] Y. Nomura, A. S. Darmawan, Y. Yamaji, and M. Imada, Phys. Rev. B 96, 205152 (2017).
[32] L. L. Viteritti, R. Rende, and F. Becca, Phys. Rev. Lett. 130, 236401 (2023).
[33] R. Rende, L. Viteritti, F. Becca, and S. Goldt, arXiv:2310.05715 .
[34] L. M. Roth, Phys. Rev. 184, 451 (1969).
[35] S. Onoda and M. Imada, Phys. Rev. B 67, 161102 (2003).
[36] S. Sakai, M. Civelli, and M. Imada, Phys. Rev. B 94, 115130 (2016).
[37] S.-i. Amari, Advances in Neural Information Processing Systems, 9, 127 ed. by M. C. Mozer, M. I. Jordan, Th. Petsche (Cambridge, MA: MIT Press. 1996)
[38] S.-i. Amari, Neural Computation 10, 251 (1998).
[39] S. Sorella, Phys. Rev. Lett. 80, 4558 (1998).
[40] J. R. Moreno, G. Carleo, A. Georges, and J. Stokes, Proceedings of the National Academy of Sciences 119, e2122059119 (2022).
[41] M. T. Schmid, J.-B. Morée, R. Kaneko, Y. Yamaji, and M. Imada, Phys. Rev. X 13, 041036 (2023).
[42] J. Maldacena, Int. J. Theor. Phys. 38, 1113 (1999).
[43] S. Hartnoll, A. Lucas, and S. Sachdev, Holographic Quantum Matter (MIT Press, The MIT Press, 2018).
[44] K. Hashimoto, S. Sugishita, A. Tanaka, and A. Tomiya, Phys. Rev. D 98, 046019 (2018).
[45] M. Z. Hasan and C. L. Kane, Rev. Mod. Phys. 82, 3045 (2010).
[46] Y. Ando, J. Phys. Soc. Jpn. 82, 102001 (2013).

Fermi Machine — Quantum Many-Body Solver Derived from Correspondence between Noninteracting and Strongly Correlated Fermions