This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Single-particle-exact density functional theory

Martin-Isbjörn Trappe [email protected] Jun Hao Hue [email protected] Jonah Huang Zi Chao [email protected] Mikołaj Paraniak [email protected] Djamila Hiller [email protected] Jerzy Ciosłowski [email protected] Berthold-Georg Englert [email protected]
Abstract

We introduce ‘single-particle-exact density functional theory’ (1pEx-DFT), a novel density functional approach that represents all single-particle contributions to the energy with exact functionals. Here, we parameterize interaction energy functionals by utilizing two new schemes for constructing density matrices from ‘participation numbers’ of the single-particle states of quantum many-body systems. These participation numbers play the role of the variational variables akin to the particle densities in standard orbital-free density functional theory. We minimize the total energies with the help of evolutionary algorithms and obtain ground-state energies that are typically accurate at the one-percent level for our proof-of-principle simulations that comprise interacting Fermi gases as well as the electronic structure of atoms and ions, with and without relativistic corrections. We thereby illustrate the ingredients and practical features of 1pEx-DFT and reveal its potential of becoming an accurate, scalable, and transferable technology for simulating mesoscopic quantum many-body systems.

keywords:
orbital-free density functional theory , Levy–Lieb constrained search , reduced density matrices , fermion gases , relativistic electronic structure , evolutionary algorithms
journal: Annals of Physics
\affiliation

[CQT]organization=Centre for Quantum Technologies, National University of Singapore,addressline=3 Science Drive 2, postcode=Singapore 117543, country=Singapore

\affiliation

[LMU]organization=Fakultät für Physik, Ludwig-Maximilians-Universität München,addressline=Geschwister-Scholl-Platz 1, postcode=80539 München, country=Germany

\affiliation

[USZ]organization=Institute of Physics, University of Szczecin,addressline=Wielkopolska 15, postcode=70-451 Szczecin, country=Poland

\affiliation

[MPI]organization=Max-Planck-Institut für Physik komplexer Systeme,addressline=Nöthnitzer Straße 38, postcode=01187 Dresden, country=Germany

\affiliation

[Beijing]organization=Key Laboratory of Advanced Optoelectronic Quantum Architecture and Measurement of Ministry of Education, School of Physics, Beijing Institute of Technology,postcode=Beijing 100081, country=China

\affiliation

[DoP]organization=Department of Physics, National University of Singapore,addressline=2 Science Drive 3, postcode=Singapore 117542, country=Singapore

1 Introduction

The many variants of density functional theory (DFT) have been developed predominantly for calculating observables in position space—fueled by the decades-long success story of DFT applications in chemistry and materials science, see Refs. [1, 2, 3] and references therein—alongside a mere handful of scholarly articles devoted to density functionals in momentum space [4, 5], which target, for instance, Compton profiles [6] or momentum distributions in ultracold quantum gases [7]. The ever increasing demand of high-quality solutions to specific quantum many-body problems from across scientific disciplines has been inciting DFT developers to refine the established methods and codes as well as to develop approaches that differ distinctly from prevailing ones in the hope of discovering a methodology that is superior at least for a subset of problems—typically by sacrificing one of the competing objectives of accuracy, scalability, and transferability (in the sense of universal applicability), see Fig. 1. While the Schrödinger equation is, by definition, entirely accurate and transferable, its numerical solution is typically limited to a few interacting particles. By neglecting inter-particle correlations beyond the exchange energy, Hartree–Fock (HF) theory offers much better scalability at the expense of accuracy, and quantum chemistry methods like the coupled cluster technique fall in between HF and the Schrödinger equation. In contrast to these Hamiltonian-based approaches, the interaction energy for typical formulations of DFT like Thomas–Fermi-DFT (TF-DFT) and Kohn–Sham-DFT (KS-DFT) has to be constructed explicitly for any given interaction. This reduces transferability in practice. The TF kinetic energy functional offers unsurpassed scalability but is not accurate enough for predicting even the existence of molecular bonds. The most widely used KS-DFT often comes close to chemical accuracy, at the expense of transferability, but is typically limited to hundreds of particles.

Refer to caption
Figure 1: The trade-off between accuracy, scalability, and transferability is inherent to all quantum many-body methods and also provides a means of classifying them, with the Schrödinger equation (SE) at one end of this scale. Though orbital-free in spirit, our proposed ‘single-particle-exact-DFT’ (1pEx-DFT) sits at the intersection of wave-function-based methods and density-based methods. While the current implementation of 1pEx-DFT (orange solid-line triangle) is completely transferable like HF, it is not yet as accurate and scalable as other established many-body methods. However, we discuss remedies that would let 1pEx-DFT complement orbital-free DFT as well as KS-DFT in significant ways (dash-dotted line). This is no small feat, but even if only partially successful, such a program could eventually supersede HF. We emphasize, however, that this enterprise is speculative at present. Square brackets indicate objectives that are typically out of reach for the respective methods with common computational resources.

Historically, the search for alternatives to KS-DFT produced successful approaches like density matrix functional theory [8, 9, 10, 11, 12, 13], which aims at chemical accuracy and transferability across electronic structure problems—and whose technical aspects are very much related to our agenda in the present article—as well as modern orbital-free DFT in all its variety, e.g., density-potential functional theory (DPFT) [14, 15, 16, 17, 18], which aims at scalability toward large particle numbers and transferability across interactions and dimensionality. Still, these alternative methods are currently implemented in position space. But the Levy–Lieb constrained search [8, 19], which constitutes the fundamental justification of modern DFT, invites us to consider density functionals beyond the familiar configuration- and momentum-space representations. Here, we may hope to find new and promising many-body methods off the beaten path.

Reference [20] delivers a unified view of these possibilities through a second-quantized perspective and proposes one particular DFT formulation based on ‘participation numbers’ (referred to as ‘occupation numbers’ in Ref. [20]) of the eigenstates of the single-particle part of quantum many-body systems. This ‘single-particle-exact-DFT’ (1pEx-DFT) presents a stark departure from the established formulations of DFT. Here, we introduce the details of the general 1pEx-DFT formalism and showcase applications to atomic gases and electronic structure that communicate the practical aspects of 1pEx-DFT. Although 1pEx-DFT is universally applicable, its current implementation may not capture beyond-HF correlations efficiently enough, see Ref. [21]. While Hartree–Fock-level accuracy is usually considered insufficient for most chemistry applications, it is adequate for describing mesoscopic quantum gases [22], for which we believe 1pEx-DFT could prove particularly useful.

In Sec. 2, we derive the explicit formulation of the Levy–Lieb constrained search over the single-particle participation numbers (Sec. 2.1). For the proof-of-principle examples studied in this work, we parameterize the interaction energy in Dirac approximation [23] (or Hartree’s and Fock’s [24, 25, 26], if you like): we omit correlations beyond the HF exchange energy (see also A) and develop two new constructions of one-body reduced density matrices from participation numbers (Sec. 2.3); details are provided in B. In Sec. 2.4 we discuss our explicit implementation of the Levy–Lieb constrained search. To minimize the total energy, we deploy stochastic evolutionary algorithms, specifically, particle swarm optimization [27, 28, 29] and a genetic algorithm [30], see C for details. The required interaction tensor elements (known as two-electron integrals in the case of Coulomb interactions) are derived in D. Our results on energies, participation numbers, and single-particle densities of Fermi gases and atomic systems in Sec. 3 demonstrate the numerical feasibility of 1pEx-DFT. We benchmark our 1pEx-DFT predictions against HF results. In Sec. 4, we discuss fundamental as well as technical challenges of 1pEx-DFT—regarding accuracy, scalability, and transferability—that ought to be overcome for realizing its prospective advantages over existing many-body methods.

2 𝟏𝐩𝐄𝐱\mathbf{1pEx}-DFT

In this section we derive the exact ground-state energy functional in terms of the single-particle participation numbers. Aiming at a first illustration of 1pEx-DFT, we also develop an explicit scheme that encodes the interaction energy in Dirac approximation and carries out the Levy–Lieb constrained search. We then show how to construct the required density matrices from an iterative algorithm and, alternatively, from a TF-inspired derivation in rotor phase space.

2.1 General formalism

We denote the position and momentum operators of the jjth particle by \mathboldRj\mathbold{R}_{j} and \mathboldPj\mathbold{P}_{j}, respectively, and consider many-particle Hamilton operators of the generic form

Hmp=j=1N12m\mathboldPj2+j=1NVext(\mathboldRj)+12j,k=1N(jk)Vint(\mathboldRj\mathboldRk)=j=1NH1p(\mathboldPj,\mathboldRj)+Hint,\displaystyle H_{\mathrm{mp}}=\sum_{j=1}^{N}\frac{1}{2m}\mathbold{P}_{j}^{2}+\sum_{j=1}^{N}V_{\mathrm{ext}}\big{(}\mathbold{R}_{j}\big{)}+\frac{1}{2}\mathop{\sum_{j,k=1}^{N}}_{(j\neq k)}V_{\mathrm{int}}\big{(}\mathbold{R}_{j}-\mathbold{R}_{k}\big{)}=\sum_{j=1}^{N}H_{\mathrm{1p}}\big{(}\mathbold{P}_{j},\mathbold{R}_{j}\big{)}+H_{\mathrm{int}}\,, (1)

with a sum of single-particle contributions and a sum over pairs for the interaction contribution. The one-particle Hamilton operator H1pH_{\mathrm{1p}} (the ‘core Hamiltonian’) consists of the kinetic energy of a particle with mass mm and the potential energy in the external potential VextV_{\mathrm{ext}} that traps the particles,

H1p(\mathboldP,\mathboldR)=12m\mathboldP2+Vext(\mathboldR).\displaystyle H_{\mathrm{1p}}(\mathbold{P},\mathbold{R})=\frac{1}{2m}\mathbold{P}^{2}+V_{\mathrm{ext}}(\mathbold{R})\,. (2)

The system is composed of NN identical spin-12\frac{1}{2} particles, with symmetric pair interaction energy Vint(\mathboldRj\mathboldRk)=Vint(\mathboldRk\mathboldRj){V_{\mathrm{int}}\big{(}\mathbold{R}_{j}-\mathbold{R}_{k}\big{)}=V_{\mathrm{int}}\big{(}\mathbold{R}_{k}-\mathbold{R}_{j}\big{)}}. More complicated situations are thinkable, such as an external electric or magnetic field and a corresponding change in H1pH_{\mathrm{1p}}, a spin-dependent interaction, or alternative dispersion relations.

The eigenstates of H1pH_{\mathrm{1p}} constitute a complete orthonormal set of orbital states (the ‘1pEx-basis’),

H1p(\mathboldP,\mathboldR)|a=|aEa,a|b=δa,b,a|aa|=1,\displaystyle H_{\mathrm{1p}}(\mathbold{P},\mathbold{R}){\left|{a}\right\rangle}={\left|{a}\right\rangle}E_{a}\,,\quad{\langle{a}|{b}\rangle}=\delta_{a,b}\,,\quad\sum_{a}{\left|{a}\right\rangle}{\left\langle{a}\right|}=1\,, (3)

here written for discrete quantum numbers aa. More generally, there could also be a continuous part of the spectrum (scattering states) with corresponding modifications of the orthonormality and completeness relations. For the sake of notational simplicity, we will employ the conventions of Eq. (3) while keeping in mind that there could be adjustments if H1pH_{\mathrm{1p}} has scattering states, which are commonly disregarded in DFT calculations.

The H1pH_{\mathrm{1p}}-induced single-particle density 𝒏=(n1,n2,){\bm{n}=(n_{1},n_{2},\dots)} is the list of ‘participation numbers’

na=j=1N(|aa|)j,\displaystyle n_{a}={\left\langle{\sum_{j=1}^{N}\Bigl{(}{\left|{a}\right\rangle}{\left\langle{a}\right|}\Bigr{)}_{j}}\right\rangle}\,, (4)

evaluated in the applicable many-particle state. The jjth term (|aa|)j{\left\langle{\bigl{(}{\left|{a}\right\rangle}{\left\langle{a}\right|}\bigr{)}_{j}}\right\rangle} is the probability of finding the jjth particle in the aath orbital state. For simplicity we restrict ourselves in the following to unpolarized systems with even NN. Hence,

0na2\displaystyle 0\leq n_{a}\leq 2 (5)

as there can be at most two spin-12\frac{1}{2} particles in the same orbital state. The completeness of the orbital states for each particle ensures that the sum of all nan_{a} is equal to the particle count,

ana=j=1N1=N.\displaystyle\sum_{a}n_{a}={\left\langle{\sum_{j=1}^{N}1}\right\rangle}=N\,. (6)

Since we have

H1p(\mathboldP,\mathboldR)=a|aEaa|,\displaystyle H_{\mathrm{1p}}(\mathbold{P},\mathbold{R})=\sum_{a}{\left|{a}\right\rangle}E_{a}{\left\langle{a}\right|}\,, (7)

it follows that

E1p=j=1NH1p(\mathboldPj,\mathboldRj)=anaEa,\displaystyle E_{\mathrm{1p}}={\left\langle{\sum_{j=1}^{N}H_{\mathrm{1p}}(\mathbold{P}_{j},\mathbold{R}_{j})}\right\rangle}=\sum_{a}n_{a}E_{a}\,, (8)

the sum of the single-particle energies EaE_{a} weighted by the participation numbers nan_{a}. Equation (8) will typically entail that a large part of the total energy is treated exactly and that characteristic consequences of the external potential, such as the electron cusp in atoms, are automatically built in through the 1pEx-basis.

The ground-state energy EgsE_{\mathrm{gs}} of the many-particle Hamilton operator in Eq. (1) is the minimum of all expectation values of HmpH_{\mathrm{mp}},

Egs=Minρmp{tr(ρmpHmp)},\displaystyle E_{\mathrm{gs}}=\mathop{\mathrm{Min}}_{\rho^{\ }_{\mathrm{mp}}}{\left\{\mathrm{tr}{\left(\rho_{\mathrm{mp}}H_{\mathrm{mp}}\right)}\rule{0.0pt}{10.0pt}\right\}}\,, (9)

where all NN-particle statistical operators ρmp\rho^{\ }_{\mathrm{mp}} participate in the competition. While it would be sufficient to consider pure-state ρmp\rho_{\mathrm{mp}}s, there is no need for this restriction. In the spirit of the Levy–Lieb constrained search in standard DFT, we regard this minimization as a two-step process: first we minimize over the ρmp\rho_{\mathrm{mp}}s that yield a prescribed single-particle density 𝒏\bm{n}, then we minimize over all permissible 𝒏 \bm{n}\rule{1.00006pt}{0.0pt}s,

Egs=Min𝒏{Emp[𝒏]}\displaystyle E_{\mathrm{gs}}=\mathop{\mathrm{Min}}_{\bm{n}}{\left\{E_{\mathrm{mp}}[\bm{n}]\rule{0.0pt}{10.0pt}\right\}} (10)

with

Emp[𝒏]=Minρmp𝒏{tr(ρmpHmp)},\displaystyle E_{\mathrm{mp}}[\bm{n}]=\mathop{\mathrm{Min}}_{\rho^{\ }_{\mathrm{mp}}\to\bm{n}}{\left\{\mathrm{tr}{\left(\rho_{\mathrm{mp}}H_{\mathrm{mp}}\right)}\rule{0.0pt}{10.0pt}\right\}}\,, (11)

where, as a consequence of Eq. (8),

Emp[𝒏]=anaEa+Minρmp𝒏{tr(ρmpHint)}=anaEa+Eint[𝒏]\displaystyle E_{\mathrm{mp}}[\bm{n}]=\sum_{a}n_{a}E_{a}+\mathop{\mathrm{Min}}_{\rho^{\ }_{\mathrm{mp}}\to\bm{n}}{\left\{\mathrm{tr}{\left(\rho_{\mathrm{mp}}H_{\mathrm{int}}\right)}\rule{0.0pt}{10.0pt}\right\}}=\sum_{a}n_{a}E_{a}+E_{\mathrm{int}}[\bm{n}] (12)

is the single-particle-exact density functional.

Indeed, the contribution from the sum of single-particle energies is exact and simple in Emp[𝒏]E_{\mathrm{mp}}[\bm{n}], whereas the contribution from the pair interactions requires a minimization over all permissible ρmp\rho_{\mathrm{mp}}s. Usually, we cannot find this minimum and have to be content with a suitable approximation. We will work with one such approximation in Sec. 2.2 below.

For a given ρmp\rho_{\mathrm{mp}}, we have the single-particle-orbital density matrix in position space

n(1)(\mathboldr;\mathboldr)=tr(ρmpj=1N( \mathboldr><\mathboldr )j),\displaystyle n^{(1)}(\mathbold{r};\mathbold{r}^{\prime})=\mathrm{tr}{\left(\rho_{\mathrm{mp}}\sum_{j=1}^{N}\Bigl{(}\rule[-2.65pt]{0.65pt}{10.75pt}\hskip 1.07639pt\mathbold{r}^{\prime}\big{>}\big{<}\mathbold{r}\hskip 1.07639pt\rule[-2.65pt]{0.65pt}{10.75pt}\Bigr{)}_{j}\right)}\,, (13)

here expressed in orbital states and normalized to the particle count (d\mathboldr)n(1)(\mathboldr;\mathboldr)=N{\int(\mathrm{d}\mathbold{r})\,n^{(1)}(\mathbold{r};\mathbold{r})=N}. We also have the two-particle orbital-density matrix

n(2)(\mathboldr1,\mathboldr2;\mathboldr1,\mathboldr2)=12tr(ρmpj,k=1N(jk)( \mathboldr1><\mathboldr1 )j( \mathboldr2><\mathboldr2 )k),\displaystyle n^{(2)}(\mathbold{r}_{1}^{\ },\mathbold{r}_{2}^{\ };\mathbold{r}_{1}^{\prime},\mathbold{r}_{2}^{\prime})=\frac{1}{2}\mathrm{tr}{\left(\rho_{\mathrm{mp}}\mathop{\sum_{j,k=1}^{N}}_{(j\neq k)}\Bigl{(}\rule[-2.65pt]{0.65pt}{10.75pt}\hskip 1.07639pt\mathbold{r}_{1}^{\prime}\big{>}\big{<}\mathbold{r}_{1}^{\ }\hskip 1.07639pt\rule[-2.65pt]{0.65pt}{10.75pt}\Bigr{)}_{j}\otimes\Bigl{(}\rule[-2.65pt]{0.65pt}{10.75pt}\hskip 1.07639pt\mathbold{r}_{2}^{\prime}\big{>}\big{<}\mathbold{r}_{2}^{\ }\hskip 1.07639pt\rule[-2.65pt]{0.65pt}{10.75pt}\Bigr{)}_{k}\right)}\,, (14)

which is symmetric under particle exchange, n(2)(\mathboldr1,\mathboldr2;\mathboldr1,\mathboldr2)=n(2)(\mathboldr2,\mathboldr1;\mathboldr2,\mathboldr1){n^{(2)}(\mathbold{r}_{1}^{\ },\mathbold{r}_{2}^{\ };\mathbold{r}_{1}^{\prime},\mathbold{r}_{2}^{\prime})=n^{(2)}(\mathbold{r}_{2}^{\ },\mathbold{r}_{1}^{\ };\mathbold{r}_{2}^{\prime},\mathbold{r}_{1}^{\prime})}, and related to the single-particle density by

(d\mathboldr2)n(2)(\mathboldr,\mathboldr2;\mathboldr,\mathboldr2)=12(N1)n(1)(\mathboldr;\mathboldr).\displaystyle\int(\mathrm{d}\mathbold{r}_{2}^{\ })\,n^{(2)}(\mathbold{r},\mathbold{r}_{2}^{\ };\mathbold{r}^{\prime},\mathbold{r}_{2}^{\ })=\frac{1}{2}(N-1)\,n^{(1)}(\mathbold{r};\mathbold{r}^{\prime})\,. (15)

It follows that the full trace of n(2)n^{(2)} is the count of pairs,

(d\mathboldr)(d\mathboldr)n(2)(\mathboldr,\mathboldr;\mathboldr,\mathboldr)=12N(N1).\displaystyle\int(\mathrm{d}\mathbold{r})\,(\mathrm{d}\mathbold{r}^{\prime})\,n^{(2)}(\mathbold{r},\mathbold{r}^{\prime};\mathbold{r},\mathbold{r}^{\prime})=\frac{1}{2}N(N-1)\,. (16)

As indicated these are orbital densities, that is, the spin variables are traced out. This is fine under the given circumstances as we have no spin dependence in H1pH_{\mathrm{1p}} and HintH_{\mathrm{int}}.

The constraint ‘ρmp𝒏\rho_{\mathrm{mp}}\to\bm{n}’ in Eqs. (10)–(12) means

na=tr(ρmpj=1N(|aa|)j)=(d\mathboldr)(d\mathboldr)ψa(\mathboldr)n(1)(\mathboldr;\mathboldr)ψa(\mathboldr),\displaystyle n_{a}^{\ }=\mathrm{tr}{\left(\rho_{\mathrm{mp}}\sum_{j=1}^{N}\Bigl{(}{\left|{a}\right\rangle}{\left\langle{a}\right|}\Bigr{)}_{j}\right)}=\int(\mathrm{d}\mathbold{r})\,(\mathrm{d}\mathbold{r}^{\prime})\,\psi_{a}(\mathbold{r})^{*}\,n^{(1)}(\mathbold{r};\mathbold{r}^{\prime})\,\psi_{a}(\mathbold{r}^{\prime})\,, (17)

where ψa(\mathboldr)=\mathboldr|a\psi_{a}(\mathbold{r})={\langle{\mathbold{r}}|{a}\rangle} is the wave function of the aath eigenstate of H1pH_{\mathrm{1p}}. We introduce the effective (orbital) single-particle statistical operator ρ\rho through

n(1)(\mathboldr;\mathboldr)=<\mathboldr ρ \mathboldr>orρ=(d\mathboldr)(d\mathboldr)|\mathboldrn(1)(\mathboldr;\mathboldr)\mathboldr|,withρ0,tr(ρ)=N,\displaystyle n^{(1)}(\mathbold{r};\mathbold{r}^{\prime})=\big{<}\mathbold{r}\hskip 1.07639pt\rule[-2.65pt]{0.65pt}{10.75pt}\hskip 2.15277pt\rho\hskip 1.72218pt\rule[-2.65pt]{0.65pt}{10.75pt}\hskip 1.07639pt\mathbold{r}^{\prime}\big{>}\quad\mbox{or}\quad\rho=\int(\mathrm{d}\mathbold{r})\,(\mathrm{d}\mathbold{r}^{\prime})\,{\left|{\mathbold{r}}\right\rangle}\,n^{(1)}(\mathbold{r};\mathbold{r}^{\prime})\,\langle\mathbold{r}^{\prime}|\,,\quad\mbox{with}\quad\rho\geq 0\,,\ \mathrm{tr}{\left(\rho\right)}=N\,, (18)

and then Eq. (17) reads

na=a|ρ|a.\displaystyle n_{a}^{\ }=\left<\right.{a}\left.\right|{\rho}\left|\right.{a}\left.\right>\,. (19)

Since na2n_{a}\leq 2, the eigenvalues of ρ\rho cannot exceed 22, ρ2{\rho\leq 2}. It follows that the rank of ρ\rho is 12N\frac{1}{2}N or larger.

Since

tr(ρmpHint)=(d\mathboldr)(d\mathboldr)Vint(\mathboldr\mathboldr)n(2)(\mathboldr,\mathboldr;\mathboldr,\mathboldr),\displaystyle{\mathrm{tr}{\left(\rho_{\mathrm{mp}}H_{\mathrm{int}}\right)}\rule{0.0pt}{10.0pt}}=\int(\mathrm{d}\mathbold{r})\,(\mathrm{d}\mathbold{r}^{\prime})\,V_{\mathrm{int}}(\mathbold{r}-\mathbold{r^{\prime}})\,n^{(2)}(\mathbold{r},\mathbold{r}^{\prime};\mathbold{r},\mathbold{r}^{\prime})\,, (20)

the minimization in Eq. (12) requires us to consider all n(2)n^{(2)}s that yield, via Eq. (15) and Eq. (18), a ρ\rho that obeys Eq. (19) for the given nan_{a}^{\ }s. We do not have a generic parameterization of n(2)n^{(2)} that is suitable for this purpose and, therefore, resort to approximations, that is, rather than minimizing over all permissible n(2)n^{(2)}s, we minimize over a smaller set and then work with the resulting approximation for Eint[𝒏]E_{\mathrm{int}}[\bm{n}].

2.2 Implementation of an exchange-only 1pEx-DFT

Dirac’s approximation (that is, the Hartree–Fock approximation [24, 25, 26]) for the two-particle density matrix in terms of the single-particle density matrix [23],

n(2)(\mathboldr1,\mathboldr2;\mathboldr1,\mathboldr2)=12n(1)(\mathboldr1;\mathboldr1)n(1)(\mathboldr2;\mathboldr2)14n(1)(\mathboldr1;\mathboldr2)n(1)(\mathboldr2;\mathboldr1),\displaystyle n^{(2)}(\mathbold{r}_{1}^{\ },\mathbold{r}_{2}^{\ };\mathbold{r}_{1}^{\prime},\mathbold{r}_{2}^{\prime})=\frac{1}{2}n^{(1)}(\mathbold{r}_{1}^{\ };\mathbold{r}_{1}^{\prime})\,n^{(1)}(\mathbold{r}_{2}^{\ };\mathbold{r}_{2}^{\prime})-\frac{1}{4}n^{(1)}(\mathbold{r}_{1}^{\ };\mathbold{r}_{2}^{\prime})\,n^{(1)}(\mathbold{r}_{2}^{\ };\mathbold{r}_{1}^{\prime})\,, (21)

has a very good track record, and we employ it. This is exact for pure-state ρmp\rho_{\mathrm{mp}}s that are single Slater determinants and yield single-particle density matrices with a 2×2{2\times 2} spin matrix that is a multiple of the identity; the particle number NN has to be even for that, hardly a restriction as we are mostly interested in situations with very many particles. For Dirac’s n(2)n^{(2)}, the integral in Eq. (15) states

12Nn(1)(\mathboldr;\mathboldr)14(d\mathboldr2)n(1)(\mathboldr;\mathboldr2)n(1)(\mathboldr2;\mathboldr)=12(N1)n(1)(\mathboldr;\mathboldr),\displaystyle\frac{1}{2}N\,n^{(1)}(\mathbold{r};\mathbold{r}^{\prime})-\frac{1}{4}\int(\mathrm{d}\mathbold{r}_{2}^{\ })\,n^{(1)}(\mathbold{r};\mathbold{r}_{2}^{\ })\,n^{(1)}(\mathbold{r}_{2}^{\ };\mathbold{r}^{\prime})=\frac{1}{2}(N-1)\,n^{(1)}(\mathbold{r};\mathbold{r}^{\prime})\,, (22)

That is,

(d\mathboldr2)n(1)(\mathboldr;\mathboldr2)n(1)(\mathboldr2;\mathboldr)=2n(1)(\mathboldr;\mathboldr)\displaystyle\int(\mathrm{d}\mathbold{r}_{2}^{\ })\,n^{(1)}(\mathbold{r};\mathbold{r}_{2}^{\ })\,n^{(1)}(\mathbold{r}_{2}^{\ };\mathbold{r}^{\prime})=2n^{(1)}(\mathbold{r};\mathbold{r}^{\prime}) (23)

or

ρ2=2ρ,\displaystyle\rho^{2}=2\rho\,, (24)

when expressed as a property of the statistical operator. Accordingly, in Dirac’s approximation, ρ\rho has 12N\frac{1}{2}N eigenvalues equal to 22 and all other eigenvalues are zero. Any two such ρ\rhos are related to each other by a unitary transformation, and the constraints in Eq. (19) require that the subspaces specified by an nan_{a}^{\ } value are invariant under the unitary transformation. In other words, the minimization in Eq. (12) needs to be carried out over all unitary transformations that leave the diagonal elements a|ρ|a\left<\right.{a}\left.\right|{\rho}\left|\right.{a}\left.\right> of the single-particle statistical operator

ρ=a,b|aϱabb|\displaystyle\rho=\sum_{a,b}{\left|{a}\right\rangle}\,\varrho^{\ }_{ab}\,{\left\langle{b}\right|} (25)

unchanged. Here, with the emphasis on the dependence on 𝒏\bm{n}, it is natural to use the 1pEx-basis {|a}\{{\left|{a}\right\rangle}\} for the parameterization of ρ\rho.

One family of suitable unitary transformations multiplies each |a{\left|{a}\right\rangle} by a phase factor

|a|aeiϕa.\displaystyle{\left|{a}\right\rangle}\to{\left|{a}\right\rangle}\,\mathrm{e}^{\mbox{\footnotesize$\mathrm{i}\phi_{a}$}}\,. (26)

Taking into account all admissible transformations that preserve individually the diagonal elements in 2×2{2\times 2} sectors of ϱ\varrho, that is, more general transformations than those in Eq. (26), we find no further improvement in the ground-state energies of Eq. (10), see B for details. For the purpose of this work we are thus content with optimizing the phases ϕa\phi_{a} in Eq. (26) and will attend elsewhere to the expansion of the search space toward all permissible density matrices. We want to emphasize, however, that we deem the use of density matrices an auxiliary measure that is straightforward but comes with an inflated number of parameterization variables. Here, these are the phases ϕa\phi_{a} on top of the participation numbers nan_{a}. Ultimately, we hope to construct more efficient functionals Eint[𝒏]E_{\mathrm{int}}[\bm{n}] without introducing an excessive number of parameterization variables (not to be confused with free, adjustable parameters).

Upon using Dirac’s approximation, viz., Eq. (21), in Eq. (20) and recalling Eq. (18), we have

tr(ρmpHint)\displaystyle{\mathrm{tr}{\left(\rho_{\mathrm{mp}}H_{\mathrm{int}}\right)}\rule{0.0pt}{10.0pt}} =12(d\mathboldr)(d\mathboldr)Vint(\mathboldr\mathboldr)(n(1)(\mathboldr;\mathboldr)n(1)(\mathboldr;\mathboldr)12n(1)(\mathboldr;\mathboldr)n(1)(\mathboldr;\mathboldr))\displaystyle=\frac{1}{2}\int(\mathrm{d}\mathbold{r})\,(\mathrm{d}\mathbold{r}^{\prime})\,V_{\mathrm{int}}(\mathbold{r}-\mathbold{r^{\prime}})\,\left(n^{(1)}(\mathbold{r};\mathbold{r})\,n^{(1)}(\mathbold{r}^{\prime};\mathbold{r}^{\prime})-\frac{1}{2}n^{(1)}(\mathbold{r};\mathbold{r}^{\prime})\,n^{(1)}(\mathbold{r}^{\prime};\mathbold{r})\right)
=12(d\mathboldr)(d\mathboldr)Vint(\mathboldr\mathboldr)(\mathboldr|ρ|\mathboldr\mathboldr|ρ|\mathboldr12\mathboldr|ρ|\mathboldr\mathboldr|ρ|\mathboldr),\displaystyle=\frac{1}{2}\int(\mathrm{d}\mathbold{r})\,(\mathrm{d}\mathbold{r}^{\prime})\,V_{\mathrm{int}}(\mathbold{r}-\mathbold{r^{\prime}})\,\left(\left<\right.{\mathbold{r}}\left.\right|{\rho}\left|\right.{\mathbold{r}}\left.\right>\,\left<\right.{\mathbold{r}^{\prime}}\left.\right|{\rho}\left|\right.{\mathbold{r}^{\prime}}\left.\right>-\frac{1}{2}\left<\right.{\mathbold{r}}\left.\right|{\rho}\left|\right.{\mathbold{r}^{\prime}}\left.\right>\,\left<\right.{\mathbold{r}^{\prime}}\left.\right|{\rho}\left|\right.{\mathbold{r}}\left.\right>\right)\,,\qquad\quad (27)

where we exploit the Fourier integral for

Vint(\mathboldr\mathboldr)=(d\mathboldk)(2π)Du(\mathboldk)ei\mathboldk\mathboldrei\mathboldk\mathboldr\displaystyle V_{\mathrm{int}}(\mathbold{r}-\mathbold{r^{\prime}})=\int\frac{(\mathrm{d}\mathbold{k})}{(2\pi)^{D}}\,u(\mathbold{k})\,\mathrm{e}^{\mbox{\footnotesize$\mathrm{i}\mathbold{k}\cdot\mathbold{r}$}}\mathrm{e}^{\mbox{\footnotesize$-\mathrm{i}\mathbold{k}\cdot\mathbold{r}^{\prime}$}} (28)

for DD spatial dimensions to factorize the \mathboldr\mathbold{r} and \mathboldr\mathbold{r^{\prime}} dependence; since Vint(\mathboldr)V_{\mathrm{int}}(\mathbold{r}) is real and even, so is u(\mathboldk)u(\mathbold{k}). Then, with

(d\mathboldr)ei\mathboldk\mathboldr\mathboldr|ρ|\mathboldr=tr(ρei\mathboldk\mathboldR),\displaystyle\int(\mathrm{d}\mathbold{r})\,\mathrm{e}^{\mbox{\footnotesize$\mathrm{i}\mathbold{k}\cdot\mathbold{r}$}}\left<\right.{\mathbold{r}}\left.\right|{\rho}\left|\right.{\mathbold{r}}\left.\right>=\mathrm{tr}{\left(\rho\,\mathrm{e}^{\mbox{\footnotesize$\mathrm{i}\mathbold{k}\cdot\mathbold{R}$}}\right)}\,, (29)

for example, we arrive at

tr(ρmpHint)=12(d\mathboldk)(2π)Du(\mathboldk)(|tr(ρei\mathboldk\mathboldR)|212tr(ρei\mathboldk\mathboldRρei\mathboldk\mathboldR)). \displaystyle{\mathrm{tr}{\left(\rho_{\mathrm{mp}}H_{\mathrm{int}}\right)}\rule{0.0pt}{10.0pt}}=\frac{1}{2}\int\frac{(\mathrm{d}\mathbold{k})}{(2\pi)^{D}}\,u(\mathbold{k})\,\left(\;\biggl{|}\mathrm{tr}{\left(\rho\,\mathrm{e}^{\mbox{\footnotesize$\mathrm{i}\mathbold{k}\cdot\mathbold{R}$}}\right)}\biggr{|}^{2}-\frac{1}{2}\mathrm{tr}{\left(\rho\,\mathrm{e}^{\mbox{\footnotesize$\mathrm{i}\mathbold{k}\cdot\mathbold{R}$}}\,\rho\,\mathrm{e}^{\mbox{\footnotesize$-\mathrm{i}\mathbold{k}\cdot\mathbold{R}$}}\right)}\right)\,.\rule{30.0pt}{0.0pt} (30)

In Eq. (12), we minimize this over all ρ\rho that are permitted by Eqs. (19) and (24).

With Eq. (25) we have

tr(ρei\mathboldk\mathboldR)\displaystyle\mathrm{tr}{\left(\rho\,\mathrm{e}^{\mbox{\footnotesize$\mathrm{i}\mathbold{k}\cdot\mathbold{R}$}}\right)} =a,bϱab<b ei\mathboldk\mathboldR a>\displaystyle=\sum_{a,b}\,\varrho^{\ }_{ab}\big{<}b\hskip 1.07639pt\rule[-2.65pt]{0.65pt}{10.75pt}\hskip 2.15277pt\mathrm{e}^{\mbox{\footnotesize$\mathrm{i}\mathbold{k}\cdot\mathbold{R}$}}\hskip 1.72218pt\rule[-2.65pt]{0.65pt}{10.75pt}\hskip 1.07639pta\big{>} (31)
and
tr(ρei\mathboldk\mathboldRρei\mathboldk\mathboldR)\displaystyle\mathrm{tr}{\left(\rho\,\mathrm{e}^{\mbox{\footnotesize$\mathrm{i}\mathbold{k}\cdot\mathbold{R}$}}\,\rho\,\mathrm{e}^{\mbox{\footnotesize$-\mathrm{i}\mathbold{k}\cdot\mathbold{R}$}}\right)} =a,bc,dϱab<b ei\mathboldk\mathboldR c>ϱcd<d ei\mathboldk\mathboldR a>.\displaystyle=\sum_{a,b}\sum_{c,d}\varrho^{\ }_{ab}\big{<}b\hskip 1.07639pt\rule[-2.65pt]{0.65pt}{10.75pt}\hskip 2.15277pt\mathrm{e}^{\mbox{\footnotesize$\mathrm{i}\mathbold{k}\cdot\mathbold{R}$}}\hskip 1.72218pt\rule[-2.65pt]{0.65pt}{10.75pt}\hskip 1.07639ptc\big{>}\,\varrho^{\ }_{cd}\big{<}d\hskip 1.07639pt\rule[-2.65pt]{0.65pt}{10.75pt}\hskip 2.15277pt\mathrm{e}^{\mbox{\footnotesize$-\mathrm{i}\mathbold{k}\cdot\mathbold{R}$}}\hskip 1.72218pt\rule[-2.65pt]{0.65pt}{10.75pt}\hskip 1.07639pta\big{>}\,. (32)

Then,

tr(ρmpHint)=12a,bc,dϱabab,cdϱcd\displaystyle\mathrm{tr}{\left(\rho_{\mathrm{mp}}H_{\mathrm{int}}\right)}=\frac{1}{2}\sum_{a,b}\sum_{c,d}\varrho^{\ }_{ab}\,\mathcal{H}^{\ }_{ab,cd}\,\varrho^{\ }_{cd} (33)

with ab,cd=Iabcd12Iadcb\mathcal{H}^{\ }_{ab,cd}=I_{abcd}-\frac{1}{2}I_{adcb}, where

Iabcd\displaystyle I_{abcd} =(d\mathboldr)(d\mathboldr)Vint(\mathboldr\mathboldr)ψa(\mathboldr)ψb(\mathboldr)ψc(\mathboldr)ψd(\mathboldr)\displaystyle=\int(\mathrm{d}\mathbold{r})(\mathrm{d}\mathbold{r}^{\prime})\,V_{\mathrm{int}}(\mathbold{r}-\mathbold{r}^{\prime})\,\psi_{a}(\mathbold{r})\psi_{b}^{*}(\mathbold{r})\psi_{c}(\mathbold{r}^{\prime})\psi_{d}^{*}(\mathbold{r}^{\prime})
=1(2π)Dd\mathboldku(\mathboldk)<a ei\mathboldk\mathboldR b><c ei\mathboldk\mathboldR d>\displaystyle=\frac{1}{(2\pi)^{D}}\int\mathrm{d}\mathbold{k}\,u(\mathbold{k})\,\big{<}a\hskip 1.07639pt\rule[-2.65pt]{0.65pt}{10.75pt}\hskip 2.15277pt\mathrm{e}^{\mbox{\footnotesize$\mathrm{i}\mathbold{k}\cdot\mathbold{R}$}}\hskip 1.72218pt\rule[-2.65pt]{0.65pt}{10.75pt}\hskip 1.07639ptb\big{>}\,\big{<}c\hskip 1.07639pt\rule[-2.65pt]{0.65pt}{10.75pt}\hskip 2.15277pt\mathrm{e}^{\mbox{\footnotesize$-\mathrm{i}\mathbold{k}\cdot\mathbold{R}$}}\hskip 1.72218pt\rule[-2.65pt]{0.65pt}{10.75pt}\hskip 1.07639ptd\big{>} (34)

is a set of interaction-energy tensor elements (commonly known as two-electron integrals in the case of coulombic electron–electron interactions), which are completely determined by the interaction potential VintV_{\mathrm{int}} and the orbital eigenstates of the single-particle Hamilton operator H1pH_{\mathrm{1p}}.

We have thus explicitly formulated the constrained search in Eq. (11) over ρmp\rho_{\mathrm{mp}} as a constrained search over density matrices ϱ\varrho in the 1pEx-basis, where ϱ\varrho derives from ρmp\rho_{\mathrm{mp}} through Eqs. (25), (18), and (13). In any particular application, we need

(38)

The resulting density functional Emp[𝒏]E_{\mathrm{mp}}[\bm{n}] can then be minimized over all permissible 𝒏\bm{n}. In practice, we minimize over 𝒏\bm{n} and ϕ\bm{\phi} simultaneously—the concrete implementation of Eq. (38) is described in the next Secs. 2.3 and 2.4.

2.3 Interaction tensor elements and density matrices

For any practical computation, the program outlined in Eq. (38) comes with two approximations. First, the Dirac approximation of the interaction energy in terms of the one-body density matrix ϱ\varrho—we leave for future work the inclusion of correlation energy expressed in terms of ϱ\varrho. Second, the incomplete set of matrices ϱ\varrho that enters the competition in Eq. (11), which can be improved by (i) going beyond the transformations with phase factors eiϕa\mathrm{e}^{\mbox{\footnotesize$\mathrm{i}\phi_{a}$}} (toward all admissible ϱ\varrho, that is, toward full HF), (ii) considering different seed matrices ϱ(0)\varrho^{(0)}, and (iii) increasing the dimension LL of ϱ\varrho, which has to be finite in practice. Additional approximations may stem from the interaction tensor elements IabcdI_{abcd} in Eq. (34) if the orbital eigenstates and energies of H1pH_{\mathrm{1p}} can only be obtained numerically or approximately. For the systems considered in the present work, however, these ingredients are analytical or quasi-exact, see D for details. Hence, for the purpose of this article, we are content with approximating the ground-state energy assembled from Eqs. (10)–(12) and Eq. (33) by

EgsMin𝚯,ϕ{a=1L(1+cosθa)Ea+12a,b,c,d=1Lϱab(0)ϱcd(0)(Iabcd12Iadcb)ei(ϕaϕb+ϕcϕd)},\displaystyle E_{\mathrm{gs}}\approx\underset{\bm{\Theta},\bm{\phi}}{\mathrm{Min}}\left\{\sum_{a=1}^{L}(1+\cos\theta_{a})\,E_{a}+\frac{1}{2}\sum_{a,b,c,d=1}^{L}\varrho^{(0)}_{ab}\,\varrho^{(0)}_{cd}\,\left(I_{abcd}-\frac{1}{2}I_{adcb}\right)\,\mathrm{e}^{\mathrm{i}\,(\phi_{a}-\phi_{b}+\phi_{c}-\phi_{d})}\right\}, (39)

where we incorporate the Dirac approximation of Eq. (21) and take into account the finite number LL of 1pEx-basis states. We write na=1+cosθa{n_{a}=1+\cos\theta_{a}} with angles Θ=(θ1,,θL){\Theta=(\theta_{1},\dots,\theta_{L})}, such that we satisfy 0na2{0\leq n_{a}\leq 2} automatically and only need to enforce the particle count of Eq. (6). Finally, in general we have Iabcd=Ibadc{I_{abcd}=I_{badc}^{*}} and ϱab=ϱba{\varrho_{ab}=\varrho_{ba}^{*}}, while for the case studies in this work, Vint(\mathboldr\mathboldr)=Vint(\mathboldr\mathboldr){V_{\mathrm{int}}(\mathbold{r}-\mathbold{r}^{\prime})=V_{\mathrm{int}}(\mathbold{r}^{\prime}-\mathbold{r})}, and both IabcdI_{abcd} and ϱab(0)\varrho_{ab}^{(0)} are real. Therefore, we may replace the phase factor in Eq. (39) by cos(ϕaϕb+ϕcϕd){\cos(\phi_{a}-\phi_{b}+\phi_{c}-\phi_{d})}.

We obtain a seed matrix ϱ(0)=ϱit{\varrho^{(0)}=\varrho^{\mathrm{it}}} with admissible entries by iteratively transforming the diagonal matrix with diagonal (2,2,,2,0,0,,0)(2,2,\dots,2,0,0,\dots,0) and trace NN. In each step of this matrix mixer algorithm introduced in Ref. [31] we apply a unitary operation that produces one target diagonal element ϱaait=na{\varrho^{\mathrm{it}}_{aa}=n_{a}}. That is, after at most L1{L-1} steps, we arrive at a proper density matrix (obeying ϱ2=2ϱ{\varrho^{2}=2\varrho}, cf. Eq. (24)) with prescribed diagonal elements and non-zero off-diagonal elements, see B for details.

If we take into account enough 1pEx-basis states, then any discrepancies between 1pEx-DFT results (obtained using ϱit\varrho^{\mathrm{it}}) and those from HF reported in this work stem from the incomplete parameterization of the density matrices. In such cases, improvements of the 1pEx-DFT energies toward the HF energies have to come from transformations beyond those that individually preserve the diagonal elements in 2×2{2\times 2} sectors of the density matrices. It is an open problem to determine for which systems such more general transformations would make our current 1pEx-DFT implementation equivalent to HF.

As an alternative for 1D systems, we use the approximate density matrix

ϱabtf=gsin((ab)σ)π(ab),\displaystyle\varrho^{\mathrm{tf}}_{ab}=\frac{g\sin\left((a-b)\,\sigma\right)}{\pi(a-b)}\,, (40)

with degeneracy factor gg (for unpolarized spin-12\frac{1}{2} fermions, g=2{g=2}) and cotσ=12[cot(π2na)+cot(π2nb)]\cot\sigma=\frac{1}{2}\Big{[}\cot\left(\frac{\pi}{2}n_{a}\right)+\cot\left(\frac{\pi}{2}n_{b}\right)\Big{]}. We obtain Eq. (40) from an approximate Wigner function in rotor phase space, inspired by the classical-phase-space argument that yields the TF-approximated spatial density matrix ϱTF(x;x){\varrho_{\mathrm{TF}}(x;x^{\prime})}. That is, ϱ(0)=ϱ(0)(𝒏){\varrho^{(0)}=\varrho^{(0)}(\bm{n})} in Eq. (39) stands for either ϱit\varrho^{\mathrm{it}} or ϱtf\varrho^{\mathrm{tf}}, which are derived in B.

2.4 Energy minimization

The angles θa\theta_{a} and phases ϕa\phi_{a} form the search space for the minimization of EgsE_{\mathrm{gs}} in Eq. (39) and are only constrained by Eq. (6), which spans an (L1)(L-1)-dimensional hypersurface within the LL-dimensional space of participation numbers  𝒏\bm{n}. Suitable choices of global optimizers for constrained non-convex functions in high-dimensional spaces are problem-dependent. We opt for stochastic algorithms, especially evolutionary algorithms, which are efficient in optimizing our constrained high-dimensional non-convex energy functions for two reasons: first, stochastic optimization allows us to project (improper) randomly requested vectors 𝒏\bm{n} onto the constraining hypersurface before we evaluate the energy. Second, evolutionary algorithms can be easily tuned to escape even deep local optima efficiently and can optimize highly deceptive objective functions such as Eq. (39).

We consulted two evolutionary algorithms: a particle swarm optimization (PSO) [27, 28, 29] and a genetic-algorithm optimizer (GAO) [30], see C for the details of our implementations, which outperformed—on average, for the test cases we considered—coupled simulated annealers, adapted from Ref. [32]. As an alternative to genuinely stochastic optimizers, we used multi-start linearly-constrained conjugate gradient optimization, based on the C++ implementation of the ALGLIB project at www.alglib.net. In our test cases, all four aforementioned algorithms produced virtually the same numerical values for the ground-state participation numbers 𝒏\bm{n}. However, PSO proved superior among our implementations of optimizers regarding the convergence speed toward the optima of our case studies; all results of optimizations reported in this work are obtained with PSO, unless explicitly stated otherwise.

3 Results on energies, participation numbers, and spatial densities

We applied the 1pEx-DFT program entailed in Eq. (38) and Secs. 2.3 and 2.4 to harmonically confined fermions in 1D, subjected to contact interaction (Sec. 3.1) and harmonic interaction (Sec. 3.2). Moreover, in Sec. 3.3 we extract the electronic structure of atoms and ions from 1pEx-DFT, both with and without relativistic corrections from spin-free exact two-component Dirac theory [33, 34, 35, 36]. With our choice of the Dirac approximation for the interaction energy Eint[𝒏]E_{\mathrm{int}}[\bm{n}], HF theory is the natural choice for benchmarking our simulations: Like in HF theory, the Dirac approximation is the only physical approximation that enters our current 1pEx-DFT implementation, such that both methods must produce the same results if the energy minimization covers all admissible density matrices, which is the case when minimizing Eq. (39) with ϱ(0)=ϱit{\varrho^{(0)}=\varrho^{\mathrm{it}}} for N=2{N=2}, cf. Tables I and II below. Future implementations of 1pEx-DFT along the same lines of the present work may exceed HF accuracy if we use approximations of the interaction energy that include at least some correlation but can still be expressed in terms of the one-body density matrix. While HF theory is reasonably accurate and transferable, it is orbital-based and, hence, of limited scalability. Therefore, keeping in mind 1pEx-DFT applications to mesoscopic systems, we also compared with a variant of orbital-free DPFT that is scalable and transferable. While this semiclassical method tends to become (relatively) accurate only for larger particle numbers, it comes with an established track record across systems and disciplines [14, 15, 16, 17, 37, 18, 22, 38, 39, 40].

3.1 Harmonically trapped contact-interacting fermions

We begin with NN spin-12\frac{1}{2} fermions in 1D harmonic confinement and pair interaction energy

Vint(xx)=𝒸δ(𝓍𝓍).\displaystyle V_{\mathrm{int}}(x-x^{\prime})=\mathpzc{c}\,\delta(x-x^{\prime})\,. (41)

The corresponding interaction tensor elements IabcdI_{abcd} are derived in D.1. All numerical values in Secs. 3.1 and 3.2 are given in harmonic oscillator units of energy ω\hbar\omega and length /(mω)\sqrt{\hbar/(m\omega)}, with particle mass mm and oscillator frequency ω\omega. That is, for example, an interaction strength 𝒸=20{\mathpzc{c}=20} in Eq. (41) is implicit for 𝒸=20ω/(𝓂ω){\mathpzc{c}=20\,\hbar\omega\sqrt{\hbar/(m\omega)}}.

As expected, the energies E1pExtfE_{\mathrm{1pEx}}^{\mathrm{tf}} based on the TF inspired ϱtf\varrho^{\mathrm{tf}} are hardly accurate for small NN, but Table 1 suggests that they become relatively more accurate with increasing NN. The energies E1pExitE_{\mathrm{1pEx}}^{\mathrm{it}} are more accurate for the cases considered here and approach the HF energies to within one or two percent. By comparison, the DPFT energies are quite accurate when using the exact HF interaction energy EintHF=(𝒸/4)d𝓍(𝓃(𝓍))2{E_{\mathrm{int}}^{\mathrm{HF}}=(\mathpzc{c}/4)\int\mathrm{d}x\,\big{(}n(x)\big{)}^{2}}, see B. However, unlike HF theory, the approximate DPFT energy functionals are not variational and do not guarantee to deliver upper bounds to EgsE_{\mathrm{gs}}—here, the TF density nTFn_{\mathrm{TF}} incidentally delivers an even lower energy than its quantum-corrected successor n3n_{3^{\prime}}—and systematic improvements beyond n3n_{3^{\prime}} can incur high computational cost [17, 38].

If we assume that the number LL of single-particle levels required to converge the energy in Eq. (39) scales like the particle number NN, then the computational cost of the current implementation of 1pEx-DFT scales like 𝒪(N4)\mathcal{O}\big{(}N^{4}\big{)}, compared with the generic scaling of 𝒪(N3)\mathcal{O}\big{(}N^{3}\big{)} for HF and KS–DFT. The cost of the DPFT densities nTFn_{\mathrm{TF}} and n3n_{3^{\prime}} scales like 𝒪(N)\mathcal{O}\big{(}N\big{)} and 𝒪(N2)\mathcal{O}\big{(}N^{2}\big{)}, respectively. Of course, these scaling behaviors are really informative only in the limit of large NN, and the same scaling does not imply the same cost in practice, with HF and KS–DFT presenting a point in case. Furthermore, the actually realized computational cost much depends on the system. For example, we found the energies E1pExitE^{\mathrm{it}}_{\mathrm{1pEx}} in Table I with 1 CPU-hour (for c=1{c=1}, N=4{N=4}, L=20{L=20}), 16 CPU-hours (for c=1{c=1}, N=10{N=10}, L=20{L=20}), and 59 CPU-days (for c=20{c=20}, N=10{N=10}, L=30{L=30}), respectively. We list these timings only for completeness, bearing in mind that our focus here is to illustrate the concepts behind a new approach, not to optimize code and CPU time.

𝒸\mathpzc{c} NN E1pExitE^{\mathrm{it}}_{\mathrm{1pEx}} EHFE_{\mathrm{HF}} (E1pExitEHF)/EHF\big{(}E_{\mathrm{1pEx}}^{\mathrm{it}}-E_{\mathrm{HF}}\big{)}/E_{\mathrm{HF}} E1pExtfE_{\mathrm{1pEx}}^{\mathrm{tf}} (E1pExtfEHF)/EHF\big{(}E_{\mathrm{1pEx}}^{\mathrm{tf}}-E_{\mathrm{HF}}\big{)}/E_{\mathrm{HF}} EDPFT(nTF)E_{\mathrm{DPFT}}(n_{\mathrm{TF}}) EDPFT(n3)E_{\mathrm{DPFT}}(n_{3^{\prime}})
1 2 1.3790 1.3790 0.00% 1.3243 -3.97% 1.3648 1.4526
4 5.0642 5.0590 0.10% 4.9773 -1.62% 5.0451 5.2420
10 29.218 29.193 0.09% 29.053 -0.48% 29.181 29.353
20 111.97 111.91 0.05% 111.75 -0.14% 111.90 112.12
20 2 5.9695 5.9695 0.00% 6.3862 6.98% 5.9214 6.0391
4 19.416 19.083 1.75% 19.999 4.80% 19.031 19.211
10 90.572 90.031 0.60% 93.486 3.84% 89.974 90.155
20 298.60 294.75 1.31% 304.45 3.29% 294.69 294.89
Table 1: Comparison of E1pExE_{\mathrm{1pEx}} and HF energies EHFE_{\mathrm{HF}} for NN spin-12\frac{1}{2} fermions in a 1D harmonic trap at contact-interaction strengths 𝒸=1{\mathpzc{c}=1} and 𝒸=20{\mathpzc{c}=20}. Since both 1pEx-DFT and HF are exact for noninteracting systems, it is not surprising that the two methods agree better for weaker interactions. In contrast to ϱit\varrho^{\mathrm{it}}, the TF-inspired density matrix ϱtf\varrho^{\mathrm{tf}} breaks the variational character of Eq. (39) and can therefore yield energies smaller than EHFE_{\mathrm{HF}}, as we see here for 𝒸=1{\mathpzc{c}=1}. Also the density approximations nTFn_{\mathrm{TF}} and n3n_{3^{\prime}} breach the variational property of the general DPFT framework. Incidentally, energies below EHFE_{\mathrm{HF}} materialize here with the TF approximation, but not with the generically superior approximation associated with the semiclassical density n3n_{3^{\prime}}. Efficient implementations of n3n_{3^{\prime}} from Ref. [17] are provided in Refs. [22, 39]. Here and in Table 2 below, we report HF energies calculated from the Roothaan equations for closed-shell systems in the 1pEx-basis, with the level-shifting procedure of Ref. [41] to aid convergence in the case of strong interactions.

For N=2{N=2}, the density matrix ϱit\varrho^{\mathrm{it}} obtained from the iterative algorithm of Sec. 2.3 is identical to the HF density matrix ϱHF\varrho^{\mathrm{HF}}—up to phases that are optimized anyway during the energy minimization—see B, such that 1pEx-DFT recovers HF exactly. However, HF is inadequate for modeling this particular system: with the exact ground-state wave function Ψ\Psi for two spin-12\frac{1}{2} fermions [42, 43] and

n(1)(x;x)=x|ρ|x=2dx2Ψ(x,x2)Ψ(x,x2),\displaystyle n^{(1)}(x;x^{\prime})=\left<\right.{x}\left.\right|{\rho}\left|\right.{x^{\prime}}\left.\right>=2\int\mathrm{d}x_{2}\,\Psi(x,x_{2})\,\Psi(x^{\prime},x_{2})^{*}\,, (42)

we calculate the exact density matrix in the 1pEx-basis, ϱab=dxdxa|xx|ρ|xx|b{\varrho_{ab}=\int\mathrm{d}{x}\mathrm{d}{x^{\prime}}\,\left<\right.{a}\left.\right|{x}\left.\right>\left<\right.{x}\left.\right|{\rho}\left|\right.{x^{\prime}}\left.\right>\left<\right.{x^{\prime}}\left.\right|{b}\left.\right>}, and obtain Egs1.92{E_{\mathrm{gs}}\approx 1.92} for the true ground-state energy at 𝒸=20{\mathpzc{c}=20}, which deviates from EHFE_{\mathrm{HF}} by a factor of three—in other words, the correlation energy dominates the total energy—and explains the large discrepancies between the exact and the HF/1pEx participation numbers shown in Fig. 5 in B.

Figure 2 shows the converged participation numbers for N=10{N=10} obtained with PSO and GAO, respectively. We also present spatial densities extracted from the converged density matrix ϱ^abit=eiϕ^aϱ^ab(0)eiϕ^b\hat{\varrho}^{\mathrm{it}}_{ab}=\mathrm{e}^{\mbox{\footnotesize$\mathrm{i}\hat{\phi}_{a}$}}\hat{\varrho}^{(0)}_{ab}\mathrm{e}^{\mbox{\footnotesize$-\mathrm{i}\hat{\phi}_{b}$}}, which comprises both the eventually successful seed matrix ϱ^(0)\hat{\varrho}^{(0)} and the phases ϕ\bm{\phi} that are optimal for ϱ^(0)\hat{\varrho}^{(0)}; see B for details. The participation numbers provide intuitively accessible information, since we usually have painless access to (and a solid understanding of) the noninteracting basis states (the 1pEx-basis)—we may imagine how unintuitive chemistry would be if we could not refer to the noninteracting hydrogenic energy levels 1s, 2s, 2p, etc., when talking about the real (interacting) electrons in atoms. For noninteracting NN-particle systems the lowest N/2N/2 levels of the 1pEx-basis are fully occupied (with participation numbers n1aN/2=2{n_{1\leq a\leq N/2}=2}), and all other levels are unoccupied (na>N/2=0{n_{a>N/2}=0}), such that the deviations from this distribution of participation numbers in the case of interacting systems are a measure of the interaction strength. The participation numbers can also reveal nontrivial structures like the irrelevance of the odd-parity states in Fig. 5 in B (if ρit\rho^{\mathrm{it}} were taken to be the truth).

Refer to caption
Figure 2: Participation numbers nan_{a} for single-particle levels a=1,,20{a=1,\dots,20} (top) and spatial densities of N=10{N=10} spin-12\frac{1}{2} fermions in a 1D harmonic trap at contact-interaction strength 𝒸=20{\mathpzc{c}=20} (bottom). The participation numbers obtained from PSO (and GAO, as a cross-check of our numerics) follow the same trend as those that originate in ϱtf\varrho^{\mathrm{tf}}. We compute the spatial densities labeled ‘ϱit/tf\varrho^{\mathrm{it/tf}}’ from the converged density matrices, i.e., the seed matrices ϱ(0)\varrho^{(0)} combined with the optimized phases ϕ\bm{\phi}, and get the characteristic quantum-mechanical oscillations in the center of the trap, which are difficult to obtain with semiclassical orbital-free DFT methods like the DPFT implementation based on Refs. [15, 16, 17, 22], which yields, for example, the TF density nTFn_{\mathrm{TF}} and the quantum-corrected density n3n_{3^{\prime}} shown here.

3.2 Harmonically trapped fermions with harmonic interaction

Next, we consider harmonically trapped fermions in 1D with the pair energy

Vint(xjxk)=m2N(Ω2ω2)(xjxk)2=α12N(xjxk)2,\displaystyle V_{\mathrm{int}}(x_{j}-x_{k})=\frac{m}{2N}(\Omega^{2}-\omega^{2})\,(x_{j}-x_{k})^{2}=\frac{\alpha-1}{2N}(x_{j}-x_{k})^{2}\,, (43)

where α=Ω2/ω2{\alpha=\Omega^{2}/\omega^{2}} defines a dimensionless interaction strength. Equation (43) is expressed in the oscillator units ω\hbar\omega and /(mω)\sqrt{\hbar/(m\omega)} of the noninteracting system (α=1{\alpha=1}). In D.2 we derive the interaction tensor elements for Eq. (43) via Eq. (34).

In Table 2, we benchmark the energies predicted by 1pEx-DFT for N=2{N=2} and N=20{N=20} with α=3/2{\alpha=3/2} against HF results and compare with the exact energies as well as with DPFT energies. The energies labeled ‘direct’ include only the direct (Hartree) part of the interaction energy, i.e., ab,cd=Iabcd{\mathcal{H}^{\ }_{ab,cd}=I_{abcd}}. Then, the exact energy is given by Ed(α)=αN2/4{E_{\mathrm{d}}^{(\alpha)}=\sqrt{\alpha}N^{2}/4} for even NN, see Eq. (121) in D.2—the energy of an effectively noninteracting harmonic oscillator with frequency ωα\omega\sqrt{\alpha}. We also note that Eq. (121) is a density functional and can be directly employed in DFT variants such as DPFT—for example, in TF approximation, which is known to deliver the exact energy for the noninteracting harmonic oscillator.

N Ed(α)E_{\mathrm{d}}^{(\alpha)}\;\; E1pExitE^{\mathrm{it}}_{\mathrm{1pEx}} EHFE_{\mathrm{HF}} E1pExitEHF1\frac{E_{\mathrm{1pEx}}^{\mathrm{it}}}{E_{\mathrm{HF}}}-1 E1pExtfE_{\mathrm{1pEx}}^{\mathrm{tf}} E1pExtfEHF1\frac{E_{\mathrm{1pEx}}^{\mathrm{tf}}}{E_{\mathrm{HF}}}-1 EDPFT(nTF)E_{\mathrm{DPFT}}(n_{\mathrm{TF}}) EDPFT(n3)E_{\mathrm{DPFT}}(n_{3^{\prime}})
direct 2 1.22474 1.22474 1.22474 0.00% 1.13746 -7.13% 1.22474 1.51236
20 122.474 123.865 122.474 1.14% 124.156 1.37% 122.474 122.752
direct & exchange 2 1.11803 1.11803 0.00% 0.99835 -10.7%
20 123.605 122.368 1.01% 123.907 1.26%
Table 2: Like in Table 1, we compare the optimized energies E1pExE_{\mathrm{1pEx}} with the self-consistent energies EHFE_{\mathrm{HF}} for NN spin-12\frac{1}{2} fermions in a 1D harmonic trap, but here for a harmonic interaction of strength α=3/2{\alpha=3/2}. The exact energies for N=2{N=2} and N=20{N=20} at α=3/2{\alpha=3/2} are 1.112371.11237 and 122.362122.362, respectively [44]. The direct (direct & exchange) contributions to the interaction energy comprise about 20%20\% (10%10\%) of the total energy.

3.3 (Non-)relativistic atoms and ions

We have emphasized transferability across quantum-mechanical systems as one advantageous feature that sets 1pEx-DFT apart from other DFT variants. While we believe that simulations of ultracold quantum gases will dominate the applications of 1pEx-DFT, single atoms and ions comprise an important point of departure for any novel quantum many-body method whose scope includes, in principle, atomic systems from molecules to nanoparticles to materials. As our last case study we therefore calculate the electronic structure of atoms and ions. We also extract binding energies of highly charged ions by using the (numerical) eigenfunctions of a relativistic core Hamiltonian. The latter is an example of the most general situation for the usage of 1pEx-DFT, where eigenstates and eigenenergies of the core Hamiltonian are only available in numerical form.

The Coulomb interaction energy for a pair of electrons at positions \mathboldr\mathbold{r} and \mathboldr\mathbold{r}^{\prime} is Vint(\mathboldr\mathboldr)=1/|\mathboldr\mathboldr|V_{\mathrm{int}}(\mathbold{r}-\mathbold{r}^{\prime})=1/|\mathbold{r}-\mathbold{r}^{\prime}|. Accordingly, all numerical values in Sec. 3.3 are given in units of Hartree (Ha) and Bohr radius (a0a_{0}). The electrons are subjected to the external potential Vext(\mathboldr)=Z/|\mathboldr|{V_{\mathrm{ext}}(\mathbold{r})=-Z/|\mathbold{r}|} of the nucleus with nuclear charge ZZ at the origin of the spatial coordinate system, which makes the hydrogenic states our 1pEx-basis; details are provided in D.3.

In Fig. 3 we benchmark the total binding energies of atoms and ions. We obtained the energies in Fig. 3 by composing the 1pEx-basis solely of the hydrogenic bound states, see D.3.1. When constrained to these states, HF indeed delivers the 1pEx energies for the two-electron systems reported in Fig. 3. However, the scattering states associated with the continuous part of the spectrum of the nuclear Coulomb potential need to be taken into account to recover the exact HF energies in the case of atomic systems with two electrons. In fact, with the scattering states completing the 1pEx-basis, 1pEx-DFT results should gain accuracy for electronic structure calculations in general; we leave this potentially fruitful enterprise for future work. In its current implementation with (i) the Dirac approximation and (ii) the incomplete search over density matrices, 1pEx-DFT yields energies that are accurate at the 12%1\mbox{--}2\% level when compared with HF, which itself provides the same level of accuracy when compared with the NIST Atomic Spectra Database [45]. As expected, the accuracy of E1pExE_{\mathrm{1pEx}} improves as we increase the number LL of states of the 1pEx-basis that enter the competition in Eq. (39); as an illustration, we report binding energies for carbon with L=7{L=7}, L=15{L=15}, and L=31{L=31}. Furthermore, E1pExE_{\mathrm{1pEx}} becomes relatively more accurate as the ion-electron interaction intensifies. Then, deviations of the participation numbers 𝒏\bm{n} from the Fermi–Dirac distribution incur a penalty that increases, for example, along the helium isoelectronic sequence. In other words, the contributions of H1pH_{\mathrm{1p}} in highly charged ions dominate over the interaction energy, such that both the single Slater-determinant of HF theory and the exact single-particle treatment of 1pEx-DFT provide an increasingly accurate description, if relativistic effects are included when called for. This observation is also in line with our results for the carbon-like Xe48+ and the neon-like Xe44+. Also the diminishing influence of the scattering states on the binding energies along the helium isoelectronic series helps align our 1pEx energies with the HF benchmark in Fig. 3, see D.3.1 for details. By design, however, EHFE_{\mathrm{HF}} is a lower bound to the currently implemented E1pExE_{\mathrm{1pEx}}—which is also true when accounting for relativistic effects. They become more important with increasing nuclear charge; see Ref. [46] for a review on the theory of complex atoms. Since 1pEx-DFT handles the nuclear singularity exactly for any ZZ by adopting the 1pEx-basis, 1pEx-DFT can be directly applied to large-ZZ atoms and ions—provided that we supply the relativistic hydrogenic states as 1pEx-basis. These states are known analytically from four-component Dirac theory [47], but for simplicity we will stay within the confines of the nonrelativistic algebra that underpins the specific 1pEx-framework laid out in Sec. 2.1; see Ref. [20] for the more general perspective on DFT from a second-quantized point of view. Since electron–positron pair production is irrelevant for chemistry, we can fully take into account the relativistic corrections using the Schrödinger-like exact two-component quasi-relativistic method (X2C) [33, 34]. In fact, its spin-free approximation (sf-X2C) is accurate enough for our purposes, see D.3.2 for details.

We use the sf-X2C Hamiltonian for calculating the relativistic HF energies that provide the benchmarks for our 1pEx(sf-X2C) energies in Fig. 3. The sf-X2C Hamiltonian also determines our relativistic 1pEx-basis and the associated interaction tensor elements; see D.3.2 for an outline of our numerical procedures. This application of 1pEx-DFT is an example of the generic situation, in which the eigenstates of H1pH_{\mathrm{1p}} can only be determined numerically.

Refer to caption
Figure 3: Total binding energies EBE_{\mathrm{B}} of atoms and ions relative to the nonrelativistic HF energies EHFE_{\mathrm{HF}}. The latter are an upper bound to the exact nonrelativistic energies and a lower bound to (our current implementation of) the nonrelativistic E1pExE_{\mathrm{1pEx}}. Smaller ordinates correspond to lower total ground-state energies (viz., negative binding energies). The connecting lines guide the eye. The absolute values of the energies displayed here are listed in Table 4 in D.3.2.

Since Fig. 3 displays relative deviations to the nonrelativistic EHFE_{\mathrm{HF}}, the relativistic effects we extract from 1pEx(sf-X2C) are visible (to the eye) only for the highly charged ions. We also compare our results with a (nonrelativistic) approximation for the binding energy of neutral atoms: the ‘statistical atom’ denotes a semiclassical approximation that includes the Scott correction as well as quantum corrections upon the TF approximation and becomes relatively more accurate as the atomic number increases [48, 49, 18]. The binding energies calculated with 1pEx(sf-X2C) tend to approach the NIST data for highly charged ions, where relativistic energy corrections can dominate over correlation energy and other effects (such as QED effects) that are not taken into account here. In fact, both 1pEx(sf-X2C) and HF(sf-X2C) deliver essentially the exact reference value for helium-like Ar16+. This contrasts with the cases of neutral helium and the hydrogen anion. For the latter, HF underestimates the experimental binding energy by more than 7% [50].

4 Discussion and conclusions

We introduced 1pEx-DFT, a novel generic quantum many-body method in the spirit of orbital-free DFT, where the energy functional depends on the particle density (in whichever representation), viz., the diagonal of the density matrix. This contrasts with density matrix functional theory, where the single-particle part is also exact but depends on the entire density matrix. 1pEx-DFT is as transferable as other explicitly Hamiltonian-based methods, for example, HF theory or the Schrödinger equation itself. Here, we gave first illustrations of the broad scope of 1pEx-DFT by simulating interacting Fermi gases in one-dimensional confinement and by extracting relativistic corrections in the electronic structure of atoms and ions.

The proof-of-principle implementation of 1pEx-DFT in this work can reach HF accuracy by design, although full equivalence with HF in all cases will require an extension of the currently implemented constrained search toward all one-body density matrices compatible with the HF approximation. And since our analysis in A, see also Ref. [21], indicates that the evaluation of cumulant-based corrections to the HF approximation in the 1pEx-basis is costly, the accuracy of 1pEx-DFT could often be limited in practice to that of HF. This restriction is, however, minor for an important class of systems that we think should be targeted by 1pEx-DFT, namely ultracold atomic gases of mesoscopic size, for which the cost even of HF calculations is usually prohibitive. Figure 1 illustrates our take on the trinity of transferability, accuracy, and scalability of 1pEx-DFT, and we argue that the latter poses the primary challenge for 1pEx-DFT in becoming a complementing approach to today’s workhorses among quantum many-body methods. In the following we propose several routes to attacking this issue of scalability.

The figure of merit for judging the efficacy of traditional optimizers for minimizing the energy in Eq. (39) is the number of function (or gradient) evaluations required to reach a targeted threshold for the function value. As a rule, this number grows sharply with the dimension DD of the search space. In addition, the computational cost of a single evaluation of Eq. (39), where D=2L{D=2L}, scales like 𝒪(D4)\mathcal{O}\big{(}D^{4}\big{)}, such that our currently implemented objective-function-based optimizers, including PSO, are too costly for systems that require us to consider hundreds of single-particle levels. As an alternative to PSO, we may employ stochastic gradient descent (SGD) or its refinements [51], which forfeit objective function evaluations and can escape local optima by descending along approximate gradients. The prototypical application of SGD is the minimization of functions that are sums like Eq. (39), such that the computational cost for a single gradient evaluation can be reduced from 𝒪(D4)\mathcal{O}\big{(}D^{4}\big{)} to 𝒪(1)\mathcal{O}\big{(}1\big{)}—at the expense of a potentially large number of descent steps, but with the promise of the same efficiency gains that also enable large-scale machine learning applications based on automatic differentiation. In particular, SGD on graphical processing units could prove promising since the interaction tensor elements IabcdI_{abcd} of Eq. (34) cannot all be stored in memory and have to be (re-)calculated during runtime anyway if, say, D1000{D\gtrsim 1000}. This would also circumvent another caveat of 1pEx-DFT, namely that IabcdI_{abcd} has to be recalculated when the external potential changes, for example, during the search for the ground-state geometry of molecules. A third and somewhat counterintuitive type of optimizer disregards both gradients and function values: ‘novelty search’ contrasts with traditional optimizers as it escapes local optima by guiding the optimizer into unexplored regions of the search space—a heuristic that can optimize highly deceptive objective functions [52]. We acknowledge that no strategy can guarantee finding the global optimum gg of a black-box function and that gg can only be found if the optimizer happens to arrive—ultimately by sheer luck, but hopefully accelerated via tried-and-tested heuristics—in the optimizer-specific attractor region of gg. Hence, we can only hypothesize that combining a pool of optimizers will give us the means to minimize Eq. (39) across physical systems even in high-dimensional (D1000{D\gtrsim 1000}) spaces. We also leave for future work the embedding of a suitable optimizer into a divide-and-conquer strategy for large-scale global optimization [53, 54, 55].

Maybe the greatest potential for scalability lies in explicit closed-form functionals Eint[𝒏]E_{\mathrm{int}}[\bm{n}] with computational cost 𝒪(D){\sim\mathcal{O}\big{(}D\big{)}}, similar to E1pE_{\mathrm{1p}} in Eq. (8), which could make all the optimization strategies mentioned above feasible for systems with thousands of particles. As the most important quest in this regard we deem the search for a TF-type approximate functional that works at least for trapped atomic gases. Furthermore, when approaching a continuum of single-particle levels as NN increases, it may be possible to approximate the four-dimensional sum of Eq. (33) by interaction-specific integrals that can be evaluated more efficiently than the sum. Independent of the functional form of the interaction energy or choice of optimizer, we may also fully occupy the low-lying levels and optimize in the remaining ‘active space’ of partially occupied levels—we already implicitly impose vanishing participation numbers of levels beyond LL, just like any other method that imposes energy cutoffs. This reduces the optimization cost, especially for weakly interacting systems and, for example, for heavier atoms. Such a ‘frozen-core approximation’ of 1pEx-DFT does not require ad-hoc assumptions or fits to data and is thus a natural ab-initio alternative to the use of pseudopotentials that approximate the Coulomb potential in traditional DFT methods. Of course, 1pEx-DFT calculations could also benefit directly from the use of pseudopotentials, if the implied uncertainties are tolerable, but the appeal of 1pEx-DFT comes in particular from the exact and unproblematic treatment of external potentials. For example, 1pEx-DFT naturally accounts for (and supersedes) the Scott correction [48, 18], the leading correction to the Thomas–Fermi model of atoms with singular nuclear potential.

We are confident that the list of aforementioned issues—and their remedies—is by no means complete. And all our suggestions for increasing the efficiency of 1pEx-DFT are uncharted territory at present. But, taking the cue from the history of the quantum many-body problem in general and the history of DFT in particular, we may hope to resolve over time many of the technical challenges that initially accompany a novel method like 1pEx-DFT.

5 Acknowledgments

We thank Leonardo dos Anjos Cunha, Min Lin, and Giovanni Vignale for valuable input. This work has been supported by the National Research Foundation, Singapore and A*STAR under its CQT Bridging Grant and its Quantum Engineering Programme (grant NRF2022-QEP2-02-P16 supports J.H.H.). One of the authors (J. C.) acknowledges the support of the Max-Planck-Institut für Physik komplexer Systeme, Dresden, Germany.

Appendix A Perturbation theory for the correlation energy in the 1pEx-basis

In this appendix we derive a perturbative approximation of the correlation contribution to the exact interaction energy. To that end, we calculate the contribution of the 2-cumulant—the difference between the exact and the HF two-body density matrix—to second-order perturbation theory along the lines of Ref. [56]. This allows us, in the limit of weak interactions, to gauge how fast we approach the exact energies and participation numbers as we increase the number LL of single-particle states that determine the perturbative energies and participation numbers. Unfortunately, we find that the energies converge rather slowly with LL, which suggests that the 1pEx-basis can be inefficient, compared with natural orbitals—at least as far as correlations beyond HF exchange are concerned, see also Ref. [21].

The expectation value of the many-particle Hamilton operator, cf. Eq. (9), corresponding to some (not necessarily ground-state) wave function Ψ\Psi reads

E=Ψ|Hmp|Ψ=pqΓpq1hqp+pqrsΓpqrs2grspq,\displaystyle E=\langle\Psi|H_{\mathrm{mp}}|\Psi\rangle=\sum_{pq}{{}^{1}\Gamma}_{pq}\,h_{qp}+\sum_{pqrs}{{}^{2}\Gamma}_{pqrs}\,g_{rspq}\,, (44)

where hpq=p(1)|H1p|q(1)h_{pq}=\left<\right.{p(1)}\left.\right|{H_{\mathrm{1p}}}\left|\right.{q(1)}\left.\right> and gpqrs=p(1)|q(2)|Vint(\mathboldR1\mathboldR2)|r(1)|s(2)g_{pqrs}={\left\langle{p(1)}\right|}\left<\right.{q(2)}\left.\right|{V_{\mathrm{int}}(\mathbold{R}_{1}-\mathbold{R}_{2})}\left|\right.{r(1)}\left.\right>{\left|{s(2)}\right\rangle}, with |q(1){\left|{q(1)}\right\rangle} referring to the spin-orbital ϕq(1)\phi_{q}(1) for a particle labeled ‘(1)’. That is, in what follows, the participation numbers are constrained by 0nq1{0\leq n_{q}\leq 1}, and the spin-orbitals {ϕq()}\big{\{}\phi_{q}(\,)\big{\}} define the creation and annihilation operators {ϕp+}\big{\{}\phi_{p}^{+}\big{\}} and {ϕp}\big{\{}\phi_{p}^{-}\big{\}} that enter the one-body reduced density matrix (‘1-matrix’) with elements

Γpq1=Ψ|ϕq+ϕp|Ψ\displaystyle{{}^{1}\Gamma_{pq}}=\langle\Psi|\phi_{q}^{+}\,\phi_{p}^{-}|\Psi\rangle\quad (45)

and the 2-matrix with elements

Γpqrs2=12Ψ|ϕr+ϕs+ϕqϕp|Ψ.\displaystyle{{}^{2}\Gamma_{pqrs}}=\frac{1}{2}\,\langle\Psi|\phi_{r}^{+}\,\phi_{s}^{+}\,\phi_{q}^{-}\,\phi_{p}^{-}|\Psi\rangle\,. (46)

Their normalizations are

rΓprqr2=N12Γpq1andpΓpp1=N.\displaystyle\sum_{r}\,{{}^{2}\Gamma_{prqr}}=\frac{N-1}{2}\;{{}^{1}\Gamma_{pq}}\quad\mbox{and}\quad\sum_{p}\,{{}^{1}\Gamma_{pp}}=N\,. (47)

In conventional approaches to DFT, the total energy in Eq. (44) is the functional

E[ρ(1)]=min{Γpqrs2}ρ(1)pqrs(2hrpδqsN1+grspq)Γpqrs2,\displaystyle E[\rho(1)]=\min_{\{{{}^{2}\Gamma_{pqrs}}\}\to\rho(1)}\;\sum_{pqrs}\,\left(\frac{2\,h_{rp}\,\delta_{qs}}{N-1}+g_{rspq}\right)\,{{}^{2}\Gamma_{pqrs}}\,, (48)

where {Γpqrs2}ρ(1)\{{{}^{2}\Gamma_{pqrs}}\}\to\rho(1), with the single-particle density ρ(1)\rho(1), denotes both the constraint

pqΓpq1ϕq(1)ϕp(1)=ρ(1)\displaystyle\sum_{pq}{{}^{1}\Gamma}_{pq}\,\phi^{*}_{q}(1)\,\phi_{p}(1)=\rho(1)\quad (49)

and the NN-representability of {Γpqrs2}\{{{}^{2}\Gamma_{pqrs}}\}. It is instructive to compare Eq. (48) with its analog

E({1Γpq})=pqΓpq1hqp+min{Γpqrs2}{1Γpq}pqrsΓpqrs2grspq\displaystyle E\big{(}\{^{1}\Gamma_{pq}\}\big{)}=\sum_{pq}{{}^{1}\Gamma}_{pq}\,h_{qp}+\min_{\{{{}^{2}\Gamma_{pqrs}}\}\to\{^{1}\Gamma_{pq}\}}\;\sum_{pqrs}{{}^{2}\Gamma}_{pqrs}\,g_{rspq} (50)

in density matrix functional theory, where {Γpqrs2}{1Γpq}\{{{}^{2}\Gamma_{pqrs}}\}\to\{^{1}\Gamma_{pq}\} denotes both the constraints in Eq. (47) and the NN-representability of {Γpqrs2}\{{{}^{2}\Gamma_{pqrs}}\}. There are two advantages of density matrix functional theory over conventional DFT. First, the obvious one, which is shared with 1pEx-DFT, namely the exclusion of the single-particle part from the constrained minimization. Second, the less obious one, which is only revealed upon the introduction of the 2-cumulant with the elements

2𝔊pqrs=Γpqrs212(Γpr1Γqs1Γps1Γqr1),\displaystyle^{2}\mathfrak{G}_{pqrs}={{}^{2}\Gamma_{pqrs}}-\frac{1}{2}\left({{}^{1}\Gamma_{pr}}{{}^{1}\Gamma_{qs}}-{{}^{1}\Gamma_{ps}}{{}^{1}\Gamma_{qr}}\right)\,, (51)

which obey 𝔊pqrs2=𝔊rspq2=2𝔊pqsr=𝔊qpsr2=2𝔊qprs{{}^{2}\mathfrak{G}_{pqrs}={{}^{2}}\mathfrak{G}_{rspq}^{*}=-^{2}\mathfrak{G}_{pqsr}={{}^{2}\mathfrak{G}_{qpsr}}=-^{2}\mathfrak{G}_{qprs}} and

r𝔊prqr2=12(kΓpk1Γkq1Γpq1).\displaystyle\sum_{r}\,{{}^{2}\mathfrak{G}_{prqr}}=\frac{1}{2}\left(\sum_{k}{{}^{1}\Gamma}_{pk}{{}^{1}\Gamma}_{kq}-{{}^{1}\Gamma}_{pq}\right)\,. (52)

Then, Eq. (50) reads

E({1Γpq})=pqΓpq1hqp+12pqrs(Γpr1Γqs1Γps1Γqr1)grspq+min{𝔊pqrs2}{1Γpq}pqrs𝔊pqrs2grspq,\displaystyle E(\{^{1}\Gamma_{pq}\})=\sum_{pq}{{}^{1}\Gamma}_{pq}\,h_{qp}+\frac{1}{2}\;\sum_{pqrs}\left({{}^{1}\Gamma_{pr}}{{}^{1}\Gamma_{qs}}-{{}^{1}\Gamma_{ps}}{{}^{1}\Gamma_{qr}}\right)g_{rspq}+\min_{\{{{}^{2}\mathfrak{G}_{pqrs}}\}\to\{^{1}\Gamma_{pq}\}}\;\sum_{pqrs}{{}^{2}\mathfrak{G}_{pqrs}}\,g_{rspq}\,, (53)

where {𝔊pqrs2}{1Γpq}\{{{}^{2}\mathfrak{G}_{pqrs}}\}\to\{^{1}\Gamma_{pq}\} denotes both the constraint (52) and the NN-representability of {𝔊pqrs2}\{{{}^{2}\mathfrak{G}_{pqrs}}\}. As a result, the minimization is now over a contribution to the total energy that vanishes for uncorrelated systems and is small in general.

In 1pEx-DFT we choose the spin-orbitals {ϕp()}\big{\{}\phi_{p}(\,)\big{\}} specifically to diagonalize H1pH_{\mathrm{1p}}. That is, each 1pEx mode introduced in Eq. (3) is effectively made up of two spin-orbitals ϕp()\phi_{p}(\,), and the total energy functional reads

E({np})=pnphpp+min{Γpqrs2}{np}pqrsΓpqrs2grspq,\displaystyle E(\{n_{p}\})=\sum_{p}n_{p}\,h_{pp}+\min_{\{{{}^{2}\Gamma_{pqrs}}\}\to\{n_{p}\}}\;\sum_{pqrs}{{}^{2}\Gamma}_{pqrs}\,g_{rspq}\,, (54)

where np=Γpp1n_{p}={{}^{1}\Gamma}_{pp}, ϵp=hpp\epsilon_{p}=h_{pp} (in the case of a spin-unpolarized system, ϵ1=ϵ2=E1{\epsilon_{1}=\epsilon_{2}=E_{1}}, ϵ3=ϵ4=E2{\epsilon_{3}=\epsilon_{4}=E_{2}}, and so forth, with the energies EaE_{a} of Eq. (3)), and {Γpqrs2}{np}\{{{}^{2}\Gamma_{pqrs}}\}\to\{n_{p}\} denotes both the constraint 2N1rΓprpr2=np{\frac{2}{N-1}\;\sum_{r}\,{{}^{2}\Gamma_{prpr}}=n_{p}} for all pp, according to Eq. (47), and the NN-representability of {Γpqrs2}\{{{}^{2}\Gamma_{pqrs}}\}.

A perturbative treatment of the interaction energy in Eq. (54) yields

Γpq1=Γpq(0)1+λΓpq(1)1+λ2Γpq(2)1+\displaystyle{{}^{1}{\Gamma}_{pq}}={{}^{1}{\Gamma}_{pq}^{(0)}}+\lambda\,{{}^{1}{\Gamma}_{pq}^{(1)}}+\lambda^{2}\,{{}^{1}{\Gamma}_{pq}^{(2)}}+...\, (55)

with parameter λ\lambda, as done in Ref. [56], where

Γpq(0)1\displaystyle{{}^{1}\Gamma_{pq}^{(0)}} =νpδpq,\displaystyle=\nu_{p}\,\delta_{pq}\,, (56)
Γpq(1)1\displaystyle{{}^{1}\Gamma_{pq}^{(1)}} =νpνqϵpϵqGpq,\displaystyle=\frac{\nu_{p}-\nu_{q}}{\epsilon_{p}-\epsilon_{q}}\;G_{pq}\,, (57)

and

Γpq(2)1\displaystyle{{}^{1}{\Gamma}_{pq}^{(2)}} =iηpηqνiνpνqηi(ϵpϵi)(ϵqϵi)GpiGiq+𝒫^pqiηqνpνiνqηpηiϵqϵiGpiGiqϵpϵq\displaystyle=\sum_{i}\,\frac{\eta_{p}\,\eta_{q}\,\nu_{i}-\nu_{p}\,\nu_{q}\,\eta_{i}}{(\epsilon_{p}-\epsilon_{i})\,(\epsilon_{q}-\epsilon_{i})}\;G_{pi}\;G_{iq}+\mathcal{\hat{P}}^{\dagger}_{pq}\sum_{i}\,\frac{\eta_{q}\,\nu_{p}\,\nu_{i}-\nu_{q}\,\eta_{p}\,\eta_{i}}{\epsilon_{q}-\epsilon_{i}}\;\frac{G_{pi}\;G_{iq}}{\epsilon_{p}-\epsilon_{q}}
+ijνpνqϵpϵqνiνjϵiϵjGij𝔤jpiq+12𝒫^pqijkηqνpνiνjηkνqηpηiηjνkϵq+ϵkϵiϵj𝔤kpij𝔤ijkqϵpϵq\displaystyle\quad+\sum_{ij}\,\frac{\nu_{p}-\nu_{q}}{\epsilon_{p}-\epsilon_{q}}\;\frac{\nu_{i}-\nu_{j}}{\epsilon_{i}-\epsilon_{j}}\;G_{ij}\;\mathfrak{g}_{jpiq}+\frac{1}{2}\mathcal{\hat{P}}^{\dagger}_{pq}\sum_{ijk}\,\frac{\eta_{q}\,\nu_{p}\,\nu_{i}\,\nu_{j}\,\eta_{k}-\nu_{q}\,\eta_{p}\,\eta_{i}\,\eta_{j}\,\nu_{k}}{\epsilon_{q}+\epsilon_{k}-\epsilon_{i}-\epsilon_{j}}\;\frac{\mathfrak{g}_{kpij}\;\mathfrak{g}_{ijkq}}{\epsilon_{p}-\epsilon_{q}}
+ 2(ηpηqνpνq)ijk𝔊ipjk(1)2𝔊jkiq(1)2.\displaystyle\quad+\,2\,(\eta_{p}\,\eta_{q}-\nu_{p}\,\nu_{q})\sum_{ijk}\,{{}^{2}{\mathfrak{G}}_{ipjk}^{(1)}}\,{{}^{2}{\mathfrak{G}}_{jkiq}^{(1)}}\,. (58)

Here, νp\nu_{p} and ηp=1νp{\eta_{p}=1-\nu_{p}} are the indicator functions for the core ({νp}\{\nu_{p}\}) and virtual ({ηp}\{\eta_{p}\}) subsets of the spin-orbitals {ϕp(1)}\{\phi_{p}(1)\}, i.e., νp=1{\nu_{p}=1} if npn_{p} is close to one and νp=0{\nu_{p}=0} otherwise. For the purpose of the present work, we set νp=1{\nu_{p}=1} for p=1,,N{p=1,...,N} in the case of NN particles. Fractions with vanishing numerators in Eqs. (57) and (58) are assumed to be zero, even if ϵp=ϵq{\epsilon_{p}=\epsilon_{q}}. We define 𝔤pqrs=gpqrsgpqsr{\mathfrak{g}_{pqrs}=g_{pqrs}-g_{pqsr}} and Gpq=iνi𝔤piqi{G_{pq}=\sum_{i}\,\nu_{i}\,\mathfrak{g}_{piqi}}, and the operator 𝒫pq\mathcal{P}^{\dagger}_{pq} acts on two-index quantities according to the prescription 𝒫pqApq=Apq+Aqp{\mathcal{P}^{\dagger}_{pq}\,A_{pq}=A_{pq}+A_{qp}^{*}}.

The 2-cumulant has no zeroth-order term and we shall approximate it by its first-order term

λ2𝔊pqrs(1)=12νpνqηrηsηpηqνrνsϵp+ϵqϵrϵs𝔤pqrs,\displaystyle\lambda\;^{2}\mathfrak{G}_{pqrs}^{(1)}=\frac{1}{2}\;\frac{\nu_{p}\,\nu_{q}\,\eta_{r}\,\eta_{s}-\eta_{p}\,\eta_{q}\,\nu_{r}\,\nu_{s}}{\epsilon_{p}+\epsilon_{q}-\epsilon_{r}-\epsilon_{s}}\;\mathfrak{g}_{pqrs}\,, (59)

see Ref. [56]. With Eqs. (51), (55), and (59) the first two terms in the expansion

Vee({np})=min{Γpqrs2}{np}pqrsΓpqrs2grspq=λVee(1)({np})+λ2Vee(2)({np})+\displaystyle V_{\mathrm{ee}}(\{n_{p}\})=\min_{\{{{}^{2}\Gamma_{pqrs}}\}\to\{n_{p}\}}\;\sum_{pqrs}{{}^{2}\Gamma}_{pqrs}\,g_{rspq}=\lambda\,V^{(1)}_{\mathrm{ee}}(\{n_{p}\})+\lambda^{2}\,V_{\mathrm{ee}}^{(2)}(\{n_{p}\})+... (60)

are readily available:

Vee(1)({np})\displaystyle V_{\mathrm{ee}}^{(1)}(\{n_{p}\}) =pqrsΓpqrs(0)2grspq=12pqrs(Γpr(0)1Γqs(0)1Γps(0)1Γqr(0)1)grspq=12pqνpνq𝔤pqpq\displaystyle=\sum_{pqrs}{{}^{2}\Gamma}^{(0)}_{pqrs}\,g_{rspq}=\frac{1}{2}\;\sum_{pqrs}\left({{}^{1}\Gamma^{(0)}_{pr}}\,{{}^{1}\Gamma^{(0)}_{qs}}-{{}^{1}\Gamma^{(0)}_{ps}}\,{{}^{1}\Gamma^{(0)}_{qr}}\right)\,g_{rspq}=\frac{1}{2}\;\sum_{pq}\,\nu_{p}\,\nu_{q}\,\mathfrak{g}_{pqpq} (61)
and
Vee(2)({np})\displaystyle V_{\mathrm{ee}}^{(2)}(\{n_{p}\}) =pqrsΓpqrs(1)2grspq\displaystyle=\sum_{pqrs}{{}^{2}\Gamma}^{(1)}_{pqrs}\,g_{rspq}
=12pqrs(Γpr(1)1Γqs(0)1Γps(1)1Γqr(0)1+Γpr(0)1Γqs(1)1Γps(0)1Γqr(1)1)grspq+pqrs𝔊pqrs(1)2grspq\displaystyle=\frac{1}{2}\;\sum_{pqrs}\left({{}^{1}\Gamma^{(1)}_{pr}}\,{{}^{1}\Gamma^{(0)}_{qs}}-{{}^{1}\Gamma^{(1)}_{ps}}\,{{}^{1}\Gamma^{(0)}_{qr}}+{{}^{1}\Gamma^{(0)}_{pr}}\,{{}^{1}\Gamma^{(1)}_{qs}}-{{}^{1}\Gamma^{(0)}_{ps}}\,{{}^{1}\Gamma^{(1)}_{qr}}\right)\,g_{rspq}+\,\sum_{pqrs}{{}^{2}\mathfrak{G}_{pqrs}^{(1)}}\,g_{rspq}
=pqνpνqϵpϵq|Gpq|2+12pqrsνpνqηrηsηpηqνrνsϵp+ϵqϵrϵs|𝔤pqrs|2,\displaystyle=\,\sum_{pq}\frac{\nu_{p}-\nu_{q}}{\epsilon_{p}-\epsilon_{q}}\;|G_{pq}|^{2}+\frac{1}{2}\,\sum_{pqrs}\frac{\nu_{p}\,\nu_{q}\,\eta_{r}\,\eta_{s}-\eta_{p}\,\eta_{q}\,\nu_{r}\,\nu_{s}}{\epsilon_{p}+\epsilon_{q}-\epsilon_{r}-\epsilon_{s}}\;|\mathfrak{g}_{pqrs}|^{2}\,, (62)

such that (λ\lambda is now dropped)

Vee({np})=12pqνpνq𝔤pqpq+pqνpνqϵpϵq|Gpq|2+12pqrsνpνqηrηsηpηqνrνsϵp+ϵqϵrϵs|𝔤pqrs|2\displaystyle V_{\mathrm{ee}}(\{n_{p}\})=\frac{1}{2}\;\sum_{pq}\,\nu_{p}\,\nu_{q}\,\mathfrak{g}_{pqpq}+\,\sum_{pq}\frac{\nu_{p}-\nu_{q}}{\epsilon_{p}-\epsilon_{q}}\;|G_{pq}|^{2}+\frac{1}{2}\,\sum_{pqrs}\frac{\nu_{p}\,\nu_{q}\,\eta_{r}\,\eta_{s}-\eta_{p}\,\eta_{q}\,\nu_{r}\,\nu_{s}}{\epsilon_{p}+\epsilon_{q}-\epsilon_{r}-\epsilon_{s}}\;|\mathfrak{g}_{pqrs}|^{2} (63)

is correct up to second order in the pair interaction strength. Accordingly, we determine the participation numbers by setting p=q{p=q} in Eqs. (56)–(58):

np=Γpp1νp+iηpνiνpηi(ϵpϵi)2|Gpi|2+12(ηpνp)ijkηpηiνjνk+νpνiηjηk(ϵi+ϵpϵjϵk)2|𝔤ipjk|2.\displaystyle n_{p}={{}^{1}\Gamma}_{pp}\approx\nu_{p}+\sum_{i}\,\frac{\eta_{p}\,\nu_{i}-\nu_{p}\,\eta_{i}}{(\epsilon_{p}-\epsilon_{i})^{2}}\;|G_{pi}|^{2}+\,\frac{1}{2}\;(\eta_{p}-\nu_{p})\,\sum_{ijk}\frac{\eta_{p}\,\eta_{i}\,\nu_{j}\,\nu_{k}+\nu_{p}\,\nu_{i}\,\eta_{j}\,\eta_{k}}{(\epsilon_{i}+\epsilon_{p}-\epsilon_{j}-\epsilon_{k})^{2}}\;|\mathfrak{g}_{ipjk}|^{2}\,. (64)
Refer to caption
Figure 4: Energies and participation numbers for N=2{N=2} contact-interacting unpolarized spin-12\frac{1}{2} particles at three different interaction strengths 𝒸{\mathpzc{c}}. The connecting lines guide the eye. Left: we compare the quasi-exact energy EexE_{\mathrm{ex}} (where ‘quasi-exact’ refers to a numerically approximate evaluation of the exact expression that is accurate at the level of machine precision, here achieved for L=60{L=60}) with the total energy E1p+Vee{E_{\mathrm{1p}}+V_{\mathrm{ee}}} (as a function of LL) to second-order perturbation theory, viz., Eq. (63), with 2L2L spin-orbitals taken into account. For illustrative purposes we rescale the ordinate for 𝒸=0.1{\mathpzc{c}=0.1} (𝒸=0.001{\mathpzc{c}=0.001}) by a factor of 10210^{2} (10610^{6}). Right: the participation numbers {na}\{n_{a}\} of (doubly occupied) 1pEx modes labeled a=1,2,,L{a=1,2,...,L} according to Eq. (3). We compare the quasi-exact participation numbers with the participation numbers derived from Eq. (64), where the summation indices span {1,2,,2L}\{1,2,...,2L\}.

Figure 4 illustrates for N=2{N=2} contact-interacting unpolarized spin-12\frac{1}{2} particles that the total energy in second-order perturbation theory converges rather slowly as the number LL of single particle orbitals increases (Fig. 4, left panel). As expected, the perturbative treatment becomes very accurate as the contact interaction strength 𝒸\mathpzc{c} decreases from 𝒸=1{\mathpzc{c}=1} to 𝒸=0.001{\mathpzc{c}=0.001}. But the convergence behavior for 𝒸=0.001{\mathpzc{c}=0.001} still matches that for 𝒸=1{\mathpzc{c}=1}. Similarly, we find a slow decay of the 1pEx participation numbers {na}\{n_{a}\} toward zero as aa increases (Fig. 4, right panel). If this example is prototypical, then a high energy cutoff in the 1pEx-basis is required for very precise calculations that include correlations beyond HF exchange.

Appendix B Approximate density matrices

In this appendix we (i) obtain the seed matrices for the 1pEx-DFT program in Eq. (38) from the matrix mixer algorithm introduced in Ref. [31] and from a TF-inspired Wigner function, respectively, (ii) derive the Hartree–Fock density matrix in the 1pEx-basis for N=2{N=2} particles, (iii) express the spatial and momental densities through the converged seed matrix and phases, (iv) write the exact HF interaction energy for the contact-interaction as a density functional, and (v) discuss transformations of the seed matrix beyond the phase transformations in Eq. (26).

Seed matrix from a matrix mixer algorithm. To generate a valid density matrix ϱ\varrho, which obeys

ϱaa\displaystyle\varrho_{aa} =na\displaystyle=n_{a} (65)
and
ϱ2\displaystyle\varrho^{2} =2ϱ\displaystyle=2\varrho (66)

for a prescribed vector 𝒏\bm{n} of participation numbers, we start from a matrix ϱ0\varrho_{0} with diagonal entries (2,2,,2,0,0,,0)(2,2,\dots,2,0,0,\dots,0) that add up to NN. Note that ϱ0\varrho_{0} is itself a proper density matrix and obeys the constraint of Eq. (66). We aim at an iterative transformation of ϱ0\varrho_{0} by unitary transformations (leaving both the trace and the spectrum unchanged), such that the diagonal of the final matrix is 𝒏\bm{n}. This is accomplished by a sequence of unitary transformations

M=(cosθsinθsinθcosθ),M=\begin{pmatrix}\cos\theta&\sin\theta\\ \sin\theta&-\cos\theta\\ \end{pmatrix}\,, (67)

which transform diagonal 2×22\times 2 matrices into

M(η)(a00b)M(η)=(ηa+(1η)bη(1η)(ab)η(1η)(ab)(1η)a+ηb).M(\eta)\,\begin{pmatrix}a&0\\ 0&b\end{pmatrix}\,M(\eta)^{\dagger}=\begin{pmatrix}\eta a+(1-\eta)b&\sqrt{\eta(1-\eta)}(a-b)\\ \sqrt{\eta(1-\eta)}(a-b)&(1-\eta)a+\eta b\end{pmatrix}. (68)

Here, the parameter η=cos2θ{\eta=\cos^{2}\theta} can be adjusted for the first diagonal element of the matrix in Eq. (68) to equal any prescribed value between aa and bb. In particular, since the participation numbers are all bounded between zero and two, we can apply a sequence of suitable transformations MM on the initial matrix ϱ0\varrho_{0}, such that the final density matrix ϱit\varrho^{\mathrm{it}} obeys the constraint in Eq. (65).

The matrix mixer algorithm has been proven to work in all cases [31]. The proof consists of inductively showing that, for a given nan_{a}, there is always a 2×22\times 2 diagonal sub-matrix, associated with indices (i,j)(i,j), such that ϱiinaϱjj{\varrho_{ii}\geq n_{a}\geq\varrho_{jj}}, hence permitting the application of the matrix mixing operation MM of Eq. (68). As an illustration, we consider N=4{N=4} particles on L=4{L=4} levels and target 𝒏=(64,54,34,24){\bm{n}=\left(\frac{6}{4},\frac{5}{4},\frac{3}{4},\frac{2}{4}\right)}. The sequence

ϱ0=(2000020000000000)(1,3)η1=3/4ϱ1=(322120)(2,3)η2=1/2ϱ2=(3254540)(3,4)η3=3/5ϱ3=(32543412)\varrho_{0}=\begin{pmatrix}2&0&0&0\\ 0&2&0&0\\ 0&0&0&0\\ 0&0&0&0\\ \end{pmatrix}\overset{\eta_{1}=3/4}{\underset{(1,3)}{\longrightarrow}}\;\varrho_{1}=\begin{pmatrix}\frac{3}{2}&&*&\\ &2&&\\ *&&\frac{1}{2}&\\ &&&0\\ \end{pmatrix}\overset{\eta_{2}=1/2}{\underset{(2,3)}{\longrightarrow}}\;\varrho_{2}=\begin{pmatrix}\frac{3}{2}&*&*&\\ *&\frac{5}{4}&*&\\ *&*&\frac{5}{4}&\\ &&&0\\ \end{pmatrix}\overset{\eta_{3}=3/5}{\underset{(3,4)}{\longrightarrow}}\;\varrho_{3}=\begin{pmatrix}\frac{3}{2}&*&*&*\\ *&\frac{5}{4}&*&*\\ *&*&\frac{3}{4}&*\\ *&*&*&\frac{1}{2}\\ \end{pmatrix} (69)

of matrix mixing operations M(ηi)M(\eta_{i}), each operating in the 2×22\times 2 sector of indices (a,b)(a,b) as indicated, yields one target entry on the diagonal in each step ii. Off-diagonal elements, here summarily denoted by an asterisk (*), are thereby introduced and modified. The final density matrix ϱ3\varrho_{3} then satisfies both conditions of Eqs. (65) and (66). The exploration of alternative mixer algorithms is left for future study.

In the simplest case of N=2{N=2} particles on L=2{L=2} levels, i.e., na+nb=2{n_{a}+n_{b}=2} (without loss of generality, nanb{n_{a}\geq n_{b}}), we begin with ϱ0=(2000){\varrho_{0}=\begin{pmatrix}2&0\\ 0&0\\ \end{pmatrix}} and apply Eq. (68) with weight η=na2\eta=\frac{n_{a}}{2} to obtain

ϱit=(nananbnanbnb).\varrho^{\mathrm{it}}=\begin{pmatrix}n_{a}&\sqrt{n_{a}n_{b}}\\ \sqrt{n_{a}n_{b}}&n_{b}\end{pmatrix}. (70)

That is, for N=2{N=2} particles, the matrix mixer algorithm reproduces the Hartree–Fock density matrix in the 1pEx-basis: the spin-orbital density matrix γHFN=2(r;r)\gamma_{\mathrm{HF}}^{N=2}(r;r^{\prime}) of the Hartree–Fock spin-singlet ground-state wave function

ψHFN=2(r1=(\mathboldr1,σ1),r2=(\mathboldr2,σ2))=12ϕ(\mathboldr1)ϕ(\mathboldr2)σ1σ2|(||)\displaystyle\psi_{\mathrm{HF}}^{N=2}\big{(}r_{1}=(\mathbold{r}_{1},\sigma_{1}),r_{2}=(\mathbold{r}_{2},\sigma_{2})\big{)}=\frac{1}{\sqrt{2}}\phi(\mathbold{r}_{1})\,\phi(\mathbold{r}_{2})\,{\left\langle{\sigma_{1}\sigma_{2}}\right|}\,\big{(}{\left|{\uparrow\downarrow}\right\rangle}-{\left|{\downarrow\uparrow}\right\rangle}\big{)} (71)

for two particles in the normalized spatial orbital ϕ()\phi(\,) is

γHFN=2(r;r)=2dr2ψHFN=2(r,r2)ψHFN=2(r,r2)=ϕ(\mathboldr)ϕ(\mathboldr)(σ||σ+σ||σ).\displaystyle\gamma_{\mathrm{HF}}^{N=2}(r;r^{\prime})=2\int\mathrm{d}r_{2}\,\psi_{\mathrm{HF}}^{N=2}(r,r_{2})\,\psi_{\mathrm{HF}}^{N=2}(r^{\prime},r_{2})^{*}=\phi(\mathbold{r})\,\phi(\mathbold{r}^{\prime})^{*}\,\big{(}\left<\right.{\sigma}\left.\right|{\uparrow}\left.\right>\left<\right.{\uparrow}\left.\right|{\sigma^{\prime}}\left.\right>+\left<\right.{\sigma}\left.\right|{\downarrow}\left.\right>\left<\right.{\downarrow}\left.\right|{\sigma^{\prime}}\left.\right>\big{)}\,. (72)

Upon integrating out the spin degrees of freedom, we obtain the orbital density matrix

γHFN=2(\mathboldr;\mathboldr)=σ,σ{,}γHFN=2(r;r)=\mathboldr|ρHFN=2|\mathboldr,\displaystyle\gamma_{\mathrm{HF}}^{N=2}(\mathbold{r};\mathbold{r}^{\prime})=\sum_{\sigma,\sigma^{\prime}\in\{\uparrow,\downarrow\}}\gamma_{\mathrm{HF}}^{N=2}(r;r^{\prime})=\left<\right.{\mathbold{r}}\left.\right|{\rho_{\mathrm{HF}}^{N=2}}\left|\right.{\mathbold{r}^{\prime}}\left.\right>, (73)

with ρHFN=2=2|ϕϕ|{\rho_{\mathrm{HF}}^{N=2}=2{\left|{\phi}\right\rangle}{\left\langle{\phi}\right|}}, such that ϱabHF,N=2=a|ρHFN=2|b=2cacb\varrho^{\mathrm{HF},N=2}_{ab}=\left<\right.{a}\left.\right|{\rho_{\mathrm{HF}}^{N=2}}\left|\right.{b}\left.\right>=2\,c_{a}\,c_{b}^{*}, with coefficients ca=|ca|eiϕa{c_{a}=|c_{a}|\,\mathrm{e}^{\mathrm{i}\phi_{a}}} of the expansion ϕ(\mathboldr)=acaψa(\mathboldr){\phi(\mathbold{r})=\sum_{a}c_{a}\psi_{a}(\mathbold{r})} in the orthonormal 1pEx-basis. Since ϱaaHF,N=2=2|ca|2=na{\varrho^{\mathrm{HF},N=2}_{aa}=2|c_{a}|^{2}=n_{a}}, we get

ϱabHF,N=2=nanbei(ϕaϕb)ϱabit,N=2=eiϕaϱab(0),it,N=2eiϕb,\displaystyle\varrho^{\mathrm{HF},N=2}_{ab}=\sqrt{n_{a}\,n_{b}}\,\mathrm{e}^{\mathrm{i}\,\big{(}\phi_{a}-\phi_{b}\big{)}}\equiv\varrho^{\mathrm{it},N=2}_{ab}=\mathrm{e}^{\mbox{\footnotesize$\mathrm{i}\phi_{a}$}}\varrho^{(0),\mathrm{it},N=2}_{ab}\mathrm{e}^{\mbox{\footnotesize$-\mathrm{i}\phi_{b}$}}, (74)

with optimal phases {ϕ^a}\{\hat{\phi}_{a}\} to be found, see Eq. (38).

In Fig. 5 we compare the optimized participation numbers for this case with the exact results and display the spatial densities calculated from the converged density matrix: if the 1pEx-basis states can be chosen real, as is the case for the harmonic oscillator eigenstates, the spatial (momental) densities are

n()=a,bcos(ϕ^aϕ^b)ψa()ϱ^ab(0)ψb(),\displaystyle n(\cdot)=\sum_{a,b}\cos\big{(}\hat{\phi}_{a}-\hat{\phi}_{b}\big{)}\,\psi_{a}(\cdot)\,\hat{\varrho}_{ab}^{(0)}\,\psi_{b}(\cdot)\,, (75)

where ()(\cdot) stands for position \mathboldr\mathbold{r} (momentum \mathboldp\mathbold{p}). We can also use Eq. (75) in the case of the hydrogenic states after separating from {ψa}\{\psi_{a}\} the ϕ\phi-dependence (with corresponding quantum number mam_{a}) of the spherical harmonics in Eq. (124), such that the remainder of {ψa}\{\psi_{a}\} is real, and adding ϕ(mamb)\phi\,(m_{a}-m_{b}) to the argument of the cosine in Eq. (75), with ϕ\phi the azimuthal angle of \mathboldr\mathbold{r} (or \mathboldp\mathbold{p}).

Using the density matrix in Eq. (72) and the pair potential VintV_{\mathrm{int}} in Eq. (41), we reduce the exact HF interaction energy

EintHF[γ]=12(d\mathboldr1)(d\mathboldr2)Vint(\mathboldr1\mathboldr2)(n(\mathboldr1)n(\mathboldr2)σ1,σ2γHFN=2(r1;r2)γHFN=2(r2;r1))\displaystyle E_{\mathrm{int}}^{\mathrm{HF}}[\gamma]=\frac{1}{2}\int(\mathrm{d}\mathbold{r}_{1})(\mathrm{d}\mathbold{r}_{2})\,V_{\mathrm{int}}(\mathbold{r}_{1}-\mathbold{r}_{2})\left(n(\mathbold{r}_{1})n(\mathbold{r}_{2})-\sum_{\sigma_{1},\sigma_{2}}\gamma_{\mathrm{HF}}^{N=2}(r_{1};r_{2})\,\gamma_{\mathrm{HF}}^{N=2}(r_{2};r_{1})\right) (76)

to

Eint,1D-contactHF[n]=𝒸2dx((n(x))212(n(x))2)=𝒸4dx(n(x))2,\displaystyle E_{\text{int,1D-contact}}^{\mathrm{HF}}[n]=\frac{\mathpzc{c}}{2}\int\mathrm{d}x\,\left(\big{(}n(x)\big{)}^{2}-\frac{1}{2}\big{(}n(x)\big{)}^{2}\right)=\frac{\mathpzc{c}}{4}\int\mathrm{d}x\,\big{(}n(x)\big{)}^{2}\,, (77)

which is actually the exact expression of the HF interaction energy for any even NN (and generalizes to contact-interaction in 2D and 3D). We use Eq. (77) as the interaction functional for producing the DPFT energies in Table 1 and the DPFT densities in Figs. 2 and 5.

Seed matrix from a Thomas–Fermi inspired density matrix. With the ingredients listed in Table 3, we produce the density matrix ϱtf\varrho^{\mathrm{tf}} in Eq. (40) in analogy to the TF-approximated spatial density matrix through an ansatz for an approximate Wigner function [F]W[F]_{W} of an operator FF in the rotor phase space. The basic observables of the rotor are (i) the unitary ε=eiΦ{\mbox{\leavevmode\resizebox{6.99997pt}{}{$\varepsilon$}}=\mathrm{e}^{\mbox{\footnotesize$\mathrm{i}\Phi$}}} for the azimuth with bras φ|=φ+2π|{{\left\langle{\varphi}\right|}={\left\langle{\varphi+2\pi}\right|}} and ε|φ=eiφ|φ\mbox{\leavevmode\resizebox{6.99997pt}{}{$\varepsilon$}}{\left|{\varphi}\right\rangle}=\mathrm{e}^{\mbox{\footnotesize$\mathrm{i}\varphi$}}{\left|{\varphi}\right\rangle}, and (ii) the hermitian 𝒜\mathcal{A} for the angular momentum with kets |a{\left|{a}\right\rangle}. The inversion operator is I=a=|aa|=(2π)dφ2π|φφ|{I=\sum_{a=-\infty}^{\infty}{\left|{-a}\right\rangle}{\left\langle{a}\right|}=\int_{(2\pi)}\frac{\mathrm{d}\varphi}{2\pi}{\left|{-\varphi}\right\rangle}{\left\langle{\varphi}\right|}}, η()\eta(\,) is the step function, and ν=2m2[μV(x+x2)]+{\nu=\sqrt{\frac{2m}{\hbar^{2}}\left[\mu-V\left(\frac{x+x^{\prime}}{2}\right)\right]_{+}}}, with chemical potential μ\mu, potential energy V()V(\,), and [z]+=zη(z){[z]_{+}=z\,\eta(z)}. We express F(𝒜,ε)F(\mathcal{A},\mbox{\leavevmode\resizebox{6.99997pt}{}{$\varepsilon$}}) with the help of the Fourier representation of η()\eta(\,), where the contour integration crosses the imaginary axis in the lower half-plane, and define f(𝒜)|a=f(a)|a=cot(π2na)|af(\mathcal{A}){\left|{a}\right\rangle}=f(a){\left|{a}\right\rangle}=\cot\left(\frac{\pi}{2}n_{a}\right){\left|{a}\right\rangle}. The operators PP and XX that appear in W(x,p;X,P)W(x,p;X,P) as powers of (P;X)(P;X) are ordered such that all PP stand to the left of XX.

Refer to caption
Figure 5: Participation numbers nan_{a} for single-particle levels a=1,,L=10{a=1,\dots,L=10} (top) and spatial densities (bottom) for N=2{N=2} spin-12\frac{1}{2} fermions in a 1D harmonic trap at contact-interaction strength 𝒸=20{\mathpzc{c}=20}, analogous to Fig. 2. We obtain the participation numbers from 1pEx-DFT using the HF density matrix, which coincides with ϱit\varrho^{\mathrm{it}} for N=2{N=2}. For this system, HF overestimates the exact ground-state energy by a factor of three. Hence, the quantitative differences between the (Dirac-approximated) participation numbers and their exact counterparts obtained from Refs. [42, 43] are not surprising. The same holds for the spatial 1pEx-densities labeled ‘ϱit/tf\varrho^{\mathrm{it/tf}}’, which align more with the DPFT densities nTFn_{\mathrm{TF}} and n3n_{3^{\prime}} than with the exact density. As any approximate density, also n3n_{3^{\prime}} does not have all the properties of the exact density. The oscillations of n3n_{3^{\prime}} into negative numbers occur in the classically forbidden region and become less pronounced for larger NN, see Fig. 2 and Ref. [38]. For applications that strictly rely on positive densities everywhere, positive n3n_{3^{\prime}} can be enforced in an ad hoc fashion at the level of the self-consistent calculation that yields n3n_{3^{\prime}}, see Ref. [22]. A similar measure is standard in the TF approach, where the spatial density is manually made to vanish in the classically forbidden region. We report exact results for completeness, but our actual objective is to validate our 1pEx-DFT implementation by comparing with HF results.
classical phase space {x,p}{\left\{x\in\mathbb{R},p\in\mathbb{R}\right\}} rotor phase space {a,φ[π,π)}{\left\{a\in\mathbb{Z},\varphi\in[-\pi,\pi)\right\}}
operator basis of Wigner type W(x,p;X,P)=2e2i(Pp);(Xx)W(x,p;X,P)=2\,\mathrm{e}^{\mbox{\footnotesize$\frac{2\mathrm{i}}{\hbar}(P-p);(X-x)$}} W(a,φ;𝒜,ε)=εaeiφ𝒜I(1+ε)eiφ𝒜εaW(a,\varphi;\mathcal{A},\mbox{\leavevmode\resizebox{6.99997pt}{}{$\varepsilon$}})=\mbox{\leavevmode\resizebox{6.99997pt}{}{$\varepsilon$}}^{a}\,\mathrm{e}^{\mbox{\footnotesize$-\mathrm{i}\varphi\mathcal{A}$}}\,I\,(1+\mbox{\leavevmode\resizebox{6.99997pt}{}{$\varepsilon$}})\,\mathrm{e}^{\mbox{\footnotesize$\mathrm{i}\varphi\mathcal{A}$}}\,\mbox{\leavevmode\resizebox{6.99997pt}{}{$\varepsilon$}}^{-a}
operator representation F(X,P)=dXdP2π[F]W(x,p)W(x,p;X,P)F(X,P)=\int\frac{\mathrm{d}X\mathrm{d}P}{2\pi\hbar}[F]_{W}(x,p)\,W(x,p;X,P) F(a,φ)=a(2π)dφ2π[F]W(a,φ)W(a,φ;𝒜,ε)F(a,\varphi)=\sum_{a}\int_{(2\pi)}\frac{\mathrm{d}\varphi}{2\pi}[F]_{W}(a,\varphi)\,W(a,\varphi;\mathcal{A},\mbox{\leavevmode\resizebox{6.99997pt}{}{$\varepsilon$}})
ansatz [F]W(x,p)=gη(μ12mp2V(x)){[F]_{W}(x,p)=g\,\eta\left(\mu-\frac{1}{2m}p^{2}-V(x)\right)} [F]W(a,φ)=gη(cotφf(a)){[F]_{W}(a,\varphi)=g\,\eta\big{(}\cot\varphi-f(a)\big{)}}
operator corresponding to [F]W[F]_{W} F(X,P)=gη(μ12mP2V(X)){F(X,P)=g\,\eta\left(\mu-\frac{1}{2m}P^{2}-V(X)\right)} F(𝒜,ε)=g\curve(-3,0,-8,0)\curve(3,0,8,0)\curve(0,0,-1.5,1.5)\curve(0,0,-1.5,-1.5)\arc(-3,0)180dt2πiteit[cotΦf(𝒜)]{F(\mathcal{A},\mbox{\leavevmode\resizebox{6.99997pt}{}{$\varepsilon$}})=g\,\int\limits_{\begin{picture}(16.0,3.0)(-8.0,-3.0)\put(0.0,0.0){\curve(-3,0,-8,0)\curve(3,0,8,0)}\put(8.0,0.0){\curve(0,0,-1.5,1.5)\curve(0,0,-1.5,-1.5)}\put(0.0,0.0){\arc(-3,0){180}}\put(0.0,0.0){\makebox(0.0,0.0)[]{$\cdot$}}\end{picture}}\frac{\mathrm{d}t}{2\pi\mathrm{i}t}\,\mathrm{e}^{\mbox{\footnotesize$\mathrm{i}t\,[\cot\Phi-f(\mathcal{A})]$}}}
approximate density matrix ϱTF(x;x)=r|F|r=gsin[2(xx)ν]2π(xx){\varrho_{\mathrm{TF}}(x;x^{\prime})=\left<\right.{r}\left.\right|{F}\left|\right.{r^{\prime}}\left.\right>=\frac{g\sin\left[2(x-x^{\prime})\,\nu\right]}{2\pi(x-x^{\prime})}} ϱabtf=a|F|b=gsin[(ab)σ]π(ab)=Eq. (40){\varrho^{\mathrm{tf}}_{ab}=\left<\right.{a}\left.\right|{F}\left|\right.{b}\left.\right>=\frac{g\sin\left[(a-b)\,\sigma\right]}{\pi(a-b)}=\mbox{Eq.~{}(\ref{rhotf})}}
Table 3: Key ingredients for the derivation of the density matrix ϱtf\varrho^{\mathrm{tf}} in rotor phase space—in analogy to the TF-approximated density matrix ϱTF\varrho_{\mathrm{TF}} in classical phase space, see also Refs. [15, 16, 17].

As an example, we give ϱit\varrho^{\mathrm{it}} and ϱtf\varrho^{\mathrm{tf}} for 𝒏=(2,1.5,0.3,0.2){\bm{n}=(2,1.5,0.3,0.2)}:

ϱit(200001.50.670.5500.670.30.2400.550.240.2),ϱtf(200001.50.570.3200.570.30.2300.320.230.2).\displaystyle\varrho^{\mathrm{it}}\approx\left(\begin{array}[]{cccc}2&0&0&0\\ 0&1.5&0.67&0.55\\ 0&0.67&0.3&0.24\\ 0&0.55&0.24&0.2\end{array}\right)\quad,\qquad\varrho^{\mathrm{tf}}\approx\left(\begin{array}[]{cccc}2&0&0&0\\ 0&1.5&0.57&0.32\\ 0&0.57&0.3&0.23\\ 0&0.32&0.23&0.2\end{array}\right)\ . (86)

Alternative transformations of the seed matrix. We consider a 2×22\times 2 sector at the diagonal of ϱ\varrho (in general, the 2×22\times 2 sector associated with an index pair (a,b)(a,b)), of the form

(xγγy),\displaystyle{\left(\begin{array}[]{cccc}\ddots&\vdots&\vdots&\\ \cdots&x&\gamma&\cdots\\ \cdots&\gamma^{*}&y&\cdots\\ &\vdots&\vdots&\ddots\end{array}\right)}\,, (91)

where the unitary transformation afforded by

U=[(xy)2+(γeiφab+γeiφab)2]12((xy)eiξa(γ+γe2iφab)eiξb(γ+γe2iφab)eiξa(yx)eiξb)\displaystyle U=\left[(x-y)^{2}+\left(\gamma^{*}\mathrm{e}^{\mbox{\footnotesize$\mathrm{i}\varphi_{ab}$}}+\gamma\mathrm{e}^{\mbox{\footnotesize$-\mathrm{i}\varphi_{ab}$}}\right)^{2}\right]^{-\frac{1}{2}}{\left(\begin{array}[]{ccc}(x-y)\,\mathrm{e}^{\mbox{\footnotesize$\mathrm{i}\xi_{a}$}}&&\big{(}\gamma+\gamma^{*}\mathrm{e}^{\mbox{\footnotesize$2\mathrm{i}\varphi_{ab}$}}\big{)}\,\mathrm{e}^{\mbox{\footnotesize$\mathrm{i}\xi_{b}$}}\\ \big{(}\gamma^{*}+\gamma\mathrm{e}^{\mbox{\footnotesize$-2\mathrm{i}\varphi_{ab}$}}\big{)}\,\mathrm{e}^{\mbox{\footnotesize$\mathrm{i}\xi_{a}$}}&&(y-x)\,\mathrm{e}^{\mbox{\footnotesize$\mathrm{i}\xi_{b}$}}\end{array}\right)} (94)

yields

U(xγγy)U=(xγei(2φabξa+ξb)γei(2φabξa+ξb)y),\displaystyle U^{\dagger}\,{\left(\begin{array}[]{cc}x&\gamma\\ \gamma^{*}&y\end{array}\right)}\,U={\left(\begin{array}[]{cc}x&\gamma^{*}\mathrm{e}^{\mbox{\footnotesize$\mathrm{i}(2\varphi_{ab}-\xi_{a}+\xi_{b})$}}\\ \gamma\mathrm{e}^{\mbox{\footnotesize$-\mathrm{i}(2\varphi_{ab}-\xi_{a}+\xi_{b})$}}&y\end{array}\right)}\,, (99)

so that the diagonal entries are unchanged, while the off-diagonal entries are affected—in fact, Eq. (94) parameterizes all 2×22\times 2 unitary transformations with this property. Our numerical analyses show, however, that the inclusion of the 3L(L1)/23L(L-1)/2 parameters {φab,ξa,ξb}\{\varphi_{ab},\xi_{a},\xi_{b}\}, three for each index pair (a,b)(a,b), does not yield lower energies in the global minimization of Eq. (10) compared with optimizing over the LL phases of Eq. (26) only. Although every unitary transformation that leaves the diagonal elements of a matrix unchanged can be decomposed into a sequence of L(L1)/2L(L-1)/2 transformations in 2×22\times 2 sectors, we have so far only covered those cases in which each of these transformations in 2×22\times 2 sectors individually preserve the diagonal elements. We leave the exploration of more general unitary transformations for future study.

Appendix C Evolutionary algorithms

C.1 Particle swarm optimization

Particle swarm optimization (PSO) is an evolutionary algorithm that draws inspiration from how an ‘intelligent’ swarm, such as a flock of birds, moves toward beneficial conditions [27, 28, 29]. Aiming at the global optimizer \mathboldx^=argmin\mathboldxXf(\mathboldx){\hat{\mathbold{x}}=\mathrm{arg}\,\underset{\mathbold{x}\in X}{\mathrm{min}}f(\mathbold{x})} of the objective function ff, each swarm particle s{1,,S}{s\in\{1,\dots,S\}} updates its coordinate vector \mathboldxs=(xs,1,,xs,d,,xs,D)X{\mathbold{x}_{s}=\left(x_{s,1},\dots,x_{s,d},\dots,x_{s,D}\right)\in X} in an iterative random walk through the DD-dimensional search space XX. The core principle of PSO is the stochastic guidance of this random walk by (i) the so far encountered personal best position \mathboldps\mathbold{p}_{s} of ss and (ii) the global personal best position \mathboldpg\mathbold{p}_{g}—or, alternatively and more generally, the personal best position \mathboldpgs\mathbold{p}_{g_{s}} from a randomly selected group GsG_{s} of particles that are intermittently linked to ss.

Figure 6a illustrates the work flow of our PSO implementation. We start each PSO run by uniformly drawing SS random particle ‘positions’ \mathboldxs(0)\mathbold{x}_{s}^{(0)} and particle ‘velocities’ \mathboldvs(0)\mathbold{v}_{s}^{(0)} from XX. Then, in the iith iteration of the run, each position coordinate xs,d(i1)x_{s,d}^{(i-1)} receives the update

xs,d(i1)xs,d(i)=xs,d(i1)+vs,d(i)\displaystyle x_{s,d}^{(i-1)}\longrightarrow x_{s,d}^{(i)}=x_{s,d}^{(i-1)}+v_{s,d}^{(i)} (100)

according to the velocity update

vs,d(i)=w(i)vs,d(i1)+c1(i)(ps,d(i1)xs,d(i1))+c2(i)(pgs,d(i1)xs,d(i1))\displaystyle v_{s,d}^{(i)}=w^{(i)}\,v_{s,d}^{(i-1)}+c_{1}^{(i)}\,\left(p_{s,d}^{(i-1)}-x_{s,d}^{(i-1)}\right)+c_{2}^{(i)}\,\left(p_{g_{s},d}^{(i-1)}-x_{s,d}^{(i-1)}\right) (101)

with inertia w(i)w^{(i)} and random coefficients c1/2(i)[0,C1/2(i)]c_{1/2}^{(i)}\in[0,C_{1/2}^{(i)}]. We initialize these dynamic parameters with w(1)=0.42{w^{(1)}=0.42} and C1/2(1)=1.55{C_{1/2}^{(1)}=1.55}, respectively, as recommended in Ref. [57], see also Ref. [28], and optionally modify them according to an adaptive schedule during the course of optimization. Once we have moved the whole swarm, we enforce all constraints, which poses no problem since the position update is random anyway, and then make the velocities consistent with the constraints. For example, in the case of Eq. (39), we enforce the constraint i=1Lni=N{\sum_{i=1}^{L}n_{i}=N} of Eq. (6) as illustrated with Fig. 6b. There, 𝒏req\bm{n}^{\mathrm{req}} is first rescaled to 𝒏~=NNreq𝒏req{\tilde{\bm{n}}=\frac{N}{N^{\mathrm{req}}}\bm{n}^{\mathrm{req}}} if Nreq=i=1LnireqN{N^{\mathrm{req}}=\sum_{i=1}^{L}n_{i}^{\mathrm{req}}\not=N}. Then, we obtain 𝒏\bm{n} by adding a rescaled \mathboldσ=𝒏~𝒏c{\mathbold{\sigma}=\tilde{\bm{n}}-\bm{n}_{\mathrm{c}}}, which is parallel to the constraining line, to 𝒏c=NL(1,1){\bm{n}_{\mathrm{c}}=\frac{N}{L}(1,1)}, which points to the center of the intersection of the constraining line with the square that encodes 0ni2{0\leq n_{i}\leq 2}. We rescale \mathboldσ\mathbold{\sigma} using the maximal excess |δσ|=maxi{max{ni2,ni}}{|\delta\sigma|=\mathrm{max}_{i}\big{\{}\mathrm{max}\{n_{i}-2,-n_{i}\}\big{\}}}, here incidentally realized by |δσ2||\delta\sigma_{2}|, among all participation numbers beyond their allowed values in [0,2][0,2]. The generalization to larger LL is straightforward. As an alternative to rescaling 𝒏req\bm{n}^{\mathrm{req}} to 𝒏~\tilde{\bm{n}} on the constraining (hyper-)plane defined by i=1Lni=N{\sum_{i=1}^{L}n_{i}=N}, we may project onto the plane, which generally results in a different 𝒏~\tilde{\bm{n}} on the plane. In fact, we alternate between both options, rescaling and projection, in a random fashion. Note that the PSO never requests participation numbers outside [0,2][0,2] since the participation numbers are constructed as ni=1+cos(θi){n_{i}=1+\cos\big{(}\theta_{i}\big{)}}, but some nin_{i} can exceed 22 after rescaling if Nreq<N{N^{\mathrm{req}}<N}, and the projection can even result in negative excess δσ\delta\sigma. After the constraints are enforced, we calculate f(\mathboldxs(i))f\left(\mathbold{x}_{s}^{(i)}\right) for all ss and prepare to move the swarm in the next iteration. We monitor the swarm by collecting the best function values \mathboldfg=(fg(i),,fg(iλ+1)){\mathbold{f}_{g}=\left(f_{g}^{(i)},\dots,f_{g}^{(i-\lambda+1)}\right)} of the λ\lambda most recent iterations. We declare convergence and terminate the PSO run once the current variance C(i)C(i) of \mathboldfg\mathbold{f}_{g} falls below a chosen target variance TT; our default settings are λ=min(30,D){\lambda=\mathrm{min}(30,D)} and T=1012D{T=10^{-12}D}. Since PSO is stochastic and can, despite all our efforts, get stuck in local optima, it is prudent to judge the quality of an alleged global optimum with the aid of a histogram of the optima from several (e.g., 10 to 1000{10\mbox{ to }1000}) PSO runs, see Fig. 6c.

Refer to caption
Figure 6: a, Schematics of our implementation of PSO. b, An illustration of how to enforce the particle number constraint i=1Lni=N{\sum_{i=1}^{L}n_{i}=N} in the case of L=2{L=2} levels by replacing an improper 𝒏req\bm{n}^{\mathrm{req}}, the participation numbers for which the optimizer requests a function value, with 𝒏=𝒏c+(𝒏~𝒏c)||σ2||δσ2||/|σ2|{\bm{n}=\bm{n}_{\mathrm{c}}+\big{(}\tilde{\bm{n}}-\bm{n}_{\mathrm{c}}\big{)}\big{|}|\sigma_{2}|-|\delta\sigma_{2}|\big{|}/|\sigma_{2}|}, which obeys all constraints and is a proper vector of participation numbers close to 𝒏req\bm{n}^{\mathrm{req}}. c, Histogram of 1000 PSO runs (totaling 108{\sim 10^{8}} energy evaluations) for N=4{N=4} harmonically confined spin-1/2 fermions with contact interaction strength 𝒸=20{\mathpzc{c}=20} on 20 single-particle levels (hence, the dimension of the search space is D=40{D=40}), cf. Sec. 3.1 and Table 1. The lowest (highest) converged energy found by PSO is 19.41619.416 (21.56421.564). About 36%36\% of the runs yield energies below the red dashed line, which marks two percent excess over the HF energy of 19.154 (green dashed line).

We designed several adaptations to this general scheme to aid the exploratory capability of the swarm in the initial phase of each run, the convergence in the final phase, and the swarm’s ability to avoid premature convergence to local optima in the intermediate phase. We choose the number of elements in each group GsG_{s} to about S/4S/4 and randomly re-initialize this grouping, together with the associated inter-particle links, if the current iteration does not improve upon the best function value encountered during the current PSO run. In other words, there are effectively four concurrent swarms with populations that interchange their members in an adaptive fashion. In our experience, 40S010D{40\lesssim S_{0}\lesssim 10D} is a suitable range for the initial swarm size S=S0{S=S_{0}}, with a problem-dependent trade-off between swarm size and required iterations/runs. Our PSO code is parallelized with openMP, and each swarm particle is processed in a separate thread. With decreasing difference between current and target variance, we decrease SS down to max(10,t)\mathrm{max}(10,t), where tt is the number of available parallel-processing threads. The worst particles (in terms of their personal best) are thereby discarded according to the schedule S(i)=min{S0,max[t,t+S0(1log10C(i)/log10T)]}{S(i)=\mathrm{min}\left\{S_{0},\mathrm{max}\left[t,t+S_{0}\,\left(1-\mathrm{log}_{10}C(i)/\mathrm{log}_{10}T\right)\right]\right\}} at iteration ii. This procedure removes inferior and/or superfluous particles as long as the variance drops and hastens the convergence in the later optimization stages.

Furthermore, much has been said about the curse of dimensionality, viz., the exponential growth of the search space with DD, which lies at the heart of so many real-world problems, not least the quantum many-body problem. But there is also a blessing of dimensionality, namely that reducing the search interval in a single dimension by 50%50\% removes half of the total search space, irrespective of DD. We therefore implemented an adaptive search space, where search space intervals IdI_{d} shrink in all those dimensions dd, for which the coordinate of a newly encountered global best lies close to the center of IdI_{d}. In turn, we shift IdI_{d}, to the extent the constraints allow, whenever the ddth coordinate of a new global best comes close to a boundary of IdI_{d}. Optionally, the so adapted search space can be inherited by subsequent runs. If active, this procedure eases the computational load by focusing on more promising regions—at the expense of potentially sacrificing superior regions that can be reached only after sustained uphill exploration; but in practice, we find essentially the same high-quality optima with or without an adaptive search space, just more efficiently in the former case.

By default, we make the dynamic PSO parameters ww and C1/2C_{1/2} self-adapting: every time a significantly better global best is found, we draw a new, say, ww from a Gaussian distribution with variance of 0.00250.0025, centered at wcw_{\mathrm{c}}, which is itself updated according to wcIwc+(1I)ww_{\mathrm{c}}\rightarrow I*w_{\mathrm{c}}+(1-I)*\langle w\rangle, with inertia I=0.8{I=0.8} and the mean w\langle w\rangle of previously successful (in finding a significantly better global best) parameters ww. Each particle holds its own parameters ww and C1/2C_{1/2}, independent from the other particles. Hence, at any given iteration, the swarm’s individuals span a self-adapting distribution of search capabilities.

Finally, we note our unsuccessful attempts to improve the PSO performance through self-adaption of the PSO hyperparameters (ww, C1C_{1}, C2C_{2}, SS, λ\lambda, TT, etc.) by adding them to the search space XX or through ‘fuzzy self-tuning’ [58]. Also of little effect was our attempt to counter premature convergence by maintaining diversity among the swarm particles in the intermediate optimization stage: we replaced the worst fraction σ\sigma of particles (five times per run on average) with randomly re-initialized ones. We reckoned this to have little effect in the very early exploratory stages, which are reminiscent of random search anyway, or in the final stages, where the pull of many near-optimal particles dominates the swarm behavior. Convergence can also be accelerated through elitism, where the global best particle informs all others with a finite probability. However, we found it expedient to deactivate elitism by default to better escape local optima.

C.2 Genetic algorithm

Genetic algorithms (GAs) are a class of evolutionary algorithms that resemble Darwinian evolution. The principles of natural selection are mimicked through crossover, mutation, and selection operators that act on a population of ‘chromosomes’, each being a vector of variables (‘genes’) from the search space of the objective function. While many different flavors of GAs have been studied and countless heuristics proposed for augmenting these GAs [30], the following procedure is typically iterated: given a number of ‘parents’ whose fitness (viz., objective function value) has been determined, two parents are chosen for breeding a ‘child’ by exchanging genes according to a crossover rate, followed by randomly altering genes of the child according to a mutation rate and enforcing problem-specific constraints on the genes. In a subsequent selection process, children (or random invaders according to an invasion rate) may then replace parents based on a fitness comparison. For multi-modal deceptive objective functions, this selection pressure commonly homogenizes the population too quickly [59, 60], with a large part of the population representing the same local optimum—although mutations and invasions allow, in principle, the exploration of the whole search space. We thus built our genetic algorithm optimization (GAO), which we used to validate the results obtained with PSO, by augmenting an otherwise prototypical GA with three heuristics. First, we select new generations of parents akin to the fitness uniform selection scheme [59], which guarantees a high diversity among chromosomes throughout the entire evolution, though we skew the selection toward fitter parents and always preserve the fittest chromosome. Second, we let many small subpopulations evolve in parallel, with inter-population breeding that begins with neighboring populations according to a dispersal parameter and extends to all populations toward the end of a chosen maximum number of generations. Finally, we also shrink the population sizes to accelerate the convergence toward the end of the optimization, analogous to our procedure for PSO that is controlled by the history of encountered objective function values.

Appendix D Interaction tensor elements

The interaction tensor elements IabcdI_{abcd} of Eq. (34) for the specific physical systems simulated in this work are available in analytical form (except for the results on relativistic atoms presented in Sec. 3.3), see D.1D.3 below. For the small-scale systems that are addressed in this work with objective-function-based optimizers, it is expedient to precompute the elements IabcdI_{abcd} and store them in compact form by making use of their system-dependent symmetries.

D.1 Contact interaction (1D)

We compute the interaction tensor elements IabcdI_{abcd} of Eq. (34) for unpolarized contact-interacting spin-12\frac{1}{2} fermions of mass mm in a one-dimensional harmonic oscillator of frequency ω\omega with the aid of the generating function

fK(x,y)=a=0b=0xaa!ybb!a|eiK(𝔞+𝔞)|b=exyeiK(x+y)eK22,\displaystyle f_{K}(x,y)=\sum_{a=0}^{\infty}\sum_{b=0}^{\infty}\frac{x^{a}}{\sqrt{a!}}\frac{y^{b}}{\sqrt{b!}}\langle a|\mathrm{e}^{\mbox{\footnotesize$\mathrm{i}K(\mathfrak{a}+\mathfrak{a}^{\dagger})$}}|b\rangle=\mathrm{e}^{\mbox{\footnotesize$xy$}}\mathrm{e}^{\mbox{\footnotesize$\mathrm{i}K(x+y)$}}\mathrm{e}^{\mbox{\footnotesize$-\frac{K^{2}}{2}$}}\,, (102)

where K=k2mω{K=k\sqrt{\frac{\hbar}{2m\omega}}} and 𝔞+𝔞=2mωR{\mathfrak{a}+\mathfrak{a}^{\dagger}=\sqrt{\frac{2m\omega}{\hbar}}R}, such that

a|eikR|b=a|eiK(𝔞+𝔞)|b=1a!1b!(x)a(y)bfK(x,y)|x=y=0=eK22a!b!(iK)baLa(ba)(K2),\displaystyle\langle a|\mathrm{e}^{\mbox{\footnotesize$\mathrm{i}kR$}}|b\rangle=\langle a|\mathrm{e}^{\mbox{\footnotesize$\mathrm{i}K(\mathfrak{a}+\mathfrak{a}^{\dagger})$}}|b\rangle=\frac{1}{\sqrt{a!}}\frac{1}{\sqrt{b!}}\Big{(}\frac{\partial}{\partial x}\Big{)}^{a}\Big{(}\frac{\partial}{\partial y}\Big{)}^{b}f_{K}(x,y)\Bigg{|}_{x=y=0}=\mathrm{e}^{\mbox{\footnotesize$\frac{-K^{2}}{2}$}}\frac{\sqrt{a!}}{\sqrt{b!}}(\mathrm{i}K)^{b-a}L_{a}^{(b-a)}(K^{2})\,, (103)

which is symmetric in the nonnegative integers aa and bb; hence, bab\geq a without loss of generality. For the contact interaction in Eq. (41), we have u(k)=𝒸{u(k)=\mathpzc{c}} in Eq. (34). Then, expressing the generalized Laguerre polynomials as Lnα(x)=j=0n(n+αnj)(x)jj!{L_{n}^{\alpha}(x)=\sum_{j=0}^{n}\binom{n+\alpha}{n-j}\frac{(-x)^{j}}{j!}}, while setting n=a{n=a} (n=d{n=d}) and α=ba{\alpha=b-a} (α=cd{\alpha^{\prime}=c-d}), we write

2mωπIabcd\displaystyle\sqrt{\frac{2\hbar}{m\omega}}\,\pi\,I_{abcd} =dK𝒸𝒶|eiK(𝔞+𝔞)|𝒷𝒸|eiK(𝔞+𝔞)|𝒹\displaystyle=\int\mathrm{d}K\,\mathpzc{c}\,\langle a|\mathrm{e}^{\mbox{\footnotesize$\mathrm{i}K(\mathfrak{a}+\mathfrak{a}^{\dagger})$}}|b\rangle\langle c|\mathrm{e}^{\mbox{\footnotesize$-\mathrm{i}K(\mathfrak{a}+\mathfrak{a}^{\dagger})$}}|d\rangle
=dK𝒸eK2𝒶!𝒹!𝒷!𝒸!(i𝒦)𝒷𝒶(i𝒦)𝒸𝒹𝒶α(𝒦2)𝒹α(𝒦2)\displaystyle=\int\mathrm{d}K\,\mathpzc{c}\,\mathrm{e}^{\mbox{\footnotesize$-K^{2}$}}\sqrt{\frac{a!d!}{b!c!}}(\mathrm{i}K)^{b-a}(-\mathrm{i}K)^{c-d}L_{a}^{\alpha}(K^{2})L_{d}^{\alpha^{\prime}}(K^{2})
=dK𝒸eK2𝒶!𝒹!𝒷!𝒸!𝒿=0𝒶𝒿=0𝒹(𝒷𝒶𝒿)(𝒸𝒹𝒿)(𝒦2)𝒢𝒿!𝒿!(1)𝒷𝒶,\displaystyle=\int\mathrm{d}K\,\mathpzc{c}\,\mathrm{e}^{\mbox{\footnotesize$-K^{2}$}}\sqrt{\frac{a!d!}{b!c!}}\sum_{j=0}^{a}\sum_{j^{\prime}=0}^{d}\binom{b}{a-j}\binom{c}{d-j^{\prime}}\frac{(-K^{2})^{G}}{j!j^{\prime}!}(-1)^{b-a}\,, (104)

where G=12(ba+cd)+j+j{G=\frac{1}{2}(b-a+c-d)+j+j^{\prime}}. Since Iabcd=Ibadc=Idcba=Icdba{I_{abcd}=I_{badc}=I_{dcba}=I_{cdba}}, it suffices to compute IabcdI_{abcd} for index sets with abca\leq b\leq c and cdc\geq d, for which Eq. (104) reduces to

Iabcd={𝒸𝓂ω𝒿=0𝒶𝒿=0𝒹12π(1)𝒷𝒶𝒶!𝒷!𝒸!𝒹!(14)𝒢(2𝒢)!(𝒷𝒶+𝒿)!(𝒶𝒿)!(𝒸𝒹+𝒿)!(𝒹𝒿)!𝒿!𝒿!𝒢!,a+b+c+d even0,a+b+c+d odd.\displaystyle I_{abcd}=\begin{cases}\mathpzc{c}\sqrt{\frac{m\omega}{\hbar}}\sum_{j=0}^{a}\sum_{j^{\prime}=0}^{d}\frac{1}{\sqrt{2\pi}}\frac{(-1)^{b-a}\sqrt{a!b!c!d!}(-\frac{1}{4})^{G}(2G)!}{(b-a+j)!(a-j)!(c-d+j^{\prime})!(d-j^{\prime})!j!j^{\prime}!G!}&,\;a+b+c+d\;\mbox{ even}\\ 0&,\;a+b+c+d\;\mbox{ odd}\end{cases}\,. (105)

As an alternative to Eq. (105), we derive a recursion relation for

abcd=𝒸8πJ(a,b,c,d)2aa!2bb!2cc!2dd!\displaystyle\mathcal{H}_{abcd}=\frac{\mathpzc{c}}{\sqrt{8\pi}\mathcal{L}}\frac{J{\left(a,\,b,\,c,\,d\right)}}{\sqrt{2^{a}a!2^{b}b!2^{c}c!2^{d}d!}} (106)

in Eq. (33). Here,

J(a,b,c,d)=2πdxe2x2Ha(x)Hb(x)Hc(x)Hd(x),\displaystyle J{\left(a,\,b,\,c,\,d\right)}=\sqrt{\frac{2}{\pi}}\int\mathrm{d}{x}\,\mathrm{e}^{\mbox{\footnotesize$-2x^{2}$}}\,\mathrm{H}_{a}{\left({x}\right)}\,\mathrm{H}_{b}{\left({x}\right)}\,\mathrm{H}_{c}{\left({x}\right)}\,\mathrm{H}_{d}{\left({x}\right)}\,, (107)

with the Hermite polynomial Hn(x)\mathrm{H}_{n}{\left({x}\right)} of order n{n}, obeys the relation

a,b,c,dz1aa!z2bb!z3cc!z4dd!J(a,b,c,d)=exp[12(z1+z2+z3+z4)2(z12+z22+z32+z42)],\displaystyle\sum_{a,\,b,\,c,\,d}\frac{z_{1}^{a}}{a!}\frac{z_{2}^{b}}{b!}\frac{z_{3}^{c}}{c!}\frac{z_{4}^{d}}{d!}J{\left(a,\,b,\,c,\,d\right)}=\exp\left[\frac{1}{2}{\left(z_{1}+z_{2}+z_{3}+z_{4}\right)}^{2}-{\left(z_{1}^{2}+z_{2}^{2}+z_{3}^{2}+z_{4}^{2}\right)}\right]\,, (108)

obtained from applying the generating function of the Hermite polynomials four times. Operating with z1z2{z_{1}\frac{\partial}{\partial z_{2}}} on Eq. (108), we find

J(a1,b+1,c,d)={ 0,a+b+c+d odd(a1)J(a2,b,c,d)bJ(a1,b1,c,d)+cJ(a1,b,c1,d)+dJ(a1,b,c,d1),a+b+c+d even\displaystyle J{\left(a-1,\,b+1,\,c,\,d\right)}=\begin{cases}\;0&,\;a+b+c+d\;\text{ odd}\\ \;\mbox{\parbox{200.0003pt}{${\left(a-1\right)}\,J{\left(a-2,\,b,\,c,\,d\right)}-b\,J{\left(a-1,\,b-1,\,c,\,d\right)}\\ +c\,J{\left(a-1,\,b,\,c-1,\,d\right)}+d\,J{\left(a-1,\,b,\,c,\,d-1\right)}$ }}&,\;a+b+c+d\;\text{ even}\\ \end{cases} (109)

with initial values J(a, 0, 0, 0)=(12)a/2a!(a/2)![(a+1) mod 2]J{\left(a,\,0,\,0,\,0\right)}={\left(-\frac{1}{2}\right)}^{a/2}\frac{a!}{(a/2)!}\,[(a+1)\mbox{ mod }2]. Following Ref. [61], we may also write

J(a,b,c,d)=q=0Min{a+b,c+d}δ(a+b),(c+d)δ(a+b),(q mod 2)αq(a,b)αq(c,d)2qq!,J{\left(a,\,b,\,c,\,d\right)}=\sum_{q=0}^{\mathop{\mathrm{Min}}{\left\{a+b,c+d\right\}}}\delta_{(a+b),(c+d)}\,\delta_{(a+b),(q\text{ mod }2)}\,\alpha_{q}\left(a,\,b\right)\alpha_{q}\left(c,\,d\right)2^{q}q!\,, (110)

where αq(a,b)=n=Max{0,qa}Min{b,q}(1)bn+a+bq2[2a+b2q!(a+bq2)!]1(qn)(a+bqbn)\alpha_{q}\left(a,\,b\right)=\sum_{n=\mathop{\mathrm{Max}}{\left\{0,q-a\right\}}}^{\mathop{\mathrm{Min}}{\left\{b,q\right\}}}\left(-1\right)^{b-n+\frac{a+b-q}{2}}\left[2^{\frac{a+b}{2}}q!{\left(\frac{a+b-q}{2}\right)}!\right]^{-1}\binom{q}{n}\binom{a+b-q}{b-n}.

The accurate numerical evaluation of the closed formulae in Eqs. (105) and (106) beyond L15L\approx 15 energy levels require high-precision arithmetic. Alternatively and as a means of validation, the tensor elements IabcdI_{abcd} can be tabulated using double-precision arithmetic by directly evaluating the integral in Eq. (104)—we produced and confirmed all tensor elements with indices up to L=100{L=100} using an adaptive bisection algorithm with Boole quadrature.

D.2 Harmonic interaction (1D)

With the oscillator eigenfunctions x|a=ψa(x)\left<\right.{x}\left.\right|{a}\left.\right>=\psi_{a}(x) and Vint(xjxk)=2β~(xjxk)2=β(xjxk)2{V_{\mathrm{int}}(x_{j}-x_{k})=\frac{\mathcal{E}}{\mathcal{L}^{2}}\tilde{\beta}\,(x_{j}-x_{k})^{2}=\beta\,(x_{j}-x_{k})^{2}} from Eq. (43), we get

Iabcd=β~dx~dx~(x~x~)22ψa(x)ψb(x)ψc(x)ψd(x),\displaystyle I_{abcd}=\mathcal{E}\tilde{\beta}\,\int\mathrm{d}\tilde{x}\,\mathrm{d}\tilde{x}^{\prime}\,(\tilde{x}-\tilde{x}^{\prime})^{2}\,\mathcal{L}^{2}\psi_{a}(x)\,\psi_{b}(x)\,\psi_{c}(x^{\prime})\,\psi_{d}(x^{\prime})\,, (111)

with x~=x/\tilde{x}=x/\mathcal{L}, for tr(ρmpHint)\mathrm{tr}\left(\rho_{\mathrm{mp}}H_{\mathrm{int}}\right) in Eq. (33). Here, we explicitly exhibit the harmonic oscillator units of energy =ω{\mathcal{E}=\hbar\omega} and length =/(mω){\mathcal{L}=\sqrt{\hbar/(m\omega)}}, with particle mass mm and oscillator frequency ω\omega, i.e., the units of the noninteracting system (α=1{\alpha=1})). Then, we write

1β~Iabcd\displaystyle\frac{1}{\mathcal{E}\tilde{\beta}}I_{abcd} =[Jab(2)δcdJab(1)Jcd(1)]+[ad&bc],\displaystyle=\left[J_{ab}^{(2)}\delta_{cd}-J_{ab}^{(1)}J_{cd}^{(1)}\right]+[a\leftrightarrow d\;\&\;b\leftrightarrow c]\,, (112)

where Jab(ν)=dx~x~νψ~a(x~)ψ~b(x~)J_{ab}^{(\nu)}=\int\mathrm{d}\tilde{x}\,\tilde{x}^{\nu}\,\tilde{\psi}_{a}(\tilde{x})\tilde{\psi}_{b}(\tilde{x}), with ψ~j(x~)=ψj(x)=ψj(x~)\tilde{\psi}_{j}(\tilde{x})=\sqrt{\mathcal{L}}\psi_{j}(x)=\sqrt{\mathcal{L}}\psi_{j}(\mathcal{L}\tilde{x}).

The recurrence relation

Hj+1(x~)=2x~Hj(x~)2jHj1(x~)\displaystyle H_{j+1}(\tilde{x})=2\tilde{x}\,H_{j}(\tilde{x})-2j\,H_{j-1}(\tilde{x}) (113)

for the Hermite polynomials implies the recurrence relation

x~ψ~j(x)=j+12ψ~j+1(x~)+j2ψ~j1(x~)\displaystyle\tilde{x}\,\tilde{\psi}_{j}(x)=\sqrt{\frac{j+1}{2}}\tilde{\psi}_{j+1}(\tilde{x})+\sqrt{\frac{j}{2}}\tilde{\psi}_{j-1}(\tilde{x}) (114)

for the Hermite functions ψ~j(x~)=(2jj!)1/2π1/4exp(x~2/2)Hj(x~)\tilde{\psi}_{j}(\tilde{x})=(2^{j}\,j!)^{-1/2}\pi^{-1/4}\exp\left(-\tilde{x}^{2}/2\right)H_{j}(\tilde{x}). Hence, with M=max(a,b){M=\mbox{max}(a,b)} and m=min(a,b){m=\mbox{min}(a,b)}, we get

Jab(1)={0,both a&b even or odd (‘same parity’)dx~x~ψ~M(x~)ψ~m(x~)=M2δm,M1,a&b different parity\displaystyle J_{ab}^{(1)}=\begin{cases}0&,\;\mbox{both }a\;\&\;b\mbox{ even or odd (`same parity')}\\ \int\mathrm{d}\tilde{x}\,\tilde{x}\,\tilde{\psi}_{M}(\tilde{x})\tilde{\psi}_{m}(\tilde{x})=\sqrt{\frac{M}{2}}\delta_{m,M-1}&,\;a\;\&\;b\mbox{ different parity }\end{cases} (115)

and

Jab(2)={0,a&b different parity1/2,a=b=03/2,a=b=112(2a+1),a=b>112M(M1)δM2,m,M&m same parity and Mm+2,\displaystyle J_{ab}^{(2)}=\begin{cases}0&,\;a\;\&\;b\mbox{ different parity}\\ 1/2&,\;a=b=0\\ 3/2&,\;a=b=1\\ \frac{1}{2}(2a+1)&,\;a=b>1\\ \frac{1}{2}\sqrt{M(M-1)}\,\delta_{M-2,m}&,\;M\;\&\;m\mbox{ same parity and }M\geq m+2\end{cases}, (116)

where the last two cases of (116) follow from

x~2ψ~M(x~)=12((M+1)(M+2)ψ~M+2(x~)+(2M+1)ψ~M(x~)+M(M1)ψ~M2(x~)),\displaystyle\tilde{x}^{2}\tilde{\psi}_{M}(\tilde{x})=\frac{1}{2}\left(\sqrt{(M+1)(M+2)}\tilde{\psi}_{M+2}(\tilde{x})+(2M+1)\tilde{\psi}_{M}(\tilde{x})+\sqrt{M(M-1)}\tilde{\psi}_{M-2}(\tilde{x})\right), (117)

and the three central cases of (116) are summarized by 12(2a+1)\frac{1}{2}(2a+1) for a=b{a=b}.

Since n(x)=n(x)n(-x)=n(x) for the ground-state density n(x)n(x), we write

tr(ρmpHint)|direct part=12dxdxβ(xx)2n(x)n(x)=dx12 2βNx2n(x),\displaystyle\left.\mathrm{tr}\left(\rho_{\mathrm{mp}}H_{\mathrm{int}}\right)\right|_{\mbox{direct part}}=\frac{1}{2}\int\mathrm{d}x\,\mathrm{d}x^{\prime}\,\beta\,(x-x^{\prime})^{2}\,n(x)\,n(x^{\prime})=\int\mathrm{d}x\,\frac{1}{2}\,2\beta N\,x^{2}\,n(x)\,, (118)

such that the total energy (with only the direct part of the interaction) for any α\alpha is

Ed(α)=Ekin+dx12(mω2+2βN)x2n(x).\displaystyle E_{\mathrm{d}}^{(\alpha)}=E_{\mathrm{kin}}+\int\mathrm{d}x\,\frac{1}{2}(m\omega^{2}+2\beta N)\,x^{2}\,n(x)\,. (119)

With β=2β~\beta=\frac{\mathcal{E}}{\mathcal{L}^{2}}\tilde{\beta}, we have

mω2+2βN=ωmω+22β~N=2(1+2β~N)=α=ωαmωα.\displaystyle m\omega^{2}+2\beta N=\hbar\omega\frac{m\omega}{\hbar}+2\frac{\mathcal{E}}{\mathcal{L}^{2}}\tilde{\beta}N=\frac{\mathcal{E}}{\mathcal{L}^{2}}\underset{=\alpha}{\underbrace{(1+2\tilde{\beta}N)}}=\hbar\omega\sqrt{\alpha}\frac{m\omega\sqrt{\alpha}}{\hbar}\,. (120)

Hence,

Ed(α)=Ekin+dx~12αx~2n~(x~),\displaystyle E_{\mathrm{d}}^{(\alpha)}=E_{\mathrm{kin}}+\mathcal{E}\int\mathrm{d}\tilde{x}\,\frac{1}{2}\alpha\,\tilde{x}^{2}\,\tilde{n}(\tilde{x})\,, (121)

where n~(x~)=n(x~)\tilde{n}(\tilde{x})=\mathcal{L}n(\tilde{x}). That is, (121) is the energy of a noninteracting harmonic oscillator with frequency ωα\omega\sqrt{\alpha} (in units of the oscillator with frequency ω\omega) and can be directly utilized in DFT calculations—e.g., for comparison between 1pEx-DFT and DPFT. The virial theorem then implies Ed(α)=αEd(1)E_{\mathrm{d}}^{(\alpha)}=\alpha\,E^{(1)}_{\mathrm{d}}, and the α\alpha-dependent dimensionless eigenfunctions

ψ~j(α)(x~)=(2jj!)1/2π1/4α1/8exp(α2x~2)Hj(α1/4x~)\displaystyle\tilde{\psi}_{j}^{(\alpha)}(\tilde{x})=(2^{j}\,j!)^{-1/2}\pi^{-1/4}\alpha^{1/8}\exp\left(-\frac{\sqrt{\alpha}}{2}\tilde{x}^{2}\right)H_{j}\left(\alpha^{1/4}\tilde{x}\right) (122)

yield (for even NN) the exact dimensionless ground-state density

n~(x~)=2j=0N21|ψ~j(α)(x~)|2\displaystyle\tilde{n}(\tilde{x})=2\sum_{j=0}^{\frac{N}{2}-1}|\tilde{\psi}_{j}^{(\alpha)}(\tilde{x})|^{2} (123)

of the interacting system.

D.3 Coulomb interaction (3D)

D.3.1 Nonrelativistic atoms

We evaluate the interaction energy in Eq. (33) for atoms by calculating the tensor element IabcdI_{abcd} of Eq. (34). First, we consider nonrelativistic atoms as a blueprint for and a comparison with the relativistic case. Therefore, IabcdI_{abcd} takes the nonrelativistic hydrogenic wave functions

ψa(\mathboldr)=ψna,la,ma(r,θ,ϕ)=γaRa(r)Yla,ma(θ,ϕ)\displaystyle\psi_{a}(\mathbold{r})=\psi_{n_{a},l_{a},m_{a}}(r,\theta,\phi)=\gamma_{a}\,R_{a}(r)\,Y_{l_{a},m_{a}}(\theta,\phi) (124)

(and accordingly for the orbital indices b,c,db,c,d), where the orbital index a=1,2,{a=1,2,\dots} combines the three hydrogenic quantum numbers {na=1,2,;la=0,1,,na1;ma=la,la+1,,la}{\{n_{a}=1,2,\dots;l_{a}=0,1,\dots,n_{a}-1;m_{a}=-l_{a},-l_{a}+1,\dots,l_{a}\}} in row-major order (,a=5{na=2;la=1;ma=1},a=6{na=3;la=0;ma=0},{\dots,\,a=5\leftrightarrow\{n_{a}=2;l_{a}=1;m_{a}=1\},\,a=6\leftrightarrow\{n_{a}=3;l_{a}=0;m_{a}=0\},\dots}). We choose the phase factors γa=1\gamma_{a}=1. The radial wave functions are

Ra(r)=(2Zna)3(nala1)!2na(na+la)!(2Zrna)laexp(Zrna)Lnala12la+1(2Zrna),\displaystyle R_{a}(r)=\sqrt{\left(\frac{2Z}{n_{a}}\right)^{3}\frac{(n_{a}-l_{a}-1)!}{2n_{a}\,(n_{a}+l_{a})!}}\,\left(\frac{2Zr}{n_{a}}\right)^{l_{a}}\exp\left(\frac{-Zr}{n_{a}}\right)\,L_{n_{a}-l_{a}-1}^{2l_{a}+1}\left(\frac{2Zr}{n_{a}}\right)\,, (125)

with the generalized Laguerre polynomials

Lnα(x)=l=0n(α+l+1)nl(nl)!l!(x)l,\displaystyle L_{n}^{\alpha}(x)=\sum_{l=0}^{n}\frac{(\alpha+l+1)_{n-l}}{(n-l)!\,l!}(-x)^{l}\,, (126)

where (z)n=z(z+1)(z+n1){(z)_{n}=z\,(z+1)\dots(z+n-1)}.

Then, we proceed along the lines of Ref. [62] with the help of the Laplace expansion

1|\mathboldr\mathboldr|=k=04π2k+1m=kk(1)mr<kr>k+1Yk,m(θ,ϕ)Yk,m(θ,ϕ),\displaystyle\frac{1}{|\mathbold{r}-\mathbold{r}^{\prime}|}=\sum_{k=0}^{\infty}\frac{4\pi}{2k+1}\sum_{m=-k}^{k}(-1)^{m}\frac{r_{<}^{k}}{r_{>}^{k+1}}\,Y_{k,-m}(\theta,\phi)\,Y_{k,m}(\theta^{\prime},\phi^{\prime}), (127)

where {r,θ,ϕ}\{r,\theta,\phi\} ({r,θ,ϕ})\big{(}\{r^{\prime},\theta^{\prime},\phi^{\prime}\}\big{)} are the spherical coordinates of \mathboldr\mathbold{r} (\mathboldr\mathbold{r}^{\prime}), r<=Min{r,r}{r_{<}=\mathop{\mathrm{Min}}{\left\{r,r^{\prime}\right\}}}, r>=Max{r,r}{r_{>}=\mathop{\mathrm{Max}}{\left\{r,r^{\prime}\right\}}}, and Yk,m(θ,ϕ)Y_{k,m}(\theta,\phi) are the spherical harmonics with the phase convention used in [63]. Using the relation Yk,m(θ,ϕ)=(1)mYk,m(θ,ϕ){Y_{k,m}^{*}(\theta,\phi)=(-1)^{m}\,Y_{k,-m}(\theta,\phi)}, we write

Iabcd=γaγbγcγdk=04π2k+1m=kk(1)m+mb+mdRk(ac,bd)S~ma,mb,mla,lb,kS~mc,md,mlc,ld,k\displaystyle I_{abcd}=\gamma_{a}\gamma_{b}^{*}\gamma_{c}\gamma_{d}^{*}\sum_{k=0}^{\infty}\frac{4\pi}{2k+1}\sum_{m=-k}^{k}(-1)^{m+m_{b}+m_{d}}\,R^{k}(ac,bd)\,\tilde{S}^{l_{a},l_{b},k}_{m_{a},-m_{b},-m}\,\tilde{S}^{l_{c},l_{d},k}_{m_{c},-m_{d},m} (128)

with the dimensionless radial integrals

Rk(ab,cd)=ZRHk(ab,cd)=0drdrr2r2r<kr>k+1Ra(r)Rb(r)Rc(r)Rd(r),\displaystyle R^{k}(ab,cd)=Z\,R_{H}^{k}(ab,cd)=\int_{0}^{\infty}\mathrm{d}r\mathrm{d}r^{\prime}\,r^{2}\,r^{\prime 2}\frac{r_{<}^{k}}{r_{>}^{k+1}}\,R_{a}(r)\,R_{b}(r^{\prime})\,R_{c}(r)\,R_{d}(r^{\prime})\,, (129)

where RHk(ab,cd)R_{H}^{k}(ab,cd) is independent of ZZ, and the Gaunt coefficients [63]

S~m1,m2,m3j1,j2,j3=(2j1+1)(2j2+1)(2j3+1)4π(j1j2j3000)(j1j2j3m1m2m3)=:(2j3+1)4πSm1,m2,m3j1,j2,j3\displaystyle\tilde{S}^{j_{1},j_{2},j_{3}}_{m_{1},m_{2},m_{3}}=\sqrt{\frac{(2j_{1}+1)(2j_{2}+1)(2j_{3}+1)}{4\pi}}\,\left(\begin{array}[]{ccc}j_{1}&j_{2}&j_{3}\\ 0&0&0\end{array}\right)\,\left(\begin{array}[]{ccc}j_{1}&j_{2}&j_{3}\\ m_{1}&m_{2}&m_{3}\end{array}\right)=:\sqrt{\frac{(2j_{3}+1)}{4\pi}}S^{j_{1},j_{2},j_{3}}_{m_{1},m_{2},m_{3}} (134)

that include a product of two 3j-symbols and, hence, can be nonzero only if |j1j2|j3j1+j2{|j_{1}-j_{2}|\leq j_{3}\leq j_{1}+j_{2}}, m1+m2+m3=0{m_{1}+m_{2}+m_{3}=0}, and j1+j2+j3{j_{1}+j_{2}+j_{3}} even. That is, Eq. (128) reduces to

Iabcd=k=kminkmaxγabcd(k)RHk(ac,bd)Sma,mb,mbmala,lb,kSmc,md,mdmclc,ld,k\displaystyle I_{abcd}=\sum_{k=k_{\mathrm{min}}}^{k_{\mathrm{max}}}\gamma_{abcd}(k)\,R_{H}^{k}(ac,bd)\,S^{l_{a},l_{b},k}_{m_{a},-m_{b},m_{b}-m_{a}}\,S^{l_{c},l_{d},k}_{m_{c},-m_{d},m_{d}-m_{c}} (135)

with kmin=Max{|lalb|,|lcld|}{k_{\mathrm{min}}=\mathop{\mathrm{Max}}{\left\{|l_{a}-l_{b}|,|l_{c}-l_{d}|\right\}}}, kmax=Min{la+lb,lc+ld}{k_{\mathrm{max}}=\mathop{\mathrm{Min}}{\left\{l_{a}+l_{b},l_{c}+l_{d}\right\}}}, and

γabcd(k)=Zγaγbγcγd(1)ma+mdδmamb,mdmc[(la+lb+k+1) mod 2][(lc+ld+k+1) mod 2].\displaystyle\gamma_{abcd}(k)=\,Z\,\gamma_{a}\gamma_{b}^{*}\gamma_{c}\gamma_{d}^{*}(-1)^{m_{a}+m_{d}}\,\delta_{m_{a}-m_{b},m_{d}-m_{c}}\,[(l_{a}+l_{b}+k+1)\mbox{ mod }2]\,[(l_{c}+l_{d}+k+1)\mbox{ mod }2]\,. (136)

We precompute the radial integrals

RHk(ab,cd)\displaystyle R_{H}^{k}(ab,cd) =GHk(ac,bd)+GHk(bd,ac)\displaystyle=G_{H}^{k}(ac,bd)+G_{H}^{k}(bd,ac) (137)

after expanding the generalized Laguerre polynomials according to Eq. (126), with

GHk(ac,bd)\displaystyle G_{H}^{k}(ac,bd) =1Z0drr2k1Ra(r)Rc(r)0rdrr2+kRb(r)Rd(r)\displaystyle=\frac{1}{Z}\int_{0}^{\infty}\mathrm{d}r\,r^{2-k-1}R_{a}(r)R_{c}(r)\int_{0}^{r}\mathrm{d}r^{\prime}\,r^{\prime 2+k}R_{b}(r^{\prime})R_{d}(r^{\prime})
=τaτbτcτdaa=0nala1σa(aa)ab=0nblb1σb(ab)ac=0nclc1σc(ac)ad=0ndld1σd(ad)\displaystyle=\tau_{a}\tau_{b}\tau_{c}\tau_{d}\sum_{a_{a}=0}^{n_{a}-l_{a}-1}\sigma_{a}(a_{a})\sum_{a_{b}=0}^{n_{b}-l_{b}-1}\sigma_{b}(a_{b})\sum_{a_{c}=0}^{n_{c}-l_{c}-1}\sigma_{c}(a_{c})\sum_{a_{d}=0}^{n_{d}-l_{d}-1}\sigma_{d}(a_{d})
×gk(1k+aa+la+ac+lc,1na+1nc;2+k+ab+lb+ad+ld,1nb+1nd),\displaystyle\quad\times g^{k}(1-k+a_{a}+l_{a}+a_{c}+l_{c},\frac{1}{n_{a}}+\frac{1}{n_{c}};2+k+a_{b}+l_{b}+a_{d}+l_{d},\frac{1}{n_{b}}+\frac{1}{n_{d}})\,, (138)
τa\displaystyle\tau_{a} =(2na)3(nala1)!2na(na+la)!(2na)la,\displaystyle=\sqrt{\left(\frac{2}{n_{a}}\right)^{3}\frac{(n_{a}-l_{a}-1)!}{2n_{a}\,(n_{a}+l_{a})!}}\,\left(\frac{2}{n_{a}}\right)^{l_{a}}\,, (139)
σa(aa)\displaystyle\sigma_{a}(a_{a}) =(2la+2+aa)nala1aa(nala1aa)!aa!(2na)aa,\displaystyle=\frac{(2l_{a}+2+a_{a})_{n_{a}-l_{a}-1-a_{a}}}{(n_{a}-l_{a}-1-a_{a})!\,a_{a}!}\,\left(\frac{-2}{n_{a}}\right)^{a_{a}}\,, (140)
gk(p,α;q,β)\displaystyle g^{k}(p,\alpha;q,\beta) =0dζζpeαζ0ζdζζqeβζ\displaystyle=\int_{0}^{\infty}\mathrm{d}\zeta\,\zeta^{p}\,\mathrm{e}^{-\alpha\zeta}\int_{0}^{\zeta}\mathrm{d}\zeta^{\prime}\,\zeta^{\prime q}\,\mathrm{e}^{-\beta\zeta^{\prime}}
=p!βq+2[βq!αp+1(p+q+1)!βp(p+1)!F12(p+1,p+q+2p+2;αβ)],\displaystyle=\frac{p!}{\beta^{q+2}}\left[\frac{\beta\,q!}{\alpha^{p+1}}-\frac{(p+q+1)!}{\beta^{p}\,(p+1)!}\,{{}_{2}F_{1}}\left(\begin{array}[]{c}p+1,p+q+2\\ p+2\end{array};\,-\frac{\alpha}{\beta}\right)\right]\,, (143)

and the generalized hypergeometric function F12(u,vw;z)=k=0(u)k(v)k(w)kzkk!{{{}_{2}F_{1}}\left(\begin{array}[]{c}u,v\\ w\end{array};\,z\right)=\sum_{k=0}^{\infty}\frac{(u)_{k}\,(v)_{k}}{(w)_{k}}\frac{z^{k}}{k!}}. Equation (143) derives from changing the integration variable rr in Eq. (138) to ζ=Zr{\zeta=Zr}. The numerical values we obtain for Eq. (137) coincide with those reported in Refs. [64, 65, 66].

The core Hamiltonian H1pH_{\mathrm{1p}} of atoms and ions has scattering states, which we disregarded when using only the bound states defined in Eq. (124) for generating the 1pEx binding energies in Sec. 3.3; cf. the discussion around Eq. (3). This omission is responsible for the discrepancies between the energies from 1pEx-DFT and HF in the case of N=2{N=2} electrons, see Fig. 3. Indeed, the accumulated overlap s=1|ϕ|s|2\sum_{s=1}^{\infty}|\left<\right.{\phi}\left.\right|{s}\left.\right>|^{2} between the HF ground-state orbital |ϕ{\left|{\phi}\right\rangle}, taken from Ref. [67], and the hydrogenic s-states |s{\left|{s}\right\rangle} is less than one: approximately 0.994365 for H-, 0.994945 for He, 0.999156 for C4+, and 0.999898 for Ar16+. That is, in these cases a small part of |ϕ{\left|{\phi}\right\rangle} is composed of scattering states.

D.3.2 Relativistic atoms

We reduce the program of the exact two-component quasi-relativistic theory (X2C) to a one-component (viz., spin-free) quasi-relativistic theory (sf-X2C) along the lines of Ref. [33]. In summary, the resulting relativistic hydrogenic states ψk(\mathboldr)=\mathboldr|k=μ=1MCμkgμ(\mathboldr)\psi_{k}(\mathbold{r})=\left<\right.{\mathbold{r}}\left.\right|{k}\left.\right>=\sum_{\mu=1}^{M}C_{\mu k}\,g_{\mu}(\mathbold{r}), here expanded in the nonrelativistic hydrogenic states {gμ}\{g_{\mu}\} as given in Eq. (124), yield the associated relativistic interaction tensor elements

Iabcd=κ,λ,μ,ν=1MCκaCλbCμcCνdIκλμνNR.\displaystyle I_{abcd}=\sum_{\kappa,\lambda,\mu,\nu=1}^{M}C_{\kappa a}\,C_{\lambda b}^{*}\,C_{\mu c}\,C_{\nu d}^{*}\,I^{\mathrm{NR}}_{\kappa\lambda\mu\nu}\,. (144)

Here, IκλμνNRI^{\mathrm{NR}}_{\kappa\lambda\mu\nu} are the nonrelativistic interaction tensor elements given in Eq. (135), and MM controls the quality of this expansion; we choose M=L{M=L} for simplicity. Although other bases such as Slater-orbitals are thinkable, we opt for the nonrelativistic hydrogenic states in order to keep MM small, because the values {IκλμνNR}\left\{I^{\mathrm{NR}}_{\kappa\lambda\mu\nu}\right\} are already available in this case, and for convenient consistency checks with light atoms, where IabcdIabcdNR{I_{abcd}\approx I^{\mathrm{NR}}_{abcd}}.

Defining the M×M{M\times M}-dimensional (unitless) matrices

Sμν\displaystyle S_{\mu\nu} =gμ|gν=δμν,\displaystyle=\left<\right.{g_{\mu}}\left.\right|{g_{\nu}}\left.\right>=\delta_{\mu\nu}\,, (145)
Tμν\displaystyle T_{\mu\nu} =1Hagμ|\mathboldP22me|gν,\displaystyle=\frac{1}{\mbox{Ha}}\left<\right.{g_{\mu}}\left.\right|{\frac{\mathbold{P}^{2}}{2m_{\mathrm{e}}}}\left|\right.{g_{\nu}}\left.\right>\,, (146)
Vμν\displaystyle V_{\mu\nu} =1Hagμ|Vext(\mathboldR)|gν,\displaystyle=\frac{1}{\mbox{Ha}}\left<\right.{g_{\mu}}\left.\right|{V_{\mathrm{ext}}(\mathbold{R})}\left|\right.{g_{\nu}}\left.\right>\,, (147)
and
Uμν\displaystyle U_{\mu\nu} =1meHa2gμ|j=13PjVext(\mathboldR)Pj|gν,\displaystyle=\frac{1}{m_{\mathrm{e}}\,\mbox{Ha}^{2}}\left<\right.{g_{\mu}}\left.\right|{\sum_{j=1}^{3}P_{j}\,V_{\mathrm{ext}}(\mathbold{R})\,P_{j}}\left|\right.{g_{\nu}}\left.\right>\,, (148)

we find the coefficients CμkC_{\mu k} as the matrix elements of C=S~12A{C=\tilde{S}^{\frac{1}{2}}\,A}. Here, S~=S+12meXT1X{\tilde{S}=S+\frac{1}{2m_{\mathrm{e}}}X^{\dagger}T^{-1}X} is a modified metric, and we adopt atomic units of Hartree (Ha), Bohr radius (a0a_{0}), and electron rest mass (mem_{\mathrm{e}}), see also Sec. 3.3. The matrix X=2αmeHaTBA1{X=2\alpha\sqrt{m_{\mathrm{e}}\,\mbox{Ha}}\,TBA^{-1}}, where α\alpha is the fine structure constant, is built from the nonrelativistic kinetic energy matrix TT as given in Eq. (146) and from the matrices AA and BB, whose elements are found by solving the generalized eigenvalue problem that is presented by the one-particle Dirac equation in (block-)matrix form, with energies shifted by the electron rest-mass energy:

(V2T2Tα2U4T)(\mathboldak\mathboldbk)=EkHa(S002α2T)(\mathboldak\mathboldbk), for all k=1,,M.\displaystyle\left(\begin{array}[]{cc}V&2T\\ 2T&\alpha^{2}U-4T\end{array}\right)\left(\begin{array}[]{c}\mathbold{a}_{k}\\ \mathbold{b}_{k}\end{array}\right)=\frac{E_{k}}{\mbox{Ha}}\left(\begin{array}[]{cc}S&0\\ 0&2\alpha^{2}T\end{array}\right)\left(\begin{array}[]{c}\mathbold{a}_{k}\\ \mathbold{b}_{k}\end{array}\right),\mbox{ for all }k=1,\dots,M\,. (157)

Here, the (unitless) vector \mathboldak=(A1k,A2k,,AMk){\mathbold{a}_{k}=(A_{1k},A_{2k},...,A_{Mk})}, i.e., the kkth column of AA, defines the {gμ}\{g_{\mu}\}-expansion of the so-called large component φk(\mathboldr)=μ=1MAμkgμ(\mathboldr)\varphi_{k}(\mathbold{r})=\sum_{\mu=1}^{M}A_{\mu k}\,g_{\mu}(\mathbold{r}) of the Dirac spinor that solves the Dirac equation for eigenenergy Ek/HaE_{k}/\mbox{Ha}. While φk\varphi_{k} is a two-component spinor in full four-component Dirac theory, it is a scalar quantity in the sf-X2C employed here. The MM electronic eigenstates among the 2M2M eigenstates of Eq. (157) are associated with energies Ek>mec2{E_{k}>-m_{\mathrm{e}}c^{2}} [33], where cc is the speed of light, i.e., Ek/Ha>1/α2{E_{k}/\mbox{Ha}>-1/\alpha^{2}}. The key element of X2C is the matrix XX: it transforms between the small component χk\chi_{k} of the Dirac spinor and φk\varphi_{k}, such that the contributions of χk\chi_{k} to the electronic eigenstate, as encoded in the (unitless) vector \mathboldbk=(B1k,B2k,,BMk){\mathbold{b}_{k}=(B_{1k},B_{2k},...,B_{Mk})}, can be merged with those of φk\varphi_{k} with the help of S~\tilde{S}. Upon normalizing each of the so-obtained vectors (C1k,C2k,,CMk){(C_{1k},C_{2k},...,C_{Mk})}, we obtain the Schrödinger-like orthonormal 1pEx-basis {ψk}\{\psi_{k}\} and the corresponding interaction tensor elements in Eq. (144).

Since spin-dependent contributions are not included in the matrix UU of Eq. (148), the sf-X2C employed here does not recover the (exactly known) hydrogenic eigenenergies of the one-particle Dirac equation [47]. Typically, however, the deviations are small: for the example of nuclear charge Z=20{Z=20} the lowest energy levels are 201.077Ha-201.077\,\mbox{Ha} (from four-component Dirac theory), 201.055Ha-201.055\,\mbox{Ha} (from sf-X2C), and 200Ha-200\,\mbox{Ha} (from nonrelativistic quantum mechanics). Hence, in this case, sf-X2C produces about 98%98\% of the relativistic corrections of the exact Dirac theory.

atomic system ZZ NN 1pEx-DFT HF 1pEx-DFT (sf-X2C) HF (sf-X2C) statistical atom reference value
H- 1 2 0.48187 0.48793 0.48188 0.44883 0.527731(3)
He 2 2 2.83474 2.85583 2.83489 2.85583 2.73111 2.90339
C4+ 6 2 32.3146 32.3565 32.3305 32.3735 32.416
Ar16+ 18 2 312.809 312.828 314.166 314.143 314.092
Be 4 4 14.5096 14.5617 14.5129 14.5748 14.2453 14.6684
CL=7 6 6 37.3007 37.5646 37.3198 37.6109 37.6356 37.8558
CL=15 6 6 37.5097 37.5646 37.5268 37.6109 37.6356 37.8558
CL=31 6 6 37.5197 37.5646 37.5368 37.6109 37.6356 37.8558
Xe48+ 54 6 4197.74 4199.22 4348.14 4379.7±0.4{\pm 0.4}
O 8 8 74.1110 74.5883 74.1259 74.7173 75.0362 75.1098
Ne 10 10 126.064 128.353 126.140 128.625 128.149 129.053
Xe44+ 54 10 5361.70 5373.85 5538.51 5558.4±1.0{\pm 1.0}
Table 4: The binding energies (in Hartree) as displayed in Fig. 3 relative to the (nonrelativistic) HF binding energies. The nonrelativistic HF binding energy for H- is reported in Ref. [67]. We used the quantum chemistry package PSI4 [68, 69] to compute all other HF energies displayed in Fig. 3, with default PSI4 settings, except: unrestricted HF and ‘dgauss-dzvp-autoabs-decon’ basis set for nonrelativistic HF; ‘cc-pvdz-dk’ basis set for relativistic HF; and we omitted the relativistic HF simulations of the xenon ions for lack of a built-in basis set. The reference value for H- is the experimental binding energy given in Ref. [50]. All other reference values are taken from Ref. [45]; where uncertainties are omitted, they are negligible for the number of digits shown.

References

  • [1] A. D. Becke, Perspective: Fifty years of density-functional theory in chemical physics, J. Chem. Phys. 140, 18A301 (2014).
  • [2] P. J. Hasnip, K. Refson, M. I. J. Probert, J. R. Yates, S. J. Clark, and C. J. Pickard, Density functional theory in the solid state, Phil. Trans. R. Soc. A 372, 20130270 (2014).
  • [3] P. Okun and K. Burke, Semiclassics: The hidden theory behind the success of DFT, arXiv:2106.07839, pp. 179–249 in: Density Functionals for Many-Particle Systems: Mathematical Theory and Physical Applications of Effective Equations; B.-G. Englert, H. Siedentop, and M.-I. Trappe (eds.); Lecture Notes Series, IMS, World Scientific, Singapore (2023).
  • [4] G. A. Henderson, Variational theorems for the single-particle probability density and density matrix in momentum space, Phys. Rev. A 23, 19 (1981).
  • [5] M. Cinal and B.-G. Englert, Energy functionals in momentum space: Exchange energy, quantum corrections, and the Kohn-Sham scheme, Phys. Rev. A 48, 1893 (1993).
  • [6] Y. Sakurai, Y. Tanaka, A. Bansil, S. Kaprzyk, A. T. Stewart, Y. Nagashima, T. Hyodo, S. Nanao, H. Kawata, and N. Shiotani, High-Resolution Compton Scattering Study of Li: Asphericity of the Fermi Surface and Electron Correlation Effects, Phys. Rev. Lett. 74, 2252 (1995).
  • [7] K. Hueck, N. Luick, L. Sobirey, J. Siegl, T. Lompe, and H. Moritz, Two-Dimensional Homogeneous Fermi Gases, Phys. Rev. Lett. 120, 060402 (2018).
  • [8] M. Levy, Universal variational functionals of electron densities, first-order density matrices, and natural spin-orbitals and solution of the v-representability problem, Proc. Natl. Acad. Sci. 76, 6062 (1979).
  • [9] M. Levy and A. Görling, Correlation-energy density-functional formulas from correlating first-order density matrices, Phys. Rev. A 52, R1808 (1995).
  • [10] A. Savin, Expression of the exact electron-correlation-energy density functional in terms of first-order density matrices, Phys. Rev. A 52, R1805 (1995).
  • [11] S. Goedecker and C. J. Umrigar, Natural Orbital Functional for the Many-Electron Problem, Phys. Rev. Lett. 81, 866 (1998).
  • [12] J. Ciosłowski (ed.), Many-Electron Densities and Reduced Density Matrices, Springer, New York (2000).
  • [13] K. J. H. Giesbertz and M. Ruggenthaler, One-body reduced density-matrix functional theory in finite basis sets at elevated temperatures, Phys. Rep. 806, 1 (2019).
  • [14] B.-G. Englert and J. Schwinger, Thomas–Fermi revisited: The outer regions of the atom, Phys. Rev. A 26, 2322 (1982).
  • [15] M.-I. Trappe, Y. L. Len, H. K. Ng, C. A. Müller, and B.-G. Englert, Leading gradient correction to the kinetic energy for two-dimensional fermion gases, Phys. Rev. A 93, 042510 (2016).
  • [16] M. I. Trappe, Y. L. Len, H. K. Ng, and B. G. Englert, Airy-averaged gradient corrections for two-dimensional fermion gases, Ann. Phys. (N. Y.) 385, 136 (2017).
  • [17] T. T. Chau, J. H. Hue, M.-I. Trappe, and B.-G. Englert, Systematic corrections to the Thomas–Fermi approximation without a gradient expansion, New J. Phys. 20, 073003 (2018).
  • [18] B.-G. Englert, Julian Schwinger and the Semiclassical Atom, arXiv:1907.04751, Chapter 17, pp. 261-269 in: Proceedings of the Julian Schwinger Centennial Conference; B.-G. Englert (ed.); World Scientific (2019).
  • [19] E. H. Lieb, Density functionals for coulomb systems, Int. J. Quantum Chem. 24, 243 (1983).
  • [20] B.-G. Englert, J. H. Hue, Z. C. Huang, M. M. Paraniak, and M.-I. Trappe, Energy functionals of single-particle densities: A unified view, arXiv:2206.10097, pp. 287–308 in: Density Functionals for Many-Particle Systems: Mathematical Theory and Physical Applications of Effective Equations; B.-G. Englert, H. Siedentop, and M.-I. Trappe (eds.); Lecture Notes Series, IMS, World Scientific, Singapore (2023).
  • [21] J. Cioslowski, B.-G. Englert, M.-I. Trappe, and J. H. Hue, Contactium: A strongly correlated model system, J. Chem. Phys. 158, 184110 (2023).
  • [22] M.-I. Trappe, P. T. Grochowski, J. H. Hue, T. Karpiuk, and K. Rząz˙\dot{\mbox{z}}ewski, Phase Transitions of Repulsive Two-Component Fermi Gases in Two Dimensions, New J. Phys. 23, 103042 (2021).
  • [23] P. A. M. Dirac, Note on the exchange phenomena in the Thomas Atom, Math. Proc. Camb. Philos. Soc. 26, 376 (1930).
  • [24] D. R. Hartree, The Wave Mechanics of an Atom with a Non-Coulomb Central Field. Part I. Theory and Methods, Math. Proc. Camb. Philos. Soc. 24, 89 (1928).
  • [25] J. C. Slater, Note on Hartree’s Method, Phys. Rev. 35, 210 (1930).
  • [26] V. Fock, Näherungsmethode zur Lösung des quantenmechanischen Mehrkörperproblems, Z. Phys. 61, 126 (1930).
  • [27] J. Kennedy and R. Eberhart, Particle swarm optimization, pp. 1942–1948 in: Proceedings of ICNN’95 - International Conference on Neural Networks; IEEE (1995).
  • [28] R. Bonyadi, M and Z. Michalewicz, Particle Swarm Optimization for Single Objective Continuous Space Problems: A Review, Evol. Comput. 25, 1 (2017).
  • [29] J. Tang, G. Liu, and Q. Pan, A review on representative swarm intelligence algorithms for solving optimization problems: Applications and trends, IEEE/CAA J. Autom. Sin. 8, 1627 (2021).
  • [30] A. Slowik and H. Kwasnicka, Evolutionary algorithms and their applications to engineering problems, Neural. Comput. Appl. 32, 12363 (2020).
  • [31] M. M. Paraniak, (A) Reduced Density Matrix Generation Algorithms in Single-Particle-Exact Density Functional Theory, (B) Quantum Dynamical Simulation of a Transversal Stern-Gerlach Interferometer, (C) Open Quantum System Process Tomography, Ph.D. thesis, National University of Singapore, Singapore (2022).
  • [32] S. Xavier-de Souza, J. A. K. Suykens, J. Vandewalle, and D. Bolle, Coupled Simulated Annealing, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics) 40, 320 (2010).
  • [33] W. Kutzelnigg and W. Liu, Quasirelativistic theory equivalent to fully relativistic theory, J. Chem. Phys. 123, 241102 (2005).
  • [34] W. Liu and D. Peng, Exact two-component Hamiltonians revisited, J. Chem. Phys. 131, 031104 (2009).
  • [35] L. Cheng and J. Gauss, Analytic energy gradients for the spin-free exact two-component theory using an exact block diagonalization for the one-electron Dirac Hamiltonian, J. Chem. Phys. 135, 084114 (2011).
  • [36] L. A. Cunha, D. Hait, R. Kang, Y. Mao, and M. Head-Gordon, Relativistic Orbital-Optimized Density Functional Theory for Accurate Core-Level Spectroscopy, J. Phys. Chem. Lett. 13, 3438 (2022).
  • [37] M.-I. Trappe, D. Y. H. Ho, and S. Adam, First-principles quantum corrections for carrier correlations in double-layer two-dimensional heterostructures, Phys. Rev. B 99, 235415 (2019).
  • [38] M.-I. Trappe, J. H. Hue, and B.-G. Englert, Density-potential functional theory for fermions in one dimension, arXiv:2106.07839, pp. 251–267 in: Density Functionals for Many-Particle Systems: Mathematical Theory and Physical Applications of Effective Equations; B.-G. Englert, H. Siedentop, and M.-I. Trappe (eds.); Lecture Notes Series, IMS, World Scientific, Singapore (2023).
  • [39] M.-I. Trappe, C. Witt, and S. Manzhos, Atoms, dimers, and nanoparticles from orbital-free density-potential functional theory (in preparation).
  • [40] M.-I. Trappe and R. A. Chisholm, A density functional theory for ecology across scales, Nat. Commun. 14, 1089 (2023).
  • [41] V. R. Saunders and I. H. Hillier, A "Level-Shifting" method for converging closed shell Hartree-Fock wave functions, Int. J. Quantum Chem. 7, 699 (1973).
  • [42] T. Busch, B.-G. Englert, K. Rzążewski, and M. Wilkens, Two Cold Atoms in a Harmonic Trap, Found. Phys. 28, 549 (1998).
  • [43] J. Viana-Gomes and N. M. R. Peres, Solution of the quantum harmonic oscillator plus a delta-function potential at the origin: The oddness of its even-parity solutions, Eur. J. Phys. 32, 1377 (2011).
  • [44] C. L. Benavides-Riveros, I. V. Toranzo, and J. S. Dehesa, Entanglement in N-harmonium: bosons and fermions, J. Phys. B: At. Mol. Opt. Phys. 47, 195503 (2014).
  • [45] A. Kramida, Y. Ralchenko, J. Reader, and NIST ASD Team, NIST Atomic Spectra Database (ver. 5.9), [Online], available: https://physics.nist.gov/asd, National Institute of Standards and Technology, Gaithersburg, MD (2021).
  • [46] C. F. Fischer, M. Godefroid, T. Brage, P. Jönsson, and G. Gaigalas, Advanced multiconfiguration methods for complex atoms: I. Energies and wave functions, J. Phys. B: At. Mol. Opt. Phys. 49, 182004 (2016).
  • [47] H. Bethe and E. Salpeter, Quantum Mechanics of One- and Two-Electron Systems, Springer, Berlin, Heidelberg (1957).
  • [48] J. Scott, LXXXII. The binding energy of the Thomas-Fermi Atom, Lond. Edinb. Dublin philos. mag. j. sci. 43, 859 (1952).
  • [49] B.-G. Englert, Lecture Notes in Physics: Semiclassical Theory of Atoms, Springer, Berlin, Heidelberg (1988).
  • [50] K. R. Lykke, K. K. Murray, and W. C. Lineberger, Threshold photodetachment of HH^{-}, Phys. Rev. A 43, 6104 (1991).
  • [51] D. P. Kingma and J. Ba, Adam: A Method for Stochastic Optimization, arXiv:1412.6980v9 [cs.LG] (2017).
  • [52] J. Lehman and K. O. Stanley, Exploiting Open-Endedness to Solve Problems Through the Search for Novelty, in: Proceedings of the Eleventh International Conference on Artificial Life (ALIFE XI), MIT Press, Cambridge, MA (2008).
  • [53] A. LaTorre, S. Muelas, and J.-M. Pen~\tilde{\mbox{n}}a, A comprehensive comparison of large scale global optimizers, Inf. Sci. 316 (2015).
  • [54] E. Glorieux, B. Svensson, F. Danielsson, and B. Lennartson, Improved Constructive Cooperative Coevolutionary Differential Evolution for Large-Scale Optimisation, pp. 1703–1710 in: IEEE Symposium Series on Computational Intelligence (2015).
  • [55] Y. Sun, M. Kirley, and S. K. Halgamuge, A Recursive Decomposition Method for Large Scale Continuous Optimization, IEEE Trans. Evol. Comput. 22, 647 (2018).
  • [56] J. Cioslowski, Z. E. Mihalka, and A. Szabados, Bilinear constraints upon the correlation contribution to the electron–electron repulsion energy as a functional of the one-electron reduced density matrix, J. Chem. Theory Comput. 15, 4862 (2019).
  • [57] Q. Liu, Order-2 Stability Analysis of Particle Swarm Optimization, Evol. Comput. 23, 187 (2014).
  • [58] M. S. Nobile., P. Cazzaniga, D. Besozzi, R. Colombo, G. Mauri, and G. Pasi, Fuzzy Self-Tuning PSO: A settings-free algorithm for global optimization, Swarm Evol. Comput. 39, 70 (2018).
  • [59] M. Hutter and S. Legg, Fitness uniform optimization, IEEE Trans. Evol. Comput. 10, 568 (2006).
  • [60] T. Park and K. R. Ryu, A dual-population genetic algorithm for adaptive diversity control, IEEE Trans. Evol. Comput. 14, 865 (2010).
  • [61] M. Na and F. Marsuglio, Two and three particles interacting in a one-dimensional trap, Am. J. Phys. 85, 769 (2017).
  • [62] E. U. Condon and G. H. Shortley, The Theory of Atomic Spectra, Cambridge University Press, New York (1935).
  • [63] F. W. J. Olver, D. W. Lozier, R. F. Boisvert, and C. W. Clark, NIST Handbook of Mathematical Functions, Cambridge University Press, New York (2010).
  • [64] B. D. Joshi, Generalized Form of Atomic Two Electron Integrals over Hydrogenic Functions, J. Comp. Phys. 8, 300 (1971).
  • [65] P. H. Butler, P. E. H. Minchin, and B. G. Wybourne, Tables of Hydrogenic Slater Radial Integrals, At. Data Nucl. Data Tables 3, 153 (1971).
  • [66] P. H. Butler, P. E. H. Minchin, and B. G. Wybourne, Tables of Hydrogenic Slater Radial Integrals, At. Data Nucl. Data Tables 27, 145 (1982), erratum: At. Data Nucl. Data Tables 3, 153 (1971).
  • [67] A. W. King, A. L. Baskerville, and H. Cox, Hartree–Fock implementation using a Laguerre-based wave function for the ground state and correlation energies of two-electron atoms, Philos. Trans. R. Soc. A 376, 20170153 (2018).
  • [68] P. Verma, W. D. Derricotte, and F. A. Evangelista, Predicting Near Edge X-ray Absorption Spectra with the Spin-Free Exact-Two-Component Hamiltonian and Orthogonality Constrained Density Functional Theory, J. Chem. Theory Comput. 12, 144 (2016).
  • [69] D. G. A. Smith et al., PSI4 1.4: Open-source software for high-throughput quantum chemistry, J. Chem. Phys. 152, 184108 (2020).