Measuring energy by measuring any other observable

Dominik Šafránek [email protected] Center for Theoretical Physics of Complex Systems, Institute for Basic Science (IBS), Daejeon - 34126, Korea Dario Rosa [email protected] Center for Theoretical Physics of Complex Systems, Institute for Basic Science (IBS), Daejeon - 34126, Korea Basic Science Program, Korea University of Science and Technology (UST), Daejeon - 34113, Korea

Abstract

We present a method to estimate the probabilities of outcomes of a quantum observable, its mean value, and higher moments by measuring any other observable. This method is general and can be applied to any quantum system. In the case of estimating the mean energy of an isolated system, the estimate can be further improved by measuring the other observable at different times. Intuitively, this method uses interplay and correlations between the measured observable, the estimated observable, and the state of the system. We provide two bounds: one that is looser but analytically computable and one that is tighter but requires solving a non-convex optimization problem. The method can be used to estimate expectation values and related quantities such as temperature and work in setups where performing measurements in a highly entangled basis is difficult, finding use in state-of-the-art quantum simulators. As a demonstration, we show that in Heisenberg and Ising models of ten sites in the localized phase, performing two-qubit measurements excludes 97.5% and 96.7% of the possible range of energies, respectively, when estimating the ground state energy.

I Introduction

Expectation values are ubiquitous in quantum physics, characterizing different types of behaviors of quantum systems. They are used both as descriptive and predictive tools. To name several: mean values of generic local observables classify many-body systems according to how well they thermalize D’Alessio et al. (2016); Deutsch (2018). Vanishing total magnetization identifies a quantum phase transition Vojta (2003); Sun et al. (2014); Tian et al. (2020). The mean value of homodyne measurement Yuen and Chan (1983); Tyc and Sanders (2004); Shaked et al. (2018); Raffaelli et al. (2018) is evaluated in magnetic resonance imaging Noll et al. (1991) and quantum cryptography protocols Voss (2009), while its variance is used to prove squeezing Davidovich (1996); Takeno et al. (2007) — an essential resource for quantum sensors Lawrie et al. (2019). Variances also appear in Heisenberg’s uncertainty principle Heisenberg (1985); Robertson (1929); Busch et al. (2007). Expectation values are the object of interest in quantum field theory Wightman (1956); Srednicki (2007) and in nuclear physics Bunge et al. (1993); Ikot et al. (2019).

Moments of energy are somewhat special due to their wide range of applications. The mean energy determines the thermodynamic entropy of the system Deutsch (2010); Swendsen (2015); Santos et al. (2011); Šafránek et al. (2021) and its temperature Hovhannisyan and Correa (2018); Mukherjee et al. (2019); Cenni et al. (2021). Its change may represent heat and work Engel and Nolte (2007); Alipour et al. (2016); Modak and Rigol (2017); Goold et al. (2018); De Chiara et al. (2018); Varizi et al. (2020) and its difference defines a measure of extractable work called ergotropy Allahverdyan et al. (2004); Alicki and Fannes (2013); Šafránek et al. (2022). Variance in energy determines the precision in estimating both the time Paris (2008) and temperature Correa et al. (2015) parameters. Both moments, when combined, provide a tight bound on the characteristic time scale of a quantum system Mandelstam and Tamm (1991); Margolus and Levitin (1998); Deffner and Campbell (2017).

Given the breadth of applications, it is clear that measuring and estimating expectation values is of essential importance. This may be however challenging. For example, in quantum many-body systems, the mean energy is considerably difficult to measure, with only a few architecture-specific proposals Villa and De Chiara (2017) and experiments Jiménez et al. (2021) known. This is because energy eigenstates are typically highly entangled. In quantum simulators, measuring in an entangled basis is performed by combining several elementary gates. Each gate has a fixed fidelity, and when many of such gates are combined, the fidelity diminishes making such measurements unreliable Nielsen and Chuang (2002); Reich et al. (2013); Harper and Flammia (2017); Huang et al. (2019). Additionally, experimental setups may allow measurement only of a close but not the exact observable we are interested in. This is the case, for example, in the aforementioned homodyne detection with a finite, instead of infinite, oscillator field strength Tyc and Sanders (2004); Combes and Lund (2022).

In this paper, we show that performing any measurement bounds the probabilities of outcomes, the mean value, and higher moments of any other observable. This means that, quite unintuitively, measurements carry more information than previously known. Any observable yields some information on any other observable. The method uses correlations between the measured, the estimated observable, and the state of the system. It is precisely this interplay that allows us to bound the probability of outcomes of the estimated observable and, from those, its mean value and higher moments.

These results immediately ameliorate the issue mentioned above: even in experimental systems in which we have only a limited ability to measure, we can perform the best possible measurement, and this is enough to estimate the probability distribution of outcomes and the mean value of an observable that we are truly interested in measuring.

The derived bounds are further tightened by measuring in different bases and, in the case of estimating the mean energy, by measuring at different times. After some preliminaries, we show how measurement in any basis bounds the probabilities of the system to have a certain mean energy. From this, we derive two bounds on the mean value of energy: one analytic which is easy to compute, and one tighter which leads to an optimization problem. We discuss situations in which the analytic bound becomes relatively tight. Then we describe a few differences when bounding the mean values of observables other than energy. We illustrate this method on several experimentally relevant models. Finally, we discuss the advantages and drawbacks of this method, possible applications, and future directions.

Refer to caption — Figure 1: Bounds on energy probabilities and the mean energy. Each energy probability $p_{E_{l}}$ (purple lines) is bounded by $a_{l}^{\max}$ from below (blue bar) and by $b_{l}^{\min}$ (red bar) from above, see Eq. (3). The analytical bound on the mean energy, $E_{\min}^{\mathrm{lin}}\leq\langle E\rangle\leq E_{\max}^{\mathrm{lin}}$ , Eq. (8), is computed as follows: imagine a bottle of probabilities with volume one. To obtain the lower bound on the mean energy (left figure), we pour the probabilities on the graph from the bottle and fill the minimal probability of each energy given by $a_{l}^{\max}$ (blue solid bar). We will have some probabilities in the bottle left, so, we pour the rest and top the red bars up to their maximum $b_{l}^{\min}$ (red striped bar), starting from the lowest to the highest energy, until we run out of probabilities in the bottle, i.e., until all the probabilities on the graph sum up to one. Taking the mean value of such distribution will yield the lower bound, see Eq. (11). The upper bound (right figure) is obtained by the same method but going from the highest to the lowest energy, see Eq. (13). The mean energy $\langle E\rangle$ lies somewhere in the shaded region.

II Results

Setup. Consider any quantum system and measurement given by the measurement basis ${\mathcal{C}}=\{\ket{i}\}.$ Label $i$ is the outcome of the measurement, and the probability of obtaining the outcome at time $t$ is $p_{i}=\absolutevalue{\innerproduct{i}{\psi_{t}}}^{2}.$ $\ket{\psi_{t}}$ is the state of the system at time $t$ . If we create many realizations of the same experiment by repeating the sequence prepare-and-measure, we can build the statistics of outcomes and determine the probability distribution $\{p_{i}\}$ . Thus, these probabilities are experimentally accessible.

Next, we consider a Hamiltonian of the system, with spectral decomposition in terms of its eigenvalues and eigenvectors being ${\hat{H}}=\sum_{E}E\outerproduct{E}{E}.$ The probability of finding the system having energy $E$ is given by $p_{E}=\absolutevalue{\innerproduct{E}{\psi_{t}}}^{2}.$ We assume to know the Hamiltonian and its spectral decomposition. However, we presume that we are unable to measure it experimentally. In other words, we cannot perform the measurement in the energy eigenbasis. As we will show next, this does not stop us from estimating its probability of outcomes, and from those also the mean value of energy. Proofs for the following bounds can be found in Appendix A.

Bounds on energy probabilities. The key result of this paper is that one finds the probability of the state having energy $E$ between two bounds,

a_{E}\ \leq\ p_{E}\ \leq\ b_{E},

(1)

where we defined

\begin{split}a_{E}&=\max\Big{\{}2\sum_{i}c_{iE}^{2}-b_{E},\ 0\Big{\}},\\ b_{E}&=\big{(}\sum_{i}{c_{iE}}\big{)}^{2},\end{split}

(2)

and ${c_{iE}}=\sqrt{p_{i}}\absolutevalue{\innerproduct{i}{E}}$ , see Fig. 1. The last element contains both the probability of an outcome, $p_{i}$ , and the correlation between the measured and the estimated observable, given by overlap $\absolutevalue{\innerproduct{i}{E}}$ . Thus, the above inequality connects the probability of the estimated observable to the probability of the measured observable, through the correlations between their eigenbases. We can easily derive that $b_{E}\leq 1$ from the Cauchy-Schwarz inequality. Thus, the upper bound on the energy probability is always non-trivial.

If the system is isolated, it evolves unitarily with the time-independent Hamiltonian ${\hat{H}}$ , and energy probabilities $p_{E}$ are also time-independent. In contrast, probabilities $p_{i}$ are time-dependent if the measurement basis does not commute with the Hamiltonian, and so are the bounds $a_{E}$ and $b_{E}$ . This leads to an interesting observation that measuring at different times can make the bound tighter. Quantitatively, we have

a_{E}^{\max}\leq p_{E}\leq b_{E}^{\min},

(3)

where $a_{E}^{\max}=\max_{t\in[0,T]}a_{E}$ and $b_{E}^{\min}=\min_{t\in[0,T]}b_{E}.$

Let us discuss situations in which the bound becomes tight. First, assume that one of the measurement basis vectors is an energy eigenvector, i.e., $\ket{i}=\ket{E}$ for some $i$ and $E$ . Then the bound gives $p_{E}=p_{i}$ for this specific $E$ , as intuitively expected. Second, consider a situation in which we always obtain a single outcome when measuring at a specific time, i.e., $p_{i}=1$ for some $i$ at some time $t$ . Doing this is akin to identifying the state of the system as being equal to $\ket{i}$ . As a result, the bound gives an identity $p_{E}=\absolutevalue{\innerproduct{i}{E}}$ for all energies $E$ so the entire energy distribution is determined exactly.

The two extreme cases just discussed suggest two possible scenarios in which the bounds perform well. The bounds are relatively tight when either the measurement basis resembles the eigenbasis of the estimated observable (in this case, the Hamiltonian), or when the state of the system comes close to one of the measurement basis vectors during its time evolution.

In addition to optimization over time, the inequalities can be further tightened by performing measurements in different bases. Defining a set of performed measurements, $\mathcal{M}=\{{\mathcal{C}}_{m}\}$ , each measurement bounds the $p_{E}$ independently, so we can take

a_{E}^{\max}=\max_{{\mathcal{C}}_{m}\in\mathcal{M},t\in[0,T]}a_{E}^{m},\quad b_{E}^{\min}=\min_{{\mathcal{C}}_{m}\in\mathcal{M},t\in[0,T]}b_{E}^{m},

(4)

for the bound (3). $a_{E}^{m}(t)$ and $b_{E}^{m}(t)$ are defined by Eqs. (2) for each measurement ${\mathcal{C}}_{m}$ . This may be helpful when there are limits on the types of measurements we can perform. For example, we can be experimentally limited to using only one- and two-qubit gates due to many-qubit gates having a low fidelity.

Bounds on collections of energy probabilities. Additionally, we derive the following collective bounds on the energy probabilities,

{\sqrt{\boldsymbol{p}}}^{T}\boldsymbol{A}_{i}{\sqrt{\boldsymbol{p}}}\ \leq\ p_{i}\ \leq\ {\sqrt{\boldsymbol{p}}}^{T}\boldsymbol{B}_{i}{\sqrt{\boldsymbol{p}}}.

(5)

The left and the right-hand sides are time-independent quadratic forms, which are defined by their elements as

\begin{split}(\boldsymbol{A}_{i})_{EE^{\prime}}&=(-1)^{1+\delta_{EE^{\prime}}}\absolutevalue{\innerproduct{E}{i}\!\!\innerproduct{i}{E^{\prime}}},\\ (\boldsymbol{B}_{i})_{EE^{\prime}}&=\absolutevalue{\innerproduct{E}{i}\!\!\innerproduct{i}{E^{\prime}}}.\end{split}

(6)

They are applied on the vector of the square root of energy probabilities ${\sqrt{\boldsymbol{p}}}_{E}=\sqrt{p_{E}}$ , which are those that we would like to estimate.

For the Hamiltonian evolution, extremizing over times of measurement yields tighter bounds

{\sqrt{\boldsymbol{p}}}^{T}\boldsymbol{A}_{i}{\sqrt{\boldsymbol{p}}}\ \leq\ p_{i}^{\min}\ \leq p_{i}^{\max}\ \leq\ {\sqrt{\boldsymbol{p}}}^{T}\boldsymbol{B}_{i}{\sqrt{\boldsymbol{p}}},

(7)

where $p_{i}^{\min}=\min_{t\in[0,T]}p_{i}$ and $p_{i}^{\max}=\max_{t\in[0,T]}p_{i}$ .

These collective inequalities are generally non-linear in $p_{E}$ and neither convex nor concave. There are as many as the number of measurement outcomes. While they do not bound each energy probability separately, they provide relationships between their respective sizes. For example, one can derive quantitative statements of type: if $p_{E_{1}}$ is high, then $p_{E_{2}}$ must be low. They might require numerical methods to work with due to their non-linearity. However, they can provide a robust improvement in estimating energy in some cases. See Appendix B for such an example of coarse-grained energy measurements.

Similar to Eq. (4), one can employ measurements in different bases. These generate more conditions for probabilities $p_{E}$ of type (7), thus making quantitative relations between them stricter.

Bounds on the mean energy. Given the derived bounds on the probability distribution of energy, we can bound the mean energy of the system as follows,

E_{\min}^{\mathrm{lin}}\,\leq\,E_{\min}\,\leq\,\langle E\rangle\,\leq\,E_{\max}\,\leq\,E_{\max}^{\mathrm{lin}}.

(8)

The inner bound is tighter but may be challenging to compute. The outer bound is looser, but it is analytically computable.

The inner bound is computed by optimizing the mean value of energy, $\langle E\rangle=\sum_{E}E\,p_{E}$ , as

E_{\min}=\min_{\{p_{E}\}\in S}\langle E\rangle,\quad E_{\max}=\max_{\{p_{E}\}\in S}\langle E\rangle,

(9)

over the set of probability distributions consistent with our observations, i.e., over the set that satisfies all the required inequalities

S=\left\{\{p_{E}\}\bigg{|}\sum_{E}p_{E}=1,\ \mathrm{Eq.}~{}\eqref{eq:linear_inequalities},\text{ and }\mathrm{Eq.}~{}\eqref{eq:sum_inequalities_extremized}\right\}.

(10)

The mean energy itself is a linear function. While $\sum_{E}p_{E}=1$ and Eq. (3) are linear constraints, Eq. (7) is in general non-linear. Computing $E_{\min}$ and $E_{\max}$ is, therefore, a non-linear constrained optimization problem. These problems are considered to be computationally demanding, although they are difficult to characterize within computational complexity theory Hochbaum (2007). They can be solved only approximately by various methods ¹¹1These are, for example, Nelder-Mead algorithm Luersen and Le Riche (2004), random search Price (1983), differential evolution Storn (1996), machine learning methods Sivanandam and Deepa (2008), simulated Bertsimas and Tsitsiklis (1993) or quantum Das and Chakrabarti (2005) annealing, and modified linear programming methods Powell (1998). The time to find an exact solution typically scales exponentially with the number of variables, in our case, the dimension of the system.

However, we can solve an easier problem by including only linear constraints in the optimization, i.e., by removing the requirement for satisfying Eqs. (7). This makes the bound looser but allows for solving this optimization problem analytically. The reasoning behind the following derivation is explained in Fig. 1 and performed in detail in Appendix A. We assume that energy eigenvalues are ordered in increasing order as $E_{j}\leq E_{j+1}$ with $E_{1}$ representing the ground state energy. We have

E_{\min}^{\mathrm{lin}}=\sum_{j=1}^{N}E_{j}u_{j},

(11)

where probabilities $u_{j}\equiv u_{E_{j}}$ are computed recursively starting from the ground state as

	$\displaystyle u_{1}$	$\displaystyle=\min\left\{b_{1}^{\min},\ 1-\sum_{l=2}^{N}a_{l}^{\max}\right\},$		(12)
	$\displaystyle u_{j}$	$\displaystyle=\min\left\{b_{j}^{\min},\ 1-\sum_{l=1}^{j-1}u_{l}-\sum_{l=j+1}^{N}a_{l}^{\max}\right\},\quad 2\leq j\leq N.$

(We simplified lower indices as $l\equiv E_{l}$ , and the dimension of the system is $N$ .) Similarly, we obtain

E_{\max}^{\mathrm{lin}}=\sum_{j=1}^{N}E_{j}w_{j},

(13)

where starting from the highest energy state, we have

	$\displaystyle w_{N}$	$\displaystyle=\min\left\{b_{N}^{\min},\ 1-\sum_{l=1}^{N-1}a_{l}^{\max}\right\},$		(14)
	$\displaystyle w_{j}$	$\displaystyle=\min\left\{b_{j}^{\min},\ 1-\sum_{l=j+1}^{N}w_{l}-\sum_{l=1}^{j-1}a_{l}^{\max}\right\},\quad 1\leq j\leq N-1.$

Bounds for the higher moments are computed by replacing eigenvalues $E$ with $E^{k}$ in Eq. (9).

Tightness of the analytic bound on the mean value of energy. Below Eq. (3), we discussed two cases where the bound on the energy probability distribution becomes tight. Now we show that the same arguments can also be extended to discuss the tightness of the bound on the mean energy, Eq. (8).

The first case is when the state of the system comes close to one of the measurement basis vectors during its time evolution. If the state becomes exactly one of the basis vectors, i.e., $p_{i}(t)=1$ at some time $t$ , we can identify the state exactly as $\ket{i}$ and so all of its properties, energy included. In this case, bounds (3) become tight and we obtain $\langle E\rangle=E_{\max}^{\mathrm{lin}}=E_{\min}^{\mathrm{lin}}$ . The most informative times of measurement are those of a low value of observational entropy von Neumann (2010); Šafránek et al. (2019a, b); Strasberg and Winter (2021); Šafránek et al. (2021); Buscemi et al. (2022), due to the state wandering into a small subspace of the Hilbert space Šafránek et al. (2021); Šafránek and Thingna (2020) recognizable by the measurement. This can be advantageous for energy estimation in systems exhibiting recurrences and Loschmidt echo Usaj et al. (1998); Sánchez et al. (2016); Pastawski et al. (1995); Levstein et al. (2004); Rauer et al. (2018), which return close to their original state after some time.

The second is when the measurement is close to the energy measurement itself. This happens, for example, in the localized phase of many-body localized systems, in which the energy eigenvectors tend to localize in small portions of the Fock space Abanin et al. (2019). Thus, measuring local particle numbers is almost as good as measuring the energy itself. This is mathematically justified below Eq. (3).

Choosing the time interval and times of measurements. In experimental settings, the system can be evolved only over a finite time. Within this time interval, only a finite number of times a measurement can be performed. Thus, it is useful to specify the criteria until which time $T$ the system should be evolved together with the corresponding times of measurement, for the time optimization, Eq. (4), to work at its best.

We can address this heuristically given the points introduced in the previous section. Generally, the ideal number and times of measurement depend on the initial state: if the state does not evolve much, or at all, which is the case for any energy eigenstate (such as the ground state), then only a single measurement is required. Additional measurements will not yield any improvement.

On the other hand, if a nontrivial evolution occurs, then more times of measurement might be advantageous. The rule of thumb is to measure for as long as the observational entropy related to the measurement grows until it reaches its equilibrium value. This is because, bigger dips in observational entropy give more information, while small dips do not provide as much. The same criterium could be applied to identify the times of measurement within this interval. There should be as many as to reproduce the medium-sized dips in the observational entropy evolution.

Bounding the mean values of observables other than energy. The derivations and results above can be repeated as they are for any observable that commutes with the Hamiltonian. In that context, $E$ would denote an eigenvalue of an observable other than energy. For observables that do not commute with the Hamiltonian, the procedure can be repeated but it must be performed only at a fixed time $t$ (extremization over time, Eqs. (3) and (7), is not possible). Extremization over different measurements at a fixed time, Eqs. (4), can be employed. To summarize, the estimation of observables other than energy works exactly the same as estimating energy, with the only difference that optimization over time can be performed only when the observable commutes with the Hamiltonian. If the observable does not commute with the Hamiltonian, then also its expectation value changes in time, so only a specific time must be chosen but everything else proceeds identically.

Demonstration on experimentally relevant many-body systems. We numerically demonstrate this method on the paradigmatic example of the one-dimensional disordered Heisenberg model Porras and Cirac (2004); Pal and Huse (2010); Luitz et al. (2015). Numerical experiments for other experimentally achieved models, Ising Smith et al. (2016); Jurcevic et al. (2017); Zhang et al. (2017); Bingham et al. (2021), XY Lanyon et al. (2017); Friis et al. (2018); Brydges et al. (2019); Maier et al. (2019), and PXP models Bernien et al. (2017); Turner et al. (2018); Su et al. (2022) are presented in Appendix E. A simple analytical example is presented in Appendix D.

The Hamiltonian is given by

{\hat{H}}=\sum_{i}\big{(}\hat{\sigma}_{i}^{x}\hat{\sigma}_{i+1}^{x}+\hat{\sigma}_{i}^{y}\hat{\sigma}_{i+1}^{y}+\hat{\sigma}_{i}^{z}\hat{\sigma}_{i+1}^{z}\big{)}+\sum_{i}h_{i}\hat{\sigma}_{i}^{z},

(15)

where $\hat{\sigma}_{i}^{a}$ , $a=x,y,z$ , are the Pauli operators acting at the site $i$ . The constants $h_{i}$ are randomly extracted within the interval $\left[-W,W\right]$ with $W$ being the disorder strength. We show the case $W=0.5$ for the chaotic (delocalized) regime and $W=10$ for the localized regime Pal and Huse (2010); Luitz et al. (2015). See Appendix E for the Bethe integrable regime $W=0$ Bethe (1931).

We choose a complete measurement in the local number basis,

{\mathcal{C}}=\{\ket{i_{1}}\otimes\cdots\otimes\ket{i_{L}}\}_{i_{1},\dots,i_{L}},

(16)

for small systems simulations. There, $i_{j}=0,1$ for all $j$ and $L$ is the length of the chain (such that the dimension of the Hilbert space is $N=2^{L}$ ). For example, for the chain of $L=3$ sites, the measurement basis is

{\mathcal{C}}=\{\ket{000},\ket{001},\ket{010},\ket{011},\ket{100},\ket{101},\ket{110},\ket{111}\}.

(17)

This is an example of a one-local measurement, meaning that the measurement basis does not consist of states entangled between two or more sites. For large system simulations, we also add optimized $k$ -local measurements. $k$ -local measurements are those that project onto states that are allowed to be entangled between $k$ neighboring sites. An example of a two-local measurement on the chain of $L=4$ sites is the measurement in the local Bell basis,

\begin{split}{\mathcal{C}}&=\{\ket{\Phi^{+}}\!\ket{\Phi^{+}},\ket{\Phi^{+}}\!\ket{\Phi^{-}},\ket{\Phi^{+}}\!\ket{\Psi^{+}},\ket{\Phi^{+}}\!\ket{\Psi^{-}},\dots\},\end{split}

(18)

where $\ket{\Phi^{\pm}}=(\ket{00}\pm\ket{11})/\sqrt{2}$ and $\ket{\Psi^{\pm}}=(\ket{01}\pm\ket{10})/\sqrt{2}$ .

We consider three types of initial states: First the ground state (G). The second is a “pure thermal” state (C),

\ket{\psi_{\beta}}=\frac{1}{\mathcal{N}}\sum_{E}e^{-\beta E/2}\ket{E},

(19)

where $\mathcal{N}$ is the normalization factor, and the inverse temperature is chosen as $\beta=6/(E_{N}-E_{1})$ . We take this choice of $\beta$ to imitate a cold state. Third, we randomly choose a pure state from the Hilbert space with the Haar measure (H), imitating an infinite temperature state.

To compute the bounds, we evolve them with the Hamiltonian for the total time of $T=160$ .

In Figure 2, we show estimates of energy in small systems, taking three particles on the chain of $L=6$ sites. The Heisenberg Hamiltonian is particle conserving, so the initial state explores only a subspace of the full Hilbert space. We analytically solve for the looser bound using Eqs. (11) and (13). This solution serves as a starting point for the COBYLA optimization algorithm Powell (1994, 1998) to compute the tighter bound, Eq. (9). In Figure 3, we plot estimates of energy in a large system, 5 particles on $L=10$ sites, in which computing the numeric bound is prohibitively difficult. Instead, we add more measurements and calculate the corresponding analytical bounds in the increasing degree of non-locality.

Generalization to mixed states and POVMs. Most general quantum measurements are represented by the positive operator-valued measure (POVM), ${\mathcal{C}}=\{\hat{\Pi}_{i}\}_{i}$ , satisfying the completeness relation $\sum_{i}\hat{\Pi}_{i}={\hat{I}}$ . $\hat{\Pi}_{i}$ is a positive semi-definite operator called a POVM element. For a density matrix ${\hat{\rho}}$ representing the state of the system, the probability of obtaining measurement outcome $i$ is given by $p_{i}=\tr[{\hat{\Pi}}_{i}{\hat{\rho}}]$ .

POVM elements admit a spectral decomposition $\hat{\Pi}_{i}=\sum_{k}\gamma_{i}^{k}|i^{k}\rangle\langle i^{k}|$ . There, $0<\gamma_{i}^{k}\leq 1$ and $\ket{i^{k}}$ are orthogonal to each other for different $k$ ’s. We define its “volume” as $V_{i}=\tr[{\hat{\Pi}}_{i}]$ . We further define $x_{i}^{E}=\min_{k}\gamma_{i}^{k}\absolutevalue{\innerproduct{i^{k}}{E}}^{2}$ , $y_{i}^{E}=\max_{k}\absolutevalue{\innerproduct{i^{k}}{E}}^{2}$ , and $\gamma_{i}=\min_{k}\gamma_{i}^{k}$ . Note that these extrema are taken only over $k$ for which $\gamma_{i}^{k}$ is positive, i.e., non-zero.

The results of this paper generalize to mixed states and general measurements by taking

	$\displaystyle a_{E}$	$\displaystyle=\max\left\{\sum_{i}p_{i}\big{(}x_{i}^{E}+\gamma_{i}y_{i}^{E}\big{)}-\left(\sum_{i}\sqrt{p_{i}y_{i}^{E}V_{i}}\right)^{2}\!\!\!,0\right\},$
	$\displaystyle b_{E}$	$\displaystyle=\bigg{(}\sum_{i}\sqrt{p_{i}\bra{E}{\hat{\Pi}}_{i}\ket{E}}\bigg{)}^{2},$		(20)

in Eq. (1), and by taking

\begin{split}(\boldsymbol{A}_{i})_{EE^{\prime}}&=(-1)^{1+\delta_{EE^{\prime}}}\absolutevalue{\bra{E}{\hat{\Pi}}_{i}\ket{E^{\prime}}},\\ (\boldsymbol{B}_{i})_{EE^{\prime}}&=\absolutevalue{\bra{E}{\hat{\Pi}}_{i}\ket{E^{\prime}}},\end{split}

(21)

in Eq. (6). See Appendix A for the proofs. Results that come after do not depend on the specific form of the bounds. Thus these proceed identically. For a complete projective measurement, we have ${\hat{\Pi}}_{i}=\outerproduct{i}{i}$ , which implies $x_{i}^{E}=y_{i}^{E}=\absolutevalue{\innerproduct{i}{E}}^{2}$ and $\gamma_{i}=V_{i}=1$ , from which the initial bounds easily follow.

III Discussion and Conclusion

Quantum measurements provide more information than one would initially think. We developed a method that allows us to measure one observable and predict bounds on the distribution of outcomes and expectation values of every other observable. In this method, it is assumed that we have enough copies of the initial state so we can determine the entire probability distribution of outcomes of the measured observable. The method works well either when the measured and the estimated observables resemble each other or when the system state is close to one of the measurement basis states. In those cases, the bounds will be very tight. On the other hand, if the measurement cannot distinguish between two eigenstates with very different eigenvalues, and the system state has considerable overlap with one of them, the method naturally cannot give a good estimate. However, this can be overcome by combining measurements in different bases. Additionally, when estimating conserved quantities, better estimates are obtained by measuring at different times.

It is interesting to compare the presented method with the recent work of Huang et al. (2020). There, an algorithm is provided in which it is possible to approximate the mean value of an observable with high probability by applying random measurements, called classical shadows. This idea has been extended in subsequent literature both theoretically Zhao et al. (2021); Hadfield et al. (2022); Bu et al. (2022); Sack et al. (2022); Ippoliti (2023); Hu and You (2022); Hu et al. (2023); Gresch and Kliesch (2023); Akhtar et al. (2023); Seif et al. (2023), and experimentally Zhang et al. (2021); Struchalin et al. (2021).

In classical shadows literature, it is assumed that performing any type of measurement is possible. These measurements are sampled randomly from a tomographically complete set, meaning that with this set of measurements, quantum tomography is possible. The goal is to estimate the expectation value of an observable and achieve an error lower than $\epsilon$ , with as few measurements as possible. In other words, it is assumed that only a limited number of copies of the state are available to be measured.

In contrast, in the method presented here, it is assumed that only a single type of measurement can be performed. This measurement has been chosen from a limited set of measurements that are experimentally available. At the same time, it is assumed that infinitely many copies of the initial state are available. Thus, we can perform as many repetitions of the same measurement as necessary to fully specify its probability distribution.

In our method, since we do not sample from a tomographically complete set, we always have a finite error in estimating the expectation value. This error comes from the misalignment of the chosen measurement basis and the estimated observable and the eigenbasis of the density matrix.

Thus, while attempting to address a very similar goal, the two approaches are different in their assumptions and outcomes. Our method shines in exactly those situations in which not every measurement can be performed. This is motivated by the experimental capabilities of current state-of-the-art quantum simulators , which allow for the application of only one and possibly two-qubit gates. For this reason, we focused on local measurements. Of course, the method will work better with the improvement of the experimental capabilities. Its main strength, though, is to be able to give a prediction of the mean value of observables together with its error even in cases when the experimental capabilities are very low and other methods cannot be used.

We argued for using this method for estimating moments of energy, which have a wide range of applications while being difficult to measure directly in many-body systems. For instance, using this method, one can bound the characteristic timescale through the Mandelstam-Tamm and Margolus-Levitin bounds Mandelstam and Tamm (1991); Margolus and Levitin (1998); Deffner and Campbell (2017), to estimate the amount of extractable work from an unknown source of states Šafránek et al. (2022), or to estimate temperature. Moreover, the latest can be used to benchmark the cooling function of quantum annealers Benedetti et al. (2016); Pino and García-Ripoll (2020); Hauke et al. (2020); Imoto et al. (2021) and adiabatic quantum computers Mohseni et al. (2019). This could be particularly suitable for systems with area-law entanglement scaling Eisert et al. (2010); Abanin et al. (2019), in which local measurements should be more powerful given the absence of long-range correlations in the eigenstates of such systems. We confirmed this numerically in two gapped models, which are proven to have area-law ground states Hastings (2007). In these, estimating the ground state energy using only local measurements works especially well. The method can be used equally well and proceeds identically for estimating observables other than energy, with the only exception that if such an observable does not commute with the Hamiltonian, then also its mean value is time-dependent, therefore, one has to pick a specific time for the analysis. (In contrast, when measuring energy, we could improve the bounds by measuring at different times, using that its mean value is conserved.) The method can also be used to prove entanglement through an entanglement witness (operator $\hat{A}$ ), without measuring the witness itself: to prove entanglement, it is enough to show that the expectation value of this operator is negative Terhal (2000); Barbieri et al. (2003); Lewenstein et al. (2000).

On a theoretical ground, this research instigates new possible paths of exploration. How to choose a measurement given some restriction, for example, on its locality or the number of elementary gates it consists of, that leads to the tightest possible bound? Is it possible to apply machine learning models to find this optimal strategy? Will the bound give an exact value when the set of measurements is tomographically complete? Can this method be modified to identify the properties of channels instead of states? Given that this method bounds the entire probability distribution of outcomes, is it possible to modify it to calculate estimates of functions of a state other than expectation values, such as entanglement entropy?

Acknowledgments. We thank Felix C. Binder for the collaboration on a related project during which some of the ideas for this paper started to surface. We thank Dung Xuan Nguyen, Sungjong Woo, and Siranuysh Badalyan for their comments and discussions. We acknowledge the support from the Institute for Basic Science in Korea (IBS-R024-D1).

Author Contributions. D.Š. Conceptualization, theory, bounds and their proofs in particular, writing, visualization of figures, development of the ground state optimized and the observable optimized (type 1) methods, and creating a single qubit example in the Appendix. D.R. Development of the software for generating numerical experiments of Heisenberg, XY, PXP, and Ising models and producing the data, editing, and development of the observable optimized (type 2) method. Both authors contributed to the style of the paper and cross-contributed to the main roles of the other through frequent discussions.

Data and code availability. Data and the code used to generate data for this paper are available from the corresponding authors upon reasonable request.

Appendix

This Appendix provides methods and proofs of the bounds, several examples, and additional numerical experiments on experimentally realized many-body systems. It contains App. A: Methods and proofs of the bounds. App. B: Example in which the bound on the collection of energy probabilities provides a much better estimate of the mean energy value. App. C: Introduction of quality factors — two measures of performance. App. D: Simple analytic example of the mean energy estimation of a qubit. App. E: Estimation of the mean energy in experimentally relevant models: Heisenberg, Ising, XY, and PXP models.

Appendix A Methods and proofs

Ground state optimized and observable optimized methods for finding appropriate $k$ -local measurements. We choose a chain of length $L=10$ and two-local ( $k=2$ ) measurements to illustrate.

Ground state optimized method: This method is inspired by the Matrix Product State ansatz Orús (2014), and by the correspondence between observational and entanglement entropy Schindler et al. (2020). We choose a pure state (in our case, the ground state) $\ket{\psi_{0}}$ for which we want to optimize. We divide the chain into $L/k$ local parts and generate the local basis as an eigenbasis of the reduced state. This corresponds to the local Schmidt basis. For example, for the first two sites denoted as subsystem $A_{1}$ , while denoting the full system as $A$ , we have

{\hat{\rho}}_{A_{1}}=\tr_{A\setminus{A_{1}}}\outerproduct{\psi_{0}}{\psi_{0}},

(22)

from which we compute eigenvectors $\{\ket{\psi_{i_{1}}^{A_{1}}}\}$ . Eq. (22) is what we refer to as the $k$ -local reduced state of the ground state. The final, ground state-optimized two-local measurement basis is given as a product of such generated local basis,

\big{\{}\ket{\psi_{i_{1}}^{A_{1}}}\otimes\ket{\psi_{i_{2}}^{A_{2}}}\otimes\ket{\psi_{i_{3}}^{A_{3}}}\otimes\ket{\psi_{i_{4}}^{A_{4}}}\otimes\ket{\psi_{i_{5}}^{A_{5}}}\big{\}}.

(23)

This method ensures that for the full system ( $k=L$ ), the ground state is one of the measurement basis states.

Observable optimized method: This method is a $k$ -local optimization for a specific observable, in our case the Hamiltonian. The basis is generated by the eigenbasis of the Hamiltonian where interaction terms between the local parts have been taken out. For example, in the Heisenberg model, the two-local measurement basis is given as the eigenbasis of the Hamiltonian

{\hat{H}}_{2}=\sum_{i=1,3,5,7,9}\big{(}\hat{\sigma}_{i}^{x}\hat{\sigma}_{i+1}^{x}+\hat{\sigma}_{i}^{y}\hat{\sigma}_{i+1}^{y}+\hat{\sigma}_{i}^{z}\hat{\sigma}_{i+1}^{z}\big{)}+\sum_{i=1}^{10}h_{i}\hat{\sigma}_{i}^{z}.

(24)

This method ensures that the measurement basis for the full system ( $k=L$ ) is the same as the eigenbasis of the observable we optimize for. See Appendix E, Methods for optimizing local measurements, for details.

Upper bound on energy probabilities. First, we prove an upper bound on the energy probability,

p_{E}\leq\bigg{(}\sum_{i}\sqrt{p_{i}}\sqrt{\bra{E}{\hat{\Pi}}_{i}\ket{E}}\bigg{)}^{2},

(25)

where the right-hand side defines $a_{E}$ . This is the second half of Eq. (20).

Proof.

We express the spectral decomposition of the density matrix as ${\hat{\rho}}=\sum_{m}\lambda_{m}|\psi_{m}\rangle\langle\psi_{m}|$ , where $\lambda_{m}$ are its eigenvalues corresponding to eigenvectors $\ket{\psi_{m}}$ . We find that $p_{i}=\tr[{\hat{\Pi}}_{i}{\hat{\rho}}]$ translates to $p_{i}=\sum_{m}\lambda_{m}\bra{\psi_{m}}{\hat{\Pi}}_{i}\ket{\psi_{m}}$ . A series of inequalities follows:

\begin{split}p_{E}&=\bra{E}{\hat{\rho}}\ket{E}=\sum_{m}\lambda_{m}\absolutevalue{\innerproduct{E}{\psi_{m}}}^{2}=\sum_{m}\lambda_{m}\Big{|}\!\sum_{i}\bra{E}{\hat{\Pi}}_{i}\ket{\psi_{m}}\!\Big{|}^{2}\\ &\leq\sum_{m}\lambda_{m}\bigg{(}\sum_{i}\absolutevalue{\bra{E}{\hat{\Pi}}_{i}\ket{\psi_{m}}}\bigg{)}^{2}\\ &=\sum_{m}\lambda_{m}\bigg{(}\sum_{i}\absolutevalue{\bra{E}\sqrt{{\hat{\Pi}}_{i}^{\dagger}}\sqrt{{\hat{\Pi}}_{i}}\ket{\psi_{m}}}\bigg{)}^{2}\\ &\leq\sum_{m}\lambda_{m}\bigg{(}\sum_{i}\sqrt{\bra{E}{\hat{\Pi}}_{i}\ket{E}\bra{\psi_{m}}{\hat{\Pi}}_{i}\ket{\psi_{m}}}\bigg{)}^{2}\\ &=\sum_{m}\lambda_{m}\left\|{\boldsymbol{x}}^{m}\right\|_{\frac{1}{2}}\\ &\leq\Big{\|}\sum_{m}\lambda_{m}{\boldsymbol{x}}^{m}\Big{\|}_{\frac{1}{2}}\\ &=\bigg{(}\sum_{i}\sqrt{\sum_{m}\lambda_{m}\bra{E}{\hat{\Pi}}_{i}\ket{E}\bra{\psi_{m}}{\hat{\Pi}}_{i}\ket{\psi_{m}}}\bigg{)}^{2}\\ &=\bigg{(}\sum_{i}\sqrt{p_{i}}\sqrt{\bra{E}{\hat{\Pi}}_{i}\ket{E}}\bigg{)}^{2}.\\ \end{split}

(26)

The first inequality is the triangle inequality, the second the Cauchy-Schwarz inequality, and the third Jensen’s theorem applied on ( $p=\frac{1}{2}$ )-seminorm

\left\|{\boldsymbol{x}}\right\|_{\frac{1}{2}}=\Big{(}\sum_{i}\sqrt{x_{i}}\Big{)}^{2}.

(27)

${\boldsymbol{x}}^{m}=(x_{1}^{m},x_{2}^{m},\dots)$ is a vector of positive entries $x_{i}^{m}=\bra{E}{\hat{\Pi}}_{i}\ket{E}\bra{\psi_{m}}{\hat{\Pi}}_{i}\ket{\psi_{m}}\geq 0$ . In order to apply Jensen’s theorem, we need first to confirm that the ( $p=\frac{1}{2}$ )-seminorm is a concave function. It is concave when restricted on vectors with positive entries, $\left\|~{}\right\|_{\frac{1}{2}}:\mathbb{R}_{+}^{N}\rightarrow\mathbb{R}_{+}$ , which is indeed the case here. This follows from the reverse Minkowski inequality

\norm{{\boldsymbol{x}}+{\boldsymbol{y}}}_{p}\geq\norm{{\boldsymbol{x}}}_{p}+\norm{{\boldsymbol{y}}}_{p},

(28)

(where it is assumed $x_{i}\geq 0$ , $y_{i}\geq 0$ ), which holds for all ( $p<1$ )-seminorms. Taking $0\leq q\leq 1$ , we have

\begin{split}\norm{q{\boldsymbol{x}}+(1-q){\boldsymbol{y}}}_{p}&\geq\norm{q{\boldsymbol{x}}}_{p}+\norm{(1-q){\boldsymbol{y}}}_{p}\\ &=q\norm{{\boldsymbol{x}}}_{p}+(1-q)\norm{{\boldsymbol{y}}}_{p},\end{split}

(29)

so $\left\|~{}\right\|_{\frac{1}{2}}$ is indeed concave. ∎

Lower bound on energy probabilities. Next, we prove a lower bound on the energy probabilities,

p_{E}\geq\max\left\{\sum_{i}p_{i}\big{(}x_{i}^{E}+\gamma_{i}y_{i}^{E}\big{)}-\left(\sum_{i}\sqrt{p_{i}y_{i}^{E}V_{i}}\right)^{2}\!\!\!,0\right\}.

(30)

where the right-hand side defines $b_{E}$ . This is the first half of Eq. (20), above which $x_{i}^{E}$ , $y_{i}^{E}$ , and $V_{i}$ are defined. We drop superscripts $E$ to keep the notation cleaner, i.e., we write $x_{i}\equiv x_{i}^{E}$ and $y_{i}\equiv y_{i}^{E}$ .

Proof.

It is clear that $p_{E}\geq 0$ . To derive the first inequality, we start by expressing $p_{E}$ more conveniently as

\begin{split}p_{E}&=\bra{E}{\hat{\rho}}\ket{E}=\sum_{m}\lambda_{m}\absolutevalue{\sum_{i}\bra{E}{\hat{\Pi}}_{i}\ket{\psi_{m}}}^{2}\\ &=\sum_{m}\lambda_{m}\absolutevalue{\sum_{i,k}\gamma_{i}^{k}\innerproduct{E}{i^{k}}\innerproduct{i^{k}}{\psi_{m}}}^{2}\\ &=\sum_{m,i,k}\lambda_{m}\absolutevalue{\gamma_{i}^{k}\innerproduct{E}{i^{k}}\innerproduct{i^{k}}{\psi_{m}}}^{2}\\ &+\sum_{m,i\neq i^{\prime},k,k^{\prime}}\lambda_{m}\gamma_{i}^{k}\gamma_{i^{\prime}}^{k^{\prime}}\innerproduct{i^{\prime k^{\prime}}}{E}\innerproduct{E}{i^{k}}\innerproduct{i^{k}}{\psi_{m}}\innerproduct{\psi_{m}}{i^{\prime k^{\prime}}}\\ &+\sum_{m,i,k\neq k^{\prime}}\lambda_{m}\gamma_{i}^{k}\gamma_{i}^{k^{\prime}}\innerproduct{i^{k^{\prime}}}{E}\innerproduct{E}{i^{k}}\innerproduct{i^{k}}{\psi_{m}}\innerproduct{\psi_{m}}{i^{k^{\prime}}}\\ &\geq\!\!\!\sum_{m,i,k}\!\!\lambda_{m}\absolutevalue{\gamma_{i}^{k}\innerproduct{E}{i^{k}}\innerproduct{i^{k}}{\psi_{m}}}^{2}\!\!+\!\!\!\sum_{m,i,k}\!\!\!\lambda_{m}\absolutevalue{\gamma_{i}^{k}\innerproduct{i^{k}}{\psi_{m}}}^{2}\!y_{i}\!-\!w\\ &\geq\sum_{m,i,k}\lambda_{m}\gamma_{i}^{k}\absolutevalue{\innerproduct{i^{k}}{\psi_{m}}}^{2}(x_{i}+\gamma_{i}y_{i})-w\\ &=\sum_{i}p_{i}(x_{i}+\gamma_{i}y_{i})-w.\end{split}

(31)

The first inequality follows from the definition of absolute value and the second from definitions of $x_{i}$ and $\gamma_{i}$ . We defined

\begin{split}w&=\sum_{m,i\neq i^{\prime},k,k^{\prime}}\!\!\lambda_{m}\gamma_{i}^{k}\gamma_{i^{\prime}}^{k^{\prime}}\absolutevalue{\innerproduct{i^{\prime k^{\prime}}}{E}\innerproduct{E}{i^{k}}\innerproduct{i^{k}}{\psi_{m}}\innerproduct{\psi_{m}}{i^{\prime k^{\prime}}}}\\ &+\sum_{m,i,k\neq k^{\prime}}\!\!\lambda_{m}\gamma_{i}^{k}\gamma_{i}^{k^{\prime}}\absolutevalue{\innerproduct{i^{k^{\prime}}}{E}\innerproduct{E}{i^{k}}\innerproduct{i^{k}}{\psi_{m}}\innerproduct{\psi_{m}}{i^{k^{\prime}}}}\\ &+\sum_{m,i,k}\lambda_{m}\absolutevalue{\gamma_{i}^{k}\innerproduct{i^{k}}{\psi_{m}}}^{2}y_{i}\\ &\leq\sum_{m,i,i^{\prime},k,k^{\prime}}\lambda_{m}\gamma_{i}^{k}\gamma_{i^{\prime}}^{k^{\prime}}\absolutevalue{\innerproduct{i^{k}}{\psi_{m}}\innerproduct{\psi_{m}}{i^{\prime k^{\prime}}}}\sqrt{y_{i}y_{i^{\prime}}}\\ &=\sum_{i,i^{\prime}}\!\sqrt{y_{i}y_{i^{\prime}}}\sum_{m}\!\lambda_{m}\Big{(}\sum_{k}\!\gamma_{i}^{k}\absolutevalue{\innerproduct{i^{k}}{\psi_{m}}}\Big{)}\Big{(}\sum_{k^{\prime}}\!\gamma_{i^{\prime}}^{k^{\prime}}\absolutevalue{\innerproduct{i^{\prime k^{\prime}}}{\psi_{m}}}\Big{)}\\ &\leq\sum_{i,i^{\prime}}\!\sqrt{y_{i}y_{i^{\prime}}}\sum_{m}\!\lambda_{m}\Big{(}\sqrt{\sum_{k}\gamma_{i}^{k}\absolutevalue{\innerproduct{i^{k}}{\psi_{m}}}^{2}}\sqrt{\sum_{k}\gamma_{i}^{k}}\Big{)}\\ &\times\Big{(}\sqrt{\sum_{k^{\prime}}\gamma_{i^{\prime}}^{k^{\prime}}\absolutevalue{\innerproduct{i^{\prime k^{\prime}}}{\psi_{m}}}^{2}}\sqrt{\sum_{k^{\prime}}\gamma_{i^{\prime}}^{k^{\prime}}}\Big{)}\\ &=\sum_{i,i^{\prime}}\sqrt{y_{i}y_{i^{\prime}}V_{i}V_{i^{\prime}}}\sum_{m}\lambda_{m}\norm{\vec{a}_{i}^{m}}_{2}\norm{\vec{a}_{i^{\prime}}^{m}}_{2}\\ &\leq\sum_{i,i^{\prime}}\sqrt{y_{i}y_{i^{\prime}}V_{i}V_{i^{\prime}}}\sqrt{\sum_{m}\lambda_{m}\norm{\vec{a}_{i}^{m}}_{2}^{2}}\sqrt{\sum_{m^{\prime}}\lambda_{m^{\prime}}\norm{\vec{a}_{i^{\prime}}^{m^{\prime}}}_{2}^{2}}\\ &=\sum_{i,i^{\prime}}\sqrt{p_{i}p_{i^{\prime}}y_{i}y_{i^{\prime}}V_{i}V_{i^{\prime}}}=\Big{(}\sum_{i}\sqrt{p_{i}y_{i}V_{i}}\Big{)}^{2}.\end{split}

(32)

where $V_{i}=\sum_{k}\gamma_{i}^{k}$ , $p_{i}=\sum_{m}\lambda_{m}\norm{\vec{a}_{i}^{m}}_{2}^{2}$ , $\vec{a}_{i}^{m}=(\sqrt{\gamma_{i}^{1}}\absolutevalue{\innerproduct{i^{1}}{\psi_{m}}},\sqrt{\gamma_{i}^{2}}\absolutevalue{\innerproduct{i^{2}}{\psi_{m}}},\dots)$ , and $\norm{~{}}_{2}$ is the two-norm. The first inequality follows from the definition of $y_{i}$ . We used the Cauchy-Schwarz inequality of type $\absolutevalue{\sum_{i}a_{i}b_{i}}\leq\sqrt{\sum_{i}a_{i}^{2}}\sqrt{\sum_{i^{\prime}}b_{i^{\prime}}^{2}}$ in the second and the third inequality.

Combining the two bounds, we obtain Eq. (30). ∎

Proof of collective bounds. Finally, we prove

{\sqrt{\boldsymbol{p}}}^{T}\boldsymbol{A}_{i}{\sqrt{\boldsymbol{p}}}\ \leq\ p_{i},

(33)

where ${\sqrt{\boldsymbol{p}}}_{E}=\sqrt{p_{E}}$ and $(\boldsymbol{A}_{i})_{EE^{\prime}}=(-1)^{1+\delta_{EE^{\prime}}}\absolutevalue{\bra{E}{\hat{\Pi}}_{i}\ket{E^{\prime}}}$ . Expanding this gives $\sum_{E,E^{\prime}}\sqrt{p_{E}}(\boldsymbol{A}_{i})_{EE^{\prime}}\sqrt{p_{E^{\prime}}}\leq p_{i}$ .

Proof.

\begin{split}p_{i}&=\tr[{\hat{\Pi}}_{i}{\hat{\rho}}]=\sum_{E,E^{\prime}}\tr[|E\rangle\langle E|{\hat{\Pi}}_{i}|E^{\prime}\rangle\langle E^{\prime}|{\hat{\rho}}]\\ &=\sum_{E,E^{\prime}}\bra{E}{\hat{\Pi}}_{i}\ket{E^{\prime}}\bra{E^{\prime}}{\hat{\rho}}\ket{E}\\ &=\sum_{E}\bra{E}{\hat{\Pi}}_{i}\ket{E}p_{E}+\sum_{E\neq E^{\prime}}\bra{E}{\hat{\Pi}}_{i}\ket{E^{\prime}}\bra{E^{\prime}}{\hat{\rho}}\ket{E}\\ &\geq\sum_{E}\bra{E}{\hat{\Pi}}_{i}\ket{E}p_{E}-\sum_{E\neq E^{\prime}}\absolutevalue{\bra{E}{\hat{\Pi}}_{i}\ket{E^{\prime}}}\absolutevalue{\bra{E^{\prime}}{\hat{\rho}}\ket{E}}\\ &\geq\sum_{E}\bra{E}{\hat{\Pi}}_{i}\ket{E}p_{E}-\sum_{E\neq E^{\prime}}\absolutevalue{\bra{E}{\hat{\Pi}}_{i}\ket{E^{\prime}}}\sqrt{p_{E}p_{E^{\prime}}}\\ &=2\sum_{E}\bra{E}{\hat{\Pi}}_{i}\ket{E}p_{E}-\sum_{E,E^{\prime}}\absolutevalue{\bra{E}{\hat{\Pi}}_{i}\ket{E^{\prime}}}\sqrt{p_{E}p_{E^{\prime}}}\\ &=\sum_{E,E^{\prime}}\sqrt{p_{E}}(\boldsymbol{A}_{i})_{EE^{\prime}}\sqrt{p_{E^{\prime}}},\end{split}

(34)

where $p_{E}=\bra{E}{\hat{\rho}}\ket{E}$ . For the last inequality, we have used the fact that ${\hat{\rho}}$ is positive semi-definite, and therefore according to Sylvester’s criterion for positive semi-definite matrices Swamy (1973), which says that all submatrices must have a non-negative determinant (i.e., all the principal minors are non-negative). For $2$ by $2$ submatrices this means

\bra{E}{\hat{\rho}}\ket{E}\bra{E^{\prime}}{\hat{\rho}}\ket{E^{\prime}}-\bra{E}{\hat{\rho}}\ket{E^{\prime}}\bra{E^{\prime}}{\hat{\rho}}\ket{E}\geq 0,

(35)

and thus $\absolutevalue{\bra{E^{\prime}}{\hat{\rho}}\ket{E}}^{2}\leq p_{E}p_{E^{\prime}}$ . The second inequality, ${\sqrt{\boldsymbol{p}}}^{T}\boldsymbol{B}_{i}{\sqrt{\boldsymbol{p}}}\ \geq\ p_{i}$ , is proved analogously. ∎

Derivation of the analytic bound on the mean energy. Here we derive the recurrence formula for the computation of the lower bound upper values of the bound

E_{\min}^{\mathrm{lin}}\leq\langle E\rangle\leq E_{\max}^{\mathrm{lin}}.

(36)

We do this with the lower value, $E_{\min}^{\mathrm{lin}}$ , and the formula for the upper value with follow analogously.

The lower bound is given by

E_{\min}^{\mathrm{lin}}=\sum_{l=1}^{N}E_{l}u_{l}.

(37)

(Eq. (11) in the main text), where $u_{l}$ follow a recurrence relation that we derive next. We simplified the lower indices as $l\equiv E_{l}$ .

The idea behind the recurrence relation is described in Fig. 1 in the main text. Having bounds

a_{l}^{\max}\equiv a_{E_{l}}^{\max}\leq p_{E_{l}}\leq b_{E_{l}}^{\min}\equiv b_{l}^{\min},

(38)

(Eq. (3) in the main text) we find the lower bound on the mean energy by filling the minimal probability of each energy given by $a_{l}^{\max}$ and topping it up to the maximum $b_{l}^{\min}$ , from the lowest to the highest energy, until the probabilities sum up to one.

What does this mean mathematically? Let us think of a “bottle” of probabilities with the total volume equal to one, $V=1$ . We start with all $u_{l}$ initialized at zero. We pour the minimal required amount given by $a_{E}^{\max}\leq p_{E}$ into each probability $u_{l}$ . After this, we have

\begin{split}u_{1}&=a_{1}^{\max}\\ u_{2}&=a_{2}^{\max}\\ &\cdots\\ u_{N}&=a_{N}^{\max}\\ \end{split}

(39)

and the remaining volume in the bottle is $V=1-\sum_{l=1}^{N}a_{l}^{\max}$ . We start topping each $u_{l}$ up to its maximum value, from the lowest to the highest energy eigenvalue, until we run out of the probability in the bottle. For $l=1$ , the two cases can occur: either there is enough probability in the bottle to fill $u_{l}$ up to its maximum allowed value $b_{1}^{\min}$ , or not. Mathematically, this topping-up is expressed as

u_{1}=a_{1}^{\max}+\min\left\{b_{1}^{\min}-a_{1}^{\max},1-\sum_{l=1}^{N}a_{l}^{\max}\right\}.

(40)

The $a_{1}^{\max}$ can be subtracted, which gives

u_{1}=\min\left\{b_{1}^{\min},1-\sum_{l=2}^{N}a_{l}^{\max}\right\}.

(41)

The remaining volume in the bottle is $V=1-u_{1}-\sum_{l=2}^{N}a_{l}^{\max}$ .

Next, we top up the second probability, which gives

\begin{split}u_{2}&=a_{2}^{\max}+\min\left\{b_{2}^{\min}-a_{2}^{\max},1-u_{1}-\sum_{l=2}^{N}a_{l}^{\max}\right\}\\ &=\min\left\{b_{2}^{\min},1-u_{1}-\sum_{l=3}^{N}a_{l}^{\max}\right\}.\end{split}

(42)

The remaining volume is $V=1-\sum_{l=1}^{2}u_{l}-\sum_{l=3}^{N}a_{l}^{\max}$ . We continue up to the maximal index $N$ , deriving the full recursive relation, Eq. (12) in the main text.

The recursive relation for $E_{\max}^{\mathrm{lin}}$ is derived analogously.

Appendix B Powerful improvement from collective bounds

Here we show an example in which the collective bounds

{\sqrt{\boldsymbol{p}}}^{T}\boldsymbol{A}_{i}{\sqrt{\boldsymbol{p}}}\ \leq\ p_{i}^{\min}\ \leq p_{i}^{\max}\ \leq\ {\sqrt{\boldsymbol{p}}}^{T}\boldsymbol{B}_{i}{\sqrt{\boldsymbol{p}}},

(43)

(Eq. (7) in the main text) where

p_{i}^{\min}=\min_{t\in[0,T]}p_{i}(t),\quad p_{i}^{\max}=\max_{t\in[0,T]}p_{i}(t),

(44)

provide a considerable improvement in the estimation of energy probabilities in comparison with using just the linear bound

a_{E}^{\max}\leq p_{E}\leq b_{E}^{\min}.

(45)

(Eq. (3) in the main text.)

Consider a coarse-grained energy measurement ${\mathcal{C}}=\{{\hat{P}}_{\tilde{E}}\}$ given by the coarse-grained energy projectors

{\hat{P}}_{\tilde{E}}=\sum_{E\in[\tilde{E},\tilde{E}+\Delta E)}|E\rangle\langle E|.

(46)

$\Delta E$ denotes the resolution in measuring energy. Then $\boldsymbol{A}_{\tilde{E}}=\boldsymbol{B}_{\tilde{E}}$ are diagonal, and the inequalities Eq. (43) yield

p_{\tilde{E}}=p_{\tilde{E}}^{\max}=p_{\tilde{E}}^{\min}=\sum_{E\in[\tilde{E},\tilde{E}+\Delta E)}p_{E}.

(47)

This upper bounds the sum of energy probabilities, making the determination of the mean energy much more precise. This is a stark difference with Eq. (45), which yields much less restrictive $p_{E}\leq p_{\tilde{E}}$ for each $E\in[\tilde{E},\tilde{E}+\Delta E)$ .

To give an example, consider a Hamiltonian ${\hat{H}}=\sum_{i=1}^{8}E_{i}|E_{i}\rangle\langle E_{i}|$ with the following spectrum

\{E_{1},\dots,E_{8}\}=\{0,\,1,\,2,\,2.5,\,3,\,3.3,\,3.7,\,4\}.

(48)

Consider a fixed resolution in measuring energy to be $\Delta E=1$ . This results in coarse-grained energy projectors, Eq. (46), representing the coarse energy measurement ${\mathcal{C}}=\{{\hat{P}}_{\tilde{E}_{j}}\}_{j=1}^{5}$ as

$\displaystyle{\hat{P}}_{\tilde{E}_{1}}$	$\displaystyle=\|0\rangle\langle 0\|,$
$\displaystyle{\hat{P}}_{\tilde{E}_{2}}$	$\displaystyle=\|1\rangle\langle 1\|,$
$\displaystyle{\hat{P}}_{\tilde{E}_{3}}$	$\displaystyle=\|2\rangle\langle 2\|+\|2.5\rangle\langle 2.5\|,$	(49)
$\displaystyle{\hat{P}}_{\tilde{E}_{4}}$	$\displaystyle=\|3\rangle\langle 3\|+\|3.3\rangle\langle 3.3\|+\|3.7\rangle\langle 3.7\|,$
$\displaystyle{\hat{P}}_{\tilde{E}_{5}}$	$\displaystyle=\|4\rangle\langle 4\|.$

This indicates that the measurement device cannot distinguish between energy states $\ket{2}$ and $\ket{2.5}$ , for example, because they are too close in energy.

Consider an initial state

\ket{\psi}=(\ket{2}+\ket{3})/\sqrt{2}.

(50)

Knowing this state, we can compute

\begin{split}p_{0}&=0,\ p_{1}=0,\ p_{2}=1/2,\ p_{2.5}=0,\\ p_{3}&=1/2,\ p_{3.3}=0,\ p_{3.7}=0,\ p_{4}=0.\end{split}

(51)

However, the experimenter performing a coarse-grained measurement ${\mathcal{C}}$ on many copies of this initial state does not know that. Instead, using Eqs. (45), they derive

\begin{split}p_{0}&=0,\ p_{1}=0,\ p_{2}\leq 1/2,\ p_{2.5}\leq 1/2,\\ p_{3}&\leq 1/2,\ p_{3.3}\leq 1/2,\ p_{3.7}\leq 1/2,\ p_{4}=0.\end{split}

(52)

The right-hand sides were obtained from the outcomes of the coarse-grained measurement. This is because half of the time, they obtain measurement outcome $\tilde{E}_{3}$ , and the other half they get $\tilde{E}_{4}$ . If they were to estimate the mean energy of the state only from these equations, they would obtain

2\times\frac{1}{2}+2.5\times\frac{1}{2}=2.25\leq\langle E\rangle\leq 3.5=3.3\times\frac{1}{2}+3.7\times\frac{1}{2}.

(53)

However, from Eqs. (47), which follow from the collective bounds, they obtain an additional set of equations,

\frac{1}{2}=p_{2}+p_{2.5},\quad\frac{1}{2}=p_{3}+p_{3.3}+p_{3.7}.

(54)

The left-hand sides were obtained from the experimental outcomes. Using this additional set of equations, the experimenter is able to derive a noticeably tighter bound on the mean energy,

2\times\frac{1}{2}+3\times\frac{1}{2}=2.5\leq\langle E\rangle\leq 3.1=2.5\times\frac{1}{2}+3.7\times\frac{1}{2}.

(55)

This improvement will be dramatic in systems with many energy eigenstates, leading to much coarser projectors.

Appendix C Quality factors

We can employ two quality factors to assess the performance of method four bounding the mean energy: the first,

Q_{1}=\left(1-\frac{E_{\max}-E_{\min}}{E_{N}-E_{1}}\right)\times 100\%,

(56)

which measures the range of excluded energy, and the second,

Q_{2}=\left(1-\frac{N_{[E_{\min},E_{\max}]}}{N}\right)\times 100\%,

(57)

which measures the percentage of “excluded” energy eigenstates. $N_{[E_{\min},E_{\max}]}$ denotes the number of energy eigenstates with energy between $E_{\min}$ and $E_{\max}$ , and $N$ is the dimension of the Hilbert space.

Appendix D Simple example

Consider a Hamiltonian given by the Pauli-z Matrix,

{\hat{H}}=\hat{\sigma}_{z}=\begin{pmatrix}1&0\\ 0&-1\end{pmatrix},

(58)

which has energy eigenvalues $E_{0}=-1$ and $E_{1}=1$ , corresponding to eigenstates $\ket{0}$ and $\ket{1}$ , respectively. The task is to estimate the energy of a general pure qubit state,

\ket{\psi}=\cos\frac{\theta}{2}\ket{0}+e^{i\phi}\sin\frac{\theta}{2}\ket{1},

(59)

where $0\leq\theta\leq\pi$ and $0\leq\phi\leq 2\pi$ .

We consider two different two-outcome measurements to estimate energy. We will use the combined bound for the energy probabilities,

\absolutevalue{p_{E}-\sum_{i}p_{i}\absolutevalue{\innerproduct{E}{i}}^{2}}\leq b_{E}-\sum_{i}p_{i}\absolutevalue{\innerproduct{E}{i}}^{2},

(60)

where $b_{E}=\big{(}\sum_{i}\sqrt{p_{i}}\absolutevalue{\innerproduct{E}{i}}\big{)}^{2}$ , which is easily derived from Eq. (1) in the main text.

First, consider measuring in the z-basis, i.e., measuring in the eigenbasis of the operator $\hat{M}=\hat{\sigma}_{z}$ . This defines the measurement ${\mathcal{C}}=\{|0\rangle\langle 0|,|1\rangle\langle 1|\}$ . Because the measurement basis is the same as the eigenbasis of the Hamiltonian, we are measuring the energy directly, so we expect the exact result. From the bound above, we have

\begin{split}\absolutevalue{p_{E_{0}}-p_{0}}&\leq 0,\\ \absolutevalue{p_{E_{1}}-p_{1}}&\leq 0,\end{split}

(61)

independent of the initial state $\ket{\psi}$ . Clearly, $p_{E_{0}}=p_{0}$ , $p_{E_{1}}=p_{1}$ , and $E_{\min}^{\mathrm{lin}}=\langle E\rangle=E_{\max}^{\mathrm{lin}}$ , as expected.

Second, consider measuring in the x-basis, i.e., measuring in the eigenbasis of the operator $\hat{M}=\hat{\sigma}_{x}$ . This defines ${\mathcal{C}}=\{|+\rangle\langle+|,|-\rangle\langle-|\}$ . We have

\begin{split}\absolutevalue{p_{E_{0}}-\tfrac{1}{2}}&\leq\tfrac{1}{2}\big{(}(\sqrt{p_{+}}+\sqrt{p_{-}})^{2}-1\big{)},\\ \absolutevalue{p_{E_{1}}-\tfrac{1}{2}}&\leq\tfrac{1}{2}\big{(}(\sqrt{p_{+}}+\sqrt{p_{-}})^{2}-1\big{)}.\end{split}

(62)

This means that if the state is aligned with the $x$ axis, for example, $p_{+}=1$ , then $p_{E_{0}}=p_{E_{1}}=\tfrac{1}{2}$ and we can determine the energy exactly as $\langle E\rangle=0$ . On the contrary, if the state is aligned with the z-axis, implying $p_{+}=p_{-}=\frac{1}{2}$ , then $\absolutevalue{p_{E_{0}}-\tfrac{1}{2}}\leq\tfrac{1}{2}$ and $\absolutevalue{p_{E_{1}}-\tfrac{1}{2}}\leq\tfrac{1}{2}$ which we can rewrite as $0\leq p_{E_{0}}\leq 1$ and $0\leq p_{E_{1}}\leq 1$ . Thus we obtain a trivial bound,

-1=E_{\max}^{\mathrm{lin}}\leq\langle E\rangle\leq E_{\max}^{\mathrm{lin}}=1.

(63)

Generally, we have

p_{+}=\tfrac{1}{2}(1+\cos\phi\sin\theta),\quad p_{-}=\tfrac{1}{2}(1+\cos\phi\sin\theta).

(64)

which gives

\begin{split}\absolutevalue{p_{E_{0}}-\tfrac{1}{2}}&\leq\tfrac{1}{2}\sqrt{1-\cos^{2}\phi\sin^{2}\theta},\\ \absolutevalue{p_{E_{1}}-\tfrac{1}{2}}&\leq\tfrac{1}{2}\sqrt{1-\cos^{2}\phi\sin^{2}\theta}.\end{split}

(65)

This yields

-\!\sqrt{1\!-\!\cos^{2}\!\phi\sin^{2}\!\theta}\!=\!E_{\max}^{\mathrm{lin}}\!\leq\langle E\rangle\leq\!E_{\max}^{\mathrm{lin}}\!=\!\sqrt{1\!-\!\cos^{2}\!\phi\sin^{2}\!\theta}.

(66)

We visualize the corresponding quality factor (the percentage of excluded energies, see Eq. (56))

Q_{1}=1-\sqrt{1\!-\!\cos^{2}\!\phi\sin^{2}\!\theta}

(67)

for this general case on the Bloch sphere in Fig. 4.

Next, we consider time evolution. We have

\ket{\psi_{t}}=e^{-i{\hat{H}}t}\ket{\psi}=\cos\frac{\theta}{2}\ket{0}+e^{i(\phi-2t)}\sin\frac{\theta}{2}\ket{1}.

(68)

Measuring $\hat{\sigma}_{x}$ at time $t$ bounds the energy probabilities as

\begin{split}\absolutevalue{p_{E_{0}}-\tfrac{1}{2}}&\leq\tfrac{1}{2}\sqrt{1-\cos^{2}(\phi-2t)\sin^{2}\theta},\\ \absolutevalue{p_{E_{1}}-\tfrac{1}{2}}&\leq\tfrac{1}{2}\sqrt{1-\cos^{2}(\phi-2t)\sin^{2}\theta}.\end{split}

(69)

If we measure at different times during $t\in[0,\pi/2)$ , we manage to tighten these bounds and obtain

\begin{split}\absolutevalue{p_{E_{0}}-\tfrac{1}{2}}&\leq\tfrac{1}{2}\absolutevalue{\cos\theta},\\ \absolutevalue{p_{E_{1}}-\tfrac{1}{2}}&\leq\tfrac{1}{2}\absolutevalue{\cos\theta},\end{split}

(70)

which leads to bounds on energy

-\absolutevalue{\cos\theta}=E_{\max}^{\mathrm{lin}}\leq\langle E\rangle\leq E_{\max}^{\mathrm{lin}}=\absolutevalue{\cos\theta}.

(71)

The corresponding quality factor $Q_{1}=1-\absolutevalue{\cos\theta}$ is again plotted in Fig. 4.

Appendix E Estimation of the mean energy in experimentally relevant models

Here we show the simulations for energy estimation using measurements in a local number basis and then using optimized $k$ -local measurements. This is a continuation of numerical simulations shown in the main text, which contained a part of the results obtained for the Heisenberg model.

We simulate a number of models, including the Heisenberg model Porras and Cirac (2004); Pal and Huse (2010); Luitz et al. (2015), which is a paradigmatic model to study for many-body localization, and then several other experimentally relevant models. These are the Ising model Smith et al. (2016); Jurcevic et al. (2017); Zhang et al. (2017); Bingham et al. (2021), known for its frequent use in quantum simulators and quantum annealers, the XY model Lanyon et al. (2017); Friis et al. (2018); Brydges et al. (2019); Maier et al. (2019), which is a type of non-integrable long-range model, and the PXP model Bernien et al. (2017); Turner et al. (2018); Su et al. (2022), an archetypal model for many-body quantum scars.

The Hamiltonians and the corresponding parameters are given in Table 1. The simulations of energy estimation using local particle number measurements and optimized $k$ -local measurements are shown in Tables 2 and 3.

The bulk of the explanation necessary to understand these numerical experiments are also shown in the main text. Below, we give details on the types of Hilbert space considered in our simulations, and we discuss methods that we designed for the optimization over the $k$ -local measurements to estimate energy.

Hilbert space considered. In our simulations, we choose to work in a different type of Hilbert space for each model, depending on the conservation laws, and to match the experimental setups, see Table 1: In the Heisenberg and the XY models, the full Hilbert space splits into subspaces, each of them characterized by a definite value of the total spin along the $z$ axes, i.e., $\hat{S}^{z}=\sum_{i}\hat{\sigma}_{i}^{z}$ . We work in the largest subspace, characterized by the value $\hat{S}^{z}=0$ . This conservation is also why the actual value of $B$ in the XY model is irrelevant. For the Ising model, the total spin along the $z$ axes is not conserved, only the parity of the total spin is conserved. In this case, we work on the parity even subspace, which contains the Néel state. For the PXP model, the situation is more intricate: the presence of the projector operators $(\hat{I}-\hat{\sigma}_{i}^{z})$ in the Hamiltonian introduces a non-trivial local constraint Turner et al. (2018). Consequently, the full Hilbert space shatters in many different subspaces, dynamically disconnected and having various dimensions Khemani et al. (2020). Inside each subspace, the dynamics is generically chaotic with the presence of many-body scars Turner et al. (2018); Serbyn et al. (2021); Chandran et al. (2022). However, our goal in using this model was to study the effect of the Hilbert space shattering on the quality of the energy estimation. Therefore, we work in the full Hilbert space.

Methods for optimizing local measurements. We introduce three methods of analytical optimization for the $k$ -local measurements. The sketch of $k$ -local measurements is shown in Fig. 5, which corresponds to Fig 3 (a) in the main text. $k$ -local measurement consists of applying a $k$ -local unitary operator and then measuring in the computational basis.

name

characteristics

conservation

Hamiltonian

parameters

Heisenberg

(integrable,

delocalized,

(localized)

many-body

localizing

total spin

conserving

\displaystyle{\hat{H}}=\sum_{i}\big{(}\hat{\sigma}_{i}^{x}\hat{\sigma}_{i+1}^{x}+\hat{\sigma}_{i}^{y}\hat{\sigma}_{i+1}^{y}+\hat{\sigma}_{i}^{z}\hat{\sigma}_{i+1}^{z}\big{)}+\sum_{i}h_{i}\hat{\sigma}_{i}^{z}

h_{i}\in[-W,W]

drawn randomly

(

W=0

W=0.5

W=10

)

Ising

(delocalized,

localized)

gapped and

many-body

localizing

parity

conserving

\displaystyle{\hat{H}}\!=\!\sum_{i<j}J_{ij}\hat{\sigma}_{i}^{x}\hat{\sigma}_{j}^{x}+\frac{1}{2}\sum_{i}(B+h_{i})\hat{\sigma}_{j}^{z}

J_{0}=1

\alpha=1.13

B=4

h_{i}\in[-W,W]

drawn randomly

(

W=0

W=8

)

gapped and

long-range

total spin

conserving

\displaystyle{\hat{H}}\!=\!\sum_{i<j}J_{ij}\big{(}\hat{\sigma}_{i}^{+}\hat{\sigma}_{j}^{-}+\hat{\sigma}_{i}^{-}\hat{\sigma}_{j}^{+}\big{)}+B\sum_{i}\hat{\sigma}_{i}^{z}

J_{0}=1

\alpha=1.24

B=0

PXP

quantum

scars

Hilbert space

shattering

\displaystyle{\hat{H}}\!=\!\tfrac{\Omega}{4}\sum_{i}\big{(}{\hat{I}}-\hat{\sigma}_{i}^{z}\big{)}\hat{\sigma}_{i+1}^{x}\big{(}{\hat{I}}-\hat{\sigma}_{i+2}^{z}\big{)}

\Omega=1

Sizes and Hilbert spaces considered in the numerical experiments

Small systems

Large systems

Heisenberg

6 sites, 3 particles

D=20

10 sites, 5 particles

D=252

Ising

6 sites, even parity subspace

D=32

10 sites, even parity subspace

D=512

6 sites, 3 particles

D=20

10 sites, 5 particles

D=252

PXP

5 sites, full Hilbert space

D=32

10 sites, full Hilbert space

D=1024

Table 1: Table of models used in our simulations and the Hilbert spaces considered with dimension

D

\hat{\sigma}_{i}^{x}

denotes the Pauli-x matrix at site

i

, and similar with Pauli-y and Pauli-z matrices.

\hat{\sigma}_{i}^{+}

and

\hat{\sigma}_{i}^{-}

denote the spin creation and annihilation operators, respectively. The Ising and XY models have a non-local interaction of form

J_{ij}=J_{0}/\absolutevalue{i-j}^{\alpha}

. Parameters were taken to match those employed in experiments.

[Uncaptioned image] — Table 2: Estimating energy with the local number and $k$ -local optimized measurements (see Fig. 5), for various Hamiltonians and sizes of Hilbert space delineated in Table 1. The initial state is either a ground state (G), a pure thermal state (C - cold), Eq. (19) in the main text, or a state drawn randomly from the Hilbert space with the Haar measure (H - hot). Small systems (left panel): the graphs show the true mean energy $\langle E\rangle$ (single symbol), intervals $[E_{\min}^{\mathrm{lin}},E_{\max}^{\mathrm{lin}}]$ (full-line) and $[E_{\min},E_{\max}]$ (dashed-line), for each state ordered from top to bottom, and the list of energy eigenvalues at the very bottom. Large systems: the true mean energy $\langle E\rangle$ (single symbol), intervals $[E_{\min}^{{\mathrm{lin}}(k)},E_{\max}^{{\mathrm{lin}}(k)}]$ (full-lines), denoting analytic bounds computed for $k$ -local measurements, $k=0,1,2,5,10$ (see Fig. 4), using ground state-optimized method (middle panel), and observable-optimized type 1 method (right panel). See the observable-optimized type 2 method for the PXP model in Fig. 6. Observations: 1) There is little difference between the integrable and delocalized phases of the Heisenberg model in estimating energy. Bethe integrability does not seem to play a role. 2) Estimation of energy in the localized phase of the Heisenberg model works well for both small and large systems. It managed to exclude $Q_{1}=97.5\%$ of the range of energies when estimating the ground state energy using two-qubit ( $k=2$ ) measurements. This is due to a large overlap between energy eigenstates and the local number basis. 3) Estimation of the ground state energy in the Ising model works well both in the localized ( $Q_{1}=96.7\%$ for $k=2$ ) and delocalized ( $Q_{1}=92.9\%$ for $k=2$ ) phases. This is due to the low entanglement in the ground state. Please see the continuation of this table in Table 3.

a. Ground state-optimized measurements. First, we introduce a method that is $k$ -local optimization for a specific state, in our case, the ground state. This method is inspired by the Matrix Product State ansatz Orús (2014), and by the correspondence between observational and entanglement entropy Schindler et al. (2020).

The logic of the motivation goes as follows: low observational entropy means that the system state wandered into one of the small subspaces-macrostates given by the measurement Šafránek et al. (2019b); Šafránek and Thingna (2020). This means that we can estimate the maximal and the minimal value of the estimated observable in that subspace, which, in turn, translates into estimating these bounds for the system state itself. In other words, lower observational entropy means better estimates. According to Ref. Schindler et al. (2020), observational entropy minimized over local coarse-grainings leads to entanglement entropy, and the minimum is achieved when the local coarse-grainings are given by the Schmidt basis. This means that the measuring in the Schmidt basis will yield small observational entropy and, in turn, a better estimate of the mean value of observable. Thus, we need to find $k$ -local measurements that reflect the Schmidt basis, which are expected to perform well in the estimation.

We do this as follows: Assuming we have a chain of length $L$ (for example, $L=10$ ) and some divisor $k\leq 10$ (for example $k=2$ ), we first divide the chain into two parts: one — system $A_{1}$ — of length $k$ (i.e., sites $(1,2)$ ) and the other — system $B_{1}$ — of length $L-k$ (i.e., sites $(3,4,5,6,7,8,9,10)$ ). We define ${\hat{\rho}}_{0}$ as the ground state (or any other state we want to optimize for). We compute the reduced density matrix

{\hat{\rho}}_{A_{1}}=\tr_{B_{1}}{\hat{\rho}}_{0},

(72)

and diagonalize it. The eigenbasis of the reduced density matrix is, by definition, the system $A_{1}$ -local part of the Schmidt basis. We denote this basis as $\{\ket{\psi_{i_{1}}^{A_{1}}}\}$ .

Then we move on to the next $k$ sites. We divide the system into $A_{2}$ (sites $(3,4)$ ) and $B_{2}$ (sites $(1,2,5,6,7,8,9,10)$ ). Again, we compute the reduced density matrix

{\hat{\rho}}_{A_{2}}=\tr_{B_{2}}{\hat{\rho}}_{0},

(73)

and find its eigenvectors, which we denote $\{\ket{\psi_{i_{2}}^{A_{2}}}\}$ .

We continue dividing the system until the end (in our example, we go up to $A_{5}$ ). The final, ground state-optimized $k$ -local measurement basis is then given by

\{\ket{\psi_{i_{1}}^{A_{1}}}\otimes\ket{\psi_{i_{2}}^{A_{2}}}\otimes\ket{\psi_{i_{3}}^{A_{3}}}\otimes\ket{\psi_{i_{4}}^{A_{4}}}\otimes\ket{\psi_{i_{5}}^{A_{5}}}\}.

(74)

b. Observable-optimized measurements, type 1. Second, we introduce a method of $k$ -local optimization for a specific observable, in our case, the Hamiltonian.

The motivation behind this optimization is that the estimation of the mean value of the observable works better the more the measurement resembles the eigenbasis of the observable. Thus, we create a procedure that generates a measurement that somewhat resembles the estimated observable.

We illustrate this method on the Heisenberg model, assuming $L=10$ , which is given by the Hamiltonian (assuming hard-wall boundary conditions)

{\hat{H}}=\sum_{i=1}^{9}\big{(}\hat{\sigma}_{i}^{x}\hat{\sigma}_{i+1}^{x}+\hat{\sigma}_{i}^{y}\hat{\sigma}_{i+1}^{y}+\hat{\sigma}_{i}^{z}\hat{\sigma}_{i+1}^{z}\big{)}+\sum_{i=1}^{10}h_{i}\hat{\sigma}_{i}^{z}.

(75)

For $k=1$ -local measurement, we remove all the terms spanning more than a single site. This leads to a modified Hamiltonian

{\hat{H}}_{1}=\sum_{i=1}^{10}h_{i}\hat{\sigma}_{i}^{z}.

(76)

We call the eigenbasis of this Hamiltonian the $k=1$ -local observable optimized measurement for the Hamiltonian. Incidentally, in this case, this basis is precisely the same as the computational basis.

For $k=2$ -local measurement, we divide the lattice into blocks of two sites and remove all the terms that cross those blocks. This leads to

{\hat{H}}_{2}=\sum_{i=1,3,5,7,9}\big{(}\hat{\sigma}_{i}^{x}\hat{\sigma}_{i+1}^{x}+\hat{\sigma}_{i}^{y}\hat{\sigma}_{i+1}^{y}+\hat{\sigma}_{i}^{z}\hat{\sigma}_{i+1}^{z}\big{)}+\sum_{i=1}^{10}h_{i}\hat{\sigma}_{i}^{z}.

(77)

The eigenbasis of this Hamiltonian is the $k=2$ -local observable optimized measurement for the Hamiltonian.

For $k=5$ -local measurement, we divide the lattice into two blocks of five sites and remove all the terms that cross those blocks. This leads to

{\hat{H}}_{5}=\sum_{i=1,\dots,4,6,\dots,9}\big{(}\hat{\sigma}_{i}^{x}\hat{\sigma}_{i+1}^{x}+\hat{\sigma}_{i}^{y}\hat{\sigma}_{i+1}^{y}+\hat{\sigma}_{i}^{z}\hat{\sigma}_{i+1}^{z}\big{)}+\sum_{i=1}^{10}h_{i}\hat{\sigma}_{i}^{z}.

(78)

The eigenbasis of this Hamiltonian is the $k=5$ -local observable optimized measurement for the Hamiltonian.

In the case of $k=10$ -local measurement, the corresponding Hamiltonian is the original Hamiltonian itself,

{\hat{H}}_{10}={\hat{H}}.

(79)

Measuring in the basis of this Hamiltonian is the same as measuring the Hamiltonian itself, which yields perfect precision in estimating its mean value.

c. Observable-optimized measurements, type 2. Alternatively, one can consider a different way of finding observable-optimized measurements, somewhat similar to the ground-state optimization method. For $k=2$ and $L=10$ , divide the system into system $A_{1}$ (the first two sites $(1,2)$ ) and system $B_{1}$ (the last $L-k$ sites $(3,4,5,6,7,8,9,10)$ ). Then compute the “reduced Hamiltonian”

{\hat{H}}^{A_{1}}=\tr_{B_{1}}{\hat{H}},

(80)

and diagonalize it, obtaining its eigenbasis $\{\ket{\psi_{i_{1}}^{A_{1}}}\}$ . Continue analogously as in the ground-state optimization method to generate the global observable-optimized basis for $k=2$ ,

\{\ket{\psi_{i_{1}}^{A_{1}}}\otimes\ket{\psi_{i_{2}}^{A_{2}}}\otimes\ket{\psi_{i_{3}}^{A_{3}}}\otimes\ket{\psi_{i_{4}}^{A_{4}}}\otimes\ket{\psi_{i_{5}}^{A_{5}}}\}.

(81)

In our numerical experiments, due to the specific forms of the Hamiltonian, type 1 and type 2 observable-optimized measurements differ only in the PXP model. See Fig. 6.

d. Generating the unitary. Finally, we want to transform the optimized measurement and express it as a combination of a unitary operator applied to the system’s state and, after that measuring in the computational basis. This is illustrated in Fig. 5. Assuming our example $k=2$ and $L=10$ again, we start deriving the formula for $U^{A_{1}}$ with requiring that the probability of an outcome is the same in both situations:

\bra{j_{1},j_{2}}U^{A_{1}}{\hat{\rho}}U^{A_{1}{\dagger}}\ket{j_{1},j_{2}}\overset{!}{=}\bra{\psi_{i_{1}}^{A_{1}}}{\hat{\rho}}\ket{\psi_{i_{1}}^{A_{1}}},

(82)

where $\ket{j_{1},j_{2}}$ is the computational basis vector, $\ket{\psi_{i_{1}}^{A_{1}}}$ an optimized basis vector on the first two sites, and we want to match each couple $j_{1},j_{2}$ to one $i_{1}$ . A sufficient condition is

\ket{\psi_{i_{1}}^{A_{1}}}=U^{A_{1}{\dagger}}\ket{j_{1},j_{2}}.

(83)

Assuming that $\ket{\psi_{i_{1}}^{A_{1}}}$ is a column vector written in the computational basis and that each site is a qubit (which leads to $i_{1}=1,2,3,4$ ), we have

U^{A_{1}{\dagger}}=\begin{pmatrix}&&&\\ \ket{\psi_{1}^{A_{1}}}&\ket{\psi_{2}^{A_{1}}}&\ket{\psi_{3}^{A_{1}}}&\ket{\psi_{4}^{A_{1}}}\\ &&&\\ \end{pmatrix}.

(84)

This means that

U^{A_{1}}=\begin{pmatrix}\ket{\psi_{1}^{A_{1}}}^{\dagger}\\ \ket{\psi_{2}^{A_{1}}}^{\dagger}\\ \ket{\psi_{3}^{A_{1}}}^{\dagger}\\ \ket{\psi_{4}^{A_{1}}}^{\dagger}\end{pmatrix}=\begin{pmatrix}\bra{\psi_{1}^{A_{1}}}\\ \bra{\psi_{2}^{A_{1}}}\\ \bra{\psi_{3}^{A_{1}}}\\ \bra{\psi_{4}^{A_{1}}}\end{pmatrix}.

(85)

The global optimized measurement then consists of applying the unitary operation

U=U^{A_{1}}\otimes\cdots\otimes U^{A_{5}}=\begin{pmatrix}\bra{\psi_{1}^{A_{1}}}\\ \bra{\psi_{2}^{A_{1}}}\\ \bra{\psi_{3}^{A_{1}}}\\ \bra{\psi_{4}^{A_{1}}}\end{pmatrix}\otimes\cdots\otimes\begin{pmatrix}\bra{\psi_{1}^{A_{5}}}\\ \bra{\psi_{2}^{A_{5}}}\\ \bra{\psi_{3}^{A_{5}}}\\ \bra{\psi_{4}^{A_{5}}}\end{pmatrix}

(86)

on the system and then measuring in the computational basis.

References

D’Alessio et al. (2016) L. D’Alessio, Y. Kafri, A. Polkovnikov, and M. Rigol, Advances in Physics 65, 239 (2016), arXiv:1509.06411 [cond-mat.stat-mech] .
Deutsch (2018) J. M. Deutsch, Reports on Progress in Physics 81, 082001 (2018), arXiv:1805.01616 [quant-ph] .
Vojta (2003) M. Vojta, Reports on Progress in Physics 66, 2069 (2003).
Sun et al. (2014) Z.-Y. Sun, Y.-Y. Wu, J. Xu, H.-L. Huang, B.-F. Zhan, B. Wang, and C.-B. Duan, Phys. Rev. A 89, 022101 (2014).
Tian et al. (2020) T. Tian, H.-X. Yang, L.-Y. Qiu, H.-Y. Liang, Y.-B. Yang, Y. Xu, and L.-M. Duan, Phys. Rev. Lett. 124, 043001 (2020).
Yuen and Chan (1983) H. P. Yuen and V. W. S. Chan, Opt. Lett. 8, 177 (1983).
Tyc and Sanders (2004) T. Tyc and B. C. Sanders, Journal of Physics A: Mathematical and General 37, 7341 (2004).
Shaked et al. (2018) Y. Shaked, Y. Michael, R. Z. Vered, L. Bello, M. Rosenbluh, and A. Pe’er, Nature Communications 9, 609 (2018).
Raffaelli et al. (2018) F. Raffaelli, G. Ferranti, D. H. Mahler, P. Sibson, J. E. Kennard, A. Santamato, G. Sinclair, D. Bonneau, M. G. Thompson, and J. C. F. Matthews, Quantum Science and Technology 3, 025003 (2018).
Noll et al. (1991) D. Noll, D. Nishimura, and A. Macovski, IEEE Transactions on Medical Imaging 10, 154 (1991).
Voss (2009) P. Voss, Optical homodyne detection and applications in quantum cryptography, Ph.D. thesis, TELECOM ParisTech (2009).
Davidovich (1996) L. Davidovich, Rev. Mod. Phys. 68, 127 (1996).
Takeno et al. (2007) Y. Takeno, M. Yukawa, H. Yonezawa, and A. Furusawa, Opt. Express 15, 4321 (2007).
Lawrie et al. (2019) B. J. Lawrie, P. D. Lett, A. M. Marino, and R. C. Pooser, ACS Photonics 6, 1307 (2019), https://doi.org/10.1021/acsphotonics.9b00250 .
Heisenberg (1985) W. Heisenberg, in Original Scientific Papers Wissenschaftliche Originalarbeiten (Springer, 1985) pp. 478–504.
Robertson (1929) H. P. Robertson, Phys. Rev. 34, 163 (1929).
Busch et al. (2007) P. Busch, T. Heinonen, and P. Lahti, Physics Reports 452, 155 (2007).
Wightman (1956) A. S. Wightman, Phys. Rev. 101, 860 (1956).
Srednicki (2007) M. Srednicki, Quantum field theory (Cambridge University Press, 2007).
Bunge et al. (1993) C. Bunge, J. Barrientos, and A. Bunge, Atomic Data and Nuclear Data Tables 53, 113 (1993).
Ikot et al. (2019) A. N. Ikot, U. S. Okorie, R. Sever, and G. J. Rampho, European Physical Journal Plus 134, 386 (2019).
Deutsch (2010) J. M. Deutsch, New Journal of Physics 12, 075021 (2010).
Swendsen (2015) R. H. Swendsen, arXiv e-prints , arXiv:1508.01323 (2015), arXiv:1508.01323 [cond-mat.stat-mech] .
Santos et al. (2011) L. F. Santos, A. Polkovnikov, and M. Rigol, Phys. Rev. Lett. 107, 040601 (2011).
Šafránek et al. (2021) D. Šafránek, A. Aguirre, J. Schindler, and J. M. Deutsch, Foundations of Physics 51, 101 (2021), arXiv:2008.04409 [quant-ph] .
Hovhannisyan and Correa (2018) K. V. Hovhannisyan and L. A. Correa, Phys. Rev. B 98, 045101 (2018).
Mukherjee et al. (2019) V. Mukherjee, A. Zwick, A. Ghosh, X. Chen, and G. Kurizki, Communications Physics 2, 1 (2019).
Cenni et al. (2021) M. F. B. Cenni, L. Lami, A. Acin, and M. Mehboudi, arXiv e-prints , arXiv:2110.02098 (2021), arXiv:2110.02098 [quant-ph] .
Engel and Nolte (2007) A. Engel and R. Nolte, Europhysics Letters 79, 10003 (2007).
Alipour et al. (2016) S. Alipour, F. Benatti, F. Bakhshinezhad, M. Afsary, S. Marcantoni, and A. T. Rezakhani, Scientific Reports 6, 35568 (2016), arXiv:1606.08869 [quant-ph] .
Modak and Rigol (2017) R. Modak and M. Rigol, Phys. Rev. E 95, 062145 (2017).
Goold et al. (2018) J. Goold, F. Plastina, A. Gambassi, and A. Silva, “The role of quantum work statistics in many-body physics,” in Thermodynamics in the Quantum Regime: Fundamental Aspects and New Directions, edited by F. Binder, L. A. Correa, C. Gogolin, J. Anders, and G. Adesso (Springer International Publishing, Cham, 2018) pp. 317–336.
De Chiara et al. (2018) G. De Chiara, P. Solinas, F. Cerisola, and A. J. Roncaglia, “Ancilla-assisted measurement of quantum work,” in Thermodynamics in the Quantum Regime: Fundamental Aspects and New Directions, edited by F. Binder, L. A. Correa, C. Gogolin, J. Anders, and G. Adesso (Springer International Publishing, Cham, 2018) pp. 337–362.
Varizi et al. (2020) A. D. Varizi, A. P. Vieira, C. Cormick, R. C. Drumond, and G. T. Landi, Phys. Rev. Research 2, 033279 (2020).
Allahverdyan et al. (2004) A. E. Allahverdyan, R. Balian, and T. M. Nieuwenhuizen, Europhysics Letters (EPL) 67, 565 (2004).
Alicki and Fannes (2013) R. Alicki and M. Fannes, Phys. Rev. E 87, 042123 (2013).
Šafránek et al. (2022) D. Šafránek, D. Rosa, and F. Binder, arXiv e-prints , arXiv:2209.11076 (2022), arXiv:2209.11076 [quant-ph] .
Paris (2008) M. G. A. Paris, arXiv e-prints , arXiv:0804.2981 (2008), arXiv:0804.2981 [quant-ph] .
Correa et al. (2015) L. A. Correa, M. Mehboudi, G. Adesso, and A. Sanpera, Phys. Rev. Lett. 114, 220405 (2015), arXiv:1411.2437 [quant-ph] .
Mandelstam and Tamm (1991) L. Mandelstam and I. Tamm, “The uncertainty relation between energy and time in non-relativistic quantum mechanics,” in Selected Papers, edited by B. M. Bolotovskii, V. Y. Frenkel, and R. Peierls (Springer Berlin Heidelberg, Berlin, Heidelberg, 1991) pp. 115–123.
Margolus and Levitin (1998) N. Margolus and L. B. Levitin, Physica D: Nonlinear Phenomena 120, 188 (1998), proceedings of the Fourth Workshop on Physics and Consumption.
Deffner and Campbell (2017) S. Deffner and S. Campbell, Journal of Physics A Mathematical General 50, 453001 (2017), arXiv:1705.08023 [quant-ph] .
Villa and De Chiara (2017) L. Villa and G. De Chiara, arXiv e-prints , arXiv:1704.01583 (2017), arXiv:1704.01583 [quant-ph] .
Jiménez et al. (2021) J. L. Jiménez, S. P. G. Crone, E. Fogh, M. E. Zayed, R. Lortz, E. Pomjakushina, K. Conder, A. M. Läuchli, L. Weber, S. Wessel, A. Honecker, B. Normand, C. Rüegg, P. Corboz, H. M. Rønnow, and F. Mila, Nature (London) 592, 370 (2021).
Nielsen and Chuang (2002) M. A. Nielsen and I. Chuang, “Quantum computation and quantum information,” (American Association of Physics Teachers, 2002) Chap. 9.3.
Reich et al. (2013) D. M. Reich, G. Gualdi, and C. P. Koch, Phys. Rev. Lett. 111, 200401 (2013).
Harper and Flammia (2017) R. Harper and S. T. Flammia, Quantum Science and Technology 2, 015008 (2017), arXiv:1608.02943 [quant-ph] .
Huang et al. (2019) W. Huang, C. H. Yang, K. W. Chan, T. Tanttu, B. Hensen, R. C. C. Leon, M. A. Fogarty, J. C. C. Hwang, F. E. Hudson, K. M. Itoh, A. Morello, A. Laucht, and A. S. Dzurak, Nature (London) 569, 532 (2019), arXiv:1805.05027 [cond-mat.mes-hall] .
Combes and Lund (2022) J. Combes and A. P. Lund, arXiv e-prints , arXiv:2207.10210 (2022), arXiv:2207.10210 [quant-ph] .
Hochbaum (2007) D. S. Hochbaum, Annals of Operations Research 153, 257 (2007).
Note (1) These are, for example, Nelder-Mead algorithm Luersen and Le Riche (2004), random search Price (1983), differential evolution Storn (1996), machine learning methods Sivanandam and Deepa (2008), simulated Bertsimas and Tsitsiklis (1993) or quantum Das and Chakrabarti (2005) annealing, and modified linear programming methods Powell (1998).
von Neumann (2010) J. von Neumann, European Physical Journal H 35, 201 (2010).
Šafránek et al. (2019a) D. Šafránek, J. M. Deutsch, and A. Aguirre, Phys. Rev. A 99, 010101 (2019a), arXiv:1707.09722 [quant-ph] .
Šafránek et al. (2019b) D. Šafránek, J. M. Deutsch, and A. Aguirre, Phys. Rev. A 99, 012103 (2019b), arXiv:1803.00665 [quant-ph] .
Strasberg and Winter (2021) P. Strasberg and A. Winter, PRX Quantum 2, 030202 (2021), arXiv:2002.08817 [quant-ph] .
Buscemi et al. (2022) F. Buscemi, J. Schindler, and D. Šafránek, arXiv:2209.03803 (2022), 10.48550/arXiv.2209.03803.
Šafránek and Thingna (2020) D. Šafránek and J. Thingna, arXiv e-prints , arXiv:2007.07246 (2020), arXiv:2007.07246 [quant-ph] .
Usaj et al. (1998) G. Usaj, H. M. Pastawski, and P. R. Levstein, Molecular Physics 95, 1229 (1998), https://doi.org/10.1080/00268979809483253 .
Sánchez et al. (2016) C. Sánchez, P. Levstein, L. Buljubasich, H. Pastawski, and A. Chattah, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences 374, 20150155 (2016).
Pastawski et al. (1995) H. M. Pastawski, P. R. Levstein, and G. Usaj, Phys. Rev. Lett. 75, 4310 (1995).
Levstein et al. (2004) P. R. Levstein, A. K. Chattah, H. M. Pastawski, J. Raya, and J. Hirschinger, The Journal of Chemical Physics 121, 7313 (2004), https://doi.org/10.1063/1.1792575 .
Rauer et al. (2018) B. Rauer, S. Erne, T. Schweigler, F. Cataldini, M. Tajik, and J. Schmiedmayer, Science 360, 307 (2018), https://www.science.org/doi/pdf/10.1126/science.aan7938 .
Abanin et al. (2019) D. A. Abanin, E. Altman, I. Bloch, and M. Serbyn, Rev. Mod. Phys. 91, 021001 (2019).
Porras and Cirac (2004) D. Porras and J. I. Cirac, Phys. Rev. Lett. 92, 207901 (2004).
Pal and Huse (2010) A. Pal and D. A. Huse, Phys. Rev. B 82, 174411 (2010).
Luitz et al. (2015) D. J. Luitz, N. Laflorencie, and F. Alet, Phys. Rev. B 91, 081103 (2015).
Smith et al. (2016) J. Smith, A. Lee, P. Richerme, B. Neyenhuis, P. W. Hess, P. Hauke, M. Heyl, D. A. Huse, and C. Monroe, Nature Physics 12, 907 (2016), arXiv:1508.07026 [quant-ph] .
Jurcevic et al. (2017) P. Jurcevic, H. Shen, P. Hauke, C. Maier, T. Brydges, C. Hempel, B. P. Lanyon, M. Heyl, R. Blatt, and C. F. Roos, Phys. Rev. Lett. 119, 080501 (2017).
Zhang et al. (2017) J. Zhang, G. Pagano, P. W. Hess, A. Kyprianidis, P. Becker, H. Kaplan, A. V. Gorshkov, Z. X. Gong, and C. Monroe, Nature (London) 551, 601 (2017), arXiv:1708.01044 [quant-ph] .
Bingham et al. (2021) N. S. Bingham, S. Rooke, J. Park, A. Simon, W. Zhu, X. Zhang, J. Batley, J. D. Watts, C. Leighton, K. A. Dahmen, and P. Schiffer, Phys. Rev. Lett. 127, 207203 (2021).
Lanyon et al. (2017) B. Lanyon, C. Maier, M. Holzäpfel, T. Baumgratz, C. Hempel, P. Jurcevic, I. Dhand, A. Buyskikh, A. Daley, M. Cramer, et al., Nature Physics 13, 1158 (2017).
Friis et al. (2018) N. Friis, O. Marty, C. Maier, C. Hempel, M. Holzäpfel, P. Jurcevic, M. B. Plenio, M. Huber, C. Roos, R. Blatt, and B. Lanyon, Phys. Rev. X 8, 021012 (2018).
Brydges et al. (2019) T. Brydges, A. Elben, P. Jurcevic, B. Vermersch, C. Maier, B. P. Lanyon, P. Zoller, R. Blatt, and C. F. Roos, Science 364, 260 (2019).
Maier et al. (2019) C. Maier, T. Brydges, P. Jurcevic, N. Trautmann, C. Hempel, B. P. Lanyon, P. Hauke, R. Blatt, and C. F. Roos, Phys. Rev. Lett. 122, 050501 (2019).
Bernien et al. (2017) H. Bernien, S. Schwartz, A. Keesling, H. Levine, A. Omran, H. Pichler, S. Choi, A. S. Zibrov, M. Endres, M. Greiner, V. Vuletić, and M. D. Lukin, Nature (London) 551, 579 (2017), arXiv:1707.04344 [quant-ph] .
Turner et al. (2018) C. J. Turner, A. A. Michailidis, D. A. Abanin, M. Serbyn, and Z. Papić, Nature Physics 14, 745 (2018).
Su et al. (2022) G.-X. Su, H. Sun, A. Hudomal, J.-Y. Desaules, Z.-Y. Zhou, B. Yang, J. C. Halimeh, Z.-S. Yuan, Z. Papić, and J.-W. Pan, arXiv e-prints , arXiv:2201.00821 (2022), arXiv:2201.00821 [cond-mat.quant-gas] .
Bethe (1931) H. Bethe, Zeitschrift für Physik 71, 205 (1931).
Powell (1994) M. J. Powell, in Advances in optimization and numerical analysis (Springer, 1994) pp. 51–67.
Powell (1998) M. J. Powell, Acta numerica 7, 287 (1998).
Huang et al. (2020) H.-Y. Huang, R. Kueng, and J. Preskill, Nature Physics 16, 1050 (2020), arXiv:2002.08953 [quant-ph] .
Zhao et al. (2021) A. Zhao, N. C. Rubin, and A. Miyake, Phys. Rev. Lett. 127, 110504 (2021).
Hadfield et al. (2022) C. Hadfield, S. Bravyi, R. Raymond, and A. Mezzacapo, Communications in Mathematical Physics 391, 951 (2022).
Bu et al. (2022) K. Bu, D. Enshan Koh, R. J. Garcia, and A. Jaffe, arXiv e-prints , arXiv:2202.03272 (2022), arXiv:2202.03272 [quant-ph] .
Sack et al. (2022) S. H. Sack, R. A. Medina, A. A. Michailidis, R. Kueng, and M. Serbyn, PRX Quantum 3, 020365 (2022).
Ippoliti (2023) M. Ippoliti, arXiv e-prints , arXiv:2305.10723 (2023), arXiv:2305.10723 [quant-ph] .
Hu and You (2022) H.-Y. Hu and Y.-Z. You, Phys. Rev. Res. 4, 013054 (2022).
Hu et al. (2023) H.-Y. Hu, S. Choi, and Y.-Z. You, Phys. Rev. Res. 5, 023027 (2023).
Gresch and Kliesch (2023) A. Gresch and M. Kliesch, arXiv e-prints , arXiv:2301.03385 (2023), arXiv:2301.03385 [quant-ph] .
Akhtar et al. (2023) A. A. Akhtar, H.-Y. Hu, and Y.-Z. You, Quantum 7, 1026 (2023).
Seif et al. (2023) A. Seif, Z.-P. Cian, S. Zhou, S. Chen, and L. Jiang, PRX Quantum 4, 010303 (2023).
Zhang et al. (2021) T. Zhang, J. Sun, X.-X. Fang, X.-M. Zhang, X. Yuan, and H. Lu, Phys. Rev. Lett. 127, 200501 (2021).
Struchalin et al. (2021) G. Struchalin, Y. A. Zagorovskii, E. Kovlakov, S. Straupe, and S. Kulik, PRX Quantum 2, 010307 (2021).
Benedetti et al. (2016) M. Benedetti, J. Realpe-Gómez, R. Biswas, and A. Perdomo-Ortiz, Phys. Rev. A 94, 022308 (2016).
Pino and García-Ripoll (2020) M. Pino and J. J. García-Ripoll, Phys. Rev. A 101, 032324 (2020).
Hauke et al. (2020) P. Hauke, H. G. Katzgraber, W. Lechner, H. Nishimori, and W. D. Oliver, Reports on Progress in Physics 83, 054401 (2020), arXiv:1903.06559 [quant-ph] .
Imoto et al. (2021) T. Imoto, Y. Seki, Y. Matsuzaki, and S. Kawabata, arXiv e-prints , arXiv:2102.05323 (2021), arXiv:2102.05323 [quant-ph] .
Mohseni et al. (2019) N. Mohseni, M. Narozniak, A. N. Pyrkov, V. Ivannikov, J. P. Dowling, and T. Byrnes, arXiv e-prints , arXiv:1909.09947 (2019), arXiv:1909.09947 [quant-ph] .
Eisert et al. (2010) J. Eisert, M. Cramer, and M. B. Plenio, Rev. Mod. Phys. 82, 277 (2010).
Hastings (2007) M. B. Hastings, Journal of Statistical Mechanics: Theory and Experiment 2007, P08024 (2007).
Terhal (2000) B. M. Terhal, Physics Letters A 271, 319 (2000), arXiv:quant-ph/9911057 [quant-ph] .
Barbieri et al. (2003) M. Barbieri, F. De Martini, G. Di Nepi, P. Mataloni, G. M. D’Ariano, and C. Macchiavello, Phys. Rev. Lett. 91, 227901 (2003).
Lewenstein et al. (2000) M. Lewenstein, B. Kraus, J. I. Cirac, and P. Horodecki, Phys. Rev. A 62, 052310 (2000).
Orús (2014) R. Orús, Annals of Physics 349, 117 (2014).
Schindler et al. (2020) J. Schindler, D. Šafránek, and A. Aguirre, Phys. Rev. A 102, 052407 (2020).
Swamy (1973) K. Swamy, IEEE Transactions on Automatic Control 18, 306 (1973).
Khemani et al. (2020) V. Khemani, M. Hermele, and R. Nandkishore, Phys. Rev. B 101, 174204 (2020).
Serbyn et al. (2021) M. Serbyn, D. A. Abanin, and Z. Papić, Nature Physics 17, 675 (2021).
Chandran et al. (2022) A. Chandran, T. Iadecola, V. Khemani, and R. Moessner, “Quantum many-body scars: A quasiparticle perspective,” (2022).
Luersen and Le Riche (2004) M. A. Luersen and R. Le Riche, Computers & Structures 82, 2251 (2004), computational Structures Technology.
Price (1983) W. Price, Journal of optimization theory and applications 40, 333 (1983).
Storn (1996) R. Storn, in Proceedings of north american fuzzy information processing (Ieee, 1996) pp. 519–523.
Sivanandam and Deepa (2008) S. Sivanandam and S. Deepa, in Introduction to genetic algorithms (Springer, 2008) pp. 165–209.
Bertsimas and Tsitsiklis (1993) D. Bertsimas and J. Tsitsiklis, Statistical science 8, 10 (1993).
Das and Chakrabarti (2005) A. Das and B. K. Chakrabarti, Quantum annealing and related optimization methods, Vol. 679 (Springer Science & Business Media, 2005).

initial states: $\filledstar$ G $\filleddiamond$ C $\bullet$ H
Ham.	large system
	observable-optimized (type 2)
PXP