Constructing Driver Hamiltonians for Optimization Problems with Linear Constraints

Hannes Leipold^1,2, Federico M. Spedalieri^1,3 ¹Information Sciences Institute, University of Southern California, Marina del Rey, CA 90292, USA
²Department of Computer Science, University of Southern California, Los Angeles, CA 90089, USA
³Department of Electrical and Computer Engineering, University of Southern California, Los Angeles, CA 90089, USA

Abstract

Recent advances in the field of adiabatic quantum computing and the closely related field of quantum annealing have centered around using more advanced and novel Hamiltonian representations to solve optimization problems. One of these advances has centered around the development of driver Hamiltonians that commute with the constraints of an optimization problem - allowing for another avenue to satisfying those constraints instead of imposing penalty terms for each of them. In particular, the approach is able to use sparser connectivity to embed several practical problems on quantum devices in comparison to the standard approach of using penalty terms. However, designing the driver Hamiltonians that successfully commute with several constraints has largely been based on strong intuition for specific problems and with no simple general algorithm for generating them for arbitrary constraints. In this work, we develop a simple and intuitive algebraic framework for reasoning about the commutation of Hamiltonians with linear constraints - one that allows us to classify the complexity of finding a driver Hamiltonian for an arbitrary set of linear constraints as NP-Complete. Because unitary operators are exponentials of Hermitian operators, these results can also be applied to the construction of mixers in the Quantum Alternating Operator Ansatz (QAOA) framework.

I Introduction

Quantum annealing has been proposed as a heuristic method to exploit quantum mechanical effects in order to solve discrete optimization problems. Typically, these problems require optimizing a quadratic cost function subject to a set of linear constraints. The usual approach to treating these constraints consists of adding them to the cost function as penalty terms, thereby transforming the constrained optimization into an unconstrained one. This approach has some drawbacks, including an increase in the required resources (i.e., higher connectivity and increased dynamical range for the parameters that define the instance). In Ref.hen2016quantum , the authors introduced the idea of constrained quantum annealing (CQA), that uses specially tailored driver Hamiltonians for a specific set of constraints. These tailored Hamiltonians have several advantages, such as reducing the size of the search space of the problem and reducing the number of needed interactions to implement the annealing protocol. At the heart of the approach is the idea that a Hamiltonian which commutes with the operator embedding of the constraints and starts within the feasible space of configurations will remain in it throughout the evolution.

While Ref.hen2016quantum looked primarily at commuting with a single global constraint, the work in Ref.hen2016driver focused on finding driver Hamiltonians for several constraints. Under special conditions, the authors were able to construct appropriate driver Hamiltonians for several optimization problems of practical interest. In this paper, we ask and answer the general question of, given a set of arbitrary linear constraints, can we construct a driver Hamiltonian which will commute with the operator embedding of the constraints? Our main result is that this problem is NP-Complete – answering a question posited originally in Ref.hen2016driver . Along the way, we will derive a simple formula for describing the commutation relation and exploit it to understand many facets of CQA. These results can be naturally applied to the Quantum Alternating Operator Ansatz (QAOA) framework hadfield2017quantum, where one of the central tasks is to find mixer operators that connect feasible solutions of a constrained optimization problem. Since unitary operators are exponentials of Hermitian operators (for every unitary matrix $U$ , there exists a Hermitian matrix $H$ such that $U=e^{iH}$ ) and therefore the existence of a unitary matrix with this commutative property necessitates the existence of the Hermitian matrix with the same commutative property (since $[e^{iH},\hat{C}]=\sum_{k=0}^{\infty}[\left(iH\right)^{k},\hat{C}]/k!$ ), our results directly translate into this setting as well.

The paper is organized as follows. In Section II we review the basic ideas behind constrained quantum annealing. In Section III, we derive a simple algebraic condition for the commutation relation of the driver Hamiltonian and the constraint operators. For many practical applications, Hamiltonians with bounded weight interaction (local) terms are often desired; in Section IV we show how brute forcing the simple algebraic condition from Section III to find driver Hamiltonians of bounded weight. In Section V, we introduce several variations of the problem ILP-QCOMMUTE, the problem of finding a Hermitian matrix that will commute with the constraint operators. We also reduce the EQUAL SUBSET SUM problem to the problem ILP-QCOMMUTE (and in Appendix A we show it for the special case of binary valued linear constraints). The EQUAL SUBSET SUM problem is known to be NP-Complete, thus proving our main result. We define a related complexity class, ILP-QCOMMUTE-k-LOCAL, about finding a Hermitian matrix that commutes with the constraints and has interaction terms up to $k$ weight. ILP-QCOMMUTE-k-LOCAL is in P by the approach detailed in Section IV. Other questions of interest, such as finding a driver Hamiltonian that has full reachability over a constrained space for a CQA protocol, will be addressed through our formulation in Section VI. We conclude with a discussion of the significance of our result and open problems related to what we have shown in this work.

II Background

In the quantum annealing (QA) framework, gradually decreasing quantum fluctuations are used to traverse the barriers of an energy landscape in the search of global minima to complicated cost functions kadowaki1998quantum ; farhi2000quantum . For an overview of these approaches, we refer the reader to Ref.albash2018adiabatic . Quantum annealing has gained traction for combinatorial optimization farhi2001quantum ; santoro2006optimization ; bian2016mapping ; hauke2020perspectives as a way to solve hard optimization problems faster and, more recently, for machine learning adachi2015application ; mott2017solving ; amin2018quantum ; biamonte2017quantum ; li2018quantum ; kumar2018quantum as a way to naturally sample desired probability distributions quickly. In the case of solving an optimization problem, the problem is encoded in the Hamiltonian $H_{p}$ such that the ground state is the optimum solution. Usually this is readily done by expressing the problem as an Ising model, a model for spin glassesbrush1967history ; sherrington1975solvable ; castellani2005spin ; troyer2005computational .

Once the problem Hamiltonian is described, the QA framework prescribes an evolution to the final Hamiltonian from some readily preparable Hamiltonian $H_{d}$ - usually through a linear interpolation of $H_{d}$ and $H_{p}$ :

\displaystyle H(s)=s\,H_{p}+(1-s)\,H_{d},

(1)

where there is a continuous smooth function $s(t)$ for $t\in[0,T]$ such that $s(0)=0$ and $s(T)=1$ . If the process is varied slowly enough, the adiabatic theorem ensures that the wave function of the system will be close to the instantaneous ground state of the system for any $s$ and therefore any $t$ . By the adiabatic theorem, if the total time $T$ that the system is evolved for is large compared to the inverse of the minimum gap squared, then the wavefunction of the system will be close to the ground state of $H_{p}$ . For the purpose of our presentation here, we restrict our focus to the case of binary linear optimization problems, a heavily studied optimization class. Specifically, we can consider a set of linear constraints $\mathcal{C}=\{C_{1},\ldots,C_{m}\}$ . Suppose that the solutions to the optimization problem are subject to constraints $C_{i}$ such that a solution state $x\in\{0,1\}^{n}$ satisfies $C_{i}(x)=\sum_{j}c_{ij}x_{i}=b_{i}$ for some $b_{i}$ . Because $C_{i}$ is a simple linear function, we can associate a vector $\vec{c}_{i}$ with it such that $C_{i}(x)=\vec{c}_{i}\cdot\vec{x}$ where $\vec{c}_{i}\in\mathbb{Z}^{n}$ . When referring to constraints throughout this paper, we are referring specifically to linear constraints, for which our main results are pertinent to.

II.1 Constraint Quantum Annealing for Integer Linear Programming

We use the ordinary embedding of binary variables $x_{i}\in\{0,1\}$ in the computational basis for quantum annealing, such that $\vec{x}\in\{0,1\}^{n}$ is represented by $\left\lvert\,\vec{x}\,\right\rangle=\left\lvert\,x_{1}\,\right\rangle\ldots\left\lvert\,x_{n}\,\right\rangle\in\mathbb{C}^{2^{n}}$ (i.e. $\sigma_{i}^{z}\left\lvert\,\vec{x}\,\right\rangle=\left(1-2\,x_{i}\right)\left\lvert\,\vec{x}\,\right\rangle$ ) and the final Hamiltonian is diagonal in the computational basis so that we can read off a solution by measuring in that basis. Following the framework of CQA, given a constraint $C(x)=\vec{c}\cdot\vec{x}$ , we associate $C$ with an embedded constraint operator $\hat{C}=\sum_{i=1}^{n}c_{i}\sigma_{i}^{z}$ . Let us consider the case of a single constraint - $C=(1,\ldots,1)$ - over $n$ variables. This is also the first case presented in Ref.hen2016quantum . It is simple to check that $H_{d}=\sum_{i=1}^{n-1}(\sigma_{i}^{x}\sigma_{i+1}^{x}+\sigma_{i}^{y}\sigma_{i+1}^{y})$ commutes with the constraint embedded operator $\hat{C}=\sum_{i=1}^{n}\sigma_{i}^{z}$ . For example, this type of constraint may arise in graph partitioning, since the partitions must split the graph into equal size. For the Graph Partition problem, one is given a graph $G$ and is asked to partition the vertices $V$ into equal subsets such that the number of edges between the two is minimized. In terms of the Ising model, we can consider a collection of $n$ qubits, such that $\left\lvert\,0\,\right\rangle$ ( $\left\lvert\,1\,\right\rangle$ ) for qubit $i$ represents placing vertex $v_{i}$ in partition 1 (2). As such, we design a penalty Hamiltonian $H_{p}$ and a driver Hamiltonian $H_{d}$ such that the final state will be a solution to the graph partitioning problem. Assuming the transverse field driver Hamiltonian - $H_{d}=\sum_{i=1}^{n}\sigma_{i}^{x}$ - a simple penalty Hamiltonian can be:

\displaystyle H_{p}=\sum_{(i,j)\in E}\left(\mathbbm{1}-\sigma_{i}^{z}\sigma_{j}^{z}\right)+\alpha\,\left(\sum_{i=1}^{n}\sigma_{i}^{z}\right)^{2},

(2)

where the first term assigns a positive potentiality to each edge that connects vertices across the partitions and the second term is the constraint operator squared.

In general the penalty factor $\alpha$ must be greater than $\text{min}(2\,d_{m},n)/8$ where $d_{m}$ is the maximal degree of $G$ lucas2014ising . Note that the term $\left(\sum_{i=1}^{n}\sigma_{i}^{z}\right)^{2}$ is $\hat{C}^{2}$ and requires $n^{2}$ two body interaction terms to implement. However, if we choose our $H_{d}$ such that $[H_{d},\hat{C}]=0$ , then we can use the simpler penalty Hamiltonian:

\displaystyle H_{p}=\sum_{(i,j)\in E}\left(\mathbbm{1}-\sigma_{i}^{z}\sigma_{j}^{z}\right).

(3)

Note that since $H_{p}$ is diagonal in the spin-z basis, it trivially commutes with the constraints.

One benefit to this construction is that the driver $H_{d}=\sum_{i=1}^{n-1}(\sigma_{i}^{x}\sigma_{i+1}^{x}+\sigma_{i}^{y}\sigma_{i+1}^{y})$ , for example, commutes with $\hat{C}$ and only requires $n-1$ two body interaction terms to implement. As such, the total number of two body terms required to solve the problem can be greatly reduced by using driver Hamiltonians beyond the transverse field if they commute with a set of constraints. As long as the initial wavefunction has an expected value of $n/2$ for $\hat{C}$ ( $n/2+1$ or $n/2-1$ if $n$ is odd), the wavefunction will remain in the subspace with this expected value for the entirety of the anneal. As an example of Graph Partition that we will return to later, consider a graph with 4 vertices $V=\{v_{1},v_{2},v_{3},v_{4}\}$ , connected into a single path by edges $E=\{e_{1},e_{2},e_{3}\}$ with $e_{1}=(v_{1},v_{2}),e_{2}=(v_{2},v_{3}),e_{3}=(v_{3},v_{4})$ . Then in this case, $H_{d}=(\sigma_{1}^{x}\sigma_{2}^{x}+\sigma_{1}^{y}\sigma_{2}^{y})+(\sigma_{2}^{x}\sigma_{3}^{x}+\sigma_{2}^{y}\sigma_{3}^{y})+(\sigma_{3}^{x}\sigma_{4}^{x}+\sigma_{3}^{y}\sigma_{4}^{y})$ . Since we are interested in an even partition, the starting state should have an equal number of 1s and 0s. For example, $\left\lvert\,0011\,\right\rangle$ is in the correct subspace. It is important to note that driver Hamiltonians are constructed irrespective of the value that the constraint is set to, since the eigenvalue of the constraint operator with respect to the initial wavefunction will determine what value is preserved during the anneal. For a wavefunction that is not an eigenvector of a constraint operator, the commuting property of the driver Hamiltonian with the constraint operator will mean that the projection of the wavefunction onto each specific eigenspace will evolve independently of the rest of the wavefunction.

For the purpose of adiabatic quantum computing, the initial wavefunction of the system has to be in the ground state of the initial Hamiltonian, while a general driver Hamiltonian from this construction can be highly nontrivial if there are many constraints. Therefore, specifying an alternative initial Hamiltonian for CQA is also a major area of research, since we want the initial Hamiltonian to have a ground state that is easy to prepare. One approach to overcome this is to use an initial Hamiltonian diagonal in the computational basis and linear on the $\sigma_{z}$ operators, that has as its ground state a specific solution in the feasible space in the spin-z basis, and then evolve from this initial Hamiltonian to the driver Hamiltonian (whose ground state will have support on all or a subset of the feasible space). There are many hard problems for which finding a feasible solution is simple. For example, it is straightforward to find a single partition for Graph Partition, but it is hard to find the best partition. As such, a useful avenue for exploiting CQA is in the case where linear constraints specify a nontrivial feasibility space, but one where finding a nonoptimum element in the feasibility space is still tractable.

The work in Ref.hen2016driver extended the framework for cases where a driver should commute with multiple constraint operators. In particular, given a set $\mathcal{C}$ , they consider finding a Hamiltonian $H_{d}$ such that:

\displaystyle[H_{d},\hat{C}_{j}]=0,\;C_{j}\in\mathcal{C}

(4)

As they note, in general, tailoring driver Hamiltonians for a set $\mathcal{C}$ can be difficult. In this paper, we answer specifically the computational complexity of such a task by reducing an NP-Complete problem to ILP-QCOMMUTE. We also discuss the related task of finding $H_{d}$ such that it can reach every state in the solution space, but reaches no state outside the solution space in Section VI. This result in some ways may appear intuitive, since describing the feasible space of $\mathcal{C}$ is hard and knowing a Hamiltonian that would keep a wavefunction within this space - and only this space - should require some characterization of it. Simply knowing that a nontrivial $H_{d}$ exists at all for a NP-Complete feasibility problem allows one to recognize that the problem should have at least two solutions for some set of values that each constraint is set to, even if one does not have a token to prove it.

III An Algebraic Condition for Commuting with Linear Constraints

Consider the problem to find Hamiltonian drivers that have, as their eigenvectors, support over the possible values that satisfy the given linear constraints. In the most general sense, we consider constraints of the form:

\displaystyle\hat{C}=\sum_{i=1}^{n}c_{i}\,\sigma_{i}^{z},\;\vec{c}\in\mathbb{Z}^{n},

(5)

with a constraint value $b$ that corresponds to one of the energy levels of $\hat{C}$ . Often problems of practical interest can be captured in the restricted case where $\vec{c}\in\{0,1\}^{n}$ or $\vec{c}\in\{-1,0,1\}^{n}$ .

Consider the linear transformation $[M,\sigma^{z}]$ that maps any two by two matrix $M$ to a new two by two matrix $M^{\prime}$ , by commuting the matrix with $\sigma^{z}$ . This transformation has two obvious eigenmatrices - $\mathbbm{1}$ and $\sigma^{z}$ - that span the kernel of the transformation. One can easily verify that $\sigma^{+}$ ( $\left\lvert\,0\,\right\rangle\left\langle\,1\,\right\rvert$ ) and $\sigma^{-}$ ( $\left\lvert\,1\,\right\rangle\left\langle\,0\,\right\rvert$ ) are also eigenmatrices of this transformation, with eigenvalue $2$ and $-2$ respectively. Together these four eigenmatrices and their eigenvalues describe the spectrum decomposition of the transformation. We exploit this fact to find a simple algebraic formula for expressing the commutation of a general Hamiltonian with a linear constraint. It is easy to verify that for $H$ over $n$ qubits, if $\text{Tr}_{1,\ldots i-1,i+1,\ldots n}[H]=\sigma^{\pm}$ , then $[H,\sigma_{i}^{z}]=\pm 2\,H$ .

Given any complete basis for a single qubit system, we can extend that basis to define a basis over $n$ qubits. Doing this with the found eigenmatrices defines a basis $\{\mathbbm{1},\sigma^{z},\sigma^{+},\sigma^{-}\}^{\otimes n}$ . Note as well that $\left(\alpha_{j}\sigma_{i}^{\pm}\right)^{\dagger}=\alpha_{j}^{\dagger}\sigma_{i}^{\mp}$ for $\alpha_{j}\in\mathbb{C}$ . This suggests a simple representation in which a Hermitian matrix is defined by its nonzero terms over this basis. Then any Hermitian matrix can be written in the form:

	$\displaystyle H$	$\displaystyle=\sum_{(\vec{y_{j}},\vec{v_{j}},\vec{w_{j}})\in\Delta(\mathscr{Y},\;\mathscr{V},\;\mathscr{W})}\alpha_{j}\bigotimes_{i=1}^{n}\left(\sigma^{z}\right)^{y_{ji}}\left(\sigma^{+}\right)^{v_{ji}}\left(\sigma^{-}\right)^{w_{ji}}$
		$\displaystyle\phantom{=}+\sum_{(\vec{y_{j}},\vec{v_{j}},\vec{w_{j}})\in\Delta(\mathscr{Y},\;\mathscr{V},\;\mathscr{W})}\alpha_{j}^{\dagger}\bigotimes_{i=1}^{n}\left(\sigma^{z}\right)^{y_{ji}}\left(\sigma^{+}\right)^{w_{ji}}\left(\sigma^{-}\right)^{v_{ji}},$		(6)

where $\mathscr{Y}=\{\vec{y_{1}},\ldots,\vec{y_{r}}\}$ with $\vec{y}_{i}\in\{0,1\}^{n}$ , $\mathscr{V}=\{\vec{v_{1}},\ldots,\vec{v_{r}}\}$ with $\vec{v}_{i}\in\{0,1\}^{n}$ , and $\mathscr{W}=\{\vec{w_{1}},\ldots,\vec{w_{r}}\}$ with $\vec{w}_{i}\in\{0,1\}^{n}$ are such that the corresponding $\alpha_{i}\neq 0$ . Here $\Delta(\mathscr{Y},\mathscr{V},\mathscr{W})=\{(\vec{y}_{i},\vec{v}_{i},\vec{w}_{i})|y_{i}\in\mathscr{Y},v_{i}\in\mathscr{V},w_{i}\in\mathscr{W}\}$ , where $\Delta$ simply takes any indexed element sets and creates the set of the index-wise confederated tuples. A tuple $(\vec{y}_{i},\vec{v}_{i},\vec{w}_{i})$ specifies the indices in which we chose $\sigma^{z}$ , $\sigma^{+}$ , or $\sigma^{-}$ for each nonzero term. Once that choice is made, hermiticity demands the corresponding second term seen in Eq. 6 to be part of the Hamiltonian as well. However, there are restrictions on what vectors can be chosen. Specifically, $\vec{y}_{i}\cdot\vec{w}_{i}=0$ and $\vec{y}_{i}\cdot\vec{v}_{i}=0$ , since choosing $\sigma^{z}$ and a $\sigma^{\pm}$ would actually mean selecting $\sigma^{\pm}$ with a new coefficient $-\alpha_{j}$ instead. Likewise, it should also be the case that $\vec{v}_{i}\cdot\vec{w}_{i}=0$ – otherwise the term would be equivalent to two terms with the same coefficient halved and one taking the term $\sigma^{z}$ , the other taking the identity term. As such, this added constraints on $\Delta(\mathscr{Y},\mathscr{V},\mathscr{W})$ so that the representations are unique, which must be enforced before applying the theorem below because the uniqueness of the basis will be actively used. As an example, consider the driver Hamiltonian discussed in the previous section: $H_{d}=\sum_{i=1}^{n-1}(\sigma_{i}^{x}\sigma_{i+1}^{x}+\sigma_{i}^{y}\sigma_{i+1}^{y})=2\,\sum_{i=1}^{n-1}(\sigma_{i}^{+}\sigma_{i+1}^{-}+\sigma_{i}^{-}\sigma_{i+1}^{+})$ . For this Hamiltonian, $\mathscr{Y}=\{\vec{0},\ldots,\vec{0}\},\mathscr{V}=\{\vec{e}_{1},\ldots,\vec{e}_{n-1}\},\mathscr{W}=\{\vec{e}_{2},\ldots,\vec{e}_{n}\}$ – where $\vec{e}_{i}$ refers to the standard basis vectors. While the notation is somewhat awkward, it becomes useful for expressing our first major result:

Theorem III.1 (Algebraic Condition for Commutativity).

A Hermitian Matrix $H$ commutes with a linear constraint $C$ if and only if $\vec{c}\cdot(\vec{v}_{j}-\vec{w}_{j})=0$ for all $\vec{v}_{j},\vec{w}_{j}\in\Delta(\mathscr{V},\mathscr{W})$ .

Proof.

Using the form for $H$ we introduced earlier, we can see that:

$\displaystyle\left[H,\hat{C}\right]$	$\displaystyle=\sum_{(\vec{y_{j}},\vec{v_{j}},\vec{w_{j}})\in\Delta(\mathscr{Y},\;\mathscr{V},\;\mathscr{W})}\left[\alpha_{j}\bigotimes_{i=1}^{n}\left(\sigma^{z}\right)^{y_{ji}}\left(\sigma^{+}\right)^{v_{ji}}\left(\sigma^{-}\right)^{w_{ji}},\sum_{k=1}^{n}c_{k}\,\sigma_{k}^{z}\right]$
	$\displaystyle\phantom{=}+\sum_{(\vec{y_{j}},\vec{v_{j}},\vec{w_{j}})\in\Delta(\mathscr{Y},\;\mathscr{V},\;\mathscr{W})}\left[\alpha_{j}^{\dagger}\bigotimes_{i=1}^{n}\left(\sigma^{z}\right)^{y_{ji}}\left(\sigma^{+}\right)^{w_{ji}}\left(\sigma^{-}\right)^{v_{ji}},\sum_{k=1}^{n}c_{k}\,\sigma_{k}^{z}\right]$
	$\displaystyle=\sum_{(\vec{y_{j}},\vec{v_{j}},\vec{w_{j}})\in\Delta(\mathscr{Y},\;\mathscr{V},\;\mathscr{W})}2\,\alpha_{j}\left(\sum_{k=1}^{n}c_{k}(v_{jk}-w_{jk})\right)\bigotimes_{i=1}^{n}\left(\sigma^{z}\right)^{y_{ji}}\left(\sigma^{+}\right)^{v_{ji}}\left(\sigma^{-}\right)^{w_{ji}}$
	$\displaystyle\phantom{=}+\sum_{(\vec{y_{j}},\vec{v_{j}},\vec{w_{j}})\in\Delta(\mathscr{Y},\;\mathscr{V},\;\mathscr{W})}2\,\alpha_{j}^{\dagger}\left(\sum_{k=1}^{n}c_{k}(w_{jk}-v_{jk})\right)\bigotimes_{i=1}^{n}\left(\sigma^{z}\right)^{y_{ji}}\left(\sigma^{+}\right)^{w_{ji}}\left(\sigma^{-}\right)^{v_{ji}}$
	$\displaystyle=\sum_{(\vec{y_{j}},\vec{v_{j}},\vec{w_{j}})\in\Delta(\mathscr{Y},\;\mathscr{V},\;\mathscr{W})}2\,\alpha_{j}\,\vec{c}\cdot\left(\vec{v}_{j}-\vec{w}_{j}\right)\bigotimes_{i=1}^{n}\left(\sigma^{z}\right)^{y_{ji}}\left(\sigma^{+}\right)^{v_{ji}}\left(\sigma^{-}\right)^{w_{ji}}$
	$\displaystyle\phantom{=}-\sum_{(\vec{y_{j}},\vec{v_{j}},\vec{w_{j}})\in\Delta(\mathscr{Y},\;\mathscr{V},\;\mathscr{W})}2\,\alpha_{j}^{\dagger}\,\vec{c}\cdot\left(\vec{v}_{j}-\vec{w}_{j}\right)\bigotimes_{i=1}^{n}\left(\sigma^{z}\right)^{y_{ji}}\left(\sigma^{+}\right)^{w_{ji}}\left(\sigma^{-}\right)^{v_{ji}}$	(7)

Since the set defined by the tuples $\{(\vec{y}_{j},\vec{v}_{j},\vec{w}_{j})\}$ is a linearly independent set Eq. 7 $=0$ iff $\vec{c}\cdot(\vec{v_{j}}-\vec{w_{j}})=0$ for all $j$ .

∎

Consider the example discussed in Section II in the context of Theorem III.1. It is easy to see that $v_{1}=(1,0,0,0),w_{1}=(0,1,0,0),v_{2}=(0,1,0,0),w_{2}=(0,0,1,0),v_{3}=(0,0,1,0),w_{3}=(0,0,0,1)$ would satisfy Eq. 7, defining $H_{d}=\sum_{i=1}^{3}\sigma_{i}^{+}\sigma_{i+1}^{-}+\sigma_{i}^{-}\sigma_{i+1}^{+}$ . Note that more vector pairs will also satisfy the condition, for example $v_{4}=(0,0,0,1)$ and $w_{4}=(1,0,0,0)$ .

IV Bounded weight drivers

The algebraic condition of Theorem III.1 can be used as a starting point to understand several features of CQA. One of them is motivated by the fact that actual implementations of quantum annealing will likely impose a bound on the weight of the driver operators available. Hence, given a set of linear constraints, we could restrict our search for commuting drivers to those with bounded weight.

Consider a set $\mathcal{C}=\{C_{1},\ldots,C_{m}\}$ of linear constraints on $n$ variables, and let $C^{M}$ be the $m\times n$ matrix with coefficients $c_{ij}$ (recall $C_{i}(x)=\sum_{j}c_{ij}x_{j}$ ). Then, Theorem III.1 tells us that there is a non-diagonal driver commuting with all the constraints, if and only if, the linear system $C^{M}\,\vec{u}=\vec{0}$ , where $\vec{u}=\vec{v}-\vec{w}\in\{-1,0,1\}^{n}$ . Furthermore, we can see that the number of components of $\vec{u}$ that are non-zero is a lower bound for the weight of such a driver (the weight could be higher if we allow $\sigma^{z}$ operators acting on the variables associated with the vanishing components of $\vec{u}$ ). This leads to a simpler analysis of the case of bounded weight drivers.

Assume that we are only allowed weight $k$ drivers, and we further impose the condition that they are constructed from $k$ 1-qubit operators that act non trivially in the computational basis (in our case, it means these are chosen from $\{\sigma^{+},\sigma^{-}\}$ ). Then, the number of such operators is $2^{k-1}\binom{n}{k}$ . For fixed $k$ , this is polynomial in $n$ , and so it is tractable to check all the possible vectors $\vec{u}$ to find which ones satisfy the condition $C^{M}\,\vec{u}=0$ . From those that do, we can construct the corresponding weight $k$ driver that will commute with all the constraints, by assigning $\sigma^{+}$ to qubit $i$ if $u_{i}=1$ , and $\sigma^{-}$ if $u_{i}=-1$ . This is a simple but very useful result in practice: brute force searching for solutions to the condition from Theorem III.1 finds all possible driver Hamiltonians that commute with the embedded constraint operators up to a certain weight in time polynomial on the system size.

The simplest and most relevant case for practical applications is that of 2-local operators, since 2-body interactions are more easily engineered than higher order ones. In this case the condition of Theorem III.1 implies an even simpler characterization of when a set of constraints allow for commuting drivers.

Corollary IV.0.1.

Let $\mathcal{C}=\{C_{1},\ldots,C_{m}\}$ be a set of linear constraints and $C^{M}$ the associated matrix of coefficients. Then a 2-local driver that commutes with all the constraints exists, if and only if, the matrix $C^{M}$ has a pair of columns that are either equal, or opposite.

2-local means that $\vec{u}$ in the condition $C^{M}\,\vec{u}=0$ has only 2 non zero components, and we can take these to be $(1,1)$ or $(1,-1)$ , since the other two possibilities would produce the corresponding Hermitian conjugates. Since multiplying a matrix by a column vector results in a linear combination of the columns of the matrix, with the coefficients given by the vector components, the condition $C^{M}\,\vec{u}=0$ will state that $C^{M}$ has two columns that are the same (if the non zero components of $\vec{u}$ are $(1,-1)$ ), or are opposites (if they are $(1,1)$ ). To any distinct pair of columns that satisfy one of these conditions we can associate a distinct weight-2 driver that commutes with the constraints, so the maximum number of such possible weight-2 drivers is $\binom{n}{2}$ .

V The Problem ILP-QCOMMUTE

Having found a simple algebraic condition for expressing the commutation relationship of any Hermitian matrix with a linear constraint, we wish to exploit that fact here to find what the general complexity of knowing the existence of such a Hermitian matrix is. We consider the following problem:

Definition (ILP-QCOMMUTE):

Given a set $\mathcal{C}=\{C_{1},\ldots,C_{m}\}$ of linear constraints such that $\hat{C_{i}}=\sum_{j=1}^{n}c_{ij}\sigma_{j}^{z}$ , over a space $\mathbb{C}^{2^{n}}$ with $c_{ij}\in\mathbb{Z}$ , is there a Hermitian Matrix $H$ , with $\mathscr{O}\left(\text{poly}(n)\right)$ nonzero coefficients over a basis $\{\chi_{1},\chi_{2},\chi_{3},\chi_{4}\}^{\bigotimes n}$ , such that $\left[H,\hat{C_{i}}\right]=0$ for all $\hat{C_{i}}$ and $H$ has at least one off-diagonal term in the spin-z basis?

Solving this problem would be useful for constructing Hamiltonian drivers for quantum annealing. We can also define 0-1-LP-QCOMMUTE as the binary version, where $c_{ij}\in\{0,1\}$ , and also {-1,0,1}-LP-QCOMMUTE, where $c_{ij}\in\{-1,0,1\}$ , the type of coefficients used when representing problems like 1-in-3 3-SAT as an ILP. One of the central results of this paper is that these problems are NP-Completecook1971complexity karp1975computational , which can be shown by reducing them to the EQUAL SUBSET SUM problemkarp1972reducibility . This reduction is simple and straightforward for ILP-QCOMMUTE, and we discuss it in this section to give a sense of the connection between the two problems. However, this proof is not enough to imply that 0-1-LP-QCOMMUTE and {-1,0,1}-LP-QCOMMUTE are also NP-Complete, since they could both very well be easier subclasses of ILP-QCOMMUTE. That is not the case, but the proof for 0-1-LP-QCOMMUTE is more involved (and rather tedious) and thus is presented in the Appendix A.

Definition (EQUAL SUBSET SUM):

Given a set $S=\{s_{1},s_{2},\ldots,s_{n}\}$ , with $s_{i}\in\mathbb{Z}^{+}$ , are there two non-empty disjoint subsets, $A,B$ such that $\sum_{a_{i}\in A}a_{i}=\sum_{b_{i}\in B}b_{i}$ ?

The EQUAL SUBSET SUM problem is known to be NP-Completewoeginger1992equal . We map an instance of the EQUAL SUBSET SUM problem to the ILP-QCOMMUTE problem; the former defined over a set $S=\{s_{1},s_{2},\ldots,s_{n}\}$ , with $s_{i}\in\mathbb{Z}^{+}$ . Consider the constraint operator defined by $\hat{C}=\sum_{i=1}^{n}s_{i}\sigma_{i}^{z}$ , and the vector $\vec{s}=(s_{1},\ldots,s_{n})$ . Suppose we can find vectors $\vec{v},\vec{w}$ with binary components, such that $\vec{s}\cdot\left(\vec{v}-\vec{w}\right)=0$ (the algebraic condition derived in Theorem III.1). Then the indices corresponding to the nonzero components of $\vec{v}$ and $\vec{w}$ can be used to identify the sets $A$ and $B$ (respectively) in the EQUAL SUBSET SUM problem.

Suppose there is a solution $H$ to ILP-QCOMMUTE. From Theorem III.1, it follows that $\vec{c}_{i}\cdot\left(\vec{v}-\vec{w}\right)=0$ for the vector $\vec{c}$ associated with constraint $C$ and any nonzero term in the basis $\{\mathbbm{1},\sigma^{z},\sigma^{+},\sigma^{-}\}^{\otimes n}$ with a term $\sigma^{\pm}$ on at least one qubit will be enough to define a new $H^{\prime}$ that will be associated with a solution to EQUAL SUBSET SUM. At least one such element exists for $H$ because $H$ has at least one off-diagonal term in the spin-z basis. We can associate any off-diagonal term of H with $\vec{y},\vec{v},\vec{w}$ such that $H^{\prime}=\alpha\bigotimes_{i=1}^{n}\left(\sigma^{z}\right)^{y_{i}}\left(\sigma^{+}\right)^{v_{i}}\left(\sigma^{-}\right)^{w_{i}}+\alpha^{\dagger}\bigotimes_{i=1}^{n}\left(\sigma^{z}\right)^{y_{i}}\left(\sigma^{+}\right)^{w_{i}}\left(\sigma^{-}\right)^{v_{i}}$ is a matrix with only that off-diagonal term and its complex conjugate for some $\alpha$ and $\vec{v}\neq\vec{0},\vec{w}\neq\vec{0}$ . Then for a specific off-diagonal term, every non-zero entry in $\vec{v}$ between $1$ and $n$ , call it $i$ , picks an integer $s_{i}\in S$ for the set $A$ , and $\vec{w}$ does likewise for the set $B$ , providing a solution to the corresponding instance of EQUAL SUBSET SUM since $\vec{s}\cdot(\vec{v}-\vec{w})=\left(\sum_{s_{i}\in A}s_{i}\right)-\left(\sum_{s_{i}\in B}s_{i}\right)=0$ . Suppose there is a solution $A,B$ to EQUAL SUBSET SUM, then define $\vec{v}$ such that $v_{i}=1$ ( $w_{i}=1$ ) if and only if $s_{i}$ is in $A$ ( $B$ ). Then $H=\bigotimes_{i=1}^{n}\left(\sigma^{+}\right)^{v_{i}}\left(\sigma^{-}\right)^{w_{i}}+\bigotimes_{i=1}^{n}\left(\sigma^{+}\right)^{w_{i}}\left(\sigma^{-}\right)^{v_{i}}$ is a solution to ILP-QCOMMUTE since $\left(\sum_{s_{i}\in A}s_{i}\right)+\left(\sum_{s_{i}\in B}s_{i}\right)=\vec{s}\cdot\left(\vec{v}-\vec{w}\right)=0$ . Hence, we have the following result.

Theorem V.1.

ILP-QCOMMUTE is NP-Hard.

Given this result, we can show NP-Completeness by noting that we can check for an off-diagonal term in the spin-z basis in polynomial time. Let $H$ be a proposed solution to the ILP-QCOMMUTE such that there exists nonequivalent indices $i,j$ such that entry $h_{ij}\neq 0$ . Checking that $H$ commutes with the constraints is polynomial time. For $k\in\{1,\ldots,n\}$ , check if $\text{Tr}_{k}\left(H\sigma_{k}^{+}\right)\neq 0$ or $\text{Tr}_{k}\left(H\sigma_{k}^{-}\right)\neq 0$ . $H$ has at least one element $h_{ij}$ that is off-diagonal in the spin-z basis if and only if there exists $k$ such that at least one of these terms is nonzero. Since the partial trace of tensor products is the trace over a specific tensor component, this can be done quickly.

Corollary V.1.1.

ILP-QCOMMUTE is NP-Complete.

While we have shown that this problem is NP-Hard, we note that for any specific instance of the problem, the practical runtime can still be tractable and therefore a useful avenue for even the hardest instances of optimization problems.Also take note that since every unitary operator is the exponential of a corresponding Hermitian operator, knowing the existence of a unitary operator that commutes with the constraint operators is paramount to knowing a Hermitian matrix exists with the same property. As such, our result immediately translates to the QAOA setting where one wishes to construct unitary operators that will commute with the embedded linear constraint operators.

V.1 Bounded Weight ILP-QCOMMUTE

Despite the NP-hardness of ILP-QCOMMUTE, Section IV discusses a simple polynomial time algorithm to find driver terms up to some weight $k$ . Consider this modified version of ILP-QCOMMUTE, which asks about the existence of a Hermitian matrix that commutes with the constraints, but consists of interaction terms up to weight $k$ .

Definition (ILP-QCOMMUTE-k-LOCAL):

Given a set $\mathcal{C}=\{C_{1},\ldots,C_{m}\}$ of linear constraints such that $\hat{C_{i}}=\sum_{j=1}^{n}c_{ij}\sigma_{j}^{z}$ , over a space $\mathbb{C}^{2^{n}}$ with $c_{ij}\in\mathbb{Z}$ , is there a Hermitian Matrix $H$ , with $\mathscr{O}\left(\text{poly}(n)\right)$ nonzero coefficients over a basis $\{\chi_{1},\chi_{2},\chi_{3},\chi_{4}\}^{\bigotimes n}$ and no term with weight higher than $k$ , such that $\left[H,\hat{C_{i}}\right]=0$ for all $\hat{C_{i}}$ and $H$ has at least one off-diagonal term in the spin-z basis?

Theorem V.2.

ILP-QCOMMUTE-k-LOCAL is in P for k in $\mathcal{O}(1)$ .

Proof.

Apply the brute force approach described in Section IV. Since $k\in\mathcal{O}(1)$ , the algorithm runs in time $n^{\mathcal{O}(1)}$ . ∎

This shows that for a practical application where the Hamiltonian driver should be all local, we can tractably find such a Hamiltonian driver. Moreover, any $H$ that is all local and commutes with the constraints can be constructed by placing the right coefficients on the found terms by brute forcing the expression in Theorem III.1 (they form a basis for all Hamiltonians that commute with the constraints up to weight $k$ ).

VI Reachability within the Feasible Space

In the previous section, we proved that finding a Hermitian matrix which commutes with a collection of linear spin-z constraints is NP-Complete. The related question becomes finding a Hermitian matrix which commutes with the constraints, but also connects the feasibility space. Note that two states $\left\lvert\,p\,\right\rangle,\left\lvert\,q\,\right\rangle$ are connected if they are in the same commutation subspace of $H$ , that is $\left\langle\,p\,\right\rvert H^{r}\left\lvert\,q\,\right\rangle\neq 0$ for some $r\in\mathbb{Z}^{+}$ . As such, for any pair of solutions $i,j$ in the feasibility space, their associated vectors in the computational basis $\left\lvert\,i\,\right\rangle,\left\lvert\,j\,\right\rangle$ should be in the same commutation subspace. In general, when finding a driver Hamiltonian for an anneal that should solve an optimization task, we wish to find a driver that satisfies this condition so that we can ensure that it will be able to reach the entire feasibility space, since commuting with the constraints alone is not enough to ensure this will happen. Consider the graph partitioning example discussed in Section II and Section V, clearly $\sigma_{1}^{+}\sigma_{2}^{-}+\sigma_{1}^{-}\sigma_{2}^{+}$ commutes with the constraint, but fails to connect the entire feasible space. For example, the state $\left\lvert\,0011\,\right\rangle$ is disconnected from $\left\lvert\,1100\,\right\rangle$ under this driver Hamiltonian. While it succeeds not to mix solution states to nonsolution states, it does not mix all solution states with each other. We introduce the problem ILP-QCOMMUTE-NONTRIVIAL which asks to find a driver term that not only commutes with the constraints, but acts nontrivially on the feasible space, so that the action of the driver is such that there exists one solution state which the driver maps to another solution state in the feasible space. If we think about the solution states as vertices in a graph, then transitions induced by driver terms are the edges (and a driver term can induce more than one such edge). Then the problem ILP-QCOMMUTE-NONTRIVIAL asks that the driver term found induces at least one edge in the graph of feasible solutions. For the graph partitioning example discussed above, the term we discussed is clearly a solution to the problem ILP-QCOMMUTE-NONTRIVIAL since it connects $\left\lvert\,1010\,\right\rangle$ to $\left\lvert\,0110\,\right\rangle$ , both of which are in the feasible space for this example. Let $P_{i}^{b_{i}}$ be the projection operator corresponding to the energy eigenvalue $b_{i}$ for the constraint operator $\hat{C}_{i}$ . Then this formally defines the problem ILP-QCOMMUTE-NONTRIVIAL:

Definition (ILP-QCOMMUTE-NONTRIVIAL):

Given a set $\mathcal{C}=\{C_{1},\ldots,C_{m}\}$ of linear constraints and constraint values $b=\{b_{1},\ldots,b_{m}\}$ such that $\hat{C}_{i}=\sum_{j=1}^{n}c_{ij}\sigma_{j}^{z}$ over a space $\mathbbm{C}^{2^{n}}$ with $c_{ij}\in\mathbb{Z}$ , is there a Hermitian Matrix H, with $\mathcal{O}(poly(n))$ nonzero coefficients over a basis $\{\chi_{1},\chi_{2},\chi_{3},\chi_{4}\}^{\bigotimes n}$ , such that $\left[H,\hat{C}_{i}\right]=0$ for all $\hat{C}_{i}$ and $P_{1}^{b_{1}}\,\cdots\,P_{m}^{b_{m}}HP_{m}^{b_{m}}\,\cdots\,P_{1}^{b_{1}}$ has at least one off-diagonal term in the spin-z basis?

The main difference is that while ILP-QCOMMUTE required nontrivial off-diagonal terms in the spin-z basis, ILP-QCOMMUTE-NONTRIVIAL specifically requires these to be nontrivial in the constraint space of interest, which in general is a non-polynomial problem to verify (i.e. knowing a Hamiltonian has or fails to have an eigenvector for a specific energy level would allow one to know if a Hamiltonian has a solution to hard problems). We show that this problem is at least NP-Hard by reducing a problem closely related to EQUAL SUBSET SUM to ILP-QCOMMUTE-NONTRIVIAL. We begin with the famous NP-Complete SUBSET SUM problem:

Definition (SUBSET SUM):

Given a set $S=\{s_{1},\ldots,s_{n}\}$ of integers and an integer target value $T$ , is there a subset $S_{1}$ such that $\sum_{s\in S_{1}}s=T$ ?

While SUBSET SUM asks about the existence of a single solution, we are interested in at least two solutions, defining the problem:

Definition (2-OR-MORE SUBSET SUM):

Given a set $S=\{s_{1},\ldots,s_{n}\}$ and a target value $T$ , are there two subsets $S_{1},S_{2}$ such that $\sum_{s\in S_{1}}s=\sum_{s\in S_{2}}s=T$ ?

We show that like SUBSET SUM (over positive integerscormen2009introduction ), 2-OR-MORE SUBSET SUM is also NP-Hard:

Lemma VI.1.

2-OR-MORE SUBSET SUM is NP-Hard.

Proof.

Consider an instance of the SUBSET SUM problem with a set $S=\{s_{1},\ldots,s_{n}\}$ and a target value $T$ such that $s_{i}>0$ for all $s_{i}$ . We construct a new instance of the 2-OR-MORE SUBSET SUM problem with set $S^{\prime}=\{s_{1},\ldots,s_{n},T\}$ and the same target value $T$ .

First we show how to relate a solution to the original SUBSET SUM instance from a solution to the constructed 2-OR-MORE SUBSET SUM problem. Let $S_{1},S_{2}$ be solutions to the new 2-OR-MORE SUBSET SUM instance, either one or none of the solutions uses the element $T$ . If neither does, either one is a solution to the instance of the original SUBSET SUM problem. Without loss of generality, suppose $S_{1}$ uses the value $T$ , then $S_{1}$ cannot use any other value since every other value is greater than zero, hence $S_{1}=\{T\}$ . Since $S_{2}$ cannot use any value other than $T$ if $T\in S_{2}$ and $S_{1}\neq S_{2}$ , it follows that $T\notin S_{2}$ . Then $S_{2}\subseteq S$ and $\sum_{s\in S_{2}}s=T$ .

We now show how to relate a solution to the constructed 2-OR-MORE SUBSET SUM instance given a solution to the SUBSET SUM instance. Let $S_{1}$ be a solution to the SUBSET SUM problem. Then $S_{1},\{T\}$ is a solution to the 2-OR-MORE SUBSET SUM instance.

Since SUBSET SUM is NP-Hard over positive integers, 2-OR-MORE SUBSET SUM is NP-Hard as well.

∎

2-OR-MORE SUBSET SUM is closely related to EQUAL SUBSET SUM because both ask about the existence of two subsets with equal sums, but 2-OR-MORE SUBSET SUM adds the further constraint that these two subsets should have a specific sum. We show that ILP-QCOMMUTE-NONTRIVIAL is NP-Hard through a reduction to 2-OR-MORE SUBSET SUM. Note again that ILP-QCOMMUTE-NONTRIVIAL is not verifiable in polynomial time kitaev2002classical kempe20033 kempe2006complexity and so this reduction is only for the decision version of the problem 2-OR-MORE SUBSET SUM.

Theorem VI.2.

ILP-QCOMMUTE-NONTRIVIAL is NP-Hard.

Proof.

Consider an instance of the 2-OR-MORE SUBSET SUM problem with a set $S=\{s_{1},\ldots,s_{n}\}$ and a target value $T$ . Define the constraint operator $\hat{S}=\sum_{j=1}^{n}s_{j}\sigma_{j}^{z}$ and a target energy value $\left(\sum_{j=1}^{n}s_{j}\right)-2\,T$ .

Suppose this instance of ILP-QCOMMUTE-NONTRIVIAL has a solution. Then there are at least two eigenvectors $\left\lvert\,\vec{v}\,\right\rangle,\left\lvert\,\vec{w}\,\right\rangle$ of $\hat{S}$ with eigenvalue $\left(\sum_{j=1}^{n}s_{j}\right)-2\,T$ such that the two eigenvalues can be written in the spin-z basis with $\vec{v},\vec{w}\in\{0,1\}^{n}$ . Like with ILP-QCOMMUTE and EQUAL SUBSET SUM, the nonzero elements of $\vec{v}$ and $\vec{w}$ describe two sets $S_{1},S_{2}$ such that $s_{i}\in S_{1}$ ( $s_{i}\in S_{2}$ ) if and only if $v_{i}=1$ ( $w_{i}=1$ ). Since $\hat{S}\left\lvert\,\vec{v}\,\right\rangle=\left(\sum_{j=1}^{n}s_{j}(1-2\,v_{j})\right)\left\lvert\,\vec{v}\,\right\rangle=\left(\left(\sum_{j=1}^{n}s_{j}\right)-2\,\left(\sum_{j=1}^{n}\,s_{j}v_{j}\right)\right)\left\lvert\,\vec{v}\,\right\rangle$ , it follows that $\sum_{j=1}^{n}s_{j}v_{j}=T$ . The same logic works for $\vec{w}$ and so $\sum_{s\in S_{1}}s=\sum_{s\in S_{2}}s=T$ . Then 2-OR-MORE SUBSET SUM must have a solution as well, specifically $S_{1},S_{2}$ .

Suppose 2-OR-MORE SUBSET SUM has a solution. Then there are two nonequal subsets $S_{1},S_{2}$ of $S$ such that $\sum_{s_{i}\in S_{1}}s_{i}=\sum_{s_{i}\in S_{2}}s_{2}=T$ . Then let $\vec{v}=(v_{1},\ldots,v_{n})$ ( $\vec{w}=(w_{1},\ldots,w_{n})$ ) with $v_{i}=1$ ( $w_{i}=1$ ) if $s_{i}\in S_{1}$ ( $s_{i}\in S_{2}$ ).

Then $\hat{S}\left\lvert\,\vec{v}\,\right\rangle=\left(\sum_{j=1}^{n}s_{j}(1-2\,v_{j})\right)\left\lvert\,\vec{v}\,\right\rangle=\left(\left(\sum_{j=1}^{n}s_{j}\right)-2\,\sum_{j=1}^{n}s_{j}v_{j}\right)\left\lvert\,\vec{v}\,\right\rangle=\left(\left(\sum_{j=1}^{n}s_{j}\right)-2\,T\right)\left\lvert\,\vec{v}\,\right\rangle$ . The same logic works for $\left\lvert\,\vec{w}\,\right\rangle$ and so $\left\lvert\,\vec{v}\,\right\rangle,\left\lvert\,\vec{w}\,\right\rangle$ are both eigenvectors of $\hat{S}$ with eigenvalue $\left(\sum_{j=1}^{n}s_{j}\right)-2\,T$ . Then $\left\lvert\,\vec{v}\,\right\rangle\left\langle\,\vec{w}\,\right\rvert+\left\lvert\,\vec{w}\,\right\rangle\left\langle\,\vec{v}\,\right\rvert$ is a driver term that nontrivially maps solution states of this constraint problem to one another.

∎

In practical applications, we are often able to quickly find some driver terms that commute with the constraints, but then need to know whether they are sufficient to connect the entire feasible space. This raises the following question: given $k$ driver terms (individual basis terms) that commute with the constraints, does some linear combination of them with nonzero coefficients connect the entire feasibility space? In other words, given that we have found $k$ driver terms that commute with the constraints, can we guarantee that some linear combination of them will have the whole feasible subspace as its smallest invariant subspace? Note that not every driver term that does commute with a subspace is necessary to solve this problem. For example, in the case of the constraint $\hat{C}=\sum_{i}^{n}\sigma_{i}^{z}$ , it suffices to use the driver terms $\sigma_{i}^{+}\sigma_{i+1}^{-}+\sigma_{i}^{-}\sigma_{i+1}^{+}$ for $i\in[n-1]$ . Any linear combination with nonzero coefficients, $H_{d}=\sum_{i}^{n-1}\lambda_{i}\left(\sigma_{i}^{+}\sigma_{i+1}^{-}+\sigma_{i}^{-}\sigma_{i+1}^{+}\right)$ , then is a valid Hamiltonian driver to connect the feasible space. Then an extra term, like $\sigma_{1}^{+}\sigma_{3}^{-}+\sigma_{1}^{-}\sigma_{3}^{+}$ is unnecessary, because if $\left\lvert\,\phi\,\right\rangle$ is in the constrained subspace, it follows that $(\sigma_{1}^{+}\sigma_{2}^{-}+\sigma_{1}^{-}\sigma_{2}^{+})\left\lvert\,\phi\,\right\rangle$ , $(\sigma_{2}^{+}\sigma_{3}^{-}+\sigma_{2}^{-}\sigma_{3}^{+})\left\lvert\,\phi\,\right\rangle$ , $(\sigma_{1}^{+}\sigma_{2}^{-}+\sigma_{1}^{-}\sigma_{2}^{+})(\sigma_{2}^{+}\sigma_{3}^{-}+\sigma_{2}^{-}\sigma_{3}^{+})\left\lvert\,\phi\,\right\rangle$ , and $(\sigma_{2}^{+}\sigma_{3}^{-}+\sigma_{2}^{-}\sigma_{3}^{+})(\sigma_{1}^{+}\sigma_{2}^{-}+\sigma_{1}^{-}\sigma_{2}^{+})\left\lvert\,\phi\,\right\rangle$ are as well. Note that $(\sigma_{1}^{+}\sigma_{3}^{-}+\sigma_{1}^{-}\sigma_{3}^{+})=(\sigma_{1}^{+}\sigma_{2}^{-}+\sigma_{1}^{-}\sigma_{2}^{+})(\sigma_{2}^{+}\sigma_{3}^{-}+\sigma_{2}^{-}\sigma_{3}^{+})+(\sigma_{2}^{+}\sigma_{3}^{-}+\sigma_{2}^{-}\sigma_{3}^{+})(\sigma_{1}^{+}\sigma_{2}^{-}+\sigma_{1}^{-}\sigma_{2}^{+})$ . If a Hermitian matrix $M$ can be decomposed into a linear combination of products of operators chosen from the set of driver terms $\{\hat{G}_{k}\}$ , and $\left\lvert\,\phi\,\right\rangle$ is a state in the constrained space, then for any state $\left\lvert\,\psi\,\right\rangle$ such that $\left\langle\,\psi\,\right\rvert M\left\lvert\,\phi\,\right\rangle\neq 0$ (i.e., any state reachable from $\left\lvert\,\phi\,\right\rangle$ through the action of $M$ ), is also reachable through the action of the driver terms in $\{\hat{G}_{k}\}$ for the state $\left\lvert\,\psi\,\right\rangle$ .

The set of Hamiltonians that commute with a given set of constraints form an algebra (known as the commutant of the set of constraints). Each one of the driver terms we are considering can be seen as a generator of this commutant algebra. This leads us to define the problem ILP-QIRREDUCIBLECOMMUTE-GIVEN-k formally:

Definition (ILP-QIRREDUCIBLECOMMUTE-GIVEN-k):

Given a set $\mathcal{C}=\{C_{1},\ldots,C_{m}\}$ of linear constraints and constraint values $b=\{b_{1},\ldots,b_{m}\}$ such that $\hat{C}_{i}=\sum_{j=1}^{n}c_{ij}\sigma_{j}^{z}$ over a space $\mathbbm{C}^{2^{n}}$ with $c_{ij}\in\mathbb{Z}$ , and a set of basis terms $\mathcal{G}=\{\hat{G}_{1},\ldots,\hat{G}_{k}\}$ such that $\hat{G}_{i}\in\{\chi_{1},\chi_{2},\chi_{3},\chi_{4}\}^{\bigotimes n}$ , does $\mathcal{G}$ connect the entire nonzero eigenspace of the operator $P_{1}^{b_{1}}\cdots P_{m}^{b_{m}}$ ?

As such, ILP-QIRREDUCIBLECOMMUTE-GIVEN-k asks if a given set of driver terms is able to connect the entire feasible space of a set of constraints with the given constraint values. We show that this problem is also NP-Hard by reducing ILP-QCOMMUTE-NONTRIVIAL to ILP-QIRREDUCIBLECOMMUTE-GIVEN-k. We do so by finding a mapping for any instance of ILP-QCOMMUTE-NONTRIVIAL to an instance of ILP-QIRREDUCIBLECOMMUTE-GIVEN-k. Consider such an instance with constraints $\{C_{1},\ldots,C_{m}\}$ and constraint values $\{b_{1},\ldots,b_{m}\}$ . Find an integer $a_{1}$ such that $\|\vec{c_{i}}\|_{1}<a_{1}$ for $i\in[m]$ . Then expand the space $\{x_{1},\ldots,x_{n}\}$ by appending $x_{n+1},x_{n+2}$ . Make the constraints $F_{i}(x)=C_{i}(x_{1},\ldots,x_{n})+a_{1}(x_{n+1}+x_{n+2})$ with constraint values $b_{i}+a_{1}$ . Then we can easily find a driver term $\hat{G}_{1}=\sigma_{n+1}^{+}\sigma_{n+2}^{-}+\sigma_{n+1}^{-}\sigma_{n+2}^{+}$ (over the basis $\{\mathbbm{1},\sigma^{+},\sigma^{-},\sigma^{z}\}$ ) such that $[\hat{G}_{1},\hat{F_{i}}]=0$ for $i\in[m]$ . Since the constraint values are $b_{i}+a_{1}$ , then the only way to satisfy the constraints is if $(x_{n+1},x_{n+2})\in\{(1,0),(0,1)\}$ , since $\sum_{i=1}^{n}c_{i}x_{i}\leq\|c_{i}\|_{1}<a_{1}$ by design.

We have thus altered the constraints such that the feasibility space of the original problem is now doubled in size by the addition of the two variables $x_{n+1}$ and $x_{n+2}$ , if the original feasibility space was non-empty. The important structure we note here is that the feasibility subspace of the new constraints is the tensor product of the feasibility subspace of the original constraints, with the subspace generated by the states $\left\lvert\,10\,\right\rangle$ and $\left\lvert\,01\,\right\rangle$ over qubits $x_{n+1},x_{n+2}$ . It should also be easy to recognize then that the only nontrivial action induced by the driver terms we choose over $x_{n+1},x_{n+2}$ is precisely the action of $\hat{G}_{1}$ .

We can apply the same procedure recursively, adding ancillas $\{x_{n+1},\ldots,x_{n+2k}\}$ and generating $k$ driver terms, such that $a_{i}>\|\vec{c}_{i}\|_{1}+\sum_{j=1}^{i-1}a_{j}$ and $\hat{G}_{i}=(\sigma_{n+2\,i-1}^{+}\sigma_{n+2\,i}^{-}+\sigma_{n+2\,i-1}^{-}\sigma_{n+2\,i}^{+})$ , $1\leq i\leq k$ . By this construction, when restricted to the ancilla variables, the feasible subspace is spanned by the vectors $\{\bigotimes_{i=1}^{k}|i_{1}i_{2}\rangle,i_{1}+i_{2}=1\}$ . Given any element in this subspace, the action of the $\hat{G}_{i}$ driver terms is sufficient to guarantee that any other element of this subspace can also be generated. To proceed with the reduction, we then give the constraints $\{F_{1},\ldots,F_{m}\}$ with constraint values $\{b_{1}+\sum_{i=1}^{k}a_{i},\ldots,b_{m}+\sum_{i=1}^{k}a_{i}\}$ respectively and the drivers $\{\hat{G}_{1},\ldots,\hat{G}_{k}\}$ to our ILP-QIRREDUCIBLECOMMUTE-GIVEN-k solver oracle. Since our $k$ driver terms are all over the added qubits $x_{n+1}$ to $x_{n+2\,k}$ , it should be clear that these driver terms say nothing about the feasibility space of the original problem over qubits $x_{1}$ to $x_{n}$ . Suppose we are told that our $k$ drivers are sufficient, i.e., they can generate the whole feasible subspace by acting on any one element of that subspace. Then clearly there are no driver terms for the original ILP-QCOMMUTE-NONTRIVIAL decision problem, since none of the $k$ driver terms operate over qubits associated with variables $x_{1}$ to $x_{n}$ . Likewise if we are told our $k$ drivers are not sufficient, then clearly there must be at least one nontrivial driver for the original problem, since the drivers are enough to generate all elements of the feasible subspace when restricted to the ancillas. Note that this solves ILP-QCOMMUTE-NONTRIVIAL without giving us a token to verify it. Because this is the most general unstructured version of the problem, is it possible that a different complexity result can be found for a more structured questioning of the same problem. We note that the problem can also have a stronger complexity result, such as a relationship to a higher class in the polynomial hierarchymeyer1972equivalence ; stockmeyer1976polynomial ; garey1979computers , like #Pvaliant1979complexity , to which it has some natural analogues.

Given this result and the result of Section IV, we can often find drivers that satisfy the condition stated in Theorem III.1, but may not connect the entire feasible space. Still, there remain many avenues for exploiting such terms; for example, alongside the ordinary transverse field such that universality is maintained, but with biasing towards a subspace of the solution space. This gives us a way to adjust the knob of using higher order terms when the ordinary transverse field struggles to find a solution. Such driver terms can also be beneficial for exploring new solutions using reverse annealingventurelli2019reverse ; king2018observation , especially for solutions that are higher hamming distance away since the transverse field generally struggles to find such solutions.

Another way to leverage our result is to brute force the problem for a set of constraints over a small enough subspace that it becomes polynomially tractable. Over the other variables, we apply the usual transverse field and enforce the other constraints as penalty terms in the final Hamiltonian. These approaches can also be adopted to the constraints that are geometrically local (like in a two dimensional grid).

VII Conclusion

In this work, we addressed the computational complexity of finding driver Hamiltonians for quantum annealing processes which aim at solving optimization or feasibility problems with several linear constraints. We develop a simple and intuitive algebraic framework for understanding whether a Hamiltonian commutes with a set of constraints or not. While this result is interesting mathematically in its own right, we mainly focus on the problem posed in Ref.hen2016driver about algorithmically finding driver Hamiltonians for optimization problems with several linear constraints. Most significantly, the condition is useful for finding a reduction of the NP-Hard problem EQUAL SUBSET SUMS to finding such a driver Hamiltonian, thereby allowing us to categorize the complexity of this problem.

We also showed that ILP-QCOMMUTE-NONTRIVIAL and ILP-QIRREDUCIBLECOMMUTE-GIVEN-k are at least NP-Hard. But these problems could well be in a higher complexity class in the polynomial hierarchy - like #P, to which ILP-QIRREDUCIBLECOMMUTE-GIVEN-k has some similarity. However, for most common implementations the Hamiltonians are of bounded weight, and the relevant complexity class ILP-QCOMMUTE-k-LOCAL for a small integer $k$ is in P. Hence, there is a simple brute force algorithm, as detailed in Section IV, to find a basis for all possible driver Hamiltonians of this bounded locality. However, the results from ILP-QIRREDUCIBLECOMMUTE-GIVEN-k say that given a set of driver terms, it is intractable to know whether the found basis can sustain a Hamiltonian that connects the entire feasibility space for the linear constraints. As such, we present a polynomial time algorithm that is guaranteed to find a basis for all possible Hamiltonians that commute with a set of embedded constraint operators up to a certain weight, but with no guarantees that the found Hamiltonian is able to connect the entire feasibility space that the constraints specified. However, for some important problems it is actually possible to exploit the constraint structure to guarantee that driver terms with low weight will be sufficient to reach all feasible states. This is the case, for example, for the graph coloring problem discussed in Section II and Ref.hen2016driver .

Our result also applies to finding mixing operators for Quantum Alternating Operator Ansatz (QAOA)farhi2014quantum ; hadfield2019quantum . To implement highly nontrivial driver Hamiltonians for an anneal, it also becomes necessary to find a new initial Hamiltonian that is then evolved slowly with a simple linear interpolation to the driver Hamiltonian since thermal equilibration to the driver Hamiltonian may be difficult. It then becomes relevant how can we construct such a Hamiltonian for a given driver Hamiltonian, such that we can guarantee that we reach the right constrained space. This is a fundamental question for future research. While we have shown these problems to be NP-Hard, we have not shown what the average hardness of this class is or what the typical hardness is for instances of interest for specific applications. Especially pertinent become sets of instances in which the practical runtime for finding driver Hamiltonians remains tractable or the hardness of the problem comes from having to search a large feasible space for an optimum solution rather than pinpointing a very small feasible space. It is also interesting to note that our algebraic formulation is agnostic to the stoquasticity of the terms found. In the presented basis of Section III, the stoquasticity of the individual basis terms, as written in Eq. 6, is determined by the amplitudes $\alpha$ and its conjugate pair $\alpha^{\dagger}$ . Commutation is invariant under altering $\alpha$ ( $\alpha^{\dagger}$ will adjust as we alter $\alpha$ to keep the term pair commutative). Once we have found driver terms that are suitable for a problem, it then raises the question of what effect, if any, choosing coefficients that will make them stoquastic or non-stoquastic will have on the annealbravyi2006complexity ; bravyi2010complexity ; marvian2019computational ; crosson2020signing .This is another direction that requires further study.

VIII Acknowledgments

The research is based upon work (partially) supported by the Office of the Director of National Intelligence (ODNI), Intelligence Advanced Research Projects Activity (IARPA) and the Defense Advanced Research Projects Agency (DARPA), via the U.S. Army Research Office contract W911NF-17-C-0050. The views and conclusions contained herein are those of the authors and should not be interpreted as necessarily representing the official policies or endorsements, either expressed or implied, of the ODNI, IARPA, or the U.S. Government. The U.S. Government is authorized to reproduce and distribute reprints for Governmental purposes notwithstanding any copyright annotation thereon.

References

(1) I. Hen and F. M. Spedalieri, “Quantum annealing for constrained optimization,” Physical Review Applied, vol. 5, no. 3, p. 034007, 2016.
(2) I. Hen and M. S. Sarandy, “Driver hamiltonians for constrained optimization in quantum annealing,” Physical Review A, vol. 93, no. 6, p. 062312, 2016.
(3) T. Kadowaki and H. Nishimori, “Quantum annealing in the transverse ising model,” Physical Review E, vol. 58, no. 5, p. 5355, 1998.
(4) E. Farhi, J. Goldstone, S. Gutmann, and M. Sipser, “Quantum computation by adiabatic evolution,” arXiv preprint quant-ph/0001106, 2000.
(5) T. Albash and D. A. Lidar, “Adiabatic quantum computation,” Rev. Mod. Phys., vol. 90, p. 015002, Jan 2018. [Online]. Available: https://link.aps.org/doi/10.1103/RevModPhys.90.015002
(6) E. Farhi, J. Goldstone, S. Gutmann, J. Lapan, A. Lundgren, and D. Preda, “A quantum adiabatic evolution algorithm applied to random instances of an np-complete problem,” Science, vol. 292, no. 5516, pp. 472–475, 2001.
(7) G. E. Santoro and E. Tosatti, “Optimization using quantum mechanics: quantum annealing through adiabatic evolution,” Journal of Physics A: Mathematical and General, vol. 39, no. 36, p. R393, 2006.
(8) Z. Bian, F. Chudak, R. B. Israel, B. Lackey, W. G. Macready, and A. Roy, “Mapping constrained optimization problems to quantum annealing with application to fault diagnosis,” Frontiers in ICT, vol. 3, p. 14, 2016.
(9) P. Hauke, H. G. Katzgraber, W. Lechner, H. Nishimori, and W. D. Oliver, “Perspectives of quantum annealing: Methods and implementations,” Reports on Progress in Physics, vol. 83, no. 5, p. 054401, 2020.
(10) S. H. Adachi and M. P. Henderson, “Application of quantum annealing to training of deep neural networks,” arXiv preprint arXiv:1510.06356, 2015.
(11) A. Mott, J. Job, J.-R. Vlimant, D. Lidar, and M. Spiropulu, “Solving a higgs optimization problem with quantum annealing for machine learning,” Nature, vol. 550, no. 7676, pp. 375–379, 2017.
(12) M. H. Amin, E. Andriyash, J. Rolfe, B. Kulchytskyy, and R. Melko, “Quantum boltzmann machine,” Physical Review X, vol. 8, no. 2, p. 021050, 2018.
(13) J. Biamonte, P. Wittek, N. Pancotti, P. Rebentrost, N. Wiebe, and S. Lloyd, “Quantum machine learning,” Nature, vol. 549, no. 7671, pp. 195–202, 2017.
(14) R. Y. Li, R. Di Felice, R. Rohs, and D. A. Lidar, “Quantum annealing versus classical machine learning applied to a simplified computational biology problem,” NPJ quantum information, vol. 4, no. 1, pp. 1–10, 2018.
(15) V. Kumar, G. Bass, C. Tomlin, and J. Dulny, “Quantum annealing for combinatorial clustering,” Quantum Information Processing, vol. 17, no. 2, pp. 1–14, 2018.
(16) S. G. Brush, “History of the lenz-ising model,” Reviews of modern physics, vol. 39, no. 4, p. 883, 1967.
(17) D. Sherrington and S. Kirkpatrick, “Solvable model of a spin-glass,” Physical review letters, vol. 35, no. 26, p. 1792, 1975.
(18) T. Castellani and A. Cavagna, “Spin-glass theory for pedestrians,” Journal of Statistical Mechanics: Theory and Experiment, vol. 2005, no. 05, p. P05012, 2005.
(19) M. Troyer and U.-J. Wiese, “Computational complexity and fundamental limitations to fermionic quantum monte carlo simulations,” Physical review letters, vol. 94, no. 17, p. 170201, 2005.
(20) A. Lucas, “Ising formulations of many np problems,” Frontiers in Physics, vol. 2, p. 5, 2014.
(21) S. A. Cook, “The complexity of theorem-proving procedures,” in Proceedings of the third annual ACM symposium on Theory of computing, 1971, pp. 151–158.
(22) R. M. Karp, “On the computational complexity of combinatorial problems,” Networks, vol. 5, no. 1, pp. 45–68, 1975.
(23) ——, “Reducibility among combinatorial problems,” in Complexity of computer computations. Springer, 1972, pp. 85–103.
(24) G. J. Woeginger and Z. Yu, “On the equal-subset-sum problem,” Information Processing Letters, vol. 42, no. 6, pp. 299–302, 1992.
(25) T. H. Cormen, C. E. Leiserson, R. L. Rivest, and C. Stein, Introduction to algorithms. MIT press, 2009.
(26) A. Y. Kitaev, A. Shen, M. N. Vyalyi, and M. N. Vyalyi, Classical and quantum computation. American Mathematical Soc., 2002, no. 47.
(27) J. Kempe and O. Regev, “3-local hamiltonian is qma-complete,” arXiv preprint quant-ph/0302079, 2003.
(28) J. Kempe, A. Kitaev, and O. Regev, “The complexity of the local hamiltonian problem,” SIAM Journal on Computing, vol. 35, no. 5, pp. 1070–1097, 2006.
(29) A. R. Meyer and L. J. Stockmeyer, “The equivalence problem for regular expressions with squaring requires exponential space,” in SWAT (FOCS), 1972, pp. 125–129.
(30) L. J. Stockmeyer, “The polynomial-time hierarchy,” Theoretical Computer Science, vol. 3, no. 1, pp. 1–22, 1976.
(31) M. R. Garey and D. S. Johnson, “Computers and intractability,” A Guide to the, 1979.
(32) L. G. Valiant, “The complexity of enumeration and reliability problems,” SIAM Journal on Computing, vol. 8, no. 3, pp. 410–421, 1979.
(33) D. Venturelli and A. Kondratyev, “Reverse quantum annealing approach to portfolio optimization problems,” Quantum Machine Intelligence, vol. 1, no. 1, pp. 17–30, 2019.
(34) A. D. King, J. Carrasquilla, J. Raymond, I. Ozfidan, E. Andriyash, A. Berkley, M. Reis, T. Lanting, R. Harris, F. Altomare et al., “Observation of topological phenomena in a programmable lattice of 1,800 qubits,” Nature, vol. 560, no. 7719, pp. 456–460, 2018.
(35) E. Farhi, J. Goldstone, and S. Gutmann, “A quantum approximate optimization algorithm,” arXiv preprint arXiv:1411.4028, 2014.
(36) S. Hadfield, Z. Wang, B. O’Gorman, E. G. Rieffel, D. Venturelli, and R. Biswas, “From the quantum approximate optimization algorithm to a quantum alternating operator ansatz,” Algorithms, vol. 12, no. 2, p. 34, 2019.
(37) S. Bravyi, D. P. Divincenzo, R. I. Oliveira, and B. M. Terhal, “The complexity of stoquastic local hamiltonian problems,” arXiv preprint quant-ph/0606140, 2006.
(38) S. Bravyi and B. Terhal, “Complexity of stoquastic frustration-free hamiltonians,” Siam journal on computing, vol. 39, no. 4, pp. 1462–1485, 2010.
(39) M. Marvian, D. A. Lidar, and I. Hen, “On the computational complexity of curing non-stoquastic hamiltonians,” Nature communications, vol. 10, no. 1, pp. 1–9, 2019.
(40) E. Crosson, T. Albash, I. Hen, and A. Young, “De-signing hamiltonians for quantum adiabatic optimization,” Quantum, vol. 4, p. 334, 2020.

Appendix A 0-1-LP-QCOMMUTE is NP HARD

We reduce the 0-1-LP-QCOMMUTE problem to the EQUAL SUBSET SUM problem. We define the EQUAL SUBSET SUM problem as before:

Definition:

EQUAL SUBSET SUM Given a set $S=\{s_{1},s_{2},\ldots,s_{n}\}$ , with $s_{i}\in\mathbb{Z}^{+}$ , find two non-empty disjoint subsets, $A,B$ such that $\sum_{a_{i}\in A}a_{i}=\sum_{b_{i}\in B}b_{i}$ .

The EQUAL SUBSET SUM problem is known to be NP-Completewoeginger1992equal . We map an instance of the EQUAL SUBSET SUM problem to the 0-1-LP-QCOMMUTE problem; the former of which is defined over a set $S=\{s_{1},s_{2},\ldots,s_{n}\}$ , with $s_{i}\in\mathbb{Z}^{+}$ . In order to connect EQUAL SUBSET SUM with solving a linear system over discrete variables (the key of Theorem III.1), we will associate an assignment of integers in $S$ to the two subsets $A$ and $B$ with a function $u$ over $S$ such that $u=\{u_{1},\ldots,u_{n}\}$ with $u_{i}\in\{-1,0,1\}$ . We associate the value $u_{i}$ as the assignment $u$ gives to integer $s_{i}$ . Slightly abusing notation, this defines a function on any subset $M=\{s_{m_{1}},s_{m_{2}},\ldots,s_{m_{|M|}}\}$ such that $u(M)=\{u_{m_{1}},u_{m_{2}},\ldots,u_{m_{|M|}}\}$ . When discussing a subset of a single element $s_{e}$ , we also abuse notation to allow for $u(s_{e})=u_{e}$ . We can then define an integer valued function $E_{S}(u)=\sum_{s_{i}\in S}^{n}u_{i}s_{i}$ . If we associate integers $s_{i}$ such that $u_{i}=1$ with integers in subset $A$ , and those with $u_{i}=-1$ with integers in subset $B$ , then we can rewrite it as $E_{S}(u)=\sum_{s_{i}\in A}s_{i}-\sum_{s_{i}\in B}s_{i}$ (note that $u_{i}=0$ means that the corresponding integer is not chosen for any of the two subsets). Then, EQUAL SUBSET SUM has a solution if and only if there is an assignment function $u$ with a nontrivial image such that $E_{S}(u)=\sum_{i}^{n}u_{i}s_{i}=0$ .

However, we need a vector representation to exploit the structure of Theorem III.1. Let $s_{\text{max}}$ be the maximum of $S$ . We define $S^{M}$ as the matrix with the binary representations of $S$ as its column vectors. Given $s_{j}\in S$ , we define entry $s_{ij}^{M}=s_{j}^{i}$ – referring to the $i$ -th bit of integer $s_{j}$ . This defines a $m\times n$ matrix with $m=\lceil\log(s_{max})\rceil$ .

The idea is that we wish to give each integer an associated binary vector such that multiplying a binary vector with $S^{M}$ corresponds to selecting that integer to participate in a sum. We refer to the vectorized form of $u$ as $\vec{\mu}\in\{-1,0,1\}^{n}$ such that $\vec{\mu}=(u_{1},\ldots,u_{n})$ . Since multiplying a matrix by a vector on the right results in a linear combination of the matrix columns, with the coefficients being the corresponding components of the vector, it would be tempting to assume that $E_{S}(u)=S^{M}\,\vec{\mu}$ , since the columns of $S^{M}$ are associated with the integers in $S$ . Then we would have something like $E_{S}(u)=0$ if and only if $S^{M}\,\vec{\mu}=\vec{0}$ , providing our desired connection between Theorem III.1 and EQUAL SUBSET SUM.

Unfortunately this does not work, since the columns of $S$ contain a binary representations of the integers $s_{i}$ , while the expression $E_{S}(u)$ refers to the usual addition of integers, and not bit component wise addition. To illustrate what we mean with this, consider the (improper) set $S=\{1,1,2\}$ which delineates:

\displaystyle S^{M}=\;\begin{array}[]{ccccc}&s_{1}&s_{2}&s_{3}&\\ \ldelim({2}{0.5em}&1&1&0&\rdelim){2}{0.5em}\\ &0&0&1&\\ \end{array}

Even though the associated EQUAL SUBSET SUM problem has a simple solution associated with the function $u=\{1,1,-1\}$ (assign the first two integers to subset $A$ and the third to subset $B$ ), a simple calculation shows that $S^{M}\,\vec{\mu}=(2,-1)\neq\vec{0}$ . From the example above we can see that what we are missing is a way of incorporating the “bit carry" that occurs in binary addition into the operations of regular matrix-vector multiplication. The main goal of this appendix is to show how this can be accomplished by embedding these matrix operations into a larger vector space.

In order to resolve this issue, we will introduce a mechanism to do generalized bit addition - bit addition that is generalized to when the bit values can both be positive and negative as well as zero. We add ancillary bits $\mathscr{A}$ such that $u^{\ast}$ is the assignment $u$ expanded to this new space $S\cup\mathscr{A}$ as $u^{\ast}=\{u_{1},\ldots,u_{n},u_{n+1},\ldots,u_{n+|\mathscr{A}|}\}$ . Slightly abusing notation, for any subset $M=M_{S}\cup M_{\mathscr{A}}$ with $M_{S}=\{s_{ms_{1}},\ldots,s_{ms_{|M_{S}|}}\}$ and $M_{\mathscr{A}}=\{a_{ma_{1}},\ldots,a_{ma_{|M_{\mathscr{A}}|}}\}$ , we define $u^{\ast}(M)=\{u_{ms_{1}},\ldots,u_{ms_{|M_{S}|}},a_{ma_{1}},\ldots,a_{ma_{|M_{\mathscr{A}}|}}\}$ . We construct new constraints $\mathscr{K}$ such that $E_{S}(u)=0\Leftrightarrow E_{\mathscr{K}}(u^{\ast})=0$ . Moreover, $u^{\ast}$ will allow for a vectorized form $\vec{\mu^{\ast}}$ and $\mathscr{K}$ a matrix $K^{M}$ (see A.2) such that $E_{\mathscr{K}}(u^{\ast})=0\Leftrightarrow K^{M}\,\vec{\mu^{\ast}}=0$ . Intuitively, $u^{\ast}$ picks coefficients for values over $S$ and is subsequently forced to take values on $\mathscr{A}$ corresponding to doing valid bit addition and only satisfies $\mathscr{K}$ if the bit entries of the total sum is indeed zero. Fig. 1 gives a visual description of the steps used to create our full reduction. As such, for a given set of integers $S$ , we follow the reduction to construct a binary matrix $K^{M}$ such that the row vectors of $K^{M}$ define the constraint operators $\hat{K}_{i}=\sum_{j=1}^{|S\cup\mathscr{A}|}k_{ij}^{M}\sigma_{j}^{z}$ . This serves as the input binary LP to the oracle solver of 0-1-LP-QCOMMUTE to tell us if a Hamiltonian $H$ exists such that $H$ has an off-diagonal term in the spin-z basis that shows the existence of $\vec{v},\vec{w}$ , which describe two subsets $A$ and $B$ as the solution to the given Equal Subset Sum problem.

Refer to caption — Figure 1: A flow cart describing how our reduction works, we recommend motivated readers refer back to it as they read the reduction. An instance of EQUAL SUBSET SUM (box 1) is mapped into a binary constraint representation such that the sum function $E$ defined over the assignment $u$ is equivalent to $\text{sum}(A)-\text{sum}(B)$ where $u$ assigns variables to either $A$ , $B$ , or they are not used (box 2). To exploit Theorem III.1, constraints $C$ are mapped to constraints $\mathscr{K}$ (box 3), such that assignment $E_{\mathscr{K}}(u^{\ast})=0\Leftrightarrow E_{S}(u^{\ast})=0$ . Unlike $S$ , $\mathscr{K}$ allows for a simple matrix representation such that $K^{M}\,\vec{\mu^{\ast}}=0\Leftrightarrow E_{\mathscr{K}}(u^{\ast})=0$ (box 4), where $\vec{\mu}$ is a naive vectorized form of $u^{\ast}$ . Note that if u exists such that $E_{S}(u)=0$ , then many $u^{\ast}$ exist such that $E_{\mathscr{K}}(u^{\ast})=0$ , but each reduces to the same $u$ . The constraint version of $K^{M}$ can be embedded row-wise to define operators $\hat{K}_{1},\ldots,\hat{K_{S\cup\mathscr{A}}}$ as $\hat{K}_{i}=\sum_{j=1}^{|S\cup\mathscr{A}|}k_{ij}^{M}\sigma_{j}^{z}$ such that a 0-1-LP-QCOMMUTE oracle solves to show the existence of a driver Hamiltonian $H_{d}$ , which we can interpret back to see there must be a solution to EQUAL SUBSET SUM as well.

A.1 Generalized Full Adder

Inputs		Output
		Primary		Secondary
a	b	s	c	s	c
-1	-1	0	-1	-	-
-1	0	-1	0	1	-1
-1	1	0	0	-	-
0	-1	-1	0	1	-1
0	0	0	0	-	-
0	1	1	0	-1	1
1	-1	0	0	-	-
1	0	1	0	-1	1
1	1	0	1	-	-

Table 1: The generalized full adder; if

u^{\ast}

takes a particular value on inputs

a

and

b

, then

u^{\ast}

will be forced to take the corresponding sum (represented by

s

) and carry (represented by

c

) values. In the case that

a+b

is not a power of two,

s

and

c

have two possible values they can take. Here primary (secondary) operations correspond to the operations where the carry is set to zero (nonzero) if possible.

In this section we describe how to build the basis for our reduction, which is to find a matrix such that the values $u^{\ast}$ takes on the set $S$ are added bitwise over the ancillary bits $\mathscr{A}$ . There will be specific ancillary bits such that the total sum that $u^{\ast}$ takes on $S$ can be deduced from its value on these bits. Consider again the simple example we introduced in the previous section. We will add ancillary variables such that their values are forced to be what is dictated by the bit addition of values in $S$ . This can be summarized in Table 1. If $u^{\ast}$ takes a particular value on two inputs $a$ and $b$ , then the table describes what value $u^{\ast}$ will be forced to take on new ancillary values $s$ and $c$ (representing the sum and carry bits respectively).

Like the ordinary adder, the generalized adder accepts all values such that $u^{\ast}(a)+u^{\ast}(b)=2\,u^{\ast}(c)+u^{\ast}(s)$ except now $u^{\ast}(x)\in\{-1,0,1\}$ for any $x$ and so $u^{\ast}(a)+u^{\ast}(b)\in\{-2,-1,0,1,2\}$ . Note that then the carry bit and the sum bit are not unique like in the case of the ordinary full adder. For example if $u^{\ast}(a)=1$ and $u^{\ast}(b)=0$ , then it is possible that $u^{\ast}(c)=0$ and $u^{\ast}(s)=1$ like in the ordinary adder, but also that $u^{\ast}(c)=1$ and $u^{\ast}(s)=-1$ . Since $2\,u^{\ast}(c)+u^{\ast}(s)$ is the same value for both, they are both technically valid. The operations keen to the ordinary full adder we refer to as primary and those that do not as secondary. When possible, a primary operation will set the carry bit to zero while a secondary operation will set the carry bit to either one or negative one. One may hope that we could force the primary mode of operation, but we could not construct a 0-1 matrix that could force these modes of operations over the secondary modes since our condition for satisfaction is through equivalence statements like $u^{\ast}(a)+u^{\ast}(b)=2\,u^{\ast}(c)+u^{\ast}(s)$ , but no equivalence statement can state a preference in representation. While it does not affect the correctness of our result, it does mean that the number of solutions is not preserved in our reduction - there are many valid $u^{\ast}$ that reduced to a single $u$ . The reduction is therefore not parsimonious.

To enforce the generalized adder between two inputs and two outputs we need to generate the correct submatrix. Given inputs $a$ and $b$ , we define the matrix on $a,b,s,c,x_{1},x_{2},x_{3}$ - with $x_{1},x_{2},x_{3}$ being intermediating ancillas - as:

\displaystyle GA^{M}=\;\begin{array}[]{c c c c c c c c c}&a&b&x_{1}&x_{2}&x_{3}&c&s&\\ \ldelim({6}{0.5em}&a&b&1&1&1&0&0&\rdelim){6}{0.5em}\\ &0&0&1&0&1&1&1&\\ &0&0&0&1&1&1&1&\\ &0&0&1&0&0&1&0&\\ &0&0&0&1&0&1&0&\\ &0&0&0&0&1&0&1&\end{array}

(15)

As constraints, we can write it as:

$\displaystyle GA_{1}(a,b,x_{1},x_{2},x_{3})$	$\displaystyle=0,$	(16)
$\displaystyle GA_{2}(x_{1},x_{3},s,c)$	$\displaystyle=0,$	(17)
$\displaystyle GA_{3}(x_{2},x_{3},s,c)$	$\displaystyle=0,$	(18)
$\displaystyle GA_{4}(x_{1},c)$	$\displaystyle=0,$	(19)
$\displaystyle GA_{5}(x_{2},c)$	$\displaystyle=0,$	(20)
$\displaystyle GA_{6}(x_{3},s)$	$\displaystyle=0.$	(21)

For every generalized adder in Fig. 3 (as described in the protocol we gave in Section A.1), we have a submatrix over the corresponding variables. We give a simple case by case proof that $GA^{M}$ enforces $u^{\ast}$ to be valid if and only if its entries satisfy $2\,u^{\ast}(c)+u^{\ast}(s)=u^{\ast}(a)+u^{\ast}(b)$ as seen in Fig. 1 in Appendix B.

A.2 The Simple Reduced Case

Before we move on to give a general protocol for any given problem, we consider the simple case we described earlier with the integer (improper) set $S=\{1,1,2\}$ . We give a slightly reduced description for this problem to show what the reductions typically look like. We implement a generalized adder for the bits $s_{1}^{1}$ and $s_{2}^{1}$ - introducing the ancillary bits $k_{1}^{1},z_{1}^{1}$ that are the corresponding carry and sum bit. We then implement a generalized adder for the bits $s_{3}^{2}$ and $k_{1}^{1}$ - introducing the ancillary bits $k_{2}^{1},z_{2}^{1}$ that the corresponding carry and sum bit. As such, $E_{S}(u^{\ast})=0\Leftrightarrow u^{\ast}(z_{1}^{1})=u^{\ast}(z_{2}^{1})=u^{\ast}(k_{2}^{1})=0$ , since the latter condition is equivalent to saying that the bitwise sum of the two sets is zero. This is represented in Fig. 2A. Each box in the diagram refers to a generalized full adder. In words, we add the assignments $u_{1}^{\ast},u_{2}^{\ast}$ of the bits $s_{1}^{1},s_{2}^{1}$ and add the respective carry bit assignment with the assignment $u_{3}^{\ast}$ on the bit $s_{3}^{2}$ . The resulting integer is given by $E_{\{z_{1}^{1},z_{2}^{1},k_{2}^{1}\}}\left(u^{\ast}(\{z_{1}^{1},z_{2}^{1},k_{2}^{1}\})\right)=u^{\ast}(z_{1}^{1})+2\times u^{\ast}(z_{2}^{1})+4\times u^{\ast}(k_{2}^{1})$ - the first row sum bit, the second row sum bit, and what can be considered the third row sum bit added with their respective power of two. This must be zero if $u^{\ast}$ defines subsets of equal sums and therefore $u^{\ast}$ must be zero on each of them.

The resulting matrix $\tilde{K}^{M}$ - here we use a tilde to signify that we are in the reduced construction case - that this process defines can be represented as:

\displaystyle\tilde{K}^{M}=\;\begin{array}[]{ c c c c | c c c | c c | c c c | c c c }&s_{1}^{1}&s_{2}^{1}&s_{3}^{2}&x_{1}^{1}&x_{1}^{2}&x_{1}^{3}&k_{1}^{1}&z_{1}^{1}&x_{2}^{1}&x_{2}^{2}&x_{2}^{3}&k_{2}^{1}&z_{2}^{1}&\\ \ldelim({15}{0.5em}&1&1&0&1&1&1&0&0&0&0&0&0&0&\rdelim){15}{0.5em}\\ &0&0&0&1&0&1&1&1&0&0&0&0&0&\\ &0&0&0&0&1&1&1&1&0&0&0&0&0&\\ &0&0&0&1&0&0&1&0&0&0&0&0&0&\\ &0&0&0&0&1&0&1&0&0&0&0&0&0&\\ &0&0&0&0&0&1&0&1&0&0&0&0&0&\\ \cline{2-14}\cr&0&0&1&0&0&0&1&0&1&1&1&0&0&\\ &0&0&0&0&0&0&0&0&1&0&1&1&1&\\ &0&0&0&0&0&0&0&0&0&1&1&1&1&\\ &0&0&0&0&0&0&0&0&1&0&0&1&0&\\ &0&0&0&0&0&0&0&0&0&1&0&1&0&\\ &0&0&0&0&0&0&0&0&0&0&1&0&1&\\ \cline{2-14}\cr&0&0&0&0&0&0&0&1&0&0&0&0&0&\\ &0&0&0&0&0&0&0&0&0&0&0&1&0&\\ &0&0&0&0&0&0&0&0&0&0&0&0&1&\end{array}

(38)

One can check that if $\vec{\mu^{\ast}}=(1,1,-1,-1,-1,0,1,0,0,0,0,0,0)$ , then $\tilde{K}^{M}\,\vec{\mu^{\ast}}=\vec{0}$ . The vector $\vec{\mu^{\ast}}$ defines the assignment $u^{\ast}(S)=\{1,1,-1\}$ - since $s_{1},s_{2},s_{3}$ are the first three entries of the vectorized form. This defines the two sets $A=\{s_{1},s_{2}\}$ and $B=\{s_{3}\}$ as a solution to the EQUAL SUBSET SUM problem posed. One can check that $\vec{\mu^{\ast}}=(1,-1,0,0,0,0,0,0,0,0,0,0,0)$ is also a solution, corresponding to the sets $A=\{1\}$ and $B=\{1\}$ .

In this reduced construction, we only used the generalized full adder for the significant bits of each $s_{j}\in S$ for a given bit entry $i$ . This helps to greatly reduce the size of the resulting embedding, but hopefully still conveys the principal idea behind our reduction. While we could write a general protocol on the same principle, it requires a more involved strategy than the one we take.

A.3 The Simple Unreduced Case

To simplify the construction of the embedding at the cost of increasing their corresponding size, we follow the same logic as before, but do not prune the insignificant bits. In the unreduced construction, we “compute” the sum bit by bit. Like in the reduced case, the resulting sum bit at the end of each layer corresponds to a bit entry that the sum defined by $u^{\ast}$ takes - remember, in the end, the value of $\sum_{a\in A}a-\sum_{b\in B}b$ was described bitwise by the value $u^{\ast}$ took on the last sum bit in each layer plus the last layer’s carry bit (e.g. $E_{S}(u^{\ast}(S))=u^{\ast}(z_{1})+2\times u^{\ast}(z_{2})+4\times u^{\ast}(k_{2})$ ). This remains the same in the unreduced representation. Note that nonzero sums can have multiple bit representations when the entries can be negative or positive, i.e. $1=-1\times 1+1\times 2=1\times 1+0\times 2$ , while the sum $0$ has only one.

In words, we add the bits of each integer in the corresponding bit entry as well as all carry bits from the previous layer to find the total sum of all the integers if none of them had significant bits beyond this layer. Let $u^{\ast}(z_{end}^{i})$ be the last sum bit for any row $i$ . Then the total sum up to the current layer $i$ can be written as $2^{i}\left(2\times q+s\right)+\sum_{j=1}^{i}2^{j-1}u^{\ast}(z_{end}^{j})$ for some $q$ . Then we identify $s$ as the last sum bit of the current layer, $u^{\ast}(z_{end}^{i})$ and $q$ as the net number of carries passed from layer $i$ to $i+1$ .

Consider again the simple case we described earlier. We have a (improper) set $S=\{1,1,2\}$ , such that we can identify each of these three values as $s_{1},s_{2},s_{3}$ . Refer to Fig. 2B to see the resulting diagram that this construction will give. Then $s_{1}^{1},s_{2}^{1},s_{3}^{1}$ are the first bits of each of these three. We use generalized full adders to add - bit by bit - the values $s_{1}^{1},s_{2}^{1},s_{3}^{1}$ and feed the resulting carry bits $k_{1}^{1},k_{2}^{1}$ to the next layer while $z_{2}^{1}$ takes the value of the lowest bit entry for the total sum of the assignment. In the second layer we add - bit by bit - the values $s_{1}^{2},s_{2}^{2},s_{3}^{3},k_{1}^{1},k_{2}^{1}$ and feed the resulting carry bits $k_{1}^{2},k_{2}^{2},k_{3}^{2},k_{4}^{2}$ to the next layer while $z_{2}^{4}$ takes the value of the second lowest bit entry for the total sum. Since the maximum bit entry was given in row two, row three adds only the carry bits $k_{1}^{2},k_{2}^{2},k_{3}^{2},k_{4}^{2}$ , which generates the carry bits $k_{1}^{3},k_{2}^{3},k_{3}^{3}$ that are subsequently fed into layer four while $z_{4}^{3}$ is the third lowest bit entry for the total sum. Layer four adds the carry values and generates the corresponding carry bits $k_{1}^{4},k_{2}^{4}$ as well as the sum bit $z_{2}^{4}$ . Lastly, layer five adds these values and generates the corresponding carry bit $k_{1}^{5}$ as well as the last sum bit $z_{1}^{4}$ . To complete our description, each layer has internal sum variables from each generalized adder. Every line in the diagram corresponds to a variable; variables that are between two boxes are intermediaries, such as all the carry bits except $k_{1}^{5}$ and all the sum bits except the last ones in each layer. For example, layer one has $z_{1}^{1}$ - an intermediate sum bit that is passed from the first generalized adder to the second. This is in contrast to $z_{2}^{1}$ , which is the sum bit of the second generalized adder and is the lowest bit entry for the total sum of the assignment. All carry bits except for the very last one - the one from the single generalized adder in the last row - are intermediaries. Variables that are not between boxes are determined; $s_{i}^{j}$ are set to the $j$ -th lowest bit in the $i$ -th integer of the set $S$ while $z_{end}^{i}$ for layer $i$ and $k_{1}^{5}$ are set to one in the corresponding matrix.

A.4 The General Unreduced Case

Before we turn our attention to a full protocol for the general unreduced case, we give a more intuitive and visual description of the reduction. Fig. 3 gives a schematic of what the general case looks like. Note that Fig. 2B fits precisely this description as well.

We call the generalized function with the truth table corresponding to Table 1 as $GA_{s}$ and $GA_{k}$ for the sum and carry bit respectively, and so the constraints we considered earlier enforce: $u^{\ast}(c)=GA_{c}(u^{\ast}(a),u^{\ast}(b))$ and $u^{\ast}(s)=GA_{s}(u^{\ast}(a),u^{\ast}(b))$ . We use the common convention of writing $u^{\ast}(a_{1},\ldots,a_{k})$ as a condensed form of $(u^{\ast}(a_{1}),\ldots,u^{\ast}(a_{k}))$ so that $GA_{c}(u^{\ast}(a),u^{\ast}(b))\equiv GA_{c}(u^{\ast}(a,b))$ . These are not proper functions since $GA_{s}$ and $GA_{k}$ sometimes have two valid modes of operation. We also define $GA_{s}(u^{\ast}(s_{1:k}))=GA_{s}\left(GA_{s}\left(\ldots GA_{s}\left(u^{\ast}(s_{1}),u^{\ast}(s_{2})\right),\ldots\right),u^{\ast}(s_{n})\right)$ to help condense our writing.

To enforce the right bit addition, we use the following protocol:

1.

Let $l=1$ , $\mathscr{K}=\O$ , and $\mathscr{A}=\O$ .

Generate $(n-1)l$ carry bits ( $k_{1}^{l},\ldots,k_{(n-1)l}^{l}$ ) and append them to $\mathscr{A}$ as well as $(n-1)l$ sum bits ( $z_{1}^{l},\ldots,z_{(n-1)l}^{l}$ ) and append them to $\mathscr{A}$ . Add $n-1$ constraints to $\mathscr{K}$ that will enforce $GA$ between $s_{1}^{l},\ldots s_{n}^{l}$ in order, such that for any assignment $u^{\ast}$ , $u^{\ast}$ is valid if and only if

$\displaystyle u^{\ast}(k_{1}^{l})$	$\displaystyle=GA_{k}(u^{\ast}(s_{1}^{l},s_{2}^{l}))$	(39)
$\displaystyle u^{\ast}(z_{1}^{l})$	$\displaystyle=GA_{s}(u^{\ast}(s_{1}^{l},s_{2}^{l}))$	(40)
$\displaystyle u^{\ast}(k_{i}^{l})$	$\displaystyle=GA_{k}(u^{\ast}(z_{i-1}^{l},s_{i+1}^{l}))=GA_{k}(GA_{s}(u^{\ast}(s_{1:i}^{l})))\;\;\forall i\in[2,n-1]$	(41)
$\displaystyle u^{\ast}(z_{i}^{l})$	$\displaystyle=GA_{s}(u^{\ast}(z_{i-1}^{l},s_{i+1}^{l}))=GA_{s}(u^{\ast}(s_{1:i+1}^{l}))\;\;\forall i\in[2,n-1]$	(42)

Then place $nl$ constraints to $\mathscr{K}$ that will enforce $GA$ between $\{k_{1}^{l-1},\ldots k_{(n-1)(l-1)}^{l-1}\}$ (the carry ins from the previous layer) and $z_{n-1}$ such that for any assignment $u^{\ast}$ , $u^{\ast}$ is valid if and only if:

	$\displaystyle u^{\ast}(k_{i}^{l})$	$\displaystyle=GA_{k}(u^{\ast}(z_{i-1}^{l},k_{i-n}^{l-1}))=GA_{k}(GA_{s}(u^{\ast}(s_{1:n}^{l},k_{1:i-n}^{l-1})))\;\;\forall i\in[n,(n-1)l]$		(43)
	$\displaystyle u^{\ast}(z_{i}^{l})$	$\displaystyle=GA_{s}(u^{\ast}(z_{i-1}^{l},k_{i-n}^{l-1}))=GA_{s}(u^{\ast}(s_{1:n}^{l},k_{1:i-n}^{l-1}))\;\;\forall i\in[n,(n-1)l]$		(44)

3.

Let $l=l+1$ . If $l\leq m$ , then Go to Step 2.
4.

At the last run of step 2., we had $(n-1)m$ total carry bits. Now we add layers feeding carries forward like before, but without introducing any new bits from the actual integers. As such, in each layer, we will have one less carry bit generated than the layer before it.
5.

Let $r=1$

Generate $(n-1)m-r$ carry bits $\{k_{1}^{r+m},\ldots,k_{(n-1)m-r}^{r+m}\}$ and append them to $\mathscr{A}$ as well as $(n-1)m-r$ sum bits ( $z_{1}^{r+m},\ldots,z_{(n-1)m-r}^{r+m}$ ) and append them to $\mathscr{A}$ . Add $(n-1)m-r$ constraints to $\mathscr{K}$ that will enforce GA on the carry bits of the previous layer. For the first layer:

$\displaystyle u^{\ast}(k_{1}^{m+1})$	$\displaystyle=GA_{k}(u^{\ast}(k_{1}^{m},k_{2}^{m})))$	(45)
$\displaystyle u^{\ast}(z_{1}^{m+1})$	$\displaystyle=GA_{s}(u^{\ast}(k_{1}^{m},k_{2}^{m})))$	(46)
$\displaystyle u^{\ast}(k_{i}^{m+1})$	$\displaystyle=GA_{k}(u^{\ast}(z_{i}^{m+1},k_{i+1}^{m}))\;\;\forall i\in[2,m(n-1)-1]$	(47)
$\displaystyle u^{\ast}(z_{i}^{m+1})$	$\displaystyle=GA_{k}(u^{\ast}(z_{i}^{m+1},k_{i+1}^{m}))\;\;\forall i\in[2,m(n-1)-1]$	(48)

For all the subsequent layers:

$\displaystyle u^{\ast}(k_{1}^{m+r})$	$\displaystyle=GA_{k}(u^{\ast}(k_{1}^{m+r-1},k_{2}^{m+r-1})))$	(49)
$\displaystyle u^{\ast}(z_{1}^{m+r})$	$\displaystyle=GA_{s}(u^{\ast}(k_{1}^{m+r-1},k_{2}^{m+r-1})))$	(50)
$\displaystyle u^{\ast}(k_{i}^{m+r})$	$\displaystyle=GA_{k}(u^{\ast}(z_{i}^{m+r},k_{i+1}^{m+r-1}))=GA_{k}(GA_{s}(u^{\ast}(k_{1:i+1}^{m+r-1})))\;\;\forall i\in[2,m(n-1)-r]$	(51)
$\displaystyle u^{\ast}(z_{i}^{m+r})$	$\displaystyle=GA_{s}(u^{\ast}(z_{i}^{m+r},k_{i+1}^{m+r-1}))=GA_{s}(u^{\ast}(k_{1:i+1}^{m+r-1}))\;\;\forall i\in[2,m(n-1)-r]$	(52)

7.

Let $r=r+1$ . If $r\leq m(n-1)$ , go to Step 6.
8.

Lastly add constraints to force the last sum bit in each row to be zero, those constraints simply are $\{z_{(n-1)l}^{l}\}$ for $l\in\{1,\ldots,m\}$ and $\{z_{m(n-1)-r}^{m+r}\}$ for $r\in\{1,\ldots,m(n-1)\}$ (check Fig. 3). We also add $\{k_{1}^{m(n-1)}\}$ .

Theorem A.1.

Suppose there exists $u$ such that $\sum_{i=1}^{n}u_{i}s_{i}=0$ , then and only then does there exist $u^{\ast}$ such that $u^{\ast}(z_{end}^{l})=u^{\ast}(k_{1}^{mn})=0$ (where $z_{end}^{l}$ refers to the last sum bit in each row as shown in 3). Then $E_{S}(u(S))=0\Leftrightarrow E_{S\cup\mathscr{A}}(u^{\ast}(S\cup\mathscr{A}))=0\Leftrightarrow K^{M}\,\vec{\mu^{\ast}}=0$ .

Proof.

We first consider the forward direction. First recognize that $\sum_{i=1}^{n}u_{i}s_{i}=\sum_{i=1}^{n}\sum_{j=1}^{m}u_{i}s_{i}^{j}2^{j}$ . It must be that $\sum_{i=1}^{n}u_{i}s_{i}^{1}\mod 2=0$ . Then $\sum_{i=1}^{n}u_{i}s_{i}^{1}=\sigma^{1}\in\{\ldots,-4,-2,0,2,4,\ldots\}$ . Note that if inputs $a$ and $b$ have different signs for the generalized adder, the carry and sum bits are both zero, if $a$ and $b$ are the same sign then they pass a carry. When one is zero, then the other one is simply passed on using the primary operation of $GA_{k}$ and $GA_{s}$ . In the forward direction of the proof, we only need to consider the primary operations. As such, it is clear that $u^{\ast}(z_{n-1}^{1})=0$ since the number of positive and negative inputs added is zero modulo 2. It should also be straight forward to see that $\sum_{i=1}^{n-1}u^{\ast}(k_{i}^{1})=\frac{\sigma^{1}}{2}$ . Now recognize that $\left(\sum_{i=1}^{n}\sum_{j=1}^{l}u_{i}s_{i}^{j}\right)/2^{l}=\sigma^{l}\in\{\ldots,-4,-2,0,2,4,\ldots\}$ . Note that $\sigma^{l}=\frac{\sigma^{l-1}}{2}+\sum_{i=1}^{n}u_{i}s_{i}^{l}$ where we can identify $\frac{\sigma^{l-1}}{2}=\sum_{i=1}^{f[l-1]}u^{\ast}(k_{i}^{l-1})$ , with $f[x]=\Theta(m-x)x(n-1)+\Theta(x-m)(m(n-1)-x)$ for all $i$ . Here $f[x]$ has a Heaviside step function to differentiate between the indexing of rows generated by Step 2 of the protocol versus those generated later by Step 6. Again it is clear that $u^{\ast}(z_{f(l)}^{l})=0$ since the number of positive inputs and negative inputs of $u^{\ast}(s_{1:n}^{l},k_{1:f(l-1)}^{l-1})$ is zero modulo 2. Since $\sum_{i=1}^{n}|s_{i}|<n2^{m}$ , we must only worry at most about $m\log(n)$ rows, but we have $mn$ rows as zero for $u^{\ast}$ .

We now consider the backward direction. The proof will look very similar to the forward direction, but now we also have to give some consideration that $u^{\ast}$ could make use of secondary operations, not just primary operations. Consider in a specific row, we used the secondary operations, e.g. $\tilde{GA}_{k}(1,0)=1$ and $\tilde{GA}_{s}(1,0)=-1$ . Here we used the tilde to alert the reader that these are the secondary operations specifically. We know that $u^{\ast}(z_{end}^{l})=0$ for any layer $l$ as the assumption, and so the number of $\tilde{GA}$ operations is even. It must be of opposite kinds such that the number of total carries is unchanged (since they are still valid operations such that $2\,c+s=a+b$ ) - for every operation that propagrates an extra carry at the expense of reducing its sum bit there must be a secondary operation that reduces its carry bit to surplus its sum bit. If not, then $u^{\ast}(z_{end}^{l})\neq 0$ . Then we can replace them with the primary operations. The rest of the arguments follow through as before. We have $u^{\ast}(z_{n-1}^{l})=0$ and since $\sum_{i=1}^{n}u^{\ast}(s_{i}^{1})=\sum_{i=1}^{n-1}2\times u^{\ast}(k_{i}^{1})+u^{\ast}(z_{n-1}^{1})$ with $u^{\ast}(z_{n-1}^{1})=0$ , we have $\sigma^{1}=\sum_{i=1}^{n-1}u^{\ast}(k_{i}^{1})$ . Again $\sigma^{l}+u^{\ast}(z_{end}^{l})=\frac{\sigma^{l-1}}{2}+\sum_{i=1}^{n}u^{\ast}(s_{i}^{l})$ and we know that $u^{\ast}(z_{end}^{l})=0$ . By the same bound, we know that after $mn$ rows having zero on all the outputs is sufficient to see that $\sum_{i=1}^{n}\sum_{j=1}^{m}u^{\ast}(s_{i}^{j})=0$ and so let $u(s_{i})=u^{\ast}(s_{i})$ for every $s_{i}\in S$ . ∎

Given an input integer set $S$ , this protocol outputs constraint set $\mathscr{K}=\{K_{1},\ldots,K_{S\cup\mathscr{A}}\}$ (the variables with indices such that the entry is nonzero in $K^{M}$ ) such that an assignment vector $\vec{\mu^{\ast}}$ has value zero for every constraint in the set if and only if these exists a valid assignment for $S$ that defines two disjoint nonempty sets $A$ and $B$ such that $\sum_{a\in A}a-\sum_{b\in B}b=0$ . We can then identify the constraint operators as the row vectors of $K^{M}$ as coefficients on spin-z operators on each qubit: $\hat{K}_{i}=\sum_{j}^{|S\cup\mathscr{A}|}k_{ij}^{m}\sigma_{j}^{z}$ , such that our solver for 0-1-LP-QCOMMUTE finds a Hermitian matrix that commutes with these constraint operators.

A.5 Proof of Runtime

In the worse case, we construct no more than $5$ new variables and $6$ constraints for each $GA$ illustrated in Fig. 3. For row $i<m+1$ , this leads to no more than $5\,(n-1)\,i$ new variables and $6\,n\,(n-1)\,i$ constraints. Then after row $m$ , we have no more than $5\,(n-1)\,m^{2}$ variables and $6\,(n-1)\,m^{2}$ constraints. Row $i>m$ has no more than $m\,n-i$ generalized adders, creating no more than $5\,(m\,n-i)$ new variables and $6\,(m\,n-i)$ constraints. In total, we have no more than $\mathscr{O}\left(m^{2}n^{2}\right)$ variables and $\mathscr{O}\left(m^{2}n^{2}\right)$ constraints. The constraint matrix therefore has size $\mathscr{O}\left(m^{2}n^{2}\right)\times\mathscr{O}\left(m^{2}n^{2}\right)$ . The reduction is therefore a polynomial time algorithm.

A.6 Reducing a Solution of ILP-COMMUTE to a Solution of EQUAL SUBSET SUM

We consider the same set up as in Section V. Using the protocol from Section A (check Fig. 1) we can reduce any instance of the EQUAL SUBSET SUM problem with polynomial overhead, and if any solution to 0-1-LP-QCOMMUTE exists, there must be $v$ and $w$ (and therefore the sets $A$ and $B$ ) to describe at least one off-diagonal term in the spin-z basis. By our construction, the ancilla bits used for forcing are the bits beyond the n-th bit. Similarly, if a solution to EQUAL SUBSET SUM exists, by selecting the values of $v$ and $w$ to match the indices of chosen elements for the sets $A$ and $B$ , we are able to set the values of the first $n$ bits and then propagate their value through the general adder to find $v^{\prime}$ and $w^{\prime}$ over the enlarged space. Then $H=\bigotimes_{i=1}^{|S\bigcup\mathcal{A}|}\left(\sigma^{+}\right)^{v_{i}^{\prime}}\left(\sigma^{-}\right)^{w_{i}^{\prime}}+\bigotimes_{i=1}^{|S\bigcup\mathcal{A}|}\left(\sigma^{-}\right)^{v_{i}^{\prime}}\left(\sigma^{+}\right)^{w_{i}^{\prime}}$ solves 0-1-LP-QCOMMUTE. This leads to the following Theorem:

Theorem A.2.

0-1-LP-QCOMMUTE is NP-Hard.

Through the same proof that ILP-QCOMMUTE is polynomial verifiable, 0-1-LP-QCOMMUTE is likewise polynomial verifiable.

Theorem A.3.

0-1-LP-QCOMMUTE is NP-Complete.

It also leads to an important corollary:

Corollary A.3.1.

$\{-1,0,1\}$ -LP-QCOMMUTE is NP-Complete.

As well as another proof to the result in Section V:

Corollary A.3.2.

ILP-QCOMMUTE is NP-Complete.

Appendix B Proof of the Matrix Implementation of the Generalized Adder

Given inputs $a$ and $b$ , we define the matrix on $a,b,s,c,x_{1},x_{2},x_{3}$ - with $x_{1},x_{2},x_{3}$ being intermediating ancillas - as:

\displaystyle GA^{M}=\;\begin{array}[]{c c c c c c c c c}&a&b&x_{1}&x_{2}&x_{3}&c&s&\\ \ldelim({6}{0.5em}&a&b&1&1&1&0&0&\rdelim){6}{0.5em}\\ &0&0&1&0&1&1&1&\\ &0&0&0&1&1&1&1&\\ &0&0&1&0&0&1&0&\\ &0&0&0&1&0&1&0&\\ &0&0&0&0&1&0&1&\end{array}

(60)

As constraints, we can write it as:

$\displaystyle GA_{1}(a,b,x_{1},x_{2},x_{3})$	$\displaystyle=0,$	(61)
$\displaystyle GA_{2}(x_{1},x_{3},s,c)$	$\displaystyle=0,$	(62)
$\displaystyle GA_{3}(x_{2},x_{3},s,c)$	$\displaystyle=0,$	(63)
$\displaystyle GA_{4}(x_{1},c)$	$\displaystyle=0,$	(64)
$\displaystyle GA_{5}(x_{2},c)$	$\displaystyle=0,$	(65)
$\displaystyle GA_{6}(x_{3},s)$	$\displaystyle=0.$	(66)

A constraint is satisfied if and only if the assignment $u^{\ast}$ over the variables of that constraint sums to zero. Then an assignment satisfies all of them if $u^{\ast}(GA_{i})=0$ for all $i\in[1,6]$ . Although checkable through brute force calculations, we give simple arguments for emulation of bit addition step by step:

If $u^{\ast}(a)=u^{\ast}(b)=1$ , then and only then do we have $u^{\ast}(c)=1$ and $u^{\ast}(s)=0$ . If $u^{\ast}(a)=1$ and $u^{\ast}(b)=1$ , then two of the auxillary bits must have an assignment of -1 and one must not. If $u^{\ast}(x_{1})=-1$ then $u^{\ast}(c)=1$ by $GA_{4}$ , but then $u^{\ast}(x_{2})=-1$ by $GA_{5}$ and vice versa. Then $u^{\ast}(x_{3})=0$ , otherwise $GA_{1}$ cannot be satisfied. Then $u^{\ast}(s)=0$ as wanted. Suppose that $u^{\ast}(s)=0$ and $u^{\ast}(c)=1$ , then likewise $GA_{4}$ and $GA_{3}$ force that $u^{\ast}(x_{1})=u^{\ast}(x_{2})=-1$ . Then from $GA_{2}$ , we have that $u^{\ast}(x_{3})=0$ and so from $GA_{1}$ that $u^{\ast}(a)=u^{\ast}(b)=1$ .

If $u^{\ast}(a)=1$ and $u^{\ast}(b)=0$ or $u^{\ast}(a)=0$ and $u^{\ast}(b)=1$ , then and only then do we have $u^{\ast}(c)=0$ and $u^{\ast}(s)=1$ or $u^{\ast}(c)=1$ and $u^{\ast}(s)=-1$ . Suppose that $u^{\ast}(a)=1$ or $u^{\ast}(b)=1$ , but not both. From $GA_{1}$ , we know that either one of the auxillary bits must take value -1 or two take the value -1 and one takes the value 1. If either $x_{1}$ or $x_{2}$ take value -1, but not the other then $GA_{4}$ and $GA_{5}$ lead to a contradiction, then if $x_{3}=1$ and by $GA_{6}$ it must be that $s=-1$ . Otherwise $x_{1}=x_{2}=0$ and so $u^{\ast}(x_{3})=-1$ by $GA_{2}$ and $GA_{3}$ . $GA_{6}$ forces that $u^{\ast}(s)=1$ . Suppose that $u^{\ast}(c)=0$ and $u^{\ast}(s)=1$ . Then $u^{\ast}(x_{3})=-1$ from $GA_{6}$ and $u^{\ast}(x_{1})=u^{\ast}(x_{2})=0$ from $GA_{4}$ and $GA_{5}$ . Then $GA_{1}$ is only satisfied if $u^{\ast}(a)=1$ or $u^{\ast}(b)=1$ , but not both. Suppose instead that $u^{\ast}(c)=1$ and $u^{\ast}(s)=-1$ . Then by $GA_{6}$ it must be that $x_{3}=1$ . By $GA_{5}$ and $GA_{4}$ , it must be that $x_{1}=-1$ and $x_{2}=-1$ . Then by $GA_{1}$ it must be that $u^{\ast}(a)$ or $u^{\ast}(b)$ is 1, but not both.

If $u^{\ast}(a)+u^{\ast}(b)=0$ , then and only then do we have $u^{\ast}(c)=0$ and $u^{\ast}(s)=0$ . Suppose that $u^{\ast}(a)+u^{\ast}(b)=0$ . From $GA_{1}$ , we know that at most two auxillary bits are non-zero and they have opposite sign. From $GA_{4}$ and $GA_{5}$ if one of the first two auxillary bits is non-zero then so is the other one, but they must have the same sign. As such $GA_{1}$ can only be satisfied with $u^{\ast}(x_{1})=u^{\ast}(x_{2})=u^{\ast}(x_{3})=0$ . Then it follows that $u^{\ast}(c)=u^{\ast}(s)=0$ . Suppose that $u^{\ast}(c)=u^{\ast}(s)=0$ . From $GA_{4}$ , $GA_{5}$ , and $GA_{6}$ , we have that $u^{\ast}(x_{1})=u^{\ast}(x_{2})=u^{\ast}(x_{3})=0$ . Then $GA_{1}$ can only be satisfied if $u^{\ast}(a)+u^{\ast}(b)=0$ .

The same logic works if we swap the values $1$ and $-1$ everywhere in the above proof.