This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Goal Seeking Quadratic Unconstrained Binary Optimization

Amit Verma [email protected] Mark Lewis Craig School of Business, Missouri Western State University, Saint Joseph, MO, 64507, United States
Abstract

The Quadratic Unconstrained Binary Optimization (QUBO) modeling and solution framework is a requirement for quantum and digital annealers. However optimality for QUBO problems of any practical size is extremely difficult to achieve. In order to incorporate the problem-specific insights, a diverse set of solutions meeting an acceptable target metric or goal is the preference in high level decision making. In this paper, we present two alternatives for goal-seeking QUBO for minimizing the deviation from a given target as well as a range of values around a target. Experimental results illustrate the efficacy of the proposed approach over Constraint Programming for quickly finding a satisficing set of solutions.

keywords:
Quadratic Unconstrained Binary Optimization, pseudo-Boolean optimization, goal seeking, what if, goal programming, quantum computer

1 Introduction

Quadratic Unconstrained Binary Optimization (QUBO) is a popular modeling framework wherein many combinatorial optimization problems can be recast into the form minxQx;x{0,1}min\;x^{\prime}Qx;x\in\{0,1\} where QQ is a symmetric matrix of size nn with integer or real components and xx is a binary vector (see [1] for a survey of the broad applicability of this modeling structure). Optimization software such as CPLEX and Gurobi find an exact solution that optimizes the objective function value. Constraint Programming (CP) finds feasible solutions satisfying a set of constraints. Thus, CP supports a decision maker who is interested in setting goals and generating a set of solutions to consider for implementation, for example, a target time to complete a task, multi-criteria profit, risk, or time-frame targets. In the mixed integer linear programming paradigm, the objective target is modeled as lower and upper bound constraints on the objective function. However for quadratic binary objectives, these constraints are also binary quadratic and are very difficult for conventional linear solvers such as CPLEX and Gurobi. In this short paper, we present two variants of a goal-seeking or target QUBO. In the first case, the target tt is exact and we are interested in finding binary vectors xx such that xQx=tx^{\prime}Qx=t. In the second case, the target is uncertain but defined by an interval [lb,ub][lb,ub].

Target QUBO belongs to the class of multi-criteria goal programming where the objective function xQxx^{\prime}Qx is provided a goal or target to be achieved and deviation from the target may be seen as the quadratic Taguchi quality loss function ([2]), although other measures such as absolute value are valid. Our implementation utilizes the squared deviation from the goal, which is minimized in the tabu-search heuristic. However, the importance of the magnitude of deviation from the target is typically determined by the decision-maker.

Our proposed model finds utility in four scenarios. First, the settings where we are interested in finding the input parameters that achieve a desired target level benefit from this analysis which is closely related to the “goal-seeking problem” defined in [3] and exemplified in the parameter selection method of Taguchi ([4]). For instance, near-optimal solutions in RNA folding prediction are often better predictors of RNA structure than the optimal, and the near-optimal solutions occur in the neighborhood around a target energy level ([5]). Second, in sensitivity analysis of a discrete event simulation (see [3] for more details). The third application arises in the context of the Constraint Satisfaction Problem modeled as a QUBO wherein a target number of constraints are specified for satisfaction.

Fourth, target QUBO can be utilized as a tool to solve multi-objective optimization problems. For instance, when the model appears as a subproblem for lexicographic optimization and the objectives are sorted from most important to least important ([6]). Next, we find the binary variables satisfying the inequality for the ithi^{\prime}th objective xQixfi+δix^{\prime}Q_{i}x\leq f_{i}^{*}+\delta_{i} which guarantees that the search focuses on near-optimal function evaluation fif_{i}^{*} controlled by a tolerance level δi\delta_{i} which translates to xQix[fi,fi+δi]x^{\prime}Q_{i}x\in[f_{i}^{*},f_{i}^{*}+\delta_{i}] in our modeling framework. Thus, the results are useful for lexicographic optimization wherein we are looking for solutions that are close to the optimal value. Moreover, various objectives can be combined into a weighted multi-objective function. Thus, the various solution techniques proposed for multi-objective QUBO (like [7, 8, 9, 10]) benefit from our analysis.

The paper is organized as follows. The goal-seeking QUBO model is presented in Section 2. The solution technique and the results are presented in Section 3 and 4, followed by conclusions and future research directions.

2 Model

First, we present the exact goal-seeking QUBO for a deterministic target tt. An exact model with quadratic penalties minimizing the squared deviation from the target is min(xQxt)2=min(xQx)22t(xQx)+t2min\;(x^{\prime}Qx-t)^{2}=min\;(x^{\prime}Qx)^{2}-2t(x^{\prime}Qx)+t^{2} wherein the first term has fourth-degree pseudo boolean functions of the form xixjxkxlx_{i}x_{j}x_{k}x_{l} significantly increasing the computational complexity. We could iteratively use Rosenberg quadratization substituting the quadratic subterm xixjx_{i}x_{j} with an additional binary variable zijz_{ij} thereby introducing a penalty term M(xixj2xizij2xjzij+3zij)M(x_{i}x_{j}-2x_{i}z_{ij}-2x_{j}z_{ij}+3z_{ij}) for a minimization problem. In this way, we could transform the fourth degree polynomial to a quadratic polynomial and utilize existing QUBO solvers. However, this reformulation leads to a large number of auxiliary variables and penalty terms ([11]).

While the number of auxiliary variables required for the transformation of a fourth degree goal-seeking QUBO is O(n2n^{2}), calculating the impact on the achievement function (xQxt)2(x^{\prime}Qx-t)^{2} as part of a one-flip search routine is fast given that the incremental function evaluation of xQxx^{\prime}Qx could be done in O(1)O(1) ([12]). Thus, selecting the best variable out of one-flip neighborhood is O(n)O(n). Only binary vectors that meet the target are collected as part of the search routine and presented to the decision-maker.

A common feature of integer optimization problems is the presence of alternate optima ([13]) and while it may not be practical to enumerate all alternate optima, our solution approach generates a list of binary solution vectors meeting the target value or range. This feature is important to multi-criteria decision-making, wherein the decision-maker selects the preferred solution from the list of pareto optimal solution vectors.

The landscape theory provides us techniques to quantify the impact on the number of local optima due to width of the target interval. According to [14], the number of local optima MM in a combinatorial landscape is given by the following expression:

M|X||X(xo,l)|M\approx\frac{|X|}{|X(x_{o},l)|} (1)

where |X||X| denotes the total size of the solution space and X(xo,l)X(x_{o},l) represents the set of solutions accessible from the vector xox_{o} in ll or less local moves. Note that the width of the interval around xox_{o} increases as ll increases. An estimate of MM for QUBO of size nn provided by [15] is given by:

M2ni=0l(ni)M\approx\frac{2^{n}}{\sum_{i=0}^{l}{n\choose i}} (2)

Using this estimate, we could derive the following related to Lemma 2 in [16]:

Lemma 1.

As ll increases, the estimate of the number of local optima increases.

Proof.

We need to prove that the expression of MM provided in Equation (2) is increasing in ll. According to [17], the denominator is approximated by:

i=0l(ni)2n1exp(n2l2)24(1+ln)\sum_{i=0}^{l}{n\choose i}\leq 2^{n-1}exp{-\frac{(n-2l-2)^{2}}{4(1+l-n)}} (3)

Substituting this in Equation (2), we have:

M2exp(n2l2)24(nl1)M\approx 2\;exp{-\frac{(n-2l-2)^{2}}{4(n-l-1)}} (4)

MM is increasing in ll as long as log(M)log(M) is increasing in ll. The first derivative of log(M)log(M) is given by a positive multiple of [n(l+1)][n(2l+2)][n2l][n-(l+1)][n-(2l+2)][n-2l]. Hence MM is increasing in ll as long as n2l+2n\geq 2l+2. In other words, for a sufficiently large nn the lemma holds. Thus the number of local optima in the optimization problem increases as the size of the neighborhood increases. Hence, the number of combinations leading to a target also increases. ∎

Second, we describe the interval goal-seeking QUBO that seeks to find solution vectors that satisfy xQx[lb,ub]x^{\prime}Qx\in[lb,ub]. This is attained by optimizing the achievement function min(xQxlb)(xQxub)=min(xQx)2(lb+ub)(xQx)+lbubmin\;(x^{\prime}Qx-lb)(x^{\prime}Qx-ub)=min\;(x^{\prime}Qx)^{2}-(lb+ub)(x^{\prime}Qx)+lb*ub. Again we face computational difficulties with solving this exact fourth-degree pseudo boolean function involving a large number of auxiliary variables and penalty terms. Alternatively, lower and upper bound constraints on the objective function can be used, but most solvers do not work well with quadratic binary constraints. Thus we integrate the target constraint into the solution method as described in Section 3.

Because of the quadratic nature of the achievement function, the optimal value is achieved at xQx=(lb+ub)/2x^{\prime}Qx=(lb+ub)/2. However, in general the achievement function is non-positive as long as xQx[lb,ub]x^{\prime}Qx\in[lb,ub]. Moreover, the binary vectors outside this interval lead to a higher penalty term, which increases with the squared distance from the interval bounds. This incentivizes the solver to limit the search to the solution landscape around the interval. Any binary vectors with non-positive function evaluation are easily collected during the search routine. Such vectors satisfying the interval constraint are presented to the decision-maker in descending order according to their objective function.

The following simple lemma captures the relationship between satisficing vectors and achievement function for either of the variants of goal-seeking QUBO.

Lemma 2.

Achievement function AF(x)0AF(x^{*})\leq 0 if and only if xx^{*} is a satisficing solution vector.

Proof.

The deterministic goal-seeking QUBO has AF(x)=(xQxt)2AF(x)=(x^{\prime}Qx-t)^{2}. Note that this quadratic function is always positive except for the case wherein xQx=tx^{\prime}Qx=t. Thus, AF(x)=0AF(x)=0 only when the binary vector xx meets the target tt.

Any solution vector xx for the interval goal-seeking QUBO with AF(x)=(xQxlb)(xQxub)AF(x)=(x^{\prime}Qx-lb)(x^{\prime}Qx-ub) should have objective function evaluation xQxx^{\prime}Qx belonging to one of the following three intervals: (i) (,lb)(-\infty,lb) (ii) [lb,ub][lb,ub] (iii) (ub,)(ub,\infty). If xQx(,lb)x^{\prime}Qx\in(-\infty,lb), then (xQxlb)(xQxub)>0(x^{\prime}Qx-lb)(x^{\prime}Qx-ub)>0 since both of the subterms evaluate to negative values. Similarly, if xQx(ub,)x^{\prime}Qx\in(ub,\infty), then (xQxlb)(xQxub)>0(x^{\prime}Qx-lb)(x^{\prime}Qx-ub)>0 because both subterms evaluate to positive values. The only scenario wherein AF(x)AF(x) given by (xQxlb)(xQxub)0(x^{\prime}Qx-lb)(x^{\prime}Qx-ub)\leq 0 happens for xQx[lb,ub]x^{\prime}Qx\in[lb,ub] because of the conflicting signs of the two components. ∎

3 Solution Technique

Our solution method is summarized in Algorithm 1. We utilize a tabu-search (TS) heuristic for finding the binary vectors that meet the user-defined goals. A neighborhood move is defined by a one-flip i.e. setting xix_{i} to 1xi1-x_{i} for a specific variable ii. The best impact on the objective function xQxx^{\prime}Qx due to one-flip is calculated in an incremental manner building on a previous solution requiring O(n)O(n) ([12]) to select the best one-flip from the nn variables in the neighborhood. We start from the all-zero solution and iteratively choose the non-tabu move that leads to the best improvement in the achievement function (AFAF). In case an improvement in AFAF is not possible i.e., we reach a local optima, we chose the non-improving and non-tabu move having the least impact on AFAF. As discussed earlier, AFAF is defined as (xQxt)2(x^{\prime}Qx-t)^{2} for deterministic target and (xQxlb)(xQxub)(x^{\prime}Qx-lb)(x^{\prime}Qx-ub) for interval target.

Algorithm 1 Tabu search heuristic
1:procedure Tabu Search(TSTS) \triangleright Returns the set SS of satisficing solutions
2:    Input: Q,xinitialQ,x_{initial} \triangleright QQ matrix and starting solution
3:    Input: TabuListTabuList \triangleright Tabu list of fixed length
4:    Output: SS \triangleright Set containing output vectors satisfying target
5:    SS\leftarrow\emptyset
6:    xxinitialx_{*}\leftarrow x_{initial}
7:    while termination criteria is not met do
8:       bestAFbest_{AF}\leftarrow\infty
9:       for each xNeighborhood(x)x\in\;Neighborhood(x_{*}) do \triangleright For all one flip neighbors of best solution
10:          if xTabuListx\notin\;TabuList then \triangleright Variable is non-tabu
11:             if AF(x)<bestAFAF(x)<best_{AF} then \triangleright Solution vector improves achievement function
12:                xxx_{*}\leftarrow x
13:                bestAFAF(x)best_{AF}\leftarrow AF(x)                               
14:       TabuListUpdate(TabuList)TabuList\leftarrow Update(TabuList) \triangleright Remove a variable from the tabu list
15:       TabuListUpdate(TabuList,x)TabuList\leftarrow Update(TabuList,x_{*}) \triangleright Add the best choice variable to the tabu list
16:       if AF(x)0AF(x_{*})\leq 0 then \triangleright Solution vector meets the specified target
17:          SSxS\leftarrow S\bigcup x_{*} \triangleright Add the solution vector to the output set            
18:    SPostprocess(S)S\leftarrow Postprocess(S) \triangleright Remove the duplicates from output set SS
19:    return SS

To help prevent cycling to previously visited solutions, we define a tabu list consisting of the tabu tenure of the variables that were flipped. Thus, a non-tabu variable xix_{i} is added to the tabu list with maximum tabu tenure if it was the best choice for one-flip. The variable is eventually removed from the tabu list after a fixed number of iterations set by the tabu tenure parameter. Note that this parameter has an impact on the computational performance of the heuristic. In general, as the tabu tenure increases, the chances for cycling reduces while a too large tabu tenure parameter inhibits the ability to reach a target.

Compared to a traditional heuristic wherein we are interested in finding the best solution in a limited runtime, we focus on finding multiple solutions that meet the user-defined objective goal. In this way, we can provide various choices to the decision-maker. The termination criteria in our TS heuristic is the solution runtime and our algorithm might find duplicate solutions due to cycling, so the end of the routine, we run a post-processing method that identifies the unique solutions among the solution set.

The solution quality of target QUBO is assessed by the number of unique solutions obtained by the solver in a limited runtime. Because the tabu tenure parameter has an impact on the solution quality, we conducted preliminary experiments to determine a good setting. For this purpose, we utilized the 25002500 node ORLIB instances ([18]). The deterministic target was set at 80%80\% of the best known solutions of the problems (obtained from [19]). In general, smaller tabu tenures lead to quick solutions, but tend to cycle, and the solver is often stuck in local optima. On the other hand, large tabu tenure restricts the neighborhood visited by the solver. Based on preliminary experiments, we observed that tabu tenure between 1010 and 2020 provides good results on the ORLIB problems and we set the tabu tenure to 1010 for the problem set in the following section.

4 Results

For computational experiments, we use the QUBO instances with 2500,30002500,3000 and 40004000 variables (presented in [18] and [20]). The solution technique was implemented using C. We also utilize the IBM Constraint Programming (CP) Optimizer for benchmarking with the exact model. The exact non-linear model for the two variants namely xQx=tx^{\prime}Qx=t and xQx>=lb&xQx<=ubx^{\prime}Qx>=lb\;\&\;x^{\prime}Qx<=ub was input to the CP solver using Python DOCPLEX API. The target level is set as a higher percentage of best known solutions (bksbks) namely 80%,85%,90%80\%,85\%,90\% and 95%95\%. The experiments were performed on a 3.40 GHz Intel Core i7 processor with 16 GB RAM running 64 bit Windows 7 OS.

In terms of benchmarking, the IBM CP solver was unable to find any feasible solution for either of the problem variants in 100100 seconds and for bigger instances, the size of the exact model was too large in terms of memory requirements. We also tried the default portfolio of solvers provided by MiniZinc ([21]) for the non-linear models, and those also yielded no solutions. Thus, these problems are computationally challenging.

First, we present the results using our heuristic for the exact target QUBO problem xQx=tx^{\prime}Qx=t in Table 1. It is evident from the results that as we get closer to bksbks, the goal-setting problem becomes more challenging and we conclude that the solution landscape for these problems are distributed such that the number of feasible solutions decrease as the target approaches bksbks.

Table 1: Number of unique solutions for xQx=tx^{\prime}Qx=t
Instance 80% 85% 90% 95% Instance 80% 85% 90% 95%
2500.1 31 27 22 15 3000.1 28 22 19 17
2500.2 21 17 8 6 3000.2 61 52 41 30
2500.3 30 25 19 16 3000.3 60 54 45 36
2500.4 36 36 20 19 3000.4 42 33 27 16
2500.5 32 31 30 22 3000.5 31 26 18 14
2500.6 26 24 14 8 4000.1 38 30 24 14
2500.7 60 59 39 31 4000.2 29 18 14 8
2500.8 27 23 16 16 4000.3 33 28 24 12
2500.9 45 37 33 18 4000.4 29 24 19 16
2500.10 33 23 19 8 4000.5 52 50 44 27

Second, we investigate the performance of our heuristic on interval targets i.e. xQx[lb,ub]x^{\prime}Qx\in[lb,ub] in Table 2. For this case, we choose the intervals as (80%,85%)(80\%,85\%),(85%,90%)(85\%,90\%),(90%,95%)(90\%,95\%) and (95%,100%)(95\%,100\%) of bksbks. Note that (lb,ub)(lb,ub) is equivalent to [lb+1,ub1][lb+1,ub-1] for functions with integral coefficients. We again observe from the results that the intervals close to bksbks do not contain as many feasible solutions. Moreover, the number of unique feasible solutions found while satisfying an interval goal as well as the deterministic goal are dependent on the solution landscape which is problem specific.

The diversity attributes for the set of solution vectors (SS) output by the heuristic for a specific target were also analyzed. For this purpose, the hamming distance h(x,y)h(x,y) between two solution vectors xx and yy given by the number of difference bits was computed for each pair of solution vectors. Thereafter, we calculated the mean hamming distance given by x,ySxyh(x,y)/(|S|2)\sum_{x,y\in S}^{x\neq y}h(x,y)/{|S|\choose 2}. The denominator represents the number of unique solution pairs. The mean hamming distance for all output sets was between 11 and 22. The median hamming distance is 11 or 22 while the maximum hamming distance is 22 or 33 bits. Thus the solutions found were not diverse. In other words, alternate optima for a specific target near bksbks are located in close proximity in the solution landscape. Based on this finding, it is more likely to find the next satisficing binary vector near the current one.

Table 2: Number of unique solutions for xQx[lb,ub]x^{\prime}Qx\in[lb,ub]
Instance (80,85) (85,90) (90,95) (95,100) Instance (80,85) (85,90) (90,95) (95,100)
2500.1 29 22 14 10 3000.1 19 15 13 10
2500.2 15 11 7 4 3000.2 42 33 26 16
2500.3 23 23 16 10 3000.3 42 37 27 19
2500.4 32 25 20 13 3000.4 24 22 11 10
2500.5 30 29 17 13 3000.5 17 17 14 8
2500.6 19 14 9 7 4000.1 25 20 12 6
2500.7 52 46 25 16 4000.2 21 10 11 5
2500.8 18 17 15 13 4000.3 18 16 14 8
2500.9 29 24 16 6 4000.4 16 18 9 5
2500.10 22 15 14 5 4000.5 36 29 26 11

5 Conclusions and Future Research

We present a novel model for two variants of goal-seeking QUBO involving either deterministic or uncertain targets. A number of alternate satisficing solutions were obtained using a greedy one-flip tabu search routine. Testing on benchmark instances clearly established the efficacy of our approach compared to state-of-the-art CP solvers which were unable to find solutions to any of the problems in the test set.

Regarding more detailed future research, our technique is easily extendable to multiple targets. For instance, achieving target levels of t1t_{1} or t2t_{2} would require parallelized instances of the routine searching for the two target levels. These threads could operate independently. Moreover, two different objective goals t1t_{1} and t2t_{2} with priorities w1w_{1} and w2w_{2} could be handled by changing the achievement function to w1(xQ1xt1)2+w2(xQ2xt2)2w_{1}(x^{\prime}Q_{1}x-t_{1})^{2}+w_{2}(x^{\prime}Q_{2}x-t_{2})^{2}. In this way, we construct the efficient solution frontier by enumerating different choices of w1w_{1} and w2w_{2}. Herein, the weights are determined by relative importance according to the decision-maker and is closely related to weighted or non-preemptive goal programming ([22]).

References

  • [1] G. Kochenberger, J.-K. Hao, F. Glover, M. Lewis, Z. Lü, H. Wang, Y. Wang, The unconstrained binary quadratic programming problem: a survey, Journal of Combinatorial Optimization 28 (1) (2014) 58–81.
  • [2] G. Taguchi, V. Cariapa, Taguchi on robust technology development (1993).
  • [3] H. Arsham, Algorithms for sensitivity information in discrete-event systems simulation, Simulation Practice and Theory 6 (1) (1998) 1–22.
  • [4] H. Arsham, Goal-seeking problem in discrete event systems simulation, Microelectronics Reliability 37 (3) (1997) 391–395.
  • [5] M. W. Lewis, A. Verma, T. T. Eckdahl, Qfold: a new modeling paradigm for the rna folding problem, Journal of Heuristics (2021) 1–23.
  • [6] J. Marques-Silva, J. Argelich, A. Graça, I. Lynce, Boolean lexicographic optimization: algorithms & applications, Annals of Mathematics and Artificial Intelligence 62 (3) (2011) 317–343.
  • [7] A. Liefooghe, S. Verel, J.-K. Hao, A hybrid metaheuristic for multiobjective unconstrained binary quadratic programming, Applied Soft Computing 16 (2014) 10–19.
  • [8] M. Zangari, A. Pozo, R. Santana, A. Mendiburu, A decomposition-based binary aco algorithm for the multiobjective ubqp, Neurocomputing 246 (2017) 58–68.
  • [9] Y. Zhou, J. Wang, Z. Wu, K. Wu, A multi-objective tabu search algorithm based on decomposition for multi-objective unconstrained binary quadratic programming problem, Knowledge-Based Systems 141 (2018) 18–30.
  • [10] Y. Zhou, L. Kong, Z. Wu, S. Liu, Y. Cai, Y. Liu, Ensemble of multi-objective metaheuristic algorithms for multi-objective unconstrained binary quadratic programming problem, Applied Soft Computing 81 (2019) 105485.
  • [11] A. Verma, M. Lewis, Optimal quadratic reformulations of fourth degree pseudo-boolean functions, Optimization Letters 14 (6) (2020) 1557–1569.
  • [12] G. A. Kochenberger, F. Glover, B. Alidaee, C. Rego, A unified modeling and solution framework for combinatorial optimization problems, OR Spectrum 26 (2) (2004) 237–250.
  • [13] E. Hannan, Nondominance in goal programming, INFOR: Information Systems and Operational Research 18 (4) (1980) 300–309.
  • [14] P. F. Stadler, Fitness landscapes, in: Biological evolution and statistical physics, Springer, 2002, pp. 183–204.
  • [15] F. Chicano, E. Alba, Elementary landscape decomposition of the 0-1 unconstrained quadratic optimization, Journal of Heuristics 19 (4) (2013) 711–728.
  • [16] A. Verma, M. Lewis, Penalty and partitioning techniques to improve performance of qubo solvers, Discrete Optimization (2020) 100594.
  • [17] L. Lovász, J. Pelikán, K. Vesztergombi, Diskrete Mathematik, Springer-Verlag, 2006.
  • [18] J. E. Beasley, Or-library: distributing test problems by electronic mail, Journal of the operational research society 41 (11) (1990) 1069–1072.
  • [19] Y. Wang, Z. Lü, F. Glover, J.-K. Hao, Path relinking for unconstrained binary quadratic programming, European Journal of Operational Research 223 (3) (2012) 595–604.
  • [20] G. Palubeckis, Multistart tabu search strategies for the unconstrained binary quadratic optimization problem, Annals of Operations Research 131 (1) (2004) 259–282.
  • [21] N. Nethercote, P. J. Stuckey, R. Becket, S. Brand, G. J. Duck, G. Tack, Minizinc: Towards a standard cp modelling language, in: International Conference on Principles and Practice of Constraint Programming, Springer, 2007, pp. 529–543.
  • [22] H. D. Sherali, A. L. Soyster, Preemptive and nonpreemptive multi-objective programming: Relationship and counterexamples, Journal of Optimization Theory and Applications 39 (2) (1983) 173–186.