Quadratization in Discrete Optimization and Quantum Mechanics
Quadratization in Discrete Optimization and Quantum Mechanics
Nike Dattani∗
When optimizing discrete functions, it is often easier when the function is quadratic than if
it is of higher degree. But notice that the cubic and quadratic functions:
b1 b2 + b2 b3 + b3 b4 − 4b1 b2 b3 (cubic), (1)
b1 b2 + b2 b3 + b3 b4 + 4b1 − 4b1 b2 − 4b1 b3 (quadratic), (2)
arXiv:1901.04405v2 [quant-ph] 23 Sep 2019
where each bi can either be 0 or 1, both never go below the value of -2, and all minima occur
at (b1 , b2 , b3 , b4 ) = (1, 1, 1, 0). Therefore if we are interested in the ground state of a discrete
function of degree k, we may optimize either function and get exactly the same result. Part I
gives more than 40 different ways to do this, almost all of them published in the last 5 years.
The binary variables bi can be either of the eigenvalues of the matrix b below, which is
related to the Pauli z matrix by z = 2b − 11. The Pauli matrices x, y, z, 11 are listed below:
1 0 1 0 0 1 0 −i 1 0
b ≡ ( ), z ≡ ( ), x ≡ ( ), y ≡ ( ) , 11 ≡ ( ) . (3)
0 0 0 −1 1 0 i 0 0 1
Any Hermitian 2 × 2 matrix can be written as a linear combination of the Pauli matrices, so we
can therefore describe the Hamiltonian of any number of spin-1/2 particles by a function of
Pauli matrices acting on each particle, for instance:
x1 y2 z3 y4 + y1 x2 z3 y4 + x1 x2 y3 (cubic), (4)
x1 y4 + x2 y4 + x3 (quadratic), (5)
where the coefficients tell us about the strengths of couplings between these particles. The
Schrödinger equation tells us that the eigenvalues of the Hamiltonian are the allowed energy
levels and their eigenvectors (wavefunctions) are the corresponding physical states. More
generally these do not have to be spins but can be any type of qubits, and we can encode
the solution to any problem in the ground state of a Hamiltonian, then solve the problem
by finding the lowest energy state of the physical system (this is called adiabatic quantum
computing). Eqs. (5) and (4) have exactly the same energy spectra, so Eq. (5) is an example of
a type of quadratization.
Two-body physical interactions occur more naturally than many-body interactions so Parts
II-III give more than 30 different ways to quadratize general Hamiltonians (some of these
methods may use d × d matrices instead of only the 2 × 2 matrices in Eq. (3), meaning that we
can have types of qudits that are not qubits). All of these methods were published during the
last 15 years.
The optimization problems of Eqs. (1)-(2) are specific cases of the type in Eqs. (4)-(5), but
with only b matrices.
Part I
Diagonal Hamiltonians (pseudo-Boolean functions)
which has the same global minima as H4−local but one fewer quartic and one fewer cubic
term. The coefficient of b1 b2 was chosen as 6 because 6 ≥ max (4 + b3 + b3 b4 ).
Bibliography
• Original paper, with more implementation details, and application to integer
factorization: [1].
3
Summary
An Excludable Local Configuration (ELC) is a partial assignment of variables that
make it impossible to achieve the minimum. We can therefore add a term that corre-
sponds to the energy of this ELC without changing the solution to the minimization
problem. In practice we can eliminate all monomials with a variable in which a variable
is set to 0, and reduce any variable set to 1. Given a general objective function we can
try to find ELCs by enumerating solutions of a small subset of variables in the problem
[2].
Cost
• 0 auxiliary variables needed.
In both cases Eqs. (9) and (11), the only global minima occur when b1 b2 b3 = 1.
Bibliography
• Original paper and application to computerized image denoising: [2].
4
C. Groebner Bases
Summary
Given a set of polynomials, a Groebner basis is another set of polynomials that have
exactly the same zeros. The advantage of a Groebner basis is it has nicer algebraic
properties than the original equations, in particular they tend to have smaller degree
polynomials. The algorithms for calculating Groebner bases are generalizations of
Euclid’s algorithm for the polynomial greatest common divisor.
Work has been done in the field of ’Boolean Groebner bases’, but while the variables
are Boolean the coefficients of the functions are in F2 rather than Q.
Cost
• 0 auxiliary variables needed.
• O (22 ) in general, O(dn ) if the zeros of the equations form a set of discrete points,
n 2
where d is the degree of the polynomial and n is the number of variables [3].
Pros
• No auxiliary variables needed.
• General method, which can be used for other rings, fields or types of variables.
Cons
• Best algorithms for finding Groebner bases scale double exponentially in n.
b1 b2 b3 b4 + b1 b3 + b2 b4 − b3 = b1 + b1 b2 + b3 − 2 = 0. (12)
Feeding these to Mathematica’s GroebnerBasis function, along with the binarizing
b1 (b1 − 1) = . . . = b4 (b4 − 1) = 0 constraints, gives a Groebner basis:
{b4 b3 − b4 , b2 + b3 − 1, b1 − 1} . (13)
From this we can immediately read off the solutions b1 = 1, b2 = 1 − b3 and reduce
the problem to b3 b4 − b4 = 0. Solving this gives a final solution set of: (b1 , b2 , b3 , b4 ) =
(1, 0, 1, 0), (1, 0, 1, 1), (1, 1, 0, 0), which should be the same as the original 4-local prob-
lem.
Bibliography
• Reduction and embedding of factorizations of all bi-primes less than 200, 000: [4].
5
Summary
We use the formula from [5] for representing any function of three binary variables:
f (b1 , b2 , b3 ) = (f (1, 1, 1) + f (1, 0, 0) − f (1, 1, 0) − f (1, 0, 1) − f (0, 1, 1) − f (0, 0, 0) + (14)
f (0, 0, 1) + f (0, 1, 0)) b1 b2 b3 + (f (0, 1, 1) + f (0, 0, 0) − f (0, 0, 1) − f (0, 1, 0)) b2 b3 + (15)
(f (1, 0, 1) + f (0, 0, 0) − f (0, 0, 1) − f (1, 0, 0)) b1 b3 + (16)
(f (1, 1, 0) + f (0, 0, 0) − f (1, 0, 0) − f (0, 1, 0)) b1 b2 + (f (0, 1, 0) − f (0, 0, 0)) b2 (17)
+ (f (1, 0, 0) − f (0, 0, 0)) b1 + (f (0, 0, 1) − f (0, 0, 0)) b3 + f (0, 0, 0). (18)
If the cubic term is zero, then the function becomes quadratic. We can use ELM
(Energy Landscape Manipulation) to change the energy landscape without changing
the ground state [6]. In this case, we apply ELM in order to make the cubic term zero.
Cost
• No auxiliary variables.
• May require many evaluations of the cubic function, varying coefficients in order
to find the right ELM coefficients.
• For a particular value of m, we have (m
n
) different m-variable subsets of the n
variable problem, and (m)2 evaluations of the objective function to find all
n m
Bibliography
• The method was first presented in the first arXiv version of this book [7].
6
Summary
It is possible to reduce a lot of the problem by conditioning on the most connected
variables. We call each of these operations a split.
Cost
Usually slightly sub-exponential in the number of splits, as the number of problems
to solve at most doubles with every split, but often does not double (since entire cases
can get eliminated by some splits, as in the example below).
Pros
• This method can be applied to any problem and can be very effective on problems
with a few very connected variables.
Cons
• Multiple runs of the optimization procedure need to be made, and the number of
runs can often grow almost exponentially with respect to the number of splits.
Example
Consider the simple objective function
H = 1 + b1 b2 b5 + b1 b6 b7 b8 + b3 b4 b8 − b1 b3 b4 . (20)
In order to quadratize H, we first have to choose a variable over which to split. In this
case b1 is the obvious choice since it is present in the most terms and contributes to the
quartic term.
We then obtain two different problems:
H0 = 1 + b3 b4 b8 (21)
H1 = 1 + b2 b5 + b6 b7 b8 + b3 b4 b8 − b3 b4 . (22)
At this point, we could split H0 again and solve it entirely, or use a variable we saved in
the previous split to quadratize our only problem.
To solve H1 , we can split again on b8 , resulting in two quadratic problems:
H1,0 = 1 + b2 b5 − b3 b4 (23)
H1,1 = 1 + b2 b5 + b6 b7 . (24)
Now both of these problems are quadratic. Hence we have reduced our original,
hard problem into 3 easy problems, requiring only 2 extra (much easier) runs of our
minimization algorithm, and without needing any auxiliary variables.
Note that the number of quadratic problems to solve is 3, which is smaller than 22
which would be the "exponential" cost if the number of problems were (hypothetically)
to double with each split. This is a good example of the typical sub-exponential scaling
of split-reduc.
Bibliography
• Original paper and application to Ramsey number determination: [8].
7
Summary
For a negative term −b1 b2 ...bk , introduce a single auxiliary variable ba and make the
substitution:
− b1 b2 . . . bk → (k − 1)ba − ∑ bi ba . (25)
i
Cost
• 1 auxiliary variable for each k-local term.
Pros
• All resulting quadratic terms are submodular (have negative coefficients).
• Can reduce arbitrary order terms with only 1 auxiliary.
• Reproduces the full spectrum.
Cons
• Only works for negative terms.
Example
Alternate Names
• "Standard quadratization" of negative monomials [9].
• sk (b, ba ) [9]
Bibliography
• 2004: Kolmogorov and Zabih presented this for cubic terms [10].
• 2005: Generalized to arbitrary order by Freedman and Drineas [11].
• Discussion: [12], [13].
8
Summary
For a negative term −b1 b2 ...bk , introduce a single auxiliary variable ba and make the
substitution:
k−1 k−1 k
− b1 b2 . . . bk → ∑ bi − ∑ bi bk − ∑ bi ba + (k − 1)bk ba . (30)
i i i
Cost
• 1 auxiliary variable for each k-local term.
• 1 non-submodular term for each k-local termi (and it is quadratic).
Pros
• Can reduce arbitrary order terms with only 1 auxiliary.
• Reproduces the full spectrum.
Cons
• Only works for negative terms.
• Turns a symmetric term into a non-symmetric term (but only bk is asymmetric).
Example
Alternate Names
• "Extended standard quadratization" of negative monomials (see Eqs. 25-26 of [9]).
• sk (b, ba )+ [9].
Bibliography
• 2014, First presentation: [13, 14].
• Further discussion: [9].
9
Summary
For a negative term −b1 b2 ...bk , introduce a single auxiliary variable ba and make the
substitution:
Cost
• 1 auxiliary variable for each k-local term.
• Coefficients of quadratic terms are twice the size of their size in NTR-KZFD or
NTR-ABCG-1, and roughly twice the size for the linear term.
Example
1 k
−b1 b2 . . . bk = 2ba (k − − ∑ bi ) (36)
2 i=1
Summary
Cost
• 1 auxiliary variable per negative cubic term.
Pros
• Asymmetric which allows more flexibility in cancelling with other quadratics.
Cons
• Only works for negative cubic monomials.
Example
−b1 b2 b3 + b1 b3 − b2 = minba (ba − b1 ba − b3 ba + b2 ba + 2b1 b3 ) − b2 (41)
Alternate Forms
• By starting with (40), and flipping ba (i.e. setting ba → 1 − b̄a and relabelling ba → b̄a
since ba does not appear anywhere else in the function being quadratized), we
see that NTR-GBP can actually be derived from NTR-ABCG with k = 3).
Bibliography
• First introduced in: [15].
11
Summary
Using a ternary variable tq ∈ −1, 0, 1 we have:
− z1 z2 z3 → (1 + 4ta + z1 + z2 + z3 ) − 1.
2
(43)
Cost
• 1 auxiliary ternary variable
Pros
• One of the only methods designed specifically for z variables.
• Symmetric with respect to all variables.
Cons
• The auxiliary variable required is ternary (a qutrit).
• Requires all possile quadratic terms and they are all non-submodular.
• Only reproduces the ground state manifold.
Example
Alternate Forms
4 4 4
−z1 z2 z3 z4 → 16 t2a + 4 ta ∑ zi + 2 ∑ ∑ zi zj + 4 (45)
i=1 i=1 j>i
Bibliography
• Original paper where they introduce this gadget for the end points in the LHZ
lattice: [16].
12
Summary
Extra binary or ternary variable is added to ensure that the energy of the even parity
sector is zero and the energy of the odd sector is higher:
Cost
• 1 auxiliary ternary variable (see appendix for transformation to a binary variable)
Pros
• One of the only methods designed specifically for z variables.
Cons
• Only reproduces the ground state manifold, not higher excited states.
• Requires all possile quadratic terms and they are all non-submodular.
• Only reproduces the ground state manifold.
Example
−z1 z2 z3 z4 = −16 b1 b2 b3 b4 + 8 (b1 b2 b3 + b1 b2 b4 + b1 b3 b4 + b2 b3 b4 ) −
4 (b1 b2 + b1 b3 + b1 b4 + b2 b3 + b2 b4 + b3 b4 ) + 2 (b1 + b2 + b3 + b4 ) − 1
4 4 4
→ 16 t2a + 8 ta ∑ bi + 8 ∑ ∑ bi bj + 16 (46)
i=1 i=1 j>i
Alternate Forms
4 4 4
−z1 z2 z3 z4 → 16 t2a + 4 ta ∑ zi + 2 ∑ ∑ zi zj + 4 (47)
i=1 i=1 j>i
Bibliography
• Original paper: [17] (Eq. 4).
• Also given in Eq. 10 of [16] for i + 2 < j.
13
Summary
By considering the negated literals b̄i = 1 − bi , we recursively apply NTR-KZFD to
b1 b2 . . . bk = −b̄1 b2 . . . bk + b2 b3 . . . bk . The final identity is:
k−2 k
b1 b2 . . . bk → ( ∑ bai (k − i − 1 + bi − ∑ bj )) + bk−1 bk (48)
i=1 j=i+1
Cost
• k − 2 auxiliary variables for each k-local term.
Pros
• Works for positive monomials.
Cons
• k − 1 non-submodular quadratic terms.
Example
b1 b2 b3 b4 → ba1 (2 + b1 − b2 − b3 − b4 ) + ba2 (1 + b2 − b3 − b4 ) + b3 b4 (49)
Bibliography
• Summary: [18].
14
Summary
This method re-writes a positive monomial using symmetric polynomials, so all
possible quadratic terms are produced and they are all non-submodular:
nk k
b1 ...bk → (∑ bai (ci,d (− ∑ bj + 2i) − 1) + ∑ bi bj ) (50)
i=1 j=1 i<j
1, i = nd and k is odd,
where nk = ⌊ k−1
2
⌋ and ci,k = {
2, else.
Cost
• ⌊ k−1
2
⌋ auxiliary variables for each k-order term
• O(kt) for a k-local objective function with t terms.
Pros
• Works for positive monomials.
• About half as many auxiliary variables for each k-order term as the previous
method.
• Reproduces the full spectrum.
Cons
• O(k 2 ) quadratic terms are created, which may make chimerization more costly.
k(k−1)
• 2 non-submodular terms.
• Worse than the previous method for quartics, with respect to submodularity.
Example
b1 b2 b3 b4 → (3 − 2b1 − 2b2 − 2b3 − 2b4 )ba + b1 b2 + b1 b3 + b1 b4 + b2 b3 + b2 b4 + b3 b4 (51)
Alternate Forms
For even k, and equivalent expression is given in [19]:
Alternate Names
• "Ishikawa’s Symmetric Reduction" [15].
• "Ishikawa Reduction"
• "Ishikawa"
Bibliography
• Original paper and application to image denoising: [12].
• Equivalent way of writing it for even k, shown in [19].
15
Summary
This is very similar to the alternative form of Ishikawa Reduction, but works for odd
values of k, and is different from Ishikawa Reduction:
Cost
• Same number of auxiliaries as Ishikawa Reduction.
Pros
• Same as for Ishikawa Reduction.
Cons
• Same as for Ishikawa Reduction.
• Only works for odd k, but for even k we have an analogous method which is
equivalent to Ishikawa Reduction.
Alternate Forms
Bibliography
• Original paper: [19].
16
Summary
Let ⌈ k4 ⌉ ≤ m ≤ ⌈ k2 ⌉,
m−1
b1 b2 ⋯bk → αb ∑ bi + αba,1 ∑ bai + αba,2 bam + αbb ∑ bi bj + αbba,1 ∑ ∑ bi baj +
i i ij i j
m−1 m−1
(56)
α bba,2
∑ bi bam + α ba,1 ba,1
∑ bai baj + α ba,1 ba,2
∑ bai bam ,
i ij i
where:
⎛ α
b αbba,1 ⎞ ⎛ −1/2 −1 ⎞
⎜α ba,1 α bb a,2
⎟ ⎜ 1 −2 ⎟
⎜ ba,2 ba,1 ba,1 ⎟ = ⎜ 1 ⎟. (57)
⎜α α ⎟ ⎜ 2 (n − m + n2 − 2mn + m2 ) −(n − m)⎟
⎝ αbb αba,1 ba,2 ⎠ ⎝ 1/2 4(n − m) ⎠
Cost
⌈ k4 ⌉ auxiliary qubits per positive monomial.
Pros
• Smallest number of auxiliary coefficients that scales linearly with k.
• Smaller coefficients than the logarithmic reduction.
Cons
• Introduces many non-submodular terms.
Example
We quadratize a quartic term with only 1 auxiliary (half as many as in PTR-Ishikawa):
1
b1 b2 b3 b4 → (b1 + b2 + b3 + b4 − 2ba1 ) (b1 + b2 + b3 + b4 − 2ba1 − 1) (58)
2
Bibliography
• Original appears in: Theorem 7 of [19], and Theorem 10 of [20].
17
Summary
Pick m such that k < 2m+1 ,
b1 b2 . . . bk → α + αb ∑ bi + αbai ∑ 2i−1 bai + αbb ∑ bi bj + αbba ∑ bi baj + αbai baj bai baj , (59)
i i ij ij
where,
⎛ α αbb ⎞ ⎛ (2m − k)
2
1 ⎞
⎜ α α ⎟ = ⎜ 2 (2 − k) 2 ⎟ .
b bb a m j−1 (60)
⎝αba αba ba ⎠ ⎝−2 (2m − k) 2i+j−2 ⎠
Cost
⌈log k⌉ auxiliary qubits per positive monomial.
Pros
• Logarithmic number of auxiliary variables.
Cons
• Introduces all terms non-submodular except for the term linear in auxiliaries.
Example
1
b1 b2 b3 b4 → (4 + b1 + b2 + b3 + b4 − ba1 − 2ba2 ) (3 + b1 + b2 + b3 + b4 − ba1 − 2ba2 ) (61)
2
Alternate Forms
2
b1 b2 . . . bk → (2 − k + ∑ bi − ∑ 2
m i−1
bai ) (62)
i i
Bibliography
• Original paper (Theorem 4, special case of Theorem 1): [19].
18
Summary
Pick m such that k ≤ 2m+1 ,
m m
1 m+1
b1 . . . b k → (2 − k + ∑ bi − ∑ 2i bai ) (2m+1 − k + ∑ bi − ∑ 2i bai − 1) . (63)
2 i i i i
Cost
⌈log k⌉ − 1 auxiliary qubits per positive monomial.
Pros
• Logarithmic number of auxiliary variables.
Cons
• Introduces many non-submodular terms.
Example
1
b 1 b2 b3 b4 → (b1 + b2 + b3 + b4 − 2ba ) (b1 + b2 + b3 + b4 − 2ba − 1) (64)
2
Alternate Forms
Let X = ∑ bi and N = 2m+1 − k,
n n
1
b1 . . . bk = min (N + X − ∑ 2i b′i ) (N + X − ∑ 2i b′i − 1) . (65)
b1 ,...bn 2
′ ′
i=1 i=1
Bibliography
• Original paper (Theorem 5): [19], Also in (Theorem 9): [20].
19
Summary
Pick m such that k ≤ 2m+1 ,
m m
1 m+1
b1 . . . b k → (2 − k + ∑ bi − ∑ 2i bai ) (2m+1 − k + ∑ bi − ∑ 2i bai − 1) . (66)
2 i i i i
Cost
⌈log n⌉ − 1 auxiliary qubits per positive monomial.
Pros
• Logarithmic number of auxiliary variables.
1
b 1 b2 b3 b4 → (b1 + b2 + b3 + b4 − 2ba ) (b1 + b2 + b3 + b4 − 2ba − 1) (67)
2
Alternate Forms
Let X = ∑ bi and N = 2m+1 − k,
n n
1
b1 . . . bk = min (N + X − ∑ 2i b′i ) (N + X − ∑ 2i b′i − 1) . (68)
b1 ,...bn 2
′ ′
i=1 i=1
Bibliography
• Original paper (Theorem 9): [20].
20
Summary
Pick m such that k ≤ 2m+1 ,
m m
1 m+1
b1 . . . b k → (2 − k + ∑ bi − ∑ 2i bai ) (2m+1 − k + ∑ bi − ∑ 2i bai − 1) . (69)
2 i i i i
Cost
⌈log n⌉ auxiliary qubits per positive monomial.
Pros
• Logarithmic number of auxiliary variables.
1
b 1 b2 b3 b4 → (b1 + b2 + b3 + b4 − 2ba ) (b1 + b2 + b3 + b4 − 2ba − 1) (70)
2
Alternate Forms
Let X = ∑ bi and N = 2m+1 − k,
n n
1
b1 . . . bk = min (N + X − ∑ 2i b′i ) (N + X − ∑ 2i b′i − 1) . (71)
b1 ,...bn 2
′ ′
i=1 i=1
Bibliography
• Original paper (Remark 5): [20].
21
Summary
The CCG-based quadratization algorithm is an iterative algorithm. The CCG (Con-
straint composite graph) is a combinatorial structure associated with an optimization
problem posed as the weighted constraint satisfaction problem.
Cost
• Each positive monomial of degree i generates 2 auxiliary variables when it is
reduced to the sum of a quadratic polynomial and a monomial of degree i − 1,
which can then be combined with existing monomials of degree i − 1 if they are
composed of the same variables. This combination of monomials can take place in
each iteration, until the whole pseudo-Boolean function (PBF) becomes quadratic.
• Same as Ishikawa’s method, use 1 auxiliary variable for each negative monomial.
• For a k-local objective function, use a factor of k less of auxiliary variables asymp-
totically compared to Ishikawa’s method.
Pros
• Due to the recombinations of terms during its iterative reduction process, the
resulting number of auxiliary variables is less than Ishikawa’s method especially
for PBFs with many positive monomials and many terms (the difficult case).
• The higher the degree of the PBF is, the more advantageous the CCG-based
quadratization method is. It works particularly well for problems in real life such
as planning problem that requires a high-degree PBF formulation.
• Due to the nature of recombinations of terms during each iteration, the number
of quadratic terms in the finalized quadratic PBFs is less than Ishikawa’s method.
Cons
• Can lead to more auxiliary variables for sparse PBFs (the number of monomials
is much less than the maximum number the PBFs can have) and PBFs with low
degree.
Example
Each degree-d monomial in the PBF f (⃗
x) in one reduction step is substituted as:
d−1
ax1 . . . xd = min [axa + LxL + J ∑ (1 − xi )(1 − xa )
xa ,xL
i=1
(72)
+ J(1 − xd )(1 − xL ) + J(1 − xL )(1 − xa )]
where J ≥ L > a > 0. xa and xL are the two auxiliary variables introduced for positive
monomial and xa′ is the auxiliary variable introduced for negative monomial.
Bibliography
• 2019, original paper: [21].
• 2019, (Theorem 1): [21], is inspired by 2008, (Theorem 3): [22].
22
Summary
This method can be used to re-write positive or negative cubic terms in terms of 6
quadratic terms. The identity is given by:
Cost
• 1 auxiliary variable per positive or negative cubic term.
Pros
• Works on positive or negative monomials.
• Reproduces the full spectrum.
Cons
• Introduces all 6 possible non-submodular quadratic terms.
Alternate Names
• "Reduction by Minimum Selection" [12, 15].
Bibliography
• Original paper: [10].
23
Summary
The formula is almost the same as in the version of PTR-KZ on the previous page
(which is written in terms of b), but with a factor of 2, a change of sign for the linear
terms, and a slight change in the constant term. This formula can be obtained directly
from the PTR-KZ quadratization formula in terms of b, by starting with 8b1 b2 b3 on the
left-side, making the substitution bi → (1 + zi )/2, and removing all terms that appear on
both sides of the equation. The result is:
Cost
• 1 auxiliary variable per positive or negative cubic term.
Pros
• Works on positive or negative monomials.
Summary
Similar to other methods of reducing one term, this method can reduce a positive cu-
bic monomial into quadratic terms using only one auxiliary variable, while introducing
fewer non-submodular terms than the symmetric version.
The identity is given by:
b1 b2 b3 → ba − b2 ba − b3 ba + b1 ba + b2 b3 (76)
→ ba − b1 ba − b3 ba + b2 ba + b1 b3 (77)
→ ba − b1 ba − b2 b a + b 3 ba + b1 b2 . (78)
(79)
Cost
1 auxiliary variable per positive cubic term.
Pros
• Works on positive monomials.
• Fewer non-submodular terms than Ishikawa Reduction.
Cons
• Only been shown to work for cubics.
Example
b1 b2 b3 + b1 b3 − b2 → (ba − b1 ba − b3 ba + b2 ba + 2b1 b3 ) − b2 (80)
Bibliography
• Original paper and application to computer vision: [15].
25
Summary
Using a ternary variable tq ∈ −1, 0, 1 we have:
z1 z2 z3 → (1 + 4ta + z1 + z2 + z3 )2 − 1. (81)
Cost
• 1 auxiliary ternary variable
Pros
• One of the only methods designed specifically for z variables.
• Symmetric with respect to all variables.
Cons
• The auxiliary variable required is ternary (a qutrit).
• Requires all possile quadratic terms and they are all non-submodular.
• Only reproduces the ground state manifold.
Example
Alternate Forms
4 4 4
−z1 z2 z3 z4 → 16 t2a + 4 ta ∑ zi + 2 ∑ ∑ zi zj + 4 (83)
i=1 i=1 j>i
Bibliography
• Original paper where they introduce this gadget for the end points in the LHZ
lattice: [16].
26
Summary
Extra binary or ternary variable is added to ensure that the energy of the even parity
sector is zero and the energy of the odd sector is higher:
4 4 4
z1 z2 z3 z4 → 16 t2a + 4 ta ∑ zi + 2 ∑ ∑ zi zj + 4. (84)
i=1 i=1 j>i
Cost
• 1 auxiliary ternary variable
Pros
• One of the only methods designed specifically for z variables.
Cons
• Only reproduces the ground state manifold, not higher excited states.
• Requires all possile quadratic terms and they are all non-submodular.
Bibliography
• Original paper: [16].
27
Summary
Auxilliary qubits can be made to “count” the number of logical qubits in the 1
configuration. By applying single qubit terms to the auxilliary qubits, the spectrum of
any permutation symmetric objective function can be reproduced.
Cost
• For a k local coupler requires k auxilliary qubits.
Pros
• Natural flux qubit implementation [26].
• Single gadget can reproduce any permutation symmetric spectrum.
• High degree of symmetry means this method is natural for some kinds of quan-
tum simulations [27].
Cons
• Requires coupling between all logical bits and from all logical bits to all auxilliary
bits.
• Requires single body terms of increasing strength as k is increased.
Example
A 4 qubit gadget guarantees that the number of auxillary bits in the −1 state is equal
to the number of logical bits in the 1 state
4 i−1 4 4 4 4
H4−count = 4 ∑ ∑ bi bj + 4 ∑ ∑ bi baj − 15 ∑ bi − 8 ∑ bai + (5 ba1 + ba2 − 3 ba3 − 7 ba4 ) + 26 (86)
i=2 j=1 i=1 j=1 i=1 i=1
Bibliography
• Paper on flux qubit implementation: [28]
• Paper on MAX-k-SAT mapping: [28] (published in a journal earlier than [26] but
put on arXiv 1 month later).
• Talk including use in quantum simulation: [27]
28
Summary
For any variable b, we can consider the negation b̄ = 1 − b. The process of exchanging
b for b̄ is called flipping. Using bit-flipping, an arbitrary function in n variables can be
represented using at most 2(n−2) (n − 3) + 1 variables, though this is a gross overestimate.
Can be used in many different ways:
1. Flipping positive terms and using II A, recursively;
2. For α < 0, we can reduce αb̄1 b̄2 ...b̄k very efficiently to submodular form using II A.
A generalized version exists for arbitrary combinations of flips in the monomial
which makes reduction entirely submodular [12];
3. When we have quadratized we can minimize the number of non-submodular
terms by flipping.
4. We can make use of both bi and b̄i in the same objective function by adding on a
sufficiently large penalty term: λ(bi + b̄i − 1)2 = λ(1 + 2bi b̄i − bi − b̄i ). This is similar
to the ideas in reduction by substitution or deduc-reduc. In this way, given a
quadratic in n variables we can make sure it only has at most n nonsubmodular
terms if we are willing to use the extra n negation variables as well (so we have
2n variables in total).
Cost
• None, as replacing bi with it’s negation b̄i costs nothing except a trivial symbolic
expansion.
Pros
• Cheap and effective way of improving submodularity.
• Can be used to combine terms in clever ways, making other methods more
efficient.
Cons
• Unless the form of the objective function is known, spotting these ’factorizations’
using negations is difficult.
• We need an auxiliary variable for each bi for which we also want to use b¯i in the
same objective function.
Example
By bit-flipping b2 and b4 , i.e. substituting b2 = 1 − b̄2 and b4 = 1 − b̄4 , we see that:
H = 3b1 b2 + b2 b3 + 2b1 b4 − 4b2 b4 (91)
= −3b1 b̄2 − b̄2 b3 − 2b1 b̄4 − b̄2 b̄4 + 5b1 + b3 + 4b̄2 + 4b̄4 − 4. (92)
The first expression is highly non-submodular while the second is entirely submodular.
Bibliography
• Original paper: [12].
29
A symmetric function is one where if we switch any of the variable names (for
example b1 → b5 → b8 → b1 ), the function’s output is unaffected.
Summary
Any n-variable symmetric function f (b1 , b2 , . . . bn ) ≡ f (b) can be quadratized with
n − 2 auxiliaries:
1
f (b) → −α0 − α0 ∑ bi + a2 ∑ bi bj + 2 ∑ (αi − c) bai (2i − − ∑ bj ) (93)
i ij i 2 j
min (α2j ) , i ∈ even
c={ (94)
min (α2j−1 ) , i ∈ odd
a2 = Determined from Page 12 of [29] (95)
Cost
• n − 2 auxiliaries for any n-variable symmetric function.
• n2 non-submodular quadratic terms (of the non-auxiliary variables).
• n − 2 non-submodular linear terms (of the auxiliary variables).
Pros
• Quadratization is symmetric in all non-auxiliary variables (this is not always true,
for example some of the methods in the NTR section).
• Reproduces the full spectrum.
• When there’s a large number of terms, there’s fewer auxiliary variables than
quadratizing each positive monomial separately.
Cons
• Only works on a specific class of functions, although the quadratizations of
arbitrary functions can be related to the quadratizations of symmetric functions
on a larger number of variables.
• All quadratic terms of the non-auxiliary variables are non-sub-modular.
Summary
Any n-variable symmetric function f (b1 , b2 , . . . bn ) ≡ f (b) that is non-zero only when
∑ i = c where n/2 ≤ c ≤ n, can be quadratized with m = ⌈log2 c⌉ + 1 auxiliary variables:
b
where:
⎛α αbb ⎞ ⎛ (c + 1)2 1 ⎞
⎜ αb α1 ⎟
bba
⎜ −2(c + 1) −2 i
⎟
⎜ ba ⎟ ⎜ m−1 ) ⎟
⎜α α2bba ⎟ = ⎜ (c + i (1 + ⎟.
⎜ 1 ⎟ ⎜ 1)2 2 2
⎟
(97)
⎜ α ba α1ba ba ⎟ ⎜(1 + m−1 ) (2m−1 − 2c − 1) ⎟
⎜ 2 ⎟ 2 2i+j−1
⎝ ⋅ α2ba ba ⎠ ⎝ ⋅ − (1 + 2 ) 2 ⎠
m−1 i
Cost
• m = ⌈log2 c⌉ + 1 auxiliary variables.
Bibliography
• 2018, original paper (Theorem 1): [19].
31
Summary
Any n-variable symmetric function f (b1 , b2 , . . . bn ) ≡ f (b) that is non-zero only when
∑ i = c where 0 ≤ c ≤ n/2, can be quadratized with m = ⌈log2 (n − c)⌉ + 1 auxiliary
b
variables:
where:
⎛α αbb ⎞ ⎛ (c − 1)2 1 ⎞
⎜ αb bba
α1 ⎟ ⎜ ⎟ −2(c − 1) +2i ⎟
⎜ ba ⎜ m−1 ) ⎟
⎜α α2bba ⎟ = ⎜ (1 − i −2 (1 + ⎟.
⎜ 1 ⎟ ⎜ c)2 2
⎟
(102)
⎜αba ba ba ⎟ ⎜ (1 + 2 ) (2 + 2c − 1) ⎟
⎜ 2 α1 ⎟ m−1 m−1 2i+j−1
⎝ ⋅ ba ba ⎠
α2 ⎝ ⋅ − (1 + 2 ) 2 ⎠
m−1 i
Cost
• m = ⌈log2 (n − c)⌉ + 1 auxiliary variables.
• n2 + m2 non-submodular quadratic terms (all possible quadratic terms involving
only non-auxiliary or only auxiliary variables).
• m non-submodular linear terms (all possible linear terms involving auxiliaries).
Pros
• Small number of auxiliary terms
Cons
• Only works for a special class of functions
• Introduces many linear and even more quadratic non-submodular terms.
Example
For n = 4 and c = 2, we have m = 2, and
(b1 b2 + b1 b3 + b1 b4 + b2 b3 + b2 b4 + b3 b4 ) − 3(b1 b2 b3 + b1 b2 b4 + b1 b3 b4 + b2 b3 b4 ) + 6b1 b2 b3 b4(103)
→ (1 − b1 − b2 − b3 − b4 − ba1 + 3ba2 )
2
(104)
using the alternate form below.
Alternate Forms
m−1 2
f (b1 , b2 , . . . , bn ) → ((c − 1) − ∑ bi − ∑ 2 i−1
bai + (1 + 2
m−1
) bam ) (105)
i i
Bibliography
• 2018, original paper (Theorem 1): [19].
32
Summary
Any n-variable symmetric function f (b1 , b2 , . . . bn ) ≡ f (b) that is non-zero only when
∑ i = c where n/2 ≤ c ≤ n, can be quadratized with m = ⌈log2 c⌉ auxiliary variables f →:
b
where:
⎛α 2 (c + 3c + 2)
αbb ⎞ ⎛ 1 2 1
2 ⎞
⎜ αb α1 ⎟
bba
⎜ −c − 2
3
−2i ⎟
⎜ ba ⎟ ⎜ m) ⎟
⎜α α2bba ⎟ = ⎜ (3 + i−1 (1 + ⎟.
⎜ 1 ⎟ ⎜ c)2 2
⎟
(107)
⎜αba α1ba ba ⎟ ⎜(1 + m ) (2m−1 − c − 1) ⎟
⎜ 2 ⎟ 2 2i+j−1
⎝ ⋅ α2ba ba ⎠ ⎝ ⋅ − (1 + 2 ) 2 ⎠
m i
Cost
• m = ⌈log2 c⌉ auxiliary variables.
Bibliography
• 2018, original paper (Theorem 2): [19].
33
Summary
Any n-variable symmetric function f (b1 , b2 , . . . bn ) ≡ f (b) that is non-zero only when
∑ i = c where 0 ≤ c ≤ n/2, can be quadratized with m = ⌈log2 (n − c)⌉ auxiliary variables
b
f →:
where:
⎛α 2 (c − 3c + 2)
αbb ⎞ ⎛ 1 2 1
2 ⎞
⎜ αb α1 ⎟
bba
⎜ −c + 3
+2 i
⎟
⎜ ba ⎟ ⎜ 2
m) ⎟
⎜α α2bba ⎟ = ⎜ (3 − i−1 − (1 + ⎟.
⎜ 1 ⎟ ⎜ c)2 2
⎟
(112)
⎜αba ba ba ⎟ ⎜(1 + 2 ) (2 + c − 1) ⎟
⎜ 2 α1 ⎟ m m−1 2i+j−1
⎝ ⋅ ba ba ⎠
α2 ⎝ ⋅ − (1 + 2 ) 2 ⎠
m i
Cost
• m = ⌈log2 (n − c)⌉ auxiliary variables.
• n2 + m2 non-submodular quadratic terms (all possible quadratic terms involving
only non-auxiliary or only auxiliary variables).
• m non-submodular linear terms (all possible linear terms involving auxiliaries).
Pros
• Small number of auxiliary terms
Cons
• Only works for a special class of functions
• Introduces many linear and even more quadratic non-submodular terms.
Example
For n = 4 and c = 2, we have m = 1, and
(b1 b2 + b1 b3 + b1 b4 + b2 b3 + b2 b4 + b3 b4 ) − 3(b1 b2 b3 + b1 b2 b4 + b1 b3 b4 + b2 b3 b4 ) + 6b1 b2 b3 b4 (113)
1 − b1 − b2 − b3 − b4 + 3ba1
→( ) (114)
2
using the alternate form below.
Alternate Forms
(c − 1) − ∑i bi − ∑m−1 2i bai + (1 + 2m ) bam
f (b1 , b2 , . . . , bn ) → ( i
) (115)
2
Bibliography
• 2018, original paper (Theorem 2): [19].
34
Summary
For an n-variable symmetric function that is a function of the sum √ of all variables
f (b1 , b2 , . . . bn ) = f (∑ bi ), for some large value of λ > max(f ), and ⌈ n + 1⌉:
2 2
m ⎛ m m
f (∑ bi ) → ∑ f ((i − 1) (m + 1) + (j − 1)) bai bac+j + λ (1 − ∑ bai ) + (1 − ∑ bac+i ) +
ij ⎝ i i
2 2
m m m m ⎞
(∑ bi − ((m + 1) ∑(i − 1)yai + ∑(i − 1)bac+i )) + (∑ bi − ((m + 1) ∑(i − 1)yai + ∑(i − 1)bac+i ))
i i i i i i ⎠
(116)
where:
⎛ α αbb ⎞ ⎛ (c + 1)2 1 ⎞
⎜ α α
b bb a,1
⎟ ⎜ −2(c + 1) −2i ⎟
⎜ ba,1 bba,2 ⎟ = ⎜ ⎟. (117)
⎜α α ⎟ ⎜ 2(c + 1) −2 (2m − 2m−1 + 1)⎟
⎝αba,2 αba ba ⎠ ⎝2(c + 1) (2c − 2m−1 + 1) 2i+j−2 ⎠
Cost √
• m = ⌈ n + 1⌉ auxiliary variables.
• n2 + m2 non-submodular quadratic terms (all possible quadratic terms involving
only non-auxiliary or only auxiliary variables).
• m non-submodular linear terms (all possible linear terms involving auxiliaries).
Pros
• Small number of auxiliary terms
Cons
• Only works for a special class of functions
• Introduces many linear and even more quadratic non-submodular terms.
Example
By bit-flipping b2 and b4 , i.e. substituting b2 = 1 − b̄2 and b4 = 1 − b̄4 , we see that:
H = 3b1 b2 + b2 b3 + 2b1 b4 − 4b2 b4 (118)
= −3b1 b̄2 − b̄2 b3 − 2b1 b̄4 − b̄2 b̄4 + 5b1 + b3 + 4b̄2 + 4b̄4 − 4. (119)
The first expression is highly non-submodular while the second is entirely submodular.
Alternate Forms
1
((c − 1) − ∑i bi − (2(n − c) − 2m−1 + 1) bam − ∑m−1 2i−1 bai )
f (b1 , b2 , . . . , bn ) → (
2 i
) (120)
2
Bibliography
• 2018, original paper (Theorem 9): [19] (contains a typo which was corrected in
Theorem 6 of [20].
35
Summary
For an n-variable symmetric function that is a function of a weighted sum of
all variables f (b1 , b2 , . . . bn ) = f (∑ wi bi ), for some large value of λ > max(f ), and
max (f (∑ wi bi )) < (m + 1)2 :
⎛
m m m 2
f (∑ wi bi ) → ∑ αij bai bam+i + λ 1 + (∑ wi bi − (m − 1) ∑ bai + ∑ bac+i )
ij ⎝ i i i
(121)
m−1 m−1
+ ∑ (1 − bai ) bai+1 + ∑ (1 − bai+m ) bai+m+1 )
i i
where:
α β
∑ ∑ αij = f (α(m + 1) + β) (122)
i j
Cost √
• 2m auxiliary variables, where m > max (f (∑ wi bi )) − 1.
• n2 + m2 non-submodular quadratic terms (all possible quadratic terms involving
only non-auxiliary or only auxiliary variables).
• m non-submodular linear terms (all possible linear terms involving auxiliaries).
Pros
• Small number of auxiliary terms
Cons
• Only works for a special class of functions
The first expression is highly non-submodular while the second is entirely submodular.
Alternate Forms
1
((c − 1) − ∑i bi − (2(n − c) − 2m−1 + 1) bam − ∑m−1 2i−1 bai )
f (b1 , b2 , . . . , bn ) → ( 2 i
) (125)
2
Bibliography
• 2018, original paper (Theorem 10): [19].
36
Summary
For any n-variable, k-local function that is non-zero only if ∑ bi = 2m − 1, we call it
the "partity function" and it can be quadratized as follows:
n−1
f (b1 , b2 , . . . , bn ) → ∑ bi + 2 ∑ bi bj + 4 ∑ bai (2i − 1 − ∑ bj ) . (126)
i ij 2i−1 j
Cost
• m = 2⌊n/2⌋ auxiliary variables.
• ⌊1.5n⌋ non-submodular linear terms.
• n2 non-submodular quadratic terms.
Pros
• Smaller number of auxiliary variables than the most naive methods.
Cons
• Only works for a special class of functions
• Introduces many linear and even more quadratic non-submodular terms (every-
thing is non-submodular except for 0.5n2 quadratic terms involving the auxiliaries
with the non-auxiliaries).
• non-submodular terms can be rather large compared to the submodular terms
(about 4n times as big).
Example
By bit-flipping b2 and b4 , i.e. substituting b2 = 1 − b̄2 and b4 = 1 − b̄4 , we see that:
H = 3b1 b2 + b2 b3 + 2b1 b4 − 4b2 b4 (127)
= −3b1 b̄2 − b̄2 b3 − 2b1 b̄4 − b̄2 b̄4 + 5b1 + b3 + 4b̄2 + 4b̄4 − 4. (128)
The first expression is highly non-submodular while the second is entirely submodular.
Bibliography
• 2014, original paper (Theorem 4.6): [13, 14].
37
Summary
The complement of the parity function can be quadratized as follow:
n−1 n
f (b1 , b2 , . . . , bn ) → 1 + 2 ∑ bi bj − ∑ bi + 4 ∑ bai (i − ∑ bj ) (129)
ij i 2i j
Cost
2 ⌋ auxiliary variables.
• m = 2⌊ n−1
• ⌊0.5n⌋ non-submodular linear terms.
• n2 non-submodular quadratic terms.
Pros
• Smaller number of auxiliary variables than the most naive methods.
• Fewer non-submodular linear terms than in the analogous quadratization for its
complement (the parity function).
Cons
• Only works for a special class of functions
• Introduces many linear and even more quadratic non-submodular terms (every-
thing is non-submodular except for 0.5n2 quadratic terms involving the auxiliaries
with the non-auxiliaries, and all n linear terms involving only the non-auxiliaries).
Summary √
For a symmetric function such that f (∣c∣) = 0 for c > n, then with with 2⌈ n + 1⌉
auxiliary variables, we have:
n−1 n
f (b1 , b2 , . . . , bn ) → 1 + 2 ∑ bi bj − ∑ bi + 4 ∑ bai (i − ∑ bj ) (132)
ij i 2i j
Cost √
• m = ⌈ n + 1⌉ auxiliary variables.
• ⌊0.5n⌋ non-submodular linear terms.
• Introduces many linear and even more quadratic non-submodular terms (every-
thing is non-submodular except for 0.5n2 quadratic terms involving the auxiliaries
with the non-auxiliaries, and all n linear terms involving only the non-auxiliaries).
• non-submodular terms can be rather large compared to the submodular terms
(about 4n times as big).
Example
By bit-flipping b2 and b4 , i.e. substituting b2 = 1 − b̄2 and b4 = 1 − b̄4 , we see that:
H = 3b1 b2 + b2 b3 + 2b1 b4 − 4b2 b4 (133)
= −3b1 b̄2 − b̄2 b3 − 2b1 b̄4 − b̄2 b̄4 + 5b1 + b3 + 4b̄2 + 4b̄4 − 4. (134)
The first expression is highly non-submodular while the second is entirely submodular.
Bibliography
• 2018, original paper (Theorem 6): [20].
39
Summary
For a symmetric function such that f (∣c∣) = 0 for c > n, then with with max (⌈log(c)⌉, ⌈log(n − c)⌉)
auxiliary variables, we have:
n−1 n
f (b1 , b2 , . . . , bn ) → 1 + 2 ∑ bi bj − ∑ bi + 4 ∑ bai (i − ∑ bj ) (135)
ij i 2i j
Cost
• max (⌈log(c)⌉, ⌈log(n − c)⌉) auxiliary variables.
• ⌊0.5n⌋ non-submodular linear terms.
• n2 non-submodular quadratic terms.
Pros
• Smaller number of auxiliary variables than the most naive methods.
• Fewer non-submodular linear terms than in the analogous quadratization for its
complement (the parity function).
Cons
• Only works for a special class of functions
• Introduces many linear and even more quadratic non-submodular terms (every-
thing is non-submodular except for 0.5n2 quadratic terms involving the auxiliaries
with the non-auxiliaries, and all n linear terms involving only the non-auxiliaries).
Summary
For a symmetric function such that f (∣c∣) = 0 for c > n, then with with max (⌈log(c)⌉, ⌈log(n − c)⌉)
auxiliary variables, we have:
n−1 n
f (b1 , b2 , . . . , bn ) → 1 + 2 ∑ bi bj − ∑ bi + 4 ∑ bai (i − ∑ bj ) (138)
ij i 2i j
Cost
• max (⌈log(c)⌉, ⌈log(n − c)⌉) auxiliary variables.
• ⌊0.5n⌋ non-submodular linear terms.
• n2 non-submodular quadratic terms.
Pros
• Smaller number of auxiliary variables than the most naive methods.
• Fewer non-submodular linear terms than in the analogous quadratization for its
complement (the parity function).
Cons
• Only works for a special class of functions
• Introduces many linear and even more quadratic non-submodular terms (every-
thing is non-submodular except for 0.5n2 quadratic terms involving the auxiliaries
with the non-auxiliaries, and all n linear terms involving only the non-auxiliaries).
Summary
The parity function for even n can be quadratized with:
n−1 n
f (b1 , b2 , . . . , bn ) → 1 + 2 ∑ bi bj − ∑ bi + 4 ∑ bai (i − ∑ bj ) (141)
ij i 2i j
Cost
• ⌈log(n)⌉ − 1 auxiliary variables.
• ⌊0.5n⌋ non-submodular linear terms.
• n2 non-submodular quadratic terms.
Pros
• Smaller number of auxiliary variables than the most naive methods.
• Fewer non-submodular linear terms than in the analogous quadratization for its
complement (the parity function).
Cons
• Only works for a special class of functions
• Introduces many linear and even more quadratic non-submodular terms (every-
thing is non-submodular except for 0.5n2 quadratic terms involving the auxiliaries
with the non-auxiliaries, and all n linear terms involving only the non-auxiliaries).
Summary
The parity function for odd n can be quadratized with:
n−1 n
f (b1 , b2 , . . . , bn ) → 1 + 2 ∑ bi bj − ∑ bi + 4 ∑ bai (i − ∑ bj ) (144)
ij i 2i j
Cost
• ⌈log(n)⌉ − 1 auxiliary variables.
• ⌊0.5n⌋ non-submodular linear terms.
• n2 non-submodular quadratic terms.
Pros
• Smaller number of auxiliary variables than the most naive methods.
• Fewer non-submodular linear terms than in the analogous quadratization for its
complement (the parity function).
Cons
• Only works for a special class of functions
• Introduces many linear and even more quadratic non-submodular terms (every-
thing is non-submodular except for 0.5n2 quadratic terms involving the auxiliaries
with the non-auxiliaries, and all n linear terms involving only the non-auxiliaries).
• There exist ZUC (zero until c) functions such that every quadratization must in-
volve at least Ω (2n/2 ) auxiliary variables, no matter what the value of c (Theorem
2 from [20]). This is true for almost all ZUC functions because the set of ZUCs
requiring fewer auxiliary variables has Lebesgue measure zero.
• For any c ≥ 0, the number of auxiliary variables is m ≥ ⌈log(c)⌉ − 1 (Theorem 3 of
[20]).
β(q1 )
• The number of auxiliary variables m is such that 2m+1 ≥ 2 − d + 1 (Theorem 12
from [20]).
• For any c ≥ 0, the number of auxiliary variables is m ≥ ⌈log(c)⌉ − 1 (Theorem 3 of
[20]).
45
Summary
Pick a variable pair (bi , bj ) and substitute bi bj with a new auxiliary variable baij .
Enforce equality in the ground states by adding some scalar multiple of the penalty
P = bi bj − 2bi baij − 2bj baij + 3baij or similar. Since P > 0 if and only if baij ≠ bi bj , the
minimum of the new (k − 1)-local function will satisfy bai j = bi bj ), which means that at
the minimum, we have precisely the original function. Repeat (k − 2) times for each
k-local term and the resulting function will be 2-local. For an arbitrary cubic term we
have:
Cost
• 1 auxiliary variable per reduction.
• At most k t auxiliary variables for a k-local objective function of t terms, but
usually fewer.
Pros
• Variable can be used across the entire objective function, reducing many terms at
once.
• Very easy to implement.
• Reproduces not only the ground state, but the full spectrum.
Cons
• Inefficient for single terms as it introduces many auxiliary variables compared to
Ishikawa reduction, for example.
• Introduces quadratic terms with large positive coefficients, making them highly
non-submodular.
• Determining optimal substitutions can be expensive.
Example
We pick the pair (b1 , b2 ) and combine.
Bibliography
• Original paper: [30]
• Re-discovered in the context of diagonal quantum Hamiltonians: [31].
• Used in: [32, 33].
46
Summary
We consider a set C of variables which can occur in multiple terms throughout the
objective function. Each application ‘rips out’ this common component from each term
[18, 34].
∑ αH ∏ bj → ∑ αH (1 − ∏ bj − ∏ bj ) ba (149)
H j∈H H j∈C j∈H∖C
Cost
• One auxiliary variable per application.
−ba1 b2 b3 − ba1 b2 b4 ↦ 2ba2 − ba1 ba2 − ba2 b2 − ba2 b3 + 2ba2 − ba1 ba2 − ba2 b2 − ba2 b4 (152)
= 4ba2 − 2ba1 ba2 − 2ba2 b2 − ba2 b3 − ba2 b4 . (153)
Bibliography
• Original paper and application to image denoising: [34].
47
Summary
We consider a set C of variables which can occur in multiple terms throughout the
objective function. Each application ‘rips out’ this common component from each term
[18, 34]:
∑ αH ∏ bj → ∑ αH ba ∏ bj + ∑ αH (1 − ba ) ∏ bj . (154)
H j∈H H j∈C H j∈H∖C
Cost
• One auxiliary variable per application.
−ba1 b2 b3 − ba1 b2 b4 ↦ 2ba2 − ba1 ba2 − ba2 b2 − ba2 b3 + 2ba2 − ba1 ba2 − ba2 b2 − ba2 b4 (157)
= 4ba2 − 2ba1 ba2 − 2ba2 b2 − ba2 b3 − ba2 b4 . (158)
Bibliography
• Original paper and application to image denoising: [34].
48
Summary
Here we consider a set C of variables which occur in multiple monomials throughout
the objective function. Each application ’rips out’ this common component from each
term [34][18].
Let H be a set of monomials, where C ⊆ H for each H ∈ H and each monomial H
has a weight αH . The algorithm comes in 2 parts: when all αH > 0 and when all αH < 0.
Combining the 2 gives the final method:
1. αH > 0
∑ αH ∏ bj = min ( ∑ αH ) ba ∏ bj + ∑ αH (1 − ba ) ∏ bj (159)
H∈H j∈H ba H∈H j∈C H∈H j∈H∖C
2. αH < 0
∑ αH ∏ bj = min ∑ αH (1 − ∏ bj − ∏ bj ) ba (160)
H∈H j∈H ba H∈H j∈C j∈H∖C
Cost
• One auxiliary variable per application.
• αH < 0 method only works for ∣C∣ > 1, and cannot quadratize cubic terms.
Example
First let C = b1 and use the positive weight version:
−ba1 b2 b3 − ba1 b2 b4 ↦ 2ba2 − ba1 ba2 − ba2 b2 − ba2 b3 + 2ba2 − ba1 ba2 − ba2 b2 − ba2 b4 (163)
= 4ba2 − 2ba1 ba2 − 2ba2 b2 − ba2 b3 − ba2 b4 . (164)
Bibliography
• Theorem 4 of: [9].
49
Summary
This method uses gadgets to produce separate 3-SAT clauses which allow variables
which ‘flag’ the state of pairs of other variables.
Cost
• 24 auxiliary variables to quadratize z1 z1 z3 .
Pros
• Very general and therefore conducive to proofs.
Cons
• Extremely inefficient in terms of number of auxiliary variables.
Example
To create a system which maps b1 b2 b3 , we use the following gadget (note that this is
given in terms of z in the orginal work and translated to b here):
3 3 3 3
23
H1 (b1 , b2 , b3 ) = 2 ∑ bi bai + 2 ∑ bai baj − 4 ∑ bai − 2 ∑ bi + . (165)
i=1 i<j i=1 i=1 2
Implementing αH1 , creates a situation where b3 is a ‘flag’ for b1 and b2 in other words
b3 is constrained to be 1 in the low energy manifold if b1 = 0 and b2 = 0. It follows from
the universality of 3 − SAT that these ‘flag’ clauses can be combined to map any spin
Hamiltonian. To do this, we also need anti-ferromagnetic couplings to express the
‘negated’ variable, to do this, we define,
3
⎛ 3 2
z1 z2 z3 →α ∑ H2 (bi , b¬i ) + ∑ H2 (bai , b¬ai ) + H2 (ba4,1 , ba4,2 ) + H1 (ba1,1 , ba1,2 , b¬a1 ) + H1 (ba1 , ba1,3 , ba4,1 ) + (167)
⎝i=1 i=1
H1 (b1 , b2 , b¬a2 ) + H1 (ba2 , b3 , ba4,2 + H1 (b1 , b2 , b¬a3 ) + H1 (ba3 , ba1,3 , ba4,1 ) + H1 (ba1,1 , ba1,2 , b¬a4 ) + (168)
H1 (ba4 , b3 , ba4,2 ) + H1 (b1 , ba1,2 , b¬a5 ) + H1 (ba5 , b3 , ba4,1 ) + H1 (ba1,1 , b2 , b¬a6 ) + H1 (ba6 , ba1,3 , ba4,2 ) + (169)
H1 (b1 , b2 , b¬a7 ) + H1 (ba7 , ba1,3 , ba4,1 ) + H1 (ba1,1 , ba1,2 , b¬a8 ) + H1 (ba8 , b3 , ba4,2 )) − 2 ba4,1 + 1. (170)
Each of the next four lines assigns a value to the flag variable ba4,1 for a state and
ba4,2 , for instance the leftmost two terms of the second line enforce that ba4,1 = 0 if
(b1 , b2 , b3 ) = (0, 0, 0), while the right two terms enforce that ba4,1 = 1 if (b1 , b2 , b3 ) = (1, 1, 1).
Because there are 23 = 8 possible bitstrings for (b1 , b2 , b3 ), and each term to enforce a flag
state requires two instances of H1 (and two auxilliary variables), a total of 16 instances
are required as well as 16 auxilliary variables.
Bibliography
• Paper showing the universality of the Ising spin models: [35].
50
F. Lower bounds for arbitrary functions (Anthony, Boros, Crama, Gruber, 2015)
Summary
Split a k-local monomial with odd k into a (k − 1)-local term (with even degree) and
a new odd k-local term which has negative coefficient:
k−1 k−1
b1 b2 ⋯bk → ∏ bi − ∏ bi (1 − bk ) (171)
i=1 i=1
We can use any of the PTR methods for even k on the first term, and we can use any of
the NTR methods on the second term. Can be generalized to split into different-degree
factors when seeking an "optimum" quadratization. Can be generalized into more
splits.
Cost
• Depends on the methods used for the PTR and NTR procedures.
Pros
• Very flexible.
Cons
• First turns one term into two terms, so might not be preferred when we wish to
minimize the number of terms.
Bibliography
• Original paper: [19].
52
Summary
Split a any function f into a symmetric part and anti-symmetric part:
Part II
Hamiltonians quadratic in z and linear in x (Transverse Field Ising Hamiltonians)
The Ising Hamiltonian with a transverse field in the x direction is possible to implement
in hardware:
C. ZZZ-TI-CBBK: Transvese Ising from ZZZ, by Cao, Babbush, Biamonte, and Kais (2015)
There is only one reduction in the literature for reducing a Hamiltonian term to the
transverse Ising Hamiltonian, and it works on 3-local zzz terms, by introducing an
auxiliary qubit with label a:
(z)
αjz = αi
(z)
αkz = αi
2/5
αaz = 21 (∆− ( α6 ) ∆3/5 )
1/5
αax = ( α∆
6 )
4
3/5 1/5
= − 21 (( 7α ( α ) ∆2/5 ) + ( α∆
6 + 6 6 ) )
4
zz
αia
zz (zz)
αja = αia
zz (zz)
αka = αja
1/5
1 − za α∆4 /5
1
α∆4
αzi zj zk → (∆ + (zi + zj + zk )) ( )+ xa (177)
6 2 6
α /5 3 α /5 2 1 + za
2 3
7α
+ (( ) ∆ /5 − ( + ( ) ∆ /5 ) (zi + zj + zk )) ( ) (178)
6 6 6 2
The low-lying spectrum (eigenvalues and eigenvectors) of the right side of Eq. (176)
will match those of the left side to within a spectral error of as long as ∆ = O (−5 ).
Cost
• 1 auxiliary qubit
• 8 auxiliary terms not proportional to 11.
54
Part III
General Quantum Hamiltonians
Summary
For the 8-body Hamiltonian:
we define auxiliary qubits labeled by aijk , two auxiliaries for each pair ij: labeled aij1
and aij2 . Then the 8-body Hamiltonian has the same low-lying eigenspace as the 4-body
Hamiltonian:
H4-body = − ∑ α (zij1 zij2 zij3 zij4 + zi,j,−1,4 zij1 + zij2 zi,j−1,3 + zij4 zi+1,j,1 + zij3 zi,j+1,2
ij
H2-body = − ∑ (α (szz zz zz zz zz zz zz zz zz zz
ij1ij2 sij3ij4 + sij−1,4ij1 + sij2ij−1,3 + sij4i+1j1 + sij3ij+1,2 + saij 1 saij+1 2 − saij 3 sai+1j 3 )
ij
(190)
U t xz xx
+ szz
a ,1 + (saij 1 sij3ij4 + sxy xx xz xx xy xx
aij 1 sij+1,2ij+1,4 + saij 2 si+1,j+1,1i+1,j+1,2 + saij 2 si+1j1,i+1,j3 )) .
2 ij 2
Summary
Determine the k-local term, Hk−local , whose degree we wish to reduce, and factor it
into two commuting factors: Hk′ −local H(k−k′ )−local , where k ′ can be as low as 0. Separate
all terms that are at most (k −1)-local into ones that commmute with one of these factors
(it does not matter which one, but without loss of generality we assume it to be the
(k − k ′ )-local one) and ones that anti-commute with it:
commuting
H<k−local + H<k−local
anti−commuting
+ αHk′ −local H(k−k′ )−local (191)
Introduce one auxiliary qubit labeled by a and the Hamiltonian:
4z5 − 3x1 + 2z1 y2 x5 + 9x1 x2 x3 x4 − x1 y2 z3 x5 → 9xa1 + 4za2 z5 − 3za3 x1 − za3 xa2 + 2xa3 x5
(193)
Bibliography
• Original paper, and description of the choices of terms and factors used for the
given example [38].
57
Summary
Similar to NP-Nagaj-1 but instead of using qutrits, we use two qubits for each qutrit,
according to:
1 1
∣0⟩ → ∣00⟩, ∣1⟩ → √ (∣01⟩ + ∣10⟩) , ∣2⟩ → √ (∣01⟩ − ∣10⟩) . (200)
2 2
which leads to the following transformations:
1
∣01⟩⟨10∣ij + h.c. → √ (∣01⟩⟨10∣ij1 + ∣01⟩⟨10∣ij2 ) + h.c. (201)
2
1
∣02⟩⟨10∣ij + h.c. → √ (∣01⟩⟨10∣ij1 − ∣01⟩⟨10∣ij2 ) + h.c. (202)
2
∣1⟩⟨2∣j + h.c. → zj1 − zj2 , (203)
H2−local = 1/2 (za5 − za6 + za9 − za10 − za3 za5 + za3 za6 − za3 za9 + za3 za10 +
za11 − za12 + za15 − za16 + za3 za11 − za3 za12 + za3 za15 − za3 za16 ) + x4 za7 − x4 za8 + za13 − za14 +
1/√2 (x λ
1 1,a5 + y1 λ2,a5 + x1 λ1,a6 + y1 λ2,a6 + xa1 λ1,a7 + ya1 λ2,a7 + xa1 λ1,a8 + y1 λ2,a8 +
xa2 λ1,a7 + ya2 λ2,a7 + xa2 λ1,a8 + ya2 λ2,a8 + xa2 λ1,a9 + ya2 λ2,a9 + xa2 λ1,a10 +
(204)
ya2 λ2,a10 + x1 λ1,a11 + y1 λ2,a11 + xa4 λ1,a15 + ya4 λ2,a15 + xa1 λ4,a5 + ya1 λ5,a5 +
xa2 λ4,a7 + ya2 λ5,a7 + x2 λ4,a9 + y2 λ5,a9 + xa3 λ4,a11 + ya3 λ5,a11 + xa4 λ4,a13 −
ya4 λ5,a13 + xa2 λ4,a15 + ya2 λ5,a15 − xa1 λ4,a6 − ya1 λ5,a6 − xa2 λ4,a8 − ya2 λ5,a8 − x2 λ4,a10 −
−y2 λ5,a10 − xa3 λ4,a12 − ya3 λ5,a12 − xa4 λ4,a14 − ya4 λ5,a14 − x2 λ4,a16 − y2 λ5,a16 ) ,
The first gadgets for arbitrary Hamiltonians acting on some number of qubits, were
designed to reproduce the spectrum of a 3-local Hamiltonian in the low-lying spectrum
of a 2-local Hamiltonian.
Summary
For any group of 3-local terms that can be factored into a product of three 1-local
factors, we can define three auxiliary qubits (regardless of the number of qubits we
have in total) labeled by ai and make the transformation:
3 2
1
α= (206)
8∆
1
α =
ss
(207)
6∆1/3
1
αsx = − 2/3 (208)
6∆
1
α =−
zz
(209)
24∆
The result will be a 2-local Hamiltonian whose low-lying spectrum is equivalent to the
spectrum of H3−local to within as long as ∆ = Θ (−3 ).
Cost
• 1 auxiliary qubit for each group of 3-local terms that can be factored into three
1-local factors.
• ∆ = Θ (−3 )
Pros
• Very few auxiliary qubits needed
Cons
• Will not work for Hamiltonians that do not factorize appropriately.
Bibliography
• Original paper: [42]
60
Summary
For any 3-local term (product of Pauli matrices si ) in the Hamiltonian, we can define
one auxiliary qubit labeled by a and make the transformation:
3
a ∏ si → α + αs s3 + αz za + αss (s1 + s2 ) + αsz s3 za + +αsx (s1 xa + s2 xa )
2
(210)
i
α αs αz − 1 a ( 4∆12/3 − 1) a ( 4∆12/3 − 1)
( ss sz sx ) = ( 2∆
1 a 1 ) (211)
α α α ∆1/3 4∆2/3 ∆2/3
The result will be a 2-local Hamiltonian whose low-lying spectrum is equivalent to the
spectrum of H3−local to within as long as ∆ = Θ (−3 ).
Cost
• 1 auxiliary qubit for each 3-local term.
• ∆ = Θ (−3 )
Example
Bibliography
• Original paper: [42]
61
3
∏ si → α + αiss ∑ s2i + αisx ∑ si xai + αzz ∑ zai zaj (214)
i i i ij
1
α=− (215)
8∆
1
αss = − 1/3 (216)
6∆
1
αsx = (217)
6∆2/3
1
αzz = (218)
24∆
The result will be a 2-local Hamiltonian whose low-lying spectrum is equivalent to the
spectrum of H3−local to within as long as ∆ = Θ (−3 ).
Cost
• 3 auxiliary qubits for each 3-local term.
• ∆ = Ω (−3 )
Example
zx xx xx xx xx yz yx
x1 z2 y3 − 3x1 x2 y4 + z1 x2 → α + α2a z x + α12 x1 x2 + α1a x xa11 + α1a x xa21 + α2a x xa22 + α3a y3 xa13 + α4a y4 xa23
12 2 a1 2 11 1 21 1 22 2 13 23
(219)
Bibliography
• Original paper on arXiv: [43]
Summary
For any 3-local term which is a product of 1-local matrices si , we can define one
auxiliary qubit labeled by a and make the transformation:
3
a ∏ si → α + α1s s21 + α2s s22 + α3s s3 + αaz za + α12
ss
s1 s2 + α13 s1 s3 + α23
ss 2
s2 s3 + α3a
ss 2 sz
s3 za + α1a
sx
s1 xa + α2a
sx
s2 xa
i
(220)
⎛ 2/32 1/3 −∆
∆ 1/3
ss ⎞
⎛ αs α12 ⎞ ⎜− α ∆
a 1 ⎟
⎟ ⎜ ⎟
ss
⎜ α1 α13 ⎜
2 2
⎜ αs ss ⎟ a2 ⎟
2/3 ∆1/3
⎟ ⎜ ⎟
α 1
⎜ 2 α23
⎜ s sz ⎟ =⎜ ⎟
2
⎜ α3 α3a ⎟ ⎜− 2
α1/3 ∆2/3 α1/3 ∆2/3 ⎟ . (221)
⎜ z ⎟ ⎜⎜ ⎟
∆2/3 ⎟
2
⎜ αa sx ⎟
α2a ⎜ − ∆2 − α1/3
√ ⎟
⎝N/A sx ⎠ ⎜ 2 ⎟
α2a ⎝ N/A
1/3
α √∆ 2/3
⎠
2
A k-local Hamiltonian with a 3-local term replaced by this 2-local Hamiltonian will
have an equivalent low-lying spectrum to within as long as ∆ = Ω (−3 ).
Cost
• 1 auxiliary qubit for each 3-local term.
• ∆ = Ω (−3 )
Example
Bibliography
• Original paper where α = 1: [45]. For arbitrary α see the 2005 v1 from arXiv, or
[46]. Connection to improved version: [47].
63
A 1B1 gadget allows k-local terms to be quadratized one step at a time, where at
each step the term’s order is reduced by at most one. In each step, a k-local term is
reduced to (k − 1)-local, contrary to SD (sub-division) gadgets which can reduce k-local
terms to (1/2)-local in one step.
Summary
We wish to reduce the k-local term:
k
Hk−local = α ∏ sj . (222)
j
α /3 1 − za α /3 ∆r
1 1
2 2 j 4
(224)
The result will be a (k − 1)-local Hamiltonian with the same low-lying spectrum as
Hk−local to within as long as ∆ = Ω (−3 ).
Cost
• Only 1 auxiliary qubit.
• ∆ = Ω (−3 )
Example
Bibliography
• Described in: [47], based on: [45].
64
Summary
Define one auxiliary qubit labeled by a and make the transformation:
α /2 1 1 − za
3
Hk−local → (∆ + ( ) ∆ /2 sk ) ( ) (225)
2 2
α2/3 √ 1 + za
− (1 + sgn2 α) ((2α) /3 sgn2 α + α1/3 sk − 2∆1/2 ) ( )
2 3
(226)
2 2
α /3 3/4 k−2
1
√ k−1
+ ( ) ∆ (∏ sj − sgn(α)sk−1 ) xa + sgn(α) 2α /3 (∆ /2 + ∆ /2 ) ∏ sj . (227)
3 2 1 3
2 j j
The result is (k − 1)-local and its low-lying spectrum is the same as that of Hk−local when
∆ is large enough.
Cost
• Only 1 auxiliary qubit.
• ∆ = Ω (−3 )
Example
Bibliography
• Described in: [47], based on: [45].
65
Summary
We factor a k-local term into a product of three factors: operators H1 , H2 acting on
non-overlapping spaces, and scalar α. Then introduce an auxiliary qubit labelled by a
and make the transformation:
√
1 − za α 2 α 2 α∆
Hk−local → ∆ + H1 + H2 + (−H1 + H2 ) xa . (228)
2 2 2 2
The resulting Hamiltonian has a degree of 1 larger than the degree of whichever
factor H1 or H2 has a larger degree, and the low-lying spectrum is equivalent to the
original one to within O (α) for sufficiently large ∆.
Cost
• 1 auxiliary qubit for each k-local term that can be factored into two non-
overlapping subspaces, is enough to reduce the degree down to k/2 + 1.
√ 6
α(∣∣H(else) +Ω( 2)max(∣∣H1 ∣∣,∣∣H2 ∣∣)∣∣)
• ∆= 2 = Ω (α−2 ).
Pros
• Potentially very few auxiliary qubits needed.
Cons
• Requires the ability to factor k-local terms into non-overlapping subspaces that
are at most (k − 2)-local in order to reduce k-locality. This is not possible for
z1 x2 x3 + z2 z3 x4 , for example.
• ∆ needs to be rather large.
Summary
For any k-local term, we can subdivide it into a product of two (k/2) −local terms:
1 − za 1 + za √
∆ + ∣α∣ + ∣α∣∆/2 (sgn(α)H1,(k/2−local) − H2,(k/2−local) ) xa (230)
2 2
The result is a (k/2)-local Hamiltonian with the same low-lying spectrum as Hk−local for
large enough ∆. The disadvantage is that ∆ has to be larger.
Cost
• ∆ ≥ ( 2∣α∣
+ 1) (∣α∣ + + 22∣∣H(k−1)−local )
Pros
• only one qubit to reduce k to ⌈k/2⌉ + 1
Cons
• Only beneficial for k ≥ 5.
Example
Bibliography
• Original paper: [47].
67
Summary
For a sum of terms that are k-local, with each term j written as a product H1j H2j ,
introduce Ncore ‘core’ auxiliary qubits labeled by ai and Ndirect ‘direct’ auxiliary qubits
labeled by aij for each term j. Make all core auxiliary qubits couple to all others, and
make the direct auxiliary qubits couple each H1j and H2j to the core auxiliary qubits.
∑ aj H1j H2j → α ∑ij (1 − zaij zai + αj xaij (H1j − H2j )) + α ∑i (1 − zai + ∑j (1 − zai zaj ))
j
(231)
If we would like the spectrum of the RHS to be close to that of the LHS, with a
difference of O(), then for any d ∈ (0, 1) we can choose
⎧
⎪
1
⎫
1 ⎪⎞
⎛ ⎪ −2 ∥Helse ∥2 d
−2 ⎪
Ndirect ∈ Ω max ⎨ d , ( ) , (M ) ⎬ ,
3 1−d
(232)
⎝ ⎪
⎪ 2M maxj ∣aj ∣
4 ⎪
⎪ ⎠
⎩ ⎭
Ncore ∈ Ω (M 3 Ndirect
d
−1 ) , (233)
αj , α ∈ O() (234)
.
Pros
• For spectral error , uses only O() coupling between the qubits (See Equation
234).
Cons
• Uses poly(−1 ) ancilla qubits (See Equations 232 and 233).
• The construction only describes the asymptotic scaling of the parameters rather
than concrete assignments of them. More work is needed for finding tight non-
asymptotic error bounds in perturbative expansion.
Example
Bibliography
• Original paper: [48].
68
Here we do not reduce k by one order at a time (1B1 reduction) or by k/2 at a time
(SD reduction), but we directly reduce k-local terms to 2-local terms.
Summary
Express a sum of k-local terms as a sum of products of Pauli matrices sij , and define
k auxiliary qubits laelled by aij for each term i, and make the transformmation:
k
−k(−)k 1⎛ 2 k ⎞ k
∑ αi ∏ sij → ∑ k − ∑ zaij zail + (αi si1 xi1 + ∑ sij xij ) − f ()Π, (235)
i j (k − 1)! i 2 ⎝ jl ⎠ j
for some polynomial f (λ). The result is a 2-local Hamiltonian with the same low-lying
spectrum to within k+1 for sufficiently smmall .
Cost
• Number of auxiliary qubits is tk for t terms.
• Unknown requirement for .
Pros
• All done in one step, so easier to implement than 1B1 and SD gadgets.
Cons
• Requires 2 more auxiliary qubits per term than 1B1-KKR.
• Unknown polynomial f (λ)
Example
Bibliography
• Original paper [49].
69
Summary
The 4-body Hamiltonian:
H4-local = − ∑ (z4i+1,j z4i+2,j z4i+3,j z4i+4,j + x4i+3,j x4i+4,j x4i+6,j x4i+4,j+1 ) (236)
ij
H2-local = − ∑ (x8i+4,j x8i+6,j + x8i+3,j+1 x8i+5,j+1 + z8i+4,j z8i+3,j+1 + z8i+6,j z8i+5,j+1 + (237)
ij
For λ = O (−5 ), the 2-local Hamiltonian has the same low-lying spectrum as the 4-local
Hamiltonian, to within an error of .
Cost
• In total, uses four times the number of qubits of the original Hamiltonian.
• Unknown requirement for λ.
Pros
• All done in one step, so easier to implement than implementing two 1B1 gadgets.
• Very symmetric
Cons
• Ordinary 1B1 or SD followed by 3→2 gadgets would require half as many total
qubits.
• Many 2-local terms.
• Perturbative, as opposed to NR-OY which is similar but does not involve any λ
parameter.
Part IV
Appendix
In this implementation the variable 12 (z1 + z2 ) plays the role of t, assuming λ is large
and positive. For instance, when coupled to a binary variable t z3 → 12 (z1 + z2 ) z3 .
71
Example Here we show how deductions can arise naturally from the Ramsey number
problem. Consider R(4, 3) with N = 4 nodes. Consider a Hamiltonian:
H = (1 − z12 )(1 − z13 )(1 − z23 ) + . . . + (1 − z23 )(1 − z24 )(1 − z34 ) + z12 z13 z14 z23 z24 z34 . (241)
H = 2(1 − z12 )(1 − z13 )(1 − z23 ) + . . . + (1 − z23 )(1 − z24 )(1 − z34 ) + (242)
z14 z24 z34 (z12 z13 + z12 z23 + z13 z23 − z12 − z13 − z23 + 1). (243)
We could repeat this process to remove all 5- and 4-local terms without adding any
auxiliary qubits. Note in this case the error terms added by deduc-reduc already appear
in our Hamiltonian.
72
XIV. 2 → 2 GADGETS
This review has only focused on k−local to 2−local transformations where k > 2.
There is also a large number of 2−local to 2−local transformations in the literature,
which are used for various pruposes. Some of these are listed here:
• Gadgetization of any 2−local Hamiltonian into {11, z, x, zz, xx} or {11, z, x, zx} [51].
Used for the proof that xx + zz or xz is universal is enough for universal quantum
computation. In other words, any computation can be transformed into a problem
of finding the ground state of a 2−local Hamiltonian containing terms from
{11, z, x, zz, xx} or from {11, z, x, zx} with real coefficients, and the ground state
can be found by adiabatic quantum computing with only polynomial time and
space overhead over the best alternative algorithm for the problem.
• Transformation of any 2−local Hamiltonian into {11, z, x, zz, xx + yy}, without any
perturbative gadgets, and only requiring the qubits to be connected in an almost
2D lattice [52].
• "Cross gadget", "fork gadget", and "triangle gadget" described in [45].
• Minor-embedding general problems for the Chimera [53] graph [22, 54].
• Minor-embedding quadratization gadgets for the Pegasus [55] graph [56].
73
• Design of a tunable zzz coupling in which all zz couplings are cancelled, and its
experimental demonstration [76].
75
XVII. CONTRIBUTORS
Richard Tanburn
• Richard was the original creator and maintaner of the Git repository.
• Richard created the Tex commands used throughout the document, and con-
tributed majorly to the overall layout.
• Richard wrote the original versions of the following sections: (1) Deduc-Reduc,
(2) ELC Reduction, (3) Groebner Bases, (4) Split Reduction, (5) NTR-KZFD, (6)
NTR-GBP, (7) PTR, (8) PTR-Ishikawa, (9) PTR-KZ, (10) PTR-GBP, (11) Bit flipping,
(12) RBS, and (13) FGBZ.
Nicholas Chancellor
• Nick made contributions to the following sections: (1) RMS (in terms of z), (2)
PTR-RBL-(3→2), (3) PTR-RBL-(4→2), (4) SBM, (5) Flag based SAT Mapping, and
to the qutrit → qubit transformation (6).
Szilard Szalay
• Szilard re-derived Nike’s transformations for the sections: (1) SFR-BCR-1, (2)
SFR-BCR-2, (3) SFR-BCR-3, and (4) SFR-BCR-4 from the notation of the original
paper, into the format consistent with the rest of the book. In doing so he corrected
errors in Nike’s work and also fixed them in the main document.
Ka Wa Yip
• Ka Wa Yip added the page about his own method co-authored with Xu, Koenig
and Kumar.
Yudong Cao
• Yudong wrote the first version of the following section: (1) PSD-CN.
Daniel Nagaj
• Daniel provided a .tex document to Nike in May 2018 which helped Nike to write
the following sections: (1) NP-Nagaj-1, (2) NP-Nagaj-2. The document that Daniel
provided made it easier for Nike to write these sections than the original papers.
76
Aritanan Gruber
Charles Herrmann
• Charles informed us in May 2018 of the papers which contained results which
became the following sections: (1) PTR-BCR-1, (2) PTR-BCR-2, (3) PTR-BCR-3,
(4) PTR-BCR-4, (5) SFR-BCR-1, (6) SFR-BCR-2, (7) SFR-BCR-3, (8) SFR-BCR-4, (9)
SFR-BCR-5, (10) SFR-BCR-6.
Elisabeth Rodriguez-Heck
• Elisabeth also pointed us to what became the following section: (1) ABCG Reduc-
tion.
• Tin fixed a typo in the alternative forms of the following sections: SFR-BCR-3,4.
Andreas Soteriou
• Andreas found typos on the opening page in the arXiv version which surprisingly
no one else found (or pointed out), and he diligently fixed them.
• Andreas created the example involving x,y, and z presented on the opening page
in the September 2019 version (I plan to have this example further improved at a
later time).
Jacob Biamonte
ACKNOWLEDGMENTS
• We thank Mohammad Amin of D-Wave for pointing Nike Dattani to the paper
[33] on determining Ramsey numbers on the D-Wave device, which contained
what we call in this review "Reduction by substitution", later found through
Ishikawa’s paper to be from the much older 1970s paper by Rozenberg.
• We thank Aritanan Gruber and Endre Boros of Rutgers University, who in August
2015 shared with Nike Dattani and Richard Tanburn some of their wisdom about
sub-modularity, and Aritanan Gruber for pointing us to the then very recent paper
of Hiroshi Ishikawa on what we call in this review "ELC reductions", which was
also a valuable paper due to the references in it. We also thank him for helping us
in our quest to determine whether or not "deduc-reduc" was a re-discovery by
Richard, Emile, and Nike, or perhaps a novel quadratization scheme.
• We thank Toby Cathcart-Burn of Oxford University, who during the third year
of undergraduate study, worked with Nike Dattani and Richard Tanburn and in
Autumn 2015 and Winter 2016 helped us gain insights about the application of
deduc-reduc and bit flipping to the problem of determining Ramsey numbers
via discrete optimization, and for insights into the trade-offs between Ishikawa’s
symmetric reduction and reduction by substitution.
• We thank Hiroshi Ishikawa of Waseda University, who Nike Dattani had the
memorable opportunity to visit in November 2015, and through discussions
about the computer vision problem and neural network problem (two examples
of real-world discrete optimization problems that benefit from quadratization),
provided insights about the role of quadratization for calculations on classical
computers. In particular, solving the computer vision problem in which he had
experience solving on classical computers, was very different from the integer
factoring problem and Ramsey number problem which we had been attempting
78
to quadratize for D-Wave and NMR devices. He taught us that far more total
(original plus auxiliary) variables can be tolerated on classical computers than on
D-Wave machines or NMR systems, and approximate solutions to the optimiza-
tion problems are acceptable (unlike for the factorization and Ramsey number
problems in which we were interested). This gave us more insight into what
trade-offs one might wish to prioritize when quadratizing optially. We also thank
him for helping us in our quest to determine whether or not "deduc-reduc" was
a re-discovery by Richard, Emile, and Nike, or perhaps a novel quadratization
scheme.
• Last but indubitably not least, we thank Jacob Biamonte of Skolkovo Institute of
Technology, who Nike Dattani enjoyed visiting in Hangzhou in January 2017 and
meeting at Harvard University in April 2018. Jacob provided us plenty of insights
about perturbative gadgets, made valuable comments on early versions of our
manuscript, and has been a major supporter of this review paper since the idea
was presented to him in December 2016. At many points during the preparation
of this review, we had prioritized other commitments and put preparation of this
review aside. Jake’s frequent encouragement was often what got us working on
this review again. Without him, we are not certain this paper would have been
completed by this time (or ever!).
79
[1] Richard Tanburn, Emile Okada, and Nike Dattani, “Reducing multi-qubit interactions in
adiabatic quantum computation without adding auxiliary qubits. Part 1: The "deduc-reduc"
method and its application to quantum factorization of numbers,” (2015), arXiv:1508.04816.
[2] Hiroshi Ishikawa, “Higher-Order Clique Reduction without Auxiliary Variables,” in 2014
IEEE Conference on Computer Vision and Pattern Recognition (IEEE, 2014) pp. 1362–1369.
[3] Magali Bardet, “On the Complexity of a Gröbner Basis Algorithm,” Algorithms Seminar ,
85–92 (2002).
[4] Raouf Dridi and Hedayat Alghassi, “Prime factorization using quantum annealing and
computational algebraic geometry,” (2016), arXiv:1604.05796.
[5] Asem M. Ali, Aly A. Farag, and Georgy L. Gimel’farb, “Optimizing Binary MRFs with
Higher Order Cliques,” (Springer, Berlin, Heidelberg, 2008) pp. 98–111.
[6] Richard Tanburn, Oliver Lunt, and Nikesh S. Dattani, “Crushing runtimes in adiabatic
quantum computation with Energy Landscape Manipulation (ELM): Application to Quan-
tum Factoring,” Physical Review A (submitted). (2015), arXiv:1510.07420.
[7] Nike Dattani, Quadratization in discrete optimization and quantum mechanics (2019)
arXiv:1901.04405.
[8] Emile Okada, Richard Tanburn, and Nikesh S. Dattani, “Reducing multi-qubit interactions
in adiabatic quantum computation without adding auxiliary qubits. Part 2: The "split-
reduc" method and its application to quantum determination of Ramsey numbers,” , 5
(2015), arXiv:1508.07190.
[9] Martin Anthony, Endre Boros, Yves Crama, and Aritanan Gruber, “Quadratic reformu-
lations of nonlinear binary optimization problems,” Mathematical Programming 162,
115–144 (2017).
[10] V. Kolmogorov and R. Zabih, “What energy functions can be minimized via graph cuts?”
IEEE Transactions on Pattern Analysis and Machine Intelligence 26, 147–159 (2004).
[11] D. Freedman and P. Drineas, “Energy Minimization via Graph Cuts: Settling What is Pos-
sible,” in 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition
(CVPR’05), Vol. 2 (IEEE, 2005) pp. 939–946.
[13] Martin Anthony, Endre Boros, Yves Crama, and Aritanan Gruber, “Quadratization of
symmetric pseudo-Boolean functions,” Discrete Applied Mathematics 203, 1–12 (2016).
[14] Martin Anthony, Endre Boros, Yves Crama, and Aritanan Gruber, “Quadratization of
symmetric pseudo-Boolean functions,” (2014), arXiv:1404.6535.
[15] Andrew C. Gallagher, Dhruv Batra, and Devi Parikh, “Inference for order reduction in
Markov random fields,” in CVPR 2011 (IEEE, 2011) pp. 1857–1864.
80
[16] Andrea Rocchetto, Simon C Benjamin, and Ying Li, “Stabilizers as a design tool for new
forms of the Lechner-Hauke-Zoller annealer,” Science Advances 2 (2016), 10.1126/sci-
adv.1601246.
[17] Wolfgang Lechner, Philipp Hauke, and Peter Zoller, “A quantum annealing architecture
with all-to-all connectivity from local interactions,” Science Advances 1 (2015), 10.1126/sci-
adv.1500838.
[18] Endre Boros and Aritanan Gruber, “On Quadratization of Pseudo-Boolean Functions,”
(2014), arXiv:1404.6538.
[19] Endre Boros, Yves Crama, and Elisabeth Rodríguez-Heck, “Quadratizations of symmetric
pseudo-boolean functions: sub-linear bounds on the number of auxiliary variables,” in
ISAIM (2018).
[20] Endre Boros, Yves Crama, and Elisabeth Rodríguez-Heck, “Compact quadratizations for
pseudo-boolean functions,” in unpublished (2018).
[21] Ka Wa Yip, Hong Xu, T. K. Satish Kumar, and Sven Koenig, “Quadratic reformulation of
nonlinear pseudo-boolean functions via the constraint composite graph,” in the Interna-
tional Conference on Integration of Artificial Intelligence and Operations Research Techniques in
Constraint Programming (2019) pp. 643–660.
[23] Nicholas Chancellor, Stefan Zohren, and Paul A. Warburton, “Circuit design for multi-
body interactions in superconducting quantum annealing system with applications to a
scalable architecture,” (2016), 10.1038/s41534-017-0022-6, arXiv:1603.09521.
[24] Martin Leib, Peter Zoller, and Wolfgang Lechner, “A Transmon Quantum Annealer: De-
composing Many-Body Ising Constraints Into Pair Interactions,” (2016), arXiv:1604.02359.
[25] Martin Leib, Peter Zoller, and Wolfgang Lechner, “A transmon quantum annealer: de-
composing many-body Ising constraints into pair interactions,” Quantum Science and
Technology 1, 15008 (2016).
[26] N Chancellor, S Zohren, and P A Warburton, “Circuit design for multi-body interactions in
superconducting quantum annealing systems with applications to a scalable architecture,”
npj Quantum Information 3, 21 (2017).
[28] N. Chancellor, S. Zohren, P.A. Warburton, S.C. Benjamin, and S. Roberts, “A Direct
Mapping of Max k-SAT and High Order Parity Checks to a Chimera Graph,” Scientific
Reports 6 (2016), 10.1038/srep37107.
[29] Martin Anthony, Endre Boros, Yves Crama, and Aritanan Gruber, “Quadratic reformula-
tions of nonlinear binary optimization problems,” .
[32] Alejandro Perdomo, Colin Truncik, Ivan Tubert-Brohman, Geordie Rose, and Alán Aspuru-
Guzik, “Construction of model Hamiltonians for adiabatic quantum computation and its
application to finding low-energy conformations of lattice protein models,” Phys. Rev. A
78, 12320 (2008).
[33] Zhengbing Bian, Fabian Chudak, William G. Macready, Lane Clark, and Frank Gaitan,
“Experimental Determination of Ramsey Numbers,” Physical Review Letters 111, 130505
(2013).
[34] Alexander Fix, Aritanan Gruber, Endre Boros, and Ramin Zabih, “A graph cut algorithm
for higher-order Markov Random Fields,” in 2011 International Conference on Computer
Vision (IEEE, 2011) pp. 1020–1027.
[35] Gemma De las Cuevas and Toby S. Cubitt, “Simple universal models capture all classical
spin physics,” Science 351, 1180—-1183 (2016).
[36] Kahl and Strandmark, “Generalized Roof Duality for Pseudo-Boolean Optimization,”
International Conference on Computer Vision (2011).
[37] Samuel A. Ocko and Beni Yoshida, “Nonperturbative Gadget for Topological Quantum
Codes,” Physical Review Letters 107, 250502 (2011).
[38] Yigit Subasi and Christopher Jarzynski, “Nonperturbative embedding for highly nonlocal
Hamiltonians,” Physical Review A 94, 012342 (2016).
[39] Richard P. Feynman, “Quantum Mechanical Computers,” Optics News 11, 11 (1985).
[40] Daniel Nagaj, “Fast universal quantum computation with railroad-switch local Hamiltoni-
ans,” Journal of Mathematical Physics 51, 062201 (2010).
[42] Qian-Heng Duan and Ping-Xing Chen, “Realization of universal adiabatic quantum com-
putation with fewer physical resources,” Physical Review A 84, 042332 (2011).
[43] Julia Kempe, Alexei Kitaev, and Oded Regev, “The Complexity of the Local Hamiltonian
Problem,” (2004), arXiv:0406180 [quant-ph].
[44] Julia Kempe, Alexei Kitaev, and Oded Regev, “The Complexity of the Local Hamiltonian
Problem,” SIAM Journal on Computing 35, 1070–1097 (2006).
[45] Roberto Oliveira and Barbara M. Terhal, “The complexity of quantum spin systems on a
two-dimensional square lattice,” Quantum Info. Comput. 8, 900–924 (2008).
[46] Sergey Bravyi, David P. DiVincenzo, Daniel Loss, and Barbara M. Terhal, “Quantum Sim-
ulation of Many-Body Hamiltonians Using Perturbation Theory with Bounded-Strength
Interactions,” Physical Review Letters 101, 070503 (2008).
[47] Yudong Cao, Ryan Babbush, Jacob Biamonte, and Sabre Kais, “Hamiltonian gadgets with
reduced resource requirements,” Physical Review A 91, 012315 (2015).
82
[48] Yudong Cao and Daniel Nagaj, Quantum Information & Computation, Vol. 15 (Rinton Press,
2015) pp. 1197–1222.
[49] Stephen P. Jordan and Edward Farhi, “Perturbative gadgets at arbitrary orders,” Physical
Review A 77, 062329 (2008).
[50] Courtney G Brell, Steven T Flammia, Stephen D Bartlett, and Andrew C Doherty, “Toric
codes and quantum doubles from two-body Hamiltonians,” New Journal of Physics 13,
053039 (2011).
[51] Jacob D. Biamonte and Peter J. Love, “Realizable Hamiltonians for universal adiabatic
quantum computers,” Physical Review A 78, 012352 (2008).
[52] Seth Lloyd and Barbara M Terhal, “Adiabatic and Hamiltonian computing on a 2D lattice
with simple two-qubit interactions,” New Journal of Physics 18, 023042 (2016).
[53] Hartmut Neven, Vasil S Denchev, Marshall Drew-Brook, Jiayong Zhang, William G
Macready, and Geordie Rose, NIPS 2009 Demonstration: Binary Classification using Hardware
Implementation of Quantum Annealing, Tech. Rep. (2009).
[55] Nikesh S. Dattani, Szilard Szalay, and Nicholas Chancellor, “Pegasus: The second connec-
tivity graph for large-scale quantum annealing hardware,” Pegasus: The second connectiv-
ity graph for large-scale quantum annealing hardware (2019), arXiv:1901.07636.
[56] Nike Dattani and Nick Chancellor, “Embedding quadratization gadgets on Chimera and
Pegasus graphs,” (2019), arXiv:1901.07676.
[57] Ryan Babbush, Bryan O’Gorman, and Alán Aspuru-Guzik, “Resource efficient gadgets for
compiling adiabatic quantum optimization problems,” Annalen der Physik 525, 877–888
(2013).
[58] Courtney G Brell, Stephen D Bartlett, and Andrew C Doherty, “Perturbative 2-body parent
Hamiltonians for projected entangled pair states,” New Journal of Physics 16, 123056
(2014).
[59] Jianbing Shen, Jianteng Peng, Xingping Dong, Ling Shao, and Fatih Porikli, “Higher Order
Energies for Image Segmentation,” IEEE Transactions on Image Processing 26, 4911–4922
(2017).
[60] Endre Boros and Peter L. Hammer, “Pseudo-Boolean optimization,” Discrete Applied
Mathematics 123, 155–225 (2002).
[61] Gernot Schaller and Ralf Schützhold, “The role of symmetries in adiabatic quantum
algorithms,” Quantum Information & Computation 10, 109–140 (2010).
[62] Nanyang Xu, Jing Zhu, Dawei Lu, Xianyi Zhou, Xinhua Peng, and Jiangfeng Du, “Quan-
tum Factorization of 143 on a Dipolar-Coupling Nuclear Magnetic Resonance System,”
Physical Review Letters 108, 130501 (2012).
[63] William H. Cunningham, “Minimum cuts, modular functions, and matroid polyhedra,”
Networks 15, 205–215 (1985).
83
[64] Yves Crama and Elisabeth Rodriguez Heck, Short Prime Quadratizations of Cubic Negative
Monomials (HEC ULg, 2014).
[65] Endre Boros and Isabella Lari, “Cones of nonnegative quadratic pseudo-boolean func-
tions,” in International Symposium on Artificial Intelligence and Mathematics, ISAIM 2014, Fort
Lauderdale, FL, USA, January 6-8, 2014 (2014).
[66] Aron J. Cohen, Hongjun Luo, Kai Guther, Werner Dobrautz, David P. Tew, and Ali Alavi,
“Similarity transformation of the electronic Schrödinger equation via Jastrow factorization,”
The Journal of Chemical Physics 151, 061101 (2019).
[67] Joel Strand, Anthony Przybysz, David Ferguson, and Ken Zick, “ZZZ coupler for native
embedding of MAX-3SAT problem instances in quantum annealing hardware,” Bulletin of
the American Physical Society Volume 62, Number 4 (2017).
[68] David Ferguson, Anthony Przybysz, and Joel Strand, “ZZZ coupler for superconducting
qubits,” US Patent: 20180261752A1 (2017).
[69] David Ferguson, Anthony Przybysz, and Joel Strand, “ZZZ coupler for superconducting
qubits,” US Patent: 20180342663A1 (2018).
[70] Andrew Kerman, “Design and simulation of complex superconducting circuits for ad-
vanced quantum annealing hardware,” Bulletin of the American Physical Society Volume
63 (2018).
[71] Tim Menke, Cyrus Hirjibehedin, Steven Weber, Gabriel Samach, Simon Gustavsson, Alan
Aspuru-Guzik, William Oliver, and Andrew Kerman, “A many-body coupler for coherent
4-local interaction of superconducting flux qubits,” APS March Meeting 2019, abstract
id.A42.011 (2019).
[72] Andrew Kerman, “Four Spin Couplers for Quantum Information Processing,” US Patent:
20180309452A1 (2018).
[73] Andrew Kerman, “Four Spin Couplers for Quantum Information Processing,” US Patent:
10187065B2 (2019).
[74] Andrew Kerman, “Four Spin Couplers for Quantum Information Processing,” US Patent:
20190158098A1 (2019).
[76] Denis Melanson, Antonio J. Martinez, Salil Bedkihal, and Adrian Lupascu, “Tunable
three-body coupler for superconducting flux qubits,” (2019), arXiv:1909.02091.