Combinatorial Optimization Problems (COPs) are notoriously difficult even as a decision problem [1]—well-known examples include the travelling salesman problem [2] and finding the ground state of a spin glass Hamiltonian [3]. Rather than attempting to find an exact solution, one is often interested in approximate solutions. One such algorithm is the Quantum Approximate Optimization Algorithm (QAOA) introduced by Farhi et al. [4], a type of variational quantum algorithm that, given p layers, uses 2p optimization parameters.
Attempting to evaluate the expectation value of the QAOA is incredibly difficult. Naively, given a problem with size n, each parameter in the QAOA requires a sum over 2n terms. In a series of works starting with [5], algorithms to evaluate the expectation value of the QAOA on q-spin glass models with time complexity independent of n have been found with increasing performance. The best known one for evaluating q-spin glass is found in [6] with a time complexity of 𝒪(p24p) using algebraic techniques.
Another line of research with respect to the QAOA is to prove its limitation in performance at shallow depths via the Overlap Gap Property (OGP). One of the first applications to show the limitation of performance uses locality properties of the QAOA. Thus, at shallow depth, the QAOA does not explore the whole graph underlying a COP and is unable to output a solution that beats the OGP barrier for the Maximum Independent Set problem [7]. The limitation of the QAOA as a result of the OGP has been predominately limited on sparse graphs, but a breakthrough came in [8] using a dense-from-sparse relation between complete graphs and sparse graphs to show that the QAOA is also limited in performance even if it sees the whole graph. Furthermore, they showed that for any constant p in the asymptotic analysis, the QAOA is unable to surpass the OGP barrier. Assuming a stronger version of the OGP, a similar and slightly better result shows limitation at super constant depth, p ~ 𝒪(log log n), for dense graphs [9].
In this paper, we re-derive a result of [8] that for the Max-q-XORSAT problem, and equivalently the mean-field q-spin glass, the QAOA is unable to find the optimal value even if p goes to infinity for even q ≥ 4 if we swap the order of limits, the thermodynamic limit and the run time of the algorithm (i.e., taking limp→∞limn→∞ rather than limn→∞ limp→∞. More precisely, the analysis done by various authors is to study the performance of the QAOA asymptotically by taking the problem size n to infinity and studying the output of the QAOA at constant depths p. This results in the underlying graph explored by the QAOA to appear as a tree and the parameters that optimize the QAOA’s performance in this instance are known as tree-parameters that work well on any Hamiltonian instance [10,11]. While this is a re-derivation of a known proof [8], their proof is significantly more technical compared to ours. Furthermore, our theorem provides some hints that their proof can be extended beyond constant p and hold when p ≤ ϵ log n for dense graphs since we show that their method is implicitly valid at logarithmic depth for sparse graphs.
The paper is organized as follows: In Section 2 we give a brief background to random graphs, spin glass problems, the OGP, and the QAOA; in Section 3, we summarize what is known in literature about the results of the QAOA on spin glasses and their equivalence between the mean-field and dilute spin model; in Section 4 we formalize the point about the OGP in random regular hypergraphs that was mentioned in [6] and re-derive the result in [8], that the QAOA cannot find the optimal value for a spin glass COP even if the algorithm runs indefinitely under limit swapping. Following that, we outline a proof to affirm the theorem while providing some numerical evidence that the proof should extend to odd q-spin glass as well.
The main result of this work is to show that the OGP exists as a limitation for Max-q-XORSAT on a random regular hypergraph with sufficiently large degree. This is done via the following theorem
If the Overlap Gap Property limits the performance of a local algorithm on the Erdös–Rényi hypergraph at logarithmic depth p, it also limits the performance on a random regular hypergraph at logarithmic depth p.
The proof is done via contradiction. First, we show that we can trim the Erdös–Rényi hypergraph of average degree λ to a regular hypertree removing at most an 𝒪(1/λlogλ)-fraction of edges. Then, one can form a λ-regular hypergraph from the hypertree. Assuming an algorithm, that is limited by the OGP, is able to find a near-optimal solution on the regular hypergraph leads to a contradiction since such an algorithm is unable to find near-optimal solution on Erdös–Rényi hypergraph. Thus, the OGP also acts as a barrier to optimization for random regular graph.
This limitation to logarithmic depth local algorithms leads us to re-derive a recently discovered theorem and improve upon its results:
(Informal, theorems 3 and 4 of [8]). When q ≥ 4 and is even, the performance of the QAOA is limited at logarithmic depth for the Max-q-XORSAT with an underlying D-regular q-uniform hypergraph. The performance is strictly upper-bounded by the OGP since
There are several immediate corollaries of this result for the QAOA. The first of which was noted in [6] as a side-note.
Optimizing the QAOA using the algorithm in [6,8] only allows it to perform equally in performance to local classical algorithms, thus providing no quantum advantage.
The above corollary is a result of optimizing the QAOA under limit swapping of the algorithm run time p and the problem size n. This therefore results in the following corollary:
If a COP exhibits the OGP, then optimizing QAOA via limit swapping (i.e. using the tree parameters) results in sub-optimal performance.
Here, we standardize the notation we use to denote a hypergraph. A hypergraph G = (V, E) has |V| = n vertices, and |E| = m edges. A graph is q-uniform if every hyperedge connects to exactly q vertices. A graph is d-regular if every vertex has degree d. Conventionally, an instance of the Erdös–Rényi–(Gilbert) q-uniform hypergraph
Another type of random hypergraph of interest is the d-regular q-uniform hypergraph G = ℝq(n, d), where we implicitly assume that nd = qx for some integer x. Unlike Erdös–Rényi graphs that can be generated randomly, there is no easy unbiased way to generate such graphs, though one such method is known as the configuration model introduced by Bollobás [13].
In this paper, we focus on the mean field spin glass model and the related dilute spin glass model. For the mean field model, the main goal, roughly speaking, is to find the ground state energy of a spin glass Hamiltonian. A widely studied model is the Sherrington–Kirkpatrick (SK) model [14]. More generally, an Ising mean field q-spin model is given by the following
The ground state energy of the mean-field spin glass can be computed exactly [15], which we denote as the Parisi constant
The dilute spin glass model is also known as the XOR-satisfiability (XORSAT) problem. Specifically, given a q-uniform hypergraph G = (V, E) where E ⊂ Vq, and a given signed weight Ji1,…,iq ∈ {–1,+1}, Max-q-XORSAT is the problem of maximizing the following cost function
In terms of the hypergraph, zij refers to the vertices and Ji1i2…iq refers to the hyperedge.
We say that an instance of the problem is satisfied if there is an assignment of values to the bit-string z which satisfies all the clauses (i.e.
One major obstacle to finding optimal solutions for COPs is known as the Overlap Gap Property (OGP). The term was introduced in [17], though the concept was already used by various authors [18,19].
For the definition of the OGP, one can informally think that for certain choices of disorder J, there is a gap in the set of possible pairwise overlaps of near-optimal solution. Informally, for every two near-optimal solution z1, z2, it is the case that the distance between them is either extremely small, or extremely large. Formally, we define the OGP for a single instance as the following:
Definition 1 (Overlap Gap Property [20]). For a general maximization problem with random input J, the OGP holds if there exists some ϵ > 0, with 0 ≥ μ1 ≤ μ2 such that for every z1, z2 that is an ϵ-optimal solution
Rather than using the overlap, it is often easier to visualize sets of solutions as clusters using the Hamming distance between two states via the relation
The first interval is trivial as we can simply choose the overlap z1 with itself. It is the existence of the second overlap, or rather the non-existence of overlap in the interval (a, b), that is difficult to prove.
A general version of it is known as the ensemble-OGP introduced in [21] or coupled-OGP as used in [22]. This version is required to prove limitations of local algorithm for technical reasons and requires an interpolation scheme between two different instances of Erdös–Rényi graphs. For spin glasses, a branching OGP has been developed that makes use of the ultrametric structure in the Parisi solution [23].
Informally speaking, the OGP limits the performance of algorithms by first showing that the set of near-optimal solutions exhibits a strong clustering property. That is, with high probability, a gap exists in their overlaps or equivalently, a gap in the hamming distance between near-optimal solutions. Then, one proceeds to show that if an algorithm is able to find arbitrary near-optimal solution, the algorithm outputs a solution that is in the region forbidden by the OGP. In the context of the QAOA, one typically uses the concentration of measure to prove the limitation of the QAOA at shallow depth [8,22]. We refer the reader to the review papers by Garmanik [3,20] for details on how the OGP limits the performance of algorithms.
The QAOA is a local quantum algorithm [22] designed to find approximate solutions to combinatorial optimization problems [4]. The goal is to find a bit string z ∈ {–1, +1}n that maximizes the cost function C(z). Given a classical cost function C, we can define a corresponding quantum operator
For a given cost function C, the corresponding QAOA objective function is the expectation value
The first result of the QAOA on spin glasses can be found in [5] where the authors applied the QAOA on the Sherrington–Kirkpatrick (SK) model and found an algorithm to evaluate the expectation value in the infinite limit after averaging over the disorder J. More generally, for a q-spin glass with cost function
(Theorem 1 of [8]). For any p and any parameters (γ, β), we have
In [6], the authors evaluated the performance of the QAOA for Max-q-XORSAT on large-girth (D + 1)-regular graphs. By restricting to graphs that are regular and have girth (also known as the shortest Berge-cycle) greater than 2p + 1, the subgraph explored by the QAOA at depth p will appear as a regular tree. Since the optimal cut fraction is of the form
Let
(Theorem 2 of [6]). For
Furthermore, the authors also made the following conjecture based on promizing numerical evidence,
Conjecture 5 (Conjecture of [6]). Optimizing the QAOA using tree-parameters found in [6], the Parisi value for the Sherrington–Kirkpatrick model can be reached:
Note that to compute
While we are unable to prove or disprove the conjecture, we prove later that the generalized Parisi value Πq is not obtainable if the OGP is present.
One point to note is that in [7], it has been shown that at low depth p, if a problem exhibits the OGP, then the locality of the QAOA makes it such that it is prevented from getting close to the optimal value if it does not see the whole graph. Specifically, the following theorem is proven.
(modified version of Corollary 4.4 in [22]). For Max-q-XORSAT on a random Erdös–Réyi directed multihypergraph, for every even q ≥ 4, there exists a value ηOGP < ηOPT, where ηOPT is the energy of the optimal solution, and a sequence {δ(d)}d≥1 with the following property: for every ϵ > 0 there exists sufficiently large d0 such that for every d > d0, every p < δ(d) log n and an arbitrary choice of parameters γ, β with probability converging to 1 as n → ∞, the performance of the QAOA with depth p satisfies
The authors of [6] noted that assuming the OGP also holds for regular hypergraphs, then a similar argument can be used to show that the QAOA’s performance as measured by the algorithm in Theorem 4 does not converge to the Parisi value Πq for even q ≥ 4. This is because the large girth assumption implies that the graph has at least Dp vertices so p is always less than ϵ log n in this limit. For the Max-q-XORSAT, the subgraph explored at constant p has q[(q – 1)pDp + … + (q – 1)D + 1] vertices. This lays the foundation of Theorem 10 later.
The first equivalence between spin glass and MaxCUT for the QAOA was shown in [6], where the performance of the QAOA at depth p on the SK model as n → ∞ to is equal to the performance of the QAOA at depth p on MaxCUT problems on large-girth (D + 1)-regular graphs when D → ∞. In the follow-up work of [8], they generalize this result to show that the QAOA’s performance for the q-spin model is equivalent to that for Max-q-XORSAT on any large girth D-regular hypergraphs in the limit D → ∞.
(Theorem 3 of [8]). Let
The equivalence of performance of the QAOA on dense and sparse graph is also shown to hold in the case of Erdös–Rényi graph.
(modified version of theorem 2 in [8]). Let
We now have the pieces in place to state our main theorem
Given a local algorithm 𝒜 that is limited in performance up to depth p = ϵ log n on an Erdös–Rényi hypergraph with sufficiently large average degree λ, 𝒜 is also limited in performance up to depth p on a random D-regular hypergraph for sufficiently large D.
We delay the proof to Section 4.1 and note an immediate consequence for the performance of the QAOA.
For Max-q-XORSAT on a D-regular q-uniform hypergraph, for every even q > 4, there exists a value ηOGP such that it is smaller than the optimal value ηOPT with the following property. For every ϵ > 0 there exists sufficiently large d0 such that for every d > d0, every p ≤ d log n and an arbitrary choice of parameters γ, β with probability converging to 1 as n → ∞, the performance of the QAOA with depth p satisfies
Proof. The proof of the inequality follows from Theorem 6 and Theorem 9.
The proof of the equality comes from Theorem 9 which states that evaluating
As a result of this theorem, we re-derive and improve a theorem of [8]:
(Theorem 4 of [8]). From Theorem 10 the performance of the QAOA on the pure q-spin glass for even q ≥ 4 is upper bounded by ηOGP as p → ∞ and is strictly less than the optimal value, i.e. the Parisi value Πq, under the swapping of limits.
We emphasize again that while [8] proves this corollary, it does so via explicit calculation of the QAOA using a highly technical proof involving a generalized multinomial sum. This work improves upon it by providing a simpler, straightforward argument using graph properties and extending their result from constant depth to ϵ log n depth for the random regular graph. Note that both proofs require Theorem 6 to prove this corollary.
Unfortunately, the QAOA performance on mean-field spin glass has only been shown to be equivalent in performance to the dilute spin glass model via Eq. (16) at constant depth p, and we are unable to find a proof to extend their equivalence. As such, we are not able to prove that the OGP acts as a barrier on dense graphs at logarithmic depth. If one can show that an asymptotic analysis of the mean-field spin glass at constant depth and logarithmic depth result in the same solution as in the random regular graph, this could extend the algorithmic barrier for dense graphs to logarithmic depth as well.
We note that Corollary 3 shows that the QAOA will not be able to find the optimal value even when it sees the whole graph, and the algorithm runs indefinitely if one optimizes the parameters of the QAOA using the tree-parameters obtained via asymptotic analysis. Formally, the Parisi value is attainable via the QAOA with the following limits
The iteration provided in Theorem 4 swaps the limits which results in failure of the QAOA to find the optimal value. This leads us to the following corollaries.
If the OGP exists in a spin glass type problem, then the swapping of limits results in a sub-optimal solution for both random regular graphs and Erdös–Rényi graphs. In other words, a necessary condition for the validity of limit swapping is that the problem does not exhibit the OGP.
For q-spin glass, it is expected that OGP holds for all q ≥ 3, which suggests that limit swapping is not allowed for all mean-field spin glasses with the possible exception for the 2-spin glass model (i.e. the SK model) [25].
In addition, Corollary 4 also applies to the Maximum Independent Set problem since [7] shows a similar limitation at logarithmic depth. We suspect that similar results extend to all COPs rather than being limited to spin glass type COPs.
Proof. The proof is as follows: first, we need to show that for some λ ∈ ℤ+, a λ-regular q-uniform hypertree can be embedded into an Erdös–Rényi hypergraph of sufficiently high average connectivity. Then, we show that a λ-regular q-uniform hypergraph can be generated from said hypertree. Finally, we show that algorithm 𝒜 must also fail to find solutions arbitrarily close to the optimal solution in a λ-regular q-uniform hypergraph as doing so would result in a contradiction.
We note that a COP instance with m chosen edges can be converted into a regular instance changing only
Here, we show that an Erdös–Rényi hypergraph can be converted into a hypertree by changing only
Remove edges until di < γ′ for all vertices.
Add edges to all vertices until di = λ′ where each vertex is chosen with probability proportional to λ′ – di. In order to prove this, we will need the following lemmas.
In the limit n → ∞, the number of edges removed from
Proof. Note that the distribution of edges in an Erdös–Rényi follows a binomial distribution B(m, 1/n). For each vertex vi, the number of edges removed is either 0 or di – λ′ so Δi := max (λ′,0). The first moment of Δi is bounded by
Where we used the fact that the expectation value of a random variable equals the cumulative function in the second equality.
The second moment can also be bounded as
Using Chernoff’s bound for the binomial distribution, we have
Applying Eq. (25) to the second moment bound, we have
For the total number of edges removed Δ, we note that by a union bound,
Since the degree of each vertex is not independent (i.e. follows a multinomial distribution), the covariance term is negative. Therefore, as n → ∞, Δ is at most n𝒪λ(d–c log d) for some c > 0 with high probability.
The process of removing edges does not create cycles (i.e. destroy tree-like property). However, we need to ensure that the graph was initially tree-like and remains tree-like after adding edges. Rather than working with
Fix any constant λ and l ≤ ϵ log n. With high probability as n → ∞, 1-o(1) fraction of the l-local neighbourhood are treelike.
Proof. Let
By Markov’s inequality, we thus have
Now we show that the l-local neighbourhood of an arbitrary vertex has at most o(1) k-cycles. In the limit n → ∞, the degree of each vertex follows a Poisson distribution with mean λ. Let Υ denote the degree of an arbitrary vertex. The probability that any vertex v has degree at most log n is given by
From the Chernoff bound for the Poisson distribution, we have that
Thus, the l-local neighbourhood has at most (log n)ϵlogn vertices. Repeating the same argument for the number of k-cycles shows that only o(1) of the l-local neighbourhood will contain a cycle.
Fix any λ, there exists some ϵ > 0 such that for l ≤ ϵ log n, with high probability as n → ∞, adding edges preserves trees in 1-o(1) fraction of the l-local neighbourhood.
Proof. Right after removing edges, every vertex has at most degree λ′ so given some constant λ and l ≤ ϵ log n, the l-local neighbourhood BG(v, l) is upper-bound by λ′ϵlogn and is a hypertree. Then, we have to add on average n(λ log λ) ~ Θ(n) edges but since BG(v, l) is of order 𝒪(λ′ϵlogn), the probability that an added hyperedge contains at least two vertices in BG(v, l) is 𝒪(λ′ϵlogn/n).
Choose ϵ < 1/log λ′. As a result, adding clauses results in o(1) fraction of l-local neighbourhood forming a cycle.
Now, we show that a λ-regular, q-uniform hypergraph is locally also a hypertree.
Fix any λ > 1 and p ≤ ϵ log n for some ϵ > 0, with high probability as n → ∞, 1 – oλ(1) fraction of vertices in the p-local neighbourhood are treelike.
Proof. As we are interested in the large n limit, we first show that for fixed p, the dominant term in the probability that a cycle is formed in the large n limit is given by
Now we can show that the OGP is also an obstruction in random regular hypergraphs via contradiction. Assume that an algorithm 𝒜 at logarithmic depth is able to find solutions arbitrarily close to the optimal solution for the Maxq-XORSAT on a regular hypergraph. Then this would imply that 𝒜 is also able to find such solutions when performed on an Erdös–Rényi hypergraph since both graphs are p-locally the same. However, this contradicts Theorem 6 and thus, the OGP must also restrict the performance of logarithmic depth local algorithms when applied to a regular hypergraph.
It is of note that proving that the OGP exists in a problem is much easier when the underlying graph is an Erdös–Rényi hypergraph as compared to a regular hypergraph since only the former can be described by a probability distribution. This is why there is no proof that the OGP exists for the Max-q-XORSAT on regular graph as it requires the Poisson distribution found in an Erdös–Rényi hypergraph. Given Theorem 9 and that it is possible to show that the OGP exists in both Erdös–Rényi hypergraphs and regular hypergraphs in some problems [29], it is reasonable to think that if the OGP exists in the former, it also exists in the latter. Motivated by this, we make the following conjectures.
Conjecture 11. If the overlap gap property exists in a COP with an underlying Erdös–Rényi hypergraph of sufficiently high connectivity, then it also exists when the underlying hypergraph is a regular hypergraph of sufficiently high degree.
While Theorem 9 seems to support this conjecture, we have only proved this asymptotically and at logarithmic depth. It is not entirely clear if the same result applies when the depth is of 𝒪 (n) or for the finite case since the proof of OGP for max-q-XORSAT on an Erdös–Rényi holds for some constant n ∈ ℤ.
Conjecture 12 (Monotonicity of the OGP). For the Max-q-XORSAT problem, the overlap gap property is a monotonically increasing graph property.
We note that the proof of Theorem 9 is much simpler if the conjectures are true as can be seen in Appendix A.
We refer the reader to numerous numerical studies about how the performance of the QAOA is unable to surpass the OGP barrier. For instance, the authors of [6] numerically evaluated the performance of the QAOA on Max-3- XORSAT up to p = 14 and got 0.6623Π3. For the 3-spin glass, the OGP inhibits the AMS algorithm’s performance to get to 0.987Π3 [30]. A study of the QAOA on Max-q-XORSAT problem similarly finds that for n = 18, the QAOA is unable to get close to the 0.987 approximation ratio even at a depth of p = 30 for q = 3 [31].
Instead, we provide some numerical evidence that instances of the OGP can occur in random regular hypergraphs of odd degree. Our numerical simulation proceeds in the following manner. First, we define the problem size n, uniformity q, and degree d, where we implicitly assume that nd is a multiple of q. Then, randomly generate a d-regular q-uniform hypergraph so that the total number of hyperedges |E| = (nd/q). Next, we randomly generate the list J = {–1, +1}|E| for the coupling strength of the hyperedges. Finally, we perform a branch and bound algorithm and record those whose cut-fraction exceeds a certain threshold.
Once we have the list of bit-strings and their corresponding cut-fraction, we have to choose some ϵ > 0 such that the list of bit-strings that are ϵ-optimal solutions is small. By default, we limit the bit-strings that are at least 95% to the optimal solution. Finally, compute the overlap between all such ϵ-optimal bit-string and obtain the overlap spectrum.
We find that on average, when d < q, the OGP is not present. It is only when d is greater than q that instances of problems exhibiting the OGP first appear. The numerical simulations were run on q = 3 and varying n up to 30. A histogram of the distribution of solutions can be found in Figure 1 while the typical evolution of the overlap spectrum can be found in Figure 2.

Plot of a typical cluster of solutions. Y-axis indicates fraction of clauses satisfied and X-axis represents the Hamming distance from the state |1〉⊗n (e.g.the state |–1〉⊗n/2 ⊗ |1⊗n/2 hasavalue 0.5 on the X-axis.

Typical evolution of the overlap spectrum of a n = 30 Max-3-XORSAT instance with constant degree 15. Here, the plot is based on the Hamming distance rather than overlaps. Typically, the optimal solution and the first few optimal solutions do not exhibit the OGP. At some distance ϵ away from the optimal solution, the OGP occurs. Increasing ϵ further includes additional sub-optimal solution until the set of overlaps is dense.
We also ran simulations on the SK model as it is believed, though not yet proven, that the SK model does not exhibit the OGP [3,30]. We find that indeed the SK model does not exhibit the OGP at n = 45 as can be seen in Figure 3.

The overlap spectrum of an instance of the SK model with = 45. No gap was observed and including additional suboptimal solution merely monotonically increases the overlap until the set of overlap is dense.
Being a heuristic algorithm, the limitations and potential of the QAOA have not yet been fully explored. While swapping the order of limits allows us to evaluate the expectation value with a classical computer faster, it also seems to lead to sub-optimal results. This of course is expected and one can instead use the algorithm developed in [6] as a heuristic starting ansatz for (γ, β) to be further optimized for a specific problem.
Currently, the OGP has only been shown to be a limitation on dense models at super-constant depth p ~ 𝒪(log log n) [9]. Given the “dense-from-sparse” reduction performed in [8] that showed equivalence between constant depth QAOA for dense and sparse graphs, perhaps one can extend the equivalence to logarithmic depth similar to how we extended the validity of the algorithm from constant to logarithmic depth.
We note that these results suggests that at logarithmic depth, the performance of the QAOA equals that of AMS’s algorithm for the mean field spin glass [30], a type of Approximate Message Passing (AMP) algorithm. This suggests that if the QAOA is optimized correctly beyond logarithmic depth such as polynomial depth, it should outperform such classical algorithms since the QAOA is known to find exact solutions by reduction to the Quantum Adiabatic Algorithm. It is still an open question to determine at what depth p the QAOA v outperform the algorithm. Furthermore, given the similarity in performance to the AMP algorithm, this also suggests that the conjecture in [6] that the Parisi value for the Sherrington–Kirkpatrick model is obtained under limit swapping might be true as the AMP algorithm achieves the optimal value under the assumption that the OGP does not exists.