Have a personal or library account? Click to login
Selection and Improvement of Product Formulae for Best Performance of Quantum Simulation Cover

Selection and Improvement of Product Formulae for Best Performance of Quantum Simulation

Open Access
|Jan 2025

Full Article

1.
Introduction

The Lie-Trotter product formula is commonly used in quantum algorithms for Hamiltonian simulation, where one can approximate the Hamiltonian evolution of H = A + B in terms of exponentials of A and B when these operators do not commute. For short time, a standard approximation is the second-order symmetric formula S2(t) = eiAt/2eiBteiAt/2, which satisfies eiHt = S2(t) + 𝒪(t3). More generally, the error in an order k formula is 𝒪(tk+1). Longer times are simulated by breaking the evolution into many repetitions of a short time. The number of repetitions needed is reduced with the order, motivating the search for higher-order product formulae. A systematic method to produce arbitrarily high-order formulae is the fractal method [1,2,3,4], which has found applications in Hamiltonian simulation [5]. The first explicit use of product formulae for quantum simulation was given in [6], applying it for quantum evolution under local Hamiltonians. Later work considered the broader class of sparse Hamiltonians [7] and higher orders [5], as well as methods beyond product formulae [8,9,10,11].

Recent work has shown that despite its simplicity, the Lie-Trotter product formula can compete with other Hamiltonian simulation algorithms due to the low error that it achieves in practice [12]. Methods based on linear combinations of unitaries [13,14] or quantum signal processing [15] give complexity logarithmic in the inverse error, but the error is not required to be extremely small, meaning those methods do not provide as large an advantage as might be expected. Product formula error bounds can be further improved by considering particular physical systems [16,17,18] or leveraging randomisation [19,20,21]. Moreover, Trotter formulae are expected to be relevant for both noisy intermediate-scale quantum (NISQ) computation and fault-tolerant computation. It is then of great importance to seek efficient implementations of product formulae as it can have a great impact on the efficiency of Hamiltonian simulation algorithms in practice.

The downside of the fractal methods to generate higher-order formulae is that the number of exponential operators to implement it grows very rapidly. Fractal product formulae are usually assumed in quantum computing, but they can be greatly improved upon. An alternative method by Yoshida [3] can be used to obtain product formulae with a smaller number of exponentials. Similar to fractal formulae, they are given as a product of S2 for different time intervals, but in contrast to the fractal approach there is not an explicit analytic form for the higher-order formulae. Instead one needs to derive and solve a complicated set of simultaneous non-linear polynomial equations.

In classical computing, product formulae are equivalent to symplectic integrators for numerical solution of differential equations. Since the initial work of Yoshida, there have been many results on better integrators of many different orders [22,23,24,25,26,27,28,29,30,31,32]. See Ref. [33] for a comprehensive review of product formulae/integrators in the literature. In particular, Refs. [24] and [25] give what appear to be the optimal 8th order product formulae with 15 and 17 stages (the number of S2 in the product formula). Reference [29] gives improved 8th order product formulae with 19 and 21 stages. For 10th order, Ref. [29] provides solutions with 31, 33, and 35 stages. These solutions from 2005 in Ref. [29] appear to be the best in prior work for 8th and 10th order (see Ref. [33]).

Another approach is that of processed product formulae [31,34,35,36,37,38,39,40]. Instead of the product formula being symmetric, it is of the form PΣP−1 for kernel Σ and processor P . The P and P−1 cancel when using the product of many of these for evolution over a long time, so the cost is dominated by the number of stages in the kernel. Since the kernel has fewer restrictions on it, it can have fewer stages than a normal symmetric product formula. According to Ref. [33], the best 8th order product formula with processing is from 2006 [31].

The objective of this work is to find improved product formulae and compare their performance in a consistent manner. We show that the performance of product formulae is better quantified by the error in the eigenvalues, rather than the spectral-norm error as usually considered in prior work. This is because it is the eigenvalue error that dominates the error for evolution at longer times. We also derive a method to consistently compare the performance of product formulae with different orders and numbers of exponentials.

We use these methods to compare between the performance of our product formulae, as well as to compare prior product formulae in the literature. By our numerical search, we found hundreds of thousands of solutions at 8th order, including many that outperform those previously reported in the literature. Our best 8th order product formula without processing improves over the best prior work by a factor of about 100, and our best 8th order processed product formula improves over the best prior processed product formulae by a factor of nearly 300. Our best product formula without processing has slightly lower error, but our product formula with processing provides better performance due to being shorter. For 10th order, one solution from Ref. [29] provides extremely high accuracy, a factor of nearly 300 times better than any others in prior work. We provide a detailed numerical comparison of product formulae in Section 5.

When comparing product formulae of different orders, it is better to use higher-order formulae for larger values of T /ε, where T is the total evolution time and ε is the required precision. For smaller T /ε it is better to use lower-order formulae, but as it is increased there are threshold values beyond which it is optimal to use higher-order formulae. We derive methods for determining these thresholds and show that our 8th order product formula is best for T /ε from around 107 to 1016, which includes the range of typical values for quantum chemistry applications. This means that the best 8th order product formula we have found in this work will be best suited to real applications.

The upper threshold means T /ε ≳ 1016 would be needed for 10th order product formulae to outperform our 8th order product formula, which is unrealistically large. Although the 10th order solution from Ref. [29] provides high accuracy, it is not competitive for quantum simulation due to its greater length. It is more than twice as long as our processed 8th order formula, so for a similar number of exponentials the 8th order product formula can use a time step half as long, resulting in much better performance for realistic values of the parameters. The lower threshold means T /ε ≲ 107 is needed for 6th order to outperform 8th order. Accurate estimation of this threshold requires testing of the product formulae for larger time steps. It is possible to reduce this threshold by adjusting the 8th order product formula to provide better performance for larger time steps.

The organisation of this work is as follows. In Section 2 we give a pedagogical explanation of the product formulae and the method of solution. First, we define product formulae and introduce the fractal methods for generating higher-order product formulae in Section 2.1, then in Section 2.2 we give a summary of Yoshida's method, then we explain processors for product formulae in Section 2.3, more general types of product formulae in Section 2.4, and the Taylor series method in Section 2.5. In Sections 3.2 and 3.3 we present the results regarding new product formulae. In Section 4 we explain the method for comparing product formulae, and then in Section 5 we give the comparison of the product formulae based on numerical testing. We finish in Section 6 by discussing what the results mean for choosing appropriate product formulae for quantum simulation, as well as searching for better-performing product formulae.

2.
Background

In this section, we review the background and methods for product formulae. Readers who are familiar with the methods may wish to skip to Section 3 where we present new results. See also Ref. [33] for a more complete recent review of this topic. We begin by defining product formulae and the Baker-Campbell-Hausdorff formula, then we introduce fractal methods and Yoshida's method to obtain higher-order formulae, and describe the processed product formulae and the Taylor series method.

2.1.
Product Formulae

It is well known that, for any non-commuting operators X and Y , (1) exp((X+Y)t)=exp(Xt)exp(Yt)+O(t2), \exp ((X + Y)t) = \exp (Xt)\exp (Yt) + {\cal O}({t^2}), where we have absorbed the −i factor used in quantum simulation into X and Y . The above equation demonstrates that the exponential of a sum of two operators is, to first order, equal to the product of the exponential of those operators. The above equation is often referred to as a “first-order product formula.” Higher-order terms can be computed via the Baker-Campbell-Haussdorff (BCH) formula.

Theorem 1 (Baker-Campbell-Haussdorff formula [41])

Let X and Y be any operators such thatX‖ + ‖Y ‖ < ln 2. We have for an operator Z that exp(X) exp(Y) = exp(Z), with (2) Z=n=1(1)n1nr1+s1>0rn+sn>0[Xr1,Ys1,Xrn,Ysn]j=1nri+sii=1nri!si!, Z = \sum\limits_{n = 1}^\infty {{{{( - 1)}^{n - 1}}} \over n}\sum\limits_{{\scriptstyle {r_1} + {s_1} > 0 \atop {\scriptstyle \vdots \atop \scriptstyle {r_n} + {s_n} > 0}} } {{[{X^{{r_1}}},{Y^{{s_1}}}, \cdots {X^{{r_n}}},{Y^{{s_n}}}]} \over {\left( {\sum\nolimits_{j = 1}^n {{r_i} + {s_i}} } \right)\prod\nolimits_{i = 1}^n {{r_i}!{s_i}!} }}, where [Xr1,Ys1,Xrn,Ysn]=[X,[X,[Xr1,[Y,[Y,[Ys1,[X,[X,[Xrn,[Y,[Y,Ysn]]]]. [{X^{{r_1}}},{Y^{{s_1}}}, \cdots {X^{{r_n}}},{Y^{{s_n}}}] = [\underbrace {X,[X, \cdots [X}_{{r_1}},[\underbrace {Y,[Y, \cdots [Y}_{{s_1}},{\kern 1pt} \cdots {\kern 1pt} [\underbrace {X,[X, \cdots [X}_{{r_n}},[\underbrace {Y,[Y, \cdots Y}_{{s_n}}]] \cdots ]].

The standard second-order symmetric product formula is as given in the definition below.

Definition 1 (Symmetric product formula of order two)

Let X and Y be non-commuting operators and let t be a real variable. Define (3) S2(t):=exp12Xtexp(Yt)exp12Xt. {S_2}(t): = \exp \left( {{1 \over 2}Xt} \right)\exp (Yt)\exp \left( {{1 \over 2}Xt} \right).

The operators X and Y used in the definition of S2 should always be clear from context. More generally, when there is a sum of Xj, the product formula is (4) S2(t):=j=1Jexp12Xjtj=J1exp12Xjt. {S_2}(t): = \left[ {\prod\limits_{j = 1}^J \exp \left( {{1 \over 2}{X_j}t} \right)} \right]\left[ {\prod\limits_{j = J}^1 \exp \left( {{1 \over 2}{X_j}t} \right)} \right]. Products are ordered with the starting index on the right and the final one on the left, so for J = 2 terms the expression in the definition is obtained. This form of the product formula can be used for Hamiltonians that are a product of any number of terms. The higher-order product formulae considered in this work are constructed from products of S2, and due to this method of construction these product formulae are suitable for Hamiltonians that are a sum of any number of terms. The corollary below describes the form of the error terms in the symmetric product formula, and also implies that it is second order.

Corollary 1 (Symmetric BCH formula [3])

Let X and Y be any operators such thatX‖ + ‖Y ‖ < ln 2 and let t be a real variable. Define Z such that S2(t) = exp(Z). Then there is a sequence α consisting of linear combinations of ℓ-term commutators in X and Y such that (5) Z==1αt. Z = \sum\limits_{\ell = 1}^\infty {\alpha _\ell }{t^\ell }. Moreover, α ≡ 0 whenever ℓ is even.

Reference [3] also shows that even terms are zero for more general symmetric product formulae. The first three non-zero α terms from above are (6) α1=X+Y, {\alpha _1} = X + Y, (7) α3=112[Y,[Y,X]]124[X,[X,Y]], {\alpha _3} = {1 \over {12}}[Y,[Y,X]] - {1 \over {24}}[X,[X,Y]], (8) α5=75760[X,X,X,X,Y]1720[Y,Y,Y,Y,X]+1360[X,Y,Y,Y,X]+1360[Y,X,X,X,Y]1480[X,X,Y,Y,X]+1120[Y,Y,X,X,Y]. \matrix{ {{\alpha _5}} \hfill & { = {7 \over {5760}}[X,X,X,X,Y] - {1 \over {720}}[Y,Y,Y,Y,X] + {1 \over {360}}[X,Y,Y,Y,X] + {1 \over {360}}[Y,X,X,X,Y]} \hfill \cr {} \hfill & { - {1 \over {480}}[X,X,Y,Y,X] + {1 \over {120}}[Y,Y,X,X,Y].} \hfill \cr } Here the square brackets are used to indicate multicommutator expressions similar to the notation in Theorem 1, for example (9) [Y,Y,X,X,Y][Y,[Y,[X,[X,Y]]]]. [Y,Y,X,X,Y] \equiv [Y,[Y,[X,[X,Y]]]]. Expressions for each α can be derived from two applications of the usual BCH formula.

Definition 2 (Product formula)

Let X and Y be any non-commuting operators. Given a natural number n, a product formula of order n is a pair (c,d) (\vec c,\vec d) with c,d \vec{c},\vec{d}\in {{\mathbb{R}}^{\ell }} and ℓ a natural number such that for all t ∈ ℝ (10) exp((X+Y)t)=j=1exp(cjXt)exp(djYt)+O(tn+1). \exp ((X + Y)t) = \prod\limits_{j = 1}^\ell \exp ({c_j}Xt)\exp ({d_j}Yt) + {\cal O}({t^{n + 1}}). We refer to the number of non-zero coefficients in c,d \left( {\vec c,\vec d} \right) as the length of the product formula.

Hence S2 is a length-3 product formula. The number of exponentials used in the product formula is a crucial measure of its efficiency, and in quantum simulation it is proportional to the required number of gates.

2.1.1.
Fractal Methods for Generating Higher-Order Product Formulae

Here we describe two fractal methods to obtain higher-order product formulae. Starting from the symmetrised product formula in Eq. (3), fractal methods produce product formulae at all even orders. The first fractal (or triple jump) technique to generate a product formula of order k = 2κ is [1,2,3,42,43] (11) S2κ(t)=S2κ2(sκt)S2κ2((12sκ)t)S2κ2(sκt), {S_{2\kappa }}(t) = {S_{2\kappa - 2}}({s_\kappa }t){S_{2\kappa - 2}}((1 - 2{s_\kappa })t){S_{2\kappa - 2}}({s_\kappa }t), where sκ = 1/(2 − 21/(2κ−1)). This method can be used to generate even orders starting at S2, or more generally to increase the order of symmetric product formulae. A drawback to this method is that both sk and 1 − 2sκ are greater than 1, so the coefficients in the higher-order methods are large, causing greater error.

The history of this approach is discussed in detail in a review article by Forest [44]. This form for a fourth-order integrator was originally derived by Ruth about 1984, but not published until 1990 [42]. That work specifically considered a Hamiltonian that is a sum of two parts. The iterative form was given by Creutz and Gocksch in 1989 [1], but citing unpublished work by Campostrini and Rossi for the fourth-order case. That work was published in 1990 [43], and cited Creutz and Gocksch for the iterative form. Suzuki [2] and Yoshida [3] independently derived the iterative form in 1990.

The iterative form given by Creutz and Gocksch in 1989 [1] was the more general expression (12) S˜2κ2(uκt)pS˜2κ2((12puκ)t)S˜2κ2(uκt)p. {\tilde S_{2\kappa - 2}}{({u_\kappa }t)^p}{\tilde S_{2\kappa - 2}}((1 - 2p{u_\kappa })t){\tilde S_{2\kappa - 2}}{({u_\kappa }t)^p}. A case of particular interest is the second fractal method, also derived by Suzuki in 1991 [4] (13) S˜2κ(t)=S˜2κ2(uκt)2S˜2κ2((14uκ)t)S˜2κ2(uκt)2, {\tilde S_{2\kappa }}(t) = {\tilde S_{2\kappa - 2}}{({u_\kappa }t)^2}{\tilde S_{2\kappa - 2}}((1 - 4{u_\kappa })t){\tilde S_{2\kappa - 2}}{({u_\kappa }t)^2}, where uκ = 1/(4 − 41/(2κ−1)). This method has the advantage that both uκ and 1 − 4uκ are less than 1, so the coefficients of higher-order formulae are small resulting in small error. The drawback is that far more exponentials are required. Each iteration uses 5 copies of the lower-order formula, whereas the previous one uses 3 copies. The virtue of these fractal methods is that they allow one to generate arbitrarily high-order product formulae easily, albeit at the expense of a large number of exponentials.

2.1.2.
Exponential Length Scaling of Fractal Methods

For the first fractal method, the total number of exponentials for a given order 2κ = 4, 6, 8, ... in the product formula S2κ is given by (14) 2(J1)3κ1+1. 2(J - 1){3^{\kappa - 1}} + 1. For example J = 2 for X + Y and κ = 1 just corresponds to S2, and the expression gives 3 as expected. For the second fractal method S˜2κ {\tilde S_{2\kappa }} the number of exponentials is (15) 2(J1)5κ1+1. 2(J - 1){5^{\kappa - 1}} + 1. The number of exponential operators in both cases of the fractal method grows very rapidly. Thus one may be interested in alternative method to obtain product formulae with a lower count, such as the method of Yoshida in the next section.

2.2.
Yoshida's Method for Deriving 6th Order Product Formulae

Here, we give a pedagogical explanation of the method to derive product formulae.

2.2.1.
Approach

Rather than using Eq. (11) and Eq. (13), Yoshida uses the general procedure [3] (16) S(m)(t)=j=1mS2(wmj+1t)S2(w0t)j=1mS2(wjt), {S^{(m)}}(t) = \left( {\prod\limits_{j = 1}^m {S_2}({w_{m - j + 1}}t)} \right){S_2}({w_0}t)\left( {\prod\limits_{j = 1}^m {S_2}({w_j}t)} \right), where wj ∈ ℝ for j = 0, 1, ⋯ , m are parameters to be determined. Note the number of exponentials in this product is given by (17) (4m+2)(J1)+1. (4m + 2)(J - 1) + 1. Given this ansatz, the task becomes to find m and wi such that S(m) is an order k product formula. We will illustrate Yoshida's method by deriving the result for 6th order.

2.2.2.
Expand Yoshida Product Using Baker-Campbell-Haussdorf Formula

The method is to expand Eq. (16) using the BCH formula from Theorem 1 recursively as follows. First, note that from Corollary 1, S2(t)=et2XetYet2X=etα1+t3α3 {S_2}(t) = {e^{{t \over 2}X}}{e^{tY}}{e^{{t \over 2}X}} = {e^{t{\alpha _1} + {t^3}{\alpha _3} \cdots }} where α is a commutator of operators as described below.

Corollary 1.

We are for the moment considering sums of two terms X + Y . Define C=i=1t2i1w12i1α2i1 C = \sum\nolimits_{i = 1}^\infty {t^{2i - 1}}w_1^{2i - 1}{\alpha _{2i - 1}} and D=i=1t2i1w02i1α2i1 D = \sum\nolimits_{i = 1}^\infty {t^{2i - 1}}w_0^{2i - 1}{\alpha _{2i - 1}} . Then, (18) S2(w1t)S2(w0t)S2(w1t)=eCeDeC=exptw1α1+t3w13α3+t5w15α5+O(t7)exptw0α1+t3w03α3+tw05α5+O(t7)×exptw1α1+t3w13α3+t5w15α5+O(t7)=expt(2w1+w0)α1+t3(2w13+w03)α3+t5(2w15+w05)α5+16([D,D,C][C,C,D])+O(t7). \matrix{ {{S_2}({w_1}t){S_2}({w_0}t){S_2}({w_1}t)} \hfill & { = {e^C}{e^D}{e^C}} \hfill \cr {} \hfill & { = \exp \left\{ {t{w_1}{\alpha _1} + {t^3}w_1^3{\alpha _3} + {t^5}w_1^5{\alpha _5} + {\cal O}({t^7})} \right\}\exp \left\{ {t{w_0}{\alpha _1} + {t^3}w_0^3{\alpha _3} + tw_0^5{\alpha _5} + {\cal O}({t^7})} \right\}} \hfill \cr {} \hfill & {\quad \times \;\exp \left\{ {t{w_1}{\alpha _1} + {t^3}w_1^3{\alpha _3} + {t^5}w_1^5{\alpha _5} + {\cal O}({t^7})} \right\}} \hfill \cr {} \hfill & { = \exp \left\{ {t(2{w_1} + {w_0}){\alpha _1} + {t^3}(2w_1^3 + w_0^3){\alpha _3} + {t^5}(2w_1^5 + w_0^5){\alpha _5} + {1 \over 6}([D,D,C] - [C,C,D]) + {\cal O}({t^7})} \right\}.} \hfill \cr } A simple computation shows (19) [D,D,C][C,C,D]=t5(w02w13w12w03+w14w0w04w1)[α1,α1,α3]+O(t7). [D,D,C] - [C,C,D] = {t^5}(w_0^2w_1^3 - w_1^2w_0^3 + w_1^4{w_0} - w_0^4{w_1})[{\alpha _1},{\alpha _1},{\alpha _3}] + {\cal O}({t^7}). Define β5 = [α1, α1, α3] so (20) S2(w1t)S2(w0τ)S2(w1t)=expt(2w1+w0)α1+t3(2w13+w03)α3+t5(2w15+w05)α5+t516(w02w13w12w03+w0w14w04w1)β5+O(t7). \matrix{ {{S_2}({w_1}t){S_2}({w_0}\tau ){S_2}({w_1}t)} \hfill & { = \exp \left\{ {t(2{w_1} + {w_0}){\alpha _1} + {t^3}(2w_1^3 + w_0^3){\alpha _3} + {t^5}(2w_1^5 + w_0^5){\alpha _5}} \right.} \hfill \cr {} \hfill & {\quad \left. { + \;{t^5}{1 \over 6}(w_0^2w_1^3 - w_1^2w_0^3 + {w_0}w_1^4 - w_0^4{w_1}){\beta _5} + {\cal O}({t^7})} \right\}.} \hfill \cr } By an induction argument Yoshida shows that (21) S(m)(t)=exptA1,mα1+t3A3,mα3+t5(A5,mα5+B5,mβ5)+O(t7), {S^{(m)}}(t) = \exp \left\{ {t{A_{1,m}}{\alpha _1} + {t^3}{A_{3,m}}{\alpha _3} + {t^5}({A_{5,m}}{\alpha _5} + {B_{5,m}}{\beta _5}) + {\cal O}({t^7})} \right\}, where Aj,m and B5,m are polynomials on the variables w0, ... , wm.

The case m = 0 is just the symmetric BCH formula, so it is clear that Eq. (21) holds with (22) A1,0=w0,A3,0=w03,A5,0=w05,B5,0=0. \matrix{ {{A_{1,0}} = {w_0},} \hfill \cr {{A_{3,0}} = w_0^3,} \hfill \cr {{A_{5,0}} = w_0^5,} \hfill \cr {{B_{5,0}} = 0.} \hfill \cr } To prove Eq. (21) for m > 0, one needs to show that the exponential is of the form with operator α1 for first order in t, operator α3 for third order in t, and operators α5 and β5 for fifth order in t. This result may be shown using (23) S(m+1)(t)=S2(wm+1t)S(m)(t)S2(wm+1t)=exptwm+1α1+t3wm+13α3+t5wm+15α5+O(t7)×exptA1,mα1+t3A3,mα3+t5(A5,mα3+B5,mβ5)+O(t7)×exptwm+1α1+t3wm+13α3+t5wm+15α5+O(t7)=exp2twm+1α1+tA1,mα1+2t3wm+13α3+t3A3,mα3+2t5wm+15α5+t5A5,mα5+t5B5,mβ5+16t5(A1,m2wm+13A1,mA3,mwm+1wm+12A3,m+wm+14A1,m)β5+O(t7). \matrix{ {{S^{(m + 1)}}(t)} \hfill & { = {S_2}({w_{m + 1}}t){S^{(m)}}(t){S_2}({w_{m + 1}}t)} \hfill \cr {} \hfill & { = \exp \left\{ {t{w_{m + 1}}{\alpha _1} + {t^3}w_{m + 1}^3{\alpha _3} + {t^5}w_{m + 1}^5{\alpha _5} + {\cal O}({t^7})} \right\}} \hfill \cr {} \hfill & {\quad \times \;\exp \left\{ {t{A_{1,m}}{\alpha _1} + {t^3}{A_{3,m}}{\alpha _3} + {t^5}({A_{5,m}}{\alpha _3} + {B_{5,m}}{\beta _5}) + {\cal O}({t^7})} \right\}} \hfill \cr {} \hfill & {\quad \times \;\exp \left\{ {t{w_{m + 1}}{\alpha _1} + {t^3}w_{m + 1}^3{\alpha _3} + {t^5}w_{m + 1}^5{\alpha _5} + {\cal O}({t^7})} \right\}} \hfill \cr {} \hfill & { = \exp \left\{ {2t{w_{m + 1}}{\alpha _1} + t{A_{1,m}}{\alpha _1} + 2{t^3}w_{m + 1}^3{\alpha _3} + {t^3}{A_{3,m}}{\alpha _3} + 2{t^5}w_{m + 1}^5{\alpha _5} + {t^5}{A_{5,m}}{\alpha _5} + {t^5}{B_{5,m}}{\beta _5}} \right.} \hfill \cr {} \hfill & {\quad \left. { + \;{1 \over 6}{t^5}(A_{1,m}^2w_{m + 1}^3 - {A_{1,m}}{A_{3,m}}{w_{m + 1}} - w_{m + 1}^2{A_{3,m}} + w_{m + 1}^4{A_{1,m}}){\beta _5} + {\cal O}({t^7})} \right\}.} \hfill \cr } Hence, if the product formula can be expressed as in the form Eq. (21) for S(m)(t), it can again be expressed in this form for S(m+1)(t), establishing it for all m ≥ 0 by induction. This expression also shows that the scalar coefficients can be determined from the formulae (24) A1,m+1=2wm+1+A1,m,A3,m+1=2wm+13+A3,m,A5,m+1=2wm+15+A5,m,B5,m+1=B5,m+16(A1,m2wm+13A1,mA3,mwm+1wm+12A3,m+wm+14A1,m). \matrix{ {{A_{1,m + 1}} = 2{w_{m + 1}} + {A_{1,m}},} \hfill \cr {{A_{3,m + 1}} = 2w_{m + 1}^3 + {A_{3,m}},} \hfill \cr {{A_{5,m + 1}} = 2w_{m + 1}^5 + {A_{5,m}},} \hfill \cr {{B_{5,m + 1}} = {B_{5,m}} + {1 \over 6}(A_{1,m}^2w_{m + 1}^3 - {A_{1,m}}{A_{3,m}}{w_{m + 1}} - w_{m + 1}^2{A_{3,m}} + w_{m + 1}^4{A_{1,m}}).} \hfill \cr } See Appendix A for an explanation of how to extend this approach to 10th order.

2.2.3.
Constraints to Satisfy in Order to Derive 6th Order Formula

To derive a 6th order formula, the lower-order terms in the exponential in Eq. (21) must be zero (see also Eq. (5.16) of [3]), which gives the four conditions (25) A1,m=1,A3,m=0,A5,m=0,B5,m=0. {A_{1,m}} = 1,\quad {A_{3,m}} = 0,\quad {A_{5,m}} = 0,\quad {B_{5,m}} = 0. For m = 3 there are four unknowns w0 to w3, and it can be expected there are solutions because there are the same number of equations as unknowns. In practice A1,m = 1 is satisfied by taking w0 = 1 − 2 ∑j wj, so there are then three equations for three unknowns w1, w2, w3. Whereas it is possible to solve the equations using the Newton-Raphson method, the expression for the appropriate Jacobian matrix is complicated, so Yoshida instead uses the Brent method. Yoshida produced three m = 3 solutions in this way, and states “It seems that there is no other solution.” We have performed an extensive search and also found no more solutions.

The product formulae obtained through the Yoshida method also work for exponentials of sums of more operators. The S2 product formula can again be used as the building block for the product formula, and we can write (26) S2(t)=exp=0α˜t, {S_2}(t) = \exp \left( {\sum\limits_{\ell = 0}^\infty {{\tilde \alpha }_\ell }{t^\ell }} \right), where α˜ {\tilde \alpha _\ell } are now order- multicommutator expressions on the J terms. The reasoning for finding the product formula is entirely based on the construction with α, but without taking advantage of its particular form, so exactly the same reasoning holds for α˜ {\tilde \alpha _\ell } . This immediately implies that the higher-order product formulae work in general. This is an advantage of constructing product formulae as products of S2.

2.3.
Processed Product Formulae

Another technique to obtain higher-order product formulae is that of processing [31,34,35,36,37,38,39]. In this technique a product formula Sk of order k is generated by the composition of a kernel Σ conjugated by a processor operator P as (27) Sk=PΣP1. {S_k} = P\Sigma {P^{ - 1}}. The advantage of this method is that usually Σ requires fewer stages than other methods, and due to the construction, we have that Skn=PΣnP1 S_k^n = P{\Sigma ^n}{P^{ - 1}} . This means that the cost of using the product formula is effectively that of the kernel in the usual application where a long time evolution is partitioned into many repetitions of the product formula for short times.

Typically, one chooses a kernel Σ as a product formula that has a smaller order than k, but conjugating by processors gives an order k integrator. Some of the best-performing kernels are given in Ref. [31], and recently new kernels were reported in [40]. The advantage of this method is that the number of non-linear equations required to be solved to find new product formulae is reduced. One could use a shorter minimum-length product formula, but best performance is provided by increasing the length of a product formula beyond the minimum. The more the number of free parameters exceeds the number of equations, the more freedom there is to find product formulae with low error. Reducing the number of equations by solving for the kernel enables reduced error for a given length.

To be more specific, Ref. [31] gives a set of conditions for the kernels and processors in Table 4 of that work. Reference [31] uses the notation Yj rather than αj, so S2(t) = etY1+t3Y3. It then uses the notation Ej,i for multicommutator expressions, with Ej,1 = Yjαj and for example (28) E5,2=Y1,Y1,Y3β5, {E_{5,2}} = \left[ {{Y_1},{Y_1},{Y_3}} \right] \equiv {\beta _5}{\kern 1pt} , (29) E7,2=Y3,Y1,Y3γ7, {E_{7,2}} = \left[ {{Y_3},{Y_1},{Y_3}} \right] \equiv - {\gamma _7}{\kern 1pt} , where γ7 is given in Appendix A. In Ref. [31] the quantities fj,i are the coefficients of Ej,i, so the equivalent of Eq. (21) in that notation is (30) S(m)(t)=exptf1,1E1,1+t3f3,1E3,1+t5(f5,1E5,1+f5,2E5,2)+O(t7). {S^{(m)}}(t) = \exp \left\{ {t{f_{1,1}}{E_{1,1}} + {t^3}{f_{3,1}}{E_{3,1}} + {t^5}({f_{5,1}}{E_{5,1}} + {f_{5,2}}{E_{5,2}}) + {\cal O}({t^7})} \right\}{\kern 1pt} .

To obtain the conditions for a 6th order kernel from Ref. [31], one should use the conditions for fj,i up to q = 5 in Table 4 of that work, which give f1,1 = 1, f3,1 = 0, f5,1 = 0, which in the above notation are equivalent to (31) A1,m=1,A3,m=0,A5,m=0. {A_{1,m}} = 1,\quad {A_{3,m}} = 0,\quad {A_{5,m}} = 0. The conditions for the kernel are the same as for the complete product formula, but missing B5,m = 0. That is a general feature of the kernel order conditions; they are a subset of the conditions for the complete product formula, enabling shorter kernels, or more flexibility to choose kernels of the same length but lower error.

For 4th order there are the same number of conditions, so the kernel is a valid product formula of that order. For the processors, Ref. [31] uses the notation pj,i for the coefficients of Ej,i in the expansion of P . These need to be determined from the BCH formula rather than symmetric BCH, and instead of only odd-order terms there need to be only even-order terms. Some symmetries mean that low-order odd terms automatically cancel, but higher-order ones need to be made zero by the choice of P . Table 4 in Ref. [31] gives the conditions on the even-order terms. The first processor condition depending on fj,i is for q = 5, which means that for 6th order and higher the processors are dependent on the kernel. The 4th order kernel is already 4th order, and the processors will yield another 4th order product formula, but can be chosen to reduce the error (which arises from higher-order terms). See Appendix B for more detail on the method of determining processors.

2.4.
More General Product Formulae

The form of product formulae constructed from products of S2 can be generalised in a number of ways. First, one may choose different weightings in the two parts of Eq. (4), so one has (32) S(t1,t2)=j=1Jexp12Xjt1j=J1exp12Xjt2. S({t_1},{t_2}) = \left[ {\prod\limits_{j = 1}^J \exp \left( {{1 \over 2}{X_j}{t_1}} \right)} \right]\left[ {\prod\limits_{j = J}^1 \exp \left( {{1 \over 2}{X_j}{t_2}} \right)} \right]. That is, one still alternates between the order of applying the exponentials, but allows the times to be different. This form is no longer symmetric, but there are twice as many parameters for a given length. In the literature on symplectic integrators, the notation for product formulae of this type is [33] (33) χα2shχα2s1h*χα2hχα1h*, {\chi _{{\alpha _{2s}}h}} \circ \chi _{{\alpha _{2s - 1}}h}^* \circ \cdots \circ {\chi _{{\alpha _2}h}} \circ \chi _{{\alpha _1}h}^*{\kern 1pt} , where the asterisk indicates that the exponentials are performed in reverse order (rather than conjugation).

In the case where the Hamiltonian is a sum of only two parts, then a product formula may be written in the form (34) ewmXtewm1Ytew1Xtew0Ytew1Xtewm1YtewmXt, {e^{{w_m}Xt}}{e^{{w_{m - 1}}Yt}} \cdots {e^{{w_1}Xt}}{e^{{w_0}Yt}}{e^{{w_1}Xt}} \cdots {e^{{w_{m - 1}}Yt}}{e^{{w_m}Xt}}{\kern 1pt} , if m is odd, or ending in an exponential of Y if m is even. Here we have written the product formula as symmetric, but one can also consider product formulae without that constraint. Again there are twice as many parameters as for a product formula constructed as a product of S2. Product formulae of this type are often written as [33] (35) φas+1h[1]φbs+1h[2]φash[1]φa2h[1]φb1h[2]φa1h[1], \varphi _{{a_{s + 1}}h}^{[1]} \circ \varphi _{{b_{s + 1}}h}^{[2]} \circ \varphi _{{a_s}h}^{[1]} \circ \cdots \circ \varphi _{{a_2}h}^{[1]} \circ \varphi _{{b_1}h}^{[2]} \circ \varphi _{{a_1}h}^{[1]}{\kern 1pt} , where φ[1] and φ[2] are indicating the two different exponentials.

In the case where the Hamiltonian is a sum of two terms, the product of S(t1, t2) gives a form as in Eq. (34), and conversely the product formula as in Eq. (34) can be expressed as a product of S(t1, t2). In Section 5 we give results for testing some product formulae of this type, and find that they give the best results for 4th and 6th order. This method is mostly considered only up to 6th order because for higher orders the number of order conditions becomes significantly larger (see Table I of Ref. [31]).

Another class of product formulae is those constructed from products of fourth-order product formulae S4. Some product formulae have been constructed in that way; for example, see Ref. [31]. We have also tested those (using the fourth-order fractal product formulae), but do not include them in the comparison in Section 5. We found that, although they reduced the error, the much larger number of exponentials in the product meant that they were not competitive. Part of the reason is that the best gains are obtained when there are significantly more free parameters to adjust than the number of equations to satisfy. This is the reason to use product formulae with more than the minimum length. Constructing a product formula from S4 increases the length without the added advantage of more free parameters to adjust.

Another option is to construct product formulae from a different second-order product formula. For example, a more accurate second-order method is provided in Ref. [24]. In our testing, we find that yields slightly more accurate product formulae, but with twice the length, making them uncompetitive with product formulae constructed from the standard second-order symmetric product formula S2.

2.5.
Solution Using Taylor Expansion

Another solution method is based on computing the Taylor expansion of both the exact exponential and its product formula approximation with given wj. This method is not normally used because it is much more computationally intensive than the symmetric BCH approach. Here, we describe how to effectively code this method for numerical solution. The Taylor expansion is performed on both sides up to the desired order of approximation for the integrator. By imposing equality on terms of the same order one can obtain a series of equations for wj which can be solved. For higher orders, a large number of simultaneous equations are obtained, so we need an automated way of generating them.

To make precise the problem we are solving, denote as 𝒯k[⋅] the map giving the Taylor expansion in t around 0, truncated at order k, so (36) Tk[et(X+Y)]=p=0ktpp!(X+Y)p=p=0ktpp!r1,,rp=01Xr1Y1r1XrpY1rp. \matrix{ {{{\cal T}_k}[{e^{t(X + Y)}}]} \hfill & { = \sum\limits_{p = 0}^k {{{t^p}} \over {p!}}{{(X + Y)}^p}} \hfill \cr {} \hfill & { = \sum\limits_{p = 0}^k {{{t^p}} \over {p!}}\sum\limits_{{r_1}, \cdots ,{r_p} = 0}^1 {X^{{r_1}}}{Y^{1 - {r_1}}} \cdots {X^{{r_p}}}{Y^{1 - {r_p}}}.} \hfill \cr } We consider a sum of two operators X + Y , but this approach for solving for wj will also be sufficient to provide product formulae for sums of arbitrary numbers of terms. That is because the solutions must also be solutions of the equations derived using Yoshida's method, and as explained above those equations will be the same when considering sums of arbitrary numbers of terms. Now consider the ansatz operator of Yoshida from Eq. (16) (37) S(m)(t,w1,,wm)=etwmX/2etwmYet(wm+wm1)X/2etwm1Yet(wm1+wm2)X/2etw0Yet(w1+w0)X/2etw1YetwmX/2=etc1Xetc2Yetc4m+3X, \matrix{ {{S^{(m)}}(t,{w_1}, \cdots ,{w_m})} \hfill & { = {e^{t{w_m}X/2}}{e^{t{w_m}Y}}{e^{t({w_m} + {w_{m - 1}})X/2}}{e^{t{w_{m - 1}}Y}}{e^{t({w_{m - 1}} + {w_{m - 2}})X/2}}} \hfill \cr {} \hfill & {\quad \cdots {e^{t{w_0}Y}}{e^{t({w_1} + {w_0})X/2}}{e^{t{w_1}Y}} \cdots {e^{t{w_m}X/2}}} \hfill \cr {} \hfill & { = {e^{t{c_1}X}}{e^{t{c_2}Y}} \cdots {e^{t{c_{4m + 3}}X}},} \hfill \cr } where in the last line we have defined the constants c1 = wm/2, c2 = wm, c3 = (wm + wm−1)/2, ⋯ c4m+3 = wm/2. Taylor expanding this ansatz up order k gives (38) Tk[S(m)(t,w1,,wm)]=r1,,r4m+3=0r1++r4m+3kktr1++r4m+3r1!r4m+3!c1r1c4m+3r4m+3Xr1Yr2Xrk1Yr4m+3. {{\cal T}_k}[{S^{(m)}}(t,{w_1}, \ldots ,{w_m})] = \sum\limits_{{\scriptstyle {r_1}, \cdots ,{r_{4m + 3}} = 0 \atop \scriptstyle {r_1} + \ldots + {r_{4m + 3}} \le k} }^k {{{t^{{r_1} + \ldots + {r_{4m + 3}}}}} \over {{r_1}! \cdots {r_{4m + 3}}!}}c_1^{{r_1}} \ldots c_{4m + 3}^{{r_{4m + 3}}}{X^{{r_1}}}{Y^{{r_2}}} \ldots {X^{{r_{k - 1}}}}{Y^{{r_{4m + 3}}}}. Note that the total number of exponentials in Yoshida's ansatz is 4m + 3.

The product formula obtained from the solution procedure needs to be independent of the choice of X and Y , so we need to match the coefficients for each sequence of products of X and Y . Because X and Y are non-commuting, we need to record coefficients for each ordered sequence. To do this in an automated way we construct a data structure to store the coefficients.

Given operators of the form ecX=I+cX+c22!X2+c33!X3+ {e^{cX}} = I + cX + {{{c^2}} \over {2!}}{X^2} + {{{c^3}} \over {3!}}{X^3} + \cdots and edY=I+dY+d22!Y2+d33!Y3+ {e^{dY}} = I + dY + {{{d^2}} \over {2!}}{Y^2} + {{{d^3}} \over {3!}}{Y^3} + \cdots with c, d scalar coefficients and X, Y general operators, we describe this Taylor expansion up to order k using an array encoding. First, write monomials composed of X and Y operators in lexicographical order and note that these operators can be mapped to binary numbers: (39) IXYXXXYYXYYXXXXXY1101110010111011110001001123456789 \matrix{ I & X & Y & {XX} & {XY} & {YX} & {YY} & {XXX} & {XXY} & \cdots \cr 1 & {10} & {11} & {100} & {101} & {110} & {111} & {1000} & {1001} & \cdots \cr 1 & 2 & 3 & 4 & 5 & 6 & 7 & 8 & 9 & \cdots \cr }

To construct a bit string, we map each X to 0 and each Y to 1, then place a 1 on the left to flag the length of the string, as shown in the second line of Eq. (39). Then, to obtain the operator products, simply remove the leading 1 and then map 0 to X and 1 to Y . The empty string corresponds to the identity I. Now, to store the coefficients in a sum of products of X and Y , convert each product to a binary integer as above, then store the coefficient in the corresponding location in a vector. This approach enables products of Taylor expansions of exponentials to be rapidly calculated.

By way of illustration, consider the polynomial 10I + 3X + 2Y + 2X2 + Y X. This operator would be stored in an array as [10, 3, 2, 2, 0, 1, 0, ⋯]. In this way, any polynomial of X and Y can be efficiently stored in a vector. We denote this vector storing the coefficients of operators of order up to k as veck[p(X, Y)], where k denotes that the vector will only store the coefficients of the corresponding operators up to order k (so a vector of length 2k+1 − 1) and p(X, Y) is the polynomial in terms of X and Y.

3.
Product Formula Search

First we describe the numerical methods, then summarise solutions we found at 8th and 10th order.

3.1.
Numerical methods

The size of the product formula we consider as in Eq. (16) is governed by the value of m, so the number of S2 in the product is M = 2m + 1. The number of free parameters is m, and it needs to be a least a large as the number of independent equations in order for there to be a solution. Yoshida [3] finds that m = 7 is minimal for 8th order, and m = 15 is minimal for 10th order [29]. Choosing the minimal m typically yields product formulae with large error, and for that reason we focus on larger values of m.

For the numerical solution of the simultaneous non-linear equations, we found that Matlab's fsolve (using the Levenberg-Marquardt algorithm) provided rapid results using the vector of errors. The method was to repeatedly solve from random starting vectors w \vec w generated according to the normal distribution. The best solutions were those with smaller values for the coefficients; for much of the calculations we selected standard deviations of 0.6 for 8th order and 0.9 for 10th order. We filtered a large number of solutions by numerically checking the error for example Hamiltonians, and selecting those that provided the best performance. We then perform tests for larger numbers of samples, to more accurately select the best performing product formulae. We also further refined the solutions to give the results to extended precision, which enables testing of the error with smaller values of t. We were also able to further reduce the error by using an alternative solution method using the Taylor series.

The Taylor series approach could be used to solve for product formulae, but is more computationally intensive because it produces an error vector that is exponentially large in the order. However, it is very effective at further improving the performance of solutions that have already been found. If a solution is of 8th order (for example), then we can consider the error using the Taylor expansion up to and including the 9th order. This will give non-zero error for the 8th order product formula, so minimising the error gives a new product formula. That can be used as a starting point to again solve for an 8th order formula, and often it is found that the product formula adjusted in that way has reduced error.

3.2.
Improved 8th Order

Following these methods, we have searched for new product formulae and found thousands of product formulae of 8th order. Once we have obtained the potential solutions, we generate random Hamiltonians and compute the product formula errors for a chosen short time. The error for 8th order product formulae is given by δ(t) ∝ t9, but the constant will depend on the measure of error used. One could use the spectral-norm error for a single time step, but a more appropriate measure of the performance of the product formulae is the accuracy of the eigenvalues. In the following we use χ for the constant factor based on spectral-norm error and ζ for the constant based on eigenvalue error, with χ = δ/t9 or ζ = δ/t9.

For each product formula, we compute an average constant factor error; this average corresponds to the geometric mean of the constant factors computed for each random Hamiltonian. See Section 5 for more details of the numerical methods, and the comprehensive comparison of product formulae. This method of comparing the performance of product formulae through the estimation of the constant factor in the error has been used before (see for example [32]) and is considered a good approximation to the performance of the product formula in practice.

The case m = 7 yields product formulae of minimal length because the number of equations is equal to the number of unknowns. We have found over 600 solutions, and the search now finds almost only repeated solutions and very few new solutions. This indicates that we have found nearly all solutions, but it is also possible that there are many more solutions with large values of w \vec w . Indeed, the most recent new solutions we found have significantly larger values of w \vec w . We find that large values of w \vec w correspond to worse product formulae with larger error. Therefore, even if there are many more solutions with large w \vec w , they likely will not give improved performance over those we have already found.

This search for m = 7 (15 stages) verified that the best-performing product formula corresponds to that previously reported in Refs. [24] and [25]. Similarly, a numerical search with m = 8 (17 stages) yielded the best-performing product formula close to solutions s17odr8a and s17odr8b given in Ref. [25]. It is not exactly the same, but the small difference is likely due to the system of equations being underdetermined. The extensive nature of our search indicates that these product formulae reported in Refs. [24,25] are optimal for 8th order product formulae with m = 7 and m = 8.

To provide improved performance over prior work, we extended the search to m = 10 (21 stages), and found over 100,000 solutions. This size of 8th order product formula was previously studied in Ref. [29], but we find solutions with significantly lower error. Our best solutions for spectral-norm and eigenvalue error are given in Table 1. The solution selected for minimum spectral-norm error has a constant of χ = 5.8 × 10−8. Its constant factor for eigenvalue error ζ = 7.0 × 10−9 is significantly smaller, but the best performance is obtained by selecting for minimum eigenvalue error. The second solution given in Table 1 has the significantly smaller constant factor in the error ζ = 5.4 × 10−10.

Table 1.

Our best-performing 8th order solutions when setting m = 10.

Best 8th order for spectral-norm errorBest 8th order for eigenvalue error

w10.593580604008506258635140592652240.10467636532245895252340732579853
w2−0.46916012347004197296293264921328−0.57896999331780988041471955125778
w30.27435664258984679072282428781460.57503350160061785946141563279891
w40.171938794846567730599190749653770.12231011868707029786561397542663
w50.234398744825413844154305787475410.27793149999039524816733903301747
w6−0.48616424480326193899617759997914−0.37349605088056728482635987352576
w70.496173673881146603548717570449060.11575566589480463220616543972403
w8−0.326602189484391301145018153238140.1464645610975800618712569230326
w90.23271679349369857679445410270557−0.39443578322284085764474498594073
w100.0982495574147085332734719061806430.44370228726021218923197141183196

These constant factors all improve over those provided by product formulae in prior work. Our solution with smallest eigenvalue error is well over 100 times more accurate than the m = 10 product formula in Ref. [29], and about 100 times more accurate than the m = 9 product formula in that work. See Section 5 for the detailed numerical testing of the product formulae.

A further improvement in product formulae can be obtained by using processing. We have performed a search for high-accuracy 8th order product formulae using this method with m = 8, and found one particular example that provides excellent performance; see Table 2. This kernel provides a constant of ζ = 8.1 × 10−10, only slightly larger than the best length m = 10 product formula, while providing better performance due to the shorter length. This error is nearly 300 times smaller than provided by 8th order processed product formulae in prior work [31]. Therefore, we have provided a sequence of three solutions that all improve over prior work, with successively better and better performance.

Table 2.

Our best-performing 8th order kernel and processor when setting m = 8. The value of γ10 for the processor is chosen such that the sum is zero.

Best 8th order kernelProcessor for kernel

w10.21784176681731006074681969186513−0.44324901019570126590495430949294γ1
w20.19470177060539032240224563429070.25459857192003772850622377066944γ2
w30.18372413281145589944261642180363−0.73862036266779261573694538099739γ3
w4−0.37307499512657736825709230652023−0.00024139614958652134370419495289618γ4
w50.157576442575691463730336620604610.73873460354125365739379753874964γ5
w6−0.33342207567391682979227850551172−0.20285971152536085519251666906017γ6
w70.517886496829879242817871422268030.44989521689676869571827637424046γ7
w80.214564754998977669863812196217610.29538398007876871184026747505657γ8
−0.3364996155865700091428329802017γ9

To provide an accurate result in terms of the spectral norm (rather than just the eigenvalues) it is necessary to solve for the processor. For the kernel there are many solutions for the processor, but we list the one with the best performance of the solutions we have found. This one was found by solving for many (hundreds of thousands) of processors, picking the one with the best performance, and further optimising. Moreover, using the processor the constant for the spectral-norm error is χ = 5.3 × 10−8. That is even better than the best length m = 10 (nonprocessed) formula above.

3.3.
Finding 10th Order

We have also used our solution procedure to find new 10th order product formulae. For Yoshida's method applied to 10th order, there are 15 independent equations to be solved (see Appendix A). We performed searches for solutions with m = 15 (the minimal number) to m = 18. Again the larger values of m give the flexibility to adjust the solution to reduce the error. As in Section 3.2, we compare the performance of product formulae of 10th order by computing the constant factors χ and ζ for random Hamiltonians.

Out of our solutions, the best eigenvalue performance is given by a m = 17 solution (see Table 3), which yields ζ = 6.1 × 10−11. This formula is slightly outperformed by one product formula in Ref. [29], but for the purpose of quantum computing we will show that 8th order provides better performance than 10th order. For this reason we did not perform further searches for processed 10th order product formulae or proceed to any higher orders.

Table 3.

The 10th order solution with lowest eigenvalue error in our numerical search.

10th order solution with m = 17

w1−0.28371232689144296279654621726493
w20.046779504778147381605331000278223
w30.36845892382797770619657504217539
w40.19186204094674514739760408197461
w5−0.53123134392680669702873064192428
w6−0.0081253242720827266680816105600661
w7−0.16389450414378567860032917538393
w80.18514766119291405032528647881
w90.5383584694754681989174668806505
w10−0.30583981835573485697292316732177
w110.43199935609523301289295473774488
w120.1510502301631786853020124612813
w13−0.35051099204829676098801520498121
w140.1032971125844291674511513007661
w150.15043936943817152697371946806229
w160.12118469498650736511410491586846
w170.10437742779547826358296681557444

In the search for 10th order product formulae, unlike in the case of 8th order, we find that almost all new solutions found are different from those found before. That indicates there is an extremely large number of solutions, and we have only found a very small proportion of them. There is still the potential for better-performing solutions to be found in this order, but we expect that the longer length will make them unsuitable for quantum computing.

4.
Methods for Comparison of Product Formulae
4.1.
Same Order Comparison

A fair comparison between product formulae of different lengths may be made in the following way. An order k product formula for time t will have an error δ = ζtk+1 (using ζ for eigenvalue error, or δ = χtk+1 for the case of spectral-norm error). Let T be the total evolution time for a product formula of order k, and ε be the maximum allowable error. Subdivide the evolution time T into r subintervals, so t = T /r is the length of each time subinterval. We thus have ζ(T /r)k+1ε/r, which gives (40) rζTε1/kT. r \approx {\left( {{{\zeta T} \over \epsilon }} \right)^{1/k}}T. As explained above (see Eq. (17)), the number of exponentials in the product is (4m + 2)(J − 1) + 1. When applying products of these product formulae, two exponentials can be combined, so the effective number for each is (4m + 2)(J − 1). As a result, the total number of exponentials can be given as approximately (41) 2(J1)MζTε1/kT, 2(J - 1)M{\left( {{{\zeta T} \over \epsilon }} \right)^{1/k}}T, where M = 2m + 1. The only quantities that vary between different product formulae of the same order are M and ζ, so to compare product formulae, we need only compare the values of 1/k; the one with the smaller value is the more efficient product formula. Similarly, if we consider spectral-norm error then we should compare 1/k between the formulae.

As another motivation for this form, consider the case where the task is to estimate an eigenvalue of the Hamiltonian. Then one would use an ancilla register to control the time of evolution, and use phase estimation. If we are controlling between the forward and reverse time evolution as per Ref. [45], then the (root-mean square) error in estimating the phase of eiHT /r is given by (42) εphase=π2r. {\epsilon _{{\rm{phase}}}} = {\pi \over {2r}}. For the error in the simulation of eiHT /r to be no more than this phase estimation error, we should have (43) π2rζ(T/r)k+1, {\pi \over {2r}} \approx \zeta {(T/r)^{k + 1}}, or (44) r2ζTπ1/kT. r \approx {\left( {{{2\zeta T} \over \pi }} \right)^{1/k}}T. If εH is the allowed error in estimation of the eigenvalue of H, then it would correspond to π/(2T). Then, in terms of εH the choice of r is (45) rζεH1/kπ2εH. r \approx {\left( {{\zeta \over {{\epsilon _H}}}} \right)^{1/k}}{\pi \over {2{\epsilon _H}}}{\kern 1pt} . That would correspond to a total number of exponentials (46) π(J1)MζεH1/kεH. \pi (J - 1)M{\left( {{\zeta \over {{\epsilon _H}}}} \right)^{1/k}}{\epsilon _H}. Again the constant factor dependent on the product formula is 1/k, so it is this expression that should be compared between formulae. In terms of the overall complexity, this is similar to simulation of the evolution for time T = π/(2εH) with error ε = π/2.

4.2.
Thresholds for Different Order

If we wish to compare product formulae of different order, then we need to take account of the values of T and ε. Assume we have two product formulae of order k1 and k2, with corresponding constants ζ1, ζ2. Given T and ε, when the two product formulae use the same number of exponentials we have M1r1 = M2r2, thus (47) M1ζ1Tε1/k1T=M2ζ2Tε1/k2T {M_1}{\left( {{{{\zeta _1}T} \over \epsilon }} \right)^{1/{k_1}}}T = {M_2}{\left( {{{{\zeta _2}T} \over \epsilon }} \right)^{1/{k_2}}}T (48) Tε=M2ζ21/k2M1ζ11/k111k11k2. \Rightarrow \;\;\;{T \over \epsilon } = {\left( {{{{M_2}\zeta _2^{1/{k_2}}} \over {{M_1}\zeta _1^{1/{k_1}}}}} \right)^{{1 \over {{1 \over {{k_1}}} - {1 \over {{k_2}}}}}}}. This gives the threshold beyond which the higher-order product formula should be used for improved performance. Again, when considering spectral-norm error, ζ would be replaced with χ. In the case of eigenvalue estimation, the threshold is for 1/εH rather than T /ε.

A limitation of this analysis is that it is assumed the time step (ε/(ζT))1/k is small enough for the scaling law for the error to hold. To adjust the threshold for cases where the time step is large, we can consider a more general functional dependence of the error on the time interval as f(t). Then we require for time T and error ε that (49) f(t)×T/t=ε. f(t) \times T/t = \epsilon {\kern 1pt} . The cost is then proportional to M × r = MT /t. For the threshold between two product formulae, we require (50) M1T/t1=M2T/t2. {M_1}T/{t_1} = {M_2}T/{t_2}{\kern 1pt} . That implies t2 = t1M2/M1. For the total error to be equal to ε in each case, we then require (51) f1(t1)×T/t1=f2(t2)×T/t2. {f_1}({t_1}) \times T/{t_1} = {f_2}({t_2}) \times T/{t_2}{\kern 1pt} . That can be rearranged to give (52) f1(t1)=f2(t1M2/M1)×M1/M2. {f_1}({t_1}) = {f_2}({t_1}{M_2}/{M_1}) \times {M_1}/{M_2}{\kern 1pt} . This is an expression we can solve for t1, which may then be used to determine the T /ε threshold from (53) T/ε=t1/f1(t1). T/\epsilon = {t_1}/{f_1}({t_1}){\kern 1pt} . This means that the threshold is still for the ratio T /ε, rather than having separate dependence on T and ε.

4.3.
Motivation for Considering Eigenvalue Error

Here, we give further explanation of why it is important to consider eigenvalue error. If the goal is to estimate eigenvalues of the Hamiltonian (as is often the case in quantum chemistry), then the measure of the error should be that in the eigenvalues. That is, how close are the eigenvalues of the product formula to those of the exact Hamiltonian evolution. In the case that the goal is to accurately reproduce the final state after the evolution, then an appropriate measure of the error should be the spectral norm of the difference of the unitary operators. This error accounts for both error in the eigenvalues and basis.

That error will upper bound the 2-norm error in the generated quantum state, but using the triangle inequality to bound the error for long evolution times overestimates the error. This is because the error beyond that in the eigenvalues cancels when using a product of many short time intervals. Let us denote the exact evolution operator for short time U , and the approximate evolution operator provided by the product formula as U˜ \widetilde U . In the basis of the Hamiltonian's eigenstates, U is diagonal, and we can diagonalise U˜ \widetilde U in this basis as U˜=VDV \widetilde U = VD{V^\dagger } . Then the difference between D and U describes the eigenvalue error, since these are diagonal matrices with the eigenvalues on the diagonal. The matrix V describes the basis error.

It is convenient to write V=eiH˜ V = {e^{i\tilde H}} for some Hermitian matrix H˜ \tilde H . Then the error can be given as (54) U˜U=eiH˜DeiH˜U=(I+iH˜+O(H˜2))D(IiH˜+O(H˜2))U=i[H˜,D]+DU+O(H˜2)[H˜,DI]+DU+O(H˜2). \matrix{ {\left\| {\widetilde U - U} \right\|} \hfill & { = \;\left\| {{e^{i\tilde H}}D{e^{ - i\tilde H}} - U} \right\|} \hfill \cr {} \hfill & { = \left\| {(I + i\tilde H + {\cal O}({{\tilde H}^2}))D(I - i\tilde H + {\cal O}({{\tilde H}^2})) - U} \right\|} \hfill \cr {} \hfill & { = \;\left\| {i[\tilde H,D] + D - U + {\cal O}({{\tilde H}^2})} \right\|} \hfill \cr {} \hfill & { \le \left\| {[\tilde H,D - I]} \right\| + \left\| {D - U} \right\| + {\cal O}({{\tilde H}^2}){\kern 1pt} .} \hfill \cr } That is, the error can be split up between a part [H˜,DI] [\tilde H,D - I] corresponding to basis error, and a part DU corresponding to eigenvalue error. Note that DI is proportional to t, so the error in the operator due to the basis error is one order higher than the error in the basis.

Then, for the case of a large number of time steps r, we can bound the overall error as (55) U˜rUr=VDrVUr=(VDrVDrV)+(DrVUrV)+(UrVUr)VI+DrUr+VI2VI+rDU. \matrix{ {\left\| {{{\widetilde U}^r} - {U^r}} \right\|} \hfill & { = \left\| {V{D^r}{V^\dagger } - {U^r}} \right\|} \hfill \cr {} \hfill & { = \left\| {(V{D^r}{V^\dagger } - {D^r}{V^\dagger }) + ({D^r}{V^\dagger } - {U^r}{V^\dagger }) + ({U^r}{V^\dagger } - {U^r})} \right\|} \hfill \cr {} \hfill & { \le \left\| {V - I} \right\| + \left\| {{D^r} - {U^r}} \right\| + \left\| {{V^\dagger } - I} \right\|} \hfill \cr {} \hfill & { \le 2\left\| {V - I} \right\| + r\left\| {D - U} \right\|{\kern 1pt} .} \hfill \cr } That is, the error due to the eigenvalues is multiplied by a factor of r, but the error due to the basis is not. The error due to the basis can be larger than for a single time step, but only by a factor of ~ 1/t. That factor arises because for a single time step the basis error is commuted with DI which is proportional to t, so the expression here without DI can be a factor ∝ 1/t larger.

In practice we find that in many cases the error in the basis scales as 𝒪(tk), resulting in the expected contribution to the spectral-norm error 𝒪(tk+1). However, in other cases we find scaling of the error in the basis as 𝒪(tk+1), resulting in a higher-order contribution to the spectral-norm error. Let us give the error in the eigenvalues as ζtk+1, and the error in the basis as μtk+ν where ν ∈ [0, 1] to account for the various scalings found numerically.

Then the choice of r to make the error in the eigenvalues smaller than ε is (56) rζTε1/kT. r \approx {\left( {{{\zeta T} \over \epsilon }} \right)^{1/k}}T. The contribution to the total error from the basis error is (57) 2μ(T/r)k+ν2μεζT1+ν/k. 2\mu {(T/r)^{k + \nu }} \approx 2\mu {\left( {{\epsilon \over {\zeta T}}} \right)^{1 + \nu /k}}{\kern 1pt} . That gives the ratio of the basis error to the eigenvalue error as approximately (58) 2μεεζT1+ν/k=2μζTεζTν/k. {{2\mu } \over \epsilon }{\left( {{\epsilon \over {\zeta T}}} \right)^{1 + \nu /k}} = {{2\mu } \over {\zeta T}}{\left( {{\epsilon \over {\zeta T}}} \right)^{\nu /k}}{\kern 1pt} . Hence, for large T the eigenvalue error should be dominant. This means that the eigenvalue error for a single step is the relevant measure to estimate the spectral-norm error for long-time evolution.

5.
Numerical Comparison of Product Formulae
5.1.
Comparison for General Matrices

In what follows, we report on the comparison between product formulae of 4th, 6th, 8th, and 10th order based on the threshold provided above. We analyse those found in this work, and those from prior work including processed product formulae. When giving the number of stages of a processed product formula, we refer to the number of stages in the kernel (not P) because those are the ones that would be repeated for a simulation over an extended time.

We list 4th order product formulae in Table 4, 6th order in Table 5, 8th order in Table 6, and 10th order in Table 7, and give the constant factors in the error scaling χ, ζ (spectral norm or eigenvalue error). These product formulae include some constructed with even M rather than M = 2m + 1 as for our product formulae. We generated 1024 random Hamiltonians for each product formula, and then estimated the constants χ and ζ by taking the geometric mean. Each random Hamiltonian is generated as a sum of two random Hamiltonians H = A + B, where A and B are random Hermitian matrices of dimension 64 × 64 and norm 1.

Table 4.

The list of 4th order product formulae. The column labelled “processing” indicates whether the formula uses processor. The label is the name we will refer to the formula by. The constant factor in the error is denoted χ for spectral-norm error and ζ for eigenvalue error, and the corresponding quantities 1/k and 1/k are given. The column labelled S2 indicates whether the product formula is a product of S2. The best result is highlighted in green, and is from 2002 [47]. The 2006 result PPBCM4m6 is slightly higher error, but it is not evident to 2 significant figures.

labelMS2processingreferenceχ1/kζ1/k

S4m13YNfirst fractal product [2]4.9 × 10−21.412.3 × 10−21.17
S4m25YNsecond fractal product [4]3.0 × 10−31.173.3 × 10−40.67
O4M55NNOstmeyer Eq. (40) of [46]3.0 × 10−40.667.7 × 10−50.47
BM4M66NNS6 Table 2 of [47]1.6 × 10−40.672.4 × 10−50.42
PPBCM4m66NYP64 Table 5 of [31]5.7 × 10−50.522.4 × 10−50.42
BCE4m36YY ψ3[4] \psi _3^{[4]} Table 6 of [40]1.6 × 10−22.151.5 × 10−31.17
BCE4m48YY ψ4[4] \psi _4^{[4]} Table 6 of [40]3.4 × 10−41.099.8 × 10−50.80
BCE4m510YY ψ5[4] \psi _5^{[4]} Table 6 of [40]6.7 × 10−50.902.1 × 10−50.68
BCE4m612YY ψ6[4] \psi _6^{[4]} Table 6 of [40]2.6 × 10−50.857.0 × 10−60.62
BCE4m714YY ψ7[4] \psi _7^{[4]} Table 6 of [40]1.3 × 10−50.853.0 × 10−60.58
BCE4m816YY ψ8[4] \psi _8^{[4]} Table 6 of [40]7.9 × 10−50.851.5 × 10−60.56
BCE4m918YY ψ9[4] \psi _9^{[4]} Table 6 of [40]5.3 × 10−60.868.6 × 10−70.55
Table 5.

The list of 6th order product formulae with their average errors. The column labelled S2 indicates whether it is a product of S2. The best result is highlighted in green, and is from 2006 [31].

labelMS2processingreferenceχ1/kζ1/k

S6m19YNfirst fractal product [2]4.5 × 10−25.362.3 × 10−24.81
S6m225YNsecond fractal product [4]1.3 × 10−53.832.0 × 10−71.91
Y6m3a7YNSolution A in Table 1 of [3]2.0 × 10−32.491.2 × 10−32.28
KL6s9a9YNs9odr6a in Appendix A of [25]2.9 × 10−42.311.7 × 10−42.12
KL6s9b9YNs9odr6b in Appendix A of [25]2.9 × 10−42.311.7 × 10−42.11
SS6s1111YNSection 4.2 of [29]4.2 × 10−52.051.4 × 10−51.71
SS6s1313YNSection 4.2 of [29]2.1 × 10−52.163.4 × 10−61.59
BM6M1010NNS10 Table 2 of [47]6.0 × 10−61.351.4 × 10−61.06
PPBCM6m99NYP96 Table 5 of [31]5.0 × 10−61.184.3 × 10−70.78
PPBCM6m511YYP116 in Table 6 of [31]1.9 × 10−61.229.0 × 10−71.08
PPBCM6m613YYP136 in Table 6 of [31]4.8 × 10−71.152.5 × 10−71.03
BCE6m510YY ψ5[6] \psi _5^{[6]} Table 8 of [40]5.8 × 10−34.243.2 × 10−33.84
BCE6m612YY ψ6[6] \psi _6^{[6]} Table 8 of [40]7.7 × 10−52.472.0 × 10−51.98
BCE6m714YY ψ7[6] \psi _7^{[6]} Table 8 of [40]1.6 × 10−52.222.4 × 10−61.62
BCE6m816YY ψ8[6] \psi _8^{[6]} Table 8 of [40]4.6 × 10−62.063.2 × 10−71.32
BCE6m918YY ψ9[6] \psi _9^{[6]} Table 8 of [40]7.3 × 10−62.511.3 × 10−71.28
BCE6m1020YY ψ10[6] \psi _{10}^{[6]} Table 8 of [40]3.2 × 10−62.429.9 × 10−90.93
BCE6m1122YY ψ11[6] \psi _{11}^{[6]} Table 8 of [40]7.9 × 10−63.111.4 × 10−81.08
Table 6.

The list of 8th order product formulae with their average errors. This table shows our new results for product formulae that improve over those in the prior literature. The result that provides the most efficient simulations is highlighted in (dark) green, and is our processed product formula. That provides the best performance due to the shorter length, but the lowest error is provided by our product formula highlighted in (dark) blue. The lowest error results from prior work are highlighted in light blue (for the non-processed case), and light green (for the processed case). The processed product formula PP8s13 (P138) has larger error than the highlighted PP8s19, but better performance due to its shorter length.

labelMprocessingreferenceχMχ1/kζ1/k

S8m127Nfirst fractal product [2]5.6 × 10−218.81.6 × 10−216.2
S8m2125Nsecond fractal product [4]6.7 × 10−911.95.2 × 10−133.64
Y8m7d15NSolution D in Table 2 of [3]1.1 × 10−36.411.3 × 10−44.89
MC8s1515NTable 2 of [24]6.5 × 10−63.372.0 × 10−62.90
MC8s1717NTable 2 of [24]7.9 × 10−72.943.3 × 10−72.63
KL8s17a17Ns17odr8a in [25]6.1 × 10−72.842.7 × 10−72.56
KL8s17b17Ns17odr8b in [25]5.9 × 10−72.832.5 × 10−72.54
SS8s1919NSection 4.3 of [29]1.8 × 10−72.725.3 × 10−82.34
SS8s2121NSection 4.3 of [29]3.4 × 10−73.277.5 × 10−82.70
PP8s1313YP138 in Table 6 of [31]1.2 × 10−62.376.5 × 10−72.19
PP8s1919YP198 in Table 6 of [31]NANA2.4 × 10−72.83
Y8m1021NTable 1 (our new result)5.8 × 10−82.617.0 × 10−92.01
Y8m10b21NTable 1 (our new result)6.3 × 10−73.535.4 × 10−101.46
YP8m817YTable 2 (our new result)5.3 × 10−82.098.1 × 10−101.24
Table 7.

The list of 10th order product formulae with their average errors. The best result is highlighted in green, and is the product formula from 2005 [29]. That product formula provides exceptional performance for 10th order, being about 280 times better than all others from prior work, and half the error of our solution (bottom line).

labelMprocessingreferenceχMχ1/kζ1/k

S10m181Nfirst fractal product [2]9.0 × 10−263.72.7 × 10−344.8
S10m2625Nsecond fractal product [4]4.1 × 10−1336.06.1 × 10−199.44
KL10s31a31Ns31odr10a in Appendix A of [25]8.7 × 10−69.675.8 × 10−69.28
KL10s31b31Ns31odr10b in Appendix A of [25]8.4 × 10−512.14.1 × 10−511.3
Tsi10s3333NTable II of [26]3.3 × 10−69.336.4 × 10−77.93
SS10s3131NSection 4.4 of [29]4.2 × 10−85.662.4 × 10−85.36
SS10s3333NSection 4.4 of [29]1.4 × 10−85.428.8 × 10−95.17
SS10s3535NSection 4.4 of [29]1.0 × 10−94.413.1 × 10−113.11
Alberdi3131NAppendix A of [48]1.5 × 10−76.441.0 × 10−76.20
Alberdi3333NAppendix A of [48]8.4 × 10−86.475.5 × 10−86.20
Alberdi3535NAppendix A of [48]1.5 × 10−85.799.5 × 10−95.52
PP10s1919YP1910 in Table 6 of [31]NANA6.2 × 10−65.73
PP10s2323YP2310 in Table 6 of [31]1.4 × 10−65.961.6 × 10−83.82
Y10m1735NTable 31.9 × 10−85.916.1 × 10−113.33

The size of the time step used for the estimation is t = e−5/2 ≈ 0.08, which we found to be small enough for accurate estimation of the constants. An exception is the 10th order product formula PP10s23 in the second-last line of Table 7. That was evaluated for t = e−3/2 ≈ 0.2, rather than t ≈ 0.08 for the other product formulae. Estimates for t ≈ 0.08 are significantly larger, possibly because the coefficients for that product formula were not given to high precision. We provide the estimates of χ, ζ for larger t to evaluate it as fairly as possible.

To compare among product formulae of the same order, we compare 1/k for eigenvalue error, and the best results in each table are highlighted. We also provide values of 1/k for comparison, but these are not used to select the best product formulae. For 8th order it can be seen that our new processed product formula provides the best performance out of all formulae tested. Our two new product formulae of length m = 10 (M = 21) also give better performance than any prior product formulae tested, including processed product formulae. Hence we have given a sequence of three new solutions here, all of which provide improved performance over prior work. There are also other results in our database of over 100,000 solutions that outperform prior work, but we have just given a selection of the best here.

In terms of eigenvalue error, our solution Y8m10b is about 100 times more accurate than the lowest-error prior result, SS8s19. Our processed product formula YP8m8 is about a factor of 300 times more accurate than the lowest-error prior result for processed product formulae, PP8S19, which is longer. These surprisingly large improvements demonstrate that the solution space had not been fully explored in prior work. Each increase in m greatly expands the solution space to explore, and it is likely that there are further improvements still to be found. The error for our processed product formula YP8m8 has slightly larger error than our non-processed formula Y8m10b, but its shorter length means that 1/k is smaller, thus providing the best performance out of all product formulae tested.

These improvements are consistent between the samples of the Hamiltonians. In comparing our formula Y8m10b to the best from prior work SS8s19, our smallest improvement (out of 1,024 samples) was about a factor of 35. A histogram of the improvement for our new product formula is shown in Figure 1. We also find that the improvements are consistent between sizes of the matrices. We have tested sizes 6, 16, and 32 as well as 64 (which is shown in the tables), and found significant improvements at all sizes tested.

Figure 1.

A histogram of the ratio of the error of the most accurate prior product formula SS8s19 to our new solution, in the case without processing. The histogram is over the 1024 samples of 64 × 64 matrices. The average improvement is about a factor of 100, with the lowest being about 35.

A recent review [33] also provides a comparative analysis of many of the product formulae in the literature, and for 8th order their recommended methods are SS8s21 in the case without processing, and PP8s13 for the case with processing. In our testing we find that SS8s19 is slightly better than SS8s21; this difference may be because we are testing Hamiltonians here, whereas Ref. [33] considers general matrices (which are non-Hermitian). PP8s19 provides lower error than PP8s13, but PP8s13 provides better performance due to the shorter length (and correspondingly smaller value of 1/k). The three new results we have listed in Table 6 all outperform those methods recommended by Ref. [33].

As a further test, we have calculated the error for simulating the transverse-field Ising model for an 8-qubit chain. We used a time step of 1, with the two components of the Hamiltonian normalised to unit norm, so it is (59) H=17j=17ZjZj+1+18j=18Xj. H = - {1 \over 7}\sum\limits_{j = 1}^7 {Z_j}{Z_{j + 1}} + {1 \over 8}\sum\limits_{j = 1}^8 {X_j}{\kern 1pt} . We then determined the spectral-norm error for up to 1000 steps, for our minimum eigenvalue error solution Y8m10b, as well as SS8s19 and SS8s21 which provided the minimum eigenvalue error out of results presented in prior work. As seen in Figure 2, the error is far smaller for our new product formula than that provided by those from prior work. The initial error is larger as indicated by the larger spectral-norm error for a single step, but then the error for a large number of time steps is dominated by the eigenvalue error which is much smaller for this product formula. It is also surprising that the simulation is highly accurate for a time step of 1, which is not the short time step normally considered for scaling of the error. The eigenvalue error for a single time step of 1 is only 2.7 × 10−12 for our product formula.

Figure 2.

The spectral-norm error for simulating an 8-qubit transverse-field Ising Hamiltonian in Eq. (59) for our product formula Y8m10b (black), SS8s19 (blue), and SS8s21 (red) from Ref. [29]. The size of the time step is 1, and the components of the Hamiltonian are normalised to unit norm.

In the case of 10th order, we find that the best performance is provided by SS10s35, from Sofroniou and Spaletta [29] (which is recommended by Ref. [33]). The performance of that product formula is extraordinarily good, with error nearly a factor of 280 times less than any others we have tested from prior work. Our solution Y10m17 provides intermediate performance, having error twice as large as SS10s35, but a factor of 140 times better than any others. For example, it outperforms the processed product formula PP10s23, which was also recommended by Ref. [33]. The fact that there is a very exceptional solution at one particular length for 10th order suggests that there are better solutions still to be found, motivating further searches in future work.

For 4th order the best result for eigenvalue error is given by the formula BM4M6 of Blanes and Moan [47], which is not in the form of a product of S2. The processed product formula PPBCM4m6 also provides excellent performance, with a marginally larger error in our testing. A recently proposed formula O4M5 of Ostmeyer [46] also gives good performance. In the 6th order case we find that the best performance is given by PPBCM6m9, which is a processed product formula not constructed as a product of S2. The recently proposed processed product formula BCE6m10 also gives excellent performance with low error, but is much longer, resulting in a larger value of 1/k.

In all cases the fractal products provide poor performance. The second fractal product yields very low error but very large M, equal to 125 for 8th order and 625 for 10th order. This is far beyond that for any other product formulae considered here, making this product formula uncompetitive despite the small error. The first fractal product yields large error as well as large M, making its performance by far the worst out of any product formula tested. We also tested various combinations of the two formulae (by using different formulae in the iteration to increase the order) or using fractal formulae to increase the order of lower-order formulae. None of these provided competitive performance, so are not included in the tables.

To consider the threshold to use a higher-order product formula over a lower-order one, we first consider the asymptotic expression using 1/k for eigenvalue error. For 4th order the value of this quantity is 0.42 (BM4M6) and for 6th order is 0.78 (PPBCM6m9), which gives a threshold of 1,700. The value for the best 8th order is 1.24 (our result YP8m8), which gives a threshold of around 6000 when compared with 4th order and 68,000 when compared with 6th order.

At 10th order the best value is 3.11 (SS10s35), which when compared with 8th order gives a threshold of 9.3 × 1015. This is far greater than can be expected for realistic simulations. To see the reason for this, if we use Eq. (46) for the approximate number of exponentials needed for phase estimation (where the threshold is for 1/εH), it would be more than 3 × 1018. In realistic implementations of quantum chemistry it takes about 104 Toffoli gates per exponential, so this would correspond to a Toffoli complexity over 1022. For comparison, in the analysis of Shor's algorithm in Ref. [49], 3 × 109 Toffolis were estimated to take 8 hours. Even with improvements in gate speed, a simulation sufficiently large for 10th order product formulae to provide an advantage would require a computation time of many millions of years.

This threshold between 8th and 10th order can be expected to be accurate (at least for the estimate of the order of magnitude) because it corresponds to small time steps below 0.3. On the other hand, the other thresholds would correspond to large time steps, so would be inaccurate because the asymptotic expressions break down. The calculated threshold between 6th and 8th order would correspond to a time step larger than 3, which is far too large for the scaling of the error to be accurate.

In this case we find numerically that the threshold between these 6th and 8th order formulae (PPBCM6m9 and YP8m8) is increased dramatically, to about 3.2 × 107, corresponding to a time step of about 1.22. If the target ε is 0.01 for example, that would correspond to about 260,000 time steps. It is possible to optimise our kernel for the larger time step size in order to provide better performance. This optimisation provides a kernel (see Table 8) such that the threshold is reduced about a factor of twelve to 2.8 × 106. This now corresponds to 15,000 time steps with a time step size of about 1.82. This is fewer time steps than expected for simulations of classically intractable systems. Thus the optimisation provides another order of magnitude where 8th order product formulae are optimal, albeit with a product formula tailored to the time step size.

Table 8.

The 8th order kernel tailored for large time step size t = 1.82 to improve the threshold.

kernel for large time step

w10.1777372900430394
w20.2862580532195395
w30.1701306063199336
w4−0.3746748008394162
w50.1485267804844835
w6−0.3773225725485588
w70.5395886879620081
w80.2210419534887659

One must also be careful in considering thresholds based on eigenvalue error because there needs to be a large number of time steps such that the spectral-norm error in the complete evolution time is dominated by the eigenvalue error. There is a factor of about 65 between the spectral-norm error and the eigenvalue error for the 8th order product formula YP8m8. For these thresholds the number of time steps is orders of magnitude larger than this, which indicates that the eigenvalue error is an appropriate measure.

These threshold calculations are for Hamiltonians that are sums of two terms, each of which is normalised. In the case where the Hamiltonian is not normalised, the threshold should be scaled by the norm. In the simple case where the Hamiltonian is a sum of two terms with equal norms ‖A‖ = ‖B‖, then the threshold will be for ‖AT/ɛ rather than T/ɛ. More generally the Hamiltonian may be a sum of any number of terms with different norms, which would change the threshold and make it unclear what quantity the threshold should be considered in terms of. There is also the possibility that the threshold can be changed for Hamiltonians that have structure to them instead of being random, for example those for quantum chemistry.

5.2.
Comparison for Fermionic Hamiltonians

Fermionic Hamiltonians encountered in quantum chemistry often have the form (60) p,q=1dτpqapaq+p,q=1dνpqapapaqaq \sum\limits_{p,q = 1}^d {\tau _{pq}}a_p^\dagger {a_q} + \sum\limits_{p,q = 1}^d {\nu _{pq}}a_p^\dagger {a_p}a_q^\dagger {a_q} where ap a_p^\dagger and ap are the fermionic creation and annihilation operators acting on orbital p, and there are a total of d orbitals. Each entry τpq, νpq is real and there is symmetry in exchanging indices. The behaviour of the error as the size of system is changed can be predicted based on the result in Theorem 4 of [50].

Theorem 2 (Theorem 4 in [50]).

Let H=T+V=p,qdτpqapaq+p,qdνpqapapaqaq H = T + V = \sum\nolimits_{p,q}^d {\tau _{pq}}a_p^\dagger {a_q} + \sum\nolimits_{p,q}^d {\nu _{pq}}a_p^\dagger {a_p}a_q^\dagger {a_q} be an interacting-electronic Hamiltonian, and Sk(t) be a kth order product formula splitting the evolutions under T and V . Then (61) Sk(t)eitHWη=Oτ1+ν1,[η]k1τ1ν1,[η]ηtk+1, {\left\| {{S_k}(t) - {e^{ - itH}}} \right\|_{{W_\eta }}} = {\cal O}\left( {{{\left( {{{\left\| \tau \right\|}_1} + {{\left\| \nu \right\|}_{1,[\eta ]}}} \right)}^{k - 1}}{{\left\| \tau \right\|}_1}{{\left\| \nu \right\|}_{1,[\eta ]}}\eta {t^{k + 1}}} \right){\kern 1pt} , where ‖⋅‖Wη corresponds to the operator norm on the operator acting in the η-electron subspace, τ1=maxpqτpq, {\left\| \tau \right\|_1} = {\max _p}{\sum _q}\left| {{\tau _{pq}}} \right|, and ν1,[η]=maxpmaxq1<q2<<qηνpq1++νpqη. {\left\| \nu \right\|_{1,[\eta ]}} = {\max _p}{\max _{{q_1} < {q_2} < \cdots < {q_\eta }}}\left( {\left| {{\nu _{p{q_1}}}} \right| + \cdots + \left| {{\nu _{p{q_\eta }}}} \right|} \right).

To fairly compare product formulae as applied to quantum chemistry, we can define ω so that (62) ζ=ω(τ1+ν1,[η])k1τ1ν1,[η]η. \zeta = \omega {\kern 1pt} {({\left\| \tau \right\|_1} + {\left\| \nu \right\|_{1,[\eta ]}})^{k - 1}}{\left\| \tau \right\|_1}{\left\| \nu \right\|_{1,[\eta ]}}\eta {\kern 1pt} . Note that the expression in Ref. [50] was derived for error in the spectral norm, but that also provides an upper bound on the error in the eigenvalues, so it is reasonable to consider the constant defined for eigenvalue error. Then the formula for the threshold T /ε becomes (63) Tε=M2[ω2(τ1+ν1,[η])k21τ1ν1,[η]η]1/k2M1[ω1(τ1+ν1,[η])k11τ1ν1,[η]η]1/k111k11k2=M2[ω2τ1ν1,[η]η/(τ1+ν1,[η])]1/k2M1[ω1τ1ν1,[η]η/(τ1+ν1,[η])]1/k111k11k2τ1ν1,[η]ητ1+ν1,[η]Tε=M2ω21/k2M1ω11/k111k11k2. \matrix{\hfill{{T \over \smallint }} & { = {{\left( {{{{M_2}{{[{\omega _2}{{({{\left\| \tau \right\|}_1} + {{\left\| \nu \right\|}_{1,[\eta ]}})}^{{k_2} - 1}}{{\left\| \tau \right\|}_1}{{\left\| \nu \right\|}_{1,[\eta ]}}\eta ]}^{1/{k_2}}}} \over {{M_1}{{[{\omega _1}{{({{\left\| \tau \right\|}_1} + {{\left\| \nu \right\|}_{1,[\eta ]}})}^{{k_1} - 1}}{{\left\| \tau \right\|}_1}{{\left\| \nu \right\|}_{1,[\eta ]}}\eta ]}^{1/{k_1}}}}}} \right)}^{{1 \over {{1 \over {{k_1}}} - {1 \over {{k_2}}}}}}}} \hfill \cr \hfill{} & { = {{\left( {{{{M_2}{{[{\omega _2}{{\left\| \tau \right\|}_1}{{\left\| \nu \right\|}_{1,[\eta ]}}\eta /({{\left\| \tau \right\|}_1} + {{\left\| \nu \right\|}_{1,[\eta ]}})]}^{1/{k_2}}}} \over {{M_1}{{[{\omega _1}{{\left\| \tau \right\|}_1}{{\left\| \nu \right\|}_{1,[\eta ]}}\eta /({{\left\| \tau \right\|}_1} + {{\left\| \nu \right\|}_{1,[\eta ]}})]}^{1/{k_1}}}}}} \right)}^{{1 \over {{1 \over {{k_1}}} - {1 \over {{k_2}}}}}}}} \hfill \cr \hfill{{{{{\left\| \tau \right\|}_1}{{\left\| \nu \right\|}_{1,[\eta ]}}\eta } \over {{{\left\| \tau \right\|}_1} + {{\left\| \nu \right\|}_{1,[\eta ]}}}}{T \over \smallint }} & { = {{\left( {{{{M_2}\omega _2^{1/{k_2}}} \over {{M_1}\omega _1^{1/{k_1}}}}} \right)}^{{1 \over {{1 \over {{k_1}}} - {1 \over {{k_2}}}}}}}.} \hfill \cr } For estimation of eigenvalues (rather than simulation of time evolution) the same derivation holds with T /ε replaced by 1/εH . Thus we see that the ratio ‖τ1ν1,[η]η/(‖τ1 + ‖ν1,[η]) governs the threshold where a higher-order product formula will improve over a lower-order product formula. This is somewhat different from the factor of the norm of the Hamiltonian that might otherwise be expected for unstructured Hamiltonians.

To analyse the non-asymptotic regime, we rewrite the error to be proportional to (64) τ1ν1,[η]ητ1+ν1,[η]t×tτ1+ν1,[η]k. {{{{\left\| \tau \right\|}_1}{{\left\| \nu \right\|}_{1,[\eta ]}}\eta } \over {{{\left\| \tau \right\|}_1} + {{\left\| \nu \right\|}_{1,[\eta ]}}}}t \times {\left[ {t\left( {{{\left\| \tau \right\|}_1} + {{\left\| \nu \right\|}_{1,[\eta ]}}} \right)} \right]^k}. Following the derivation of Ref. [50], it is easily seen that the contributions to the error from each of the orders higher than k + 1 will be of a similar form. These higher-order contributions to the error can be used to determine the error in the non-asymptotic regime where the order k + 1 term is not dominant.

To see why that is the case, note that Lemma 2 of Ref. [50] describes the error at order k + 1 (p + 1 in the notation of that work) in terms of a sum over norms of multicommutators of k + 1 terms in the Hamiltonian. It is easily seen that the error at each of the higher orders will also be given by similar multicommutator forms. The method of proof of Theorem 4 in Ref. [50] is based on simplifying the multicommutator expressions from Lemma 2, and proceeds in exactly the same way for the higher orders.

Therefore, the sum over errors of all orders would give a total error of the form (65) τ1ν1,[η]ητ1+ν1,[η]t×j=k+1ξjtτ1+ν1,[η]j. {{{{\left\| \tau \right\|}_1}{{\left\| \nu \right\|}_{1,[\eta ]}}\eta } \over {{{\left\| \tau \right\|}_1} + {{\left\| \nu \right\|}_{1,[\eta ]}}}}t \times \sum\limits_{j = k + 1}^\infty {\xi _j}{\left[ {t\left( {{{\left\| \tau \right\|}_1} + {{\left\| \nu \right\|}_{1,[\eta ]}}} \right)} \right]^j}. This sum would then give the non-asymptotic form for the error (66) τ1ν1,[η]ητ1+ν1,[η]t×gtτ1+ν1,[η], {{{{\left\| \tau \right\|}_1}{{\left\| \nu \right\|}_{1,[\eta ]}}\eta } \over {{{\left\| \tau \right\|}_1} + {{\left\| \nu \right\|}_{1,[\eta ]}}}}t \times g\left( {t\left( {{{\left\| \tau \right\|}_1} + {{\left\| \nu \right\|}_{1,[\eta ]}}} \right)} \right){\kern 1pt} , for some function g.

Then following the same procedure to determine the threshold as before, we will have a total error given by (67) τ1ν1,[η]ητ1+ν1,[η]T×gtτ1+ν1,[η]. {{{{\left\| \tau \right\|}_1}{{\left\| \nu \right\|}_{1,[\eta ]}}\eta } \over {{{\left\| \tau \right\|}_1} + {{\left\| \nu \right\|}_{1,[\eta ]}}}}T \times g\left( {t\left( {{{\left\| \tau \right\|}_1} + {{\left\| \nu \right\|}_{1,[\eta ]}}} \right)} \right){\kern 1pt} . That would imply that comparing two product formulae, the threshold will be obtained for (68) g1t1τ1+ν1,[η]=g2t2τ1+ν1,[η]. {g_1}\left( {{t_1}\left( {{{\left\| \tau \right\|}_1} + {{\left\| \nu \right\|}_{1,[\eta ]}}} \right)} \right) = {g_2}\left( {{t_2}\left( {{{\left\| \tau \right\|}_1} + {{\left\| \nu \right\|}_{1,[\eta ]}}} \right)} \right){\kern 1pt} . Using t2 = t1M2/M1 and solving for t1 as before, the T /ε threshold would then be given by substituting the solution for t1 into (69) τ1ν1,[η]ητ1+ν1,[η]Tε=1/g1t1τ1+ν1,[η]. {{{{\left\| \tau \right\|}_1}{{\left\| \nu \right\|}_{1,[\eta ]}}\eta } \over {{{\left\| \tau \right\|}_1} + {{\left\| \nu \right\|}_{1,[\eta ]}}}}{T \over \epsilon } = 1/{g_1}\left( {{t_1}\left( {{{\left\| \tau \right\|}_1} + {{\left\| \nu \right\|}_{1,[\eta ]}}} \right)} \right){\kern 1pt} .

The threshold can be calculated for Hamiltonians with a given value of ‖τ1 + ‖ν1,[η], then for some new Hamiltonian with different values of the norms, the threshold should still be for the same value of t1(‖τ1 + ‖ν1,[η]). This implies that the right-hand side of Eq. (69) should be the same for the threshold with the new Hamiltonian. As a result, we expect that the threshold should be in terms of the left-hand side of Eq. (69), which is the same expression as found in the asymptotic regime.

In our numerical testing, the coefficients τpq, νpq were chosen uniformly from the interval [−1, 1]. We computed ω for a selection of the product formulae with the best performance for random Hamiltonians. In Table 9, Table 10, and Table 11 we give the computed result for d = 6 orbitals, assuming half-filling of the orbitals. Our numerics indicate that the error is roughly proportional to the bound in Eq. (61), independent of the number of orbitals, though the constant factors are small. In Appendix C we give the constant factors for the case d = 4, showing that the computed constant does not change much with a different d.

Table 9.

Comparison of constant factors ω for a selection of the lowest-error best product formulae for 6th order. We generate 1, 000 random Hamiltonians with d = 6 orbitals as in Eq. (60) and compute the average ω.

labelMprocessingreferenceω1/k

PPBCM6m99YP96 Table 5 of [31]2.8 × 10−90.33
PPBCM6m613YP136 in Table 6 of [31]1.2 × 10−90.42
BCE6m1020Y ψ10[6] \psi _{10}^{[6]} Table 8 of [40]3.4 × 10−110.36
Table 10.

Comparison of constant factors ω for a selection of the lowest-error product formulae for 8th order. We generate 1, 000 random Hamiltonians with d = 6 orbitals as in Eq. (60) and compute the average ω.

labelMprocessingreferenceω1/k

SS8s1919NSection 4.3 of [29]3.5 × 10−110.94
PP8s1313YP138 in Table 6 of [31]4.4 × 10−100.88
Y8m1021NTable 1 (our new result)8.5 × 10−120.87
Y8m10b21NTable 1 (our new result)1.3 × 10−120.68
YP8m817YTable 2 (our new result)1.7 × 10−120.57
Table 11.

Comparison of constant factors ω for a selection of the lowest-error product formulae for 10th order. We generate 1, 000 random Hamiltonians with d = 6 orbitals as in Eq. (60) and compute the average ω.

labelMprocessingreferenceω1/k

SS10s3535NSection 4.4 of [29]3.0 × 10−151.24
PP10s2323YP2310 in Table 6 of [31]2.3 × 10−121.58
Y10m1735NTable 32.6 × 10−141.53

From the results in these tables, we see that the relative performance for the product formulae in unchanged from the results with the random matrices. This is a useful independent verification that the assessment of the performance of the product formulae is relatively independent of the class of matrices. In particular, we find that for 8th order our three new results all outperform the best product formulae in prior work (where we have compared to the two prior formulae with best performance). In the case of 10th order the product formula SS10s35 of Ref. [29] still provides exceptional performance.

For calculating the thresholds based on the asymptotic formula in Eq. (63), we should compare our best 8th order formula YP8m8 to SS10s35 in the 10th order case, and PPBCM6m9 in the 6th order case. Then we obtain a threshold of 3 × 1013 for 8th to 10th order, and 5 × 105 for 6th to 8th order, indicating about 9 orders of magnitude for T /ε where our 8th order product formula is optimal.

Similar to the case for general matrices, we expect that the threshold for 8th to 10th order is sufficiently large that the time steps are small and the asymptotic formulae are accurate, but the threshold between 6th and 8th order may be unreliable. In that case we solve Eq. (68) for t1 with t2 = t1M2/M2, and find the threshold using Eq. (69). We expect that the 6th order product formula PPBCM6m9 can give better performance for larger time steps, and indeed we find that the threshold is further increased to 1.7 × 107. By using our 8th order kernel optimised for large time step in Table 8 we can improve the threshold between PPBCM6m6 and our 8th order formula in the non-asymptotic case. We then obtain a threshold of around 3 × 106, which is an improvement by a factor of 5 times.

Next we compare these thresholds to the parameters for specific systems. From Ref. [50] the norms can be expected to scale as (70) ν1,[η]=Oη2/3N1/3Ω1/3,τ1=ON2/3Ω2/3, {\left\| \nu \right\|_{1,[\eta ]}} = {\cal O}\left( {{{{\eta ^{2/3}}{N^{1/3}}} \over {{\Omega ^{1/3}}}}} \right),\quad \quad {\left\| \tau \right\|_1} = {\cal O}\left( {{{{N^{2/3}}} \over {{\Omega ^{2/3}}}}} \right), where N is the number of orbitals and Ω is the volume (denoted n and ω in [50]). The constants for the scaling of these norms are derived in Ref. [51] as (71) ν1,[η]π1/3(3/4)2/3η2/3N1/3Ω1/3,τ13π2N2/32Ω2/3. {\left\| \nu \right\|_{1,[\eta ]}} \approx {\pi ^{1/3}}{(3/4)^{2/3}}{{{\eta ^{2/3}}{N^{1/3}}} \over {{\Omega ^{1/3}}}},\quad \quad {\left\| \tau \right\|_1} \approx {{3{\pi ^2}{N^{2/3}}} \over {2{\Omega ^{2/3}}}}. For an initial order of magnitude estimate, the chemical accuracy required for the phase estimation is about 0.001 Hartree. As discussed above, for phase estimation we replace T /ε in the threshold with 1/εH . With estimates η ≈ 100 and N /Ω in the range 103 to 109 (corresponding to Δ between 10−1a0 and 10−3a0 in Ref. [52]), the left-hand side (LHS) of Eq. (63) would be in the range 2.2 × 107 to 2.6 × 109. That is far below the threshold between 8th and 10th order, demonstrating that our 8th order should be preferred to 10th order. It is also above the threshold between 6th and 8th order, indicating that 8th order provides the best performance for simulations of this type. For smaller simulations 6th order will be optimal, but it may be expected that the small values of N /Ω would not provide a sufficiently accurate representation of the quantum system.

For comparison, we also consider the Alpha + Hydrogen system from Ref. [51]. The parameters η, N , and Ω are given in Table 1 of that work as 218, 533, and 2420, respectively. Some of the most challenging values of T , ε are given in Figure 4 of that work, with for example T = 40 and ε = 10−4. With these parameters the LHS of Eq. (63) is approximately 8.6 × 109, which is below the threshold (between 8th and 10th order) by a few orders of magnitude. The smallest values of T /ε from Ref. [51] are obtained with T = 10 and ε = 0.01 (see Table IV). That results in a value for the LHS of Eq. (63) of 2.2 × 107, which is above the threshold between 6th and 8th order, so our 8th order product formula would still be optimal. This indicates that the new 8th order product formula found here should be optimal for the full range of parameter values used in Ref. [51]. Nonetheless, for smaller values of T /ε it may be advantageous to further customise the product formula to provide better performance at larger time steps, as described above to improve the threshold between 6th order and 8th order.

Note also that the fact that the threshold here is different than for random Hamiltonians means that the threshold will depend on the class of Hamiltonians. Selecting a different type of Hamiltonians can change the threshold. Similarly, it will be expected that choosing a particular class of fermionic Hamiltonians (such as those arising from plane waves) rather than random fermionic Hamiltonians can give a different threshold. Nevertheless, changing the class of Hamiltonians still resulted in an extremely large threshold for 10th order product formulae to be optimal. This indicates that the 8th order formula will be optimal for realistic simulations regardless of the class of Hamiltonians.

6.
Discussion

We have found 8th order product formulae with significantly lower error than in prior work. For the case of processed product formulae, our best solution is about 300 times more accurate than that in prior work, and in the case of product formulae without processing our solution is about 100 times more accurate. The size of the improvement is estimated based on testing with random 64 × 64 matrices. We have also tested on a range of different matrix sizes, fermionic Hamiltonians of different sizes, and a range of sizes of the transverse-field Ising model and find consistent improvements with well over an order of magnitude in all cases.

The size of the improvement is surprising, especially considering the best prior results are from 2005 (without processing) and 2006 (with processing). To achieve this improvement we have used distinct techniques from prior work. The processed product formulae from Ref. [31] set multiple coefficients equal to one another. Specifically, in the P198 formula, six of the coefficients are identical, following a heuristic suggested by McLachlan [27]. However, our results suggest that using distinct values for each coefficient yields better outcomes. The advantage of taking longer product formulae with distinct coefficients is that there are significantly more parameters than constraints, giving more freedom for optimisation of the product formula. The solutions which we find give the best performance have no clear pattern to the coefficients, indicating that assuming these patterns is too restrictive.

The way that product formulae were evaluated in Ref. [31] was based on the p norms of the coefficients for p = 1 and p = k + 1. Those norms are both smaller for P198 given in Ref. [31] than our lower-error solution. Similarly, these norms are both smaller for the solutions from Ref. [29] than our non-processed product formula. Also, Ref. [48] found 10th order integrators with 1-norm lower than those in Ref. [29], but in our testing their error is larger. In all these cases the solutions with lower error have larger norms. The k+1 norm corresponds to just one of the 𝒪(tk+1) error terms, and a product formula that minimises that term while giving large values for the others will not perform well. We find that the strategy of selecting product formulae with low error for examples can yield much better results than attempting to minimise norms.

In this work we compare the eigenvalue error, rather than the spectral-norm error as in prior work because it is the eigenvalue error in a single step which dominates the error over longer time evolutions, with basis error cancelling. In many applications of quantum computing the goal is to estimate eigenvalues, so it is solely the eigenvalue error which should be considered. For most product formulae tested, the spectral norm of the error (for a single step) is substantially larger than the eigenvalue error, meaning that it can be an unrealistic measure of accuracy. This also suggests that techniques for reducing error based on randomisation of the term order [21] or randomised corrections [53] may provide less improvement than anticipated because if successive steps are not identical then the basis error will not cancel.

We have shown how to fairly compare the performance of product formulae with different numbers of factors and different orders. The optimal order of integrator to use depends primarily on the ratio T /ε, the ratio of the total evolution time to the allowed error. This is because, as T /ε becomes larger, the error allowed for each individual time step becomes smaller, and the time steps must become shorter to make the error sufficiently small. That results in a larger overall number of time steps, and so a larger complexity. The higher-order integrators better reduce the error with shorter time steps, and so for large T /ε that improvement more than compensates for the larger number of exponentials for the higher-order product formulae. For eigenvalue estimation there is an equivalent threshold in 1/εH , in terms of the allowed error in the eigenvalue.

The threshold for 10th order product formulae to outperform 8th order product formulae is very large, so 10th order would only be optimal for simulations well beyond the scale considered for quantum algorithms. Similarly, it would not be useful to consider 12th or higher order product formulae for the purpose of Hamiltonian simulation, as the threshold would be even larger. For large quantum simulations that are still reasonable to perform on a quantum computer, such as those considered in Refs. [51] or [52], the best performance is provided by our 8th order product formulae (out of the extensive set of product formulae tested). For more moderately sized simulations 6th order will be optimal, and in our testing we find that the product formula S96 of Ref. [31] provides excellent performance. For smaller simulations, as in NISQ experiments, 4th order product formulae would be ideal, with S6 of Ref. [47] and P64 of [31] providing the best performance.

Care is required in determining the thresholds between the orders. Using the formula based on the asymptotic scaling can yield estimates of the T /ε threshold corresponding to large time steps where the asymptotic scaling no longer holds. When properly accounting for the error for a given time step, the T /ε threshold can change by many orders of magnitude. This can also be a problem more generally in estimating the complexity of an algorithm. Estimating the number of time steps based on the asymptotic expression may correspond to unrealistically large time steps, so the asymptotic formula is inaccurate. We find that significant performance gains can be obtained by fine-tuning a product formula for larger time steps.

A potential topic for future work is further customising product formulae for large time steps. We have customised the 8th order product formula to improve the threshold, but it would also be possible to customise the 6th and 4th order product formulae. It would also be possible to customise the product formulae for larger time steps with the fermionic Hamiltonians. This approach could be used to provide better performance for small-scale simulations suitable for NISQ devices. It could also potentially extend the parameter range where 6th order product formulae provide best performance to larger simulations on fault tolerant quantum computers.

DOI: https://doi.org/10.2478/qic-2025-0001 | Journal eISSN: 3106-0544 | Journal ISSN: 1533-7146
Language: English
Page range: 1 - 35
Submitted on: Jul 17, 2024
Accepted on: Dec 12, 2024
Published on: Jan 31, 2025
Published by: Cerebration Science Publishing Co., Limited
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year
Related subjects:

© 2025 Mauro E. S. Morales, Pedro C. S. Costa, Giacomo Pantaleoni, Daniel K. Burgarth, Yuval R. Sanders, Dominic W. Berry, published by Cerebration Science Publishing Co., Limited
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.