Selection and Improvement of Product Formulae for Best Performance of Quantum Simulation

Morales, Mauro E. S.; Costa, Pedro C. S.; Pantaleoni, Giacomo; Burgarth, Daniel K.; Sanders, Yuval R.; Berry, Dominic W.

doi:10.2478/qic-2025-0001

Full Article

1.

Introduction

The Lie-Trotter product formula is commonly used in quantum algorithms for Hamiltonian simulation, where one can approximate the Hamiltonian evolution of H = A + B in terms of exponentials of A and B when these operators do not commute. For short time, a standard approximation is the second-order symmetric formula S₂(t) = e^−iAt/2e^−iBte^−iAt/2, which satisfies e^−iHt = S₂(t) + 𝒪(t³). More generally, the error in an order k formula is 𝒪(t^k⁺¹). Longer times are simulated by breaking the evolution into many repetitions of a short time. The number of repetitions needed is reduced with the order, motivating the search for higher-order product formulae. A systematic method to produce arbitrarily high-order formulae is the fractal method [1,2,3,4], which has found applications in Hamiltonian simulation [5]. The first explicit use of product formulae for quantum simulation was given in [6], applying it for quantum evolution under local Hamiltonians. Later work considered the broader class of sparse Hamiltonians [7] and higher orders [5], as well as methods beyond product formulae [8,9,10,11].

Recent work has shown that despite its simplicity, the Lie-Trotter product formula can compete with other Hamiltonian simulation algorithms due to the low error that it achieves in practice [12]. Methods based on linear combinations of unitaries [13,14] or quantum signal processing [15] give complexity logarithmic in the inverse error, but the error is not required to be extremely small, meaning those methods do not provide as large an advantage as might be expected. Product formula error bounds can be further improved by considering particular physical systems [16,17,18] or leveraging randomisation [19,20,21]. Moreover, Trotter formulae are expected to be relevant for both noisy intermediate-scale quantum (NISQ) computation and fault-tolerant computation. It is then of great importance to seek efficient implementations of product formulae as it can have a great impact on the efficiency of Hamiltonian simulation algorithms in practice.

The downside of the fractal methods to generate higher-order formulae is that the number of exponential operators to implement it grows very rapidly. Fractal product formulae are usually assumed in quantum computing, but they can be greatly improved upon. An alternative method by Yoshida [3] can be used to obtain product formulae with a smaller number of exponentials. Similar to fractal formulae, they are given as a product of S₂ for different time intervals, but in contrast to the fractal approach there is not an explicit analytic form for the higher-order formulae. Instead one needs to derive and solve a complicated set of simultaneous non-linear polynomial equations.

In classical computing, product formulae are equivalent to symplectic integrators for numerical solution of differential equations. Since the initial work of Yoshida, there have been many results on better integrators of many different orders [22,23,24,25,26,27,28,29,30,31,32]. See Ref. [33] for a comprehensive review of product formulae/integrators in the literature. In particular, Refs. [24] and [25] give what appear to be the optimal 8th order product formulae with 15 and 17 stages (the number of S₂ in the product formula). Reference [29] gives improved 8th order product formulae with 19 and 21 stages. For 10th order, Ref. [29] provides solutions with 31, 33, and 35 stages. These solutions from 2005 in Ref. [29] appear to be the best in prior work for 8th and 10th order (see Ref. [33]).

Another approach is that of processed product formulae [31,34,35,36,37,38,39,40]. Instead of the product formula being symmetric, it is of the form PΣP⁻¹ for kernel Σ and processor P . The P and P⁻¹ cancel when using the product of many of these for evolution over a long time, so the cost is dominated by the number of stages in the kernel. Since the kernel has fewer restrictions on it, it can have fewer stages than a normal symmetric product formula. According to Ref. [33], the best 8th order product formula with processing is from 2006 [31].

The objective of this work is to find improved product formulae and compare their performance in a consistent manner. We show that the performance of product formulae is better quantified by the error in the eigenvalues, rather than the spectral-norm error as usually considered in prior work. This is because it is the eigenvalue error that dominates the error for evolution at longer times. We also derive a method to consistently compare the performance of product formulae with different orders and numbers of exponentials.

We use these methods to compare between the performance of our product formulae, as well as to compare prior product formulae in the literature. By our numerical search, we found hundreds of thousands of solutions at 8th order, including many that outperform those previously reported in the literature. Our best 8th order product formula without processing improves over the best prior work by a factor of about 100, and our best 8th order processed product formula improves over the best prior processed product formulae by a factor of nearly 300. Our best product formula without processing has slightly lower error, but our product formula with processing provides better performance due to being shorter. For 10th order, one solution from Ref. [29] provides extremely high accuracy, a factor of nearly 300 times better than any others in prior work. We provide a detailed numerical comparison of product formulae in Section 5.

When comparing product formulae of different orders, it is better to use higher-order formulae for larger values of T /ε, where T is the total evolution time and ε is the required precision. For smaller T /ε it is better to use lower-order formulae, but as it is increased there are threshold values beyond which it is optimal to use higher-order formulae. We derive methods for determining these thresholds and show that our 8th order product formula is best for T /ε from around 10⁷ to 10¹⁶, which includes the range of typical values for quantum chemistry applications. This means that the best 8th order product formula we have found in this work will be best suited to real applications.

The upper threshold means T /ε ≳ 10¹⁶ would be needed for 10th order product formulae to outperform our 8th order product formula, which is unrealistically large. Although the 10th order solution from Ref. [29] provides high accuracy, it is not competitive for quantum simulation due to its greater length. It is more than twice as long as our processed 8th order formula, so for a similar number of exponentials the 8th order product formula can use a time step half as long, resulting in much better performance for realistic values of the parameters. The lower threshold means T /ε ≲ 10⁷ is needed for 6th order to outperform 8th order. Accurate estimation of this threshold requires testing of the product formulae for larger time steps. It is possible to reduce this threshold by adjusting the 8th order product formula to provide better performance for larger time steps.

The organisation of this work is as follows. In Section 2 we give a pedagogical explanation of the product formulae and the method of solution. First, we define product formulae and introduce the fractal methods for generating higher-order product formulae in Section 2.1, then in Section 2.2 we give a summary of Yoshida's method, then we explain processors for product formulae in Section 2.3, more general types of product formulae in Section 2.4, and the Taylor series method in Section 2.5. In Sections 3.2 and 3.3 we present the results regarding new product formulae. In Section 4 we explain the method for comparing product formulae, and then in Section 5 we give the comparison of the product formulae based on numerical testing. We finish in Section 6 by discussing what the results mean for choosing appropriate product formulae for quantum simulation, as well as searching for better-performing product formulae.

2.

Background

In this section, we review the background and methods for product formulae. Readers who are familiar with the methods may wish to skip to Section 3 where we present new results. See also Ref. [33] for a more complete recent review of this topic. We begin by defining product formulae and the Baker-Campbell-Hausdorff formula, then we introduce fractal methods and Yoshida's method to obtain higher-order formulae, and describe the processed product formulae and the Taylor series method.

2.1.

Product Formulae

It is well known that, for any non-commuting operators X and Y , (1) $exp ((X + Y) t) = exp (Xt) exp (Yt) + O (t^{2}),$ \exp ((X + Y)t) = \exp (Xt)\exp (Yt) + {\cal O}({t^2}), where we have absorbed the −i factor used in quantum simulation into X and Y . The above equation demonstrates that the exponential of a sum of two operators is, to first order, equal to the product of the exponential of those operators. The above equation is often referred to as a “first-order product formula.” Higher-order terms can be computed via the Baker-Campbell-Haussdorff (BCH) formula.

Theorem 1 (Baker-Campbell-Haussdorff formula [41])

Let X and Y be any operators such that ‖X‖ + ‖Y ‖ < ln 2. We have for an operator Z that exp(X) exp(Y) = exp(Z), with (2) $Z = \sum_{n = 1}^{\infty} \frac{{(- 1)}^{n - 1}}{n} \sum_{\begin{matrix} r_{1} + s_{1} > 0 \\ ⋮ \\ r_{n} + s_{n} > 0 \end{matrix}} \frac{[X^{r_{1}}, Y^{s_{1}}, \dots X^{r_{n}}, Y^{s_{n}}]}{(\sum_{j = 1}^{n} r_{i} + s_{i}) \prod_{i = 1}^{n} r_{i}! s_{i}!},$ Z = \sum\limits_{n = 1}^\infty {{{{( - 1)}^{n - 1}}} \over n}\sum\limits_{{\scriptstyle {r_1} + {s_1} > 0 \atop {\scriptstyle \vdots \atop \scriptstyle {r_n} + {s_n} > 0}} } {{[{X^{{r_1}}},{Y^{{s_1}}}, \cdots {X^{{r_n}}},{Y^{{s_n}}}]} \over {\left( {\sum\nolimits_{j = 1}^n {{r_i} + {s_i}} } \right)\prod\nolimits_{i = 1}^n {{r_i}!{s_i}!} }}, where $[X^{r_{1}}, Y^{s_{1}}, \dots X^{r_{n}}, Y^{s_{n}}] = [\underset{r_{1}}{\underset{︸}{X, [X, \dots [X}}, [\underset{s_{1}}{\underset{︸}{Y, [Y, \dots [Y}}, \dots [\underset{r_{n}}{\underset{︸}{X, [X, \dots [X}}, [\underset{s_{n}}{\underset{︸}{Y, [Y, \dots Y}}]] \dots]] .$ [{X^{{r_1}}},{Y^{{s_1}}}, \cdots {X^{{r_n}}},{Y^{{s_n}}}] = [\underbrace {X,[X, \cdots [X}_{{r_1}},[\underbrace {Y,[Y, \cdots [Y}_{{s_1}},{\kern 1pt} \cdots {\kern 1pt} [\underbrace {X,[X, \cdots [X}_{{r_n}},[\underbrace {Y,[Y, \cdots Y}_{{s_n}}]] \cdots ]].

The standard second-order symmetric product formula is as given in the definition below.

Definition 1 (Symmetric product formula of order two)

Let X and Y be non-commuting operators and let t be a real variable. Define (3) $S_{2} (t) : = exp (\frac{1}{2} Xt) exp (Yt) exp (\frac{1}{2} Xt) .$ {S_2}(t): = \exp \left( {{1 \over 2}Xt} \right)\exp (Yt)\exp \left( {{1 \over 2}Xt} \right).

The operators X and Y used in the definition of S₂ should always be clear from context. More generally, when there is a sum of X_j, the product formula is (4) $S_{2} (t) : = \prod_{j = 1}^{J} exp (\frac{1}{2} X_{j} t)] \prod_{j = J}^{1} exp (\frac{1}{2} X_{j} t)] .$ {S_2}(t): = \left[ {\prod\limits_{j = 1}^J \exp \left( {{1 \over 2}{X_j}t} \right)} \right]\left[ {\prod\limits_{j = J}^1 \exp \left( {{1 \over 2}{X_j}t} \right)} \right]. Products are ordered with the starting index on the right and the final one on the left, so for J = 2 terms the expression in the definition is obtained. This form of the product formula can be used for Hamiltonians that are a product of any number of terms. The higher-order product formulae considered in this work are constructed from products of S₂, and due to this method of construction these product formulae are suitable for Hamiltonians that are a sum of any number of terms. The corollary below describes the form of the error terms in the symmetric product formula, and also implies that it is second order.

Corollary 1 (Symmetric BCH formula [3])

Let X and Y be any operators such that ‖X‖ + ‖Y ‖ < ln 2 and let t be a real variable. Define Z such that S₂(t) = exp(Z). Then there is a sequence α_ℓ consisting of linear combinations of ℓ-term commutators in X and Y such that (5) $Z = \sum_{ℓ = 1}^{\infty} α_{ℓ} t^{ℓ} .$ Z = \sum\limits_{\ell = 1}^\infty {\alpha _\ell }{t^\ell }. Moreover, α_ℓ ≡ 0 whenever ℓ is even.

Reference [3] also shows that even terms are zero for more general symmetric product formulae. The first three non-zero α_ℓ terms from above are (6) $α_{1} = X + Y,$ {\alpha _1} = X + Y, (7) $α_{3} = \frac{1}{12} [Y, [Y, X]] - \frac{1}{24} [X, [X, Y]],$ {\alpha _3} = {1 \over {12}}[Y,[Y,X]] - {1 \over {24}}[X,[X,Y]], (8) $\begin{array}{l} α_{5} & = \frac{7}{5760} [X, X, X, X, Y] - \frac{1}{720} [Y, Y, Y, Y, X] + \frac{1}{360} [X, Y, Y, Y, X] + \frac{1}{360} [Y, X, X, X, Y] \\ - \frac{1}{480} [X, X, Y, Y, X] + \frac{1}{120} [Y, Y, X, X, Y] . \end{array}$ \matrix{ {{\alpha _5}} \hfill & { = {7 \over {5760}}[X,X,X,X,Y] - {1 \over {720}}[Y,Y,Y,Y,X] + {1 \over {360}}[X,Y,Y,Y,X] + {1 \over {360}}[Y,X,X,X,Y]} \hfill \cr {} \hfill & { - {1 \over {480}}[X,X,Y,Y,X] + {1 \over {120}}[Y,Y,X,X,Y].} \hfill \cr } Here the square brackets are used to indicate multicommutator expressions similar to the notation in Theorem 1, for example (9) $[Y, Y, X, X, Y] \equiv [Y, [Y, [X, [X, Y]]]] .$ [Y,Y,X,X,Y] \equiv [Y,[Y,[X,[X,Y]]]]. Expressions for each α_ℓ can be derived from two applications of the usual BCH formula.

Definition 2 (Product formula)

Let X and Y be any non-commuting operators. Given a natural number n, a product formula of order n is a pair $(\vec{c}, \vec{d})$ (\vec c,\vec d) with $\vec{c}, \vec{d} \in ℝ^{ℓ}$ \vec{c},\vec{d}\in {{\mathbb{R}}^{\ell }} and ℓ a natural number such that for all t ∈ ℝ (10) $exp ((X + Y) t) = \prod_{j = 1}^{ℓ} exp (c_{j} Xt) exp (d_{j} Yt) + O (t^{n + 1}) .$ \exp ((X + Y)t) = \prod\limits_{j = 1}^\ell \exp ({c_j}Xt)\exp ({d_j}Yt) + {\cal O}({t^{n + 1}}). We refer to the number of non-zero coefficients in $(\vec{c}, \vec{d})$ \left( {\vec c,\vec d} \right) as the length of the product formula.

Hence S₂ is a length-3 product formula. The number of exponentials used in the product formula is a crucial measure of its efficiency, and in quantum simulation it is proportional to the required number of gates.

2.1.1.

Fractal Methods for Generating Higher-Order Product Formulae

Here we describe two fractal methods to obtain higher-order product formulae. Starting from the symmetrised product formula in Eq. (3), fractal methods produce product formulae at all even orders. The first fractal (or triple jump) technique to generate a product formula of order k = 2κ is [1,2,3,42,43] (11) $S_{2 κ} (t) = S_{2 κ - 2} (s_{κ} t) S_{2 κ - 2} ((1 - 2 s_{κ}) t) S_{2 κ - 2} (s_{κ} t),$ {S_{2\kappa }}(t) = {S_{2\kappa - 2}}({s_\kappa }t){S_{2\kappa - 2}}((1 - 2{s_\kappa })t){S_{2\kappa - 2}}({s_\kappa }t), where s_κ = 1/(2 − 2^1/(2κ−1)). This method can be used to generate even orders starting at S₂, or more generally to increase the order of symmetric product formulae. A drawback to this method is that both s_k and 1 − 2s_κ are greater than 1, so the coefficients in the higher-order methods are large, causing greater error.

The history of this approach is discussed in detail in a review article by Forest [44]. This form for a fourth-order integrator was originally derived by Ruth about 1984, but not published until 1990 [42]. That work specifically considered a Hamiltonian that is a sum of two parts. The iterative form was given by Creutz and Gocksch in 1989 [1], but citing unpublished work by Campostrini and Rossi for the fourth-order case. That work was published in 1990 [43], and cited Creutz and Gocksch for the iterative form. Suzuki [2] and Yoshida [3] independently derived the iterative form in 1990.

The iterative form given by Creutz and Gocksch in 1989 [1] was the more general expression (12) ${\tilde{S}}_{2 κ - 2} {(u_{κ} t)}^{p} {\tilde{S}}_{2 κ - 2} ((1 - 2 {pu}_{κ}) t) {\tilde{S}}_{2 κ - 2} {(u_{κ} t)}^{p} .$ {\tilde S_{2\kappa - 2}}{({u_\kappa }t)^p}{\tilde S_{2\kappa - 2}}((1 - 2p{u_\kappa })t){\tilde S_{2\kappa - 2}}{({u_\kappa }t)^p}. A case of particular interest is the second fractal method, also derived by Suzuki in 1991 [4] (13) ${\tilde{S}}_{2 κ} (t) = {\tilde{S}}_{2 κ - 2} {(u_{κ} t)}^{2} {\tilde{S}}_{2 κ - 2} ((1 - 4 u_{κ}) t) {\tilde{S}}_{2 κ - 2} {(u_{κ} t)}^{2},$ {\tilde S_{2\kappa }}(t) = {\tilde S_{2\kappa - 2}}{({u_\kappa }t)^2}{\tilde S_{2\kappa - 2}}((1 - 4{u_\kappa })t){\tilde S_{2\kappa - 2}}{({u_\kappa }t)^2}, where u_κ = 1/(4 − 4^1/(2κ−1)). This method has the advantage that both u_κ and 1 − 4u_κ are less than 1, so the coefficients of higher-order formulae are small resulting in small error. The drawback is that far more exponentials are required. Each iteration uses 5 copies of the lower-order formula, whereas the previous one uses 3 copies. The virtue of these fractal methods is that they allow one to generate arbitrarily high-order product formulae easily, albeit at the expense of a large number of exponentials.

2.1.2.

Exponential Length Scaling of Fractal Methods

For the first fractal method, the total number of exponentials for a given order 2κ = 4, 6, 8, ... in the product formula S_2κ is given by (14) $2 (J - 1) 3^{κ - 1} + 1 .$ 2(J - 1){3^{\kappa - 1}} + 1. For example J = 2 for X + Y and κ = 1 just corresponds to S₂, and the expression gives 3 as expected. For the second fractal method ${\tilde{S}}_{2 κ}$ {\tilde S_{2\kappa }} the number of exponentials is (15) $2 (J - 1) 5^{κ - 1} + 1 .$ 2(J - 1){5^{\kappa - 1}} + 1. The number of exponential operators in both cases of the fractal method grows very rapidly. Thus one may be interested in alternative method to obtain product formulae with a lower count, such as the method of Yoshida in the next section.

2.2.

Yoshida's Method for Deriving 6th Order Product Formulae

Here, we give a pedagogical explanation of the method to derive product formulae.

2.2.1.

Approach

Rather than using Eq. (11) and Eq. (13), Yoshida uses the general procedure [3] (16) $S^{(m)} (t) = (\prod_{j = 1}^{m} S_{2} (w_{m - j + 1} t)) S_{2} (w_{0} t) (\prod_{j = 1}^{m} S_{2} (w_{j} t)),$ {S^{(m)}}(t) = \left( {\prod\limits_{j = 1}^m {S_2}({w_{m - j + 1}}t)} \right){S_2}({w_0}t)\left( {\prod\limits_{j = 1}^m {S_2}({w_j}t)} \right), where w_j ∈ ℝ for j = 0, 1, ⋯ , m are parameters to be determined. Note the number of exponentials in this product is given by (17) $(4 m + 2) (J - 1) + 1 .$ (4m + 2)(J - 1) + 1. Given this ansatz, the task becomes to find m and w_i such that S^(m) is an order k product formula. We will illustrate Yoshida's method by deriving the result for 6th order.

2.2.2.

Expand Yoshida Product Using Baker-Campbell-Haussdorf Formula

The method is to expand Eq. (16) using the BCH formula from Theorem 1 recursively as follows. First, note that from Corollary 1, $S_{2} (t) = e^{\frac{t}{2} X} e^{tY} e^{\frac{t}{2} X} = e^{t α_{1} + t^{3} α_{3} \dots}$ {S_2}(t) = {e^{{t \over 2}X}}{e^{tY}}{e^{{t \over 2}X}} = {e^{t{\alpha _1} + {t^3}{\alpha _3} \cdots }} where α_ℓ is a commutator of ℓ operators as described below.

Corollary 1.

We are for the moment considering sums of two terms X + Y . Define $C = \sum_{i = 1}^{\infty} t^{2 i - 1} w_{1}^{2 i - 1} α_{2 i - 1}$ C = \sum\nolimits_{i = 1}^\infty {t^{2i - 1}}w_1^{2i - 1}{\alpha _{2i - 1}} and $D = \sum_{i = 1}^{\infty} t^{2 i - 1} w_{0}^{2 i - 1} α_{2 i - 1}$ D = \sum\nolimits_{i = 1}^\infty {t^{2i - 1}}w_0^{2i - 1}{\alpha _{2i - 1}} . Then, (18) $\begin{array}{l} S_{2} (w_{1} t) S_{2} (w_{0} t) S_{2} (w_{1} t) & = e^{C} e^{D} e^{C} \\ = exp {tw}_{1} α_{1} + t^{3} w_{1}^{3} α_{3} + t^{5} w_{1}^{5} α_{5} + O (t^{7})\} exp {tw}_{0} α_{1} + t^{3} w_{0}^{3} α_{3} + t w_{0}^{5} α_{5} + O (t^{7})\} \\ \times exp {tw}_{1} α_{1} + t^{3} w_{1}^{3} α_{3} + t^{5} w_{1}^{5} α_{5} + O (t^{7})\} \\ = exp t (2 w_{1} + w_{0}) α_{1} + t^{3} (2 w_{1}^{3} + w_{0}^{3}) α_{3} + t^{5} (2 w_{1}^{5} + w_{0}^{5}) α_{5} + \frac{1}{6} ([D, D, C] - [C, C, D]) + O (t^{7})\} . \end{array}$ \matrix{ {{S_2}({w_1}t){S_2}({w_0}t){S_2}({w_1}t)} \hfill & { = {e^C}{e^D}{e^C}} \hfill \cr {} \hfill & { = \exp \left\{ {t{w_1}{\alpha _1} + {t^3}w_1^3{\alpha _3} + {t^5}w_1^5{\alpha _5} + {\cal O}({t^7})} \right\}\exp \left\{ {t{w_0}{\alpha _1} + {t^3}w_0^3{\alpha _3} + tw_0^5{\alpha _5} + {\cal O}({t^7})} \right\}} \hfill \cr {} \hfill & {\quad \times \;\exp \left\{ {t{w_1}{\alpha _1} + {t^3}w_1^3{\alpha _3} + {t^5}w_1^5{\alpha _5} + {\cal O}({t^7})} \right\}} \hfill \cr {} \hfill & { = \exp \left\{ {t(2{w_1} + {w_0}){\alpha _1} + {t^3}(2w_1^3 + w_0^3){\alpha _3} + {t^5}(2w_1^5 + w_0^5){\alpha _5} + {1 \over 6}([D,D,C] - [C,C,D]) + {\cal O}({t^7})} \right\}.} \hfill \cr } A simple computation shows (19) $[D, D, C] - [C, C, D] = t^{5} (w_{0}^{2} w_{1}^{3} - w_{1}^{2} w_{0}^{3} + w_{1}^{4} w_{0} - w_{0}^{4} w_{1}) [α_{1}, α_{1}, α_{3}] + O (t^{7}) .$ [D,D,C] - [C,C,D] = {t^5}(w_0^2w_1^3 - w_1^2w_0^3 + w_1^4{w_0} - w_0^4{w_1})[{\alpha _1},{\alpha _1},{\alpha _3}] + {\cal O}({t^7}). Define β₅ = [α₁, α₁, α₃] so (20) $\begin{array}{l} S_{2} (w_{1} t) S_{2} (w_{0} τ) S_{2} (w_{1} t) & = exp t (2 w_{1} + w_{0}) α_{1} + t^{3} (2 w_{1}^{3} + w_{0}^{3}) α_{3} + t^{5} (2 w_{1}^{5} + w_{0}^{5}) α_{5} \\ + t^{5} \frac{1}{6} (w_{0}^{2} w_{1}^{3} - w_{1}^{2} w_{0}^{3} + w_{0} w_{1}^{4} - w_{0}^{4} w_{1}) β_{5} + O (t^{7})\} . \end{array}$ \matrix{ {{S_2}({w_1}t){S_2}({w_0}\tau ){S_2}({w_1}t)} \hfill & { = \exp \left\{ {t(2{w_1} + {w_0}){\alpha _1} + {t^3}(2w_1^3 + w_0^3){\alpha _3} + {t^5}(2w_1^5 + w_0^5){\alpha _5}} \right.} \hfill \cr {} \hfill & {\quad \left. { + \;{t^5}{1 \over 6}(w_0^2w_1^3 - w_1^2w_0^3 + {w_0}w_1^4 - w_0^4{w_1}){\beta _5} + {\cal O}({t^7})} \right\}.} \hfill \cr } By an induction argument Yoshida shows that (21) $S^{(m)} (t) = exp {tA}_{1, m} α_{1} + t^{3} A_{3, m} α_{3} + t^{5} (A_{5, m} α_{5} + B_{5, m} β_{5}) + O (t^{7})\},$ {S^{(m)}}(t) = \exp \left\{ {t{A_{1,m}}{\alpha _1} + {t^3}{A_{3,m}}{\alpha _3} + {t^5}({A_{5,m}}{\alpha _5} + {B_{5,m}}{\beta _5}) + {\cal O}({t^7})} \right\}, where A_j,m and B_5,m are polynomials on the variables w₀, ... , w_m.

The case m = 0 is just the symmetric BCH formula, so it is clear that Eq. (21) holds with (22) $\begin{array}{l} A_{1, 0} = w_{0}, \\ A_{3, 0} = w_{0}^{3}, \\ A_{5, 0} = w_{0}^{5}, \\ B_{5, 0} = 0 . \end{array}$ \matrix{ {{A_{1,0}} = {w_0},} \hfill \cr {{A_{3,0}} = w_0^3,} \hfill \cr {{A_{5,0}} = w_0^5,} \hfill \cr {{B_{5,0}} = 0.} \hfill \cr } To prove Eq. (21) for m > 0, one needs to show that the exponential is of the form with operator α₁ for first order in t, operator α₃ for third order in t, and operators α₅ and β₅ for fifth order in t. This result may be shown using (23) $\begin{array}{l} S^{(m + 1)} (t) & = S_{2} (w_{m + 1} t) S^{(m)} (t) S_{2} (w_{m + 1} t) \\ = exp {tw}_{m + 1} α_{1} + t^{3} w_{m + 1}^{3} α_{3} + t^{5} w_{m + 1}^{5} α_{5} + O (t^{7})\} \\ \times exp {tA}_{1, m} α_{1} + t^{3} A_{3, m} α_{3} + t^{5} (A_{5, m} α_{3} + B_{5, m} β_{5}) + O (t^{7})\} \\ \times exp {tw}_{m + 1} α_{1} + t^{3} w_{m + 1}^{3} α_{3} + t^{5} w_{m + 1}^{5} α_{5} + O (t^{7})\} \\ = exp 2 {tw}_{m + 1} α_{1} + {tA}_{1, m} α_{1} + 2 t^{3} w_{m + 1}^{3} α_{3} + t^{3} A_{3, m} α_{3} + 2 t^{5} w_{m + 1}^{5} α_{5} + t^{5} A_{5, m} α_{5} + t^{5} B_{5, m} β_{5} \\ + \frac{1}{6} t^{5} (A_{1, m}^{2} w_{m + 1}^{3} - A_{1, m} A_{3, m} w_{m + 1} - w_{m + 1}^{2} A_{3, m} + w_{m + 1}^{4} A_{1, m}) β_{5} + O (t^{7})\} . \end{array}$ \matrix{ {{S^{(m + 1)}}(t)} \hfill & { = {S_2}({w_{m + 1}}t){S^{(m)}}(t){S_2}({w_{m + 1}}t)} \hfill \cr {} \hfill & { = \exp \left\{ {t{w_{m + 1}}{\alpha _1} + {t^3}w_{m + 1}^3{\alpha _3} + {t^5}w_{m + 1}^5{\alpha _5} + {\cal O}({t^7})} \right\}} \hfill \cr {} \hfill & {\quad \times \;\exp \left\{ {t{A_{1,m}}{\alpha _1} + {t^3}{A_{3,m}}{\alpha _3} + {t^5}({A_{5,m}}{\alpha _3} + {B_{5,m}}{\beta _5}) + {\cal O}({t^7})} \right\}} \hfill \cr {} \hfill & {\quad \times \;\exp \left\{ {t{w_{m + 1}}{\alpha _1} + {t^3}w_{m + 1}^3{\alpha _3} + {t^5}w_{m + 1}^5{\alpha _5} + {\cal O}({t^7})} \right\}} \hfill \cr {} \hfill & { = \exp \left\{ {2t{w_{m + 1}}{\alpha _1} + t{A_{1,m}}{\alpha _1} + 2{t^3}w_{m + 1}^3{\alpha _3} + {t^3}{A_{3,m}}{\alpha _3} + 2{t^5}w_{m + 1}^5{\alpha _5} + {t^5}{A_{5,m}}{\alpha _5} + {t^5}{B_{5,m}}{\beta _5}} \right.} \hfill \cr {} \hfill & {\quad \left. { + \;{1 \over 6}{t^5}(A_{1,m}^2w_{m + 1}^3 - {A_{1,m}}{A_{3,m}}{w_{m + 1}} - w_{m + 1}^2{A_{3,m}} + w_{m + 1}^4{A_{1,m}}){\beta _5} + {\cal O}({t^7})} \right\}.} \hfill \cr } Hence, if the product formula can be expressed as in the form Eq. (21) for S^(m)(t), it can again be expressed in this form for S^(m+1)(t), establishing it for all m ≥ 0 by induction. This expression also shows that the scalar coefficients can be determined from the formulae (24) $\begin{array}{l} A_{1, m + 1} = 2 w_{m + 1} + A_{1, m}, \\ A_{3, m + 1} = 2 w_{m + 1}^{3} + A_{3, m}, \\ A_{5, m + 1} = 2 w_{m + 1}^{5} + A_{5, m}, \\ B_{5, m + 1} = B_{5, m} + \frac{1}{6} (A_{1, m}^{2} w_{m + 1}^{3} - A_{1, m} A_{3, m} w_{m + 1} - w_{m + 1}^{2} A_{3, m} + w_{m + 1}^{4} A_{1, m}) . \end{array}$ \matrix{ {{A_{1,m + 1}} = 2{w_{m + 1}} + {A_{1,m}},} \hfill \cr {{A_{3,m + 1}} = 2w_{m + 1}^3 + {A_{3,m}},} \hfill \cr {{A_{5,m + 1}} = 2w_{m + 1}^5 + {A_{5,m}},} \hfill \cr {{B_{5,m + 1}} = {B_{5,m}} + {1 \over 6}(A_{1,m}^2w_{m + 1}^3 - {A_{1,m}}{A_{3,m}}{w_{m + 1}} - w_{m + 1}^2{A_{3,m}} + w_{m + 1}^4{A_{1,m}}).} \hfill \cr } See Appendix A for an explanation of how to extend this approach to 10th order.

2.2.3.

Constraints to Satisfy in Order to Derive 6th Order Formula

To derive a 6th order formula, the lower-order terms in the exponential in Eq. (21) must be zero (see also Eq. (5.16) of [3]), which gives the four conditions (25) $A_{1, m} = 1, A_{3, m} = 0, A_{5, m} = 0, B_{5, m} = 0 .$ {A_{1,m}} = 1,\quad {A_{3,m}} = 0,\quad {A_{5,m}} = 0,\quad {B_{5,m}} = 0. For m = 3 there are four unknowns w₀ to w₃, and it can be expected there are solutions because there are the same number of equations as unknowns. In practice A_1,m = 1 is satisfied by taking w₀ = 1 − 2 ∑_j w_j, so there are then three equations for three unknowns w₁, w₂, w₃. Whereas it is possible to solve the equations using the Newton-Raphson method, the expression for the appropriate Jacobian matrix is complicated, so Yoshida instead uses the Brent method. Yoshida produced three m = 3 solutions in this way, and states “It seems that there is no other solution.” We have performed an extensive search and also found no more solutions.

The product formulae obtained through the Yoshida method also work for exponentials of sums of more operators. The S₂ product formula can again be used as the building block for the product formula, and we can write (26) $S_{2} (t) = exp (\sum_{ℓ = 0}^{\infty} {\tilde{α}}_{ℓ} t^{ℓ}),$ {S_2}(t) = \exp \left( {\sum\limits_{\ell = 0}^\infty {{\tilde \alpha }_\ell }{t^\ell }} \right), where ${\tilde{α}}_{ℓ}$ {\tilde \alpha _\ell } are now order-ℓ multicommutator expressions on the J terms. The reasoning for finding the product formula is entirely based on the construction with α_ℓ, but without taking advantage of its particular form, so exactly the same reasoning holds for ${\tilde{α}}_{ℓ}$ {\tilde \alpha _\ell } . This immediately implies that the higher-order product formulae work in general. This is an advantage of constructing product formulae as products of S₂.

2.3.

Processed Product Formulae

Another technique to obtain higher-order product formulae is that of processing [31,34,35,36,37,38,39]. In this technique a product formula S_k of order k is generated by the composition of a kernel Σ conjugated by a processor operator P as (27) $S_{k} = P Σ P^{- 1} .$ {S_k} = P\Sigma {P^{ - 1}}. The advantage of this method is that usually Σ requires fewer stages than other methods, and due to the construction, we have that $S_{k}^{n} = P Σ^{n} P^{- 1}$ S_k^n = P{\Sigma ^n}{P^{ - 1}} . This means that the cost of using the product formula is effectively that of the kernel in the usual application where a long time evolution is partitioned into many repetitions of the product formula for short times.

Typically, one chooses a kernel Σ as a product formula that has a smaller order than k, but conjugating by processors gives an order k integrator. Some of the best-performing kernels are given in Ref. [31], and recently new kernels were reported in [40]. The advantage of this method is that the number of non-linear equations required to be solved to find new product formulae is reduced. One could use a shorter minimum-length product formula, but best performance is provided by increasing the length of a product formula beyond the minimum. The more the number of free parameters exceeds the number of equations, the more freedom there is to find product formulae with low error. Reducing the number of equations by solving for the kernel enables reduced error for a given length.

To be more specific, Ref. [31] gives a set of conditions for the kernels and processors in Table 4 of that work. Reference [31] uses the notation Y_j rather than α_j, so S₂(t) = e^{tY₁+t³Y₃⋯}. It then uses the notation E_j,i for multicommutator expressions, with E_j,1 = Y_j ≡ α_j and for example (28) $E_{5, 2} = Y_{1}, Y_{1}, Y_{3}] \equiv β_{5},$ {E_{5,2}} = \left[ {{Y_1},{Y_1},{Y_3}} \right] \equiv {\beta _5}{\kern 1pt} , (29) $E_{7, 2} = Y_{3}, Y_{1}, Y_{3}] \equiv - γ_{7},$ {E_{7,2}} = \left[ {{Y_3},{Y_1},{Y_3}} \right] \equiv - {\gamma _7}{\kern 1pt} , where γ₇ is given in Appendix A. In Ref. [31] the quantities f_j,i are the coefficients of E_j,i, so the equivalent of Eq. (21) in that notation is (30) $S^{(m)} (t) = exp {tf}_{1, 1} E_{1, 1} + t^{3} f_{3, 1} E_{3, 1} + t^{5} (f_{5, 1} E_{5, 1} + f_{5, 2} E_{5, 2}) + O (t^{7})\} .$ {S^{(m)}}(t) = \exp \left\{ {t{f_{1,1}}{E_{1,1}} + {t^3}{f_{3,1}}{E_{3,1}} + {t^5}({f_{5,1}}{E_{5,1}} + {f_{5,2}}{E_{5,2}}) + {\cal O}({t^7})} \right\}{\kern 1pt} .

To obtain the conditions for a 6th order kernel from Ref. [31], one should use the conditions for f_j,i up to q = 5 in Table 4 of that work, which give f_1,1 = 1, f_3,1 = 0, f_5,1 = 0, which in the above notation are equivalent to (31) $A_{1, m} = 1, A_{3, m} = 0, A_{5, m} = 0 .$ {A_{1,m}} = 1,\quad {A_{3,m}} = 0,\quad {A_{5,m}} = 0. The conditions for the kernel are the same as for the complete product formula, but missing B_5,m = 0. That is a general feature of the kernel order conditions; they are a subset of the conditions for the complete product formula, enabling shorter kernels, or more flexibility to choose kernels of the same length but lower error.

For 4th order there are the same number of conditions, so the kernel is a valid product formula of that order. For the processors, Ref. [31] uses the notation p_j,i for the coefficients of E_j,i in the expansion of P . These need to be determined from the BCH formula rather than symmetric BCH, and instead of only odd-order terms there need to be only even-order terms. Some symmetries mean that low-order odd terms automatically cancel, but higher-order ones need to be made zero by the choice of P . Table 4 in Ref. [31] gives the conditions on the even-order terms. The first processor condition depending on f_j,i is for q = 5, which means that for 6th order and higher the processors are dependent on the kernel. The 4th order kernel is already 4th order, and the processors will yield another 4th order product formula, but can be chosen to reduce the error (which arises from higher-order terms). See Appendix B for more detail on the method of determining processors.

2.4.

More General Product Formulae

The form of product formulae constructed from products of S₂ can be generalised in a number of ways. First, one may choose different weightings in the two parts of Eq. (4), so one has (32) $S (t_{1}, t_{2}) = \prod_{j = 1}^{J} exp (\frac{1}{2} X_{j} t_{1})] \prod_{j = J}^{1} exp (\frac{1}{2} X_{j} t_{2})] .$ S({t_1},{t_2}) = \left[ {\prod\limits_{j = 1}^J \exp \left( {{1 \over 2}{X_j}{t_1}} \right)} \right]\left[ {\prod\limits_{j = J}^1 \exp \left( {{1 \over 2}{X_j}{t_2}} \right)} \right]. That is, one still alternates between the order of applying the exponentials, but allows the times to be different. This form is no longer symmetric, but there are twice as many parameters for a given length. In the literature on symplectic integrators, the notation for product formulae of this type is [33] (33) $χ_{α_{2 s} h} \circ χ_{α_{2 s - 1} h}^{*} \circ \dots \circ χ_{α_{2} h} \circ χ_{α_{1} h}^{*},$ {\chi _{{\alpha _{2s}}h}} \circ \chi _{{\alpha _{2s - 1}}h}^* \circ \cdots \circ {\chi _{{\alpha _2}h}} \circ \chi _{{\alpha _1}h}^*{\kern 1pt} , where the asterisk indicates that the exponentials are performed in reverse order (rather than conjugation).

In the case where the Hamiltonian is a sum of only two parts, then a product formula may be written in the form (34) $e^{w_{m} Xt} e^{w_{m - 1} Yt} \dots e^{w_{1} Xt} e^{w_{0} Yt} e^{w_{1} Xt} \dots e^{w_{m - 1} Yt} e^{w_{m} Xt},$ {e^{{w_m}Xt}}{e^{{w_{m - 1}}Yt}} \cdots {e^{{w_1}Xt}}{e^{{w_0}Yt}}{e^{{w_1}Xt}} \cdots {e^{{w_{m - 1}}Yt}}{e^{{w_m}Xt}}{\kern 1pt} , if m is odd, or ending in an exponential of Y if m is even. Here we have written the product formula as symmetric, but one can also consider product formulae without that constraint. Again there are twice as many parameters as for a product formula constructed as a product of S₂. Product formulae of this type are often written as [33] (35) $φ_{a_{s + 1} h}^{[1]} \circ φ_{b_{s + 1} h}^{[2]} \circ φ_{a_{s} h}^{[1]} \circ \dots \circ φ_{a_{2} h}^{[1]} \circ φ_{b_{1} h}^{[2]} \circ φ_{a_{1} h}^{[1]},$ \varphi _{{a_{s + 1}}h}^{[1]} \circ \varphi _{{b_{s + 1}}h}^{[2]} \circ \varphi _{{a_s}h}^{[1]} \circ \cdots \circ \varphi _{{a_2}h}^{[1]} \circ \varphi _{{b_1}h}^{[2]} \circ \varphi _{{a_1}h}^{[1]}{\kern 1pt} , where φ^[1] and φ^[2] are indicating the two different exponentials.

In the case where the Hamiltonian is a sum of two terms, the product of S(t₁, t₂) gives a form as in Eq. (34), and conversely the product formula as in Eq. (34) can be expressed as a product of S(t₁, t₂). In Section 5 we give results for testing some product formulae of this type, and find that they give the best results for 4th and 6th order. This method is mostly considered only up to 6th order because for higher orders the number of order conditions becomes significantly larger (see Table I of Ref. [31]).

Another class of product formulae is those constructed from products of fourth-order product formulae S₄. Some product formulae have been constructed in that way; for example, see Ref. [31]. We have also tested those (using the fourth-order fractal product formulae), but do not include them in the comparison in Section 5. We found that, although they reduced the error, the much larger number of exponentials in the product meant that they were not competitive. Part of the reason is that the best gains are obtained when there are significantly more free parameters to adjust than the number of equations to satisfy. This is the reason to use product formulae with more than the minimum length. Constructing a product formula from S₄ increases the length without the added advantage of more free parameters to adjust.

Another option is to construct product formulae from a different second-order product formula. For example, a more accurate second-order method is provided in Ref. [24]. In our testing, we find that yields slightly more accurate product formulae, but with twice the length, making them uncompetitive with product formulae constructed from the standard second-order symmetric product formula S₂.

2.5.

Solution Using Taylor Expansion

Another solution method is based on computing the Taylor expansion of both the exact exponential and its product formula approximation with given w_j. This method is not normally used because it is much more computationally intensive than the symmetric BCH approach. Here, we describe how to effectively code this method for numerical solution. The Taylor expansion is performed on both sides up to the desired order of approximation for the integrator. By imposing equality on terms of the same order one can obtain a series of equations for w_j which can be solved. For higher orders, a large number of simultaneous equations are obtained, so we need an automated way of generating them.

To make precise the problem we are solving, denote as 𝒯_k[⋅] the map giving the Taylor expansion in t around 0, truncated at order k, so (36) $\begin{array}{l} T_{k} [e^{t (X + Y)}] & = \sum_{p = 0}^{k} \frac{t^{p}}{p!} {(X + Y)}^{p} \\ = \sum_{p = 0}^{k} \frac{t^{p}}{p!} \sum_{r_{1}, \dots, r_{p} = 0}^{1} X^{r_{1}} Y^{1 - r_{1}} \dots X^{r_{p}} Y^{1 - r_{p}} . \end{array}$ \matrix{ {{{\cal T}_k}[{e^{t(X + Y)}}]} \hfill & { = \sum\limits_{p = 0}^k {{{t^p}} \over {p!}}{{(X + Y)}^p}} \hfill \cr {} \hfill & { = \sum\limits_{p = 0}^k {{{t^p}} \over {p!}}\sum\limits_{{r_1}, \cdots ,{r_p} = 0}^1 {X^{{r_1}}}{Y^{1 - {r_1}}} \cdots {X^{{r_p}}}{Y^{1 - {r_p}}}.} \hfill \cr } We consider a sum of two operators X + Y , but this approach for solving for w_j will also be sufficient to provide product formulae for sums of arbitrary numbers of terms. That is because the solutions must also be solutions of the equations derived using Yoshida's method, and as explained above those equations will be the same when considering sums of arbitrary numbers of terms. Now consider the ansatz operator of Yoshida from Eq. (16) (37) $\begin{array}{l} S^{(m)} (t, w_{1}, \dots, w_{m}) & = e^{{tw}_{m} X / 2} e^{{tw}_{m} Y} e^{t (w_{m} + w_{m - 1}) X / 2} e^{{tw}_{m - 1} Y} e^{t (w_{m - 1} + w_{m - 2}) X / 2} \\ \dots e^{{tw}_{0} Y} e^{t (w_{1} + w_{0}) X / 2} e^{{tw}_{1} Y} \dots e^{{tw}_{m} X / 2} \\ = e^{{tc}_{1} X} e^{{tc}_{2} Y} \dots e^{{tc}_{4 m + 3} X}, \end{array}$ \matrix{ {{S^{(m)}}(t,{w_1}, \cdots ,{w_m})} \hfill & { = {e^{t{w_m}X/2}}{e^{t{w_m}Y}}{e^{t({w_m} + {w_{m - 1}})X/2}}{e^{t{w_{m - 1}}Y}}{e^{t({w_{m - 1}} + {w_{m - 2}})X/2}}} \hfill \cr {} \hfill & {\quad \cdots {e^{t{w_0}Y}}{e^{t({w_1} + {w_0})X/2}}{e^{t{w_1}Y}} \cdots {e^{t{w_m}X/2}}} \hfill \cr {} \hfill & { = {e^{t{c_1}X}}{e^{t{c_2}Y}} \cdots {e^{t{c_{4m + 3}}X}},} \hfill \cr } where in the last line we have defined the constants c₁ = w_m/2, c₂ = w_m, c₃ = (w_m + w_m−1)/2, ⋯ c_4m+3 = w_m/2. Taylor expanding this ansatz up order k gives (38) $T_{k} [S^{(m)} (t, w_{1}, \dots, w_{m})] = \sum_{\begin{matrix} r_{1}, \dots, r_{4 m + 3} = 0 \\ r_{1} + \dots + r_{4 m + 3} \leq k \end{matrix}}^{k} \frac{t^{r_{1} + \dots + r_{4 m + 3}}}{r_{1}! \dots r_{4 m + 3}!} c_{1}^{r_{1}} \dots c_{4 m + 3}^{r_{4 m + 3}} X^{r_{1}} Y^{r_{2}} \dots X^{r_{k - 1}} Y^{r_{4 m + 3}} .$ {{\cal T}_k}[{S^{(m)}}(t,{w_1}, \ldots ,{w_m})] = \sum\limits_{{\scriptstyle {r_1}, \cdots ,{r_{4m + 3}} = 0 \atop \scriptstyle {r_1} + \ldots + {r_{4m + 3}} \le k} }^k {{{t^{{r_1} + \ldots + {r_{4m + 3}}}}} \over {{r_1}! \cdots {r_{4m + 3}}!}}c_1^{{r_1}} \ldots c_{4m + 3}^{{r_{4m + 3}}}{X^{{r_1}}}{Y^{{r_2}}} \ldots {X^{{r_{k - 1}}}}{Y^{{r_{4m + 3}}}}. Note that the total number of exponentials in Yoshida's ansatz is 4m + 3.

The product formula obtained from the solution procedure needs to be independent of the choice of X and Y , so we need to match the coefficients for each sequence of products of X and Y . Because X and Y are non-commuting, we need to record coefficients for each ordered sequence. To do this in an automated way we construct a data structure to store the coefficients.

Given operators of the form $e^{cX} = I + cX + \frac{c^{2}}{2!} X^{2} + \frac{c^{3}}{3!} X^{3} + \dots$ {e^{cX}} = I + cX + {{{c^2}} \over {2!}}{X^2} + {{{c^3}} \over {3!}}{X^3} + \cdots and $e^{dY} = I + dY + \frac{d^{2}}{2!} Y^{2} + \frac{d^{3}}{3!} Y^{3} + \dots$ {e^{dY}} = I + dY + {{{d^2}} \over {2!}}{Y^2} + {{{d^3}} \over {3!}}{Y^3} + \cdots with c, d scalar coefficients and X, Y general operators, we describe this Taylor expansion up to order k using an array encoding. First, write monomials composed of X and Y operators in lexicographical order and note that these operators can be mapped to binary numbers: (39) $\begin{matrix} I & X & Y & XX & XY & YX & YY & XXX & XXY & \dots \\ 1 & 10 & 11 & 100 & 101 & 110 & 111 & 1000 & 1001 & \dots \\ 1 & 2 & 3 & 4 & 5 & 6 & 7 & 8 & 9 & \dots \end{matrix}$ \matrix{ I & X & Y & {XX} & {XY} & {YX} & {YY} & {XXX} & {XXY} & \cdots \cr 1 & {10} & {11} & {100} & {101} & {110} & {111} & {1000} & {1001} & \cdots \cr 1 & 2 & 3 & 4 & 5 & 6 & 7 & 8 & 9 & \cdots \cr }

To construct a bit string, we map each X to 0 and each Y to 1, then place a 1 on the left to flag the length of the string, as shown in the second line of Eq. (39). Then, to obtain the operator products, simply remove the leading 1 and then map 0 to X and 1 to Y . The empty string corresponds to the identity I. Now, to store the coefficients in a sum of products of X and Y , convert each product to a binary integer as above, then store the coefficient in the corresponding location in a vector. This approach enables products of Taylor expansions of exponentials to be rapidly calculated.

By way of illustration, consider the polynomial 10I + 3X + 2Y + 2X² + Y X. This operator would be stored in an array as [10, 3, 2, 2, 0, 1, 0, ⋯]. In this way, any polynomial of X and Y can be efficiently stored in a vector. We denote this vector storing the coefficients of operators of order up to k as vec_k[p(X, Y)], where k denotes that the vector will only store the coefficients of the corresponding operators up to order k (so a vector of length 2^k+1 − 1) and p(X, Y) is the polynomial in terms of X and Y.

3.

Product Formula Search

First we describe the numerical methods, then summarise solutions we found at 8th and 10th order.

3.1.

Numerical methods

The size of the product formula we consider as in Eq. (16) is governed by the value of m, so the number of S₂ in the product is M = 2m + 1. The number of free parameters is m, and it needs to be a least a large as the number of independent equations in order for there to be a solution. Yoshida [3] finds that m = 7 is minimal for 8th order, and m = 15 is minimal for 10th order [29]. Choosing the minimal m typically yields product formulae with large error, and for that reason we focus on larger values of m.

For the numerical solution of the simultaneous non-linear equations, we found that Matlab's fsolve (using the Levenberg-Marquardt algorithm) provided rapid results using the vector of errors. The method was to repeatedly solve from random starting vectors $\vec{w}$ \vec w generated according to the normal distribution. The best solutions were those with smaller values for the coefficients; for much of the calculations we selected standard deviations of 0.6 for 8th order and 0.9 for 10th order. We filtered a large number of solutions by numerically checking the error for example Hamiltonians, and selecting those that provided the best performance. We then perform tests for larger numbers of samples, to more accurately select the best performing product formulae. We also further refined the solutions to give the results to extended precision, which enables testing of the error with smaller values of t. We were also able to further reduce the error by using an alternative solution method using the Taylor series.

The Taylor series approach could be used to solve for product formulae, but is more computationally intensive because it produces an error vector that is exponentially large in the order. However, it is very effective at further improving the performance of solutions that have already been found. If a solution is of 8th order (for example), then we can consider the error using the Taylor expansion up to and including the 9th order. This will give non-zero error for the 8th order product formula, so minimising the error gives a new product formula. That can be used as a starting point to again solve for an 8th order formula, and often it is found that the product formula adjusted in that way has reduced error.

3.2.

Improved 8th Order

Following these methods, we have searched for new product formulae and found thousands of product formulae of 8th order. Once we have obtained the potential solutions, we generate random Hamiltonians and compute the product formula errors for a chosen short time. The error for 8th order product formulae is given by δ(t) ∝ t⁹, but the constant will depend on the measure of error used. One could use the spectral-norm error for a single time step, but a more appropriate measure of the performance of the product formulae is the accuracy of the eigenvalues. In the following we use χ for the constant factor based on spectral-norm error and ζ for the constant based on eigenvalue error, with χ = δ/t⁹ or ζ = δ/t⁹.

For each product formula, we compute an average constant factor error; this average corresponds to the geometric mean of the constant factors computed for each random Hamiltonian. See Section 5 for more details of the numerical methods, and the comprehensive comparison of product formulae. This method of comparing the performance of product formulae through the estimation of the constant factor in the error has been used before (see for example [32]) and is considered a good approximation to the performance of the product formula in practice.

The case m = 7 yields product formulae of minimal length because the number of equations is equal to the number of unknowns. We have found over 600 solutions, and the search now finds almost only repeated solutions and very few new solutions. This indicates that we have found nearly all solutions, but it is also possible that there are many more solutions with large values of $\vec{w}$ \vec w . Indeed, the most recent new solutions we found have significantly larger values of $\vec{w}$ \vec w . We find that large values of $\vec{w}$ \vec w correspond to worse product formulae with larger error. Therefore, even if there are many more solutions with large $\vec{w}$ \vec w , they likely will not give improved performance over those we have already found.

This search for m = 7 (15 stages) verified that the best-performing product formula corresponds to that previously reported in Refs. [24] and [25]. Similarly, a numerical search with m = 8 (17 stages) yielded the best-performing product formula close to solutions s17odr8a and s17odr8b given in Ref. [25]. It is not exactly the same, but the small difference is likely due to the system of equations being underdetermined. The extensive nature of our search indicates that these product formulae reported in Refs. [24,25] are optimal for 8th order product formulae with m = 7 and m = 8.

To provide improved performance over prior work, we extended the search to m = 10 (21 stages), and found over 100,000 solutions. This size of 8th order product formula was previously studied in Ref. [29], but we find solutions with significantly lower error. Our best solutions for spectral-norm and eigenvalue error are given in Table 1. The solution selected for minimum spectral-norm error has a constant of χ = 5.8 × 10⁻⁸. Its constant factor for eigenvalue error ζ = 7.0 × 10⁻⁹ is significantly smaller, but the best performance is obtained by selecting for minimum eigenvalue error. The second solution given in Table 1 has the significantly smaller constant factor in the error ζ = 5.4 × 10⁻¹⁰.

Table 1.

Our best-performing 8th order solutions when setting m = 10.

	Best 8th order for spectral-norm error	Best 8th order for eigenvalue error

w₁	0.59358060400850625863514059265224	0.10467636532245895252340732579853
w₂	−0.46916012347004197296293264921328	−0.57896999331780988041471955125778
w₃	0.2743566425898467907228242878146	0.57503350160061785946141563279891
w₄	0.17193879484656773059919074965377	0.12231011868707029786561397542663
w₅	0.23439874482541384415430578747541	0.27793149999039524816733903301747
w₆	−0.48616424480326193899617759997914	−0.37349605088056728482635987352576
w₇	0.49617367388114660354871757044906	0.11575566589480463220616543972403
w₈	−0.32660218948439130114501815323814	0.1464645610975800618712569230326
w₉	0.23271679349369857679445410270557	−0.39443578322284085764474498594073
w₁₀	0.098249557414708533273471906180643	0.44370228726021218923197141183196

These constant factors all improve over those provided by product formulae in prior work. Our solution with smallest eigenvalue error is well over 100 times more accurate than the m = 10 product formula in Ref. [29], and about 100 times more accurate than the m = 9 product formula in that work. See Section 5 for the detailed numerical testing of the product formulae.

A further improvement in product formulae can be obtained by using processing. We have performed a search for high-accuracy 8th order product formulae using this method with m = 8, and found one particular example that provides excellent performance; see Table 2. This kernel provides a constant of ζ = 8.1 × 10⁻¹⁰, only slightly larger than the best length m = 10 product formula, while providing better performance due to the shorter length. This error is nearly 300 times smaller than provided by 8th order processed product formulae in prior work [31]. Therefore, we have provided a sequence of three solutions that all improve over prior work, with successively better and better performance.

Table 2.

Our best-performing 8th order kernel and processor when setting m = 8. The value of γ₁₀ for the processor is chosen such that the sum is zero.

	Best 8th order kernel	Processor for kernel

w₁	0.21784176681731006074681969186513	−0.44324901019570126590495430949294	γ₁
w₂	0.1947017706053903224022456342907	0.25459857192003772850622377066944	γ₂
w₃	0.18372413281145589944261642180363	−0.73862036266779261573694538099739	γ₃
w₄	−0.37307499512657736825709230652023	−0.00024139614958652134370419495289618	γ₄
w₅	0.15757644257569146373033662060461	0.73873460354125365739379753874964	γ₅
w₆	−0.33342207567391682979227850551172	−0.20285971152536085519251666906017	γ₆
w₇	0.51788649682987924281787142226803	0.44989521689676869571827637424046	γ₇
w₈	0.21456475499897766986381219621761	0.29538398007876871184026747505657	γ₈
		−0.3364996155865700091428329802017	γ₉

To provide an accurate result in terms of the spectral norm (rather than just the eigenvalues) it is necessary to solve for the processor. For the kernel there are many solutions for the processor, but we list the one with the best performance of the solutions we have found. This one was found by solving for many (hundreds of thousands) of processors, picking the one with the best performance, and further optimising. Moreover, using the processor the constant for the spectral-norm error is χ = 5.3 × 10⁻⁸. That is even better than the best length m = 10 (nonprocessed) formula above.

3.3.

Finding 10th Order

We have also used our solution procedure to find new 10th order product formulae. For Yoshida's method applied to 10th order, there are 15 independent equations to be solved (see Appendix A). We performed searches for solutions with m = 15 (the minimal number) to m = 18. Again the larger values of m give the flexibility to adjust the solution to reduce the error. As in Section 3.2, we compare the performance of product formulae of 10th order by computing the constant factors χ and ζ for random Hamiltonians.

Out of our solutions, the best eigenvalue performance is given by a m = 17 solution (see Table 3), which yields ζ = 6.1 × 10⁻¹¹. This formula is slightly outperformed by one product formula in Ref. [29], but for the purpose of quantum computing we will show that 8th order provides better performance than 10th order. For this reason we did not perform further searches for processed 10th order product formulae or proceed to any higher orders.

Table 3.

The 10th order solution with lowest eigenvalue error in our numerical search.

	10th order solution with m = 17

w₁	−0.28371232689144296279654621726493
w₂	0.046779504778147381605331000278223
w₃	0.36845892382797770619657504217539
w₄	0.19186204094674514739760408197461
w₅	−0.53123134392680669702873064192428
w₆	−0.0081253242720827266680816105600661
w₇	−0.16389450414378567860032917538393
w₈	0.18514766119291405032528647881
w₉	0.5383584694754681989174668806505
w₁₀	−0.30583981835573485697292316732177
w₁₁	0.43199935609523301289295473774488
w₁₂	0.1510502301631786853020124612813
w₁₃	−0.35051099204829676098801520498121
w₁₄	0.1032971125844291674511513007661
w₁₅	0.15043936943817152697371946806229
w₁₆	0.12118469498650736511410491586846
w₁₇	0.10437742779547826358296681557444

In the search for 10th order product formulae, unlike in the case of 8th order, we find that almost all new solutions found are different from those found before. That indicates there is an extremely large number of solutions, and we have only found a very small proportion of them. There is still the potential for better-performing solutions to be found in this order, but we expect that the longer length will make them unsuitable for quantum computing.

4.

Methods for Comparison of Product Formulae

4.1.

Same Order Comparison

A fair comparison between product formulae of different lengths may be made in the following way. An order k product formula for time t will have an error δ = ζt^k⁺¹ (using ζ for eigenvalue error, or δ = χt^k⁺¹ for the case of spectral-norm error). Let T be the total evolution time for a product formula of order k, and ε be the maximum allowable error. Subdivide the evolution time T into r subintervals, so t = T /r is the length of each time subinterval. We thus have ζ(T /r)^k+1 ≈ ε/r, which gives (40) $r \approx {(\frac{ζ T}{ε})}^{1 / k} T .$ r \approx {\left( {{{\zeta T} \over \epsilon }} \right)^{1/k}}T. As explained above (see Eq. (17)), the number of exponentials in the product is (4m + 2)(J − 1) + 1. When applying products of these product formulae, two exponentials can be combined, so the effective number for each is (4m + 2)(J − 1). As a result, the total number of exponentials can be given as approximately (41) $2 (J - 1) M {(\frac{ζ T}{ε})}^{1 / k} T,$ 2(J - 1)M{\left( {{{\zeta T} \over \epsilon }} \right)^{1/k}}T, where M = 2m + 1. The only quantities that vary between different product formulae of the same order are M and ζ, so to compare product formulae, we need only compare the values of Mζ^1/k; the one with the smaller value is the more efficient product formula. Similarly, if we consider spectral-norm error then we should compare Mχ^1/k between the formulae.

As another motivation for this form, consider the case where the task is to estimate an eigenvalue of the Hamiltonian. Then one would use an ancilla register to control the time of evolution, and use phase estimation. If we are controlling between the forward and reverse time evolution as per Ref. [45], then the (root-mean square) error in estimating the phase of e^{−iHT /r} is given by (42) $ε_{phase} = \frac{π}{2 r} .$ {\epsilon _{{\rm{phase}}}} = {\pi \over {2r}}. For the error in the simulation of e^{−iHT /r} to be no more than this phase estimation error, we should have (43) $\frac{π}{2 r} \approx ζ {(T / r)}^{k + 1},$ {\pi \over {2r}} \approx \zeta {(T/r)^{k + 1}}, or (44) $r \approx {(\frac{2 ζ T}{π})}^{1 / k} T .$ r \approx {\left( {{{2\zeta T} \over \pi }} \right)^{1/k}}T. If ε_H is the allowed error in estimation of the eigenvalue of H, then it would correspond to π/(2T). Then, in terms of ε_H the choice of r is (45) $r \approx {(\frac{ζ}{ε_{H}})}^{1 / k} \frac{π}{2 ε_{H}} .$ r \approx {\left( {{\zeta \over {{\epsilon _H}}}} \right)^{1/k}}{\pi \over {2{\epsilon _H}}}{\kern 1pt} . That would correspond to a total number of exponentials (46) $π (J - 1) M {(\frac{ζ}{ε_{H}})}^{1 / k} ε_{H} .$ \pi (J - 1)M{\left( {{\zeta \over {{\epsilon _H}}}} \right)^{1/k}}{\epsilon _H}. Again the constant factor dependent on the product formula is Mζ^1/k, so it is this expression that should be compared between formulae. In terms of the overall complexity, this is similar to simulation of the evolution for time T = π/(2ε_H) with error ε = π/2.

4.2.

Thresholds for Different Order

If we wish to compare product formulae of different order, then we need to take account of the values of T and ε. Assume we have two product formulae of order k₁ and k₂, with corresponding constants ζ₁, ζ₂. Given T and ε, when the two product formulae use the same number of exponentials we have M₁r₁ = M₂r₂, thus (47) $M_{1} {(\frac{ζ_{1} T}{ε})}^{1 / k_{1}} T = M_{2} {(\frac{ζ_{2} T}{ε})}^{1 / k_{2}} T$ {M_1}{\left( {{{{\zeta _1}T} \over \epsilon }} \right)^{1/{k_1}}}T = {M_2}{\left( {{{{\zeta _2}T} \over \epsilon }} \right)^{1/{k_2}}}T (48) $\Rightarrow \frac{T}{ε} = {(\frac{M_{2} ζ_{2}^{1 / k_{2}}}{M_{1} ζ_{1}^{1 / k_{1}}})}^{\frac{1}{\frac{1}{k_{1}} - \frac{1}{k_{2}}}} .$ \Rightarrow \;\;\;{T \over \epsilon } = {\left( {{{{M_2}\zeta _2^{1/{k_2}}} \over {{M_1}\zeta _1^{1/{k_1}}}}} \right)^{{1 \over {{1 \over {{k_1}}} - {1 \over {{k_2}}}}}}}. This gives the threshold beyond which the higher-order product formula should be used for improved performance. Again, when considering spectral-norm error, ζ would be replaced with χ. In the case of eigenvalue estimation, the threshold is for 1/ε_H rather than T /ε.

A limitation of this analysis is that it is assumed the time step (ε/(ζT))_1/k is small enough for the scaling law for the error to hold. To adjust the threshold for cases where the time step is large, we can consider a more general functional dependence of the error on the time interval as f(t). Then we require for time T and error ε that (49) $f (t) \times T / t = ε .$ f(t) \times T/t = \epsilon {\kern 1pt} . The cost is then proportional to M × r = MT /t. For the threshold between two product formulae, we require (50) $M_{1} T / t_{1} = M_{2} T / t_{2} .$ {M_1}T/{t_1} = {M_2}T/{t_2}{\kern 1pt} . That implies t₂ = t₁M₂/M₁. For the total error to be equal to ε in each case, we then require (51) $f_{1} (t_{1}) \times T / t_{1} = f_{2} (t_{2}) \times T / t_{2} .$ {f_1}({t_1}) \times T/{t_1} = {f_2}({t_2}) \times T/{t_2}{\kern 1pt} . That can be rearranged to give (52) $f_{1} (t_{1}) = f_{2} (t_{1} M_{2} / M_{1}) \times M_{1} / M_{2} .$ {f_1}({t_1}) = {f_2}({t_1}{M_2}/{M_1}) \times {M_1}/{M_2}{\kern 1pt} . This is an expression we can solve for t₁, which may then be used to determine the T /ε threshold from (53) $T / ε = t_{1} / f_{1} (t_{1}) .$ T/\epsilon = {t_1}/{f_1}({t_1}){\kern 1pt} . This means that the threshold is still for the ratio T /ε, rather than having separate dependence on T and ε.

4.3.

Motivation for Considering Eigenvalue Error

Here, we give further explanation of why it is important to consider eigenvalue error. If the goal is to estimate eigenvalues of the Hamiltonian (as is often the case in quantum chemistry), then the measure of the error should be that in the eigenvalues. That is, how close are the eigenvalues of the product formula to those of the exact Hamiltonian evolution. In the case that the goal is to accurately reproduce the final state after the evolution, then an appropriate measure of the error should be the spectral norm of the difference of the unitary operators. This error accounts for both error in the eigenvalues and basis.

That error will upper bound the 2-norm error in the generated quantum state, but using the triangle inequality to bound the error for long evolution times overestimates the error. This is because the error beyond that in the eigenvalues cancels when using a product of many short time intervals. Let us denote the exact evolution operator for short time U , and the approximate evolution operator provided by the product formula as $\tilde{U}$ \widetilde U . In the basis of the Hamiltonian's eigenstates, U is diagonal, and we can diagonalise $\tilde{U}$ \widetilde U in this basis as $\tilde{U} = {VDV}^{†}$ \widetilde U = VD{V^\dagger } . Then the difference between D and U describes the eigenvalue error, since these are diagonal matrices with the eigenvalues on the diagonal. The matrix V describes the basis error.

It is convenient to write $V = e^{i \tilde{H}}$ V = {e^{i\tilde H}} for some Hermitian matrix $\tilde{H}$ \tilde H . Then the error can be given as (54) $\begin{array}{l} \tilde{U} - U‖ & = e^{i \tilde{H}} {De}^{- i \tilde{H}} - U‖ \\ = (I + i \tilde{H} + O ({\tilde{H}}^{2})) D (I - i \tilde{H} + O ({\tilde{H}}^{2})) - U‖ \\ = i [\tilde{H}, D] + D - U + O ({\tilde{H}}^{2})‖ \\ \leq [\tilde{H}, D - I]‖ + D - U‖ + O ({\tilde{H}}^{2}) . \end{array}$ \matrix{ {\left\| {\widetilde U - U} \right\|} \hfill & { = \;\left\| {{e^{i\tilde H}}D{e^{ - i\tilde H}} - U} \right\|} \hfill \cr {} \hfill & { = \left\| {(I + i\tilde H + {\cal O}({{\tilde H}^2}))D(I - i\tilde H + {\cal O}({{\tilde H}^2})) - U} \right\|} \hfill \cr {} \hfill & { = \;\left\| {i[\tilde H,D] + D - U + {\cal O}({{\tilde H}^2})} \right\|} \hfill \cr {} \hfill & { \le \left\| {[\tilde H,D - I]} \right\| + \left\| {D - U} \right\| + {\cal O}({{\tilde H}^2}){\kern 1pt} .} \hfill \cr } That is, the error can be split up between a part $[\tilde{H}, D - I]$ [\tilde H,D - I] corresponding to basis error, and a part D − U corresponding to eigenvalue error. Note that D − I is proportional to t, so the error in the operator due to the basis error is one order higher than the error in the basis.

Then, for the case of a large number of time steps r, we can bound the overall error as (55) $\begin{array}{l} {\tilde{U}}^{r} - U^{r}‖ & = {VD}^{r} V^{†} - U^{r}‖ \\ = ({VD}^{r} V^{†} - D^{r} V^{†}) + (D^{r} V^{†} - U^{r} V^{†}) + (U^{r} V^{†} - U^{r})‖ \\ \leq V - I‖ + D^{r} - U^{r}‖ + V^{†} - I‖ \\ \leq 2 V - I‖ + r D - U‖ . \end{array}$ \matrix{ {\left\| {{{\widetilde U}^r} - {U^r}} \right\|} \hfill & { = \left\| {V{D^r}{V^\dagger } - {U^r}} \right\|} \hfill \cr {} \hfill & { = \left\| {(V{D^r}{V^\dagger } - {D^r}{V^\dagger }) + ({D^r}{V^\dagger } - {U^r}{V^\dagger }) + ({U^r}{V^\dagger } - {U^r})} \right\|} \hfill \cr {} \hfill & { \le \left\| {V - I} \right\| + \left\| {{D^r} - {U^r}} \right\| + \left\| {{V^\dagger } - I} \right\|} \hfill \cr {} \hfill & { \le 2\left\| {V - I} \right\| + r\left\| {D - U} \right\|{\kern 1pt} .} \hfill \cr } That is, the error due to the eigenvalues is multiplied by a factor of r, but the error due to the basis is not. The error due to the basis can be larger than for a single time step, but only by a factor of ~ 1/t. That factor arises because for a single time step the basis error is commuted with D − I which is proportional to t, so the expression here without D − I can be a factor ∝ 1/t larger.

In practice we find that in many cases the error in the basis scales as 𝒪(t^k), resulting in the expected contribution to the spectral-norm error 𝒪(t^k⁺¹). However, in other cases we find scaling of the error in the basis as 𝒪(t^k⁺¹), resulting in a higher-order contribution to the spectral-norm error. Let us give the error in the eigenvalues as ζt^k⁺¹, and the error in the basis as μt^k^+ν where ν ∈ [0, 1] to account for the various scalings found numerically.

Then the choice of r to make the error in the eigenvalues smaller than ε is (56) $r \approx {(\frac{ζ T}{ε})}^{1 / k} T .$ r \approx {\left( {{{\zeta T} \over \epsilon }} \right)^{1/k}}T. The contribution to the total error from the basis error is (57) $2 μ {(T / r)}^{k + ν} \approx 2 μ {(\frac{ε}{ζ T})}^{1 + ν / k} .$ 2\mu {(T/r)^{k + \nu }} \approx 2\mu {\left( {{\epsilon \over {\zeta T}}} \right)^{1 + \nu /k}}{\kern 1pt} . That gives the ratio of the basis error to the eigenvalue error as approximately (58) $\frac{2 μ}{ε} {(\frac{ε}{ζ T})}^{1 + ν / k} = \frac{2 μ}{ζ T} {(\frac{ε}{ζ T})}^{ν / k} .$ {{2\mu } \over \epsilon }{\left( {{\epsilon \over {\zeta T}}} \right)^{1 + \nu /k}} = {{2\mu } \over {\zeta T}}{\left( {{\epsilon \over {\zeta T}}} \right)^{\nu /k}}{\kern 1pt} . Hence, for large T the eigenvalue error should be dominant. This means that the eigenvalue error for a single step is the relevant measure to estimate the spectral-norm error for long-time evolution.

5.

Numerical Comparison of Product Formulae

5.1.

Comparison for General Matrices

In what follows, we report on the comparison between product formulae of 4th, 6th, 8th, and 10th order based on the threshold provided above. We analyse those found in this work, and those from prior work including processed product formulae. When giving the number of stages of a processed product formula, we refer to the number of stages in the kernel (not P) because those are the ones that would be repeated for a simulation over an extended time.

We list 4th order product formulae in Table 4, 6th order in Table 5, 8th order in Table 6, and 10th order in Table 7, and give the constant factors in the error scaling χ, ζ (spectral norm or eigenvalue error). These product formulae include some constructed with even M rather than M = 2m + 1 as for our product formulae. We generated 1024 random Hamiltonians for each product formula, and then estimated the constants χ and ζ by taking the geometric mean. Each random Hamiltonian is generated as a sum of two random Hamiltonians H = A + B, where A and B are random Hermitian matrices of dimension 64 × 64 and norm 1.

Table 4.

The list of 4th order product formulae. The column labelled “processing” indicates whether the formula uses processor. The label is the name we will refer to the formula by. The constant factor in the error is denoted χ for spectral-norm error and ζ for eigenvalue error, and the corresponding quantities Mχ^1/k and Mζ^1/k are given. The column labelled S₂ indicates whether the product formula is a product of S₂. The best result is highlighted in green, and is from 2002 [47]. The 2006 result PPBCM4m6 is slightly higher error, but it is not evident to 2 significant figures.

label	M	S₂	processing	reference	χ	Mχ^1/k	ζ	Mζ^1/k

S4m1	3	Y	N	first fractal product [2]	4.9 × 10⁻²	1.41	2.3 × 10⁻²	1.17
S4m2	5	Y	N	second fractal product [4]	3.0 × 10⁻³	1.17	3.3 × 10⁻⁴	0.67
O4M5	5	N	N	Ostmeyer Eq. (40) of [46]	3.0 × 10⁻⁴	0.66	7.7 × 10⁻⁵	0.47
BM4M6	6	N	N	S₆ Table 2 of [47]	1.6 × 10⁻⁴	0.67	2.4 × 10⁻⁵	0.42
PPBCM4m6	6	N	Y	P₆4 Table 5 of [31]	5.7 × 10⁻⁵	0.52	2.4 × 10⁻⁵	0.42
BCE4m3	6	Y	Y	$ψ_{3}^{[4]}$ \psi _3^{[4]} Table 6 of [40]	1.6 × 10⁻²	2.15	1.5 × 10⁻³	1.17
BCE4m4	8	Y	Y	$ψ_{4}^{[4]}$ \psi _4^{[4]} Table 6 of [40]	3.4 × 10⁻⁴	1.09	9.8 × 10⁻⁵	0.80
BCE4m5	10	Y	Y	$ψ_{5}^{[4]}$ \psi _5^{[4]} Table 6 of [40]	6.7 × 10⁻⁵	0.90	2.1 × 10⁻⁵	0.68
BCE4m6	12	Y	Y	$ψ_{6}^{[4]}$ \psi _6^{[4]} Table 6 of [40]	2.6 × 10⁻⁵	0.85	7.0 × 10⁻⁶	0.62
BCE4m7	14	Y	Y	$ψ_{7}^{[4]}$ \psi _7^{[4]} Table 6 of [40]	1.3 × 10⁻⁵	0.85	3.0 × 10⁻⁶	0.58
BCE4m8	16	Y	Y	$ψ_{8}^{[4]}$ \psi _8^{[4]} Table 6 of [40]	7.9 × 10⁻⁵	0.85	1.5 × 10⁻⁶	0.56
BCE4m9	18	Y	Y	$ψ_{9}^{[4]}$ \psi _9^{[4]} Table 6 of [40]	5.3 × 10⁻⁶	0.86	8.6 × 10⁻⁷	0.55

Table 5.

The list of 6th order product formulae with their average errors. The column labelled S₂ indicates whether it is a product of S₂. The best result is highlighted in green, and is from 2006 [31].

label	M	S₂	processing	reference	χ	Mχ^1/k	ζ	Mζ^1/k

S6m1	9	Y	N	first fractal product [2]	4.5 × 10⁻²	5.36	2.3 × 10⁻²	4.81
S6m2	25	Y	N	second fractal product [4]	1.3 × 10⁻⁵	3.83	2.0 × 10⁻⁷	1.91
Y6m3a	7	Y	N	Solution A in Table 1 of [3]	2.0 × 10⁻³	2.49	1.2 × 10⁻³	2.28
KL6s9a	9	Y	N	s9odr6a in Appendix A of [25]	2.9 × 10⁻⁴	2.31	1.7 × 10⁻⁴	2.12
KL6s9b	9	Y	N	s9odr6b in Appendix A of [25]	2.9 × 10⁻⁴	2.31	1.7 × 10⁻⁴	2.11
SS6s11	11	Y	N	Section 4.2 of [29]	4.2 × 10⁻⁵	2.05	1.4 × 10⁻⁵	1.71
SS6s13	13	Y	N	Section 4.2 of [29]	2.1 × 10⁻⁵	2.16	3.4 × 10⁻⁶	1.59
BM6M10	10	N	N	S₁₀ Table 2 of [47]	6.0 × 10⁻⁶	1.35	1.4 × 10⁻⁶	1.06
PPBCM6m9	9	N	Y	P₉6 Table 5 of [31]	5.0 × 10⁻⁶	1.18	4.3 × 10⁻⁷	0.78
PPBCM6m5	11	Y	Y	P₁₁6 in Table 6 of [31]	1.9 × 10⁻⁶	1.22	9.0 × 10⁻⁷	1.08
PPBCM6m6	13	Y	Y	P₁₃6 in Table 6 of [31]	4.8 × 10⁻⁷	1.15	2.5 × 10⁻⁷	1.03
BCE6m5	10	Y	Y	$ψ_{5}^{[6]}$ \psi _5^{[6]} Table 8 of [40]	5.8 × 10⁻³	4.24	3.2 × 10⁻³	3.84
BCE6m6	12	Y	Y	$ψ_{6}^{[6]}$ \psi _6^{[6]} Table 8 of [40]	7.7 × 10⁻⁵	2.47	2.0 × 10⁻⁵	1.98
BCE6m7	14	Y	Y	$ψ_{7}^{[6]}$ \psi _7^{[6]} Table 8 of [40]	1.6 × 10⁻⁵	2.22	2.4 × 10⁻⁶	1.62
BCE6m8	16	Y	Y	$ψ_{8}^{[6]}$ \psi _8^{[6]} Table 8 of [40]	4.6 × 10⁻⁶	2.06	3.2 × 10⁻⁷	1.32
BCE6m9	18	Y	Y	$ψ_{9}^{[6]}$ \psi _9^{[6]} Table 8 of [40]	7.3 × 10⁻⁶	2.51	1.3 × 10⁻⁷	1.28
BCE6m10	20	Y	Y	$ψ_{10}^{[6]}$ \psi _{10}^{[6]} Table 8 of [40]	3.2 × 10⁻⁶	2.42	9.9 × 10⁻⁹	0.93
BCE6m11	22	Y	Y	$ψ_{11}^{[6]}$ \psi _{11}^{[6]} Table 8 of [40]	7.9 × 10⁻⁶	3.11	1.4 × 10⁻⁸	1.08

Table 6.

The list of 8th order product formulae with their average errors. This table shows our new results for product formulae that improve over those in the prior literature. The result that provides the most efficient simulations is highlighted in (dark) green, and is our processed product formula. That provides the best performance due to the shorter length, but the lowest error is provided by our product formula highlighted in (dark) blue. The lowest error results from prior work are highlighted in light blue (for the non-processed case), and light green (for the processed case). The processed product formula PP8s13 (P₁₃8) has larger error than the highlighted PP8s19, but better performance due to its shorter length.

label	M	processing	reference	χ	Mχ^1/k	ζ	Mζ^1/k

S8m1	27	N	first fractal product [2]	5.6 × 10⁻²	18.8	1.6 × 10⁻²	16.2
S8m2	125	N	second fractal product [4]	6.7 × 10⁻⁹	11.9	5.2 × 10⁻¹³	3.64
Y8m7d	15	N	Solution D in Table 2 of [3]	1.1 × 10⁻³	6.41	1.3 × 10⁻⁴	4.89
MC8s15	15	N	Table 2 of [24]	6.5 × 10⁻⁶	3.37	2.0 × 10⁻⁶	2.90
MC8s17	17	N	Table 2 of [24]	7.9 × 10⁻⁷	2.94	3.3 × 10⁻⁷	2.63
KL8s17a	17	N	s17odr8a in [25]	6.1 × 10⁻⁷	2.84	2.7 × 10⁻⁷	2.56
KL8s17b	17	N	s17odr8b in [25]	5.9 × 10⁻⁷	2.83	2.5 × 10⁻⁷	2.54
SS8s19	19	N	Section 4.3 of [29]	1.8 × 10⁻⁷	2.72	5.3 × 10⁻⁸	2.34
SS8s21	21	N	Section 4.3 of [29]	3.4 × 10⁻⁷	3.27	7.5 × 10⁻⁸	2.70
PP8s13	13	Y	P₁₃8 in Table 6 of [31]	1.2 × 10⁻⁶	2.37	6.5 × 10⁻⁷	2.19
PP8s19	19	Y	P₁₉8 in Table 6 of [31]	NA	NA	2.4 × 10⁻⁷	2.83
Y8m10	21	N	Table 1 (our new result)	5.8 × 10⁻⁸	2.61	7.0 × 10⁻⁹	2.01
Y8m10b	21	N	Table 1 (our new result)	6.3 × 10⁻⁷	3.53	5.4 × 10⁻¹⁰	1.46
YP8m8	17	Y	Table 2 (our new result)	5.3 × 10⁻⁸	2.09	8.1 × 10⁻¹⁰	1.24

Table 7.

The list of 10th order product formulae with their average errors. The best result is highlighted in green, and is the product formula from 2005 [29]. That product formula provides exceptional performance for 10th order, being about 280 times better than all others from prior work, and half the error of our solution (bottom line).

label	M	processing	reference	χ	Mχ^1/k	ζ	Mζ^1/k

S10m1	81	N	first fractal product [2]	9.0 × 10⁻²	63.7	2.7 × 10⁻³	44.8
S10m2	625	N	second fractal product [4]	4.1 × 10⁻¹³	36.0	6.1 × 10⁻¹⁹	9.44
KL10s31a	31	N	s31odr10a in Appendix A of [25]	8.7 × 10⁻⁶	9.67	5.8 × 10⁻⁶	9.28
KL10s31b	31	N	s31odr10b in Appendix A of [25]	8.4 × 10⁻⁵	12.1	4.1 × 10⁻⁵	11.3
Tsi10s33	33	N	Table II of [26]	3.3 × 10⁻⁶	9.33	6.4 × 10⁻⁷	7.93
SS10s31	31	N	Section 4.4 of [29]	4.2 × 10⁻⁸	5.66	2.4 × 10⁻⁸	5.36
SS10s33	33	N	Section 4.4 of [29]	1.4 × 10⁻⁸	5.42	8.8 × 10⁻⁹	5.17
SS10s35	35	N	Section 4.4 of [29]	1.0 × 10⁻⁹	4.41	3.1 × 10⁻¹¹	3.11
Alberdi31	31	N	Appendix A of [48]	1.5 × 10⁻⁷	6.44	1.0 × 10⁻⁷	6.20
Alberdi33	33	N	Appendix A of [48]	8.4 × 10⁻⁸	6.47	5.5 × 10⁻⁸	6.20
Alberdi35	35	N	Appendix A of [48]	1.5 × 10⁻⁸	5.79	9.5 × 10⁻⁹	5.52
PP10s19	19	Y	P₁₉10 in Table 6 of [31]	NA	NA	6.2 × 10⁻⁶	5.73
PP10s23	23	Y	P₂₃10 in Table 6 of [31]	1.4 × 10⁻⁶	5.96	1.6 × 10⁻⁸	3.82
Y10m17	35	N	Table 3	1.9 × 10⁻⁸	5.91	6.1 × 10⁻¹¹	3.33

The size of the time step used for the estimation is t = e^−5/2 ≈ 0.08, which we found to be small enough for accurate estimation of the constants. An exception is the 10th order product formula PP10s23 in the second-last line of Table 7. That was evaluated for t = e^−3/2 ≈ 0.2, rather than t ≈ 0.08 for the other product formulae. Estimates for t ≈ 0.08 are significantly larger, possibly because the coefficients for that product formula were not given to high precision. We provide the estimates of χ, ζ for larger t to evaluate it as fairly as possible.

To compare among product formulae of the same order, we compare Mζ^1/k for eigenvalue error, and the best results in each table are highlighted. We also provide values of Mχ^1/k for comparison, but these are not used to select the best product formulae. For 8th order it can be seen that our new processed product formula provides the best performance out of all formulae tested. Our two new product formulae of length m = 10 (M = 21) also give better performance than any prior product formulae tested, including processed product formulae. Hence we have given a sequence of three new solutions here, all of which provide improved performance over prior work. There are also other results in our database of over 100,000 solutions that outperform prior work, but we have just given a selection of the best here.

In terms of eigenvalue error, our solution Y8m10b is about 100 times more accurate than the lowest-error prior result, SS8s19. Our processed product formula YP8m8 is about a factor of 300 times more accurate than the lowest-error prior result for processed product formulae, PP8S19, which is longer. These surprisingly large improvements demonstrate that the solution space had not been fully explored in prior work. Each increase in m greatly expands the solution space to explore, and it is likely that there are further improvements still to be found. The error for our processed product formula YP8m8 has slightly larger error than our non-processed formula Y8m10b, but its shorter length means that Mζ^1/k is smaller, thus providing the best performance out of all product formulae tested.

These improvements are consistent between the samples of the Hamiltonians. In comparing our formula Y8m10b to the best from prior work SS8s19, our smallest improvement (out of 1,024 samples) was about a factor of 35. A histogram of the improvement for our new product formula is shown in Figure 1. We also find that the improvements are consistent between sizes of the matrices. We have tested sizes 6, 16, and 32 as well as 64 (which is shown in the tables), and found significant improvements at all sizes tested.

A recent review [33] also provides a comparative analysis of many of the product formulae in the literature, and for 8th order their recommended methods are SS8s21 in the case without processing, and PP8s13 for the case with processing. In our testing we find that SS8s19 is slightly better than SS8s21; this difference may be because we are testing Hamiltonians here, whereas Ref. [33] considers general matrices (which are non-Hermitian). PP8s19 provides lower error than PP8s13, but PP8s13 provides better performance due to the shorter length (and correspondingly smaller value of Mζ^1/k). The three new results we have listed in Table 6 all outperform those methods recommended by Ref. [33].

As a further test, we have calculated the error for simulating the transverse-field Ising model for an 8-qubit chain. We used a time step of 1, with the two components of the Hamiltonian normalised to unit norm, so it is (59) $H = - \frac{1}{7} \sum_{j = 1}^{7} Z_{j} Z_{j + 1} + \frac{1}{8} \sum_{j = 1}^{8} X_{j} .$ H = - {1 \over 7}\sum\limits_{j = 1}^7 {Z_j}{Z_{j + 1}} + {1 \over 8}\sum\limits_{j = 1}^8 {X_j}{\kern 1pt} . We then determined the spectral-norm error for up to 1000 steps, for our minimum eigenvalue error solution Y8m10b, as well as SS8s19 and SS8s21 which provided the minimum eigenvalue error out of results presented in prior work. As seen in Figure 2, the error is far smaller for our new product formula than that provided by those from prior work. The initial error is larger as indicated by the larger spectral-norm error for a single step, but then the error for a large number of time steps is dominated by the eigenvalue error which is much smaller for this product formula. It is also surprising that the simulation is highly accurate for a time step of 1, which is not the short time step normally considered for scaling of the error. The eigenvalue error for a single time step of 1 is only 2.7 × 10⁻¹² for our product formula.

In the case of 10th order, we find that the best performance is provided by SS10s35, from Sofroniou and Spaletta [29] (which is recommended by Ref. [33]). The performance of that product formula is extraordinarily good, with error nearly a factor of 280 times less than any others we have tested from prior work. Our solution Y10m17 provides intermediate performance, having error twice as large as SS10s35, but a factor of 140 times better than any others. For example, it outperforms the processed product formula PP10s23, which was also recommended by Ref. [33]. The fact that there is a very exceptional solution at one particular length for 10th order suggests that there are better solutions still to be found, motivating further searches in future work.

For 4th order the best result for eigenvalue error is given by the formula BM4M6 of Blanes and Moan [47], which is not in the form of a product of S₂. The processed product formula PPBCM4m6 also provides excellent performance, with a marginally larger error in our testing. A recently proposed formula O4M5 of Ostmeyer [46] also gives good performance. In the 6th order case we find that the best performance is given by PPBCM6m9, which is a processed product formula not constructed as a product of S₂. The recently proposed processed product formula BCE6m10 also gives excellent performance with low error, but is much longer, resulting in a larger value of Mζ^1/k.

In all cases the fractal products provide poor performance. The second fractal product yields very low error but very large M, equal to 125 for 8th order and 625 for 10th order. This is far beyond that for any other product formulae considered here, making this product formula uncompetitive despite the small error. The first fractal product yields large error as well as large M, making its performance by far the worst out of any product formula tested. We also tested various combinations of the two formulae (by using different formulae in the iteration to increase the order) or using fractal formulae to increase the order of lower-order formulae. None of these provided competitive performance, so are not included in the tables.

To consider the threshold to use a higher-order product formula over a lower-order one, we first consider the asymptotic expression using Mζ^1/k for eigenvalue error. For 4th order the value of this quantity is 0.42 (BM4M6) and for 6th order is 0.78 (PPBCM6m9), which gives a threshold of 1,700. The value for the best 8th order is 1.24 (our result YP8m8), which gives a threshold of around 6000 when compared with 4th order and 68,000 when compared with 6th order.

At 10th order the best value is 3.11 (SS10s35), which when compared with 8th order gives a threshold of 9.3 × 10¹⁵. This is far greater than can be expected for realistic simulations. To see the reason for this, if we use Eq. (46) for the approximate number of exponentials needed for phase estimation (where the threshold is for 1/ε_H), it would be more than 3 × 10¹⁸. In realistic implementations of quantum chemistry it takes about 10⁴ Toffoli gates per exponential, so this would correspond to a Toffoli complexity over 10²². For comparison, in the analysis of Shor's algorithm in Ref. [49], 3 × 10⁹ Toffolis were estimated to take 8 hours. Even with improvements in gate speed, a simulation sufficiently large for 10th order product formulae to provide an advantage would require a computation time of many millions of years.

This threshold between 8th and 10th order can be expected to be accurate (at least for the estimate of the order of magnitude) because it corresponds to small time steps below 0.3. On the other hand, the other thresholds would correspond to large time steps, so would be inaccurate because the asymptotic expressions break down. The calculated threshold between 6th and 8th order would correspond to a time step larger than 3, which is far too large for the scaling of the error to be accurate.

In this case we find numerically that the threshold between these 6th and 8th order formulae (PPBCM6m9 and YP8m8) is increased dramatically, to about 3.2 × 10⁷, corresponding to a time step of about 1.22. If the target ε is 0.01 for example, that would correspond to about 260,000 time steps. It is possible to optimise our kernel for the larger time step size in order to provide better performance. This optimisation provides a kernel (see Table 8) such that the threshold is reduced about a factor of twelve to 2.8 × 10⁶. This now corresponds to 15,000 time steps with a time step size of about 1.82. This is fewer time steps than expected for simulations of classically intractable systems. Thus the optimisation provides another order of magnitude where 8th order product formulae are optimal, albeit with a product formula tailored to the time step size.

Table 8.

The 8th order kernel tailored for large time step size t = 1.82 to improve the threshold.

	kernel for large time step

w₁	0.1777372900430394
w₂	0.2862580532195395
w₃	0.1701306063199336
w₄	−0.3746748008394162
w₅	0.1485267804844835
w₆	−0.3773225725485588
w₇	0.5395886879620081
w₈	0.2210419534887659

One must also be careful in considering thresholds based on eigenvalue error because there needs to be a large number of time steps such that the spectral-norm error in the complete evolution time is dominated by the eigenvalue error. There is a factor of about 65 between the spectral-norm error and the eigenvalue error for the 8th order product formula YP8m8. For these thresholds the number of time steps is orders of magnitude larger than this, which indicates that the eigenvalue error is an appropriate measure.

These threshold calculations are for Hamiltonians that are sums of two terms, each of which is normalised. In the case where the Hamiltonian is not normalised, the threshold should be scaled by the norm. In the simple case where the Hamiltonian is a sum of two terms with equal norms ‖A‖ = ‖B‖, then the threshold will be for ‖A‖T/ɛ rather than T/ɛ. More generally the Hamiltonian may be a sum of any number of terms with different norms, which would change the threshold and make it unclear what quantity the threshold should be considered in terms of. There is also the possibility that the threshold can be changed for Hamiltonians that have structure to them instead of being random, for example those for quantum chemistry.

5.2.

Comparison for Fermionic Hamiltonians

Fermionic Hamiltonians encountered in quantum chemistry often have the form (60) $\sum_{p, q = 1}^{d} τ_{pq} a_{p}^{†} a_{q} + \sum_{p, q = 1}^{d} ν_{pq} a_{p}^{†} a_{p} a_{q}^{†} a_{q}$ \sum\limits_{p,q = 1}^d {\tau _{pq}}a_p^\dagger {a_q} + \sum\limits_{p,q = 1}^d {\nu _{pq}}a_p^\dagger {a_p}a_q^\dagger {a_q} where $a_{p}^{†}$ a_p^\dagger and a_p are the fermionic creation and annihilation operators acting on orbital p, and there are a total of d orbitals. Each entry τ_pq, ν_pq is real and there is symmetry in exchanging indices. The behaviour of the error as the size of system is changed can be predicted based on the result in Theorem 4 of [50].

Theorem 2 (Theorem 4 in [50]).

Let $H = T + V = \sum_{p, q}^{d} τ_{pq} a_{p}^{†} a_{q} + \sum_{p, q}^{d} ν_{pq} a_{p}^{†} a_{p} a_{q}^{†} a_{q}$ H = T + V = \sum\nolimits_{p,q}^d {\tau _{pq}}a_p^\dagger {a_q} + \sum\nolimits_{p,q}^d {\nu _{pq}}a_p^\dagger {a_p}a_q^\dagger {a_q} be an interacting-electronic Hamiltonian, and S_k(t) be a kth order product formula splitting the evolutions under T and V . Then (61) ${S_{k} (t) - e^{- itH}‖}_{W_{η}} = O ({({τ‖}_{1} + {ν‖}_{1, [η]})}^{k - 1} {τ‖}_{1} {ν‖}_{1, [η]} η t^{k + 1}),$ {\left\| {{S_k}(t) - {e^{ - itH}}} \right\|_{{W_\eta }}} = {\cal O}\left( {{{\left( {{{\left\| \tau \right\|}_1} + {{\left\| \nu \right\|}_{1,[\eta ]}}} \right)}^{k - 1}}{{\left\| \tau \right\|}_1}{{\left\| \nu \right\|}_{1,[\eta ]}}\eta {t^{k + 1}}} \right){\kern 1pt} , where ‖⋅‖_{W_η} corresponds to the operator norm on the operator acting in the η-electron subspace, ${τ‖}_{1} = {max}_{p} \sum_{q} τ_{pq}|,$ {\left\| \tau \right\|_1} = {\max _p}{\sum _q}\left| {{\tau _{pq}}} \right|, and ${ν‖}_{1, [η]} = {max}_{p} {max}_{q_{1} < q_{2} < \dots < q_{η}} (ν_{{pq}_{1}}| + \dots + ν_{{pq}_{η}}|) .$ {\left\| \nu \right\|_{1,[\eta ]}} = {\max _p}{\max _{{q_1} < {q_2} < \cdots < {q_\eta }}}\left( {\left| {{\nu _{p{q_1}}}} \right| + \cdots + \left| {{\nu _{p{q_\eta }}}} \right|} \right).

To fairly compare product formulae as applied to quantum chemistry, we can define ω so that (62) $ζ = ω {({τ‖}_{1} + {ν‖}_{1, [η]})}^{k - 1} {τ‖}_{1} {ν‖}_{1, [η]} η .$ \zeta = \omega {\kern 1pt} {({\left\| \tau \right\|_1} + {\left\| \nu \right\|_{1,[\eta ]}})^{k - 1}}{\left\| \tau \right\|_1}{\left\| \nu \right\|_{1,[\eta ]}}\eta {\kern 1pt} . Note that the expression in Ref. [50] was derived for error in the spectral norm, but that also provides an upper bound on the error in the eigenvalues, so it is reasonable to consider the constant defined for eigenvalue error. Then the formula for the threshold T /ε becomes (63) $\begin{matrix} \frac{T}{ε} & = {(\frac{M_{2} {[ω_{2} {({τ‖}_{1} + {ν‖}_{1, [η]})}^{k_{2} - 1} {τ‖}_{1} {ν‖}_{1, [η]} η]}^{1 / k_{2}}}{M_{1} {[ω_{1} {({τ‖}_{1} + {ν‖}_{1, [η]})}^{k_{1} - 1} {τ‖}_{1} {ν‖}_{1, [η]} η]}^{1 / k_{1}}})}^{\frac{1}{\frac{1}{k_{1}} - \frac{1}{k_{2}}}} \\ = {(\frac{M_{2} {[ω_{2} {τ‖}_{1} {ν‖}_{1, [η]} η / ({τ‖}_{1} + {ν‖}_{1, [η]})]}^{1 / k_{2}}}{M_{1} {[ω_{1} {τ‖}_{1} {ν‖}_{1, [η]} η / ({τ‖}_{1} + {ν‖}_{1, [η]})]}^{1 / k_{1}}})}^{\frac{1}{\frac{1}{k_{1}} - \frac{1}{k_{2}}}} \\ \frac{{τ‖}_{1} {ν‖}_{1, [η]} η}{{τ‖}_{1} + {ν‖}_{1, [η]}} \frac{T}{ε} & = {(\frac{M_{2} ω_{2}^{1 / k_{2}}}{M_{1} ω_{1}^{1 / k_{1}}})}^{\frac{1}{\frac{1}{k_{1}} - \frac{1}{k_{2}}}} . \end{matrix}$ \matrix{\hfill{{T \over \smallint }} & { = {{\left( {{{{M_2}{{[{\omega _2}{{({{\left\| \tau \right\|}_1} + {{\left\| \nu \right\|}_{1,[\eta ]}})}^{{k_2} - 1}}{{\left\| \tau \right\|}_1}{{\left\| \nu \right\|}_{1,[\eta ]}}\eta ]}^{1/{k_2}}}} \over {{M_1}{{[{\omega _1}{{({{\left\| \tau \right\|}_1} + {{\left\| \nu \right\|}_{1,[\eta ]}})}^{{k_1} - 1}}{{\left\| \tau \right\|}_1}{{\left\| \nu \right\|}_{1,[\eta ]}}\eta ]}^{1/{k_1}}}}}} \right)}^{{1 \over {{1 \over {{k_1}}} - {1 \over {{k_2}}}}}}}} \hfill \cr \hfill{} & { = {{\left( {{{{M_2}{{[{\omega _2}{{\left\| \tau \right\|}_1}{{\left\| \nu \right\|}_{1,[\eta ]}}\eta /({{\left\| \tau \right\|}_1} + {{\left\| \nu \right\|}_{1,[\eta ]}})]}^{1/{k_2}}}} \over {{M_1}{{[{\omega _1}{{\left\| \tau \right\|}_1}{{\left\| \nu \right\|}_{1,[\eta ]}}\eta /({{\left\| \tau \right\|}_1} + {{\left\| \nu \right\|}_{1,[\eta ]}})]}^{1/{k_1}}}}}} \right)}^{{1 \over {{1 \over {{k_1}}} - {1 \over {{k_2}}}}}}}} \hfill \cr \hfill{{{{{\left\| \tau \right\|}_1}{{\left\| \nu \right\|}_{1,[\eta ]}}\eta } \over {{{\left\| \tau \right\|}_1} + {{\left\| \nu \right\|}_{1,[\eta ]}}}}{T \over \smallint }} & { = {{\left( {{{{M_2}\omega _2^{1/{k_2}}} \over {{M_1}\omega _1^{1/{k_1}}}}} \right)}^{{1 \over {{1 \over {{k_1}}} - {1 \over {{k_2}}}}}}}.} \hfill \cr } For estimation of eigenvalues (rather than simulation of time evolution) the same derivation holds with T /ε replaced by 1/ε_H . Thus we see that the ratio ‖τ‖₁‖ν‖_1,[η]η/(‖τ‖₁ + ‖ν‖_1,[η]) governs the threshold where a higher-order product formula will improve over a lower-order product formula. This is somewhat different from the factor of the norm of the Hamiltonian that might otherwise be expected for unstructured Hamiltonians.

To analyse the non-asymptotic regime, we rewrite the error to be proportional to (64) $\frac{{τ‖}_{1} {ν‖}_{1, [η]} η}{{τ‖}_{1} + {ν‖}_{1, [η]}} t \times {t ({τ‖}_{1} + {ν‖}_{1, [η]})]}^{k} .$ {{{{\left\| \tau \right\|}_1}{{\left\| \nu \right\|}_{1,[\eta ]}}\eta } \over {{{\left\| \tau \right\|}_1} + {{\left\| \nu \right\|}_{1,[\eta ]}}}}t \times {\left[ {t\left( {{{\left\| \tau \right\|}_1} + {{\left\| \nu \right\|}_{1,[\eta ]}}} \right)} \right]^k}. Following the derivation of Ref. [50], it is easily seen that the contributions to the error from each of the orders higher than k + 1 will be of a similar form. These higher-order contributions to the error can be used to determine the error in the non-asymptotic regime where the order k + 1 term is not dominant.

To see why that is the case, note that Lemma 2 of Ref. [50] describes the error at order k + 1 (p + 1 in the notation of that work) in terms of a sum over norms of multicommutators of k + 1 terms in the Hamiltonian. It is easily seen that the error at each of the higher orders will also be given by similar multicommutator forms. The method of proof of Theorem 4 in Ref. [50] is based on simplifying the multicommutator expressions from Lemma 2, and proceeds in exactly the same way for the higher orders.

Therefore, the sum over errors of all orders would give a total error of the form (65) $\frac{{τ‖}_{1} {ν‖}_{1, [η]} η}{{τ‖}_{1} + {ν‖}_{1, [η]}} t \times \sum_{j = k + 1}^{\infty} ξ_{j} {t ({τ‖}_{1} + {ν‖}_{1, [η]})]}^{j} .$ {{{{\left\| \tau \right\|}_1}{{\left\| \nu \right\|}_{1,[\eta ]}}\eta } \over {{{\left\| \tau \right\|}_1} + {{\left\| \nu \right\|}_{1,[\eta ]}}}}t \times \sum\limits_{j = k + 1}^\infty {\xi _j}{\left[ {t\left( {{{\left\| \tau \right\|}_1} + {{\left\| \nu \right\|}_{1,[\eta ]}}} \right)} \right]^j}. This sum would then give the non-asymptotic form for the error (66) $\frac{{τ‖}_{1} {ν‖}_{1, [η]} η}{{τ‖}_{1} + {ν‖}_{1, [η]}} t \times g (t ({τ‖}_{1} + {ν‖}_{1, [η]})),$ {{{{\left\| \tau \right\|}_1}{{\left\| \nu \right\|}_{1,[\eta ]}}\eta } \over {{{\left\| \tau \right\|}_1} + {{\left\| \nu \right\|}_{1,[\eta ]}}}}t \times g\left( {t\left( {{{\left\| \tau \right\|}_1} + {{\left\| \nu \right\|}_{1,[\eta ]}}} \right)} \right){\kern 1pt} , for some function g.

Then following the same procedure to determine the threshold as before, we will have a total error given by (67) $\frac{{τ‖}_{1} {ν‖}_{1, [η]} η}{{τ‖}_{1} + {ν‖}_{1, [η]}} T \times g (t ({τ‖}_{1} + {ν‖}_{1, [η]})) .$ {{{{\left\| \tau \right\|}_1}{{\left\| \nu \right\|}_{1,[\eta ]}}\eta } \over {{{\left\| \tau \right\|}_1} + {{\left\| \nu \right\|}_{1,[\eta ]}}}}T \times g\left( {t\left( {{{\left\| \tau \right\|}_1} + {{\left\| \nu \right\|}_{1,[\eta ]}}} \right)} \right){\kern 1pt} . That would imply that comparing two product formulae, the threshold will be obtained for (68) $g_{1} (t_{1} ({τ‖}_{1} + {ν‖}_{1, [η]})) = g_{2} (t_{2} ({τ‖}_{1} + {ν‖}_{1, [η]})) .$ {g_1}\left( {{t_1}\left( {{{\left\| \tau \right\|}_1} + {{\left\| \nu \right\|}_{1,[\eta ]}}} \right)} \right) = {g_2}\left( {{t_2}\left( {{{\left\| \tau \right\|}_1} + {{\left\| \nu \right\|}_{1,[\eta ]}}} \right)} \right){\kern 1pt} . Using t₂ = t₁M₂/M₁ and solving for t₁ as before, the T /ε threshold would then be given by substituting the solution for t₁ into (69) $\frac{{τ‖}_{1} {ν‖}_{1, [η]} η}{{τ‖}_{1} + {ν‖}_{1, [η]}} \frac{T}{ε} = 1 / g_{1} (t_{1} ({τ‖}_{1} + {ν‖}_{1, [η]})) .$ {{{{\left\| \tau \right\|}_1}{{\left\| \nu \right\|}_{1,[\eta ]}}\eta } \over {{{\left\| \tau \right\|}_1} + {{\left\| \nu \right\|}_{1,[\eta ]}}}}{T \over \epsilon } = 1/{g_1}\left( {{t_1}\left( {{{\left\| \tau \right\|}_1} + {{\left\| \nu \right\|}_{1,[\eta ]}}} \right)} \right){\kern 1pt} .

The threshold can be calculated for Hamiltonians with a given value of ‖τ‖₁ + ‖ν‖_1,[η], then for some new Hamiltonian with different values of the norms, the threshold should still be for the same value of t₁(‖τ‖₁ + ‖ν‖_1,[η]). This implies that the right-hand side of Eq. (69) should be the same for the threshold with the new Hamiltonian. As a result, we expect that the threshold should be in terms of the left-hand side of Eq. (69), which is the same expression as found in the asymptotic regime.

In our numerical testing, the coefficients τ_pq, ν_pq were chosen uniformly from the interval [−1, 1]. We computed ω for a selection of the product formulae with the best performance for random Hamiltonians. In Table 9, Table 10, and Table 11 we give the computed result for d = 6 orbitals, assuming half-filling of the orbitals. Our numerics indicate that the error is roughly proportional to the bound in Eq. (61), independent of the number of orbitals, though the constant factors are small. In Appendix C we give the constant factors for the case d = 4, showing that the computed constant does not change much with a different d.

Table 9.

Comparison of constant factors ω for a selection of the lowest-error best product formulae for 6th order. We generate 1, 000 random Hamiltonians with d = 6 orbitals as in Eq. (60) and compute the average ω.

label	M	processing	reference	ω	Mω^1/k

PPBCM6m9	9	Y	P₉6 Table 5 of [31]	2.8 × 10⁻⁹	0.33
PPBCM6m6	13	Y	P₁₃6 in Table 6 of [31]	1.2 × 10⁻⁹	0.42
BCE6m10	20	Y	$ψ_{10}^{[6]}$ \psi _{10}^{[6]} Table 8 of [40]	3.4 × 10⁻¹¹	0.36

Table 10.

Comparison of constant factors ω for a selection of the lowest-error product formulae for 8th order. We generate 1, 000 random Hamiltonians with d = 6 orbitals as in Eq. (60) and compute the average ω.

label	M	processing	reference	ω	Mω^1/k

SS8s19	19	N	Section 4.3 of [29]	3.5 × 10⁻¹¹	0.94
PP8s13	13	Y	P₁₃8 in Table 6 of [31]	4.4 × 10⁻¹⁰	0.88
Y8m10	21	N	Table 1 (our new result)	8.5 × 10⁻¹²	0.87
Y8m10b	21	N	Table 1 (our new result)	1.3 × 10⁻¹²	0.68
YP8m8	17	Y	Table 2 (our new result)	1.7 × 10⁻¹²	0.57

Table 11.

Comparison of constant factors ω for a selection of the lowest-error product formulae for 10th order. We generate 1, 000 random Hamiltonians with d = 6 orbitals as in Eq. (60) and compute the average ω.

label	M	processing	reference	ω	Mω^1/k

SS10s35	35	N	Section 4.4 of [29]	3.0 × 10⁻¹⁵	1.24
PP10s23	23	Y	P₂₃10 in Table 6 of [31]	2.3 × 10⁻¹²	1.58
Y10m17	35	N	Table 3	2.6 × 10⁻¹⁴	1.53

From the results in these tables, we see that the relative performance for the product formulae in unchanged from the results with the random matrices. This is a useful independent verification that the assessment of the performance of the product formulae is relatively independent of the class of matrices. In particular, we find that for 8th order our three new results all outperform the best product formulae in prior work (where we have compared to the two prior formulae with best performance). In the case of 10th order the product formula SS10s35 of Ref. [29] still provides exceptional performance.

For calculating the thresholds based on the asymptotic formula in Eq. (63), we should compare our best 8th order formula YP8m8 to SS10s35 in the 10th order case, and PPBCM6m9 in the 6th order case. Then we obtain a threshold of 3 × 10¹³ for 8th to 10th order, and 5 × 10⁵ for 6th to 8th order, indicating about 9 orders of magnitude for T /ε where our 8th order product formula is optimal.

Similar to the case for general matrices, we expect that the threshold for 8th to 10th order is sufficiently large that the time steps are small and the asymptotic formulae are accurate, but the threshold between 6th and 8th order may be unreliable. In that case we solve Eq. (68) for t₁ with t₂ = t₁M₂/M₂, and find the threshold using Eq. (69). We expect that the 6th order product formula PPBCM6m9 can give better performance for larger time steps, and indeed we find that the threshold is further increased to 1.7 × 10⁷. By using our 8th order kernel optimised for large time step in Table 8 we can improve the threshold between PPBCM6m6 and our 8th order formula in the non-asymptotic case. We then obtain a threshold of around 3 × 10⁶, which is an improvement by a factor of 5 times.

Next we compare these thresholds to the parameters for specific systems. From Ref. [50] the norms can be expected to scale as (70) ${ν‖}_{1, [η]} = O (\frac{η^{2 / 3} N^{1 / 3}}{Ω^{1 / 3}}), {τ‖}_{1} = O (\frac{N^{2 / 3}}{Ω^{2 / 3}}),$ {\left\| \nu \right\|_{1,[\eta ]}} = {\cal O}\left( {{{{\eta ^{2/3}}{N^{1/3}}} \over {{\Omega ^{1/3}}}}} \right),\quad \quad {\left\| \tau \right\|_1} = {\cal O}\left( {{{{N^{2/3}}} \over {{\Omega ^{2/3}}}}} \right), where N is the number of orbitals and Ω is the volume (denoted n and ω in [50]). The constants for the scaling of these norms are derived in Ref. [51] as (71) ${ν‖}_{1, [η]} \approx π^{1 / 3} {(3 / 4)}^{2 / 3} \frac{η^{2 / 3} N^{1 / 3}}{Ω^{1 / 3}}, {τ‖}_{1} \approx \frac{3 π^{2} N^{2 / 3}}{2 Ω^{2 / 3}} .$ {\left\| \nu \right\|_{1,[\eta ]}} \approx {\pi ^{1/3}}{(3/4)^{2/3}}{{{\eta ^{2/3}}{N^{1/3}}} \over {{\Omega ^{1/3}}}},\quad \quad {\left\| \tau \right\|_1} \approx {{3{\pi ^2}{N^{2/3}}} \over {2{\Omega ^{2/3}}}}. For an initial order of magnitude estimate, the chemical accuracy required for the phase estimation is about 0.001 Hartree. As discussed above, for phase estimation we replace T /ε in the threshold with 1/ε_H . With estimates η ≈ 100 and N /Ω in the range 10³ to 10⁹ (corresponding to Δ between 10⁻¹a₀ and 10⁻³a₀ in Ref. [52]), the left-hand side (LHS) of Eq. (63) would be in the range 2.2 × 10⁷ to 2.6 × 10⁹. That is far below the threshold between 8th and 10th order, demonstrating that our 8th order should be preferred to 10th order. It is also above the threshold between 6th and 8th order, indicating that 8th order provides the best performance for simulations of this type. For smaller simulations 6th order will be optimal, but it may be expected that the small values of N /Ω would not provide a sufficiently accurate representation of the quantum system.

For comparison, we also consider the Alpha + Hydrogen system from Ref. [51]. The parameters η, N , and Ω are given in Table 1 of that work as 218, 53³, and 2420, respectively. Some of the most challenging values of T , ε are given in Figure 4 of that work, with for example T = 40 and ε = 10⁻⁴. With these parameters the LHS of Eq. (63) is approximately 8.6 × 10⁹, which is below the threshold (between 8th and 10th order) by a few orders of magnitude. The smallest values of T /ε from Ref. [51] are obtained with T = 10 and ε = 0.01 (see Table IV). That results in a value for the LHS of Eq. (63) of 2.2 × 10⁷, which is above the threshold between 6th and 8th order, so our 8th order product formula would still be optimal. This indicates that the new 8th order product formula found here should be optimal for the full range of parameter values used in Ref. [51]. Nonetheless, for smaller values of T /ε it may be advantageous to further customise the product formula to provide better performance at larger time steps, as described above to improve the threshold between 6th order and 8th order.

Note also that the fact that the threshold here is different than for random Hamiltonians means that the threshold will depend on the class of Hamiltonians. Selecting a different type of Hamiltonians can change the threshold. Similarly, it will be expected that choosing a particular class of fermionic Hamiltonians (such as those arising from plane waves) rather than random fermionic Hamiltonians can give a different threshold. Nevertheless, changing the class of Hamiltonians still resulted in an extremely large threshold for 10th order product formulae to be optimal. This indicates that the 8th order formula will be optimal for realistic simulations regardless of the class of Hamiltonians.

6.

Discussion

We have found 8th order product formulae with significantly lower error than in prior work. For the case of processed product formulae, our best solution is about 300 times more accurate than that in prior work, and in the case of product formulae without processing our solution is about 100 times more accurate. The size of the improvement is estimated based on testing with random 64 × 64 matrices. We have also tested on a range of different matrix sizes, fermionic Hamiltonians of different sizes, and a range of sizes of the transverse-field Ising model and find consistent improvements with well over an order of magnitude in all cases.

The size of the improvement is surprising, especially considering the best prior results are from 2005 (without processing) and 2006 (with processing). To achieve this improvement we have used distinct techniques from prior work. The processed product formulae from Ref. [31] set multiple coefficients equal to one another. Specifically, in the P₁₉8 formula, six of the coefficients are identical, following a heuristic suggested by McLachlan [27]. However, our results suggest that using distinct values for each coefficient yields better outcomes. The advantage of taking longer product formulae with distinct coefficients is that there are significantly more parameters than constraints, giving more freedom for optimisation of the product formula. The solutions which we find give the best performance have no clear pattern to the coefficients, indicating that assuming these patterns is too restrictive.

The way that product formulae were evaluated in Ref. [31] was based on the ℓ^p norms of the coefficients for p = 1 and p = k + 1. Those norms are both smaller for P₁₉8 given in Ref. [31] than our lower-error solution. Similarly, these norms are both smaller for the solutions from Ref. [29] than our non-processed product formula. Also, Ref. [48] found 10th order integrators with 1-norm lower than those in Ref. [29], but in our testing their error is larger. In all these cases the solutions with lower error have larger norms. The ℓ^k⁺¹ norm corresponds to just one of the 𝒪(t^k⁺¹) error terms, and a product formula that minimises that term while giving large values for the others will not perform well. We find that the strategy of selecting product formulae with low error for examples can yield much better results than attempting to minimise norms.

In this work we compare the eigenvalue error, rather than the spectral-norm error as in prior work because it is the eigenvalue error in a single step which dominates the error over longer time evolutions, with basis error cancelling. In many applications of quantum computing the goal is to estimate eigenvalues, so it is solely the eigenvalue error which should be considered. For most product formulae tested, the spectral norm of the error (for a single step) is substantially larger than the eigenvalue error, meaning that it can be an unrealistic measure of accuracy. This also suggests that techniques for reducing error based on randomisation of the term order [21] or randomised corrections [53] may provide less improvement than anticipated because if successive steps are not identical then the basis error will not cancel.

We have shown how to fairly compare the performance of product formulae with different numbers of factors and different orders. The optimal order of integrator to use depends primarily on the ratio T /ε, the ratio of the total evolution time to the allowed error. This is because, as T /ε becomes larger, the error allowed for each individual time step becomes smaller, and the time steps must become shorter to make the error sufficiently small. That results in a larger overall number of time steps, and so a larger complexity. The higher-order integrators better reduce the error with shorter time steps, and so for large T /ε that improvement more than compensates for the larger number of exponentials for the higher-order product formulae. For eigenvalue estimation there is an equivalent threshold in 1/ε_H , in terms of the allowed error in the eigenvalue.

The threshold for 10th order product formulae to outperform 8th order product formulae is very large, so 10th order would only be optimal for simulations well beyond the scale considered for quantum algorithms. Similarly, it would not be useful to consider 12th or higher order product formulae for the purpose of Hamiltonian simulation, as the threshold would be even larger. For large quantum simulations that are still reasonable to perform on a quantum computer, such as those considered in Refs. [51] or [52], the best performance is provided by our 8th order product formulae (out of the extensive set of product formulae tested). For more moderately sized simulations 6th order will be optimal, and in our testing we find that the product formula S₉6 of Ref. [31] provides excellent performance. For smaller simulations, as in NISQ experiments, 4th order product formulae would be ideal, with S₆ of Ref. [47] and P₆4 of [31] providing the best performance.

Care is required in determining the thresholds between the orders. Using the formula based on the asymptotic scaling can yield estimates of the T /ε threshold corresponding to large time steps where the asymptotic scaling no longer holds. When properly accounting for the error for a given time step, the T /ε threshold can change by many orders of magnitude. This can also be a problem more generally in estimating the complexity of an algorithm. Estimating the number of time steps based on the asymptotic expression may correspond to unrealistically large time steps, so the asymptotic formula is inaccurate. We find that significant performance gains can be obtained by fine-tuning a product formula for larger time steps.

A potential topic for future work is further customising product formulae for large time steps. We have customised the 8th order product formula to improve the threshold, but it would also be possible to customise the 6th and 4th order product formulae. It would also be possible to customise the product formulae for larger time steps with the fermionic Hamiltonians. This approach could be used to provide better performance for small-scale simulations suitable for NISQ devices. It could also potentially extend the parameter range where 6th order product formulae provide best performance to larger simulations on fault tolerant quantum computers.

Selection and Improvement of Product Formulae for Best Performance of Quantum Simulation

Full Article

Paradigm

My account