This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

A critical survey on the kinetic assays of DNA polymerase fidelity from a new theoretical perspective

Qiu-Shi Li School of Physical Science, University of Chinese Academy of Sciences    Yao-Gen Shu [email protected] Wenzhou Institute, University of Chinese Academy of Sciences    Zhong-Can Ou-Yang Institute of Theoretical Physics, Chinese Academy of Sciences    Ming Li [email protected] School of Physical Science, University of Chinese Academy of Sciences
Abstract

The high fidelity of DNA polymerase is critical for the faithful replication of genomic DNA. Several approaches were proposed to quantify the fidelity of DNA polymerase. Direct measurements of the error frequency of the replication products definitely give the true fidelity but turn out very hard to implement. Two biochemical kinetic approaches, the steady-state assay and the transient-state assay, were then suggested and widely adopted. In these assays, the error frequency is indirectly estimated by using the steady-state or the transient-state kinetic theory combined with the measured kinetic rates. However, whether these indirectly estimated fidelities are equivalent to the true fidelity has never been clarified theoretically, and in particular there are different strategies to quantify the proofreading efficiency of DNAP but often lead to inconsistent results. The reason for all these confusions is that it’s mathematically challenging to formulate a rigorous and general theory of the true fidelity. Recently we have succeeded to establish such a theoretical framework. In this paper, we develop this theory to make a comprehensive examination on the theoretical foundation of the kinetic assays and the relation between fidelities obtained by different methods. We conclude that while the steady-state assay and the transient-state assay can always measure the true fidelity of exonuclease-deficient DNA polymerases, they only do so for exonuclease-efficient DNA polymerases conditionally (the proper way to use these assays to quantify the proofreading efficiency is also suggested). We thus propose a new kinetic approach, the single-molecule assay, which indirectly but precisely characterizes the true fidelity of either exonuclease-deficient or exonuclease-efficient DNA polymerases.

DNA polymerase; fidelity assay; steady state; transient state; single molecule.
pacs:
82.39.-k, 87.15.Rn, 87.16.A-

The high fidelity of DNA polymerase (DNAP) is critical for faithful replication of genomic DNA. Quantitative studies on DNAP fidelity began in 1960s and became an important issue in biochemistry and molecular biology. Intuitively, the DNAP fidelity can be roughly understood as the reciprocal of the overall mismatch (error) frequency when a given DNA template is replicated with both the matched dNTPs (denoted as dRTP or R) and the mismatched dNTPs (denoted as dWTP or W). For instance, the synthetic polymer poly-ABU¯\overline{BU} was used as the template and the replication reaction was conducted with both dRTPs (dATP and dBU¯TP\overline{BU}TP) and dWTP (dGTP). The ratio of the incorporated dRTPs to dWTPs in the final products was then determined to quantify the overall error frequencyTrautner et al. (1962). Similarly, a homopolymer poly-dC was used as the template and the total number of the incorporated dWTP (dTTP) and dRTP (dGTP) was then measured to give the error frequencyHall and Lehman (1968). Beyond such overall fidelity, the site-specific fidelity was defined as the reciprocal of the error frequency at individual template sites. In principle, the error frequency at any template site can be directly counted if a sufficient amount of full-length replication products can be collected and sequenced(this will be denoted as true fidelity fturef_{ture}), e.g. by using deep sequencing techniques Lee et al. (2016); de Paz et al. (2018). However, this type of sequencing-based approach always requires a huge workload and was rarely adopted in fidelity assay. It is also hard to specify the sequence-context influences on the fidelity. A similar but much simpler strategy is to only investigate the error frequency at the assigned template site by single-nucleotide incorporation assays. Such assays are conducted for exoexo^{-}-DNAP (exonuclease-deficient DNAP), in which dRTP and dWTP compete to be incorporated to the primer terminal only at the assigned single template site and the amount of the final reaction products containing the incorporated dRTP or dWTP are then determined by gel analysis to give the error frequency, e.g.e.g.Clayton and Branscomb (1979); Bertram et al. (2010). By designing various template sequences, one can further dissect the sequence-context dependence of the site-specific error frequency. Although the above definitions of DNAP fidelity are simple and intuitive, the direct measurements are very challenging since mismatches occur with too low frequency to be detected even when heavily-biased dNTP pools are used. Besides, the single-nucleotide incorporation assays do not apply to exo+exo^{+}-DNAP (exonuclease-deficient DNAP) because the coexistence of the polymerase activity and the exonuclease activity makes the reaction products very complicated and hard to interpret. Hence two alternative kinetic approaches were proposed.

The steady-state method was developed by A. Fersht for exoexo^{-}-DNAP, which is based on the Michaelis-Menten kinetics of the incorporation of a single dRTP or dWTP at the same assigned template siteFersht (1985).The two incorporation reactions are conducted separately under steady-state conditions to obtain the specificity constant (the quasi-first order rate constant) (kcat/Km)R(k_{cat}/K_{m})_{R} or (kcat/Km)W(k_{cat}/K_{m})_{W} respectively, kcatk_{cat} is the maximal steady-state turnover rate of dNTP incorporation and KmK_{m} is the Michaelis constant. The site-specific fidelity is then characterized as the ratio between the two incorporation velocities, i.e. (kcat/Km)R[dRTP]/(kcat/Km)W[dWTP](k_{cat}/K_{m})_{R}\textmd{[dRTP]}\big{/}(k_{cat}/K_{m})_{W}\textmd{[dWTP]} (denoted as steady-state fidelity fssf_{s\cdot s}) , which is nothing but the specificity commonly defined for multi-substrate enzymes. This assay has been widely acknowledged as the standard method in DNAP fidelity studies. Nevertheless, there is an apparent difference between the specificity and the true fidelity of exoexo^{-}-DNAP. Enzyme specificity is operationally defined and measured under the steady-state condition which is usually established in experiments by two requirements, i.e. the substrate is in large excess to the enzyme, and the enzyme can dissociate from the product after a single turnover is finished. These two requirements are often met by many reactions catalyzed by non-processive enzymes, and the enzyme specificity is indeed a good measure of the relative contents of final products of competing substrates. DNAP, however, is a processive enzyme and rarely dissociates from the template, which violates the second requirement. Additionally, DNA replication in vivo consists of only a single template DNA but many DNAPs, which violates the first requirement. Hence, no steady-state assumptions can be made a priori to single-nucleotide incorporation reactions either invivoin\ vivo or invitroin\ vitro. So, is the enzyme specificity really relevant to the true fidelity of exoexo^{-}-DNAP ? So far as we know, there was only one experiment work which did the comparison and indicated the possible equivalence of fssf_{s\cdot s} to ftruef_{true} for Klenow fragment (KF\textmd{KF}^{-})Bertram et al. (2010), but no theoretical works have ever been published to investigate the true fidelity of DNA replication and examine the equivalence of fssf_{s\cdot s} and ftruef_{true} in general.

Besides the steady-state method, the transient-state kinetic analysis was also proposed to obtain the specificity constantKati et al. (1992); Johnson (1993). Under the pre-steady-state condition or the single-turnover condition, one can obtain the parameter kpol/Kdk_{pol}/K_{d} (a substitute for kcat/Kmk_{cat}/K_{m}) for the single-nucleotide incorporation reactions with exoexo^{-}-DNAP, and define the site-specific fidelity as (kpol/Kd)R[dRTP]/(kpol/Kd)W[dWTP](k_{pol}/K_{d})_{R}\textmd{[dRTP]}\big{/}(k_{pol}/K_{d})_{W}\textmd{[dWTP]} (denoted as transient-state fidelity ftsf_{t\cdot s}). Either kpol/Kdk_{pol}/K_{d} or kcat/Kmk_{cat}/K_{m} can only be properly interpreted by kinetic models, so the relation between the two parameters is actually model-dependent. For the commonly used two-step kinetic model (including only dNTP binding and the subsequent chemical step), it can be shown that they are equalJohnson (1992). For complex models including additional steps (e.g.e.g. DNA binding to DNAP, translocation of DNAP on the template, PPi release, etc.), their equivalence can also be proved in general (details will given in later sections). But again the relevance of ftsf_{t\cdot s} to ftruef_{true} is not yet clarified. Although the experiment has indicated the possible equivalence of ftsf_{t\cdot s} and ftruef_{true} for KF\textmd{KF}^{-}Bertram et al. (2010), a general theoretical examination is still needed.

Further, these methods fail to definitely measure the site-specific fidelity of exo+exo^{+}-DNAP. For exo+exo^{+}-DNAP, the total fidelity is often assumed to consist of two multiplier factors. The first is the initial discrimination finif_{ini} contributed solely by the polymerase domain, which can be given by fssf_{s\cdot s} or ftsf_{t\cdot s}. The second factor is the additional proofreading efficiency fprof_{pro} contributed by the exonuclease domain, which is defined by the ratio of the elongation probability of the terminal R (Pel,RP_{el,R}) to that of the terminal W (Pel,WP_{el,W}). Here the elongation probability is given by Pel=kel/(kel+kex)P_{el}=k_{el}/(k_{el}+k_{ex}), kelk_{el} is the elongation rate to the next site, and kexk_{ex} is the excision rate of the terminal nucleotide at the assigned site (e.g.e.g. Eq.(A1-A6) in Ref.Fersht et al. (1982)). Pel,RP_{el,R} is usually assumed close to 100%, so fprof_{pro} equals approximately to 1+kex,W/kel,W1+k_{ex,W}/k_{el,W}. Although these expressions seem reasonable, there are some problems that were not clarified. First, the definition of fprof_{pro} is subjective though intuitive, so a rigorous theoretical foundation is needed. Second, the rate parameters kelk_{el} and kexk_{ex} are not well defined since both the elongation and the excision are multi-step processes, i.e. kelk_{el} and kexk_{ex} are unknown functions of the involved rate constants but there is not a unique way to define them. They could be theoretically defined under steady-state assumptions (Eq.(6) in Ref.Fersht (1979)) or operationally defined by experiment assays (e.g.e.g. steady-state assaysWingert et al. (2013); Vashishtha and Kuchta (2015) or transient-state assays Donlin et al. (1991)), but different ways often lead to inconsistent interpretations and quite different estimates of fprof_{pro} (as will be clarified in RESULTS AND DISCUSSION Sec.2). Additionally, kelk_{el} should be more properly understood as the effective elongation rate in the sense that the elongated terminal (the added nucleotide) is no longer excised. This condition is not met if the exo+exo^{+}-DNAP can proofread the buried mismatches (e.g.e.g. the penultimate or antepenultimate mismatches, etc.) . In these cases, kelk_{el} is affected not only by the next template site but also by further sites. Such far-neighbor effects were not seriously considered in previous studies. So, what on earth is the relation between the total fidelity ftotf_{tot} (=finifpro=f_{ini}\cdot f_{pro}) and ftruef_{true}?

Recently two equivalent rigorous theories were proposed to investigate the true fidelity of either exoexo^{-}-DNAP or exo+exo^{+}-DNAP, i.e. the iterated function systems by P.Gaspard Gaspard (2017) and the first-passage (FP) method by us Li et al. (2019). In particular, we have obtained very simple and intuitive mathematical formulas by FP method to compute rigorously ftruef_{true} of exo+exo^{+}-DNAP, which can not be achieved by the steady-state or the transient-state analysis. With these firmly established results, we can address all the above questions in detail. In the following sections, we will first give a brief review of the FP method and the major conclusions already obtained for simplified kinetic models of DNA replication. Then we will generalize these conclusions to more realistic kinetic models for exoexo^{-}-DNAP and exo+exo^{+}-DNAP, and carefully examine the relations between fssf_{s\cdot s}, ftsf_{t\cdot s} and ftruef_{true}. In particular, the FP analysis makes it possible to take full advantage of single-molecule techniques to investigate the site-specific fidelity, whereas the conventional steady-state or transient-state analysis applies only to ensemble reactions but not to single-molecule processes. Feasible single-molecule assays for either exoexo^{-}-DNAP or exo+exo^{+}-DNAP will also be suggested.

1. Basics of the FP method

The first-passage (FP) method was proposed to study the replication of the entire template by exo+exo^{+}-DNAP Li et al. (2019), which also applies to single-nucleotide incorporation reactions.

Refer to caption
Figure 1: The highly simplified reaction scheme of DNA replication. EE: the enzyme DNAP. DiD_{i}: the primer-template duplex with primer terminal at the template site ii.

Here the highly simplified reaction scheme Fig.1 is taken as an example to illustrate the basic logic of this method. k¯i\overline{k}_{i} is the incorporation rate of dNTP to the primer terminal at the template site i1i-1 (the dNTP-concentration dependence of k¯i\overline{k}_{i} is not explicitly shown here), r¯i\overline{r}_{i} is the excision rate of the primer terminal at the template site ii. In Fig.1, dRTP and dWTP compete for each template site during the replication, so there will be various sequences in the final full-length products. The FP method describes the entire template-directed replication process by chemical kinetic equations, and directly compute the sequence distribution of the full-length products from which ftruef_{true} can be precisely calculated. It is worth noting that the FP method does not need any extra assumptions like steady-state or quasi-equilibrium assumptions, or need to explicitly solve the kinetic equations as done in the transient-state analysis which is often a formidable task. Some illustrative examples of FP calculations will be given in later sections. Here we only list the major results in terms of k¯i\overline{k}_{i} and r¯i\overline{r}_{i}. Detailed computation can be found in Ref.Li et al. (2019)

Intuitively, k¯i\overline{k}_{i} and r¯i\overline{r}_{i} depend on the identity (A, G, T or C) and the state (matched or mismatched) not only of the base pair at site ii but also of the one or more preceding base pairs. If there are only nearest-neighbor (first-order) effects, k¯i\overline{k}_{i} and r¯i\overline{r}_{i} can be written as k¯αi1αiXi1Xi{\overline{k}}^{X_{i-1}X_{i}}_{\alpha_{i-1}\alpha_{i}} and r¯αi1αiXi1Xi{\overline{r}}^{X_{i-1}X_{i}}_{\alpha_{i-1}\alpha_{i}}, Xi1X_{i-1} (or XiX_{i}) represents the nucleotide at site i1i-1 (or ii) on the template, αi1\alpha_{i-1} represents the nucleotide at site i1i-1 on the primer, αi\alpha_{i} represents the the next nucleotide to be incorporated to the primer terminal at site ii (for k¯αi1αiXi1Xi{\overline{k}}^{X_{i-1}X_{i}}_{\alpha_{i-1}\alpha_{i}}) or the terminal nucleotide of the primer at site ii to be excised (for r¯αi1αiXi1Xi{\overline{r}}^{X_{i-1}X_{i}}_{\alpha_{i-1}\alpha_{i}}). XX and α\alpha can be any of the four types of nucleotides A, G, T and C. Similarly, there are k¯αi2αi1αiXi2Xi1Xi{\overline{k}}^{X_{i-2}X_{i-1}X_{i}}_{\alpha_{i-2}\alpha_{i-1}\alpha_{i}} etc. for the second-order neighbor effects, and so on for far-neighbor (higher-order) effects.

2. The true fidelity calculated by the FP method

For DNAP having first-order neighbor effects, in a wide range of the involved rate constants, we have derived the analytical expression of the fidelity at site ii Li et al. (2019),

ftrue,i[WiRik¯Ri1WiXi1Xik¯Ri1RiXi1Xik¯WiRi+1XiXi+1k¯WiRi+1XiXi+1+r¯Ri1WiXi1Xi]1\displaystyle f_{true,i}\approx\left[\sum\limits_{W_{i}\neq R_{i}}\frac{\overline{k}^{X_{i-1}X_{i}}_{R_{i-1}W_{i}}}{{\overline{k}}^{X_{i-1}X_{i}}_{R_{i-1}R_{i}}}\frac{{\overline{k}}^{X_{i}X_{i+1}}_{W_{i}R_{i+1}}}{{\overline{k}}^{X_{i}X_{i+1}}_{W_{i}R_{i+1}}+{\overline{r}}^{X_{i-1}X_{i}}_{R_{i-1}W_{i}}}\right]^{-1} (1)

RR represents the matched nucleotide, and WW represents any one of the three types of mismatched nucleotides. For simplicity, we omit all the superscripts below unless it causes misunderstanding. Each term in the sum represents the error frequency of a particular type of mismatch, whose reciprocal is the mismatch-specific fidelity studied in the conventional steady-state assay or transient-state assay,

ftrue,ifipolfiexo\displaystyle f_{true,i}\approx f^{pol}_{i}\cdot f^{exo}_{i} (2)

where

fipolk¯Ri1Rik¯Ri1Wi\displaystyle f^{pol}_{i}\approx\frac{\overline{k}_{R_{i-1}R_{i}}}{\overline{k}_{R_{i-1}W_{i}}} (3)

is the initial discrimination, and

fiexo1+r¯Ri1Wik¯WiRi+1\displaystyle f^{exo}_{i}\approx 1+\frac{\overline{r}_{R_{i-1}W_{i}}}{\overline{k}_{W_{i}R_{i+1}}} (4)

is the proofreading efficiency. This is similar to fprof_{pro} defined in INTRODUCTION, if k¯WiRi+1,r¯Ri1Wi\overline{k}_{W_{i}R_{i+1}},\overline{r}_{R_{i-1}W_{i}} are regarded as kel,Wk_{el,W} kex,Wk_{ex,W} respectively.

For DNAP having second-order neighbor effects, with some reasonable assumptions about the rate parameters, we can obtain the fidelity at site ii Li et al. (2019),

ftrue,i\displaystyle f_{true,i}\approx
[WiRik¯Ri2Ri1Wik¯Ri2Ri1RikRi1WiRi+1elkRi1WiRi+1el+r¯Ri2Ri1Wi]1\displaystyle\Big{[}\sum\limits_{W_{i}\neq R_{i}}\frac{\overline{k}_{R_{i-2}R_{i-1}W_{i}}}{\overline{k}_{R_{i-2}R_{i-1}R_{i}}}\frac{k^{el}_{R_{i-1}W_{i}R_{i+1}}}{k^{el}_{R_{i-1}W_{i}R_{i+1}}+\overline{r}_{R_{i-2}R_{i-1}W_{i}}}\Big{]}^{-1}
kRi1WiRi+1el=k¯Ri1WiRi+1k¯WiRi+1Ri+2k¯WiRi+1Ri+2+r¯Ri1WiRi+1\displaystyle k^{el}_{R_{i-1}W_{i}R_{i+1}}=\frac{\overline{k}_{R_{i-1}W_{i}R_{i+1}}\overline{k}_{W_{i}R_{i+1}R_{i+2}}}{\overline{k}_{W_{i}R_{i+1}R_{i+2}}+\overline{r}_{R_{i-1}W_{i}R_{i+1}}} (5)

Each term in the sum represents the mismatch-specific error frequency at site ii. Its reciprocal defines the mismatch-specific fidelity which again consists of the initial discrimination and the proofreading efficiency, but the latter differs significantly from fprof_{pro} in INTRODUCTION, since the effective elongation rate is not k¯Ri1WiRi+1\overline{k}_{R_{i-1}W_{i}R_{i+1}} but instead kRi1WiRi+1elk^{el}_{R_{i-1}W_{i}R_{i+1}} which includes the next-nearest neighbor effects. The same logic can be readily generalized to higher-order neighbor effects where the proofreading efficiency will be more complicated Song et al. (2017); Li et al. (2019).

In real DNA replication, either the dNTP incorporation or the dNMP excision is a multi-step process. By using the FP method, the complex reaction scheme can be reduced to the simplified scheme Fig.1, and the fidelity can still be calculated by Eq.(2)or Eq.(2.), with only one modification: k¯\overline{k} and r¯\overline{r} are now the effective incorporation rates and the effective excision rates respectively which are functions of the involved rate constants. In the following sections, we will derive these functions for different multi-step reaction models, and compare them with those obtained by steady-state or transient-state assays. For simplicity, we only discuss DNAP having first-order neighbor effects in details, since almost all the existing literature focused on this case. Higher-order neighbor effects will also be mentioned in SUMMARY.

1. Fidelity assays of exo--DNAP

1.1. The true fidelity measured by the direct competition assay

Refer to caption
Figure 2: The minimal reaction scheme of the competitive incorporation of dRTP and dWTP. EE: exoexo^{-}-DNAP. DiD_{i}: the primer-template duplex with the matched(R) terminal at site ii. For brevity, the subscript ii in each rate constant is omitted.

Fig.2 shows a three-step kinetic model of the competitive incorporation of a single dRTP or dWTP to site i+1i+1. The true fidelity is precisely given by the ratio of the final product DiRD_{i}R to DiWD_{i}W when the substrate DNA are totally consumed, i.e.,

fture=[DiR][DiW]\displaystyle f_{ture}=\frac{[D_{i}R]}{[D_{i}W]} (6)

which can be calculated by FP method.

A part of the kinetic equations for this model are given below ,

ddt[Diα]\displaystyle\frac{d}{dt}[D_{i}\alpha] =\displaystyle= k2,Rα[EDidαTP]\displaystyle k_{2,R\alpha}[E\cdot D_{i}\cdot d\alpha TP]
ddt[EDidαTP]\displaystyle\frac{d}{dt}[E\cdot D_{i}\cdot d\alpha TP] =\displaystyle= k1,Rα0[EDi][dαTP]\displaystyle k^{0}_{1,R\alpha}[E\cdot D_{i}][d\alpha TP]
\displaystyle- (k2,Rα+rRα)[EDidαTP]\displaystyle(k_{2,R\alpha}+r_{R\alpha})[E\cdot D_{i}\cdot d\alpha TP]

here α=\alpha=R,W. The dNTP binding rate is denoted as k1,Rα=k1,Rα0[dαTP]k_{1,R\alpha}=k^{0}_{1,R\alpha}[d\alpha TP]. The basic idea of FP method is not to directly solve the kinetic equations rigorously (e.g.e.g. in the transient-state analysis) or approximately by imposing extra assumptions (e.g.e.g. the steady-state assumption) . Instead, the two equations are integrated to give the products at time tt,

[Diα](t)\displaystyle[D_{i}\alpha](t) =\displaystyle= k1,Rα0k2,Rαk2,Rα+rRα0t([EDi](τ)[dαTP](τ))𝑑τ\displaystyle\frac{k^{0}_{1,R\alpha}k_{2,R\alpha}}{k_{2,R\alpha}+r_{R\alpha}}\int^{t}_{0}([E\cdot D_{i}](\tau)[d\alpha TP](\tau))d\tau (8)
\displaystyle- k2,Rαk2,Rα+rRα[EDidαTP](t)\displaystyle\frac{k_{2,R\alpha}}{k_{2,R\alpha}+r_{R\alpha}}[E\cdot D_{i}\cdot d\alpha TP](t)

The second term approaches to zero with tt increases to infinity. dNTP is usually in large excess to template DNA either invivoin\ vivo or invitroin\ vitro, so [dNTP] remains approximately a constant during the reaction. Then the fidelity is simply given by

fture\displaystyle f_{ture} =\displaystyle= k¯RR/k¯RW\displaystyle\overline{k}_{RR}\Big{/}\overline{k}_{RW}
k¯RR\displaystyle\overline{k}_{RR} =\displaystyle= k1,RR0k2,RRk2,RR+rRR[dRTP]\displaystyle\frac{k^{0}_{1,RR}k_{2,RR}}{k_{2,RR}+r_{RR}}\textmd{[dRTP]}
k¯RW\displaystyle\overline{k}_{RW} =\displaystyle= k1,RW0k2,RWk2,RW+rRW[dWTP]\displaystyle\frac{k^{0}_{1,RW}k_{2,RW}}{k_{2,RW}+r_{RW}}\textmd{[dWTP]} (9)

ftruef_{true} is exactly the initial discrimination defined by Eq.(3) with the two effective incorporation rates k¯RR\overline{k}_{RR} and k¯RW\overline{k}_{RW}.

In practice, when the reaction time tt is large enough for sufficient product accumulation (i.e., the second term on the right side of Eq.(8) is far smaller than the first term), the measured [DiR](t)/[DiW](t)[D_{i}R](t)/[D_{i}W](t) becomes nearly time-invariant, and thus it is a good measure of ftruef_{true}. In the direct competition assay conducted by Bertram et.alBertram et al. (2010), the incorporation reaction was terminated when about half of the substrate DNA were reacted. This termination criteria per se does not meet the above requirement. Other evidences should be considered. For instance, [DiR](t)/[DiW](t)[D_{i}R](t)/[D_{i}W](t) is proportional to [dRTP]/[dWTP]\textmd{[dRTP]}/\textmd{[dWTP]} if the reaction time tt is large, so one can decide whether tt is sufficient large by examining whether [DiR](t)[dWTP]/[DiW](t)[dRTP][D_{i}R](t)\textmd{[dWTP]}/[D_{i}W](t)\textmd{[dRTP]} becomes nearly a constant when [dWTP] or [dRTP] is changed. Combined with these evidences, Bertram et.al were able to show that [DiR](t)/[DiW](t)[D_{i}R](t)/[D_{i}W](t) measured under their termination condition is really a good measure of the true fidelity.

1.2. Effective rates of multi-step reactions uniquely determined by FP method

Refer to caption
Figure 3: The multi-step incorporation scheme. The enzyme-substrate complex (EDE\cdot D) goes through NN states (indicated by subscripts 1,,N1,...,N) to successfully incorporate a single nucleotide(indicated by subscript ii). To simplify the notation of the rate constants, the superscripts indicating the template nucleotide XiX_{i} and the subscripts indicating the primer nucleotide αi\alpha_{i} are omitted. This omission rule also applies to other figures in this paper, unless otherwise specified.

The above FP treatment can be directly extended to multi-state incorporation schemes like Fig.3 to get the effective incorporation rate, as below,

k¯=k=kN1kN/(rN1+kN)\displaystyle\overline{k}=k^{*}=k^{{}^{\prime}}_{N-1}k_{N}\big{/}(r^{{}^{\prime}}_{N-1}+k_{N})
kj=kj1kj/(rj1+kj)\displaystyle k^{{}^{\prime}}_{j}=k^{{}^{\prime}}_{j-1}k_{j}\big{/}(r^{{}^{\prime}}_{j-1}+k_{j})
rj=rj1rj/(rj1+kj)\displaystyle r^{{}^{\prime}}_{j}=r^{{}^{\prime}}_{j-1}r_{j}\big{/}(r^{{}^{\prime}}_{j-1}+k_{j})
(j=N,N1,,3)\displaystyle(j=N,N-1,...,3)
k2=k1k2/(r1+k2)\displaystyle k^{{}^{\prime}}_{2}=k_{1}k_{2}\big{/}(r_{1}+k_{2})
r2=r1r2/(r1+k2)\displaystyle r^{{}^{\prime}}_{2}=r_{1}r_{2}\big{/}(r_{1}+k_{2}) (10)

Details of the calculation can be found in Supplementary Materials (SM) Sec.I B. Here k1k_{1} is proportional to dNTP concentration k1=k10[dNTP]k_{1}=k^{0}_{1}\textmd{[dNTP]}, so k=k0[dNTP]k^{*}=k^{*0}\textmd{[dNTP]}. The true fidelity is still given by Eq.(3) where k¯RR\overline{k}_{RR} and k¯RW\overline{k}_{RW} are effective rates defined here. Fig.3 describes the processive dNTP incorporation by DNAP without dissociation from the substrate DNA. If the dissociation is considered, the reaction scheme will be more complex, but the effective incorporation rates can still be given as above, as will be shown in RESULTS AND DISCUSSION Sec.2.2.

1.3. The steady-state assay measures the true fidelity

The steady-state assays measure the initial velocity of product generation under the condition that the substrate is in large excess to the enzyme. The normalized velocity per enzyme is in general given by the Michaelis-Menten equation

vsspol=kcat[dNTP][dNTP]+Km\displaystyle v^{pol}_{s\cdot s}=\frac{k_{cat}\textmd{[dNTP]}}{\textmd{[dNTP]}+K_{m}} (11)

Here the superscript polpol indicates the polymerase activity, the subscript sss\cdot s indicates the steady state. Fitting the experimental data by this equation, one can get the specific constant kcat/Kmk_{cat}/K_{m} either for dRTP incorporation or dWTP incorporation and estimate the fidelity (the initial discrimination) by fss=(kcat/Km)R[dRTP]/(kcat/Km)W[dWTP]f_{s\cdot s}=(k_{cat}/K_{m})_{R}\textmd{[dRTP]}\big{/}(k_{cat}/K_{m})_{W}\textmd{[dWTP]}. What is the relation between kcat/Kmk_{cat}/K_{m} and the effective incorporation rate k¯\overline{k} in Eq.(3)?

Refer to caption
Figure 4: The reaction scheme for the steady-state assay to measure the specificity constant of the nucleotide incorporation reaction of exoexo^{-}DNAP.

To understand the exact meaning of kcat/Kmk_{cat}/K_{m}, the complete multi-step incorporation reaction scheme Fig.(4) must be considered, which explicitly includes the DNAP binding step and the dissociation step. The last dissociation step is reasonably assumed irreversible, since the enzyme will much unlikely rebind to the same substrate molecule after dissociation because the substrate is in large excess to the enzyme. Under the steady-state condition, it can be easily shown

kcat[dNTP]Km=kKss\displaystyle\frac{k_{cat}\textmd{[dNTP]}}{K_{m}}=\frac{k^{*}}{K_{s\cdot s}} (12)

Here kk^{*} is defined in Eq.(1.2.). Kss=1+koff/konK_{s\cdot s}=1+k_{off}/k_{on}, kon=kon0[DNA]k_{on}=k^{0}_{on}\textmd{[DNA]}. KssK_{s\cdot s} is exactly the same for either dRTP incorporation or dWTP incorporation. So the fidelity is given as

fss(kcat[dNTP]/Km)Ri1Ri(kcat[dNTP]/Km)Ri1Wi=kRi1RikRi1Wi=fture\displaystyle f_{s\cdot s}\equiv\frac{(k_{cat}\textmd{[dNTP]}/K_{m})_{R_{i-1}R_{i}}}{(k_{cat}\textmd{[dNTP]}/K_{m})_{R_{i-1}W_{i}}}=\frac{k^{*}_{R_{i-1}R_{i}}}{k^{*}_{R_{i-1}W_{i}}}=f_{ture}

This is understandable: the steps before dNTP binding should not contribute to the initial discrimination. However, it does not mean that those steps do not contribute to the total fidelity. Actually they can affect the proofreading efficiency, as will be demonstrated in RESULTS AND DISCUSSION Sec.2.

1.4. The transient-state assay measures the true fidelity

The transient-state assay often refers to two different methods, the pre-steady-state assay or the single-turnover assay. Since the theoretical foundations of these two methods are the same, we only discuss the latter below for simplicity.

In single-turnover assays, the enzyme is in large excess to the substrate, and so the dissociation of the enzyme from the product is neglected. The time course of the product accumulation or the substrate consumption is monitored. The data is then fitted by exponential functions (single-exponential or multi-exponential) to give one or more exponents (i.e. the characteristic rates). In DNAP fidelity assay, these rates are complex functions of all the involved rate constants and dNTP concentration, which in principle can be analytically derived for any given kinetic model. For instance, for the commonly-used simple model including only substrate binding and the subsequent irreversible chemical step, one can directly solve the kinetic equations to get two rate functions. It was proved by K. Johnson that the smaller one obeys approximately the Michaelis-Menten-like equationsJohnson (1992).

vtspol=kpol[dNTP][dNTP]+Kd\displaystyle v^{pol}_{t\cdot s}=\frac{k_{pol}\textmd{[dNTP]}}{\textmd{[dNTP]}+K_{d}} (14)

The subscript tst\cdot s indicates the transient state. Similar to the steady-state assays, kpol/Kdk_{pol}/K_{d} is regarded as the specific constant and thus DNAP fidelity is defined as the enzyme specificity fts=(kpol/Kd)R[dRTP]/(kpol/Kd)W[dWTP]f_{t\cdot s}=(k_{pol}/K_{d})_{R}\textmd{[dRTP]}/(k_{pol}/K_{d})_{W}\textmd{[dWTP]}. It was also shown that kpol/Kdk_{pol}/K_{d} equals to kcat/Kmk_{cat}/K_{m} for the two-step modelJohnson (1992), so fts=fssf_{t\cdot s}=f_{s\cdot s}.

Refer to caption
Figure 5: The reaction scheme for the transient-state assay to measure the specificity constant of the nucleotide incorporation reaction of exoexo^{-}-DNAP.

The equality kpol/Kd=kcat/Kmk_{pol}/K_{d}=k_{cat}/K_{m} actually holds for more general models like Fig.5. The rigorous proof is too lengthy to be presented here (details can be found in SM Sec.III B). Below we only give some intuitive explanations.

Since there are NN states in the reaction scheme Fig.5, the time evolution of the system can be described by NN exponentially-decay functions with NN characteristic rates. If the smallest rate vtspolv^{pol}_{t\cdot s} is much smaller than the others, it can be easily proven that vtspolv^{pol}_{t\cdot s} follows the same form as Eq.(14) in general. Thus kpol/Kdk_{pol}/K_{d} can be obtained by mathematically extrapolating [dNTP] to zero, vtspolkpol[dNTP]/Kdv^{pol}_{t\cdot s}\approx k_{pol}\textmd{[dNTP]}/K_{d}. Intuitively, when [dNTP] approaches to zero, dNTP binding is the rate-limiting step, and all the steps after it will be so slow that the accumulation of each intermediate state is almost zero, i.e. they are approximately in steady state. This gives the velocity per substrate DNA, vtspolk[E1Di1]/[D0]v^{pol}_{t\cdot s}\approx k^{*}[E_{1}\cdot D_{i-1}]/[D_{0}]. kk^{*} is defined by Eq.(1.2.), [D0][D_{0}] is the total concentration of DNA. On the other hand, all the steps before dNTP binding are relatively much faster and approximately in equilibrium, which leads to [E1Di1](kon/koff)[Di1][E_{1}\cdot D_{i-1}]\approx(k_{on}/k_{off})[D_{i-1}], i.e. [E1Di1][D0]/(1+koff/kon)[E_{1}\cdot D_{i-1}]\approx[D_{0}]/(1+k_{off}/k_{on}). Here kon=kon0[E]k_{on}=k^{0}_{on}[E], [E][E] equals almost to the total DNAP concentration [E0][E_{0}] since DNAP is in large excess to DNA. So the normalized velocity per substrate is vtspolk/Ktsv^{pol}_{t\cdot s}\approx k^{*}/K_{t\cdot s}, which leads to

kpol[dNTP]Kd=kKts\displaystyle\frac{k_{pol}\textmd{[dNTP]}}{K_{d}}=\frac{k^{*}}{K_{t\cdot s}} (15)

Kts=1+koff/konK_{t\cdot s}=1+k_{off}/k_{on}. This is exactly the same as Eq.(12). So the fidelity can be given by

fts(kpol[dNTP]/Kd)Ri1Ri(kpol[dNTP]/Kd)Ri1Wi=kRi1RikRi1Wi=fture\displaystyle f_{t\cdot s}\equiv\frac{(k_{pol}\textmd{[dNTP]}/K_{d})_{R_{i-1}R_{i}}}{(k_{pol}\textmd{[dNTP]}/K_{d})_{R_{i-1}W_{i}}}=\frac{k^{*}_{R_{i-1}R_{i}}}{k^{*}_{R_{i-1}W_{i}}}=f_{ture}

1.5. A new approach to measure the true fidelity: a single-molecule assay

As stated above, neither the steady-state assay nor the transient-state assay can give the effective incorporation rates. This is not a problem for fidelity assay of exoexo^{-}-DNAP, but is indeed a serious problem for exo+exo^{+}-DNAP (as shown in later sections). Then, how can one estimate the effective rates by a general method ? A possible way is to directly dissect the reaction mechanism, i.e. measuring the rate constants of each step by transient-state experiments Wong et al. (1991); Tsai and Johnson (2006); Purohit et al. (2003); Joyce et al. (2008); Santoso et al. (2010); Hohlbein et al. (2013), and then one can calculate the effective rate according to Eq.(1.2.). This is a perfect approach but needs heavy work. Are there direct measurements of the effective rates? Here we suggest a possible single-molecule approach based on the FP analysis.

In a typical single-molecule experiment, the different states of the enzyme or the substrate can be distinguished by techniques such as smFRETSantoso et al. (2010); Hohlbein et al. (2013). So, if the state E1Di1E_{1}\cdot D_{i-1} and E1DiE^{{\dagger}}_{1}\cdot D_{i} in Fig.4 can be properly identified, the following single-molecule experiment can be done to measure the effective incorporation rates.

1. Initiate the nucleotide incorporation reaction by adding exoexo^{-}-DNAP and dNTP to the substrate Di1D_{i-1} and begin to record the state-switching trajectory of a single enzyme-DNA complex. Here dNTP can be dRTP or dWTP, and the primer terminal can be matched(R) or mismatched(W). When a single DNAP is captured by the substrate DNA, it can catalyze the incorporation of one or more nucleotides, depending on the template sequence context and the dNTP used. Then one can select a particular time window from the recorded trajectory, starting from the first-arrival at E1Di1E_{1}\cdot D_{i-1} (denoted as starting point) and ending at the first-arrival at E1DiE_{1}\cdot D_{i} (denoted as ending point).

2. In this time window, the system may make multiple visits to E1Di1E_{1}\cdot D_{i-1}. Count the total time the system resides at E1Di1E_{1}\cdot D_{i-1}. This so-called residence time may be clearly measured under low concentrations of dNTP.

3. Collect sufficient samples to get the averaged residence time Γ1,i1\Gamma_{1,i-1}, which gives directly the required effective incorporation rate kαi1αi=1/Γ1αi1k^{*}_{\alpha_{i-1}\alpha_{i}}=1/{\Gamma_{1}}_{\alpha_{i-1}}. Here kαi1αi=kαi1αi0[dαiTP]k^{*}_{\alpha_{i-1}\alpha_{i}}=k^{*0}_{\alpha_{i-1}\alpha_{i}}[d\alpha_{i}TP], α=\alpha=A,T,G,C. The proof of this equality is given in SM Sec.IV A.

The advantage of this single-molecule analysis is its model-independence. Since ki=1/Γ1,i1k^{*}_{i}=1/\Gamma_{1,i-1} holds in general, the measurement of Γ1,i1\Gamma_{1,i-1} does not depend on any hypothesis about the details of the reaction scheme (in fact, steps after dNTP binding are often unclear). So this method is hopefully an alternative of the conventional ensemble assays, particularly in cases where the latter may fail (see later sections).

2. Fidelity assays of exo+-DNAP

It is widely conjectured that the total fidelity of exo+exo^{+}-DNAP consists of the initial discrimination finif_{ini} and the proofreading efficiency fprof_{pro}. The former finif_{ini} can be well characterized by the methods introduced in the preceding sections. The latter fprof_{pro}, however, is assumed equal to 1+kex,W/kel,W1+k_{ex,W}/k_{el,W}, where kexk_{ex} and kelk_{el} are not well defined and may have different meanings in different assays. Below we discuss some usual ways to characterize these rates. The reaction scheme under discussion is shown in Fig.6.

Refer to caption
Figure 6: The multi-step reaction scheme of exo+exo^{+}-DNAP.

In this reaction scheme, before the excision, the primer terminal can transfer between Pol and Exo in two different ways, i.e. the intramolecular transfer without DNAP dissociation (the transfer rates are denoted as kpek_{pe} and kepk_{ep}), and the intermolecular transfer in which DNAP can dissociate from and rebind to either Pol or Exo (the rates are denoted as konk_{on} and koffk_{off}). These two modes have been revealed by single-turnover experimentsDonlin et al. (1991) and directly observed by smFRETLamichhane et al. (2013). kk^{*} is the effective incorporation rate, as explained below.

2.1. Effective rates uniquely determined by the FP method

Applying the FP method to the kinetic equations for the reaction scheme Fig.6, one can reduce this complex scheme to the simplified scheme Fig.1, with rigorously defined effective rates given below. The logic of the reduction is the same as that in RESULTS AND DISCUSSION Sec.1.2. Details can be found in SM Sec.I C.

k¯=k\displaystyle\overline{k}=k^{*} , r¯=k~peqk~ep+q\displaystyle\overline{r}=\frac{\widetilde{k}_{pe}q}{\widetilde{k}_{ep}+q}
k~pe=kpe+kpe\displaystyle\widetilde{k}_{pe}=k_{pe}+k_{p\rightarrow e} , k~ep=kep+kep\displaystyle\widetilde{k}_{ep}=k_{ep}+k_{e\rightarrow p}
kpe=koffpkonekonp+kone\displaystyle k_{p\rightarrow e}=\frac{k^{p}_{off}k^{e}_{on}}{k^{p}_{on}+k^{e}_{on}} , kep=koffekonpkonp+kone\displaystyle k_{e\rightarrow p}=\frac{k^{e}_{off}k^{p}_{on}}{k^{p}_{on}+k^{e}_{on}} (17)

The rate constants can be written more explicitly such as k¯RR=kRR\overline{k}_{RR}=k^{*}_{RR}, if the states of the base pairs at site i,i1,etc.i,i-1,etc. are explicitly indicated. All the rate constants in the same formula have the same state-subscript. kk^{*} is defined by Eq.(1.2.). kpek_{p\rightarrow e} and kepk_{e\rightarrow p} define the effective intermolecular transfer rates between Pol and Exo. So k~pe\widetilde{k}_{pe} and k~ep\widetilde{k}_{ep} represent the total transfer rates via both intramolecular and intermolecular ways. With these effective rates, the real initial discrimination and the real proofreading efficiency can be calculated by ftrue,ini=kRR/kRWf_{true,ini}=k^{*}_{RR}/k^{*}_{RW} and ftrue,pro=1+r¯RW/kWRf_{true,pro}=1+\overline{r}_{RW}/k^{*}_{WR} respectively.

ftrue,prof_{true,pro} differs much from that given by K.Johnson et al. who may be the first to discuss the contribution of the two transfer pathways to the proofreading efficiency of T7 DNAP. Without a rigorous theoretical foundation, they gave intuitively fpro=1+(kpe+θkoffp)W/kel,Wf_{pro}=1+(k_{pe}+\theta k^{p}_{off})_{W}/k_{el,W} Donlin et al. (1991). The effective elongation rate kel,Wk_{el,W} was interpreted as the steady-state incorporation velocity vss,WRpolv^{pol}_{s\cdot s,WR} (at certain [dNTP]), which is incorrect as will be explained in the section below. The ambiguous quantity θ\theta was supposed between 0 and 1 (depending on the fate of the DNA after dissociation) and, unlike r¯\overline{r}, can not be expressed explicitly in terms of konp,koffp,kone,koffek^{p}_{on},k^{p}_{off},k^{e}_{on},k^{e}_{off},etc. So fprof_{pro} is not equivalent to ftrue,prof_{true,pro}.

2.2. The steady-state assay can not measure the proofreading efficiency

Because of the co-existence of the polymerase activity and the exonuclease activity, reaction schemes consisting of a single-nucleotide incorporation and a single-nucleotide excision are theoretically unacceptable and also impossible to implement in experiments. So the usual steady-state assay does not apply to exo+exo^{+}-DNAP. It’s also improper to define the elongation probability by imposing steady-state assumptions to such unrealistic reaction models, as given by Eq.(6) in Ref.Fersht (1979). Nevertheless, the steady-state assay can still be employed to study the polymerase and exonuclease separately.

When mixed with the exo+exo^{+}-DNAP, the substrate DNA can bind either to the polymerase domain or to the exonuclease domain. For some exoexo^{-}-DNAP, the exonuclease domain may exist and still be able to bind (but not excise) the substrate DNA, which is not discussed in preceding sections. For the steady-state assays of dNTP incorporation by such DNAPs, the reaction scheme becomes complicated (Fig.7. Again, the last enzyme dissociation steps are reasonably assumed irreversible under steady-state condition).

Refer to caption
Figure 7: The reaction scheme for the steady-state assay to measure the specificity constant of the nucleotide incorporation reaction of DNAP with deficient exonuclease domain.

Under the steady-state condition, one can easily compute the specific constant

kcat[dNTP]Km=kKss\displaystyle\frac{k_{cat}\textmd{[dNTP]}}{K_{m}}=\frac{k^{*}}{K^{{}^{\prime}}_{s\cdot s}} (18)

Kss=1+kpe/kep+koffp/konpK^{{}^{\prime}}_{s\cdot s}=1+k_{pe}/k_{ep}+k^{p}_{off}/k^{p}_{on}, konp=konp0[DNA]k^{p}_{on}=k^{p0}_{on}\textmd{[DNA]}. So the fidelity is given by

fini(kcat[dNTP]/Km)Ri1Ri(kcat[dNTP]/Km)Ri1Wi=kRi1RikRi1Wi\displaystyle f_{ini}\equiv\frac{(k_{cat}\textmd{[dNTP]}/K_{m})_{R_{i-1}R_{i}}}{(k_{cat}\textmd{[dNTP]}/K_{m})_{R_{i-1}W_{i}}}=\frac{k^{*}_{R_{i-1}R_{i}}}{k^{*}_{R_{i-1}W_{i}}} (19)

which is exactly ftrue,inif_{true,ini}. Combined with Eq.(12), we conclude that the form of Eq.(18) is universal: kk^{*} represents the effective incorporation rate of the subprocess beginning from dNTP binding (DNAP dissociation is not involved), and KssK^{{}^{\prime}}_{s\cdot s} is a simple function of the equilibrium constants of all steps before dNTP binding, no matter how complex the reaction scheme is. Since KssK^{{}^{\prime}}_{s\cdot s} depends only on the identity and the state of the primer terminal but not on the next incoming dNTP, the enzyme specificity is indeed equal to ftrue,inif_{true,ini} in general.

There were also some studies trying the steady-state assay to define the effective elongation rate k¯WR\overline{k}_{WR} and the effective excision rate r¯RW\overline{r}_{RW}. For instance, some works used the specific constantDonlin et al. (1991); Wong et al. (1991) or the maximal turn-over rate kcatk_{cat} Wingert et al. (2013) as k¯WR\overline{k}_{WR}. As shown above, however, k¯\overline{k} is not equal to the specific constant or kcatk_{cat}. So the steady-state assay fails in principle to measure k¯WR\overline{k}_{WR}, unless Kss1K^{{}^{\prime}}_{s\cdot s}\approx 1. This condition is met by T7 DNAP (kpekepk_{pe}\ll k_{ep} and koffpkonpk^{p}_{off}\ll k^{p}_{on} were observed for the mismatched terminalDonlin et al. (1991)), and may even be generally met since replicative DNAPs are believed to be highly processive (i.e. koffpkonpk^{p}_{off}\ll k^{p}_{on} ) and always tend to bind DNA preferentially at the polymerase domain (i.e. kpe/kep1k_{pe}/k_{ep}\ll 1). So the the specific constant, but not kcatk_{cat}, might be used in practice as k¯WR\overline{k}_{WR}.

Refer to caption
Figure 8: The reaction scheme for the steady-state assay to measure the effective excision rate of exo+exo^{+}-DNAP.

In the steady-state assay of the excision reaction (Fig.8), one may measure the initial velocity vssexov^{exo}_{s\cdot s} and interpret it as the effective excision rate r¯\overline{r} Wingert et al. (2013). Whereas vssexov^{exo}_{s\cdot s} is determined by all the rate constants in Fig.8, some rate constants like koffpk^{p{\dagger}}_{off} and kpek^{{\dagger}}_{pe} are absent from r¯\overline{r}. So in principle vssexov^{exo}_{s\cdot s} is not equal to r¯\overline{r}. Under some conditions, e.g.e.g. konekonpk^{e}_{on}\ll k^{p}_{on} and the dissociation of the enzyme from the substrate is fast enough after the excision, the initial velocity vssexov^{exo}_{s\cdot s} may be approximately equal to r¯\overline{r} (details can be found in SM Sec.II C). But these conditions may not be met by real DNAP, e.g.e.g., kone>konpk^{e}_{on}>k^{p}_{on} was observed for T7 polymeraseDonlin et al. (1991). Unless there are carefully designed control tests to provide compelling evidence, vssexov^{exo}_{s\cdot s} itself is not a good measure for r¯\overline{r}.

One can also change the concentration of the substrate DNA to obtain the specific constant of the excision reaction in experimentsVashishtha and Kuchta (2015), as can be shown theoretically

kcat′′[DNA]Km′′=q(kpekonp/(kpe+koffp)+kone)q+kepkoffp/(kpe+koffp)+koffe\displaystyle\frac{k^{{}^{\prime\prime}}_{cat}\textmd{[DNA]}}{K^{{}^{\prime\prime}}_{m}}=\frac{q(k_{pe}k^{p}_{on}/(k_{pe}+k^{p}_{off})+k^{e}_{on})}{q+k_{ep}k^{p}_{off}/(k_{pe}+k^{p}_{off})+k^{e}_{off}} (20)

which is just irrelevant to r¯RW\overline{r}_{RW}. It can be shown further in any case

(kcat′′[DNA]/Km′′)Ri1Wi(kcat[dNTP]/Km)WiRi+1r¯Ri1Wik¯WiRi+1\displaystyle\frac{(k^{{}^{\prime\prime}}_{cat}\textmd{[DNA]}/K^{{}^{\prime\prime}}_{m})_{R_{i-1}W_{i}}}{(k_{cat}\textmd{[dNTP]}/K_{m})_{W_{i}R_{i+1}}}\neq\frac{\overline{r}_{R_{i-1}W_{i}}}{\overline{k}_{W_{i}R_{i+1}}} (21)

Details can be found in SM Sec.II C. In the experiment to estimate fprof_{pro} for ap-polymerseWingert et al. (2013), the authors wrongly interpreted k¯WR\overline{k}_{WR} and r¯RW\overline{r}_{RW} as kcatk_{cat} and vssexov^{exo}_{s\cdot s} respectively , and gave that excision/elongation=(vssexo)Ri1Wi/(kcat)WiRi+1excision/elongation=(v^{exo}_{s\cdot s})_{R_{i-1}W_{i}}/(k_{cat})_{W_{i}R_{i+1}}. They thought this measure roughly reflects the true fidelity. Now it is clear that the two quantities are completely unrelated.

2.3. The transient-state assay can measure the proofreading efficiency conditionally

Refer to caption
Figure 9: The reaction scheme for the transient-state assay to measure the specificity constant of the nucleotide incorporation reaction of DNAP with deficient exonuclease domain.

When DNA can bind to either the polymerase domain or the deficient exonuclease domain, the scheme for the transient-state assay of the dNTP incorporation is depicted in Fig.9. By the same logic presented in preceding sections, the specificity constant defined by transient-state assays can be written as

kpol[dNTP]Kd=kKts\displaystyle\frac{k_{pol}\textmd{[dNTP]}}{K_{d}}=\frac{k^{*}}{K^{{}^{\prime}}_{t\cdot s}} (22)

Kts=1+kpe/kep+koffp/konpK^{{}^{\prime}}_{t\cdot s}=1+k_{pe}/k_{ep}+k^{p}_{off}/k^{p}_{on}. konp=konp0[E0]k^{p}_{on}=k^{p0}_{on}[E_{0}] . This is exactly the same as Eq.(18). So, like the steady-state assay, the transient-state assay also applies in general to estimate ftrue,inif_{true,ini}. Additionally, the specificity constant, but not kpolk_{pol}, can be used to estimate k¯\overline{k} when Kts1K^{{}^{\prime}}_{t\cdot s}\sim 1, as mentioned in the above section.

The transient-state assay of the exonuclease activity is often done under single-turnover conditions. The time course of product accumulation or substrate consumption is fitted by a single exponential or a double exponential to give one or two characteristic rates. In the single exponential case, the rate is simply taken as the effective excision rate r¯\overline{r}. In the double exponential case, however, there is no criteria which one to select. This causes large uncertainty since the two rates often differ by one or more orders of magnitude. In the experiment of T7 polymeraseDonlin et al. (1991), two types of excision reactions were conducted, with or without preincubation of DNA and DNAP. A single characteristic rate was obtained in the former, while two rates were obtained in the latter where the smaller one almost equals to the rate in the former case. So this smaller one was selected as r¯\overline{r}. In the experiment of human mitochondrial DNAPJohnson and Johnson (2001a); Johnson and Johnson (2001b), however, the larger one of the two fitted exponents was selected in some cases. For instance, two fitted exponents of the excision reaction of the substrate 25x1/45 (DNA contains a single mismatch in the primer terminal) are 1.1s11.1s^{-1} and 0.04s10.04s^{-1} and the former was selected as r¯\overline{r}Johnson and Johnson (2001a). Different choices of the exponents can result in estimates of r¯\overline{r} differing by orders of magnitude. In the following, we show that the smallest of the fitted exponents may probably be equal to r¯\overline{r} under some conditions.

Refer to caption
Figure 10: The reaction scheme for the transient-state assay to measure the effective excision rate of exo+exo^{+}-DNAP.

The minimal scheme for the transient-state assay of the excision reaction is depicted in Fig.10. By solving the corresponding kinetic equations, one can get three characteristic rates and the smallest one is given by

vtsexo=\displaystyle v^{exo}_{t\cdot s}=
kpeq(konp+kone)+qkoffpkone(konp+kone)(q+kpe+kep)+koffekonp+koffpkone+ϵ\displaystyle\frac{k_{pe}q(k^{p}_{on}+k^{e}_{on})+qk^{p}_{off}k^{e}_{on}}{(k^{p}_{on}+k^{e}_{on})(q+k_{pe}+k_{ep})+k^{e}_{off}k^{p}_{on}+k^{p}_{off}k^{e}_{on}+\epsilon}

Here ϵ=kpeq+kepkoffp+kpekoffe+qkoffp\epsilon=k_{pe}q+k_{ep}k^{p}_{off}+k_{pe}k^{e}_{off}+qk^{p}_{off}, kon=kon0[E]k_{on}=k^{0}_{on}[E]. When the concentration of DNAP is large enough to ensure konp>koffp,kone>koffe,konp>kpek^{p}_{on}>k^{p}_{off},k^{e}_{on}>k^{e}_{off},k^{p}_{on}>k_{pe} and ϵ0\epsilon\approx 0 (compared to other terms in the denominator), Eq.2.3. can be simplified as

vtsexok~peqk~pe+k~ep+q\displaystyle v^{exo}_{t\cdot s}\approx\frac{\widetilde{k}_{pe}q}{\widetilde{k}_{pe}+\widetilde{k}_{ep}+q} (24)

If k~ep>k~pe\widetilde{k}_{ep}>\widetilde{k}_{pe}, which is met if DNA binds preferentially to the polymerase domain, then we get vtsexor¯v^{exo}_{t\cdot s}\approx\overline{r} (of the same order of magnitude), r¯\overline{r} is defined in Eq.(2.1.). So, if the real excision reaction follows the minimal scheme, vtsexov^{exo}_{t\cdot s} may be interpreted as r¯\overline{r}. In the experiment of human mitochondrial DNAP Johnson and Johnson (2001a); Johnson and Johnson (2001b), the author adopted this interpretation, but used kpolk_{pol} as the effective elongation rate, and calculated the proofreading efficiency as (vtsexo)RW/(kpol)WR(v^{exo}_{t\cdot s})_{RW}/(k_{pol})_{WR}. It is now clear that this quantity is not a proper measure of ftrue,pro(=(vtsexo)RW/kWR)f_{true,pro}\big{(}=(v^{exo}_{t\cdot s})_{RW}/k^{*}_{WR}\big{)}. Here kWRk^{*}_{WR} may probably be replaced by (kpol[dNTP]/Kd)WR(k_{pol}\textmd{[dNTP]}/K_{d})_{WR}, if konp>koffpk^{p}_{on}>k^{p}_{off} and kep>kpek_{ep}>k_{pe} .

It is worth emphasizing that the interpretation of vtsexov^{exo}_{t\cdot s} is severely model-dependent. The reaction scheme could be more complicated than the minimal model in Fig.10, e.g.e.g. there may be multiple substeps in the intramolecular transfer process since the two domains are far apart (2-4 nmBȩbenek and Ziuzia-Graczyk (2018)), particularly when there are buried mismatches in the primer terminal. For any complex scheme, one can calculate the smallest characteristic rate vtsexov^{exo}_{t\cdot s} and the real excision rate r¯\overline{r}. These two functions always differ greatly (examples can be found in SM Sec.III D). So the single-turnover assay per se is not a universally reliable method to measure the effective excision rate. Is there a model-independent method to measure such excision rates? Below we suggest a possible single-molecule assay.

2.4. The single-molecule assay measures the ture fidelity

Similar to RESULTS AND DISCUSSION Sec.1.5, a single-molecule assay can be proposed to directly measure the effective rates k¯\overline{k} and r¯\overline{r}, if the states in Fig.9 and Fig.10 can be well defined in the experiments. For instance, the states EpDE_{p}\cdot D, EeDE_{e}\cdot D and E+DE+D can be clearly resolved by smFRET Markiewicz et al. (2012); Lamichhane et al. (2013).

To measure k¯\overline{k}, the experiment is initiated by mixing DNAP and dNTP to the single molecule DNA. If EpDiE_{p}\cdot D_{i}, EeDiE_{e}\cdot D_{i}, E+DiE+D_{i} and EpDi+1E_{p}\cdot D_{i+1} in Fig.9 can be distinguished in the nucleotide incorporation process, then the residence time at EpDiE_{p}\cdot D_{i} can be counted from a time window of the state-switching trajectory of the enzyme-DNA complex with the starting point EpDiE_{p}\cdot D_{i} and the ending point EpDi+1E_{p}\cdot D_{i+1}. Collecting sufficient samples to obtain the averaged residence time Γp,i\Gamma_{p,i}, one can get ki+1=1/Γp,ik^{*}_{i+1}=1/\Gamma_{p,i} or ki+10=1/(Γp,i[dαi+1TP])k^{*0}_{i+1}=1/(\Gamma_{p,i}[d\alpha_{i+1}TP]) by the FP analysis

The measurement of r¯\overline{r} follows the same logic. The experiment is initiated by adding DNAPs to the single molecule DNA. If the states EpDiE_{p}\cdot D_{i}, EeDiE_{e}\cdot D_{i}, E+DiE+D_{i} and EpDi1E_{p}\cdot D_{i-1} in Fig.10 can be distinguished in the excision process, the state-switching trajectory between the starting point EpDiE_{p}\cdot D_{i} and the ending point EpDi1E_{p}\cdot D_{i-1} can be recorded. Then the averaged residence time Γp,i\Gamma_{p,i} at EpDiE_{p}\cdot D_{i} is obtained, which gives r¯i=1/Γp,i\overline{r}_{i}=1/\Gamma_{p,i}. Sometimes, however, the excision may occur without visiting EpDE_{p}\cdot D. The trajectory recorded in such cases are not taken for the averaging. Detailed explanations can be found in SM Sec.IV B. This analysis also applies to more complex reaction mechanisms and one can always get r¯i=1/Γp,i\overline{r}_{i}=1/\Gamma_{p,i}.

3. More realistic models including DNAP translocation

So far we have not considered the important step, DNAP translocation, in the above kinetic models. Goodman et al. had discussed the effect of translocation on the transient-state gel assay very earlyGoodman et al. (1993), and recently DNAP translocation has been observed for phi29 DNAP by using nanopore techniquesDahl et al. (2012); Lieberman et al. (2012, 2013, 2014) or optical tweezersMorin et al. (2015). However, so far there is no any theory or experiment to study the effect of translocation on the replication fidelity.

Refer to caption
Figure 11: The multi-step reaction scheme of exo+exo^{+}-DNAP including the translocation step.

By using optical tweezers, Morin et al. had shown that DNAP translocation is not powered by PPi release or dNTP bindingMorin et al. (2015) and it’s indeed a thermal ratchet process. So the minimal scheme accounting for DNAP translocation can be depicted as Fig.11. ktk_{t} and rtr_{t} are the forward and the backward translocation rate respectively. EpreDiE_{pre}\cdot D_{i} and EpostDiE_{post}\cdot D_{i} indicate the pre-translocation and the post-translocation state of DNAP respectively. Here, the primer terminal can only switch intramolecularly between EeE_{e} and EpreE_{pre} (but not EpostE_{post}), according to the experimental observation Lieberman et al. (2014). We also assume DNAP can bind DNA either in state EpreDiE_{pre}\cdot D_{i} or in state EpostDiE_{post}\cdot D_{i} with possibly different binding rates and dissociation rates.

This complex scheme can be reduced to the simplified scheme Fig.1 by using the FP analysis. The obtained effective rates are briefly written as k¯=k(1qη/ξ)\overline{k}=k^{*}(1-q\eta^{{}^{\prime}}\big{/}\xi) and r¯=qη/ξ\overline{r}=q\eta\big{/}\xi. Here η,η,ξ\eta,\eta^{{}^{\prime}},\xi are complex functions of all the rate constants in Fig.11 except kk^{*}, which are too complex to be given here (see details in SM Sec.V A). These effective rates are much different from that defined by steady-state or transient-state assays. Below we only show the difference between the calculated ftruef_{true} and the operationally-defined ftsf_{t\cdot s} in transient-state assays.

(ftrue)i\displaystyle(f_{true})_{i} =\displaystyle= (ftrue,ini)i(ftrue,pro)i\displaystyle(f_{true,ini})_{i}(f_{true,pro})_{i}
(fts)i\displaystyle(f_{t\cdot s})_{i} =\displaystyle= (fts,ini)i(fts,pro)i\displaystyle(f_{t\cdot s,ini})_{i}(f_{t\cdot s,pro})_{i} (25)

The initial discrimination can be precisely measured by transient-state assays, (ftrue,ini)i=(fts,ini)i=kRi1Ri/kRi1Wi(f_{true,ini})_{i}=(f_{t\cdot s,ini})_{i}=k^{*}_{R_{i-1}R_{i}}/k^{*}_{R_{i-1}W_{i}}. But the two proofreading efficiencies are hugely different, (fts,pro)i(ftrue,pro)i(f_{t\cdot s,pro})_{i}\neq(f_{true,pro})_{i}. The complex functions (ftrue,pro)i(f_{true,pro})_{i} and (fts,pro)i(f_{t\cdot s,pro})_{i} are given in SM Sec.V A,B. They may be approximately equal only under some extreme conditions, e.g. ktkoffprek_{t}\gg k^{pre}_{off}, ktkpek_{t}\gg k_{pe} and rt>koffpostr_{t}>k^{post}_{off}. This may be true when the terminal is in the matched state, so the translocation is in fast equilibrium and the states prepre and postpost can be treated as a single state, as always assumed in conventional gel assays or other ensemble assays. But these conditions may not be met if there is a terminal mismatch or a buried mismatch which may slow down the translocation Ren (2016). In such cases, ftsf_{t\cdot s} is quite different from fturef_{ture}. To reliably estimate fturef_{ture}, we suggest the following single-molecule assay to directly measure the effective rates.

First, if the states pre and post cannot be distinguished in the experiment, indicating that the translocation is a fast process, the assays presented in preceding sections ( RESULTS AND DISCUSSION Sec.1.5 or Sec.2.4) can be used.

Second, if the translocation is a relatively slow process, either pre or post can be directly observed (e.g. for Dpo4 polymerase by smFRET Brenlla et al. (2014)), then the effective incorporation rate k¯\overline{k} is no longer kk^{*} but k(1qη/ξ)k^{*}(1-q\eta^{{}^{\prime}}\big{/}\xi). However, it’s hard to obtain this effective rate in a single measurement, since it consists of both the polymerase and the exonuclease contributions. Fortunately we can measure the factors kk^{*} and 1qη/ξ1-q\eta^{{}^{\prime}}\big{/}\xi separately. The measurement of kk^{*} is basically the same as that given in RESULTS AND DISCUSSION Sec.1.5 and Sec.2.4. The reaction scheme is shown in Fig.12. The experiment is initiated by mixing DNAP and dNTP to the single molecule DNA. The time trajectory between the starting point EpostDi1E_{post}\cdot D_{i-1} and the ending point EpostDiE_{post}\cdot D_{i} is selected, if EpostDi1E_{post}\cdot D_{i-1}, EpostDiE_{post}\cdot D_{i} and other states can be well distinguished. Then the average residence time at EpostDi1E_{post}\cdot D_{i-1} gives ki=1/Γpost,i1k^{*}_{i}=1/\Gamma_{post,i-1} or ki0=1/(Γpost,i1[dαiTP])k^{*0}_{i}=1/(\Gamma_{post,i-1}[d\alpha_{i}TP]), which defines ftrue,inif_{true,ini}. Detailed explanations can be found in SM Sec.V C.

Refer to caption
Figure 12: The reaction scheme for the suggested single-molecule experiment to measure kk^{*}. The dashed circle represents the starting point, and the dashed rectangle represents the ending point.
Refer to caption
Figure 13: The suggested reaction scheme to interpret the factor qη/ξq\eta^{{}^{\prime}}\big{/}\xi, based on FP analysis. The dashed circle represents the starting point, and the dashed rectangle represents the ending point.

The logic to measure qη/ξq\eta^{{}^{\prime}}\big{/}\xi is given below, as shown in Fig.13.

1. The experiment is initiated by mixing DNAP with DNA.

2. Record the state-switching trajectory of the complex. It may go directly to EpostDi1E_{post}\cdot D_{i-1} without visiting EpreDiE_{pre}\cdot D_{i}. Or it may arrive at EpreDiE_{pre}\cdot D_{i} via whatever pathway before the excision, and then go to EpostDi1E_{post}\cdot D_{i-1} (with or without visiting EpostDiE_{post}\cdot D_{i}). We collect trajectories of the latter case, and denote EpreDiE_{pre}\cdot D_{i} as the starting point (it may be visited repeatedly), EpostDiE_{post}\cdot D_{i} and EpostDi1E_{post}\cdot D_{i-1} as the two ending points.

3. Select all the windows from the trajectories, which are between the starting point and either ending point. The windows are classified in two types, i.e. between EpreDiE_{pre}\cdot D_{i} and EpostDiE_{post}\cdot D_{i}, or between EpreDiE_{pre}\cdot D_{i} and EpostDi1E_{post}\cdot D_{i-1} without visiting EpostDiE_{post}\cdot D_{i}.

4. Count the total number of either type of window npost,in_{post,i}, npost,i1n_{post,i-1}, and one gets npost,i1/(npost,i1+npost,i)=(qη/ξ)in_{post,i-1}\Big{/}\Big{(}n_{post,i-1}+n_{post,i}\Big{)}=(q\eta^{{}^{\prime}}\big{/}\xi)_{i}. Details can be found in SM Sec.V C.

Refer to caption
Figure 14: The reaction scheme for the suggested single-molecule assay to measure r¯\overline{r}. The dashed circle represents the starting point, and the dashed rectangle represents the ending point.

The measurement of r¯\overline{r} follows the same logic in RESULTS AND DISCUSSION Sec.2.4. The reaction scheme is shown in Fig.14. The experiment is initiated by adding DNAPs to the single molecule DNA. The time window selected from the trajectory is between the starting point EpostDiE_{post}\cdot D_{i} and the ending point EpostDi1E_{post}\cdot D_{i-1}, if EpostDiE_{post}\cdot D_{i}, EpostDi1E_{post}\cdot D_{i-1} and other states can be well distinguished. Then the average residence time at EpostDiE_{post}\cdot D_{i} gives r¯i=1/Γpost,i\overline{r}_{i}=1/\Gamma_{post,i}. Similarly, the trajectory recorded without visiting EpostDiE_{post}\cdot D_{i} are not taken for the averaging. Detailed explanations can be found in SM Sec.V C.

The conventional kinetic assays of DNAP fidelity, i.e. the steady-state assay or the transient-state assay, have indicated that the initial discrimination finif_{ini} is about 104510^{4\sim 5} and the proofreading efficiency fprof_{pro} is about 102310^{2\sim 3} Bȩbenek and Ziuzia-Graczyk (2018). Although these assays have been widely used for decades and these estimates of finif_{ini} and fprof_{pro} have been widely cited in the literatures, they are not unquestionable since the logic underlying these methods are not well founded. No rigorous theories about the true fidelity fturef_{ture} have ever been proposed, and its relation to the operationally defined fssf_{s\cdot s} or ftsf_{t\cdot s} has never been clarified.

In this paper, we examined carefully the relations among fssf_{s\cdot s}, ftsf_{t\cdot s} and ftruef_{true}, based on the the FP method recently proposed by us to investigate the true fidelity of exoexo^{-}-DNAP or exo+exo^{+}-DNAP. We conclude that these three definitions are equivalent in general for exoexo^{-}-DNAP, i.e. either the steady-state assay or the transient-state assay can give fturef_{ture} precisely just by measuring the specificity constant (kcat/Kmk_{cat}/K_{m} or kpol/Kdk_{pol}/K_{d}).

For exo+exo^{+}-DNAP, however, the situation is more complicated. The steady-state assay or the transient-state assay can still be applied to measure the initial discrimination finif_{ini}, as done for exoexo^{-}-DNAP (so the above cited estimates of finif_{ini} are reliable). But either method fails to measure the effective elongation rate and the effective excision rate and thus in principle can not characterize fprof_{pro}. So the widely cited estimates fpro1023f_{pro}\sim 10^{2\sim 3} are very suspicious. Our analysis shows that only if the involved rate constants meet some special conditions, the two assays can give approximately the effective incorporation rates, but only the transient-state assay can give approximately the effective excision rate. If there are no other supporting evidences to ensure the required conditions are met, the conventional fidelity assays of fprof_{pro} per se are not reliable. Of course, the transient-state method can be used to measure the rate constants of each step of the excision reactionDonlin et al. (1991) and then fprof_{pro} can be calculated, but this is definitely a hard work.

So we proposed an alternative method, the single-molecule assay, to obtain the fidelity of either exoexo^{-}-DNAP or exo+exo^{+}-DNAP by directly measure all the required effective rates, without dissecting the details of the reaction scheme. It is hopefully a general and reliable method for fidelity assay if some key states of the enzyme-substrate complex can be well resolved by the single-molecule techniques. In RESULTS AND DISCUSSION Sec.1.5, Sec.2.4 and Sec.3, we have designed several protocols to conduct the single-molecule experiment and data analysis, which are feasible in principle though it may be hard to implement in practice.

Last but not least, we have focused on the first-order (nearest) neighbor effects in this paper, but higher-order neighbor effects may also be important to the fidelity. Here we take the second-order (next-nearest) neighbor effect as an example. According to Eq.(2.), the initial discrimination finif_{ini} is of the same form as that defined by Eq.(3), so it can still be correctly given by the steady-state or the transient-state assays (in fact, these assays can measure finif_{ini} for any-order neighbor effects). The proofreading efficiency fprof_{pro} of exo+exo^{+}-DNAP consists of two factors, r¯Ri2Ri1Wi/k¯Ri1WiRi+1\overline{r}_{R_{i-2}R_{i-1}W_{i}}/\overline{k}_{R_{i-1}W_{i}R_{i+1}} and r¯Ri1WiRi+1/k¯WiRi+1Ri+2\overline{r}_{R_{i-1}W_{i}R_{i+1}}/\overline{k}_{W_{i}R_{i+1}R_{i+2}}, which can be regarded as the first-order and the second-order proofreading efficiency respectively. These two factors are both dependent on the stability of the primer-template duplex. For naked dsDNA duplex, numerous experiments have shown that a penultimate mismatch leads to much lower stability than a terminal mismatch SantaLucia and Hicks (2004). This implies that a penultimate mismatch may more significantly disturb the base stacking of the primer-template conjunction in the polymerase domain and thus the forward translocation of DNAP will be slower and the PolPol-to-ExoExo transfer of the primer terminal will be faster, if compared with the terminal mismatch. In such cases, the second-order factor may be larger than the first-order factor. This enhancement has been mentioned in Ref.Johnson and Johnson (2001a), though the steady-state and transient-state assays used in that work and thus the obtained estimates of the two factors are all questionable (as pointed out in RESULTS AND DISCUSSION Sec.2). The single-molecule assay can be directly adopted to demonstrate the second-order effects by measuring the effective rates r¯Ri1WiRi+1,k¯WiRi+1Ri+2\overline{r}_{R_{i-1}W_{i}R_{i+1}},\overline{k}_{W_{i}R_{i+1}R_{i+2}} by the same protocols. We hope the analysis and the suggestions presented in this paper will urge a serious examination of the conventional fidelity assays and offer some new inspirations to single-molecule experimentalists to conduct more accurate fidelity analysis.

The authors thank the financial support by National Natural Science Foundation of China (No.11675180,11774358), the CAS Strategic Priority Research Program (No.XDA17010504), Key Research Program of Frontier Sciences of CAS (No.Y7Y1472Y61), WIUCASYJ2020004 and WIUCASQD2020009.

References

  • Trautner et al. (1962) T. A. Trautner, M. N. Swartz, and A. Kornberg, Proceedings of the National Academy of Sciences of the United States of America 48, 449 (1962).
  • Hall and Lehman (1968) Z. W. Hall and I. R. Lehman, Journal of Molecular Biology 36, 321 (1968).
  • Lee et al. (2016) D. F. Lee, J. Lu, S. Chang, J. J. Loparo, and X. S. Xie, Nucleic Acids Research 44, e118 (2016).
  • de Paz et al. (2018) A. M. de Paz, T. R. Cybulski, A. H. Marblestone, B. M. Zamft, G. M. Church, E. S. Boyden, K. P. Kording, and K. E. Tyo, Nucleic Acids Research 46, e78 (2018).
  • Clayton and Branscomb (1979) K. Clayton and W. Branscomb, Journal of Biological Chemistry 254, 1902 (1979).
  • Bertram et al. (2010) J. G. Bertram, K. Oertell, J. Petruska, and M. F. Goodman, Biochemistry 49, 20 (2010).
  • Fersht (1985) A. R. Fersht, Enzyme Structure and Mechanism (W.H.Freeman & Co Ltd., 1985), 2nd ed.
  • Kati et al. (1992) W. M. Kati, K. A. Johnson, L. F. Jerva, and K. S. Anderson, Journal of Biological Chemistry 267, 25988 (1992).
  • Johnson (1993) K. A. Johnson, Annual Review of Biochemistry 62, 685 (1993).
  • Johnson (1992) K. A. Johnson, The Enzymes 20, 1 (1992).
  • Fersht et al. (1982) A. R. Fersht, J. W. Knill-Jones, and W. C. Tsui, Journal of Molecular Biology 156, 37 (1982).
  • Fersht (1979) A. R. Fersht, Proceedings of the National Academy of Sciences of the United States of America 76, 4946 (1979).
  • Wingert et al. (2013) B. M. Wingert, E. E. Parrott, and S. W. Nelson, Biochemistry 52, 7723 (2013).
  • Vashishtha and Kuchta (2015) A. K. Vashishtha and R. D. Kuchta, Biochemistry 54, 240 (2015).
  • Donlin et al. (1991) M. J. Donlin, S. S. Patel, and K. A. Johnson, Biochemistry 30, 538 (1991).
  • Gaspard (2017) P. Gaspard, Physical Review E 96, 1 (2017).
  • Li et al. (2019) Q. S. Li, P. D. Zheng, Y. G. Shu, Z. C. Ou-Yang, and M. Li, Physical Review E 100 (2019), eprint 1901.01495.
  • Song et al. (2017) Y. S. Song, Y. G. Shu, X. Zhou, Z. C. Ou-Yang, and M. Li, Journal of Physics Condensed Matter 29, 25101 (2017), eprint 1603.02453.
  • Wong et al. (1991) I. Wong, S. S. Patel, and K. A. Johnson, Biochemistry 30, 526 (1991).
  • Tsai and Johnson (2006) Y. C. Tsai and K. A. Johnson, Biochemistry 45, 9675 (2006).
  • Purohit et al. (2003) V. Purohit, N. D. Grindley, and C. M. Joyce, Biochemistry 42, 10200 (2003).
  • Joyce et al. (2008) C. M. Joyce, O. Potapova, A. M. DeLucia, X. Huang, V. P. Basu, and N. D. Grindley, Biochemistry 47, 6103 (2008).
  • Santoso et al. (2010) Y. Santoso, C. M. Joyce, O. Potapova, L. Le Reste, J. Hohlbein, J. P. Torella, N. D. Grindley, and A. N. Kapanidis, Proceedings of the National Academy of Sciences of the United States of America 107, 715 (2010).
  • Hohlbein et al. (2013) J. Hohlbein, L. Aigrain, T. D. Craggs, O. Bermek, O. Potapova, P. Shoolizadeh, N. D. Grindley, C. M. Joyce, and A. N. Kapanidis, Nature Communications 4 (2013).
  • Lamichhane et al. (2013) R. Lamichhane, S. Y. Berezhna, J. P. Gill, E. Van Der Schans, and D. P. Millar, Journal of the American Chemical Society 135, 4735 (2013).
  • Johnson and Johnson (2001a) A. A. Johnson and K. A. Johnson, Journal of Biological Chemistry 276, 38097 (2001a).
  • Johnson and Johnson (2001b) A. A. Johnson and K. A. Johnson, Journal of Biological Chemistry 276, 38090 (2001b).
  • Bȩbenek and Ziuzia-Graczyk (2018) A. Bȩbenek and I. Ziuzia-Graczyk, Current Genetics 64, 985 (2018).
  • Markiewicz et al. (2012) R. P. Markiewicz, K. B. Vrtis, D. Rueda, and L. J. Romano, Nucleic Acids Research 40, 7975 (2012).
  • Goodman et al. (1993) M. F. Goodman, S. Creighton, L. B. Bloom, J. Petruska, and T. A. Kunkel, Critical Reviews in Biochemistry and Molecular Biology 28, 83 (1993).
  • Dahl et al. (2012) J. M. Dahl, A. H. Mai, G. M. Cherf, N. N. Jetha, D. R. Garalde, A. Marziali, M. Akeson, H. Wang, and K. R. Lieberman, Journal of Biological Chemistry 287, 13407 (2012).
  • Lieberman et al. (2012) K. R. Lieberman, J. M. Dahl, A. H. Mai, M. Akeson, and H. Wang, Journal of the American Chemical Society 134, 18816 (2012).
  • Lieberman et al. (2013) K. R. Lieberman, J. M. Dahl, A. H. Mai, A. Cox, M. Akeson, and H. Wang, Journal of the American Chemical Society 135, 9149 (2013).
  • Lieberman et al. (2014) K. R. Lieberman, J. M. Dahl, and H. Wang, Journal of the American Chemical Society 136, 7117 (2014).
  • Morin et al. (2015) J. A. Morin, F. J. Cao, J. M. Lázaro, J. R. Arias-Gonzalez, J. M. Valpuesta, J. L. Carrascosa, M. Salas, and B. Ibarra, Nucleic Acids Research 43, 3643 (2015).
  • Ren (2016) Z. Ren, Nucleic Acids Research 44, 7457 (2016).
  • Brenlla et al. (2014) A. Brenlla, R. P. Markiewicz, D. Rueda, and L. J. Romano, Nucleic Acids Research 42, 2555 (2014).
  • SantaLucia and Hicks (2004) J. SantaLucia and D. Hicks, Annual Review of Biophysics and Biomolecular Structure 33, 415 (2004).