Prethermalization in Fermi-Pasta-Ulam-Tsingou chains
Abstract
The observation of the Fermi-Pasta-Ulam-Tsingou (FPUT) paradox, namely the lack of equipartition in the evolution of a normal mode in a nonlinear chain on unexpectedly long times, is arguably the most famous numerical experiment in the history of physics. Since seventy years after its publication, most studies in FPUT chains still focus on long wavelength initial states similar to the original paper. It is shown here that all characteristic features of the FPUT paradox are rendered even more striking if short(er) wavelength modes are evolved instead. Since not every normal mode leads to equipartition, we also provide a simple technique to predict which modes, and in what order, are excited starting from an initial mode (root) in -FPUT chains. The excitation sequences associated with a root are then shown to spread energy at different speeds, leading to prethermalization regimes that become longer as a function of mode excitation number. This effect is visible in observables such as mode energies and spectral entropies and, surprisingly, also in the time evolution of invariant quantities such as Lyapunov times and Kolmogorov-Sinai entropies. Our findings generalize the original FPUT experiment, provide an original look at the paradox’s source, and enrich the vast literature dedicated to studying equipartition in classical many-body systems.
I Introduction
The numerical experiment of Fermi, Pasta, Ulam and Tsingou (FPUT) [1] was simultaneously the first computational approach in the study of out-of-equilibrium many-body systems and a source of considerable headache shared by generations of physicists in the past 70 years [1, 2, 3, 4]. A proper understanding of FPUT’s findings, which contradicted standard knowledge at the time and were deemed a paradox, relies heavily on modern mathematical and physical tools unavailable to the authors at that time, e.g. a mature formulation of Kolmogorov-Arnol’d-Moser (KAM) theory [5, 6, 7], resonance overlap criteria [8] and Birkhoff-Gustavson normalization [9, 10, 11, 12, 13], and also on the power of modern computers. Being at the intersection of several fields, it is not surprising that this topic has attracted the attention of so many researchers working on nonlinear dynamical systems and statistical mechanics.
The original FPUT experiment consists in evolving the fundamental mode of a harmonic chain in a weakly perturbed, nonintegrable system of which this mode is no longer a stationary state. More specifically, the authors studied the evolution of the first normal mode of a discretized string under the dynamics generated by the Hamiltonian
(1) |
where and are the momentum and position of each discretized mass in the chain, and fixed boundary conditions were used, namely . A very small energy density was chosen for the initial state, i.e. , limiting the size of the cubic term and rendering the system a very weak perturbation of the integrable harmonic chain corresponding to setting . Even for such a weak perturbation the dynamics under (1) ends up being chaotic, justifying the original FPUT expectation that the energy of the initially excited mode would be shared among all other normal modes in a finite (possibly short) time. However, what was verified is that not only energy equipartition was not reached for very long times, but that the energy periodically returned to the initial state and formed recurrences, with a large fraction of modes being barely excited during evolution.
Nowadays, the intersection of a plethora of studies provides a rather converged explanation for the originally observed apparent lack of energy equipartition in finite, weakly anharmonic systems (for reviews, see [2, 14, 15, 16, 4, 17]), although speculations on whether or not the original FPUT setup would eventually reach equilibrium have become a prominent topic in metaphysics. First of all, it is important to note that the energy density chosen by FPUT was extremely small; indeed, a subsequent investigation by Chirikov and Izrailev showed that larger energy densities did result in energy equipartition [18]. Secondly, KAM theory was later proved to be applicable to many types of FPUT-like systems, showing that there is indeed a bound in energy density below which equipartition will never take place [19, 20, 21, 22]. Thirdly, and most relevant to this manuscript, is the fact that the initial state chosen by FPUT is a highly atypical initial state, being a stationary solution of a nearby harmonic chain [23]. Even more, such an atypical initial state is also close to exact stationary time-periodic states coined q-breathers of the anharmonic system under consideration [24, 25]. This strongly contrasts with typical initial conditions, which when expressed in the normal mode basis are combinations of all of its elements. Since such modes form the canonical action variables of the harmonic chain, perturbing a typical initial condition involves an instantaneous coupling of all actions, while for the atypical FPUT initial state only a single action is perturbed. It is therefore only logical that time evolution would be atypical in the original FPUT study, given the atypicality already present in its chosen initial state.
Atypical initial states are, moreover, not all built the same. Their time evolution behaves very differently depending on which initial mode, or root, is excited. The dynamics of roots is characterized by excitation sequences that spread in mode space and, depending on relatively simple algebraic rules, completely or only partially fill it. The case of complete filling is associated to thermal roots, namely states for which time-dependent observables will generally reach equipartition; If the root only excites a subset of modes, known as its bush [26], then equipartition does not take place and the corresponding root is deemed nonthermal. A particularly striking example of nonthermal roots is given by q-breathers, namely families of exact time-periodic orbits which localize in mode space and reduce to the root mode upon tuning the anharmonicity down to zero [24, 25]. Such states show that specific normal modes and their perturbations can also be stationary states of the perturbed chain and provide remarkable counterexamples to the expectation that observables in nonlinear, classical many-body systems necessarily undergo some form of equilibration. They also show that ergodicity-breaking behavior can be observed not only as a function of decreasing energy density, as in the original FPUT experiment, but also as a function of the initial state, as we shall explore here.
This manuscript is devoted to investigating the properties of atypical initial conditions (normal modes) in FPUT chains, and also comparing them to the ones of typical (random) ones. We provide explicit algebraic rules to write mode excitation sequences as a function of the initial root, together with simple criteria to know if such root is thermal or not. We then numerically investigate the time evolution of thermal roots of several different excitation numbers, showing that the paradox associated to the fundamental mode is substantially maximized for higher excited roots, e.g. the prethermal regime and its metastable plateaus become increasingly longer as a function of root excitation number for a fixed energy density. This is shown both by a direct tracking of mode energies as a function of time and by computing equipartition times from spectral entropies.
The impact of root excitation number is not restricted to quantities that depend on the initial state. Indeed, albeit Lyapunov times and Kolmogorov-Sinai entropies being invariant with respect to a large number of transformations and being constant in an ergodic system regardless of the initial state used for computing them [27], we show here that their transient properties are strikingly different from those of typical initial states and clearly reflect the fact that the system is trapped in phase space. More specifically, both the maximal Lyapunov exponent (the inverse of the Lyapunov time) and the Kolmogorov-Sinai entropy show a dip in their time evolution when computed from roots, which is absent if the initial state chosen is typical. Moreover, the duration and depth of this dip increases with excitation number, showing that excited modes remain trapped for even longer in near-integrable portions of phase space. An explanation to this phenomenon is provided by the previously computed excitation sequences of roots, which show that equipartition is fastest for states which spread energy efficiently among modes. The fastest spreader in this sense is always the fundamental root, with the equipartition time slowly increasing with excitation number. Evidently, nonthermal roots are extremely bad spreaders since a portion of mode space is never filled.
The paper is structured as follows. Sec. II discusses the model, which is entirely focused in (1), introducing normal modes, spectral entropies and Lyapunov data which will be thoroughly employed in the rest of the paper. Sec. III presents the rules for computing excitation sequences for roots, which are then evolved in an FPUT chain with masses in Sec. IV. A discussion of our findings can be found in Sec. V, together with a conclusion in Sec. VI. An appendix is also included providing examples of pathological numerical behavior due to bush instability.
II Model and observables
We restrict our analysis to cubic FPUT chains, also known as the -FPUT model, with Hamiltonian (1). In the following we recall how to rewrite the FPUT Hamiltonian in normal mode (phonon) basis, and introduce quantities such as mode energies, spectral entropies and equipartition times. We then describe how to obtain important invariant quantities that will be used in the following sections, such as Lyapunov times and Kolmogorov-Sinai entropies.
II.1 Eigenmode basis
Since (1) has degrees of freedom its harmonic limit has a set of eigenmodes, denoted , with which we can rewrite the position and momentum in (1) at any time as
(2) |
each mode having eigenfrequency and eigenenergy
(3) |
In these canonical coordinates, the Hamiltonian (1) takes the form [19]
(4) |
where [28]
(5) |
Note that is invariant with respect to permutations of its indices.
II.2 Spectral entropy
Eq. (1) is much easier to deal with, numerically speaking, than (II.1). However, we are interested in several properties that need to be computed in the eigenmode basis, such as the eigenenergies themselves and the [normalized] spectral entropy
(6) |
where the Shannon entropy and normalized eigenenergy are given, respectively, by
(7) |
Spectral entropy has been used several times in order to quantify equipartition in nonlinear systems [23, 4, 29, 30]. The logic for employing it is simple: If ideal equipartition is reached, will be stationary and the same for every mode , freezing at some value . If one assumes a Gibbs distribution at equilibrium, can be calculated analytically as a function of the Shannon entropy of the initial state, being given by
(8) |
where is the Euler constant [31]. Since for any initial state, one expects to decrease and follow a complicated and possibly highly oscillatory evolution until eventually hitting its stationary value. Once this happens the dynamics cannot, at least on average, depart significantly from thermal oscillations around . It is then natural to consider that equipartition is reached once hits its equilibrium value for the first time [30]. Such first-passage time furnishes our definition of equipartition time, , and is appealing because it does not require any ad hoc assumptions.
II.3 Lyapunov invariants and Kolmogorov-Sinai entropy
Equipartition time as defined in Sec. II.2 is, evidently, attached to the observables being monitored, namely the mode energies. Different observables will generally reach equipartition at different times (if at all), such that observable equipartition does not actually measure properties intrinsic to the system at hand. Nevertheless, since mode energies are conserved in the unperturbed harmonic chain, one knows well how these observables should behave upon approaching an integrable limit, i.e., equipartition should slow down and eventually stop for sufficiently small energy densities.
There is, however, an observable-independent time scale that is completely intrinsic to the system and should not depend on the initial state, given by the Lyapunov time (the inverse of the maximal Lyapunov exponent). Since, like the maximal Lyapunov exponent (MLE), all exponents in the Lyapunov spectrum (LS) are invariant with respect to homeomorphisms [27], the LS provides a valuable set of time scales related to the average “pull” along each 1-dimensional submanifold that forms the hyperbolic skeleton of the system’s dynamics in phase space. The information in the LS can also be condensed in the form of a scalar, the Kolmogorov-Sinai entropy, which by Pesin’s theorem can be obtained as the sum of all positive exponents in the LS [32].
In order to compute the LS we employ the well-known prescription of [33, 34], which amounts to computing the spectrum of the symmetric matrix
(9) |
In the above represents the system’s stability matrix, which is obtained by solving Hamilton’s equations in tangent space:
(10) |
where is the Hessian with respect to . Since the stability matrix has to be computed along a trajectory, the above equation is solved in parallel with Hamilton’s equations for an initial phase-space point until a final time that is long enough to result in a converged LS. During this evolution, one must also perform QR-diagonalizations of in order to avoid numerical denegenacies, as is well-known and described in, e.g., [34].
As can be seen in (9), the stability matrix will generally depend on the trajectory along which it is calculated, i.e., on the initial phase-space point . However, for a sufficiently large number of degrees of freedom and energy density, phase space will consist of a single metrically intransitive set that is equally accessible to all trajectories in the system [35, 36]. Equivalently, one can formulate this previous property as stating that, for sufficiently large and , the system lies outside the KAM regime in which phase space can be split into a chaotic web and pockets of near-integrability. Thus, since all trajectories are “free to roam” around the entire accessible phase space, the LS is expected to be independent of the trajectory along which it is computed. This independence is a stronger evidence that the system at hand is ergodic than computing observables, and can be ascertained directly from the verification that the KS entropy is independent of the trajectory along which it is computed.
III Excitation sequences
We start this section by writing a very abridged version of Newton’s second law for a given mode , obtained from (II.1):
(11) |
Thus, mode is excited either by a lone mode or a mixture of modes and . Excitations from single modes have and the coupling tensor assumes the form
(12) |
where the first two terms vanished because and are never zero. Now, for mixed excitations we have , so
(13) |
The above expression is composed of: An ascending term, namely , which dictates which higher-order mode to excite starting from and ; A right reflection term, , which dictates which term to excite if , i.e. if the next mode in the ascending sequence is too large and leaves mode space; And a descending term, , which propagates the excitation backward in mode space and is also responsible for controlling left reflections, which take place when the next excited mode in a descending sequence is smaller than zero.
III.1 First excitation cycle and q-breathers
Initializing the system in a root means setting unless . The first excitation cycle, therefore, starts with a lone mode , which by (III) excites modes and . If the first condition is met, then the second delta will be one iff , with even. Thus, if the chain has an odd number of sites and the root chosen is , then all coefficients in the coupling tensor vanish and the energy remains forever localized in the root, i.e. this mode is a q-breather.
Now, assuming that the second Kronecker delta in (III) vanishes, i.e. that , then the only mode that is excited starting from is . This constitutes the first excitation cycle, namely the first run over and indices in (11). Note that excites only if , since cannot be negative or zero. This justifies splitting excitation sequences into two types, namely low- and high-mode excitations, depending on whether the root fulfills or not. The only difference is that for low-mode excitations the initial sequence will be ascending, while it is descending in the high-mode case. Although these scenarios are essentially identical, we will assume low-mode excitations here in order for sequences to start in the ascending phase.
III.2 First ascending sequence
The second cycle for low-mode excitations starts with two excited modes, and . Evidently, will re-excite , which on the other hand will hit due to (III). According to (III) mode will also be excited. Thus, the second excitation cycle results in the sequence . For the third cycle, and will be present due to (III), together with all terms that can be formed from sums as dictated by (III). In this ascending phase, terms created from are not important, since any and will be multiples of that were already excited in previous cycles. Thus, by induction, the first ascending sequence starting from a root , with , is simply
(14) |
until a reflection takes place in (III), i.e. until for some and . The reflection, however, will only excite new modes if is not a multiple of , since otherwise the terms would have been already excited during the ascending sequence. This will be explored in the following.
III.3 First descending sequence
At the end of the first ascending sequence every term is a multiple of the root, . Thus, at the reflection, substituting in the last delta of (III) singles out , allowing us to state two important facts:
-
1.
If divides the first ascending sequence will be reflected into itself (for low modes) or be equivalent to a low mode excitation sequence that reflects into itself (for high modes). This shows, in particular, that is nonthermal for all , since is a always divisible by 2. It also shows that is a trivial thermal root, since it excites all modes in the first ascending sequence for any ;
-
2.
The reflection takes place when a multiple of is larger than for the first time, i.e. when . Evidently this mode lies outside mode space and the excited mode is actually after the reflection. A consequence is that even roots can never excite odd modes, since for even modes is always even. This shows that are nonthermal roots for all and all .
Once a reflection occurs, the selection rule for generates a descending sequence starting from , namely,
(15) |
The descent continues in multiples of until the next mode would exit the chain, i.e. until .
III.4 Full excitation sequences
Once the first descending sequence is over, another reflection takes place, now going from left to right, mediated by in (III). Substituting the last mode number from the descending sequence in this delta, we see that the excited mode after this reflection is . The second ascending sequence will start from this mode number, once again ascending in steps of until . Then, we have another reflection from left to right that excites mode , etc. Instead of writing full abstract sequences, we now provide a few examples showing how easy such sequences are to write in practice.

Let us start with a chain with masses. Since , the only three nontrivial thermal roots in this chain are , and . The root is evidently nonthermal since it divides 24, but let’s take a look at its excitation sequence. The first ascending sequence is , and we stop here because the next term falls outside mode space. The reflection term comes from , which is already excited. This will be now propagated backwards to and accoding to (III), retracing the ascending sequence and never exciting any new mode. Now, for the leftmost reflection, the excited mode is , which does not exist. Thus, the bush of is given by . This is verified numerically in Fig. 1(a).

Let us now consider what happens to for instead. The first ascending sequence is now updated to . The reflection gives , which was not previously excited and is not a multiple of . This will then propagate backwards to , , etc. The full excitation sequence is then given by
(16) |
where the arrows indicate whether the sequence is ascending () or descending (). This excitation sequence is seen to match numerics in Fig. 1(b).
As another example, for and we have
(17) |
In both (III.4) and (III.4) the root resulted in three excitation sequences: two ascending and one descending. If is chosen as the root, then the full excitation sequence for is
(18) |
In general, it is easy to see that since every step in the sequence is performed in multiples of , one needs ascents and descents (or reflections) to cover mode space for a thermal root , no matter what is chosen. Since large requires more cycles to cover mode space, it means less modes are excited at the end of each descending or ascending sequence. One can once again use the case of as an example: Here, at the end of the first ascending sequence, the root has already excited all modes in the chain, which is a unique property of the fundamental root; , however, has only excited 5 of the 13 available modes, and only 3 out of 13. In Sec. IV we will show that this effect strongly impacts any quantity computed from evolving roots.
IV Numerical experiments

In this section we perform computations in the -FPUT chain (1) with sites. This value of is chosen because is not divisible by any odd integer other than 1, such that all odd roots are thermal for this chain. Although initial states, final propagation times and energy densities will vary, evolution is always computed using an optimized second order symplectic integrator [37] with step size . This results in a maximum relative energy error of around for the largest energy densities chosen, and much smaller for most of them. We will focus on the evolution of random initial conditions and the first five odd roots , , , and .
IV.1 Metastable plateaus
Sec. III described how energy can remain localized within a small subset of modes before spreading throughout mode space: The higher excited the root, the smaller the initial subset of excited modes, and also the slower the energy transfer. During this transfer time the dynamics is mostly localized within the excited modes, which essentially “wait” while slowly transferring energy to the ones in the next cycles, forming the metastable plateaus well known in FPUT literature [17, 38]. The longer it takes for the energy to reach the end of the sequence, the longer a metastable plateau survives.
The well-known fact that the metastable plateaus associated to roots become more visible at lower energy densities is unsurprising, since in the limit of zero energy density such modes are normal modes of the associated harmonic lattice. However, the behavior of plateaus for excited roots was not previously addressed. To this end, Fig. 2 shows a comparison of evolving mode energies and spectral entropies for the prototypical -FPUT chain (1) computed starting from the first 5 odd roots in a chain with masses, together with a random initial state. Fig. 2(a) displays the standard mode energy dynamics found in literature: An excitation of monotonically spreads to all other modes and eventually results in energy equipartition, with all ’s converging to the mean energy (black dashed line). Metastable plateaus are clearly visible in the prethermal regime, in which a measurement would indicate that the system is not ergodic and cannot be described by equilibrium statistical mechanics. However, such a statement is evidently incorrect, as waiting a bit longer would result in a system that does display statistical behavior.
The approach from an out-of-equilibrium initial state towards a thermal one is most clearly seen in tracking the spectral entropy as a function of time, shown in the insets: The moment it touches the red line, which represents its equilibrium value (8), is where we consider equipartition to have been achieved. The associated equipartition times are displayed in the main plots as an orange triangle and slowly increase as we move from up to , with the shortest time corresponding to the random initial condition. Evidently, this latter case is less interesting since the equipartition time depends strongly on the initial sampling, but for roots the structures formed by mode energies as they evolve in time show that, indeed, higher-excited roots isolate an increasingly smaller subset of modes in mode space. Fig. 2(a) also shows a dense subset of modes homogeneously concentrated around the mean, while Fig. 2(e) clearly shows that two modes, namely and , remain relatively isolated from the others for much longer times. To see this more clearly, in Fig. 3 we take vertical slices of Fig. 2 at three different fixed times for , and . At low frequency modes that are multiples of 3 and 9 can still be seen to have above-average energies for roots and . At this effect has already dissipated for , but the energies of modes and remain slightly above the mean energy when evolving , as visible in Fig. 3(c).

Evidently, the longer it takes for a subset of modes to reach equipartition, the longer the prethermal regime. This is clearly seen in Fig. 2: The relatively obfuscated metastability associated to the -root seen in panel (a) is already maximized in (b), where is evolved, and becomes more and more blatant as higher-excited modes are used as roots. Evidently, the random initial condition in Fig. 2(e) does not display metastability, since here all modes are excited at once and the slow, selection-rule mediated transferring of energy from excited modes to previously unexcited ones does not take place. Since the mean energy is a constant of motion, the prethermal regime’s duration can also be visualized by plotting the standard deviation of , , as a function of time, which will be approximately constant while the energy is still strongly localized in a subset of normal modes. This is indeed seen to be true in Fig. 4, where it is also clear that the metastability’s lifetime increases monotonically as a function of root excitation for this energy density.
IV.2 Equipartition and Lyapunov times

Spectral entropy (6) is a function of the initial state, so it is no surprise that the equipartition times computed from it in Fig. 2 depend on it. These times have no direct relation to the duration of prethermalization, except for the fact that they are necessarily longer than the length of metastable plateaus. Since random initial conditions do not, at least generally, undergo prethermal regimes, it is expected that the equipartition times of such typical initial conditions will be shorter than the ones of roots for any energy density chosen. This expectation is confirmed in Fig. 5, where we compare equipartition times for 280 typical conditions with the ones obtained from odd roots as a function of the energy density. Clearly, the former are bounded from above by the latter, which slowly increase as a function of excitation number.
If equipartition times reach the order of the propagation time used to compute them, namely , convergence can no longer be achieved for these parameters, which is clear in Fig. 5 by the presence of missing data for some values of energy density. However, the “disappearance” of points does not increase fully monotonically with mode excitation number, with some data displaying what could be described as “erratic behavior”. Since dependence on initial conditions is tantamount to ergodicity breaking, investigating quantities that are proved to be invariant in any ergodic system will shed light into the source of the instabilities seen in Fig. 5. The most important of such quantities is the MLE, , which indicates the presence of chaotic behavior and is not only invariant with respect to diffeomorphisms but also constant for ergodic dynamical systems [27]. From this exponent one can extract the Lyapunov time, , which measures the time a vector tangent to a trajectory takes to align with the maximally chaotic, one-dimensional submanifold in the tangent bundle [39, 40]. Since the Lyapunov time is computed from the divergence of two initially close trajectories, it takes a finite time to converge and it is often useful to observe its evolution as a function of time, as done in Fig. 6.

Panels (a) and (b) of Fig. 6 display the evolution of extracted from propagating odd roots and random initial conditions for an energy density of . Not only is convergence achieved for all initial states employed, but the time-dependent portrait of both typical and atypical initial conditions is identical. Upon decreasing the energy density, however, Fig. 6(c) shows that atypical initial conditions start behaving very differently not only with respect to typical ones, but also among themselves. While the Lyapunov times obtained from and are similar to the ones of the random initial conditions in Fig. 6(d), the data obtained from evolving is seen to start forming a “belly” before convergence. Thus, it shows that lies in a region of phase space that takes longer to align with the maximally chaotic direction, similarly to the trapping times described in [23]. The difference is that here we observe longer trapping times for a fixed energy density by changing only the initial conditions.

By evolving even higher excited roots such as , Fig. 6(c) shows that the “belly” increases when compared to the one of , such that is trapped for a time one order of magnitude longer. Extrapolating to this transient trapping regime, which is a manifestation of prethermalization in an invariant quantity and therefore must disappear for sufficiently long times, ends up being longer than the propagation time and does not converge in our computations. The reason for the longer transient times as we go up from to is that the higher the excitation number, the smaller the subspace of mode space in which dynamics is approximately restricted to during the prethermal regime, as discussed in Sec. III. Indeed, Fig. 3 shows that a large fraction of the energy is still concentrated in multiples of the initial excitation number at equipartition time, which as seen by comparing Figs. 6 and 7 is always longer than Lyapunov time when dealing with atypical initial conditions. Thus, as long as the root is a low lying mode (that is, with ), the trapping time will increase with the excitation number. The breather at marks the boundary between low-mode and high-mode excitations, with being the slowest mode to thermalize for –so slow that there is no point in attempting to obtain data from it, since this is already computationally hard for . It is interesting to note that components of the trajectory that “scan” phase space in order to obtain convergence in are the low-energy ripples that are not multiples of for a root . Evidently, these ripples might not have enough energy to go very far, requiring longer transient times. The energy is spread among all modes most efficiently when evolving , which explains why this state behaves essentially as a random state despite its extremely long equipartition time when compared to the latter.

Fig. 7, which shows the final (converged) Lyapunov times as a function of energy density as obtained from typical and atypical initial conditions, is clear evidence that the system we are dealing with is numerically ergodic: The times are the same, no matter what initial conditions are chosen (although the larger error bars and spread for show that is barely long enough to reach numerical ergodicity for the smallest values of used here). This does not mean, as discussed before, that all data for roots converges, with the main reason for failure being the presence of extremely long prethermal regimes as seen in Fig. 6.
IV.3 Kolmogorov-Sinai entropies
Given the imprint prethermalization has in a dynamical invariant such as the Lyapunov time, in Fig. 8 we display the time evolution of one of the most meaningful, yet computationally expensive, invariant quantities in dynamical systems, namely the Kolmogorov-Sinai entropy, . This quantity, just like the Lyapunov time and the associated maximal exponent, displays a clear dip when computed by propagating roots, while for a typical initial condition it approaches its converged value from above. Since is the sum of all positive Lyapunov exponents, what Fig. 8 shows is that the prethermal regime is present in all phase-space directions, not only the maximal one associated to the MLE. Upon decreasing the energy density we also see a tendency of a larger dip for higher-excited modes, in accordance with our expectation that the prethermal regime is more pronounced for higher excitation numbers. We also note that convergence of Kolmogorov-Sinai entropies to the same final value is likely the strongeast display of numerical ergodicity achievable in simulations.
V Discussion
The root chosen in the original FPUT experiment and most subsequent investigations of the FPUT paradox was the fundamental mode. In essence, and as previously shown by comparisons with the nearby and integrable Toda chain [23], the proximity of the fundamental mode to a stationary state of the FPUT chain ends up trapping it in a near-regular portion of phase space for a finite time. We have shown here that higher-excited roots are trapped for even longer than the fundamental mode in such near-regular regions, even though all other parameters in the system are held constant.
Traditionally, the Lyapunov time is considered to characterize alignment along the maximally chaotic direction in a decomposition of the tangent space known as Oseledec splitting, which is covariant with respect to the Hamiltonian flow [40]. Evidently, such decomposition is meaningless in the case of a near-regular trajectory, and it is expected that the transient regime of will be different depending on how close the initial state is to a near-integrable region. However, the Lyapunov time itself is an invariant quantity in ergodic systems, such that at the end all initial states must provide the same value for the converged MLE (that is, the asymptotic limit of . This was verified in Figs. 6 and 7. Thus, the time needed to escape a region of near-integrability is more related to the equipartition time than to Lyapunov time, since we expect mode space to be fully covered only once a trajectory starting in a root becomes sufficiently chaotic. The Lyapunov time itself then carries absolutely no information on prethermalization, although its time-evolution certainly does. This leads to a reinterpretation of as the time a typical initial condition takes to align with the maximally chaotic covariant direction in tangent space, since atypical ones might not align with it at all. This results in the Lyapunov time being always shorter than the equipartition time of a root, as seen by plotting them together in Fig. 9.

The only time scale longer than the equipartition time of roots appears to be the convergence time of their time-dependent Kolmogorov-Sinai entropies, . Like , this quantity will only converge once the trajectory leaves a near-regular region, but now alignment with respect to a large fraction of covariant Lyapunov directions is required instead of only the maximal one. Since several directions are associated to small Lyapunov exponents it is not necessary to align with all of them to have a well-converged result, but Fig. 8(c) shows that the convergence times are, nevertheless, longer than the equipartition times shown in Fig. 2. Thus, the inescapable conclusion is that equipartition is reached before the trajectory has fully explored the chaotic sea.
It would be interesting to attempt a computation of the Lyapunov time of a submanifold, i.e. calculate starting from a nonthermal mode, e.g. , and see how this exponent relates to the true obtained from a thermal initial condition. Unfortunately this type of numerical investigation is hard, if not impossible, to carry on. This is due to the fact that the bushes associated to nonthermal modes in -FPUT chains are unstable and will not be preserved in long-time evolution no matter which numerical integrator is chosen [26]. Indeed, if the root chosen is nonthermal, by tracking the time-evolution of mode energies one sees that the initial excitation leaves the bush due to numerical errors and eventually covers the whole of mode space. This instability might be responsible for the observation that the evolution of nonthermal modes results in approximately the same Lyapunov times as thermal ones [23], which is surprising given that bush dynamics takes place only in a submanifold of mode space. Nevertheless, it is unclear how ergodicity in mode space relates to that of phase space, such that it is possible that a partial covering of mode space by a large enough bush is enough to resolve the “true” MLE. Besides, it might even happen that dynamics in phase space is ergodic while that in mode space is not, since ergodicity always depends on the observables chosen [41] unless it is tracked by means of invariant quantities [42]. At present, the authors are unaware of a special type of integrator that is capable of preserving bushes of excitations, such that studies regarding dynamical properties of chaotic subspaces cannot be performed in a numerically meaningful fashion. A consequence of this is that the energy in Fig. 1(a), which is correctly localized in the bush associated to , will eventually spread to all modes for long enough times. Nevertheless, the way “equipartition” takes place in this case is pathological and clearly traceable to numerical errors, as can be seen in Appendix A.
VI Conclusion
We have extended previous studies on the cubic Fermi-Pasta-Ulam-Tsingou (-FPUT) model by investigating the consequences of evolving higher-excited normal modes instead of the fundamental. The evolution of such atypical initial conditions was also compared to that obtained from typical (random) ones. We provided an explicit and simple way of predicting mode excitation sequences when exciting only a single normal mode (the root), showing that higher-excited roots take longer to cover mode space, while the fundamental is the fastest energy spreader. Thorough numerical investigations based on computing Lyapunov exponents and Kolmogorov-Sinai entropies show that even invariant quantities carry imprints of the atypical initial states used to compute them in their transient regime, despite converging to the correct invariant value for long times. The prethermal dynamics taking place before convergence is confirmed to be a consequence of some select roots lying close to stationary states also in the -FPUT system, and therefore being trapped in near-regular regions. The escape from such regions is what puts an end to the paradigmatic metastable plateaus in mode energies, whose end marks the moment invariant quantities computed from roots start to converge.
Acknowledgements.
This research was supported by the Institute for Basic Science through Project Code (No. IBS-R024-D1).Appendix A Numerical instability of bushes

The bushes associated with nonthermal roots in -FPUT chains appear to be unstable as they evolve in time. The main consequence of this instability is that energy, which should be only shared among modes in the bush and never leave for any finite time, will end up exciting other modes due to numerical errors. In Fig. 10 we provide two examples of this behavior when evolving provenly nonthermal modes: The roots and in a chain. The first case is essentially a continuation of Fig. 1 for much longer times, until one can see an anomalous and simultaneous jump in the energy of modes that should not be excited starting from ; The root , on the other hand, is even and therefore always nonthermal, undergoing a very similar pathological jump and reaching equipartition in Fig. 10 exclusively due to numerical errors.
References
- Fermi et al. [1955] E. Fermi, J. Pasta, S. Ulam, and M. Tsingou, Studies of nonlinear problems. I, Tech. Rep. LA-1940 (Los Alamos Scientific Lab., N. Mex., 1955).
- Ford [1992] J. Ford, The Fermi-Pasta-Ulam problem: paradox turns discovery, Physics Reports 213, 271 (1992).
- Weissert [1997] T. P. Weissert, The Genesis of Simulation in Dynamics: Pursuing the Fermi-Pasta-Ulam Problem (Springer-Verlag, New York, NY, 1997).
- Gallavotti [2007] G. Gallavotti, The Fermi-Pasta-Ulam problem: a status report, Vol. 728 (Springer Science & Business Media, 2007).
- Kolmogorov [1954] A. N. Kolmogorov, On conservation of conditionally periodic motions for a small change in hamilton’s function, Doklady Akademii Nauk SSSR 98, 527 (1954), translated into English in Lecture Notes in Physics, vol. 93, 1979, Springer.
- Arnold [1963] V. I. Arnold, Small denominators. i. mappings of the circle onto itself, Izvestiya Akademii Nauk SSSR. Seriya Matematicheskaya 25, 21 (1963), translated into English in American Mathematical Society Translations, Series 2, vol. 46, 1965, pp. 213-284.
- Moser [1962] J. Moser, On invariant curves of area-preserving mappings of an annulus, Nachrichten der Akademie der Wissenschaften in Göttingen. II. Mathematisch-Physikalische Klasse 1962, 1 (1962), reprinted in J. Moser, Stable and Random Motions in Dynamical Systems, Princeton University Press, 1973, pp. 89-112.
- Chirikov [1979] B. V. Chirikov, A universal instability of many-dimensional oscillator systems, Physics reports 52, 263 (1979).
- Birkhoff [1927] G. D. Birkhoff, On the periodic motions of dynamical systems, Acta Mathematica 50, 359 (1927).
- Gustavson [1966] F. G. Gustavson, On constructing formal integrals of a hamiltonian system near an equilibrium point, Astronomical Journal 71, 670 (1966).
- Ozorio de Almeida [1990] A. M. Ozorio de Almeida, Hamiltonian Systems: Chaos and Quantization, Cambridge Monographs on Mathematical Physics (Cambridge University Press, Cambridge, 1990).
- Murdock [2003] J. A. Murdock, Normal Forms and Unfoldings for Local Dynamical Systems, Applied Mathematical Sciences, Vol. 182 (Springer, New York, 2003).
- Wiggins [2003] S. Wiggins, Introduction to Applied Nonlinear Dynamical Systems and Chaos, 2nd ed., Texts in Applied Mathematics, Vol. 2 (Springer, New York, 2003).
- Poggi and Ruffo [1997] P. Poggi and S. Ruffo, Exact solutions in the FPU oscillator chain, Physica D: Nonlinear Phenomena 103, 251 (1997).
- Berman and Izrailev [2005] G. Berman and F. Izrailev, The Fermi–Pasta–Ulam problem: fifty years of progress, Chaos: An Interdisciplinary Journal of Nonlinear Science 15 (2005).
- Campbell et al. [2005] D. K. Campbell, P. Rosenau, and G. M. Zaslavsky, Introduction: the Fermi–Pasta–Ulam problem—the first fifty years, Chaos: An Interdisciplinary Journal of Nonlinear Science 15 (2005).
- Benettin et al. [2008] G. Benettin, A. Carati, L. Galgani, and A. Giorgilli, The Fermi-Pasta-Ulam problem and the metastability perspective, The Fermi-Pasta-Ulam problem: a status report , 151 (2008).
- Chirikov and Izrailev [1970] B. V. Chirikov and F. M. Izrailev, Statistical properties of a nonlinear string, Soviet Physics Doklady 11, 30 (1970).
- Nishida [1971] T. Nishida, A note on an existence of conditionally periodic oscillation in a one-dimensional anharmonic lattice, Memoirs of the Faculty of Engineering, Kyoto University 33, 27 (1971).
- Rink and Verhulst [2000] B. Rink and F. Verhulst, Near-integrability of periodic fpu-chains, Physica A: Statistical Mechanics and its Applications 285, 467 (2000).
- Rink [2001] B. Rink, Symmetry and resonance in periodic fpu chains, Communications in Mathematical Physics 218, 665 (2001).
- Verhulst [2020] F. Verhulst, Variations on the fermi-pasta-ulam chain, a survey, in Chaotic Modeling and Simulation International Conference (Springer, 2020) pp. 1025–1042.
- Casetti et al. [1997] L. Casetti, M. Cerruti-Sola, M. Pettini, and E. Cohen, The fermi-pasta-ulam problem revisited: Stochasticity thresholds in nonlinear hamiltonian systems, Physical Review E 55, 6566 (1997).
- Flach et al. [2005] S. Flach, M. Ivanchenko, and O. Kanakov, q-breathers and the fermi-pasta-ulam problem, Physical review letters 95, 064102 (2005).
- Flach et al. [2006] S. Flach, M. Ivanchenko, and O. Kanakov, Q-breathers in Fermi-Pasta-Ulam chains: Existence, localization, and stability, Physical Review E—Statistical, Nonlinear, and Soft Matter Physics 73, 036618 (2006).
- Chechin et al. [2002] G. Chechin, N. Novikova, and A. Abramenko, Bushes of vibrational modes for fermi–pasta–ulam chains, Physica D: Nonlinear Phenomena 166, 208 (2002).
- Eichhorn et al. [2001] R. Eichhorn, S. J. Linz, and P. Hänggi, Transformation invariance of lyapunov exponents, Chaos, Solitons & Fractals 12, 1377 (2001).
- Flach and Ponno [2008] S. Flach and A. Ponno, The fermi–pasta–ulam problem: Periodic orbits, normal forms and resonance overlap criteria, Physica D: Nonlinear Phenomena 237, 908 (2008).
- Cretegny et al. [1998] T. Cretegny, T. Dauxois, S. Ruffo, and A. Torcini, Localization and equipartition of energy in the -fpu chain: Chaotic breathers, Physica D: Nonlinear Phenomena 121, 109 (1998).
- Danieli et al. [2017] C. Danieli, D. Campbell, and S. Flach, Intermittent many-body dynamics at equilibrium, Physical Review E 95, 060202 (2017).
- Goedde et al. [1992] C. G. Goedde, A. J. Lichtenberg, and M. A. Lieberman, Chaos and the approach to equilibrium in a discrete sine-Gordon equation, Physica D: Nonlinear Phenomena 59, 200 (1992).
- Pesin [1977] Y. B. Pesin, Characteristic Lyapunov exponents and smooth ergodic theory, Russian Mathematical Surveys 32, 55 (1977).
- Benettin et al. [1980] G. Benettin, L. Galgani, A. Giorgilli, and J.-M. Strelcyn, Lyapunov characteristic exponents for smooth dynamical systems and for hamiltonian systems; a method for computing all of them. part 1: Theory, Meccanica 15, 9 (1980).
- Geist et al. [1990] K. Geist, U. Parlitz, and W. Lauterborn, Comparison of different methods for computing lyapunov exponents, Progress of theoretical physics 83, 875 (1990).
- Abraham and Marsden [2008] R. Abraham and J. E. Marsden, Foundations of mechanics, 364 (American Mathematical Soc., 2008).
- Palis and De Melo [2012] J. J. Palis and W. De Melo, Geometric theory of dynamical systems: an introduction (Springer Science & Business Media, 2012).
- McLachlan and Atela [1992] R. I. McLachlan and P. Atela, The accuracy of symplectic integrators, Nonlinearity 5, 541 (1992).
- Matsuyama and Konishi [2015] H. J. Matsuyama and T. Konishi, Multistage slow relaxation in a Hamiltonian system: The Fermi-Pasta-Ulam model, Physical Review E 92, 022917 (2015).
- Eckmann and Ruelle [1985] J.-P. Eckmann and D. Ruelle, Ergodic theory of chaos and strange attractors, Reviews of modern physics 57, 617 (1985).
- Ginelli et al. [2007] F. Ginelli, P. Poggi, A. Turchi, H. Chaté, R. Livi, and A. Politi, Characterizing dynamics with covariant Lyapunov vectors, Physical review letters 99, 130601 (2007).
- Baldovin et al. [2021] M. Baldovin, A. Vulpiani, and G. Gradenigo, Statistical mechanics of an integrable system, Journal of Statistical Physics 183, 41 (2021).
- Lando and Flach [2023] G. M. Lando and S. Flach, Thermalization slowing down in multidimensional josephson junction networks, Physical Review E 108, L062301 (2023).