Ergodic observables in non-ergodic systems: the example of the harmonic chain
Abstract
In the framework of statistical mechanics the properties of macroscopic systems are deduced starting from the laws of their microscopic dynamics. One of the key assumptions in this procedure is the ergodic property, namely the equivalence between time averages and ensemble averages. This property can be proved only for a limited number of systems; however, as proved by Khinchin [1], weak forms of it hold even in systems that are not ergodic at the microscopic scale, provided that extensive observables are considered.
Here we show in a pedagogical way the validity of the ergodic hypothesis, at a practical level, in the paradigmatic case of a chain of harmonic oscillators. By using analytical results and numerical computations, we provide evidence that this non-chaotic integrable system shows ergodic behavior in the limit of many degrees of freedom. In particular, the Maxwell-Boltzmann distribution turns out to fairly describe the statistics of the single particle velocity. A study of the typical time-scales for relaxation is also provided.
I Introduction
Since the seminal works by Maxwell, Boltzmann and Gibbs, statistical mechanics has been conceived as a link between the microscopic world of atoms and molecules and the macroscopic one where everyday phenomena are observed [2]. The same physical system can be described, in the former, by an enormous number of degrees of freedom (of the same order of the Avogadro number) or, in the latter, in terms of just a few thermodynamics quantities. Statistical mechanics is able to describe in a precise way the behavior of these macroscopic observables, by exploiting the knowledge of the laws for the microscopic dynamics and classical results from probability theory. Paradigmatic examples of this success are, for instance, the possibility to describe the probability distribution of the single-particle velocity in an ideal gas [2, 3], as well as the detailed behavior of phase transitions [4, 5] and critical phenomena [6, 7]. In some cases (Bose-Einstein condensation [8], absolute negative temperature systems [9, 10, 11]) the results of statistical mechanics were able to predict states of the matter that were never been observed before.
In spite of the above achievements, a complete consensus about the actual reasons for such a success has not been yet reached within the statistical mechanics community. The main source of disagreement is the so-called “ergodic hypothesis”, stating that time averages (the ones actually measured in physics experiments) can be computed as ensemble averages (the ones appearing in statistical mechanics calculations). Specifically, a system is called ergodic when the value of the time average of any observable is the same as the one obtained by taking the average over the energy surface, using the microcanonical distribution [12]. It is worth mentioning that, from a mathematical point of view, ergodicity holds only for a small amount of physical systems: the KAM theorem [13, 14, 15] establishes that, strictly speaking, non-trivial dynamics cannot be ergodic. Nonetheless, the ergodic hypothesis happens to work extremely well also for non-ergodic systems. It provides results in perfect agreement with the numerical and experimental observations, as seen in a wealth of physical situations [16, 17, 18].
Different explanations for this behavior have been provided. Without going into the details of the controversy, three main points of view can be identified: (i) the “classical” school based on the seminal works by Boltzmann and the important contribution of Khinchin, where the main role is played by the presence of many degrees of freedom in the considered systems [1, 19, 20, 21, 22, 23]; (ii) those, like the Prigogine school, who recognize in the chaotic nature of the microscopic evolution the dominant ingredient [24, 25, 26, 27]; (iii) the maximum entropy point of view, which does not consider statistical mechanics as a physical theory but as an inference methodology based on incomplete information [28, 29, 30, 31].
The main aim of the present contribution is to clarify, at a pedagogical level, how ergodicity manifests itself for some relevant degrees of freedom, in non-ergodic systems. We say that ergodicity occurs “at a practical level”. To this end, a classical chain of coupled harmonic oscillators turns out to be an excellent case study: being an integrable system, it cannot be suspected of being chaotic; still, “practical” ergodicity is recovered for relevant observables, in the limit of . We believe that this kind of analysis supports the traditional point of view of Boltzmann, which identifies the large number of degrees of freedom as the reason for the occurrence of ergodic behavior for physically relevant observables. Of course, these conclusions are not new. In the works of Khinchin (and then Mazur and van der Lynden) [1, 32, 33, 34, 35] it is rigorously shown that the ergodic hypothesis holds for observables that are computed as an average over a finite fraction of the degrees of freedom, in the limit of . Specifically, if we limit our interest to this particular (but non-trivial) class of observables, the ergodic hypothesis holds for almost all initial conditions (but for a set whose probability goes to zero for ), within arbitrary accuracy. In addition, several numerical results for weakly non-linear systems [36, 37, 38], as well as integrable systems [14, 39], present strong indications of the poor role of chaotic behaviour, implying the dominant relevance of the many degrees of freedom. Still, we think it may be useful, at least from a pedagogical point of view, to analyze an explicit example where analytical calculations can be made (to some extent), without losing physical intuition about the model.
The rest of this paper is organized as follows. In Section II we briefly recall basic facts about the chosen model, to fix the notation and introduce some formulae that will be useful in the following. Section III contains the main result of the paper. We present an explicit calculation of the empirical distribution of the single-particle momentum, given a system starting from out-of-equilibrium initial conditions. We show that in this case the Maxwell-Boltzmann distribution is an excellent approximation in the limit. Section IV is devoted to an analysis of the typical times at which the described ergodic behavior is expected to be observed; a comparison with a noisy version of the model (which is ergodic by definition) is also provided. In Section V we draw our final considerations.
II Model
We are interested in the dynamics of a one-dimensional chain of classical harmonic oscillators of mass . The state of the system is described by the canonical coordinates with ; here identifies the momentum of the -th oscillator at time , while represents its position. The -th and the -th particles of the chain interact through a linear force of intensity , where is the elastic constant. We will assume that the first and the last oscillator of the chain are coupled to virtual particles at rest, with infinite inertia (the walls), i.e. .
The Hamiltonian of the model reads therefore
(1) |
where .
Such a system is integrable and, therefore, trivially non-ergodic. This can be easily seen by considering the normal modes of the chain, i.e. the set of canonical coordinates
(2a) | |||
(2b) |
with . Indeed, by rewriting the Hamiltonian in terms of these new canonical coordinates one gets
(3) |
where the frequencies of the normal modes are given by
(4) |
In other words, the system can be mapped into a collection of independent harmonic oscillators with characteristic frequencies . This system is clearly non-ergodic, as it admits integrals of motion, namely the energies
associated to the normal modes.
In spite of its apparent simplicity, the above system allows the investigation of some nontrivial aspects of the ergodic hypothesis, and helps clarifying the physical meaning of this assumption.
III Ergodic behavior of the momenta
In this section we analyze the statistics of the single-particle momenta of the chain. We aim to show that they approximately follow a Maxwell-Boltzmann distribution
(5) |
in the limit of large , where is the inverse temperature of the system. With the chosen initial conditions, . Firstly, extending some classical results by Kac [40, 41], we focus on the empirical distribution of the momentum of one particle, computed from a unique long trajectory, namely
(6) |
Then we consider the marginal probability distribution computed from the momenta of all the particles at a specific time , i.e.
(7) |
In both cases we assume that the system is prepared in an atypical initial condition. More precisely, we consider the case in which , for all , and the total energy , at time , is equally distributed among the momenta of the first normal modes, with :
(8) |
In this case, the dynamics of the first normal modes is given by
(9) | ||||
III.1 Empirical distribution of single-particle momentum
Our aim is to compute the empirical distribution of the momentum of a given particle , i.e., the distribution of its values measured in time. This analytical calculation was carried out rigorously by Mazur and Montroll in Ref. [42]. Here, we provide an alternative argument that has the advantage of being more concise and intuitive, in contrast to the mathematical rigour of [42]. Our approach exploits the computation of the moments of the distribution; by showing that they are the same, in the limit of infinite measurement time, as those of a Gaussian, it is possible to conclude that the considered momentum follows the equilibrium Maxwell-Boltzmann distribution. The assumption will enter explicitly the calculation.
The momentum of the -th particle can be written as a linear combination of the momenta of the normal modes by inverting Eq. (2b):
(10) | ||||
where the ’s are defined by Eq. (4), and the dynamics (9) has been taken into account. The -th empirical moment of the distribution is defined as the average of the -th powerof over a measurement time :
(11) | ||||
with
(12) |
We want to study the integral appearing in the last term of the above equation. To this end it is useful to recall that
(13) |
As a consequence, one has
(14) |
Indeed, we are just averaging over periods of the integrated function, obtaining the same result we get for a single period, with a correction of the order . This correction comes from the fact that is not, in general, an exact multiple of . If , , …, are incommensurable (i.e., their ratios cannot be expressed as rational numbers), provided that is much larger than for each choice of , a well known result [40] assures that
(15) | ||||
where the last step is a consequence of Eq. (14). Instead, if at least one of the ’s is odd, the above quantity vanishes, again with corrections due to the finite time . Since the smallest sfrequency is , one has that the error is at most of the order .

Let us consider again the integral in the last term of Eq. (11). The ’s are, in general, incommensurable. Therefore, the integral vanishes when is odd, since in that case at least one of the , , will be odd. When is even, the considered quantity is different from zero as soon as the ’s are pairwise equal, so that . In the following we will neglect the contribution of terms containing groups of four or more equal ’s: if , the number of these terms is indeed times less numerous than the pairings, and it can be neglected if (which is one of our assumptions on the initial condition). Calling the set of possible pairings for the vector , we have then
(16) |
with an error of due to neglecting groups of 4, 6 and so on, and an error due to the finite averaging time , as discussed before. Factor comes from the explicit evaluation of Eq. (15) .
At fixed , we need now to estimate the sums appearing in the above equation, recalling that the ’s are pairwise equal. If , the arguments of the periodic functions can be thought as if independently extracted from a uniform distribution . One has:
(17) |
and
(18) |
if . As a consequence
(19) |
where is the number of ways in which we can choose the pairings. These are the moments of a Gaussian distribution with zero average and variance.
Summarising, it is possible to show that, if , the first moments of the distribution are those of a Maxwell-Boltzmann distribution. In the limit of with fixed, the Gaussian distribution is thus recovered up to an arbitrary number of moments. Let us note that the assumption , while allowing to make the calculations clearer, is not really relevant. Indeed, if we can repeat the above computation while replacing by , where the phases take into account the initial conditions.
Fig. 1 shows the standardized histogram of the relative frequencies of single-particle velocities of the considered system, in the limit, with the initial conditions discussed before. As expected, the shape of the distribution tends to a Gaussian in the large-time limit.
III.2 Distribution of momenta at a given time
A similar strategy can be used to show that, at any given time large enough, the histogram of the momenta is well approximated by a Gaussian distribution. Again, the large number of degrees of freedom plays an important role. We want to compute the empirical moments
(20) |
defined according to the distribution introduced by Eq. (6). Using again Eq. (10) we get
(21) | ||||
Reasoning as before, we see that the sum over vanishes in the large limit unless the ’s are pairwise equal. Again, we neglect the terms including groups of 4 or more equal ’s, assuming that , so that their relative contribution is . That sum selects paired values of for the product inside the square brackets, and we end with
(22) |
If is “large enough” (we will come back to this point in the following section), different values of lead to completely uncorrelated values of . Hence, as before, we can consider the arguments of the cosines as extracted from a uniform distribution, obtaining
(23) |
These are again the moments of the equilibrium Maxwell-Boltzmann distribution. We had to assume , meaning that a Gaussian distribution is recovered only in the limit of large number of degrees of freedom.
The empirical distribution can be compared with the Maxwell-Boltzmann by looking at the Kullback-Leibler divergence which provides a sort of distance between the empirical and the Maxwell-Boltzmann:
(24) |
Figure 2 shows how the Kullback-Leibler divergences approach their equilibrium limit, for different values of . As expected, the transition happens on a time scale that depends linearly on .
A comment is in order: even if this behaviour may look similar to the H-Theorem for diluited gases, such a resemblance is only superficial. Indeed, while in the cases of diluited gases the approach to the Maxwell-Boltzmann is due to the collisions among different particles that actually exchange energy and momentum, in the considered case the “thermalization” is due to a dephasing mechanism.

IV Analysis of the time scales
In the previous section, when considering the distribution of the momenta at a given time, we had to assume that was “large enough” in order for our approximations to hold. In particular we required and to be uncorrelated as soon as . Such a dephasing hypothesis amounts to asking that
(25) |
where is the number of phases by which the two oscillator have to differ before they can be considered uncorrelated. The constant may be much larger than 1, but it is not expected to depend strongly on the size of the system. In other words, we require
(26) |
for each choice of and . To estimate this typical relaxation time, we need to pick the minimum value of among the possible pairs . This term is minimized when and (or vice-versa), with chosen such that is minimum. In the large- limit this quantity is approximated by
(27) |
which is minimum when is maximum, i.e. for .
Dephasing is thus expected to occur at
(28) |
i.e. in the limit.
It is instructive to compare this characteristic time with the typical relaxation time of the “damped” version of the considered system. For doing so, we assume that our chain of oscillators is now in contact with a viscous medium which acts at the same time as a thermal bath and as a source of viscous friction. By considering the (stochastic) effect of the medium, one gets the Klein-Kramers stochastic process [43, 44]
(29) |
where is the damping coefficient and is the temperature of the thermal bath (we are taking the Boltzmann constant equal to 1). Here the are time-dependent, delta-correlated Gaussian noises such that . Such a system is surely ergodic and the stationary probability distribution is the familiar equilibrium one
(30) |
Also in this case we can consider the evolution of the normal modes. By taking into account Eqs. (2) and (29) one gets
(31) | ||||
where the are again delta-correlated Gaussian noises. It is important to notice that also in this case the motion of the modes is independent (i.e. the friction does not couple normal modes with different ); nonetheless, the system is ergodic, because the presence of the noise allows it to explore, in principle, any point of the phase-space.
The Fokker-Planck equation for the evolution of the probability density function of the -th normal mode can be derived using standard methods [43]:
(32) |
The above equation allows to compute also the time dependence of the correlation functions of the system in the stationary state. In particular one gets
(33) |
and
(34) |
which, once combined together, lead to
(35) |
For the solution of this equation admits two characteristic frequencies , namely
(36) |
In the limit one has therefore
(37) |
Therefore, as a matter of fact, even in the damped case the system needs a time that scales as in order to get complete relaxation for the modes. As we discussed before, the dephasing mechanism that guarantees for “practical” ergodicity in the deterministic version is instead expected to occur on time scales of order .
V Conclusions
The main aim of this paper was to expose, at a pedagogical level, some aspects of the foundation of statistical mechanics, namely the role of ergodicity for the validity of the statistical approach to the study of complex systems.
We analyzed a chain of classical harmonic oscillators (i.e. a paradigmatic example of integrable system, which cannot be suspected to show chaotic behaviour). By extending some well-known results by Kac [40], we showed that the Maxwell-Bolzmann distribution approximates with arbitrary precision (in the limit of large number of degrees of freedom) the empirical distribution of the momenta of the system, after a dephasing time which scales with the size of the chain. This is true also for quite pathological initial conditions, where only a small fraction of the normal modes is excited at time . The scaling of the typical dephasing time with the number of oscillators may appear as a limit of our argument, since this time will diverge in the thermodynamic limit; on the other hand one should consider, as explicitely shown before, that the damped version of this model (which is ergodic by definition) needs times of the order to reach thermalization for each normal mode.
This comparison clearly shows that the effective thermalization observed in large systems has little to do with the mathematical concept of ergodicity, and it is instead related to the large number of components concurring to define the global observales that are usually taken into account (in our case, the large number of normal modes that define the momentum of a single particle). When these components cease to be in phase, the predictions of statistical mechanics start to be effective; this can be observed even in integrable systems, without need for the mathematical notion of ergodicity to hold.
In other words, we believe that the present work give further evidence of the idea (which had been substantiated mathematically by Khinchin, Mazur and van der Linden) that the most relevant ingredient of statistical mechanics is the large number of degrees of freedom, and the global nature of the observables that are typically taken into account.
Acknowledgements
RM is supported by #NEXTGENERATIONEU (NGEU) and funded by the Ministry of University and Research (MUR), National Recovery and Resilience Plan (NRRP), project MNESYS (PE0000006) ”A Multiscale integrated approach to the study of the nervous system in health and disease” (DN. 1553 11.10.2022).
References
- Khinchin [1949] A. I. Khinchin, Mathematical foundations of statistical mechanics (Courier Corporation, 1949).
- Huang [2009] K. Huang, Introduction to statistical physics (CRC press, 2009).
- Peliti and Epstein [2003] L. Peliti and M. Epstein, Statistical Mechanics in a Nutshell (Princeton University Press, 2003).
- Ehrenfest and Ehrenfest [1990] P. Ehrenfest and T. Ehrenfest, The conceptual foundations of the statistical approach in mechanics (Courier Corporation, 1990).
- Di Castro and Raimondi [2015] C. Di Castro and R. Raimondi, Statistical mechanics and applications in condensed matter (Cambridge University Press, 2015).
- Ma [1985] S.-K. Ma, Statistical mechanics (World Scientific Publishing Company, 1985).
- Caracciolo et al. [2021] S. Caracciolo, R. Fabbricatore, M. Gherardi, R. Marino, G. Parisi, and G. Sicuro, Criticality and conformality in the random dimer model, Phys. Rev. E 103, 042127 (2021).
- Anderson et al. [1995] M. H. Anderson, J. R. Ensher, M. R. Matthews, C. E. Wieman, and E. A. Cornell, Observation of Bose-Einstein condensation in a dilute atomic vapor, Science 269, 198 (1995).
- Ramsey [1956] N. F. Ramsey, Thermodynamics and statistical mechanics at negative absolute temperatures, Physical Review 103, 20 (1956).
- Braun et al. [2013] S. Braun, J. P. Ronzheimer, M. Schreiber, S. S. Hodgman, T. Rom, I. Bloch, and U. Schneider, Negative absolute temperature for motional degrees of freedom, Science 339, 52 (2013).
- Baldovin et al. [2021a] M. Baldovin, S. Iubini, R. Livi, and A. Vulpiani, Statistical mechanics of systems with negative temperature, Phys. Rep. 923, 1 (2021a).
- Lebowitz and Penrose [1973] J. L. Lebowitz and O. Penrose, Modern ergodic theory, Physics Today 26, 23 (1973).
- Arnold [2009] V. I. Arnold, Proof of a theorem of A N Kolmogorov on the invariance of quasi-periodic motions under small perturbations of the Hamiltonian, Collected Works: Representations of Functions, Celestial Mechanics and KAM Theory, 1957–1965 , 267 (2009).
- Arnold [2013] V. I. Arnold, Mathematical methods of classical mechanics, Vol. 60 (Springer Science & Business Media, 2013).
- Arnold et al. [1988] V. Arnold, V. Kozlov, and A. Neishtadt, Dynamical Systems III, Encyclopaedia of Mathematical Sciences, Vol. 3 (1988).
- Livi et al. [1987] R. Livi, M. Pettini, S. Ruffo, and A. Vulpiani, Chaotic behavior in nonlinear Hamiltonian systems and equilibrium statistical mechanics, Journal of statistical physics 48, 539 (1987).
- Baldovin et al. [2021b] M. Baldovin, A. Vulpiani, and G. Gradenigo, Statistical mechanics of an integrable system, Journal of Statistical Physics 183, 41 (2021b).
- Cocciaglia et al. [2022] N. Cocciaglia, A. Vulpiani, and G. Gradenigo, Thermalization without chaos in harmonic systems, Physica A: Statistical Mechanics and its Applications 601, 127581 (2022).
- Schrödinger [1944] E. Schrödinger, The statistical law in nature, Nature 153, 704 (1944).
- Darrigol [2018] O. Darrigol, Atoms, Mechanics, and Probability: Ludwig Boltzmann’s Statistico-Mechanical Writings - An Exegesis (Oxford University Press, 2018).
- Boltzmann [2022] L. Boltzmann, Lectures on gas theory (Univ of California Press, 2022).
- Boltzmann [2012] L. Boltzmann, Theoretical physics and philosophical problems: Selected writings, Vol. 5 (Springer Science & Business Media, 2012).
- Castiglione et al. [2008] P. Castiglione, M. Falcioni, A. Lesne, and A. Vulpiani, Chaos and Coarse Graining in Statistical Mechanics (Cambridge University Press, 2008).
- Prigogine [1978] I. Prigogine, Time, structure, and fluctuations, Science 201, 777 (1978).
- Prigogine [2017] I. Prigogine, Non-equilibrium statistical mechanics (Courier Dover Publications, 2017).
- Prigogine [1996] I. Prigogine, The laws of chaos, Review (Fernand Braudel Center) , 1 (1996).
- Lando and Flach [2023] G. M. Lando and S. Flach, Thermalization slowing-down in multidimensional Josephson junction networks (2023), arXiv:2306.14398 [cond-mat.stat-mech] .
- Jaynes [1982] E. T. Jaynes, On the rationale of maximum-entropy methods, Proceedings of the IEEE 70, 939 (1982).
- Uffink [1995] J. Uffink, Can the maximum entropy principle be explained as a consistency requirement?, Studies in History and Philosophy of Science Part B: Studies in History and Philosophy of Modern Physics 26, 223 (1995).
- Jaynes [1957] E. T. Jaynes, Information theory and statistical mechanics, Physical review 106, 620 (1957).
- Jaynes [2003] E. T. Jaynes, Probability theory: The logic of science (Cambridge university press, 2003).
- Halmos [2017] P. R. Halmos, Lectures on ergodic theory (Courier Dover Publications, 2017).
- Van der Linden and Mazur [1967] J. Van der Linden and P. Mazur, On the asymptotic problem of statistical thermodynamics for a real system ii. equivalence of canonical and pressure ensemble, Physica 36, 491 (1967).
- Mazur and Van der Linden [1963] P. Mazur and J. Van der Linden, Asymptotic form of the structure function for real systems, Journal of Mathematical Physics 4, 271 (1963).
- Casetti et al. [1997] L. Casetti, M. Cerruti-Sola, M. Pettini, and E. Cohen, The fermi-pasta-ulam problem revisited: Stochasticity thresholds in nonlinear hamiltonian systems, Physical Review E 55, 6566 (1997).
- Chakraborty and Mallik [2001] G. Chakraborty and A. Mallik, Dynamics of a weakly non-linear periodic chain, International Journal of Non-Linear Mechanics 36, 375 (2001).
- Arnold et al. [1978] L. Arnold, W. Horsthemke, and R. Lefever, White and coloured external noise and transition phenomena in nonlinear systems, Zeitschrift für Physik B Condensed Matter 29, 367 (1978).
- Arnold and Lefever [2012] L. Arnold and R. Lefever, Stochastic Nonlinear Systems in Physics, Chemistry, and Biology: Proceedings of the Workshop Bielefeld, Fed. Rep. of Germany, October 5–11, 1980, Vol. 8 (Springer Science & Business Media, 2012).
- Goldstein et al. [2002] H. Goldstein, C. Poole, and J. Safko, Classical mechanics (2002).
- Kac [1959] M. Kac, Statistical Independence in Probability, Analysis, and Number Theory, Carus Mathematical Monographs (Mathematical Association of America, 1959).
- Ford et al. [1965] G. Ford, M. Kac, and P. Mazur, Statistical mechanics of assemblies of coupled oscillators, Journal of Mathematical Physics 6, 504 (1965).
- Mazur and Montroll [1960] P. Mazur and E. Montroll, Poincaré cycles, ergodicity, and irreversibility in assemblies of coupled harmonic oscillators, Journal of mathematical physics 1, 70 (1960).
- Gardiner [1985] C. W. Gardiner, Handbook of stochastic methods (Springer, 1985).
- Marino and Aurell [2016] R. Marino and E. Aurell, Advective-diffusive motion on large scales from small-scale dynamics with an internal symmetry, Phys. Rev. E 93, 062147 (2016).