Naoki Tani, Satoru S. Kano and Yasunari Zempo
11institutetext: Hosei University, Computer and Information Sciences, Tokyo 182-8584, Japan,
11email: [email protected],
https://cis.hosei.ac.jp/
Singular Spectrum Analysis of Time-series Data from Time-dependent density-functional theory in Real-time
Abstract
This paper introduces a spectral analysis of time-series data derived from real-time time-dependent density-functional theory (TDDFT) using Singular Spectrum Analysis (SSA). TDDFT is a robust method for obtaining molecular excited states and optical spectra by tracking the time evolution of dynamical dipole moments. However, the spectral resolution can be compromised when Fourier transform’s total time duration is insufficient. SSA enabled the extraction of specific oscillation components from the time-series data, facilitating the generation of higher-precision spectra. Even with relatively short time-series datasets, the predictive extension of SSA yielded high-resolution spectra, demonstrating substantial agreement with results obtained through conventional methods. The efficacy of this approach was validated for several small molecules, including ethylene, benzene, and others. SSA’s ability to conduct detailed spectral analysis in specific energy regions enhance spectral resolution and facilitates the clarification of oscillation components within these regions. Real-time TDDFT combined with SSA provides a new analytical method for analyzing the optical properties of molecules, significantly improving the accuracy of the analysis of emission and absorption spectra analysis. This method is expected to have various applications.
keywords:
Spectral Analysis, TDDFT Time-Series, Singular Spectrum Analysis1 Introduction
Time-dependent density-functional theory (TDDFT) [1] is a powerful method for calculating the electronic structure and excited states of materials, and plays an important role in materials development and optical property analysis. In particular, real-time, real-space based TDDFT is widely used, due to its simplicity and intuitive operability [2].
We employed real-time TDDFT to analyze the optical properties of organic light-emitting diodes and other devices [3]. This approach is easy to parallelize, enabling stable and efficient calculations. However, the accuracy and resolution of the calculated results are influenced by the length of the time evolution data. For example, when applying Fourier Transform (FT) to time-dependent dynamic dipole moment data to obtain absorption and emission spectra, insufficient total time can lead to reduced spectral resolution, making features like band-edge peaks indistinct. Since TDDFT’s are based on first-principles calculations, they demand significant computational resources. Inadequate computing resources or data may result in broadened spectra, complicating the accurate evaluation of optical properties. To solve this problem, we developed a new method for extracting useful information from short time-step data [4, 5, 6]
This study proposes the application of Singular Spectrum Analysis (SSA) to real-time TDDFT results to extract essential oscillational components from short time-series data. By using SSA, fundamental oscillational components that contribute to the spectrum can be separated, allowing for a clearer analysis of band-edge peaks. Furthermore, by adding the predicted data to the insufficient time-series data based on the separated oscillation components and extending the effective total time, it is possible to achieve higher resolution spectral analysis.
2 Methods
2.1 Time-dependent density-functional theory
In this section, we outline the real-time TDDFT calculation procedure. The total energy of the ground state, based on density-functional theory (DFT) with the local density approximation, is derived from the Kohn–Sham (KS) equation. While DFT is inadequate for accurately describing optical responses and absorption spectra involving electronically excited states, this limitation was addressed by Runge and Gross through the introduction of time-dependent evolution of the DFT equation [1]. The equations of motion of TDDFT coupled with pseudopotentials can be written as follows:
(3) |
where represents the th wave function. Hamiltonian is the KS Hamiltonian to which a perturbation is added, i.e., . represents the ionic pseudopotential, is the Hartree potential, and is the exchange-correlation potential. The Hartree potential and the exchange-correlation potential are expressed by the electronic charge density . The summation is performed over all occupied states , and the Hartree potential is determined by . For simplicity, all calculations in this study are performed using the atomic unit system.
Prior to the calculation of the optical response, we first determine the stationary state using conventional DFT to optimize the electronic structure. Next, we added the external potential as a perturbation to the system and tracked the linear response of the system in real-time. In our calculations, we employ the real-time, real-space approach to solve Eq.(3) using the finite difference method. This approach is advantageous due to its suitability for parallel computation. In this study, a uniform grid is used for simplicity.
The external electric field is applied as a perturbation in the form in the -direction at . When a very weak electric field is applied, a dipole moment is generated as the system’s response. The polarizability, , which characterizes the linear response, is expressed as follows. Here, represents the directions.
(4) |
When an external stimulus in the form of a delta function, , is applied, the polarizability can be directly obtained from FT of the dipole moment
(5) |
where represents the strength of the external perturbation in the direction. Furthermore, the imaginary part of the polarizability is used to compute the total oscillator strength
(6) |
where is the average polarizability. From Eq.(3), the time-dependent wave function is expressed as follows:
(7) |
The initial wave function for this perturbation at is obtained from the following equation:
(8) |
It becomes a very simple expression:
(9) |
The time evolution of the dipole moment, , is numerically computed up to a total time using discrete time steps of , where . The spectral resolution of , obtained from Eq.(6), is approximately . Therefore, insufficient data points reduce spectral resolution, requiring a sufficiently large for a clear spectrum. To address this limitation, we propose applying SSA to the time-series to enable clear spectral analysis even with limited data.
2.2 Singular spectrum analysis
The procedure for applying SSA to the time-series data is described below. SSA is a method for decomposing time-series data and extracting components based on eigenvalues. A trajectory matrix is created from the time-series data, followed by Singular Value Decomposition (SVD). The resulting components were then reconstructed to identify the fundamental oscillation components in the data, which were analyzed using FT [7, 8].
From the time-series data , a trajectory matrix of size is created using window width , where . This matrix is skew-symmetric and captures the correlation in the original time-series data:
(14) |
The column vectors of are expressed as . SVD is applied to this matrix, decomposing it into , where is a matrix with column vectors . is a diagonal matrix of size , and each diagonal component is given by . is also a orthogonal matrix with row vectors , respectively. To remove high-frequency noise, low-rank approximations are used in SVD. Based on the singular values, is decomposed into its cor-responding components :
(15) |
where is the adopted rank. For simplicity, the product of and is defined as . The th component can be expressed as
(16) |
In general, this matrix is not skew-symmetric like the original matrix . Using the average as shown in Eq.(20), we reconstruct the following time-series data , which corresponds to each singular value :
(20) |
Consequently, we can obtain a matrix
(25) |
The reconstructed matrix retains symmetry similar to that of the original trajectory matrix . Now we also place the column vector as . The original time-series can be approximated as .
In some cases, multiple components may exhibit oscillations of similar frequency and amplitude. These components are grouped into the same cluster and treated as a single oscillation. To classify these components into clusters, the similarity of their oscillation behavior was evaluated using the W-correlation matrix
(26) |
2.3 Forecast of the time-series
This section explains the forecasting of the time-series data using SSA [9]. As mentioned above, since the time-series data , extracted as specific oscillation components, is significantly simpler than the original , it allows for stable forecasting. Forecasting using involves adding a new column vector of as shown Eq.(25).
(31) |
Since only is unknown, we use the first to the of , and obtain the following linear combination of the components with coefficients as follows:
(32) | |||||
where are the components of , obtained from the decomposition as per Eq.(15). The first term in Eq.(32) is the average of each row of , and the second term is the deviation of the component. The new point, , is obtained by solving Eq.(32) for using the least-squares method. The forecast is made by
(33) |
3 Results and discussion
3.1 Ethylene

To evaluate the effectiveness of SSA, we applied it to the dipole moments obtained from TDDFT calculations for ethylene, a molecule with well-characterized electronic states. The initial external stimulus was applied along the C-C bond direction, resulting in a slowly varying dipole moment . Spectral analysis of was performed using FT as described in Eq.(5), focusing on the band-edges associated with optical absorption. Figure 1 shows the dipole moments and the resulting spectrum for two cases: (1) sufficient data () and (2) insufficient data (), with a time step [1/eV]. When sufficient time evolution is available, the band-edge spectra are clear, with distinct peaks around eV. In contrast, with insufficient data points (), the spectrum becomes weak and broad, making it challenging to discern whether it is a single peak or composed of multiple oscillations. Identifying peaks associated with the band-edges also becomes difficult. We then analyzed the spectra using the shorter time-evolved time-series up to steps. It is necessary to select the appropriate oscillation components related to the band-edge among several obtained by decomposing the time-series . To identify the major oscillation components in , a trajectory matrix is constructed with the window width according to Eq.(14). The results are obtained in the order of the largest singular values. Using according to Eq.(16), and by averaging according to Eq.(20), each time-series data corresponding to each singular value is obtained.

Figure 2 shows the first six oscillations. Pairs such as (,), (,), and (,) exhibit similar frequencies and amplitudes, indicating they are related oscillations. Each pair was clustered and denoted as . For these reconstructed oscillations, we counted the wavenumbers, excluding the first and the last approximately steps to avoid boundary effects. The spectra derived from this wavenumber analysis showed that the main oscillations correspond to peaks around eV for , eV for and eV for . Comparing these results with Fig.1(b) confirms that these peaks are included.
The peaks near the corresponding energies are notable. In particular, represent the lowest energy peaks, which are associated with the band-edge. These oscillations exhibit minimal beating and large amplitudes, indicating that they are relatively simple signals. In contrast, and show noticeable beats and are mixed with adjacent oscillations. This indicates that SSA does not always achieve a perfect decomposition into single-frequency components.
Figure 3 shows the correlation matrix for , calculated using Eq.(26). The diagonal components are, as expected, equal to , as they represent self-correlation. If different time-series are completely separated, their off-diagonal correlations vanish. However, Fig.(3) shows high correlations within the pairs ,, and , confirming their relationship. The effectiveness of this decomposition depends on the window width , which must be optimized for proper separation. In SSA, the outputs are ordered from the highest singular value to the lowest, which means that the main oscillations may appear in the high-energy region, depending on the direction of the dipole moment oscillation. To analyze the band-edges effectively, it is useful to rearrange the decomposed and reconstructed time-series data in ascending order of wavenumber, renaming them as , for clarity.

Figure 4 demonstrates the analysis and forecasting of oscillations using the method described in Section 2.3. In Fig.4(a), the time-series data is shown for the dynamic dipole moment up to steps. Fig.4(b) displays the time-series data , and the forecasted oscillation data using the prediction method from Section 2.3. In the figure, represents . For this (blue), the prediction is repeated up to steps, and the result is shown in orange. The reconstructed oscillation remains almost simple and stable over a long period of time. Figure.4(c) shows the spectrum obtained by applying FT directly to the original time-series from Fig.4(a). Since time-series up to steps are used, the total time is not sufficient, and the spectrum is very broad with low resolution. Focusing on the low-energy peak, the spectrum in the energy region from eV to eV is shown in Fig.4(d), with each peak normalized to the magnitude obtained from the results at steps (orange). The spectra compare the FT results for using data from steps (blue), steps (green), and steps (orange). It is evident that the signal becomes stronger and sharper as the number of steps increases.

3.2 Small molecules (benzene / naphthalene / anthracene / tetracene)
In this section, we confirm the effect of the SSA forecast by comparing our results with those of TDDFT up to the same number of steps, using small molecules. For molecules with simple electronic structures, such as benzene, naphthalene, anthracene, and tetracene, we examined the effect of forecasting on the spectra near the band-edge, as in Section 3.1 It is well-known that the energy of the peak positions decreases as the molecular size increases.
Figure 5 compares spectra derived from the dynamic dipole moments calculated by TDDFT with those extended and forecasted using SSA. Fig.5(a) shows the spectra near the band-edge, based on the dynamic dipole moment obtained by TDDFT up to the step number (). The results reveal a shift of the band-edge to lower energy as the number of benzene rings and atomic size increase. However, the spectrum in this energy region appears ambiguous, influenced by other nearby peaks.
To clarify the band-edge spectrum, we applied SSA to extract from each time-series data at steps, isolating the oscillations associated with the band-edge. The extracted fundamental oscillation was then forecasted and extended to steps, as shown in Fig.5(b), with a window width of . Since SSA effectively isolates the oscillation components near the band-edge, the influence of other peaks is relatively reduced. On the other hand, Fig.5(c) shows spectra obtained from the dipole moments calculated up to the step number () simply by the TDDFT calculation.
Comparing the peak shapes of these spectra, it is found that they are nearly identical and equivalent.

4 Conclusion
Real-time TDDFT is a powerful method for obtaining the excited states and optical spectra of molecules. Spectra across all energy regions can be derived by applying the FT to the dynamic dipole moment. However, this approach assumes that the total time of the time evolution is sufficiently long. When the total time is insufficient, the spectral resolution is reduced, leading to broad spectral features. Consequently, the conventional FT method struggles with broad and ambiguous spectral shapes when the number of data points is limited. In addition, the dynamical dipole moment contains multiple frequency components, complicating the analysis. Recognizing that the dynamical dipole moment is time-series data, we applied SSA to focus on specific oscillation components within the dipole moment. By decomposing the dynamical dipole moment into groups of simpler oscillations, we were able to isolate and extract oscillations associated with specific spectral peaks. This approach allowed us to identify low-energy oscillations, especially those at the band-edges related to emission and absorption, and to analyze the spectra of individual oscillation components.
This decomposition provides relatively simple oscillation components within specific energy regions. Leveraging these components, we can forecast and extend the oscillations, effectively increasing the total time available for FT. As a result, the spectral shapes became very clear, yielding sharp and high-resolution spectra for specific energy regions. Importantly, the spectra obtained from the forecasted and extended time-series showed excellent agreement with those obtained from real-time TDDFT calculations using sufficiently long time-series data. These results were validated through analysis of ethylene and small molecules such as benzene, naphthalene, anthracene, and tetracene. This demonstrates the utility of SSA in enhancing the spectral analysis of real-time TDDFT calculations, particularly for detailed investigations of specific energy regions.
Acknowledgment
This work was partially supported by the Takahashi Industrial and Economic Research Foundation, and Sumitomo Chemical Co., Ltd.
References
- [1] Runge, E., Gross, E. K. U. : Density-functional theory for time-dependent system. Phys. Rev. Lett. 52(12), 997-1000 (1984).
- [2] Yabana, K., Bertsch, G. F.: Time-dependent local-density approximation in real-time. Phys. Rev. B 54(7), 4484-4487 (1996).
- [3] Zempo, Y., Akino, N., Ishida, M., Ishitobi M., Kurita Y.: Optical properties in conjugated polymers. J. Phys.: Condens. Matter 20, 064231 (2008), and the references therein.
- [4] Toogoshi, M., Kato, M., Kano, S. S., Zempo, Y.: Optical spectrum analysis of real-time TDDFT using the maximum entropy method. J. Phys.: Conf. Ser. 510, 012027 (2014).
- [5] Toogoshi, M., Kano, S. S., Zempo, Y.: The maximum entropy method for optical spectrum analysis of real-time TDDFT. J. Phys.: Conf. Ser. 640, 012069 (2015).
- [6] Toogoshi, M., Kano, S. S., Zempo, Y.: Improved maximum entropy method applied to real-time time-dependent density functional theory. J. Phys.: Conf. Ser. 905, 012006 (2017).
- [7] Golyandina, N., and Zhigljavsky, A.: Singular spectrum analysis for time series. 2nd edn. Springer, Berlin, ch.2 (2013).
- [8] Brunton, S. L., and Kutz, J. N.: Data-driven science and engineering: machine learning, dynamical systems and control. 2nd edn. Cambridge University Press, Cambridge, ch.1 (2022).
- [9] Danilov, D. L.: Principal components in time series forecast. J. Comp. Graph. Stat. 6 (1), 112-121 (1997).