This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

\tocauthor

Naoki Tani, Satoru S. Kano and Yasunari Zempo 11institutetext: Hosei University, Computer and Information Sciences, Tokyo 182-8584, Japan,
11email: [email protected],
https://cis.hosei.ac.jp/

Singular Spectrum Analysis of Time-series Data from Time-dependent density-functional theory in Real-time

Naoki Tani    Satoru S. Kano    Yasunari Zempo
Abstract

This paper introduces a spectral analysis of time-series data derived from real-time time-dependent density-functional theory (TDDFT) using Singular Spectrum Analysis (SSA). TDDFT is a robust method for obtaining molecular excited states and optical spectra by tracking the time evolution of dynamical dipole moments. However, the spectral resolution can be compromised when Fourier transform’s total time duration is insufficient. SSA enabled the extraction of specific oscillation components from the time-series data, facilitating the generation of higher-precision spectra. Even with relatively short time-series datasets, the predictive extension of SSA yielded high-resolution spectra, demonstrating substantial agreement with results obtained through conventional methods. The efficacy of this approach was validated for several small molecules, including ethylene, benzene, and others. SSA’s ability to conduct detailed spectral analysis in specific energy regions enhance spectral resolution and facilitates the clarification of oscillation components within these regions. Real-time TDDFT combined with SSA provides a new analytical method for analyzing the optical properties of molecules, significantly improving the accuracy of the analysis of emission and absorption spectra analysis. This method is expected to have various applications.

keywords:
Spectral Analysis, TDDFT Time-Series, Singular Spectrum Analysis

1 Introduction

Time-dependent density-functional theory (TDDFT) [1] is a powerful method for calculating the electronic structure and excited states of materials, and plays an important role in materials development and optical property analysis. In particular, real-time, real-space based TDDFT is widely used, due to its simplicity and intuitive operability [2].

We employed real-time TDDFT to analyze the optical properties of organic light-emitting diodes and other devices [3]. This approach is easy to parallelize, enabling stable and efficient calculations. However, the accuracy and resolution of the calculated results are influenced by the length of the time evolution data. For example, when applying Fourier Transform (FT) to time-dependent dynamic dipole moment data to obtain absorption and emission spectra, insufficient total time can lead to reduced spectral resolution, making features like band-edge peaks indistinct. Since TDDFT’s are based on first-principles calculations, they demand significant computational resources. Inadequate computing resources or data may result in broadened spectra, complicating the accurate evaluation of optical properties. To solve this problem, we developed a new method for extracting useful information from short time-step data [4, 5, 6]

This study proposes the application of Singular Spectrum Analysis (SSA) to real-time TDDFT results to extract essential oscillational components from short time-series data. By using SSA, fundamental oscillational components that contribute to the spectrum can be separated, allowing for a clearer analysis of band-edge peaks. Furthermore, by adding the predicted data to the insufficient time-series data based on the separated oscillation components and extending the effective total time, it is possible to achieve higher resolution spectral analysis.

2 Methods

2.1 Time-dependent density-functional theory

In this section, we outline the real-time TDDFT calculation procedure. The total energy of the ground state, based on density-functional theory (DFT) with the local density approximation, is derived from the Kohn–Sham (KS) equation. While DFT is inadequate for accurately describing optical responses and absorption spectra involving electronically excited states, this limitation was addressed by Runge and Gross through the introduction of time-dependent evolution of the DFT equation [1]. The equations of motion of TDDFT coupled with pseudopotentials can be written as follows:

itψj(𝐫,t)=Hψj(𝐫,t)H=122+Vionps(𝐫)+VH(𝐫,t)+VXC[ρ(𝐫,t)]+Vext(𝐫,t),\displaystyle\begin{array}[]{c}i\frac{\partial}{{\partial t}}\psi_{j}({\bf r},t){\ }=H\psi_{j}({\bf r},t)\\[5.69054pt] H=-\frac{1}{2}\nabla^{2}+V^{ps}_{ion}({\bf r})+V_{H}({\bf r},t)+V_{XC}[\rho({\bf r},t)]+V_{ext}({\bf r},t),\end{array} (3)

where ψj\psi_{j} represents the jj th wave function. Hamiltonian HH is the KS Hamiltonian HKSH_{KS} to which a perturbation Vext(𝐫,t)V_{ext}({\bf r},t) is added, i.e., H=HKS+Vext(𝐫,t)H=H_{KS}+V_{ext}({\bf r},t). VionpsV_{ion}^{ps} represents the ionic pseudopotential, VHV_{H} is the Hartree potential, and VXCV_{XC} is the exchange-correlation potential. The Hartree potential and the exchange-correlation potential are expressed by the electronic charge density ρ(𝐫,t)=j|ψj(𝐫,t)|2\rho({\bf r},t)=\sum_{j}|\psi_{j}({\bf r},t)|^{2}. The summation is performed over all occupied states jj, and the Hartree potential is determined by 2VH=4πρ\nabla^{2}V_{H}=-4\pi\rho. For simplicity, all calculations in this study are performed using the atomic unit system.

Prior to the calculation of the optical response, we first determine the stationary state using conventional DFT to optimize the electronic structure. Next, we added the external potential VextV_{ext} as a perturbation to the system and tracked the linear response of the system in real-time. In our calculations, we employ the real-time, real-space approach to solve Eq.(3) using the finite difference method. This approach is advantageous due to its suitability for parallel computation. In this study, a uniform grid is used for simplicity.

The external electric field Vext(t)V_{ext}(t) is applied as a perturbation in the form Vext(t)=E(t)ξV_{ext}(t)=E(t)\xi in the ξ\xi-direction at t=0t=0. When a very weak electric field E(t)E(t) is applied, a dipole moment μξ(t)\mu_{\xi}(t) is generated as the system’s response. The polarizability, αξ(ω)\alpha_{\xi}(\omega), which characterizes the linear response, is expressed as follows. Here, ξ\xi represents the x,y,zx,y,z directions.

𝑑teiωtμξ(t)=α(ω)𝑑teiωtE(t).\int dt{\ }e^{i\omega t}{\ }\mu_{\xi}(t)=\alpha(\omega)\int dt{\ }e^{i\omega t}{\ }E(t). (4)

When an external stimulus in the form of a delta function, Vext(t)=kξδ(t)V_{ext}(t)=-k\xi\delta(t), is applied, the polarizability αξ(ω)\alpha_{\xi}(\omega) can be directly obtained from FT of the dipole moment μξ(t)\mu_{\xi}(t)

αξ=1k𝑑teiωtμξ(t),\alpha_{\xi}=\frac{1}{k}\int dt{\ }e^{i\omega t}{\ }\mu_{\xi}(t), (5)

where kk represents the strength of the external perturbation in the ξ\xi direction. Furthermore, the imaginary part of the polarizability is used to compute the total oscillator strength S(ω)S(\omega)

S(ω)=2ωπImα(ω),S(\omega)=\frac{{2\omega}}{\pi}{\mathop{\rm Im}\nolimits}{\ }\alpha(\omega), (6)

where α=(αx+αy+αz)/3\alpha=(\alpha_{x}+\alpha_{y}+\alpha_{z})/3 is the average polarizability. From Eq.(3), the time-dependent wave function ψj(𝐫,t)\psi_{j}({\bf r},t) is expressed as follows:

ψj(𝐫,t)=eiHtψj(𝐫,0).\psi_{j}({\bf r},t)=e^{-iHt}\psi_{j}({\bf r},0). (7)

The initial wave function ψ~j{\tilde{\psi}}_{j} for this perturbation at t=0t=0 is obtained from the following equation:

ψ~|t=0=exp[i0+0𝑑t(HKSkξδ(t))]ψj(𝐫,0).{\tilde{\psi}}{\big{|}}_{\small t=0}=\exp{{\bigl{[}}-i\int_{-0}^{+0}dt{\ }(H_{KS}-k\xi\delta(t)){\bigr{]}}}{\ }\psi_{j}({\bf r},0). (8)

It becomes a very simple expression:

ψ~|t=0=eikξψj(𝐫,0).{\tilde{\psi}}{\big{|}}_{\small t=0}=e^{ik\xi}\psi_{j}({\bf r},0). (9)

The time evolution of the dipole moment, μξ(t)\mu_{\xi}(t), is numerically computed up to a total time TT using discrete time steps of Δt\Delta t, where T=NΔtT=N\Delta t. The spectral resolution of S(ω)S(\omega), obtained from Eq.(6), is approximately O(1/T)O(1/T). Therefore, insufficient data points reduce spectral resolution, requiring a sufficiently large TT for a clear spectrum. To address this limitation, we propose applying SSA to the time-series μξ(t)\mu_{\xi}(t) to enable clear spectral analysis even with limited data.

2.2 Singular spectrum analysis

The procedure for applying SSA to the time-series data μξ(t)\mu_{\xi}(t) is described below. SSA is a method for decomposing time-series data and extracting components based on eigenvalues. A trajectory matrix is created from the time-series data, followed by Singular Value Decomposition (SVD). The resulting components were then reconstructed to identify the fundamental oscillation components in the data, which were analyzed using FT [7, 8].

From the time-series data F=(f1,f2,,fN)F=(f_{1},f_{2},\dots,f_{N}), a trajectory matrix YY of size τ×n\tau\times n is created using window width τ(1<τ<N/2)\tau(1<\tau<N/2), where n=Nτ+1n=N-\tau+1. This matrix is skew-symmetric and captures the correlation in the original time-series data:

Y=(f1f2fnf2f3fn+1fτfτ+1fN).\displaystyle Y=\left(\begin{array}[]{cccc}f_{1}&f_{2}&\cdots&f_{n}\\ f_{2}&f_{3}&\cdots&f_{n+1}\\ \vdots&\vdots&}{\hfil&\vdots\\ f_{\tau}&f_{\tau+1}&\cdots&f_{N}\\ \end{array}\right). (14)

The column vectors of YY are expressed as 𝐲(1),𝐲(2),,𝐲(n){\bf y}^{(1)},{\bf y}^{(2)},\dots,{\bf y}^{(n)}. SVD is applied to this matrix, decomposing it into Y=UΣVTY=U\Sigma V^{T}, where UU is a τ×τ\tau\times\tau matrix with column vectors (𝐮1,,𝐮τ)T({\bf u}_{1},\dots,{\bf u}_{\tau})^{T}. Σ\Sigma is a diagonal matrix of size τ×n\tau\times n, and each diagonal component is given by (σ1,,στ)(\sigma_{1},\dots,\sigma_{\tau}). VTV^{T} is also a n×nn\times n orthogonal matrix with row vectors (𝐯1,,𝐯τ)T({\bf v}_{1},\dots,{\bf v}_{\tau})^{T}, respectively. To remove high-frequency noise, low-rank approximations are used in SVD. Based on the singular values, YY is decomposed into its cor-responding components YiY_{i}:

YUrΣrVrT=UrVrT=i=1rYi,Y\approx U_{r}\Sigma_{r}V_{r}^{T}={U^{\prime}}_{r}V_{r}^{T}=\sum_{i=1}^{r}Y_{i}, (15)

where rr is the adopted rank. For simplicity, the product of UrU_{r} and Σr\Sigma_{r} is defined as Ur{U^{\prime}}_{r}. The ii th component YiY_{i} can be expressed as

Yi=(𝐲i(1),𝐲i(2),,𝐲i(n)).Y_{i}=\left({\bf y}_{i}^{(1)},{\bf y}_{i}^{(2)},\dots,{\bf y}_{i}^{(n)}\right). (16)

In general, this matrix YiY_{i} is not skew-symmetric like the original matrix YY. Using the average as shown in Eq.(20), we reconstruct the following time-series data F~i=(f~i,1,f~i,2,,f~i,N){\tilde{F}}_{i}=({\tilde{f}}_{i,1},{\tilde{f}}_{i,2},\dots,{\tilde{f}}_{i,N}), which corresponds to each singular value σi\sigma_{i}:

f~i,s={1sj=1syi(j,sj+1)(1sτ)1τj=1τyi(j,sj+1)(τsn)1Ns+1j=1Ns+1yi(j+sn,nj+1)(nsN).\displaystyle{\tilde{f}}_{i,s}=\left\{\begin{array}[]{ccc}\frac{1}{s}\sum\limits_{j=1}^{s}y_{i{\tiny(j,s-j+1)}}&}{\hfil&(1\leq s\leq\tau)\\ \frac{1}{\tau}\sum\limits_{j=1}^{\tau}y_{i{\tiny(j,s-j+1)}}&}{\hfil&(\tau\leq s\leq n)\\ \frac{1}{N-s+1}\sum\limits_{j=1}^{N-s+1}y_{i{\tiny(j+s-n,n-j+1)}}&}{\hfil&(n\leq s\leq N).\\ \end{array}\right. (20)

Consequently, we can obtain a matrix Y~i{\tilde{Y}}_{i}

Y~i=(f~i,1f~i,2f~i,nf~i,2f~i,3f~i,n+1f~i,τf~i,τ+1f~i,N).\displaystyle{\tilde{Y}}_{i}=\left(\begin{array}[]{cccc}\tilde{f}_{i,1}&\tilde{f}_{i,2}&\cdots&\tilde{f}_{i,n}\\ \tilde{f}_{i,2}&\tilde{f}_{i,3}&\cdots&\tilde{f}_{i,n+1}\\ \vdots&\vdots&}{\hfil&\vdots\\ \tilde{f}_{i,\tau}&\tilde{f}_{i,\tau+1}&\cdots&\tilde{f}_{i,N}\\ \end{array}\right). (25)

The reconstructed matrix Y~i{\tilde{Y}}_{i} retains symmetry similar to that of the original trajectory matrix YY. Now we also place the column vector Y~i{\tilde{Y}}_{i} as 𝐲~i(1),𝐲~i(2),,𝐲~i(n){\tilde{\bf y}}_{i}^{(1)},{\tilde{\bf y}}_{i}^{(2)},\dots,{\tilde{\bf y}}_{i}^{(n)}. The original time-series can be approximated as FΣiF~iF\approx\Sigma_{i}{\tilde{F}}_{i}.

In some cases, multiple F~i\tilde{F}_{i} components may exhibit oscillations of similar frequency and amplitude. These components are grouped into the same cluster and treated as a single oscillation. To classify these components into clusters, the similarity of their oscillation behavior was evaluated using the W-correlation matrix

Wi,j=(F~i,F~j)F~iF~j.W_{i,j}=\frac{(\tilde{F}_{i},\tilde{F}_{j})}{\|{\tilde{F}}_{i}\|{\ }\|{\tilde{F}}_{j}\|}. (26)

2.3 Forecast of the time-series

This section explains the forecasting of the time-series data F~i\tilde{F}_{i} using SSA [9]. As mentioned above, since the time-series data F~i\tilde{F}_{i}, extracted as specific oscillation components, is significantly simpler than the original FF, it allows for stable forecasting. Forecasting F~i,N+1\tilde{F}_{i,N+1} using F~i=(f~i,1,f~i,2,,f~i,N)\tilde{F}_{i}=(\tilde{f}_{i,1},\tilde{f}_{i,2},\dots,\tilde{f}_{i,N}) involves adding a new column vector of Y~i\tilde{Y}_{i} as shown Eq.(25).

𝐲~i(n+1)=(f~i,n+1f~i,Nf~i,N+1).\displaystyle{\bf\tilde{y}}_{i}^{(n+1)}=\left(\begin{array}[]{c}{\tilde{f}}_{i,n+1}\\ \vdots\\ {\tilde{f}}_{i,N}\\ {\tilde{f}}_{i,N+1}\\ \end{array}\right). (31)

Since only f~i,N+1{\tilde{f}}_{i,N+1} is unknown, we use the first to the τ1\tau-1 of Y~i{\tilde{Y}}_{i}, and obtain the following linear combination of the components u~i,jk{\tilde{u}}_{i,jk} with coefficients hkh_{k} as follows:

f~i,n+1=\displaystyle\displaystyle{\tilde{f}}_{i,n+1}= 1n+1s=1n+1f~i,s\displaystyle\frac{1}{n+1}\sum\limits_{s=1}^{n+1}{\tilde{f}}_{i,s} +k=1rhku~1,k,\displaystyle+\sum\limits_{k=1}^{r}h_{k}{\tilde{u}}_{1,k}, (32)
\displaystyle\vdots
f~i,n+τ1=\displaystyle\displaystyle{\tilde{f}}_{i,n+\tau-1}= 1n+τ1s=τ1n+τ+1f~i,s\displaystyle\frac{1}{n+\tau-1}\sum\limits_{s=\tau-1}^{n+\tau+1}{\tilde{f}}_{i,s} +k=1rhku~τ1,k,\displaystyle+\sum\limits_{k=1}^{r}h_{k}{\tilde{u}}_{\tau-1,k},

where u~j,k(j=1,,τ1){\tilde{u}}_{j,k}~{}(j=1,\dots,\tau-1) are the components of Ur(i){U^{\prime}}_{r}^{(i)}, obtained from the decomposition as per Eq.(15). The first term in Eq.(32) is the average of each row fi,a(a=1,,τ1)f_{i,a}~{}(a=1,\dots,\tau-1) of Y~i{\tilde{Y}}_{i}, and the second term is the deviation of the component. The new point, f~i,N+1{\tilde{f}}_{i,N+1}, is obtained by solving Eq.(32) for hkh_{k} using the least-squares method. The forecast is made by

f~i,N+1=1ns=τNf~i,s+n+1nk=1rhku~τ,k.{\tilde{f}}_{i,N+1}=\frac{1}{n}\sum\limits_{s=\tau}^{N}{\tilde{f}}_{i,s}+\frac{n+1}{n}\sum\limits_{k=1}^{r}h_{k}{\tilde{u}}_{\tau,k}. (33)

By incorporating this value into Y~i{\tilde{Y}}_{i} as a new component to 𝐲~i(n+1){\bf\tilde{y}}_{i}^{(n+1)}, an updated Y~i{\tilde{Y}}_{i} is obtained, expanded by one column. SVD is then reapplied to the updated Y~i{\tilde{Y}}_{i}, allowing the operations described in Eq.(32) and (33) to be repeated iteratively, enabling forecasts to be extended to the desired number of points.

3 Results and discussion

3.1 Ethylene

Refer to caption

Figure 1: Dipole moment μ(t)\mu(t) and oscillator strength S(ω)S(\omega) for steps (a)(b) N=5000N=5000 and (c)(d) N=20000N=20000, respectively.

To evaluate the effectiveness of SSA, we applied it to the dipole moments obtained from TDDFT calculations for ethylene, a molecule with well-characterized electronic states. The initial external stimulus was applied along the C-C bond direction, resulting in a slowly varying dipole moment μ(t)\mu(t). Spectral analysis of μ(t)\mu(t) was performed using FT as described in Eq.(5), focusing on the band-edges associated with optical absorption. Figure 1 shows the dipole moments and the resulting spectrum S(ω)S(\omega) for two cases: (1) sufficient data (N20000N\simeq 20000) and (2) insufficient data (N5000N\simeq 5000), with a time stepΔt=0.002\Delta t=0.002 [1/eV]. When sufficient time evolution is available, the band-edge spectra are clear, with distinct peaks around 7.57.5 eV. In contrast, with insufficient data points (N5000N\simeq 5000), the spectrum becomes weak and broad, making it challenging to discern whether it is a single peak or composed of multiple oscillations. Identifying peaks associated with the band-edges also becomes difficult. We then analyzed the spectra using the shorter time-evolved time-series F=(μ1,μ2,,μ5000)F=(\mu_{1},\mu_{2},\dots,\mu_{5000}) up to N=5000N=5000 steps. It is necessary to select the appropriate oscillation components related to the band-edge among several obtained by decomposing the time-series FF. To identify the major oscillation components in FF, a trajectory matrix YY is constructed with the window width τ=1000\tau=1000 according to Eq.(14). The results are obtained in the order of the largest singular values. Using Y1,Y2,,YrY_{1},Y_{2},\dots,Y_{r} according to Eq.(16), and by averaging according to Eq.(20), each time-series data F~1,F~2,{\tilde{F}}_{1},{\tilde{F}}_{2},\dots corresponding to each singular value is obtained.

Refer to caption

Figure 2: Oscillations of the dipole moment μ(t)\mu(t) for ethylene were decomposed and reconstructed into individual signal components using SSA with a bandwidth τ=1000\tau=1000.

Figure 2 shows the first six oscillations. Pairs such as (F~1{\tilde{F}}_{1},F~2{\tilde{F}}_{2}), (F~3{\tilde{F}}_{3},F~4{\tilde{F}}_{4}), and (F~5{\tilde{F}}_{5},F~6{\tilde{F}}_{6}) exhibit similar frequencies and amplitudes, indicating they are related oscillations. Each pair was clustered and denoted as (F~i,F~j)({\tilde{F}}_{i},{\tilde{F}}_{j}). For these reconstructed oscillations, we counted the wavenumbers, excluding the first and the last approximately 100100 steps to avoid boundary effects. The spectra derived from this wavenumber analysis showed that the main oscillations correspond to peaks around 7.57.5 eV for (F~1,F~2)({\tilde{F}}_{1},{\tilde{F}}_{2}), 11.811.8 eV for (F~3,F~4)({\tilde{F}}_{3},{\tilde{F}}_{4}) and 18.418.4 eV for (F~5,F~6)({\tilde{F}}_{5},{\tilde{F}}_{6}). Comparing these results with Fig.1(b) confirms that these peaks are included.

The peaks near the corresponding energies are notable. In particular, (F~1,F~2)({\tilde{F}}_{1},{\tilde{F}}_{2}) represent the lowest energy peaks, which are associated with the band-edge. These oscillations exhibit minimal beating and large amplitudes, indicating that they are relatively simple signals. In contrast, (F~3,F~4)({\tilde{F}}_{3},{\tilde{F}}_{4}) and (F~5,F~6)({\tilde{F}}_{5},{\tilde{F}}_{6}) show noticeable beats and are mixed with adjacent oscillations. This indicates that SSA does not always achieve a perfect decomposition into single-frequency components.

Figure 3 shows the correlation matrix WijW_{ij} for F~1F~10{\tilde{F}}_{1}-{\tilde{F}}_{10}, calculated using Eq.(26). The diagonal components are, as expected, equal to 11, as they represent self-correlation. If different time-series are completely separated, their off-diagonal correlations vanish. However, Fig.(3) shows high correlations within the pairs (F~1,F~2)({\tilde{F}}_{1},{\tilde{F}}_{2}),(F~3,F~4)({\tilde{F}}_{3},{\tilde{F}}_{4}), and (F~5,F~6)({\tilde{F}}_{5},{\tilde{F}}_{6}), confirming their relationship. The effectiveness of this decomposition depends on the window width τ\tau, which must be optimized for proper separation. In SSA, the outputs are ordered from the highest singular value to the lowest, which means that the main oscillations may appear in the high-energy region, depending on the direction of the dipole moment oscillation. To analyze the band-edges effectively, it is useful to rearrange the decomposed and reconstructed time-series data in ascending order of wavenumber, renaming them as F~1,F~2,{\tilde{F}}_{1},{\tilde{F}}_{2},\dots, for clarity.

Refer to caption
Figure 3: Correlation matrix on the decomposed and reconstructed time-series data {F~1,,F~10}\{{\tilde{F}}_{1},\dots,{\tilde{F}}_{10}\} of ethylene dipole moment.

Figure 4 demonstrates the analysis and forecasting of oscillations using the method described in Section 2.3. In Fig.4(a), the time-series data F=(μ1,μ2,,μ5000)F=(\mu_{1},\mu_{2},\dots,\mu_{5000}) is shown for the dynamic dipole moment μ(t)\mu(t) up to N=5000N=5000 steps. Fig.4(b) displays the time-series data F~=F~1+F~2{\tilde{F}}={\tilde{F}}_{1}+{\tilde{F}}_{2}, and the forecasted oscillation data using the prediction method from Section 2.3. In the figure, μ~(t){\tilde{\mu}}(t) represents F~=(μ~1,μ~2,,μ~5000){\tilde{F}}=({\tilde{\mu}}_{1},{\tilde{\mu}}_{2},\dots,{\tilde{\mu}}_{5000}). For this F~{\tilde{F}}(blue), the prediction is repeated up to N=20000N=20000 steps, and the result is shown in orange. The reconstructed oscillation remains almost simple and stable over a long period of time. Figure.4(c) shows the spectrum obtained by applying FT directly to the original time-series FF from Fig.4(a). Since time-series up to N=5000N=5000 steps are used, the total time TT is not sufficient, and the spectrum is very broad with low resolution. Focusing on the low-energy peak, the spectrum in the energy region from 66 eV to 99 eV is shown in Fig.4(d), with each peak normalized to the magnitude obtained from the results at N=2000N=2000 steps (orange). The spectra compare the FT results for F~{\tilde{F}} using data from N=5000N=5000 steps (blue), N=10000N=10000 steps (green), and N=20000N=20000 steps (orange). It is evident that the signal becomes stronger and sharper as the number of steps increases.

Refer to caption

Figure 4: Dynamic dipole moment and spectrum of ethylene. (a) the base μ(t)\mu(t) and (c) its spectrum, (b) the extraction of the band-edge component by SSA and its spectral prediction μ(t)\mu(t), and (d) its spectral improvement, normalized by the maximum of the peak obtained from N=20000N=20000. Dynamic dipole moment and spectrum of ethylene. (a) the base μ(t)\mu(t) and (c) its spectrum, (b) the extraction of the band-edge component by SSA and its spectral predictionμ(t)\mu(t), and (d) its spectral improvement, normalized by the maximum of the peak obtained from N=20000N=20000.

3.2 Small molecules (benzene / naphthalene / anthracene / tetracene)

In this section, we confirm the effect of the SSA forecast by comparing our results with those of TDDFT up to the same number of steps, using small molecules. For molecules with simple electronic structures, such as benzene, naphthalene, anthracene, and tetracene, we examined the effect of forecasting on the spectra near the band-edge, as in Section 3.1 It is well-known that the energy of the peak positions decreases as the molecular size increases.

Figure 5 compares spectra derived from the dynamic dipole moments calculated by TDDFT with those extended and forecasted using SSA. Fig.5(a) shows the spectra near the band-edge, based on the dynamic dipole moment μ(t)\mu(t) obtained by TDDFT up to the step number (N=2000N=2000). The results reveal a shift of the band-edge to lower energy as the number of benzene rings and atomic size increase. However, the spectrum in this energy region appears ambiguous, influenced by other nearby peaks.

To clarify the band-edge spectrum, we applied SSA to extract μ~(t){\tilde{\mu}}(t) from each time-series data at N=2000N=2000 steps, isolating the oscillations associated with the band-edge. The extracted fundamental oscillation was then forecasted and extended to N=8000N=8000 steps, as shown in Fig.5(b), with a window width of τ=500\tau=500. Since SSA effectively isolates the oscillation components near the band-edge, the influence of other peaks is relatively reduced. On the other hand, Fig.5(c) shows spectra obtained from the dipole moments calculated up to the step number (N=8000N=8000) simply by the TDDFT calculation.

Comparing the peak shapes of these spectra, it is found that they are nearly identical and equivalent.

Refer to caption

Figure 5: Comparison of the calculated band-edge spectra for benzene (solid line), naphthalene (dashed line), anthracene (dash dot line), and tetracene (dotted line). (a) Spectrum obtained from the original μ(t)\mu(t) calculated using TDDFT up to N=2000N=2000 steps. (b) Spectrum derived from the extracted oscillation μ~(t){\tilde{\mu}}(t), associated with the band-edge, extended to N=8000N=8000 steps using SSA and forecasting. (c) Spectrum obtained from the time evolution of up to N=8000N=8000 steps simply using TDDFT with Δt=0.002\Delta t=0.002 [1/eV].

4 Conclusion

Real-time TDDFT is a powerful method for obtaining the excited states and optical spectra of molecules. Spectra across all energy regions can be derived by applying the FT to the dynamic dipole moment. However, this approach assumes that the total time of the time evolution is sufficiently long. When the total time is insufficient, the spectral resolution is reduced, leading to broad spectral features. Consequently, the conventional FT method struggles with broad and ambiguous spectral shapes when the number of data points is limited. In addition, the dynamical dipole moment contains multiple frequency components, complicating the analysis. Recognizing that the dynamical dipole moment is time-series data, we applied SSA to focus on specific oscillation components within the dipole moment. By decomposing the dynamical dipole moment into groups of simpler oscillations, we were able to isolate and extract oscillations associated with specific spectral peaks. This approach allowed us to identify low-energy oscillations, especially those at the band-edges related to emission and absorption, and to analyze the spectra of individual oscillation components.

This decomposition provides relatively simple oscillation components within specific energy regions. Leveraging these components, we can forecast and extend the oscillations, effectively increasing the total time available for FT. As a result, the spectral shapes became very clear, yielding sharp and high-resolution spectra for specific energy regions. Importantly, the spectra obtained from the forecasted and extended time-series showed excellent agreement with those obtained from real-time TDDFT calculations using sufficiently long time-series data. These results were validated through analysis of ethylene and small molecules such as benzene, naphthalene, anthracene, and tetracene. This demonstrates the utility of SSA in enhancing the spectral analysis of real-time TDDFT calculations, particularly for detailed investigations of specific energy regions.

Acknowledgment

This work was partially supported by the Takahashi Industrial and Economic Research Foundation, and Sumitomo Chemical Co., Ltd.

References

  • [1] Runge, E., Gross, E. K. U. : Density-functional theory for time-dependent system. Phys. Rev. Lett.  52(12), 997-1000 (1984).
  • [2] Yabana, K., Bertsch, G. F.: Time-dependent local-density approximation in real-time. Phys. Rev. 54(7), 4484-4487 (1996).
  • [3] Zempo, Y., Akino, N., Ishida, M., Ishitobi M., Kurita Y.: Optical properties in conjugated polymers. J. Phys.: Condens. Matter 20, 064231 (2008), and the references therein.
  • [4] Toogoshi, M., Kato, M., Kano, S. S., Zempo, Y.: Optical spectrum analysis of real-time TDDFT using the maximum entropy method. J. Phys.: Conf. Ser.  510, 012027 (2014).
  • [5] Toogoshi, M., Kano, S. S., Zempo, Y.: The maximum entropy method for optical spectrum analysis of real-time TDDFT. J. Phys.: Conf. Ser.  640, 012069 (2015).
  • [6] Toogoshi, M., Kano, S. S., Zempo, Y.: Improved maximum entropy method applied to real-time time-dependent density functional theory. J. Phys.: Conf. Ser.  905, 012006 (2017).
  • [7] Golyandina, N., and Zhigljavsky, A.: Singular spectrum analysis for time series. 2nd edn. Springer, Berlin, ch.2 (2013).
  • [8] Brunton, S. L., and Kutz, J. N.: Data-driven science and engineering: machine learning, dynamical systems and control. 2nd edn. Cambridge University Press, Cambridge, ch.1 (2022).
  • [9] Danilov, D. L.: Principal components in time series forecast. J. Comp. Graph. Stat.  6 (1), 112-121 (1997).