This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Thermal Conductivity Calculation using Homogeneous Non-equilibrium Molecular Dynamics Simulation with Allegro

Kohei Shimamura [email protected]. Department of Physics, Kumamoto University Advanced Research Laboratory, Technology Infrastructure Center, Technology Platform, Sony Group Corporation Collaboratory for Advanced Computing and Simulations, University of Southern California Shinnosuke Hattori Ken-ichi Nomura Akihide Koura Fuyuki Shimojo
Abstract

In this study, we derive the heat flux formula for the Allegro model, one of machine-learning interatomic potentials using the equivariant deep neural network, to calculate lattice thermal conductivity using the homogeneous non-equilibrium molecular dynamics (HNEMD) method based on the Green-Kubo formula. Allegro can construct more advanced atomic descriptors than conventional ones, and can be applied to multicomponent and large-scale systems, providing a significant advantage in estimating the thermal conductivity of anharmonic materials, such as thermoelectric materials. In addition, the spectral heat current (SHC) method, recently developed for the HNEMD framework (HNEMD-SHC), allows the calculation of not only the total thermal conductivity but also its frequency components. The verification of the heat flux and the demonstration of HNEMD-SHC method are performed for the extremely anharmonic low-temperature phase of Ag2Se.

keywords:
Machine-learning interatomic potential (MLIP) , Allegro , Thermal conductivity , Homogeneous non-equilibrium molecular dynamics (HNEMD) , Spectral heat current (SHC) , First-principles molecular dynamics (FPMD)

1 Introduction

The computational framework based on the Green-Kubo (GK) formula for molecular dynamics (MD) simulations using machine-learning interatomic potential (MLIP), has attracted attention in estimating the lattice thermal conductivities of various materials [1, 2, 3, 4, 5, 6, 7, 8, 9, 10]. The GK formula is as follows:

κ\displaystyle\kappa =\displaystyle= Ω3kBT20𝐉Q(t)𝐉Q(0)𝑑t,\displaystyle\frac{\varOmega}{3k_{\rm B}T^{2}}\int^{\infty}_{0}\langle{\bf{J}}_{Q}(t)\cdot{\bf{J}}_{Q}(0)\rangle dt, ( 1.1)

where kBk_{\rm B}, TT, Ω\varOmega, and 𝐉Q{\bf{J}}_{Q} are the Boltzmann constant, temperature, volume of the supercell, and heat flux, respectively. Two significant advantages of MLIP underpin this framework: low computational cost and high accuracy achieved through training on first-principles MD (FPMD) data. While this framework is employed to explore and investigate materials with low thermal conductivity such as thermoelectric materials, MLIPs that can precisely describe complex atomic motions with strong anharmonicity are required to estimate thermal conductivity more accurately. In particular, appropriate descriptors that can characterize the atomic motions should be designed.

The recently proposed Allegro model [11] is a useful MLIP candidate for thermal conductivity calculations. The descriptors used in Allegro, similar to many conventional descriptors such as the Behler-Parrinello symmetry functions [12], characterize the local atomic environment, however, with significant improvements. Allegro is based on atomic cluster expansion (ACE) [13], which allows the construction of descriptors that incorporate body-order (BO) expansion. Although most conventional descriptors consist of up to three-body orders (e.g., radial and angular parts) constrained by computational costs, four or more body orders can be incorporated. Allegro also can efficiently generate descriptors with higher-order rotational symmetry with the aid of rotational equivariance. Furthermore, although conventional descriptors require careful adjustment of the number of descriptors and functional forms of several hyperparameters, in Allegro the order of BO expansion and rotational symmetry can be controlled with NlayerN_{\rm layer} and lmaxl_{\rm max} parameters, respectively. Note that Allegro is constructed using a graph neural network (GNN) with NlayerN_{\rm layer} corresponding to the number of convolutional layers. The use of GNNs also solves the cumbersome problem of exponential increase in the number of conventional descriptors with an increase in the number of atomic species, leading to a high computational cost. This is an indispensable capability for studying thermoelectric candidates in multicomponent systems.

However, Allegro does not implement the appropriate atomic virial tensor for the heat flux 𝐉Q{\bf{J}}_{Q}:

𝐉Q\displaystyle{\bf{J}}_{Q} =\displaystyle= 1ΩiNatomti𝐯i+1ΩiNatomϵi𝐯i+1ΩiNatomWi𝐯i,\displaystyle\frac{1}{\varOmega}\sum^{N_{\rm atom}}_{i}t_{i}{\bf{v}}_{i}\ +\frac{1}{\varOmega}\sum^{N_{\rm atom}}_{i}\epsilon_{i}{\bf{v}}_{i}\ +\frac{1}{\varOmega}\sum^{N_{\rm atom}}_{i}{\rm W}_{i}{\bf{v}}_{i}, ( 1.2)

where tit_{i}, 𝐯i{\bf{v}}_{i}, ϵi\epsilon_{i}, and Wi{\rm W}_{i} are the atomic kinetic energy, velocity, potential energy, and virial tensor of the iith atom, respectively. NatomN_{\rm atom} denotes the number of atoms in the system. The atomic virial tensor plays an essential role, because its contribution accounts for the majority of the thermal conductivity [10, 14, 15]. In addition, recent studies [16, 17, 18] report that the atomic virial tensor can have a strong effect on the accuracy of the thermal conductivity especially when dealing with many-body potentials, including MLIPs. While an infinite number of atomic virial tensors can be defined in the many-body potential, an appropriate tensor must be used for the heat flux [10, 19]. Therefore, in this study, we derive the atomic virial tensor appropriate for the heat flux of Allegro and attempt to calculate the thermal conductivity based on the GK formula in Allegro. We used the extended version of Allegro proposed by Yu et al. [20]. The test system was the highly anharmonic, low-temperature phase of Ag2Se (β\beta-Ag2Se) [21], and we investigated the dependence of lmaxl_{\rm max} and NlayerN_{\rm layer} on the atomic structure and thermal conductivity.

The homogeneous non-equilibrium MD (HNEMD) method [22, 23, 24] based on the GK formula was employed. This method has the advantage of saving computational time compared to calculating the GK formula directly and can decompose the thermal conductivity into its partial contributions [25, 26]. In particular, the recently proposed scheme combined with the spectral heat current (SHC) method (hereafter referred to as HNEMD-SHC) [26] allows the analysis of thermal conductivity in frequency space, even for highly anharmonic materials. We used the HNEMD-SHC method along with the vibrational density of states (VDOS) [27], for the thermal conductivity analysis.

In Section 2, we explain the computational methods, such as the creation of training data for β\beta-Ag2Se, construction of Allegro models with NlayerN_{\rm layer} and lmaxl_{\rm max} parameters, formulas of the atomic virial tensors for Allegro, thermal conductivity calculation using the HNEMD method, and its decomposition in frequency space using the HNEMD-SHC method. In Section 3, we first discuss the accuracy of the Allegro models using not only root mean square errors in training, but also physical quantities obtained from the MD simulations, such as stress and radial distribution functions. Second, we discuss the accuracy of the total thermal conductivities, comparing them with experimental value of β\beta-Ag2Se, using the HNEMD-SHC method to explain the differences in total thermal conductivity between the Allegro models in frequency space. Finally, conclusions are presented in Section 4.

2 Computational Methods

2.1 Generation of β\beta-Ag2Se training data by FPMD simulation

All the FPMD and MD simulations using the Allegro models in this study were executed using the QXMD code [28]. Common to all the MD simulations is the time step Δt\Delta t that was set to 2.42 fs, performing at 300 K and 0 GPa. In addition, a system comprising 256 Ag and 128 Se atoms was used.

In the FPMD simulations, the electronic states were calculated using the projector augmented wave method [29, 30] within the framework of the density functional theory (DFT) [31, 32]. Projector functions were generated for the 4dd, 5ss, and 5pp states of Ag and for the 4ss, 4pp, and 4dd states of Se. The Perdew-Burke-Ernzerhof generalized gradient approximation [33] was used for the exchange correlation energy. To correctly represent the electronic states in the localized dd orbitals of Ag, the DFT+U method [34] with an effective parameter for the Coulomb interaction Ueff=6.0U_{\rm eff}=6.0 eV [35, 36] was used. An empirical correction of the van der Waals interactions using the DFT-D approach [37] was employed. The plane-wave cutoff energies were 20 and 200 Ry for the electronic pseudo-wave function and pseudo-charge density, respectively. The energy functional was minimized iteratively using a preconditioned conjugate-gradient method [38]. The Γ\Gamma point was used for Brillouin zone sampling.

The training data were generated for β\beta-Ag2Se at 300 K and 0 GPa by performing FPMD simulations. The unit cell of β\beta-Ag2Se has an orthorhombic structure consisting of four Ag2Se groups with lattice parameters: a=4.333a=4.333 Å, b=7.062b=7.062 Å, and c=7.764c=7.764 Å [21]. 32 unit cells were arranged to construct an atomic configuration similar to cubic structure containing 256 Ag and 128 Se atoms, as shown in Fig. S1 in Supplementary Material (SM). The lattice parameters of supercell were L1=17.332L_{1}=17.332 Å, L2=20.991L_{2}=20.991 Å, L3=20.991L_{3}=20.991 Å, α=84.58\alpha=84.58^{\circ}, β=90.00\beta=90.00^{\circ}, and γ=90.00\gamma=90.00^{\circ}. Using the configuration under periodic boundary conditions, an FPMD simulation with an isothermal and isobaric (NPTNPT) ensemble was performed in 4,100 steps. A total of 820 MD steps were used as training data by extracting every five steps. The IIth MD step data contained the atomic coordinates {𝐫I,i{\bf{r}}_{I,i}}, total potential energy EIE_{I}, atomic force {𝐅I,i{\bf{F}}_{I,i}}, virial tensor WI{\rm W}_{I}, and supercell volume ΩI\varOmega_{I}.

2.2 Allegro model

2.2.1 Setting of hyperparemters, NlayerN_{\rm layer} and lmaxl_{\rm max}

The Allegro model [11] can be constructed using NequIP [39] which introduces an E(3) symmetry equivariant neural network (NN) coded using PyTorch [40]. However, unlike NequIP, Allegro questions the need for the message passing scheme that incorporates atomic interactions beyond the cutoff radius RcR_{\rm c} and instead adopts the conventional method of restricting the interacting atoms to within RcR_{\rm c} from the central atom. The atomic potential energy of Allegro εiAllegro\varepsilon^{\rm Allegro}_{i} is calculated as the sum of the pairwise energies εijAllegro\varepsilon^{\rm Allegro}_{ij} defined between two atoms within RcR_{\rm c}:

εiAllegro=jεijAllegro,\displaystyle\varepsilon^{\rm Allegro}_{i}=\sum_{j}\varepsilon^{\rm Allegro}_{ij}, ( 2.1)

and the total potential energy is defined by the sum of εiAllegro\varepsilon^{\rm Allegro}_{i} over all atoms in the system,

EAllegro=iNatomεiAllegro.\displaystyle E^{\rm Allegro}=\sum^{N_{\rm atom}}_{i}\varepsilon^{\rm Allegro}_{i}. ( 2.2)

This definition of EAllegroE^{\rm Allegro} provides Allegro the advantage of parallel computation with space partitioning using the well-known Verlet neighbor list and the cell-index method required for large-scale MD simulations.

The motivation for developing Allegro was also to overcome the difficulties of ACE [13]. ACE can construct descriptors that incorporate BO expansion up to higher orders at a feasible computational cost. While most conventional descriptors are composed of up to three-body order of radial and angular parts, such as the Behler-Parrinello symmetry functions [12], ACE descriptors, which incorporates four-body order or more, have been employed [41]. Furthermore, ACE also efficiently generates descriptors with higher-order rotational symmetry characterized by irreducible representation ll, which paved the way for MLIP construction with descriptors incorporating rotational equivariance [42]. However, a drawback is that the number of descriptors increases exponentially as the number of atomic species increases. In addition, the number of different descriptors, characterized by BO or ll, to generate is determined by trial and error. ACE descriptors are often combined nonlinearly to improve accuracy [13], and the manner in which they are combined requires trial and error.

Allegro employs a GNN to distinguish atomic species by utilizing embedded vectors, which significantly reduce the computational cost resulting from an increase in atomic species. Furthermore, the descriptor generation process is combined with the layer structure of GNN to efficiently construct nonlinear descriptors. The number of convolution layers in GNN, which is controlled by hyperparameter NlayerN_{\rm layer}, is devised to correspond to the order of BO expansion. As the tensor computation of the spherical harmonics is performed in each convolution layer of the GNN, the BO order of the descriptor increases by one. For example, Allegro with NlayerN_{\rm layer} = 2 produces a nonlinear combination of descriptors of up to four body orders. In other words, an Allegro model with NlayerN_{\rm layer} has (NlayerN_{\rm layer} + 2) body-order descriptors. In addition, the diversity of descriptors with irreducible representation ll can be increased through the composition of spherical harmonics via Clebsch–Gordan coefficients during the tensor computation. The hyperparameter lmaxl_{\rm max} controls the maximum irreducible representation ll of the descriptors produced after composition. For lmaxl_{\rm max} = 0, only rotation-invariant descriptors are produced, whereas, for lmaxl_{\rm max} \geq 1, rotation-equivariant descriptors are also produced, which can improve feature extraction for the atomic local environment.

Therefore, because NlayerN_{\rm layer} and lmaxl_{\rm max} play important roles in Allegro, this study focuses on these two hyperparameters and evaluates their dependence on the thermal conductivity. Six models are created using combinations of NlayerN_{\rm layer} = {1,2} and lmaxl_{\rm max} = {0,1,2}, that is, (lmaxl_{\rm max},NlayerN_{\rm layer}) = (0,1), (0,2), (1,1), (1,2), (2,1), and (2,2) models.

Other hyperparameters related to the descriptors and architecture of GNN are the same for all Allegro models. The details are explained in Section 2 of SM.

2.2.2 Training of Allegro

The Adam optimizer [43], which is a stochastic gradient descent method, was adopted to train all the Allegro models. The default parameters of Adam were set to β1\beta_{1} = 0.9, β2\beta_{2} = 0.999, and ϵ\epsilon = 10-8 without weight decay. The total number of epochs was 1,000. All the models were trained with float64 precision by minimizing the following cost function CC, which comprises three loss functions associated with the total potential energy (first term), atomic force (second term), and total virial (third term):

C\displaystyle C =\displaystyle= pE21BIB(EIAllegroEIFPMDNatom)2\displaystyle\frac{p_{E}}{2}\frac{1}{B}\sum^{B}_{I}\left(\frac{E^{\rm Allegro}_{I}-E^{\rm FPMD}_{I}}{N_{{\rm atom}}}\right)^{2} ( 2.3)
+\displaystyle+ pF21BIB13NatomiNatom(𝐅I,iAllegro𝐅I,iFPMD)2\displaystyle\frac{p_{F}}{2}\frac{1}{B}\sum^{B}_{I}\frac{1}{3N_{{\rm atom}}}\sum^{N_{{\rm atom}}}_{i}\ \left({\bf{F}}^{\rm Allegro}_{I,i}-{\bf{F}}^{\rm FPMD}_{I,i}\right)^{2}
+\displaystyle+ pW21BIB16j6(WI,jAllegroWI,jFPMD)2,\displaystyle\frac{p_{W}}{2}\frac{1}{B}\sum^{B}_{I}\frac{1}{6}\sum^{6}_{j}\ \left({W}^{\rm Allegro}_{I,j}-{W}^{\rm FPMD}_{I,j}\right)^{2},

where BB denotes the batch size used for the Adam optimizer and was set to BB = 5 in this study. As these terms differ in dimension and size, pEp_{E}, pFp_{F}, and pWp_{W} were introduced as adjustable parameters; pEp_{E} = pFp_{F} = 1.0 and pWp_{W} = 0.0005 were used. Note that the training for WIAllegro{\rm W}^{\rm Allegro}_{I} is equivalent to that of the potential part of virial stress tensor calculated by dividing WIAllegro{\rm W}^{\rm Allegro}_{I} by the volume Ω\varOmega. The training was performed on 80% of the data mentioned in Section 2.1 with the remaining 20% of the data used for validation. Hereafter, the former and latter are referred to as the Train and Validation datasets, respectively. The training was performed using System C at the Institute for Solid State Physics, the University of Tokyo. The training wall times for the six Allegro models described in Section 2.2.1 are shown in Fig. S2(a) in SM.

There is a concern that the physical quantities calculated from the Allegro models depend on the initial weight parameters. Therefore, for statistical evaluation, five models with different initial weight parameters were constructed.

2.3 Thermal conductivity calculation by HNEMD method

2.3.1 Atomic virial tensors for Allegro

The heat flux 𝐉Q{\bf{J}}_{Q} in Eq. (1.2) requires an atomic virial tensor Wi{\rm W}_{i}. However, this is not present in the original Allegro model [11]. The derivation of an atomic virial tensor that is appropriate for the heat flux requires careful consideration of the nature of Allegro as a many-body potential. The extended code [20] proposed by Yu et al. is helpful for this purpose. They introduced relative coordinates {𝐫ij\{{\bf{r}}_{ij} (\equiv 𝐫j{\bf{r}}_{j} - 𝐫i{\bf{r}}_{i})} into the computational graph of the atomic potential energy εiAllegro\varepsilon^{\rm Allegro}_{i} in PyTorch. This allows the calculation of its derivatives εiAllegro𝐫ij\frac{\partial\varepsilon^{\rm Allegro}_{i}}{\partial{\bf{r}}_{ij}} by automatic differentiation, and Yu et al. defined the following type of atomic virial tensor Wisym{\rm W}^{\rm sym}_{i}:

Wisym=ji12[𝐫ijϵiAllegro𝐫ji+(𝐫ijϵiAllegro𝐫ji)T].\displaystyle{\rm W}^{\rm sym}_{i}=\sum_{j\neq i}\frac{1}{2}\left[{\bf{r}}_{ij}\otimes\frac{\partial\epsilon^{\rm Allegro}_{i}}{\partial{\bf{r}}_{ji}}+\left({\bf{r}}_{ij}\otimes\frac{\partial\epsilon^{\rm Allegro}_{i}}{\partial{\bf{r}}_{ji}}\right)^{\rm T}\right]. ( 2.4)

This definition is based on the objective of finding an atomic virial tensor that is symmetric as well as the total virial tensor. In practice, this can be expressed as follows:

Wisym=ji𝐫ijϵiAllegro𝐫ji.\displaystyle{\rm W}^{\rm sym}_{i}=\sum_{j\neq i}{\bf{r}}_{ij}\otimes\frac{\partial\epsilon^{\rm Allegro}_{i}}{\partial{\bf{r}}_{ji}}. ( 2.5)

However, Wisym{\rm W}^{\rm sym}_{i} would not be suitable for thermal conductivity calculations. According to the work by Fan et al. on Tersoff potential [16], the rigorous heat flux JQ for many-body potentials can be derived from the following equation [44].

𝐉Q\displaystyle{\bf{J}}_{Q} =\displaystyle= 1ΩddtiNatom(ti+εi)𝐫i\displaystyle\frac{1}{\varOmega}\frac{d}{dt}\sum^{N_{\rm atom}}_{i}\left(t_{i}+\varepsilon_{i}\right){\bf{r}}_{i} ( 2.6)
=\displaystyle= 1ΩiNatom(ti+εi)𝐯i+1ΩiNatom𝐫iddt(ti+εi).\displaystyle\frac{1}{\varOmega}\sum^{N_{\rm atom}}_{i}\left(t_{i}+\varepsilon_{i}\right){\bf{v}}_{i}+\frac{1}{\varOmega}\sum^{N_{\rm atom}}_{i}{\bf{r}}_{i}\frac{d}{dt}\left(t_{i}+\varepsilon_{i}\right).

The second term of Eq. (2.6), which coincides with the third term of Eq. (1.2), yields a rigorous formula for the atomic virial tensor. As the atomic potential energy of Allegro ϵiAllegro\epsilon^{\rm Allegro}_{i} is only a function of relative coordinates 𝐫ij{\bf{r}}_{ij} within a sphere of radius RcR_{\rm c}

ϵiAllegro=ϵiAllegro({𝐫ij}ji),\displaystyle\epsilon^{\rm Allegro}_{i}=\epsilon^{\rm Allegro}_{i}\left(\{{\bf{r}}_{ij}\}_{j\neq i}\right), ( 2.7)

the second term of Eq. (2.6) can be expressed as [16]:

1ΩiNatom𝐫iddt(ti+εiAllegro)=1ΩiNatomji(𝐫ijεjAllegro𝐫ji)𝐯i.\displaystyle\frac{1}{\varOmega}\sum^{N_{\rm atom}}_{i}{\bf{r}}_{i}\frac{d}{dt}\left(t_{i}+\varepsilon^{\rm Allegro}_{i}\right)=\frac{1}{\varOmega}\sum^{N_{\rm atom}}_{i}\sum_{j\neq i}\left({\bf{r}}_{ij}\otimes\frac{\partial\varepsilon^{\rm Allegro}_{j}}{\partial{\bf{r}}_{ji}}\right){\bf{v}}_{i}. ( 2.8)

Thus, the following atomic virial tensor Wiasym{\rm W}^{\rm asym}_{i} would be appropriate for the heat flux [10],

Wiasym=ji𝐫ijϵjAllegro𝐫ji,\displaystyle{\rm W}^{\rm asym}_{i}=\sum_{j\neq i}{\bf{r}}_{ij}\otimes\frac{\partial\epsilon^{\rm Allegro}_{j}}{\partial{\bf{r}}_{ji}}, ( 2.9)

which is an asymmetric tensor. Although the mathematical properties of the two atomic virial tensors are different, their summation over NatomN_{\rm atom} corresponds to the same total virial tensor WAllegro{\rm W}^{\rm Allegro}. To demonstrate that the derivation of Wiasym{\rm W}^{\rm asym}_{i} is indispensable, the thermal conductivities obtained using these two atomic virial tensors are compared later. The code of the Allegro model, extended to the output Wiasym{\rm W}^{\rm asym}_{i} and Wisym{\rm W}^{\rm sym}_{i}, is available in Ref. [45].

2.3.2 HNEMD method

The homogeneous non-equilibrium MD (HNEMD) method [22, 23, 24], which is based on the GK formula, successfully reduces the computational cost by enabling the calculation of the thermal conductivity κ\kappa in the xx direction using the time average of the heat flux 𝐉Q{\bf{J}}_{Q} instead of integrating the autocorrelation function in Eq. (1.1):

κ=ΩFextTlimtJQ,xt,\displaystyle\kappa=\frac{\varOmega}{F_{\rm ext}T}\lim_{t\to\infty}\langle J_{Q,x}\rangle_{t}, ( 2.10)

where FextF_{\rm ext} denotes the magnitude of the perturbation along the xx direction of the system. An appropriate FextF_{\rm ext} is determined from the linear response regime by evaluating the thermal conductivity as a function of FextF_{\rm ext}. Based on the evaluation results (Fig. S3 in SM), we selected FextF_{\rm ext} = 0.005 bohr-1 in this study.

2.3.3 HNEMD-SHC method

The SHC method, originally developed for the NEMD method [46], was recently extended to HNEMD by Fan et al. [26]. First, we define the cross-correlation function 𝐊(t){\bf{K}}(t) between atomic enthalpy hi{\rm h}_{i} and atomic velocity 𝐯i{\bf{v}}_{i}:

𝐊(t)\displaystyle{\bf{K}}(t) =\displaystyle= 1ΩiNatomhi(0)𝐯i(t).\displaystyle\frac{1}{\varOmega}\sum^{N_{\rm atom}}_{i}{\rm h}_{i}(0){\bf{v}}_{i}(t). ( 2.11)

hi{\rm h}_{i} is given by:

hi\displaystyle{\rm h}_{i} =\displaystyle= (ti+εi)I+Wi,\displaystyle(t_{i}+{\varepsilon}_{i}){\rm I}+{\rm W}_{i}, ( 2.12)

where I denotes the identity matrix. Note that Fan et al. constructed a correlation function using only Wi{\rm W}_{i} [26], whereas we used hi{\rm h}_{i} for the construction. When the Fourier transform of 𝐊(t){\bf{K}}(t) is expressed as

𝐊~(ω)\displaystyle{\tilde{\bf{K}}}(\omega) =\displaystyle= eiωt𝐊(t)𝑑t,\displaystyle\int^{\infty}_{-\infty}e^{i\omega t}{\bf{K}}(t)dt, ( 2.13)

𝐊(t){\bf{K}}(t) can also be written using the inverse Fourier transform as

𝐊(t)\displaystyle{\bf{K}}(t) =\displaystyle= 12πeiωt𝐊~(ω)𝑑ω\displaystyle\frac{1}{2\pi}\int^{\infty}_{-\infty}e^{-i\omega t}{\tilde{\bf{K}}}(\omega)d\omega ( 2.14)
=\displaystyle= 12π2Re[0eiωt𝐊~(ω)𝑑ω],\displaystyle\frac{1}{2\pi}2\ {\rm Re}\left[\int^{\infty}_{0}e^{-i\omega t}{\tilde{\bf{K}}}(\omega)d\omega\right],

where Re[cc] is the real part of the complex number cc. As 𝐊(t=0){\bf{K}}(t=0) of Eq. (2.11) corresponds to the heat flux formula in Eq. (1.2), the thermal conductivity κ\kappa can be expressed as follows via Eqs. (2.10) and (2.14):

κ\displaystyle\kappa =\displaystyle= ΩFextTKx(0)\displaystyle\frac{\varOmega}{F_{\rm ext}T}K_{x}(0) ( 2.15)
=\displaystyle= ΩFextT0dω2π2Re[K~x(ω)]\displaystyle\frac{\varOmega}{F_{\rm ext}T}\int^{\infty}_{0}\frac{d\omega}{2\pi}2\ {\rm Re}\left[\tilde{K}_{x}(\omega)\right]
=\displaystyle= 0dω2πκ~(ω),\displaystyle\int^{\infty}_{0}\frac{d\omega}{2\pi}\tilde{\kappa}(\omega),

where κ~(ω)2ΩFextTRe[K~x(ω)]\tilde{\kappa}(\omega)\equiv\frac{2\varOmega}{F_{\rm ext}T}{\rm Re}\left[\tilde{K}_{x}(\omega)\right]; Kx(0)K_{x}(0) and K~x(ω)\tilde{K}_{x}(\omega) denote the xx direction components of 𝐊(0){\bf{K}}(0) and 𝐊~(ω){\tilde{\bf{K}}}(\omega), respectively. Thus, using spectral thermal conductivity κ~(ω)\tilde{\kappa}(\omega), the total thermal conductivity κ\kappa can be decomposed in frequency space ω\omega. For convenience, the cumulative κcum(ω)\kappa^{\rm cum}(\omega) is defined as:

κcum(ω)\displaystyle\kappa^{\rm cum}(\omega) =\displaystyle= 0ωdω2πκ~(ω).\displaystyle\int^{\omega}_{0}\frac{d\omega^{\prime}}{2\pi}\tilde{\kappa}(\omega^{\prime}). ( 2.16)

2.3.4 HNEMD simulations with Allegro models

All HNEMD simulations were executed using the NVTNVT ensemble. The values of the cell vectors of the MD cell were averaged over those of the FPMDs with the NPTNPT ensemble that were used as the training data. For each HNEMD simulation, after relaxation for 100,000 steps, a 1,000,000-MD step simulation was performed to sample 𝐉Q{\bf{J}}_{Q} of Eq. (1.2) and calculate K(t) in Eq. (2.11). Due to its anisotropic structure, the thermal conductivity of β\beta-Ag2Se may depend on the direction. The thermal conductivity along the aa-axis of the unit cell (see Fig. S1 in SM) was calculated as an example. The thermal conductivities were calculated using Eqs. (2.10), (2.15), and (2.16). HNEMD simulations were performed using System C at the Institute for Solid State Physics, the University of Tokyo. The wall times of the simulations for the six Allegro models described in section 2.2.1 are shown in Fig. S2(a) in SM.

3 Results and Discussion

3.1 The accuracy of Allegro models

First, we present the training results of the six Allegro models described in Section 2.2.1 for β\beta-Ag2Se., i.e., (lmaxl_{\rm max},NlayerN_{\rm layer}) = (0,1), (0,2), (1,1), (1,2), (2,1), and (2,2) models. The root-mean-square errors (RMSEs) of the total potential energy (ΔE\Delta E), atomic force (ΔF\Delta F), and potential part of virial stress (ΔP\Delta P) are shown in Fig. 1. All models showed low RMSEs of the orders of 10-3 eV/atom for ΔE\Delta E, 10-2 eV/Å for ΔF\Delta F, and 10-2 GPa for ΔP\Delta P, which are comparable to the RMSEs of silver chalcogenides in our previous studies using other types of MLIP [10, 14, 47]. However, nontrivial decreases in ΔF\Delta F and ΔP\Delta P are observed between the (0,2) and (1,1) models in Fig. 1, which would cause discrepancies in some physical accuracy, such as atomic structure and stress, between these models. Hence, evaluating the accuracy of the atomic structure and stress through MD simulations is of interest.

Figure 2(a) shows the time evolution of the stress through HNEMD simulations for each model. The stresses of all models oscillate around the training target of 0 GPa. The time-averaged stresses for each Allegro model presented in Fig. 2(b) are sufficiently close to 0 GPa and display no clear correlation with ΔP\Delta P in Fig. 1(c). The differences observed in ΔP\Delta P between the models are considered to have practically no effect on the stress in the MD simulations. Incidentally, without virial training, the stress exhibits finite values, far from the target stress of 0 GPa (\sim6 GPa in the case shown in Fig. S4 in SM). This indicates that virial training is essential for accurately reproducing the stress values in the Allegro model for β\beta-Ag2Se. The accuracy of the virial stress would be important because it affects thermal conductivity via the atomic virial tensor [10].

The reproducibility of the atomic structure with FPMD data was evaluated using partial radial distribution functions gαβg_{\alpha\beta}(rr) for the Ag-Ag, Ag-Se, and Se-Se pairs. Figure 3(a) shows gαβg_{\alpha\beta}(rr) from FPMD and MD simulations using the Allegro models. We also quantified the correctness of gαβg_{\alpha\beta}(rr) using the following ΔG\Delta G from our previous studies [48, 49]:

ΔG\displaystyle\Delta G =\displaystyle= 0R[(ΔgAgAg(r))2+(ΔgAgSe(r))2+(ΔgSeSe(r))2]𝑑r\displaystyle\int_{0}^{R}\left[\left(\Delta g_{\rm AgAg}(r)\right)^{2}+\left(\Delta g_{\rm AgSe}(r)\right)^{2}+\left(\Delta g_{\rm SeSe}(r)\right)^{2}\right]dr ( 3.1)
=\displaystyle= ΔGAgAg+ΔGAgSe+ΔGSeSe,\displaystyle\Delta G_{\rm AgAg}+\Delta G_{\rm AgSe}+\Delta G_{\rm SeSe},

where Δgαβ\Delta g_{\alpha\beta}(rr) denotes the difference between gαβg_{\alpha\beta}(rr) obtained from the FPMD and MD simulations using the Allegro models. ΔGαβ\Delta G_{\alpha\beta} is calculated by integrating (Δgαβ(\Delta g_{\alpha\beta}(rr))2 from 0 to RR = 6.0 Å. The values of ΔG\Delta G and ΔGαβ\Delta G_{\alpha\beta} are presented in Fig. 3(b). ΔG\Delta G values of the (0,1) and (0,2) models are more than one order of magnitude larger than those of the other four models with lmaxl_{\rm max} = 1 and 2. This is because of the obvious deviation in gSeSeg_{\rm SeSe}(rr) of the (0,1) and (0,2) models from that of FPMD in the region from the first peak position at 4.2 Å to the subsequent 5.2 Å in Fig. 3(a). Since increasing NlayerN_{\rm layer} from 1 to 2 does not change gSeSeg_{\rm SeSe}(rr), increasing the BO of the descriptors did not lead to an essential improvement in reproducing the structure of Se. However, the remaining four models with lmaxl_{\rm max} = 1 and 2 match to such an extent that no recognizable difference from FPMD is observed in Fig. 3(a). ΔG\Delta G exhibits relatively lower values than the two models with lmaxl_{\rm max} = 0. This improvement indicates that the dimensionality of the rotationally symmetric descriptors in the (0,1) and (0,2) models was insufficient. Since the values of ΔG\Delta G for the (1,1), (1,2), (2,1), and (2,2) models are almost the same, they reproduce gSeSeg_{\rm SeSe}(rr) with similar accuracy. Considering the computational cost (Fig. S2(a) in SM), we conclude that the (1,1) model is the most reasonable among the four models in terms of structural properties.

3.2 Thermal conductivity

3.2.1 Effect of atomic virial tensor on thermal conductivity

Figure 4 shows the results of the thermal conductivities obtained from the HNEMD simulations using two types of atomic virial tensors Wisym{\rm W}^{\rm sym}_{i} and Wiasym{\rm W}^{\rm asym}_{i} defined in Section 2.3.1. As obtaining the reference value from FPMD is quite difficult because of the high computational cost, the thermal conductivities of the models were compared with the experimental value. The lattice thermal conductivity of β\beta-Ag2Se was experimentally estimated to be 0.3 ±\pm 0.1 Wm-1K-1 [50], which is quite low despite the crystalline phase. In terms of the results calculated using Wisym{\rm W}^{\rm sym}_{i} in Eq. (2.5), the four (0,1), (0,2), (1,1), and (1,2) models had almost the same thermal conductivities \sim0.45 Wm-1K-1, which were slightly larger than the experimental value. However, the thermal conductivities of the (2,1) and (2,2) models further increased by \sim0.54 and \sim0.64 Wm-1K-1, respectively, clearly deviating from the experimental value. By contrast, the thermal conductivities calculated using Wiasym{\rm W}^{\rm asym}_{i} in Eq. (2.9) are \sim0.35 Wm-1K-1 for the two models with lmaxl_{\rm max} = 0, and \sim0.40 Wm-1K-1 for the four models with lmaxl_{\rm max} = 1 and 2. These values fall within the range of experimental error. The fact that the former had a smaller thermal conductivity by \sim0.05 Wm-1K-1 would be attributed to the deviation of the structure from the FPMD simulation, however, the latter appeared to be converging, unlike those of Wisym{\rm W}^{\rm sym}_{i}. From these observations, we conclude that our derivation of Wiasym{\rm W}^{\rm asym}_{i} would be essential for the thermal conductivity calculations using Allegro.

3.2.2 Thermal conductivity analysis in frequency space

The increase in thermal conductivity by \sim0.05 Wm-1K-1 between the models with lmaxl_{\rm max} = 0 and lmaxl_{\rm max} = 1 and 2 (filled black circles in Fig. 4) would reflect the differences in the atomic structures, as shown in previous Section 3.1. Using the HNEMD-SHC method [26], we can delve deeper into the cause of such small differences in thermal conductivity. The differences in the atomic structures are reflected in the phonon distribution. The vibrational density of states (VDOSs) has often been employed in many previous studies to elucidate the affected frequency regions [27, 51, 52, 53, 54, 55]. The partial VDOSs of Ag and Se (VDOSAg and VDOSSe) were calculated using the velocity correlation function [27], as shown in Figs. 5(a) and 5(b). The profiles of VDOSs for the (0,1) and (0,2) models were similar. However, the VDOSs for the (1,1), (1,2), (2,1), and (2,2) models were indistinguishable from one another. Interestingly, a significant influence was observed not only on VDOSSe but also on VDOSAg, although the contributions of ΔGAgAg\Delta G_{\rm AgAg} and ΔGAgSe\Delta G_{\rm AgSe} to ΔG\Delta G were quite small, as shown in Fig. 3(b). The peak near 8 meV and shoulder near 4 meV in VDOSAg of the (0,1) model shift inward in the (1,1) model. In addition, the peak near 12 meV in the (0,1) model had disappeared. The peaks at approximately 5 and 18 meV for VDOSSe in the (0,1) model shift slightly outward in the (1,1) model. This may have caused the difference in total thermal conductivity; however, the contribution of each is unknown.

To this end, the HNEMD-SHC method is useful because the thermal conductivity can be decomposed in frequency space. Figures 5(c) and 5(d) show the spectral thermal conductivity κ~(ω)\tilde{\kappa}(\omega) in Eq. (2.15) and its cumulative κcum(ω)\kappa^{\rm cum}(\omega) in Eq. (2.16), respectively, for the (0,1) and (1,1) models. We found that κ~(ω)\tilde{\kappa}(\omega) from 15 to 20 meV in the (1,1) model was larger than that in the (0,1) model, which mainly contributes to the deviation in κ\kappa \sim0.05 Wm-1K-1 in Fig. 4. This can be understood more easily by κcum(ω)\kappa^{\rm cum}(\omega). Both models showed similar profiles from 0 to 15 meV. After 15 meV, a difference emerged, and the values converged with a difference of \sim 0.05 Wm-1K-1 after 20 meV. The region from 15 to 20 meV corresponds to the peak shift of VDOSSe in Fig. 5(b).

Considering these results, Allegro, which provides a high level description of atomic structures, is combined with the HNEMD-SHC method, which is expected to be even more useful for the thermal conductivity analysis of more complex systems, such as those with structural defects.

4 Conclusions

The atomic virial tensor appropriate for the heat flux of the Allegro model was derived to calculate thermal conductivity using the HNEMD method based on the GK formula. The derived atomic virial tensor works well for β\beta-Ag2Se as a test system, showing good agreement with the experimental value. Even for Ag2Se, with its extremely anharmonic atomic behavior, the relatively low descriptor levels of lmaxl_{\rm max} = 1 and NlayerN_{\rm layer} = 1 were sufficient. However, the system used in this study was limited to a binary system with no defects. By taking advantage of Allegro’s ability to easily construct sophisticated descriptors, it would be possible to easily and accurately determine the thermal conductivities of a wide variety of materials with multicomponent complex structures that include defects. In addition, using the recently developed HNEMD-SHC method, not only the total thermal conductivity but also the partial thermal conductivity of the frequency components can be obtained. This would play an important role in clarifying the factors that generate thermal conductivity, as demonstrated in this study. The effectiveness of the combination of Allegro with HNEMD and SHC methods will be further verified in the future.

CRediT authorship contribution statement

Kohei Shimamura: Methodology, Software, Formal analysis, Data curation, Writing - original draft, Visualization. Shinnosuke Hattori: Conceptualization, Methodology, Software, Writing - review & editing. Ken-ichi Nomura: Conceptualization, Methodology, Software, Writing - review & editing. Akihide Koura: Conceptualization, Methodology, Writing - review & editing, Supervision. Fuyuki Shimojo: Conceptualization, Methodology, Software, Resources, Writing - review & editing, Supervision.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

This study was supported by MEXT/JSPS KAKENHI Grant Numbers Nos. 22K03454 and 21H01766, and JST CREST Grant Number JPMJCR18I2, Japan. The authors thank the Supercomputer Center, the Institute for Solid State Physics, University of Tokyo for the use of the facilities.

Data Availability

An Allegro model is available at Ref. [45] that allows the output of atomic virial tensors in Wiasym{\rm W}^{\rm asym}_{i} and Wisym{\rm W}^{\rm sym}_{i}.

References

  • Liang et al. [2023] T. Liang, P. Ying, K. Xu, Z. Ye, C. Ling, Z. Fan, J. Xu, Mechanisms of temperature-dependent thermal transport in amorphous silica from machine-learning molecular dynamics, Phys. Rev. B 108 (2023) 184203. doi:10.1103/PhysRevB.108.184203.
  • Brorsson et al. [2022] J. Brorsson, A. Hashemi, Z. Fan, E. Fransson, F. Eriksson, T. Ala-Nissila, A. V. Krasheninnikov, H.-P. Komsa, P. Erhart, Efficient calculation of the lattice thermal conductivity by atomistic simulations with ab initio accuracy, Adv. Theory Simul. 5 (2022) 2100217. doi:https://doi.org/10.1002/adts.202100217.
  • Tisi et al. [2021] D. Tisi, L. Zhang, R. Bertossa, H. Wang, R. Car, S. Baroni, Heat transport in liquid water from first-principles and deep neural network simulations, Phys. Rev. B 104 (2021) 224202. doi:10.1103/PhysRevB.104.224202.
  • Fan et al. [2021] Z. Fan, Z. Zeng, C. Zhang, Y. Wang, K. Song, H. Dong, Y. Chen, T. Ala-Nissila, Neuroevolution machine learning potentials: Combining high accuracy and low cost in atomistic simulations and application to heat transport, Phys. Rev. B 104 (2021) 104309. doi:10.1103/PhysRevB.104.104309.
  • Verdi et al. [2021] C. Verdi, F. Karsai, P. Liu, R. Jinnouchi, G. Kresse, Thermal transport and phase transitions of zirconia by on-the-fly machine-learned interatomic potentials, Npj Comput. Mater. 7 (2021) 156. doi:10.1038/s41524-021-00630-5.
  • Han et al. [2021] L. Han, X. Chen, Q. Wang, Y. Chen, M. Xu, L. Wu, C. Chen, P. Lu, P. Guan, Neural network potential for studying the thermal conductivity of sn, Comput. Mater. Sci. 200 (2021) 110829. doi:10.1016/j.commatsci.2021.110829.
  • Li et al. [2020] R. Li, Z. Liu, A. Rohskopf, K. Gordiz, A. Henry, E. Lee, T. Luo, A deep neural network interatomic potential for studying thermal conductivity of β\beta-ga2o3, Appl. Phys. Lett. 117 (2020) 152102. doi:10.1063/5.0025051.
  • Huang et al. [2019] Y. Huang, J. Kang, W. A. Goddard, L.-W. Wang, Density functional theory based neural network force fields from energy decompositions, Phys. Rev. B 99 (2019) 064103. doi:10.1103/PhysRevB.99.064103.
  • Korotaev et al. [2019] P. Korotaev, I. Novoselov, A. Yanilkin, A. Shapeev, Accessing thermal conductivity of complex compounds by machine learning interatomic potentials, Phys. Rev. B 100 (2019) 144308. doi:10.1103/PhysRevB.100.144308.
  • Shimamura et al. [2021] K. Shimamura, Y. Takeshita, S. Fukushima, A. Koura, F. Shimojo, Estimating thermal conductivity of α\alpha-ag2se using ann potential with chebyshev descriptor, Chem. Phys. Lett. 778 (2021) 138748. doi:10.1016/j.cplett.2021.138748.
  • Musaelian et al. [2023] A. Musaelian, S. Batzner, A. Johansson, L. Sun, C. J. Owen, M. Kornbluth, B. Kozinsky, Learning local equivariant representations for large-scale atomistic dynamics, Nat. Commun. 14 (2023) 579. doi:10.1038/s41467-023-36329-y.
  • Behler and Parrinello [2007] J. Behler, M. Parrinello, Generalized neural-network representation of high-dimensional potential-energy surfaces, Phys. Rev. Lett. 98 (2007) 146401. doi:10.1103/PhysRevLett.98.146401.
  • Drautz [2019] R. Drautz, Atomic cluster expansion for accurate and transferable interatomic potentials, Phys. Rev. B 99 (2019) 014104. doi:10.1103/PhysRevB.99.014104.
  • Takeshita et al. [2022] Y. Takeshita, K. Shimamura, S. Fukushima, A. Koura, F. Shimojo, Thermal conductivity calculation based on green–kubo formula using ann potential for β\beta\mathchar 45\relaxag2se, J. Phys. Chem. Solids 163 (2022) 110580. doi:10.1016/j.jpcs.2022.110580.
  • de Andrade and Stassen [2004] J. de Andrade, H. Stassen, Molecular dynamics studies of thermal conductivity time correlation functions, J. Mol. Liq. 110 (2004) 169–176. doi:10.1016/j.molliq.2003.09.012.
  • Fan et al. [2015] Z. Fan, L. F. C. Pereira, H.-Q. Wang, J.-C. Zheng, D. Donadio, A. Harju, Force and heat current formulas for many-body potentials in molecular dynamics simulations with applications to thermal conductivity calculations, Phys. Rev. B 92 (2015) 094301. doi:10.1103/PhysRevB.92.094301.
  • Surblys et al. [2019] D. Surblys, H. Matsubara, G. Kikugawa, T. Ohara, Application of atomic stress to compute heat flux via molecular dynamics for systems with many-body interactions, Phys. Rev. E 99 (2019) 051301. doi:10.1103/PhysRevE.99.051301.
  • Boone et al. [2019] P. Boone, H. Babaei, C. E. Wilmer, Heat flux for many-body interactions: Corrections to lammps, J. Chem. Theory Comput. 15 (2019) 5579–5587. doi:10.1021/acs.jctc.9b00252.
  • Shimamura et al. [2020] K. Shimamura, Y. Takeshita, S. Fukushima, A. Koura, F. Shimojo, Computational and training requirements for interatomic potential based on artificial neural network for estimating low thermal conductivity of silver chalcogenides, J. Chem. Phys. 153 (2020) 234301. doi:10.1063/5.0027058.
  • Yu et al. [2023] H. Yu, Y. Zhong, L. Hong, C. Xu, W. Ren, X. Gong, H. Xiang, Spin-dependent graph neural network potential for magnetic materials, 2023. arXiv:2203.02853.
  • Wiegers [1971] G. A. Wiegers, The Crystal Structure of the Low-Temperature form of Silver Selenide, Am. Min. 56 (1971) 1882–1888.
  • Ciccotti et al. [1979] G. Ciccotti, G. Jacucci, I. R. McDonald, “Thought-experiments” by molecular dynamics, J. Stat. Phys. 21 (1979) 1–22. doi:10.1007/BF01011477.
  • Evans [1982] D. J. Evans, Homogeneous nemd algorithm for thermal conductivity–application of non-canonical linear response theory, Phys. Lett. A 91 (1982) 457 – 460. doi:10.1016/0375-9601(82)90748-4.
  • Yoshiya et al. [2004] M. Yoshiya, A. Harada, M. Takeuchi, K. Matsunaga, H. Matsubara, Perturbed molecular dynamics for calculating thermal conductivity of zirconia, Mol. Simulat. 30 (2004) 953–961. doi:10.1080/08927020410001709389.
  • Fujii et al. [2020] S. Fujii, T. Yokoi, C. Fisher, H. Moriwake, M. Yoshiya, Quantitative prediction of grain boundary thermal conductivities from local atomic environments, Nat. Commun. 11 (2020) 1854. doi:10.1038/s41467-020-15619-9.
  • Fan et al. [2019] Z. Fan, H. Dong, A. Harju, T. Ala-Nissila, Homogeneous nonequilibrium molecular dynamics method for heat transport and spectral decomposition with many-body potentials, Phys. Rev. B 99 (2019) 064308. doi:10.1103/PhysRevB.99.064308.
  • Loong et al. [1992] C.-K. Loong, P. Vashishta, R. K. Kalia, W. Jin, M. H. Degani, D. G. Hinks, D. L. Price, J. D. Jorgensen, B. Dabrowski, A. W. Mitchell, D. R. Richards, Y. Zheng, Phonon density of states and oxygen-isotope effect in ba1x{\mathrm{ba}}_{1\mathrm{-}\mathit{x}}kx{\mathrm{k}}_{\mathit{x}}bio3{\mathrm{bio}}_{3}, Phys. Rev. B 45 (1992) 8052–8064. doi:10.1103/PhysRevB.45.8052.
  • Shimojo et al. [2019] F. Shimojo, S. Fukushima, H. Kumazoe, M. Misawa, S. Ohmura, P. Rajak, K. Shimamura, L. Bassman, S. Tiwari, R. K. Kalia, A. Nakano, P. Vashishta, Qxmd: An open-source program for nonadiabatic quantum molecular dynamics, SoftwareX 10 (2019) 100307. doi:10.1016/j.softx.2019.100307.
  • Blöchl [1994] P. E. Blöchl, Projector augmented-wave method, Phys. Rev. B 50 (1994) 17953–17979. doi:10.1103/PhysRevB.50.17953.
  • Kresse and Joubert [1999] G. Kresse, D. Joubert, From ultrasoft pseudopotentials to the projector augmented-wave method, Phys. Rev. B 59 (1999) 1758–1775. doi:10.1103/PhysRevB.59.1758.
  • Hohenberg and Kohn [1964] P. Hohenberg, W. Kohn, Inhomogeneous electron gas, Phys. Rev. 136 (1964) B864. doi:10.1103/PhysRev.136.B864.
  • Kohn and Sham [1965] W. Kohn, L. J. Sham, Self-consistent equations including exchange and correlation effects, Phys. Rev. 140 (1965) A1133. doi:10.1103/PhysRev.140.A1133.
  • Perdew et al. [1996] J. P. Perdew, K. Burke, M. Ernzerhof, Generalized gradient approximation made simple, Phys. Rev. Lett. 77 (1996) 3865–3868. doi:10.1103/PhysRevLett.77.3865.
  • Anisimov et al. [1997] V. I. Anisimov, F. Aryasetiawan, A. I. Lichtenstein, First-principles calculations of the electronic structure and spectra of strongly correlated systems: theLDA++umethod, J. Phys. Condens. Matter 9 (1997) 767–808. doi:10.1088/0953-8984/9/4/002.
  • Fukushima et al. [2019] S. Fukushima, M. Misawa, A. Koura, F. Shimojo, Gga+u molecular dynamics study of structural and dynamic properties of superionic conductor ag2se, J. Phys. Soc. Jpn. 88 (2019) 115002. doi:10.7566/JPSJ.88.115002.
  • Santamaría-Pérez et al. [2012] D. Santamaría-Pérez, M. Marqués, R. Chuliá-Jordán, J. Menendez, O. Gomis, J. Ruiz-Fuertes, J. Sans, D. Errandonea, J. Recio, Compression of silver sulfide: X-ray diffraction measurements and total-energy calculations, Inorg. Chem. 51 (2012) 5289–5298. doi:10.1021/ic300236p.
  • Grimme [2006] S. Grimme, Semiempirical gga-type density functional constructed with a long-range dispersion correction, J. Comput. Chem. 27 (2006) 1787–1799. doi:10.1002/jcc.20495.
  • Shimojo et al. [2001] F. Shimojo, R. K. Kalia, A. Nakano, P. Vashishta, Linear-scaling density-functional-theory calculations of electronic structure based on real-space grids: design, analysis, and scalability test of parallel algorithms, Comput. Phys. Commun. 140 (2001) 303 – 314. doi:10.1016/S0010-4655(01)00247-8.
  • Batzner et al. [2022] S. Batzner, A. Musaelian, L. Sun, M. Geiger, J. P. Mailoa, M. Kornbluth, N. Molinari, T. E. Smidt, B. Kozinsky, E(3)-equivariant graph neural networks for data-efficient and accurate interatomic potentials, Nat. Commun. 13 (2022) 2453. doi:10.1038/s41467-022-29939-5.
  • Paszke et al. [2019] A. Paszke, S. Gross, F. Massa, A. Lerer, J. Bradbury, G. Chanan, T. Killeen, Z. Lin, N. Gimelshein, L. Antiga, A. Desmaison, A. Köpf, E. Yang, Z. DeVito, M. Raison, A. Tejani, S. Chilamkurthy, B. Steiner, L. Fang, J. Bai, S. Chintala, PyTorch: an imperative style, high-performance deep learning library, Curran Associates Inc., Red Hook, NY, USA, 2019.
  • Araki et al. [2023] T. Araki, S. Hattori, T. Nishi, Y. Kudo, Atomic cluster expansion force field based thermal property material design with density functional theory level accuracy in non-equilibrium molecular dynamics calculations over sub-million atoms, 2023. arXiv:2309.11026.
  • Batatia et al. [2023] I. Batatia, D. P. Kovács, G. N. C. Simm, C. Ortner, G. Csányi, Mace: Higher order equivariant message passing neural networks for fast and accurate force fields, 2023. arXiv:2206.07697.
  • Kingma and Ba [2017] D. P. Kingma, J. Ba, Adam: A method for stochastic optimization, 2017. arXiv:1412.6980.
  • Helfand [1960] E. Helfand, Transport coefficients from dissipation in a canonical ensemble, Phys. Rev. 119 (1960) 1–9. doi:10.1103/PhysRev.119.1.
  • git [2024] https://github.com/koheishimamura/nequip_allegro_tc, 2024.
  • Zhou et al. [2015] Y. Zhou, X. Zhang, M. Hu, Quantitatively analyzing phonon spectral contribution of thermal conductivity based on nonequilibrium molecular dynamics simulations. i. from space fourier transform, Phys. Rev. B 92 (2015) 195204. doi:10.1103/PhysRevB.92.195204.
  • Shimamura et al. [2024] K. Shimamura, A. Koura, F. Shimojo, Construction of machine-learning interatomic potential under heat flux regularization and its application to power spectrum analysis for silver chalcogenides, Comput. Phys. Commun. 294 (2024) 108920. doi:https://doi.org/10.1016/j.cpc.2023.108920.
  • Irie et al. [2022] A. Irie, K. Shimamura, A. Koura, F. Shimojo, Importance of adjusting coefficients in cost function for construction of high-accuracy machine-learning interatomic potential, J. Phys. Soc. Jpn 91 (2022) 045002. doi:10.7566/JPSJ.91.045002.
  • Fukushima et al. [2023] S. Fukushima, K. Shimamura, A. Koura, F. Shimojo, Efficient training of the machine-learning interatomic potential based on an artificial neural network for estimating the helmholtz free energy of alkali metals, J. Phys. Soc. Jpn 92 (2023) 054005. doi:10.7566/JPSJ.92.054005.
  • Matsunaga et al. [2021] T. Matsunaga, K. Hirata, S. Singh, M. Matsunami, T. Takeuchi, A field effect heat flow switching device, Mater. Trans. 62 (2021) 16–19. doi:10.2320/matertrans.E-M2020844.
  • Bryk et al. [2020] T. Bryk, T. Demchuk, J.-F. Wax, N. Jakse, Pressure-induced effects in the spectra of collective excitations in pure liquid metals, J. Condens. Matter Phys. 32 (2020) 184002. doi:10.1088/1361-648X/ab6a31.
  • Shenogin et al. [2009] S. Shenogin, A. Bodapati, P. Keblinski, A. J. H. McGaughey, Predicting the thermal conductivity of inorganic and polymeric glasses: The role of anharmonicity, J. Appl. Phys. 105 (2009) 034906. doi:10.1063/1.3073954.
  • Varshney et al. [2009] V. Varshney, S. S. Patnaik, A. K. Roy, B. L. Farmer, Heat transport in epoxy networks: A molecular dynamics study, Polymer 50 (2009) 3378–3385. doi:10.1016/j.polymer.2009.05.027.
  • Minakov et al. [2017] D. Minakov, P. Levashov, V. Fokin, Vibrational spectrum and entropy in simulation of melting, Comput. Mater. Sci. 127 (2017) 42–47. doi:https://doi.org/10.1016/j.commatsci.2016.10.023.
  • Zhou et al. [2020] T. Zhou, Z. Li, Y. Cheng, Y. Ni, S. Volz, D. Donadio, S. Xiong, W. Zhang, X. Zhang, Thermal transport in amorphous small organic materials: a mechanistic study, Phys. Chem. Chem. Phys. 22 (2020) 3058–3065. doi:10.1039/C9CP05938E.

Figures

Refer to caption
Figure 1: Averaged root mean square errors and corresponding standard deviations (error bars) of (a) total potential energy (ΔE\Delta E), (b) atomic force (ΔF\Delta F), and (c) virial stress from the contribution of the potential energy (ΔP\Delta P) over the five Allegro models for each (lmaxl_{\rm max},NlayerN_{\rm layer}) = (0,1), (0,2), (1,1), (1,2), (2,1), and (2,2). Black and red circles represent results obtained for the Train and Validation datasets, respectively.
Refer to caption
Figure 2: (a) Stress profiles of the six Allegro models in the HNEMD simulation as a function of time. The results are shown for the Allegro models with the worst stress accuracy among the five models with different initial weight parameters. (b) The time-averaged stresses (the filled black circles) of the Allegro models with five different initial weight parameters for each (lmaxl_{\rm max},NlayerN_{\rm layer}) = (0,1), (0,2), (1,1), (1,2), (2,1), and (2,2). Their standard deviations are represented by error bars.
Refer to caption
Figure 3: (a) Partial radial distribution functions gαβg_{\alpha\beta}(rr) for Ag-Ag, Ag-Se, and Se-Se pairs obtained from FPMD (solid lines) and MD simulations using six Allegro models (dashed lines). gαβg_{\alpha\beta}(rr) from Allegro models are averaged over five models with different initial values of the weight parameters. (b) Averaged residuals ΔG\Delta G and their components (ΔGAgAg\Delta G_{\rm AgAg}, ΔGAgSe\Delta G_{\rm AgSe}, and ΔGSeSe\Delta G_{\rm SeSe}) over the five models with different initial weight parameters. Their standard deviations are represented by error bars. ΔG\Delta G is defined in Eq. (3.1).
Refer to caption
Figure 4: With two atomic virial tensors, Wiasym{\rm W}^{\rm asym}_{i} (black circles) and Wisym{\rm W}^{\rm sym}_{i} (red squares), the averaged thermal conductivities κ\kappa are calculated from HNEMD simulations with FextF_{\rm ext} = 0.005 bohr-1 using six Allegro models. The averaged and corresponding standard deviations (error bars) are calculated from models with five different initial weight parameters.
Refer to caption
Figure 5: Elemental vibrational density of states (VDOSs) for (a) Ag and (b) Se calculated from HNEMD simulations with the six Allegro models. (c) κ~(ω)\tilde{\kappa}(\omega) defined in Eq. (2.15) and (d) κcum(ω)\kappa^{\rm cum}(\omega) defined in Eq. (2.16) obtained by the HNEMD-SHC method using the (0,1) and (1,1) Allegro models. VDOSs, κ~(ω)\tilde{\kappa}(\omega), and κcum(ω)\kappa^{\rm cum}(\omega) are averaged over five Allegro models with different initial values of the weight parameters.

Supplementary Material for “Thermal Conductivity Calculation using Homogeneous Non-equilibrium Molecular Dynamics Simulation with Allegro”

1 Atomic configuration of β\beta-Ag2Se

Refer to caption
Figure S 1: Atomic configurations of the systems consisting of 256 Ag (pink) and 128 Se (yellow) atoms for β\beta-Ag2Se. The arrows aa, bb, and cc represent orthorhombic unit cell vectors.

2 Other hyperparameters related with descriptors and architecture of GNN

We employed the cutoff radius RcR_{\rm c} = 7 Å. A radial basis of eight trainable Bessel functions for the basis encoding with the polynomial envelope function using pp = 6 is employed. The parameter parity is set to o3_full. The 2-body latent multi-layer perceptron (MLP) consists of three hidden layers of dimensions [32, 64, 128], using SiLU nonlinearities [1]. The later latent MLPs consist of one hidden layer of dimension 128, also using SiLU nonlinearities. The final edge energy MLP has one hidden layer of dimension 32 with no nonlinearity.

3 Wall times of tranining and HNEMD simulation

As described in section 2.2.1 of the main text, we constructed six Allegro models with different NlayerN_{\rm layer} and lmaxl_{\rm max}, i.e, (lmaxl_{\rm max},NlayerN_{\rm layer}) = (0,1), (0,2), (1,1), (1,2), (2,1), and (2,2) models. For these models, training and HNEMD simulations were performed using System C at the Institute for Solid State Physic, the University of Tokyo. A single node GPU (NVIDIA A100) was used for training. The HNEMD simulations, on the other hand, were performed on our MD code QXMD [2] using 128 CPU cores. The wall times for the training and HNEMD calculations for the six models are shown in Fig. S2(a). The number of weight parameters for the six models is also shown in Fig. S2(b). The wall time (training + HNEMD simulation) strongly correlate with the number of weights of Allegro models.

Refer to caption
Figure S 2: (a)Wall times of training and HNEMD simulation and (b)the numbers of weights parameters of six Allegro models, i.e, (lmaxl_{\rm max},NlayerN_{\rm layer}) = (0,1), (0,2), (1,1), (1,2), (2,1), and (2,2) models. The wall times are averaged over five Allegro models with different initial values of the weight parameters.

4 Determination of appropriate perturbation value

Using the (lmaxl_{\rm max},NlayerN_{\rm layer}) = (1,1) model as a representative, we determined the appropriate value of perturbation FextF_{\rm ext} of HNEMD simulation, where the dependence of thermal conductivity κAllegro\kappa^{\rm Allegro} on FextF_{\rm ext} ranged from 0.001 to 0.1 bohr-1, as shown in Fig. S3. The averaged thermal conductivities and corresponding standard deviations obtained among the five Allegro models with defferent initial weight parameters are represented by filled circles and error bars, respectively. We adopted FextF_{\rm ext} = 0.005 bohr-1, which has the lowest error bar.

Refer to caption
Figure S 3: Averaged thermal conductivities calculated by HNEMD simulations using five Allegro models belonging to (lmaxl_{\rm max},NlayerN_{\rm layer}) = (1,1) with different initial weight parameters as a function of the magnitude of the perturbation FextF_{\rm ext}. Error bars are illustrated using standard deviations of the thermal conductivities.

5 Stress deviation seen in HNEMD simulation using Allegro model without virial training

The stress behavior in the HNEMD simulation for the (lmaxl_{\rm max},NlayerN_{\rm layer}) = (1,1) model without virial training (i.e., with pWp_{W} = 0.0 in the cost function) is shown in Fig. S4. The value clearly deviated from the target stress of training, 0 GPa. The time-averaged stress of the Allegro models with five different initial weight parameters is 5.43 ±0.15\pm 0.15 GPa.

Refer to caption
Figure S 4: Stress profile of the (1,1) Allegro model without virial training in the HNEMD simulation as a function of time. The result is shown for the Allegro model with the smallest RMSE of virial stress from the contribution of the potential energy among the five models with different initial weight parameters.

References

  • Elfwing et al. [2017] S. Elfwing, E. Uchibe, and K. Doya, (2017), arXiv:1702.03118.
  • Shimojo et al. [2019] F. Shimojo, S. Fukushima, H. Kumazoe, M. Misawa, S. Ohmura, P. Rajak, K. Shimamura, L. Bassman, S. Tiwari, R. K. Kalia, et al., SoftwareX 10, 100307 (2019).