This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Machine learning potentials for multicomponent systems: The Ti-Al binary system

Atsuto Seko [email protected] Department of Materials Science and Engineering, Kyoto University, Kyoto 606-8501, Japan
Abstract

Machine learning potentials (MLPs) are becoming powerful tools for performing accurate atomistic simulations and crystal structure optimizations. An approach to developing MLPs employs a systematic set of polynomial invariants including high-order ones to represent the neighboring atomic density. In this study, a formulation of the polynomial invariants is extended to the case of multicomponent systems. The extended formulation is more complex than the formulation for elemental systems. This study also shows its application to Ti-Al binary system. As a result, an MLP with the lowest error and MLPs with high computational cost performance are selected from the many MLPs developed systematically. The predictive powers of the developed MLPs for many properties, such as the formation energy, elastic constants, thermodynamic properties, and mechanical properties, are examined. The MLPs exhibit high predictive power for the properties in a wide variety of ordered structures. The present scheme should be systematically applicable to other multicomponent systems.

I Introduction

Machine learning potentials (MLPs) have been developed from extensive datasets generated by density functional theory (DFT) calculation, and can significantly improve the accuracy and transferability of interatomic potentials. Therefore, MLPs are becoming useful tools for performing crystal structure optimizations and accurate large-scale atomistic simulations, which are prohibitively expensive by DFT calculation. Over the last decade, a number of methods that can be used to develop MLPs and their applications have been reported Lorenz et al. (2004); Behler and Parrinello (2007); Bartók et al. (2010); Behler (2011); Han et al. (2018); Artrith and Urban (2016); Artrith et al. (2017); Szlachta et al. (2014); Bartók et al. (2018); Li et al. (2015); Glielmo et al. (2017); Seko et al. (2014, 2015); Takahashi et al. (2017); Thompson et al. (2015); Wood and Thompson (2018a); Chen et al. (2017); Shapeev (2016); Deringer et al. (2018); Podryabinkin et al. (2018); Gubaev et al. (2019); Mueller et al. (2020). In these studies, the contribution of an atom to potential energy is given as a function of quantities depending on its neighboring environment, called structural features. Also, several models are employed to describe a mapping from structural features to the atomic contribution, including artificial neural network models Lorenz et al. (2004); Behler and Parrinello (2007); Behler (2011); Han et al. (2018); Artrith and Urban (2016); Artrith et al. (2017), Gaussian process models Bartók et al. (2010); Szlachta et al. (2014); Bartók et al. (2018); Li et al. (2015); Glielmo et al. (2017), and linear models Seko et al. (2014, 2015); Takahashi et al. (2017); Thompson et al. (2015); Wood and Thompson (2018a); Chen et al. (2017); Shapeev (2016).

Structural features play an essential role in controlling the accuracy and computational efficiency of MLPs, which are conflicting properties in general Seko et al. (2019); Hernandez et al. (2019); Zuo et al. (2020). A systematic set of structural features is composed of polynomial invariants. The polynomial invariants include second- and third-order bond-orientational order parameters Steinhardt et al. (1983), angular Fourier series Bartók et al. (2013), the bispectrum Kondor (2008); Bartók et al. (2013), and moment tensors Shapeev (2016); Flusser et al. (2009), which have been adopted to develop MLPs and machine learning models of physical properties in compounds. Recently, a group-theoretical procedure for enumerating the polynomial invariants derived from spherical harmonics, including high-order ones, was proposed Seko et al. (2019). The angular Fourier series, bond-orientational order parameters, and bispectrum can be included in the enumeration by this procedure. MLPs developed with the polynomial invariants for a wide variety of elemental systems exhibit high predictive power for a wide range of structures. They are available in the Machine Learning Potential Repository Seko (2020); Mac . However, the polynomial invariants and related potential energy models must be generalized for developing MLPs in multicomponent systems.

In this study, polynomial invariants for developing MLPs for a multicomponent system are formulated. The present formulation of the polynomial invariants should be helpful in developing MLPs for multicomponent systems even within other frameworks. Polynomial models combined with the polynomial invariants are also introduced to describe the potential energy. This study also shows an application of the polynomial models to the development of Pareto optimal MLPs in Ti-Al binary alloy system. The predictive power of the Pareto optimal MLPs is examined for the cohesive energy, the formation energy, the elastic constants, the phonon density of states and dispersion curves, the thermal expansion, the energy profile along the Bain path, and the stacking fault properties.

Section II introduces potential energy models in multicomponent systems, including the polynomial invariants representing the neighboring atomic density and polynomial models for the potential energy. In Sec. III, datasets required to develop MLPs for the Ti-Al binary system are explained. Computational procedures for constructing datasets and estimating coefficients in the potential energy models are also shown. In Sec. IV, the development of Pareto optimal MLPs for the Ti-Al binary system is demonstrated. The predictive power of the MLPs for many properties, such as the formation energy, the elastic constants, the thermodynamic properties, and the mechanical properties, is also investigated. Finally, this paper is concluded in Sec. V.

Refer to caption
Figure 1: Schematic illustration of the neighboring atomic density around atom ii in a binary structure. Its decomposition into the neighboring atomic densities of elements A and B around atom ii is also shown.

II Potential energy models

This section shows an extension of polynomial models for the potential energy proposed to develop MLPs for elemental systems Seko et al. (2019). This section is composed of a general description of the potential energy that is useful for deriving new potential energy models for a multicomponent system, a systematic set of structural features representing the neighboring atomic densities, and polynomial models for the potential energy used in this study.

II.1 General description of potential energy

Given a cutoff radius rcr_{c}, the short-range part of the potential energy for a structure, EE, may be decomposed as

E=iE(i),E=\sum_{i}E^{(i)}, (1)

where E(i)E^{(i)} denotes the contribution of atom ii within cutoff radius rcr_{c}. The atomic contribution to the potential energy can be referred to as the atomic energy. The atomic energy is then assumed to be expressed in a functional form of the neighboring atomic densities. Figure 1 shows a schematic illustration of the neighboring atomic density around atom ii within cutoff radius rcr_{c} and its decomposition into the neighboring atomic densities of the elements. In a multicomponent system composed of elements {A,B,}\{\rm A,\rm B,\cdots\}, the atomic energy is written using functional \mathcal{F} dependent on the element of atom ii as

E(i)=si[ρ(si,A)(i),ρ(si,B)(i),],E^{(i)}=\mathcal{F}_{s_{i}}\left[\rho^{(i)}_{(s_{i},\rm A)},\rho^{(i)}_{(s_{i},\rm B)},\cdots\right], (2)

where ρ(si,s)(i)\rho^{(i)}_{(s_{i},s)} denotes the neighboring atomic density of element ss (s{A,B,}s\in\{{\rm A},{\rm B},\cdots\}) around atom ii of element sis_{i}.

Subsequently, the neighboring atomic density of element ss around atom ii is expanded in terms of a basis set, because the expansion enables the functional form to be replaced with a function of its expansion coefficients. For a given basis set {bn}\{b_{n}\}, the neighboring atomic densities can be expanded as

ρ(si,A)(i)(𝒓)\displaystyle\rho^{(i)}_{(s_{i},{\rm A})}(\bm{r}) =\displaystyle= nan,(si,A)(i)bn(𝒓)\displaystyle\sum_{n}a_{n,(s_{i},{\rm A})}^{(i)}b_{n}(\bm{r})
ρ(si,B)(i)(𝒓)\displaystyle\rho^{(i)}_{(s_{i},{\rm B})}(\bm{r}) =\displaystyle= nan,(si,B)(i)bn(𝒓)\displaystyle\sum_{n}a_{n,(s_{i},{\rm B})}^{(i)}b_{n}(\bm{r})
\displaystyle\vdots

where an,(si,s)(i)a_{n,(s_{i},s)}^{(i)} denotes an order parameter characterizing the neighboring atomic density of element ss around atom ii of element sis_{i}. Using the order parameters, the atomic energy may be rewritten as

E(i)=Fsi(a1,(si,A)(i),a1,(si,B)(i),,a2,(si,A)(i),a2,(si,B)(i),).E^{(i)}=F^{\prime}_{s_{i}}\left(a_{1,(s_{i},\rm A)}^{(i)},a_{1,(s_{i},\rm B)}^{(i)},\cdots,a_{2,(s_{i},\rm A)}^{(i)},a_{2,(s_{i},\rm B)}^{(i)},\cdots\right). (4)

Although function FF^{\prime} of Eqn. (4) depends on the element of atom ii, it is convenient to introduce a unified function that is independent of the element by combining functions FF^{\prime} for all elements. In this study, a unified function for the atomic energy is formulated using order parameters defined for unordered pairs of elements. This means that order parameter an,(si,sj)(i)a_{n,(s_{i},s_{j})}^{(i)} of atom ii and its swapped order parameter an,(sj,si)(j)a_{n,(s_{j},s_{i})}^{(j)} of atom jj are considered as the same variable in the unified function. They are represented by an,{si,sj}(i)a_{n,\{s_{i},s_{j}\}}^{(i)} and an,{si,sj}(j)a_{n,\{s_{i},s_{j}\}}^{(j)}, respectively. Defining order parameter an,{s1,s2}(i)a_{n,\{s_{1},s_{2}\}}^{(i)} to be zero if sis_{i} is not included in {s1,s2}\{s_{1},s_{2}\} (si{s1,s2}s_{i}\notin\{s_{1},s_{2}\}), the atomic energy is written in an independent form of the element of atom ii as

E(i)\displaystyle E^{(i)} =\displaystyle= F′′(a1,{A,A}(i),a1,{A,B}(i),a1,{B,B}(i),,\displaystyle F^{\prime\prime}\left(a_{1,\{\rm A,\rm A\}}^{(i)},a_{1,\{\rm A,\rm B\}}^{(i)},a_{1,\{\rm B,\rm B\}}^{(i)},\cdots,\right. (5)
a2,{A,A}(i),a2,{A,B}(i),a2,{B,B}(i),),\displaystyle\left.a_{2,\{\rm A,\rm A\}}^{(i)},a_{2,\{\rm A,\rm B\}}^{(i)},a_{2,\{\rm B,\rm B\}}^{(i)},\cdots\right),

where all combinations with the replacement of elements are enumerated for each nn.

Moreover, an arbitrary rotation leaves the atomic energy invariant, although it generally changes the neighboring atomic densities and their order parameters. Therefore, the atomic energy is required to be a function of O(3) invariants {dn(i)}\{d_{n^{\prime}}^{(i)}\} as

E(i)=F(d1(i),d2(i),),E^{(i)}=F\left(d_{1}^{(i)},d_{2}^{(i)},\cdots\right), (6)

where the invariants are derived from the order parameters {an,{s1,s2}(i)}\{a_{n,\{s_{1},s_{2}\}}^{(i)}\}. Hereafter, the invariants representing the neighboring atomic density are referred to as “structural features”. The present formulation is useful for deriving potential energy models, the accuracy and computational efficiency of which can be controlled by the selections of structural features and function FF.

II.2 Structural features

In this study, the neighboring atomic densities are expanded in terms of a basis set composed of radial functions or a basis set composed of products of radial functions and spherical harmonics Bartók et al. (2013); Seko et al. (2019), although it is also possible to use other basis sets in principle. When the neighboring atomic density is expanded in terms of radial functions {fn}\{f_{n}\}, the neighboring atomic density around atom ii is expressed as

ρ(si,s)(i)(r)=nan,{si,s}(i)fn(r),\rho^{(i)}_{(s_{i},s)}(r)=\sum_{n}a^{(i)}_{n,\{s_{i},s\}}f_{n}(r), (7)

where rr denotes the distance from the position of atom ii. Since order parameter an,{si,s}(i)a^{(i)}_{n,\{s_{i},s\}} is invariant for the O(3) group, it can be a pairwise structural feature denoted as

dn0,t(i)=an,t(i),d^{(i)}_{n0,t}=a^{(i)}_{n,t}, (8)

where tt identifies the unordered pair of elements, i.e., t{{A,A},{A,B},}t\in\{\{A,A\},\{A,B\},\cdots\}.

When the neighboring atomic density is expanded in terms of products of radial functions {fn}\{f_{n}\} and spherical harmonics {Ylm}\{Y_{lm}\}, the neighboring atomic density of element ss at a position (r,θ,ϕ)(r,\theta,\phi) in spherical coordinates centered at the position of atom ii is expressed as

ρ(si,s)(i)(r,θ,ϕ)=nlmanlm,{si,s}(i)fn(r)Ylm(θ,ϕ),\rho^{(i)}_{(s_{i},s)}(r,\theta,\phi)=\sum_{nlm}a^{(i)}_{nlm,\{s_{i},s\}}f_{n}(r)Y_{lm}(\theta,\phi), (9)

where order parameter anlm,{si,s}(i)a^{(i)}_{nlm,\{s_{i},s\}} is component nlmnlm of the neighboring atomic density. Since the order parameters are not generally invariant for the O(3) group, polynomial invariants of the order parameters are adopted as structural features. A ppth-order polynomial invariant for a radial index nn and a set of pairs composed of the angular number and the element unordered pair {(l1,t1),(l2,t2),,(lp,tp)}\{(l_{1},t_{1}),(l_{2},t_{2}),\cdots,(l_{p},t_{p})\} is defined as a linear combination of products of pp order parameters, expressed as

dnl1l2lp,t1t2tp,(σ)(i)=m1,m2,,mpcm1m2mpl1l2lp,(σ)anl1m1,t1(i)anl2m2,t2(i)anlpmp,tp(i).d_{nl_{1}l_{2}\cdots l_{p},t_{1}t_{2}\cdots t_{p},(\sigma)}^{(i)}=\sum_{m_{1},m_{2},\cdots,m_{p}}c^{l_{1}l_{2}\cdots l_{p},(\sigma)}_{m_{1}m_{2}\cdots m_{p}}a_{nl_{1}m_{1},t_{1}}^{(i)}a_{nl_{2}m_{2},t_{2}}^{(i)}\cdots a_{nl_{p}m_{p},t_{p}}^{(i)}. (10)

Nonzero polynomial invariants are derived from the sets satisfying the condition that the intersection of tpt_{p} is not the empty set,

ptp.\prod_{p}t_{p}\neq\varnothing. (11)

Therefore, possible sets of pairs composed of the angular number and the element unordered pair {(l1,t1),(l2,t2),,(lp,tp)}\{(l_{1},t_{1}),(l_{2},t_{2}),\cdots,(l_{p},t_{p})\} are enumerated for a given maximum angular number to obtain the entire set of polynomial invariants. An example where the intersection of element pairs becomes the empty set is the case that a polynomial invariant is composed of order parameters with t1={A,A}t_{1}={\rm\{A,A\}} and t2={B,B}t_{2}={\rm\{B,B\}}. Such polynomial invariants are eliminated.

A coefficient set {cm1m2mpl1l2lp,(σ)}\{c^{l_{1}l_{2}\cdots l_{p},(\sigma)}_{m_{1}m_{2}\cdots m_{p}}\} is independent of the radial index nn and the element unordered pair tt. Therefore, linearly independent coefficient sets are obtained using the group-theoretical projector operation method for a given set {l1,l2,,lp}\{l_{1},l_{2},\cdots,l_{p}\}, as proposed in Ref. Seko et al., 2019, ensuring that the linear combinations are invariant for arbitrary rotation. In terms of fourth- and higher-order polynomial invariants, multiple invariants are linearly independent for most of the set {l1,l2,,lp}\{l_{1},l_{2},\cdots,l_{p}\}, which are distinguished by index σ\sigma if necessary. Note that the second- and third-order invariants are equivalent to a multicomponent extension of the angular Fourier series and the bispectrum reported in the literature, respectively Kondor (2008); Bartók et al. (2013).

In this study, a finite set of Gaussian-type functions is adopted as radial functions in basis sets to expand the neighboring atomic density, expressed as

fn(r)=exp[βn(rrn)2]fc(r),f_{n}(r)=\exp\left[-\beta_{n}(r-r_{n})^{2}\right]f_{c}(r), (12)

where βn\beta_{n} and rnr_{n} are parameters. The cutoff function fcf_{c} is given by a cosine-based function as

fc(r)={12[cos(πrrc)+1](rrc)0(r>rc).\displaystyle f_{c}(r)=\left\{\begin{aligned} &\frac{1}{2}\left[\cos\left(\pi\frac{r}{r_{c}}\right)+1\right]&(r\leq r_{c})\\ &0&(r>r_{c})\end{aligned}\right.. (13)

The order parameters of atom ii and element pair {si,s}\{s_{i},s\} are approximately estimated from the neighboring atomic density of element ss around atom ii as

anlm,{si,s}(i)={j|rijrc,sj=s}fn(rij)Ylm(θij,ϕij),a_{nlm,\{s_{i},s\}}^{(i)}=\sum_{\{j|r_{ij}\leq r_{c},s_{j}=s\}}f_{n}(r_{ij})Y_{lm}^{*}(\theta_{ij},\phi_{ij}), (14)

where (rij,θij,ϕij)(r_{ij},\theta_{ij},\phi_{ij}) denotes the spherical coordinates of neighboring atom jj centered at the position of atom ii. Although the Gaussian-type radial functions are not orthonormal, such an approximation of the order parameters is acceptable in the present polynomial-based framework, as also discussed in Ref. Seko et al., 2019.

II.3 Polynomial models for atomic energy

Polynomial models are here employed to represent the atomic energy as a function of structural features. Given a set of structural features D={d1,d2,}D=\set{d_{1},d_{2},\cdots}, polynomial functions are written as

f1(D)\displaystyle f_{1}\left(D\right) =\displaystyle= iwidi\displaystyle\sum_{i^{\prime}}w_{i^{\prime}}d_{i^{\prime}}
f2(D)\displaystyle f_{2}\left(D\right) =\displaystyle= {i,j}wijdidj\displaystyle\sum_{\{i^{\prime},j^{\prime}\}}w_{i^{\prime}j^{\prime}}d_{i^{\prime}}d_{j^{\prime}} (15)
f3(D)\displaystyle f_{3}\left(D\right) =\displaystyle= {i,j,k}wijkdidjdk\displaystyle\sum_{\{i^{\prime},j^{\prime},k^{\prime}\}}w_{i^{\prime}j^{\prime}k^{\prime}}d_{i^{\prime}}d_{j^{\prime}}d_{k^{\prime}}
\displaystyle\vdots

where ww denotes a regression coefficient. Although the polynomial functions are described by all combinations of structural features, only nonzero polynomial terms are retained, which is analogous to the enumeration of nonzero polynomial invariants. In a multicomponent system, a structural feature is composed of order parameters, each of which has an attribute on the element unordered pair tt. Therefore, when the element pair of the pp^{\prime}th order parameter in structure feature did_{i^{\prime}} of a polynomial term is denoted by ti,pt_{i^{\prime},p^{\prime}}, a nonzero polynomial term satisfies the condition that the intersection of {ti,p}\{t_{i^{\prime},p^{\prime}}\} is not the empty set:

(pti,p)(ptj,p).\left(\prod_{p^{\prime}}t_{i^{\prime},p^{\prime}}\right)\cap\left(\prod_{p^{\prime}}t_{j^{\prime},p^{\prime}}\right)\cap\cdots\neq\varnothing. (16)

For example, the intersection of element pairs becomes the empty set if a polynomial term is composed of structural features with ti,pi={A,A}t_{i^{\prime},p_{i}^{\prime}}={\rm\{A,A\}} and tj,pj={B,B}t_{j^{\prime},p_{j}^{\prime}}={\rm\{B,B\}}. Such polynomial terms are eliminated from the polynomial functions.

The following polynomial models for the atomic energy are systematically applied to obtain Pareto optimal MLPs as described in Sec. IV.1. The first model is a polynomial of pairwise structural features. When a set of pairwise structural features is described as

Dpair(i)={dn0,t(i)},D_{\rm pair}^{(i)}=\set{d_{n0,t}^{(i)}}, (17)

the first model is expressed as

E(i)=f1(Dpair(i))+f2(Dpair(i))+f3(Dpair(i))+.E^{(i)}=f_{1}\left(D_{\rm pair}^{(i)}\right)+f_{2}\left(D_{\rm pair}^{(i)}\right)+f_{3}\left(D_{\rm pair}^{(i)}\right)+\cdots. (18)

The first model includes the special case that only powers of the pairwise structural features are considered, which was introduced for elemental systems in Refs. Seko et al., 2014 and Seko et al., 2015. The first model can also be regarded as a straightforward extension of embedded atom method (EAM) potentials Takahashi et al. (2017).

The second model for the atomic energy is a linear polynomial of polynomial invariants given by Eqn. (10). A set of polynomial invariants is described as

D(i)=Dpair(i)D2(i)D3(i)D4(i),D^{(i)}=D_{\rm pair}^{(i)}\cup D_{2}^{(i)}\cup D_{3}^{(i)}\cup D_{4}^{(i)}\cup\cdots, (19)

where a set of ppth-order polynomial invariants is denoted by

D2(i)\displaystyle D_{2}^{(i)} =\displaystyle= {dnll,t1t2(i)}\displaystyle\set{d_{nll,t_{1}t_{2}}^{(i)}}
D3(i)\displaystyle D_{3}^{(i)} =\displaystyle= {dnl1l2l3,t1t2t3(i)}\displaystyle\set{d_{nl_{1}l_{2}l_{3},t_{1}t_{2}t_{3}}^{(i)}} (20)
D4(i)\displaystyle D_{4}^{(i)} =\displaystyle= {dnl1l2l3l4,t1t2t3t4,(σ)(i)}.\displaystyle\set{d_{nl_{1}l_{2}l_{3}l_{4},t_{1}t_{2}t_{3}t_{4},(\sigma)}^{(i)}}.

A second-order invariant is identified with a single ll value because second-order linear combinations are invariant only when l1=l2l_{1}=l_{2} El-Batanouny and Wooten (2008); Tolédano and Tolédano (1987). The second model is then written as

E(i)=f1(D(i)),E^{(i)}=f_{1}\left(D^{(i)}\right), (21)

which was introduced in Ref. Seko et al., 2019 for elemental systems. Note that a linear polynomial model with up to third-order invariants is equivalent to a spectral neighbor analysis potential (SNAP) Thompson et al. (2015), expressed as

E(i)=f1(Dpair(i)D2(i)D3(i)).E^{(i)}=f_{1}\left(D_{\rm pair}^{(i)}\cup D_{2}^{(i)}\cup D_{3}^{(i)}\right). (22)

An extension of the second model is a polynomial of polynomial invariants described as

E(i)=f1(D(i))+f2(D(i))+f3(D(i))+.E^{(i)}=f_{1}\left(D^{(i)}\right)+f_{2}\left(D^{(i)}\right)+f_{3}\left(D^{(i)}\right)+\cdots. (23)

Note that a quadratic polynomial model of polynomial invariants up to the third order is equivalent to a quadratic SNAP Wood and Thompson (2018b). Other extended models are also introduced, which are given by

E(i)\displaystyle E^{(i)} =\displaystyle= f1(D(i))+f2(Dpair(i))+f3(Dpair(i))\displaystyle f_{1}\left(D^{(i)}\right)+f_{2}\left(D_{\rm pair}^{(i)}\right)+f_{3}\left(D_{\rm pair}^{(i)}\right)
E(i)\displaystyle E^{(i)} =\displaystyle= f1(D(i))+f2(Dpair(i)D2(i))\displaystyle f_{1}\left(D^{(i)}\right)+f_{2}\left(D_{\rm pair}^{(i)}\cup D_{2}^{(i)}\right) (24)
E(i)\displaystyle E^{(i)} =\displaystyle= f1(D(i))+f2(Dpair(i)D2(i)D3(i)).\displaystyle f_{1}\left(D^{(i)}\right)+f_{2}\left(D_{\rm pair}^{(i)}\cup D_{2}^{(i)}\cup D_{3}^{(i)}\right).

They are decomposed into a linear polynomial of structural features and a polynomial of a subset of the structural features.

The atomic energy in all the models is measured from the sum of the energies of isolated atoms. Moreover, the forces acting on atoms and the stress tensors in a multicomponent system can be derived in a similar manner to those in the elemental system derived in Ref. Seko et al., 2019. Note that the above polynomial function forms were also applied to develop MLPs for elemental systems included in the Machine Learning Potential Repository Seko (2020); Mac .

III Datasets and computational procedure

Table 1: List of structure generators used for developing the training and test datasets.
ICSD CollCode Structure type ICSD CollCode Structure type ICSD CollCode Structure type
239 Cu3Se2 106786 Hg2Pt 618295 MoC1-x
5258 FeSi2 107998 MoNi4 618702 ScTe
16504 CrSi2 108707 HgMn 625334 Laves(2H)-MgZn2
16606 Nb3Te4 108762 Hg4Pt 626692 NiAs
30446 Fe2B 150584 Fe13Ge3 629380 Al3Os2
42428 Fe3Pt 155842 Co5Fe11 629406 Cu4Ti3
42472 CoO 161109 CoSn 633467 FeSe(tP2tP2)
42773 IrGe4 161133 Fe2Si(HT) 635060 FeSi
52294 GeTe 167735 Ru2B3 635208 CoGa3
55492 BaPt 168897 LaI 635642 Hg5Mn2
58471 CuZr2 169457 ZrH2 638227 CaF2
58607 Au2Ti 181127 AuCu3 639037 HgIn
58745 Fe6Ge6Mg 181788 NaCl 639148 NiHg4
59508 AuCu 185626 Al3Ni2 639227 Si2U3
59586 Pd5Th3 188260 Heusler-AlCu2Mn 639879 In5In4
69199 U3Si 189695 CuHg2Ti 640726 CuSmP2
69557 CdI2(hP9hP9) 189711 Heusler-AlLiSi 643301 Au3Cd
73839 Ni3S2 240119 AlLi 644708 WC
97006 InMg2 246555 Co2Nd 648572 CuInPt2
99787 Fe3Pt 248490 Pt2Si 648748 Pd4Se
100654 BiSe 260285 UCl3 649037 Ni3Ti
102712 CoU 262070 AlLi(hP8hP8) 650527 CsCl
103775 NaTl 409859 La2Sb 652553 AlCr2-MoSi2
103995 Ga3Ti2 416747 Al3Zr 655706 Cu2Te(HT)
104506 Ni3Sn 420250 LiPd2Tl 659806 GeTe
105191 Al3Ti 424636 MnGa4 659829 Al2Li3
105521 Al5W 609153 AlPt3 659856 LiPt
105636 PbU 610464 PbClF/Cu2Sb
105726 Pd5Ti3 611176 Fe2P
105948 InNi2 611457 NbAs
106325 BiIn 611618 TiAs

A straightforward development of training and test datasets for the Ti-Al binary system begins with a set of structure generators. In this study, prototype structures reported as binary alloy entries in the Inorganic Crystal Structure Database (ICSD) Bergerhoff and Brown (1987) are used as structure generators such that the datasets can cover a wide variety of structures. Moreover, the prototype structures are restricted to those represented by unit cells with up to eight atoms. A structure generator made by swapping elements in each prototype structure is also introduced; hence, the total number of structure generators is 150. The structure generators are listed in Table 1.

Given the structure generators, the atomic positions and lattice constants of the structure generators are fully optimized by DFT calculation to obtain their equilibrium structures. A structure in the datasets is then generated by introducing random lattice expansion, random lattice distortion, and random atomic displacements into a supercell of each of the equilibrium structures. A mathematical description of the procedure can be found in Ref. Seko et al., 2019. By repeatedly applying the procedure to the structure generators, 27,394 binary structures are generated for the datasets. In addition to the binary structures and the equilibrium structures of the structure generators, existing data for elemental Ti and Al Seko (2020) are also included in the present datasets.

For the total of 41,508 structures, DFT calculations are performed using the plane-wave-basis projector augmented wave (PAW) method Blöchl (1994) within the Perdew–Burke–Ernzerhof exchange-correlation functional Perdew et al. (1996) as implemented in the vasp code Kresse and Hafner (1993); Kresse and Furthmüller (1996); Kresse and Joubert (1999). The cutoff energy is set to 300 eV. The total energies converge to less than 10-3 meV/supercell. The atomic positions and lattice constants for the structure generators are optimized until the residual forces are less than 10-2 eV/Å.

The regression coefficients of a potential energy model are estimated by linear ridge regression. The DFT total energies, the DFT forces acting on atoms, and the DFT stress tensors of structures in the training dataset are simultaneously used to estimate the regression coefficients, as adopted in Refs. Seko et al., 2019 and Seko, 2020. Therefore, the total number of training data reaches 5,178,510.

IV Results and discussion

IV.1 Pareto optimal MLPs

Refer to caption
Refer to caption
Figure 2: (a) Distribution of MLPs for the Ti-Al binary system. The purple closed circles show Pareto optimal points of the distribution with different trade-offs between accuracy and computational efficiency. The green closed circles indicate the MLP with the lowest prediction error denoted by “MLP3” and the Pareto optimal MLPs with high computational cost performance denoted by “MLP1” and “MLP2”. The computational time indicates the elapsed time for a single-point calculation normalized by the number of atoms. The elapsed time is measured using a single core of Intel® Xeon® E5-2695 v4 (2.10 GHz). (b) Dependence of the computational time required for a single-point calculation on the number of atoms. The dependence of the computational time using the EAM potential Zope and Mishin (2003) is also shown for comparison.
Table 2: Model parameters and prediction errors of MLP1, MLP2, and MLP3 for the Ti-Al binary system.
MLP1 MLP2 MLP3
Number of coefficients 7875 27,520 61,605
RMS error (energy, meV/atom) 4.51 2.09 1.67
RMS error (force, eV/Å) 0.138 0.074 0.066
Cutoff radius (Å) 8.0 6.0 8.0
Number of radial functions 10 10 15
Polynomial order (function FF) 2 2 2
Polynomial order (invariants) 3 2 2
{lmax(2),lmax(3),}\set{l_{\rm max}^{(2)},l_{\rm max}^{(3)},\cdots} [0,0] [4] [4]

The polynomial models given in Sec. II.3 are systematically applied to develop MLPs in the Ti-Al binary system. Because the accuracy and computational efficiency of the MLPs strongly depend on several input parameters, a systematic grid search is performed to find their optimal values. The parameters in the grid search are the cutoff radius, the type of structural features, the type of potential energy model, the number of radial functions, the polynomial order in the potential energy model, and the truncation of the polynomial invariants, i.e., the maximum angular numbers of spherical harmonics, {lmax(2),lmax(3),,lmax(pmax)}\{l_{\rm max}^{(2)},l_{\rm max}^{(3)},\cdots,l_{\rm max}^{(p_{\rm max})}\}, and the polynomial order of the invariants, pmaxp_{\rm max}.

Figure 2 (a) shows the distribution of MLPs obtained from the grid search. The root mean squared (RMS) error for the test dataset is used as an estimator of the accuracy of MLPs. The computational time indicates the elapsed time for a single point calculation normalized by the number of atoms. Figure 2 (a) also shows the Pareto optimal MLPs when optimizing both the accuracy and computational efficiency simultaneously. As can be seen in Fig. 2 (a), the accuracy and computational efficiency of MLPs are conflicting properties; hence, the Pareto optimal MLPs can be optimal ones with different trade-offs.

In performing an atomistic simulation, an appropriate MLP must be chosen from the Pareto optimal ones according to its target system and purpose. Therefore, a convenient score that can estimate the computational cost performance is required to find an MLP with high computational cost performance in a simplified manner. In this study, functions t+ΔEt+\Delta E and 10t+ΔE10t+\Delta E that should be minimized are introduced, where tt and ΔE\Delta E denote the computational time with the unit of ms/atom/step and the RMS error with the unit of meV/atom, respectively.

Figure 2 (a) shows the MLP with the lowest RMS error and two Pareto optimal MLPs showing high computational cost performance. The MLP with the lowest RMS error is denoted by “MLP3”, whereas the two MLPs showing high computational cost performance are denoted by “MLP1” and “MLP2”. MLP1 and MLP2 are obtained by minimizing the scores 10t+ΔE10t+\Delta E and t+ΔEt+\Delta E, respectively. As can be seen in Fig. 2 (a), they exhibit high computational efficiency without significantly increasing the RMS error. The model parameters and RMS errors of MLP1, MLP2, and MLP3 are listed in Table 2. The RMS errors of MLP2 and MLP3 are close, 2.09 and 1.67 meV/atom, respectively. The RMS error of MLP1 is 4.51 meV/atom, which is greater than those of MLP2 and MLP3, whereas MLP1 is ten times more computationally efficient than MLP2. As described in Table 2, all the MLPs are derived from polynomial invariants. However, MLP1 is developed only using the l=0l=0 component of spherical harmonics. Therefore, MLP1 can be regarded as a polynomial model of pairwise structural features.

Figure 2 (b) shows the computational time required for a single point calculation using the EAM potential Zope and Mishin (2003), MLP1, MLP2, and MLP3. The computational time is evaluated for structures with up to 32,000 atoms constructed by the expansions of the four-atom unit cell of a AuCu-type (L10L1_{0}) structure with a lattice constant of 4 Å. The computational time is measured by implementing the present MLPs Lam in the lammps code lammps code ; Plimpton (1995). As can be seen in Fig. 2 (b), the EAM potential and the MLPs show linear scaling of the computational time with respect to the number of atoms. Therefore, the computational time normalized by the number of atoms can be an estimator of the computational efficiency of MLPs, and the computational time required for a simulation of nstepn_{\rm step} steps using a structure with natomn_{\rm atom} atoms can be estimated as t×natom×nstept\times n_{\rm atom}\times n_{\rm step}.

IV.2 Cohesive energy

Refer to caption
Figure 3: (a) Distribution of the prediction errors for structures in the training and test datasets. (b) Distribution of the prediction errors for the equilibrium structure generators listed in Table 1.

Figure 3 (a) shows the distribution of the prediction errors for structures in the training and test datasets. The degree of scattering in the distribution decreases in the order of MLP1, MLP2, and MLP3, which coincides with the decreasing order of the RMS errors shown in Table 2. Figure 3 (b) shows the distribution of the prediction errors for the equilibrium structure generators listed in Table 1. The distribution of the prediction errors for the structure generators indicates the predictive power for the cohesive energy. The prediction errors for some of the structure generators are significant in MLP1. On the other hand, the prediction errors are trivial for all the structure generators in MLP2 and MLP3, which indicates that MLP2 and MLP3 should have high predictive power for the cohesive energy in a wide variety of binary ordered structures.

IV.3 Formation energy

Table 3: Formation energies of selected ordered structures. (unit: meV/atom)
Structure type EAM111Ref. Zope and Mishin,2003 MLP1 MLP2 MLP3 DFT
Ti5Al
Al5W -180 -156 -135 -134 -136
Ti4Al
MoNi4 (D1aD1_{a}) -221 -202 -180 -178 -179
Ti3Al
Ni3Sn (D019D0_{19}) -288 -294 -278 -277 -280
Ni3Ti (D024D0_{24}) -288 -290 -267 -265 -267
AuCu3 (L12L1_{2}) -288 -283 -263 -262 -264
Al3Zr (D023D0_{23}) -282 -276 -258 -257 -259
Al3Ti (D022D0_{22}) -275 -275 -253 -252 -254
AlCu2Mn (L21L2_{1}) -229 -164 -143 -142 -143
Ti5Al2
Hg5Mn2 -161 -70 -53 -51 -53
Ti11Al5
Co5Fe11 -246 -254 -235 -234 -236
Ti2Al
InNi2 (B82B8_{2}) -208 -317 -305 -304 -305
CuZr2 -299 -284 -268 -267 -269
Fe2P (C22C22) -276 -205 -237 -228 -228
CrSi2 (C40C40) -245 -210 -187 -185 -186
Fe2B -137 -160 -143 -141 -143
Cu2Sb (C38C38) -128 -160 -139 -137 -139
FeSi2 -118 -22 -7 -4 -7
Ti5Al3
Pd5Th3 -255 -204 -192 -191 -193
Ti3Al2
Ga3Ti2 -363 -344 -332 -331 -331
Al3Os2 -290 -302 -288 -287 -288
Si2U3 (D5aD5_{a}) -140 -253 -246 -244 -245
Ti4Al3
Nb3Te4 -340 -340 -327 -327 -329
Cu4Ti3 -281 -232 -217 -216 -217
TiAl
AuCu (L10L1_{0}) -404 -417 -404 -403 -404
PbU -370 -375 -366 -367 -368
CoU (BaB_{a}) -340 -298 -283 -282 -283
CsCl (B2B2) -286 -280 -265 -263 -262
NiAs (B81B8_{1}) -311 -263 -257 -251 -251
WC (BhB_{h}) -296 -255 -250 -249 -250
ScTe -307 -238 -223 -224 -225
BiSe -265 -185 -172 -172 -173
FeSi (B20B20) -206 -145 -118 -118 -120
NaTl (B32B32) -319 -83 -70 -69 -71
Ti2Al3
Ga3Ti2 -370 -408 -416 -418 -419
Al3Os2 -353 -319 -309 -309 -310
Si2U3 (D5aD5_{a}) -91 -80 -73 -72 -72
Table 4: Formation energies of selected ordered structures (continued). (unit: meV/atom)
Structure type EAM222Ref. Zope and Mishin,2003 MLP1 MLP2 MLP3 DFT
Ti3Al5
Pd5Th3 -239 -332 -322 -324 -325
Pd5Ti3 -340 -293 -285 -284 -284
TiAl2
MgZn2 (C14C14) -331 -334 -326 -325 -326
Co2Nd -313 -323 -315 -314 -315
CuZr2 -320 -252 -244 -243 -245
La2Sb -181 -248 -238 -238 -239
Cu2Sb (C38C38) -184 -239 -232 -229 -230
FeSi2 -156 -212 -210 -210 -212
Fe2P (C22C22) -232 -206 -191 -192 -195
Hg2Pt 183 -45 -40 -41 -43
Ti5Al11
Co5Fe11 -331 -380 -373 -373 -374
Ti2Al5
Hg5Mn2 -168 -295 -289 -289 -289
TiAl3
Al3Zr (D023D0_{23}) -297 -407 -402 -402 -403
Al3Ti (D022D0_{22}) -289 -407 -403 -397 -397
AuCu3 (L12L1_{2}) -303 -375 -369 -369 -370
Ni3Ti (D024D0_{24}) -293 -349 -338 -338 -338
Ni3Sn (D019D0_{19}) -286 -321 -318 -318 -319
TiAl4
MoNi4 (D1aD1_{a}) -240 -223 -216 -215 -216
TiAl5
Al5W -188 -184 -180 -180 -180

The formation energy of a given ordered structure is more challenging to predict accurately than its cohesive energy because high predictive power is required not only for the ordered structure but also for the reference structures of hexagonal-close-packed (hcp) Ti and face-centered-cubic (fcc) Al. Furthermore, the local geometry relaxation for the ordered structure is crucial for evaluating the formation energy. Therefore, MLPs are needed to derive an accurate potential energy surface around the initial and equilibrium structures.

Tables 3 and 4 show the formation energies of selected ordered structures predicted using the EAM potential, MLP1, MLP2, and MLP3, compared with those predicted by DFT calculation. Note that the ordered structures cover a wide range of compositions. The RMS errors of the EAM potential, MLP1, MLP2, and MLP3 for the formation energy are 68.0, 13.8, 2.0, and 1.5 meV/atom, respectively. This reveals that MLP2 and MLP3 have high predictive power for the formation energy in a wide range of structures. The RMS error of MLP1 for the formation energy is significant as a consequence of the systematic deviation of the formation energy for the overall ordered structures. As can be seen in Tables 3 and 4, the formation energies of most of the ordered structures predicted using MLP1 are approximately 10 meV/atom lower than those predicted by DFT calculation, which originates from the fact that the prediction error of MLP1 for hcp-Ti is significant (++27.4 meV/atom). Moreover, the prediction error of the EAM potential is much more significant, and the EAM potential fails to reconstruct the hierarchy of the formation energies predicted by DFT calculation.

IV.4 Elastic constants

Table 5: Lattice constants and elastic constants of TiAl (L10L1_{0}, B2B2, and B81B8_{1}).
EAM333Ref. Zope and Mishin,2003 MLP1 MLP2 MLP3 DFT
γ\gamma-TiAl (CuAu, L10L1_{0})
a0a_{0} (Å) 2.827 2.812 2.812 2.812 2.813
c0c_{0} (Å) 4.187 4.080 4.078 4.080 4.079
C11C_{11} (GPa) 237 219 195 190 195
C12C_{12} (GPa) 67 35 63 65 66
C13C_{13} (GPa) 114 87 90 90 89
C33C_{33} (GPa) 213 189 176 176 173
C44C_{44} (GPa) 92 112 114 114 113
C66C_{66} (GPa) 45 52 39 39 38
TiAl (CsCl, B2B2)
a0a_{0} (Å) 3.278 3.184 3.183 3.182 3.182
C11C_{11} (GPa) 80 67 82 69 74
C12C_{12} (GPa) 121 132 134 141 136
C44C_{44} (GPa) 95 80 87 82 66
TiAl (NiAs, B81B8_{1})
a0a_{0} (Å) 2.853 2.880 2.878 2.877 2.879
c0c_{0} (Å) 9.370 9.269 9.264 9.276 9.263
C11C_{11} (GPa) 157 149 126 132 136
C12C_{12} (GPa) 94 113 99 94 96
C13C_{13} (GPa) 93 88 65 72 74
C33C_{33} (GPa) 292 277 250 220 223
C44C_{44} (GPa) 67 73 73 71 75
C66C_{66} (GPa) 32 18 14 19 20
Table 6: Lattice constants and elastic constants of Ti3Al (D019D0_{19}, D022D0_{22}, and L12L1_{2}).
EAM444Ref. Zope and Mishin,2003 MLP1 MLP2 MLP3 DFT
Ti3Al (Ni3Sn, D019D0_{19})
a0a_{0} (Å) 5.784 5.728 5.731 5.729 5.726
c0c_{0} (Å) 4.750 4.646 4.643 4.644 4.646
C11C_{11} (GPa) 199 187 181 196 195
C12C_{12} (GPa) 89 113 104 99 90
C13C_{13} (GPa) 74 61 67 71 70
C33C_{33} (GPa) 224 237 235 231 232
C44C_{44} (GPa) 51 44 47 63 59
C66C_{66} (GPa) 55 37 39 48 53
Ti3Al (TiAl3, D022D0_{22})
a0a_{0} (Å) 4.082 3.943 3.961 3.962 3.960
c0c_{0} (Å) 8.252 8.479 8.423 8.433 8.444
C11C_{11} (GPa) 161 192 173 177 173
C12C_{12} (GPa) 100 145 105 92 97
C13C_{13} (GPa) 93 119 101 89 87
C33C_{33} (GPa) 146 260 168 151 171
C44C_{44} (GPa) 71 73 84 79 87
C66C_{66} (GPa) 78 98 94 98 89
Ti3Al (Cu3Au, L12L1_{2})
a0a_{0} (Å) 4.089 4.036 4.036 4.036 4.035
C11C_{11} (GPa) 165 143 165 172 175
C12C_{12} (GPa) 97 102 91 89 89
C44C_{44} (GPa) 76 92 93 92 90
Table 7: Lattice constants and elastic constants of TiAl3 (D022D0_{22}, D019D0_{19}, and L12L1_{2}).
EAM555Ref. Zope and Mishin,2003 MLP1 MLP2 MLP3 DFT
TiAl3 (TiAl3, D022D0_{22})
a0a_{0} (Å) 4.049 3.838 3.842 3.842 3.841
c0c_{0} (Å) 8.139 8.626 8.621 8.609 8.608
C11C_{11} (GPa) 170 190 213 188 196
C12C_{12} (GPa) 98 100 76 81 87
C13C_{13} (GPa) 89 2 26 38 47
C33C_{33} (GPa) 140 212 192 223 220
C44C_{44} (GPa) 62 70 83 83 95
C66C_{66} (GPa) 71 140 126 120 129
TiAl3 (Ni3Sn, D019D0_{19})
a0a_{0} (Å) 5.704 5.565 5.565 5.564 5.563
c0c_{0} (Å) 4.810 4.722 4.724 4.725 4.726
C11C_{11} (GPa) 205 219 203 206 209
C12C_{12} (GPa) 88 58 67 66 67
C13C_{13} (GPa) 62 53 63 59 60
C33C_{33} (GPa) 189 163 161 168 167
C44C_{44} (GPa) 34 46 53 62 65
C66C_{66} (GPa) 58 80 68 70 71
TiAl3 (Cu3Au, L12L1_{2})
a0a_{0} (Å) 4.050 3.976 3.977 3.977 3.979
C11C_{11} (GPa) 179 183 181 190 191
C12C_{12} (GPa) 95 56 68 69 66
C44C_{44} (GPa) 73 84 77 76 77

Tables 5, 6, and 7 show the lattice constants and the elastic constants of nine structures, i.e., TiAl (L10L1_{0}), TiAl (B2B2), TiAl (B81B8_{1}), Ti3Al (D019D0_{19}), Ti3Al (D022D0_{22}), Ti3Al (L12L1_{2}), TiAl3 (D022D0_{22}), TiAl3 (D019D0_{19}), and TiAl3 (L12L1_{2}), which are predicted using the EAM potential, MLP1, MLP2, and MLP3, along with the lattice constants and the elastic constants obtained by DFT calculation. The lattice constants predicted using the EAM potential deviate from those obtained by DFT calculation in most of the structures. This deviation may arise from the fact that the EAM potential was developed by fitting to experimental lattice constants Zope and Mishin (2003), excluding the descriptive power of the EAM potential model. In contrast, the MLPs can compute the lattice constants accurately in all of the structures. Regarding the elastic constants, the elastic constants predicted using the EAM potential and MLP1 are close to those obtained by DFT calculation in many structures; however, the EAM potential and MLP1 fail to predict the elastic constants accurately in a few structures. Also, the elastic constants predicted using MLP2 and MLP3 are almost the same as those obtained by DFT calculation in all structures.

IV.5 Phonon properties

Refer to caption
Figure 4: (a) Phonon density of states for 13 selected compounds in the Ti-Al binary system, predicted using the EAM potential and the MLPs. The shaded region indicates the phonon density of states computed by DFT calculation. (b) Phonon dispersion curves for γ\gamma-TiAl predicted using the EAM potential and the MLPs. The solid black lines indicate the phonon dispersion curves predicted by DFT calculation. (c) Temperature dependence of the thermal expansion predicted using the EAM potential and the MLPs. The broken black lines show the thermal expansion computed by DFT calculation.

The predictive power of the MLPs for phonon properties and thermal expansion is examined. The phonon properties and thermal expansion are calculated using a finite displacement method implemented in the phonopy code Togo and Tanaka (2015). Figure 4 (a) shows the phonon density of states for 13 structures predicted using the EAM potential, MLP1, MLP2, and MLP3. The supercells required to compute the phonon properties are constructed by the expansions of the conventional unit cells of the structures. The number of atoms included in the supercells ranges from 54 to 162. The EAM potential predicts the phonon density of states well in the low-frequency region for many structures, while the deviation from the DFT phonon density of states is large in the high-frequency region. Conversely, the phonon density of states predicted using the MLPs and those predicted by DFT calculation overlap for all the structures, particularly those predicted using MLP2 and MLP3.

Figure 4 (b) shows the phonon dispersion curves for TiAl (AuCu-type, L10L1_{0}) predicted using the EAM potential and the MLPs, compared with those predicted by DFT calculation. The deviation of the EAM phonon dispersions from the DFT phonon dispersions is significant in the high-frequency region. On the other hand, the phonon dispersions of MLP2 and MLP3 are consistent with those of DFT calculation.

Figure 4 (c) shows the temperature dependence of the thermal expansion, calculated using a quasi-harmonic approximation, in Ti3Al (D019D0_{19}), TiAl (L10L1_{0}), and TiAl3 (D022D0_{22}), which are experimentally observed in the Ti-Al binary system. As can be seen in Fig. 4 (c), the thermal expansion of the EAM potential differs from that of the DFT calculation in all the structures. On the other hand, MLP2 and MLP3 derive the temperature dependence of the thermal expansion accurately in all the structures, even though the thermal expansion is more challenging to predict accurately than the phonon density of states and the phonon dispersion curves. The accurate prediction of the thermal expansion indicates that MLP2 and MLP3 can accurately evaluate the volume dependence of the whole range of phonon frequencies.

IV.6 Bain path between 𝜸\bm{\gamma}-TiAl and 𝑩𝟐\bm{B2}

Refer to caption
Figure 5: Energy profiles along the Bain path between the L10L1_{0} structure and the B2B2 structure.

The energy profile along the Bain path between the γ\gamma-TiAl (L10L1_{0}) structure and the B2B2 structure is calculated using the EAM potential and the MLPs, and compared with that obtained by DFT calculation. The structures required to compute the energy profile are obtained by transforming the c/ac/a ratio while keeping their volumes fixed to that of the equilibrium L10L1_{0} structure. The energy profile is then evaluated by single-point calculations without geometry optimization for the structures. Figure 5 shows the energy profiles along the Bain path predicted using the EAM potential, the MLPs, and DFT calculation. The energy profiles along the Bain path of the MLPs, particularly MLP2 and MLP3, are consistent with that obtained by DFT calculation. The EAM profile is also close to the DFT profile for a c/ac/a ratio of 1.0–1.5, although the c/ac/a ratio of the equilibrium L10L1_{0} structure of the EAM potential is slightly different from that obtained by DFT calculation. The behavior of the energy profile along the Bain path predicted by the EAM and the deviation of the c/ac/a ratio are the same as those discussed by Zope and Mishin in Ref. Zope and Mishin, 2003, who developed the EAM potential.

IV.7 Stacking faults in 𝜸\bm{\gamma}-TiAl

Table 8: Excessive energies of special stacking faults in γ\gamma-TiAl (unit: mJ/m2).
EAM666Ref. Zope and Mishin,2003 MLP1 MLP2 MLP3 DFT
SISF (111)(111) 108 281 165 258 194
APB (111)(111) 249 680 616 681 694
CSF (111)(111) 282 555 431 410 388

Computational models for the superlattice intrinsic stacking fault (SISF), the antiphase boundary (APB), and the complex stacking fault (CSF) are constructed using the procedure shown in Ref. Dumitraschkewitz et al., 2017. First, the supercell is constructed by the expansion along the 111\langle 111\rangle direction of γ\gamma-TiAl with ideal cubic lattice parameters, and the resultant supercell is composed of 24 atoms. The equilibrium structure of the supercell is then obtained by local geometry optimization. A computational model with a stacking fault is generated by introducing a tilt into the equilibrium structure. A displacement vector defining a tilt is given by a linear combination of two vectors corresponding to the 1¯10/2\langle\bar{1}10\rangle/2 and 112¯/2\langle 11\bar{2}\rangle/2 directions in ideal cubic γ\gamma-TiAl. The displacement vectors for the SISF, the APB, and the CSF are expressed as

𝒃SISF\displaystyle\bm{b}_{\rm SISF} =\displaystyle= 13[12112¯],\displaystyle\frac{1}{3}\left[\frac{1}{2}\langle 11\bar{2}\rangle\right],
𝒃APB\displaystyle\bm{b}_{\rm APB} =\displaystyle= 12[121¯10]+12[12112¯],\displaystyle\frac{1}{2}\left[\frac{1}{2}\langle\bar{1}10\rangle\right]+\frac{1}{2}\left[\frac{1}{2}\langle 11\bar{2}\rangle\right], (25)
𝒃CSF\displaystyle\bm{b}_{\rm CSF} =\displaystyle= 12[121¯10]+56[12112¯],\displaystyle\frac{1}{2}\left[\frac{1}{2}\langle\bar{1}10\rangle\right]+\frac{5}{6}\left[\frac{1}{2}\langle 11\bar{2}\rangle\right],

respectively. Finally, single-point calculations are performed for the tilted structures.

Table 8 summarizes the stacking fault energies computed using the EAM potential, the MLPs, and DFT calculation. The cohesive energy of the equilibrium structure of γ\gamma-TiAl is used as a reference to compute the stacking fault energy. As can be seen in Table 8, the MLPs predict the stacking fault energies of the SISF, the APB, and the CSF accurately. On the other hand, the EAM potential lacks the predictive power for the stacking fault energy of the APB.

Refer to caption
Figure 6: (a) Profile of the stacking fault energy in γ\gamma-TiAl along the path γ\gamma-TiAl \rightarrow SISF \rightarrow APB \rightarrow CSF \rightarrow γ\gamma-TiAl \rightarrow APB. (b) GSFE surface in γ\gamma-TiAl obtained using the EAM, the MLPs, and DFT calculation.

The stacking fault energy can be defined not only for these special stacking faults but also for other general stacking faults defined by displacement vectors. A collection of the excessive energies of the general stacking faults comprises a generalized stacking fault energy (GSFE) surface. The displacement vector identifying a tilt of the supercell for a general stacking fault is given by

𝒃=u[121¯10]+v[12112¯],\bm{b}=u\left[\frac{1}{2}\langle\bar{1}10\rangle\right]+v\left[\frac{1}{2}\langle 11\bar{2}\rangle\right], (26)

where uu and vv denote the fractional coordinates defined by the vectors 1¯10/2\langle\bar{1}10\rangle/2 and 112¯/2\langle 11\bar{2}\rangle/2, respectively.

Figure 6 (a) shows the profiles of the stacking fault energy along the path γ\gamma-TiAl \rightarrow SISF \rightarrow APB \rightarrow CSF \rightarrow γ\gamma-TiAl \rightarrow APB, predicted using the EAM potential, the MLPs, and DFT calculation. The displacement vector changes continuously along the path. The stacking fault energy profiles of MLP2 and MLP3 are close to that obtained by DFT calculation. The MLP1 profile agrees with the DFT profile along a major part of the path, while it deviates from the DFT profile along the path between the APB and the CSF. The EAM profile also deviates from the DFT profile along the path between the SISF and the CSF. Figure 6 (b) shows the GSFE surface in γ\gamma-TiAl predicted using the EAM potential, the MLPs, and DFT calculation. The GSFE surfaces predicted using the MLPs and DFT calculation are similar, while that predicted using the EAM potential is different from that predicted by the DFT calculation. Thus, the present MLPs should have high predictive power for the stacking faults and related properties.

V Conclusion

In this study, O(3) polynomial invariants representing the neighboring atomic density in a multicomponent system have been formulated. Polynomial models have also been introduced to describe the relationship between the atomic energy and the polynomial invariants. Although the present formulation to develop MLPs in a multicomponent system is more complex than the formulation for elemental systems, it enables the accuracy and computational efficiency of MLPs to be controlled systematically.

This study also shows an application of the formulation of MLPs to the Ti-Al binary alloy system. Pareto optimal MLPs have been developed by applying the polynomial models combined with the polynomial invariants. These MLPs are available in the Machine Learning Potential Repository Seko (2020); Mac . The predictive power of the Pareto optimal MLPs has been examined for the cohesive energy, the formation energy, the elastic constants, the phonon density of states and dispersion curves, the thermal expansion, the energy profile along the Bain path, and the stacking fault properties. The MLP with the lowest prediction error (MLP3) and that with high computational cost performance (MLP2) have high predictive power for all the properties, whereas an MLP showing higher computational cost performance (MLP1) than MLP2 fails to predict some of the properties with high accuracy. This study reveals that the present framework provides a systematic way to develop MLPs with high computational cost performance in multicomponent systems.

Acknowledgements.
This work was supported by a Grant-in-Aid for Scientific Research (B) (Grant Number 19H02419), and a Grant-in-Aid for Scientific Research on Innovative Areas (Grant Number 19H05787) from the Japan Society for the Promotion of Science (JSPS).

References

  • Lorenz et al. (2004) S. Lorenz, A. Groß,  and M. Scheffler, Chem. Phys. Lett. 395, 210 (2004).
  • Behler and Parrinello (2007) J. Behler and M. Parrinello, Phys. Rev. Lett. 98, 146401 (2007).
  • Bartók et al. (2010) A. P. Bartók, M. C. Payne, R. Kondor,  and G. Csányi, Phys. Rev. Lett. 104, 136403 (2010).
  • Behler (2011) J. Behler, J. Chem. Phys. 134, 074106 (2011).
  • Han et al. (2018) J. Han, L. Zhang, R. Car,  and E. Weinan, Commun. Comput. Phys. 23, 629 (2018).
  • Artrith and Urban (2016) N. Artrith and A. Urban, Comput. Mater. Sci. 114, 135 (2016).
  • Artrith et al. (2017) N. Artrith, A. Urban,  and G. Ceder, Phys. Rev. B 96, 014112 (2017).
  • Szlachta et al. (2014) W. J. Szlachta, A. P. Bartók,  and G. Csányi, Phys. Rev. B 90, 104108 (2014).
  • Bartók et al. (2018) A. P. Bartók, J. Kermode, N. Bernstein,  and G. Csányi, Phys. Rev. X 8, 041048 (2018).
  • Li et al. (2015) Z. Li, J. R. Kermode,  and A. De Vita, Phys. Rev. Lett. 114, 096405 (2015).
  • Glielmo et al. (2017) A. Glielmo, P. Sollich,  and A. De Vita, Phys. Rev. B 95, 214302 (2017).
  • Seko et al. (2014) A. Seko, A. Takahashi,  and I. Tanaka, Phys. Rev. B 90, 024101 (2014).
  • Seko et al. (2015) A. Seko, A. Takahashi,  and I. Tanaka, Phys. Rev. B 92, 054113 (2015).
  • Takahashi et al. (2017) A. Takahashi, A. Seko,  and I. Tanaka, Phys. Rev. Mater. 1, 063801 (2017).
  • Thompson et al. (2015) A. Thompson, L. Swiler, C. Trott, S. Foiles,  and G. Tucker, J. Comput. Phys. 285, 316 (2015).
  • Wood and Thompson (2018a) M. A. Wood and A. P. Thompson, J. Chem. Phys. 148, 241721 (2018a).
  • Chen et al. (2017) C. Chen, Z. Deng, R. Tran, H. Tang, I.-H. Chu,  and S. P. Ong, Phys. Rev. Mater. 1, 043603 (2017).
  • Shapeev (2016) A. V. Shapeev, Multiscale Model. Simul. 14, 1153 (2016).
  • Deringer et al. (2018) V. L. Deringer, C. J. Pickard,  and G. Csányi, Phys. Rev. Lett. 120, 156001 (2018).
  • Podryabinkin et al. (2018) E. V. Podryabinkin, E. V. Tikhonov, A. V. Shapeev,  and A. R. Oganov, arXiv preprint arXiv:1802.07605  (2018).
  • Gubaev et al. (2019) K. Gubaev, E. V. Podryabinkin, G. L. Hart,  and A. V. Shapeev, Comput. Mater. Sci. 156, 148 (2019).
  • Mueller et al. (2020) T. Mueller, A. Hernandez,  and C. Wang, J. Chem. Phys. 152, 050902 (2020).
  • Seko et al. (2019) A. Seko, A. Togo,  and I. Tanaka, Phys. Rev. B 99, 214108 (2019).
  • Hernandez et al. (2019) A. Hernandez, A. Balasubramanian, F. Yuan, S. A. Mason,  and T. Mueller, npj Comput. Mater. 5, 1 (2019).
  • Zuo et al. (2020) Y. Zuo, C. Chen, X. Li, Z. Deng, Y. Chen, J. Behler, G. Csányi, A. V. Shapeev, A. P. Thompson, M. A. Wood,  and S. P. Ong, J. Phys. Chem. A 124, 731 (2020), pMID: 31916773.
  • Steinhardt et al. (1983) P. J. Steinhardt, D. R. Nelson,  and M. Ronchetti, Phys. Rev. B 28, 784 (1983).
  • Bartók et al. (2013) A. P. Bartók, R. Kondor,  and G. Csányi, Phys. Rev. B 87, 184115 (2013).
  • Kondor (2008) I. R. Kondor, Group Theoretical Methods in Machine Learning (Columbia University, 2008).
  • Flusser et al. (2009) J. Flusser, B. Zitova,  and T. Suk, Moments and Moment Invariants in Pattern Recognition (John Wiley & Sons, 2009).
  • Seko (2020) A. Seko, arXiv:2007.14206  (2020).
  • (31) A. Seko, Machine Learning Potential Repository at Kyoto University, https://sekocha.github.io/repository/index-e.html.
  • El-Batanouny and Wooten (2008) M. El-Batanouny and F. Wooten, Symmetry and Condensed Matter Physics: A Computational Approach (Cambridge University Press, 2008).
  • Tolédano and Tolédano (1987) J.-C. Tolédano and P. Tolédano, The Landau Theory of Phase Transitions: Application to Structural, Incommensurate, Magnetic and Liquid Crystal Systems (World Scientific Press, 1987).
  • Wood and Thompson (2018b) M. A. Wood and A. P. Thompson, J. Chem. Phys. 148, 241721 (2018b).
  • Bergerhoff and Brown (1987) G. Bergerhoff and I. D. Brown, in Crystallographic Databases, edited by F. H. Allen et al. (International Union of Crystallography, 1987).
  • Blöchl (1994) P. E. Blöchl, Phys. Rev. B 50, 17953 (1994).
  • Perdew et al. (1996) J. P. Perdew, K. Burke,  and M. Ernzerhof, Phys. Rev. Lett. 77, 3865 (1996).
  • Kresse and Hafner (1993) G. Kresse and J. Hafner, Phys. Rev. B 47, 558 (1993).
  • Kresse and Furthmüller (1996) G. Kresse and J. Furthmüller, Phys. Rev. B 54, 11169 (1996).
  • Kresse and Joubert (1999) G. Kresse and D. Joubert, Phys. Rev. B 59, 1758 (1999).
  • Zope and Mishin (2003) R. R. Zope and Y. Mishin, Phys. Rev. B 68, 024102 (2003).
  • (42) A. Seko, lammps-mlip-package, https://github.com/sekocha/lammps-mlip-package.
  • (43) lammps code, http://lammps.sandia.gov.
  • Plimpton (1995) S. Plimpton, J. Comput. Phys. 117, 1 (1995).
  • Togo and Tanaka (2015) A. Togo and I. Tanaka, Scr. Mater. 108, 1 (2015).
  • Dumitraschkewitz et al. (2017) P. Dumitraschkewitz, H. Clemens, S. Mayer,  and D. Holec, Appl. Sci. 7, 1193 (2017).