This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Research on Sparsity Measures for Rotating Machinery Health Monitoring

Yudong Cao,  Minping Jia, Jichao Zhuang, and Xiaoli Zhao Yudong Cao, Jichao Zhuang and Prof. Minping Jia was with Southeast University, Southeast University, Nanjing, Jiangsu, China.Xiaoli Zhao was with Southeast University, Nanjing University of Science and Technology, Nanjing, Jiangsu, China.Manuscript received XXX; revised XXX.
Abstract

Machine health management is one of the main research contents of PHM technology, which aims to monitor the health states of machines online and evaluate degradation stages through real-time sensor data. In recent years, classic sparsity measures such as kurtosis, Lp/Lq norm, pq-mean, smoothness index, negative entropy, and Gini index have been widely used to characterize the impulsivity of repetitive transients. Since smoothness index and negative entropy were proposed, the sparse properties have not been fully analyzed. The first work of this paper is to analyze six properties of smoothness index and negative entropy. In addition, this paper conducts a thorough investigation on multivariate power average function and finds that existing classical sparsity measures can be respectively reformulated as the ratio of multivariate power mean functions (MPMFs). Finally, a general paradigm of index design are proposed for the expansion of sparsity measures family, and several newly designed dimensionless health indexes are given as examples. Two different run-to-failure bearing datasets were used to analyze and validate the capabilities and advantages of the newly designed health indexes. Experimental results prove that the newly designed health indexes show good performance in terms of monotonic degradation description, first fault occurrence time determination and degradation state assessment.

Index Terms:
Sparsity measure, Multivariate power mean function, Health monitoring, First fault occurrence time, Degradation assessment

I Introduction

The prognostics and Health Management (PHM) includes anomaly detection, fault diagnosis, and remaining useful life prediction[1, 2, 3]. The research on PHM can improve the reliability and safety of mechanical equipment, which is related to production safety and economic benefits. Therefore, it is of great significance to study effective and accurate online health monitoring and degradation assessment methods for mechanical equipment, and then to carry out predictive management and maintenance. At the same time, with the maturity of industrial field monitoring technology, more and more sensor data can be collected to analyze the running state of mechanical equipment[4, 5]. Health indexes (HIs) are designed to extract characteristic information from collected field data to identify and quantify historical and ongoing health processes of mechanical system components. The design of reasonable health indexes has become one of the main research directions of PHM technology, which has attracted more and more attention in recent years[6]. Due to the complexity of mechanical systems and the benefit factors in actual production, it is impossible to frequently shut down the equipment to check the health status, and some early faults or damages are difficult to detect in time. Generally speaking, a ideal health index should have good monotonic performance and be able to detect abnormal states and initial fault occurrence time[7].

Sparsity measures (SMs)[8, 9] as a kind of statistical parameters provide a new perspective for machine health index construction, including kurtosis, Lp/LqLp/Lq norm, pq-mean, smoothness index, negative entropy, and Gini index, etc. In general, rotating machinery fault signals have two characteristics: one is impulsiveness and the other is cyclo-stationarity[10]. In view of this, Dwyer and Antoni[11, 12, 13] proposed spectral kurtosis successively, gave a formalization of spectral kurtosis to quantify the impulsiveness of repetitive transients, and realized its successful application in vibration-based condition monitoring. Wang et al.[14] demonstrated the application of spectral kurtosis in fault detection and diagnosis of key components of rotating machinery and verified the ability of spectral kurtosis to quantify cyclo-stationarity. Zhong et al.[15] proposed a weighted residual regression-based index to relieve the sensitivity of kurtosis and entropy to impulsive noise for gear and bearing degradation evaluation. The latest research mathematically proved that spectral kurtosis can be decomposed into squared L2/L1 norm and squared envelope[16]. The Gini index was first proposed in the economic field to measure the economic wealth inequality of residents of a country or region[17]. Later, it was introduced into the PHM field and considered a more effective criterion than kurtosis and correlated kurtosis for machine condition monitoring. Hurley et al.[18] compared several popular sparsity measures based on different attributes, including pq-mean and Gini index, etc. Miao et al.[19] designed enhancing signal processing methods via Gini index and proposed two blind deconvolution methods based on Gini index and its variants.

In recent years, spectral negative entropy and smoothness index have been added to the sparsity measures family as new members. Antoni[20] introduced negative entropy in the field of thermodynamics as a sparse metric to characterize repetitive fault transients. Smoothness index[21] was first designed to guide the selection of wavelet parameters, which can be expressed in mathematical form as the ratio of geometric mean and arithmetic mean. Miao et al.[22] applied singular value negentropy (SVN) to measure the periodicity of signal without prior knowledge. However, the sparse properties of negative entropy and smoothness index are rarely analyzed. This paper conducts an in-depth analysis of these two sparsity measures from six attributes, including Robin Hood, Scaling, Rising Tide, Cloning, Bill Gates, and Babies. In addition, this paper firstly investigates the multivariate power mean functions (MPMFs) and further finds that most of the currently used sparsity measures can be reformulated as ratios of different MPMFs. On this basis, a general paradigm are proposed for the design of new sparsity measures for machine health monitoring. The main contributions of this paper can be summarized as follows:

(1)The sparse properties of negative entropy and smoothness index are fully studied from six aspects for the first time, and the detailed proofs and final conclusions are given.

(2) The ratio of multivariate power average function is introduced into the domain of online machine monitoring. Mathematically, six existing sparsity measures (kurtosis, Lp/LqLp/Lq norm, pq-mean, smoothness index, negative entropy, and Gini index) are respectively reformulated into the form of ratios of multivariate power mean functions.

(3) A general paradigm of health index design based on the ratios of various MPMFs are proposed, and some examples of dimensionless health indexes designed from different perspectives are presented. The capabilities and advantages of newly designed HIs are specifically analyzed and validated by two run-to-failure bearing datasets.

The rest of the paper is organized as follows. Section II analyzes the sparse properties of smoothness index and negative entropy. Six existing sparsity measures are reformulated in the form of ratios of different MPMFs and a general paradigm are introduced for the expansion of sparsity measures family in Section III. In Section IV, two run-to-failure bearing datasets are employed to validate the generalization and reproducibility of proposed paradigm and newly designed health indexes. Finally, conclusions are drawn in Section V.

II Sparse Attribute Analysis of Smoothness Index and Negative Entropy

II-A Six Attributes of Ideal Sparsity Measures

Suppose a sparse measure SS as a function with the mapping as follows

S:(n1𝕏n)S:\left(\bigcup_{n\geq 1}\mathbb{X}^{n}\right)\rightarrow\mathbb{R} (1)

where 𝕏n\mathbb{X}^{n} represents a complex vector of length N. Metric SS maps complex vectors into a real number. According to previous research and inspiration from economics[23, 18], it is generally believed that ideal SMs should own six attributes.

D1. Robin Hood

Dalton’s 1st Law states that Robin Hood decreases sparsity. Robin Hood operation subtracts a positive constant α\alpha from any relatively large element xix_{i} in the vector, and adds α\alpha to a relatively small element xjx_{j}. After operation, new xix_{i} is still larger than new xjx_{j}. The mapping value of the sparse metric SS on the new vector becomes smaller. Robin Hood can be formally defined as follows:

S([x1xiαxj+α])<S(x)S\left(\left[x_{1}\dots x_{i}-\alpha\dots x_{j}+\alpha\dots\right]\right)<S(\overset{\rightarrow}{x}) (2)

for all α\alpha, xix_{i}, xjx_{j} such that xi>xjx_{i}>x_{j} and 0<α<xixj20<\alpha<\frac{x_{i}-x_{j}}{2}.

D2. Scaling

Dalton’s modified 2nd Law states that Sparsity is scale invariant. After all elements in the vector are multiplied by a constant factor α\alpha, the sparse value of the vector remains invariant. Scaling can be formally defined as follows:

S(αx)=S(x),α,α>0S(\alpha\overset{\rightarrow}{x})=S(\overset{\rightarrow}{x}),\forall\alpha\in\mathbb{R},\alpha>0 (3)

D3. Rising Tide

Dalton’s 3rd Law states that adding a constant to each element in vector can reduce sparsity. After adding a positive constant α\alpha to all elements in the vector, the sparse value of the vector becomes smaller. We exclude the case where all elements are equal as this is equivalent to D2. Rising Tide can be formally defined as follows:

S(α+x)<S(x),α,α>0S(\alpha+\overset{\rightarrow}{x})<S(\overset{\rightarrow}{x}),\forall\alpha\in\mathbb{R},\alpha>0 (4)

D4. Cloning

Dalton’s 4th Law states that sparsity is invariant under cloning operation. The sparse value is unchanged after the vector is joined with itself. Cloning can be formally defined as follows:

S(x)=S(xx)=S(xxx)=S(xxx)S(\vec{x})=S(\vec{x}\|\vec{x})=S(\vec{x}\|\vec{x}\|\vec{x})=S(\vec{x}\|\vec{x}\|\cdots\|\vec{x}) (5)

P1. Bill Gates

An element xix_{i} in the vector becomes infinitely large, and the sparse value of the vector also tends to infinitely large. Bill Gates can be formally defined as follows:

i\forall i, β=βi>0\exists\beta=\beta_{i}>0, such that α>0\forall\alpha>0

S([x1xi+β+α])>S([x1xi+β])S\left(\left[x_{1}\dots x_{i}+\beta+\alpha\dots\right]\right)>S\left(\left[x_{1}\dots x_{i}+\beta\dots\right]\right) (6)

P2. Babies

After adding an element of zero into the vector, and the sparse value of the new vector will increase. Babies can be formally defined as follows:

S(x0)>S(x)S(\vec{x}\|0)>S(\vec{x}) (7)

II-B Attributes of SI and NE

In previous work, Hurely[18] proved the sparsity of kurtosis, Hoyer, Pq-mean and Gini index and other classical sparse measures. Since smoothness index and negative entropy were added to the sparsity measures family, their sparse properties have not been fully studied. The first work of this paper is to analyze the sparse quantification capability of smoothness index and negative entropy from the above six attributes. The result of mathematical derivation shows that smoothness index satisfies nonnegativity, D2 and D4; and negative entropy satisfies nonnegativity, D1, D2, D3, D4, P1, as shown in Table I. The detailed proofs are given in Appendix. Based on our research with Dr. Hurely, the following conclusion can be drawn: Pq-mean, Hoyer, Gini index and negative entropy have better sparse quantification capability among existing popular sparse measures.

TABLE I: Detailed Attributes of SI and NE.
Sparsity Measures Nonnegativity D1 D2 D3 D4 P1 P2
Smoothness index
Negative entropy

III Reformulation and Family Extension of Sparsity Measures

III-A Definition of Multivariate Power Mean Function

Suppose a function has the form of Eq. (8), then it can be called a power mean function, where ai(i=1,2,,n)a_{i}(i=1,2,\dots,n) are incomplete equal positive constants. This function is defined on the entire real number axis, and there is always F(x)>0F(x)>0. Obviously, no matter how x varies, the power mean function does not tend to infinity. Therefore, power mean function is bounded. If min{ai}=amin{\left\{a_{i}\right\}}=a, max{ai}=Amax{\left\{a_{i}\right\}}=A, then aF(x)Aa\leq F(x)\leq A.

F(x)={min(a1,,an) if x=(a1x++anxn)1/x if x\{0}a1ann if x=0max(a1,,an) if x=+F(x)=\begin{cases}\min\left(a_{1},\ldots,a_{n}\right)&\text{ if }x=-\infty\\ \left(\frac{a_{1}^{x}+\cdots+a_{n}^{x}}{n}\right)^{1/x}&\text{ if }x\in\mathbb{R}\backslash\{0\}\\ \sqrt[n]{a_{1}\cdots a_{n}}&\text{ if }x=0\\ \max\left(a_{1},\ldots,a_{n}\right)&\text{ if }x=+\infty\end{cases} (8)

Based on the above, we define a function with the form Γ(𝐱,y)=[1Nn=1N(xn)y]1y\Gamma(\mathbf{x},y)=[\frac{1}{N}\sum_{n=1}^{N}\left(x_{n}\right)^{y}]^{\frac{1}{y}} as a multivariate power mean function (MPMF), where 𝐱=[x1,x2,xn]\mathbf{x}=\left[x_{1},x_{2},\cdots x_{n}\right] represents a row vector. The classical RMS metric can be expressed as Γ(𝐱,2)\Gamma(\mathbf{x},2) in the form of MPMF. In addition, the multivariate power mean inequality needs to be introduced: If y1>y2y_{1}>y_{2}, then Γ(𝐱,y1)Γ(𝐱,y2)\Gamma\left(\mathbf{x},y_{1}\right)\geq\Gamma\left(\mathbf{x},y_{2}\right), and the equality holds if and only if x1=x2=xnx_{1}=x_{2}=\cdots x_{n}. The proof only needs to use Jensen inequality in mathematical analysis (i.e., up and down convexity), and take the auxiliary function f(x)=xaf(x)=x^{a}.

III-B Reformulation of Six Classic Sparsity Measures

Assume that a raw vibration signal is denoted as x[n],n=1,2,3,Nx[n],n=1,2,3,\dots N. Then, a bandpass filtered signal xl,h[n]x_{l,h}[n] can be obtained by a using bandpass filter with a passband [l,h][l,h]. Further, a complex signal xl,h¯[n]=xl,h[n]+jHilbert{xl,h[n]}\overline{x_{l,h}}[n]=x_{l,h}[n]+j\cdot\text{{Hilbert}}\left\{x_{l,h}[n]\right\} and square envelope SEl,h[n]=|xl,h¯[n]|2SE_{l,h}[n]=\left|\overline{x_{l,h}}[n]\right|^{2} can be obtained by Hilbert transform. To simplify symbols, SEl,h[n]SE_{l,h}[n] is replaced by xl,h¯[n]\overline{x_{l,h}}[n] for the rest of the paper.

Six classic sparsity measures are respectively reformulated as the ratios of different multivariate power mean functions in this section, including spectral kurtosis, spectral Lp/LqLp/Lq norm, pq-mean, spectral smoothness index, spectral negative entropy and Gini index. The following subsections give the details of the reformulation.

III-B1 Spectral Kurtosis

The original definition of kurtosis is the normalized 4th order center moment. Spectral kurtosis can be reformulated by combining the ratios of MPMFs as follows:

SK\displaystyle SK =m4{|xl,h¯[n]|}(m2{|xl,h¯[n]|})2=n=1NSEl,h2[n]N(n=1NSEl,h[n]N)2\displaystyle=\frac{m_{4}\left\{\left|\overline{x_{l,h}}[n]\right|\right\}}{\left(m_{2}\left\{\left|\overline{x_{l,h}}[n]\right|\right\}\right)^{2}}=\frac{\sum_{n=1}^{N}\frac{SE_{l,h}^{2}[n]}{N}}{\left(\sum_{n=1}^{N}\frac{SE_{l,h}[n]}{N}\right)^{2}} (9)
=(n=1NXl,h2[n]/Nn=1NXl,h[n]/N)2=[Γ(𝐗,2)Γ(𝐗,1)]2\displaystyle=\left(\frac{\sqrt{\sum_{n=1}^{N}X_{l,h}^{2}[n]/N}}{\sum_{n=1}^{N}X_{l,h}[n]/N}\right)^{2}=\left[\frac{\Gamma(\mathbf{X},2)}{\Gamma(\mathbf{X},1)}\right]^{2}

III-B2 Spectral Lp/LqLp/Lq norm

Spectral Lp/LqLp/Lq norm can be reformulated as the ratio of two different MPMFs as follows:
when p>q>0p>q>0

GSK\displaystyle GSK =N1q1pSEl,h[n]LPSEl,h[n]Lqp!pq!q\displaystyle=N^{\frac{1}{q}-\frac{1}{p}}\frac{\left\|SE_{l,h}[n]\right\|_{L_{P}}}{\left\|SE_{l,h}[n]\right\|_{Lq}}-\frac{\sqrt[p]{p!}}{\sqrt[q]{q!}} (10)
=(n=1NXl,hp[n]N)1p(n=1NXl,hq[n]N)1qp!pq!q=Γ(𝐗,p)Γ(𝐗,q)p!pq!q\displaystyle=\frac{\left(\sum_{n=1}^{N}\frac{X_{l,h}^{p}[n]}{N}\right)^{\frac{1}{p}}}{\left(\sum_{n=1}^{N}\frac{X_{l,h}^{q}[n]}{N}\right)^{\frac{1}{q}}}-\frac{\sqrt[p]{p!}}{\sqrt[q]{q!}}=\frac{\Gamma(\mathbf{X},p)}{\Gamma(\mathbf{X},q)}-\frac{\sqrt[p]{p!}}{\sqrt[q]{q!}}

when p>q=0p>q=0

GSK\displaystyle GSK =N1pSEl,h[n]LPn=1NSEl,h[n]Np!peγ\displaystyle=N^{-\frac{1}{p}}\frac{\left\|SE_{l,h}[n]\right\|_{L_{P}}}{\sqrt[N]{\prod_{n=1}^{N}SE_{l,h}[n]}}-\frac{\sqrt[p]{p!}}{e^{-\gamma}} (11)
=(n=1NXl,hp[n]/N)1pn=1NXl,h[n]Np!peγ=Γ(𝐗,p)eΓ(ln(𝐗),1)p!peγ\displaystyle=\frac{\left(\sum_{n=1}^{N}X_{l,h}^{p}[n]/N\right)^{\frac{1}{p}}}{\sqrt[N]{\prod_{n=1}^{N}X_{l,h}[n]}}-\frac{\sqrt[p]{p!}}{e^{-\gamma}}=\frac{\Gamma(\mathbf{X},p)}{e^{\Gamma(\ln(\mathbf{X}),1)}}-\frac{\sqrt[p]{p!}}{e^{-\gamma}}

where eΓ(ln(𝐗),1)e^{\Gamma(\ln(\mathbf{X}),1)} is equal to Γ(𝐗,0)\Gamma(\mathbf{X},0); γ\gamma is the Euler-Mascheroni constant 0.5772156649….

III-B3 PqmeanPq-mean

Pq-mean is also a widely used sparsity measure with good performance. By the ratio of multivariate power mean functions, the pq-mean can be reformulated as follows:

when 0<p1,q>10<p\leq 1,q>1

pq mean\displaystyle pq-\text{ mean } =(1Nn=1NXl,hp[n])1p(1Nn=1NXl,hq[n])1q\displaystyle=-\left(\frac{1}{N}\sum_{n=1}^{N}X_{l,h}^{p}[n]\right)^{\frac{1}{p}}\left(\frac{1}{N}\sum_{n=1}^{N}X_{l,h}^{q}[n]\right)^{-\frac{1}{q}} (12)
=Γ(𝐗,p)Γ(𝐗,q)\displaystyle=-\frac{\Gamma(\mathbf{X},p)}{\Gamma(\mathbf{X},q)}

III-B4 Spectral smoothness

Spectral smoothness index can also be reformulated in the form of the ratio of different MPMFs as follows:

SI\displaystyle SI =n=1NSEl,h[n]Nn=1NSEl,h[n]N=n=1NXl,h[n]Nn=1NXl,h[n]N\displaystyle=\frac{\sqrt[N]{\prod_{n=1}^{N}SE_{l,h}[n]}}{\frac{\sum_{n=1}^{N}SE_{l,h}[n]}{N}}=\frac{\sqrt[N]{\prod_{n=1}^{N}X_{l,h}[n]}}{\frac{\sum_{n=1}^{N}X_{l,h}[n]}{N}} (13)
=eΓ(ln(𝐗),1)Γ(𝐗,1)\displaystyle=\frac{e^{\Gamma(\ln(\mathbf{X}),1)}}{\Gamma(\mathbf{X},1)}

III-B5 Spectral negative entropy

Similarly, spectral negative entropy can be redefined as follows:

SNE\displaystyle SNE =|xl,h¯[n]|2|xl,h¯[n]|2ln(|xl,h¯[n]|2|xl,h¯[n]|2)\displaystyle=\left\langle\frac{\left|\overline{x_{l,h}}[n]\right|^{2}}{\left\langle\left|\overline{x_{l,h}}[n]\right|^{2}\right\rangle}\ln\left(\frac{\left|\overline{x_{l,h}}[n]\right|^{2}}{\left\langle\left|\overline{x_{l,h}}[n]\right|^{2}\right\rangle}\right)\right\rangle (14)
=SEl,h[n]SEl,h[n]ln(SEl,h[n]SEl,h[n])\displaystyle=\left\langle\frac{SE_{l,h}[n]}{\left\langle SE_{l,h}[n]\right\rangle}\ln\left(\frac{SE_{l,h}[n]}{\left\langle SE_{l,h}[n]\right\rangle}\right)\right\rangle
=Xl,h[n]Γ(𝐗,1)ln(Xl,h[n]Γ(𝐗,1))\displaystyle=\left\langle\frac{X_{l,h}[n]}{\Gamma(\mathbf{X},1)}\ln\left(\frac{X_{l,h}[n]}{\Gamma(\mathbf{X},1)}\right)\right\rangle
=Γ(𝐗Γ(𝐗,1)ln(𝐗Γ(𝐗,1)),1)Γ(1,1)\displaystyle=\frac{\Gamma\left(\frac{\mathbf{X}}{\Gamma(\mathbf{X},1)}\ln\left(\frac{\mathbf{X}}{\Gamma(\mathbf{X},1)}\right),1\right)}{\Gamma(1,1)}

By further mathematical derivation, the following relationship between SNE and SI can be found:

SNE\displaystyle SNE =(n=1NXl,h[n]1Nln(Xl,h[n]Xl,h[n]))ln(n=1NXl,h[n]Nn=1NXl,h[n]/N)(n=1NXl,h[n]/N)ln(SI)\displaystyle=\frac{\left(\sum_{n=1}^{N}X_{l,h}[n]\frac{1}{N}\ln\left(\frac{X_{l,h}[n]}{\left\langle X_{l,h}[n]\right\rangle}\right)\right)}{\ln\left(\frac{\sqrt[N]{\prod_{n=1}^{N}X_{l,h}[n]}}{\sum_{n=1}^{N}X_{l,h}[n]/N}\right)\left(\sum_{n=1}^{N}X_{l,h}[n]/N\right)}\ln(SI) (15)
=(1Nn=1NXl,h[n]ln(Xl,h[n]Γ(𝐱,1)))ln(eΓ(ln(𝐗),1)Γ(𝐗,1))Γ(𝐗,1)ln(SI)\displaystyle=\frac{\left(\frac{1}{N}\sum_{n=1}^{N}X_{l,h}[n]\ln\left(\frac{X_{l,h}[n]}{\Gamma(\mathbf{x},1)}\right)\right)}{\ln\left(\frac{e^{\Gamma(\ln(\mathbf{X}),1)}}{\Gamma(\mathbf{X},1)}\right)\Gamma(\mathbf{X},1)}\ln(SI)
=Γ(𝐗ln(𝐗Γ(𝐗,1)),1)Γ(𝐗,1)(Γ(ln(𝐗),1)ln(Γ(𝐗,1)))ln(SI)\displaystyle=\frac{\Gamma\left(\mathbf{X}\ln\left(\frac{\mathbf{X}}{\Gamma(\mathbf{X},1)}\right),1\right)}{\Gamma(\mathbf{X},1)(\Gamma(\ln(\mathbf{X}),1)-\ln(\Gamma(\mathbf{X},1)))}\ln(SI)

III-B6 Gini index

The Gini index was originally proposed in the field of economics. It was later introduced into the field of fault diagnosis and has become a reliable measure in signal processing techniques. The value of the Gini index ranges from 0 to 1. The definition of Gini index is shown in Eq. (16).

GI=12n=1NXorder [n]X[n]L1(Nn+0.5N)GI=1-2\sum_{n=1}^{N}\frac{X^{\text{order }}[n]}{\|X[n]\|_{L1}}\left(\frac{N-n+0.5}{N}\right) (16)

Gini index can be reformulated as the ratio of two different MPMFs as follows:

GI\displaystyle GI =12n=1N((Nn+12N)×SEl,horder [n]SEl,h[n]L1)\displaystyle=1-2\sum_{n=1}^{N}\left(\left(\frac{N-n+\frac{1}{2}}{N}\right)\times\frac{SE_{l,h}^{\text{order }}[n]}{\left\|SE_{l,h}[n]\right\|_{L1}}\right) (17)
=12n=1N((Nn+12N2)×Xl,horder [n]n=1NXl,h[n]/N)\displaystyle=1-2\sum_{n=1}^{N}\left(\left(\frac{N-n+\frac{1}{2}}{N^{2}}\right)\times\frac{X_{l,h}^{\text{order }}[n]}{\sum_{n=1}^{N}X_{l,h}[n]/N}\right)
=1n=1N(2(Nn+12N2)×Xl,horder [n])n=1NXl,h[n]/N\displaystyle=1-\frac{\sum_{n=1}^{N}\left(2\left(\frac{N-n+\frac{1}{2}}{N^{2}}\right)\times X_{l,h}^{\text{order }}[n]\right)}{\sum_{n=1}^{N}X_{l,h}[n]/N}
=1Γ(2(Nn+12N)×𝐗order ,1)Γ(𝐗,1)\displaystyle=1-\frac{\Gamma\left(2\left(\frac{N-n+\frac{1}{2}}{N}\right)\times\mathbf{X}_{\text{order }},1\right)}{\Gamma(\mathbf{X},1)}

where 2(Nn+12N)×𝐗order2\left(\frac{N-n+\frac{1}{2}}{N}\right)\times\mathbf{X}_{\text{order}} can be viewed as a weighted and ordered vector.

An important point needs to be emphasized additionally. Hou et al.[24] expressed the classical sparsity measures as ratio of quasi-arithmetic means in their previous work. The multivariate power mean we mentioned can generally be expressed as a kind of generalized arithmetic mean. Through multivariate power mean function, the reformulation expression can be made more popular. At the same time, it is more convenient to reconstruct the spectral negative entropy and Gini index.

III-C Family Extension of Sparsity Measures

Inspired by reformulation of the six classical sparsity measures in Subection B, we propose a general paradigm based on ratios of multivariate power mean functions for designing more advanced health indexes for machine health monitoring and degradation state assessment.

Complex health indexes designed by combining ratios of the sum of multiple MPMFs:

PHI=λm=1MpmΓ(𝐱,am)n=1NqnΓ(𝐱,bm)+cPHI=\lambda\frac{\sum_{m=1}^{M}p_{m}\Gamma\left(\mathbf{x},a_{m}\right)}{\sum_{n=1}^{N}q_{n}\Gamma\left(\mathbf{x},b_{m}\right)}+c (18)

Complex health indexes designed by combining ratios of the product of multiple MPMFs:

MHI=λm=1MpmΓ(𝐱,am)n=1NqnΓ(𝐱,bm)+cMHI=\lambda\frac{\prod_{m=1}^{M}p_{m}\Gamma\left(\mathbf{x},a_{m}\right)}{\prod_{n=1}^{N}q_{n}\Gamma\left(\mathbf{x},b_{m}\right)}+c (19)

where pmp_{m} and qnq_{n} represent different weights of the power average function respectively, and satisfy m=1Mpm=1,n=1Nqn=1\sum_{m=1}^{M}p_{m}=1,\sum_{n=1}^{N}q_{n}=1, λ\lambda is a tradeoff parameter, and cc is an offset coefficient providing a baseline for indexes. From previous studies, it is clear that Γ(α𝐱,y)=αΓ(𝐱,y)\Gamma(\alpha\mathbf{x},y)=\alpha\Gamma(\mathbf{x},y) and Γ(𝐱𝐱𝐱,y)=Γ(𝐱,y)\Gamma(\mathbf{x}\|\mathbf{x}\|\cdots\|\mathbf{x},y)=\Gamma(\mathbf{x},y) , so PHIPHI and MHIMHI satisfy at least two sparse attributes: D2 and D4.

According to Eq. (18) and Eq. (19), we can give a general paradigm as follows:

GHI=jPHI+jMHIGHI=\sum_{j}PHI+\sum_{j}MHI (20)

The specific expressions for the four newly designed HIs as examples are as follows:

HI1=1Γ(𝐗+τ,2)+Γ(𝐗+τ,1)Γ(𝐗+τ,1)+Γ(𝐗+τ,1)\displaystyle HI_{1}=1-\frac{\Gamma(\mathbf{X}+\tau,-2)+\Gamma(\mathbf{X}+\tau,-1)}{\Gamma(\mathbf{X}+\tau,-1)+\Gamma(\mathbf{X}+\tau,1)} (21)
HI2=1Γ(𝐗+τ,2)+Γ(𝐗+τ,2)eΓ(ln(𝐗+τ),1)+Γ(𝐗+τ,1)HI_{2}=1-\frac{\Gamma(\mathbf{X}+\tau,-2)+\Gamma(\mathbf{X}+\tau,-2)}{e^{\Gamma(\ln(\mathbf{X}+\tau),1)}+\Gamma(\mathbf{X}+\tau,-1)} (22)
HI3=1Γ(𝐗+τ,2)Γ(𝐗+τ,2)eΓ(ln(𝐗+τ),1)Γ(𝐗+τ,1)HI_{3}=1-\frac{\Gamma(\mathbf{X}+\tau,-2)\Gamma(\mathbf{X}+\tau,-2)}{e^{\Gamma(\ln(\mathbf{X}+\tau),1)}\Gamma(\mathbf{X}+\tau,-1)} (23)
HI4=1eΓ(ln(𝐗+τ),1)Γ(𝐗+τ,2)Γ(𝐗+τ,1)Γ(𝐗+τ,1)HI_{4}=1-\frac{e^{\Gamma(\ln(\mathbf{X}+\tau),1)}\Gamma(\mathbf{X}+\tau,-2)}{\Gamma(\mathbf{X}+\tau,1)\Gamma(\mathbf{X}+\tau,-1)} (24)

where τ\tau represents slip coefficient which is related to shifted percentage [25]. In our case, τ\tau is set to 0.065. At the same time, an extra property of the newly designed HIs need to be emphasized. According to multivariate power mean inequality, HI1HI_{1}-HI4HI_{4} have strict upper and lower bounds [0,1)[0,1). Therefore, it is helpful to set a reasonable and effective failure threshold for machine health monitoring under different operating conditions.

IV Validation And Analysis Based On Two Run-To-Failure Bearing Datasets

IV-A Case Study I: IMS Bearing Experimental Dataset

The accelerated bearing degradation experimental dataset provided by the NSF I/UCR Center for Intelligent Maintenance Systems was firstly used to verify the effectiveness and superiority of the designed health indexes. The experimental platform is shown in Fig. 1 with four bearings installed on one shaft. The sampling frequency is set at 20KHz, and 20,480 points are collected each time with an interval of 1 second. Set No.2 collected a total of 984 files in Channel 1. The bearing finally produced an outer ring failure, and the outer ring failure frequency could be calculated as 236.4Hz according to the theoretical formula of failure frequency.

The bearing degradation curves are plotted by classic spectral kurtosis, spectral smoothness index, spectral negative entropy and Gini index as shown in Fig. 2, where the real values of spectral smoothness index are multiplied by -1 to achieve an increasing trend. From the figures, it can be seen that these classical sparsity measures can reflect the bearing degradation trend to a certain extent and help in incipient fault detection, but they cannot characterize the bearing degradation monotonically and the degradation curve is highly fluctuant. Our designed index degradation curves are shown in Fig. 3. As can be seen from the figures, HI1HI_{1}-HI4HI_{4} have good monotonic degradation trends with less fluctuation. Enlarged pictures around FFOT at file No.460 to No.580 quantified by designed indexes are respectively plotted tt the top left of each figure. It can be easily concluded that HI1HI_{1}-HI4HI_{4} can detect the first failure occurrence time (FFOT) more precisely. In the following subsections, we will further evaluate the performance of the designed health metrics in three aspects.

Refer to caption


Figure 1: The bearing experimental platform of IMS.
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 2: Degradation curves plotted by classic sparsity measures: (a) spectral kurtosis; (b) spectral smoothness index; (c) spectral negative entropy; (d) Gini index.
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 3: Bearing degradation curves plotted by (a) Index1; (b) Index2; (c) Index3; (d) Index4.

IV-A1 First Fault Occurrence Time (FFOT) Determination and Threshold Analysis

Hou et al. [25] analyzed the healthy state of the first to 300-th samples of the dataset and used the Lilliefors test to ensure that the historical health data followed the Gaussian distribution in their previous work. Therefore, 3σ3\sigma rule can be used to establish statistical thresholds for the detection of the first failure occurrence time and to determine upper and lower thresholds for the division of health and fault states. As marked by the red dotted lines and magenta dotted circles in Fig. 4, our proposed improved indexes with their associate thresholds can effectively detecting FFOT at file No.533. Based on the above analysis, some conclusions can be drawn. The designed HIs show a better monotonic degradation trend than the original measures with a more pronounced detection of FFOT and effectively suppress impulsive noise. This is mainly reflected in two aspects, firstly, the degradation curve is smoother, and secondly, the detection of fault-induced repetitive transients is more sensitive.

Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 4: First fault occurrence time determination and threshold analysis by three-sigma rule: (a) Index1; (b) Index2; (c) Index3; (d) Index4.

IV-A2 Incipient Fault Diagnosis Around FFOT

To verify correctness of indication FFOT based on designed HIs, the bearing fault signal at file No.534 was selected for incipient fault diagnosis. Fast Kurtogram method can help us to determine the optimal filter band center and band frequency, and then perform resonant envelope demodulation to extract the early fault characteristics of the bearing. The optimal filter band results are shown in Fig. 5, and the envelope demodulation results are shown in Fig. 6. The final envelope demodulation results can clearly find the bearing outer ring fault frequency of 263.4Hz and its multiples, indicating that the bearing enters the fault state from the healthy state.

Refer to caption

Figure 5: The optimal filter band obtained by Fast Kurtogram.

Refer to caption

Figure 6: The analysis results by envelope demodulation: (a)Original signal; (b)Envelope of filtered signal; (c)Amplitude spectrum of squared envelope.

IV-A3 Isolated-Forest Based Assessment of Degradation Stages

In this subsection, we combine the unsupervised isolated forest algorithm with newly designed HIs for the assessment of different degradation states of rolling bearings. Isolated forest algorithm was proposed by Liu et al. [26] for anomaly detection. Isolated forest is suitable for anomaly detection of continuous numerical data, defining anomalies as ”outliers that are more likely to be separated”. We consider that the degradation state of a machine changes when there are multiple consecutive outliers within a certain time interval. Specific hyperparameter settings of isolated forest algorithm are shown in Table II. The analysis results of four HIs are respectively shown in Fig. 7.

TABLE II: Specific Hyperparameter Settings of Isolated Forest Algorithm.
Hyperparameter Value
Maximum upper bound of outlier 10
The number of base estimators: n_estimarors 256
The number of samples: max_sample 0.5
The number of features: max_features 1.0
Controls the pseudo-randomness: random_state 42
The number of jobs: n_jobs -1
Controls the verbosity: verbose 0
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 7: Results of degradation stages assessment by Isolated-Forest algorithm applied on different HIs: (a) Index1; (b) Index2; (c) Index3; (d) Index4.

From the results of isolated forest analysis, it can be seen that five different degradation states were identified in the whole process of bearing degradation. The first state change occurs at file No.533 when the health state changes to the incipient fault state, which is consistent with the results analyzed in the previous subsections. A large number of continuous outliers began to appear after file No.892 so that it can be considered that the bearing entered a complete failure state starting from file No.892. This conclusion is basically consistent with that analyzed by RMS metric. Recognizing machine degradation state from the perspective of outliers is a new attempt in this paper. It is helpful to study the degradation mode of similar machines under the same operating condition and to further study the prediction of remaining useful life. However, how to further quantitatively evaluate the degree of machine degradation in different degradation states is a direction of future research, which will not be discussed in this paper.

IV-B Case Study II: XJTU-SY Run-To-Failure Bearing Dataset

To further validate the effectiveness and generalization of our proposed framework, this section is validated on the XJTU-SY run-to-failure bearing dataset. The bearing testbed is shown in Fig. 8. The sampling frequency is set at 25.6 kHz, and 32768 points (1.28 seconds) are collected each time with an interval of 1 minute. Details are described in [27].

Refer to caption

Figure 8: The tested bearing platform of the XJTU-SY bearing dataset.
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 9: Bearing degradation curves plotted by classic sparsity measures: (a) spectral kurtosis; (b) spectral smoothness index; (c) spectral negative entropy; (d) Gini index.
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 10: Bearing degradation curves plotted by: (a) Index1 ; (b) Index2; (c) Index3; (d) Index4.

The curves of bearing degradation quantified by classic spectral kurtosis, spectral smoothness index, spectral negative entropy and Gini index are plotted in Fig. 9. Meanwhile, the curves of bearing degradation quantified by HI1HI_{1}-HI4HI_{4} are plotted in Fig 10. From the above figures, it can be concluded that the degradation trends quantified by four classic sparsity measures are poor and can only roughly identify the moment of incipient failure of bearings. The degradation curves quantified by HI1HI_{1}-HI4HI_{4} can show good degradation trends and own the ability to identify incipient and severe bearing faults. In terms of fluctuation, except for degradation curves quantified by HI3HI_{3}, the vast majority of HIs show less fluctuant. The validity of the proposed indexes and generalization of the framework based on ratios of multivariate power mean functions are further verified by the XJTU-SY run-to-failure bearing dataset. Based on the general paradigm, more reasonable and advanced health indexes can be easily designed, and the specific index design formulas can be determined depending on the specific conditions of industrial sites.

V Conclusion

In this paper, we have studied the sparse attributes of smoothness index and negative entropy and given detailed proofs. To develop a general method for construction of health indexes with monotonicity that enables incipient fault detection, multivariate power mean functions (MPMFs) were introduced and investigated. Then, it is found that the existing classical sparsity measures can be reformulated as ratios of different MPMFs, including spectral kurtosis, spectral smoothness index, spectral negative entropy and Gini index. Finally, the general health index construction paradigm based on ratios of MPMFs are proposed for family expansion of sparsity measures. Four specific health indexes are designed as examples, and their sparse attributes and other properties are analyzed in detail. Their ability to monitor health of machines has been verified through two run-to-life bearings datasets.

Appendix A

A-A Proofs Of Smoothness Index

A-A1 SI and Nonnegativity

Theorem A.1: SI is non-negative.

SI(x)0SI(\overset{\rightarrow}{x})\geq 0 (25)

Proof: As each element in vector 𝐱=[x1,x2,,xN]\mathbf{x}=[x_{1},x_{2},\dots,x_{N}] is non-negative. Clearly, SI is non-negative.

A-A2 SI and D1

Theorem A.2: SI does not satisfy D1.

SI([x1xiαxj+α])<SI(x)SI\left(\left[x_{1}\dots x_{i}-\alpha\dots x_{j}+\alpha\dots\right]\right)<SI(\overset{\rightarrow}{x})

for all α\alpha, xix_{i}, xjx_{j} such that xi>xjx_{i}>x_{j} and 0<α<xixj20<\alpha<\frac{x_{i}-x_{j}}{2}.

Proof: As SI=i=1NxiNi=1Nxi/NSI=\frac{\sqrt[N]{\prod_{i=1}^{N}x_{i}}}{\sum_{i=1}^{N}x_{i}/N}we can restate the above as

ki,jNxk(xiα)(xj+α)((k=1Nxk)+αα)/N<i=1Nxkk=1Nxk/N\frac{\sqrt{\prod_{k\neq i,j}^{N}x_{k}\left(x_{i}-\alpha\right)\left(x_{j}+\alpha\right)}}{\left(\left(\sum_{k=1}^{N}x_{k}\right)+\alpha-\alpha\right)/N}<\frac{\sqrt{\prod_{i=1}^{N}x_{k}}}{\sum_{k=1}^{N}x_{k}/N}

This simplifies to

ki,jNxk(xiα)(xj+α)<i=1Nxk\sqrt{\prod_{k\neq i,j}^{N}x_{k}\left(x_{i}-\alpha\right)\left(x_{j}+\alpha\right)}<\sqrt{\prod_{i=1}^{N}x_{k}}
x1x2(xiα)(xj+α)xN<x1x2xNx_{1}x_{2}\cdots\left(x_{i}-\alpha\right)\cdots\left(x_{j}+\alpha\right)\cdots x_{N}<x_{1}x_{2}\cdots x_{N}
(xiα)(xj+α)<xixj\left(x_{i}-\alpha\right)\left(x_{j}+\alpha\right)<x_{i}x_{j}
xixjα<0x_{i}-x_{j}-\alpha<0

which we konw is not true as xi>xjx_{i}>x_{j} and 0<α<xixj20<\alpha<\frac{x_{i}-x_{j}}{2}.

A-A3 SI and D2

Theorem A.3: SI satisfies D2.

SI(αx)=SI(x),α,α>0SI(\alpha\overset{\rightarrow}{x})=SI(\overset{\rightarrow}{x}),\forall\alpha\in\mathbb{R},\alpha>0

Proof:

SI(αx)=i=1NαxiNi=1Nαxi/N=αi=1Nxiαi=1Nxi/N=SI(x)SI(\alpha\vec{x})=\frac{\sqrt[N]{\prod_{i=1}^{N}\alpha x_{i}}}{\sum_{i=1}^{N}\alpha x_{i}/N}=\frac{\alpha\sqrt{\prod_{i=1}^{N}x_{i}}}{\alpha\sum_{i=1}^{N}x_{i}/N}=SI(\vec{x})

A-A4 SI and D3

Theorem A.4: SI does not satisfy D3.

SI(α+x)<SI(x),α,α>0SI(\alpha+\overset{\rightarrow}{x})<SI(\overset{\rightarrow}{x}),\forall\alpha\in\mathbb{R},\alpha>0

Proof: Set

F(α)=i=1N(xi+α)N(i=1Nxi/N)+α=A(α)NB(α)F(\alpha)=\frac{\sqrt[N]{\prod_{i=1}^{N}\left(x_{i}+\alpha\right)}}{\left(\sum_{i=1}^{N}x_{i}/N\right)+\alpha}=\frac{\sqrt[N]{A(\alpha)}}{B(\alpha)}
ln[A(α)]=ln[(α+x1)(α+x2)(α+xN)]\ln[A(\alpha)]=\ln\left[\left(\alpha+x_{1}\right)\left(\alpha+x_{2}\right)\cdots\left(\alpha+x_{N}\right)\right]

Derivation on both sides of the equation

A(α)A(α)=i=1N1α+xi\frac{A^{\prime}(\alpha)}{A(\alpha)}=\sum_{i=1}^{N}\frac{1}{\alpha+x_{i}}
A(α)=A(α)i=1N1α+xiA^{\prime}(\alpha)=A(\alpha)\sum_{i=1}^{N}\frac{1}{\alpha+x_{i}}

It follows that

Fα=1NA(α)1N1A(α)B2(α)=1NA(α)1Ni=1N1α+xiA(α)1NB2(α)\frac{\partial F}{\partial\alpha}=\frac{\frac{1}{N}A(\alpha)^{\frac{1}{N}^{-1}}A^{\prime}(\alpha)}{B^{2}(\alpha)}=\frac{\frac{1}{N}A(\alpha)^{\frac{1}{N}}\sum_{i=1}^{N}\frac{1}{\alpha+x_{i}}-A(\alpha)^{\frac{1}{N}}}{B^{2}(\alpha)}

We can ignore the denominator as it is clearly positive.It is clear that Fα\frac{\partial F}{\partial\alpha} could be positive. Hence SI does not satisfy D3.

A-A5 SI and D4

Theorem A.5: SI satisfies D4.

SI(x)=SI(xx)=SI(xxx)=SI(xxx)SI(\vec{x})=SI(\vec{x}\|\vec{x})=SI(\vec{x}\|\vec{x}\|\vec{x})=SI(\vec{x}\|\vec{x}\|\cdots\|\vec{x})

Proof:

SI(xxxm)=i=1NximmN(mi=1Nxi)/mN=i=1NxiNi=1Nxi/N=SI(x)\operatorname{SI}(\underbrace{\vec{x}\|\vec{x}\cdots\|\vec{x}}_{m})=\frac{\sqrt[mN]{\prod_{i=1}^{N}x_{i}^{m}}}{\left(m\sum_{i=1}^{N}x_{i}\right)/mN}=\frac{\sqrt[N]{\prod_{i=1}^{N}x_{i}}}{\sum_{i=1}^{N}x_{i}/N}=\operatorname{SI}(\vec{x})

A-A6 SI and P1

Theorem A.6: SI does not satisfy P1.

i\forall i, β=βi>0\exists\beta=\beta_{i}>0, such that α>0\forall\alpha>0

SI([x1xi+β+α])>SI([x1xi+β])SI\left(\left[x_{1}\dots x_{i}+\beta+\alpha\dots\right]\right)>SI\left(\left[x_{1}\dots x_{i}+\beta\dots\right]\right)

Proof: Set

F(β)=xi+βNi=1Nxi+βF(\beta)=\frac{\sqrt[N]{x_{i}+\beta}}{\sum_{i=1}^{N}x_{i}+\beta}

It follows that

Fβ\displaystyle\frac{\partial F}{\partial\beta} =[1N(xi+β)1N1](i=1Nxi+β)(xi+β)1N(i=1Nxi+β)2\displaystyle=\frac{\left[\frac{1}{N}\left(x_{i}+\beta\right)^{\frac{1}{N}-1}\right]\left(\sum_{i=1}^{N}x_{i}+\beta\right)-\left(x_{i}+\beta\right)^{\frac{1}{N}}}{\left(\sum_{i=1}^{N}x_{i}+\beta\right)^{2}}
=[1N(xi+β)1N1][i=1Nxi+βN(xi+β)](i=1Nxi+β)2\displaystyle=\frac{\left[\frac{1}{N}\left(x_{i}+\beta\right)^{\frac{1}{N}-1}\right]\left[\sum_{i=1}^{N}x_{i}+\beta-N\left(x_{i}+\beta\right)\right]}{\left(\sum_{i=1}^{N}x_{i}+\beta\right)^{2}}
=\displaystyle= [1N(xi+β)1N1](i=1Nxi+β)2[(1N)β+i=1NxiNxi]\displaystyle\frac{\left[\frac{1}{N}\left(x_{i}+\beta\right)^{\frac{1}{N}-1}\right]}{\left(\sum_{i=1}^{N}x_{i}+\beta\right)^{2}}\left[(1-N)\beta+\sum_{i=1}^{N}x_{i}-Nx_{i}\right]

Clearly, for i\forall i, Fβ\frac{\partial F}{\partial\beta} could be negative. So SI does not satisfy P1.

A-A7 SI and P2

Theorem A.7: SI does not satisfy P2.

SI(x0)>SI(x)SI(\vec{x}\|0)>SI(\vec{x})

Proof:

SI(x0)=0SI(x)SI(\vec{x}\|0)=0\leq SI(\vec{x})

A-B Proofs Of Spectral Negative Entropy

A-B1 SNE and Nonnegativity

Theorem B.1: SNE is non-negative.

SNE(x)0SNE(\overset{\rightarrow}{x})\geq 0

Proof: According to Gibbs’ inequality, we can know that if i=1npi=i=1nqi=1,pi,qi(0,1]\sum_{i=1}^{n}p_{i}=\sum_{i=1}^{n}q_{i}=1,p_{i},q_{i}\in(0,1], then i=1npilogpii=1npilogqi-\sum_{i=1}^{n}p_{i}\log p_{i}\leq-\sum_{i=1}^{n}p_{i}\log q_{i}. Suppose x¯=i=1Nxi/N\bar{x}=\sum_{i=1}^{N}x_{i}/N, pi=xiNx¯p_{i}=\frac{x_{i}}{N\bar{x}}, qi=1Nq_{i}=\frac{1}{N}. It follows that

SNE(x)\displaystyle SNE(\vec{x}) =1Ni=1Nxix¯lnxix¯=i=1NxiNx¯lnxiNx¯+lnN\displaystyle=\frac{1}{N}\sum_{i=1}^{N}\frac{x_{i}}{\bar{x}}\ln\frac{x_{i}}{\bar{x}}=\sum_{i=1}^{N}\frac{x_{i}}{N\bar{x}}\ln\frac{x_{i}}{N\bar{x}}+\ln N
ln1Ni=1NxiNx¯+lnN=0\displaystyle\geq\ln\frac{1}{N}\sum_{i=1}^{N}\frac{x_{i}}{N\bar{x}}+\ln N=0

Clearly, SNE is non-negative.

A-B2 SNE and D1

Theorem B.2: SNE satisfies D1.

SNE([x1xiαxj+α])<SNE(x)SNE\left(\left[x_{1}\dots x_{i}-\alpha\dots x_{j}+\alpha\dots\right]\right)<SNE(\overset{\rightarrow}{x})

for all α\alpha, xix_{i}, xjx_{j} such that xi>xjx_{i}>x_{j} and 0<α<xixj20<\alpha<\frac{x_{i}-x_{j}}{2}.

Proof:

SNE([x1xiαxj+α])SNE(x)SNE\left(\left[x_{1}\cdots x_{i}-\alpha\cdots x_{j}+\alpha\cdots\right]\right)-SNE(\vec{x})
=xiαAlnxiαA+xj+αAlnxj+αAxiAlnxiAxjAlnxjA=\frac{x_{i}-\alpha}{A}\ln\frac{x_{i}-\alpha}{A}+\frac{x_{j}+\alpha}{A}\ln\frac{x_{j}+\alpha}{A}-\frac{x_{i}}{A}\ln\frac{x_{i}}{A}-\frac{x_{j}}{A}\ln\frac{x_{j}}{A}

Converted to Function F(x)=xlnxF(x)=xlnx, in any interval [xj,xi][x_{j},x_{i}], the sum of the function values at the endpoints of the interval must be greater than the sum of the function values of two points in any interval [xj+a,xia][x_{j}+a,x_{i}-a], which is symmetric about the midpoint of interval [xj,xi][x_{j},x_{i}]. Clearly, F′′(x)>0F^{\prime\prime}(x)>0, and F(x)F(x) is a lower convex function. It follows that in any interval [xj,xi][x_{j},x_{i}]

F(xi)+F(xj)>F(xiα)+F(xj+α)F\left(x_{i}\right)+F\left(x_{j}\right)>F\left(x_{i}-\alpha\right)+F\left(x_{j}+\alpha\right)

where xi>xjx_{i}>x_{j} and 0<α<xixj20<\alpha<\frac{x_{i}-x_{j}}{2}. Define an arbitrary interval [xj,xi][x_{j},x_{i}] of length 2d2d, d>0d>0.

Set

H(d)=F(xd)+F(x+d)H(d)=F(x-d)+F(x+d)
Hd=ln(x+d)+1ln(xd)+(1)=lnx+dxd>0\frac{\partial H}{\partial d}=\ln(x+d)+1-\ln(x-d)+(-1)=\ln\frac{x+d}{x-d}>0

Hence, the function H(d)H(d) increases strictly monotonically. To sum up, we claim that SNE satisfies D1. In addition, Lagrange mean value theorem can also be used to prove the conclusion.

A-B3 SNE and D2

Theorem B.3: SNE satisfies D2.

SNE(αx)=SNE(x),α,α>0SNE(\alpha\overset{\rightarrow}{x})=SNE(\overset{\rightarrow}{x}),\forall\alpha\in\mathbb{R},\alpha>0

Proof: Suppose xN=A\underbrace{\langle x\rangle}_{N}=A, so αxN=αA\underbrace{\langle\alpha x\rangle}_{N}=\alpha A
It follows that

SNE(αx)=i=1N(αxiαAlnαxiαA)N=SNE(x)\operatorname{SNE}(\alpha\vec{x})=\frac{\sum_{i=1}^{N}\left(\frac{\alpha x_{i}}{\alpha A}\ln\frac{\alpha x_{i}}{\alpha A}\right)}{N}=\operatorname{SNE}(\vec{x})

A-B4 SNE and D3

Theorem B.4: SNE satisfies D3.

SNE(α+x)<SNE(x),α,α>0SNE(\alpha+\overset{\rightarrow}{x})<SNE(\overset{\rightarrow}{x}),\forall\alpha\in\mathbb{R},\alpha>0

Proof: Set

F(α)=i=1Nxi+αx+αlnxi+αx+αF(\alpha)=\sum_{i=1}^{N}\frac{x_{i}+\alpha}{\langle x\rangle+\alpha}\ln\frac{x_{i}+\alpha}{\langle x\rangle+\alpha}

It follows that

Fα=i=1Nx¯xi((x¯)+α)2lnxi+αx¯+α+i=1Nx¯xi((x¯)+α)2\frac{\partial F}{\partial\alpha}=\sum_{i=1}^{N}\frac{\bar{x}-x_{i}}{((\bar{x})+\alpha)^{2}}\ln\frac{x_{i}+\alpha}{\bar{x}+\alpha}+\sum_{i=1}^{N}\frac{\bar{x}-x_{i}}{((\bar{x})+\alpha)^{2}}

Clearly, i=1Nx¯xi((x¯)+α)2=0\sum_{i=1}^{N}\frac{\bar{x}-x_{i}}{((\bar{x})+\alpha)^{2}}=0; when x¯xi\bar{x}\leq{x_{i}}, x¯xi((x¯)+α)2lnxi+αx¯+α0\frac{\bar{x}-x_{i}}{((\bar{x})+\alpha)^{2}}\ln\frac{x_{i}+\alpha}{\bar{x}+\alpha}\leq{0}; and when x¯>xi\bar{x}>{x_{i}}, x¯xi((x¯)+α)2lnxi+αx¯+α<0\frac{\bar{x}-x_{i}}{((\bar{x})+\alpha)^{2}}\ln\frac{x_{i}+\alpha}{\bar{x}+\alpha}<{0}. Hence Fα<0\frac{\partial F}{\partial\alpha}<{0}, F(α)F(\alpha) is strictly monotonically decreasing. We claim that SNE satisfies D3.

A-B5 SNE and D4

Theorem B.5: SNE satisfies D4.

SNE(x)=SNE(xx)=SNE(xxx)=SNE(xxx)SNE(\vec{x})=SNE(\vec{x}\|\vec{x})=SNE(\vec{x}\|\vec{x}\|\vec{x})=SNE(\vec{x}\|\vec{x}\|\cdots\|\vec{x})

Proof:

xxxm=xmN=xN=A\underbrace{\vec{x}\|\vec{x}\cdots\|\vec{x}}_{m}=\underbrace{\langle x\rangle}_{mN}=\underbrace{\langle x\rangle}_{N}=A
SNE(xxxm)=i=1N(mxixmNlnmxixmN)mN=SNE(x).SNE(\underbrace{\vec{x}\|\vec{x}\cdots\|\vec{x}}_{m})=\frac{\sum_{i=1}^{N}\left(\frac{mx_{i}}{\underbrace{\langle x\rangle}_{mN}}\ln\frac{mx_{i}}{\underbrace{\langle x\rangle}_{mN}}\right)}{mN}=SNE(\vec{x}).

A-B6 SNE and P1

Theorem B.6: SNE does not satisfy P1.

i\forall i, β=βi>0\exists\beta=\beta_{i}>0, such that α>0\forall\alpha>0

SNE([x1xi+β+α])>SNE([x1xi+β])SNE\left(\left[x_{1}\dots x_{i}+\beta+\alpha\dots\right]\right)>SNE\left(\left[x_{1}\dots x_{i}+\beta\dots\right]\right)

Proof: Suppose N=2N=2, x=[x1,x2]\vec{x}=\left[x_{1},x_{2}\right]. We can set

F(β)=x1x+β/2lnx1x+β/2+x2+βx+β/2lnx2+βx+β/2F(\beta)=\frac{x_{1}}{\langle x\rangle+\beta/2}\ln\frac{x_{1}}{\langle x\rangle+\beta/2}+\frac{x_{2}+\beta}{\langle x\rangle+\beta/2}\ln\frac{x_{2}+\beta}{\langle x\rangle+\beta/2}

It follows that

Fβ\displaystyle\frac{\partial F}{\partial\beta} =2x1(2x¯+β)2ln2x12x¯+β2x1(2x¯+β)2\displaystyle=-\frac{2x_{1}}{(2\bar{x}+\beta)^{2}}\ln\frac{2x_{1}}{2\bar{x}+\beta}-\frac{2x_{1}}{(2\bar{x}+\beta)^{2}}
+4x¯2x2(2x¯+β)2ln2x2+2β2x¯+β+4x¯2x2(2x¯+β)2\displaystyle+\frac{4\bar{x}-2x_{2}}{(2\bar{x}+\beta)^{2}}\ln\frac{2x_{2}+2\beta}{2\bar{x}+\beta}+\frac{4\bar{x}-2x_{2}}{(2\bar{x}+\beta)^{2}}
=4x¯2x2(2x¯+β)2ln2x2+2β2x¯+β2x1(2x¯+β)2ln2x12x¯+β\displaystyle=\frac{4\bar{x}-2x_{2}}{(2\bar{x}+\beta)^{2}}\ln\frac{2x_{2}+2\beta}{2\bar{x}+\beta}-\frac{2x_{1}}{(2\bar{x}+\beta)^{2}}\ln\frac{2x_{1}}{2\bar{x}+\beta}
=2x1(2x¯+β)2lnx2+βx1\displaystyle=\frac{2x_{1}}{(2\bar{x}+\beta)^{2}}\ln\frac{x_{2}+\beta}{x_{1}}

For i\forall i, Fβ\frac{\partial F}{\partial\beta} could be negative. Hence SNE does not satisfy P1.

A-B7 SNE and P2

Theorem B.7: SNE satisfies P2.

SNE(x0)>SNE(x)SNE(\vec{x}\|0)>SNE(\vec{x})

Proof:

x0N+1=NAN+1\underbrace{\langle x\|0\rangle}_{N+1}=\frac{NA}{N+1}
SNE(x0)\displaystyle SNE(\vec{x}\|0) =i=1N+1((N+1)xiNAln(N+1)xiNA)N+1\displaystyle=\frac{\sum_{i=1}^{N+1}\left(\frac{(N+1)x_{i}}{NA}\ln\frac{(N+1)x_{i}}{NA}\right)}{N+1}
=i=1N(xiAln(N+1)xiNA)N\displaystyle=\frac{\sum_{i=1}^{N}\left(\frac{x_{i}}{A}\ln\frac{(N+1)x_{i}}{NA}\right)}{N}

Clearly, xiAln(N+1)xiNA>xiAlnxiA\frac{x_{i}}{A}\ln\frac{(N+1)x_{i}}{NA}>\frac{x_{i}}{A}\ln\frac{x_{i}}{A}. Hence, SNE satisfies P2.

Acknowledgment

The authors would like to thank…

References

  • [1] Y. Lei, N. Li, L. Guo, N. Li, T. Yan, and J. Lin, “Machinery health prognostics: A systematic review from data acquisition to RUL prediction,” 2018.
  • [2] M. Ma and Z. Mao, “Deep-Convolution-Based LSTM Network for Remaining Useful Life Prediction,” IEEE Transactions on Industrial Informatics, 2021.
  • [3] J. Lee, F. Wu, W. Zhao, M. Ghaffari, L. Liao, and D. Siegel, “Prognostics and health management design for rotary machinery systems - Reviews, methodology and applications,” Mechanical Systems and Signal Processing, vol. 42, no. 1-2, pp. 314–334, 2014.
  • [4] X. Zhang, Y. Qin, C. Yuen, L. Jayasinghe, and X. Liu, “Time-Series Regeneration with Convolutional Recurrent Generative Adversarial Network for Remaining Useful Life Estimation,” IEEE Transactions on Industrial Informatics, 2021.
  • [5] J. Park, S. Kim, J. H. Choi, and S. H. Lee, “Frequency energy shift method for bearing fault prognosis using microphone sensor,” Mechanical Systems and Signal Processing, vol. 147, p. 107068, 2021. [Online]. Available: https://doi.org/10.1016/j.ymssp.2020.107068
  • [6] H. Miao, B. Li, C. Sun, and J. Liu, “Joint Learning of Degradation Assessment and RUL Prediction for Aeroengines via Dual-Task Deep LSTM Networks,” IEEE Transactions on Industrial Informatics, 2019.
  • [7] P. Baraldi, G. Bonfanti, and E. Zio, “Differential evolution-based multi-objective optimization for the definition of a health indicator for fault diagnostics and prognostics,” Mechanical Systems and Signal Processing, 2018.
  • [8] Y. Miao, M. Zhao, and J. Hua, “Research on sparsity indexes for fault diagnosis of rotating machinery,” Measurement: Journal of the International Measurement Confederation, 2020.
  • [9] B. Hou, D. Wang, T. Yan, Y. Wang, Z. Peng, and K. L. Tsui, “Gini Indices and : Two New Sparsity Measures and Their Applications to Machine Condition Monitoring,” IEEE/ASME Transactions on Mechatronics, 2021.
  • [10] C. Li, D. Cabrera, J. V. De Oliveira, R. V. Sanchez, M. Cerrada, and G. Zurita, “Extracting repetitive transients for rotating machinery diagnosis using multiscale clustered grey infogram,” Mechanical Systems and Signal Processing, 2016.
  • [11] R. F. Dwyer, “DETECTION OF NON-GAUSSIAN SIGNALS BY FREQUENCY DOMAIN KURTOSIS ESTIMATION.” in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1983.
  • [12] J. Antoni and R. B. Randall, “The spectral kurtosis: Application to the vibratory surveillance and diagnostics of rotating machines,” Mechanical Systems and Signal Processing, vol. 20, no. 2, pp. 308–331, 2006.
  • [13] J. Antoni, “The spectral kurtosis: A useful tool for characterising non-stationary signals,” Mechanical Systems and Signal Processing, vol. 20, no. 2, pp. 282–307, 2006.
  • [14] Y. Wang, J. Xiang, R. Markert, and M. Liang, “Spectral kurtosis for fault detection, diagnosis and prognostics of rotating machines: A review with applications,” 2016.
  • [15] J. Zhong, D. Wang, J. Guo, D. Cabrera, and C. Li, “Theoretical Investigations on Kurtosis and Entropy and Their Improvements for System Health Monitoring,” IEEE Transactions on Instrumentation and Measurement, 2021.
  • [16] D. Wang, “Some further thoughts about spectral kurtosis, spectral L2/L1 norm, spectral smoothness index and spectral Gini index for characterizing repetitive transients,” Mechanical Systems and Signal Processing, 2018.
  • [17] H. Dalton, “The Measurement of the Inequality of Incomes,” The Economic Journal, 1920.
  • [18] N. Hurley and S. Rickard, “Comparing measures of sparsity,” IEEE Transactions on Information Theory, 2009.
  • [19] Y. Miao, J. Wang, B. Zhang, and H. Li, “Practical framework of Gini index in the application of machinery fault feature extraction,” Mechanical Systems and Signal Processing, 2022.
  • [20] J. Antoni, “The infogram: Entropic evidence of the signature of repetitive transients,” Mechanical Systems and Signal Processing, 2016.
  • [21] I. S. Bozchalooi and M. Liang, “A smoothness index-guided approach to wavelet parameter selection in signal de-noising and fault detection,” Journal of Sound and Vibration, 2007.
  • [22] Y. Miao, M. Zhao, and J. Lin, “Periodicity-Impulsiveness Spectrum Based on Singular Value Negentropy and Its Application for Identification of Optimal Frequency Band,” IEEE Transactions on Industrial Electronics, 2019.
  • [23] R. Dykstra and B. C. Arnold, “Majorization and the Lorenz Order: A Brief Introduction.” Journal of the American Statistical Association, 1988.
  • [24] B. Hou, D. Wang, T. Xia, Y. Wang, Y. Zhao, and K. L. Tsui, “Investigations on quasi-arithmetic means for machine condition monitoring,” Mechanical Systems and Signal Processing, vol. 151, p. 107451, 2021. [Online]. Available: https://doi.org/10.1016/j.ymssp.2020.107451
  • [25] B. Hou, D. Wang, Y. Wang, T. Yan, Z. Peng, and K. L. Tsui, “Adaptive Weighted Signal Preprocessing Technique for Machine Health Monitoring,” IEEE Transactions on Instrumentation and Measurement, vol. 70, 2021.
  • [26] F. T. Liu, K. M. Ting, and Z. H. Zhou, “Isolation-based anomaly detection,” ACM Transactions on Knowledge Discovery from Data, 2012.
  • [27] B. Wang, Y. Lei, N. Li, and N. Li, “A Hybrid Prognostics Approach for Estimating Remaining Useful Life of Rolling Element Bearings,” IEEE Transactions on Reliability, vol. 69, no. 1, pp. 401–412, mar 2020.