This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

11institutetext: Department of Aerospace Engineering, Indian Institute of Science, Bangalore
11email: [email protected]
22institutetext: School of Mechanical Engineering, Purdue University, IN 47907-2088, USA

Deep generative models for unsupervised delamination detection using guided waves

Mahindra Rautela 1122    Amin Maghareh Shirley Dyke 2222    S. Gopalakrishnan 11
Abstract

With the rising demands for robust structural health monitoring procedures for aerospace structures, the scope of intelligent algorithms and learning techniques is expanding. Supervised algorithms have shown promising results in the field provided a large, balanced, and labeled amount of data for training. For some applications like aerospace, the data collection process is cumbersome, time-taking, and costly. Also, generating possible damage scenarios in a laboratory setup is challenging because of the complexity of the damage initiation and failure mechanism. Besides this, the uncertainties of the real-time operation restrict the online prediction accuracy with supervised learning. In this paper, deep generative models are proposed for unsupervised delamination prediction as an anomaly detection problem. In this one-class-based model, the deep learning network is trained to learn the distribution of baseline signals. In the testing phase, damage signals and unseen baseline signals are fed into the trained network to predict the state of the structure, i.e., healthy or unhealthy (delamination). It is seen that the proposed method can successfully predict the delamination with high accuracy.

keywords:
Deep Learning, Unsupervised Learning, Generative Models, Structural Health Monitoring, Delamination Detection, Composite Structures

1 Introduction

Composite materials fail via complex mechanisms, and delaminations are the most common ones which need proper attention. Condition-based monitoring can eliminate costly and time-taking scheduled maintenance procedures [1]. Ultrasonic guided wave is one of the popular techniques to actively interrogate the composite structures to identify the delaminations [2]. Recently, data-driven supervised deep learning algorithms have shown promising results for damage identification using ultrasonic guided waves [3]. However, the well-labeled data-collection process is cumbersome, costly, and time-taking. It is difficult to obtain data for possible damage scenarios in some applications. Due to this, the datasets become unbalanced, which restricts the use of supervised deep learning algorithms. In such scenarios, damage detection can be posed as an unsupervised one-class anomaly detection problem. In this framework, the network is trained to learn the distribution of the pristine state of the structure. In testing phase, the out-of-distribution signals are classified as damages [4].

Deep generative models are one of the advance unsupervised learning techniques aimed at capturing the probabilistic governing distribution that generates a class of data to generate similar data [5]. Deep generative models can be divided into two main categories:

  1. 1.

    Cost-function based models like Variational Autoencoders (VAE) [6] and Generative Adversarial Networks (GAN) [7].

  2. 2.

    Energy-based models like Deep Boltzmann Machines [8] and Deep Belief Networks [9].

Out of different deep generative models and their variants, VAE and GAN are most popularly used for anomaly detection. In the context of one-class infrastructure health monitoring problems, limited research investigations are carried out in the literature. VAE is used for damage detection in bridges [10], rail squats localization [11] and damage detection in tunnels [12]. GAN is used for road damage detection [13], post-disaster building damage detection [14], bridges health monitoring [15], concrete cracking and spalling [16]. Most of the literature is inclined toward civil-related infrastructures with little to no investigations of aerospace structures. Besides this, vibration-based or vision-based sensing technologies are prominently used in the literature.

In this work, we have focused on the one-class version of VAE for delamination detection in aerospace composite panels. The ultrasonic guided waves are used to interrogate the panel using a sensor array. The raw time-series guided wave signals are converted into time-frequency-based higher-order representations for training purposes. The paper is presented as follows: Section-2 contains theoretical background on variational autoencoders, Section-3 consists of the description of the dataset and training procedures, Section-4 presented the results of delamination detection using VAE and the paper is concluded in Section-5.

2 Theoretical Background

2.1 Variational Autoencoders

A basic autoencoder has three components, i.e., encoder, latent space (or code/hidden representation), and decoder, as shown in Figure-1(a). An encoder compresses the input into latent space, and a decoder reconstructs backs the input from the latent space. During this process, the latent layer learns valuable features of the dataset.

Refer to caption
Refer to caption
Figure 1: General architecture of (a) Basic autoencoder (b) Variational autoencoder

Variational autoencoder (VAE) is similar to basic autoencoder, but it provides a probability distribution instead of a single value for each latent variable. The decoder is fed with a randomly generated vector from the latent state distribution. In addition to regularization, this helps VAE to enforce a continuous and smooth latent space representation [6]. Mathematically, p(x,z)p(x,z) needs to be calculated to infer the latent space using Bayesian inference, which is represented by Eq.1. The term p(x|z)p(z)𝑑z\int p(x|z)p(z)dz represents an intractable distribution, so Eq.1 cannot be solved directly [17].

p(z|x)=p(x|z)p(z)p(x|z)p(z)𝑑zp(z|x)=\frac{p(x|z)p(z)}{\int p(x|z)p(z)dz} (1)

However, variational inference is used to estimate the integral. For this, the p(z|x)p(z|x) distribution is approximated with q(z|x)q(z|x) in such a way that it can be a tractable distribution. To ensure both distributions are similar, Kullback-Leibler (KL) divergence is minimized, minDKL(q(z|x)||p(z|x))min\hskip 2.84526ptD_{KL}(q(z|x)||p(z|x)). This expression can be minimized by maximizing Eq(z|x)logp(x|z)DKL(q(z|x)||p(z))E_{q(z|x)}\log p(x|z)-D_{KL}(q(z|x)||p(z)) [18]. The first term is the reconstruction likelihood which is similar to reconstruction error between input and output and the second term imposes similarity between prior and posterior distribution for latent variables. The prior distribution is assumed to be standard Normal distribution N(0,I)N(0,I) and posterior is estimated of the form N(μϕ(x),σϕ2(x))N(\mu_{\phi}(x),\sigma^{2}_{\phi}(x)). To implement it in backpropagation based neural-network framework, reparameterization trick is performed [19]. The overall loss function is the summation of reconstruction error and the KL divergence.

2.2 One-class Variational Autoencoder

In this work, a one-class version of VAE is used for delamination detection. Here, the network is trained to learn the distribution of baseline signals. In the testing phase, unseen baseline and delamination signals are used to test the network. The proposed methodology belongs to a broader class of anomaly detection problems. The strategy for unsupervised delamination detection using VAE is shown in Fig. 2. In this strategy, the errors (mean squared reconstruction + KL divergence) are collected after the training process. Based on the distribution of errors, thresholds are selected to detect the delaminations. In this work, the 99th quartile and maximum value of errors are used as two different thresholds. Selection of an optimum threshold depends on uncertainties of real-time operation, and application-dependent safety-cost tradeoffs [4]. This investigation is not in the scope of the current work.

Refer to caption
Figure 2: Delamination detection using VAE and thresholds. If error is greater than the threshold, the signal is classified as anomaly (delamination).

3 Materials & Methods

3.1 Dataset

In this work, an experimental guided-wave benchmark dataset from the Open Guided Waves platform is used [20]. The experiments are performed on a 500 ×\times 500 ×\times 2 mm T700M21 carbon fiber reinforced polymer with a quasi-isotropic layup placed in a climatic chamber. Twelve piezoelectric transducers are arranged, giving rise to 66 signals per experiment. Twelve different frequencies ranging from 40 kHz to 260 kHz are used for excitation. Damages at 28 different positions are introduced using a reversible damage model. The wavefield analysis shows that the reversible damage model behaves like real delamination. The original dataset contains 47,520 baseline and 22,176 damage signals. 5000 baseline and damage signals are randomly picked for the training and testing purpose. This random selection also includes new samples picked every time the algorithm runs. In this work, the 1d raw guided-wave signals are converted into 2d time-frequency representations using continuous wavelet transformation (CWT) [4]. Mathematically, CWT of a function F(t)F(t) is represented by FWF^{W} [21], shown by Eq. 2.

FW(a,b)=+F(t)Φ(tba)𝑑tF^{W}(a,b)=\int_{-\infty}^{+\infty}F(t)\Phi\Bigg{(}\frac{t-b}{a}\Bigg{)}dt (2)

where, Φ\Phi(t) is a wavelet basis function, ’a’ and ’b’ defines the width and position in time of Φ(t)\Phi(t). Fig. 3 shows time-series signals from the dataset and it’s corresponding CWT representation.

Refer to caption
Refer to caption
Figure 3: Experimental dataset (a) Raw guided wave signals (Baseline and Damage), (b) CWT representation of the baseline signal.

3.2 Training procedure

The network is trained with the summation of mean squared reconstruction error and KL-divergence as the loss function (See Section-2.1). The Adam optimization scheme with a learning rate of 1e-3 and a batch size of 32. The network consists of three parts, encoder, decoder, and latent space. Both encoder and decoder consist of convolutional layers with a kernel size of 3 × 3, filters: 16, 32, 64, 128, 256, a stride length of 2, and a Leaky ReLU activation function. The dimension of the latent space is 2. The code files are developed in Python programming language, and the code package is made open-source on GitHub*.

4 Results & Discussions

After training the networks, the delamination detection strategy, as shown in Fig. 2 is implemented for testing. The errors are collected corresponding to the training samples. Two different thresholds are calculated, i.e., 99th quartile and maximum value of the error of the training samples. The trained network is fed with unseen healthy and delamination signals. If the error of the test sample exceeds the threshold, then it is classified as delamination. The results are plotted in Fig. 4. It is seen that threshold-2 (maximum error) gives better classification accuracy. One of the reasons for such promising results is due to the use of higher-order wavelet enhanced representations instead of raw guided-wave signals [4]. The latent space distribution (mean value) for all the healthy and unhealthy signals is shown in Fig. 5. It can be seen that both baseline and delamination signals are well separated in the latent space. It is also observed that there exists a significant separation between the overall error of healthy and unhealthy signals (Fig. 4). A similar distinction can be seen between healthy and unhealthy clusters in the latent space. (Fig. 5). For much-complicated problems and datasets, there may not be an evident separation between the distribution of baseline signals and anomalies [4]. In such cases, the selection of decision thresholds is decided based on the tradeoff between false positives rate (FPR) and false negatives rate (FNR). This tradeoff directly translates to the safety-cost tradeoff, which is application-specific. For safety-critical health monitoring applications like space habitats and nuclear power plants, reducing FPR is considered preferable to FNR.

Refer to caption
Figure 4: Delamination detection using VAE and thresholds. The confusion matrix shows the classification results of unseen healthy and unhealthy (delamination) signals.
Refer to caption
Figure 5: Two-dimensional latent space visualization of all healthy and unhealthy (delamination) signals.

5 Conclusion

In this paper, a one-class version of the variational autoencoder is proposed for unsupervised delamination detection. The raw guided wave signals are converted to higher-order representations using continuous wavelet transformation. The network is trained to learn the distribution of baseline signals. Once the network is trained, thresholds are calculated and used to detect delaminations as out-of-distribution signals. This anomaly detection data-driven framework is helpful for data-scarce applications where balanced and labeled data collection is infeasible, costly, or time-taking. It is seen that the network provides promising results on the open-source benchmark dataset. This work will be extended and validated on different datasets in our upcoming work. Besides this, one-class GAN is another direction for future research investigations. The comparison between VAE and GAN capabilities to learn the baseline distribution will be an important highlight of our future work.

References

  • [1] Rautela, M., Huber, A., Senthilnath, J., Gopalakrishnan, S.: Inverse characterization of composites using guided waves and convolutional neural networks with dual-branch feature fusion. Mechanics of Advanced Materials and Structures pp. 1–17 (2021)
  • [2] Mitra, M., Gopalakrishnan, S.: Guided wave based structural health monitoring: A review. Smart Materials and Structures 25(5), 053001 (2016)
  • [3] Rautela, M., Senthilnath, J., Moll, J., Gopalakrishnan, S.: Combined two-level damage identification strategy using ultrasonic guided waves and physical knowledge assisted machine learning. Ultrasonics 115, 106451 (2021)
  • [4] Rautela, M., Senthilnath, J., Monaco, E., Gopalakrishnan, S.: Delamination prediction in composite panels using unsupervised-feature learning methods with wavelet-enhanced guided wave representations. Composite Structures p. 115579 (2022)
  • [5] Oussidi, A., Elhassouny, A.: Deep generative models: Survey. In: 2018 International Conference on Intelligent Systems and Computer Vision (ISCV). pp. 1–8. IEEE (2018)
  • [6] Kingma, D.P., Welling, M.: Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114 (2013)
  • [7] Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A., Bengio, Y.: Generative adversarial nets. Advances in neural information processing systems 27 (2014)
  • [8] Hinton, G.E., Salakhutdinov, R.R.: A better way to pretrain deep boltzmann machines. Advances in Neural Information Processing Systems 25 (2012)
  • [9] Hinton, G.E., Osindero, S., Teh, Y.W.: A fast learning algorithm for deep belief nets. Neural computation 18(7), 1527–1554 (2006)
  • [10] Ma, X., Lin, Y., Nie, Z., Ma, H.: Structural damage identification based on unsupervised feature-extraction via variational auto-encoder. Measurement 160, 107811 (2020)
  • [11] Yuan, Z., Zhu, S., Chang, C., Yuan, X., Zhang, Q., Zhai, W.: An unsupervised method based on convolutional variational auto-encoder and anomaly detection algorithms for light rail squat localization. Construction and Building Materials 313, 125563 (2021)
  • [12] Zhang, Y., Xie, X., Li, H., Zhou, B.: An unsupervised tunnel damage identification method based on convolutional variational auto-encoder and wavelet packet analysis. Sensors 22(6), 2412 (2022)
  • [13] Maeda, H., Kashiyama, T., Sekimoto, Y., Seto, T., Omata, H.: Generative adversarial network for road damage detection. Computer-Aided Civil and Infrastructure Engineering 36(1), 47–60 (2021)
  • [14] Tilon, S., Nex, F., Kerle, N., Vosselman, G.: Post-disaster building damage detection from earth observation imagery using unsupervised and transferable anomaly detecting generative adversarial networks. Remote sensing 12(24), 4193 (2020)
  • [15] Jiang, H., Wan, C., Yang, K., Ding, Y., Xue, S.: Continuous missing data imputation with incomplete dataset by generative adversarial networks–based unsupervised learning for long-term bridge health monitoring. Structural Health Monitoring p. 14759217211021942 (2021)
  • [16] Gao, Y., Zhai, P., Mosalam, K.M.: Balanced semisupervised generative adversarial network for damage assessment from low-data imbalanced-class regime. Computer-Aided Civil and Infrastructure Engineering 36(9), 1094–1113 (2021)
  • [17] Zhang, S., Ye, F., Wang, B., Habetler, T.G.: Semi-supervised learning of bearing anomaly detection via deep variational autoencoders. arXiv preprint arXiv:1912.01096 (2019)
  • [18] Pol, A.A., Berger, V., Germain, C., Cerminara, G., Pierini, M.: Anomaly detection with conditional variational autoencoders. In: 2019 18th IEEE international conference on machine learning and applications (ICMLA). pp. 1651–1657. IEEE (2019)
  • [19] Kingma, D.P., Welling, M.: An introduction to variational autoencoders. arXiv preprint arXiv:1906.02691 (2019)
  • [20] Moll, J., Kathol, J., Fritzen, C.P., Moix-Bonet, M., Rennoch, M., Koerdt, M., Herrmann, A.S., Sause, M.G., Bach, M.: Open guided waves: online platform for ultrasonic guided wave measurements. Structural Health Monitoring 18(5-6), 1903–1914 (2019)
  • [21] Rautela, M., Gopalakrishnan, S.: Ultrasonic guided wave based structural damage detection and localization using model assisted convolutional and recurrent neural networks. Expert Systems with Applications 167, 114189 (2021)