Robust training approach of neural networks for fluid flow state estimations

Taichi Nakamura Department of Mechanical Engineering, Keio University, Yokohama 223-8522, Japan Koji Fukagata Department of Mechanical Engineering, Keio University, Yokohama 223-8522, Japan [email protected]

Abstract

State estimation from limited sensor measurements is ubiquitously found as a common challenge in a broad range of fields including mechanics, astronomy, and geophysics. Fluid mechanics is no exception — state estimation of fluid flows is particularly important for flow control and processing of experimental data. However, strong nonlinearities and spatio-temporal high degrees of freedom of fluid flows cause difficulties in reasonable estimations. To handle these issues, neural networks (NNs) have recently been applied to the fluid flow estimation instead of conventional linear methods. The present study focuses on the capability of NNs to various fluid flow estimation problems from a practical viewpoint regarding robust training. Three types of unsteady laminar and turbulent flows are considered for the present demonstration: 1. square cylinder wake, 2. turbulent channel flow, and 3. laminar to turbulent transitional boundary layer. We utilize a convolutional neural network (CNN) to estimate velocity fields from sectional sensor measurements. To assess the practicability of the CNN models, physical quantities required for the input and robustness against lack of sensors are investigated. We also examine the effectiveness of several considerable approaches for model training to gain more robustness against the lack of sensors. The knowledge acquired through the present study in terms of effective training approaches can be transferred towards practical machine learning in fluid flow modeling.

1 Introduction

State estimation from limited available measurements is a challenging task over a wide range of fields such as engineering, economics, ecology, and biology (Simon,, 2006). Among classical methods for state estimation, Bayesian estimation (Bayes,, 1763) and least-squares method (Gauss,, 1857) can be regarded as fundamental estimation techniques and were used in mathematical astronomy. Although various methods had been suggested and applied to canonical problems, what makes state estimation famous in terms of practicability was the Apollo program. The Kalman filter (Kalman,, 1960) was applied to Apollo’s navigation system and successfully took it to the moon (Bar-Shalom et al.,, 2004; Sorenson,, 1970). Then, Kalman filter had become popular and various extensions have thus far been developed to deal with more complex applications (Hoshiya and Saito,, 1984; Wan and Van Der Merwe,, 2000; Evensen,, 2003).

Including such extensions of Kalman filter, various ideas have recently been utilized in solving problems found in a wide range of fields e.g., disease detection in medical field (Moreno and Pigazo,, 2009), recognition of the external world in robotics (Chen,, 2011; Mitsantisuk et al.,, 2011), power electronic control and distribution in electrical engineering (De Brabandere et al.,, 2006). Among these applications, fluid state estimation has been recognized as a particularly difficult example due to strong nonlinearities and a gigantic number of freedom in space and time. The state estimation of fluid flows can be applied, among others, to flow control (Bewley,, 2001; Brunton and Noack,, 2015) and weather forecasting (Wunsch and Heimbach,, 2007; Cushman-Roisin and Beckers,, 2011). To our best knowledge, the first attempt to estimate flow fields was performed by Adrian and Moin, (1988). They estimated large-scale structures in turbulent shear flow using linear stochastic estimation (LSE). However, the LSE has a limitation in estimating finer-scale structures due to its linear operations.

To overcome this limitation, Bewley’s group examined the possibilities of four-dimensional variational method or Kalman filters to estimate the turbulent channel flow from wall information (Bewley et al.,, 2001; Chevalier et al.,, 2006; Colburn et al.,, 2011). The Kalman filter estimated the velocity field near the wall relatively well, though the accuracy gets worse far from the wall. They also reported that the nonlinear Kalman filter outperforms the linear Kalman filter and the ensemble Kalman filter is only able to estimate the entire field. However, this favorable result with the ensemble Kalman filter can be considered limited according to the additional verification performed by Suzuki and Hasegawa, (2006), which also exhibits the difficulties to estimate the entire field of wall-bounded turbulence with high accuracy.

A large number of freedom is also a major hindrance in handling several methods for fluid flow estimation. Due to the large number of discretization points to represent the fluid flow data, models have to deal with an extremely high-dimensional space from low-dimensional sensor information. This hindrance can be mitigated by means of some low-dimensional forms of a high-dimensional estimation target (Everson and Sirovich,, 1995; Candès et al.,, 2006; Donoho,, 2006). For the fluid flow estimation, proper orthogonal decomposition (POD)-based methods such as Gappy POD (Everson and Sirovich,, 1995) have been considered (Bui-Thanh et al.,, 2004; Willcox,, 2006; Manohar et al.,, 2018; Maulik et al.,, 2020); however, there is also a limitation of feature extraction from fluid flows because of the linear nature of the method.

As a new method to deal with strong nonlinearity and high degrees of freedom in fluid flows, neural networks (NNs) have recently shown a great potential in different flow problems (Brenner et al.,, 2019; Brunton et al., 2020a, ; Brunton et al., 2020b, ; Fukami et al., 2020a, ; Fukami et al., 2020b, ; Duraisamy,, 2021). For instance, Güemes et al., (2019) estimated large-scale motions in wall-bounded turbulence as a combination of POD modes and by utilizing a convolutional neural network (CNN) (LeCun et al.,, 1998). They reported that the CNN model successfully estimates large-scale motions and outperforms the POD-based estimation method. Guastoni et al., (2020) also utilized a CNN to estimate velocity fields of a turbulent channel flow and demonstrated the superiority of CNN over LSE in terms of estimation accuracy. The comparison between CNN and LSE has also been demonstrated from the viewpoint of noise robustness (Nakamura et al.,, 2022). More recently, a generative adversarial network (GAN) has also been considered for state estimations (Güemes et al.,, 2021; Kim et al.,, 2021).

Refer to caption — Figure 1: Overview of the velocity estimation problems covered in the present study. $(a)$ Square cylinder wake, $(b)$ Turbulent channel flow, and $(c)$ Transitional boundary layer.

Since NN has a great potential for fluid flow estimation as introduced above, of particular interest here as the next step is the capability of the NNs from a practical viewpoint. To this end, this paper discusses the practicability of the NN-based estimation method for fluid flows from the view of robust model construction. Considering three types of unsteady flows summarized in figure 1, we investigate the dependence of the model performance on physical quantities required for the estimation and the minimum number of sensors. We also evaluate several approaches for the model to obtain robustness against the lack of sensors. The present paper is organized as follows. Details of the estimation model and training data are provided in Sections 2 and 3. The estimation performance including the dependence on input attributes and robustness against the lacked input is demonstrated in Section 4.1. Several methods to acquire robustness for the lacked input are investigated in Section 4.2. We lastly present a summary and provide some outlooks in Section 5.

2 Convolutional neural network-based state estimator for fluid flows

As presented in figure 1, we consider state estimation of fluid flows from measurements available distant from the target region. In this study, both input and output are two-dimensional sections such that the present model attempts to estimate sectional data from sectional inputs. We perform this sectional estimation capitalizing on a convolutional neural network (LeCun et al.,, 1998). The CNN has recently been identified as one of the promising tools for data-driven fluid flow analyses including state estimation (Fukami et al., 2019a, ; Fukami et al., 2021a, ; Kobayashi et al.,, 2021), flow control (Lee et al.,, 1997; Rabault et al.,, 2019), reduced-order modeling (Murata et al.,, 2020; Hasegawa et al., 2020b, ; Hasegawa et al., 2020a, ; Fukami et al., 2020c, ; Kim and Lee,, 2020; Nakamura et al.,, 2021; Maulik et al.,, 2021), and turbulence modeling (Fukami et al., 2019b, ; Duraisamy et al.,, 2019; Lapeyre et al.,, 2019; Pawar et al.,, 2020; Thuerey et al.,, 2020; Font et al.,, 2021), thanks to a filter operation inside the CNN (Fukami et al., 2020a, ).

Table 1: Structure of 2D CNN used in this study. The number of input attributes for the turbulent channel flow or the transitional boundary layer is represented as

N_{P}\,(=1,2,3)

. “Conv.” stands for a convolutional layer.

Square cylinder		Turbulent channel flow		Transitional boundary layer
Layer	Data size	Layer	Data size	Layer	Data size
Input ( ${\bm{u}}$ )	$(256,128,3)$	Input ( ${\tau_{x}}$ , ${\tau_{z}}$ , ${p}$ )	$(32,32,N_{P})$	Input ( ${\tau_{x}}$ , ${\tau_{z}}$ , ${p}$ )	$(128,128,N_{P})$
1st Conv.	$(256,128,64)$	1st Conv.	$(32,32,32)$	1st Conv.	$(128,128,32)$
2nd Conv.	$(256,128,64)$	2nd Conv.	$(32,32,32)$	2nd Conv.	$(128,128,32)$
3rd Conv.	$(256,128,32)$	3rd Conv.	$(32,32,32)$	3rd Conv.	$(128,128,32)$
4th Conv.	$(256,128,32)$	4th Conv.	$(32,32,32)$	4th Conv.	$(128,128,32)$
5th Conv.	$(256,128,32)$	5th Conv.	$(32,32,32)$	5th Conv.	$(128,128,32)$
6th Conv.	$(256,128,32)$	6th Conv.	$(32,32,32)$	6th Conv.	$(128,128,32)$
7th Conv.	$(256,128,32)$	7th Conv.	$(32,32,32)$	7th Conv.	$(128,128,32)$
8th Conv.	$(256,128,32)$	8th Conv.	$(32,32,32)$	8th Conv.	$(128,128,32)$
9th Conv.	$(256,128,32)$	9th Conv.	$(32,32,32)$	9th Conv.	$(128,128,32)$
10th Conv.	$(256,128,32)$	10th Conv.	$(32,32,32)$	10th Conv.	$(128,128,32)$
11th Conv.	$(256,128,32)$	Output ( ${\bm{u}}$ )	$(32,32,3)$	11th Conv.	$(128,128,32)$
12th Conv.	$(256,128,3)$			12th Conv.	$(128,128,3)$
Output ( ${\bm{u}}$ )	$(256,128,3)$			Output ( ${\bm{u}}$ )	$(128,128,3)$

The present CNN is composed of convolutional layers, which allows us to extract spatial coherent features of data through filter operations. Note that pooling or upsampling operations are not considered in the present study because no dimension reduction or expansion is required for the present model such that $\mathbb{R}^{d_{\rm input}}=\mathbb{R}^{d_{\rm output}}$ , where $d_{\rm input}$ and $d_{\rm output}$ denote the vector dimensions of input and output data, respectively (Morimoto et al., 2021b, ). The operation at the $s$ -th convolutional layer $q^{(s)}$ can mathematically be expressed as

q^{(s)}_{ijn}=\varphi\left(\sum_{m=1}^{M}\sum_{p=0}^{H-1}\sum_{q=0}^{H-1}h^{(s)}_{pqmn}q^{(s-1)}_{i+p-G,j+q-G,m}+b_{n}^{(s)}\right),

(1)

where $G=\lfloor H/2\rfloor$ , $H$ is width and height of the filter, $M$ is the number of input channels, $n$ is the number of output channels, $b$ is a bias, and $\varphi$ denotes an activation function, respectively. We set the filter size as $H=3$ in this study. Although we can choose a nonlinear activation function from various candidates (Fukami et al., 2021b, ; Murata et al.,, 2020), we use ReLU (Nair and Hinton,, 2010) to avoid vanishing the gradient of weights. The training of CNN can be regarded as an optimization process regarding weights ${\bm{w}}$ . The weights are optimized through the backpropagation (Hecht-Nielsen,, 1992) minimizing a cost function between estimated data and reference data ${\bm{q}}_{\rm Ref}$ ,

{\bm{w}}={\rm argmin}_{\bm{w}}||{\bm{q}}^{(s_{\rm max})}-{\bm{q}}_{\rm Ref}||_{2},

(2)

where ${\bm{q}}^{(s_{\rm max})}$ is the output of CNN at the last layer $s_{\rm max}$ . We use the $L_{2}$ error norm as the cost function. The Adam optimizer (Kingma and Ba,, 2014) is utilized to perform the present optimization.

The structure of the CNN models are summarized in table 1. Note that we do not consider other model structures because we focus on the robust training approach in this study. For more information about the effect of model structure on robustness, please refer to Nakamura et al., (2022).

3 Fluid flow data sets

This study demonstrates the capability of the CNN for fluid flow estimation tasks by considering three types of fluid flow data sets that cover a broad range of spatial length scales of laminar and turbulent flows. We hereafter introduce the data sets for the present demonstration.

3.1 Wake around a square cylinder

A flow around a square cylinder at the Reynolds number ${Re}_{D}=300$ is first considered. Although this is a laminar example, the flow at the present Reynolds number can be regarded as a good candidate to discuss the reconstructability of the present CNN model because there are complex three-dimensional structures associated with two- and three-dimensional separated shear layers (Bai and Alam,, 2018), as shown in figure 2 $(a)$ . A direct numerical simulation (DNS) is used to prepare the training data by numerically solving the incompressible Navier–Stokes equations with a penalization term (Caltagirone,, 1994; Matsuo et al.,, 2021; Morimoto et al., 2021a, ), i.e.,

\displaystyle{\bm{\nabla}}\cdot{\bm{u}}=0,\leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ {\partial_{t}{\bm{u}}}+{\bm{\nabla}}\cdot\left({\bm{u}}{\bm{u}}\right)=-{\bm{\nabla}}p+{{Re}^{-1}_{D}}{\bm{\nabla}}^{2}\bm{u}+\lambda\chi\left({\bm{u}}_{b}-{\bm{u}}\right),

(3)

where ${\bm{u}}=\{u,v,w\}$ and $p$ are the velocity vector and pressure, which are nondimensionalized with the fluid density $\rho$ , the length of the square cylinder $D$ , and the uniform velocity $U_{\infty}$ . The penalization term expresses the bluff body with a penalty parameter $\lambda$ , a mask value $\chi$ , and the velocity of the object ${\bm{u}}_{b}$ which is zero in the present case. For the mask value, $\chi=0$ and 1, respectively corresponding to outside and inside a body. The spatial domain in the present simulation covers $\left(L_{x},L_{y},L_{z}\right)=\left(20D,20D,4D\right)$ .

The DNS code is based on that developed for turbulent channel flows by Fukagata et al., (2006): the spatial discretization is done by using the energy-conservative second-order finite difference method, and the time integration is done by using the low-storage third order Runge-Kutta/Crank-Nicolson method. The computation is carried out with the time step of $\Delta t=2.5\times 10^{-3}$ . A uniform velocity is imposed at the inflow boundary, while the convective boundary condition is applied at the outflow boundary. We consider the slip boundary condition at $y=0$ and $y=L_{y}$ , and the periodic boundary condition at $z=0$ and $z=L_{z}$ . The center of the square cylinder is located $5.5D$ downstream of the inflow boundary.

For the present training data, the part of computational volume around the square cylinder $\left(12.8D,4D,4D\right)$ with the grid number of $(N_{x}^{\sharp},N_{y}^{\sharp},N_{z}^{\sharp})=(256,128,160)$ is extracted, and 1000 snapshots are prepared. We choose 70% of the snapshots for the training, while remaining 30% is used for the validation. We also consider additional 1000 snapshots for the assessment in Section 4.1.1. For this square cylinder example, we use the velocity vector ${\bm{u}}=\{u,v,w\}$ as the qualities of interest for both input and output.

3.2 Turbulent channel flow

Similar to out previous work on CNN autoencoder (Nakamura et al.,, 2021), a minimal turbulent channel flow (Jiménez and Moin,, 1991) at $Re_{\tau}=110$ is then considered, as shown in figure 2 $(b)$ . The training data are obtained by a DNS which numerically solves the incompressible continuity and Navier–Stokes equations,

\bm{\nabla}\cdot{\bm{u}}=0,\leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ {{\partial_{t}{\bm{u}}}+\bm{\nabla}\cdot({\bm{u}\bm{u}})=-\bm{\nabla}p+{{Re}^{-1}_{\tau}}\nabla^{2}{\bm{u}}},

(4)

where ${\bm{u}}$ and $p$ represents the velocity vector and pressure, respectively. The quantities used in the equations are nondimensionalized with the channel half-width $\delta$ and the friction velocity $u_{\tau}$ .

The computational domain covers $(L_{x},L_{y},L_{z})=(\pi\delta,2\delta,0.5\pi\delta)$ with the grid numbers of $(N_{x},N_{y},N_{z})=(32,64,32)$ . A uniform grid is used in the streamwise ( $x$ ) and the spanwise ( $z$ ) directions, while a nonuniform grid being used in the wall-normal ( $y$ ) direction. The numerical scheme for DNS is exactly the same as that for the square cylinder case. The time step is set to $\Delta t^{+}=0.0385$ , where the subscript $+$ denotes the wall units.

We use 10000 snapshots for the training of the present CNN. We use 70% of the snapshots for the training, and the remaining 30% is used for the validation. Note that the channel flow is considered for both the comparison among various fluid flow data sets (section 4.1) and the investigation with regard to acquisition of noise robustness in a training framework (section 4.2). For both assessments, additional 5000 snapshots are prepared. Although details will be provided later, the present CNN for the turbulent channel flow example attempts to estimate the velocity vector ${\bm{u}}=\{u,v,w\}$ from wall measurements. The dependence of the estimation accuracy on the choice of input quantities will also be investigated.

3.3 Transitional boundary layer

As a more complex example, we also consider a transitional boundary layer prepared from Johns Hopkins Turbulence Databases (Li et al.,, 2008; Perlman et al.,, 2007). For details of the computational conditions, please refer to Zaki, (2013). The data sets are obtained by DNS of incompressible flow over a flat plate with an elliptical leading edge. The Reynolds number based on the plate half-thickness $L$ is $Re_{L}=800$ . The computational domain non-dimensionalized by $L$ is $(L_{x},L_{y},L_{z})=(1099,40,240)$ and the simulation time step is $\Delta t=0.005$ . A no-slip boundary condition is applied at the wall.

From the original computational domain of the database, we extract the transition region defined by the momentum-thickness Reynolds number so that the extracted domain contains both laminar and turbulent structures, as shown in figure 2 $(c)$ . The extracted domain size is $(149.7,26.4,60.0)$ with the number of grid points of $(N_{x},N_{y},N_{z})=(128,224,128)$ . Based on the above configuration, we prepare 700 snapshots, 70% of which is for the training and the remaining 30% is for the validation. We also prepare additional 250 snapshots as the test data for the assessment in Section 4.1.3. Similar to the channel flow case, the present CNN with this transitional example also aims to estimate the velocity field ${\bm{u}}=\{u,v,w\}$ from wall measurements.

4 Results

4.1 Demonstration of CNN-based state estimator for unsteady laminar and turbulent flows

4.1.1 Square cylinder wake

Let us first apply the CNN to a square cylinder wake at $Re_{D}=300$ . The present CNN model ${\mathcal{F}}$ estimates an $x-y$ sectional velocity field ${\bm{u}}$ at $z=1.5$ (non-dimensionalized by $D$ ) from velocity sensors ${\bm{s}}$ collected from the $x-y$ cross-section at $z=2.0$ such that ${\bm{u}}_{z=1.5}={\mathcal{F}}({\bm{s}}_{z=2.0})$ . The estimated velocity fields with the $L_{2}$ error norm are presented in figure 3 $(a)$ . Here, the $L_{2}$ error norm $\epsilon$ is defined as $\epsilon={||{\bm{u}}_{\rm ML}-{\bm{u}}_{\rm DNS}||_{2}}/{||{\bm{u}}^{\prime}_{\rm DNS}||_{2}}$ , where ${\bm{u}}_{\rm ML}$ is the estimated velocity field, ${\bm{u}}_{\rm DNS}$ is the reference DNS field, and ${\bm{u}}^{\prime}_{\rm DNS}$ denotes the velocity fluctuations of DNS data. The reconstructed velocity fields are in reasonable agreement with the DNS data. The reasonable reconstruction can also be observed with the streamwise mean velocity profile in figure 3 $(b)$ .

We then investigate the capability of the present CNN model from a practical viewpoint. Although we used information from all grid points on the cross section as the input, which means we have to arrange sensors without gaps — this is not realistic. Hence, the robustness of the CNN model against the lacked input data is examined here. The sensor placements are randomly determined, and this information is fed into the CNN model already trained with the full data. However, since two-dimensional CNN can only handle sectional data (i.e., not local sensor measurements), an appropriate preprocessing is required to feed the sensor information into the CNN directly. To do this, we consider four methods to treat the lack of input: 1. zero-fill (i.e., substituting zero to the lacked part), 2. linear interpolation, 3. cubic interpolation, and 4. Voronoi tessellation. The Voronoi tessellation (Voronoi,, 1908) can handle random sensor placements, and its usefulness for fluid flow data has been demonstrated by Fukami et al., 2021c . Note that for the linear and cubic interpolation, the edge region not surrounded by the sensors is filled by linear extrapolation.

The error for each ratio of lacked data $r_{\rm lack}=n_{\rm lack}/n_{\rm all}$ is presented in figure 4 $(a)$ . The number of all sensors in this problem corresponds to the number of grid points in DNS, i.e., $n_{\rm all}=256\times 128=32768$ , while $(n_{\rm all}-n_{\rm lack})$ is the number of randomly placed sensors used for the input. As shown, the error of the interpolation methods is quite smaller than that of zero-fill, which implies the effectiveness of interpolation to keep the robustness. The estimated velocity fields from the lacked input are also visualized in figure 4 $(b)$ . We can observe the superiority of the cubic interpolation method for all of the lack ratios. The estimated fields using cubic interpolation with the lack ratio $r_{\rm lack}$ of $\{0.88,0.97\}$ are in qualitative agreement with the reference DNS data, though the wake structures slightly deform or disappear at $r_{\rm lack}=0.99$ . Therefore, the present CNN model for a square cylinder wake can keep the robustness for as much as $r_{\rm lack}=0.97$ when the cubic interpolation is utilized.

4.1.2 Turbulent channel flow

Let us then use a turbulent channel flow as a complex flow example. The CNN models ${\mathcal{F}}$ estimate the velocity field $\{u,v,w\}$ at $y^{+}=15.4$ from sensor measurements on the wall such that ${\bm{u}}_{y^{+}=15.4}={\mathcal{F}}({\bm{s}}_{\rm wall})$ . From the perspective of saving sensors, it is important to know physical quantities that can contribute to estimations. Hence, we here examine the dependence of the model performance on the input attributes. As the candidates for physical quantities, we consider several combinations of three physical quantities, streamwise and spanwise wall shear stress $\tau_{x},\tau_{z}$ and pressure $p$ , which are often used for state estimation in turbulent channel flow (Suzuki and Hasegawa,, 2006; Guastoni et al.,, 2020).

The estimation performance of each input case is summarized in figure 5. Note that the vorticity field $\omega_{y}$ in figure 5 $(a)$ is obtained from the estimated velocity fields $u$ and $w$ such that $\omega_{y,{\rm ML}}=f(u_{\rm ML},w_{\rm ML})={\partial u_{\rm ML}}/{\partial z}-{\partial w_{\rm ML}}/{\partial x}.$ Hence, the assessment on $\omega_{y,{\rm ML}}$ can be regarded as tougher than the use of velocities only because the first-order differential needs to be calculated from the machine-learned velocities. The reconstructed velocity and vorticity fields are in reasonable agreement with the reference DNS. The $L_{2}$ error is also summarized in figure 5 $(b)$ . Among the cases with a single quantity input i.e., $\{\tau_{x}\}$ , $\{\tau_{z}\}$ , and $\{p\}$ , the $L_{2}$ error with the streamwise wall shear stress $\tau_{x}$ is the smallest, which suggests that ${\tau_{x}}$ most significantly contributes to the estimation. This can particularly be found from the vorticity contours $\omega_{y}$ in figure 5 $(a)$ . The model with $\{\tau_{x}\}$ input can estimate the DNS-like structures well, while the models with the input of $\{\tau_{z}\}$ and $\{p\}$ cannot estimate vortex structures. However, the $L_{2}$ error of $\{\tau_{x}\}$ input only for $v$ and $w$ is larger than that of $\tau_{z}$ . This implies that the reasonable estimation for all velocity components requires both $\tau_{x}$ and $\tau_{z}$ . The necessity of the spanwise wall shear stress input $\{\tau_{z}\}$ can also be observed with the probability density function in figure 5 $(c)$ . For $v^{\prime}$ and $w^{\prime}$ , the curve of the CNN with $\{\tau_{x}\}$ input is a little shifted from that of the DNS, while that of the CNN with $\{\tau_{x},\tau_{z}\}$ input being in reasonable agreement with the DNS. Although we investigate the influence of the pressure input, it contributes to the estimation a bit, and the performance of the model with $\{\tau_{x},\tau_{z},p\}$ input is not so different from that with $\{\tau_{x},\tau_{z}\}$ input. We also assess the reconstruction in temporal behavior, considering temporal two-point correlation coefficient $R^{+}_{vv}(t^{+})/R^{+}_{vv}(t^{+}=0)$ , as presented in figure 5 $(d)$ . The coefficient is defined as

R^{+}_{vv}(t^{+})={\overline{v^{\prime}(t_{0}^{+}+t^{+},x,z)v^{\prime}(t_{0}^{+},x,z)}^{t_{0}^{+},x,z}}.

(5)

The curves for the models $\{\tau_{x},\tau_{z}\}$ and $\{\tau_{x},\tau_{z},p\}$ are in good agreement with the DNS. With the other assessments above, the input attribute of $\{\tau_{x},\tau_{z}\}$ is sufficient for estimation from the viewpoint of temporal behavior.

Let us further assess the capability of the model on the wavespace by focusing on the case with $\{\tau_{x},\tau_{z}\}$ input. The normalized $L_{2}$ error map of energy spectrum in the streamwise and spanwise directions is presented in figure 5 $(e)$ . Here, the two-dimensional energy spectrum is defined as

E_{uu}(k_{x},k_{z})=\overline{\hat{u}^{*}\hat{u}}^{t},

(6)

where $\hat{(\cdot)}$ represents two-dimensional Fourier transformation and ${(\cdot)^{*}}$ denotes the complex conjugate. For both the streamwise and spanwise directions, the error is small up to the wavenumber of $10^{0.5}$ . This implies that the CNN model can make physically reasonable estimations although much finer-scale structures cannot be estimated. This is likely due to the less correlation in the dissipation range (Scherl et al.,, 2020; Fukami et al., 2021b, ).

We also investigate the robustness against the lacked input data of the turbulent channel flow example, analogous to the square cylinder example. Following the discussion above, we here use the CNN model with $\{\tau_{x},\tau_{z}\}$ input. The error for each ratio of lack is shown in figure 6 $(a)$ . The smaller error with the interpolation methods than that with zero-fill indicates the effectiveness of interpolation, similarly to the square cylinder example. We then check the velocity contours to compare the interpolation methods and to see what percentage of lack can be tolerated, as summarized in figure 6 $(b)$ . The superiority of the cubic interpolation can be again observed. For $13\%$ lack, the estimated flow field is in qualitative agreement with the DNS, whereas the fine structure is lost with $50\%$ lack, and the estimated structure becomes substantially different from that of DNS with $75\%$ lack. Therefore, we can conclude that the cubic interpolation is the best for preprocessing and the present CNN model for the turbulent channel flow estimation can accept at most $50\%$ lacked input data.

4.1.3 A transitional boundary layer

As a more complex problem, let us apply the present CNN model to the transitional boundary layer flow. The model $\mathcal{F}$ estimates the velocity field $\{u,v,w\}$ at $y=0.96L$ from sensor information on a flat plate such that ${\bm{u}}_{0.96L}=\mathcal{F}({\bm{s}}_{\rm plate})$ . Analogous to the channel flow example, we investigate what input attributes contribute to the transitional boundary layer estimation. We consider seven cases that are combinations of the streamwise and spanwise shear stresses $\tau_{x},\tau_{z}$ and pressure $p$ . The estimated velocity fields and the $L_{2}$ error norm are summarized in figure 7. As clearly seen, the models which include the input of streamwise shear stress $\tau_{x}$ show the better performance than that without $\tau_{x}$ input. However, we should note that the input of $\tau_{x}$ only is insufficient for the estimation of $v$ and $w$ . Hence, the additional information such as $\tau_{z}$ and $p$ are required to enhance the estimation accuracy for $v$ and $w$ , as presented in figure 7.

We also investigate the robustness against the lacked input, as shown in figure 8. The effectiveness for the use of interpolation can be again observed. We cannot observe a significant difference between the interpolation methods in figure 8 $(a)$ , which makes us compare in figure 8 $(b)$ . The $L_{2}$ error norm in figure 8 $(b)$ indicates that cubic interpolation is slightly better than the other methods. Note that the larger error of cubic interpolation in the range of $r_{\rm lack}\gtrsim 0.75$ in figure 8 $(a)$ does not make sense because the error is quite large. The lack acceptability of the models is then examined by observing flow fields in figure 8 $(b)$ . For $25\%$ lack, the overall trends of the estimated field can be kept comparing to the DNS. On the other hand, we gradually start to see different flow structures compared to the DNS with $50\%$ lack. Therefore, the present CNN model for the transitional boundary layer example can accept at most $25\%$ lacked input data. However, we should note that the error of “no lack" is originally large, which makes us suspect what the origin of the error is.

As an additional assessment to clarify the point above, the maps of estimated values versus the reference DNS values, usually called 45-degree map, are shown in figure 9 $(a)$ . We here consider the CNN model with $\{\tau_{x},\tau_{z}\}$ input. A large error can be found in the downstream region for the all velocity components. The same trend can be seen from the comparison of joint probability density function in figure 9 $(b)$ . The difference between the upper and downstream region can also be assessed from a physical viewpoint by introducing intermittency factor $\gamma$ defined as the fraction of time where the flow in a given region is turbulent. Continuous laminar and turbulent flows respectively correspond to $\gamma=0$ and 1. There are some criteria to determine whether the specific region is turbulent or not. We here use a modified turbulent energy recognition algorithm (M-TERA) method (Zhang et al.,, 1995). The M-TERA method determines a region of turbulent when it satisfies the following equation:

\overline{\left|u^{\prime}\frac{\partial u^{\prime}}{\partial t}\right|}>C\left[\overline{u}\frac{(\partial u^{\prime}/\partial t)_{\rm rms}}{(u^{\prime}\partial u^{\prime}/\partial t)_{\rm rms}}\right],

(7)

where $\overline{(\cdot)}$ represents the mean over a short time-interval and $(\cdot)_{\rm rms}$ denotes the long-time standard deviation. The contours of intermittency factor are presented in figure 9 $(c)$ . As can be seen, the downstream region estimated by the CNN model is almost determined as laminar, despite that the region is almost turbulent in DNS. One of the candidates to improve the estimation performance in the downstream region is training data addition (so-called data augmentation in the field of machine learning (Shorten and Khoshgoftaar,, 2019; Morimoto et al.,, 2022)). Hence, our next interest is what kind of structures should be contained in the additional training data.

To clarify this point, we here consider two types of training data addition, as presented in 10 $(a)$ ;

1.

Add 700 snapshots from both laminar and turbulent region to the original 700 snapshots on transient region such that 2100 snapshots in total (Case 1).
2.

Add 700 snapshots from both upper and lower portion on transient region such that 2100 snapshots in total (Case 2).

We use the streamwise and spanwise wall shear stresses $\tau_{x},\tau_{z}$ as the input attribute for the CNN model. The performance of each model is then assessed using the test data from the regular region, as summarized in figures 10 $(b)$ and $(c)$ . Both cases show improvements in reconstruction, especially in the $u$ component. In this particular example, the use of case 2 can affect the accuracy more than that of case 1, which suggests that the transient structures are more important than the laminar/turbulent structures as the additional data in the transient flow estimation. In this way, we can learn considerable paths to improve the estimation accuracy with preparing proper data augmentation methods. We also note that the reason why we cannot find the significant improvement for the $v$ and $w$ components is merely because the training data sampling is determined based on the structural distribution of streamwise velocity $u$ , as shown in figure 10 $(a)$ . Hence, the use of the $v$ and $w$ components for the training data sampling or incorporating intelligence data preparation (Sapsis,, 2020) could promote the estimation capability of the model more.

4.2 Investigations of several methods to acquire robustness for the lacked input with turbulent channel flow

In this section, let us introduce the capability of several methods to construct a robust CNN model against lacked input data, especially in fluid flow estimation. We aim to construct a robust model while maintaining estimation performance for no-lacked data. We here consider three methods: 1. regularization, 2. dropout, and 3. noise-perturbed data training. For the demonstrations in what follows, we use the turbulent channel flow example with the input of streamwise wall shear stress $\tau_{x}$ .

4.2.1 Regularization

Regularization is often used in the fields of machine learning and statistics as a method of preventing overfitting and gaining robustness (Schölkopf et al.,, 2002). This method adds a penalization term to the loss function as

\displaystyle{\bm{w}}={\rm argmin}_{\bm{w}}[||{\bm{q}}_{\rm ML}-{\bm{q}}_{\rm Ref}||_{A}+\alpha||{\bm{w}}||_{B}],

(8)

where $\alpha$ is a hyperparameter to determine the magnitude of the regularization, $A$ and $B$ respectively indicate the norm factor for the loss and the regularization terms. In this study, a set of $\{A,B\}=\{2,[1,2]\}$ is considered, where $B=1$ and $2$ correspond to Lasso ( $L_{1}$ regularization) (Tibshirani,, 1996) and Ridge ( $L_{2}$ regularization) (Hoerl and Kennard,, 1970), respectively.

The influence on the sparsity parameter $\alpha$ is investigated for both the $L_{1}$ and $L_{2}$ regularizations in figure 11. We have performed a five-fold cross validation for each $\alpha$ (Brunton and Kutz,, 2019), as presented as the error bar in figures 11 $(a)$ and $(b)$ . As a method for interpolation of lacked inputs, we use the cubic interpolation. The mean value of the $L_{2}$ error norm within the present cross-validation models do not show a clear dependence on the value of $\alpha$ . In contrast, focusing on the standard deviation, we can see that there is a large variation in the error depending on the hyperparameter $\alpha$ . Especially, the standard deviation with $\alpha=1\times 10^{-6}$ is quite larger than the others, which suggests that one of the models with $\alpha=1\times 10^{-6}$ exhibits a great robustness compared to the others. The reason for the large variance in the error value is likely because the optimization of neural network is carried out in the high-dimensional solution space regarding the updating of the immense number of weights. The same trend can also be found with the $L_{2}$ regularization in figure 11 $(b)$ . In sum, a cross validation is mandatory to reach the robust model.

The velocity fields for the regularization cases are summarized in figure 11 $(c)$ . Note that we here only visualize the best case of each $r_{\rm lack}$ since there is a high variation among the cross-validated models. The regular CNN model without regularization is also shown for comparison. With the regular model, the correct structures cannot be recovered with $75\%$ lack; however, this issue can be mitigated capitalizing on the regularization, especially with the $L_{2}$ regularization for the present case. The superiority of the regularized models can also be observed from the $L_{2}$ error norm. Summarizing above, the regularization is effective in obtaining robustness, although care should be taken for the choice of the hyperparameter $\alpha$ . Constructing several models with the appropriate parameter $\alpha$ results in a large variation in robustness, and users should choose the most robust model among them.

4.2.2 Dropout

Let us then consider dropout (Srivastava et al.,, 2014), which is often used in machine learning to prevent overfitting. This method can randomly deactivate a certain percentage of nodes during the training of a model, which essentially enables us to construct multiple models within a single model. We examine the robustness for the lacked input by constructing several models whose dropout rates are different, as shown in figure 12 $(a)$ . The dropout rate here represents the deactivation rate of nodes in training. The lack portion is interpolated using the cubic interpolation analogous to the investigation of regularization. The models with dropout outperform the regular model, especially with the large lack ratio. Thus, a larger dropout ratio is indeed effective in obtaining robustness; however, we should note that too large dropout ratio deteriorates the model performance when the lack ratio is small since the connection inside the model is too sparse. The velocity contours are also checked in figure 12 $(b)$ . Overall, we can observe the superiority of the dropout models over the regular one. In particular, the flow field estimated by the model with $20\%$ dropout is in reasonable agreement with the DNS even with $75\%$ input. Therefore, we can evaluate that dropout can make the CNN model robust for up to about $75\%$ lack in this example.

4.2.3 Noise-addition training

At last, we investigate the effect of noise-addition training. Noise perturbation for training data is one of the data augmentation techniques to gain robustness of machine learning models because test data are generally treated as “noise" against training data (Shorten and Khoshgoftaar,, 2019; Morimoto et al.,, 2022). We here consider a Gaussian noise whose magnitude is defined with signal-to-noise ratio (SNR), ${\rm SNR}={\sigma^{2}_{\rm data}}/{\sigma^{2}_{\rm noise}}$ , where $\sigma{{}^{2}}_{\rm data}$ and $\sigma{{}^{2}}_{\rm noise}$ are the variances of input data and noise, respectively. The present study covers three different magnitudes of noise ${\rm 1/SNR}=\{0.01,0.05,0.10\}$ . The Gaussian noise is perturbed to 5000 snapshots out of 10000 training snapshots. The machine learning model is constructed for each case and the training data sets (i.e., 5000 noisy snapshots plus 5000 clean snapshots) are also prepared for each case.

The error of each model for the lacked input is summarized in figure 13 $(a)$ . Comparing to the regular model, the noise-trained models are more robust for the lacked input, and the robustness is strengthened by adding a stronger noise though there is an upper limit. The performance of the models is also verified using the streamwise velocity contours in figure 13 $(b)$ . The flow fields estimated by the models trained with noise-perturbed data are in good agreement with the DNS data even with the lacked input. Noteworthy here is that the $L_{2}$ error against the $50\%$ lack of the model trained with $10\%$ noise is smaller than that against the $0\%$ lack of the model trained without noise. This suggests that the noise perturbation to training data significantly helps the CNN model to obtain the strong robustness.

5 Concluding remarks

The practicability of neural network-based state estimation from limited sensor measurements in fluid flow was investigated. We constructed estimation models utilizing convolutional neural network and applied to three types of unsteady laminar and turbulent flows which cover a wide range of spatial length scales associated with complex fluid flow phenomena. The models were able to estimate a target of two-dimensional plane from input measurements. From the viewpoint of a practicability, we also investigated physical quantities required for the input in the problems of turbulent channel flow and transitional boundary layer. For both cases, the wall shear stress significantly contributed to the estimation performance. The robustness of the models for the lacked input was further investigated towards the state estimation from much fewer available sensors. We found that reasonable estimations can be achieved from the lacked input measurements by utilizing cubic interpolation. Moreover, the possibility for the utilization of several approaches for models to gain more robustness against a lack of sensors was demonstrated.

We can consider several a posteriori applications of the present robust fluid flow estimator based on neural network. For example, it is expected that a machine learning-based estimator can help to control a flow by sensing and guessing a whole flow state (Brunton and Noack,, 2015). In fact, the seminal work by Lee et al., (1997) used a shallow multi-layer perceptron to aid the opposition control (Choi et al.,, 1994) of the channel flow. In addition to this study, several reports have demonstrated the applicability of the aforementioned combination based on the concept that estimates a velocity field on the detection plane from the wall measurements using a machine-learning model (Han and Huang,, 2020; Park and Choi,, 2020; Li et al.,, 2021). However, it is also true that there are several remaining issues including the applicability of a model trained with uncontrolled cases to controlled flows in an online manner (Park and Choi,, 2020) and the limitation of the sensor availability in terms of both the number and the quality. We believe that the present investigation can directly address these issues from the perspective on the robustness against noise and lack of the sensors. As for the future study, the combination with the optimal sensor placements based on data-driven approaches (Manohar et al.,, 2018; Saito et al.,, 2020; Nakai et al.,, 2021; Morita et al.,, 2022) and digital twins (Rasheed et al.,, 2020) can also be considered.

Acknowledgments

We are grateful to Mr. Kai Fukami (UCLA) for fruitful discussion. This work was supported by JSPS KAKENHI Grant Numbers 18H03758 and 21H05007.

Data availability

The data that support the findings of this study are available from the corresponding author upon reasonable request.

Declaration of interest

The authors report no conflict of interest.

CRediT Author contributions

Taichi Nakamura: Conceptualization, Methodology, Software, Validation, Formal analysis, Investigation, Data curation, Writing- Original draft preparation, Visualization. Koji Fukagata: Conceptualization, Formal analysis, Investigation, Resources, Writing - Review & Editing, Supervision, Project administration, Funding acquisition

References

Adrian and Moin, (1988) Adrian, R. J. and Moin, P. (1988). Stochastic estimation of organized turbulent structure: homogeneous shear flow. J. Fluid Mech., 190:531–559.
Bai and Alam, (2018) Bai, H. and Alam, M. M. (2018). Dependence of square cylinder wake on Reynolds number. Phys. Fluids, 30(1):015102.
Bar-Shalom et al., (2004) Bar-Shalom, Y., Li, X. R., and Kirubarajan, T. (2004). Estimation with Applications to Tracking and Navigation: Theory Algorithms and Software. John Wiley & Sons.
Bayes, (1763) Bayes, T. (1763). An essay towards solving a problem in the doctrine of chances. by the late Rev. Mr. Bayes, FRS communicated by Mr. Price, in a letter to John Canton, AMFR S. Philos. Trans. R. Soc. Lond., B, Biol. Sci., 53:370–418.
Bewley, (2001) Bewley, T. R. (2001). Flow control: new challenges for a new renaissance. Prog. Aerosp. Sci., 37(1):21–58.
Bewley et al., (2001) Bewley, T. R., Moin, P., and Temam, R. (2001). DNS-based predictive control of turbulence: an optimal benchmark for feedback algorithms. J. Fluid Mech., 447:179–225.
Brenner et al., (2019) Brenner, M. P., Eldredge, J. D., and Freund, J. B. (2019). Perspective on machine learning for advancing fluid mechanics. Phys. Rev. Fluids, 4:100501.
(8) Brunton, S. L., Hemati, M. S., and Taira, K. (2020a). Special issue on machine learning and data-driven methods in fluid dynamics. Theor. Comput. Fluid Dyn., 34(4):333–337.
Brunton and Kutz, (2019) Brunton, S. L. and Kutz, J. N. (2019). Data-driven Science and Engineering: Machine Learning, Dynamical Systems, and Control. Cambridge University Press.
Brunton and Noack, (2015) Brunton, S. L. and Noack, B. R. (2015). Closed-loop turbulence control: Progress and challenges. Appl. Mech. Rev., 67(5).
(11) Brunton, S. L., Noack, B. R., and Koumoutsakos, P. (2020b). Machine learning for fluid mechanics. Annu. Rev. Fluid Mech., 52:477–508.
Bui-Thanh et al., (2004) Bui-Thanh, T., Damodaran, M., and Willcox, K. (2004). Aerodynamic data reconstruction and inverse design using proper orthogonal decomposition. AIAA J., 42(8).
Caltagirone, (1994) Caltagirone, J. P. (1994). Sur l’interaction fluide-milieu poreux: application au calcul des efforts excerses sur un obstacle par un fluide visqueux. C. R. Acad. Sci. Paris, 318:571–577.
Candès et al., (2006) Candès, E. J., Romberg, J., and Tao, T. (2006). Robust uncertainty principles: Exact signal reconstruction from highly incomplete frequency information. IEEE Trans. Inf. Theory, 52(2):489–509.
Chen, (2011) Chen, S. Y. (2011). Kalman filter for robot vision: a survey. IEEE Trans. Ind. Electron., 59(11):4409–4420.
Chevalier et al., (2006) Chevalier, M., Hœpffner, J., Bewley, T. R., and Henningson, D. S. (2006). State estimation in wall-bounded flow systems. part 2. turbulent flows. J. Fluid Mech., 552:167–187.
Choi et al., (1994) Choi, H., Moin, P., and Kim, J. (1994). Active turbulence control for drag reduction in wall-bounded flows. J. Fluid Mech., 262:75–110.
Colburn et al., (2011) Colburn, C. H., Cessna, J. B., and Bewley, T. R. (2011). State estimation in wall-bounded flow systems. part 3. the ensemble kalman filter. J. Fluid Mech., 682:289–303.
Cushman-Roisin and Beckers, (2011) Cushman-Roisin, B. and Beckers, J. M. (2011). Introduction to Geophysical Fluid Dynamics: Physical and Numerical Aspects. Academic press.
De Brabandere et al., (2006) De Brabandere, K., Loix, T., Engelen, K., Bolsens, B., Van den Keybus, J., Driesen, J., and Belmans, R. (2006). Design and operation of a phase-locked loop with kalman estimator-based filter for single-phase applications. In IECON 2006-32nd Annual Conference on IEEE Industrial Electronics, pages 525–530. IEEE.
Donoho, (2006) Donoho, D. L. (2006). Compressed sensing. IEEE Trans. Inf. Theory, 52(4):1289–1306.
Duraisamy, (2021) Duraisamy, K. (2021). Perspectives on machine learning-augmented Reynolds-averaged and large eddy simulation models of turbulence. Phys. Rev. Fluids, 6(5):050504.
Duraisamy et al., (2019) Duraisamy, K., Iaccarino, G., and Xiao, H. (2019). Turbulence modeling in the age of data. Annu. Rev. Fluid. Mech., 51:357–377.
Evensen, (2003) Evensen, G. (2003). The ensemble kalman filter: Theoretical formulation and practical implementation. Ocean Dyn., 53(4):343–367.
Everson and Sirovich, (1995) Everson, R. and Sirovich, L. (1995). Karhunen–Loeve procedure for gappy data. J. Opt. Soc. Am., 12(8):1657–1664.
Font et al., (2021) Font, B., Weymouth, G. D., Nguyen, V. T., and Tutty, O. R. (2021). Deep learning of the spanwise-averaged Navier–Stokes equations. J. Comput. Phys., 434:110199.
Fukagata et al., (2006) Fukagata, K., Kasagi, N., and Koumoutsakos, P. (2006). A theoretical prediction of friction drag reduction in turbulent flow by superhydrophobic surfaces. Phys. Fluids, 18:051703.
(28) Fukami, K., Fukagata, K., and Taira, K. (2019a). Super-resolution reconstruction of turbulent flows with machine learning. J. Fluid Mech., 870:106–120.
(29) Fukami, K., Fukagata, K., and Taira, K. (2020a). Assessment of supervised machine learning for fluid flows. Theor. Comput. Fluid Dyn., 34(4):497–519.
(30) Fukami, K., Fukagata, K., and Taira, K. (2021a). Machine-learning-based spatio-temporal super resolution reconstruction of turbulent flows. J. Fluid Mech., 909:A9.
(31) Fukami, K., Hasegawa, K., Nakamura, T., Morimoto, M., and Fukagata, K. (2021b). Model order reduction with neural networks: Application to laminar and turbulent flows. SN Comput. Sci., 2(6):1–16.
(32) Fukami, K., Maulik, R., Ramachandra, N., Fukagata, K., and Taira, K. (2021c). Global field reconstruction from sparse sensors with Voronoi tessellation-assisted deep learning. Nat. Mach. Intell., 3:945–951.
(33) Fukami, K., Murata, T., Zhang, K., and Fukagata, K. (2020b). Sparse identification of nonlinear dynamics with low-dimensionalized flow representations. J. Fluid Mech., 926:A10.
(34) Fukami, K., Nabae, Y., Kawai, K., and Fukagata, K. (2019b). Synthetic turbulent inflow generator using machine learning. Phys. Rev. Fluids, 4:064603.
(35) Fukami, K., Nakamura, T., and Fukagata, K. (2020c). Convolutional neural network based hierarchical autoencoder for nonlinear mode decomposition of fluid field data. Phys. Fluids, 32:095110.
Gauss, (1857) Gauss, C. F. (1857). Theory of the Motion of the Heavenly Bodies Moving about the Sun in Conic Sections: A Translation of Gauss’s" Theoria Motus." With an Appendix. Little, Brown.
Guastoni et al., (2020) Guastoni, L., Encinar, M. P., Schlatter, P., Azizpour, H., and Vinuesa, R. (2020). Prediction of wall-bounded turbulence from wall quantities using convolutional neural networks. In J. Phys. Conf. Ser., volume 1522, page 012022. IOP Publishing.
Güemes et al., (2019) Güemes, A., Discetti, S., and Ianiro, A. (2019). Sensing the turbulent large-scale motions with their wall signature. Phys. Fluids, 31(12):125112.
Güemes et al., (2021) Güemes, A., Tober, H., Discetti, S., Ianiro, A., Sirmacek, B., Azizpour, H., and Vinuesa, R. (2021). From coarse wall measurements to turbulent velocity fields with deep learning. Phys. Fluids, page 075121.
Han and Huang, (2020) Han, B.-Z. and Huang, W.-X. (2020). Active control for drag reduction of turbulent channel flow based on convolutional neural networks. Phys. Fluids, 32(9):095108.
(41) Hasegawa, K., Fukami, K., Murata, T., and Fukagata, K. (2020a). CNN-LSTM based reduced order modeling of two-dimensional unsteady flows around a circular cylinder at different Reynolds numbers. Fluid Dyn. Res., 52:065501.
(42) Hasegawa, K., Fukami, K., Murata, T., and Fukagata, K. (2020b). Machine-learning-based reduced-order modeling for unsteady flows around bluff bodies of various shapes. Theor. Comput. Fluid Dyn., 34(4):367–388.
Hecht-Nielsen, (1992) Hecht-Nielsen, R. (1992). Theory of the backpropagation neural network. In Neural Networks for Perception, pages 65–93. Elsevier.
Hoerl and Kennard, (1970) Hoerl, A. E. and Kennard, R. W. (1970). Ridge regression: Biased estimation for nonorthogonal problems. Technometrics, 12(1):55–67.
Hoshiya and Saito, (1984) Hoshiya, M. and Saito, E. (1984). Structural identification by extended Kalman filter. J. Eng. Mech., 110(12):1757–1770.
Jiménez and Moin, (1991) Jiménez, J. and Moin, P. (1991). The minimal flow unit in near-wall turbulence. J. Fluid Mech., 225:213–240.
Kalman, (1960) Kalman, R. (1960). Contributions to the theory of optimal control. Bol. Soc. Mat. Mexicana, 5(2):102–119.
Kim et al., (2021) Kim, H., Kim, J., Won, S., and Lee, C. (2021). Unsupervised deep learning for super-resolution reconstruction of turbulence. J. Fluid Mech., 910:A29.
Kim and Lee, (2020) Kim, J. and Lee, C. (2020). Prediction of turbulent heat transfer using convolutional neural networks. J. Fluid Mech., 882:A18.
Kingma and Ba, (2014) Kingma, D. P. and Ba, J. (2014). Adam: A method for stochastic optimization. arXiv:1412.6980.
Kobayashi et al., (2021) Kobayashi, W., Shimura, T., Mitsuishi, A., Iwamoto, K., and Murata, A. (2021). Prediction of the drag reduction effect of pulsating pipe flow based on machine learning. Int. J. Heat Fluid Flow, 88:108783.
Lapeyre et al., (2019) Lapeyre, C. J., Misdariis, A., Cazard, N., Veynante, D., and Poinsot, T. (2019). Training convolutional neural networks to estimate turbulent sub-grid scale reaction rates. Combust. Flame, 203:255–264.
LeCun et al., (1998) LeCun, Y., Bottou, L., Bengio, Y., and Haffner, P. (1998). Gradient-based learning applied to document recognition. Proc. IEEE, 86(11):2278–2324.
Lee et al., (1997) Lee, C., Kim, J., Babcock, D., and Goodman, R. (1997). Application of neural networks to turbulence control for drag reduction. Phys. Fluids, 9(6):1740–1747.
Li et al., (2008) Li, Y., Perlman, E., Wan, M., Yang, Y., Meneveau, C., Burns, R., Chen, S., Szalay, A., and Eyink, G. (2008). A public turbulence database cluster and applications to study Lagrangian evolution of velocity increments in turbulence. J. Turbul., 9:N31.
Li et al., (2021) Li, Z., Dang, X., Lv, P., and Duan, H. (2021). Blowing-only opposition control: Characteristics of turbulent drag reduction and implementation by deep learning. AIP Adv., 11(3):035016.
Manohar et al., (2018) Manohar, K., Brunton, B. W., Kutz, J., and Brunton, S. L. (2018). Data-driven sparse sensor placement for reconstruction: Demonstrating the benefits of exploiting known patterns. IEEE Control Syst., 38(3):63–86.
Matsuo et al., (2021) Matsuo, M., Nakamura, T., Morimoto, M., Fukami, K., and Fukagata, K. (2021). Supervised convolutional network for three-dimensional fluid data reconstruction from sectional flow fields with adaptive super-resolution assistance. arXiv:2103.09020.
Maulik et al., (2020) Maulik, R., Fukami, K., Ramachandra, N., Fukagata, K., and Taira, K. (2020). Probabilistic neural networks for fluid flow surrogate modeling and data recovery. Phys. Rev. Fluids, 5:104401.
Maulik et al., (2021) Maulik, R., Lusch, B., and Balaprakash, P. (2021). Reduced-order modeling of advection-dominated systems with recurrent neural networks and convolutional autoencoders. Phys. Fluids, 33(3):037106.
Mitsantisuk et al., (2011) Mitsantisuk, C., Ohishi, K., and Katsura, S. (2011). Estimation of action/reaction forces for the bilateral control using Kalman filter. IEEE Trans. Ind. Electron., 59(11):4383–4393.
Moreno and Pigazo, (2009) Moreno, V. M. and Pigazo, A. (2009). Kalman Filter: Recent Advances and Applications. BoD–Books on Demand.
(63) Morimoto, M., Fukami, K., and Fukagata, K. (2021a). Experimental velocity data estimation for imperfect particle images using machine learning. Phys. Fluids, 33(8):087121.
Morimoto et al., (2022) Morimoto, M., Fukami, K., Zhang, K., and Fukagata, K. (2022). Generalization techniques of neural networks for fluid flow estimation. Neural Comput. Appl. 34:3647-3669.
(65) Morimoto, M., Fukami, K., Zhang, K., Nair, A. G., and Fukagata, K. (2021b). Convolutional neural networks for fluid flow analysis: toward effective metamodeling and low-dimensionalization. Theor. Comput. Fluid Dyn., 35:633–658.
Morita et al., (2022) Morita, Y., Rezaeiravesh, S., Tabatabaei, N., Vinuesa, R., Fukagata, K., and Schlatter, P. (2022). Applying bayesian optimization with gaussian process regression to computational fluid dynamics problems. J. Comput. Phys., 449:110788.
Murata et al., (2020) Murata, T., Fukami, K., and Fukagata, K. (2020). Nonlinear mode decomposition with convolutional neural networks for fluid dynamics. J. Fluid Mech., 882:A13.
Nair and Hinton, (2010) Nair, V. and Hinton, G. E. (2010). Rectified linear units improve restricted boltzmann machines. Proc. Int. Conf. Mach. Learn., pages 807–814.
Nakai et al., (2021) Nakai, K., Yamada, K., Nagata, T., Saito, Y., and Nonomura, T. (2021). Effect of objective function on data-driven greedy sparse sensor optimization. IEEE Access, 9:46731–46743.
Nakamura et al., (2022) Nakamura, T., Fukami, K., and Fukagata, K. (2022). Identifying key differences between linear stochastic estimation and neural networks for fluid flow regressions. Sci. Rep., 12:3726.
Nakamura et al., (2021) Nakamura, T., Fukami, K., Hasegawa, K., Nabae, Y., and Fukagata, K. (2021). Convolutional neural network and long short-term memory based reduced order surrogate for minimal turbulent channel flow. Phys. Fluids, 33:025116.
Park and Choi, (2020) Park, J. and Choi, H. (2020). Machine-learning-based feedback control for drag reduction in a turbulent channel flow. J. Fluid Mech., 904:A24.
Pawar et al., (2020) Pawar, S., San, O., Rasheed, A., and Vedula, P. (2020). A priori analysis on deep learning of subgrid-scale parameterizations for Kraichnan turbulence. Theor. Comput. Fluid Dyn., 34(4):429–455.
Perlman et al., (2007) Perlman, E., Burns, R., Li, Y., and Meneveau, C. (2007). Data exploration of turbulence simulations using a database cluster. In Proceedings of the 2007 ACM/IEEE Conference on Supercomputing, pages 1–11.
Rabault et al., (2019) Rabault, J., Kuchta, M., Jensen, A., Réglade, U., and Cerardi, N. (2019). Artificial neural networks trained through deep reinforcement learning discover control strategies for active flow control. J. Fluid Mech., 865:281–302.
Rasheed et al., (2020) Rasheed, A., San, O., and Kvamsdal, T. (2020). Digital twin: Values, challenges and enablers from a modeling perspective. IEEE Access, 8:21980–22012.
Saito et al., (2020) Saito, Y., Nonomura, T., Nankai, K., Yamada, K., Asai, K., Tsubakino, Y., and Tsubakino, D. (2020). Data-driven vector-measurement-sensor selection based on greedy algorithm. IEEE Sensors Letters, 4(7):7002604.
Sapsis, (2020) Sapsis, T. P. (2020). Output-weighted optimal sampling for Bayesian regression and rare event statistics using few samples. Proc. Roy. Soc. A, 476(2234):20190834.
Scherl et al., (2020) Scherl, I., Storm, B., Shang, J. K., Williams, O., Polagye, B. L., and Brunton, S. L. (2020). Robust principal component analysis for modal decomposition of corrupt fluid flows. Phys. Rev. Fluids, 5:054401.
Schölkopf et al., (2002) Schölkopf, B., Smola, A. J., Bach, F., et al. (2002). Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond. MIT press.
Shorten and Khoshgoftaar, (2019) Shorten, C. and Khoshgoftaar, T. M. (2019). A survey on image data augmentation for deep learning. J. Big Data, 6(1):1–48.
Simon, (2006) Simon, D. (2006). Optimal State Estimation: Kalman, H Infinity, and Nonlinear Approaches. John Wiley & Sons.
Sorenson, (1970) Sorenson, H. W. (1970). Least-squares estimation: from Gauss to Kalman. IEEE Spectr., 7(7):63–68.
Srivastava et al., (2014) Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., and Salakhutdinov, R. (2014). Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res., 15(1):1929–1958.
Suzuki and Hasegawa, (2006) Suzuki, T. and Hasegawa, Y. (2006). Estimation of turbulent channel flow at ${\rm Re}_{\tau}=100$ based on the wall measurement using a simple sequential approach. J. Fluid Mech., 830:760–796.
Thuerey et al., (2020) Thuerey, N., Weißenow, K., Prantl, L., and Hu, X. (2020). Deep learning methods for Reynolds-averaged Navier–Stokes simulations of airfoil flows. AIAA J., 58(1):25–36.
Tibshirani, (1996) Tibshirani, R. (1996). Regression shrinkage and selection via the Lasso. J. R. Stat. Soc. Series B Stat. Methodol., 58(1):267–288.
Voronoi, (1908) Voronoi, G. (1908). New applications of continuous parameters to the theory of quadratic forms. Z. Reine Angew. Math, 134:198.
Wan and Van Der Merwe, (2000) Wan, E. A. and Van Der Merwe, R. (2000). The unscented Kalman filter for nonlinear estimation. In Proceedings of the IEEE 2000 Adaptive Systems for Signal Processing, Communications, and Control Symposium (Cat. No. 00EX373), pages 153–158. IEEE.
Willcox, (2006) Willcox, K. (2006). Unsteady flow sensing and estimation via the gappy proper orthogonal decomposition. Comput. Fluids, 35(2):208–226.
Wunsch and Heimbach, (2007) Wunsch, C. and Heimbach, P. (2007). Practical global oceanic state estimation. Physica D: Nonlinear Phenomena, 230(1-2):197–208.
Zaki, (2013) Zaki, T. A. (2013). From streaks to spots and on to turbulence: exploring the dynamics of boundary layer transition. Flow Turbul. Combust., 91(3):451–473.
Zhang et al., (1995) Zhang, D., Chew, Y. T., and Winoto, S. H. (1995). A proposed intermittency measurement method for transitional boundary layer flows. Exp. Fluids, 19(6):426–428.