Data-Driven Discovery of a New Ginzburg-Landau
Reduced-Order Model for Vortex Shedding

Joseph J. Williams^1∗, Zachary G. Nicolaou¹, J. Nathan Kutz^1,2, Steven L. Brunton³
¹ Department of Applied Mathematics, University of Washington, Seattle, WA 98195, United States
² Department of Electrical and Computer Engineering, University of Washington, Seattle, WA 98195, United States
³ Department of Mechanical Engineering, University of Washington, Seattle, WA 98195, United States

Abstract

Vortex shedding is an important physical phenomenon observed across many spatial and temporal scales in fluids. Previous experimental and theoretical studies have established a hierarchy of local and global reduced-order models for vortex shedding based on the Stuart-Landau and Ginzburg-Landau equations. In this work, we employ data-driven methods to discover a new Landau variable for the complex Ginzburg-Landau equation (CGLE) for periodic two-dimensional vortex shedding past a cylinder. We first coarse grain vorticity field data from direct numerical simulations of the fluid by integrating over the vertical spatial dimension, and then time-delay embed this new variable to obtain phase information necessary for CGLE analysis. Finally, we use the sparse identification of nonlinear dynamics (SINDy) to learn an interpretable system of coupled real partial differential equations that balance model complexity and prediction error. We develop local homogeneous models in key regions of the wake, as well as global heterogeneous models that are parameterized by the streamwise coordinate across the entire wake. After computing the normal-form parameters from the discovered equations, we classify the different models and determine behavior and stability in the different regions of the wake, demonstrating the validity of these models across a range of spatiotemporal subdomains. Different dynamics are found to dominate at different stations within the wake. In addition to introducing a novel Landau variable for vortex shedding, this work may also help characterize other vortex shedding systems and inform the design of control schemes.

^†^†^∗ Corresponding author ([email protected]).

1 Introduction

Vortices are generally shed behind solid bodies moving through a fluid as a result of the Bénard-von Karman instability [1, 2]. Vortex shedding produces low-dimensional patterns of activity that are important in engineering applications, as the low-pressure vortices result in oscillating forces on the immersed bodies. These forces can, for example, excite instabilities in flexible structures like towers and bridges, leading to undesirable behavior, fatigue, and potentially failure [1, 3, 4, 5]. Consequently, many control strategies have been proposed to suppress vortex formation, including modifications to the bluff body [6, 7, 8], placement of a secondary control cylinder close to the main body [9], and rotation of the main body [10].

The mathematical modeling of vortex shedding begins with the Navier-Stokes equations, which are a set of coupled, nonlinear, partial differential equations. The onset of vortex shedding represents a transition from steady to oscillatory flow; this presents a case where the Navier-Stokes equations undergo a Hopf bifurcation, leading to a study of bifurcation theory and stability analysis [11, 12, 13, 14]. For such modeling and control purposes, the Stuart-Landau equation (SLE) and complex Ginzburg-Landau equation (CGLE) often serve as important reduced-order models (ROMs). These ROMs help determine, for example, the frequency and growth or decay rate of vortices [15, 16] and the optimal sensor and actuator placements for measurement, reconstruction, and control [17, 18, 19]. The SLE and CGLE derive from universal oscillatory instabilities in general complex and pattern-forming systems, and they serve as ROMs for other fluid instabilities, including Rayleigh-Benard convection and Taylor-Couette flow, as well as phenomena from fields as diverse as biology, chemistry, solid-state physics, and optics [20, 21, 22, 23].

Reduced-order models extract the dominant features of the original system while dramatically reducing the computational cost, enabling downstream tasks such as design, optimization, and control. ROMs are particularly important in fluid dynamics, which involves complex, multiscale, and nonlinear behaviors that require significant computational resources to properly resolve [24, 25, 26]. For example, the Landau equations [11, 13] capture the temporal evolution of the vortex shedding instability with a simpler differential equation in a reduced number of dimensions. Many ROMs have been classically derived from the governing equations [27, 28], and § 2 will discuss the asymptotic analysis and derivation of the SLE and CGLE from the Navier-Stokes equations in the case of vortex shedding past a cylinder. In contrast, data-driven methods and machine learning encompass a broad range of techniques that leverage data to develop ROMs [29, 24, 30].

One common data-driven method to study dynamics is proper orthogonal decomposition (POD), a projection method which reveals the energy associated with each mode of the data [30, 31]. The distribution of energy among modes reveals which flow structures dominate the overall system behavior. POD was originally developed by Lumley to study turbulence [32], and POD lends itself well to other fluid dynamic data particularly where the behavior is periodic, such as vortex shedding past a cylinder [27] or a mixing layer [29]. POD is also a starting point for the development of more sophisticated data-driven methods, including dynamic mode decomposition (DMD), which identifies spatial modes and their corresponding time dynamics, and Koopman operator theory, which provides a linear framework to study nonlinear systems [24, 25, 30, 31]. Specific methods for particular classes of models have been developed, such as the operator inference method for models with complex non-polynomial nonlinearities [31, 33]. Finally, machine learning techniques such as auto-encoders and neural networks have been more recently developed for building data-driven ROMs of fluid dynamics systems [34, 35].

Across all domains, interpretable ROMs are desired. POD modes are interpretable as maximizing energy content; however, other data-driven methods may provide accurate predictions but less interpretable models [30]. Neural networks, for example, depend on the many hidden variables of the inner layers, which may have limited physical meaning. In this work, we employ the sparse identification of nonlinear dynamics (SINDy) [36, 37], which learns sparse and interpretable nonlinear ROMs directly from data. Loiseau et al. [38] and Callaham et al. [17] both use SINDy to learn models capable of flowfield reconstruction and prediction from limited flowfield measurements. SINDy has also been used in turbulence modeling, for example to develop Reynolds-stress models for turbulent RANS closure [39, 40] and for eddy behavior and unresolved turbulent processes in ocean dynamics [41].

Prior to using SINDy, important data post-processing steps must be taken; in this work, we post-process our data by coarse-graining [42, 43, 44] and time-delay embedding [45, 46]. Coarse-graining includes a variety of methods from different fields, including mean-flow analysis in fluid dynamics [43], large-eddy simulation (LES) in computational fluid dynamics [47, 48], methods in molecular dynamics and other particle methods [42, 44], and methods in computational biology and chemistry [49, 50]. In these applications, coarse-graining is achieved by, for example, averaging across time or samples [42, 43], averaging across, simplifying, or removing a dimension or area [44, 47, 48], or reducing resolution at smaller scales [44, 49, 50]. Once the data is coarse-grained, a lower-dimensional, real-valued signal may be used for model learning. However, to model the complex phase behavior in CGLE, we must augment the coarse-grained signal via time-delay embedding. Time-delay embedding allows us to reconstruct the underlying phase space of a system from only a single time-series measurement $x(t)$ [45]. With an appropriate choice $\tau$ of how much to lag the time-series measurement, we may generate one or more additional vectors of delayed observations $[x(t+\tau),x(t+2\tau)...]$ , embedding a one-dimensional time series into a higher-dimensional space. Originally developed to analyze turbulence, time-delay embedding has been used in a wide range of topics such as health, ecology, finance, and other physical and mathematical systems [46, 51, 52, 53, 54].

Our method will be demonstrated by finding a novel Landau equation for the canonical vortex shedding past a cylinder. The SLE and CGLE have been widely used as ROMs for cylinder vortex shedding, in both two-dimensional and three-dimensional domains [11, 12, 13, 14, 27]. Previous work has only linked the one-dimensional CGLE to three-dimensional vortex shedding past a cylinder, using the spanwise $z-$ coordinate as the spatial coordinate of the CLGE [13, 14, 55]. There have been limited previous attempts at incorporating the streamwise $x-$ coordinate into a model for vortex shedding, mainly through spatial variation of the model coefficients [56, 57].

In this work, we present a modern analysis of a classic problem by developing a data-driven one-dimensional CGLE model for two-dimensional vortex shedding. After coarse-graining and time-delay embedding our flowfield data, we use SINDy to find coefficients of a Ginzburg-Landau model that best captures the observed dynamics. We first develop an inhomogeneous CGLE in a near-wake subdomain, aiming to characterize the local dynamics in the wavemaker region [19, 58]. We then investigate the spatial dependence of the generated model to determine how the local models vary downstream of the cylinder. In so doing, we show for the first time a link between two-dimensional vortex shedding flow and the one-dimensional CGLE through the streamwise coordinate. This method can easily be extended to more sophisticated problems, elucidating more complex bifurcations in systems such as a rotating cylinder [59] or the fluidic pinball [60].

This paper is organized as follows: In § 2, we will further discuss the various analytic, numerical, and experimental work on cylinder vortex shedding, including its relationship to the SLE and CGLE. In § 3, we detail the data and methods used in our work, including the fluid dynamic simulations of vortex shedding and our methods of coarse-graining, time delay-embedding, and model discovery. In § 4, we show that this method generates a pair of sparse, interpretable PDEs in the form of the CGLE. By tuning the spatial domain over which we develop our model, we highlight differences in the behavior of local models for the near-, mid-, and far-wakes. We find that the stability properties of the local models vary smoothly downstream, with a variety of transitions between behaviors before dissipative dynamics ultimately dominate the behavior. In $\S~{}5$ we offer concluding remarks about our CGLE models and the vortex shedding phenomenon.

2 Background

The dynamics of vortex shedding can be explained and justified by directly linking it to the Stuart-Landau equation (SLE) and the Hopf bifurcation. We discuss the wide array of experimental, numerical, and analytic work undertaken to explore the connection between the SLE and vortex shedding [2, 11, 27, 61] and explain how the SLE, to leading order, serves as ROM for vortex shedding past a cylinder. Then we discuss the spatial extension of the SLE, the complex Ginzburg-Landau equation (CGLE), a partial differential equation which may be thought of as a coupled field of Landau oscillators. The CGLE has also been linked to the phenomenon of vortex shedding, but the addition of a new spatial variable allows for more variety in analyses [13, 57, 62]. This section explores the history and context of this problem before presenting in $\S~{}3$ our own novel contribution of a one-dimensional CGLE as a ROM for two-dimensional vortex shedding past a cylinder.

2.1 Vortex Shedding and the Stuart-Landau Equation

The simplest case of vortex shedding over a cylinder can be modeled by the incompressible, two-dimensional Navier-Stokes equations, together with mass continuity:


$\displaystyle u_{x}+v_{y}$	$\displaystyle=0,$	(1a)
$\displaystyle u_{t}+uu_{x}+vu_{y}$	$\displaystyle=-\frac{1}{\rho}p_{x}+\nu(u_{xx}+u_{yy}),$	(1b)
$\displaystyle v_{t}+uv_{x}+vv_{y}$	$\displaystyle=-\frac{1}{\rho}p_{y}+\nu(v_{xx}+v_{yy}),$	(1c)

where $\rho$ is the fluid’s density, $\nu$ is the fluid’s kinematic viscosity, and the state variables are $(\textbf{u},p)=(u,v,p)$ , the fluid’s horizontal and vertical velocities and pressure at every point in spacetime. At the fluid-body interface on the cylinder’s surface, the boundary conditions are the no-slip condition on the surface of the body, $\textbf{u}=0$ on $\Omega$ . The nondimensional parameter Reynolds number, $\text{Re}={U_{\infty}L}/{\nu}$ , characterizes the dynamics, where $U_{\infty}$ is the freestream velocity of the flow at the inlet before interacting with the body and $L$ is a characteristic length scale, in this case the diameter of the cylinder. The behavior of the system is then parameterized by Re and displays a wide variety of behaviors from creeping flow at low Re to fully turbulent flow at high Re [1]. Relevant for this work, for $\text{Re}\lesssim 47$ , the solution $(u(x,y,t),v(x,y,t),p(x,y,t))$ is stable and steady, and presents as a long, narrow tail behind the cylinder that grows with increasing Re [2]. Once Re increases past a critical value $\text{Re}_{c}\approx 47$ , the wake begins oscillating as discrete vortices begin to break free and are advected downstream [2]. Eventually the behavior saturates onto a limit cycle – the periodic vortex shedding. Figure 1 (left) shows the saturated flowfield at values of Re below, at, and above the critical threshold.

Refer to caption — Figure 1: Left: Snapshots colored by vorticity $\omega(x,y;t)$ of fully saturated flowfields, from two-dimensional cylinder flow at: (a) $\text{Re}=40$ ; (b) $\text{Re}=50$ ; and (c) $\text{Re}=100$ . The critical value at which vortex shedding begins is $\text{Re}_{c}\approx 47$ , hence these flowfields represent systems below, in the vicinity of, and above the critical threshold. Middle: The phase space $(\mathbf{Re}(A),\mathbf{Im}(A))$ of a numerical solution of the Stuart-Landau equation (Eq. 2), parameterized by $\lambda_{r}$ (with $\mu_{r}<0$ ). The behavior of the SLE depends on the value of the control parameter $\lambda_{r}$ , spanning from stationary ( $\lambda_{r}<0$ ) to oscillatory ( $\lambda_{r}>0$ ) solutions. In phase space, saturated oscillatory solutions with a given $\lambda_{r}$ trace out a circular path with a radius that increases with increasing $\lambda_{r}$ . Initial conditions that start beyond this radius (such as for solutions (a) and (b)) decay to their fixed point (when $\lambda_{r}<0$ ) or their limit cycle (when $\lambda_{r}>0$ ), while initial conditions that start at 0 (such as for solution (c)) grow and saturate on their limit cycle (when $\lambda_{r}>0$ ). Given that Re is the control parameter for vortex shedding, the behavior of the flowfields (a), (b), and (c) are analogous to the respective labeled points in the SLE phase space. Right: The first two oscillatory POD modes of the vorticity flowfield at $\text{Re}=50$ (b). These critical eigenvectors are the most critical spatial modes of the system and represent the oscillatory behavior of the solution. They are associated with the critical eigenvalues whose real parts becoming positive cause linear instability in the system, which is later stabilized by nonlinear effects and higher-order modes. As established by Noack et al. [27], the amplitudes of these critical modes have Stuart-Landau dynamics, hence the pair of POD modes are analogous to the real and imaginary components of the SLE amplitude.

This behavior of transitioning from steady flow to periodic flow as Re increases past $\text{Re}_{c}$ is a hallmark of a supercritical Hopf bifurcation [63]. A Hopf bifurcation occurs in a system of differential equations when the real part of a pair of complex eigenvalues (the critical eigenvalues) switches from negative to positive as a critical parameter increases past its critical value [63]. The real part of the critical eigenvalues determines the linear growth rate, so a positive real part indicates that even infinitesimal perturbations will grow in amplitude and the steady state of the system is unstable [63]. In fluid flows, such perturbations may arise as small fluctuations in the velocity or pressure fields, for example due to physically imperfect boundaries or numerical round-off errors [2, 11, 12]. In a supercritical Hopf bifurcation, the growth of the perturbation is countered by nonlinear effects, which ultimately causes the amplitude of the perturbation to saturate onto a stable limit cycle.

The competing influence of the linear and nonlinear effects is captured by the Stuart-Landau equation (SLE) [64, 65],

\frac{d}{dt}A=\lambda A-\mu|A|^{2}A,

(2)

where $\lambda=\lambda_{r}+\mathbf{i}\lambda_{i}$ and $\mu=\mu_{r}+\mathbf{i}\mu_{i}$ are complex parameters. $A$ is known as the Landau variable, or order parameter [20], and in the vicinity of the bifurcation it represents the complex amplitude of the perturbation of the system away from the unstable equilibrium. Given that the perturbation starts with small amplitude, $A>|A|^{2}A$ for small enough $t$ and the linear term will dominate the early-time behavior of the solution. Positive $\lambda_{r}$ leads to exponential growth (i.e. linear instability) and $\lambda_{i}\neq 0$ causes oscillations. Once $A$ grows large enough, then the nonlinear terms begin to influence the solution in a significant way. When $\mu_{r}<0$ , the nonlinear terms will counteract the exponential growth of the linear terms, resulting in a stable oscillatory solution. When $\mu_{r}$ is also positive, however, then the nonlinear terms further the growth of $A$ . This is the subcritical case, where perturbations will grow to large amplitude and no nearby stable limit cycle exists [63]. Figure 1 (middle) shows the phase space of the SLE $(\mathbf{Re}(A),\mathbf{Im}(A))$ parameterized by the critical parameter $\lambda_{r}$ , with $\mu_{r}<0$ . The different flowfields (left) are linked to different behaviors in phase space depending on the value of the control parameter.

For a general nonlinear system, the SLE can be derived by expanding the dynamics for the amplitude of the critical eigenvectors (associated with the critical eigenvalues) in a Taylor series. Then to leading order, the supercritical Hopf bifurcation is modeled in the normal form by the SLE. The components of the complex amplitude are then the most critical spatial modes of the system, dominating the saturated oscillatory dynamics. These modes can be found, for example, via POD [27]. In the specific case of vortex shedding past a cylinder, much experimental, numerical, and analytic work has been done to show and justify the SLE as a ROM of vortex shedding [2, 11, 12, 27, 61]. Early investigations focused on the wake of a three-dimensional cylinder, using experimental fluid flow measurements [2, 11, 16, 66] and numerical simulations [12] to derive effective coefficients for the SLE governing the local oscillations at some point in the wake.

Noack et al. [27] developed a hierarchy of ROMs for the transient and steady cylinder wake using POD and Galerkin projection, first finding the dominant modes of the data, and then projecting the original data onto this lower-order subspace. The POD modes of flowfield data are the most critical spatial modes and represent the dominant oscillatory dynamics. Subsequent modes contribute progressively less energy to the overall dynamics, and higher modes capture nonlinear interactions and less dominant features [27, 30]. The models presented by Noack et al. capture both the saturated and transient behavior of vortex shedding. Their low-order model achieves this by incorporating a “shift” mode to account for the underlying unstable steady flow in addition to incorporating the critical modes capturing the dominant oscillatory behavior. Higher-order models include higher-order effects from the less-dominant modes, increasing the accuracy at the cost of additional computations and model complexity [27]. The critical POD modes for the flowfield from $\text{Re}=50$ can be found in Figure 1 (right). The POD modes, being coupled, display similar but offset periodic patterns. The work of Noack et al., along with the rest discussed in this section, establishes that the amplitudes of these modes can be effectively modeled by the SLE [11, 27, 61].

In [43], Barkley studies the time-averaged mean flow of the cylinder wake and undertakes a two-dimensional linear stability analysis of the base and mean flows for increasing Re. Contrasting the base and mean flows, he confirms via the leading eigenvalues of the base flow that the onset of oscillation corresponds to a Hopf bifurcation in which the growth rate crosses zero at $\text{Re}_{c}$ , whereas the growth rate of the mean flow remains zero. Sipp and Lebedev [61] expand upon Barkley’s mean flow analysis by undertaking a global weakly nonlinear multiple scales analysis of the two-dimensional Navier-Stokes equations, developing a fast time coordinate $t_{1}=\epsilon t$ . From this, they derive the SLE with the Landau variable as the slow time behavior of the critical global mode. Although the purely temporal SLE does not incorporate a spatial coordinate directly like the spatiotemporal CGLE does, their weakly nonlinear analysis is global in nature and provides the coefficients of the SLE governing the local oscillator at any point in space [61].

2.2 The Complex Ginzburg-Landau Equation

Although the SLE is a purely temporal ordinary differential equation, it was found that the measured coefficients varied depending on spatial location within the wake [2, 11], hinting at deeper spatial dependence [61]. It was later found that the values of the coefficients of the SLE vary depending both on measurements’ spatial locations within the wake and very strongly on end effects from finite cylinder length, and hence a more complete model was sought in the spatiotemporal one-dimensional complex Ginzburg-Landau equation (CGLE) [13, 14, 55, 67],

\frac{\partial}{\partial t}A=\lambda A-\mu|A|^{2}A+\kappa\frac{\partial}{\partial x}A+\gamma\frac{\partial^{2}}{\partial x^{2}}A,

(3)

where in general $A$ and all parameters are again complex. The CGLE is a partial differential equation (PDE), whose solution varies with both time and space. The CGLE may be thought of as an advective and diffusive coupling of an infinite field of Stuart-Landau oscillators. It is derived under the assumption of a long wavelength, supercritical Hopf instability in a homogeneous medium. However, the translational invariance and symmetry assumptions of the CGLE are not satisfied by the vortex shedding system [23], hence significant effort has been made in establishing a connection between vortex shedding and the CGLE.

In first proposing the CGLE as a ROM of vortex shedding past a cylinder, the one-dimensional CGLE was proposed with its spatial coordinate corresponding to the $z$ -coordinate of the flowfield i.e. the spanwise coordinate of the cylinder; various boundary conditions were imposed to model different experimental setups [13, 14]. With coefficients derived from measurements of three-dimensional cylinder wake flows, simulations of the CGLE accurately recreated many of the patterns and regimes of behavior observed in three-dimensional cylinder flow, such as chevrons [62]. Some works also attempted to incorporate the streamwise- as well as the spanwise-coordinate into a two-dimensional CGLE for the three-dimensional cylinder wake, mainly in the form of spatially varying coefficients and under certain assumptions such as slow variations in the streamwise direction [56, 57]. These works expand on the applicability of the CGLE as a ROM for three-dimensional vortex shedding past a cylinder and further motivate the incorporation of streamwise dependence.

Analytically, via weakly nonlinear asymptotic expansion, Le Dizes et al. [68] deduced the one-dimensional CGLE from the Navier-Stokes equations as the equation governing the amplitude of the critical global mode, again in the spanwise direction. Noack [15] used a low-dimensional Galerkin method to analyze the stability of both steady and periodic cylinder wakes, finding that both the onset of periodicity in two dimensions at $\text{Re}_{c}$ and the onset of periodicity in three dimensions at $\text{Re}_{c,2}\approx 170$ are supercritical Hopf bifurcations modeled by the Landau equation. Zuccoli et al. [21] also used a multiple scales analysis on the Navier-Stokes equations for Taylor-Couette flow, using both slow time and space scales to derive a form of the one-dimensional CGLE, with the spatial coordinate as the spanwise coordinate and the Landau variable as the amplitude of a perturbation.

The discussion here and in the previous subsection firmly establishes that the Bénard-von Karman instability is characterized by the Hopf bifurcation, justifying the SLE and CGLE as ROMs of vortex shedding. From this, a hierarchy of ROMs for vortex shedding past a cylinder emerges, from an SLE description of the two-dimensional wake to local SLE and global CGLE descriptions of the three-dimensional wake. Thus the SLE has been rigorously linked to two-dimensional and three-dimensional vortex shedding, but the CGLE has only been linked to three-dimensional vortex shedding, mainly through the spanwise $z$ -coordinate. Our contribution to this hierarchy of ROMs will be the data-driven discovery and development of a one-dimensional CGLE for the two-dimensional cylinder wake; by incorporating the streamwise $x$ -coordinate into a novel Landau variable, we will answer whether streamwise dependence could be included in a CGLE ROM for vortex shedding.

More generally, ROMS capture the dominant features of the flow in a simpler mathematical framework. Modeling and solving the full Navier-Stokes equations, even with simplifications such as two-dimensionality and incompressibility as in Eqns. (1a)–(1c), is significantly more involved and expensive than modeling the SLE or CGLE. With reduced-order modeling, we capture the essential features of vortex shedding with a single complex variable rather than with a solution to the Navier-Stokes equations, and a better understanding of ROMs may lead to a better understanding of the original phenomenon

2.3 Properties of the Stuart- and Ginzburg-Landau Equations

The CGLE is a complex partial differential equation. From Eqn. (3), the CGLE can be expanded into a system of two coupled, real differential equations; the two equations in the system describe the dynamics of the real and imaginary parts of the complex amplitude. We perform this derivation, first for clarity expanding the complex terms:

\frac{\partial(A_{r}+\mathbf{i}A_{i})}{\partial t}=(\lambda_{r}+\mathbf{i}\lambda_{i})(A_{r}+\mathbf{i}A_{i})-(\mu_{r}+\mathbf{i}\mu_{i})|A_{r}+\mathbf{i}A_{i}|^{2}(A_{r}+\mathbf{i}A_{i})+(\kappa+\mathbf{i}\kappa)\frac{\partial(A_{r}+\mathbf{i}A_{i})}{\partial x}+(\gamma_{r}+\mathbf{i}\gamma_{i})\frac{\partial^{2}(A_{r}+\mathbf{i}A_{i})}{\partial x^{2}}.

(4)

Then, expanding and separating the real and imaginary parts of the equation and relabeling the complex amplitude as $A=\omega+\mathbf{i}\eta$ , we obtain:


	$\displaystyle A_{r}=\omega\;\;\Rightarrow\;\;\frac{\partial\omega}{\partial t}=\lambda_{r}\omega-\lambda_{i}\eta-(\omega^{2}+\eta^{2})(\mu_{r}\omega-\mu_{i}\eta)+\kappa_{r}\omega_{x}-\kappa_{i}\eta_{x}+\gamma_{r}\omega_{xx}-\gamma_{i}\eta_{xx},$		(5a)
	$\displaystyle A_{i}=\eta\;\;\Rightarrow\;\;\frac{\partial\eta}{\partial t}=\lambda_{i}\omega+\lambda_{r}\eta-(\omega^{2}+\eta^{2})(\mu_{i}\omega+\mu_{r}\eta)+\kappa_{i}\omega_{x}+\kappa_{r}\eta_{x}+\gamma_{i}\omega_{xx}+\gamma_{r}\eta_{xx}.$		(5b)

Finally, we expand the nonlinear terms:


	$\displaystyle\Rightarrow\;\;\frac{\partial\omega}{\partial t}=\lambda_{r}\omega-\lambda_{i}\eta-\mu_{r}\omega^{3}+\mu_{i}\omega^{2}\eta-\mu_{r}\omega\eta^{2}+\mu_{i}\eta^{3}+\kappa_{r}\omega_{x}-\kappa_{i}\eta_{x}+\gamma_{r}\omega_{xx}-\gamma_{i}\eta_{xx},$		(6a)
	$\displaystyle\Rightarrow\;\;\frac{\partial\eta}{\partial t}=\lambda_{i}\omega+\lambda_{r}\eta-\mu_{i}\omega^{3}-\mu_{r}\omega^{2}\eta-\mu_{i}\omega\eta^{2}-\mu_{r}\eta^{3}+\kappa_{i}\omega_{x}+\kappa_{r}\eta_{x}+\gamma_{i}\omega_{xx}+\gamma_{r}\eta_{xx}.$		(6b)

Notice the symmetry in the coefficients, with the real and imaginary parts of $\lambda$ , $\mu$ , $\kappa$ and $\gamma$ appearing on alternate terms between Eqns. (6a) and (6b) (e.g. $\gamma_{r}$ is on $\omega_{xx}$ for (6a), but on $\eta_{xx}$ for (6b)). This system of two real PDEs is the form of the CGLE we will derive from our data and numerically solve.

More generally, any generic two-dimensional system undergoing a Hopf bifurcation can be converted to its normal form, from which essential features of the dynamics can be determined and compared across systems. The conversion process involves applying a series of near-identity transformations, rescalings, and coefficient normalizations to the system in order to distill the normal form parameters [63]. The normal form of the Hopf bifurcation is:

\begin{bmatrix}\dot{y}_{1}\\ \dot{y}_{2}\end{bmatrix}=\begin{bmatrix}\beta&-1\\ 1&\beta\end{bmatrix}\begin{bmatrix}y_{1}\\ y_{2}\end{bmatrix}+\sigma^{\prime}(y_{1}^{2}+y_{2}^{2})\begin{bmatrix}y_{1}\\ y_{2}\end{bmatrix}+D_{x}(y_{1},y_{2}),

(7)

where $\beta\equiv{\zeta}/{\phi}$ and $\zeta$ and $\phi$ are respectively the real and imaginary parts of the complex eigenvalues. $D_{x}(y_{1},y_{2})$ represents the spatial derivative terms. $\sigma$ is the first Lyapunov coefficient and is computed as a function of second- and third-order derivatives of the nonlinear terms of the original system; reference Kuznetsov [63] for further details. The parameter $\beta$ determines whether a perturbation experiences growth ( $\beta>0$ ) or decay ( $\beta<0$ ), and the parameter $\sigma^{\prime}=\operatorname{sgn}(\sigma)=\pm 1$ determines whether the nonlinearity will saturate a perturbation’s growth ( $\sigma^{\prime}=-1$ ) or amplify it ( $\sigma^{\prime}=+1$ ). The case where a perturbation tends to grow but its growth is saturated by the nonlinear terms has parameters ( $\beta>0$ , $\sigma<0$ ) and is the supercritical Hopf bifurcation. This bifurcation results in a stable limit cycle, while in the subcritical case, an unstable limit cycle exists for $\beta<0$ . By computing the parameters ( $\beta$ , $\sigma$ ), we can distill and compare basic features across different systems; in § 4 of this work, we will compute ( $\beta$ , $\sigma$ ) for a variety of systems generated with SINDy in order to compare and discuss them.

3 Method, Simulations, and Data

We obtain vorticity flowfield data $\omega(x,y,t)$ for two-dimensional cylinder shedding from a direct numerical simulation [69, 70] at $\text{Re}~{}=~{}50$ , just above $\text{Re}_{c}$ . We then coarse-grain the data by integrating along the $y$ -coordinate, compressing two-dimensional spatiotemporal data $\omega(x,y,t)$ into one-dimensional spatiotemporal $\overline{\omega}(x,t)$ . We time-delay embed this data, producing the system $\mathcal{U}=[\overline{\omega},\eta]$ and find the dynamical system that produces this data through the sparse identification of nonlinear dynamics (SINDy) [36], specifically the PDE-FIND variant for discovering partial differential equations [37, 71]. SINDy has previously been used to correctly find the Navier-Stokes equations from $\omega(x,y,t)$ for two-dimensional cylinder shedding [37]; we extend that work by coarse-graining and time-delay embedding to produce a ROM for the same vortex shedding.

3.1 Simulating the Fluid Equations

The data is obtained through a computational fluid dynamics (CFD) simulation using the immersed boundary projection method (IBPM) [69, 70]. IBPM uses 3rd order Runge-Kutta integration to solve the vorticity-stream function formulation of the two-dimensional Navier-Stokes equations, simulated here in a domain of $x\in[-2,52]$ and $y\in[-8,8]$ with a cylinder of diameter $D=1$ situated at the origin. We used a grid of $dx=dy=0.02$ resulting in 2.16 million mesh points, and the time step was $dt=0.02$ with an initial condition of zero velocity and pressure everywhere. We impose no-slip boundary conditions on the surface of the body, far-field freestream conditions on the vertical domain boundaries, and inflow and outflow conditions on the left and right horizontal domain boundaries, respectively.

A nondimensional measure of the frequency of the system is the Strouhal number, $\text{St}=fL/U_{\infty}$ , where $f$ is the frequency of vortex shedding. St can be measured from the coefficient of lift $C_{l}(t)$ (discussed more later), and for $\text{Re}=50$ , we obtain $\text{St}=0.126$ . This is in-line with Williamson [72] and Park et al. [73] who give 0.12 and 0.124, respectively. The specific flowfield data used throughout this work in order to learn a sparse, interpretable model is vorticity $\omega(x,y,t)$ at $\text{Re}=50$ , specifically a single period of flowfield data after saturation onto the limit cycle.

3.2 Coarse-Graining

We seek to transform our flowfield data into a form that can be modeled as a one-dimensional Ginzburg-Landau system depending on $(x,t)$ , hence we must find a way to transform our two-dimensional data $\omega(x,y,t)$ into one-dimensional data $\overline{\omega}(x,t)$ . There are many reasonable ways to do this. For example, taking the vorticity along a single horizontal line at a specified $y-$ coordinate is simple, but neglects to incorporate the vertical distribution of vorticity in any way. Instead, we choose to integrate along the $y$ -coordinate for each $(x,t)$ ,

\overline{\omega}(x,t)=\int_{-H}^{+H}\omega(x,y,t)dy,

(8)

which incorporates the vertical distribution of vorticity while still producing one-dimensional data in a simple manner. The vortices spread out vertically as they dissipate downstream, hence the ideal bounds of integration would be $-\infty\leq y\leq\infty$ ; in practice the vertical movement of a vortex is slow relative to its horizontal movement, and a vertical limit of $H=8D$ at $x=50D$ downstream is more than sufficient. We perform this integration at every point $(x,t)$ , effectively removing the $y$ -coordinate and thus reducing the dimension of our data from $(x,y,t)$ to $(x,t)$ . Figure 2a depicts a snapshot of the vorticity flowfield data from the CFD simulation overlaid with the bounds of integration for a particular $x_{i}$ ; Figure 2b shows the instantaneous coarse-grained vorticity $\overline{\omega}(x;t=0)$ . Figures 2c and 2d repeat this process through time to produce the full spatiotemporal coarse-grained vorticity $\overline{\omega}(x,t)$ .

The coarse-grained system captures the spatiotemporal regularity and pattern-forming behavior of the original vortex shedding flow. Each peak or trough in the periodic signal respectively corresponds to a vortex of positive or negative vorticity in the original flowfield. The signal is zero upstream of the cylinder where the flow is unperturbed, then begins to oscillate sharply in the region of the cylinder $x\in[-0.5,0.5]$ . As the vortices detach and shed downstream, the coarse-grained signal smoothly propagates, its amplitude growing and peaking in the mid-wake region, after which it slowly decays. Since vorticity is diffusive and the signal is clearly decaying, in a semi-infinite domain $x\in[0,+\infty)$ we expect that $\overline{\omega}(x,t)\to 0$ as $x\to\infty$ .

We propose this method of coarse-graining for two reasons: first, it is a simple method to reduce the spatiotemporal dimension of the data while incorporating information from the original flowfield; and second, it is in analogy to an analysis of the time-averaged mean flow, as in Barkley [43]. In this work, we propose a spatial analog of mean flow analysis that leads to novel, interpretable results.

3.3 Time-Delay Embedding

We seek to discover the real and imaginary components of the CGLE in the 2-equation form of Eqns. (6a)–(6b), hence we require a second signal in addition to our coarse-grained vorticity $\overline{\omega}(x,t)$ . As with coarse-graining, there are many reasonable ways to lift the dimension of a system, in this case from 1 signal to 2 signals. For example, Loiseau et al. based their system off of the coefficient of lift $C_{l}(t)=2L/(\rho U_{\infty}^{2}L)$ , a nondimensional measure of the lift force on the body [38]. Reference Figure 3 (left) for a plot of $C_{l}(t)$ from our data at $\text{Re}=50$ . Loiseau et al. then lifted the dimension of their system by differentiating $C_{l}(t)$ with respect to time; differentiating the signal $C_{l}(t)$ produces a second periodic signal of the same frequency and similar (but not the same) amplitude but with a phase shift. Loiseau went on to use SINDy on these signals to discover a Stuart-Landau-like model for the coefficient of lift. Our discovery of a Ginzburg-Landau-like model from spatiotemporal data is a natural extension of that work; here however, we produce this second signal via time-delay embedding, which also produces a second periodic signal of the same frequency and same amplitude, and which is less prone to amplifying numerical errors than derivative embedding.

Specifically, if our signal is $C_{l}(t)$ , then the time-delay embedded system is $\mathcal{U}=[C_{l}(t),C_{l}(t+\tau)]$ , where $\tau$ is the degree of time-delay embedding [45, 46]. Choosing a good $\tau$ is crucial to recovering a good model, but it’s not obvious a priori what that choice should be. Figure 3 shows the phase space of the system under different time-delay embeddings and under time-differentiation.

Given a period of oscillation $\mathcal{T}$ of $C_{l}(t)$ , the orbit in phase space of the time-delay embedded system is most circular for $\tau=\frac{1}{4}\mathcal{T}$ , hence maximizing the independence of the signals. As the time-delay embedding increases or decreases, the signals become increasingly in-phase until a trivial time-delay embedding of $\tau=\frac{1}{2}n\mathcal{T},n=0,1,2...$ and we recover perfectly colinear features. If instead of time-delay embedding, we pair $C_{l}(t)$ with its time derivative, as can be seen in Figure 3(d), the effect is to produce an unrotated ellipsoidal orbit in phase space, resulting in a similar effect of obtaining mostly independent features. However, finding the numerical derivative of $C_{l}(t)$ will introduce and amplify numerical errors. Recalling Figure 1, the real and imaginary parts of the Stuart-Landau equation also produce a circular orbit in phase space.

Returning to our spatiotemporal coarse-grained vorticity data $\overline{\omega}(x,t)$ , we make an initial choice of $\tau=\frac{1}{4}\mathcal{T}$ as this produced maximally independent features in the related system $C_{l}(t)$ . Doing so produces the system of time-delay embedded coarse-grained data $\mathcal{U}=[\overline{\omega}(x,t)\,,\,\eta(x,t)]$ where $\eta(x,t)\equiv\overline{\omega}(x,t+\tau)$ is the time-delay embedded signal. Figure 4a shows line plots of the instantaneous system of coarse-grained data $\mathcal{U}$ . We seek a model for the dynamics $\mathcal{U}_{t}$ of this system, and the instantaneous dynamics are shown in Figure 4b. Time-differentiating the spatiotemporal coarse-grained vorticity produces higher-order behavior evident at the peaks of the oscillations in $\mathcal{U}_{t}$ at $10\lesssim x\lesssim 30$ , in which the curve doubles back on itself. This is also the area of the global maximum of $\overline{\omega}$ and $\eta$ , occurring at $x\approx 20$ . Higher-order behavior is less easily explained by a sparse system of PDEs, hence the inclusion of a time-differentiated signal in the base system of signals $\mathcal{U}$ would produce more complex dynamics. By time-delay embedding our data, we avoid introducing additional complexity into the signals and dynamics. Figures 4c and 4d show the full spatiotemporal plots of the coarse-grained vorticity $\overline{\omega}(x,t)$ and its time derivative $\frac{\partial}{\partial t}\overline{\omega}(x,t)$ . Note that the color plots of the time-delay embedded data (not shown) would look identical except for a phase shift of $\frac{1}{4}\mathcal{T}$ .

As a final note, our system of data $\mathcal{U}=[\overline{\omega},\eta]$ was a result of specific choices: what CFD flowfield data to use, how to coarse-grain it, and how to produce a second commensurate signal. There are other choices that may also produce a system of signals that could be used as a basis for developing a ROM for cylinder vortex shedding. For example, one might directly incorporate the time derivative of the data by using a system like $\mathcal{U}=[\overline{\omega},\overline{\omega}_{t}]$ , though with the drawback that numerical differentiation will introduce and amplify numerical errors. Alternately, one might change the underlying flowfield data by coarse-graining the horizontal $u(x,y,t)$ and vertical $v(x,y,t)$ components of the velocity flowfield, $\mathcal{U}=(\overline{u},\overline{v})$ .

3.4 Model Identification

Having obtained our flowfield data $\omega(x,y,t)$ and post-processed it into our system of coarse-grained, time-delay embedded signals $\mathcal{U}$ , we now develop a sparse, interpretable system of PDEs to describe the dynamics of our spatiotemporal data using SINDy [36, 37, 71]. Given a library of candidate functions, SINDy leverages sparsity-promoting and regularizing techniques to determine which subset of candidate terms explains most of the dynamics: the reconstruction error is minimized under some norm while a sparsity-promoting regularization term ensures a final model which is sparse and hence interpretable. In this work, we specifically use an extension of SINDy known as PDE-FIND [37], which extends the SINDy framework from ODEs to PDEs.

Given data $\mathcal{U}$ , we propose to model it as a dynamical system

\mathcal{U}_{t}=\mathcal{N}(f_{1}(\mathcal{U}),...,f_{i}(\mathcal{U}),...),

(9)

where the library terms $f_{i}$ are a set of functions which we apply to the original data, for example powers, spatial derivatives, and products thereof. For a homogeneous system, the coefficients are assumed constant. We can frame this as a matrix-vector product:

\mathcal{U}_{t}=\Theta\>\cdot\>\xi,

(10)

where $\Theta$ is the library of candidate functions $f_{i}(\mathcal{U}$ ) and $\xi$ is the vector of coefficients for each function. Given some data $\mathcal{U}$ we wish to model as a dynamic system as in (9), we find the numerical time-derivative of the data $\mathcal{U}_{t}$ and evaluate the candidate functions $f_{i}$ within $\Theta$ . The choice of candidate function library $\Theta$ is significant. In theory, one may fill the library with as many functions as desired, but in practice, prior knowledge of the system and dynamics helps to select the most appropriate functions. Previous work has established a clear relationship between vortex shedding and the Stuart- and Ginzburg-Landau equations [13, 68, 11, 66]. Given this, we choose a library of functions inspired by the form of the CGLE, including polynomial terms up to third order, spatial derivatives up to second order, and variations thereof; the exact libraries of functions used throughout this study can be found in Appendix A. With many possible candidate functions, the regression problem for $\xi$ is underconstrained. We desire that $\xi$ be sparse (i.e. it has many zero entries), since having fewer terms with nonzero values results in a more interpretable dynamical system for $\mathcal{U}_{t}$ . This is achieved by solving an optimization problem,

\hat{\xi}=\text{argmin}||\Theta\xi-\mathcal{U}_{t}||_{2}^{2}+\lambda||\xi||_{0},

(11)

where $\hat{\xi}$ is the sparse version of $\xi$ returned from SINDy. In practice, the use of the $l_{0}$ norm to regularize $\xi$ results in an $np$ -hard optimization problem, so we solve the optimization problem with a convex relaxation, such as the sequentially thresholded least-squares (STLS) regression algorithm [36].

We can then compute the matrix-vector product $\hat{\mathcal{U}}_{t}=\Theta\cdot\hat{\xi}$ , obtaining SINDy’s reconstruction of the time-derivative of the data. We then calculate the error between the data and the SINDy reconstruction:

\epsilon=\mathcal{U}_{t}-\hat{\mathcal{U}}_{t}=\mathcal{U}_{t}-\Theta\cdot\hat{\xi}.

(12)

We then find the normalized root-mean-square error (NRMSE), defined as:

\text{NRMSE}=\sqrt{\frac{\displaystyle\sum_{t}\sum_{x}\epsilon^{2}}{\displaystyle\sum_{t}\sum_{x}\mathcal{U}_{t}^{2}}}.

(13)

The NRMSE is computed for each signal within $\mathcal{U}$ , and the total NRMSE is the $l^{2}$ norm of the individual NRMSEs. This single error measurement for the entire system is a metric of the model’s quality. After obtaining a PDE with low NRMSE, the final step in this analysis is to numerically solve the system of PDEs and compare the result to the original data, observing the quality and similarities of the models.

4 Results and Discussion

For our initial results, we report the fit from SINDy for a library inspired by the CGLE: first-, second-, and third-order products of the data $\mathcal{U}=[\overline{\omega},\eta]$ as well as first- and second-order spatial derivatives (advection and diffusion, respectively). This is “Library GL” and the full list of terms is shown in Appendix A. Following the work of Rudy et al. [37] in which SINDy correctly discovered the Navier-Stokes equations directly from the vorticity flowfield in a spatial domain of $x\in[2,9]$ , we first investigate this near-wake region as a base case, developing a CGLE model for the local dynamics. We then investigate the global dynamics by developing a model for the coarse-grained vorticity of the entire wake $x\in[2,50]$ . A pseudospectral numerical solution of the generated CGLE model presents nearly identically to our original data, confirming the validity of the CGLE as a ROM for cylinder vortex shedding.

In § 4.2, we expand the SINDy function libraries to include more terms, seeking models with higher-order terms and additional nonlinear terms. While the base case NRMSE between the data and the SINDy fit is already small, it is clear that some of the behavior is not modeled by the CGLE. Additional terms in the function library are added to reduce the NRMSE and more closely model the dynamics. We also investigate the spatial variation of the local CGLE models. By computing the normal form parameters of models that are generated across different subdomains of the entire wake, we find that different regions of the wake can be characterized according to the stability properties of their local models. Finally, in § 4.3, we introduce spatially-varying coefficients on the model terms for a heterogeneous CGLE model of the global wake dynamics. These analyses shed light on the spatial character of our local and global ROMs of cylinder vortex shedding

4.1 Initial Results and Numerical Solution of Fitted Model

To begin our analysis, we use SINDy to fit a Ginzburg-Landau model to the coarse-grained vorticity. We first find a model for the local dynamics of the wake near the cylinder, $x\in[2,9]$ . Then we find a global model for a much larger wake of $x\in[2,50]$ . For both spatial domains, Figure 5 shows the input dynamics $\overline{\omega}_{t}$ , the SINDy fit $\hat{\overline{\omega}}_{t}$ , and their error $\epsilon$ ; as before, the accompanying signal $\eta_{t}$ (not shown) is similar to $\overline{\omega}_{t}$ but with a phase shift of $\frac{1}{4}\mathcal{T}$ . The identified coefficients of the CGLE as well as the NRMSE and normal form parameters $(\sigma,\beta)$ are shown in Table 1. $\overline{\omega}_{t}$ and $\hat{\overline{\omega}}_{t}$ overlap on the line plots, with the error at any given point being at most one order of magnitude smaller than $\overline{\omega}_{t}$ and $\hat{\overline{\omega}}_{t}$ at that point. The NRMSE is NRMSE = $6.7\times 10^{-3}$ for the near-wake and NRMSE = $3.2\times 10^{-2}$ for the entire wake. SINDy is clearly able to reconstruct the dynamics in the near-wake and the entire wake with little error.

The spatial behavior of the coarse-grained dynamics and their time derivative have been discussed in § 3; like $\overline{\omega}_{t}$ , $\epsilon$ also varies with $x$ , but whereas $\overline{\omega}_{t}$ attains its global maximum at $x\approx 20$ and decays from there, $\epsilon$ has a global maximum in the mid-wake ( $x\approx 10$ ) and a secondary local maximum in the far-wake ( $x\approx 30$ ); these are areas in which a greater amount of the dynamics are not explained by the terms in $\Theta(\overline{\omega},\eta)$ . These areas respectively coincide with the beginning and end of the higher-order behavior in $\overline{\omega}_{t}$ and $\hat{\overline{\omega}}_{t}$ , and $\epsilon$ reaches a local minimum at $x\approx 20$ , in between its primary global maximum and the secondary local maximum. This coincides with the global maximum of $\overline{\omega}_{t}$ and the peak of its higher-order behavior. This indicates that the two areas of transition between lower- and higher-order behaviors and back are least explained by the modeled CGLE. This supports the notion that the wake may be separated into three regimes of behavior: the near-wake, the mid-wake, and the far-wake. These subdomains will be explored more in the following subsections, where we both vary the subdomain of the data and tune the library terms and sparsity used to generate the model.

For both spatial domains, SINDy naturally identified coefficients whose values and signs show the same symmetry pattern expected from Eqns. (6a)–(6b), for example identical coefficients ( $\lambda_{r}$ ) between the $\overline{\omega}$ term in the equation for $\overline{\omega}_{t}$ and the $\eta$ term in the equation for $\eta_{t}$ . There is one key difference however: in a given equation, all four cubic terms have different coefficients, instead of two terms with coefficient $\mu_{r}$ and two with $\mu_{i}$ . However, the four terms in one equation are still matched with four opposing terms in the other equation, hence instead of a single $\mu=\mu_{r}+\mathbf{i}\mu_{i}$ suggested by Eqns. (6a)–(6b), we instead have $\mu_{1}=\mu_{1_{r}}+\mathbf{i}\mu_{1_{i}}$ and $\mu_{2}=\mu_{2_{r}}+\mathbf{i}\mu_{2_{i}}$ . While this means the generated equation cannot be neatly factored back into the form of Eqn. (3), we may still compute the normal form parameters $(\sigma,\beta)$ . The normal form characterizes the differences in behaviors of different systems arising from differences in signs and magnitudes of some terms between the different spatial domains (e.g. $\lambda_{r}$ and $\lambda_{i}$ ); variations across the terms give rise to different system behaviors. For example, $(\sigma<0,\beta>0)$ for $x\in[2,9]$ corresponds to the classic supercritical Hopf bifurcation with an unstable fixed point and a stable limit cycle, consistent with previous conclusions that vortex shedding is characterized by a supercritical Hopf bifurcation [11, 12, 66]. Conversely, $(\sigma>0,\beta<0)$ for $x\in[2,50]$ corresponds to a subcritical Hopf bifurcation with a stable fixed point and unstable limit cycle. Differences in the local and global ROMs will be explore further in § 4.2.

Domain	Near-Wake: $x\in[2,9]$				Entire Wake: $x\in[2,50]$
Term	$\overline{\omega}_{t}$		$\eta_{t}$		$\overline{\omega}_{t}$		$\eta_{t}$
$\overline{\omega}$	$\lambda_{r}$	0.0483	$\lambda_{i}$	0.222	$\lambda_{r}$	–0.0269	$\lambda_{i}$	0.499
$\eta$	$\lambda_{i}$	0.222	$\lambda_{r}$	0.0483	$\lambda_{i}$	0.499	$\lambda_{r}$	–0.0269
$\overline{\omega}^{3}$	$\mu_{1,r}$	0.0583	$\mu_{1,i}$	–0.0732	$\mu_{1,r}$	–0.0154	$\mu_{1,i}$	0.0787
$\eta^{3}$	$\mu_{1,i}$	–0.0732	$\mu_{1,r}$	0.0583	$\mu_{1,i}$	0.0787	$\mu_{1,r}$	–0.0154
$\overline{\omega}^{2}\eta$	$\mu_{2,r}$	0.154	$\mu_{2,i}$	–0.209	$\mu_{2,r}$	–0.0347	$\mu_{2,i}$	–0.00668
$\overline{\omega}\eta^{2}$	$\mu_{2,i}$	–0.209	$\mu_{2,r}$	0.154	$\mu_{2,i}$	–0.00668	$\mu_{2,r}$	–0.0347
$\overline{\omega}_{x}$	$\kappa_{r}$	–0.613	$\kappa_{i}$	0.0130	$\kappa_{r}$	–0.539	$\kappa_{i}$	0.00715
$\eta_{x}$	$\kappa_{i}$	0.0130	$\kappa_{r}$	–0.613	$\kappa_{i}$	0.00715	$\kappa_{r}$	–0.539
$\overline{\omega}_{xx}$	$\gamma_{r}$	–0.00687	$\gamma_{i}$	0.171	$\gamma_{r}$	–0.0102	$\gamma_{i}$	0.231
$\eta_{xx}$	$\gamma_{i}$	0.171	$\gamma_{r}$	–0.00687	$\gamma_{i}$	0.231	$\gamma_{r}$	–0.0102
NRMSE	$6.7\times 10^{-3}$				$3.2\times 10^{-2}$
$\sigma$	–0.389				0.0308
$\beta$	0.218				–0.0538

Table 1: Coefficients of generated sparse PDEs for coarse-grained vorticity in the form of Eqs. (6a)–(6b) in two different spatial domains:

x\in[2,9]

(left) and

x\in[2,50]

(right). For a given spatial domain’s model, notice the symmetry in coefficient values, for example between the coefficient for

\overline{\omega}^{3}

in the equation for

\overline{\omega}_{t}

and the coefficient for

\eta^{3}

in the equation for

\eta_{t}

. Such symmetries in terms are required by the form of the CGLE and discovered by SINDy naturally. The numerical solution of the near-wake model is shown in Figure 6 (bottom row).

Having found a sparse system of PDEs that closely fits our data with minimal reconstruction error, we use a pseudospectral integration method to compute a numerical solution for the SINDy model generated from the near-wake. We focus on this model for now because of its lower error; local models for different subregions of the wake and global models for the entire wake will be explored more later. Figure 6 compares the original data $\mathcal{U}$ in the top row to the numerical solution of the generated system of PDEs in the bottom row. The numerical solution shows a close agreement with the data in the near-wake, reinforcing the validity of the CGLE as a model for our data and as a ROM for vortex shedding.

The numerical solution of our generated model closely matches the original data and lends great insight into its behavior, but some of its features are an approximation. Although convenient and simple to implement, a pseudospectral method supposes periodic boundaries which our original data clearly does not have. Our data originally comes from the vortex shedding wake behind a cylinder, and once coarse-grained, there is clearly strongly time-dependent behavior at the boundaries. In particular, the behavior of $\overline{\omega}$ near the surface of the cylinder is nearly discontinuous. This poses a challenge, because the model found by SINDy is generated without consideration of well-posedness and boundary conditions.

Further, periodic boundary conditions allow our initial pulse to eventually self-interact once it has propagated the length of the domain. To overcome this, we embed the solution in a large spatial domain and investigate a spatiotemporal subdomain comparable to the original data, early enough in time that the initial pulse has not yet significantly interacted with itself. Although this strategy works well, it works better when we also include regularization in the form of hyperdiffusion terms (fourth-order spatial derivatives e.g. $\eta_{xxxx}$ ) in each equation with coefficient $R$ . Hyperdiffusion serves to damp high frequency oscillations that result from self-interaction in a periodic domain. With distinct $\mu_{1}$ and $\mu_{2}$ and the inclusion of hyperdiffusion, our final equations then become the following:


	$\displaystyle\frac{\partial\overline{\omega}}{\partial t}=\lambda_{r}\overline{\omega}-\lambda_{i}\eta-\mu_{1,r}\overline{\omega}^{3}+\mu_{2,i}\overline{\omega}^{2}\eta-\mu_{2,r}\overline{\omega}\eta^{2}+\mu_{1,i}\eta^{3}+\kappa_{r}\overline{\omega}_{x}-\kappa_{i}\eta_{x}+\gamma_{r}\overline{\omega}_{xx}-\gamma_{i}\eta_{xx}+R\overline{\omega}_{xxxx},$		(14a)
	$\displaystyle\frac{\partial\eta}{\partial t}=\lambda_{i}\overline{\omega}+\lambda_{r}\eta-\mu_{1,i}\overline{\omega}^{3}-\mu_{2,r}\overline{\omega}^{2}\eta-\mu_{2,i}\overline{\omega}\eta^{2}-\mu_{1,r}\eta^{3}+\kappa_{i}\overline{\omega}_{x}+\kappa_{r}\eta_{x}+\gamma_{i}\overline{\omega}_{xx}+\gamma_{r}\eta_{xx}+R\eta_{xxxx}.$		(14b)

While we use a large spatial domain in order to prevent self-interaction for as long as possible, too large of a domain requires infeasibly long calculation time. Hyperdiffusion with sufficiently large $R$ serves to damp the higher-order oscillations that begin once the signal begins self-interacting; however, we generally want a value of $R$ as low as possible as it will damp the main oscillations as well. For the numerical solution in the bottom row of Figure 6, we use $R=10^{-3}$ . This value of $R$ was found to be optimal, with larger values of $R$ resulting in the same signal but with lower amplitude, and smaller values of $R$ resulting in quickly unstable solutions. Regularization in the form of hyperdiffusion is easy to include in the framework of a pseudospectral numerical solution and is highly effective.

The analysis of this section has found close agreement between the original data, the model generated by SINDy, and the numerical solution of that model. As such, our method of coarse-graining, time-delay embedding, and model discovery with SINDy has resulted in a novel one-dimensional CGLE for two-dimensional cylinder vortex shedding, incorporating for the first time the streamwise coordinate into the Landau variable. In the following subsection, we will explore the spatial variation of local models found at different points in the wake.

4.2 Parameter Tuning – Spatial Domain and Model Term Libraries

In the previous subsection, we found a sparse model in the form of the CGLE to explain the dynamics of the coarse-grained vorticity in both the near-wake and the entire wake. Now we refine our method by varying and tuning certain parameters of our data-driven method. First we vary the spatial subdomain of interest and show how the error of the fitted model varies as the subdomain moves downstream. Then we expand the library of terms used to generate the model in order to better capture higher-order behavior in our data. We discuss each analysis in turn.

Regarding the spatial domain of the data, the initial work of § 4.1 investigated both a specific flowfield subdomain of $x\in[2,9]$ inspired by Rudy et al. [37], and a much larger flowfield of $x\in[2,50]$ to fully investigate the global wake dynamics. From this, it was clear that the data, the model coefficients, and the error are spatially dependent, so we next investigate how generated CGLE models vary depending on the location within the wake they are modeling. Specifically, for a subdomain of length $W_{L}$ , SINDy generates a model from a specified library. The spatial subdomain has a midpoint $x_{mid}$ that then increments downstream, and the SINDy fit process is repeated. At each station within the wake, the NRMSE of the fitted model and its normal form parameters are computed. This entire process is then repeated for different values of $W_{L}$ . In the limit $W_{L}\rightarrow 0$ , the spatial coordinate of the coarse-grained data vanishes and our spatiotemporal data $\mathcal{U}(x,t)$ collapses to purely temporal data $\mathcal{U}(t;x_{mid})$ at that point $x_{mid}$ . For this limiting case, we use SINDy to fit an ordinary differential equation in the form of the SLE (Eq. (2)), which mirrors the CGLE (Eqs. (6a)–(6b)) but without the spatial derivative terms. This is “Library SL” and the full list of terms is shown in Appendix A.

Regarding the library of model terms, the initial work applied SINDy to the data with Library GL, a judiciously chosen library of candidate functions $\Theta$ with terms exactly matching those in the CGLE. Now we will allow for additional nonlinear terms in three forms: higher-order polynomial terms up to fifth order, products of the polynomial and derivative terms (e.g. $\eta^{2}\eta_{xx}$ ), and combinations of those. These libraries are named for the total number of terms they feature, and respectively are referred to as “GL₂₄”, “GL₄₉”, and “GL₁₀₄”. For reference, the SL and GL Libraries have 9 and 13 terms. The full list of terms for all libraries are shown in Appendix A. These libraries contain many terms, and some of the larger libraries will have terms that contribute very little to the observed dynamics. Later we will tune the sparsity parameter $\lambda$ from Eq. (11) in order to eliminate some terms from these larger libraries, leaving behind only those additional nonlinear terms that better explain the dynamics of the coarse-grained vorticity. The best sparse models will explain as much of the dynamics with as few terms as possible. For now, the NRMSE of the nonsparse systems will provide a baseline to which we may later compare the sparse models.

As before, we quantify the goodness of fit of the model returned by SINDy with the NRMSE. Figure 7 shows how the local models’ NRMSE varies spatially in the wake, with parameterization by the spatial subdomain size and by the choice of library. Figure 7(a) shows how the NRMSE from models generated with our initial GL Library varies with spatial location in the wake, plotted specifically against $x_{mid}$ , the midpoint of the spatial domain. In dark blue is the base case using (GL,7). In a lighter shade of blue and with different linestyles are the NRMSE for cases with larger and smaller $W_{L}$ , (GL,15) and (GL,2). In red is the case (SL,0), which does not incorporate any spatial derivative terms. Figure 7(b) shows the spatial variation of NRMSE from models generated with equal subdomain sizes $W_{L}=7$ and increasingly larger libraries “GL₂₄”, “GL₄₉”, and “GL₁₀₄”. The dark blue case of (GL,7) is identical on both plots of 7.

In both Figures 7(a)-(b), the NRMSE shows clear spatial dependence in for all $W_{L}$ and model library. The error is moderate in the near-wake, then grows to a global maximum at $x\approx 20$ in the mid-wake before a sharp decline in the far-wake. Indeed, the NRMSE is 1-2 orders of magnitude lower in the far-wake than at its maximum in the mid-wake. In the mid-wake, the dynamics are characterized by the nonlinear growth and saturation of the SLE and CGLE and potentially by higher-order effects, but in the far-wake, the dynamics are primarily dominated by diffusion. Since the NRMSE is lower where diffusive dynamics dominate, this behavior is better explained by the terms in GL Library. Furthermore, in Figure 7(a), the NRMSE generally decreases as $W_{L}$ decreases but variation with $x_{mid}$ remains similar, suggesting more accurate models can be fit to a more narrow spatial slice of the data. However, the limiting case of $W_{L}=0$ , which necessarily has no spatial component and hence no spatial derivatives in its model term library, has larger NRMSE in the far-wake region than does the base case of (GL, $W_{L}=7$ ), reinforcing that derivative and specifically diffusive terms dominate the far-wake coarse-grained dynamics.

In Figure 7(b), the NRMSE of all local models improves as the library of candidate functions $\Theta$ becomes larger. Interestingly, the peak at $x\approx 20$ is much less pronounced for Library GL₁₀₄ (the largest library), though the error in the far-wake is still an order of magnitude lower. This suggests that the mid-wake region of the flowfield is more influenced by higher-order effects, hence when higher-order terms are included in the model, the error at the mid-wake greatly decreases. In the far-wake, where the dynamics are dominated by diffusion, the inclusion of additional higher-order terms helps the least. The minimum NRMSE achieved anywhere across this parameter and hyperparameter tuning is $\approx 2\times 10^{-5}$ , occurring far downstream for a model generated from Library GL₁₀₄. This is over 2 magnitudes of an improvement in NRMSE over the initial case (Table 1). However, this has come at the cost of interpretability, as with zero sparsity ( $\lambda=0$ ) this model has over 100 terms and cannot feasibly be numerically simulated; in Appendix B, we will tune our sparsity parameter $\lambda$ to remove those terms which have the least effect on the error.

The local models clearly vary spatially, hence by investigating the coefficients of the models in each subdomain and computing their normal form parameters $(\sigma,\beta)$ , we can draw conclusions about the behavior of that local model and the spatial characteristics of the coarse-grained data. Figure 8 shows $\sigma$ and $\beta$ vs. $x_{mid}$ for both the spatiotemporal ( $W_{L}\neq 0$ ) CGLE models from Library GL and for purely temporal ( $W_{L}=0$ ) SLE models from Library SL. The normal form parameters were also computed for models (GL,15) and (GL,2) from Figure 7(a), but were found to deviate only marginally from the base case (GL,7), and hence are not shown. This suggests that $W_{L}$ does not have a strong effect on the normal form parameters of the local model, at least not until it vanishes for (SL,0).

The behavior of a Hopf bifurcation depends on the signs of $(\sigma,\beta)$ ; specifically, the sign of $\beta$ determines whether the fixed point is stable ( $\beta<0$ ) or unstable ( $\beta<0$ ), and the sign of $\sigma$ determines whether the nonlinear terms cause a stable limit cycle ( $\sigma<0$ ) or unstable limit cycle ( $\sigma>0$ ). As the spatial domain moves downstream, the local model transitions between several distinct regions of behavior, delineated in Figures 8(a)-(b). For each of these regions, Table 2 shows the normal form parameters $(\sigma,\beta)$ , the stability of the equilibrium at the origin, the existence and stability of the limit cycle, and figures of the normal form phase portraits.

[Uncaptioned image] — Table 2: The range of Hopf bifurcation behaviors and normal form parameter regimes present in the coarse-grained data from vortex shedding past a cylinder near the bifurcation point. From, Table 1, recall that the global model for the entire domain $x\in[2,50]$ had normal form parameters $(\sigma>0,\beta<0)$ , matching Region II.

	Region I	Region II	Region III
Spatial Domain	$0\lesssim x\lesssim 5$ and $x\approx 30$	$5\lesssim x\lesssim 18$ and $25\lesssim x<+\infty$	$18\lesssim x\lesssim 25$
$\sigma$	$\sigma<0$	$\sigma>0$	$\sigma>0$
$\beta$	$\beta>0$	$\beta<0$	$\beta>0$
Origin Stability	Unstable	Asymptotically Stable	Unstable
Limit Cycle	Stable	Unstable	None
Phase Portrait

Near the cylinder for $0\lesssim x\lesssim 5$ , Region I has normal form parameters ( $\sigma<0$ , $\beta>0$ ). Moving downstream, $\sigma$ and $\beta$ undergo a variety of sign and magnitude changes, transitioning through three unique sets of ( $\sigma$ , $\beta$ ). Throughout the wake, the normal form parameters from the GL Library and SL Library fits and generally agree with each other. While derivative terms are excluded from the SL Library, the nature of the data clearly suggests advective and diffusive behavior. In the absence of derivative terms, the values of the remaining coefficients compensate, and their changing values produce different $(\sigma,\beta)$ . The main distinction is the differing behavior in the far wake, namely the $(SL,0)$ fits never experience Region III and instead experience a brief return of Region I at $x\approx 30$ . Both regions feature $\beta>0$ and are distinguished by whether $\sigma<0$ (Region I) or $\sigma>0$ (Region III). For the (GL,7) model, $\beta>0$ for $18\lesssim x\lesssim 25$ while $\sigma$ remains positive but decreasing to an asymptotic value. For the $(SL,0)$ model however, $\sigma$ becomes negative and $\beta$ simultaneously becomes positive, then both quickly switch back. Although difficult to determine visually, close inspection reveals that the bounds of these regions for both parameters coincide exactly, and both parameters are $\mathcal{O}(10^{-4})$ . Recalling Figure 7, the NRMSE produced by the model $(SL,0)$ sees a pause in its decline at $x\approx 30$ while other models’ NRMSEs continue to decrease. This indicates that the fit from $(SL,0)$ in this region is relatively worse and the dynamics here may be due to higher-order effects not captured by the CGLE and not represented by its normal form parameters.

Further regarding the far-wake behavior, there is suggestive behavior indicative of long-range spatial variation in both parameters. While $\sigma$ reaches an asymptotic value in the far-wake for Library GL fits, $\sigma$ from Library SL briefly becomes negative in the far-wake before increasing and possibly attaining a secondary local maxima at $x\approx 50$ . Regarding $\beta$ , for all three fits the parameter decreases moving downstream after its maximum, but appears to begin increasing again far downstream at $x\approx 45$ . Although the present spatial domain of analysis is insufficient to clearly capture the behavior, it appears that the local models’ evolve on a large length scale and the normal forms have long-range spatial variation.

The spatial variation of normal forms may also suggest a connection with the wavemaker region, a region of local absolute instability known to exist in the wake close to the cylinder, $0\lesssim x\lesssim 4$ [19, 58]. In this region, perturbations grow and ultimately affect the entire domain. This near-wake region corresponds to Region I in Figure 8, where $\beta>0$ indicating that the fixed point is unstable and perturbations to this state grow. Sufficiently far downstream, the local flowfield is convectively but not absolutely unstable, and correspondingly, $\beta<0$ , signaling that perturbations from the stable fixed point decay, consistent with a flowfield in which the vorticity slowly decays away far downstream. The recurrence of regions of $\beta>0$ amid the downstream diffusion may again indicate longer-range behavior or a secondary wavemaker region.

4.3 Heterogeneous Models

The analysis of the previous section explored generated local models’ spatial behaviors and variations. In this section, we further iterate on the form of the model and minimize the NRMSE by fitting a global heterogeneous CGLE to the data of the entire wake $x\in[2,50]$ . This is done by introducing $x-$ dependent coefficients to replace constant coefficients on the model terms. By explicitly allowing for spatial variation in the coefficients of the terms, we find a global model for the entire wake that captures the spatially changing dynamics. The general form of our dynamical system becomes:

\mathcal{U}_{t}=\mathcal{N}(g_{1}(x)\cdot f_{1}(\mathcal{U}),...,g_{i}(x)\cdot f_{i}(\mathcal{U}),...),

(15)

where our data $\mathcal{U}=[\overline{\omega},\eta]$ is again our coarse-grained vorticity and its time-delay embedding. First, we will try using $g_{i}(x)$ as polynomials in $x$ up to a specified order as an approximation to the Taylor series of the true, unknown coefficient functions for each term in the Library GL. In Figure 9(a), the NRMSE of a generated model is plotted against the order $n$ of the Taylor polynomials. The different libraries are:

•

H-GL₁ – Taylor series in $x$ on the linear terms of the GL library
•

H-GL₃ – Taylor series in $x$ on the cubic terms of the GL library
•

H-GL_1,3 – Taylor series in $x$ on both the linear and cubic terms of the GL library

At $n=0$ , the coefficients are constant and we reproduce the initial case with Library GL on the global wake. The NRMSE then improves with Taylor polynomials on the linear terms, more with Taylor polynomials on the cubic terms, and even more with Taylor polynomials on both the linear and cubic terms. The NRMSE generally improves with higher order $n$ of the Taylor polynomials, although improvements generally diminish around $n=7$ , after which the NRMSE of the generated model experiences only marginal improvements. At $n=9$ , Library H-GL_1,3 achieves a minimum NRMSE of $1.2\times 10^{-3}$ , an order of magnitude improvement over the initial case (NRMSE = $3.2\times 10^{-2}$ ). The heterogeneous system’s improvement of the global model’s NRMSE further reinforces that the vortex shedding wake and its ROMs have spatially-dependent behavior that a homogeneous global model fails to capture.

5 Conclusion

In this work, we developed a novel one-dimensional Ginzburg-Landau model for two-dimensional cylinder vortex shedding, incorporating the streamwise coordinate of the flow as the spatial coordinate of the Landau model. After simulating flow past a cylinder at $\text{Re}=50$ , the two-dimensional vorticity field $\omega(x,y,t)$ is first coarse-grained, producing $\overline{\omega}(x,t)$ . This signal is then time-delay embedded, producing the system $\mathcal{U}=[\overline{\omega},\eta](x,t)$ . SINDy is used to fit a model to the observed dynamics. Using a library of terms modeled after the CGLE, the data is modeled and reconstructed with minimal error. The numerical solution of the generated model reproduced the near-wake coarse-grained dynamics almost exactly.

From this initial analysis, it was clear that the data and models were spatially dependent, so we next investigated how the generated CGLE models varied depending on the location within the wake they were modeling. By fitting a CGLE model to a given spatial subdomain and then sliding downstream, we developed smoothly varying local models throughout the entire wake. At each station within the wake, the fitted model’s NRMSE and normal form parameters ( $\sigma,\beta$ ) were computed. Figures 7 and 8 showed the spatial variation in NRMSE and ( $\sigma,\beta$ ) across different model libraries. These findings clearly showed how the properties of the local models divided the entire wake into distinct subregions. Furthermore, libraries with higher-order terms beyond the base CGLE terms uniformly improved the NRMSE of local models, but this improvement was most pronounced in the mid-wake region and least evident in the far-wake region. From this, it was clear that the near- and mid-wake regions were characterized by higher-order behavior that was less explained by the CGLE models, while the far-wake was dominated by diffusive dynamics. Once sparsity is introduced (Appendix B), using larger libraries of higher-order terms provided only limited reduction in error; at their core, the coarse-grained dynamics are modeled by the CGLE. Finally, the behavior of local and global models was contrasted by fitting a global heterogeneous model to the coarse-grained data of the entire wake. This was done by applying Taylor polynomials in $x$ of some order $n$ to the linear and cubic terms of the CGLE. The best global heterogeneous model improved upon the NRMSE of a global homogeneous model by over an order of magnitude, further confirming that the local behavior of the wake varies downstream.

These findings validate the CGLE as a ROM for two-dimensional cylinder vortex shedding. They join a host experimental, numerical, and analytic work which has previously investigated the Stuart-Landau and Ginzburg-Landau equations as ROMs for two- and three-dimensional cylinder vortex shedding [11, 12, 13, 61, 66, 68]. In particular, the CGLE had previously only been associated with three-dimensional cylinder vortex shedding, incorporating the spanwise coordinate of the flow as the spatial coordinate the CGLE [13, 68]. Our work uniquely incorporates the streamwise coordinate from the flowfield as the spatial coordinate in our one-dimensional CGLE, which had seen only limited prior attempts [56, 57]. The incorporation the streamwise coordinate into the CGLE model for cylinder vortex shedding has helped shed light on the changes in stability and behavior of downstream local models.

A further contribution of this work is the justification of time-delay embedding as an appropriate method for discovering a multidimensional system from a single data signal. The SLE and CGLE may be presented as either a single complex differential equation or a pair of coupled, real differential equations, hence two (real) data signals are required in order to discover the SLE and CGLE with SINDy. Whereas in [38] the coefficient of lift $C_{l}(t)$ is time-differentiated in order to generate a second oscillatory, out-of-phase signal, we time-delay embed our data. In the case of $C_{l}(t)$ , time-delay embedding generates similar results as time-differentiating: both cases produce an SLE model with low NRMSE and whose numerical solution closely matches the behavior of the original signal. In the case of $\overline{\omega}(x,t)$ , time-differentiating introduces additional higher-order behavior less explainable by a sparse model, whereas time-delay embedding does not. Time-differentiation also introduces and amplifies numerical errors, whereas time-delay embedding does not.

Future work includes increasing the value of the control parameter Re past the bifurcation and using the recent parameterized SINDy [74] to parameterize the local ROMs vs. Re. The SLE and CGLE only describe the Hopf bifurcation when the control parameter is in the vicinity of the bifurcation point [63], hence a similar study of local models but with $\text{Re}>>47$ may help explain the effect of increasing Re on the cylinder vortex shedding wake. Our methods may also help elucidate the normal forms of other bifurcations experienced in vortex shedding flows, for example those experienced in three-dimensional cylinder flow [75], in two-dimensional rotating cylinder flow [59], or in the flow over the fluidic pinball [76, 60]. Common in biological applications, some vortex streets do not present as rows of alternating vortices but instead may shed multiple large or small vortices at once [77, 78]; coarse-graining such data would result in rich spatiotemporal signals, prime candidates for our method of data-driven model discovery.

A final extension of this work may be towards feedback control systems. The SINDy method has been applied to controlled systems with great success [79], and control and suppression of vortex shedding, particularly vortex shedding past a cylinder, is a well-researched topic [1, 4]. There are many different strategies for vortex shedding control, from passive modifications to the body [6, 9] to active actuation of control surfaces [10]. Analysis of the coarse-grained wake dynamics under different control strategies may help identify the extent to which these control strategies are successful and what regions of the wake experience the greatest vortex suppression effects.

Acknowlegments

The authors acknowledge support from the National Science Foundation AI Institute in Dynamic Systems (grant number 2112085), the US Air Force Office of Scientific Research (FA9550-21-1-0178), and The Boeing Company.

References

[1] B. Mutlu Sumer. Hydrodynamics around Cylindrical Strucures, volume 26. World scientific.
[2] David J. Tritton. Experiments on the flow past a circular cylinder at low Reynolds numbers. 6(4):547–567.
[3] M. Matsumoto. Vortex shedding of bluff bodies: A review. 13(7-8):791–811.
[4] Saman Rashidi, Masoud Hayatdavoodi, and Javad Abolfazli Esfahani. Vortex shedding suppression and wake control: A review. 126:57–80.
[5] Giorgio Diana, Ferruccio Resta, Marco Belloli, and Daniele Rocchi. On the vortex shedding forcing on suspension bridge deck. 94(5):341–363.
[6] Allan Larsen, Søren Esdahl, Jacob E. Andersen, and Tina Vejrum. Storebælt suspension bridge–vortex shedding excitation and mitigation by guide vanes. 88(2-3):283–296.
[7] S. Mittal and A. Raghuvanshi. Control of vortex shedding behind circular cylinder for flows at low Reynolds numbers. 35(4):421–447.
[8] Kiyoung Kwon and Haecheon Choi. Control of laminar vortex shedding behind a circular cylinder using splitter plates. 8(2):479–486.
[9] Pj J. Strykowski and Kr R. Sreenivasan. On the formation and suppression of vortex ‘shedding’at low Reynolds numbers. 218:71–107.
[10] N. Fujisawa, Yl Kawaji, and K. Ikemoto. Feedback control of vortex shedding from a circular cylinder by rotational oscillations. 15(1):23–37.
[11] K. R. Sreenivasan, P. J. Strykowski, and D. J. Olinger. Hopf bifurcation, Landau equation, and vortex shedding behind circular cylinders. In Forum on Unsteady Flow Separation, volume 1, pages 1–13. ASME New York.
[12] Jan Dušek, Patrice Le Gal, and Philippe Fraunié. A numerical and theoretical study of the first Hopf bifurcation in a cylinder wake. 264:59–80.
[13] P. Albarède, M. Provansal, and L. Boyer. Modélisation par l’équation de Ginzburg-Landau du sillage tridimensionnel d’un obstacle allongé. 310(5):459–464.
[14] Pierre Albarède, Thomas Leweke, and Michel Provansal. The Ginzburg-Landau Equation as a Model of the Three-Dimensional Circular Cylinder Wake at Low Reynolds Numbers. In Bluff-Body Wakes, Dynamics and Instabilities: IUTAM Symposium, Göttingen, Germany September 7–11, 1992, pages 313–316. Springer.
[15] Bernd R. Noack and Helmut Eckelmann. A global stability analysis of the steady and periodic cylinder wake. 270:297–330.
[16] Michael Schumm, Eberhard Berger, and Peter A. Monkewitz. Self-excited oscillations in the wake of two-dimensional bluff bodies and their control. 271:17–53.
[17] Jared L. Callaham, Kazuki Maeda, and Steven L. Brunton. Robust flow reconstruction from limited measurements via sparse representation. 4(10):103907.
[18] Kimon Roussopoulos and Peter A. Monkewitz. Nonlinear modelling of vortex shedding control in cylinder wakes. 97(1-3):264–273.
[19] Shervin Bagheri, Dan S. Henningson, J. Hoepffner, and Peter J. Schmid. Input-output analysis and control design applied to a linear model of spatially developing flows. 62(2).
[20] Mark C. Cross and Pierre C. Hohenberg. Pattern formation outside of equilibrium. 65(3):851.
[21] Emanuele Zuccoli. Derivation of the Ginzburg-Landau equation and its application to the Taylor-Couette flow.
[22] Pierre C. Hohenberg and Alexei P. Krekhov. An introduction to the Ginzburg–Landau theory of phase transitions and nonequilibrium patterns. 572:1–42.
[23] Igor S. Aranson and Lorenz Kramer. The world of the complex Ginzburg-Landau equation. 74(1):99.
[24] Kunihiko Taira, Steven L. Brunton, Scott T. M. Dawson, Clarence W. Rowley, Tim Colonius, Beverley J. McKeon, Oliver T. Schmidt, Stanislav Gordeyev, Vassilios Theofilis, and Lawrence S. Ukeiley. Modal Analysis of Fluid Flows: An Overview. 55(12):4013–4041.
[25] Clarence W. Rowley and Scott T.M. Dawson. Model Reduction for Flow Analysis and Control. 49(1):387–417.
[26] Bernd R. Noack, Marek Morzynski, and Gilead Tadmor. Reduced-Order Modelling for Flow Control, volume 528. Springer Science & Business Media.
[27] Bernd R. Noack, Konstantin Afanasiev, Marek Morzyński, Gilead Tadmor, and Frank Thiele. A hierarchy of low-dimensional models for the transient and post-transient cylinder wake. 497:335–363.
[28] Bernd R. Noack, Michael Schlegel, Marek Morzynski, and Gilead Tadmor. Galerkin Method for Nonlinear Dynamics. In Reduced-Order Modelling for Flow Control, volume 528, pages 111–149. Springer Vienna.
[29] Eurika Kaiser, Bernd R. Noack, Laurent Cordier, Andreas Spohn, Marc Segond, Markus Abel, Guillaume Daviller, Jan Östh, Siniša Krajnović, and Robert K. Niven. Cluster-based reduced-order modelling of a mixing layer. 754:365–414.
[30] Steven L. Brunton and J. Nathan Kutz. Data-Driven Science and Engineering: Machine Learning, Dynamical Systems, and Control. Cambridge University Press.
[31] Peter Benner, Serkan Gugercin, and Karen Willcox. A Survey of Projection-Based Model Reduction Methods for Parametric Dynamical Systems. 57(4):483–531.
[32] John L. Lumley. Toward a turbulent constitutive relation. 41(2):413–434.
[33] Peter Benner, Pawan Goyal, Boris Kramer, Benjamin Peherstorfer, and Karen Willcox. Operator inference for non-intrusive model reduction of systems with non-polynomial nonlinear terms. 372:113433.
[34] Steven L. Brunton, Bernd R. Noack, and Petros Koumoutsakos. Machine Learning for Fluid Mechanics. 52(1):477–508.
[35] Steven L. Brunton. Applying machine learning to study fluid mechanics. 37(12):1718–1726.
[36] Steven L. Brunton, Joshua L. Proctor, and J. Nathan Kutz. Discovering governing equations from data by sparse identification of nonlinear dynamical systems. 113(15):3932–3937.
[37] Samuel H. Rudy, Steven L. Brunton, Joshua L. Proctor, and J. Nathan Kutz. Data-driven discovery of partial differential equations. 3(4):e1602614.
[38] Jean-Christophe Loiseau, Bernd R. Noack, and Steven L. Brunton. Sparse reduced-order modelling: Sensor-based dynamics to full-state estimation. 844:459–490.
[39] Martin Schmelzer, Richard P. Dwight, and Paola Cinnella. Discovery of Algebraic Reynolds-Stress Models Using Sparse Symbolic Regression. 104(2-3):579–603.
[40] Jasper P. Huijing, Richard P. Dwight, and Martin Schmelzer. Data-driven RANS closures for three-dimensional flows around bluff bodies. 225:104997.
[41] Laure Zanna and Thomas Bolton. Data‐Driven Equation Discovery of Ocean Mesoscale Closures. 47(17):e2020GL088376.
[42] James M. Haile. Molecular Dynamics Simulation: Elementary Methods. John Wiley & Sons, Inc.
[43] Dwight Barkley. Linear analysis of the cylinder wake mean flow. 75(5):750.
[44] Joseph Bakarji and Daniel M. Tartakovsky. Data-driven discovery of coarse-grained equations. 434:110219.
[45] Floris Takens. Detecting strange attractors in turbulence. In Dynamical Systems and Turbulence, Warwick 1980, volume 898, pages 366–381. Springer Berlin Heidelberg.
[46] Steven L. Brunton, Bingni W. Brunton, Joshua L. Proctor, Eurika Kaiser, and J. Nathan Kutz. Chaos as an intermittently forced linear system. 8(1):19.
[47] Joseph Smagorinsky. General circulation experiments with the primitive equations: I. The basic experiment. 91(3):99–164.
[48] James W. Deardorff. A numerical study of three-dimensional turbulent channel flow at large Reynolds numbers. 41(2):453–480.
[49] Sebastian Kmiecik, Dominik Gront, Michal Kolinski, Lukasz Wieteska, Aleksandra Elzbieta Dawid, and Andrzej Kolinski. Coarse-Grained Protein Models and Their Applications. 116(14):7898–7936.
[50] Florian Müller-Plathe. Coarse-Graining in Polymer Simulation: From the Atomistic to the Mesoscopic Scale and Back. 3(9):754–769.
[51] George Sugihara, Robert May, Hao Ye, Chih-hao Hsieh, Ethan Deyle, Michael Fogarty, and Stephan Munch. Detecting Causality in Complex Ecosystems. 338(6106):496–500.
[52] Zhen Wang, Xia Huang, and Guodong Shi. Analysis of nonlinear dynamics and chaos in a fractional order financial system with time delay. 62(3):1531–1539.
[53] George Sugihara and Robert M. May. Nonlinear forecasting as a way of distinguishing chaos from measurement error in time series. 344(6268):734–741.
[54] Henry D. I. Abarbanel, Reggie Brown, John J. Sidorowich, and Lev Sh. Tsimring. The analysis of observed chaotic data in physical systems. 65(4):1331–1392.
[55] Pierre Albarède and Michel Provansal. Quasi-periodic cylinder wakes and the Ginzburg–Landau model. 291:191–222.
[56] A. Chiffaudel. Nonlinear stability analysis of two-dimensional patterns in the wake of a circular cylinder. 18(7):589.
[57] D. S. Park and L. G. Redekopp. A model for pattern selection in wake flows. 4(8):1697–1706.
[58] Jean-Marc Chomaz. Global instabilities in spatially developing flows: Non-normality and nonlinearity. 37:357–392.
[59] J. Sierra, David Fabre, Vincenzo Citro, and Flavio Giannetti. Bifurcation scenario in the two-dimensional laminar flow past a rotating cylinder. 905:A2.
[60] Nan Deng, Bernd R. Noack, Marek Morzyński, and Luc R. Pastur. Low-order model for successive bifurcations of the fluidic pinball. 884:A37.
[61] Denis Sipp and Anton Lebedev. Global stability of base and mean flows: A general approach and its applications to cylinder and open cavity flows. 593:333–358.
[62] Charles HK Williamson. Oblique and parallel modes of vortex shedding in the wake of a circular cylinder at low Reynolds numbers. 206:579–627.
[63] Yuri A. Kuznetsov, Iu A. Kuznetsov, and Y. Kuznetsov. Elements of Applied Bifurcation Theory, volume 112. Springer.
[64] Lev D. Landau. On the problem of turbulence. In Dokl. Akad. Nauk USSR, volume 44, page 311.
[65] John Trevor Stuart. On the non-linear mechanics of hydrodynamic stability. 4(1):1–21.
[66] M. Provansal, C. Mathis, and L. Boyer. Bénard-von Kármán instability: Transient and forced regimes. 182:1–22.
[67] Pierre Albarède and Peter A. Monkewitz. A model for the formation of oblique shedding and “chevron”patterns in cylinder wakes. 4(4):744–756.
[68] S. Le Dizès, P. A. Monkewitz, and P. Huerre. Weakly nonlinear analysis of spatially developing shear flows. 36:2675.
[69] Kunihiko Taira and Tim Colonius. The immersed boundary method: A projection approach. 225(2):2118–2137.
[70] Tim Colonius and Kunihiko Taira. A fast immersed boundary method using a nullspace approach and multi-domain far-field boundary conditions. 197(25-28):2131–2146.
[71] Hayden Schaeffer. Learning partial differential equations via data discovery and sparse optimization. 473(2197):20160446.
[72] Charles HK Williamson. Defining a universal and continuous Strouhal–Reynolds number relationship for the laminar vortex shedding of a circular cylinder. 31(10):2742–2744.
[73] Jeongyoung Park, Kiyoung Kwon, and Haecheon Choi. Numerical solutions of flow past a circular cylinder at Reynolds numbers up to 160. 12:1200–1205.
[74] Zachary G. Nicolaou, Guanyu Huo, Yihui Chen, Steven L. Brunton, and J. Nathan Kutz. Data-driven discovery and extrapolation of parameterized pattern-forming dynamics.
[75] Dwight Barkley, Laurette S. Tuckerman, and Martin Golubitsky. Bifurcation theory for three-dimensional flow in the wake of a circular cylinder. 61(5):5247.
[76] Nan Deng, Luc R. Pastur, Marek Morzyński, and Bernd R. Noack. Route to chaos in the fluidic pinball. In Fluids Engineering Division Summer Meeting, volume 51555, page V001T01A005. American Society of Mechanical Engineers.
[77] Eva Kanso, Jerrold E. Marsden, Clarence W. Rowley, and Juan B. Melli-Huber. Locomotion of articulated bodies in a perfect fluid. 15:255–289.
[78] Brendan Colvert, Mohamad Alsalman, and Eva Kanso. Classifying vortex wakes using neural networks. 13(2):025003.
[79] Eurika Kaiser, J. Nathan Kutz, and Steven L. Brunton. Sparse identification of nonlinear dynamics for model predictive control in the low-data limit. 474(2219):20180335.

Appendix A Libraries of Candidate Functions

In this work, SINDy is used to find a dynamical system modeling given input data. The input data $\mathcal{U}=[\overline{\omega},\eta]$ consists of two spatiotemporal signals: the coarse-grained vorticity and its time-delay embedding. Thus, the dynamical system will consist of two differential equations for $\overline{\omega}_{t}$ and $\eta_{t}$ . The matrix of candidate functions $\Theta(u,v)$ contains the functions possible in both equations, where $u$ and $v$ are our signals $\overline{\omega}$ and $\eta$ .

SL Library: Stuart-Landau equation (SLE). Polynomials up to 3rd order.
$\Theta=[\;u,v,u^{2},uv,v^{2},u^{3},u^{2}v,uv^{2},v^{3}\;]$

GL Library: Complex Ginzburg-Landau equation (CGLE). Polynomials up to 3rd order and derivatives up to 2nd order.
$\Theta=[\;u,v,u^{2},uv,v^{2},u^{3},u^{2}v,uv^{2},v^{3},u_{x},u_{xx},v_{x},v_{xx}\;]$

GL₂₄ Library: polynomials up to 5th order and derivatives up to 2nd order.
$\Theta=[\;u,v,u^{2},uv,v^{2},u^{3},u^{2}v,uv^{2},v^{3},u^{4},u^{3}v,u^{2}v^{2},uv^{3},v^{4},u^{5},u^{4}v,u^{3}v^{2},u^{2}v^{3},uv^{4},v^{5},u_{x},u_{xx},v_{x},v_{xx}\;]$

GL₄₉ Library: polynomials up to 3rd order, derivatives up to 2nd order, and cross-terms thereof.
$\Theta=[\;u,v,u^{2},uv,v^{2},u^{3},u^{2}v,uv^{2},v^{3},u_{x},u_{xx},v_{x},v_{xx},...$
$...\;u_{x}\cdot(u,v,u^{2},uv,v^{2},u^{3},u^{2}v,uv^{2},v^{3}),\;...$
$...\;u_{xx}\cdot(u,v,u^{2},uv,v^{2},u^{3},u^{2}v,uv^{2},v^{3}),\;...$
$...\;v_{x}\cdot(u,v,u^{2},uv,v^{2},u^{3},u^{2}v,uv^{2},v^{3}),\;...$
$...\;v_{xx}\cdot(u,v,u^{2},uv,v^{2},u^{3},u^{2}v,uv^{2},v^{3})\;]$

GL₁₀₄ Library: polynomials up to 5th order, derivatives up to 2nd order, and cross-terms thereof.
$\Theta=[\;u,v,u^{2},uv,v^{2},u^{3},u^{2}v,uv^{2},v^{3},u^{4},u^{3}v,u^{2}v^{2},uv^{3},v^{4},u^{5},u^{4}v,u^{3}v^{2},u^{2}v^{3},uv^{4},v^{5},u_{x},u_{xx},v_{x},v_{xx},...$
$...\;u_{x}\cdot(u,v,u^{2},uv,v^{2},u^{3},u^{2}v,uv^{2},v^{3},u^{4},u^{3}v,u^{2}v^{2},uv^{3},v^{4},u^{5},u^{4}v,u^{3}v^{2},u^{2}v^{3},uv^{4},v^{5}),\;...$
$...\;u_{xx}\cdot(u,v,u^{2},uv,v^{2},u^{3},u^{2}v,uv^{2},v^{3},u^{4},u^{3}v,u^{2}v^{2},uv^{3},v^{4},u^{5},u^{4}v,u^{3}v^{2},u^{2}v^{3},uv^{4},v^{5}),\;...$
$...\;v_{x}\cdot(u,v,u^{2},uv,v^{2},u^{3},u^{2}v,uv^{2},v^{3},u^{4},u^{3}v,u^{2}v^{2},uv^{3},v^{4},u^{5},u^{4}v,u^{3}v^{2},u^{2}v^{3},uv^{4},v^{5}),\;...$
$...\;v_{xx}\cdot(u,v,u^{2},uv,v^{2},u^{3},u^{2}v,uv^{2},v^{3},u^{4},u^{3}v,u^{2}v^{2},uv^{3},v^{4},u^{5},u^{4}v,u^{3}v^{2},u^{2}v^{3},uv^{4},v^{5})\;]$

H-GL₁ Library: GL library with Taylor polynomials in $x$ up to $n$ th order on the linear terms.
$\Theta=[\;(x+...+x^{n})u,(x+...+x^{n})v,...\\ u^{2},uv,v^{2},u^{3},u^{2}v,uv^{2},v^{3},u_{x},u_{xx},v_{x},v_{xx}\;]$

H-GL₃ Library: GL library with Taylor polynomials in $x$ up to $n$ th order on the cubic terms.
$\Theta=[\;u,v,u^{2},uv,v^{2},...\\ (x+...+x^{n})u^{3},(x+...+x^{n})u^{2}v,(x+...+x^{n})uv^{2},(x+...+x^{n})v^{3},...\\ u_{x},u_{xx},v_{x},v_{xx}\;]$

H-GL_1,3 Library: GL library with Taylor polynomials in $x$ up to $n$ th order on both the linear and cubic terms.
$\Theta=[\;(x+...+x^{n})u,(x+...+x^{n})v,...\\ u^{2},uv,v^{2},...\\ (x+...+x^{n})u^{3},(x+...+x^{n})u^{2}v,(x+...+x^{n})uv^{2},(x+...+x^{n})v^{3},...\\ u_{x},u_{xx},v_{x},v_{xx}\;]$

Appendix B Sparse Approximations of the CGLE

In this section, we realize the full potential of SINDy by iterating upon the sparsity parameter $\lambda$ . In previous sections, models with extremely low error were produced, but they have too many terms to feasibly implement numerically and solve. By tuning $\lambda$ , SINDy will retain only those terms which most explain the observed dynamics, with larger $\lambda$ eliminating more terms. We focus on 3 equally-sized subdomains of the wake: the near-, mid-, and far-wakes, encompassing $x\in[2,9]$ , $x\in[10,19]$ , and $x\in[38,45]$ , respectively. As bases for our sparse models, we use Libraries GL and its extensions GL₂₄, GL₄₉, and GL₁₀₄. We exclude heterogeneous libraries from this process as we seek sparse, interpretable equations that can be solved with our pseudospectral numerical method from § 4.1. For each subdomain, Figures 10(a)-(c) show NRMSE vs. $\lambda$ for each library and Figures 10(d)-(f) show the number of nonzero terms in the model vs. $\lambda$ .

The top row of Figure 10 shows the NRMSE vs. $\lambda$ for each of the four libraries across the three wake subregions. The bottom row of Figure 7 shows the number of nonzero terms of the generate models vs. $\lambda$ . As expected, the NRMSE is lowest for each library when $\lambda=0$ . At this value, Library GL₁₀₄ has the lowest NRMSE, although this effect is least noticeable for the far-wake region. This suggests the dynamics of the near- and mid-wake regions are better explained by the additional nonlinear terms than are the far-wake dynamics, which are known to be dominated by diffusion. These findings align with those of Figure 7, which shows generally similar dependence of the NRMSE on spatial region and model complexity. For each library and subdomain, as sparsity increases, the numbers of nonzero terms decrease and the NRMSEs increase from their nonsparse minimum values, with some of the curves reaching various plateaus. Once $\lambda$ has been increased large enough to eliminate all terms from a given model, that NRMSE reaches its maximum value of $\sqrt{2}$ . For a given library, the errors are comparable in the near- and mid-wakes and an order of magnitude lower in the far-wake, reinforcing that the diffusion-dominated dynamics of the far-wake are most easily explained.

Regarding the plateaus, the models produced here are relatively stable with respect to the sparsity, as opposed to regions where the model produced (and hence NRMSE) changes with even small changes in the sparsity. Additionally, for a given region, the observed plateaus generally overlap between the different libraries. While Libraries GL₂₄, GL₄₉, and GL₁₀₄ outperform Library GL at $\lambda=0$ , once any sparsity is introduced, models from these libraries generally either underperform or perform no better than models from Library GL. Further, the higher-order terms in Libraries GL₂₄, GL₄₉, and GL₁₀₄ are among the first terms dropped as $\lambda$ increases, and the terms retained are either the standard CGLE terms from Library GL, or a higher-order version (e.g. $u_{xx}$ and $v_{xx}$ from Library GL vs. $u^{2}u_{xx}$ and $u^{2}v_{xx}$ from Library GL₄₉). Hence while larger libraries do improve the error for nonsparse sparse models, at their core the coarse-grained dynamics are still modeled very well by the CGLE, even with with only a subset of terms. When interpretability and solvability of the model are desired and sparsity is introduced, the higher-order terms vanish and there is no benefit to using Libraries GL₂₄, GL₄₉, and GL₁₀₄ over Library GL. Altogether, these findings support our development of a one-dimensional Ginzburg-Landau model as a ROM for two-dimensional cylinder vortex shedding.

Data-Driven Discovery of a New Ginzburg-Landau Reduced-Order Model for Vortex Shedding