This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

The buckling load of cylindrical shells under axial compression depends on the cross-section curvature

Davit Harutyunyan and Andre Martins Rodrigues University of California Santa Barbara, [email protected] of California Santa Barbara, [email protected]
Abstract

It is known that the famous theoretical formula by Koiter for the critical buckling load of circular cylindrical shells under axial compression does not coincide with the experimental data. Namely, while Koiter’s formula predicts linear dependence of the buckling load λ(h)\lambda(h) of the shell thickness hh (h>0h>0 is a small parameter), one observes the dependence λ(h)h3/2\lambda(h)\sim h^{3/2} in experiments; i.e., the shell buckles at much smaller loads for small thickness. This theoretical prediction failure is believed to be caused by the so-called sensitivity to imperfections phenomenon (both, shape and load). Grabovsky and the first author have rigorously proven in [J. Nonl. Sci., Vol. 26, Iss. 1, pp. 83–119, Feb. 2016], that in the problem of circular cylindrical shells buckling under axial compression, a small load twist leads to the buckling load scaling λ(h)h5/4,\lambda(h)\sim h^{5/4}, while shape imperfections are likely to result in the scaling λ(h)h3/2.\lambda(h)\sim h^{3/2}. In this work we prove, that in fact the buckling load λ(h)\lambda(h) of cylindrical (not necessarily circular) shells under vertical compression depends on the curvature of the cross section curve. When the cross section is a convex curve with uniformly positive curvature, then λ(h)h,\lambda(h)\sim h, and when the the cross section curve has positive curvature except at finitely many points, then C1h8/5λ(h)C2h3/2C_{1}h^{8/5}\leq\lambda(h)\leq C_{2}h^{3/2} for hh small thickness h>0.h>0.

1 Introduction

Thin-walled shells are, in general, highly efficient structures. In order to produce reliable designs and to avoid unexpected catastrophic failures, one needs to understand buckling. Buckling occurs in a thin structure under loading, when the structure undergoes an overall change in configuration instead of acting in the primary fashion intended by their designers, leading to the failure of the structures. Physically speaking, buckling in a thin shell occurs, when the shell can absorb a great deal of membrane strain energy without deforming too much but it must deform much more in order to absorb an equivalent amount of bending strain energy. When this stored energy is converted into bending energy, buckling occurs, creating a visible change in the geometry of the shell (typically in the form of several dimples) to accommodate all the energy, e.g., Figure 1. Mathematically speaking, buckling can be interpreted as the instability of the equilibrium state that for a certain load will have two possible trajectories to follow, i.e., when in the stress-strain (or stress-deformation) diagram a bifurcation occurs. This phenomenon is mathematically described as the loss of positivity of the second variation of the total energy of the system. In engineering it is essential to have a good estimate on the critical stress that will trigger buckling. In the present work we revisit the problem of buckling of cylindrical shells under axial compression. In that problem, one starts applying homogeneous load of magnitude λ\lambda to the top of a cylindrical shell that is resting on a substrate, where λ\lambda is increased continuously from zero. It is observed that at very small load magnitudes λ,\lambda, the cylindrical shell will undergo a homogeneous deformation with no visible geometric changes. Then at some critical value λ=λ(h),\lambda=\lambda(h), the shell will buckle, producing a variety of deformation patterns, typically in the form of several (or single) dimples [References,References,References,References,References,References,References] shown in Figure 1111The second and the third cylinders are apparently deeper into the post-buckling regime. The dimple (dimples) typically appear with a ”click” and drop in the load magnitude (which corresponds to the bifurcation point), and disappear when unloading the shell. Some less common buckling patterns, such as formation of waves in the longitudinal direction or periodic-like wrinkling are also possible, see [References] and Figure 14 of [References] for more details.

Refer to caption
Refer to caption
Refer to caption
Figure 1: Buckled paper cylindrical shells under vertical load.

Despite being a very old problem with a lot of data available in the literature, (studies and experiments have been made since the mid-18th century), it still contains several unsolved puzzles even for the simplest geometry of perfectly circular cylindrical shells. For the case of circular cylindrical shells, it has been observed that the buckling load measured by experiments has a large discrepancy with the theoretical predictions made by the classical asymptotic formula [References,References,References,References]

λ(h)=Eh3(1ν2),\lambda(h)=\frac{Eh}{\sqrt{3\left(1-\nu^{2}\right)}}, (1.1)

which predicts a linear relation between the thickness of the cylinder and the critical buckling load λ(h).\lambda(h). Here EE and ν\nu are the Young modulus and the Poisson ratio of the material, respectively, and h=t/Rh=t/R is the shell dimensionless thickness, i.e., the ratio of the cylinder wall thickness tt to the cylinder radius RR. Note that in fact formula (1.1) was first derived by Lorenz [References] in 1911 and independently by Timoshenko [References] in 1914, but sometimes in the literature it also carries the name of Koiter, as Koiter derived the so-called Koiter circle associated to (1.1) in his Ph.D. thesis [References] in 1945, see also [References] for Koiter’s circle. On the experimental side, a great deal of experiments since the 1930’s show that the experimental critical stress is actually much lower than the theoretical formula (1.1), scaling like h3/2h^{3/2} with hh, e.g., [References,References]. It has been believed in the engineering and applied mathematics communities, that such a paradoxical behavior is in general due to the fact that the buckling load may be highly sensitive to shape or load imperfections [References,References,References,References,References,References,References,References,References]. Note that general geometric symmetry breaking or (even a very small) preexisting deformation have been believed to be important factors in the asymptotics of the critical buckling load. These questions have been addressed in the works [References,References,References,References] in an attempt to resolve the paradox using mainly numerical approach and/or reduced shell theory equations. One possible gap in these approaches may be whether the utilized reduced shell theory equations are indeed applicable to the problem under consideration and capture the sought parameters within the acceptable error. This concern is based in particular on the works [References,References,References], that reveal whether a specific reduced shell theory holds in a specific applied load and elastic energy regime. In the meantime the answer to those specific and important question is unknown. Another weakness was the presence of some heuristic arguments. On the other hand, on the rigorous side, Grabovsky and the first author rigorously proved in [References] that indeed Koiter’s asymptotic formula (1.1) must hold in the case of perfect cylindrical shells and perfect axial homogeneous loading. This was achieved by the improvement and application of the ”Thin structure buckling theory” by Grabovsky and Truskinovsky [References]. Another crucial component of the analysis in [References] was the derivation of the optimal asymptotic constants (not only the asymptotics, but also the leading term in it) in Korn and generalized Korn inequalities for circular cylindrical shells. Then Grabovsky and the first author went on to prove in [References] that in fact if even a very small twist (in the angular direction) is present in the shell loading, then the asymptotics of the buckling load has to drop to h5/4,h^{5/4}, see Table 1 below.

Cross-section (C.-S.) and load Circular C.-S., vertical load Circular C.-S., twist in the load
𝜶(θ)=(cos(θ),sin(θ))\bm{\alpha}(\theta)=(\cos(\theta),\sin(\theta)), 𝒕=λ𝒆z\bm{t}=-\lambda\bm{e}_{z} 𝜶(θ)=(cos(θ),sin(θ))\bm{\alpha}(\theta)=(\cos(\theta),\sin(\theta)), 𝒕=λ(𝒆z+ϵ𝒆θ)\bm{t}=-\lambda(\bm{e}_{z}+\epsilon\bm{e}_{\theta})
Buckling load asymptotics λ(h)=Eh3(1ν2)\lambda(h)=\frac{Eh}{\sqrt{3\left(1-\nu^{2}\right)}} λ(h)=Ch5/4\lambda(h)=Ch^{5/4}

Table 1: The dependence of the critical buckling load of circular cylindrical shells on the type of loading. Vertical load versus vertical load with a small twist.

This analysis to some extent gave an explanation to the fact of sensitivity of the buckling load to load imperfections. Also, a somewhat less rigorous argument in [References] demonstrated why one should expect the buckling load to drop to h3/2h^{3/2} in the presence of some small dimples in the shell.

Our task in the present work is the analysis of the ”sensitivity to imperfections” problem for some other kind of shape imperfections, that are diversions from the perfect cylindrical shell. Namely, we will consider cylindrical shells, generated by cylindrical surfaces, that are non-circular but have convex cross sections: Two main families will be analyzed.

  • (i)

    The first family contains cylindrical surfaces with convex cross sections that have uniformly positive curvature (when regarded from the exterior of the curve). An illustrative example of such a curve is given in the left half of Figure 2.

  • (ii)

    The second family contains cylindrical suraces with convex cross sections that have uniformly positive curvature (when regarded from the exterior of the curve) except from finitely many points on the curve, where the curvature vanishes. An illustrative example of such a curve is given in the right half of Figure 2.

Refer to caption
Refer to caption
Figure 2: Cylindrical surface cross-section curves with uniformly positive curvature (on the left), and uniformly positive curvature, except at finitely many points on the curve, where it vanishes (on the right), e.g., k(θ0)=0.k(\theta_{0})=0. The curve on the right behaves like (t,t4)(t,t^{4}) near the zero curvature points.

Note that in both cases the cylinder cross-section does not have to have any geometric symmetry property, but rather only has to be convex with some imposed curvature condition. We will prove that in case (i) the critical buckling load λ(h)\lambda(h) has the asymptotics ChCh, and in case (ii) we will prove the bounds C1h8/5λ(h)C2h3/2C_{1}h^{8/5}\leq\lambda(h)\leq C_{2}h^{3/2} in the vanishing thickness regime h0.h\to 0. The result in part (i) in particular disproves the longstanding believe that geometric symmetry breaking would lead to the drop in the asymptotics of λ(h),\lambda(h), i.e., there is an ϵ>0,\epsilon>0, such that λ(h)Ch1+ϵ\lambda(h)\leq Ch^{1+\epsilon}. Also, the result in part (ii) provides new evidence on how a geometric shape imperfection, which is a diversion from the perfect cylindrical shell may lead to the drop in the asymptotics of λ(h)\lambda(h) to at least h3/2.h^{3/2}.

As already pointed out, we will be working in the framework of the (improved) ”thin structure buckling theory” of Grabovsky and Truskinovsky [References] rigorously derived from three-dimensional nonlinear hyper-elasticity. Some of the main components in the analysis will be asymptotically sharp Korn inequalities for the displacement gradient components, proven for the shells under consideration.

The paper is organized as follows: In Section 2 we present a brief introduction to the general theory of slender structure buckling [References,References]. In Section 3 we will be formulating the main results of the paper. In order to apply the theory in Section 2, on one hand one needs to determine (with some amount of proximity) the so-called ”trivial branch”, and the asymptotic stress tensor, which will be done in Section 4.1. Also, on the other hand one needs to prove asymptotically sharp Korn and Korn-type inequalities for the displacement gradient components, thus in Sections 4.2-4.3 we will formulate and prove those. Finally, in Section 4.4 we will prove the main results on the buckling load.

2 Buckling of slender Structures

For the convenience of the reader, this section is devoted to the presentation of the general theory ”Buckling of slender structures” by Grabovsky and Truskinovsky [References], that was later extended by Grabovsky and the first author in [References]. In the presentation there will be some little amount of focus on the buckling of cylindrical shells, which is the subject of the paper. While we will try to keep the paper as self contained as possible, we will avoid going into the technical details and proofs referring the reader to the papers [References,References] for details.

2.1 Loading, deformation, energy

In what follows we will be working in the framework of hyper-elasticity. Let Ω3\Omega\subset\mathbb{R}^{3} be a bounded, simply-connected, and open set with a Lipschitz boundary, representing a hyper-elastic body in space, let 𝒕(𝒙)\bm{t}(\bm{x}) be dead traction applied to the boundary (or just some of it) of Ω.\Omega. A resulting deformation 𝒚=𝒚(𝒙),(𝒙Ω\bm{y}=\bm{y}(\bm{x}),\ (\bm{x}\in\Omega) is said to be stable, if it is a weak local minimizer of the total energy of the system, that is given by the formula

E(𝒚)=ΩW(𝒚(𝒙))𝑑𝒙Ω𝒚𝒕(𝒙)𝑑S(𝒙),E(\bm{y})=\int_{\Omega}W(\nabla\bm{y(\bm{x})})d\bm{x}-\int_{\partial\Omega}\bm{y}\cdot\bm{t}(\bm{x})dS(\bm{x}), (2.1)

where W(𝑭):3×3W(\bm{F})\colon\mathbb{R}^{3\times 3}\to\mathbb{R} is the elastic energy density function of the body. We will assume that WW is of class C3C^{3} in some neighborhood of the identity matrix 𝑰.\bm{I}. Also, in hyper-elasticity, WW satisfies the following fundamental properties:

  1. (P1)

    Positivity: W(𝑭)0W(\bm{F})\geq 0 for all 𝑭3×3,\bm{F}\in\mathbb{R}^{3\times 3}, and W(𝑰)=0.W(\bm{I})=0.

  2. (P2)

    Absence of prestress: W𝑭(𝑰)=𝟎.W_{\bm{F}}(\bm{I})=\mathbf{0}.

    Note that this condition follows from (P1) and the fact that WW is C3C^{3}-regular at 𝑰.\bm{I}. However, this condition is traditionally mentioned as it has the mechanical meaning of the absence of prestress.

  3. (P3)

    Frame indifference: W(𝑹𝑭)=W(𝑭)W(\bm{R}\bm{F})=W(\bm{F}) for every 𝑹SO(3).\bm{R}\in SO(3).

  4. (P4)

    Local stability of the trivial deformation 𝒚(𝒙)=𝒙:(𝑳0𝝃,𝝃)0\bm{y}(\bm{x})=\bm{x}:\left(\bm{L}_{0}\bm{\xi},\bm{\xi}\right)\geq 0 for any 𝝃3×3,\bm{\xi}\in\mathbb{R}^{3\times 3}, where 𝑳0=W𝑭𝑭(𝑰)\bm{L}_{0}=W_{\bm{F}\bm{F}}(\bm{I}) is the linearly elastic tensor of material properties.

  5. (P5)

    Non-degeneracy: (𝑳0𝝃,𝝃)=0\left(\bm{L}_{0}\bm{\xi},\bm{\xi}\right)=0 if and only if 𝝃T=𝝃\bm{\xi}^{T}=-\bm{\xi}.

In addition to the above standard properties (P1)-(P5), we will assume that the material is isotropic, i.e., the energy density WW satisfies the additional property (P6) below.

  • (P6)

    Isotropy: W(𝑭𝑹)=W(𝑭)W(\bm{F}\bm{R})=W(\bm{F}) for every 𝑹SO(3).\bm{R}\in SO(3).

Here, for a function W(𝑭):3,W(\bm{F})\colon\mathbb{R}^{3}\to\mathbb{R}, the symbols W𝑭W_{\bm{F}} and W𝑭𝑭W_{\bm{F}\bm{F}} denote the gradient and the Hessian of WW respectively, i.e.,

W𝑭(𝑭)=(Wfij(𝑭))1i,j3andW𝑭𝑭(𝑭)=(2Wfijfkl(𝑭))1i,j,k,l3,W_{\bm{F}}(\bm{F})=\left(\frac{\partial W}{\partial f_{ij}}(\bm{F})\right)_{1\leq i,j\leq 3}\quad\text{and}\quad W_{\bm{F}\bm{F}}(\bm{F})=\left(\frac{\partial^{2}W}{\partial f_{ij}\partial f_{kl}}(\bm{F})\right)_{1\leq i,j,k,l\leq 3},

where 𝑭=(fij)1i,j3.\bm{F}=(f_{ij})_{1\leq i,j\leq 3.} As it is known in mechanics, in the presence of isotropy homogeneous deformations are possible. We will also see later in Section 4.1 how the assumption of isotropy substantially simplifies the job of proving the existence of a trivial branch. Here and in what follows, (𝑨,𝑩)(\bm{A},\bm{B}) is the Frobenius inner product of matrices 𝑨=(aij)i,j=13,𝑩=(bij)i,j=133×3:\bm{A}=(a_{ij})_{i,j=1}^{3},\bm{B}=(b_{ij})_{i,j=1}^{3}\in\mathbb{R}^{3\times 3}: (𝑨,𝑩)=i,j=13aijbij.(\bm{A},\bm{B})=\sum_{i,j=1}^{3}a_{ij}b_{ij}. In the theory of hyper-elasticity, one typically chooses for the admissible set 𝒜\cal A of the deformations 𝒚(𝒙)\bm{y}(\bm{x}) to be the subspace of functions that belong to some Sobolev space W1,p(Ω)W^{1,p}(\Omega) (1<p1<p), which in addition satisfy some Dirichlet or/and natural Neumann boundary conditions on some complementary portions of Ω\partial\Omega yielded from the loading. Then the resulting deformation 𝒚(𝒙)\bm{y}(\bm{x}) must be a local minimizer of the energy E(𝒚)E(\bm{y}) in the admissible set 𝒜.\cal A. A more detailed discussion on the deformations of slender structures is presented in the section to follow.

2.2 Trivial branch

In the general theory of buckling of slender structures originated in [References], one considers a sequence of slender domains Ωh\Omega_{h} parametrized by a small parameter hh. The notion of slenderness used in this paper will be given a precise definition later in Section 2.3. In the case when Ωh\Omega_{h} is a shell, the parameter hh typically represents the thickness of the shell, or the dimensionless thickness, i.e., the thickness divided by one of the in-plane parameters. In our case, hh will represent the thickness of the non-circular cylinders Ωh\Omega_{h} under consideration, with constant height LL. Consider a loading program

𝒕(𝒙,h,λ)=λ𝒕h(𝒙)+O(λ2),𝒙Γ1h,\bm{t}(\bm{x},h,\lambda)=\lambda\bm{t}^{h}(\bm{x})+O(\lambda^{2}),\qquad\bm{x}\in\Gamma_{1}^{h}, (2.2)

applied to the part Γ1h\Gamma_{1}^{h} of the boundary of Ωh,\Omega_{h}, where λ\lambda is the load magnitude and 𝒕h\bm{t}^{h} is the load direction. For the problem of cylindrical shell axial compression, the part Γ1h\Gamma_{1}^{h} would be the top and the bottom parts of the boundary only. Assume for some λ0>0\lambda_{0}>0 the loading (2.2) results in a family of Lipschitz deformations 𝒚(𝒙;h,λ)W1,(Ωh,3)\bm{y}(\bm{x};h,\lambda)\in W^{1,\infty}(\Omega_{h},\mathbb{R}^{3}) for all λ[0,λ0],\lambda\in[0,\lambda_{0}], where as mentioned above, the field 𝒚(𝒙;h,λ)\bm{y}(\bm{x};h,\lambda) is a stable deformation for given boundary conditions, i.e., it is a weak local minimizer of the elastic energy

Eel.(𝒚)=ΩW(𝒚(𝒙))𝑑𝒙E_{el.}(\bm{y})=\int_{\Omega}W(\nabla\bm{y(\bm{x})})d\bm{x}

in the admissible set

𝒜λ={𝒚(𝒙,h,λ)W1,(Ωh,3):𝒚(𝒙,λ,h)=𝒈(𝒙,λ,h)onΓ1hin the trace sence},\mathcal{A}_{\lambda}=\{\bm{y}(\bm{x},h,\lambda)\in W^{1,\infty}(\Omega_{h},\mathbb{R}^{3})\ :\ \bm{y}(\bm{x},\lambda,h)=\bm{g}(\bm{x},\lambda,h)\ \text{on}\ \ \Gamma_{1}^{h}\ \text{in the trace sence}\}, (2.3)

for all λ[0,λ0],\lambda\in[0,\lambda_{0}], where the vector field 𝒈(𝒙,λ,h)W1,(Ωh;3)\bm{g}(\bm{x},\lambda,h)\in W^{1,\infty}\left(\Omega_{h};\mathbb{R}^{3}\right) represents the boundary conditions on the part Γ1h\Gamma_{1}^{h} where the load is applied, and it is typically defined specifically for each problem and results from the loading program (2.2). The family of deformations 𝒚(𝒙;h,λ)\bm{y}(\bm{x};h,\lambda) is then called a trivial branch. Next, one defines the so-called linearly elastic trivial branch.

Definition 2.1.

A family of stable (within the admissible set 𝒜λ\mathcal{A}_{\lambda}) Lipschitz deformations 𝐲(𝐱,h,λ)W1,(Ωh,3)\bm{y}(\bm{x},h,\lambda)\in W^{1,\infty}(\Omega_{h},\mathbb{R}^{3}) is called a linearly elastic trivial branch, if there exist h0>0,h_{0}>0, so that for every h[0,h0]h\in\left[0,h_{0}\right] and λ[0,λ0]\lambda\in\left[0,\lambda_{0}\right] one has:

  1. (i)

    𝒚(𝒙;h,0)=𝒙.\bm{y}(\bm{x};h,0)=\bm{x}.

  2. (ii)

    There exist a family of Lipschitz functions 𝒖h(𝒙)\bm{u}^{h}(\bm{x}), independent of λ\lambda, such that

    𝒚(𝒙;h,λ)𝑰λ𝒖h(𝒙)L(Ωh)Cλ2,\left\|\nabla\bm{y}(\bm{x};h,\lambda)-\bm{I}-\lambda\nabla\bm{u}^{h}(\bm{x})\right\|_{L^{\infty}\left(\Omega_{h}\right)}\leq C\lambda^{2}, (2.4)
  3. (iii)
    (𝒚)λ(𝒙;h,λ)𝒖h(𝒙)L(Ωh)Cλ.\left\|\frac{\partial(\nabla\bm{y})}{\partial\lambda}(\bm{x};h,\lambda)-\nabla\bm{u}^{h}(\bm{x})\right\|_{L^{\infty}\left(\Omega_{h}\right)}\leq C\lambda. (2.5)

    Here the constant C>0C>0 is independent of hh and λ\lambda.

We remark that neither uniqueness nor general stability of the trivial branch are assumed. It is important to note that, here, the term general stability is stability without the boundary conditions in (2.3). It is worth mentioning that as understood in [References], usually the family 𝒚(𝒙,h,λ)\bm{y}(\bm{x},h,\lambda) is not stable in general due to the possibility of infinitesimal flips. This is always the case for cylindrical shell compression problems due to possible infinitesimal rotations in the cross-section plane. Additionally the leading order term λ𝒖h(𝒙)\lambda\bm{u}^{h}(\bm{x}) of the nonlinear trivial branch is nothing else but the linear elastic displacement, that can be calculated solving the equations of linear elasticity (the Lamé system) (𝑳0e(𝒖h))=𝟎\nabla\cdot\left(\bm{L}_{0}e\left(\bm{u}^{h}\right)\right)=\mathbf{0} with the imposed boundary conditions, where e(λ𝒖h)=λ2(𝒖h+(𝒖h)T)e\left(\lambda\bm{u}^{h}\right)=\frac{\lambda}{2}(\nabla\bm{u}^{h}+\left(\nabla\bm{u}^{h})^{T}\right) is the linear strain.

2.3 The buckling load and buckling modes

Buckling of the thin structure Ωh\Omega^{h} occurs when the trivial branch 𝒚(𝒙;h,λ)\bm{y}(\bm{x};h,\lambda) becomes unstable for some value λ=λcrit(h)\lambda=\lambda_{crit}(h) for the first time. This happens because it becomes energetically more favorable to bend the structure rather than store more compressive energy. This bifurcation is detected by the change in the sign of the second variation of the energy:

δ2E(ϕ;h,λ)=Ωh(WFF(𝒚(x;h,λ))ϕ,ϕ)𝑑x,ϕVh,\delta^{2}E(\bm{\phi};h,\lambda)=\int_{\Omega_{h}}\left(W_{FF}(\nabla\bm{y}(x;h,\lambda))\nabla\bm{\phi},\nabla\bm{\phi}\right)dx,\qquad\bm{\phi}\in V_{h}, (2.6)

where VhW1,(Ωh)V_{h}\subset W^{1,\infty}(\Omega^{h}) is the vector space of admissible variations resulting from the loading program (2.2), i.e., from the admissible set 𝒜λ\mathcal{A}_{\lambda} in (2.3). Namely, this means that there exists λcrit(h)>0\lambda_{crit}(h)>0 such that the second variation is non negative when 0<λ<λcrit(h)0<\lambda<\lambda_{crit}(h) for all test functions ϕVh,\bm{\phi}\in V_{h}, but for λ>λcrit(h)\lambda>\lambda_{crit}(h) it can become negative for some choice of ϕ.\bm{\phi}. This observation leads to the following mathematical definition of the critical buckling load:

λcrit (h)=inf{λ>0:δ2E(ϕ;h,λ)<0 for some ϕVh}.\lambda_{\text{crit }}(h)=\inf\left\{\lambda>0:\delta^{2}E(\bm{\phi};h,\lambda)<0\text{ for some }\bm{\phi}\in V_{h}\right\}. (2.7)

The body Ωh\Omega_{h} is said to undergo a near-flip buckling, if limh0λcrit (h)=0.\lim_{h\to 0}\lambda_{\text{crit }}(h)=0. A buckling mode is generally understood as a different from zero variation ϕhVh\bm{\phi}^{*}_{h}\in V_{h}, such that δ2E(ϕh;h,λcrit(h))=0\delta^{2}E\left(\bm{\phi}_{h}^{*};h,\lambda_{crit}(h)\right)=0. In practice one is only interested in the leading term asymptotics (or sometimes even in the scaling) of the buckling load in hh as h0.h\to 0. Note that if one perturbs λcrit (h)\lambda_{\text{crit }}(h) by a small factor ϵ,\epsilon, i.e., one replaces it by (1+ϵ)λcrit (h),(1+\epsilon)\lambda_{\text{crit }}(h), then the second variation changes approximately by ϵλcrit (h)(δ2E)λ(ϕh;h,λcrit (h))\epsilon\lambda_{\text{crit }}(h)\frac{\partial\left(\delta^{2}E\right)}{\partial\lambda}\left(\bm{\phi}_{h}^{*};h,\lambda_{\text{crit }}(h)\right). This means that second variations at two different variations ϕ1\bm{\phi}_{1} and ϕ2\bm{\phi}_{2} differing by an infinitesimal value compared to λcrit (h)(δ2E)λ(ϕh;h,λcrit (h))\lambda_{\text{crit }}(h)\frac{\partial\left(\delta^{2}E\right)}{\partial\lambda}\left(\bm{\phi}_{h}^{*};h,\lambda_{\text{crit }}(h)\right) should not be distinguished. This observation leads to the following new definition of buckling loads and buckling modes in the broader sense.

Definition 2.2.

We say that a function λ(h)0\lambda(h)\rightarrow 0, as h0h\rightarrow 0 is a buckling load if

limh0λ(h)λcrit (h)=1.\lim_{h\rightarrow 0}\frac{\lambda(h)}{\lambda_{\text{crit }}(h)}=1.

Similarly, a buckling mode is a family of variations ϕhVh\{0},\bm{\phi}_{h}\in V_{h}\backslash\{0\}, such that

limh0δ2E(ϕh;h,λcrit (h))λcrit (h)(δ2E)λ(ϕh;h,λcrit (h))=0\lim_{h\rightarrow 0}\frac{\delta^{2}E\left(\bm{\phi}_{h};h,\lambda_{\text{crit }}(h)\right)}{\lambda_{\text{crit }}(h)\frac{\partial\left(\delta^{2}E\right)}{\partial\lambda}\left(\bm{\phi}_{h};h,\lambda_{\text{crit }}(h)\right)}=0

It turns out that under the conditions in Definition 2.1 [References], the buckling load and buckling modes can be captured by the so called constitutively linearized second variation:

δ2Ecl(ϕ;h,λ)=Ωh(𝑳0e(ϕ),e(ϕ)+λ𝝈h,ϕTϕ)𝑑𝒙,ϕVh,\delta^{2}E_{\mathrm{cl}}(\bm{\phi};h,\lambda)=\int_{\Omega_{h}}\left(\left\langle\bm{L}_{0}e(\bm{\phi}),e(\bm{\phi})\right\rangle+\lambda\left\langle\bm{\sigma}_{h},\nabla\bm{\phi}^{T}\nabla\bm{\phi}\right\rangle\right)d\bm{x},\qquad\bm{\phi}\in V_{h}, (2.8)

where 𝝈h(𝒙)=𝑳0e(𝒖h(𝒙))\bm{\sigma}_{h}(\bm{x})=\bm{L}_{0}e\left(\bm{u}^{h}(\bm{x})\right) is the linear elastic stress. From the condition (P4) on 𝑳0,\bm{L}_{0}, the first term in (2.8) is always nonnegative, thus the potentially destabilizing variations will be in the set:

Vhd={ϕVh:𝝈h,ϕTϕ0}.V_{h}^{d}=\left\{\phi\in V_{h}:\left\langle\bm{\sigma}_{h},\nabla\bm{\phi}^{T}\nabla\bm{\phi}\right\rangle\leq 0\right\}. (2.9)

Further, the constitutively linearized critical strain will be obtained by minimizing the Rayleigh quotient

(h,ϕ)=Ωh𝑳0e(ϕ),e(ϕ)𝑑xΩh𝝈h,ϕTϕ𝑑x,\Re(h,\bm{\phi})=-\frac{\int_{\Omega_{h}}\left\langle\bm{L}_{0}e(\bm{\phi}),e(\bm{\phi})\right\rangle dx}{\int_{\Omega_{h}}\left\langle\bm{\sigma}_{h},\nabla\bm{\phi}^{T}\nabla\bm{\phi}\right\rangle dx}, (2.10)

over the set of destabilizing variations Vhd.V_{h}^{d}. One can formalize this in the following definition.

Definition 2.3.

The constitutively linearized buckling load λcl(h)\lambda_{\mathrm{cl}}(h) is defined by

λcl(h)=infϕVhd(h,ϕ).\lambda_{\mathrm{cl}}(h)=\inf_{\bm{\phi}\in V_{h}^{d}}\Re(h,\bm{\phi}). (2.11)

The family of variations {ϕhVhd:h(0,h0)}\left\{\bm{\phi}_{h}\in V_{h}^{d}:h\in\left(0,h_{0}\right)\right\} is called a constitutively linearized buckling mode if

limh0(h,ϕh)λcl(h)=1.\lim_{h\rightarrow 0}\frac{\Re\left(h,\bm{\phi}_{h}\right)}{\lambda_{\mathrm{cl}}(h)}=1. (2.12)
Definition 2.4.

The body Ωh\Omega_{h} is slender if limh0K(Ωh)=0,\lim_{h\rightarrow 0}K\left(\Omega_{h}\right)=0, where the Korn constant is defined as

K(Ωh)=infϕVhe(ϕ)L2(Ωh)2ϕL2(Ωh)2.K\left(\Omega_{h}\right)=\inf_{\phi\in V_{h}}\frac{\|e(\bm{\phi})\|_{L^{2}\left(\Omega_{h}\right)}^{2}}{\|\nabla\bm{\phi}\|_{L^{2}\left(\Omega_{h}\right)}^{2}}.

The following theorem proven in [References] provides a formula for the buckling load and buckling modes.

Theorem 2.5.

Suppose that the body is slender in the sense of Definition 2.4. Assume that the constitutively linearized critical load λcl(h),\lambda_{\mathrm{cl}}(h), defined in (2.11) satisfies λcl(h)>0\lambda_{\mathrm{cl}}(h)>0 for all sufficiently small hh and

limh0λcl(h)2K(Ωh)=0.\lim_{h\rightarrow 0}\frac{\lambda_{\mathrm{cl}}(h)^{2}}{K\left(\Omega_{h}\right)}=0.

Then λcl(h)\lambda_{\mathrm{cl}}(h) is the buckling load and any constitutively linearized buckling mode ϕh\bm{\phi}_{h} is a buckling mode in the sense of Definition 2.2.

Now, in order to study the problem of buckling of cylindrical shells under axial compression, we see from Theorem 2.5 and from the above formalism, that we need to gain appropriate information about the trivial branch associated to the axial compression, the associated stress tensor, the Korn constant, and the load λcl(h).\lambda_{\mathrm{cl}}(h). This is done in the next sections.

3 Problem setting and main results

In this section we formulate the problems under consideration and the main results. Let 𝜶(θ)\bm{\alpha}(\theta) be a closed convex curve in the XYXY-plane that is parameterized by the arc length and has period/length pp. Then the cylindrical surface SS having horizontal cross section as the curve 𝜶\bm{\alpha} and height L>0L>0 will be given by the formula

S:𝒓(θ,z)=𝜶(θ)+z𝒆z,θ[0,p],z[0,L].S:\ \ \bm{r}(\theta,z)=\bm{\alpha}(\theta)+z\bm{e}_{z},\quad\theta\in[0,p],\ z\in[0,L].

Denote 𝒆θ=𝜶(θ)\bm{e}_{\theta}=\bm{\alpha}^{\prime}(\theta) and by 𝒆t\bm{e}_{t} the tangent and normal vectors to 𝜶\bm{\alpha} respectively. This will give rise to the local orthonormal basis (𝒆t,𝒆θ,𝒆z).(\bm{e}_{t},\bm{e}_{\theta},\bm{e}_{z}). We will be dealing with cylindrical shells Ωh\Omega_{h} with mid-surface SS and thickness h,h, i.e.,

Ωh={(t,θ,z):t[h/2,h/2],θ[0,p],z[0,L]}.\Omega_{h}=\left\{(t,\theta,z)\ :\ t\in[-h/2,h/2],\ \theta\in[0,p],\ z\in[0,L]\right\}. (3.1)

Denoting kθ=𝜶′′k_{\theta}=\|\bm{\alpha}^{\prime\prime}\| the curvature of the cross-section 𝜶,\bm{\alpha}, we have that the gradient of any vector field ϕ=(ϕt,ϕθ,ϕz)H1(Ωh,3)\bm{\phi}=(\phi_{t},\phi_{\theta},\phi_{z})\in H^{1}(\Omega_{h},\mathbb{R}^{3}) will be given in the local basis (𝒆t,𝒆θ,𝒆z)(\bm{e}_{t},\bm{e}_{\theta},\bm{e}_{z}) by

ϕ=[ϕt,tϕt,θkθϕθkθt+1ϕt,zϕθ,tϕθ,θ+kθϕtkθt+1ϕθ,zϕz,tϕz,θkθt+1ϕz,z],\nabla\bm{\phi}=\begin{bmatrix}\phi_{t,t}&\frac{\phi_{t,\theta}-k_{\theta}\,\phi_{\theta}}{k_{\theta}\,t+1}&\phi_{t,z}\\[10.0pt] \phi_{\theta,t}&\frac{\phi_{\theta,\theta}+k_{\theta}\,\phi_{t}}{k_{\theta}\,t+1}&\phi_{\theta,z}\\[10.0pt] \phi_{z,t}&\frac{\phi_{z,\theta}}{k_{\theta}\,t+1}&\phi_{z,z}\end{bmatrix}, (3.2)

where f,ηf_{,\eta} inside the gradient matrix denotes the partial derivative ηf.\partial_{\eta}f. Assume that the shell Ωh\Omega_{h} is resting on a substrate and is undergoing uniform and homogeneous vertical/axial loading222The terminology ”axial load” will be used even if 𝜶\bm{\alpha} does not have point-symmetry, and thus Ωh\Omega_{h} will have no axis.

𝒕(𝒙,h,λ)=λ𝒆z\bm{t}(\bm{x},h,\lambda)=-\lambda\bm{e}_{z} (3.3)

applied at the top of the shell. The main problem that we are concerned with is the determination of the asymptotics of critical buckling load λ(h)\lambda(h) in the thickness hh as h0.h\to 0. As already mentioned in Section 1, one has λ(h)h\lambda(h)\sim h as h0h\to 0 in the case when the cross section 𝜶\bm{\alpha} is a circle. We will be considering the cases when 𝜶\bm{\alpha} has positive curvature everywhere (k(θ)>0k(\theta)>0 for all θ[0,p]\theta\in[0,p]), and the case when 𝜶\bm{\alpha} has positive curvature everywhere except for finitely many points on the curve, at which the curvature has to vanish as 𝜶\bm{\alpha} is convex and thus k(θ)0k(\theta)\geq 0 for all θ[0,p].\theta\in[0,p]. Let us mention that in what follows the norm f\|f\| will be the L2L^{2} norm fL2(Ωh).\|f\|_{L^{2}(\Omega_{h})}. For the problem under consideration, the natural choice for the vector space VhV_{h} will be the subspace of all displacements ϕH1(Ωh)\bm{\phi}\in H^{1}(\Omega_{h}) that vanish at the top and the bottom of the shell, i.e,

Vh={ϕH1(Ωh):ϕ(t,θ,0)=ϕ(t,θ,L)=0,(t,θ)[h/2,h/2]×[0,p]}.V_{h}=\{\bm{\phi}\in H^{1}(\Omega_{h})\ :\ \bm{\phi}(t,\theta,0)=\bm{\phi}(t,\theta,L)=0,\ \ (t,\theta)\in[-h/2,h/2]\times[0,p]\}. (3.4)

This means that one shall study the stability of the trivial branch within the set of all Lipschitz deformations satisfying the same Dirichlet boundary conditions on the top and bottom of the cylinder. However, we will prove a stronger stability result, i.e., stability within a wider class of deformations. Note that if one only prescribes the values of the vertical component yzy_{z} of the trivial branch 𝒚\bm{y} at the top and the bottom of the shell, then the cylinder may undergo flip instability through infinitesimal rotations in the cross section plane [References,References]. Therefore one needs to impose some condition on yty_{t} or yθy_{\theta} to rule this possibility out. Following [References], we choose to impose zero integral condition on the tangential component ϕθ\phi_{\theta} in the zz direction, which gives the alternative subspace

Vhθ={ϕH1(Ωh):ϕz|z=0=ϕz|z=L=0Lϕθ(t,θ,z)𝑑z=0,(t,θ)[h/2,h/2]×[0,p]}.V_{h}^{\theta}=\{\bm{\phi}\in H^{1}(\Omega_{h})\ :\ \phi_{z}|_{z=0}=\phi_{z}|_{z=L}=\int_{0}^{L}\phi_{\theta}(t,\theta,z)dz=0,\ \ \forall(t,\theta)\in[-h/2,h/2]\times[0,p]\}. (3.5)

Let us explain this choice. Observe that we can extend the cylinder to the lower half-space by mirror reflection about the plane z=0,z=0, and consequently the components ϕt\phi_{t} and ϕθ\phi_{\theta} as even functions and ϕz\phi_{z} as an odd function, preserving the structure of the symmetric gradient e(ϕ).e(\bm{\phi}). This would allow us to expand the components ϕt\phi_{t} and ϕθ\phi_{\theta} in Fourier space in cosine series and the ϕz\phi_{z} component in sine series in the zz variable; see (4.19) and the paragraph between (4.18) and (4.19). Then the zero integral condition simply means that the independent of zz term in the expansion of ϕθ\phi_{\theta} is nonexistent. This would clearly prevent the cylinder from undergoing infinitesimal rotations in the cross section plane, which affects only the independent of zz variable terms in ϕ.\bm{\phi}. Note also, that unlike perfect circular cylindrical shells, where homogeneous deformations result in no change in the tangential component, (thus in [References], the authors have prescribed the tangential component ϕθ\phi_{\theta} at the top and the bottom of the cylinder too), homogeneous deformations in general cylindrical shells with any cross sections result in nonzero displacements in the tangential component too, thus a more relaxed subspace, such as (3.5) has to be considered. We will prove the following theorems, where the vector space VV is either VhV_{h} or Vhθ.V_{h}^{\theta}.

Theorem 3.1.

Assume 𝛂\bm{\alpha} is a convex C2C^{2} regular curve in the XYXY plane and assume the cylindrical shell Ωh\Omega_{h} is given as in (3.1). Assume Ωh\Omega_{h} is undergoing vertical loading as in (3.3), and the admissible variations ϕ\bm{\phi} belong to the subspace V.V. The following statements hold:

  • (i)

    If

    minθ[0,p]kθ(θ)=k>0,\min_{\theta\in[0,p]}k_{\theta}(\theta)=k>0,

    then one has

    λ(h)h,\lambda(h)\sim h,

    as h0.h\to 0.

  • (ii)

    Assume in addition that the cross section 𝜶\bm{\alpha} is of class333Note that the higher regularity C5C^{5} is only required for the Ansatz construction in (4.50). C5,C^{5}, and assume kθ(θ)>0k_{\theta}(\theta)>0 except for finitely many points θ1,θ2,,θn(0,p),\theta_{1},\theta_{2},\dots,\theta_{n}\in(0,p), where one has kθ(θi)=0k_{\theta}(\theta_{i})=0 and kθ′′(θi)0k_{\theta}^{\prime\prime}(\theta_{i})\neq 0444Due to the fact kθ0,k_{\theta}\geq 0, we have kθ(θi)=0k_{\theta}^{\prime}(\theta_{i})=0 for i=1,2,,n.i=1,2,\dots,n. for i=1,2,,n.i=1,2,\dots,n. This will imply quadratic growth of the curvature at the points θi,\theta_{i}, i.e.,

    c|θθi|2kθ(θ)1c|θθi|2,θ[0,p],i=1,2,,n,c|\theta-\theta_{i}|^{2}\leq k_{\theta}(\theta)\leq\frac{1}{c}|\theta-\theta_{i}|^{2},\qquad\theta\in[0,p],\ i=1,2,\dots,n, (3.6)

    for some constant c>0.c>0. Then there exist constants C1,C2>0,C_{1},C_{2}>0, depending only on the cylinder height LL and the cross-section 𝜶,\bm{\alpha}, such that one has

    C1h8/5λ(h)C2h3/2,C_{1}h^{8/5}\leq\lambda(h)\leq C_{2}h^{3/2},

    as h0.h\to 0.

A remark is in order.

Remark 3.2.

Part (i) of Thereom 3.1 implies that no matter what the geometry of the cross section 𝛂\bm{\alpha} is, the critical buckling load λ(h)\lambda(h) will scale like hh as h0h\to 0 as long as 𝛂\bm{\alpha} has strictly positive curvature everywhere. This in particular means that the buckling load asymptotics in the vanishing thickness is independent of the fact whether 𝛂\bm{\alpha} has any kind of symmetry or not. Hence, the critical buckling load asymptotics is not sensitive to the initial symmetry of the undeformed configuration. Part (ii) provides some evidence for the phenomenon that the buckling load may drop to h3/2h^{3/2} if the cylinder has some zero longitudinal curvature sections.

In order to prove Theorem 3.1, we will need the following Korn and Korn-like inequalities that can be considered as part of the main results of the paper.

Theorem 3.3.

Let the cylindrical shell Ωh\Omega_{h} be as in Theorem 3.1. The following statements hold:

  • (i)

    If

    minθ[0,p]kθ(θ)=k>0\min_{\theta\in[0,p]}k_{\theta}(\theta)=k>0

    then there exist a constant h~>0,\tilde{h}>0, depending only on L,maxkθ,L,\max k_{\theta}, and k,k, such that

    inf𝒖Ve(ϕ)2col3(ϕ)2h,\inf_{\bm{u}\in V}\frac{\|e(\bm{\phi})\|^{2}}{\|\mathrm{col}_{3}(\nabla\bm{\phi})\|^{2}}\sim h, (3.7)

    for all h(0,h~).h\in(0,\tilde{h}).

  • (ii)

    Assume the conditions in part (ii) of Theorem 3.1 are satisfied. Then there exist constants C1,C2,C3,C4,h~>0,C_{1},C_{2},C_{3},C_{4},\tilde{h}>0, depending only on L,maxkθ,L,\max k_{\theta}, and c,c, such that the Korn and Korn-like inequalities hold:

    C1h12/7infϕVe(ϕ)2ϕ2C2h5/3,C3h8/5infϕVe(ϕ)2col3(ϕ)2C4h3/2,C_{1}h^{12/7}\leq\inf_{\bm{\phi}\in V}\frac{\|e(\bm{\phi})\|^{2}}{\|\nabla\bm{\phi}\|^{2}}\leq C_{2}h^{5/3},\quad\quad C_{3}h^{8/5}\leq\inf_{\bm{\phi}\in V}\frac{\|e(\bm{\phi})\|^{2}}{\|\mathrm{col}_{3}(\nabla\bm{\phi})\|^{2}}\leq C_{4}h^{3/2}, (3.8)

    for all h(0,h~).h\in(0,\tilde{h}). These results are presented in the schematic table 2 below.

Remark 3.4.

It is proven in [References, Theorem 3.2] that under the assumptions in part (i) of Theorem 3.3, Korn’s first inequality holds:

ϕ2Ch3/2e(ϕ)2,\|\nabla\bm{\phi}\|^{2}\leq\frac{C}{h^{3/2}}\|e(\bm{\phi})\|^{2}, (3.9)

for all vector fields ϕVh\bm{\phi}\in V_{h} and all h(0,h~).h\in(0,\tilde{h}). This will be a useful component in the proof of part (i) of Theorem 3.1.

Cross-section (C.-S.) and load C.-S. with uniformly positive curvature, vertical load C.-S. with positive curvature, except at finitely many points, vertical load
k(θ)>0,k(\theta)>0, 𝒕=λ𝒆z\bm{t}=-\lambda\bm{e}_{z} k(θ)0k(\theta)\geq 0 and k(θn)=0k(\theta_{n})=0 for finitely many θn\theta_{n}, 𝒕=λ𝒆z\bm{t}=-\lambda\bm{e}_{z}
Buckling load asymptotics λ(h)=Ch\lambda(h)=Ch Ch8/5λ(h)Ch3/2Ch^{8/5}\leq\lambda(h)\leq Ch^{3/2}

Table 2: The dependence of the critical buckling load of convex cylindrical shells on the cross-section curvature. Cross-sections with uniformly positive curvature versus cross-sections with nonnegative curvature, that vanishes only at finitely many points on the curve.

4 Proof of the main results

4.1 The trivial branch and the stress tensor

In this section we will demonstrate how one can apply the theory presented in Section 2 to the problem of axial compression of cylindrical shells with any convex cross-section. First we will calculate the trivial branch 𝒚(𝒙,λ,h)\bm{y}(\bm{x},\lambda,h) resulting from the compression 𝒕(𝒙,λ,h)=λ𝒆z\bm{t}(\bm{x},\lambda,h)=-\lambda\bm{e}_{z} at the top of the shell. The vector field 𝒖h\bm{u}^{h} then will be obtained, which will yield a formula for the linear elastic stress tensor 𝝈h\bm{\sigma}_{h}, thus the minimization problem (2.11) will be identified, which will be studied in the next section. The compression problem is equivalent to the boundary-value problem, where one prescribes the displacement at the top and the bottom of the shell and at the same time solving the system of Euler-Lagrange equations (equations of equilibrium). The boundary conditions on the vertical component of 𝒚\bm{y} yielding from the axial compression will be yz(t,θ,0,λ,h)=0y_{z}(t,\theta,0,\lambda,h)=0 at the bottom and yz(t,θ,L,λ,h)=(1λ)Ly_{z}(t,\theta,L,\lambda,h)=(1-\lambda)L at the top. It is natural to seek the trivial branch among homogeneous deformations, yielding homogeneous thickening of the shell in the cross-section plane. Those are clearly given by

yt=(1+a(λ))(t+𝜶(θ)𝒆t(θ)),yθ=(1+a(λ))𝜶(θ)𝒆θ(θ),yz=(1λ)z,y_{t}=(1+a(\lambda))(t+\bm{\alpha}(\theta)\cdot\bm{e}_{t}(\theta)),\qquad y_{\theta}=(1+a(\lambda))\bm{\alpha}(\theta)\cdot\bm{e}_{\theta}(\theta),\qquad y_{z}=(1-\lambda)z, (4.1)

where 𝒚(t,θ,z)=yt𝒆t+yθ𝒆θ+yz𝒆z,\bm{y}(t,\theta,z)=y_{t}\bm{e}_{t}+y_{\theta}\bm{e}_{\theta}+y_{z}\bm{e}_{z}, and a(λ)a(\lambda) is a smooth enough function satisfying a(0)=0,a(0)=0, and will be determined by the equations of equilibrium and the natural boundary conditions yielding from the energy minimization problem. For the sake of notation simplicity, we may abbreviate 𝒚(t,θ,z;h,λ)=𝒚(t,θ,z)=𝒚(𝒙)\bm{y}(t,\theta,z;h,\lambda)=\bm{y}(t,\theta,z)=\bm{y}(\bm{x}) while keeping in mind that the trivial branch can depend on hh and λ\lambda. The plan is to prove the existence of a trivial branch given as in (4.1). It is easy to see that when minimizing the elastic energy ΩhW(𝒚¯)𝑑𝒙\int_{\Omega_{h}}W(\nabla\bar{\bm{y}})d\bm{x} subject to boundary conditions 𝒚¯(𝒙)\bar{\bm{y}}(\bm{x})=𝒚(𝒙)\bm{y}(\bm{x}) at z=0,Lz=0,L for 𝒚\bm{y} as in (4.1), then any local minimizer 𝒚¯H1(Ωh)\bar{\bm{y}}\in H^{1}(\Omega_{h}) must satisfy the equations of equilibrium

𝑷(𝒚¯(𝒙))=0,𝒙Ωh,\nabla\cdot\bm{P}(\nabla\bar{\bm{y}}(\bm{x}))=0,\qquad\bm{x}\in\Omega_{h}, (4.2)

together with the natural boundary conditions

𝑷(𝒚¯)𝒆t=0att=±h/2,\bm{P}(\nabla\bar{\bm{y}})\bm{e}_{t}=0\qquad\text{at}\qquad t=\pm h/2, (4.3)

where 𝑷(𝑭)=W𝑭(𝑭)\bm{P}(\bm{F})=W_{\bm{F}}(\bm{F}) is the Piola-Kirchhoff stress tensor. Also, note that if we do not prescribe the tt or θ\theta component of the field y¯\bar{y} at z=0,L,z=0,L, then we will get additional natural boundary conditions such as

𝑷(𝒚¯)𝒆z𝒆t=0atz=0,L,\displaystyle\bm{P}(\nabla\bar{\bm{y}})\bm{e}_{z}\cdot\bm{e}_{t}=0\qquad\text{at}\qquad z=0,L, (4.4)
𝑷(𝒚¯)𝒆z𝒆θ=0atz=0,L,\displaystyle\bm{P}(\nabla\bar{\bm{y}})\bm{e}_{z}\cdot\bm{e}_{\theta}=0\qquad\text{at}\qquad z=0,L,

respectively. The existence of the trivial branch is proved in the following Lemma:

Lemma 4.1.

Assume that W(𝐅)W(\bm{F}) is three times continuously differentiable in a neighborhood of the identity matrix 𝐅=𝐈\bm{F}=\bm{I} and satisfies the properties (P1)-(P6). Then there exists a constant λ0>0\lambda_{0}>0 and a unique function a(λ)C2([0,λ0],)a(\lambda)\in C^{2}([0,\lambda_{0}],\mathbb{R}), such that a(0)=0a(0)=0 and the family 𝐲(𝐱)=(yt,yθ,yz)\bm{y}(\bm{x})=(y_{t},y_{\theta},y_{z}) given by (4.1) satisfies the equations of equilibrium (4.2) and all of the boundary conditions (4.3)-(4.4). Moreover, the trivial branch 𝐲(𝐱)\bm{y}(\bm{x}) also fulfills all the conditions in Definition 2.1 as is required for the general theory to apply.

Proof.

For the proof we adopt the strategy in [References]. First of all note that as homogeneous deformations always satisfy the equations of equilibrium, we only have to verify the boundary conditions in (4.3) and (4.4). Letting 𝑭=𝒚\bm{F}=\nabla\bm{y} and 𝑪=𝑭T𝑭\bm{C}=\bm{F}^{T}\bm{F}, we have by the isotropy (P6) that W(𝑭)=W~(𝑪)W(\bm{F})=\tilde{W}\left(\bm{C}\right) for some function W~\tilde{W} that is three times continuously differentiable in a neighborhood of the identity matrix 𝑰\bm{I}. We have by simple algebra for the Piola-Kirchhoff stress tensor the formula

𝑷(𝑭)=W𝑭(𝑭)=2𝑭W~𝑪(𝑪).\bm{P}(\bm{F})=W_{\bm{F}}(\bm{F})=2\bm{F}\tilde{W}_{\bm{C}}\left(\bm{C}\right). (4.5)

Taking into account the form given in (4.1) and Frenet?Serret formulas 𝜶′′=kθ𝒆t,\bm{\alpha}^{\prime\prime}=-k_{\theta}\bm{e}_{t}, 𝒆t=kθ𝜶,\bm{e}^{\prime}_{t}=k_{\theta}\bm{\alpha}^{\prime}, we have

yt,t\displaystyle y_{t,t} =1+a(λ),yt,θ=kθ(1+a(λ))𝜶𝜶,yt,z=0,\displaystyle=1+a(\lambda),\quad y_{t,\theta}=k_{\theta}(1+a(\lambda))\bm{\alpha}\cdot\bm{\alpha}^{\prime},\quad y_{t,z}=0,
yθ,t\displaystyle y_{\theta,t} =0,yθ,θ=(1+a(λ))(1+𝜶𝜶′′),yθ,z=0,\displaystyle=0,\quad y_{\theta,\theta}=(1+a(\lambda))(1+\bm{\alpha}\cdot\bm{\alpha}^{\prime\prime}),\quad y_{\theta,z}=0,
yz,t\displaystyle y_{z,t} =0,yz,θ=0,yz,z=1λ.\displaystyle=0,\quad y_{z,\theta}=0,\quad y_{z,z}=1-\lambda.

Consequently, recalling the formula (3.2), we have that the gradient 𝒚\nabla\bm{y} in the curvilinear coordinates 𝒆t,𝒆θ,𝒆z\bm{e}_{t},\bm{e}_{\theta},\bm{e}_{z} is represented as:

𝒚=[yt,tyt,θkθyθkθt+1yt,zyθ,tyθ,θ+kθytkθt+1yθ,zyz,tyz,θkθt+1yz,z]=[1+a(λ)0001+a(λ)0001λ],\nabla\bm{y}=\begin{bmatrix}y_{t,t}&\frac{y_{t,\theta}-k_{\theta}\,y_{\theta}}{k_{\theta}\,t+1}&y_{t,z}\\[10.0pt] y_{\theta,t}&\frac{y_{\theta,\theta}+k_{\theta}\,y_{t}}{k_{\theta}\,t+1}&y_{\theta,z}\\[10.0pt] y_{z,t}&\frac{y_{z,\theta}}{k_{\theta}\,t+1}&y_{z,z}\end{bmatrix}=\begin{bmatrix}1+a(\lambda)&0&0\\[10.0pt] 0&1+a(\lambda)&0\\[10.0pt] 0&0&1-\lambda\end{bmatrix}, (4.6)

and thus we can calculate

𝑪=[(1+a(λ))2000(1+a(λ))2000(1λ)2].\bm{C}=\begin{bmatrix}(1+a(\lambda))^{2}&0&0\\[10.0pt] 0&(1+a(\lambda))^{2}&0\\[10.0pt] 0&0&(1-\lambda)^{2}\end{bmatrix}. (4.7)

This clearly shows that the matrices 𝑭\bm{F} and 𝑪=𝑭T𝑭\bm{C}=\bm{F}^{T}\bm{F} are diagonal. Additionally, the frame indifference condition (P3) implies that W~(𝑹𝑪𝑹T)=W~(𝑪)\tilde{W}\left(\bm{R}\bm{C}\bm{R}^{T}\right)=\tilde{W}(\bm{C}) for all 𝑹SO(3)\bm{R}\in SO(3). Differentiating this relation in 𝑹\bm{R} at 𝑹=𝑰\bm{R}=\bm{I} one concludes that W~𝑪(𝑪)\tilde{W}_{\bm{C}}(\bm{C}) commutes with 𝑪\bm{C} for any 𝑭3×3\bm{F}\in\mathbb{R}^{3\times 3} and 𝑪=𝑭T𝑭;\bm{C}=\bm{F}^{T}\bm{F}; see [References, page 290] for details. Consequently if 𝑪\bm{C} is diagonal with distinct diagonal entries, then W~𝑪(𝑪)\tilde{W}_{\bm{C}}(\bm{C}) must be diagonal as well, while if 𝑪\bm{C} is diagonal with possibly equal diagonal entries, then small diagonal perturbations will reduce the situation to the first case, and thus W𝑪~(𝑪)\tilde{W_{\bm{C}}}(\bm{C}) will be diagonal too as long as 𝑪\bm{C} is so. Hence, as in our case the matrix 𝑪\bm{C} is diagonal, also W𝑪~(𝑪)\tilde{W_{\bm{C}}}(\bm{C}) and the Piola-Kirchhoff stress tensor 𝑷(𝑭)\bm{P}(\bm{F}) given in (4.5) have to be diagonal as well. Therefore the boundary conditions in (4.4) are automatically satisfied, and the one in (4.3) reduces to the single equation

W^𝑪((1+a(λ))2(𝒆t𝒆t+𝒆θ𝒆θ)+(1λ)2𝒆z𝒆z)𝒆t𝒆t=0,\hat{W}_{\bm{C}}\left((1+a(\lambda))^{2}\left(\bm{e}_{t}\otimes\bm{e}_{t}+\bm{e}_{\theta}\otimes\bm{e}_{\theta}\right)+(1-\lambda)^{2}\bm{e}_{z}\otimes\bm{e}_{z}\right)\bm{e}_{t}\cdot\bm{e}_{t}=0, (4.8)

the solvability of which is basically guaranteed by the implicit function theorem. Indeed, the absence of prestress condition (P2) and (4.5) for 𝑭=𝑰\bm{F}=\bm{I} imply that equality (4.8) is fulfilled at the point (0,a(0))=(0,0).(0,a(0))=(0,0). For the nonzero condition in the implicit function theorem at λ=0\lambda=0 we get

𝑳0𝒆t𝒆t0,\bm{L}_{0}\bm{e}_{t}\cdot\bm{e}_{t}\neq 0,

which is fulfilled due to (P5). Consequently, the implicit function theorem guarantees the existence of a C2C^{2} smooth a(λ)a(\lambda) function in some neighborhood of λ=0,\lambda=0, which completes the proof of the existence part of Lemma 4.1. Finally we calculate

𝒖h(𝒙)=𝒚(𝒙;h,λ)λ|λ=0=a(0)(t+𝜶𝒆t)𝒆t+(a(0)𝜶𝒆θ)𝒆θz𝒆z,\bm{u}^{h}(\bm{x})=\frac{\partial\bm{y}(\bm{x};h,\lambda)}{\partial\lambda}\bigg{|}_{\lambda=0}=a^{\prime}(0)(t+\bm{\alpha}\cdot\bm{e}_{t})\bm{e}_{t}+(a^{\prime}(0)\bm{\alpha}\cdot\bm{e}_{\theta})\bm{e}_{\theta}-z\bm{e}_{z},

which does not depend on hh and thus by the C2C^{2} smoothness of the function a(λ)a(\lambda) we conclude that the conditions (2.4) and (2.5) are fulfilled as well.

Now in order to identify the minimization problem (2.11), we need to calculate the linear elastic stress tensor. We have differentiating (4.6) in λ\lambda at λ=0,\lambda=0, that

𝒖h(x)=[a(0)000a(0)0001],\nabla\bm{u}^{h}(x)=\begin{bmatrix}a^{\prime}(0)&0&0\\[10.0pt] 0&a^{\prime}(0)&0\\[10.0pt] 0&0&-1\end{bmatrix},

thus we get

e(𝒖h(x))=[a(0)000a(0)0001],e(\bm{u}^{h}(x))=\begin{bmatrix}a^{\prime}(0)&0&0\\[10.0pt] 0&a^{\prime}(0)&0\\[10.0pt] 0&0&-1\end{bmatrix},

where a(0)=νa^{\prime}(0)=\nu is the Poisson’s ratio. As the material is isotropic we have for the linear elastic stress tensor 𝝈h\bm{\sigma}_{h} the formula σhij=2μν12νδijTr(e(𝒖h))+2μeij(𝒖h),\sigma_{h}^{ij}=\frac{2\mu\nu}{1-2\nu}\delta_{ij}\mathrm{Tr}(e(\bm{u}^{h}))+2\mu e_{ij}(\bm{u}^{h}), where μ\mu is the Lamé parameter. This leads to the form

𝝈h=[00000000E],\bm{\sigma}_{h}=\begin{bmatrix}0&0&0\\[10.0pt] 0&0&0\\[10.0pt] 0&0&-E\end{bmatrix}, (4.9)

where EE is the Young’s modulus. Now we can identify the set VhdV_{h}^{d} of destabilizing variations given in (2.9), which will be the set all variations ϕVh\bm{\phi}\in V_{h} such that

Ωh(𝝈h,ϕTϕ)𝑑𝒙=E(ϕt,z2+ϕθ,z2+ϕz,z2)0,-\int_{\Omega^{h}}(\bm{\sigma}_{h},\nabla\bm{\phi}^{T}\nabla\bm{\phi})d\bm{x}=E(\left\|\phi_{t,z}\right\|^{2}+\left\|\phi_{\theta,z}\right\|^{2}+\left\|\phi_{z,z}\right\|^{2})\geq 0,

i.e., it coincides exactly with Vh.V_{h}. Finally, the minimization problem in (2.11) to be studied reduces to

λcl(h)=infϕVhΩh𝑳0e(ϕ),e(ϕ)𝑑𝒙Ecol3(ϕ)2,\lambda_{cl}(h)=\inf_{\bm{\phi}\in V_{h}}\frac{\int_{\Omega_{h}}\left\langle\bm{L}_{0}e(\bm{\phi}),e(\bm{\phi})\right\rangle d\bm{x}}{E\cdot\|\mathrm{col}_{3}(\nabla\bm{\phi})\|^{2}}, (4.10)

which will be addressed in Section 4.4.

4.2 Korn’s Inequalities: Ansatz free lower bounds

Proof of Theorem 3.3.

Part (i). In what follows all the constants C>0C>0 and Ci>0C_{i}>0 will depend only on L,maxkθ,k,L,\max k_{\theta},k, and c.c. While we will do the proof for both spaces VhV_{h} and VhθV_{h}^{\theta} in parallel, we will provide additional details to address the more specific case V=VhθV=V_{h}^{\theta} when needed. The tools and strategies developed in [References,References,References] will be adopted. Namely, we introduce the so-called simplified gradient 𝑮,\bm{G}, which is obtained by inserting t=0t=0 in the denominators of the second column of (3.2):

𝑮=[ϕt,tϕt,θkθϕθϕt,zϕθ,tϕθ,θ+kθϕtϕθ,zϕz,tϕz,θϕz,z].\bm{G}=\begin{bmatrix}\phi_{t,t}&\phi_{t,\theta}-k_{\theta}\phi_{\theta}&\phi_{t,z}\\[10.0pt] \phi_{\theta,t}&\phi_{\theta,\theta}+k_{\theta}\phi_{t}&\phi_{\theta,z}\\[10.0pt] \phi_{z,t}&\phi_{z,\theta}&\phi_{z,z}\end{bmatrix}. (4.11)

For the sake of simplicity we will be proving all the estimates in Theorem 3.3 with the gradient ϕ\nabla\bm{\phi} replaced by the simplified gradient 𝑮\bm{G} first, and then we will make the reverse step by virtue of the obvious bounds

𝑮syme(ϕ)𝑮ϕhϕfor allϕH1(Ωh),\|\bm{G}^{sym}-e(\bm{\phi})\|\leq\|\bm{G}-\nabla\bm{\phi}\|\leq h\|\nabla\bm{\phi}\|\quad\text{for all}\quad\bm{\phi}\in H^{1}(\Omega_{h}), (4.12)

where 𝑮sym=12(𝑮+𝑮T)\bm{G}^{sym}=\frac{1}{2}(\bm{G}+\bm{G}^{T}) is the symmetric part of 𝑮.\bm{G}. We aim to prove the following bounds

ϕt,z2Che(ϕ)2,ϕθ,z2Che(ϕ)2,ϕz,z2Ce(ϕ)2,for allϕV.\left\|\phi_{t,z}\right\|^{2}\leq\frac{C}{h}\|e(\bm{\phi})\|^{2},\qquad\left\|\phi_{\theta,z}\right\|^{2}\leq\frac{C}{\sqrt{h}}\|e(\bm{\phi})\|^{2},\qquad\left\|\phi_{z,z}\right\|^{2}\leq C\|e(\bm{\phi})\|^{2},\quad\text{for all}\quad\bm{\phi}\in V. (4.13)

First of all note that by density we can without loss of generality assume that the field ϕ\bm{\phi} is of class C2.C^{2}. For the zzzz component we have the obvious estimate

ϕz,ze(ϕ).\|\phi_{z,z}\|\leq\|e(\bm{\phi})\|. (4.14)

For the sake of simplicity, denote for any functions f,gH1(Ωh)f,g\in H^{1}(\Omega_{h}) the inner product (f,g)=Ωhfg.(f,g)=\int_{\Omega_{h}}fg. In order to estimate the second component of the third column, we can integrate by parts using the boundary conditions on the component ϕz\phi_{z} in z,z, and periodicity in θ:\theta:

ϕθ,z2\displaystyle\|\phi_{\theta,z}\|^{2} =(ϕθ,z,2𝑮θzsymϕz,θ)\displaystyle=(\phi_{\theta,z},2\bm{G}^{sym}_{\theta z}-\phi_{z,\theta})
=2(ϕθ,z,𝑮θzsym)(ϕθ,z,ϕz,θ)\displaystyle=2(\phi_{\theta,z},\bm{G}^{sym}_{\theta z})-(\phi_{\theta,z},\phi_{z,\theta})
=2(ϕθ,z,𝑮θzsym)(ϕθ,θ,ϕz,z)\displaystyle=2(\phi_{\theta,z},\bm{G}^{sym}_{\theta z})-(\phi_{\theta,\theta},\phi_{z,z})
=2(ϕθ,z,𝑮θzsym)(𝑮θθsymkθϕt,𝑮zzsym),\displaystyle=2(\phi_{\theta,z},\bm{G}^{sym}_{\theta z})-(\bm{G}^{sym}_{\theta\theta}-k_{\theta}\phi_{t},\bm{G}^{sym}_{zz}),

thus we obtain by the Cauchy-Schwartz inequality the estimate

ϕθ,z26𝑮sym(𝑮sym+kθϕt)C𝑮sym(𝑮sym+ϕt).\|\phi_{\theta,z}\|^{2}\leq 6\|\bm{G}^{sym}\|(\|\bm{G}^{sym}\|+\|k_{\theta}\phi_{t}\|)\leq C\|\bm{G}^{sym}\|(\|\bm{G}^{sym}\|+\|\phi_{t}\|). (4.15)

Next we recall the following Korn and Korn interpolation inequalities proven in [References, Thorems 3.1 and 3.2]:

ϕ2Ch3/2e(ϕ)2,ϕ2Ce(ϕ)(ϕth+e(ϕ)),for allϕVh.\|\nabla\bm{\phi}\|^{2}\leq\frac{C}{h^{3/2}}\|e(\bm{\phi})\|^{2},\quad\|\nabla\bm{\phi}\|^{2}\leq C\|e(\bm{\phi})\|\left(\frac{\left\|\phi_{t}\right\|}{h}+\|e(\bm{\phi})\|\right),\quad\text{for all}\quad\bm{\phi}\in V_{h}. (4.16)

Note that the inequalities in (4.16) were derived in [References] under the boundary conditions in Vh,V_{h}, thus we will need to prove similar to (4.16) estimates for the vector space VhθV_{h}^{\theta} too. To that end, we invoke the following universal Korn interpolation inequality proven in [References, Theorem 3.1]. The estimate holds for any shells Ωh\Omega_{h} with bounded principal curvatures and for any vector fields ϕH1(Ωh)\bm{\phi}\in H^{1}(\Omega_{h}) (even without any boundary conditions):

ϕ2C(1he(ϕ)ϕt+ϕ2+e(ϕ)2),for allϕH1(Ωh).\|\nabla\bm{\phi}\|^{2}\leq C\left(\frac{1}{h}\|e(\bm{\phi})\|\cdot\|\phi_{t}\|+\|\bm{\phi}\|^{2}+\|e(\bm{\phi})\|^{2}\right),\qquad\text{for all}\qquad\bm{\phi}\in H^{1}(\Omega_{h}). (4.17)

Observe that zz and θ\theta components of fields ϕVhθ,\bm{\phi}\in V_{h}^{\theta}, satisfy Poincaré inequality in the zz direction, thus keeping in mind (4.14) and (4.15), we obtain from (4.17) the simplified estimate

ϕ2C(1he(ϕ)ϕt+ϕt2+e(ϕ)2),for allϕV.\|\nabla\bm{\phi}\|^{2}\leq C\left(\frac{1}{h}\|e(\bm{\phi})\|\cdot\|\phi_{t}\|+\|\phi_{t}\|^{2}+\|e(\bm{\phi})\|^{2}\right),\qquad\text{for all}\qquad\bm{\phi}\in V. (4.18)

Note next that (4.18) combined with (4.12) and the Cauchy inequality implies the bound

e(ϕ)2𝑮sym+Ch1/2ϕt,\|e(\bm{\phi})\|\leq 2\|\bm{G}^{sym}\|+Ch^{1/2}\|\phi_{t}\|, (4.19)

thus we get from (4.18) an analogous estimate

ϕ2C(1h𝑮symϕt+ϕt2h1/2+𝑮sym2),for allϕV.\|\nabla\bm{\phi}\|^{2}\leq C\left(\frac{1}{h}\|\bm{G}^{sym}\|\cdot\|\phi_{t}\|+\frac{\|\phi_{t}\|^{2}}{h^{1/2}}+\|\bm{G}^{sym}\|^{2}\right),\qquad\text{for all}\quad\bm{\phi}\in V. (4.20)

In order now to get a sharp estimate on the zz derivative of ϕt,\phi_{t}, we extend the cylinder Ωh\Omega_{h} to the lower half-space by mirror reflection and accordingly the tt and θ\theta components of vector field ϕ\bm{\phi} as even functions, and the zz component as an odd function, which is possible due to the imposed zero boundary conditions. The extended version are denoted by Ω¯h\bar{\Omega}_{h} and ϕ¯,\bar{\bm{\phi}}, where we clearly have ϕ¯H1(Ω¯h).\bar{\bm{\phi}}\in H^{1}(\bar{\Omega}_{h}). The point is that this extensions simply double all the norms under consideration, thus we can prove all the inequalities under consideration for the extended fields. The functions ϕ¯t,\bar{\phi}_{t}, ϕ¯θ\bar{\phi}_{\theta} and ϕ¯z\bar{\phi}_{z} can then be written in Fourier space in the zz variable in H1:H^{1}:

{ϕ¯t=m=0ϕ¯tm(t,θ)cos(πmzL),ϕ¯θ=m=0ϕ¯θm(t,θ)cos(πmzL),ϕ¯z=m=0ϕ¯zm(t,θ)sin(πmzL).\begin{cases}\bar{\phi}_{t}=\sum_{m=0}^{\infty}\bar{\phi}_{t}^{m}(t,\theta)\cos(\frac{\pi mz}{L}),\\ \bar{\phi}_{\theta}=\sum_{m=0}^{\infty}\bar{\phi}_{\theta}^{m}(t,\theta)\cos(\frac{\pi mz}{L}),\\ \bar{\phi}_{z}=\sum_{m=0}^{\infty}\bar{\phi}_{z}^{m}(t,\theta)\sin(\frac{\pi mz}{L}).\end{cases} (4.21)

Observe that in each norm under consideration the Fourier modes separate, thus we can prove all the inequalities under consideration for a fixed Fourier mode with a wavenumber m0.m\geq 0. If m=0,m=0, then we have ϕt,z0=0\phi_{t,z}^{0}=0 and ϕθ,z0=0,\phi_{\theta,z}^{0}=0, thus (4.13) holds. Assuming m>0,m>0, we have ϕ¯θ,z=mϕ¯θ,\|\bar{\phi}_{\theta,z}\|=m\|\bar{\phi}_{\theta}\|, thus (4.15) implies the bound

m2ϕ¯θ2C𝑮¯sym(𝑮¯sym+ϕ¯t).m^{2}\|\bar{\phi}_{\theta}\|^{2}\leq C\|\bar{\bm{G}}^{sym}\|(\|\bar{\bm{G}}^{sym}\|+\|\bar{\phi}_{t}\|). (4.22)

Next we have e(ϕ¯)θθ=𝑮¯θθ=ϕ¯θ,θ+k(θ)ϕ¯te(\bar{\bm{\phi}})_{\theta\theta}=\bar{\bm{G}}_{\theta\theta}=\bar{\phi}_{\theta,\theta}+k(\theta)\bar{\phi}_{t}, thus we have integrating by parts in θ\theta and utilizing periodicity:

Ωhk(θ)ϕ¯t2\displaystyle\int_{\Omega_{h}}k(\theta)\bar{\phi}_{t}^{2} =Ωh𝑮¯θθsymϕ¯t+Ωh(ϕ¯θϕ¯t),θ+Ωhϕ¯θϕ¯t,θ\displaystyle=\int_{\Omega_{h}}\bar{\bm{G}}^{sym}_{\theta\theta}\bar{\phi}_{t}+\int_{\Omega_{h}}(\bar{\phi}_{\theta}\bar{\phi}_{t})_{,\theta}+\int_{\Omega_{h}}\bar{\phi}_{\theta}\bar{\phi}_{t,\theta}
=Ωh𝑮¯θθsymϕ¯t+Ωhϕ¯θϕ¯t,θ\displaystyle=\int_{\Omega_{h}}\bar{\bm{G}}^{sym}_{\theta\theta}\bar{\phi}_{t}+\int_{\Omega_{h}}\bar{\phi}_{\theta}\bar{\phi}_{t,\theta}

We can substitute ϕ¯t,θ=𝑮¯tθ+k(θ)ϕ¯θ\bar{\phi}_{t,\theta}=\bar{\bm{G}}_{t\theta}+k(\theta)\bar{\phi}_{\theta} to get

k(θ)ϕ¯t2CΩh|ϕ¯t𝑮¯θθsym|+|ϕ¯θ𝑮¯tθ|+|ϕ¯θ|2.\left\lVert\sqrt{k(\theta)}\bar{\phi}_{t}\right\rVert^{2}\leq C\int_{\Omega_{h}}|\bar{\phi}_{t}\bar{\bm{G}}^{sym}_{\theta\theta}|+|\bar{\phi}_{\theta}\bar{\bm{G}}_{t\theta}|+|\bar{\phi}_{\theta}|^{2}. (4.23)

Observe that (4.22), (4.23) and an application of the Cauchy inequality imply the bound

ϕ¯t4C(𝑮¯sym4+ϕ¯θ2𝑮¯2).\left\lVert\bar{\phi}_{t}\right\rVert^{4}\leq C\left(\left\lVert\bar{\bm{G}}^{sym}\right\rVert^{4}+\left\lVert\bar{\phi}_{\theta}\right\rVert^{2}\left\lVert\bar{\bm{G}}\right\rVert^{2}\right). (4.24)

Note that (4.12) implies that analogous to (4.20) inequalities hold for ϕ\nabla\bm{\phi} replaced by 𝑮\bm{G}, thus owing to (4.24), (4.22) and the new version off (4.20), we discover

ϕ¯t2C(𝑮¯sym2+1mhϕ¯t𝑮¯sym).\left\lVert\bar{\phi}_{t}\right\rVert^{2}\leq C\left(\left\lVert\bar{\bm{G}}^{sym}\right\rVert^{2}+\frac{1}{m\sqrt{h}}\left\lVert\bar{\phi}_{t}\right\rVert\left\lVert\bar{\bm{G}}^{sym}\right\rVert\right). (4.25)

It remains to note that, upon an application of the Cauchy inequality, (4.25) implies the desired bound

ϕ¯t,z2=m2ϕ¯t2𝑮¯sym2h.\|\bar{\phi}_{t,z}\|^{2}=m^{2}\left\lVert\bar{\phi}_{t}\right\rVert^{2}\leq\frac{\left\lVert\bar{\bm{G}}^{sym}\right\rVert^{2}}{h}. (4.26)

Observe that on one hand (4.26) implies that ϕ¯t𝑮¯symh\left\lVert\bar{\phi}_{t}\right\rVert\leq\frac{\left\lVert\bar{\bm{G}}^{sym}\right\rVert}{\sqrt{h}}, thus we get from (4.20) that

ϕ2Chh𝑮sym2for allϕV.\|\nabla\bm{\phi}\|^{2}\leq\frac{C}{h\sqrt{h}}\|\bm{G}^{sym}\|^{2}\qquad\text{for all}\quad\bm{\phi}\in V. (4.27)

Next, owing back to (4.12), we derive from (4.27) the analogous estimate

ϕ2Chhe(ϕ)2for allϕV.\|\nabla\bm{\phi}\|^{2}\leq\frac{C}{h\sqrt{h}}\|e(\bm{\phi})\|^{2}\qquad\text{for all}\quad\bm{\phi}\in V. (4.28)

On one hand, it remains to note that (4.28) combined with (4.12), (4.14), (4.15), and (4.26) imply the bound

col3(ϕ)2Che(ϕ)2,for allϕV,\|\mathrm{col}_{3}(\nabla\bm{\phi})\|^{2}\leq\frac{C}{h}\|e(\bm{\phi})\|^{2},\qquad\text{for all}\qquad\bm{\phi}\in V, (4.29)

which confirms one direction in (3.7). The proof of the other direction is by an Ansatz construction and is postponed until Section 4.4. We now turn to the second part, where the curvature k(θ)k(\theta) vanishes at finitely many points.
Part (ii). We will prove the Ansatz-free lower bounds

col3(ϕ)2Ch8/5e(ϕ)2,ϕ2Ch12/7e(ϕ)2,for allϕV.\|\mathrm{col}_{3}(\nabla\bm{\phi})\|^{2}\leq\frac{C}{h^{8/5}}\|e(\bm{\phi})\|^{2},\qquad\|\nabla\bm{\phi}\|^{2}\leq\frac{C}{h^{12/7}}\|e(\bm{\phi})\|^{2},\qquad\text{for all}\qquad\bm{\phi}\in V. (4.30)

In the sequel we will be working with the extended vector field ϕ¯,\bar{\bm{\phi}}, but will drop the ”bar” to keep the notation simpler. Assume first that the wavenumber mm is nonzero, i.e., m.m\in\mathbb{N}. Observe that if ϕte(ϕ)\left\lVert\phi_{t}\right\rVert\leq\left\lVert e(\bm{\phi})\right\rVert then (4.18) would imply (4.30). We assume in the sequel that

e(ϕ)ϕt.\left\lVert e(\bm{\phi})\right\rVert\leq\left\lVert\phi_{t}\right\rVert. (4.31)

We can assume without loss of generality that the domain of the θ\theta variable is [1,1].[-1,1]. Also, for the simplicity of the presentation, we will assume that there is only one point on the curve 𝜶\bm{\alpha} where the curvature vanishes, the general case being analogous. Consequently, assume n=1n=1 and θ1=0\theta_{1}=0 in (3.6). Let δ>0\delta>0 be a small parameter yet to be chosen and let I0=[δ,δ],I1=[1,1]I0I_{0}=[-\delta,\delta],I_{1}=[-1,1]-I_{0}, and ϕi=ϕχIi\bm{\phi}^{i}=\bm{\phi}\chi_{I_{i}}, i=0,1.i=0,1. Recall that (4.23) implies

k(θ)ϕt2C(ϕt𝑮sym+ϕθ2ϵ+ϵ𝑮2),\|\sqrt{k(\theta)}\phi_{t}\|^{2}\leq C\left(\left\lVert\phi_{t}\right\rVert\cdot\left\lVert\bm{G}^{sym}\right\rVert+\frac{\left\lVert\phi_{\theta}\right\rVert^{2}}{\epsilon}+\epsilon\left\lVert\bm{G}\right\rVert^{2}\right), (4.32)

for any ϵ(0,)\epsilon\in(0,\infty). From the obvious bound k(θ)Ck(θ)k(\theta)\leq C\sqrt{k(\theta)} and inequality (4.15) we have

ϕθ2Cm2(𝑮sym2+𝑮symk(θ)ϕt).\left\lVert\phi_{\theta}\right\rVert^{2}\leq\frac{C}{m^{2}}(\left\lVert\bm{G}^{sym}\right\rVert^{2}+\left\lVert\bm{G}^{sym}\right\rVert\cdot\|{\sqrt{k(\theta)}}\phi_{t}\|). (4.33)

Therefore combining (4.32) and (4.33) we arrive at

k(θ)ϕt2\displaystyle\|\sqrt{k(\theta)}\phi_{t}\|^{2} C(ϕt𝑮sym+𝑮sym2+𝑮symk(θ)ϕtm2ϵ+ϵ𝑮)2),\displaystyle\leq C\left(\left\lVert\phi_{t}\right\rVert\cdot\left\lVert\bm{G}^{sym}\right\rVert+\frac{\left\lVert\bm{G}^{sym}\right\rVert^{2}+\left\lVert\bm{G}^{sym}\right\rVert\cdot\|{\sqrt{k(\theta)}}\phi_{t}\|}{m^{2}\epsilon}+\epsilon\left\lVert\bm{G})\right\rVert^{2}\right),

and applying Young’s inequality again we arrive at the key estimate

k(θ)ϕt2C(ϕt𝑮sym+𝑮sym2m2ϵ+𝑮sym2m4ϵ2+ϵ𝑮2).\|\sqrt{k(\theta)}\phi_{t}\|^{2}\leq C\left(\left\lVert\phi_{t}\right\rVert\cdot\left\lVert\bm{G}^{sym}\right\rVert+\frac{\left\lVert\bm{G}^{sym}\right\rVert^{2}}{m^{2}\epsilon}+\frac{\left\lVert\bm{G}^{sym}\right\rVert^{2}}{m^{4}\epsilon^{2}}+\epsilon\|\bm{G}\|^{2}\right).\\ (4.34)

Next we utilize (4.34) to bound ϕt12.\left\lVert\phi_{t}^{1}\right\rVert^{2}. We have by (3.6) that minI1k(θ)=kδcδ2,\min_{I_{1}}k(\theta)=k_{\delta}\geq c\delta^{2}, thus substituting ϵ=δ2η2\epsilon=\delta^{2}\eta^{2} we get from (4.34) the bound

ϕt12\displaystyle\left\lVert\phi_{t}^{1}\right\rVert^{2} 1kδk(θ)ϕt2\displaystyle\leq\frac{1}{k_{\delta}}\|\sqrt{k(\theta)}\phi_{t}\|^{2} (4.35)
ϕt250+C((1δ4+1m2δ4η2+1m4δ6η4)𝑮sym2+η2𝑮2).\displaystyle\leq\frac{\|\phi_{t}\|^{2}}{50}+C\left(\left(\frac{1}{\delta^{4}}+\frac{1}{m^{2}\delta^{4}\eta^{2}}+\frac{1}{m^{4}\delta^{6}\eta^{4}}\right)\left\lVert\bm{G}^{sym}\right\rVert^{2}+\eta^{2}\|\bm{G}\|^{2}\right).

In order to bound ϕt02\left\lVert\phi_{t}^{0}\right\rVert^{2} we choose a smooth cut-off function φ:[1,1]\varphi\colon[-1,1]\to\mathbb{R} supported in [2δ,2δ][-2\delta,2\delta] such that

φ(θ)={1,θI0,0,θ[1,1]2I0,|φ(θ)|2δ,θ[1,1].\varphi(\theta)=\begin{cases}1,&\theta\in I_{0},\\ 0,&\theta\in[1,-1]-2I_{0},\\ |\varphi^{\prime}(\theta)|\leq\frac{2}{\delta},&\theta\in[1,-1].\end{cases}

We have by the Poincare inequality that

ϕt02\displaystyle\left\lVert\phi^{0}_{t}\right\rVert^{2} ϕtφ2\displaystyle\leq\left\lVert\phi_{t}\varphi\right\rVert^{2} (4.36)
δ2θ(ϕtφ)2\displaystyle\leq\delta^{2}\left\lVert\partial_{\theta}(\phi_{t}\varphi)\right\rVert^{2}
δ2(𝑮χ2I02+k(θ)ϕθχ2I02+4δ2ϕtχI12)\displaystyle\leq\delta^{2}\left(\left\lVert\bm{G}\chi_{2I_{0}}\right\rVert^{2}+\left\lVert k(\theta)\phi_{\theta}\chi_{2I_{0}}\right\rVert^{2}+\frac{4}{\delta^{2}}\left\lVert\phi_{t}\chi_{I_{1}}\right\rVert^{2}\right)
Cδ2(𝑮2+ϕθ2)+4ϕt12\displaystyle\leq C\delta^{2}\left(\left\lVert\bm{G}\right\rVert^{2}+\left\lVert\phi_{\theta}\right\rVert^{2}\right)+4\left\lVert\phi^{1}_{t}\right\rVert^{2}
Cδ2𝑮2+4ϕt12,\displaystyle\leq C\delta^{2}\left\lVert\bm{G}\right\rVert^{2}+4\left\lVert\phi^{1}_{t}\right\rVert^{2},

where we used the obvious bound ϕθC𝑮.\|\phi_{\theta}\|\leq C\|\bm{G}\|. Putting now (4.35) and (4.36) together we arrive at

ϕt2C(δ2+η2)𝑮2+C(1δ4+1m2δ4η2+1m4δ6η4)𝑮sym2\left\lVert\phi_{t}\right\rVert^{2}\leq C(\delta^{2}+\eta^{2})\left\lVert\bm{G}\right\rVert^{2}+C\left(\frac{1}{\delta^{4}}+\frac{1}{m^{2}\delta^{4}\eta^{2}}+\frac{1}{m^{4}\delta^{6}\eta^{4}}\right)\left\lVert\bm{G}^{sym}\right\rVert^{2} (4.37)

Observe that the universal interpolation inequality in (4.18) together with the bounds (4.14), (4.22), and (4.31) imply another key estimate:

𝑮2Ch𝑮symϕt.\|\bm{G}\|^{2}\leq\frac{C}{h}\|\bm{G}^{sym}\|\|\phi_{t}\|. (4.38)

In order to estimate 𝑮,\|\bm{G}\|, we first combine (4.37) and (4.38) to get by an application of the Cauchy inequality:

ϕt2C(δ4+η4h2+1δ4+1m2δ4η2+1m4δ6η4)𝑮sym2.\left\lVert\phi_{t}\right\rVert^{2}\leq C\left(\frac{\delta^{4}+\eta^{4}}{h^{2}}+\frac{1}{\delta^{4}}+\frac{1}{m^{2}\delta^{4}\eta^{2}}+\frac{1}{m^{4}\delta^{6}\eta^{4}}\right)\left\lVert\bm{G}^{sym}\right\rVert^{2}. (4.39)

Finally keeping in mind that m1,m\geq 1, we choose η=δ=h1/7\eta=\delta=h^{1/7} to optimize (4.39). This gives

𝑮2Ch12/7𝑮sym2,\left\lVert\bm{G}\right\rVert^{2}\leq\frac{C}{h^{12/7}}\left\lVert\bm{G}^{sym}\right\rVert^{2}, (4.40)

and consequently also the second inequality in (4.30) through (4.12). In order to prove the first inequality in (4.30), we note that (4.38) implies in particular the bound

ϕt,z2Ch𝑮symϕt,\|\phi_{t,z}\|^{2}\leq\frac{C}{h}\|\bm{G}^{sym}\|\|\phi_{t}\|,

which is equivalent to

m2ϕt2Cm2h2𝑮sym2.m^{2}\|\phi_{t}\|^{2}\leq\frac{C}{m^{2}h^{2}}\left\lVert\bm{G}^{sym}\right\rVert^{2}. (4.41)

Next we choose η=δ\eta=\delta in (4.39) to get the simplified variant

m2ϕt2C(m2δ4h2+m2δ4+1δ6+1m2δ10)𝑮sym2.m^{2}\left\lVert\phi_{t}\right\rVert^{2}\leq C\left(\frac{m^{2}\delta^{4}}{h^{2}}+\frac{m^{2}}{\delta^{4}}+\frac{1}{\delta^{6}}+\frac{1}{m^{2}\delta^{10}}\right)\left\lVert\bm{G}^{sym}\right\rVert^{2}. (4.42)

We need to obtain an optimal estimate for m2ϕt2m^{2}\left\lVert\phi_{t}\right\rVert^{2} from (4.41) and (4.42) regardless of the value of m,m, by choosing the parameter δ>0\delta>0 appropriately. We choose δ\delta so that the values of the first and last summands on the right-hand side of (4.39) coincide: m2δ4h2=1m2δ10\frac{m^{2}\delta^{4}}{h^{2}}=\frac{1}{m^{2}\delta^{10}}. This gives δ=h1/7m2/7\delta=\frac{h^{1/7}}{m^{2/7}} and (4.39) reduces to

m2ϕt2C(m6/7h10/7+m22/7h4/7+m12/7h6/7)𝑮sym2.m^{2}\left\lVert\phi_{t}\right\rVert^{2}\leq C\left(\frac{m^{6/7}}{h^{10/7}}+\frac{m^{22/7}}{h^{4/7}}+\frac{m^{12/7}}{h^{6/7}}\right)\left\lVert\bm{G}^{sym}\right\rVert^{2}. (4.43)

It remains to note that if m1h1/5,m\geq\frac{1}{h^{1/5}}, then (4.38) would give

m2ϕt2Ch8/5𝑮sym2.m^{2}\|\phi_{t}\|^{2}\leq\frac{C}{h^{8/5}}\left\lVert\bm{G}^{sym}\right\rVert^{2}. (4.44)

If otherwise m1h1/5,m\geq\frac{1}{h^{1/5}}, then we would get the same estimate (4.41) this time from (4.40) instead. Consequently (4.41) is fulfilled independently of m.m\in\mathbb{N}. Finally putting together (4.30) and (4.41) we arrive at the first estimate in (4.26) in the case m1.m\geq 1. In the case m=0m=0 there is no zz-variable dependence, thus obviously have col3(ϕ)=0\mathrm{col}_{3}(\nabla\bm{\phi})=0 and ϕθ=ϕz=0,\phi_{\theta}=\phi_{z}=0, hence both lower bounds in (3.8) become trivial. This completes the proof of the Ansatz-free lower bound parts of Theorem 3.3.

4.3 The Ansätze

Part (i). An Ansatz realizing the asymptotics in (3.7) can be constructed in numerous ways. For instance, one Kirchhoff-like Ansatz which can be as such was constructed in [References], see also [References]. We present it here for the convenience of the reader. One chooses

{ϕt=W(θh,zL/2h),ϕθ=thW,θ(θh,zL/2h),ϕz=thW,z(θh,zL/2h),\begin{cases}\phi_{t}=W(\frac{\theta}{\sqrt{h}},\frac{z-L/2}{\sqrt{h}}),\\[10.0pt] \phi_{\theta}=-\frac{t}{\sqrt{h}}\cdot W_{,\theta}(\frac{\theta}{\sqrt{h}},\frac{z-L/2}{\sqrt{h}}),\\[10.0pt] \phi_{z}=-\frac{t}{\sqrt{h}}\cdot W_{,z}(\frac{\theta}{\sqrt{h}},\frac{z-L/2}{\sqrt{h}}),\end{cases} (4.45)

where WW is a smooth function compactly supported in (0,p)×(0,L).(0,p)\times(0,L). Also, WW is chosen so that WW and all its first and second order derivatives be of order one. It is then easy to see that one gets for this choice e(ϕ)h\|e(\bm{\phi})\|\sim h and col3(ϕ)=h\|\mathrm{col}_{3}(\nabla\bm{\phi})\|=\sqrt{h} as h0.h\to 0. This gives the asymptotics

e(ϕ)2col3(ϕ)2h,\frac{\|e(\bm{\phi})\|^{2}}{\|\mathrm{col}_{3}(\nabla\bm{\phi})\|^{2}}\sim h,

as h0,h\to 0, i.e., (3.7).
Part (ii): Second estimate in (3.8). The idea and the novelty in this case is to localize the Ansätze at the zero curvature points, and make use of the fact that the curvature vanishes. Namely, assume again the domain of the variable θ\theta is [1,1][-1,1] and the point θ=0\theta=0 is a zero curvature point, i.e., θ(0)=0\theta(0)=0 and cθ2k(θ)1cθ2c\theta^{2}\leq k(\theta)\leq\frac{1}{c}\theta^{2} for all θ[1,1].\theta\in[-1,1]. Let δ=hα\delta=h^{\alpha} be a small parameter (α>0\alpha>0) yet to be chosen. We adjust (4.45) as

{ϕt=W(θδ,zL/2δ),ϕθ=tδW,θ(θδ,zL/2δ),ϕz=tδW,z(θδ,zL/2δ),\begin{cases}\phi_{t}=W(\frac{\theta}{\delta},\frac{z-L/2}{\delta}),\\[10.0pt] \phi_{\theta}=-\frac{t}{\delta}\cdot W_{,\theta}(\frac{\theta}{\delta},\frac{z-L/2}{\delta}),\\[10.0pt] \phi_{z}=-\frac{t}{\delta}\cdot W_{,z}(\frac{\theta}{\delta},\frac{z-L/2}{\delta}),\end{cases} (4.46)

where W(θ,z)W(\theta,z) is again a smooth function, compactly supported on D=(1,1)2D=(-1,1)^{2} such that WW and all its first and second order derivatives be of order one. Computing the simplified gradient 𝑮\bm{G} in (4.11) we get:

𝑮=[0W,θk(θ)tW,θδW,zδW,θδtW,θθδ2+k(θ)WtW,θzδ2W,zδtW,zθδ2tW,zzδ2]\displaystyle\bm{G}=\begin{bmatrix}0&\frac{W_{,\theta}-k(\theta)tW_{,\theta}}{\delta}&\frac{W_{,z}}{\delta}\\[10.0pt] -\frac{W_{,\theta}}{\delta}&\frac{tW_{,\theta\theta}}{\delta^{2}}+k(\theta)W&\frac{tW_{,\theta z}}{\delta^{2}}\\[10.0pt] -\frac{W_{,z}}{\delta}&\frac{tW_{,z\theta}}{\delta^{2}}&-\frac{tW_{,zz}}{\delta^{2}}\end{bmatrix}

and

𝑮sym=[0k(θ)tW,θ2δ0k(θ)tW,θ2δtW,θθδ2+k(θ)W2tW,θzδ202tW,θzδ2tW,zzδ2].\bm{G}^{sym}=\begin{bmatrix}0&-\frac{k(\theta)tW_{,\theta}}{2\delta}&0\\[10.0pt] -\frac{k(\theta)tW_{,\theta}}{2\delta}&\frac{tW_{,\theta\theta}}{\delta^{2}}+k(\theta)W&\frac{2tW_{,\theta z}}{\delta^{2}}\\[10.0pt] 0&\frac{2tW_{,\theta z}}{\delta^{2}}&-\frac{tW_{,zz}}{\delta^{2}}\end{bmatrix}.

Now choosing δ=h1/4\delta=h^{1/4} and recalling that k(θ)θ2,k(\theta)\sim\theta^{2}, it is easy to see that col3(ϕ)2=h,\|\mathrm{col}_{3}(\nabla\bm{\phi})\|^{2}=h, and e(ϕ)2h5/2,\|e(\bm{\phi})\|^{2}\sim h^{5/2}, as h0.h\to 0. This realizes the asymptotics

e(ϕ)2col3(ϕ)2h3/2\frac{\|e(\bm{\phi})\|^{2}}{\|\mathrm{col}_{3}(\nabla\bm{\phi})\|^{2}}\sim h^{3/2}

as h0,h\to 0, i.e., the right-hand side of the second inequality in (3.8).

Remark 4.2.

It is easy to see that if the curvature k(θ)k(\theta) has a zero of order β2,\beta\geq 2, then by choosing δ=h1β+2,\delta=h^{\frac{1}{\beta+2}}, the Ansatz in (4.43) would actually give us

e(ϕ)2col3(ϕ)2h2β+2β+2,\frac{\|e(\bm{\phi})\|^{2}}{\|\mathrm{col}_{3}(\nabla\bm{\phi})\|^{2}}\sim h^{\frac{2\beta+2}{\beta+2}},

as h0.h\to 0.

Part (iii): First estimate in (3.8). We construct the Ansatz utilizing the idea of linearization in tt suggested in [References]. Namely we seek the Ansatz in the following form

{ϕt=u,ϕθ=tv1+v2,ϕz=tw1+w2,\begin{cases}\phi_{t}=u,\\ \phi_{\theta}=tv_{1}+v_{2},\\ \phi_{z}=tw_{1}+w_{2},\end{cases} (4.47)

where the functions u,v1,v2,w1u,v_{1},v_{2},w_{1} and w2w_{2} depend only on θ\theta and z.z. The implified gradient will then be given by

𝑮=[0u,θk(θ)(tv1+v2)u,zv1tv1,θ+v2,θ+k(θ)utv1,z+v2,zw1tw1,θ+w2,θtw1,z+w2,z].\bm{G}=\begin{bmatrix}0&u_{,\theta}-k(\theta)\left(tv_{1}+v_{2}\right)&u_{,z}\\[10.0pt] v_{1}&tv_{1,\theta}+v_{2,\theta}+k(\theta)u&tv_{1,z}+v_{2,z}\\[10.0pt] w_{1}&tw_{1,\theta}+w_{2,\theta}&tw_{1,z}+w_{2,z}\end{bmatrix}.

In order to make the symmetric part of the gradient small, we choose the functions u,v1,v2,w1,u,v_{1},v_{2},w_{1}, and w2w_{2} to satisfy the relationships

v1=u,θ,w1=u,z,v2,θ=k(θ)u.v_{1}=-u_{,\theta},\qquad w_{1}=-u_{,z},\qquad v_{2,\theta}=-k(\theta)u. (4.48)

This will reduce the simplified gradient to

𝑮=[0u,θk(θ)(tv1+v2)u,zv1tv1,θtv1,z+v2,zw1tw1,θ+w2,θtw1,z+w2,z],\bm{G}=\begin{bmatrix}0&u_{,\theta}-k(\theta)\left(tv_{1}+v_{2}\right)&u_{,z}\\[10.0pt] v_{1}&tv_{1,\theta}&tv_{1,z}+v_{2,z}\\[10.0pt] w_{1}&tw_{1,\theta}+w_{2,\theta}&tw_{1,z}+w_{2,z}\end{bmatrix}, (4.49)

and the symmetric part will be

𝑮sym=[012k(θ)(tv1+v2)012k(θ)(tv1+v2)tv1,θ12(t(v1,z+w1,θ)+v2,z+w2,θ)012(t(v1,z+w1,θ)+v2,z+w2,θ)tw1,z+w2,z]\bm{G}^{sym}=\begin{bmatrix}0&-\frac{1}{2}k(\theta)\left(tv_{1}+v_{2}\right)&0\\[10.0pt] -\frac{1}{2}k(\theta)\left(tv_{1}+v_{2}\right)&tv_{1,\theta}&\frac{1}{2}(t(v_{1,z}+w_{1,\theta})+v_{2,z}+w_{2,\theta})\\[10.0pt] 0&\frac{1}{2}(t(v_{1,z}+w_{1,\theta})+v_{2,z}+w_{2,\theta})&tw_{1,z}+w_{2,z}\end{bmatrix} (4.50)

Thus to make the θz\theta z component small, we need a new relationship:

w2,θ=v2,z,w_{2,\theta}=-v_{2,z}, (4.51)

which simplifies 𝑮sym\bm{G}^{sym} further to

e(𝑮)=[012k(θ)(tv1+v2)012k(θ)(tv1+v2)tv1,θt2(v1,z+w1,θ)0t2(v1,z+w1,θ)tw1,z+w2,z].e(\bm{G})=\begin{bmatrix}0&-\frac{1}{2}k(\theta)\left(tv_{1}+v_{2}\right)&0\\[10.0pt] -\frac{1}{2}k(\theta)\left(tv_{1}+v_{2}\right)&tv_{1,\theta}&\frac{t}{2}(v_{1,z}+w_{1,\theta})\\[10.0pt] 0&\frac{t}{2}(v_{1,z}+w_{1,\theta})&tw_{1,z}+w_{2,z}\end{bmatrix}. (4.52)

Let now W(θ,z)W(\theta,z) be a smooth compactly supported function on D=(1,1)×(0,L)D=(-1,1)\times(0,L) such that WW and all its first, second, and third order derivatives be of order one. We choose

{w2=k2(θ)W,z(θδ,z)v2=2k(θ)k(θ)W(θδ,z)+k(θ)2δW,θ(θδ,z)u=2k2(θ)k(θ)W(θδ,z)2k′′(θ)W(θδ,z)4k(θ)δW,θ(θδ,z)k(θ)δ2W,θθ(θδ,z)v1=u,θw1=u,z,\begin{cases}w_{2}=-k^{2}(\theta)W_{,z}(\frac{\theta}{\delta},z)\\[5.0pt] v_{2}=2k(\theta)k^{\prime}(\theta)W(\frac{\theta}{\delta},z)+\frac{k(\theta)^{2}}{\delta}W_{,\theta}(\frac{\theta}{\delta},z)\\[5.0pt] u=-\frac{2k^{\prime 2}(\theta)}{k(\theta)}W(\frac{\theta}{\delta},z)-2k^{\prime\prime}(\theta)W(\frac{\theta}{\delta},z)-\frac{4k^{\prime}(\theta)}{\delta}W_{,\theta}(\frac{\theta}{\delta},z)-\frac{k(\theta)}{\delta^{2}}W_{,\theta\theta}(\frac{\theta}{\delta},z)\\[5.0pt] v_{1}=-u_{,\theta}\\[5.0pt] w_{1}=-u_{,z},\end{cases} (4.53)

where δ=hβ\delta=h^{\beta} (β>0\beta>0) is a small parameter yet to be chosen. From the fact that 𝜶\bm{\alpha} is of class C5C^{5} and k(0)=k(0)=0,k(0)=k^{\prime}(0)=0, k′′(0)>0,k^{\prime\prime}(0)>0, we have for small enough hh the obvious bounds |k′′′(θ)|+|k′′(θ)|C,|k^{\prime\prime\prime}(\theta)|+|k^{\prime\prime}(\theta)|\leq C, |k(θ)|C|θ|,|k^{\prime}(\theta)|\leq C|\theta|, cθ2|k(θ)|1cθ2,c\theta^{2}\leq|k(\theta)|\leq\frac{1}{c}\theta^{2}, for θ(δ,δ).\theta\in(-\delta,\delta). Consequently we can easily verify that

𝑮sym2𝑮2max{h6β(h22β+h6β),h22β,h2+h10β}\frac{\left\lVert\bm{G}^{sym}\right\rVert^{2}}{\left\lVert\bm{G}\right\rVert^{2}}\sim\max\{h^{6\beta}(h^{2-2\beta}+h^{6\beta}),h^{2-2\beta},h^{2}+h^{10\beta}\}

as h0.h\to 0. In order to minimize the left hand side we choose β=1/6\beta=1/6 that gives the desired result

e(ϕ)2ϕ2h53.\frac{\left\lVert e(\bm{\phi})\right\rVert^{2}}{\left\lVert\nabla\bm{\phi}\right\rVert^{2}}\sim h^{\frac{5}{3}}.

4.4 The buckling load

Proof of Theorem 3.1.

In this section we will study the stability of the homogeneous trivial branch given by (in (4.1))

yt=(1+a(λ))(t+𝜶(θ)𝒆t(θ)),yθ=(1+a(λ))𝜶(θ)𝒆θ(θ),yz=(1λ)z,y_{t}=(1+a(\lambda))(t+\bm{\alpha}(\theta)\cdot\bm{e}_{t}(\theta)),\qquad y_{\theta}=(1+a(\lambda))\bm{\alpha}(\theta)\cdot\bm{e}_{\theta}(\theta),\qquad y_{z}=(1-\lambda)z, (4.54)

within the theory of buckling of slender structures presented in Section 2. Let us start by verifying that the conditions for the applicability of the theory are fulfilled. These consist of the conditions in Definition 2.1 and Theorem 2.5. The conditions in Definition 2.1 have already been verified by Lemma 4.1, while the conditions in Theorem 2.5 immediately follow from Theorem 3.3 and Remark 3.4. This implies that in fact we can calculate the buckling load as in (2.11), which we have proven reduces to the minimization problem in (4.10), i.e.,

λ(h)=infϕVΩh𝑳0e(ϕ),e(ϕ)𝑑𝒙Ecol3(ϕ)2.\lambda(h)=\inf_{\bm{\phi}\in V}\frac{\int_{\Omega_{h}}\left\langle\bm{L}_{0}e(\bm{\phi}),e(\bm{\phi})\right\rangle d\bm{x}}{E\cdot\|\mathrm{col}_{3}(\nabla\bm{\phi})\|^{2}}. (4.55)

By properties (P4) and (P5) the tensor 𝑳0\bm{L}_{0} is positive definite on symmetric matrices, i.e., there exist a positive constant C>0C>0 such that

1Ce(ϕ)2Ωh𝑳0e(ϕ),e(ϕ)𝑑𝒙Ce(ϕ)2,\frac{1}{C}\|e(\bm{\phi})\|^{2}\leq\int_{\Omega_{h}}\left\langle\bm{L}_{0}e(\bm{\phi}),e(\bm{\phi})\right\rangle d\bm{x}\leq C\|e(\bm{\phi})\|^{2},

for all ϕH1(Ωh).\bm{\phi}\in H^{1}(\Omega_{h}). Hence (4.52) yields the asymptotics

λ(h)infϕVe(ϕ)2col3(ϕ)2,\lambda(h)\sim\inf_{\bm{\phi}\in V}\frac{\|e(\bm{\phi})\|^{2}}{\|\mathrm{col}_{3}(\nabla\bm{\phi})\|^{2}}, (4.56)

and Theorem 3.1 immediately follows from Theorem 3.3.

Acknowledgement

We would like to thank Yury Grabovsky for helpful comments. Anonymous referees are also thanked for useful suggestions on how to improve the presentation of the manuscript. This material is supported by the National Science Foundation under Grants No. DMS-1814361.

Appendix A Explicit trivial branch for Neo-Hookean materials

In this section we provide an explicit form for the function a(λ)a(\lambda) in (4.1) and (4.8) for the special case of Neo-Hookean solids. In that case the Piola-Kirchhoff stress tensor is given by

𝑷(𝑭)=μ~(𝑭𝑭T)+2λ~(J1)J𝑭T,J=det(𝑭),\bm{P}(\bm{F})=\tilde{\mu}(\bm{F}-\bm{F}^{-T})+2\tilde{\lambda}(J-1)J\bm{F}^{-T},\quad J=\det(\bm{F}),

where μ~\tilde{\mu} and λ~\tilde{\lambda} are the Lam’e parameters555Note that we are using the symbol ”tilde” over the letters to avoid confusion between the Lamé parameter λ~\tilde{\lambda} and the loading parameter λ.\lambda.. Letting b(λ)=(1+a(λ))2,b(\lambda)=(1+a(\lambda))^{2}, the system (4.3) will reduce to

b(λ)2(1λ)2b(λ)(1λμ~λ~)μ~λ~=0,b(\lambda)^{2}(1-\lambda)^{2}-b(\lambda)(1-\lambda-\frac{\tilde{\mu}}{\tilde{\lambda}})-\frac{\tilde{\mu}}{\tilde{\lambda}}=0,

which gives the solution

b(λ)=1λμ~λ~+(1λμ~λ~)2+4(1λ)2μ~λ~2(1λ)2,b(\lambda)=\frac{1-\lambda-\frac{\tilde{\mu}}{\tilde{\lambda}}+\sqrt{(1-\lambda-\frac{\tilde{\mu}}{\tilde{\lambda}})^{2}+4(1-\lambda)^{2}\frac{\tilde{\mu}}{\tilde{\lambda}}}}{2(1-\lambda)^{2}},

and for aa we get

a(λ)=1λμ~λ~+(1λμ~λ~)2+4(1λ)2μ~λ~2(1λ)1.a(\lambda)=\frac{\sqrt{1-\lambda-\frac{\tilde{\mu}}{\tilde{\lambda}}+\sqrt{(1-\lambda-\frac{\tilde{\mu}}{\tilde{\lambda}})^{2}+4(1-\lambda)^{2}\frac{\tilde{\mu}}{\tilde{\lambda}}}}}{\sqrt{2}(1-\lambda)}-1.

References

  • [1] B. O. Almroth. Postbuckling behaviour of axially compressed circular cylinders. AIAA J, 1:627–633, 1963.
  • [2] B. Budiansky and J. Hutchinson. A survey of some buckling problems. Technical Report CR-66071, NASA, February 1966.
  • [3] D. Bushnell. Buckling of shell-pitfall of designers. AIAA J, 19(9): 1183–1226, 1981.
  • [4] C. R. Calladine. A shell-buckling paradox resolved. In D. Durban, G. Givoli, and J. G. Simmonds, editors, Advances in the Mechanics of Plates and Shells, pages 119–134. Kluwer Academic Publishers, Dordrecht, 2000.
  • [5] R. Degenhardt, A. Kling, R. Zimmermann, F. Odermann, and F. de Araujo. Dealing with imperfection sensitivity of composite structures prone to buckling. In S. B. Coskun, editor, Advances in Computational Stability Analysis. InTech, 2012.
  • [6] G. Friesecke, R. D. James, M. G. Mora, and S. Müller. Derivation of nonlinear bending theory for shells from three-dimensional nonlinear elasticity by gamma-convergence. Comptes Rendus Mathematique, Volume 336, Issue 8, 15 April 2003, Pages 697–702.
  • [7] D. J. Gorman and R. M. Evan-Iwanowski. An analytical and experimental investigation of the effects of large prebuckling deformations on the buckling of clamped thin-walled circular cylindrical shells subjected to axial loading and internal pressure. Developments in Theoretical and Applied Mechanics , 4:415–426, 1970.
  • [8] Y. Grabovsky and D. Harutyunyan. Exact scaling exponents in korn and korn-type inequalities for cylindrical shells. SIAM Journal on Mathematical Analysis, 46(5), pp. 3277–3295, 2014.
  • [9] Y. Grabovsky and D. Harutyunyan. Scaling instability of the buckling load in axially compressed circular cylindrical shells. Journal of Nonlinear Science, 26, 83–119, (2016).
  • [10] Y. Grabovsky and D. Harutyunyan. Rigurous derivation of the formula for the buckling load in axially compressed circular cylindrical shells. Journal of Elasticity, 120, 249–276, (2015).
  • [11] Y. Grabovsky and D. Harutyunyan. Korn inequalities for shells with zero Gaussian curvature. Ann. d’Inst. Henri Poincare (C), Nonl. Anal.. 2018, Vol. 35, Iss. 1, pp. 267–282.
  • [12] Y. Grabovsky and L. Truskinovsky. The flip side of buckling. Cont. Mech. Thermodyn., 19(3-4):211–243, 2007.
  • [13] M. Gurtin, E. Fried, and L. Anand. The and Thermodynamics of Continua. Cambridge University Press, 2010.
  • [14] D. Harutyunyan. On the Korn interpolation and second inequalities in thin domains. SIAM J. Math. Anal., Vol. 50, Iss. 5, 2018, pp. 4964–4982.
  • [15] D. Harutyunyan. Gaussian curvature as an identifier of shell rigidity. Archive for Rational Mechanics and Analysis, Nov. 2017, Vol. 226, Iss. 2, pp. 743–766.
  • [16] D. Harutyunyan. A hint on the localization of the buckling deformation at vanishing curvature points on thin elliptic shells. Preprit, https://arxiv.org/abs/2104.11853
  • [17] J. Horák, G. J. Lord, and M. A. Peletier. Cylinder buckling: the mountain pass as an organizing center. SIAM J. Appl. Math., 66(5):1793–1824 (electronic), 2006.
  • [18] P. Hornung, M. Lewicka and R. Pakzad. Infinitesimal isometries on developable surfaces and asymptotic theories for thin developable shells. Journal of Elasticity, Volume 111, Number 1 (2013).
  • [19] G. Hunt, G. Lord, and M. Peletier. Cylindrical shell buckling: a characterization of localization and periodicity. Discrete and Continuous Dynamical Systems Series B, 3(4):505–518, 2003.
  • [20] G. Hunt and E. L. Neto. Localized buckling in long axially-loaded cylindrical shells. Journal of the Mechanics and Physics of Solids, 39(7):881–894, 1991.
  • [21] G. W. Hunt and E. L. Neto. Maxwell critical loads for axially loaded cylindrical shells. Trans. ASME, 60:702–706, 1993.
  • [22] W. T. Koiter. On the stability of elastic equilibrium. PhD thesis, Technische Hogeschool (Technological University of Delft), Delft, Holland, 1945.
  • [23] E. Lancaster, C. Calladine, and S. Palmer. Paradoxical buckling behaviour of a thin cylindrical shell under axial compression. International Journal of Mechanical Sciences, 42(5):843–865, 2000.
  • [24] M. Lewicka, M.G. Mora and M. Pakzad. The matching property of infinitesimal isometries on elliptic surfaces and elasticity of thin shells, Arch. Ration. Mech. Anal. (3) Vol. 200 (2011), 1023–1050.
  • [25] G. Lord, A. Champneys, and G. Hunt. Computation of localized post buckling in long axially compressed cylindrical shells. Philosophical Transactions of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences, 355(1732):2137–2150, 1997.
  • [26] R. Lorenz. Die nicht achsensymmetrische knickung dünnwandiger hohlzylinder. Physikalische Zeitschrift, 12(7):241–260, 1911.
  • [27] R. C. Tennyson. An experimental investigation of the buckling of circular cylindrical shells in axial compression using the photoelastic technique. Tech. Report 102, University of Toronto, Toronto, ON, Canada, 1964.
  • [28] S. Timoshenko. Towards the question of deformation and stability of cylindrical shell. Vesti Obshestva Tekhnologii, 21:785–792, 1914.
  • [29] S. Timoshenko and S. Woinowsky-Krieger. Theory of plates and shells, volume 2. McGraw-hill New York, second edition, 1959.
  • [30] V. I. Weingarten, E. J. Morgan, and P. Seide. Elastic stability of thin-walled cylindrical and conical shells under axial compression. AIAA J, 3:500–505, 1965.
  • [31] F. Xu and M. Potier-Ferry. On axisymmetric/diamond-like mode transitions in axially compressed core-shell cylinders, Journal of the Mechanics and Physics of Solids, Volume 94, September 2016, Pages 68–87.
  • [32] N. Yamaki. Elastic Stability of Circular Cylindrical Shells, volume 27 of Appl. Math. Mech. North Holland, 1984.
  • [33] Y. Youshimura. On the mechanism of buckling of a circular shell under axial compression. Technical Report 1390, National advisory committee for aeronautics, Washington, DC, 1955.
  • [34] E. Zhu, P. Mandal, and C. Calladine. Buckling of thin cylindrical shells: an attempt to resolve a paradox. International Journal of Mechanical Sciences, 44(8):1583–1601, 2002.