Low-rank approximation for multiscale PDEs

Ke Chen Ke Chen is a postdoctoral researcher in the Department of Mathematics at University of Texas at Austin. His email address is [email protected]. Shi Chen Shi Chen is a graduate student in the Department of Mathematics at University of Wisconsin, Madison. His email address is [email protected]. Qin Li Qin Li is an associate professor in the Department of Mathematics and the Wisconsin Institutes for Discovery at University of Wisconsin, Madison. Her email address is [email protected]. Jianfeng Lu Jianfeng Lu is a professor in the Department of Mathematics, the Department of Physics and the Department of Chemistry at Duke University. His email address is [email protected]. Stephen J. Wright Stephen J. Wright is a professor in the Department of Computer Sciences at University of Wisconsin, Madison. His email address is [email protected].

1 Introduction

Multiscale phenomena are ubiquitous, with applications in many physical sciences and engineering fields: aerospace, material sciences, geological structure analysis, and many others. The different scales often have different physics, which entangle to produce complicated nonlinearities. Partial differential equations (PDEs) are often used to model these problems, with different scales captured in the coefficients and functions that define the PDE. These PDE models are challenging to compute directly, so analysis and algorithms specifically targeted to multiscale problems have been developed and investigated. Following convention, we focus in this review on problems with two distinct scales, with a small positive parameter $\epsilon$ capturing the ratio between the small and large scale.

Though modern multiscale analysis dates back to asymptotic PDE analysis that was seen already in Hilbert and Poincaré expansions early last century (see review in [MR2382139]), the impetus for computations involving multiscale PDEs came largely from the US Department of Energy (DOE) National Labs within the ASCI (Advanced Strategic Computing Initiative) [Ho:2009multiscale]. Since that time, analysis and computation in multiscale PDEs have taken different paths. Analysis has tended to follow a single “universal” strategy, passed down from tradition. The equation is decomposed into several levels according to asymptotic expansions involving the scale parameters, with the subequation at each level representing physics at a single scale, and subequations at the finer level feeding information to those at the coarser level. This analytical machinery has been used to treat multiscale PDEs arising from such varied backgrounds as kinetic theory, semi-classical quantum systems, and homogenization of composite materials, among others [MR2382139, MR2830582].

On the computational side, strategies for handling multiscale PDEs are more varied. Problems are usually handled by specifically designed solvers. One class of solvers called asymptotic-preserving schemes [MR3645390] are designed to preserve asymptotic limits of kinetic equations. These schemes usually contain some component of macro-solvers and micro-solvers, integrated in a clever way to reveal different structures in different regimes. Another class of solvers called numerical homogenization methods [MR2830582, MR3971243, MR1979846] usually target elliptic and parabolic equations in which the coefficients that represent media have oscillatory elements. These methods usually consist of offline and online stages, with either the homogenized media or the representative basis functions being prepared in the offline stage.

Why are most numerical schemes for multiscale PDEs equation-specific despite the analytical tools being largely unified? This intriguing question has motivated our investigations into devising a universal numerical strategy for solving multiscale PDEs. While the approach is yet to be developed fully, we believe that our progress on this issue is of wide interest, and this article surveys our progress to date. Crucially, our approach exploits the low-rank structure present in discretizations of multiscale PDEs.

To demonstrate the fundamental idea, we consider the following problem:

\mathcal{L}^{\epsilon}u^{\epsilon}=f\,,

(1)

where $\mathcal{L}^{\epsilon}$ is a linear partial differential operator that depends explicitly on the small parameter $\epsilon$ , while $f$ represents the boundary condition or the source term, which is assumed to have no dependence on $\epsilon$ . Multiscale problems that can be formulated in this way include elliptic equations with highly oscillating media and the neutron transport equation with small Knudsen number. Due to the $\epsilon$ -dependence of the operator, the solution $u^{\epsilon}$ inherits structures at both fine and coarse scales.

An asymptotic limit is revealed by multiscale analysis using asymptotic expansions as $\epsilon\to 0$ . In this limit, the oscillation at the fine scale is fast and the detailed oscillation pattern no longer matters — only macroscopic quantities are relevant. Formally, writing the homogenization limit as

\mathcal{L}^{\ast}u^{\ast}=f\,,

(2)

we have

\|u^{\epsilon}-u^{\ast}\|\to 0\textrm{ as }\epsilon\to 0\,.

(3)

The norm of the approximation error depends heavily on the particular equation at hand.

The numerical challenge in solving (1) is that many degrees of freedom may be needed. Naïve finite element or finite difference methods would require mesh size $h\ll\epsilon$ to resolve fine-scale structure of the solution at the $\epsilon$ level. For a problem on $\mathbb{R}^{d}$ , the discretized system would therefore have $O(\epsilon^{-d})$ degrees of freedom, leading to prohibitive computational and memory cost for small $\epsilon$ . From an application perspective, it often suffices to characterize the solutions on the macroscopic level, where oscillations at the $\epsilon$ scale are largely absent. This property raises the question of whether we can obtain an approximate solution of this type using only $O(1)$ degrees of freedom. If we know how to derive (2), we can simply solve for $u^{\ast}$ , which has the required macroscopic properties, and typically requires a discretization with $O(1)$ degrees of freedom. Often, though, the limiting equation (2) and its solution $u^{\ast}$ are difficult to find explicitly, even when it is possible to establish their existence. These difficulties have led researchers to propose problem-specific solutions.

We believe that a universal approach for finding the large-scale solution can be devised, and that exploitation of the low-rank structure of the solution space is the key to developing such an approach. As suggested above, the Green’s matrix $\mathsf{G}^{\epsilon}$ (the discretized Green’s function on fine grids) for the multiscale system (1) requires dimension $O(\epsilon^{-d})$ to represent the underlying Green’s function accurately. However, if a limiting system such as (2) exists, this limiting system can be well-represented numerically by $\mathsf{G}^{\ast}$ , a Green’s matrix with dimension only $O(1)$ . This phenomenon suggests the system can largely be “compressed” and hence is of low rank; see illustration in Figure 1. In the language of numerical linear algebra, this transition amounts to performing a truncated singular value decomposition (SVD) of $\mathsf{G}^{\epsilon}$ to obtain $\mathsf{G}^{\ast}$ .

Refer to caption — Figure 1: PDEs with small parameters have homogenized limits, meaning the solutions to the original PDEs can be well-approximated by the solutions to the limiting equations. While analytically the two solution spaces are “close”, the original equation requires many more degrees of freedom to solve numerically than its limiting counterpart. The numerical Green’s matrix is intrinsically low rank.

If we obtain the truncated SVD of the matrix $\mathsf{G}^{\epsilon}$ by starting with a full SVD, the resulting algorithms would be impractical because of the large dimension of the matrix and the expense of preparing and storing the full matrix $\mathsf{G}^{\epsilon}$ and computing its SVD. Several new linear algebra solvers take a quite different approach. Instead of accessing the full matrix, these new solvers merely require computation of matrix-vector products, involving the target matrix and several randomly selected vectors (typically vectors with Gaussian i.i.d. entries). Translated to the PDE solver setting, these matrix-vector multiplications amount to computing numerical solutions to PDEs with some random source terms, a task that may be practical if the number of such operations required is modest. The randomized SVD (rSVD) approach is one method of this type. It is equipped with a thorough analysis and achieves optimality in terms of computational efficiency. We make use of this method in the techniques described in the remainder of this article.

The main theme of our article, then, is the use of randomized SVD solvers to exploit the low-rank features of multiscale PDEs. We will describe two strategies both of which are divided into “offline” and “online” stages. The offline stage sees the preparation of either the solution space or the boundary-to-boundary map used in the domain decomposition, while the online stage singles out the specific solution for the given source $f$ . The two strategies are described in Section 4.1 and 4.2, respectively. In Section 5, we present the nonlinear extension utilizing manifold learning algorithms for reconstructing the low-rank features of the solution manifold. Prior to these discussions, we describe in Section 2 two algorithm classes — asymptotic preserving and numerical homogenization — for identifying the asymptotic limits of multiscale problems. As examples, we use the multiscale radiative transfer equation (RTE) and the elliptic equation with rough media. Section 3 explores the two main elements of our approaches: the numerical low-rank feature of multiscale PDEs and the randomized SVD solver for efficient reconstruction of low-rank operator/spaces. We conclude with a discussion of future work in Section 6.

2 Examples

Kinetic equations and elliptic equations with oscillating media are two examples of multiscale PDEs, for which computational schemes were developed separately. The specific features of these problems were incorporated into the design of asymptotic preserving schemes and numerical homogenization methods, respectively. We review these techniques and highlight the shared low-rank property of these two problems.

2.1 Kinetic equations and asymptotic preserving methods

Kinetic equations, which originate from statistical mechanics, describe the evolution of probability density for identical particles in phase space. A model equation, the radiative transfer equation (RTE), characterizes the evolution of photon density. In the steady state, this equation is

-v\cdot\nabla_{x}u^{\epsilon}+\mathsf{S}^{\epsilon}[u^{\epsilon}]=f(x,v)\,,\quad(x,v)\in\mathcal{K}\times\mathbb{V}\,,

(4)

where $f(x,v)$ is the light source, and the linear collision operator $\mathsf{S}^{\epsilon}$ describes the interaction of photons with the optical media. The small parameter $\epsilon$ is encoded in this operator.

The operator $\mathsf{S}^{\epsilon}$ defines several distinct regimes. In the optically thick regime, it is defined by

	$\displaystyle\mathsf{S}^{\epsilon}u(x,v)=\frac{1}{\epsilon}\int_{\mathbb{V}}$	$\displaystyle k(x,v,v^{\prime})u(x,v^{\prime})\mathrm{d}v^{\prime}$		(5)
		$\displaystyle-\frac{1}{\epsilon}\int_{\mathbb{V}}k(x,v^{\prime},v)u(x,v)\mathrm{d}v^{\prime}\,.$		(5)

In this case, $k(x,v,v^{\prime})$ is the scattering coefficients that describes the possibility of a photon located at $x$ changing its velocity from $v$ to $v^{\prime}$ , and the parameter $\epsilon$ is called the Knudsen number, standing for the ratio of the mean free path to the typical domain length. When the medium is optically thick, the mean free path is small, with $\epsilon\ll 1$ . This means the photon particles are scattered fairly often, and the system statistically achieves the equilibrium state, which can itself be characterized mathematically. One example is to observe light in atmosphere, where the average mean free path is about $10$ m, and the observation is conducted at the scale of $10$ km, leading to $\epsilon\sim 10^{-3}$ . By performing asymptotic expansion in terms of $\epsilon$ , the inhomogeneity in the velocity domain vanishes, and one can show that $u^{\varepsilon}(x,v)$ asymptotically approximates $u^{\ast}(x)$ , a function without dependence on $v$ that solves a diffusion equation. We have the following result from [MR2839402].

Theorem 1.

Suppose that $u^{\epsilon}$ solves (4) with collision term $\mathsf{S}$ being isotropic, that is, $k(x,v,v^{\prime})=\sigma(x)$ for some function $\sigma$ . Let $\mathcal{K}\subset\mathbb{R}^{d}$ be bounded with smooth boundary, and $\mathbb{V}=\mathbb{S}^{d-1}$ . Assume that the boundary condition is

u^{\epsilon}(x,v)=\phi(x,v)\quad\text{on}\quad x\in\partial\mathcal{K}\,,\;\;v\cdot n_{x}<0\,.

(6)

Then

\|u^{\epsilon}-u^{\ast}\|_{L_{2}(\mathrm{d}x\mathrm{d}v)}\to 0\,,

(7)

where $u^{\ast}=u^{\ast}(x)$ solves

\nabla_{x}\cdot\left(\frac{1}{\sigma(x)}\nabla_{x}u^{\ast}(x)\right)=g(x)\,,\quad x\in\mathcal{K}\,,

(8)

with the boundary condition

u^{\ast}(x)=\xi_{\phi}(x)\,,\quad\text{on}\quad x\in\partial\mathcal{K}\,,

where $\xi_{\phi}(x)$ solves a proper boundary layer equation and $g$ can be obtained from $\int f(x,v)\mathrm{d}{v}$ .

This result indicates that the homogenized operator as $\epsilon\to 0$ is $\mathcal{L}^{\ast}\propto\nabla_{x}\cdot\left(({1}/{\sigma})\nabla_{x}\right)$ . Similar results, when $k(x,v,v^{\prime})$ fails to have the form of $\sigma(x)$ in the anisotropic optical media, are still available, but the explicit form of $\mathcal{L}^{\ast}$ is no longer available.

A second regime of interest for $\mathsf{S}^{\epsilon}$ is one in which the media is highly heterogeneous [MR1760042]:

	$\displaystyle\mathsf{S}^{\epsilon}u(x,v)=\int_{\mathbb{V}}$	$\displaystyle k\left(\frac{x}{\epsilon},v,v^{\prime}\right)u(x,v^{\prime})\mathrm{d}v^{\prime}$		(9)
		$\displaystyle-\int_{\mathbb{V}}k\left(\frac{x}{\epsilon},v^{\prime},v\right)u(x,v)\mathrm{d}v^{\prime}\,.$		(9)

In this case, the photons go through the media that oscillates at a small scale: For example, sunlight passing through heavy cloud with a large number of small droplets or laser beam passing through crystals. The amplitude of $k$ determines the photon scattering frequency. Since $k$ oscillates rapidly, photons also change rapidly between the high- and low-scattering regimes. On a large scale, the photons can be viewed approximately as scattering with an averaged frequency. A mathematical result is as follows [MR1760042].

Theorem 2.

Let the conditions from Theorem 1 hold, and suppose that the collision term $\mathsf{S}^{\epsilon}$ is defined in (9). Then

\|u^{\epsilon}-u^{\ast}\|_{L_{2}(\mathrm{d}x\mathrm{d}v)}\to 0\,,

(10)

where $u^{\ast}(x,v)$ solves

-v\cdot\nabla_{x}u^{\ast}+\mathsf{S}^{\ast}[u^{\ast}]=f(x,v)\,,\quad(x,v)\in\mathcal{K}\times\mathbb{V}\,,

(11)

where $\mathsf{S}^{\ast}u(x,v)=\sigma^{\ast}(x)\int_{\mathbb{V}}u(x,v^{\prime})-u(x,v)\mathrm{d}v^{\prime}$ for some $\sigma^{\ast}(x)$ . Furthermore, if $k(x,v,v^{\prime})=\sigma(x)$ is periodic in $x$ with period $[0,1]^{d}$ , then $\sigma^{\ast}=\int_{[0,1]^{d}}\sigma(x)\mathrm{d}x$ .

In special cases, such as under periodic or random ergodic conditions, the function $\sigma^{\ast}$ can be computed explicitly. (There are also works that investigate the asymptotic limit of the RTE when the system is both highly oscillatory and in diffusion regime; see [MR1878799].)

In both limiting regimes, the limiting equations (8) and (11) can be solved much more efficiently than the original equation (4). The discretization of (4) is constrained strongly by $\epsilon$ , due either to stability (as in (5)) or accuracy (as in (9)). By contrast, the solution $u^{\ast}$ varies smoothly, containing no $\epsilon$ -scale effects, so can be obtained accurately by applying a discretization with mesh width $O(1)$ to the asymptotic limiting equation. If the latter equation is available, computation of $u^{\ast}$ by this means is the recommended methodology.

Methods for kinetic equations are termed “asymptotic preserving” (AP) if they can relax the requirement that the discretization width $h$ satisfies $h=o(\epsilon)$ yet still capture the asymptotic limits. Many different AP approaches have been proposed. For linear equations, existing AP methods rely on even-odd or micro-macro decomposition. For nonlinear equations, knowledge of the specific forms of the limits is usually required, and this knowledge is built into the solvers [MR3645390]. As mentioned above, these specific forms are often not available, so many AP methods cannot be applied to a large set of multiscale kinetic equations. This observation begs the question: Knowing the existence of the limit, but not its particular form, can we still devise efficient methods for solving kinetic equations?

2.2 Elliptic equations and numerical homogenization

Another class of multiscale equations that has been investigated deeply is elliptic equations with highly oscillatory coefficient. These problems have the form

-\nabla_{x}\cdot\left(a^{\epsilon}\left(x\right)\nabla_{x}u^{\epsilon}\right)=f\,,

(12)

where $\epsilon\ll 1$ is the scale on which the media oscillates. (The source term $f$ has no small-scale contribution.)

This equation is a model problem from petroleum engineering where it is crucial to precompute the underground flow before expensive construction of infrastructure takes place [MR2801210]. The problem is typically solved on kilometer-scale domains, but the heterogeneities in the media can scale at centimeters. Certain forms of this equation can be approximated effectively by an equation that can be solved efficiently. Suppose the media coefficient $a^{\epsilon}(x)$ has the form $a(x,x/\epsilon)$ , that is, it varies on two scales ( $1$ and $\epsilon$ ), and moreover is periodic with respect to the fast variable (the second argument in $a(x,x/\epsilon)$ ). Then in the limiting regime as $\epsilon\to 0$ , the solution $u^{\epsilon}$ converges to that of a homogenized equation, with the media “smoothed-out,” as described in the following result [MR1185639].

Theorem 3.

Let $u^{\epsilon}$ solve (12) in the domain $x\in\mathcal{K}$ with zero boundary condition. Suppose $a(x,x/\epsilon)$ is periodic with respect to the second argument. Then

\|u^{\epsilon}-u^{\ast}\|_{L^{2}}\lesssim\epsilon\left\|u^{\ast}\right\|_{H^{2}}\,,

(13)

where $u^{\ast}$ solves the following effective equation with zero boundary condition:

-\nabla_{x}\cdot(a^{\ast}(x)\nabla_{x}u^{\ast})=f\,,\quad x\in\mathcal{K}\,,

(14)

where $a^{\ast}$ , the effective media, can be computed from a cell problem (See Definition 2.1 in [MR1185639]).

As in the previous section, when a limiting equation can be derived explicitly, the best course for obtaining a useful solution it to solve this equation directly, as the mesh width in the discretization scheme can be much larger than $\epsilon$ . See [MR2477579] for a discussion of a reduced number of basis functions and [MR2830582] for computation of the effective media.

However, the validity and the specific form of the effective limit are known only in special cases like the one described in Theorem 3. In other cases, we seek a solver that relies on as little analytical knowledge as possible. An approach known as numerical homogenization has been investigated extensively. This approach is founded on two principles: a discretization scheme independent of $\epsilon$ , and a numerical solution scheme that captures the true limiting behavior of the solution on the discrete level. Variants of numerical homogenization include application of the $\mathcal{H}$ -matrix, a purely algebraic technique [MR3445676]; and a Bayesian approach that views the source $f$ , and hence the solution $u^{\epsilon}$ , as Gaussian fields [MR3369060], which further translates to game theory [MR3971243]. All these methods are successful, but they all implicitly rely on properties of the underlying elliptic equation. Can we devise an approach that applies to general problems with oscillatory media that exploits the low-rank property in the solution space, without using analytical structure explicitly?

3 A unified framework for multiscale PDEs based on random sampling

We have given several examples of multiscale models that arise in applications, and mentioned several algorithmic approaches that make use of the limiting equations, when available. We describe next the foundations of a unified scheme that captures asymptotic limiting behavior automatically, even when the asymptotic limits are unavailable. Our method exploits low-rank structure and uses random sampling to discover this structure. We describe the low-rank property in Section 3.1 and the randomized SVD method for revealing this structure in Section 3.2.

3.1 Numerical rank

We consider a bounded linear operator $\mathcal{A}$ , which maps $f\in\mathcal{X}$ to a space $\mathcal{Y}$ , that is

	$\displaystyle\mathcal{A}:$	$\displaystyle\quad\mathcal{X}$	$\displaystyle\rightarrow$	$\displaystyle\quad\mathcal{Y}$
		$\displaystyle\quad f$	$\displaystyle\mapsto$	$\displaystyle\quad u.$

In the PDE setting, $\mathcal{A}$ is the solution operator that maps the boundary conditions and/or source term $f$ to the solution $u$ . The numerical rank of such an operator is defined as follows.

Definition 1 (Numerical rank).

The numerical $\tau$ -rank of $\mathcal{A}$ is the rank of the lowest-rank operator within the $\tau$ -neighborhood of $\mathcal{A}$ , that is,

k_{\tau}(\mathcal{A}):=\min\{\dim\mathrm{ran}\tilde{\mathcal{A}}:\tilde{\mathcal{A}}\in\mathcal{L}(\mathcal{X},\mathcal{Y}),\|\tilde{\mathcal{A}}-\mathcal{A}\|\leq\tau\}\,.

In other words, $k_{\tau}(\mathcal{A})$ is this smallest dimension of the range among all the operators within distance $\tau$ of $\mathcal{A}$ .

When $\mathcal{A}$ is the PDE solution map, then $\tilde{\mathcal{A}}$ with low rank is also a linear map with a finite dimensionality. It can be viewed as the discrete version (or a matrix of dimension $k_{\tau}(\mathcal{A})$ ) that approximates $\mathcal{A}$ within $\tau$ accuracy. The definition suggests that if $\tilde{\mathcal{A}}$ can be found, it is optimal in the sense of numerical efficiency. The concept is rather similar to the Kolmogorov $N$ -width, defined as follows.

Definition 2 (Kolmogorov $N$ -width).

Given the linear operator $\mathcal{A}:\mathcal{X}\to\mathcal{Y}$ , the Kolmogorov $N$ -width $d_{N}(\mathcal{A})$ is the shortest distance from its range to all $N$ -dimensional space, that is,

	$\displaystyle d_{N}(\mathcal{A}):$	$\displaystyle=\min_{S:\dim S=N}d(\mathcal{A},S)$		(15)
		$\displaystyle=\min_{S:\dim S=N}\sup_{f}\min_{v\in S}\frac{\\|\mathcal{A}f-v\\|_{\mathcal{Y}}}{\\|f\\|_{\mathcal{X}}}\,.$		(15)

Indeed, the Kolmogorov $N$ -width and numerical rank are related by the following result [MR4155236].

Proposition 1.

For any linear operator $\mathcal{A}:\mathcal{X}\to\mathcal{Y}$ , we have the following.

(a)

If the numerical $\tau$ -rank is $N$ , then $d_{N}(\mathcal{A})\leq\tau$ .
(b)

If $d_{N}(\mathcal{A})\leq\tau<d_{N-1}(\mathcal{A})$ , then the numerical $\tau$ -rank is $N$ .

For the three examples presented in Section 2, the numerical ranks can be calculated from their limiting equations. For one-dimensional RTE in the diffusion regime, if we denote by $\mathcal{A}^{\epsilon}$ and $\mathcal{A}^{\ast}$ the solution operators of (4) and (8), respectively, then noting that $\mathcal{A}^{\ast}$ can be approximated using $1/\sqrt{\tau}$ grid points to achieve $\tau$ accuracy, when $\epsilon<\tau$ , the numerical rank is naturally $k_{\tau}(\mathcal{A}^{\epsilon})\lesssim 1/\sqrt{\tau-\epsilon}$ . Without employing the knowledge of the existence of the limit, however, a brute-force discretization naturally requires $O(1/\epsilon\tau^{\alpha+1})$ degrees of freedom: $O(1/\epsilon\tau)$ for the upwind discretization in $x$ and $O(1/\tau^{\alpha})$ for the discretization in $v$ , where $\alpha$ depends on the particular numerical integral accuracy. Translating into Green’s-matrix language, this observation means that $\mathsf{G}^{\epsilon}$ is represented by $O(1/\epsilon\tau^{\alpha+1})$ degrees of freedom but its range can be captured by a compressed Green’s matrix $\mathsf{G}^{\ast}$ with just $O(1)$ column vectors.

The same argument applies to the elliptic equation on a two-dimension domain with high oscillations. When second-order linear finite elements are used, with no knowledge of the limiting system, $O(1/\epsilon^{2}\tau)$ degrees of freedom are required, dropping to $O(1/\tau)$ when the the limiting system is known. In other words, the full Green’s matrix $\mathsf{G}^{\epsilon}$ requiring $O(1/\epsilon^{2})$ degrees of freedom can be well-represented using just $O(1)$ column vectors.

In all these cases, the degrees of freedom for a given numerical method are substantially larger than the numerical rank of the problem. Thus, much of the information in these full-blown representations is redundant and compressible. A low rank representation exists and yields a much more economical representation.

3.2 Random sampling in numerical linear algebra

Knowing the existence of the low rank structure and finding such a structure are very different goals. The Kolmogorov $N$ -width is a concept developed in but has made little impact in numerical PDEs for a simple reason: Traditional PDE solvers require a predetermined set of basis functions, while the Kolmogorov $N$ -width looks for “optimal” basis functions. How can an optimal basis be found without first forming the full basis? Translated to linear algebra, this question is about finding the dominant singular vectors in a matrix without forming the whole matrix. Specifically, if $\mathsf{A}\in\mathbb{R}^{m\times n}$ is known to be approximately low rank, meaning that there exists $\mathsf{U}_{r}$ , a $m\times r$ matrix with orthonormal columns with $r\ll\min(m,n)$ and

\|\mathsf{A}-\mathsf{A}_{r}\|=\|\mathsf{A}-\mathsf{U}_{r}\mathsf{U}_{r}^{\top}\mathsf{A}\|\ll\|\mathsf{A}\|\,,

can we find $\mathsf{U}_{r}$ without forming the full matrix $\mathsf{A}$ ?

In linear algebra, it is well-known that $\mathsf{U}_{r}$ is simply the collection of the first $r$ singular vectors of $\mathsf{A}$ . Writing

\mathsf{A}=\mathsf{U}\Sigma\mathsf{V}^{\top}=\sum_{i=1}^{n}\sigma_{i}u_{i}v^{\top}_{i}\,,

(16)

where $\mathsf{U}=\left[u_{1}\,,u_{2}\,,\dots,u_{n}\right]\in\mathbb{R}^{m\times n}$ and $\mathsf{V}=\left[v_{1}\,,v_{2}\,,\dots,v_{n}\right]\in\mathbb{R}^{n\times n}$ contain the left/right singular vectors and $\Sigma=\mathrm{diag}(\sigma_{1},\sigma_{2},\dotsc,\sigma_{n})$ contains the singular values, then $\mathsf{U}_{r}$ is the first $r$ columns in $\mathsf{U}$ .

The standard method for computing the SVD requires $\mathsf{A}$ to be stored and computed with. But the celebrated randomized SVD (rSVD) method [MR2806637] captures the range of a given matrix by means of random sampling of its column space, which requires only computation of matrix-vector products involving $\mathsf{A}$ and random vectors — operations that can be performed without full storage or knowledge of $\mathsf{A}$ . Implementation of rSVD is easy and its performance is robust.

The idea behind the algorithm is simple. If matrix $\mathsf{A}\in\mathbb{R}^{m\times n}$ has approximate low rank $r\ll\min\{m,n\}$ , the matrix maps an $n$ -dimensional sphere to an $m$ -dimensional ellipsoid that is “thin:” $r$ of its axes are significantly larger than the rest. With high probability, vectors that are randomly sampled on the $n$ -dimensional sphere are mapped to vectors that lie mostly in a $r$ -dimensional subspace of $\mathbb{R}^{m}$ — the range of $\mathsf{A}$ . An approximation to $\mathsf{A}_{r}$ can be obtained by projecting onto this subspace.

A precise statement of the performance of randomized SVD is as follows [MR2806637].

Theorem 4.

Let $\mathsf{A}$ be an $m\times n$ matrix. Define

\mathsf{Y}=\mathsf{A}\Omega\,,

(17)

where $\Omega=\left[\omega_{1}\,,\dotsc,\omega_{r+p}\right]$ is a matrix of size $n\times(r+p)$ with its entries randomly drawn from an i.i.d. normal distribution, where $p$ is an oversampling parameter. If $\sigma_{r+1}\ll\sigma_{1}=O(1)$ , then the projection of $\mathsf{A}$ onto the space spanned by $\mathsf{Y}$ , defined by

\mathsf{P}_{\mathsf{Y}}(\mathsf{A})=\mathsf{Y}(\mathsf{Y}^{\top}\mathsf{Y})^{-1}\mathsf{Y}^{\top}\mathsf{A}\,,

yields that $\|\mathsf{A}-\mathsf{P}_{\mathsf{Y}}(\mathsf{A})\|\ll\sigma_{1}$ with high probability, and

\mathbb{E}\,\|\mathsf{A}-\mathsf{P}_{\mathsf{Y}}(\mathsf{A})\|\lesssim\frac{r}{p-1}\sigma_{r+1}\ll\sigma_{1}\,.

The result reconstructs the range of $\mathsf{A}$ in a nearly optimal way. It is optimal in efficiency because to capture a rank- $r$ matrix, only $r+p$ matrix-vector products involving $\mathsf{A}$ are required for the calculation of $\mathsf{Y}$ , and the oversampling parameter $p$ is typically quite modest. ( $p=5$ is a typical value.) The result is nearly optimal in accuracy as well. The error bound relies only on $\sigma_{r+1}$ , which is expected to be smaller than $\sigma_{1}$ . The decay profile of singular values do not affect the approximation accuracy.

If a low-rank approximation to the matrix $\mathsf{A}$ is required, and not just an approximation of its range, another step involving multiplications with its transpose is needed. The full method is shown in Algorithm 1.

Algorithm 1 Randomized SVD

1:Given an

m\times n

matrix

\mathsf{A}

, target rank

r

and oversampling parameter

p

;

2:Set

k=r+p

;

3:Stage A:

4: Generate an

n\times k

Gaussian test matrix

\Omega

;

5: Form

\mathsf{Y}=\mathsf{A}\Omega

;

6: Perform the QR-decomposition of

\mathsf{Y}

\mathsf{Y}=\mathsf{Q}\mathsf{R}

7:Stage B:

8: Form

\mathsf{B}=\mathsf{A}^{\top}\mathsf{Q}

;

9: Compute the SVD of the

k\times n

matrix

\mathsf{B}^{\top}=\widetilde{\mathsf{U}}\Sigma\mathsf{V}^{\top}

;

10: Set

\mathsf{U}=\mathsf{Q}\widetilde{\mathsf{U}}

;

11:Return:

\mathsf{U},\Sigma,\mathsf{V}

4 Random sampling for multiscale computation

Here we describe how rSVD can be incorporated into multiscale PDE solvers to exploit the low-rank structure of these equations. Our procedure is composed of both offline and online stages. Low rank structure is learned in the offline stage, while in the online stage, the solution for the given source / boundary term $f$ in (1) is extracted.

We consider in particular the following boundary value problem:

\begin{cases}(\mathcal{L}^{\epsilon}u^{\epsilon})(x)=0\,,\quad x\in\mathcal{K}\,,\\ \mathcal{B}u(x)=\phi(x)\,,\quad x\in\partial\mathcal{K}\,,\end{cases}

(18)

where $\mathcal{B}$ is the boundary condition operator, $\partial\mathcal{K}$ the boundary associated with domain $\mathcal{K}$ , and we now denote the source term (the boundary data) by $\phi$ . Our fundamental goal is to construct the low-rank approximation to the Green’s operator $\mathsf{G}^{\epsilon}$ for (18). With this operator in hand, the solution can be computed for any value of the boundary conditions $\phi$ at a relatively small incremental cost.

If we apply rSVD to approximate $\mathsf{G}^{\epsilon}$ directly, we need to compute products of this operator with random vectors. For the problem (18), this operation corresponds to solving the problem with $\phi$ replaced by random boundary conditions. Even to solve one such problem efficiently is a computationally challenging task. We use the domain decomposition framework.

We start by partitioning the domain $\mathcal{K}$ into subdomains as follows:

\mathcal{K}=\bigcup_{m=1}^{M}\mathcal{K}_{m}\,,

(19)

where the patches $\mathcal{K}_{m}$ overlap, in general. We denote by $\partial\mathcal{K}_{m}$ the boundary associated with $\mathcal{K}_{m}$ . Furthermore, we identify the subregions that intersect with $\mathcal{K}_{m}$ as follows:

\mathcal{I}_{m}=\{n\in\mathbb{N}:1\leq n\leq M\,,\mathcal{K}_{m}\cap\mathcal{K}_{n}\neq\varnothing\}\,,

and define the interior of the patch to be

\widetilde{\mathcal{K}}_{m}=\mathcal{K}_{m}\backslash\left(\bigcup_{n\in\mathcal{I}_{m}}\mathcal{K}_{n}\right)\,.

For this particular partition of the domain, we define the partition-of-unity functions $\chi_{m}$ , $m=1,2,\dotsc,M$ to have the following properties:

		$\displaystyle\sum_{m=1}^{M}\chi_{m}(x)=1\,,\quad\forall x\in\mathcal{K}\,,$		(20)
		$\displaystyle\text{with}\quad\begin{cases}0\leq\chi_{m}(x)\leq 1\,,&x\in\mathcal{K}_{m}\\ \chi_{m}(x)=0\,,&x\in\mathcal{K}\backslash\mathcal{K}_{m}.\end{cases}$		(20)

We choose a discretization that resolves the small scales in the solution, defining a mesh width $h\ll\epsilon$ . (The number of subdomains $M$ is independent of $\epsilon$ .) A typical decomposition is illustrated in Figure 2.

How do we design the offline stage to “learn” the low-rank approximation? We propose two approaches that lead to two different kinds of algorithms. In the first approach we learn the optimal basis functions within each subdomain, while the second algorithm employs Schwarz iteration, preparing the boundary-to-boundary map in the offline stage. Other PDE solvers that utilize randomness can also be found in [MR3477310, MR3824169, MR4050504]. In particular in [MR3477310] the authors studied, specifically for elliptic type equations, the generalized eigenvalue problem of the stiffness and mass matrices, and give an error bound using the largest eigenvalue obtained offline.

4.1 Learning basis functions

In standard domain decomposition, the local discretized Green’s matrix $\mathsf{G}_{m}$ is assembled from a “full” collection of basis functions in the patch $\mathcal{K}_{m}$ . The global solution to (18), confined to each $\mathcal{K}_{m}$ , is a linear combination of the columns $\mathsf{G}_{m}$ . The coefficients of these combinations are chosen so that that the continuity conditions across patches and the exterior boundary condition are all satisfied. The complete process can be outlined as follows.

(1)

Offline stage: For $m=1,2,\dots,M$ , find

\mathsf{G}_{m}=\left[b_{m,1}\,,b_{m,2}\dots\right]\,,

where each local function $b_{m,n}$ is a solution to (18) restricted to the subdomain $\mathcal{K}_{m}$ , with fine grid $h\ll\epsilon$ and delta-function boundary conditions. That is,

\begin{cases}\mathcal{L}^{\epsilon}b_{m,n}=0\,,\quad&x\in\mathcal{K}_{m}\\ b_{m,n}=\delta_{m,n}\,,\quad&x\in\partial\mathcal{K}_{m}\,,\end{cases}

(21)

where $\delta_{m,n}$ is the Kronecker delta that singles out the $n$ -th grid point on the boundary $\partial\mathcal{K}_{m}$ .

(2)

Online stage: The global solution is

$u=\sum_{m=1}^{M}u_{m}\chi_{m}=\sum_{m=1}^{M}\chi_{m}\mathsf{G}_{m}{c}_{m},$

with the support of each $u_{m}=\mathsf{G}_{m}{c}_{m}$ confined to $\mathcal{K}_{m}$ , where $c_{m}$ is a vector of coefficients determined by the boundary conditions $\phi$ and continuity conditions across the patches.

The complete basis represented by $\mathsf{G}_{m}$ has a low-rank structure that can be revealed using randomized SVD. Instead of using delta functions as the boundary conditions, we propose to obtain basis functions by setting random values on $\partial\mathcal{K}_{m}$ , as follows:

\begin{cases}\mathcal{L}^{\epsilon}r_{m,n}=0\,,\quad&x\in\mathcal{K}_{m},\\ r_{m,n}=\omega_{m,n}\,,\quad&x\in\partial\mathcal{K}_{m},\end{cases}

(22)

where $\omega_{m,n}$ is defined to have a random value drawn i.i.d. from a normal distribution at each grid point in $\partial\mathcal{K}_{m}$ . Denoting $\mathsf{G}_{m}^{\mathrm{r}}=\{r_{m,1}\,,r_{m,2}\,,\cdots\}$ , we have from linearity of the equation that

\mathsf{G}_{m}^{\mathrm{r}}=\mathsf{G}_{m}\Omega\,,

where $\Omega$ is a random i.i.d. matrix with entries $\omega_{m,n}$ . This $\mathsf{G}_{m}^{\mathrm{r}}$ is used in the online stage, as an accurate surrogate of $\mathsf{G}_{m}$ , see Algorithm 2.

Algorithm 2 A general framework for multiscale PDE

\mathcal{L}^{\epsilon}u^{\epsilon}=0

over

\mathcal{K}

with

\mathcal{B}u^{\epsilon}=f

\partial\mathcal{K}

1:Domain Decomposition

2: Partition domain according to (19).

3:Offline Stage:

4: Prepare i.i.d. Gaussian vectors

\omega_{m,i},i=1,\ldots,k_{m}

on each

\partial\mathcal{K}_{m}

5: Solve the basis function

r_{m,i}

in (22) on each

\Omega_{m}

, and collect the local basis in

\mathsf{G}_{m}

6:Online Stage:

7: Use continuity condition and global boundary data

\phi

to determine coefficient vectors

c_{1},c_{2},\dotsc,c_{M}

, and set

u=\sum_{m=1}^{M}\chi_{m}\mathsf{G}_{m}c_{m}

(23)

8:Return: approximate global solution

u

Although we do not apply full-blown rSVD here, the homogenizable and low-rank property of the local solution space implies that $\mathsf{G}_{m}^{\mathrm{r}}$ and $\mathsf{G}_{m}$ share similar range with the number of basis functions $k_{m}$ in $\mathsf{G}_{m}^{\mathrm{r}}$ being much smaller than $n_{m}$ , the number of grid points on $\partial\mathcal{K}_{m}$ , and independent of $\epsilon$ . In Figure 3 we plot the angles between $\mathsf{G}_{m}^{\mathrm{r}}$ and $\mathsf{G}_{m}$ for two of the equations discussed in Section 2. In both cases, and for small $\epsilon$ , the approximated Green’s matrix quickly recovers the true Green’s matrix as the number of samples $k_{m}$ increases, and thus captures the local solution space. We should note that if $\mathsf{G}_{m}$ does not have low-rank structure, in the sense that $k_{m}\sim n_{m}$ , then solving (22) would be equally expensive as solving (21), hence the random sampling technique does not gain any computational efficiency when the system is not homogenizable.

In Figure 4 we showcase the basis functions on a patch for elliptic equation with media coefficient $a(x_{1},x_{2})=1+1000\,\mathbf{1}_{S}(x_{1},x_{2})$ , with $S=\{(x_{1},x_{2})\in[0,1]^{2}:(x_{1}\cos(100\sqrt{(x_{1}-0.5)^{2}+(x_{2}-0.5)^{2}}))\leq x_{2}-0.5\}$ . For this non-conventional media without any periodic structure, the traditional multiscale methods are no longer valid, but our method still quickly captures the optimal basis.

For particular boundary conditions $\phi$ , the global solution is assembled from the local basis functions in the online stage. Two numerical examples are shown in Figure 5. In both examples, there is little visible difference between the reference solution and the approximated one computed from the reduced random basis. Only 8.3% and 62.5%, respectively, of the degrees of freedom required by the full basis are needed to represent these solutions using a random basis; see details in [MR4155236].

Since we do not have access to the full set of basis functions, the condition that $u$ defined in (23) is continuous across subdomains can be satisfied only in a least-squares sense; see [MR4155236] for details.

4.2 A low-rank Schwarz method

Our second approach for exploiting the low-rank property in multiscale computations is based on Schwarz iteration. The Schwarz method is a standard iteration algorithm within the domain decomposition framework, in which boundary-value problems are solved on the patches, with neighboring patches subsequently exchanging information and re-solving until consistency is attained. The exchange of boundary information between neighboring patches is known as the boundary-to-boundary (BtB) map. The map has an exploitable low-rank property.

To develop the approach, we write the solution of (18) as

u^{\epsilon}(x)=\sum_{m=1}^{N}\chi_{m}(x)u^{\epsilon}_{m}(x)\,,

(24)

where the partition-of-unity functions $\chi_{m}$ are defined in (20). The solution $u^{\epsilon}_{m}$ on patch $m$ is uniquely determined by $f_{m}$ , its local boundary condition, according to the equation

\begin{cases}\mathcal{L}^{\epsilon}u^{\epsilon}_{m}=0\,,&\quad x\in\mathcal{K}_{m}\\ u_{m}^{\epsilon}(x)=f_{m}(x)\,,&\quad x\in\partial\mathcal{K}_{m}\,.\end{cases}

(25)

The Schwarz method starts by initial guesses to the local boundary conditions $f_{m}=f_{m}^{0}$ , then on iteration $t$ , it solves the subproblems (25) with $f_{m}=f_{m}^{t}$ to obtain all local solutions $u_{m}^{\epsilon}$ . By confining $u_{m}^{\epsilon}$ on the boundaries of adjacent patches, one updates the boundary conditions for surrounding patches:

f^{t}_{m}(x)\xrightarrow{\mathcal{S}_{m}}u_{m}^{\epsilon}(x)|_{\widetilde{\mathcal{K}}_{m}}\xrightarrow{\mathcal{P}_{m}}f^{t+1}_{n}(x)\,,\forall n\in\mathcal{I}_{m}\,.

(26)

Here $\mathcal{S}_{m}$ denote the solution to (25) confined in the interior of $\mathcal{K}_{m}$ , and $\mathcal{P}_{m}$ takes the trace of the solution on the neighboring boundaries $\mathcal{K}_{m}\cap\partial\mathcal{K}_{n}$ for $n\in\mathcal{I}_{m}$ , for the updated boundary condition.

Define the BtB map by $\mathcal{A}_{m}:=\mathcal{P}_{m}\circ\mathcal{S}_{m}$ , and define $\mathcal{A}$ and $f^{t}$ to be the aggregation of $\mathcal{A}_{m}$ and $f_{m}^{t}$ , respectively, over $i=1,2,\dotsc,M$ . We can then write the updating procedure as

f^{t+1}=\mathcal{A}f^{t}\,.

The overall method is summarized in Algorithm 3.

Algorithm 3 Schwarz method for multiscale PDE

\mathcal{L}^{\epsilon}u^{\epsilon}=0

over

\mathcal{K}

with

\mathcal{B}u^{\epsilon}=f

\partial\mathcal{K}

1:Given total iterations

T

;

2:Domain Decomposition

3: Partition domain according to (19).

4:Schwarz Iteration:

5: Initialize

f_{m}^{0}

for each

\partial\mathcal{K}_{m}

and set

t=0

6: While

|f_{n}^{t}-f_{n}^{t-1}|>\text{TOL}

7: Solve (25) for

u_{m}^{t}

using

f_{m}^{t}

for each

m

8: Update

f_{n}^{t+1}=u_{m}^{t}

\mathcal{K}_{m}\cap\partial\mathcal{K}_{n}

n\in\mathcal{I}_{m}

t\to t+1

10: End

11: Solve (25) for

u_{m}^{t}

using

f_{m}^{t}

for each

m

12: Assemble global solution

u=\sum_{m=1}^{N}\chi_{m}u_{m}^{t}

13:Return: approximated global solution

u^{T}

Most of the computation in the Schwarz method during the iteration comes from solving the boundary-value PDEs on the patches, to implement the map $\mathcal{A}$ . Since the PDE is homogenizable, the solution space on each patch is approximately low rank, and the map $\mathcal{A}$ can be expected to inherit this property. If we can “learn” this operator in an offline stage, and simply apply a low-rank approximation repeatedly in the online stage, the online part of Algorithm 3 can be made much more efficient. In our approach, Algorithm 1 is used to compress the map $\mathcal{A}$ .

This approach is quite different from the one described in Section 4.1, in the sense that it is not only the range of the solution space, but the whole operator that is being approximated. To apply Algorithm 1, we need to define the “adjoint operator” for $\mathcal{A}$ on the PDE level. This operator is composed of the adjoints $\mathcal{S}_{m}^{\ast}$ for the local solution operators $\mathcal{S}_{m}$ of (25) on each domain $\mathcal{K}_{m}$ . The form of $\mathcal{S}_{m}^{\ast}$ is specific to the PDE; we use the elliptic equation as an example. Defining $\mathcal{L}^{\epsilon}=\nabla\cdot\left(a(x,x/\epsilon)\nabla\right)$ , $\mathcal{S}_{m}^{\ast}$ is defined in the following result.

Theorem 5.

Let $\mathcal{S}_{m}$ be the confined solution operator for the elliptic equation with Dirichlet boundary condition on patch $\mathcal{K}_{m}$ . Given any function $g$ supported on $\widetilde{\mathcal{K}}_{m}$ , the adjoint operator $\mathcal{S}_{m}^{\ast}$ acting on $g$ is given by:

{\mathcal{S}}_{m}^{\ast}g=a\frac{\partial h}{\partial n}\Big{|}_{\partial\mathcal{K}_{m}}\,,

(27)

where $\frac{\partial h}{\partial n}$ is the outer normal derivative on $\partial\mathcal{K}_{m}$ and $h$ solves the following sourced elliptic equation:

\begin{cases}\nabla\cdot\left(a\left(x,\frac{x}{\epsilon}\right)\nabla h(x)\right)={g}\,,&x\in\mathcal{K}_{m}\\ h(x)=0\,,&x\in\partial\mathcal{K}_{m}\,.\end{cases}

(28)

We also describe calculation of the adjoint operator $\mathcal{S}_{m}^{\ast}$ for the RTE (4).

Theorem 6.

Let $\mathcal{S}_{m}$ be the confined solution operator for RTE (4) and the conditions in Theorem 1 hold. Given any function $g$ supported on $\widetilde{\mathcal{K}}_{m}\times\mathbb{V}$ , the adjoint operator $\mathcal{S}_{m}^{\ast}$ is defined as follows

\mathcal{S}_{m}^{\ast}g(x,v)=h(x,v)\,,\quad x\in\partial\mathcal{K}_{m}\,,\quad v\cdot n_{x}<0\,,

(29)

where $h$ solves the adjoint RTE over $\mathcal{K}_{m}$ , which is

-v\cdot\nabla_{x}h-\mathcal{S}^{\epsilon}[h]=g(x,v)\,,\quad(x,v)\in\mathcal{K}_{m}\times\mathbb{V}\,,

(30)

with outgoing boundary condition $h(x,v)=0$ on $x\in\partial\mathcal{K}_{m}$ and $v\cdot n_{x}>0$ .

The specific form of the adjoint operator $\mathcal{S}_{m}^{\ast}$ allows us to adapt the randomized SVD algorithm to compress the confined solution map $\mathcal{S}_{m}$ ; see Algorithm 4. This method requires only $k$ solves of local PDE (25) and sourced adjoint PDE (28) (or (30)), together with a QR factorization and SVD of relatively small matrices. The overall low-rank Schwarz iteration is then summarized in Algorithm 5.

Algorithm 4 Randomized SVD for

\mathcal{S}_{m}

1:Given target rank

r

and oversampling parameter

p

;

2:Set

k=r+p

;

3:Stage A:

4: Generate

k

random boundary conditions

\xi_{j}

\partial\mathcal{K}_{m}

5: Solve (25) using

\xi_{j}

as boundary conditions and restrict the solution over

\widetilde{\mathcal{K}}_{m}

to obtain

u_{j}^{\epsilon}

6: Find orthonormal basis

\mathsf{Q}=[q_{1},\dots,q_{k}]

\widetilde{U}=[u_{1},\dots,u_{k}]

7:Stage B:

8: Construct zero extension of

q_{k}

over

\mathcal{K}_{m}

, denoted by

\widetilde{q}_{k}

9: Solve (28) or (30) for

h_{j}

using

\widetilde{q}_{k}

as source.

10: Compute

b_{j}

using

h_{j}

by flux (27) or restriction on incoming boundary (29).

11: Assemble all fluxes

B=[b_{1},\dots,b_{k}]

12: Compute SVD of

B^{\ast}=\widetilde{\mathsf{U}}_{k}\Sigma_{k}\mathsf{V}_{k}^{\ast}

13: Compute

\mathsf{U}_{k}=\mathsf{Q}\widetilde{\mathsf{U}}_{k}

14:Return

\mathsf{U}_{k},\Sigma_{k},\mathsf{V}_{k}

In Figure 6 we present the confined solution operator $\mathcal{S}_{m}$ and its low-rank approximation $\mathcal{S}_{m}^{\mathrm{r}}$ . For the radiative transfer equation, the map is a 2880-by-40 matrix, with the size of each patch being 0.2 $\times$ [-1,1], with $\Delta x=1/360$ and $\Delta v=1/40$ . The random sampling procedure reconstructs it with just 6 samples. For the elliptic equation, the map is a 1600-by-160 matrix, and the size of each patch is 1 $\times$ [0,1] with $\Delta x=1/40$ . The random sampling approximates it well with 60 samples. The compression rates for these examples are thus 6.7 and 2.7, respectively. See details in [MR4252068].

In Figure 7, we show numerical examples for the global solutions of two problems obtained from the approach of this section. The reference solution (obtained with a fine mesh) is well captured by the approximation that uses the low-rank BtB map as the surrogate in the Schwarz iteration. These two cases use just $15\%$ and $43\%$ , respectively, of the number of local solves needed to capture the BtB map at fine scale. While the relative error of the reduced Schwarz method decays as fast as the standard Schwarz iteration, as shown in Figure 8, the cost is much reduced. See Table 1 for a comparison of computation times.

Algorithm 5 Reduced Schwarz method for multiscale PDE

\mathcal{L}^{\epsilon}u^{\epsilon}=0

over

\mathcal{K}

with

\mathcal{B}u^{\epsilon}=\phi

\partial\mathcal{K}

1:Given rank

k

, total iterations

T

2:Domain Decomposition

3: Partition domain according to (19).

4:Offline Stage:

5: For all

m

, use Algorithm 4 to find the rank-

k

approximation to

\mathcal{S}_{m}

, denoted by

\mathsf{U}_{k}^{m}\Sigma_{k}^{m}\mathsf{V}_{k}^{m,\ast}

6:Online:

7: Initiate

f_{m}^{0}(x)

for each

\partial\mathcal{K}_{m}

, and set

t=0

8: While

|f_{n}^{t}-f_{n}^{t-1}|>\text{TOL}

9: Evaluate

u_{m}^{t}=\mathsf{U}_{k}^{m}\Sigma_{k}^{m}\mathsf{V}_{k}^{m,\ast}f_{m}^{t}

for each

m

10: Update

f_{n}^{t+1}=u_{m}^{t}

\mathcal{K}_{m}\cap\partial\mathcal{K}_{n}

n\in\mathcal{I}_{m}

11:

t\to t+1

12: End

13: Solve (25) for

u_{m}^{t}

using

f_{m}^{t}

for each

m

14: Assemble global solution

u=\sum_{n=1}^{N}\chi_{m}u^{t}_{m}

15:Return

u(x)

RTE	offline (s)	online (s)
Rank = 5	227.99	0.0029
Rank = 6	268.46	0.0162
Full rank	706.99	0.0148
Schwarz	—	1027.40
elliptic	offline (s)	online (s)
Rank = 40	49.7	0.049
Rank = 70	87.3	0.061
Schwarz	—	31.4

Table 1: Run time comparison between vanilla Schwarz method and the reduced Schwarz method.

k=5,6

for the RTE and

k=40,70

for the elliptic equation with Dirichlet boundary condition. The configuration of the media is the same as those in Figure 7.

5 Manifold learning and nonlinear multiscale problems

It is not straightforward to extend the techniques of the previous section to nonlinear PDEs. Despite low-rank properties still holding due to the existence of the limiting equation, the argument based on compressing the Green’s matrix no longer holds. The collection of solutions for different source / boundary terms is not longer a linear subspace, but a solution manifold.

We consider the general nonlinear multiscale problem in the following form

\mathcal{N}^{\epsilon}u^{\epsilon}=f\,,

(31)

where $\mathcal{N}^{\epsilon}$ is a nonlinear differential operator that depends explicitly on the small parameter $\epsilon$ . The term $f$ can be the source term, boundary conditions or initial conditions. Assume further that the equation has an asymptotic limit

\mathcal{N}^{\ast}u^{\ast}=f

(32)

as $\epsilon\to 0$ , that is, $\|u^{\epsilon}-u^{\ast}\|\to 0$ as $\epsilon\to 0$ . The argument for the linear problem is still applicable: The degrees of freedom required by the classical numerical method for solving (31) grows rapidly as $\epsilon\to 0$ , while the existence of the homogenized equation (32) indicates that only $O(1)$ degrees of freedom should be needed to resolve macro-scale features. From a manifold perspective, the solutions to (31) vary in a high dimensional space as $f$ changes, but this manifold is approximated to within distance $O(\epsilon)$ by another manifold whose dimension is $O(1)$ .

Suppose a manifold in a high dimensional space is approximately low-dimensional, can we quickly learn it without paying the high dimensional cost? We turn to manifold learning for answers to this question. We are particularly interested in adopting the ideas from the local linear embedding and multi-scale SVD approaches that learn the manifold from observed point clouds, and interpolate the local solution manifold using multiple tangent-space patches, see references in [chen2020manifold].

We denote the nonlinear solution map of (31) by $\mathcal{S}^{\epsilon}:f\in\mathcal{X}\to u^{\epsilon}\in\mathcal{Y}$ , which maps the source term or initial / boundary conditions $f(x)$ to the solution of the equation. In the offline stage, we randomly sample a large number of configurations $f_{i}$ in $\mathcal{X}$ , and compute the associated solutions $u^{\epsilon}_{i}=\mathcal{S}^{\epsilon}f_{i}\in\mathcal{Y}$ on fine grids. These solutions form a point cloud in a high dimensional space $\mathcal{Y}$ . The $\{f_{i}\}$ ’s are then subdivided into a number of small neighborhoods, and we construct tangential approximations to the mapping $\mathcal{S}^{\epsilon}$ on each neighborhood. In the online stage, given a new configuration $f$ , we identify the small neighborhood to which it belongs and find the corresponding solution by performing linear interpolation. The overall offline-online strategy in summarized in Algorithm 6. We stress that some modifications are needed to reduce the cost of implementation. For example, the algorithm should be combined with domain decomposition (for example, Schwarz iteration) to further confine the computation to local domains, to save computational cost.

In Figure 9 we plot the local low-dimensional solution manifold for a nonlinear RTE (specifically, a linear RTE nonlinearly coupled with a temperature term). The solution manifold appears to have a local two-dimensional structure; the point clouds lie near on a two-dimensional plane. We refer to [chen2020manifold] for more details of the implementation and numerical results.

Algorithm 6 Manifold learning algorithm for solving

\mathcal{N}^{\epsilon}u^{\epsilon}=f

1:Offline

2: Randomly sample

f_{i}(x)

i=1,\dots,N

, and find solutions

u^{\epsilon}_{i}=\mathcal{S}^{\epsilon}f_{i}

3:Online: Given

f(x)

4: Step 1: Identify the

k

-nearest neighbors of

f(x)

, call them

f_{i_{j}},j=1,2,\dots,k

, with

f_{i_{1}}

being the nearest neighbor;

5: Step 2: Compute

\mathcal{S}^{\epsilon}\phi\approx u^{\epsilon}_{i_{1}}+\mathsf{U}\cdot{c}

with

\mathsf{U}=\begin{bmatrix}|&&|\\ u^{\epsilon}_{i_{2}}-u^{\epsilon}_{i_{1}}&\dots&u^{\epsilon}_{i_{k}}-u^{\epsilon}_{i_{1}}\\ |&&|\end{bmatrix},

where

c

is a set of coefficient that fits

f-f_{i_{1}}

with a linear combination of

f_{i_{j}}-f_{i_{1}}

, for

j=2,3,\dotsc,k

6:Return

u^{\epsilon}=\mathcal{S}^{\epsilon}\phi

6 Looking Forward

We have seen a vast literature addressing all aspects of the computation of multiscale problems. Over the years, the research has been drifting gradually away from its origin, where solvers were influenced by analytical understanding, specifically of the limiting behavior of the specific PDE. Machine learning algorithms have shown more and more power in sketching the solution profile with a much reduced numerical cost. In particular, the existence of the homogenized limit suggests there are low-rank features in the discrete system, and that random linear algebra techniques and manifold learning methods, when utilized properly, can identify these features for a compressed representation of the PDE solutions.

We have reviewed two methods, both of which make use of the domain decomposition framework. They compress either basis functions or the boundary-to-boundary map in an offline learning stage. This review article serves as a showcase of the power of random solvers in numerical PDEs. For time-dependent problems, and homogenization problems that have weak-convergence (instead of strong) such as quantum systems in the semi-classical regime, further development of the approaches is needed. Incorporation of time and weak-limit in the algorithm-design lies at the core of future challenges.

7 Acknowledgements

The authors acknowledge generous support from NSF, ONR, DOE, and AFOSR. Due to restrictions on the allowed number of references, many important contributions are omitted. The reader is invited to consult the bibliographies in the cited references for a more extensive view of the literature.

Low-rank approximation for multiscale PDEs

1 Introduction

2 Examples

2.1 Kinetic equations and asymptotic preserving methods

Theorem 1.

Theorem 2.

2.2 Elliptic equations and numerical homogenization

Theorem 3.

3 A unified framework for multiscale PDEs based on random sampling

3.1 Numerical rank

Definition 1 (Numerical rank).

Definition 2 (Kolmogorov NN-width).

Proposition 1.

3.2 Random sampling in numerical linear algebra

Theorem 4.

4 Random sampling for multiscale computation

4.1 Learning basis functions

4.2 A low-rank Schwarz method

Theorem 5.

Theorem 6.

5 Manifold learning and nonlinear multiscale problems

6 Looking Forward

7 Acknowledgements

References

Definition 2 (Kolmogorov $N$ -width).