This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

A joint reconstruction and lambda tomography regularization technique for energy-resolved X-ray imaging
\ddmmyyyydate \currenttime

James Webber Department of Electrical and Computer Engineering, Tufts University, Medford, MA USA [email protected] Eric Todd Quinto Department of Mathematics, Tufts University, Medford, MA USA [email protected]  and  Eric L. Miller Department of Electrical and Computer Engineering, Tufts University, Medford, MA USA [email protected]
Abstract.

We present new joint reconstruction and regularization techniques inspired by ideas in microlocal analysis and lambda tomography, for the simultaneous reconstruction of the attenuation coefficient and electron density from X-ray transmission (i.e., X-ray CT) and backscattered data (assumed to be primarily Compton scattered). To demonstrate our theory and reconstruction methods, we consider the “parallel line segment” acquisition geometry of [54], which is motivated by system architectures currently under development for airport security screening. We first present a novel microlocal analysis of the parallel line geometry which explains the nature of image artefacts when the attenuation coefficient and electron density are reconstructed separately. We next introduce a new joint reconstruction scheme for low effective ZZ (atomic number) imaging (Z<20Z<20) characterized by a regularization strategy whose structure is derived from lambda tomography principles and motivated directly by the microlocal analytic results. Finally we show the effectiveness of our method in combating noise and image artefacts on simulated phantoms.

1. Introduction

In this paper we introduce new joint reconstruction and regularization techniques based on ideas in microlocal analysis and lambda tomography [13, 14] (see also [32] for related work). We consider the simultaneous reconstruction of the attenuation coefficient μE\mu_{E} and electron density nen_{e} from joint X-ray CT (transmission) and Compton scattered data, with particular focus on the parallel line segment X-ray scanner displayed in figures 1 and 2. The acquisition geometry in question is based on a new airport baggage scanner currently in development, and has the ability to measure X-ray CT and Compton data simultaneously. The line segment geometry was first considered in [54], where injectivity results are derived in Compton Scattering Tomography (CST). We provide a stability analysis of the CST problem of [54] here, from a microlocal perspective. The scanner depicted in figure 1 consists of a row of fixed, switched, monochromatic fan beam sources (SS), a row of detectors (DAD_{A}) to measure the transmitted photons, and a second (slightly out of plane) row of detectors (DCD_{C}) to measure Compton scatter. The detectors are assumed to energy-resolved, a common assumption in CST [35, 51, 41, 43, 55], and the sources are fan-beam (in the plane) with opening angle π\pi (so there is no restriction due to cropped fan-beams).

DAD_{A} at {z=2rm}\{z=2-r_{m}\}DCD_{C}SSx2x_{2}x1x_{1}OOLLCRC_{R}aa
Figure 1. Parallel line X-ray CT geometry. Here SS, DCD_{C} and DAD_{A} denote the source and detector rows. The length of the detector (and source) array is 2a2a. A cone CRS1C_{R}\subset S^{1} is highlighted in orange. We will refer to CRC_{R} later for visualisation in section 4.

The attenuation coefficient relates to the X-ray transmission data by the Beer-Lambert law, log(I0IA)=LμEdl\log\left(\frac{I_{0}}{I_{A}}\right)=\int_{L}\mu_{E}\mathrm{d}l [31, page 2] where IAI_{A} is the photon intensity measured at the detector, I0I_{0} is the initial source intensity and μE\mu_{E} is the attenuation coefficient at energy EE. Here LL is a line through SS and DAD_{A}, with arc measure dl\mathrm{d}l. Hence the transmission data determines a set of integrals of μE\mu_{E} over lines, and the problem of reconstructing μE\mu_{E} is equivalent to inversion of the line Radon transform with limited data (e.g., [31, 33]). Note that we need not account for the energy dependence of μE\mu_{E} in this case as the detectors are energy-resolved, and hence there are no issues due to beam-hardening. See figure 1.

When the attenuation of the incoming and scattered rays is ignored, the Compton scattered intensity in two-dimensions can be modelled as integrals of nen_{e} over toric sections [35, 51, 55]

(1.1) IC=Tnedt,I_{C}=\int_{T}n_{e}\mathrm{d}t,

where ICI_{C} is the Compton scattered intensity measured at a point on DCD_{C}. A toric section T=C1C2T=C_{1}\cup C_{2} is the union of two intersecting circles of the same radii (as displayed in figure 2), and dt\mathrm{d}t is the arc measure on TT. The recovery of nen_{e} is equivalent to inversion of the toric section Radon transform [34, 35, 51, 55]. See figure 2. See also [41, 43] for alternative reconstruction methods. We now discuss the approximation made above to neglect the attenuative effects from the CST model. When the attenuation effects are included, the inverse scattering problem becomes nonlinear [42]. We choose to focus on the analysis of the idealised linear case here, as this allows us to apply the well established theory on linear Fourier Integral Operators (FIO) and microlocal analysis to derive expression for the image artefacts. Such analysis will likely give valuable insight into the expected artefacts in the nonlinear case. The nonlinear models and their inversion properties are left for future work.

DAD_{A} at {z=2rm}\{z=2-r_{m}\}DCD_{C}111111SSaax2x_{2}OOx1x_{1}x0x_{0}rrss𝐱{\mathbf{x}}𝝃{\boldsymbol{\xi}}CTC_{T}C1(s,x0)C_{1}(s,x_{0})C2(s,x0)C_{2}(s,x_{0})β\beta
Figure 2. Parallel line CST geometry. SS, DCD_{C} and DAD_{A} denote the source and detector rows. The remaining labels are referenced in the main text. A cone CTS1C_{T}\subset S^{1} is highlighted in orange. We will refer to CTC_{T} later for visualisation in section 3. Note that we have cropped out part of the left side (left of OO) of the scanner of figure 1 in this picture.

The line and circular arc Radon transforms with full data, are known [35, 30] to have inverses that are continuous in some range of Sobolev norms. Hence with adequate regularization we can reconstruct an image free of artefacts. With limited data however [31, 51], the solution is unstable and the image wavefront set (see Definition 2.2) is not recovered stably in all directions. We will see later in section 3 through simulation that such data limitations in the parallel line geometry cause a blurring artefact over a cone in the reconstruction. There may also be nonlocal artefacts specific to the geometry (as in [55]), which we shall discover later in section 3 in the geometry of figure 2.

The main goal of this paper is to combine limited datasets in X-ray CT and CST with new lambda tomography regularization techniques, to recover the image edges stably in all directions. We focus particularly on the geometry of figure 1. In lambda tomography the image reconstruction is carried out by filtered backprojection of the Radon projections, where the filter is chosen to emphasize boundaries. This means that the jump singularities in the lambda reconstruction have the same location and direction to those of the target function, but the smooth parts are undetermined. A common choice of filter is a second derivative in the linear variable [7, 43]. The application of the derivative filter emphasizes the singularities in the Radon projections, and this is a key idea behind lambda tomography [13, 14, 52], and the microlocal view on lambda CT (e.g., [7, 39, 43]). The regularization penalty we propose aims to minimize the difference dmdsmR(μEne)L2(×S1)\|\frac{\mathrm{d}^{m}}{\mathrm{d}s^{m}}R(\mu_{E}-n_{e})\|_{L^{2}(\mathbb{R}\times S^{1})} for some m1m\geq 1, where RR denotes the Radon line transform. Therefore, with a full set of Radon projections, the lambda penalties enforce a similarity in the locations and direction of the image singularities (edges) of μE\mu_{E} and nen_{e}. Further dmdsmR\frac{\mathrm{d}^{m}}{\mathrm{d}s^{m}}R for m1m\geq 1 is equivalent to taking m1/2m-1/2 derivatives of the object (this operation is continuous of positive order m1/2m-1/2 in Sobolev scales), and hence its inverse is a smoothing operation, which we expect to be of aid in combatting the measurement noise. In addition, the regularized inverse problem we propose is linear (similarly to the Tikhonov regularized inverse [21, page 99]), which (among other benefits of linearity) allows for the fast application of iterative least squares solvers in the solution.

The literature considers joint image reconstruction and regularization in for example, [1, 19, 40, 46, 47, 6, 50, 12, 20, 11, 48, 10, 45, 4]. See also the special issue [3] for a more general review of joint reconstruction techniques. In [12] the authors consider the joint reconstruction from Positron Emission Tomography (PET) and Magnetic Resonance Imaging (MRI) data and use a Parallel Level Set (PLS) prior for the joint regularization. The PLS approach (first introduced in [10]) imposes soft constraints on the equality of the image gradient location and direction, thus enforcing structural similarity in the image wavefront sets. This follows a similar intuition to the “Nambu” functionals of [48] and the “cross-gradient” methods of [16, 17] in seismic imaging, the latter of which specify hard constraints that the gradient cross products are zero (i.e. parallel image gradients). The methods of [12] use linear and quadratic formulations of PLS, denoted by Linear PLS (LPLS) and Quadratic PLS (QPLS). The LPLS method will be a point of comparison with the proposed method. We choose to compare with LPLS as it is shown to offer greater performance than QPLS in the experiments conducted in [12].

In [20] the authors consider a class of techniques in joint reconstruction and regularization, including inversion through correspondence mapping, mutual information and Joint Total Variation (JTV). In addition to LPLS, we will compare against JTV as the intuition of JTV is similar to that of lambda regularization (and LPLS), in the sense that a structural similarity is enforced in the image wavefronts. Similar to standard Total Variation (TV), which favours sparsity in the (single) image gradient, the JTV penalties (first introduced in [45] for colour imaging) favour sparsity in the joint gradient. Thus the image gradients are more likely to occur in the same location and direction upon minimization of JTV. The JTV penalties also have generalizations in colour imaging and vector-valued imaging [4].

In [54] the authors introduce a new toric section transform 𝒯\mathcal{T} in the geometry of figure 2. Here explicit inversion formulae are derived, but the stability analysis is lacking. We aim to address the stability of 𝒯\mathcal{T} in this work from a microlocal perspective. Through an analysis of the canonical relations of 𝒯\mathcal{T}, we discover the existence of nonlocal artefacts in the inversion, similarly to [55]. In [40] the joint reconstruction of μE\mu_{E} and nen_{e} is considered in a pencil beam scanner geometry. Here gradient descent solvers are applied to nonlinear objectives, derived from the physical models, and a weighted, iterative Tikhonov type penalty is applied. The works of [6] improve the wavefront set recovery in limited angle CT using a partially learned, hybrid reconstruction scheme, which adopts ideas in microlocal analysis and neural networks. The fusion with Compton data is not considered however. In our work we assume an equality in the wavefront sets of nen_{e} and μE\mu_{E} (in a similar vein to [47]), and we investigate the microlocal advantages of combining Compton and transmission data, as such an analysis is lacking in the literature.

The remainder of this paper is organized as follows. In section 2, we recall some definitions and theorems from microlocal analysis. In section 3 we present a microlocal analysis of 𝒯\mathcal{T} and explain the image artefacts in the nen_{e} reconstruction. Here we prove our main theorem (Theorem 3.2), where we show that the canonical relation 𝒞\mathcal{C} of 𝒯\mathcal{T} is 2–1. This implies the existence of nonlocal image artefacts in a reconstruction from toric section integral data. Further we find explicit expressions for the nonlocal artefacts and simulate these by applying the normal operations 𝒯𝒯\mathcal{T}^{*}\mathcal{T} to a delta function. In section 4 we consider the microlocal artefacts from X-ray (transmission) data. This yields a limited dataset for the Radon transform, whereby we have knowledge the line integrals for all LL which intersect SS and DAD_{A} (see figure 1). We use the results in [5] to describe the resulting artifacts in the X-ray CT reconstruction. In section 5, we detail our joint reconstruction method for the simultaneous reconstruction of μE\mu_{E} and nen_{e}. Later in section 5.4 we present simulated reconstructions of μE\mu_{E} and nen_{e} using the proposed methods and compare against JTV [20] and LPLS [12] from the literature. We also give a comparison to a separate reconstruction using TV.

2. Microlocal definitions

We next provide some notation and definitions. Let XX and YY be open subsets of n{{\mathbb{R}}^{n}}. Let 𝒟(X)\mathcal{D}(X) be the space of smooth functions compactly supported on XX with the standard topology and let 𝒟(X)\mathcal{D}^{\prime}(X) denote its dual space, the vector space of distributions on XX. Let (X)\mathcal{E}(X) be the space of all smooth functions on XX with the standard topology and let (X)\mathcal{E}^{\prime}(X) denote its dual space, the vector space of distributions with compact support contained in XX. Finally, let 𝒮(n)\mathcal{S}({{\mathbb{R}}^{n}}) be the space of Schwartz functions, that are rapidly decreasing at \infty along with all derivatives. See [44] for more information.

Definition 2.1 ([25, Definition 7.1.1]).

For a function ff in the Schwartz space 𝒮(n)\mathcal{S}(\mathbb{R}^{n}), we define the Fourier transform and its inverse as

(2.1) f(𝝃)=nei𝐱𝝃f(𝐱)d𝐱,1f(𝐱)=(2π)nnei𝐱𝝃f(𝝃)d𝝃.\mathcal{F}f({\boldsymbol{\xi}})=\int_{\mathbb{R}^{n}}e^{-i{\mathbf{x}}\cdot{\boldsymbol{\xi}}}f({\mathbf{x}})\mathrm{d}{\mathbf{x}},\qquad\mathcal{F}^{-1}f({\mathbf{x}})=(2\pi)^{-n}\int_{\mathbb{R}^{n}}e^{i{\mathbf{x}}\cdot{\boldsymbol{\xi}}}f({\boldsymbol{\xi}})\mathrm{d}{\boldsymbol{\xi}}.

We use the standard multi-index notation: if α=(α1,α2,,αn){0,1,2,}n\alpha=(\alpha_{1},\alpha_{2},\dots,\alpha_{n})\in\left\{0,1,2,\dots\right\}^{n} is a multi-index and ff is a function on n{{\mathbb{R}}^{n}}, then

αf=(x1)α1(x2)α2(xn)αnf.\partial^{\alpha}f=\left(\frac{\partial}{\partial x_{1}}\right)^{\alpha_{1}}\left(\frac{\partial}{\partial x_{2}}\right)^{\alpha_{2}}\cdots\left(\frac{\partial}{\partial x_{n}}\right)^{\alpha_{n}}f.

If ff is a function of (𝐲,𝐱,𝝈)({\mathbf{y}},{\mathbf{x}},{\boldsymbol{\sigma}}) then 𝐲αf\partial^{\alpha}_{\mathbf{y}}f and 𝝈αf\partial^{\alpha}_{\boldsymbol{\sigma}}f are defined similarly.

We identify cotangent spaces on Euclidean spaces with the underlying Euclidean spaces, so we identify T(X)T^{*}(X) with X×nX\times{{\mathbb{R}}^{n}}.

If ϕ\phi is a function of (𝐲,𝐱,𝝈)Y×X×N({\mathbf{y}},{\mathbf{x}},{\boldsymbol{\sigma}})\in Y\times X\times{{\mathbb{R}}}^{N} then we define d𝐲ϕ=(ϕy1,ϕy2,,ϕyn)\mathrm{d}_{{\mathbf{y}}}\phi=\left(\frac{\partial\phi}{\partial y_{1}},\frac{\partial\phi}{\partial y_{2}},\cdots,\frac{\partial\phi}{\partial y_{n}}\right), and d𝐱ϕ\mathrm{d}_{\mathbf{x}}\phi and d𝝈ϕ\mathrm{d}_{\boldsymbol{\sigma}}\phi are defined similarly. We let dϕ=(d𝐲ϕ,d𝐱ϕ,d𝝈ϕ)\mathrm{d}\phi=\left(\mathrm{d}_{{\mathbf{y}}}\phi,\mathrm{d}_{{\mathbf{x}}}\phi,\mathrm{d}_{\boldsymbol{\sigma}}\phi\right).

The singularities of a function and the directions in which they occur are described by the wavefront set [9, page 16]:

Definition 2.2.

Let XX Let an open subset of n{{\mathbb{R}}^{n}} and let ff be a distribution in 𝒟(X)\mathcal{D}^{\prime}(X). Let (𝐱0,𝝃0)X×(n{𝟎})({\mathbf{x}}_{0},{\boldsymbol{\xi}}_{0})\in X\times(\mathbb{R}^{n}\setminus\left\{\mathbf{0}\right\}). Then ff is smooth at 𝐱0{\mathbf{x}}_{0} in direction 𝛏0{\boldsymbol{\xi}_{0}} if exists a neighbourhood UU of 𝐱0{\mathbf{x}}_{0} and VV of 𝝃0{\boldsymbol{\xi}}_{0} such that for every ϕ𝒟(U)\phi\in\mathcal{D}(U) and NN\in\mathbb{R} there exists a constant CNC_{N} such that

(2.2) |(ϕf)(λ𝝃)|CN(1+|λ|)N.\left|\mathcal{F}(\phi f)(\lambda{\boldsymbol{\xi}})\right|\leq C_{N}(1+\left|\lambda\right|)^{-N}.

The pair (𝐱0,𝝃0)({\mathbf{x}}_{0},{\boldsymbol{\xi}_{0}}) is in the wavefront set, WF(f)\mathrm{WF}(f), if ff is not smooth at 𝐱0{\mathbf{x}}_{0} in direction 𝝃0{\boldsymbol{\xi}_{0}}.

This definition follows the intuitive idea that the elements of WF(f)\mathrm{WF}(f) are the point–normal vector pairs above points of XX where ff has singularities. For example, if ff is the characteristic function of the unit ball in 3\mathbb{R}^{3}, then its wavefront set is WF(f)={(𝐱,t𝐱):𝐱S2,t0}\mathrm{WF}(f)=\{({\mathbf{x}},t{\mathbf{x}}):{\mathbf{x}}\in S^{2},t\neq 0\}, the set of points on a sphere paired with the corresponding normal vectors to the sphere.

The wavefront set of a distribution on XX is normally defined as a subset the cotangent bundle T(X)T^{*}(X) so it is invariant under diffeomorphisms, but we will continue to identify T(X)=X×nT^{*}(X)=X\times{{\mathbb{R}}^{n}} and consider WF(f)\mathrm{WF}(f) as a subset of X×n{𝟎}X\times{{\mathbb{R}}^{n}}\setminus\left\{\mathbf{0}\right\}.

Definition 2.3 ([25, Definition 7.8.1]).

We define Sm(Y×X×N)S^{m}(Y\times X\times\mathbb{R}^{N}) to be the set of a(Y×X×N)a\in\mathcal{E}(Y\times X\times\mathbb{R}^{N}) such that for every compact set KY×XK\subset Y\times X and all multi–indices α,β,γ\alpha,\beta,\gamma the bound

|𝐲γ𝐱β𝝈αa(𝐲,𝐱,𝝈)|CK,α,β(1+|𝝈|)m|α|,(𝐲,𝐱)K,𝝈N,\left|\partial^{\gamma}_{{\mathbf{y}}}\partial^{\beta}_{{\mathbf{x}}}\partial^{\alpha}_{{\boldsymbol{\sigma}}}a({\mathbf{y}},{\mathbf{x}},{\boldsymbol{\sigma}})\right|\leq C_{K,\alpha,\beta}(1+|{\boldsymbol{\sigma}}|)^{m-|\alpha|},\ \ \ ({\mathbf{y}},{\mathbf{x}})\in K,\ {\boldsymbol{\sigma}}\in\mathbb{R}^{N},

holds for some constant CK,α,β>0C_{K,\alpha,\beta}>0. The elements of SmS^{m} are called symbols of order mm.

Note that these symbols are sometimes denoted S1,0mS^{m}_{1,0}.

Definition 2.4 ([26, Definition 21.2.15]).

A function ϕ=ϕ(𝐲,𝐱,𝝈)(Y×X×N{𝟎})\phi=\phi({\mathbf{y}},{\mathbf{x}},{\boldsymbol{\sigma}})\in\mathcal{E}(Y\times X\times\mathbb{R}^{N}\setminus\left\{\mathbf{0}\right\}) is a phase function if ϕ(𝐲,𝐱,λ𝝈)=λϕ(𝐲,𝐱,𝝈)\phi({\mathbf{y}},{\mathbf{x}},\lambda{\boldsymbol{\sigma}})=\lambda\phi({\mathbf{y}},{\mathbf{x}},{\boldsymbol{\sigma}}), λ>0\forall\lambda>0 and dϕ\mathrm{d}\phi is nowhere zero. A phase function is clean if the critical set Σϕ={(𝐲,𝐱,𝝈):d𝝈ϕ(𝐲,𝐱,𝝈)=0}\Sigma_{\phi}=\{({\mathbf{y}},{\mathbf{x}},{\boldsymbol{\sigma}})\ :\ \mathrm{d}_{\boldsymbol{\sigma}}\phi({\mathbf{y}},{\mathbf{x}},{\boldsymbol{\sigma}})=0\} is a smooth manifold with tangent space defined by d(d𝝈ϕ)=0\mathrm{d}\left(\mathrm{d}_{\boldsymbol{\sigma}}\phi\right)=0.

By the implicit function theorem the requirement for a phase function to be clean is satisfied if d(d𝝈ϕ)\mathrm{d}\left(\mathrm{d}_{\boldsymbol{\sigma}}\phi\right) has constant rank.

Definition 2.5 ([26, Definition 21.2.15] and [27, Section 25.2]).

Let XX and YY be open subsets of n{{\mathbb{R}}^{n}}. Let ϕ(Y×X×N)\phi\in\mathcal{E}\left(Y\times X\times{{{\mathbb{R}}}}^{N}\right) be a clean phase function. In addition, we assume that ϕ\phi is nondegenerate in the following sense:

d𝐲,𝝈ϕ\mathrm{d}_{{\mathbf{y}},{\boldsymbol{\sigma}}}\phi and d𝐱,𝝈ϕ\mathrm{d}_{{\mathbf{x}},{\boldsymbol{\sigma}}}\phi are never zero.

The critical set of ϕ\phi is

Σϕ={(𝐲,𝐱,𝝈)Y×X×N{𝟎}:d𝝈ϕ=0}.\Sigma_{\phi}=\{({\mathbf{y}},{\mathbf{x}},{\boldsymbol{\sigma}})\in Y\times X\times\mathbb{R}^{N}\setminus\left\{\mathbf{0}\right\}:\mathrm{d}_{{\boldsymbol{\sigma}}}\phi=0\}.

The canonical relation parametrised by ϕ\phi is defined as

(2.3) 𝒞=\displaystyle\mathcal{C}= {((𝐲,d𝐲ϕ(𝐲,𝐱,𝝈));(𝐱,d𝐱ϕ(𝐲,𝐱,𝝈))):(𝐲,𝐱,𝝈)Σϕ},\displaystyle\left\{\left(\left({\mathbf{y}},\mathrm{d}_{{\mathbf{y}}}\phi({\mathbf{y}},{\mathbf{x}},{\boldsymbol{\sigma}})\right);\left({\mathbf{x}},-\mathrm{d}_{{\mathbf{x}}}\phi({\mathbf{y}},{\mathbf{x}},{\boldsymbol{\sigma}})\right)\right):({\mathbf{y}},{\mathbf{x}},{\boldsymbol{\sigma}})\in\Sigma_{\phi}\right\},
Definition 2.6.

Let XX and YY be open subsets of n{{\mathbb{R}}^{n}}. A Fourier integral operator (FIO) of order m+N/2n/2m+N/2-n/2 is an operator A:𝒟(X)𝒟(Y)A:\mathcal{D}(X)\to\mathcal{D}^{\prime}(Y) with Schwartz kernel given by an oscillatory integral of the form

(2.4) KA(𝐲,𝐱)=Neiϕ(𝐲,𝐱,𝝈)a(𝐲,𝐱,𝝈)d𝝈,K_{A}({\mathbf{y}},{\mathbf{x}})=\int_{\mathbb{R}^{N}}e^{i\phi({\mathbf{y}},{\mathbf{x}},{\boldsymbol{\sigma}})}a({\mathbf{y}},{\mathbf{x}},{\boldsymbol{\sigma}})\mathrm{d}{\boldsymbol{\sigma}},

where ϕ\phi is a clean nondegenerate phase function and aSm(Y×X×N)a\in S^{m}(Y\times X\times\mathbb{R}^{N}) is a symbol. The canonical relation of AA is the canonical relation of ϕ\phi defined in (2.3).

This is a simplified version of the definition of FIO in [8, Section 2.4] or [27, Section 25.2] that is suitable for our purposes since our phase functions are global. For general information about FIOs see [8, 27, 26].

Definition 2.7.

Let 𝒞T(Y×X)\mathcal{C}\subset T^{*}(Y\times X) be the canonical relation associated to the FIO A:(X)𝒟(Y)A:\mathcal{E}^{\prime}(X)\to\mathcal{D}^{\prime}(Y). Then we let πL\pi_{L} and πR\pi_{R} denote the natural left- and right-projections of 𝒞\mathcal{C}, πL:𝒞T(Y)\pi_{L}:\mathcal{C}\to T^{*}(Y) and πR:𝒞T(X)\pi_{R}:\mathcal{C}\to T^{*}(X).

Because ϕ\phi is nondegenerate, the projections do not map to the zero section. If a FIO \mathcal{F} satisfies our next definition, then \mathcal{F}^{*}\mathcal{F} (or ϕ\mathcal{F}^{*}\phi\mathcal{F} if \mathcal{F} does not map to (Y)\mathcal{E}^{\prime}(Y)) is a pseudodifferential operator [18, 37].

Definition 2.8.

Let :(X)𝒟(Y)\mathcal{F}:\mathcal{E}^{\prime}(X)\to\mathcal{D}^{\prime}(Y) be a FIO with canonical relation 𝒞\mathcal{C} then \mathcal{F} (or 𝒞\mathcal{C}) satisfies the semi-global Bolker Assumption if the natural projection πY:𝒞T(Y)\pi_{Y}:\mathcal{C}\to T^{*}(Y) is an embedding (injective immersion).

3. Microlocal properties of translational Compton transforms

Here we present a microlocal analysis of the toric section transform in the translational (parallel line) scanning geometry. Through an analysis of two separate limited data problems for the circle transform (where the integrals over circles with centres on a straight line are known) and using microlocal analysis, we show that the canonical relation of the toric section transform is 2–1. The analysis follows in a similar way to the work of [55]. We discuss the nonlocal artefacts inherent to the toric section inversion in section 3.1, and then go on to explain the artefacts due to limited data in section 3.2.

We first define our geometry and formulate the toric section transform of [54] in terms of δ\delta functions, before proving our main microlocal theory.

Let rm>1r_{m}>1 and define the set of points to be scanned as

X:={(x1,x2)2:2rm<x2<1}.X:=\{(x_{1},x_{2})\in{{\mathbb{R}}^{2}}\hskip 0.85358pt:\hskip 0.85358pt2-r_{m}<x_{2}<1\}.

Note that rmr_{m} controls the depth of the scanning tunnel as in figures 1 and 2. Let

Y:=(0,)×Y:=(0,\infty)\times\mathbb{R}

then for j=1,2j=1,2, and (s,x0)Y(s,x_{0})\in Y, we define the circles CjC_{j} and their centers 𝐜j\mathbf{c}_{j}

(3.1) r=s2+1,𝐜j(s,x0)=((1)js+x0,2)Cj(s,x0)={𝐱2:|𝐱𝐜j(s,x0)|2s21=0}.\begin{gathered}r=\sqrt{s^{2}+1},\quad\mathbf{c}_{j}(s,x_{0})=((-1)^{j}s+x_{0},2)\\ C_{j}(s,x_{0})=\{{\mathbf{x}}\in\mathbb{R}^{2}:|{\mathbf{x}}-\mathbf{c}_{j}(s,x_{0})|^{2}-s^{2}-1=0\}.\end{gathered}

Note that r=s2+1r=\sqrt{s^{2}+1} is the radius of the circle CjC_{j}. The union of the reflected circles C1C2C_{1}\cup C_{2} is called a toric section. Let fL02(X)f\in L^{2}_{0}(X) be the electron charge density. To define the toric section transform we first introduce two circle transforms

(3.2) 𝒯1f(s,x0)=C1fds,𝒯2f(s,x0)=C2fds.\mathcal{T}_{1}f(s,x_{0})=\int_{C_{1}}f\mathrm{d}s,\ \ \ \ \ \ \ \ \mathcal{T}_{2}f(s,x_{0})=\int_{C_{2}}f\mathrm{d}s.

Now we have the definition of the toric section transform [54]

(3.3) 𝒯f(s,x0)=C1C2fds=𝒯1(f)(s,x0)+𝒯2(f)(s,x0)\mathcal{T}f(s,x_{0})=\int_{C_{1}\cup C_{2}}f\mathrm{d}s=\mathcal{T}_{1}(f)(s,x_{0})+\mathcal{T}_{2}(f)(s,x_{0})

where ds\mathrm{d}s denotes the arc element on a circle and (s,x0)Y(s,x_{0})\in Y.

We express 𝒯\mathcal{T} in terms of delta functions as is done for the generalized Funk-Radon transforms studied by Palamodov. [36]

(3.4) 𝒯f(s,x0)=𝒯1f(s,x0)+𝒯2f(s,x0)=12rj=122δ(|𝐱𝐜j(s,x0)|2s21)f(𝐱)d𝐱=12rj=122eiσ(|𝐱((1)js+x0,2)|2s21)f(𝐱)d𝐱dσ.\begin{split}\mathcal{T}f(s,x_{0})&=\mathcal{T}_{1}f(s,x_{0})+\mathcal{T}_{2}f(s,x_{0})\\ &=\frac{1}{2r}\sum_{j=1}^{2}\int_{\mathbb{R}^{2}}\delta(|{\mathbf{x}}-\mathbf{c}_{j}(s,x_{0})|^{2}-s^{2}-1)f({\mathbf{x}})\mathrm{d}{\mathbf{x}}\\ &=\frac{1}{2r}\sum_{j=1}^{2}\int_{-\infty}^{\infty}\int_{\mathbb{R}^{2}}e^{-i\sigma(|{\mathbf{x}}-((-1)^{j}s+x_{0},2)|^{2}-s^{2}-1)}f({\mathbf{x}})\mathrm{d}{\mathbf{x}}\mathrm{d}\sigma.\end{split}

Note that the factor in front of the integrals comes about using the change of variables formula and that 𝒯jf=δ(|𝐱𝐜j(s,x0)|s2+1)f(x)d𝐱\mathcal{T}_{j}f=\int\delta\left(|{\mathbf{x}}-\mathbf{c}_{j}(s,x_{0})|-\sqrt{s^{2}+1}\right)f(x)\,\mathrm{d}{\mathbf{x}}. So the toric section transform is the sum of two FIO’s with phase functions

ϕj(s,x0,𝐱,σ)=σ(|𝐱((1)js+x0,2)|2s21)\phi_{j}(s,x_{0},{\mathbf{x}},\sigma)=\sigma(|{\mathbf{x}}-((-1)^{j}s+x_{0},2)|^{2}-s^{2}-1)

for j=1,2j=1,2. Our distributions ff are supported away from the intersection points of C1C_{1} and C2C_{2}, and hence we can consider the microlocal properties of 𝒯1\mathcal{T}_{1} and 𝒯2\mathcal{T}_{2} separately to describe the microlocal properties of 𝒯\mathcal{T}.

Proposition 3.1.

For j=1,2j=1,2, the circle transform 𝒯j\mathcal{T}_{j} is an FIO or order 1/2-1/2 with canonical relation

(3.5) 𝒞j={(\displaystyle\mathcal{C}_{j}=\Big{\{}\big{(} (s,x0,(1)j1σ(x1x0),σ((1)j1s+x1x0));(𝐱,σ(𝐱𝐜j(s,x0)))):\displaystyle\left(s,x_{0},(-1)^{j-1}\sigma(x_{1}-x_{0}),-\sigma((-1)^{j-1}s+x_{1}-x_{0})\right);\left({\mathbf{x}},-\sigma({\mathbf{x}}-\mathbf{c}_{j}(s,x_{0}))\right)\big{)}:
(s,x0)Y,σ{𝟎},𝐱Cj(s,x0){x2<1}}.\displaystyle\quad(s,x_{0})\in Y,\sigma\in\mathbb{R}\setminus\left\{\mathbf{0}\right\},{\mathbf{x}}\in C_{j}(s,x_{0})\cap\{x_{2}<1\}\Big{\}}.

Furthermore 𝒞j\mathcal{C}_{j} satisfies the semi-global Bolker assumption for j=1,2j=1,2.

Proof.

First, one can check that ϕj\phi_{j} and 𝒯j\mathcal{T}_{j} both satisfy the restrictions in Definition 2.6 so 𝒯j\mathcal{T}_{j} is a FIO. Using this definition again and the fact that its symbol is order zero [37], one sees that it has order 1/2-1/2.

A straightforward calculation using Definition 2.5 shows that the canonical relation of 𝒯j\mathcal{T}_{j} is as given in (3.5). Note that we have absorbed a factor of 22 into σ\sigma in this calculation. Global coordinates on 𝒞j\mathcal{C}_{j} are given by

(3.6) (s,x0,x1,σ)\displaystyle(s,x_{0},x_{1},\sigma)\mapsto (s,x0,(1)j1σ(x1x0),σ((1)j1s+x1x0);\displaystyle\big{(}s,x_{0},(-1)^{j-1}\sigma(x_{1}-x_{0}),-\sigma((-1)^{j-1}s+x_{1}-x_{0});
(x1,x2),σ((x1,x2)𝐜j(s,x0)))\displaystyle\qquad(x_{1},x_{2}),-\sigma((x_{1},x_{2})-\mathbf{c}_{j}(s,x_{0}))\big{)}
            where x2=2s2+1(x1(x0+(1)js))2x_{2}=2-\sqrt{s^{2}+1-(x_{1}-(x_{0}+(-1)^{j}s))^{2}}

because x2<1x_{2}<1. Recall that 𝐜j\mathbf{c}_{j} is given in (3.1).

We now show that 𝒞j\mathcal{C}_{j} satisfies the semiglobal Bolker assumption by finding a smooth inverse in these coordinates to the projection ΠL:𝒞jT(Y)\Pi_{L}:\mathcal{C}_{j}\to T^{*}(Y). Let λ=(s,x0,τ1,τ2)ΠL(𝒞j)\lambda=(s,x_{0},\tau_{1},\tau_{2})\in\Pi_{L}\left(\mathcal{C}_{j}\right). We solve for x1x_{1} and σ\sigma in the equation ΠL(s,x0,x1,σ)=λ\Pi_{L}(s,x_{0},x_{1},\sigma)=\lambda. Then, ss and x0x_{0} are known as are

(3.7) τ1=(1)j1σ(x1x0)τ2=σ((1)j1s+x1x0).\tau_{1}=(-1)^{j-1}\sigma(x_{1}-x_{0})\qquad\tau_{2}=-\sigma((-1)^{j-1}s+x_{1}-x_{0}).

A straightforward linear algebra exercise shows that the unique solutions for σ\sigma and x1x_{1} are

(3.8) σ=(1)jτ2τ1s,x1=sτ1(1)jτ1τ2+x0\sigma=\frac{(-1)^{j}\tau_{2}-\tau_{1}}{s},\qquad x_{1}=\frac{s\tau_{1}}{(-1)^{j}\tau_{1}-\tau_{2}}+x_{0}

This gives a smooth inverse to ΠL\Pi_{L} on the image ΠL(𝒞j)\Pi_{L}\left(\mathcal{C}_{j}\right) and finishes the proof. ∎

Because 𝒞j\mathcal{C}_{j} satisfies the Bolker Assumption, the composition 𝒞j𝒞jΔ\mathcal{C}^{*}_{j}\circ\mathcal{C}_{j}\subset\Delta, where Δ\Delta is the diagonal in T(X)T^{*}(X). Hence in a reconstruction from circular integral data with centres on a line we would not expect to see image artefacts for functions supported in x2>0x-2>0 unless one uses a sharp cutoff on the data.

The canonical relation 𝒞\mathcal{C} of 𝒯\mathcal{T} can be written as the disjoint union 𝒞=𝒞1𝒞2\mathcal{C}=\mathcal{C}_{1}\cup\mathcal{C}_{2} since (C1(s,x0)C2(s,x0))supp(f)=(C_{1}(s,x_{0})\cap C_{2}(s,x_{0}))\cap\text{supp}(f)=\emptyset for any (s,x0)Y(s,x_{0})\in Y.

For convenience, we will sometimes label the coordinate x0x_{0} in (3.6) as (x0)1(x_{0})_{1} it is associated with 𝒞1\mathcal{C}_{1} and (x0)2(x_{0})_{2} if it is associated with 𝒞2\mathcal{C}_{2}.

Theorem 3.2.

For j=1,2j=1,2, the projection πR:𝒞jT(X)\pi_{R}:\mathcal{C}_{j}\to T^{*}(X) is bijective onto the set

(3.9) D={(𝐱,𝝃)T(X):ξ20}.D=\left\{({\mathbf{x}},\boldsymbol{\xi})\in T^{*}(X)\hskip 0.85358pt:\hskip 0.85358pt\xi_{2}\neq 0\right\}.

In addition, πR:𝒞T(X)\pi_{R}:\mathcal{C}\to T^{*}(X) is two-to one onto DD.

Proof.

Let μ=(𝐱;𝝃)T(X){𝟎}\mu=({\mathbf{x}};\boldsymbol{\xi})\in T^{*}(X)\setminus\left\{\mathbf{0}\right\} and let 𝐱=(x1,x2){\mathbf{x}}=(x_{1},x_{2}) and 𝝃=(ξ1,ξ2)\boldsymbol{\xi}=(\xi_{1},\xi_{2}). If μπR(𝒞j)\mu\in\pi_{R}(\mathcal{C}_{j}) for either j=1j=1 or j=2j=2, then ξ20\xi_{2}\neq 0 by (3.5) since x2<2x_{2}<2. For the rest of the proof, assume μ\mu is in the set DD given by (3.9)

We will now describe the preimage of μ\mu in 𝒞j\mathcal{C}_{j}. The covector μ\mu is conormal to a unique circle centered on x2=2x_{2}=2, and its center is on the line through 𝐱{\mathbf{x}} and parallel 𝝃\boldsymbol{\xi}. If the center has coordinates (c,2)(c,2), then a calculation shows that cc is given by

(3.10) c=c(𝐱,𝝃)=x1ξ1(x22)ξ2.c=c({\mathbf{x}},\boldsymbol{\xi})=x_{1}-\frac{\xi_{1}(x_{2}-2)}{\xi_{2}}.

Using this calculation, one sees that the radius of the circle and coordinate ss are given by

(3.11) r=r(𝐱,𝝃)=(2x2)|𝝃||ξ2|,s=s(𝐱,𝝃)=r21r=r({\mathbf{x}},\boldsymbol{\xi})=\frac{(2-x_{2})\left|\boldsymbol{\xi}\right|}{\left|\xi_{2}\right|},\quad s=s({\mathbf{x}},\boldsymbol{\xi})=\sqrt{r^{2}-1}

and the coordinate (x0)j(x_{0})_{j} is given by

(3.12) (x0)j=(x0)j(𝐱,𝝃)=x1+ξ1(2x2)ξ2+(1)j1sfor j=1,2.(x_{0})_{j}=(x_{0})_{j}({\mathbf{x}},\boldsymbol{\xi})=x_{1}+\frac{\xi_{1}(2-x_{2})}{\xi_{2}}+(-1)^{j-1}s\ \ \text{for $j=1,2$}.

A straightforward calculation shows that

(3.13) σ=σ(𝐱,𝝃)=ξ22x2.\sigma=\sigma({\mathbf{x}},\boldsymbol{\xi})=\frac{-\xi_{2}}{2-x_{2}}.

This gives the coordinates (3.6) on 𝒞j\mathcal{C}_{j} and shows that πR:𝒞jD\pi_{R}:\mathcal{C}_{j}\to D is injective with smooth inverse.

Now, we consider the projection from 𝒞\mathcal{C}. Given (𝐱,𝝃)D({\mathbf{x}},\boldsymbol{\xi})\in D, our calculations show that the preimage in 𝒞\mathcal{C}, in coordinates (3.6) is given by two distinct points

(s(𝐱,𝝃),(x0)j(𝐱,𝝃),x1,σ(𝐱,𝝃)) for j=1,2 .(s({\mathbf{x}},\boldsymbol{\xi}),(x_{0})_{j}({\mathbf{x}},{\boldsymbol{\xi}}),x_{1},\sigma({\mathbf{x}},{\boldsymbol{\xi}}))\ \text{ for $j=1,2$ }.

The coordinates are given by (3.11), (3.12) and (3.13) respectively. ∎

The abstract adjoint 𝒯jt\mathcal{T}_{j}^{t} cannot be composed with 𝒯i\mathcal{T}_{i} for i=1,2i=1,2, because the support of 𝒯if\mathcal{T}_{i}f can be unbounded in rr, even for f(X)f\in\mathcal{E}^{\prime}(X) and 𝒯jt\mathcal{T}_{j}^{t} is not defined for such distributions. Therefore, we introduce a smooth cutoff function. Choose rM>2r_{M}>2 and let ψ(s)\psi(s) be a smooth compactly supported function equal to one for s[1,1rM2]s\in\left[1,\sqrt{1-r^{2}_{M}}\right] and define

(3.14) 𝒯jg=𝒯jt(ψg)\mathcal{T}_{j}^{*}g=\mathcal{T}^{t}_{j}(\psi g)

for all g𝒟(Y)g\in\mathcal{D}^{\prime}(Y) because our bound on rr introduces a bound on x0x_{0} so the integral is over a bounded set for each 𝐱X{\mathbf{x}}\in X.

3.1. The nonlocal artefacts

Now, we can state our next theorem, which describes the artifacts that can be added to the reconstruction using the normal operator, 𝒯𝒯\mathcal{T}^{*}\mathcal{T}.

Theorem 3.3.

If f(X)f\in\mathcal{E}^{\prime}(X) then

(3.15) WF(𝒯𝒯f)(WF(f)D)Λ12(f)Λ21(f)\mathrm{WF}\left(\mathcal{T}^{*}\mathcal{T}f\right)\subset\left(\mathrm{WF}(f)\cap D\right)\cup\Lambda_{12}(f)\cup\Lambda_{21}(f)

where DD is given by (3.9), and the sets Λij\Lambda_{ij} are given for (𝐱,𝛏)D({\mathbf{x}},{\boldsymbol{\xi}})\in D by

(3.16) Λij(f)={λij(𝐱,𝝃):(𝐱,𝝃)WF(f)D}\Lambda_{ij}(f)=\left\{\lambda_{ij}({\mathbf{x}},{\boldsymbol{\xi}})\hskip 0.85358pt:\hskip 0.85358pt({\mathbf{x}},{\boldsymbol{\xi}})\in\mathrm{WF}(f)\cap D\right\}

where the functions λ12\lambda_{12} and λ21\lambda_{21} are given by (3.19) and (3.20) respectively. Note that the functions λij\lambda_{ij} are defined for only some (𝐱,𝛏)D({\mathbf{x}},\boldsymbol{\xi})\in D and singularities at other points do not generate artifacts.

Therefore, 𝒯𝒯\mathcal{T}^{*}\mathcal{T} recovers most singularities of ff, as indicated in the first term in (3.15), but it adds two sets of nonlocal singularities, as given by Λ12(f)\Lambda_{12}(f) and Λ21(f)\Lambda_{21}(f). Note that, even if 𝒯j\mathcal{T}_{j}^{*} and 𝒯j\mathcal{T}_{j} are both elliptic above a covector (𝐱,𝝃)({\mathbf{x}},\boldsymbol{\xi}), artifacts caused by other points could mask singularities of ff that “should” be visible in 𝒯𝒯f\mathcal{T}^{*}\mathcal{T}f.

Proof.

Let f(X)f\in\mathcal{E}^{\prime}(X). By the Hörmander-Sato Lemma [25, Theorem 8.2.13] We have the expansion

(3.17) WF(𝒯𝒯(f))\displaystyle\mathrm{WF}(\mathcal{T}^{*}\mathcal{T}(f))\subset (𝒞𝒞)WF(f)\displaystyle\left(\mathcal{C}^{*}\circ\mathcal{C}\right)\circ\mathrm{WF}(f)
=[(𝒞1𝒞1)(𝒞2𝒞2)]WF(f)\displaystyle\quad=\left[(\mathcal{C}_{1}^{*}\circ\mathcal{C}_{1})\cup(\mathcal{C}_{2}^{*}\circ\mathcal{C}_{2})\right]\circ\mathrm{WF}(f)
(𝒞2𝒞1)WF(f)(𝒞1𝒞2)WF(f)\displaystyle\quad\quad\cup\ \left(\mathcal{C}_{2}^{*}\circ\mathcal{C}_{1}\right)\circ\mathrm{WF}(f)\ \cup\ \left(\mathcal{C}_{1}^{*}\circ\mathcal{C}_{2}\right)\circ\mathrm{WF}(f)

The first term in brackets in (3.17) is {(𝐱,𝝃;𝐱,𝝃):(𝐱,𝝃)D}WF(f)=WF(f)D\left\{({\mathbf{x}},\boldsymbol{\xi};{\mathbf{x}},\boldsymbol{\xi})\hskip 0.85358pt:\hskip 0.85358pt({\mathbf{x}},\boldsymbol{\xi})\in D\right\}\circ\mathrm{WF}(f)=\mathrm{WF}(f)\cap D. This proves the first part of the inclusion (3.15).

We now analyze the other two terms to define the functions λij\lambda_{ij} and finish the proof. Let (𝐱,𝝃)WF(f)D({\mathbf{x}},{\boldsymbol{\xi}})\in\mathrm{WF}(f)\cap D. First, consider λ12(𝐱,𝝃)=𝒞2𝒞1(𝐱,𝝃)\lambda_{12}({\mathbf{x}},\boldsymbol{\xi})=\mathcal{C}_{2}^{*}\circ\mathcal{C}_{1}\circ({\mathbf{x}},\boldsymbol{\xi}).111For convenience, we will abbreviate the set theoretic composition 𝒞i{(𝐱,𝝃)}\mathcal{C}_{i}\circ\left\{({\mathbf{x}},{\boldsymbol{\xi}})\right\} by 𝒞i(𝐱,𝝃)\mathcal{C}_{i}\circ({\mathbf{x}},{\boldsymbol{\xi}}). Using the calculations in the proof of Theorem 3.2 one sees that 𝒞1(𝐱,𝝃)\mathcal{C}_{1}\circ({\mathbf{x}},{\boldsymbol{\xi}}) is given by

(3.18) (s,x0,τ1,τ2) where s=s(𝐱,𝝃)=(2x2)2|𝝃|2ξ221x0=(x0)1(𝐱,𝝃)=x1+ξ1(2x2)ξ2+sσ=σ(𝐱,𝝃)=ξ22x2τ1=σ(x1x0)τ2=σ(s+x1x0)\begin{gathered}(s,x_{0},\tau_{1},\tau_{2})\ \text{ where }\ s=s({\mathbf{x}},{\boldsymbol{\xi}})=\sqrt{\frac{(2-x_{2})^{2}\left|\boldsymbol{\xi}\right|^{2}}{\xi_{2}^{2}}-1}\\ x_{0}=(x_{0})_{1}({\mathbf{x}},\boldsymbol{\xi})=x_{1}+\frac{\xi_{1}(2-x_{2})}{\xi_{2}}+s\qquad\sigma=\sigma({\mathbf{x}},\boldsymbol{\xi})=\frac{-\xi_{2}}{2-x_{2}}\\ \tau_{1}=\sigma(x_{1}-x_{0})\qquad\tau_{2}=-\sigma(s+x_{1}-x_{0})\end{gathered}

where we have taken these from the proof of Theorem 3.2. To find 𝒞2𝒞1(𝐱,𝝃)\mathcal{C}_{2}^{*}\circ\mathcal{C}_{1}\circ({\mathbf{x}},{\boldsymbol{\xi}}) we calculate the composition of the covector described in (3.18) with 𝒞2\mathcal{C}_{2}^{*}. Note that the values of x0x_{0} and ss are the same in both calculations and are given by (3.18). After using (3.8) and that ξ1(2x2)ξ2=x0sx1\frac{\xi_{1}(2-x_{2})}{\xi_{2}}=x_{0}-s-x_{1}, one sees that

(3.19) λ12(𝐱,𝝃)\displaystyle\lambda_{12}({\mathbf{x}},{\boldsymbol{\xi}}) =((y1,y2),𝜼) where\displaystyle=\left((y_{1},y_{2}),\boldsymbol{\eta}\right)\ \text{ where }
y1=y1(𝐱,𝝃)\displaystyle y_{1}=y_{1}({\mathbf{x}},{\boldsymbol{\xi}}) =s(x1x0)2(x1x0)+s+x0\displaystyle=\frac{s(x_{1}-x_{0})}{2(x_{1}-x_{0})+s}+x_{0}
y2=y2(𝐱,𝝃)\displaystyle y_{2}=y_{2}({\mathbf{x}},{\boldsymbol{\xi}}) =2(2x2)2|𝝃|2ξ22(y1(x0+s))2\displaystyle=2-\sqrt{\frac{(2-x_{2})^{2}\left|\boldsymbol{\xi}\right|^{2}}{\xi_{2}^{2}}-(y_{1}-(x_{0}+s))^{2}}
𝜼\displaystyle\boldsymbol{\eta} =(2ξ1sξ22x2)(𝐲𝐜2(s(𝐱,𝝃)(x0)1(𝐱,𝝃)))\displaystyle=\left(-2\xi_{1}-\frac{s\xi_{2}}{2-x_{2}}\right)({\mathbf{y}}-\mathbf{c}_{2}(s({\mathbf{x}},{\boldsymbol{\xi}})(x_{0})_{1}({\mathbf{x}},{\boldsymbol{\xi}})))

where x0=(x0)1(𝐱,𝝃)x_{0}=(x_{0})_{1}({\mathbf{x}},{\boldsymbol{\xi}}) and s=s(𝐱,𝝃)s=s({\mathbf{x}},{\boldsymbol{\xi}}) are given in (3.18) and 𝜼\boldsymbol{\eta} is calculated using the expression (3.8) with j=2j=2.

Note that the function λ12\lambda_{12} is defined for only some (𝐱,𝝃)D({\mathbf{x}},\boldsymbol{\xi})\in D; for example if the argument for the square root defining y2(𝐱,𝝃)y_{2}({\mathbf{x}},{\boldsymbol{\xi}}) is negative, then y2(𝐱,𝝃)y_{2}({\mathbf{x}},{\boldsymbol{\xi}}) is not defined and the point (𝐱,𝝃)({\mathbf{x}},{\boldsymbol{\xi}}) will not generate artifacts in Λ12\Lambda_{12}.

A similar calculation shows for (𝐲,𝜼)D({\mathbf{y}},\boldsymbol{\eta})\in D that

(3.20) λ21(𝐲,𝜼)\displaystyle\lambda_{21}({\mathbf{y}},\boldsymbol{\eta}) =((x1,x2),𝝃) where\displaystyle=((x_{1},x_{2}),\boldsymbol{\xi})\ \text{ where }
x1=x1(𝐲,𝜼)\displaystyle x_{1}=x_{1}({\mathbf{y}},\boldsymbol{\eta}) =s(y1x0)2(y1x0)+s+x0\displaystyle=\frac{s(y_{1}-x_{0})}{-2(y_{1}-x_{0})+s}+x_{0}
x2=x2(𝐲,𝜼)\displaystyle x_{2}=x_{2}({\mathbf{y}},\boldsymbol{\eta}) =2s2(𝐲,𝜼)+1(x1(x0s))2\displaystyle=2-\sqrt{s^{2}({\mathbf{y}},\boldsymbol{\eta})+1-(x_{1}-(x_{0}-s))^{2}}
𝝃\displaystyle\boldsymbol{\xi} =(2η1sη22y2)(𝐱𝐜1(s(𝐲,𝜼),(x0)2(𝐲,𝜼)))\displaystyle=\left(2\eta_{1}-\frac{s\eta_{2}}{2-y_{2}}\right)({\mathbf{x}}-\mathbf{c}_{1}(s({\mathbf{y}},\boldsymbol{\eta}),(x_{0})_{2}({\mathbf{y}},\boldsymbol{\eta})))

where

s=s(𝐲,𝜼)=(2y2)2|𝜼|2η221x0=(x0)2(𝐲,𝜼)=y1+η1(2y2)η2s.s=s({\mathbf{y}},\boldsymbol{\eta})=\sqrt{\frac{(2-y_{2})^{2}\left|\boldsymbol{\eta}\right|^{2}}{\eta_{2}^{2}}-1}\qquad x_{0}=(x_{0})_{2}({\mathbf{y}},\boldsymbol{\eta})=y_{1}+\frac{\eta_{1}(2-y_{2})}{\eta_{2}}-s.

Note that the function λ21\lambda_{21} is not defined for all (𝐲,𝜼)D({\mathbf{y}},\boldsymbol{\eta})\in D, and other points (𝐲,𝜼)({\mathbf{y}},\boldsymbol{\eta}) do not generate artifacts. This is for the same reason as for λ12\lambda_{12}. ∎

Remark 3.4.

The artefacts caused by a singularity of ff are as strong as the reconstruction of that singularity. To see this, first note that each 𝒯j𝒯i\mathcal{T}_{j}^{*}\mathcal{T}_{i} smooths of order one in Sobolev scale since it an FIO of order 1-1 [24, Theorem 4.3.1].

The visible singularities in the reconstruction come from the compositions 𝒯1𝒯1\mathcal{T}_{1}^{*}\mathcal{T}_{1} and 𝒯2𝒯2\mathcal{T}_{2}^{*}\mathcal{T}_{2} since these are pseudodifferential operators of order 1-1. The artefacts come from the “cross” compositions 𝒯2𝒯1\mathcal{T}_{2}^{*}\mathcal{T}_{1} and 𝒯1𝒯2\mathcal{T}_{1}^{*}\mathcal{T}_{2}, and they are FIO of order 1-1. Therefore, since the terms that preserve the real singularities of ff, 𝒯i𝒯i\mathcal{T}_{i}^{*}\mathcal{T}_{i}, i=1,2i=1,2, are also of order 1-1, 𝒯𝒯\mathcal{T}^{*}\mathcal{T} smooths each singularity of ff by one order in Sobolev scale and the composition 𝒯2𝒯1\mathcal{T}^{*}_{2}\mathcal{T}_{1} (corresponding to the artifact λ12\lambda_{12}, if defined at this covector) can create an artefact from that singularity that are also one order smoother than that singularity, and similarly with the composition 𝒯1𝒯2\mathcal{T}^{*}_{1}\mathcal{T}_{2}.

Second, our results are valid, not only for the normal operator 𝒯𝒯\mathcal{T}^{*}\mathcal{T} but for any filtered backprojection method 𝒯P𝒯\mathcal{T}^{*}P\mathcal{T} where PP is a pseudodifferential operator. This is true since pseudodifferential operators have canonical relation Δ\Delta and they do not move singularities, so our microlocal calculations are the same. If PP has order kk, then 𝒯P𝒯\mathcal{T}^{*}P\mathcal{T} decreases the Sobolev order of each singularity of ff by order (k1)(k-1) in Sobolev norm and can create an artefact from that singularity of the same order.

3.2. Artifacts for 𝒯𝒯\mathcal{T}^{*}\mathcal{T} due to limited data

In practice we do not have access to 𝒯f(s,x0)\mathcal{T}f(s,x_{0}) for all s(0,)s\in(0,\infty) (or r(1,)r\in(1,\infty)) and x0x_{0}\in\mathbb{R}, and will have knowledge of x0(a,a)x_{0}\in(-a,a) and r(1,rM)r\in(1,r_{M}) for some a>0a>0 (see figures 1 and 2) and maximum radius rM>1r_{M}>1.

We now evaluate which wavefront directions (𝐱,𝝃)({\mathbf{x}},{\boldsymbol{\xi}}) will be visible from this limited data. Let us consider the pair (𝐱,𝝃)C2(s,x0)×S1({\mathbf{x}},\boldsymbol{\xi})\in C_{2}(s,x_{0})\times S^{1} and let β\beta be the angle of 𝝃\boldsymbol{\xi} from the vertical as depicted in figure 2. Then 𝐜2(s,x0)=((2x2)tanβ+x1,2)\mathbf{c}_{2}(s,x_{0})=((2-x_{2})\tan\beta+x_{1},2) and

|𝐱𝐜2(s,x0)|2=r2(1+tan2β)(2x2)2=r2tanβ=r2(2x2)21.|{\mathbf{x}}-\mathbf{c}_{2}(s,x_{0})|^{2}=r^{2}\implies(1+\tan^{2}\beta)(2-x_{2})^{2}=r^{2}\implies\tan\beta=\sqrt{\frac{r^{2}}{(2-x_{2})^{2}}-1}.

Let βm=βm(𝐱)(0,π/2)\beta_{m}=\beta_{m}({\mathbf{x}})\in(0,\pi/2) be defined by

(3.21) tanβm=rM2(2x2)21\tan\beta_{m}=\sqrt{\frac{r_{M}^{2}}{(2-x_{2})^{2}}-1}

(noting that we only consider 𝐱{\mathbf{x}} such that 1>x2>2rM1>x_{2}>2-r_{M}). Then the maximum directional coverage of the singularities (wavefront set) at a given 𝐱X{\mathbf{x}}\in X which are resolved by the Compton data are described by the open cone of 𝝃S1\boldsymbol{\xi}\in S^{1}

(3.22) CT={±(sinβ,cosβ):βm<β<βm},C_{T}=\{\pm(\sin\beta,-\cos\beta):-\beta_{m}<\beta<\beta_{m}\},

and the opening angle of the cone depends on the depth of 𝐱{\mathbf{x}} (i.e. x2x_{2}). See figure 2. The cone CTC_{T} illustrated corresponds to the case when β=βm\beta=\beta_{m}.

In all of our numerical experiments, we set the tunnel height as rm1=6r_{m}-1=6 and the detector line width is 2a=82a=8. We let rM>rmr_{M}>r_{m} be large enough to penetrate the entire scanning tunnel (up to the line {x2=2rm}\{x_{2}=2-r_{m}\} as highlighted in figures 1 and 2), so as to imply a unique reconstruction [55]. Specifically we set the maximum radius rM=9r_{M}=9 and simulate 𝒯(r,x0)\mathcal{T}(r,x_{0}) for r{1+0.02j:1j400}r\in\{1+0.02j:1\leq j\leq 400\} and x0{4+0.04j:1j200}x_{0}\in\{-4+0.04j:1\leq j\leq 200\}. Further the densities considered are represented on [2,2]×[3,1][-2,2]\times[-3,1] (200×200200\times 200 pixel grid) in the reconstructions shown. The machine design considered is such that for any 𝐱[2,2]×[1.5,1]{\mathbf{x}}\in[-2,2]\times[-1.5,1] we have the maximal directional coverage in CTC_{T} allowed for the limited r<rMr<r_{M} (see figure 7). With the exception of the horizontal bar phantom depicted in figure 14, all objects considered for reconstruction are approximately in this region.

Refer to caption
(a) 𝒯𝒯δ\mathcal{T}^{*}\mathcal{T}\delta.
Refer to caption
(b) (𝒯1𝒯1+𝒯2𝒯2)δ(\mathcal{T}_{1}^{*}\mathcal{T}_{1}+\mathcal{T}_{2}^{*}\mathcal{T}_{2})\delta.
Refer to caption
(c) (𝒯1𝒯2+𝒯2𝒯1)δ(\mathcal{T}_{1}^{*}\mathcal{T}_{2}+\mathcal{T}_{2}^{*}\mathcal{T}_{1})\delta.
Refer to caption
(d) χS12S21\chi_{S_{12}\cup S_{21}}.
Refer to caption
(e) Λij\Lambda_{ij} artefacts.
Refer to caption
(f) Λij\Lambda_{ij} artefacts on [2,2]×[3,1][-2,2]\times[-3,1].
Figure 3. 𝒯𝒯δ\mathcal{T}^{*}\mathcal{T}\delta (the δ\delta function is centered at 𝐎=(0,1)\mathbf{O}=(0,-1)) images with the predicted artefacts due to the limited data backprojection (on S12S21S_{12}\cup S_{21}) and those induced by Λ12\Lambda_{12} and Λ21\Lambda_{21}.

To demonstrate the artefacts, we apply a discrete form of 𝒯𝒯\mathcal{T}^{*}\mathcal{T} to a delta function. We have the expansion

(3.23) 𝒯𝒯=(𝒯1+𝒯2)(𝒯1+𝒯2)=𝒯1𝒯1+𝒯2𝒯2+𝒯1𝒯2+𝒯2𝒯1.\mathcal{T}^{*}\mathcal{T}\ =\ (\mathcal{T}_{1}+\mathcal{T}_{2})^{*}(\mathcal{T}_{1}+\mathcal{T}_{2})\ =\ \mathcal{T}_{1}^{*}\mathcal{T}_{1}+\mathcal{T}_{2}^{*}\mathcal{T}_{2}+\mathcal{T}_{1}^{*}\mathcal{T}_{2}+\mathcal{T}_{2}^{*}\mathcal{T}_{1}.

Using equations (3.18), (3.19), and (3.20) one can show for gL2(Y)g\in L^{2}(Y) that the backprojection operators 𝒯j\mathcal{T}_{j}^{*}, j=1,2j=1,2 can be written

(3.24) 𝒯jg(𝐱)=βmβmg(r21,x1+(1)j1r21+rsinβ)|r=(2x2)cosβdβ.\mathcal{T}_{j}^{*}g({\mathbf{x}})=\int_{-\beta_{m}}^{\beta_{m}}g\left(\sqrt{r^{2}-1},x_{1}+(-1)^{j-1}\sqrt{r^{2}-1}+r\sin\beta\right)\bigg{|}_{r=\frac{(2-x_{2})}{\cos\beta}}\mathrm{d}\beta.

Note that we are not restricting x0x_{0} to [a,a][-a,a] but we are restricting ss to (0,rM21)\left(0,\sqrt{r_{M}^{2}-1}\right), and hence the cutoff function ψ\psi of equation 3.14 is equal to one on the bounds of integration.

Now, let ff be a delta function at 𝐲{\mathbf{y}}. We calculate the artifacts

(3.25) 𝒯1𝒯2f(𝐱)0β[βm,βm]s.t.|𝐲𝐜2(s,x0)|=r,\mathcal{T}_{1}^{*}\mathcal{T}_{2}f({\mathbf{x}})\neq 0\iff\exists\beta\in[-\beta_{m},\beta_{m}]\ \ \text{s.t.}\ \ |{\mathbf{y}}-\mathbf{c}_{2}(s,x_{0})|=r,

where r=2x2cosβr=\frac{2-x_{2}}{\cos\beta}, s=r21s=\sqrt{r^{2}-1} and x0=x1+s+rsinβx_{0}=x_{1}+s+r\sin\beta. Similarly

(3.26) 𝒯2𝒯1f(𝐱)0β[βm,βm]s.t.|𝐲𝐜1(s,x0)|=r,\mathcal{T}_{2}^{*}\mathcal{T}_{1}f({\mathbf{x}})\neq 0\iff\exists\beta\in[-\beta_{m},\beta_{m}]\ \ \text{s.t.}\ \ |{\mathbf{y}}-\mathbf{c}_{1}(s,x_{0})|=r,

where r=2x2cosβr=\frac{2-x_{2}}{\cos\beta}, s=r21s=\sqrt{r^{2}-1} and x0=x1s+rsinβx_{0}=x_{1}-s+r\sin\beta. Hence the only contributions to the backprojection from 𝒯1𝒯2\mathcal{T}_{1}^{*}\mathcal{T}_{2} and 𝒯2𝒯1\mathcal{T}_{2}^{*}\mathcal{T}_{1} are on the following sets:

(3.27) S12={𝐱:β[βm,βm]s.t.|𝐲𝐜2(s,x0)|=r}S_{12}=\{{\mathbf{x}}:\exists\beta\in[-\beta_{m},\beta_{m}]\ \ \text{s.t.}\ \ |{\mathbf{y}}-\mathbf{c}_{2}(s,x_{0})|=r\}

where r=2x2cosβr=\frac{2-x_{2}}{\cos\beta} and x0=x1+s+rsinβx_{0}=x_{1}+s+r\sin\beta and

(3.28) S21={𝐱:β[βm,βm]s.t.|𝐲𝐜1(s,x0)|=r}.S_{21}=\{{\mathbf{x}}:\exists\beta\in[-\beta_{m},\beta_{m}]\ \ \text{s.t.}\ \ |{\mathbf{y}}-\mathbf{c}_{1}(s,x_{0})|=r\}.

where r=2x2cosβr=\frac{2-x_{2}}{\cos\beta} and x0=x1s+rsinβx_{0}=x_{1}-s+r\sin\beta. This means that all Λij\Lambda_{ij} artifacts will be in these sets. Note that besides the Λij\Lambda_{ij} artifacts shown in figure 4(e) and 4(f) there are limited data artifacts caused by circles meeting 𝐲{\mathbf{y}} of radius rMr_{M} (figures 4(a)-4(c)) and these are of higher strength in Sobolev norm.

Refer to caption
(a) 𝒯𝒯δ\mathcal{T}^{*}\mathcal{T}\delta.
Refer to caption
(b) (𝒯1𝒯1+𝒯2𝒯2)δ(\mathcal{T}_{1}^{*}\mathcal{T}_{1}+\mathcal{T}_{2}^{*}\mathcal{T}_{2})\delta.
Refer to caption
(c) (𝒯1𝒯2+𝒯2𝒯1)δ(\mathcal{T}_{1}^{*}\mathcal{T}_{2}+\mathcal{T}_{2}^{*}\mathcal{T}_{1})\delta.
Refer to caption
(d) χS12S21\chi_{S_{12}\cup S_{21}}.
Refer to caption
(e) Λij\Lambda_{ij} artefacts.
Refer to caption
(f) Λij\Lambda_{ij} artefacts on [2,2]×[3,1][-2,2]\times[-3,1].
Figure 4. 𝒯𝒯δ\mathcal{T}^{*}\mathcal{T}\delta (the δ\delta function is centered at (0,0.9)(0,0.9)) images with the predicted artefacts due to the limited data backprojection (on S12S21S_{12}\cup S_{21}) and those induced by Λ12\Lambda_{12} and Λ21\Lambda_{21}.

To simulate a δ\delta function discretely we assign a value of 1 to nine neighbouring pixels in a 200–200 grid (which will represent [2,2]×[3,1][-2,2]\times[-3,1]) and set all other pixel values to zero. Letting our discrete delta function be denoted by xδx_{\delta}, we approximate 𝒯𝒯δATAxδ\mathcal{T}^{*}\mathcal{T}\delta\approx A^{T}Ax_{\delta}, where AA is the discrete form of 𝒯\mathcal{T}. For comparison we show images of

(𝒯1𝒯2+𝒯2𝒯1)δ(A1TA2+A2TA1)xδ,(\mathcal{T}_{1}^{*}\mathcal{T}_{2}+\mathcal{T}_{2}^{*}\mathcal{T}_{1})\delta\approx(A_{1}^{T}A_{2}+A_{2}^{T}A_{1})x_{\delta},

a characteristic function on the set S12S21S_{12}\cup S_{21}, and the artefacts induced by Λ12\Lambda_{12} and Λ21\Lambda_{21}. Here AjA_{j} is the discrete form of 𝒯j\mathcal{T}_{j}, for j=1,2j=1,2. See figure 3. For example, in figure 3(b) we see a butterfly wing type artefact in (A1TA1+A2TA2)xδ(A_{1}^{T}A_{1}+A_{2}^{T}A_{2})x_{\delta}. This is due to the limited rr and x0x_{0} data inherent to our acquisition geometry (there are unresolved wavefront directions). In the (A1TA2+A1TA2)xδ(A_{1}^{T}A_{2}+A_{1}^{T}A_{2})x_{\delta} image of figure 3(c) we see artefacts appearing on the set S12S21S_{12}\cup S_{21} as shown in figure 3(d). This is as predicted by our theory. The artefacts induced by the Λij\Lambda_{ij} in this case lie outside the scanning region ([2,2]×[3,1][-2,2]\times[-3,1]), and hence they are not observed in the reconstruction. See figures 3(e) and 3(f). In figure 4 the artefact curves intersect [2,2]×[3,1][-2,2]\times[-3,1] in the top left and right-hand corners respectively. See figures 4(e) and 4(f). In this case the artefacts are observed faintly in the reconstructions (their magnitude is small compared to the delta function), and it is unclear whether they align with our predictions.

Refer to caption
(a) 𝒯Φ𝒯δ\mathcal{T}^{*}\Phi\mathcal{T}\delta (location 1).
Refer to caption
(b) (𝒯1Φ𝒯2+𝒯2Φ𝒯1)δ(\mathcal{T}_{1}^{*}\Phi\mathcal{T}_{2}+\mathcal{T}_{2}^{*}\Phi\mathcal{T}_{1})\delta.
Refer to caption
(c) Λij\Lambda_{ij} artefacts.
Refer to caption
(d) 𝒯Φ𝒯δ\mathcal{T}^{*}\Phi\mathcal{T}\delta (location 2).
Refer to caption
(e) (𝒯1Φ𝒯2+𝒯2Φ𝒯1)δ(\mathcal{T}_{1}^{*}\Phi\mathcal{T}_{2}+\mathcal{T}_{2}^{*}\Phi\mathcal{T}_{1})\delta.
Refer to caption
(f) Λij\Lambda_{ij} artefacts.
Figure 5. 𝒯Φ𝒯δ\mathcal{T}^{*}\Phi\mathcal{T}\delta and (𝒯1Φ𝒯2+𝒯2Φ𝒯1)δ(\mathcal{T}_{1}^{*}\Phi\mathcal{T}_{2}+\mathcal{T}_{2}^{*}\Phi\mathcal{T}_{1})\delta images with the predicted artefacts induced by Λ12\Lambda_{12} and Λ21\Lambda_{21}. We give examples for two δ\delta function locations. Location 1 is (0,0.85) and location 2 is (-2.8,0.9).

To show the artefacts induced by the Λij\Lambda_{ij} more clearly, we repeat the analysis above using filtered backprojection, and a second derivative filter Φ=d2dr2\Phi=\frac{\mathrm{d}^{2}}{\mathrm{d}r^{2}}. That is we show images of 𝒯Φ𝒯δ\mathcal{T}^{*}\Phi\mathcal{T}\delta. Note that Φ\Phi is applied in the variable rr (the torus radius). The application of derivative filters is a common idea in lambda tomography [13, 14], and is known to highlight the image contours (singularities or edges) in the reconstruction [43, Theorem 3.5]. See figure 5. As the artefacts induced by Λij\Lambda_{ij} appeared to be largely outside the scanning region ([2,2]×[3,1][-2,2]\times[-3,1]) in our previous simulations, we have increased the scanning region size to [3,3]×[4,2][-3,3]\times[-4,2], to show more the effects of the Λij\Lambda_{ij} in the observed reconstruction. Here Φ\Phi suppresses the artefacts due to limited data, and the Λij\Lambda_{ij} artefacts appear as additional contours in the reconstruction. The observed artefacts appear most clearly in figures 5(b) and 5(e), and align exactly with our predictions in figures 5(c) and 5(f).

Remark 3.5.

With precise knowledge of the locations of the artefacts induced by the Λij\Lambda_{ij} we can assist in the design of the proposed parallel line scanner. That is we can choose aa, rMr_{M} and the scanning tunnel size to minimize the presence of the nonlocal artefacts in the reconstruction (i.e., those from Λij(f)\Lambda_{ij}(f)). Such advice would be of benefit to our industrial partners in airport screening to remove the concern for nonlocal artefacts in the image reconstruction of baggage. Indeed the machine dimensions we have chosen seem to be suitable as the artefacts appear largely outside the reconstruction space (see figures 3 and 4).

4. The transmission artefacts

The detector row DCD_{C} collects Compton (back) scattered data, which determines 𝒯f(s,x0)\mathcal{T}f(s,x_{0}) for a range of ss and x0x_{0}, where f=nef=n_{e} is the electron charge density. The forward detector array DAD_{A} collects transmission (standard X-ray CT) data, which determine a set of straight line integrals over the attenuation coefficient f=μEf=\mu_{E}, for some photon energy EE. The data is limited to the set of lines which intersect SS (the source array) and DAD_{A}. This limited data can cause artefacts in the X-ray reconstruction, and we will analyze these artefacts using the theory in [5]. Let Ls,θ={𝐱2:𝐱Θ=s}L_{s,\theta}=\{{\mathbf{x}}\in\mathbb{R}^{2}:{\mathbf{x}}\cdot\Theta=s\} be the line parameterized by a rotation θ[0,π]\theta\in[0,\pi] and a directed distance from the origin ss\in\mathbb{R}. Here Θ=(cosθ,sinθ)\Theta=(\cos\theta,\sin\theta) and Ls,θL_{s,\theta} is the line containing sΘs\Theta and perpendicular to Θ\Theta.

In the scanning geometry of this article, the set SS of X-ray transmitters is the segment between (4,3)(-4,3) and (4,3)(4,3) and the set of X-ray detectors, DAD_{A}, is the segment between (4,5)(-4,-5) and (4,5)(4,-5) as in figure 1. For this reason, the cutoff in the sinogram space is described by the set

(4.1) H={(s,θ)×[π/2,π/2]:Ls,θSandLs,θDA}.H=\{(s,\theta)\in\mathbb{R}\times[-\pi/2,\pi/2]:L_{s,\theta}\cap S\neq\emptyset\ \ \text{and}\ \ L_{s,\theta}\cap D_{A}\neq\emptyset\}.

The characteristic function of HH appears as a skewed diamond shape in sinogram space.

To illustrate the added artifacts inherent in this incomplete data problem, we simulate reconstructions of delta functions with transmission CT data on HH. That is, we apply the backprojection operator RLRLR_{L}^{*}R_{L} to δ\delta functions, where RLfR_{L}f denotes RfRf for (s,θ)H(s,\theta)\in H. See figure 6.

Refer to caption
Refer to caption
Refer to caption
Figure 6. RLTRLδ(t,1)R_{L}^{T}R_{L}\delta_{(t,-1)} images at varying δ\delta function translations along the line x2=1x_{2}=-1.

By the theory in [5], artefacts caused by the incomplete data occur on lines Ls,θL_{s,\theta} for (s,θ)(s,\theta) in the boundary of HH. Each delta function in Figure 6 is at a point (t,1)(t,-1) for some tt with 0<t<20<t<2, so the lines that meet the support of the delta function, (t,1)(t,-1) that are in the boundary of HH must also contain either (4,3)(4,3) or (4,5)(4,-5). This is true because SS and DAD_{A} are mirror images about the line y=1y=-1 and t(0,2)t\in(0,2).

Furthermore, by symmetry (the δ\delta functions are on the center line of the scanning region), these artifact lines will be reflections of each other in the vertical line x1=tx_{1}=t. This is illustrated in our reconstructions in Figure 6. The opening angle of the cone in the delta reconstructions decreases (fewer wavefront set directions are stably resolved) as we translate δ\delta to the right on the line x1=1x_{1}=-1.

Example 4.1.

We now use these ideas to analyze the visible wavefront directions for the joint problem. Let 𝒮=[4,4]×[5,3]\mathcal{S}=[-4,4]\times[-5,3] be the square between SS and DAD_{A} and let 𝐎=(0,1)\mathbf{O}=(0,-1), the center of 𝒮\mathcal{S}. We consider wavefronts at points (x1,x2)[2,2]×[3,1](x_{1},x_{2})\in[-2,2]\times[-3,1] which is a square centered at 𝐎\mathbf{O} and the region in which our simulated reconstructions are done.

By (4.1), lines in the data set must intersect both SS and DAD_{A}, so lines through 𝐎\mathbf{O} in the data set are all lines through 𝐎\mathbf{O} that are more vertical than the diagonals of 𝒮\mathcal{S}. Because visible wavefront directions are normal to lines in the X-ray CT data set [38], the wavefront directions which are resolved lie in the horizontal open cone between normals to these diagonals. Therefore, they are in the cone

CR={±(cosα,sinα):π/4<α<π/4},C_{R}=\{\pm(\cos\alpha,\sin\alpha):-\pi/4<\alpha<\pi/4\},

which is shown in figure 1.

An analysis of the singularities that are visible by the Compton data was done in section 3.2. For the point 𝐎\mathbf{O}, the angle defined by (3.21) gives βm=1.23\beta_{m}=1.23 and the cone of visible directions given by (3.22) is the vertical cone with angles from the vertical between βm-\beta_{m} and βm\beta_{m} since the parameter rM=9r_{M}=9. A calculation shows that 2(π/4+βm)>π2(\pi/4+\beta_{m})>\pi and this implies CRCT=S1C_{R}\cup C_{T}=S^{1} and we have a full resolution of the image singularities at 𝐎\mathbf{O}.

Refer to caption
Figure 7. Picture of the range of wavefront directions that are visible at points in [2,2]×[3,1][-2,2]\times[-3,1] from the joint data. Angles are measured from 00^{\circ} = no coverage to 360360^{\circ} = full coverage. We let Γ\Gamma denote the solid light-colored (yellow) region (roughly the top 3/4 of the figure) in which all wavefront directions are recovered.

However, for points near the bottom x2=3x_{2}=-3, there are invisible singularities that are not visible from either the Compton or X-ray data. For example, the vertical direction (0,1)(0,1) is not normal to any circle in the Compton data set at any point (0,x2)(0,x_{2}) for x2[2.5,3]x_{2}\in[-2.5,-3]. Figure 7 shows the points for which all wavefront directions are visible at those points (yellow color–roughly for points (x1,x2)(x_{1},x_{2}) for x2>2x_{2}>-2) and near the bottom of the reconstruction region, there are more missing directions.

5. A joint reconstruction approach and results

In this section we detail our joint reconstruction scheme and lambda tomography regularization technique, and show the effectiveness of our methods in combatting the artefacts observed and predicted by our microlocal theory. We first explain the physical relationship between μE\mu_{E} and nen_{e}, which will be needed later in the formulation of our regularized inverse problem.

5.1. Relating μE\mu_{E} and nen_{e}

The attenuation coefficient and electron density satisfy the formula [53, page 36]

(5.1) μE(Z)=ne(Z)σE(Z),\mu_{E}(Z)=n_{e}(Z)\sigma_{E}(Z),

where σE\sigma_{E} denotes the electron cross section, at energy EE. Here ZZ denotes the effective atomic number. In the proposed application in airport baggage screening (among many other applications such as medical CT) we are typically interested in the materials with low effective ZZ. Hence we consider the materials with Z<20Z<20 in this paper. For large enough EE and Z<20Z<20, σE(Z)\sigma_{E}(Z) is approximately constant as a function of ZZ. Equivalently μE\mu_{E} and nEn_{E} are approximately proportional for low ZZ and high EE by equation 5.1. See figure 8. We see a strong correlation between nen_{e} and μE\mu_{E} when E=100E=100keV and Z<20Z<20, and even more so when EE is increased to E=1E=1MeV. The sample of materials considered consists of 153 compounds (e.g. wax, soap, salt, sugar, the elements) taken from the NIST database [29]. In this case σEν\sigma_{E}\approx\nu for some ν\nu\in\mathbb{R} is approximately constant and we have μEνne\mu_{E}\approx\nu n_{e}.

Refer to caption
Refer to caption
Figure 8. Scatter plot of nen_{e} vs μE\mu_{E} for E=100E=100keV (left) and E=1E=1MeV (right), for 153 compounds with effective Z<20Z<20 taken from the NIST [28] database. The correlation is R=0.93 (left) and R=0.98 (right).

For a given energy EE, ν\nu is the slope of the straight line fit as in figure 8. Throughout the rest of this paper, we set ν\nu as the slope of the straight line in the left hand of figure 8 (i.e. ν0.57\nu\approx 0.57), and present reconstructions of μE\mu_{E} for E=100E=100keV.

5.2. Lambda regularization; the idea

In sections 3 and 4 we discovered that the RLμER_{L}\mu_{E} and 𝒯ne\mathcal{T}n_{e} data provide complementary information regarding the detection and resolution of edges in an image. More specifically the line integral data resolved singularities in an open cone CRC_{R} with central axis x1x_{1} and the toric section integral data resolved singularities in a cone CTC_{T} with central axis x2x_{2}. So the overlapping cones CRCTC_{R}\cup C_{T} give a greater coverage of S1S^{1} than when considered separately. In figures 3, 4 and 6, this theory was later verified through reconstructions of a delta functions by (un)filtered backprojection.

For a further example, let us consider a more complicated phantom than a delta function, one which is akin to densities considered later for testing our joint reconstruction and lambda regularization method. In figure 9 we have presented reconstructions of an image phantom ff (with no noise) from RLfR_{L}f (transmission data–middle figure) using FBP, and from 𝒯f\mathcal{T}f (Compton data–right figure) by an application of 𝒯d2dr2\mathcal{T}^{*}\frac{\mathrm{d}^{2}}{\mathrm{d}r^{2}} (a contour reconstruction). In the reconstruction from Compton data, we see that the image singularities are well resolved in the vertical direction (x2x_{2}), and conversely in the horizontal direction (x1x_{1}) in the reconstruction from transmission data. In the middle picture (reconstruction from RLR_{L}), the visible singularities of the object are tangent to lines in the data set (normal wavefront set) and the artifacts are along lines at the end of the data set that are tangent to boundaries of the objects. In the right-hand reconstruction from Compton data, the visible boundaries are tangent to circles in the data set and the streaks are along circles at the end of the data set. Note that the visible boundaries in each picture complement each other and together, image the full objects. This is all as predicted by the theory of sections 3 and 4 (and is consistent with the theory in [5] and [15]) and highlights the complementary nature of the Compton and transmission data in their ability to detect and resolve singularities.

Given the complementary edge resolution capabilities of RLμER_{L}\mu_{E} and 𝒯ne\mathcal{T}n_{e}, and given the approximate linear relationship between μE\mu_{E} and nen_{e}, we can devise a joint linear least squares reconstruction scheme with the aim to recover the image singularities stably in all directions in the nen_{e} and μE\mu_{E} images simultaneously.

Refer to caption
Refer to caption
Refer to caption
Figure 9. Image phantom ff (left), a reconstruction from RLfR_{L}f using FBP (middle) and 𝒯d2dr2𝒯f\mathcal{T}^{*}\frac{\mathrm{d}^{2}}{\mathrm{d}r^{2}}\mathcal{T}f (right).

To this end we employ ideas in lambda tomography and microlocal analysis.

Let f(n)f\in\mathcal{E}(\mathbb{R}^{n}) and let Rf(s,θ)=Rfθ(s)=Ls,θfdlRf(s,\theta)=Rf_{\theta}(s)=\int_{L_{s,\theta}}f\mathrm{d}l denote the hyperplane Radon transform of ff, where Ls,θL_{s,\theta} is as defined in section 4. The Radon projections RfθRf_{\theta} detect singularities in ff in the direction Θ=(cosθ,sinθ)\Theta=(\cos\theta,\sin\theta) (i.e. the elements (𝐱,Θ)WF(f)({\mathbf{x}},\Theta)\in\text{WF}(f)). Applying a derivative filter dmdsmRθ\frac{\mathrm{d}^{m}}{\mathrm{d}s^{m}}R_{\theta}, for some m1m\geq 1, increases the strength of the singularities in the Θ\Theta direction by order mm in Sobolev scale.

Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 10. Top row – Simple density (left) and attenuation (right) phantoms. Bottom row – Complex density (left) and attenuation (right) phantoms. The associated materials are labelled in each case.

Given f,g(n)f,g\in\mathcal{E}(\mathbb{R}^{n}), we aim to enforce a similarity in WF(f)\text{WF}(f) and WF(g)\text{WF}(g) through the addition of the penalty term dmdsmR(fg)L2(×S1)\|\frac{\mathrm{d}^{m}}{\mathrm{d}s^{m}}R(f-g)\|_{L^{2}(\mathbb{R}\times S^{1})} to the least squares solution. Note that we are integrating over all directions in S1S^{1} to enforce a full directional similarity in WF(f)\text{WF}(f) and WF(g)\text{WF}(g). Specifically in our case f=μEf=\mu_{E}, g=neg=n_{e} and we aim to minimize the quadratic functional

(5.2) argminμE,ne(wRL00𝒯α[dmdsmRνdmdsmR])(μEne)(wb1b20)22,\operatorname*{arg\,min}_{\mu_{E},n_{e}}\left\|\begin{pmatrix}wR_{L}&0\\ 0&\mathcal{T}\\ \alpha[\frac{\mathrm{d}^{m}}{\mathrm{d}s^{m}}R&-\nu\frac{\mathrm{d}^{m}}{\mathrm{d}s^{m}}R]\end{pmatrix}\begin{pmatrix}\mu_{E}\\ n_{e}\\ \end{pmatrix}-\begin{pmatrix}wb_{1}\\ b_{2}\\ 0\end{pmatrix}\right\|^{2}_{2},

where RLR_{L} denotes a discrete, limited data Radon operator, RR is the discrete form of the full data Radon operator, 𝒯\mathcal{T} is the discrete form of the toric section transform, b1b_{1} is known transmission data and b2b_{2} is the Compton scattered data. Here α\alpha is a regularization parameter which controls the level of similarity in the image wavefront sets. The lambda regularizers enforce the soft constraint that μE=νne\mu_{E}=\nu n_{e} (since dmdsmRf=0f=0\frac{\mathrm{d}^{m}}{\mathrm{d}s^{m}}Rf=0\iff f=0 for fLc2(X)f\in L^{2}_{c}(X)), but with emphasis on the location, direction and magnitude of the image singularities in the comparison.

ϵ\epsilon TV JLAM JTV LPLS
nen_{e} .26.26 .14.14 .05.05 .03.03
μE\mu_{E} .40.40 .15.15 .05.05 .03.03
FF-score TV JLAM JTV LPLS
supp(ne)\text{supp}(n_{e}) .81.81 .99.99 .99.99 1\sim 1
ne\nabla n_{e} .76.76 .87.87 .82.82 .87.87
supp(μE)\text{supp}(\mu_{E}) .78.78 1\sim 1 1\sim 1 1\sim 1
μE\nabla\mu_{E} .49.49 .87.87 .82.82 .86.86
Table 1. Simple phantom ϵ\epsilon and FF-score comparison using TV, JLAM, JTV and LPLS.
ϵ±\epsilon_{\pm} JLAM JTV LPLS
nen_{e} .15±.01.15\pm.01 .05±.005.05\pm.005 .06±.002.06\pm.002
μE\mu_{E} .16±.02.16\pm.02 .05±.005.05\pm.005 .06±.03.06\pm.03
F±F_{\pm} JLAM JTV LPLS
supp(ne)\text{supp}(n_{e}) .99±.01.99\pm.01 .99±.004.99\pm.004 1±.005\sim 1\pm.005
ne\nabla n_{e} .86±.01.86\pm.01 .84±.02.84\pm.02 .86±.04.86\pm.04
supp(μE)\text{supp}(\mu_{E}) .98±.04.98\pm.04 .99±.005.99\pm.005 1±.005\sim 1\pm.005
μE\nabla\mu_{E} .86±.01.86\pm.01 .85±.02.85\pm.02 .87±.04.87\pm.04
Table 2. Randomized simple phantom ϵ±\epsilon_{\pm} and F±F_{\pm} comparison over 100 runs using JLAM, JTV and LPLS.

Further we expect the lambda regularizers to have a smoothing effect given the nature of dmdsmR\frac{\mathrm{d}^{m}}{\mathrm{d}s^{m}}R as a differential operator (i.e. the inverse is a smoothing operation). Hence we expect α\alpha to also act as a smoothing parameter. The weighting w=𝒯2/RL2w=\|\mathcal{T}\|_{2}/\|R_{L}\|_{2} is included so as to give equal weighting to the transmission and scattering datasets in the inversion. We denote the joint reconstruction method using lambda tomography regularizers as “JLAM”. A common choice for mm in lambda tomography applications is m=2m=2 [7, 43] (hence the name “lambda regularizers”).

Density nen_{e} Attenuation μE\mu_{E}, E=100E=100keV

TV

Refer to caption Refer to caption

JLAM

Refer to caption Refer to caption

JTV

Refer to caption Refer to caption

LPLS

Refer to caption Refer to caption
Figure 11. Simple phantom reconstructions, noise level η=0.1\eta=0.1. Comparison of methods TV, JLAM, JTV and LPLS.

With complete X-ray data, the application of a Lambda term yields this Rd2ds2Rf=4πΔfR^{*}\frac{d^{2}}{ds^{2}}Rf={-4\pi}\sqrt{-\Delta}f [31, Example 9], so the singularities of ff are preserved and emphasized by order 11 in Sobolev scale, so they will dominate the Lambda reconstruction. Hence choosing m=2m=2 is sufficient for a full recovery of the image singularities. Since the singularities are dominant in the lambda term, they are matched accurately in (5.2). Indeed we have already seen the effectiveness of such a filtering approach in recovering the image contours earlier in the right hand of figure 9. We find that setting m=2m=2 here works well as a regularizer on synthetic image phantoms and simulated data with added pseudo random noise, as we shall now demonstrate. We note that the derivative filters for m2m\neq 2 are also worth exploration but we leave such analysis for future work.

5.3. Proposed testing and comparison to the literature

To test our reconstruction method, we first consider two test phantoms, one simple and one complex (as in [55]). See figure 11. The phantoms considered are supported on Γ\Gamma, the region in figure 7 in which there is full wavefront coverage from joint X-ray and Compton scattered data. The simple density phantom consists of a Polyvinyl Chloride (PVC) cuboid and an Aluminium sphere with an approximate density ratio of 1:2 (PVC:Al). The complex density phantom consists of a water ellipsoid, a Sulfur ellipsoid, a Calcium sulfate (CaSO4\text{CaSO}_{4}) right-angled-triangle and a thin film of Titanium dioxide (TiO2\text{TiO}_{2}) in the shape of a cross. The density ratio of the materials which compose the complex phantom is approximately 1:2:3:4 (H2O\text{H}_{2}\text{O}:S:CaSO4\text{CaSO}_{4}:TiO2\text{TiO}_{2}). The density values used are those of figure 8 taken from the NIST database [28], and the background densities (0\approx 0) were set to the density of dry air (near sea level). The corresponding attenuation coefficient phantoms are simulated similarly. The materials considered are widely used in practice. For example CaSO4\text{CaSO}_{4} is used in the production of plaster of Paris and stucco (a common construction material) [57], and TiO2\text{TiO}_{2} is used in the making of decorative thin films (e.g. topaz) and in pigmentation [56, page 15].

To simulate data we set

(5.3) b=(b1b2)=(RLμE𝒯ne,)b=\begin{pmatrix}b_{1}\\ b_{2}\\ \end{pmatrix}=\begin{pmatrix}R_{L}\mu_{E}\\ \mathcal{T}n_{e},\\ \end{pmatrix}

and add a Gaussian noise

(5.4) bη=b+ηb2vGl,b_{\eta}=b+\eta\|b\|_{2}\frac{v_{G}}{\sqrt{l}},

for some noise level η\eta, where ll is the length of bb and vGv_{G} is a vector of length ll of draws from 𝒩(0,1)\mathcal{N}(0,1). For comparison we present separate reconstructions of μE\mu_{E} and nen_{e} using Total Variation (TV regularizers). That is we will find

(5.5) argminμERLμEb122+αTV(μE)\operatorname*{arg\,min}_{\mu_{E}}\left\|R_{L}\mu_{E}-b_{1}\right\|^{2}_{2}+\alpha\text{TV}(\mu_{E})

to reconstruct μE\mu_{E} and

(5.6) argminne𝒯neb222+αTV(ne)\operatorname*{arg\,min}_{n_{e}}\left\|\mathcal{T}n_{e}-b_{2}\right\|^{2}_{2}+\alpha\text{TV}(n_{e})

for nen_{e}, where TV(f)=f1\text{TV}(f)=\|\nabla f\|_{1} and α>0\alpha>0 is a regularization parameter. We will denote this method as “TV”. In addition we present reconstructions using the state-of-the-art joint reconstruction and regularization techniques from the literature, namely the Joint Total Variation (JTV) methods of [20] and the Linear Parallel Level Sets (LPLS) methods of [12]. To implement JTV we minimize

(5.7) argminμE,ne(wRL00𝒯)(μEne)(wb1b2)22+αJTVβ(μE,ne),\operatorname*{arg\,min}_{\mu_{E},n_{e}}\left\|\begin{pmatrix}wR_{L}&0\\ 0&\mathcal{T}\\ \end{pmatrix}\begin{pmatrix}\mu_{E}\\ n_{e}\\ \end{pmatrix}-\begin{pmatrix}wb_{1}\\ b_{2}\\ \end{pmatrix}\right\|^{2}_{2}+\alpha\text{JTV}_{\beta}(\mu_{E},n_{e}),

where w=𝒯2/RL2w=\|\mathcal{T}\|_{2}/\|R_{L}\|_{2} as before, and

(5.8) JTVβ(μE,ne)=[2,2]×[3,1](μE(𝐱)22+ne(𝐱)22+β2)12d𝐱,\text{JTV}_{\beta}(\mu_{E},n_{e})=\int_{[-2,2]\times[-3,1]}\left(\|\nabla\mu_{E}({\mathbf{x}})\|_{2}^{2}+\|\nabla n_{e}({\mathbf{x}})\|_{2}^{2}+\beta^{2}\right)^{\frac{1}{2}}\mathrm{d}{\mathbf{x}},

where β>0\beta>0 is an additional hyperparameter included so that the gradient of JTVβ\text{JTV}_{\beta} is defined at zero. This allows one to apply techniques in smooth optimization to solve (5.7).

Density nen_{e} Attenuation μE\mu_{E}, E=100E=100keV

TV

Refer to caption Refer to caption

JLAM

Refer to caption Refer to caption

JTV

Refer to caption Refer to caption

LPLS

Refer to caption Refer to caption
Figure 12. Complex phantom reconstructions, noise level η=0.1\eta=0.1. Comparison of methods TV, JLAM, JTV and LPLS.

To implement LPLS we minimize

(5.9) argminμE,ne(wRL00𝒯)(μEne)(wb1b2)22+αLPLSβ(μE,ne),\operatorname*{arg\,min}_{\mu_{E},n_{e}}\left\|\begin{pmatrix}wR_{L}&0\\ 0&\mathcal{T}\\ \end{pmatrix}\begin{pmatrix}\mu_{E}\\ n_{e}\\ \end{pmatrix}-\begin{pmatrix}wb_{1}\\ b_{2}\\ \end{pmatrix}\right\|^{2}_{2}+\alpha\text{LPLS}_{\beta}(\mu_{E},n_{e}),

where

(5.10) LPLSβ(μE,ne)=[2,2]×[3,1]μE(𝐱)βne(𝐱)β|μE(𝐱)ne(𝐱)|β2d𝐱,\text{LPLS}_{\beta}(\mu_{E},n_{e})=\int_{[-2,2]\times[-3,1]}\|\nabla\mu_{E}({\mathbf{x}})\|_{\beta}\|\nabla n_{e}({\mathbf{x}})\|_{\beta}-|\nabla\mu_{E}({\mathbf{x}})\cdot\nabla n_{e}({\mathbf{x}})|_{\beta^{2}}\mathrm{d}{\mathbf{x}},

where |𝐱|β=|𝐱|2+β2|{\mathbf{x}}|_{\beta}=\sqrt{|{\mathbf{x}}|^{2}+\beta^{2}} and 𝐱β=𝐱22+β2\|{\mathbf{x}}\|_{\beta}=\sqrt{\|{\mathbf{x}}\|^{2}_{2}+\beta^{2}} for β>0\beta>0. The JTV and LPLS penalties seek to impose soft constraints on the equality of the image wavefront sets of μE\mu_{E} and nen_{e}. For example setting β=0\beta=0 in the calculation of LPLSβ\text{LPLS}_{\beta} yields

(5.11) LPLSβ(μE,ne)=μE2ne2|μEne|=μE2ne2(1|cosθ|),\begin{split}\text{LPLS}_{\beta}(\mu_{E},n_{e})&=\|\nabla\mu_{E}\|_{2}\|\nabla n_{e}\|_{2}-|\nabla\mu_{E}\cdot\nabla n_{e}|\\ &=\|\nabla\mu_{E}\|_{2}\|\nabla n_{e}\|_{2}(1-|\cos\theta|),\end{split}

where θ\theta is the angle between ne\nabla n_{e} and μE\nabla\mu_{E}. Hence (5.11) is minimized for the gradients which are parallel (i.e. when θ=0,π\theta=0,\pi), and thus using LPLSβ\text{LPLS}_{\beta} as regularization serves to enforce equality in the image gradient direction and location (i.e. when LPLSβ\text{LPLS}_{\beta} is small, the gradient directions are approximately equal).

We wish to stress that the comparison with JTV and LPLS is included purely to illustrate the potential advantages (and disadvantages) of the lambda regularizers when compared to the state-of-the-art regularization techniques. Namely is the improvement in image quality due to joint data, lambda regularizers or are they both beneficial? We are not claiming a state-of-the-art performance using JLAM, but our results show JLAM has good performance, and it is numerically easier to implement, requiring only least squares solvers. There are two hyperparameters (α\alpha and β\beta) to be tuned in order to implement JTV and LPLS, which is more numerically intensive (e.g. using cross validation) in contrast to JLAM with only one hyperparameter (α\alpha). Moreover the LPLS objective is non-convex [12, appendix A], and hence there are potential local minima to contend with, which is not an issue with JLAM, being a simple quadratic objective.

ϵ\epsilon TV JLAM JTV LPLS
nen_{e} .36.36 .24.24 .16.16 .09.09
μE\mu_{E} .63.63 .30.30 .19.19 .13.13
FF-score TV JLAM JTV LPLS
supp(ne)\text{supp}(n_{e}) .78.78 .98.98 .97.97 .99.99
ne\nabla n_{e} .73.73 .83.83 .84.84 .84.84
supp(μE)\text{supp}(\mu_{E}) .65.65 .94.94 .98.98 .99.99
μE\nabla\mu_{E} .39.39 .83.83 .84.84 .85.85
Table 3. Complex phantom ϵ\epsilon and FF-score comparison using TV, JLAM, JTV and LPLS.
ϵ±\epsilon_{\pm} JLAM JTV LPLS
nen_{e} .25±.02.25\pm.02 .13±.03.13\pm.03 .13±.03.13\pm.03
μE\mu_{E} .31±.03.31\pm.03 .16±.02.16\pm.02 .17±.04.17\pm.04
F±F_{\pm} JLAM JTV LPLS
supp(ne)\text{supp}(n_{e}) .98±.01.98\pm.01 .98±.005.98\pm.005 .99±.004.99\pm.004
ne\nabla n_{e} .76±.06.76\pm.06 .78±.05.78\pm.05 .75±.05.75\pm.05
supp(μE)\text{supp}(\mu_{E}) .90±.06.90\pm.06 .96±.03.96\pm.03 .98±.007.98\pm.007
μE\nabla\mu_{E} .73±.07.73\pm.07 .77±.05.77\pm.05 .74±.05.74\pm.05
Table 4. Randomized complex phantom ϵ±\epsilon_{\pm} and F±F_{\pm} comparison over 100 runs using JLAM, JTV and LPLS.

To minimize (5.2), we store the discrete forms of RLR_{L}, RR and 𝒯\mathcal{T} as sparse matrices and apply the Conjugate Gradient Least Squares (CGLS) solvers of [22, 23] (specifically the “IRnnfcgls” code) with non-negativity constraints (since the physical quantities nen_{e} and μE\mu_{E} are known a-priori to be nonnegative). To solve equations (5.5) and (5.6) we apply the heuristic least squares solvers of [22, 23] (specifically the “IRhtv” code) with TV penalties and non-negativity constraints. To solve (5.7) and (5.9) we apply the codes of [12], modified so as to suit a Gaussian noise model (a Poisson model is used in [12, equation 3]). The relative reconstruction error ϵ\epsilon is calculated as ϵ=𝐱𝐲2/𝐱2\epsilon=\|{\mathbf{x}}-{\mathbf{y}}\|_{2}/\|{\mathbf{x}}\|_{2}, where 𝐱{\mathbf{x}} is the ground truth image and 𝐲{\mathbf{y}} is the reconstruction. For all methods compared against we simulate data and added noise as in equations (5.3) and (5.4), and the noise level added for each simulation is η=0.1\eta=0.1 (10%10\% noise). We choose α\alpha for each method such that ϵ\epsilon is minimized for a noise level of η=0.1\eta=0.1. That is we are comparing the best possible performance of each method. We set β\beta for JTV and LPLS to the values used on the “lines2” data set of [12]. We do not tune β\beta to the best performance (as with α\alpha) so as to give fair comparison between TV, JLAM, JTV and LPLS. After the optimal hyperparameters were selected, we performed 100 runs of TV, JLAM, JTV and LPLS on both phantoms for 100 randomly selected sets of materials. That is, for 100 randomly chosen sets of values from figure 8 and the NIST database, with the NIST values corresponding to the nonzero parts of the phantoms. We present the mean (μϵ\mu_{\epsilon}) and standard deviation (σϵ\sigma_{\epsilon}) relative errors over 100 runs in the left-hand of tables 2 and 4 for the simple and complex phantom respectively. The results are given in the form ϵ±=μϵ±σϵ\epsilon_{\pm}=\mu_{\epsilon}\pm\sigma_{\epsilon} for each method. In addition to the relative error ϵ\epsilon, we also provide metrics to measure the structural accuracy of the results. Specifically we will compare FF-scores on the image gradient and support, as is done in [49, 2]. The gradient FF-score [2] measures the wavefront set reconstruction accuracy, and the support FF-score [49, page 5] (see DICE score) is a measure of the geometric accuracy. That is, the support FF-score checks whether the reconstructed phantoms are the correct shape and size. The FF-score takes values on [0,1][0,1]. For this metric, values close to one indicate higher performance, and conversely for values close to zero. Similarly to ϵ\epsilon, we present the FF-scores of the randomized tests in the form F±=μF±σFF_{\pm}=\mu_{F}\pm\sigma_{F}, where μF\mu_{F} and σF\sigma_{F} are the mean and standard deviation FF-scores respectively. In all tables, the support FF-scores are labelled by supp(ne)\text{supp}(n_{e}) and supp(μE)\text{supp}(\mu_{E}), and by ne\nabla n_{e} and μE\nabla\mu_{E} for the gradient FF-scores.

5.4. Results and discussion

See figure 11 for image reconstructions of the simple phantom using TV, JLAM, JTV and LPLS, and see table 1 for the corresponding ϵ\epsilon and FF-score values. See table 2 for the ϵ±\epsilon_{\pm} and F±F_{\pm} values calculated over 100 randomized simple phantom reconstructions. For the complex phantom, see figure 12 for image reconstructions, and table 3 for the ϵ\epsilon and FF-score values. See table 4 for ϵ±\epsilon_{\pm} and F±F_{\pm}. In the separate reconstruction of nen_{e} (using method TV) we see a blurring of the ground truth image edges (wavefront directions) in the horizontal direction and there are artefacts in the reconstruction due to limited data, as predicted by our microlocal theory. In the TV reconstruction of μE\mu_{E} we see a similar effect, but in this case we fail to resolve the wavefront directions in the vertical direction due to limited line integral data. This is as predicted by the theory of section 4 and [5].

Refer to caption
Refer to caption
Figure 13. Horizontal Al bar density (left) and attenuation (right) phantoms.

In section 3 we discovered the existence also of nonlocal artefacts in the nen_{e} reconstruction, which were induced by the mappings λij\lambda_{ij}. However these were found to lie largely outside the imaging space unless the singularity in question (𝐱,𝝃)WF(ne)({\mathbf{x}},\boldsymbol{\xi})\in\text{WF}(n_{e}) were such that 𝐱{\mathbf{x}} is close to the detector array (see figures 3 and 4). Hence why we do not see the effects of the λij\lambda_{ij} in the phantom reconstructions, as the phantoms are bounded sufficiently away from the detector array. The added regularization may smooth out such artefacts also, which was found to be the case in [55].

Using the joint reconstruction methods (i.e. JLAM, JTV and LPLS) we see a large reduction in the image artefacts in nen_{e} and μE\mu_{E}, since with joint data we are able to stably resolve the image singularities in all directions. The improvement in ϵ\epsilon and the FF-score is also significant, particularly in the μE\mu_{E} reconstruction. Thus it seems that the joint data is the greater contributor (over the regularization) to the improvement in the image quality, as the approaches with joint data each perform well.

Density nen_{e} Attenuation μE\mu_{E}, E=100E=100keV

TV

Refer to caption Refer to caption

JLAM

Refer to caption Refer to caption

JTV

Refer to caption Refer to caption

LPLS

Refer to caption Refer to caption
Figure 14. Horizontal bar phantom reconstructions, noise level η=0.1\eta=0.1. Comparison of methods TV, JLAM, JTV and LPLS.

Upon comparison of JLAM, JTV and LPLS, the ϵ\epsilon metrics are significantly improved when using JTV and LPLS over JLAM, but the image quality and FF-scores are highly comparable. This indicates that, while the noise in the reconstruction is higher using JLAM, the recovery of the image edges and support is similar using JLAM, JTV and JLAM. As theorized, the lambda regularizers were successful in preserving the wavefront sets of μE\mu_{E} and nen_{e}. However there is a distortion present in the nonzero parts of the JLAM reconstruction. This is the most notable difference in JLAM and JTV/LPMS, and is likely the cause of the ϵ\epsilon discrepancy. So while the edge preservation and geometric accuracy of JLAM is of a high quality (and this was our goal), the smoothing properties of JLAM are not up to par with the state-of-the-art currently. We note however that the JTV and LPLS objectives are nonlinear (with LPLS also non-convex) and require significant additional machinery (e.g. in the implementation of the code of [12] used here) in the inversion when compared to JLAM, which is a straight forward implementation of linear least squares solvers.

ϵ\epsilon TV JLAM JTV LPLS
nen_{e} .28.28 .09.09 .04.04 .02.02
μE\mu_{E} .68.68 .11.11 .09.09 .07.07
FF-score TV JLAM JTV LPLS
supp(ne)\text{supp}(n_{e}) .71.71 .99.99 1\sim 1 1\sim 1
ne\nabla n_{e} .64.64 .82.82 .79.79 .83.83
supp(μE)\text{supp}(\mu_{E}) .54.54 1\sim 1 1\sim 1 1\sim 1
μE\nabla\mu_{E} .53.53 .82.82 .80.80 .90.90
Table 5. Al bar phantom ϵ\epsilon and FF-score comparison using TV, JLAM, JTV and LPLS.
ϵ±\epsilon_{\pm} JLAM JTV LPLS
nen_{e} .10±.02.10\pm.02 .04±.004.04\pm.004 .03±.02.03\pm.02
μE\mu_{E} .12±.03.12\pm.03 .12±.03.12\pm.03 .08±.03.08\pm.03
F±F_{\pm} JLAM JTV LPLS
supp(ne)\text{supp}(n_{e}) .99±.02.99\pm.02 1±.003\sim 1\pm.003 1±.001\sim 1\pm.001
ne\nabla n_{e} .83±.02.83\pm.02 .79±.02.79\pm.02 .84±.02.84\pm.02
supp(μE)\text{supp}(\mu_{E}) .98±.06.98\pm.06 .99±.02.99\pm.02 1±.008\sim 1\pm.008
μE\nabla\mu_{E} .81±.03.81\pm.03 .77±.02.77\pm.02 .85±.02.85\pm.02
Table 6. Randomized bar phantom ϵ±\epsilon_{\pm} and F±F_{\pm} comparison over all NIST materials considered (153 runs) using JLAM, JTV and LPLS.

5.5. Reconstructions with limited data

The simple and complex phantoms considered thus far are supported within Γ\Gamma (the yellow region of figure 7) so as to allow for a full wavefront coverage in the reconstruction. To investigate what happens when the object is supported outside of Γ\Gamma, we present additional reconstructions of an Aluminium bar phantom with support towards the bottom (close to x2=3x_{2}=-3) of the reconstruction space. See figure 13. In this case we have limited data and the full wavefront coverage is not available with the combined X-ray and Compton data sets. Image reconstructions of the Al bar phantom are presented in figure 14, and the corresponding ϵ\epsilon and FF-score values are displayed in table 5. See table 4 for the ϵ±\epsilon_{\pm} and F±F_{\pm} values corresponding to the randomized bar phantom reconstructions. In this case ϵ±\epsilon_{\pm} and F±F_{\pm} were calculated from reconstructions of 153 bar phantoms (we used 100 runs previously), replacing the Al density value of figure 13 with one of each NIST value considered (153 in total). The reconstruction processes and hyperparameter selection applied here were exactly the same as for the simple and complex phantom. In this case we see artefacts in the Compton reconstruction along curves which follow the shape of the boundary of Γ\Gamma, and the X-ray artefacts constitute a vertical blurring as before. The ϵ\epsilon error when using JLAM, JTV and LPLS is more comparable in this example (compared to tables 1 and 3), particularly in the case of the μE\mu_{E} phantom. The image quality and FF-scores are again similar as with the simple and complex phantom examples. All joint reconstruction methods were successful in removing the image artefacts observed in the separate reconstructions, and thus can offer satisfactory image quality under the constraints of limited data. However this is only a single test of the capabilities of JLAM, JTV and LPLS with limited data and we leave future work to conclude such analysis.

6. Conclusions and further work

Here we have introduced a new joint reconstruction method “JLAM” for low effective ZZ imaging (Z<20Z<20), based on ideas in lambda tomography. We considered primarily the “parallel line segment” geometry of [54], which is motivated by system architectures for airport security screening applications. In section 3 we gave a microlocal analysis of the toric section transform 𝒯\mathcal{T}, which was first proposed in [54] for a CST problem. Explicit expressions were provided for the microlocal artefacts and verified through simulation. Section 4 explained the X-ray CT artefacts using the theory of [5]. Following the theory of sections 3 and 4, we detailed the JLAM algorithm in section 5. Here we conducted simulation testing and compared JLAM to separate reconstructions using TV, and to the nonlinear joint inversion methods, JTV [20] and LPLS [12] from the literature. The joint inversion methods considered (i.e. JLAM, JTV and LPLS) were successful in preserving the image contours in the reconstruction, as predicted. However the smoothing applied by JLAM was not as effective as JTV and LPLS, and we saw a distortion in the JLAM reconstruction (see figures 11 and 12). JTV and LPLS were thus shown to offer better performance than JLAM, with LPLS producing the best results overall. The advantages of JLAM over JTV and LPLS are in the fast, linear inversion, and the reduction in tuning parameters (one for JLAM, two for JTV/LPLS). Given the linearity of JLAM, the ideas of JTV and LPLS can be easily integrated with lambda regularization to modify the objectives of the literature and improve further the edge resolution of the reconstruction. To preserve the linearity of JLAM we could also combine JLAM with a Tikhonov regularizer. This may help smooth out the distortion observed in the JLAM reconstruction. We leave such ideas for future work.

Acknowledgements

We would like to thank the journal reviewers for their helpful comments and insight towards this article, particularly in regards to the simulation study. This material is based upon work supported by the U.S. Department of Homeland Security, Science and Technology Directorate, Office of University Programs, under Grant Award 2013-ST-061-ED0001. The views and conclusions contained in this document are those of the authors and should not be interpreted as necessarily representing the official policies, either expressed or implied, of the U.S. Department of Homeland Security. The work of the second author was partially supported by U.S. National Science Foundation grant DMS 1712207. The authors thank the referees for thorough reviews and thoughtful comments that improved the article.

References

  • [1] A. Aghasi, I. Mendoza-Sanchez, E. L. Miller, C. A. Ramsburg, and L. M. Abriola. A geometric approach to joint inversion with applications to contaminant source zone characterization. Inverse problems, 29(11):115014, 2013.
  • [2] H. Andrade-Loarca, G. Kutyniok, and O. Öktem. Shearlets as feature extractor for semantic edge detection: The model-based and data-driven realm. arXiv preprint arXiv:1911.12159, 2019.
  • [3] S. R. Arridge, M. Burger, and M. J. Ehrhardt. Preface to special issue on joint reconstruction and multi-modality/multi-spectral imaging. Inverse Problems, 36(2):020302, 2020.
  • [4] P. Blomgren and T. F. Chan. Color tv: total variation methods for restoration of vector-valued images. IEEE Transactions on Image Processing, 7(3):304–309, 1998.
  • [5] L. Borg, J. Frikel, J. S. Jørgensen, and E. T. Quinto. Analyzing reconstruction artifacts from arbitrary incomplete x-ray ct data. SIAM Journal on Imaging Sciences, 11(4):2786–2814, 2018.
  • [6] T. A. Bubba, G. Kutyniok, M. Lassas, M. Maerz, W. Samek, S. Siltanen, and V. Srinivasan. Learning the invisible: A hybrid deep learning-shearlet framework for limited angle computed tomography. Inverse Problems, 2019.
  • [7] A. S. Denisyuk. Inversion of the generalized Radon transform. In Applied problems of Radon transform, volume 162 of Amer. Math. Soc. Transl. Ser. 2, pages 19–32. Amer. Math. Soc., Providence, RI, 1994.
  • [8] J. J. Duistermaat. Fourier integral operators, volume 130 of Progress in Mathematics. Birkhäuser, Inc., Boston, MA, 1996.
  • [9] J. J. Duistermaat and L. Hormander. Fourier integral operators, volume 2. Springer, 1996.
  • [10] M. J. Ehrhardt and S. R. Arridge. Vector-valued image processing by parallel level sets. IEEE Transactions on Image Processing, 23(1):9–18, 2013.
  • [11] M. J. Ehrhardt and M. M. Betcke. Multicontrast mri reconstruction with structure-guided total variation. SIAM Journal on Imaging Sciences, 9(3):1084–1106, 2016.
  • [12] M. J. Ehrhardt, K. Thielemans, L. Pizarro, D. Atkinson, S. Ourselin, B. F. Hutton, and S. R. Arridge. Joint reconstruction of PET-MRI by exploiting structural similarity. Inverse Problems, 31(1):015001, dec 2014.
  • [13] A. Faridani, D. Finch, E. L. Ritman, and K. T. Smith. Local tomography, II. SIAM J. Appl. Math., 57:1095–1127, 1997.
  • [14] A. Faridani, E. L. Ritman, and K. T. Smith. Local tomography. SIAM J. Appl. Math., 52:459–484, 1992.
  • [15] J. Frikel and E. T. Quinto. Artifacts in incomplete data tomography with applications to photoacoustic tomography and sonar. SIAM J. Appl. Math., 75(2):703–725, 2015.
  • [16] L. A. Gallardo and M. A. Meju. Joint two-dimensional dc resistivity and seismic travel time inversion with cross-gradients constraints. Journal of Geophysical Research: Solid Earth, 109(B3), 2004.
  • [17] L. A. Gallardo and M. A. Meju. Joint two-dimensional cross-gradient imaging of magnetotelluric and seismic traveltime data for structural and lithological classification. Geophysical Journal International, 169(3):1261–1272, 2007.
  • [18] V. Guillemin and S. Sternberg. Geometric Asymptotics. American Mathematical Society, Providence, RI, 1977.
  • [19] H. E. Guven, E. L. Miller, and R. O. Cleveland. Multi-parameter acoustic imaging of uniform objects in inhomogeneous soft tissue. IEEE transactions on ultrasonics, ferroelectrics, and frequency control, 59(8):1700–1712, 2012.
  • [20] E. Haber and M. H. Gazit. Model fusion and joint inversion. Surveys in Geophysics, 34(5):675–695, 2013.
  • [21] P. C. Hansen. Rank-deficient and discrete ill-posed problems: numerical aspects of linear inversion, volume 4. Siam, 2005.
  • [22] P. C. Hansen. Regularization Tools version 4.0 for Matlab 7.3. Numer. Algorithms, 46(2):189–194, 2007.
  • [23] P. C. Hansen and J. S. Jø rgensen. AIR Tools II: algebraic iterative reconstruction methods, improved implementation. Numer. Algorithms, 79(1):107–137, 2018.
  • [24] L. Hörmander. Fourier Integral Operators, I. Acta Mathematica, 127:79–183, 1971.
  • [25] L. Hörmander. The analysis of linear partial differential operators. I. Classics in Mathematics. Springer-Verlag, Berlin, 2003. Distribution theory and Fourier analysis, Reprint of the second (1990) edition [Springer, Berlin].
  • [26] L. Hörmander. The analysis of linear partial differential operators. III. Classics in Mathematics. Springer, Berlin, 2007. Pseudo-differential operators, Reprint of the 1994 edition.
  • [27] L. Hörmander. The analysis of linear partial differential operators. IV. Classics in Mathematics. Springer-Verlag, Berlin, 2009. Fourier integral operators, Reprint of the 1994 edition.
  • [28] J. Hubbell. Photon cross sections, attenuation coefficients and energy absorption coefficients. National Bureau of Standards Report NSRDS-NBS29, Washington DC, 1969.
  • [29] J. H. Hubbell and S. M. Seltzer. Tables of x-ray mass attenuation coefficients and mass energy-absorption coefficients 1 kev to 20 mev for elements z= 1 to 92 and 48 additional substances of dosimetric interest. Technical report, National Inst. of Standards and Technology-PL, Gaithersburg, MD (United …, 1995.
  • [30] W. A. Kalender. X-ray computed tomography. Physics in Medicine & Biology, 51(13):R29, 2006.
  • [31] V. P. Krishnan and E. T. Quinto. Microlocal analysis in tomography. Handbook of mathematical methods in imaging, pages 1–50, 2014.
  • [32] A. K. Louis. Combining Image Reconstruction and Image Analysis with an Application to Two-Dimensional Tomography. SIAM J. Img. Sci., 1:188–208, 2008.
  • [33] F. Natterer. The mathematics of computerized tomography. Classics in Mathematics. Society for Industrial and Applied Mathematics (SIAM), New York, 2001.
  • [34] S. J. Norton. Compton scattering tomography. Journal of applied physics, 76(4):2007–2015, 1994.
  • [35] V. Palamodov. An analytic reconstruction for the compton scattering tomography in a plane. Inverse Problems, 27(12):125004, 2011.
  • [36] V. P. Palamodov. A uniform reconstruction formula in integral geometry. Inverse Problems, 28(6):065014, 2012.
  • [37] E. T. Quinto. The dependence of the generalized Radon transform on defining measures. Trans. Amer. Math. Soc., 257:331–346, 1980.
  • [38] E. T. Quinto. Singularities of the X-ray transform and limited data tomography in 2{\mathbb{R}}^{2} and 3{\mathbb{R}}^{3}. SIAM J. Math. Anal., 24:1215–1225, 1993.
  • [39] E. T. Quinto and O. Öktem. Local tomography in electron microscopy. SIAM J. Appl. Math., 68:1282–1303, 2008.
  • [40] H. Rezaee, B. Tracey, and E. L. Miller. On the fusion of compton scatter and attenuation data for limited-view x-ray tomographic applications. arXiv preprint arXiv:1707.01530, 2017.
  • [41] G. Rigaud. Compton scattering tomography: feature reconstruction and rotation-free modality. SIAM J. Imaging Sci., 10(4):2217–2249, 2017.
  • [42] G. Rigaud. 3d compton scattering imaging: study of the spectrum and contour reconstruction. arXiv preprint arXiv:1908.03066, 2019.
  • [43] G. Rigaud and B. N. Hahn. 3d compton scattering imaging and contour reconstruction for a class of radon transforms. Inverse Problems, 34(7):075004, 2018.
  • [44] W. Rudin. Functional analysis. McGraw-Hill Book Co., New York, 1973. McGraw-Hill Series in Higher Mathematics.
  • [45] G. Sapiro and D. L. Ringach. Anisotropic diffusion of multivalued images with applications to color filtering. IEEE transactions on image processing, 5(11):1582–1586, 1996.
  • [46] O. Semerci, N. Hao, M. E. Kilmer, and E. L. Miller. Tensor-based formulation and nuclear norm regularization for multienergy computed tomography. IEEE Transactions on Image Processing, 23(4):1678–1693, 2014.
  • [47] O. Semerci and E. L. Miller. A parametric level-set approach to simultaneous object identification and background reconstruction for dual-energy computed tomography. IEEE transactions on image processing, 21(5):2719–2734, 2012.
  • [48] N. Sochen, R. Kimmel, and R. Malladi. A general framework for low level vision. IEEE transactions on image processing, 7(3):310–318, 1998.
  • [49] A. A. Taha and A. Hanbury. Metrics for evaluating 3d medical image segmentation: analysis, selection, and tool. BMC medical imaging, 15(1):29, 2015.
  • [50] B. H. Tracey and E. L. Miller. Stabilizing dual-energy x-ray computed tomography reconstructions using patch-based regularization. Inverse Problems, 31(10):105004, 2015.
  • [51] T.-T. Truong and M. K. Nguyen. Compton scatter tomography in annular domains. Inverse Problems, 2019.
  • [52] E. Vainberg, I. A. Kazak, and V. P. Kurozaev. Reconstruction of the internal three-dimensional structure of objects based on real-time integral projections. Soviet Journal of Nondestructive Testing, 17:415–423, 1981.
  • [53] N. Wadeson. Modelling and correction of Scatter in a switched source multi-ring detector x-ray CT machine. PhD thesis, The University of Manchester (United Kingdom), 2011.
  • [54] J. Webber and E. Miller. Compton scattering tomography in translational geometries. Technical report, Tufts University, 2019.
  • [55] J. Webber and E. T. Quinto. Microlocal analysis of a Compton tomography problem. SIAM Journal on Imaging Sciences, 2020. to appear.
  • [56] J. Winkler. Titanium Dioxide: Production, Properties and Effective Usage 2nd Revised Edition. Vincentz Network, 2014.
  • [57] F. Wirsching. Calcium sulfate. Ullmann’s encyclopedia of industrial chemistry, 2000.

Appendix A Generating the plots of figure 8

The generation of the plots of figure 8 is explained in more detail here. We will explain the generation of the plot for E=100E=100keV. Refer to figure 15. We first plotted μE\mu_{E} for E=100E=100keV against nen_{e} for all materials in the NIST database [28] with effective ZZ less than 20. This is the left hand plot of figure 15. The set of materials with effective Z<20Z<20 was

Zeff={Z:σE(Z)<σE(20),E=100keV},Z_{\text{eff}}=\{Z:\sigma_{E}(Z)<\sigma_{E}(20),E=100\text{keV}\},

where σE\sigma_{E} is the electron cross section. We noticed a large outlier (coal, or amorphous Carbon) which corrupts the correlation in our favour, and hence we chose to remove the material from consideration in simulation. The outlier is highlighted in the left hand plot. After the outlier was removed we noticed a number of materials located at the origin (with negligible attenuation coefficient and density, such as air) in the middle scatter plot of figure 15. As such materials again bias the correlation and plot standard deviation in our favour, these were removed to produce the left hand plot of figure 8 in the right hand of figure 15. The same points were removed in the generation of the right hand plot of figure 8 also, for E=1E=1MeV.

Refer to caption
Refer to caption
Refer to caption
Figure 15. Scatter plot with outlier and origin points included (left, R=0.98), scatter plot with the outlier removed and origin points included, the origin points highlighted by an orange circle (middle, R=0.95), and the scatter plot of figure 8 with outliers and origin points removed (right, R=0.93).