This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Time-Resolved Focused Ion Beam Microscopy: Modeling, Estimation Methods, and Analyses

Minxu Peng, John Murray-Bruce, and Vivek K Goyal M. Peng and V. K. Goyal are with the Department of Electrical and Computer Engineering, Boston University, Boston, MA 02215 USA (e-mail: [email protected]; [email protected]).J. Murray-Bruce is with the Department of Computer Science and Engineering, University of South Florida, Tampa, FL 33620 USA (e-mail: [email protected]).This work was supported in part by the US National Science Foundation under Grant No. 1422034 and Grant No. 1815896.
Abstract

In a focused ion beam (FIB) microscope, source particles interact with a small volume of a sample to generate secondary electrons that are detected, pixel by pixel, to produce a micrograph. Randomness of the number of incident particles causes excess variation in the micrograph, beyond the variation in the underlying particle–sample interaction. We recently demonstrated that joint processing of multiple time-resolved measurements from a single pixel can mitigate this effect of source shot noise in helium ion microscopy. This paper is focused on establishing a rigorous framework for understanding the potential for this approach. It introduces idealized continuous- and discrete-time abstractions of FIB microscopy with direct electron detection and estimation-theoretic limits of imaging performance under these measurement models. Novel estimators for use with continuous-time measurements are introduced and analyzed, and estimators for use with discrete-time measurements are analyzed and shown to approach their continuous-time counterparts as time resolution is increased. Simulated FIB microscopy results are consistent with theoretical analyses and demonstrate that substantial improvements over conventional FIB microscopy image formation are made possible by time-resolved measurement.

Index Terms:
computational imaging, Fisher information, gallium ion microscopy, helium ion microscopy, scanning electron microscopy, Poisson processes, shot noise, statistical modeling and estimation.

I Introduction

The ability to image the structure of a sample at nanoscale resolution using microscopes that scan samples with a focused beam of particles is critical in material science and the life sciences. In a scanning electron microscope (SEM) [1], a focused electron beam is raster scanned over the sample, causing the sample to emit secondary electrons (SEs). An SEM is capable of providing information regarding composition and distribution of sample components. Resembling an SEM, a focused ion beam (FIB) microscope [2] instead uses a focused beam of ions, such as gallium, helium, or xenon. As one member of the FIB microscope family, a helium ion microscope (HIM) [3] offers many advantages compared to an SEM, leading to widespread use in semiconductor and biological imaging [4, 5, 6, 7]. The interaction volume with the sample is much smaller for ions compared to that for electrons, resulting in higher contrast [8]. Higher particle mass also leads to higher overall SE yield [9, 10]. Furthermore, to image insulating samples with an SEM, prior deposition of a conductive coating is required to prevent the accumulation of electron charges on the sample; an HIM uses an electron flood gun to prevent charge accumulation, thus avoiding the masking of subtle features by a coating [11].

The number of incident ions determines the number of sample interactions that can be measured. Hence, increasing this number—through increased beam current or increased dwell time—will ideally create measurements that are more informative about the mean number of SEs per incident ion, which is the sample property of interest. However, the vastly greater mass of ions compared to electrons (by a factor of 7.3×1037.3\times 10^{3} for helium or 1.3×1051.3\times 10^{5} for gallium) makes sputtering much more significant for FIB microscopy than for SEM. Various studies have shown how the sputtering damage induced by helium ions evolves with increasing numbers of incident ions [12, 13, 14], and this damage is often determinative of the best possible image quality. Dose is conventionally defined as the number of incident ions per unit area, and dose limits to prevent significant damage have been measured for certain materials and imaging configurations. For example, a safe imaging dose for suspended graphene is as low as 101310^{13} to 101410^{14} per square centimeter [15]. It follows, for example, that a (10nm)2(10\,\mathrm{nm})^{2} pixel should be subjected to only 10 to 100 ions. In our abstractions, we dispense with spatial dimensions and hereafter express doses per pixel rather than per unit area; expressed per pixel, finer resolution necessitates lower dose limits. While randomized subsampling combined with regularized reconstruction can sometimes yield high-quality images from reduced doses [16, 17, 18], this requires piecewise smooth image structure. Here we restrict our attention to pixelwise acquisition and estimation methods that do not rely on such assumptions.

We recently introduced the concept of time-resolved (TR) measurements in FIB microscopy to mean dividing any given pixel dwell time tt into nn dwell times t/nt/n for some integer n>1n>1 [19]. Without proofs, we gave theoretical evidence that a set of TR measurements is fundamentally more informative than a single measurement with the same total dose. Experiments with HIM data demonstrated mean-squared error (MSE) improvement by about a factor of 4 at doses of 1.0 and 2.5 incident ions per pixel. Importantly, this use of time resolution is entirely for the purpose of making the measurements more informative without increasing the total dose. It is not for imaging of dynamic samples and hence not comparable to any previous use of time resolution in microscopy.

The Zeiss ORION NanoFab HIM used in [19], like most commercial FIB microscopes, uses indirect electron detection with a scintillator and photomultiplier tube. Although not yet prevalent, direct electron detection offers higher signal-to-noise ratio (SNR) as it avoids the statistical noises brought by electron–photon conversion and additional readout noise [20]. For example, Yamada et al. [21] demonstrated that the SNR of direct electron detection is 2.5 times higher than that of indirect mode at low dose. Furthermore, direct electron detection technology has also been applied to imaging in transmission electron microscopy to improve resolution [22].

In this paper, we develop comprehensive theoretical results for FIB microscopy with time-resolved direct SE detection. Indirect detection introduces many sources of noise, including spatial nonuniformity in the scintillator response, nonideal light transport from the scintillator to the photomultiplier tube, and variations in pulses generated by the photomultiplier tube. Though these effects were empirically modeled in [19], including them here would make already lengthy expressions considerably more complicated and more difficult to interpret. Restricting attention to direct detection allows us to concentrate on the implications of TR sensing for mitigation of source shot noise, separated from the effects of detection noise. While some results presented here add rigor to statements in [19], we more importantly introduce a new continuous-time abstraction for FIB microscopy that yields to more elegant analyses while also representing the ultimate limit of this technology. In addition to maximum likelihood (ML) estimation, our study includes plug-in estimators for which we can complete analytical performance analyses and that are easily generalized to settings in which SE detection is indirect.

I-A Main Contributions

  • A new continuous-time probabilistic model for FIB microscopy wherein the data at any one pixel are related to a marked Poisson process. Conventional and continuous- and discrete-time time-resolved observation models are different functions of the marked Poisson process.

  • Fisher information analyses. We show that, at any ion dose level, continuous-time time-resolved measurements have Fisher information matching an upper bound that conventional measurements meet only in a low-dose limit. For conventional measurements, asymptotic expressions presented without proof in [19] are proven here.

  • Estimator analyses. Biases and variances of quotient-mode estimators in both the continuous- and discrete-time settings are derived. Convergence of the discrete-time estimator’s performance to the performance of the continuous-time estimator is proven.

I-B Outline

We introduce our mathematical abstraction for the operation of a FIB microscope in Section II. This leads to four measurement models: an unimplementable oracle model, conventional measurement, and continuous- and discrete-time time-resolved measurement. The oracle and conventional cases are analyzed within Section II. Section III develops the novel continuous-time case in detail. Measurement distributions are derived, three estimators are introduced and simulated, and the performance of a quotient-mode estimator is rigorously analyzed. Section IV develops the discrete-time case first introduced in [19] in detail. Three estimators are simulated, and the performance of the discrete-time quotient-mode estimator is rigorously analyzed. Convergences of Fisher information and quotient-mode estimator performance to their continuous-time counterparts are shown. Section V compares all the estimators in a simulated HIM experiment, demonstrating substantial improvement of the time-resolved methods over the conventional interpretation of the collected data. Section VI provides concluding comments on how mean SE yield influences the advantages of TR methods, the time resolution necessary to capitalize on these advantages, the roles of the various estimators, and generalizations to indirect SE detection.

Table I summarizes the variables, symbols, and acronyms used in the manuscript.

Table I: List of symbols and acronyms
η\eta mean secondary electron yield
η^oracle\widehat{\eta}_{\rm oracle} oracle estimator (8)
η^baseline\widehat{\eta}_{\rm baseline} baseline estimator (14)
η^CTQM\widehat{\eta}_{\rm CTQM} continuous-time quotient mode estimator (28)
η^CTLQM\widehat{\eta}_{\rm CTLQM} continuous-time Lambert quotient mode estimator (29)
η^CTML\widehat{\eta}_{\rm CTML} continuous-time maximum likelihood estimator (30)
η^DTQM\widehat{\eta}_{\rm DTQM} discrete-time quotient mode estimator (40)
η^DTLQM\widehat{\eta}_{\rm DTLQM} discrete-time Lambert quotient mode estimator (41)
η^DTML\widehat{\eta}_{\rm DTML} discrete-time maximum likelihood estimator (42)
λ\lambda ion dose per pixel
Λ\Lambda dose per unit time
ρ\rho P(Xi>0)=1eη\mathrm{P}({X_{i}>0})=1-e^{-\eta}
ii ion index
kk discrete time index
nn number of subacquisitions
pp P(Yk>0)=1exp((λ/n)(1eη))\mathrm{P}({Y_{k}>0})=1-\exp(-({{\lambda}/{n}})(1-e^{-\eta}))
tt dwell time
Z(η;λ)\mathcal{I}_{Z}(\eta\,;\,\lambda) Fisher information about η\eta in ZZ with λ\lambda available
LL number of subacquisitions with positive SEs
L~\widetilde{L} zero-truncated version of LL
MM number of incident ions
M~\widetilde{M} number of incident ions yielding positive SEs
TiT_{i} iith ion incidence time
T~i\widetilde{T}_{i} incidence time of iith ion to yield positive SEs
XiX_{i} SEs detected due to iith incident ion
X~i\widetilde{X}_{i} SEs detected due to iith ion to yield positive SEs
YY total detected SEs
YkY_{k} detected SEs in kkth subacquisition
Y~k\widetilde{Y}_{k} positive SE counts in a subacquisition
CRB Cramér–Rao bound
CTTR continuous-time time-resolved (see observation (6))
DTTR discrete-time time-resolved (see observation (7))
FI Fisher information
LQM Lambert quotient mode
ML maximum likelihood
QM quotient mode
SE secondary electron
TR time-resolved

II Measurement Models and Basic Analyses

II-A Physical Abstractions

Throughout this paper, we model the incident ions at the sample to be imaged as a Poisson process with known rate Λ\Lambda per unit time. Imaging proceeds by raster scanning with known dwell time tt at each pixel. Hence, the number of ions MM incident on a pixel is a Poisson random variable with known parameter λ=Λt\lambda=\Lambda t. Since pixel area is not relevant in our abstraction, we refer to λ\lambda as the dose. The interaction of the iith incident ion with the sample causes a number XiX_{i} of SEs to be detected. All of these SE counts are mutually independent Poisson random variables with parameter η\eta, independent of the incident-ion Poisson process, and estimation of the mean SE yield η\eta is the objective of the imaging experiment. Our analysis is for each pixel separately, so no pixel indexing is necessary.

Note that a somewhat high 1 pA beam current corresponds to a rate of 6.2×1066.2\times 10^{6} ions per second, or a mean ion interarrival time of 160 ns. The interaction between an incident ion and the sample and the subsequent detection of SEs occurs within a few femtoseconds [23]. With the SE detections happening so quickly, we abstract the SE detections caused by an incident ion to be simultaneous with the ion incidence. Thus, the model can be described as a marked Poisson process {(T1,X1),(T2,X2),}\{(T_{1},X_{1}),\,(T_{2},X_{2}),\,\ldots\}, where (T1,T2,)(T_{1},\,T_{2},\,\ldots) is the arrival time sequence of the ions. The ion count MM is the largest ii such that TitT_{i}\leq t (with M=0M=0 when T1>tT_{1}>t). One realization on an interval [0,t][0,t] is illustrated in Fig. 1(a). Note that the arrival times (horizontal) are arbitrary positive real numbers and the marks (vertical) are nonnegative integers.

Refer to caption
(a) Underlying marked Poisson process.
Refer to caption
(b) Process observed where mark is positive.
Refer to caption
(c) Dwell time divided into n=10n=10 subintervals.
Refer to caption
(d) Discrete-time time-resolved measurement, n=10n=10.
Figure 1: Illustration of the random processes generated in the abstraction of FIB microscopy through one possible realization. (a) The underlying marked Poisson process {(T1,X1),(T2,X2),}\{(T_{1},X_{1}),\,(T_{2},X_{2}),\,\ldots\}, with ion incident at times T1,T2,T_{1},\,T_{2},\,\ldots generating detected SE counts X1,X2,X_{1},\,X_{2},\,\ldots. (b) The marked Poisson process {(T~1,X~1),(T~2,X~2),}\{(\widetilde{T}_{1},\widetilde{X}_{1}),\,(\widetilde{T}_{2},\widetilde{X}_{2}),\,\ldots\}, produced by discarding the ions for which no SEs are detected. (c) Illustration of dividing dwell time of t=20t=20 s into n=10n=10 subintervals of equal length. (d) The resulting discrete-time SE count process.

Since cases of Xi=0X_{i}=0 produce no detected SEs, the corresponding ion arrival time is not observable in practice. Thus consider also the thinned process {(T~1,X~1),(T~2,X~2),}\{(\widetilde{T}_{1},\widetilde{X}_{1}),\,(\widetilde{T}_{2},\widetilde{X}_{2}),\,\ldots\}, where T~i\widetilde{T}_{i} is the arrival time of the iith ion that produces a positive number of detected SEs and X~i\widetilde{X}_{i} is the corresponding number of detected SEs. Define M~\widetilde{M} to be the largest ii such that T~it\widetilde{T}_{i}\leq t (with M~=0\widetilde{M}=0 when T~1>t\widetilde{T}_{1}>t). Note that the thinned process is also a marked Poisson process because the events of the form {Xi=0}\{X_{i}=0\}, which determine whether an arrival in the original Poisson process is retained, are independent of the arrival time process. Fig. 1(b) illustrates the thinned process for the realization of the underlying process in Fig. 1(a).

Now suppose the observation time interval [0,t][0,t] is evenly divided into nn subintervals of length t/nt/n. Counting the total number of SEs detected in each subinterval produces a discrete-time, discrete-valued random process:

Yk={i:Ti[(k1)t/n,kt/n)}Xi,k=1, 2,,n.Y_{k}=\sum_{\{i\ :\ T_{i}\in[(k-1)t/n,\,kt/n)\}}X_{i},\qquad k=1,\,2,\,\ldots,\,n. (1)

We call an observation over a subinterval a subacquisition. Fig. 1(c) illustrates the partition of [0,t][0,t] into subintervals and Fig. 1(d) illustrates the resulting discrete-time process for the realization of the underlying process in Fig. 1(a). Because of the independence of a Poisson process over disjoint intervals, {Yk}k=1n\{Y_{k}\}_{k=1}^{n} is an independent and identically distributed (i.i.d.) process. We can also view {Yk}k=1n\{Y_{k}\}_{k=1}^{n} as a marked Bernoulli process, where {Yk>0}\{Y_{k}>0\} indicates an arrival in discrete time slot kk and the “mark” is then the (nonzero) value YkY_{k}.

The abstraction described here applies similarly to SEM and FIB microscopy. The main difference is the typical values of the mean SE yield η\eta. In SEM, neglecting topographical effects (which tend to increase yield), for a sample with atomic number up to 83, the SE yield at the maximizing electron energy is typically 0.6 to 2 [24]. In FIB microscopy, SE yield is typically between 1 and 8 [25]. The advantages of TR measurements that are established in this paper diminish at smaller η\eta values. Furthermore, in FIB microscopy, sample damage is more of an impediment to improving image quality by increasing dose [12, 13, 14]. Thus, we concentrate on FIB microscopy.

II-B Measurement Models

We consider four measurement models for the probabilistic experiment described in Section II-A:

  • Oracle: Observe

    {M,(T1,X1),(T2,X2),,(TM,XM)}.\{M,\,(T_{1},X_{1}),\,(T_{2},X_{2}),\,\ldots,\,(T_{M},X_{M})\}. (2)

    Though no current instrument provides this information, this measurement model provides a useful benchmark.

  • Conventional: Observe only

    Y=i=iMXi.Y=\sum_{i=i}^{M}X_{i}. (3)

    This would be the standard operation of a FIB microscope that has direct detection of SEs. Note that

    Y=i=1M~X~iY=\sum_{i=1}^{\widetilde{M}}\widetilde{X}_{i} (4)

    and

    Y=k=1nYkY=\sum_{k=1}^{n}Y_{k} (5)

    are equivalent to the definition of YY. As the sum of a Poisson(λ)\operatorname{Poisson}(\lambda) number of mutually independent Poisson(η)\operatorname{Poisson}(\eta) random variables, (3) is the simplest of the expressions.

  • Continuous-time time-resolved (CTTR): Observe

    {M~,(T~1,X~1),(T~2,X~2),(T~M~,X~M~)}.\{\widetilde{M},\,(\widetilde{T}_{1},\widetilde{X}_{1}),\,(\widetilde{T}_{2},\widetilde{X}_{2}),\,\ldots\,\,(\widetilde{T}_{\widetilde{M}},\widetilde{X}_{\widetilde{M}})\}. (6)

    This is an idealization of a FIB microscope with direct detection of SEs with perfect temporal precision.

  • Discrete-time time-resolved (DTTR): Observe

    {Y1,Y2,,Yn}.\{Y_{1},\,Y_{2},\,\ldots,\,Y_{n}\}. (7)

    This is a model for the use of a FIB microscope to collect a set of low-dose subacquisitions.

Having established these abstractions, we can reiterate that our principal goal is to demonstrate substantial improvements from time-resolved measurements. In our previous work [19], we introduced the concept of DTTR measurement along with estimators to apply with these measurements, and we showed empirical improvements over the trivial estimator that is routinely applied with conventional measurements. The analysis of estimators in that work is limited, and certain theoretical assertions are made without proofs. The CTTR model introduced here is easier to analyze and represents a bound for what can be done with DTTR measurements. We also provide new analyses of estimators for the DTTR model. Through these results and Monte Carlo simulations, the convergence of DTTR estimators to CTTR estimators as nn\rightarrow\infty can be understood precisely (see Sections IV-B and IV-D).

II-C Analyses for Oracle Measurement

We initially assume M=m>0M=m>0. Then the oracle measurement (2) includes nonempty sets of ion arrival times {T1,T2,,Tm}\{T_{1},\,T_{2},\,\ldots,\,T_{m}\} and of SE counts {X1,X2,,Xm}\{X_{1},\,X_{2},\,\ldots,\,X_{m}\}. The arrival times have beta distributions, with no dependence on parameter of interest η\eta. Thus, arrival times are immaterial to estimation of η\eta (i.e., redundant with knowing MM). The SE counts are i.i.d. observations with the Poisson(η)\operatorname{Poisson}(\eta) distribution, and it is elementary to show that Y=X1+X2++XmY=X_{1}+X_{2}+\cdots+X_{m} is a sufficient statistic and that Y/mY/m is an efficient estimator of η\eta from the available data. In the case of M=0M=0, there is no basis for any estimate of η\eta, so we must assign some arbitrarily chosen number η0\eta_{0}.

From the arguments above, we define the oracle estimator

η^oracle(M,X1,X2,,XM)={η0,M=0;Y/M,M>0.\widehat{\eta}_{\rm oracle}(M,X_{1},X_{2},\ldots,X_{M})=\left\{\begin{array}[]{@{\,}rl}\eta_{0},&M=0;\\ {{Y}/{M}},&M>0.\end{array}\right. (8)

Conditioned on M=m>0M=m>0, the mean-squared error (MSE) of this estimator is η/m\eta/m because YY has mean mηm\eta and variance mηm\eta. Using the Poisson distribution for MM and the total expectation theorem,

MSE(η^oracle)=eλ(ηη0)2+m=1ηmλmm!eλ.\mathrm{MSE}(\widehat{\eta}_{\rm oracle})=e^{-\lambda}(\eta-\eta_{0})^{2}+\sum_{m=1}^{\infty}\frac{\eta}{m}\frac{\lambda^{m}}{m!}e^{-\lambda}. (9)

For large enough λ\lambda, the arbitrary guess of η0\eta_{0} when M=0M=0 has little impact on the MSE. Approximating the second term using (60) from Appendix A gives

MSE(η^oracle)ηλfor large λ.\mathrm{MSE}(\widehat{\eta}_{\rm oracle})\approx\frac{\eta}{\lambda}\qquad\mbox{for large $\lambda$}. (10)

II-D Analyses for Conventional Measurement

It is straightforward to show that the conventional measurement YY has Neyman Type A probability mass function (PMF)

PY(y;η,λ)=eληyy!m=0(λeη)mmym!,y=0, 1,,\mathrm{P}_{Y}(y\,;\,\eta,\lambda)=\frac{e^{-\lambda}\eta^{y}}{y!}\sum_{m=0}^{\infty}\frac{(\lambda e^{-\eta})^{m}m^{y}}{m!},\quad y=0,\,1,\,\ldots, (11)

mean

E[Y]=λη,\mathrm{E}\!\left[\,{Y}\,\right]=\lambda\eta, (12)

and variance

var(Y)=λη+λη2.\mathrm{var}\!\left({Y}\right)=\lambda\eta+\lambda\eta^{2}. (13)

Starting from (3), (11) follows from the law of total probability, (12) from iterated expectation with conditioning on MM, and (13) from the law of total variance with conditioning on MM [19]. Reaching the same conclusions starting from (4) or (5) involves more complicated computations.

From (12), the baseline estimator

η^baseline(Y)=Yλ\widehat{\eta}_{\rm baseline}(Y)=\frac{Y}{\lambda} (14)

is unbiased, and from (13) its MSE is

MSE(η^baseline)=η(1+η)λ.\mathrm{MSE}(\widehat{\eta}_{\rm baseline})=\frac{\eta(1+\eta)}{\lambda}. (15)

If λ\lambda ions were deterministically incident upon the sample, YY would be a Poisson(λη)\operatorname{Poisson}(\lambda\eta) random variable, with variance λη\lambda\eta, and the MSE of the baseline estimator would be η/λ\eta/\lambda. The excess variance in (13), consistent with experimental observations [26], and excess MSE in (15) are due to the random variation in the number of incident ions or the source shot noise. Our line of work mitigates this noise.

Since estimation under a Neyman Type A observation model is not well known, the potential efficiency of the baseline estimator is not evident. To this end, it is natural to evaluate the Fisher information (FI) about η\eta in YY with λ\lambda as a known parameter, which we denote by Y(η;λ)\mathcal{I}_{Y}(\eta\,;\,\lambda).

The FI is defined as

Y(η;λ)=E[(logPY(y;η,λ)η)2;η],\mathcal{I}_{Y}(\eta\,;\,\lambda)=\mathrm{E}\!\left[\,{\left(\frac{\partial\log\mathrm{P}_{Y}(y\,;\,\eta,\lambda)}{\partial\eta}\right)^{2}\,;\,\eta}\,\right], (16)

where a known non-random parameter in the expectation is emphasized by putting it after a semicolon. From (11),

log\displaystyle\log PY(y;η,λ)\displaystyle\mathrm{P}_{Y}(y\,;\,\eta,\lambda)
=λ+ylogηlogy!+log(m=0(λeη)mmym!).\displaystyle=-\lambda+y\log\eta-\log y!+\log\Bigg{(}\sum_{m=0}^{\infty}\frac{(\lambda e^{-\eta})^{m}m^{y}}{m!}\Bigg{)}.

Then taking the derivative with respect to η\eta, we find that

logPY(y;η,λ)η\displaystyle\frac{\partial\log\mathrm{P}_{Y}(y\,;\,\eta,\lambda)}{\partial\eta} =\displaystyle\!=\! yηm=0m(λeη)mmym!m=0(λeη)mmym!\displaystyle\frac{y}{\eta}-\frac{\sum_{m=0}^{\infty}\dfrac{m(\lambda e^{-\eta})^{m}m^{y}}{m!}}{\sum_{m=0}^{\infty}\dfrac{(\lambda e^{-\eta})^{m}m^{y}}{m!}}
=(a)\displaystyle\!\stackrel{{\scriptstyle(a)}}{{=}}\! yηPY(y+1;η,λ)/eληy+1(y+1)!PY(y;η,λ)/eληyy!\displaystyle\frac{y}{\eta}-\frac{{\mathrm{P}_{Y}(y+1\,;\,\eta,\lambda)}\bigg{/}{\dfrac{e^{-\lambda}\eta^{y+1}}{(y+1)!}}}{{\mathrm{P}_{Y}(y\,;\,\eta,\lambda)}\bigg{/}{\dfrac{e^{-\lambda}\eta^{y}}{y!}}}
=\displaystyle\!=\! yηPY(y+1;η,λ)PY(y;η,λ)y+1η,\displaystyle\frac{y}{\eta}-\frac{\mathrm{P}_{Y}(y+1\,;\,\eta,\lambda)}{\mathrm{P}_{Y}(y\,;\,\eta,\lambda)}\frac{y+1}{\eta},

where (a) follows from (11). The FI is the second moment of the above expression:

Y(η;λ)=y=0(yηPY(y+1;η,λ)PY(y;η,λ)y+1η)2PY(y;η,λ).\mathcal{I}_{Y}(\eta\,;\,\lambda)=\sum_{y=0}^{\infty}\left(\frac{y}{\eta}{-}\frac{\mathrm{P}_{Y}(y+1\,;\,\eta,\lambda)}{\mathrm{P}_{Y}(y\,;\,\eta,\lambda)}\frac{y+1}{\eta}\right)^{\!2}\mathrm{P}_{Y}(y\,;\,\eta,\lambda). (17)

While (17) is not readily comprehensible, it can be used to numerically evaluate Y(η;λ)\mathcal{I}_{Y}(\eta\,;\,\lambda) and to derive certain useful asymptotic approximations and limits. One can interpret the ratio Y(η;λ)/λ\mathcal{I}_{Y}(\eta\,;\,\lambda)/\lambda as the information gain per incident ion. As illustrated in Fig. 2, this normalized Fisher information is a decreasing function of λ\lambda, with

limλ0Y(η;λ)λ=1ηeη\lim_{\lambda\rightarrow 0}\frac{\mathcal{I}_{Y}(\eta\,;\,\lambda)}{\lambda}=\frac{1}{\eta}-e^{-\eta} (18)

and

limλY(η;λ)λ=1η(1+η)=1η11+η.\lim_{\lambda\rightarrow\infty}\frac{\mathcal{I}_{Y}(\eta\,;\,\lambda)}{\lambda}=\frac{1}{\eta(1+\eta)}=\frac{1}{\eta}-\frac{1}{1+\eta}. (19)

Detailed derivations are provided in Appendix B.

Refer to caption
Figure 2: Normalized Fisher information Y(η;λ)/λ\mathcal{I}_{Y}(\eta\,;\,\lambda)/\lambda as a function of λ\lambda for η=3\eta=3. The marked asymptotes are derived in Appendix B.

Using (19) to write

Y(η;λ)λη(1+η)for large λ,\mathcal{I}_{Y}(\eta\,;\,\lambda)\approx\frac{\lambda}{\eta(1+\eta)}\qquad\mbox{for large $\lambda$}, (20)

we have a match to the reciprocal of the MSE in (15). Thus, the baseline estimator achieves the Cramér–Rao bound (CRB) asympotically as λ\lambda\rightarrow\infty, but not otherwise (since the normalized FI is a decreasing function of λ\lambda).

One could seek improved estimators for low λ\lambda or improved lower bounds to demonstrate that substantially better estimators do not exist. We do not pursue those goals here. In practice, improved estimators for low λ\lambda may be of limited interest—even if they improve significantly upon the baseline estimator. For example, with reference to Fig. 2, FI suggests that one may be able to improve upon the baseline estimator by a factor of 3 at dose λ=102\lambda=10^{-2}. However, even the lowest possible MSE would be quite high at such a low dose.

The key observation from Fig. 2 and the limits in (18) and (19) is that low-dose measurements are more informative per incident ion than high-dose measurements. The remainder of the paper studies methods to realize improvements related to this gap while operating at any dose level—not only low dose. Furthermore, notice that the MSE in (15) has a simple inversely proportional relationship with λ\lambda. Most performance bounds and empirical performances in this paper share this simple 1/λ1/\lambda behavior, so we place little emphasis on the performance as λ\lambda is varied. Instead, we concentrate on comparisons among different methods and the performance dependence on η\eta.

III Continuous-Time Time-Resolved Measurement

III-A Measurement Distributions

The CTTR measurement (6) contains the number of incident ions that result in positive detected SEs M~\widetilde{M}, the arrival times of these ions {T~1,T~2,,T~M~}\{\widetilde{T}_{1},\,\widetilde{T}_{2},\,\ldots,\,\widetilde{T}_{\widetilde{M}}\}, and the corresponding SE counts {X~1,X~2,,X~M~}\{\widetilde{X}_{1},\,\widetilde{X}_{2},\,\ldots,\,\widetilde{X}_{\widetilde{M}}\}. As noted in Section II-A, the mutual independence of the arrival times in the underlying process {T1,T2,}\{T_{1},\,T_{2},\,\ldots\} and all events of the form {Xi=0}\{X_{i}=0\} cause the Poisson process property to be preserved.

Since P(Xi=0)=eη\mathrm{P}({X_{i}=0})=e^{-\eta}, the rate Λ\Lambda of the underlying process is reduced to Λ(1eη)\Lambda(1-e^{-\eta}) for the thinned process. For the thinned ion count over dwell time tt, we have M~Poisson(λ(1eη))\widetilde{M}\sim\operatorname{Poisson}(\lambda(1-e^{-\eta})), or more explicitly the PMF

PM~(m~;η,λ)=exp(λ(1eη))(λ(1eη))m~m~!,\mathrm{P}_{\widetilde{M}}(\widetilde{m}\,;\,\eta,\lambda)=\exp(-\lambda(1-e^{-\eta}))\frac{(\lambda(1-e^{-\eta}))^{\widetilde{m}}}{{\widetilde{m}}!}, (21)

for m~=0, 1,\widetilde{m}=0,\,1,\,\ldots.

The distribution of the X~i\widetilde{X}_{i} variables is simply the zero-truncation of the Poisson(η)\operatorname{Poisson}(\eta) distribution:

PX~i(j;η)=eη1eηηjj!,j=1, 2,.\mathrm{P}_{\widetilde{X}_{i}}(j\,;\,\eta)=\frac{e^{-\eta}}{1-e^{-\eta}}\cdot\frac{\eta^{j}}{j!},\qquad j=1,\,2,\,\ldots. (22)

While the interarrival times of the thinned process have a simple exponential distribution, this is not relevant to our estimation tasks: Under the CTTR measurement model, we have M~\widetilde{M} available, and conditioned on M~\widetilde{M}, the thinned arrival times have beta distributions with no dependence on the parameter of interest η\eta.

III-B Fisher Information

We would like to evaluate the FI about η\eta in the CTTR measurement (6) with λ\lambda as a known parameter:

CTTR(η;λ)\displaystyle\mathcal{I}_{\rm CTTR}(\eta\,;\,\lambda) =(a)(T~1,X~1),,(T~M~,X~M~)|M~(η;λ)+M~(η;λ)\displaystyle\stackrel{{\scriptstyle(a)}}{{=}}\mathcal{I}_{(\widetilde{T}_{1},\widetilde{X}_{1}),\ldots,(\widetilde{T}_{\widetilde{M}},\widetilde{X}_{\widetilde{M}})|\widetilde{M}}(\eta\,;\,\lambda)+\mathcal{I}_{\widetilde{M}}(\eta\,;\,\lambda)
=(b)X~1,,X~M~|M~(η;λ)+M~(η;λ)\displaystyle\stackrel{{\scriptstyle(b)}}{{=}}\mathcal{I}_{\widetilde{X}_{1},\ldots,\widetilde{X}_{\widetilde{M}}|\widetilde{M}}(\eta\,;\,\lambda)+\mathcal{I}_{\widetilde{M}}(\eta\,;\,\lambda)
=(c)E[M~]X~i(η;λ)+M~(η;λ)\displaystyle\stackrel{{\scriptstyle(c)}}{{=}}\mathrm{E}\!\left[\,{\widetilde{M}}\,\right]\mathcal{I}_{\widetilde{X}_{i}}(\eta\,;\,\lambda)+\mathcal{I}_{\widetilde{M}}(\eta\,;\,\lambda)
=(d)λ(1eη)X~i(η;λ)+M~(η;λ),\displaystyle\stackrel{{\scriptstyle(d)}}{{=}}\lambda(1-e^{-\eta})\mathcal{I}_{\widetilde{X}_{i}}(\eta\,;\,\lambda)+\mathcal{I}_{\widetilde{M}}(\eta\,;\,\lambda), (23)

where (a) follows from the chain rule for FI [27]; (b) from the conditional distribution of each T~i\widetilde{T}_{i} given M~\widetilde{M} having no dependence on η\eta; (c) from additivity of FI and the independence of {M~,X~1,X~2,,X~M~}\{\widetilde{M},\,\widetilde{X}_{1},\,\widetilde{X}_{2},\,\ldots,\,\widetilde{X}_{\widetilde{M}}\}; and (d) from substitution of the mean of M~\widetilde{M}. Thus, we need to evaluate X~i(η;λ)\mathcal{I}_{\widetilde{X}_{i}}(\eta\,;\,\lambda) and M~(η;λ)\mathcal{I}_{\widetilde{M}}(\eta\,;\,\lambda). The former, which represents the FI for estimating η\eta from X~i\widetilde{X}_{i} is

X~i(η;λ)\displaystyle\mathcal{I}_{\widetilde{X}_{i}}(\eta\,;\,\lambda) =E[(logPX~i(X~i;η,λ)η)2;η]\displaystyle=\mathrm{E}\!\left[\,{\left(\frac{\partial\log\mathrm{P}_{\widetilde{X}_{i}}(\widetilde{X}_{i}\,;\,\eta,\lambda)}{\partial\eta}\right)^{\!2}\,;\,\eta}\,\right]
=(a)j=1(jη11eη)2eη1eηηjj!\displaystyle\stackrel{{\scriptstyle(a)}}{{=}}\sum_{j=1}^{\infty}\left(\frac{j}{\eta}-\frac{1}{1-e^{-\eta}}\right)^{\!2}\frac{e^{-\eta}}{1-e^{-\eta}}\frac{\eta^{j}}{j!}
=11eηη+1η1(1eη)2,\displaystyle=\frac{1}{1-e^{-\eta}}\frac{\eta+1}{\eta}-\frac{1}{(1-e^{-\eta})^{2}}, (24)

where (a) uses the PMF in (22). Similarly, M~(η;λ)\mathcal{I}_{\widetilde{M}}(\eta\,;\,\lambda), which represents the FI for estimating η\eta from M~\widetilde{M}, is

M~(η;λ)=E[(logPM~(M~;η,λ)η)2;η]\displaystyle\mathcal{I}_{\widetilde{M}}(\eta\,;\,\lambda)=\mathrm{E}\!\left[\,{\left(\frac{\partial\log P_{\widetilde{M}}(\widetilde{M}\,;\,\eta,\lambda)}{\partial\eta}\right)^{\!2}\,;\,\eta}\,\right]
=(a)j=0(eη1eηjλeη)2eλ(1eη)[λ(1eη)]jj!\displaystyle\stackrel{{\scriptstyle(a)}}{{=}}\sum_{j=0}^{\infty}\left(\frac{e^{-\eta}}{1-e^{-\eta}}j-\lambda e^{-\eta}\right)^{\!2}\frac{e^{-\lambda(1-e^{-\eta})}[\lambda(1-e^{-\eta})]^{j}}{j!}
=λeηeη1,\displaystyle=\frac{\lambda e^{-\eta}}{e^{\eta}-1}, (25)

where (a) uses the PMF in (21). Finally, substituting (24) and (25) into (III-B) gives

CTTR(η;λ)\displaystyle\mathcal{I}_{\rm CTTR}(\eta\,;\,\lambda) =λ(1ηeη).\displaystyle=\lambda\left(\frac{1}{\eta}-{e^{-\eta}}\right). (26)

Notice that the FI for CTTR measurement matches the low-dose asymptote given in (18). It is exact and holds for all values of λ\lambda. The greater FI for CTTR measurement than for conventional measurement is suggestive of being able to improve upon the baseline estimator (14). In the following sections, we define new estimators and demonstrate their improvements.

III-C Estimators

In this section, we introduce estimators applicable to CTTR measurement (6). From (4), it is clear that the total number of detected SEs YY is available. As we have explained in the derivation of CTTR(η;λ)\mathcal{I}_{\rm CTTR}(\eta\,;\,\lambda) in Section III-B, with M~\widetilde{M} available, there is no additional information about η\eta in the thinned ion incidence times {T~1,T~2,,T~M~}\{\widetilde{T}_{1},\,\widetilde{T}_{2},\,\ldots,\,\widetilde{T}_{\widetilde{M}}\}. Furthermore, with YY and M~\widetilde{M} available, there is no additional information about η\eta in the positive SE counts {X~1,X~2,,X~M~}\{\widetilde{X}_{1},\,\widetilde{X}_{2},\,\ldots,\,\widetilde{X}_{\widetilde{M}}\}. To see this, consider any observation vector (X~1,X~2,,X~M~)=(j1,j2,,jm)(\widetilde{X}_{1},\,\widetilde{X}_{2},\,\ldots,\,\widetilde{X}_{\widetilde{M}})=(j_{1},\,j_{2},\,\ldots,\,j_{m}) conditioned on M~=m~\widetilde{M}=\widetilde{m}. Using independence and (22), the likelihood of the observation (conditioned on M~=m~\widetilde{M}=\widetilde{m}) is

i=1m~PX~i(ji;η)=(eη1eη)m~ηj1+j2++jm~j1!j2!jm~!.\prod_{i=1}^{\widetilde{m}}\mathrm{P}_{\widetilde{X}_{i}}(j_{i}\,;\,\eta)=\left(\frac{e^{-\eta}}{1-e^{-\eta}}\right)^{\!\widetilde{m}}\frac{\eta^{j_{1}+j_{2}+\cdots+j_{\widetilde{m}}}}{j_{1}!\,j_{2}!\,\cdots\,j_{\widetilde{m}}!}. (27)

As a function of η\eta, the dependence on SE counts is only through their sum. Hence, all the estimators in the section depend only on YY and M~\widetilde{M}.

III-C1 Continuous-Time Quotient Mode Estimator

Recall that the oracle estimator (8) divides the total SE count YY by the number of incident ions MM. Using M~\widetilde{M} as a proxy for the number of incident ions yields the continuous-time quotient mode (CTQM) estimator

η^CTQM(M~,Y)={0,M~=0;Y/M~,M~>0.\widehat{\eta}_{\rm CTQM}(\widetilde{M},Y)=\left\{\begin{array}[]{@{\,}rl}0,&\widetilde{M}=0;\\ {{Y}/{\widetilde{M}}},&\widetilde{M}>0.\end{array}\right. (28)

Note that the 0 estimate for M~=0\widetilde{M}=0 is not arbitrary, it is the ML estimate of η\eta for this case. The name “quotient mode” is to acknowledge a similar concept in a presentation by John Notte of Zeiss [28] and in a patent application [29].

III-C2 Continuous-Time Lambert Quotient Mode Estimator

The CTTR measurement observes a thinned version of the underlying ion incidence process, so M~M\widetilde{M}\leq M. We can do better than to use M~\widetilde{M} as a proxy for MM. Since

E[M~|M]=(1eη)M,\mathrm{E}\!\left[\,{\widetilde{M}\,|\,M}\,\right]=(1-e^{-\eta})M,

(1eη)1M~(1-e^{-\eta})^{-1}\widetilde{M} would be an unbiased proxy for MM. Unfortunately, this has dependence on η\eta, which is not known. In the spirit of the oracle and CTQM estimators, we may seek an estimate η^\widehat{\eta} that satisfies

η^=Y(1eη^)1M~.\widehat{\eta}=\frac{Y}{(1-e^{-\widehat{\eta}})^{-1}\widetilde{M}}.

The solution of this equation gives the continuous-time Lambert quotient mode (CTLQM) estimator:

η^CTLQM=W(η^CTQMeη^CTQM)+η^CTQM,\widehat{\eta}_{\rm CTLQM}=W(-\widehat{\eta}_{\rm CTQM}e^{-\widehat{\eta}_{\rm CTQM}})+\widehat{\eta}_{\rm CTQM}, (29)

where W()W(\cdot) is the Lambert W function [30].

III-C3 Continuous-Time Maximum Likelihood Estimator

Rather than use a heuristic approximation for the number of incident ions MM, one could instead use the statistically principled ML estimation approach as follows. The ML estimate is the value of η\eta that maximizes the joint likelihood of the full CTTR observation.

We have already seen that we can drop the times {T~1,T~2,T~M~}\{\widetilde{T}_{1},\,\widetilde{T}_{2},\,\ldots\,\widetilde{T}_{\widetilde{M}}\}, and the conditional likelihood of {X~1,X~2,X~M~}\{\widetilde{X}_{1},\,\widetilde{X}_{2},\,\ldots\,\widetilde{X}_{\widetilde{M}}\} given M~\widetilde{M} was given in (27). Thus, we must maximize the product of (21) and (27) over η\eta. For observation (M~,X~1,X~2,,X~M~)=(m~,j1,j2,,jm~)(\widetilde{M},\,\widetilde{X}_{1},\,\widetilde{X}_{2},\,\ldots,\,\widetilde{X}_{\widetilde{M}})=(\widetilde{m},\,j_{1},\,j_{2},\,\ldots,\,j_{\widetilde{m}}), by dropping factors that do not depend on η\eta, we obtain continuous-time ML (CTML) estimator

η^CTML=argmaxηeλ(1eη)em~ηηy,\widehat{\eta}_{\rm CTML}=\operatorname*{arg\,max}_{\eta}e^{-\lambda(1-e^{-\eta})}e^{-{\widetilde{m}}\eta}\eta^{y},

where y=j1+j2++jm~y=j_{1}+j_{2}+\cdots+j_{\widetilde{m}}. The unique maximizer satisfies

η^CTML=YM~+λeη^CTML,\displaystyle\widehat{\eta}_{\rm CTML}=\frac{Y}{\widetilde{M}+\lambda e^{-\widehat{\eta}_{\rm CTML}}}, (30)

which can be solved by using an appropriate root-finding algorithm.

III-D Analyzing the Continuous-Time Quotient Mode Estimator

By computing the MSE of η^CTQM\widehat{\eta}_{\rm CTQM}, we can evaluate the efficacy of the quotient mode estimator. We begin by noting that

MSE(η^CTQM)=bias(η^CTQM)2+var(η^CTQM).\displaystyle\mathrm{MSE}(\widehat{\eta}_{\rm CTQM})=\operatorname{bias}(\widehat{\eta}_{\rm CTQM})^{2}+\mathrm{var}\!\left({\widehat{\eta}_{\rm CTQM}}\right). (31)

As detailed in Appendix C, the bias is given by

bias(η^CTQM)\displaystyle\operatorname{bias}(\widehat{\eta}_{\rm CTQM}) =E[η^CTQM]η\displaystyle=\mathrm{E}\!\left[\,{\widehat{\eta}_{\rm CTQM}}\,\right]-\eta
=(eηeλ(1eη)1eη)η.\displaystyle=\left(\frac{e^{-\eta}-e^{-\lambda(1-e^{-\eta})}}{1-e^{-\eta}}\right)\eta. (32)

For fixed η\eta, this is a nonzero bias even as dose λ\lambda\rightarrow\infty, consistent with the motivation for defining the CTLQM estimator to improve upon the CTQM estimator. If η\eta\rightarrow\infty as well, the bias vanishes, which is consistent with the convergence in distribution of M~\widetilde{M} to MM.

As also detailed in Appendix C, the variance is given by

var(η^CTQM)\displaystyle\mathrm{var}\!\left({\widehat{\eta}_{\rm CTQM}}\right) =η2ρ2eλρ(1eλρ)\displaystyle=\frac{\eta^{2}}{\rho^{2}}e^{-\lambda\rho}(1-e^{-\lambda\rho})
+η(ρηeη)ρ2j=11jeλρ(λρ)jj!,\displaystyle\quad+\frac{\eta(\rho-\eta e^{-\eta})}{\rho^{2}}\sum_{j=1}^{\infty}\frac{1}{j}\frac{e^{-\lambda\rho}(\lambda\rho)^{j}}{j!}, (33)

where we introduce

ρ=P(Xi>0)=1eη\rho=\mathrm{P}({X_{i}>0})={1-e^{-\eta}}

as a shorthand to make certain expressions more compact. Expression (33) can be combined with (60) from Appendix A to show that the variance vanishes as λ\lambda\rightarrow\infty, decaying asymptotically as 1/λ\sim 1/\lambda. However, because of nonzero bias, the MSE is not inversely proportion to λ\lambda, and the MSE of η^CTQM\widehat{\eta}_{\rm CTQM} relative to other estimates depends on λ\lambda. Substituting (III-D) and (33) into (31) gives an expression for the MSE of the CTQM estimator:

MSE(η^CTQM)=\displaystyle\mathrm{MSE}(\widehat{\eta}_{\rm CTQM})= η2ρ2(eηeλρ)2+η2ρ2eλρ(1eλρ)\displaystyle\ \frac{\eta^{2}}{\rho^{2}}\left(e^{-\eta}-e^{-\lambda\rho}\right)^{\!2}+\frac{\eta^{2}}{\rho^{2}}e^{-\lambda\rho}(1-e^{-\lambda\rho})
+η(ρηeη)ρ2j=11jeλρ(λρ)jj!.\displaystyle+\frac{\eta(\rho-\eta e^{-\eta})}{\rho^{2}}\sum_{j=1}^{\infty}\frac{1}{j}\frac{e^{-\lambda\rho}(\lambda\rho)^{j}}{j!}. (34)

Substituting the upper bound (61) from Appendix A for the series in (34) gives

MSE(η^CTQM)<\displaystyle\mathrm{MSE}(\widehat{\eta}_{\rm CTQM})< η2ρ2(eηeλρ)2+η2ρ2eλρ(1eλρ)\displaystyle\ \frac{\eta^{2}}{\rho^{2}}\left(e^{-\eta}-e^{-\lambda\rho}\right)^{\!2}+\frac{\eta^{2}}{\rho^{2}}e^{-\lambda\rho}(1-e^{-\lambda\rho})
+0.518η(ρηeη)ρ2.\displaystyle+0.518\frac{\eta(\rho-\eta e^{-\eta})}{\rho^{2}}. (35)
Refer to caption
(a) MSE across η\eta.
Refer to caption
(b) Bias across η\eta.
Refer to caption
(c) Variance across η\eta.
Figure 3: Comparison of continuous-time time-resolved measurement estimators as a function of η\eta. Conventional, oracle, continuous-time quotient mode, continuous-time Lambert quotient mode, and continuous-time maximum likelihood estimators are simulated for dose rate Λ=1/1600\Lambda=1/1600 ions per ns and dwell time t=32 000t=32\,000 ns, hence total dose λ=20\lambda=20 ions per pixel. (a) MSE. (b) Bias. (Conventional estimator omitted because its bias is zero.) (c) Variance.

III-E Numerical Comparisons of Estimators

To demonstrate the benefits afforded by CTTR measurement, in Fig. 3 we compare the conventional, oracle, CTQM, CTLQM and CTML estimators across ground truth η[0,10]\eta\in[0,10]. The MSE values in Fig. 3(a) are computed from 150 000150\,000 independent Monte Carlo trials, using a dose rate Λ=1/1600\Lambda=1/1600 ions per ns and dwell time t=32 000t=32\,000 ns, for a total dose λ=20\lambda=20 ions per pixel. The curve for η^baseline\widehat{\eta}_{\rm baseline} matches the theoretical MSE expression (15). Although unimplementable, the curve for η^oracle\widehat{\eta}_{\rm oracle} also matches the theoretical MSE in (9).111We did not need to choose a value for η0\eta_{0} because the event {M=0}\{M=0\}, which has probability e202109e^{-20}\approx 2\cdot 10^{-9}, did not occur in any of the trials. We can observe that η^CTQM\widehat{\eta}_{\rm CTQM} has a large MSE for small η\eta, caused largely by M~\widetilde{M} severely underestimating MM in these cases. Predictably, η^CTLQM\widehat{\eta}_{\rm CTLQM} reduces MSE tremendously for small values of η\eta because the role of M~\widetilde{M} is modified the most in these cases. For low η\eta (about 0 to 2.5), η^CTML\widehat{\eta}_{\rm CTML} achieves lowest MSE amongst all implementable estimators; for moderate η\eta (about 2.5 to 5.5), η^CTQM\widehat{\eta}_{\rm CTQM} is slightly better than the others; and for large η\eta (about 5.5 and above), η^CTQM\widehat{\eta}_{\rm CTQM}, η^CTLQM\widehat{\eta}_{\rm CTLQM} and η^CTML\widehat{\eta}_{\rm CTML} all give nearly identical performance. It is noteworthy that CTQM converges with the oracle at η\eta above about 3.0, though the oracle is unimplementable in practice.

The MSE trends and comparisons can be better understood through the biases in Fig. 3(b) and variances in Fig. 3(c). Curves for η^CTQM\widehat{\eta}_{\rm CTQM} coincide with the bias and variance expressions derived in (III-D) and (33). As previously noted, the large bias of η^CTQM\widehat{\eta}_{\rm CTQM} for small values of η\eta is caused by the number of incident ions MM being severely underestimated by M~\widetilde{M}. The bias of η^CTQM\widehat{\eta}_{\rm CTQM} can be corrected by the use of η^CTLQM\widehat{\eta}_{\rm CTLQM}. The dashed cyan-colored curve in Fig. 3(c) is the CRB for any unbiased estimator, which is the reciprocal of (26). The variances of the implementable and approximately unbiased η^CTLQM\widehat{\eta}_{\rm CTLQM} and η^CTML\widehat{\eta}_{\rm CTML} estimators approximately coincide with the CRB.

IV Discrete-Time Time-Resolved Measurement

IV-A Measurement Distributions

The DTTR measurement (7) is a length-nn vector of SE counts collected over subacquisition dwell times of t/nt/n. Thus, some modeling and analysis for DTTR measurement follows from scaling of λ\lambda in expressions from Section II-D. Since {Y1,Y2,,Yn}\{Y_{1},\,Y_{2},\,\ldots,\,Y_{n}\} are independent, their joint PMF is simply

PY1,Y2,,Yn(y1,y2,,yn;η,λ)=k=1nPY(yk;η,λ/n),\mathrm{P}_{Y_{1},Y_{2},\ldots,Y_{n}}(y_{1},y_{2},\ldots,y_{n}\,;\,\eta,\lambda)=\prod_{k=1}^{n}\mathrm{P}_{Y}(y_{k}\,;\,\eta,\lambda/n), (36)

written in terms of the PMF in (11).

IV-B Fisher Information

To evaluate the FI about η\eta in the DTTR measurement (7) with λ\lambda as a known parameter is also quite simple. Because FI is additive over independent observations,

DTTR(η;λ,n)\displaystyle\mathcal{I}_{\rm DTTR}(\eta\,;\,\lambda,n) =nY(η;λ/n),\displaystyle=n\,\mathcal{I}_{Y}(\eta\,;\,\lambda/n), (37)

expressed in terms of the FI in (17). While this FI inherits the complexity and lack of interpretability of (17), the distinction is that the relevant dose parameter in Y\mathcal{I}_{Y} has been reduced from λ\lambda to λ/n\lambda/n, so it is more reasonable to approximate with the low-dose asymptote (18). Specifically, we can write

DTTR(η;λ,n)\displaystyle\mathcal{I}_{\rm DTTR}(\eta\,;\,\lambda,n) =λY(η;λ/n)λ/n\displaystyle=\lambda\,\frac{\mathcal{I}_{Y}(\eta\,;\,\lambda/n)}{\lambda/n}
λ(1ηeη),\displaystyle\approx\lambda\left(\frac{1}{\eta}-e^{-\eta}\right), (38)

where the approximation holds for large enough nn because of (18). Note that (38) has the same expression as (26), so as nn\rightarrow\infty, the FI of DTTR measurement converges from below to the FI of CTTR measurement.

IV-C Estimators

In this section, we present estimators applicable to DTTR measurement (7) that were first introduced in [19]. The subsequent analyses are new.

IV-C1 Discrete-Time Quotient Mode Estimator

Similar to the principle behind the CTQM estimator (28), the number of subacquisitions with positive SEs,

L=k=1n𝟙{Yk>0},L=\sum_{k=1}^{n}\mathbbm{1}_{\{Y_{k}>0\}}, (39)

can be a proxy for the number of incident ions MM. The discrete-time quotient mode (DTQM) estimator is then defined as

η^DTQM(L,Y)={0,L=0;Y/L,L>0.\widehat{\eta}_{\rm DTQM}(L,Y)=\left\{\begin{array}[]{@{\,}rl}0,&L=0;\\ {{Y}/{L}},&L>0.\end{array}\right. (40)

IV-C2 Discrete-Time Lambert Quotient Mode Estimator

At low η\eta, MM is severely underestimated by LL. Correspondingly, the discrete-time Lambert quotient mode (DTLQM) estimator is obtained by incorporating the correction factor (1eη)1(1-e^{-\eta})^{-1},

η^=Y(1eη^)1L.\widehat{\eta}=\frac{Y}{(1-e^{-\widehat{\eta}})^{-1}L}.

The solution of this equation gives the estimator:

η^DTLQM=W(η^DTQMeη^DTQM)+η^DTQM.\widehat{\eta}_{\rm DTLQM}=W(-\widehat{\eta}_{\rm DTQM}e^{-\widehat{\eta}_{\rm DTQM}})+\widehat{\eta}_{\rm DTQM}. (41)

IV-C3 Discrete-Time Maximum Likelihood Estimator

The discrete-time ML (DTML) estimate is the value of η\eta that maximizes the joint likelihood in (36):

η^DTML=argmaxηk=1nPY(yk;η,λ/n),\displaystyle\widehat{\eta}_{\rm DTML}=\operatorname*{arg\,max}_{\eta}\,\prod_{k=1}^{n}\mathrm{P}_{Y}(y_{k}\,;\,\eta,\lambda/n), (42)

where PY(;,)\mathrm{P}_{Y}(\cdot\,;\,\cdot,\cdot) is given by (11). Unlike in the CTTR case, we have no fixed-point form for the estimator. Instead, it can be computed by direct numerical optimization. Since the decision variable is scalar, even a simple grid search is not impractical.

IV-D Analyzing the Discrete-Time Quotient Mode Estimator

Like in Section III-D, we analyze the MSE of η^DTQM\widehat{\eta}_{\rm DTQM} by finding expressions for its bias and variance. In both calculations, we make use of a zero-truncated modification of the Neyman Type A distribution at ion incidence parameter λ/n\lambda/n. Let p=P(Yk>0)p=\mathrm{P}({Y_{k}>0}). Then using (11), we find

p=1PY(0;η,λ/n)=1exp(λn(1eη)).p=1-\mathrm{P}_{Y}(0\,;\,\eta,\lambda/n)=1-\exp\!\left(-\frac{\lambda}{n}(1-e^{-\eta})\right)\!. (43)

If Y~k\widetilde{Y}_{k} is the zero-truncated version of YkY_{k}, then its PMF is

PY~k(yk)=1peλ/nηykyk!m=0(λneη)mmykm!,yk=1,2,,\mathrm{P}_{\widetilde{Y}_{k}}(y_{k})=\frac{1}{p}\frac{e^{-{{\lambda}/{n}}}\eta^{y_{k}}}{y_{k}!}\sum_{m=0}^{\infty}\frac{(\frac{\lambda}{n}e^{-\eta})^{m}m^{y_{k}}}{m!},\quad y_{k}=1,2,\ldots, (44)

its mean is

E[Y~k]=1pληn,\mathrm{E}[\,{\widetilde{Y}_{k}}\,]=\frac{1}{p}\frac{\lambda\eta}{n}, (45)

and its variance is

var(Y~k)=1p(λnη+λnη2+(λnη)2)1p2(λnη)2.\mathrm{var}({\widetilde{Y}_{k}})=\frac{1}{p}\left(\frac{\lambda}{n}\eta+\frac{\lambda}{n}\eta^{2}+\left(\frac{\lambda}{n}\eta\right)^{2}\right)-\frac{1}{p^{2}}\left(\frac{\lambda}{n}\eta\right)^{2}. (46)

Furthermore, as a sum of independent indicator random variables, LL is a binomial random variable with nn trials and success probability pp for each trial.

For >0\ell>0,

E[η^DTQM|L=]\displaystyle\mathrm{E}\!\left[\,{\widehat{\eta}_{\rm DTQM}\,|\,L=\ell}\,\right] =E[YL|L=]=1E[Y|L=]\displaystyle=\mathrm{E}\!\left[\,{\frac{Y}{L}\,|\,L=\ell}\,\right]=\frac{1}{\ell}\mathrm{E}\!\left[\,{Y\,|\,L=\ell}\,\right]
=(a)1E[j=1Y~j]=E[Y~k],\displaystyle\stackrel{{\scriptstyle(a)}}{{=}}\frac{1}{\ell}\mathrm{E}\!\left[\,{\sum_{j=1}^{\ell}\widetilde{Y}_{j}}\,\right]=\mathrm{E}\!\left[\,{\widetilde{Y}_{k}}\,\right], (47)

where (a) follows from using Y~j\widetilde{Y}_{j} to denote the jjth positive subacquistion YkY_{k}. Trivially,

E[η^DTQM|L=0]=0.\mathrm{E}\!\left[\,{\widehat{\eta}_{\rm DTQM}\,|\,L=0}\,\right]=0. (48)

Using P(L>0)=1(1p)n\mathrm{P}({L>0})=1-(1-p)^{n} and the total expectation theorem to combine (47) and (48) gives

E[η^DTQM]=E[Y~k](1(1p)n)).\mathrm{E}\!\left[\,{\widehat{\eta}_{\rm DTQM}}\,\right]=\mathrm{E}[\,{\widetilde{Y}_{k}}\,](1-(1-p)^{n})).

The bias of η^DTQM\widehat{\eta}_{\rm DTQM} is thus given by

bias(η^DTQM)\displaystyle\operatorname{bias}(\widehat{\eta}_{\rm DTQM}) =E[η^DTQM]η\displaystyle=\mathrm{E}\!\left[\,{\widehat{\eta}_{\rm DTQM}}\,\right]-\eta
=ληnp[1(1p)n]η,\displaystyle=\frac{\lambda\eta}{np}\left[1-(1-p)^{n}\right]-\eta, (49)

where (45) has been substituted.

Similar arguments, detailed in Appendix D, yield

var(η^DTQM)\displaystyle\mathrm{var}\!\left({\widehat{\eta}_{\rm DTQM}}\right) =(E[Y~k])2[1(1p)n](1p)n\displaystyle=\left(\mathrm{E}[\,{\widetilde{Y}_{k}}\,]\right)^{2}\left[1-(1-p)^{n}\right](1-p)^{n}
+var(Y~k)=1n1(n)p(1p)n,\displaystyle\quad+\mathrm{var}({\widetilde{Y}_{k}})\sum_{\ell=1}^{n}\frac{1}{\ell}\binom{n}{\ell}p^{\ell}(1-p)^{n-\ell}, (50)

where E[Y~k]\mathrm{E}[\,{\widetilde{Y}_{k}}\,] is given in (45) and var(Y~k)\mathrm{var}({\widetilde{Y}_{k}}) is given in (46). Since the behavior of the series in (IV-D) for large nn is not evident, we also derive in Appendix D the lower bound

var(η^DTQM)\displaystyle\mathrm{var}\!\left({\widehat{\eta}_{\rm DTQM}}\right) (E[Y~k])2[1(1p)n](1p)n\displaystyle\geq\left(\mathrm{E}[\,{\widetilde{Y}_{k}}\,]\right)^{2}\left[1-(1-p)^{n}\right](1-p)^{n}
+var(Y~k)[1(1p)n]2np.\displaystyle\quad+\mathrm{var}({\widetilde{Y}_{k}})\frac{[1-(1-p)^{n}]^{2}}{np}. (51)

The expressions in (IV-D) and (IV-D) can be added to the square of the bias from (IV-D) to obtain the MSE of η^DTQM\widehat{\eta}_{\rm DTQM} and a lower bound for this MSE.

Refer to caption
(a) MSE across nn
Refer to caption
(b) Bias across nn
Refer to caption
(c) Variance across nn
Figure 4: MSE, bias, and variance of η^DTQM\widehat{\eta}_{\rm DTQM} as functions of the number of subacquisitions nn and those of η^CTQM\widehat{\eta}_{\rm CTQM} (dashed lines) for λ=10\lambda=10 and η=5\eta=5. The Cramér–Rao lower bound for time-resolved measurements (yellow, see (37)) is plotted as well.

High nn Limits

One of the themes of this paper is that the CTTR measurement model is easier to analyze than the DTTR measurement model, yet it gives expressions relevant to understanding the practical DTTR measurement setting. We would like to examine properties of η^DTQM\widehat{\eta}_{\rm DTQM} at the high nn limit, in part to demonstrate that we obtain matches to η^CTQM\widehat{\eta}_{\rm CTQM} behavior.

For the limit of the bias, we will use two facts proven in Appendix E:

limnnp\displaystyle\lim_{n\to\infty}np =λ(1eη),\displaystyle=\lambda(1-e^{-\eta}), (52)
limn(1p)n\displaystyle\lim_{n\to\infty}(1-p)^{n} =eλ(1eη).\displaystyle=e^{-\lambda(1-e^{-\eta})}. (53)

By substituting (52) and (53) into (IV-D),

limnbias(η^DTQM)\displaystyle\lim_{n\to\infty}\operatorname{bias}(\widehat{\eta}_{\rm DTQM}) =ληλ(1eη)(1eλ(1eη))η\displaystyle=\frac{\lambda\eta}{\lambda(1-e^{-\eta})}(1-e^{-\lambda(1-e^{-\eta})})-\eta
=(eηeλ(1eη)1eη)η,\displaystyle=\left(\frac{e^{-\eta}-e^{-\lambda(1-e^{-\eta})}}{1-e^{-\eta}}\right)\eta, (54)

an exact match to bias(η^CTQM)\operatorname{bias}(\widehat{\eta}_{\rm CTQM}) in (III-D).

Refer to caption
(a) MSE across η\eta
Refer to caption
(b) Bias across η\eta
Refer to caption
(c) Variance across η\eta
Figure 5: Comparison of discrete-time time-resolved estimators as a function of η\eta. Conventional, oracle, discrete-time quotient mode, discrete-time Lambert quotient mode, and discrete-time maximum likelihood estimators are simulated for total dose λ=20\lambda=20 split over n=200n=200 subacquisitions. (a) MSE. (b) Bias. (Conventional estimator excluded because its bias is zero.) (c) Variance.

For the limit of the variance, we will use two additional facts proven in Appendix E:

limnE[Y~k]\displaystyle\lim_{n\to\infty}\mathrm{E}[\,{\widetilde{Y}_{k}}\,] =η1eη,\displaystyle=\frac{\eta}{1-e^{-\eta}}, (55)
limnvar(Y~k)\displaystyle\lim_{n\to\infty}\mathrm{var}({\widetilde{Y}_{k}}) =η(η+η2)eη(1eη)2.\displaystyle=\frac{\eta-(\eta+\eta^{2})e^{-\eta}}{(1-e^{-\eta})^{2}}. (56)

Along with substitution of limits, we recognize that the summand in (IV-D) includes a binomial probability for which there is a Poisson limit because npnp approaches a positive constant [31]:

(n)p(1p)n(λ(1eη))!eλ(1eη).\binom{n}{\ell}p^{\ell}(1-p)^{n-\ell}\rightarrow\frac{(\lambda(1-e^{-\eta}))^{\ell}}{\ell!}e^{-\lambda(1-e^{-\eta})}. (57)

It is now tedious but straightforward to substitute (52), (53), (55), (56), and (57) into (IV-D) to obtain

limnvar(η^DTQM)\displaystyle\lim_{n\to\infty}\mathrm{var}\!\left({\widehat{\eta}_{\rm DTQM}}\right) =var(η^CTQM),\displaystyle=\mathrm{var}\!\left({\widehat{\eta}_{\rm CTQM}}\right), (58)

where var(η^CTQM)\mathrm{var}\!\left({\widehat{\eta}_{\rm CTQM}}\right) is given in (33).

IV-E Numerical Comparisons of Estimators

Fig. 4 shows the MSE, bias, and variance of η^DTQM\widehat{\eta}_{\rm DTQM} as functions of the number of subacquisitions nn when λ=10\lambda=10 and η=5\eta=5. Fig. 4(b) shows the bias approaching the asymptote given by (IV-D). Fig. 4(a) and Fig. 4(c) show the MSE and variance and their lower bounds based on (IV-D). When nn is sufficiently large, these are small and close to the Cramér–Rao  bound.

Fig. 5 compares the conventional, oracle, DTQM, DTLQM, and DTML estimators across ground truth η[0,10]\eta\in[0,10]. MSE, bias, and variance are computed by Monte Carlo simulation using total dose λ=20\lambda=20 split over n=200n=200 subacquisitions. The conventional curves match those in the CT setting in Fig. 3. The DTQM estimator has a large bias for smaller η\eta, which is absent from the DTLQM estimator. Unlike in CT, where the bias for all the studied estimators vanish at moderate and high η\eta, both the DTQM and DTLQM estimator have substantial bias that is approximately linear in η\eta for moderate and high η\eta. The lack of bias of the DTML estimator explains its uniform superiority over the DTQM and DTLQM estimators; this contrasts with the CT setting in which CTQM, CTLQM, and CTML estimators have nearly equal MSE at moderate and high η\eta.

Absolute errors of time-resolved estimators
Ground truth DT n=50n=50 DT n=100n=100 DT n=200n=200 DT n=500n=500 CT

QM

Absolute error of
conventional estimator

LQM

Absolute error of
oracle estimator

ML

Refer to caption
Refer to caption
MSE=1.111\mathrm{MSE}=1.111
Refer to caption
MSE=0.566\mathrm{MSE}=0.566
Refer to caption
MSE=0.392\mathrm{MSE}=0.392
Refer to caption
MSE=0.313\mathrm{MSE}=0.313
Refer to caption
MSE=0.272\mathrm{MSE}=0.272
Refer to caption
MSE=0.599\mathrm{MSE}=0.599
Refer to caption
MSE=0.877\mathrm{MSE}=0.877
Refer to caption
MSE=0.398\mathrm{MSE}=0.398
Refer to caption
MSE=0.259\mathrm{MSE}=0.259
Refer to caption
MSE=0.206\mathrm{MSE}=0.206
Refer to caption
MSE=0.181\mathrm{MSE}=0.181
Refer to caption
MSE=0.135\mathrm{MSE}=0.135
Refer to caption
MSE=0.246\mathrm{MSE}=0.246
Refer to caption
MSE=0.211\mathrm{MSE}=0.211
Refer to caption
MSE=0.191\mathrm{MSE}=0.191
Refer to caption
MSE=0.178\mathrm{MSE}=0.178
Refer to caption
MSE=0.166\mathrm{MSE}=0.166
Figure 6: Simulated FIB microscopy experiment with time-resolved estimators under discrete- and continuous-time settings. Aside from the ground truth in the upper-left corner, with η[1,8]\eta\in[1,8], all images are of the absolute value of the error. All results are for total dose λ=20\lambda=20 mean incident ions per pixel. The columns of time-resolved estimators are for increasing numbers of subacquisitions nn, culminating in the limiting continuous-time case. Quotient mode, Lambert quotient mode, maximum likelihood estimators are compared, with the conventional estimator (14) and oracle estimator (8) provided for context. None of these estimators include spatial regularization.

V Simulated Microscopy Results

Figures 3 and 5 show that time-resolved estimators improve upon the conventional processing of abstracted FIB microscope data, and Fig. 4 shows that the performances of DTTR estimators improve with increasing numbers of subacquisitions, converging to the performances of corresponding CTTR estimators. We conclude with visual results to demonstrate these properties in simulated FIB microscopy experiments.

We use the Hairstyle222A SEM image of the upper part of the style and stigma from an Arabidopsis flower, https://www.flickr.com/photos/fei_company/9316514268/in/set-72157634429801580/ image from ThermoFisher Scientific (upper-left of Fig 6) as the ground truth image, scaled to have SE yield η[1, 8]\eta\in[1,\,8]. All experiments use total dose λ=20\lambda=20. Fig. 6 displays absolute error images for quotient mode, Lambert quotient mode and maximum likelihood estimators. CTTR measurements are simulated and, by division of the dwell time, also interpreted as DTTR measurements for n{50, 100, 200, 500}n\in\{50,\,100,\,200,\,500\} subacquisitions. Fig. 7 complements Fig. 6 by plotting MSEs as functions of nn for these and additional values of nn.

In Fig. 6, each estimator shows improving performance as nn increases, with n=500n=500 coming close to the CT performance. With nn increased to 2000 in Fig. 7, convergence to CT performance is more clearly indicated. For each value of nn, the ranking of estimators has ML best, LQM second, and QM worst. For n100n\geq 100, all the TR methods perform better than the conventional estimator. One way to summarize is to see that with a factor of 4.4 separating the MSEs of the conventional and oracle estimators, the CTML estimator achieves a 3.6 times lower MSE than the conventional estimator.

Refer to caption
Figure 7: Mean-squared errors as functions of number of subacquisitions nn in simulated FIB microscopy of the image in Fig. 6 using conventional, oracle, quotient mode, Lambert quotient mode, and maximum-likelihood estimators. Total dose is λ=20\lambda=20, and mean secondary yield η\eta of the ground truth image is scaled to [1,8][1,8].

VI Conclusion

In this work, we establish an abstract framework for TR measurement in FIB microscopy with direct electron detection. Through estimation-theoretic analyses, analyses of estimators, and Monte Carlo FIB imaging simulations, we show the extent to which source shot noise can be mitigated by TR measurement methods. The most easily interpreted conclusion comes from the Fisher information of continuous-time TR measurements λ(1/ηeη)\lambda(1/\eta-e^{-\eta}) in (26); when η\eta is not too small, this is only slightly smaller than the λ/η\lambda/\eta, which is equal to the FI that would be obtained with a deterministic incident particle beam. The dependence on mean SE yield η\eta shows that TR methods have greater potential in FIB microscopy than in SEM.

Continuous-time measurement is not implementable in any foreseeable technology. Instead, it is intended as a greatly simpler way to understand the limits of performance under very fine time resolution than to consider nn\rightarrow\infty limits for discrete-time results. Through performance comparisons such as those in Figs. 4, 6, and 7, one can predict the time resolution that is necessary to approach the CT limit within a desired margin. Necessary time resolution can also be understood through the use of Fig. 2 to choose a sufficiently small value for λ/n\lambda/n. For example, for the illustrated value of η=3\eta=3, the normalized Fisher information plot suggests that when the time resolution is fine enough for λ/n<0.1\lambda/n<0.1, at least 83% of the improvement created by time-resolved measurement will be attained.

We study three types of unconventional estimators for both continuous- and discrete-time TR measurements. QM estimators are the simplest to implement and are similar to estimators proposed by Zeiss but not made commercially available. LQM estimators greatly reduce a source of bias at low η\eta and merely require a table lookup to be applied to QM estimates. ML estimators require root-finding or minimization of a non-convex function. The relationships among the estimators are nontrivial: though generally best, the ML estimator does not outperform the others uniformly over η\eta; and though far better at low η\eta, the LQM estimator does not outperform the QM estimator uniformly over η\eta.

The improvements presented here seem to be rooted entirely in making the number of incident ions estimable, and this is potentially applicable even without direct SE detection. Indirect electron detection creates uncertainty in the number of detected SEs, including uncertainty in whether any SEs were detected and hence in whether an ion was incident. However, the experimental results of [19] suggest that the degradation in mitigating source shot noise can be small, since the improvements presented therein are similar to the results presented here. The QM and LQM estimators may be extended to cases in which the probabilistic model relating measurements to numbers of SEs is complicated or uncertain. For an advantage from TR measurement, it may be enough to have a mean instrument output that is monotonically increasing with the number of detected SEs. Then a QM or LQM estimator can use M~\widetilde{M} or (1eη)1M~(1-e^{-\eta})^{-1}\widetilde{M} to normalize a sum of nonlinearly scaled TR measurements to mitigate source shot noise. This is one of several lines of inquiry suggested by the results presented here.

Appendix A High-Dose Performance of Oracle Estimator

Let

g(x)=exm=11mxmm!.g(x)=e^{-x}\sum_{m=1}^{\infty}\frac{1}{m}\frac{x^{m}}{m!}. (59)

This function appears in the performance of the oracle estimator (9) and the variance (33) and MSE (34) of the CTQM estimator. We are interested in approximating and bounding g(x)g(x) to better understand those expressions.

For x1x\ll 1, the first term is dominant, so g(x)xg(x)\approx x. The behavior at large xx is less obvious. The series in (59) converges to Ei(x)γlogx\mathrm{Ei}(x)-\gamma-\log x, where Ei(x)\mathrm{Ei}(x) is the exponential integral function and γ\gamma is the Euler–Mascheroni constant [32]. Asympototically for xx\rightarrow\infty, Ei(x)ex/x\mathrm{Ei}(x)\sim{{e^{x}}/{x}}, meaning that the ratio of the expressions approaches 1. Thus,

g(x)1x.g(x)\sim\frac{1}{x}. (60)

The log-log plot of g(x)g(x) in Fig. 8 shows the accuracies of the low- and high-xx approximations. Being nonnegative, continuous, and vanishing as xx\rightarrow\infty, g(x)g(x) has a finite upper bound:

g(x)0.518,for all x[0,).g(x)\leq 0.518,\qquad\mbox{for all $x\in[0,\infty)$.} (61)
Refer to caption
Figure 8: Function g(x)g(x) in (59) along with low- and high-xx asymptotes.

Appendix B Normalized Fisher Information Limits

B-A Low-Dose Limit

To evaluate limλ0Y(η;λ)/λ\lim_{\lambda\rightarrow 0}{{\mathcal{I}_{Y}(\eta\,;\,\lambda)}/{\lambda}}, we first find λ0\lambda\rightarrow 0 limits of expressions that appear in (17), including both the PMF in (11) and the probability ratio PY(y+1;η,λ)/PY(y;η,λ){{\mathrm{P}_{Y}(y+1\,;\,\eta,\lambda)}/{\mathrm{P}_{Y}(y\,;\,\eta,\lambda)}}.

For y=0y=0,

PY(0;η,λ)\displaystyle\mathrm{P}_{Y}(0\,;\,\eta,\lambda) =eλη00!m=0(λeη)mm0m!\displaystyle=\frac{e^{-\lambda}\eta^{0}}{0!}\sum_{m=0}^{\infty}\frac{(\lambda e^{-\eta})^{m}m^{0}}{m!}
=(a)eλm=0(λeη)mm!=(b)eλexp(λeη),\displaystyle\stackrel{{\scriptstyle(a)}}{{=}}e^{-\lambda}\sum_{m=0}^{\infty}\frac{(\lambda e^{-\eta})^{m}}{m!}\stackrel{{\scriptstyle(b)}}{{=}}e^{-\lambda}\exp(\lambda e^{-\eta}), (62)

where (a) follows from m0=1m^{0}=1; and (b) from identifying the series expansion of the exponential function. Similarly, for y=1y=1,

PY(1;η,λ)\displaystyle\mathrm{P}_{Y}(1\,;\,\eta,\lambda) =eλη11!m=0(λeη)mm1m!\displaystyle=\frac{e^{-\lambda}\eta^{1}}{1!}\sum_{m=0}^{\infty}\frac{(\lambda e^{-\eta})^{m}m^{1}}{m!}
=(eλη)(λeη)exp(λeη),\displaystyle=(e^{-\lambda}\eta)(\lambda e^{-\eta})\exp(\lambda e^{-\eta}), (63)

and for y=2y=2,

PY(2;η,λ)\displaystyle\mathrm{P}_{Y}(2\,;\,\eta,\lambda) =eλη22!m=0(λeη)mm2m!\displaystyle=\frac{e^{-\lambda}\eta^{2}}{2!}\sum_{m=0}^{\infty}\frac{(\lambda e^{-\eta})^{m}m^{2}}{m!}
=eλη22(λeη)(1+λeη)exp(λeη).\displaystyle=\frac{e^{-\lambda}\eta^{2}}{2}(\lambda e^{-\eta})(1+\lambda e^{-\eta})\exp(\lambda e^{-\eta}). (64)

For general y>0y>0,

PY(y;η,λ)\displaystyle\mathrm{P}_{Y}(y\,;\,\eta,\lambda) =eληyy!m=0(λeη)mmym!\displaystyle=\frac{e^{-\lambda}\eta^{y}}{y!}\sum_{m=0}^{\infty}\frac{(\lambda e^{-\eta})^{m}m^{y}}{m!}
=eληyy!(λeη)polyy1(λeη)exp(λeη),\displaystyle=\frac{e^{-\lambda}\eta^{y}}{y!}(\lambda e^{-\eta})\,\mathrm{poly}_{y-1}(\lambda e^{-\eta})\exp(\lambda e^{-\eta}), (65)

where polyy(λeη)\mathrm{poly}_{y}(\lambda e^{-\eta}) is a degree-yy polynomial in λeη\lambda e^{-\eta} with unit constant term. This allows us to conclude, for any y>0y>0,

limλ0PY(y;η,λ)λ=ηyy!eη.\lim_{\lambda\rightarrow 0}\frac{\mathrm{P}_{Y}(y\,;\,\eta,\lambda)}{\lambda}=\frac{\eta^{y}}{y!}e^{-\eta}. (66)

From (62) and (63), we obtain, for y=0y=0,

PY(y+1;η,λ)PY(y;η,λ)=PY(1;η,λ)PY(0;η,λ)=ηλeη.\frac{\mathrm{P}_{Y}(y+1\,;\,\eta,\lambda)}{\mathrm{P}_{Y}(y\,;\,\eta,\lambda)}=\frac{\mathrm{P}_{Y}(1\,;\,\eta,\lambda)}{\mathrm{P}_{Y}(0\,;\,\eta,\lambda)}=\eta\lambda e^{-\eta}. (67)

From (63) and (64), we obtain, for y=1y=1,

PY(y+1;η,λ)PY(y;η,λ)=PY(2;η,λ)PY(1;η,λ)=12η(1+λeη).\frac{\mathrm{P}_{Y}(y+1\,;\,\eta,\lambda)}{\mathrm{P}_{Y}(y\,;\,\eta,\lambda)}=\frac{\mathrm{P}_{Y}(2\,;\,\eta,\lambda)}{\mathrm{P}_{Y}(1\,;\,\eta,\lambda)}=\frac{1}{2}\eta(1+\lambda e^{-\eta}). (68)

For general y>0y>0, it follows from (66) that

limλ0PY(y+1;η,λ)PY(y;η,λ)=ηy+1.\lim_{\lambda\rightarrow 0}\frac{\mathrm{P}_{Y}(y+1\,;\,\eta,\lambda)}{\mathrm{P}_{Y}(y\,;\,\eta,\lambda)}=\frac{\eta}{y+1}. (69)

Now to evaluate limλ0Y(η;λ)/λ\lim_{\lambda\rightarrow 0}{{\mathcal{I}_{Y}(\eta\,;\,\lambda)}/{\lambda}}, we can pass the limit through to each term in (17). The first term is

limλ0(0ηPY(1;η,λ)PY(0;η,λ)1η)2PY(0;η,λ)λ\displaystyle\lim_{\lambda\rightarrow 0}\left(\frac{0}{\eta}-\frac{\mathrm{P}_{Y}(1\,;\,\eta,\lambda)}{\mathrm{P}_{Y}(0\,;\,\eta,\lambda)}\frac{1}{\eta}\right)^{2}\frac{\mathrm{P}_{Y}(0\,;\,\eta,\lambda)}{\lambda}
=(a)limλ0(0ηηλeη1η)2eλexp(λeη)λ=0,\displaystyle\quad\stackrel{{\scriptstyle(a)}}{{=}}\lim_{\lambda\rightarrow 0}\left(\frac{0}{\eta}-\eta\lambda e^{-\eta}\frac{1}{\eta}\right)^{2}\frac{e^{-\lambda}\exp(\lambda e^{-\eta})}{\lambda}=0,

where (a) follows from (62) and (67). By substituting (66) and (69) in (17), the remaining terms give

limλ0Y(η;λ)λ=y=1(yηηy+1y+1η)2ηyy!eη\displaystyle\lim_{\lambda\rightarrow 0}\frac{\mathcal{I}_{Y}(\eta\,;\,\lambda)}{\lambda}=\sum_{y=1}^{\infty}\left(\frac{y}{\eta}-\frac{\eta}{y+1}\frac{y+1}{\eta}\right)^{2}\frac{\eta^{y}}{y!}e^{-\eta}
=y=1(yη1)2ηyy!eη=(eηη1)eη=1ηeη.\displaystyle\quad=\sum_{y=1}^{\infty}\left(\frac{y}{\eta}-1\right)^{2}\frac{\eta^{y}}{y!}e^{-\eta}=\left(\frac{e^{\eta}}{\eta}-1\right)e^{-\eta}=\frac{1}{\eta}-e^{-\eta}.

This proves (18), as desired.

B-B High-Dose Limit

Let us first compute the Fisher information for the parameter η\eta when a Gaussian random variable has mean η\eta and variance f(η)f(\eta) for some twice-differentiable function ff. Let S𝒩(η,f(η))S\sim\mathcal{N}(\eta,\,f(\eta)). Then the log-likelihood of SS is

logfS(s;η)=12log(2π)12logf(η)(sη)22f(η).\log f_{S}(s\,;\,\eta)=-\frac{1}{2}\log(2\pi)-\frac{1}{2}\log f(\eta)-\frac{(s-\eta)^{2}}{2f(\eta)}. (70)

The derivative of logfS(s;η)\log f_{S}(s\,;\,\eta) with respect to η\eta is

logfS(s;η)η\displaystyle\frac{\partial\log f_{S}(s\,;\,\eta)}{\partial\eta} =f(η)2f(η)2(ηs)f(η)(ηs)2f(η)2f(η)2.\displaystyle=-\frac{f^{\prime}(\eta)}{2f(\eta)}-\frac{2(\eta-s)f(\eta)-(\eta-s)^{2}f^{\prime}(\eta)}{2f(\eta)^{2}}.

The second derivative is then

2logfS(s;η)η2\displaystyle\frac{\partial^{2}\log f_{S}(s\,;\,\eta)}{\partial\eta^{2}} =f′′(η)f(η)f(η)22f(η)21f(η)\displaystyle=-\frac{f^{\prime\prime}(\eta)f(\eta)-f^{\prime}(\eta)^{2}}{2f(\eta)^{2}}-\frac{1}{f(\eta)}
+2f(η)f(η)2(ηs)\displaystyle\quad+\frac{2f^{\prime}(\eta)}{f(\eta)^{2}}(\eta-s)
2[f(η)]2f′′(η)f(η)2f(η)3(ηs)2.\displaystyle\quad-\frac{2[f^{\prime}(\eta)]^{2}-f^{\prime\prime}(\eta)f(\eta)}{2f(\eta)^{3}}(\eta-s)^{2}.

The Fisher information for the estimation of η\eta is

S(η)\displaystyle\mathcal{I}_{S}(\eta) =E[2logfS(S;η)η2;η]\displaystyle=\mathrm{E}\!\left[\,{-\frac{\partial^{2}\log f_{S}(S\,;\,\eta)}{\partial\eta^{2}}\,;\,\eta}\,\right]
=f′′(η)f(η)f(η)22f(η)2+1f(η)2f(η)f(η)2E[ηS]\displaystyle=\frac{f^{\prime\prime}(\eta)f(\eta)-f^{\prime}(\eta)^{2}}{2f(\eta)^{2}}+\frac{1}{f(\eta)}-\frac{2f^{\prime}(\eta)}{f(\eta)^{2}}\mathrm{E}\!\left[\,{\eta-S}\,\right]
+2[f(η)]2f′′(η)f(η)2f(η)3E[(ηS)2]\displaystyle\quad+\frac{2[f^{\prime}(\eta)]^{2}-f^{\prime\prime}(\eta)f(\eta)}{2f(\eta)^{3}}\mathrm{E}\!\left[\,{(\eta-S)^{2}}\,\right]
=(a)f′′(η)f(η)f(η)22f(η)2+1f(η)2f(η)f(η)20\displaystyle\stackrel{{\scriptstyle(a)}}{{=}}\frac{f^{\prime\prime}(\eta)f(\eta)-f^{\prime}(\eta)^{2}}{2f(\eta)^{2}}+\frac{1}{f(\eta)}-\frac{2f^{\prime}(\eta)}{f(\eta)^{2}}\cdot 0
+2[f(η)]2f′′(η)f(η)2f(η)3f(n)\displaystyle\quad+\frac{2[f^{\prime}(\eta)]^{2}-f^{\prime\prime}(\eta)f(\eta)}{2f(\eta)^{3}}\cdot f(n)
=1f(η)+[f(η)]22f(η)2,\displaystyle=\frac{1}{f(\eta)}+\frac{[f^{\prime}(\eta)]^{2}}{2f(\eta)^{2}}, (71)

where (a) follows from substituting E[ηS]=0\mathrm{E}\!\left[\,{\eta-S}\,\right]=0 and E[(ηS)2]=var(S)=f(η)\mathrm{E}[\,{(\eta-S)^{2}}\,]=\mathrm{var}\!\left({S}\right)=f(\eta). (Note that this simplifies to the familiar reciprocal of the variance when f(η)f(\eta) is a constant.)

At high dose, Y/λY/\lambda is well-approximated as a 𝒩(η,η(η+1)/λ)\mathcal{N}(\eta,\,\eta(\eta+1)/\lambda) random variable [33, Sect. IV]. Thus, define f(η)=η(η+1)/λf(\eta)={{\eta(\eta+1)}/{\lambda}} so that Y/λY/\lambda is approximated well by SS. Substituting f(η)=(2η+1)/λf^{\prime}(\eta)={{(2\eta+1)}/{\lambda}} into (71) gives

S(η)\displaystyle\mathcal{I}_{S}(\eta) =λη(η+1)+(2η+1)22η2(η+1)2.\displaystyle=\frac{\lambda}{\eta(\eta+1)}+\frac{(2\eta+1)^{2}}{2\eta^{2}(\eta+1)^{2}}. (72)

Since YλSY\approx\lambda S,

limλY(η;λ)λ\displaystyle\lim_{\lambda\rightarrow\infty}\frac{\mathcal{I}_{Y}(\eta\,;\,\lambda)}{\lambda} =limλ[1η(η+1)+(2η+1)22λη2(η+1)2]\displaystyle=\lim_{\lambda\rightarrow\infty}\left[\frac{1}{\eta(\eta+1)}+\frac{(2\eta+1)^{2}}{2\lambda\eta^{2}(\eta+1)^{2}}\right]
=1η(η+1),\displaystyle=\frac{1}{\eta(\eta+1)},

as desired.

Appendix C Derivation of Mean-Squared Error for Continuous-Time Quotient Mode Estimator

C-A Bias of η^CTQM\widehat{\eta}_{\rm CTQM}

For m>0m>0,

E[η^CTQM|M~=m]=E[YM~|M~=m]\displaystyle\mathrm{E}\!\left[\,{\widehat{\eta}_{\rm CTQM}\,|\,\widetilde{M}=m}\,\right]=\mathrm{E}\!\left[\,{\frac{Y}{\widetilde{M}}\,|\,\widetilde{M}=m}\,\right]
=1mE[Y|M~=m]=(a)1mE[i=1mX~i]=E[X~i]\displaystyle\quad=\frac{1}{m}\mathrm{E}\!\left[\,{Y\,|\,\widetilde{M}=m}\,\right]\stackrel{{\scriptstyle(a)}}{{=}}\frac{1}{m}\mathrm{E}\!\left[\,{\sum_{i=1}^{m}\widetilde{X}_{i}}\,\right]=\mathrm{E}\!\left[\,{\widetilde{X}_{i}}\,\right]
=(b)η1eη,\displaystyle\quad\stackrel{{\scriptstyle(b)}}{{=}}\frac{\eta}{1-e^{-\eta}}, (73)

where (a) follows from using (4) as an expression for YY; and (b) from the effect of zero-truncation on the Poisson(η)\operatorname{Poisson}(\eta) distribution. Trivially,

E[η^CTQM|M~=0]=0.\displaystyle\mathrm{E}\!\left[\,{\widehat{\eta}_{\rm CTQM}\,|\,\widetilde{M}=0}\,\right]=0. (74)

Using P(M~>0)=1eλ(1eη)\mathrm{P}({\widetilde{M}>0})=1-e^{-\lambda(1-e^{-\eta})} from (21) and the total expectation theorem to combine (73) and (74) gives

E[η^CTQM]=(1eλ(1eη))η1eη.\mathrm{E}\!\left[\,{\widehat{\eta}_{\rm CTQM}}\,\right]=\left(1-e^{-\lambda(1-e^{-\eta})}\right)\frac{\eta}{1-e^{-\eta}}. (75)

Subtracting η\eta gives (III-D).

C-B Variance of η^CTQM\widehat{\eta}_{\rm CTQM}

From (73) and (74), E[η^CTQM|M~]\mathrm{E}[\,{\widehat{\eta}_{\rm CTQM}\,|\,\widetilde{M}}\,] is a two-valued random variable equal to η/(1eη)\eta/(1-e^{-\eta}) with probability 1eλ(1eη)1-e^{\lambda(1-e^{-\eta})} and equal to 0 otherwise. Thus,

var(E[η^CTQM|M~])\displaystyle\mathrm{var}\!\left({\mathrm{E}\!\left[\,{\widehat{\eta}_{\rm CTQM}\,|\,\widetilde{M}}\,\right]}\right)
=η2(1eη)2eλ(1eη)(1eλ(1eη))\displaystyle\ =\frac{\eta^{2}}{(1-e^{-\eta})^{2}}e^{-\lambda(1-e^{-\eta})}\left(1-e^{-\lambda(1-e^{-\eta})}\right) (76)

by direct calculation.

Toward computing E[var(η^CTQM|M~)]\mathrm{E}[\,{\mathrm{var}({\widehat{\eta}_{\rm CTQM}\,|\,\widetilde{M}})}\,], let us first examine var(η^CTQM|M~=m)\mathrm{var}({\widehat{\eta}_{\rm CTQM}\,|\,\widetilde{M}=m}). For m>0m>0,

var(η^CTQM|M~=m)=var(YM~|M~=m)\displaystyle\mathrm{var}\!\left({\widehat{\eta}_{\rm CTQM}\,|\,\widetilde{M}=m}\right)=\mathrm{var}\!\left({\frac{Y}{\widetilde{M}}\,|\,\widetilde{M}=m}\right)
=1m2var(Y|M~=m)=(a)1m2var(i=1mX~i)\displaystyle\quad=\frac{1}{m^{2}}\mathrm{var}\!\left({Y\,|\,\widetilde{M}=m}\right)\stackrel{{\scriptstyle(a)}}{{=}}\frac{1}{m^{2}}\mathrm{var}\!\left({\sum_{i=1}^{m}\widetilde{X}_{i}}\right)
=1mvar(X~i)=(b)1m(η+η21eηη2(1eη)2),\displaystyle\quad=\frac{1}{m}\mathrm{var}\!\left({\widetilde{X}_{i}}\right)\stackrel{{\scriptstyle(b)}}{{=}}\frac{1}{m}\left(\frac{\eta+\eta^{2}}{1-e^{-\eta}}-\frac{\eta^{2}}{(1-e^{-\eta})^{2}}\right), (77)

where (a) follows from using (4) as an expression for YY; and (b) from the effect of zero-truncation on the Poisson(η)\operatorname{Poisson}(\eta) distribution. Trivially,

var(η^CTQM|M~=0)=0.\displaystyle\mathrm{var}\!\left({\widehat{\eta}_{\rm CTQM}\,|\,\widetilde{M}=0}\right)=0. (78)

Using the PMF of M~\widetilde{M} from (21) and the total expectation theorem to combine (77) and (78) gives

E[var(η^CTQM|M~)]\displaystyle\mathrm{E}\!\left[\,{\mathrm{var}\!\left({\widehat{\eta}_{\rm CTQM}\,|\,\widetilde{M}}\right)}\,\right] =(η+η21eηη2(1eη)2)eλ(1eη)\displaystyle=\left(\frac{\eta+\eta^{2}}{1-e^{-\eta}}-\frac{\eta^{2}}{(1-e^{-\eta})^{2}}\right)\!e^{-\lambda(1-e^{-\eta})}
m=11m[λ(1eη)]mm!.\displaystyle\qquad\cdot\sum_{m=1}^{\infty}\frac{1}{m}\frac{[\lambda(1-e^{-\eta})]^{m}}{m!}. (79)

By summing (76) and (C-B) we obtain the variance of η^CTQM\widehat{\eta}_{\rm CTQM}, which verifies (33).

Appendix D Derivation of Mean-Squared Error for Discrete-Time Quotient Mode Estimator

D-A Variance of η^DTQM\widehat{\eta}_{\rm DTQM}

From (47) and (48), E[η^DTQM|L]\mathrm{E}[\,{\widehat{\eta}_{\rm DTQM}\,|\,L}\,] is a two-valued random variable equal to E[Y~k]\mathrm{E}[\,{\widetilde{Y}_{k}}\,] with probability 1(1p)n1-(1-p)^{n} and equal to 0 otherwise. Thus,

var(E[η^DTQM|L])=(E[Y~k])2[1(1p)n](1p)n,\displaystyle\mathrm{var}\!\left({\mathrm{E}\!\left[\,{\widehat{\eta}_{\rm DTQM}\,|\,L}\,\right]}\right)=\left(\mathrm{E}\!\left[\,{\widetilde{Y}_{k}}\,\right]\right)^{2}[1-(1-p)^{n}](1-p)^{n}, (80)

where E[Y~k]\mathrm{E}[\,{\widetilde{Y}_{k}}\,] is given in (45).

Toward computing E[var(η^DTQM|L)]\mathrm{E}[\,{\mathrm{var}({\widehat{\eta}_{\rm DTQM}\,|\,L})}\,], let us first examine var(η^DTQM|L=)\mathrm{var}({\widehat{\eta}_{\rm DTQM}\,|\,L=\ell}). For >0\ell>0,

var(η^DTQM|L=)=var(YL|L=)=12var(Y|L=)\displaystyle\mathrm{var}\!\left({\widehat{\eta}_{\rm DTQM}\,|\,L=\ell}\right)=\mathrm{var}\!\left({\frac{Y}{L}\,|\,L=\ell}\right)=\frac{1}{\ell^{2}}\mathrm{var}\!\left({Y\,|\,L=\ell}\right)
=12var(j=1Y~j)=1var(Y~k),\displaystyle\quad=\frac{1}{\ell^{2}}\mathrm{var}\!\left({\sum_{j=1}^{\ell}\widetilde{Y}_{j}}\right)=\frac{1}{\ell}\mathrm{var}\!\left({\widetilde{Y}_{k}}\right), (81)

where var(Y~k)\mathrm{var}({\widetilde{Y}_{k}}) is given in (46). Trivially,

var(η^DTQM|L=0)=0.\displaystyle\mathrm{var}\!\left({\widehat{\eta}_{\rm DTQM}\,|\,L=0}\right)=0. (82)

Using the binomial PMF of LL and the total expectation theorem to combine (81) and (82) gives

E[var(η^DTQM|L)]\displaystyle\mathrm{E}\!\left[\,{\mathrm{var}\!\left({\widehat{\eta}_{\rm DTQM}\,|\,L}\right)}\,\right] =var(Y~k)=1n1(n)p(1p)n.\displaystyle=\mathrm{var}\!\left({\widetilde{Y}_{k}}\right)\sum_{\ell=1}^{n}\frac{1}{\ell}\binom{n}{\ell}p^{\ell}(1-p)^{n-\ell}. (83)

By summing (80) and (83) we obtain the variance of η^DTQM\widehat{\eta}_{\rm DTQM}, which verifies (IV-D).

D-B Variance Lower Bound

Let L~\widetilde{L} be the zero-truncated version of binomial random variable LL, with PMF

PL~()=11(1p)n(n)p(1p)n,=1,2,,n.\mathrm{P}_{\widetilde{L}}(\ell)=\frac{1}{1-(1-p)^{n}}\binom{n}{\ell}p^{\ell}(1-p)^{n-\ell},\quad\ell=1,2,\ldots,n.

Then the series in (IV-D) equals (1(1p)n)E[ 1/L~](1-(1-p)^{n})\,\mathrm{E}[\,{1/\widetilde{L}}\,]. Using Jensen’s inequality, we can bound E[ 1/L~]\mathrm{E}[\,{1/\widetilde{L}}\,] using E[L~]\mathrm{E}[\,{\widetilde{L}}\,], which has a simple closed-form expression:

E[L~]=11(1p)nE[L]=np1(1p)n.\mathrm{E}\!\left[\,{\widetilde{L}}\,\right]=\frac{1}{1-(1-p)^{n}}\mathrm{E}\!\left[\,{L}\,\right]=\frac{np}{1-(1-p)^{n}}. (84)

Specifically,

=1n1(n)p(1p)n=(1(1p)n)E[1L~]\displaystyle\sum_{\ell=1}^{n}\frac{1}{\ell}\binom{n}{\ell}p^{\ell}(1-p)^{n-\ell}=(1-(1-p)^{n})\,\mathrm{E}\!\left[\,{\frac{1}{\widetilde{L}}}\,\right]
(a)(1(1p)n)1E[L~]=(b)(1(1p)n)2np,\displaystyle\quad\stackrel{{\scriptstyle(a)}}{{\geq}}(1-(1-p)^{n})\,\frac{1}{\mathrm{E}\!\left[\,{\widetilde{L}}\,\right]}\stackrel{{\scriptstyle(b)}}{{=}}\frac{(1-(1-p)^{n})^{2}}{np}, (85)

where (a) follows from Jensen’s inequality; and (b) from (84). Substituting (85) in (IV-D) gives (IV-D).

Appendix E Derivation of High nn Limits

E-A Proof of (52)

limnnp\displaystyle\lim_{n\to\infty}np =limn1e(λ/n)(1eη)1/n\displaystyle=\lim_{n\to\infty}\frac{1-e^{-{({{\lambda}/{n}})}(1-e^{-\eta})}}{1/n}
=(a)limne(λ/n)(1eη)(1eη)λ/n21/n2\displaystyle\stackrel{{\scriptstyle(a)}}{{=}}\lim_{n\to\infty}\frac{-e^{-{({{\lambda}/{n}})}(1-e^{-\eta})}(1-e^{-\eta})\lambda/n^{2}}{-1/n^{2}}
=limne(λ/n)(1eη)(1eη)λ\displaystyle=\lim_{n\to\infty}e^{-{({{\lambda}/{n}})}(1-e^{-\eta})}(1-e^{-\eta})\lambda
=λ(1eη),\displaystyle=\lambda(1-e^{-\eta}),

where (a) follows from L’Hôpital’s rule.

E-B Proof of (53)

limn(1p)n\displaystyle\lim_{n\to\infty}(1-p)^{n} =(a)limn(exp(λn(1eη)))n\displaystyle\stackrel{{\scriptstyle(a)}}{{=}}\lim_{n\to\infty}\left(\exp\!\left(-\frac{\lambda}{n}(1-e^{-\eta})\right)\right)^{n}
=exp(λ(1eη))\displaystyle=\exp(-\lambda(1-e^{-\eta}))

where (a) follows from substitution of (43).

E-C Proof of (55)

limnE[Y~k]\displaystyle\lim_{n\to\infty}\mathrm{E}[\,{\widetilde{Y}_{k}}\,] =(a)limnληnp\displaystyle\stackrel{{\scriptstyle(a)}}{{=}}\lim_{n\to\infty}\frac{\lambda\eta}{np}
=(b)limnληλ(1eη)=η1eη,\displaystyle\stackrel{{\scriptstyle(b)}}{{=}}\lim_{n\to\infty}\frac{\lambda\eta}{\lambda(1-e^{-\eta})}=\frac{\eta}{1-e^{-\eta}},

where (a) follows from substitution of (45); and (b) from substitution of (52).

E-D Proof of (56)

limnvar(Y~k)\displaystyle\lim_{n\to\infty}\mathrm{var}({\widetilde{Y}_{k}}) =(a)limn1p(λnη+λnη2+(λnη)2)1p2(λnη)2\displaystyle\stackrel{{\scriptstyle(a)}}{{=}}\lim_{n\to\infty}\frac{1}{p}\!\left(\frac{\lambda}{n}\eta+\frac{\lambda}{n}\eta^{2}+\left(\frac{\lambda}{n}\eta\right)^{\!2}\right)-\frac{1}{p^{2}}\!\left(\frac{\lambda}{n}\eta\right)^{\!2}
=limnληnp+λη2np+λ2η2n2pλ2η2n2p2\displaystyle=\lim_{n\to\infty}\frac{\lambda\eta}{np}+\frac{\lambda\eta^{2}}{np}+\frac{\lambda^{2}\eta^{2}}{n^{2}p}-\frac{\lambda^{2}\eta^{2}}{n^{2}p^{2}}
=(b)limnλη+λη2λ(1eη)λ2η2λ2(1eη)2\displaystyle\stackrel{{\scriptstyle(b)}}{{=}}\lim_{n\to\infty}\frac{\lambda\eta+\lambda\eta^{2}}{\lambda(1-e^{-\eta})}-\frac{\lambda^{2}\eta^{2}}{\lambda^{2}(1-e^{-\eta})^{2}}
=η(η+η2)eη(1eη)2,\displaystyle=\frac{\eta-(\eta+\eta^{2})e^{-\eta}}{(1-e^{-\eta})^{2}},

where (a) follows from substitution of (46); and (b) from substitution of (52) and noting that the third term vanishes.

References

  • [1] D. McMullan, “Scanning electron microscopy 1928–1965,” Scanning, vol. 17, no. 3, pp. 175–185, May–Jun. 1995.
  • [2] E. W. Müller and T. T. Tsong, Field Ion Microscopy: Principles and Applications.   American Elsevier, 1969.
  • [3] B. W. Ward, J. A. Notte, and N. P. Economou, “Helium ion microscope: A new tool for nanoscale microscopy and metrology,” J. Vacuum Sci. Technol. B, vol. 24, no. 6, pp. 2871–2874, Nov. 2006.
  • [4] D. Emmrich, A. Beyer, A. Nadzeyka, S. Bauerdick, J. Meyer, J. Kotakoski, and A. Gölzhäuser, “Nanopore fabrication and characterization by helium ion microscopy,” Appl. Phys. Lett., vol. 108, no. 16, p. 163103, 2016.
  • [5] M. S. Joens, C. Huynh, J. M. Kasuboski, D. Ferranti, Y. J. Sigal, F. Zeitvogel, M. Obst, C. J. Burkhardt, K. P. Curran, S. H. Chalasani, L. A. Stern, B. Goetze, and J. A. J. Fitzpatrick, “Helium ion microscopy (HIM) for the imaging of biological samples at sub-nanometer resolution,” Sci. Rep., vol. 3, no. 3514, 2013.
  • [6] S. Bazargan, N. F. Heinig, J. F. Rios, and K. Leung, “Electronic transport in tin (iv) oxide nanocrystalline films: two-medium transport with three-dimensional variable-range hopping mechanism for the ultrasmall nanocrystallite size regime,” J. Physical Chemistry C, vol. 116, no. 8, pp. 4979–4985, 2012.
  • [7] T. Wirtz, O. De Castro, J.-N. Audinot, and P. Philipp, “Imaging and analytics on the helium ion microscope,” Ann. Rev. Analytical Chem., vol. 12, pp. 523–543, 2019.
  • [8] L. Scipioni, C. A. Sanford, J. Notte, B. Thompson, and S. McVey, “Understanding imaging modes in the helium ion microscope,” J. Vac. Sci. & Technol. B, vol. 27, no. 6, pp. 3250–3255, 2009.
  • [9] R. Ramachandra, B. Griffin, and D. Joy, “A model of secondary electron imaging in the helium ion scanning microscope,” Ultramicroscopy, vol. 109, no. 6, pp. 748–757, 2009.
  • [10] J. Morgan, J. Notte, R. Hill, and B. Ward, “An introduction to the helium ion microscope,” Microscopy Today, vol. 14, no. 4, pp. 24–31, 2006.
  • [11] G. Hlawacek, V. Veligura, R. van Gastel, and B. Poelsema, “Helium ion microscopy,” J. Vac. Sci. & Technol. B, vol. 32, no. 2, p. 020801, 2014.
  • [12] R. Livengood, S. Tan, Y. Greenzweig, J. Notte, and S. McVey, “Subsurface damage from helium ions as a function of dose, beam energy, and dose rate,” J. Vac. Sci. & Technol. B, vol. 27, no. 6, pp. 3244–3249, 2009.
  • [13] V. Castaldo, C. W. Hagen, P. Kruit, E. Van Veldhoven, and D. Maas, “On the influence of the sputtering in determining the resolution of a scanning ion microscope,” J. Vac. Sci. & Technol. B, vol. 27, no. 6, pp. 3196–3202, 2009.
  • [14] J. Orloff, L. W. Swanson, and M. Utlaut, “Fundamental limits to imaging resolution for focused ion beams,” J. Vac. Sci. & Technol. B, vol. 14, no. 6, pp. 3759–3763, 1996.
  • [15] D. Fox, Y. B. Zhou, A. O’Neill, S. Kumar, J. J. Wang, J. N. Coleman, G. S. Duesberg, J. F. Donegan, and H. Z. Zhang, “Helium ion microscopy of graphene: Beam damage, image quality and edge contrast,” Nanotechnology, vol. 24, no. 33, p. 335702, 2013.
  • [16] H. S. Anderson, J. Ilic-Helms, B. Rohrer, J. Wheeler, and K. Larson, “Sparse imaging for fast electron microscopy,” in SPIE Computational Imaging XI, vol. 8657, 2013, p. 86570C.
  • [17] L. Donati, M. Nilchian, S. Trépout, C. Messaoudi, S. Marco, and M. Unser, “Compressed sensing for STEM tomography,” Ultramicroscopy, vol. 179, pp. 47–56, 2017.
  • [18] F. Boughorbel, P. Potocek, M. Hovorka, L. Strakos, J. Mitchels, T. Vystavel, P. Trampert, B. Lich, and T. Dahmen, “High-throughput large volume sem workflow using sparse scanning and in-painting algorithms inspired by compressive sensing,” Microscopy and Microanalysis, vol. 23, no. S1, pp. 150–151, 2017.
  • [19] M. Peng, J. Murray-Bruce, K. K. Berggren, and V. K. Goyal, “Source shot noise mitigation in focused ion beam microscopy by time-resolved measurement,” Ultramicroscopy, p. 112948, 2020.
  • [20] G. McMullan, S. Chen, R. Henderson, and A. Faruqi, “Detective quantum efficiency of electron area detectors in electron microscopy,” Ultramicroscopy, vol. 109, no. 9, pp. 1126–1143, 2009.
  • [21] S. Yamada, T. Ito, K. Gouhara, and Y. Uchikawa, “Electron-count imaging in SEM,” Scanning, vol. 13, no. 2, pp. 165–171, 1991.
  • [22] L. Jin, A.-C. Milazzo, S. Kleinfelder, S. Li, P. Leblanc, F. Duttweiler, J. C. Bouwer, S. T. Peltier, M. H. Ellisman, and N.-H. Xuong, “Applications of direct detection device in transmission electron microscopy,” J. Structural Biology, vol. 161, no. 3, pp. 352–358, 2008.
  • [23] C. Li, S. F. Mao, and Z. J. Ding, “Time-dependent characteristics of secondary electron emission,” J. Appl. Physics, vol. 125, no. 024902, Jan. 2019.
  • [24] Y. Lin and D. C. Joy, “A new examination of secondary electron yield data,” Surface Interface Anal., vol. 37, no. 11, pp. 895–900, 2005.
  • [25] J. Notte, B. Ward, N. Economou, R. Hill, R. Percival, L. Farkas, and S. McVey, “An introduction to the helium ion microscope,” in AIP Conference Proceedings, vol. 931.   AIP, 2007, pp. 489–496.
  • [26] L. Frank, “Noise in secondary electron emission: the low yield case,” J. Electron Microscopy, vol. 54, no. 4, pp. 361–365, Aug. 2005.
  • [27] H. L. Van Trees, Detection, Estimation and Modulation Theory, Part I.   New York, NY: Wiley, 2001.
  • [28] J. A. Notte, “Imaging with helium ions – A new detector regime with new challenges and new opportunities,” American Vacuum Society 60th Int. Symp. Exhibit., Oct. 31 (presented, no proceedings), 2013.
  • [29] J. A. Notte, S. Carousso, and B. W. Ward, “Ion beam imaging,” World Intellectual Property Organization publication WO 2009/079195 Al, 25 Jun. 2009.
  • [30] R. M. Corless, G. H. Gonnet, D. E. G. Hare, D. J. Jeffrey, and D. E. Knuth, “On the Lambert WW function,” Advances in Computational Mathematics, vol. 5, no. 1, pp. 329–359, Dec. 1996.
  • [31] R. J. Serfling, “Some elementary results on Poisson approximation in a sequence of Bernoulli trials,” SIAM Rev., vol. 20, no. 3, pp. 567–579, Jul. 1978.
  • [32] E. Masina, “A review on the exponential-integral special function and other strictly related special functions,” arXiv:1907.12373v1 [math.GM], Jul. 2019.
  • [33] M. C. Teich, “Role of the doubly stochastic Neyman type-A and Thomas counting distributions in photon detection,” Appl. Optics, vol. 20, no. 14, pp. 2457–2467, Jul. 1981.