¹¹institutetext: Department of Health Technology, Technical University of Denmark, Denmark ²²institutetext: Athinoula A. Martinos Center for Biomedical Imaging, Massachusetts General Hospital, Harvard Medical School, USA ³³institutetext: Department of Radiology, Harvard Medical School, United States ⁴⁴institutetext: Department of Neurology and TUM-Neuroimaging Center, School of Medicine, Technical University of Munich, Germany

A Longitudinal Method for Simultaneous Whole-Brain and Lesion Segmentation in Multiple Sclerosis

Stefano Cerri 11 Andrew Hoopes 22 Douglas N. Greve 2233 Mark Mühlau 44 Koen Van Leemput 1122

Abstract

In this paper we propose a novel method for the segmentation of longitudinal brain MRI scans of patients suffering from Multiple Sclerosis. The method builds upon an existing cross-sectional method for simultaneous whole-brain and lesion segmentation, introducing subject-specific latent variables to encourage temporal consistency between longitudinal scans. It is very generally applicable, as it does not make any prior assumptions on the scanner, the MRI protocol, or the number and timing of longitudinal follow-up scans. Preliminary experiments on three longitudinal datasets indicate that the proposed method produces more reliable segmentations and detects disease effects better than the cross-sectional method it is based upon.

1 Introduction

Multiple Sclerosis (MS) is an inflammatory autoimmune disorder of the central nervous system. It is characterized by the formation of lesions in the white matter, as well as marked brain atrophy primarily in deep gray matter structures [1, 2]. The increased availability of longitudinal magnetic resonance imaging (MRI) scans opens up the prospect of tracking lesion evolution and atrophy trajectories over time, enabling a better assessment of disease progression and treatment efficacy [3].

Despite high potential clinical impact, work on computational methods for quantifying longitudinal changes in MS has remained fairly limited to date (cf. [4] for an overview). The methods that do exist suffer from one or more of the following limitations: They only assess changes in white matter lesions [5, 6, 7, 8] or in aggregate measures of brain atrophy such as global brain or gray matter volume [9, 10], but not in individual brain structures; they can only compare between two consecutive time points [11, 12, 13, 14] instead of characterizing entire temporal trajectories; or they are developed and tested in very specific settings only, with degraded performance when applied to data from different scanners and acquisition protocols [15] which limits their usefulness in practice.

In order to address these limitations, here we propose a dedicated model for simultaneously segmenting anatomical brain structures and white matter lesion from longitudinal multi-contrast MRI scans. The proposed method builds upon a contrast-adaptive method for simultaneous whole-brain and lesion segmentation that we previously developed and validated [16]. Here we extend this approach to the longitudinal setting by additionally modeling the expected temporal consistency between repeated scans of the same subject, using latent variables that introduce a statistical dependency between the time points. By segmenting both white matter lesions and anatomical brain structures across time, the resulting method enables tracking deep gray matter atrophy trajectories and lesion evolution simultaneously. The model is fully adaptive to different MRI contrasts and scanners, and does not put any constraints on the number or the timing of longitudinal follow-up scans. To the best of our knowledge, no other method with these capabilities currently exists.

We assessed the segmentation performance of the proposed method on three longitudinal datasets. Preliminary results indicate that it produces more reliable segmentations and detects disease effects better than the cross-sectional method. An example result produced by the longitudinal method is shown in Fig.1.

2 Existing cross-sectional method

We first summarize the existing cross-sectional method for simultaneous whole-brain and lesion segmentation [16] the proposed method builds upon.

Let $\mathbf{D}=(\mathbf{d}_{1},\dots,\mathbf{d}_{I})$ be the image intensities of a multi contrast MRI scan with $I$ voxels, where the vector $\mathbf{d}_{i}=(d_{i}^{1},\dots,d_{i}^{N})^{T}$ represents the log-transformed image intensity of voxel $i$ for all the available $N$ contrasts. Moreover, let $\mathbf{l}=(l_{1},\dots,l_{I})^{T}$ be corresponding segmentation labels, where $l_{i}\in\{1,\dots,K\}$ denotes one of the $K$ possible anatomical structures assigned to voxel $i$ . In order for the model to be capable of segmenting white matter lesions, a binary lesion map $\mathbf{z}=(z_{1},\dots z_{I})$ is introduced, where $z_{i}\in\{0,1\}$ indicates the presence of lesion in voxel $i$ . We use a generative model, illustrated in black in Fig. 3, to estimate a joint segmentation $\{\mathbf{l},\mathbf{z}\}$ from MRI data $\mathbf{D}$ . The model consists of a segmentation prior $p(\mathbf{l},\mathbf{z}|\mathbf{h},\mathbf{x})$ with parameters $\mathbf{h}$ and $\mathbf{x}$ that encode shape information, and a likelihood function $p(\mathbf{D}|\mathbf{l},\mathbf{z},\boldsymbol{\theta},\boldsymbol{\theta}_{z})$ with parameters $\boldsymbol{\theta}$ and $\boldsymbol{\theta}_{z}$ that govern intensity appearance. Below we briefly describe the segmentation prior and the likelihood function, as well as how the model is “inverted” to obtain automatic segmentations.

Refer to caption — Figure 1: Example segmentation produced by the proposed method on a longitudinal scan with T1w and FLAIR contrast.

2.0.1 Segmentation prior:

The segmentation prior is composed of two components $p(\mathbf{l}|\mathbf{x})$ and $p(\mathbf{z}|\mathbf{h},\mathbf{x})$ that encode spatial information regarding the neuroanatomical labels $\mathbf{l}$ and the lesion map $\mathbf{z}$ respectively: $p(\mathbf{l},\mathbf{z}|\mathbf{h},\mathbf{x})=p(\mathbf{l}|\mathbf{x})p(\mathbf{z}|\mathbf{h},\mathbf{x}).$ The first component is a deformable probabilistic atlas, encoded as a tetrahedral mesh [17] with node positions $\mathbf{x}$ and with a deformation prior distribution defined as:

p(\mathbf{x})\propto\exp\left[-K\sum_{d}U_{d}(\mathbf{x},\mathbf{x}_{ref})\right].

Here $K$ controls the stiffness of the mesh deformations, $d$ loops over the tetrahedra in the mesh, and $U_{d}(\mathbf{x},\mathbf{x}_{ref})$ is a cost [18] associated with deforming the $d^{th}$ tetrahedron from its shape in the atlas’s reference position $\mathbf{x}_{ref}$ . Letting $p(l_{i}=k|\mathbf{x})$ denote the probability of observing label $k$ at voxel $i$ for a given deformation, assuming conditional independence of the labels between voxels yields $p(\mathbf{l}|\mathbf{x})=\prod_{i=1}^{I}p(l_{i}|\mathbf{x}).$

The second component of the segmentation prior is a model of the form: $p(\mathbf{z}|\mathbf{h},\mathbf{x})=\prod_{i=1}^{I}p(z_{i}|\mathbf{h},\mathbf{x}),\textbf{ }p(\mathbf{h})=\mathcal{N}(\mathbf{h}|\mathbf{0},\mathbf{I}),$ where $p(z_{i}=1||\mathbf{h},\mathbf{x})$ is the probability that voxel $i$ is part of a lesion. This model takes into account both a voxel’s spatial location within its neuroanatomical context (through $\mathbf{x}$ ), as well as lesion shape constraints through a variational autoencoder (VAE) [19] that “decodes” a low-dimensional latent code $\mathbf{h}$ using a convolutional neural network.

2.0.2 Likelihood:

For the likelihood, which links segmentations $\{\mathbf{l},\mathbf{z}\}$ to intensities $\mathbf{D}$ , we use a multivariate Gaussian intensity model for each structure, and model the MRI bias field artifact as a linear combination of spatially smooth basis functions that is added to the local voxel intensities [20, 21]. Letting $\boldsymbol{\theta}_{z}=\{\boldsymbol{\mu}_{z},\boldsymbol{\Sigma}_{z}\}$ denote the mean and variance of lesion intensities, and $\boldsymbol{\theta}$ the collection of bias field parameters and intensity means and variances $\{\boldsymbol{\mu}_{k},\boldsymbol{\Sigma}_{k}\}$ of all $K$ anatomical structures, the likelihood is defined as $p(\mathbf{D}|\mathbf{l},\mathbf{z},\boldsymbol{\theta},\boldsymbol{\theta}_{z})=\prod_{i=1}^{I}p(\mathbf{d}_{i}|l_{i},z_{i},\boldsymbol{\theta},\boldsymbol{\theta}_{z}),$ where

\displaystyle p(\mathbf{d}_{i}|l_{i}=k,z_{i},\boldsymbol{\theta},\boldsymbol{\theta}_{z})=\begin{cases}\mathcal{N}(\mathbf{d}_{i}|\boldsymbol{\mu}_{z}+\mathbf{C}^{T}\boldsymbol{\phi}_{i},\boldsymbol{\Sigma}_{z})&\text{if }z_{i}=1,\\ \mathcal{N}(\mathbf{d}_{i}|\boldsymbol{\mu}_{k}+\mathbf{C}^{T}\boldsymbol{\phi}_{i},\boldsymbol{\Sigma}_{k})&\text{otherwise}.\end{cases}

Here $\boldsymbol{\phi}_{i}$ evaluates the bias field basis functions at the $i^{th}$ voxel, and $\mathbf{C}=(\mathbf{c}_{1},\dots,\mathbf{c}_{N})$ , where $\mathbf{c}_{n}$ denotes the parameters of the bias field model for the $n^{th}$ contrast. The model is completed by a flat prior on $\boldsymbol{\theta}$ , and a weak conditional prior $p(\boldsymbol{\theta}_{z}|\boldsymbol{\theta})$ that ensures that the method can be robustly applied to scans with no or very small lesion loads [16].

2.0.3 Segmentation:

Given an MRI scan $\mathbf{D}$ , segmentation proceeds by approximating the segmentation posterior using point estimates of the parameters $\mathbf{x}$ and $\boldsymbol{\theta}$ :

p(\mathbf{l},\mathbf{z}|\mathbf{D})\simeq p(\mathbf{l},\mathbf{z}|\mathbf{D},\hat{\boldsymbol{\theta}},\hat{\mathbf{x}}),

(1)

and Markov chain Monte Carlo sampling to marginalize over the remaining, lesion-specific parameters $\boldsymbol{\theta}_{z}$ and $\mathbf{h}$ . For the purpose of finding the point estimates $\mathbf{\hat{x}}$ and $\boldsymbol{\hat{\theta}}$ , a simplified model is fitted to the data:

\boldsymbol{\hat{\Omega}}=\operatornamewithlimits{argmax}_{\boldsymbol{\Omega}}p(\boldsymbol{\Omega}|\mathbf{D})\quad\mathrm{with}\quad\boldsymbol{\Omega}=\{\mathbf{x},\boldsymbol{\theta},\boldsymbol{\theta}_{z}\},

(2)

where the lesion-shape encoding VAE and its parameters $\mathbf{h}$ are temporarily removed to simplify the optimization process. More details can be found in [16].

3 Longitudinal extension

In the longitudinal setting we are given $T$ scans with image intensities $\{\mathbf{D}_{t}\}_{t=1}^{T}$ , and we wish to compute for each time point $t$ the corresponding segmentation $\{\mathbf{l}_{t},\mathbf{z}_{t}\}$ . In contrast to the cross-sectional setting where each image is treated independently, here we can exploit the fact that all images belong to the same subject to produce more consistent (and potentially more accurate) segmentations. Towards this end, we introduce subject-specific latent variables $\mathbf{x}_{0}$ and $\boldsymbol{\theta}_{0}$ in the segmentation prior and likelihood, respectively, imposing a statistical dependency between the time points that encourages the segmentations to be similar to one another. The augmented generative model is depicted in Fig. 3, where the parameters $\mathbf{x}_{t},\mathbf{h}_{t},\boldsymbol{\theta}_{t}$ and $\boldsymbol{\theta}_{t,z}$ denote the model parameters of time point $t$ , and the blue parts indicate the additional components compared to the cross-sectional model.

3.0.1 Segmentation prior:

In order to obtain temporal consistency in the segmentation prior, we use the concept of a “subject-specific atlas” [22]: a deformation of the cross-sectional atlas to represent the average subject-specific anatomy across all time points. In particular,

p(\{\mathbf{x}_{t}\}_{t=1}^{T}|\mathbf{x}_{0})=\prod_{t=1}^{T}p(\mathbf{x}_{t}|\mathbf{x}_{0}),\quad p(\mathbf{x}_{t}|\mathbf{x}_{0})\propto\exp\left[-K_{1}\sum_{d}U_{d}(\mathbf{x}_{t},\mathbf{x}_{0})\right],

where $\mathbf{x}_{0}$ are latent atlas node positions encoding subject-specific brain shape, with prior $p(\mathbf{x}_{0})\propto\exp\left[-K_{0}\sum_{d}U_{d}(\mathbf{x}_{0},\mathbf{x}_{ref})\right].$ Here the mesh stiffnesses $K_{0}$ and $K_{1}$ are hyperparameters of the model; by choosing $K_{0}=\infty$ and $K_{1}=K$ the model devolves into the cross-sectional segmentation prior for each time point separately.

3.0.2 Likelihood:

In a similar vein, we also introduce subject-specific latent variables to encourage temporal consistency in the Gaussian intensity models. For each anatomical structure, we condition the Gaussian parameters $\{\boldsymbol{\mu}_{t,k},\boldsymbol{\Sigma}_{t,k}\}$ on latent variables $\{\boldsymbol{\mu}_{0,k},\boldsymbol{\Sigma}_{0,k}\}$ using a normal-inverse-Wishart (NIW) distribution: $p(\{\boldsymbol{\theta}_{t}\}_{t=1}^{T}|\boldsymbol{\theta}_{0})=\prod_{t=1}^{T}p(\boldsymbol{\theta}_{t}|\boldsymbol{\theta}_{0})$ with

\displaystyle p(\boldsymbol{\theta}_{t}|\boldsymbol{\theta}_{0})\propto\prod_{k=1}^{K}\mathcal{N}(\boldsymbol{\mu}_{t,k}|\boldsymbol{\mu}_{0,k},P_{0,k}\boldsymbol{\Sigma}_{0,k})\mathrm{IW}(\boldsymbol{\Sigma}_{t,k}|P_{0,k}\boldsymbol{\Sigma}_{0,k},P_{0,k}-N-2).

Here $\boldsymbol{\theta}_{0}=\{\boldsymbol{\mu}_{0,k},\boldsymbol{\Sigma}_{0,k}\}_{k=1}^{K}$ with prior $p(\boldsymbol{\theta}_{0})\propto 1$ , and $P_{0,k}\geq 0$ is a hyperparameter that governs the strength of the regularization across time for label $k$ . Note that choosing $P_{0,k}=0,\forall k$ yields the cross-sectional likelihood for each time point independently.

3.0.3 Segmentation:

We follow the same overall segmentation strategy as in the cross-sectional setting: we first compute point estimates $\{\boldsymbol{\hat{\theta}}_{t},\mathbf{\hat{x}}_{t}\}_{t=1}^{T}$ using a simplified model in which the lesion shape codes $\{\mathbf{h}_{t}\}_{t=1}^{T}$ are removed, and subsequently obtain segmentations as described in the cross-sectional setting, i.e., by using (1) for each time point separately. As in the cross-sectional case, we obtain the required point estimates by fitting the longitudinal model to the data:

\{\boldsymbol{\hat{\Omega}}_{1},\ldots,\boldsymbol{\hat{\Omega}}_{T},\boldsymbol{\hat{\theta}}_{0},\mathbf{\hat{x}}_{0}\}=\operatornamewithlimits{argmax}_{\{\boldsymbol{\Omega}_{1},\ldots,\boldsymbol{\Omega}_{T},\boldsymbol{\theta}_{0},\mathbf{x}_{0}\}}p(\boldsymbol{\Omega}_{1},\ldots,\boldsymbol{\Omega}_{T},\boldsymbol{\theta}_{0},\mathbf{x}_{0}|\mathbf{D}_{1},\ldots,\mathbf{D}_{T}),

(3)

where $\boldsymbol{\Omega}_{t}=\{\mathbf{x}_{t},\boldsymbol{\theta}_{t},\boldsymbol{\theta}_{t,z}\}$ . For optimizing (3) we use coordinate ascent, updating one variable at a time in an iterative fashion. Because $p(\mathbf{x}_{t}|\mathbf{x}_{0})$ is of the same form as the cross-sectional deformation prior, and the NIW distribution used in $p(\boldsymbol{\theta}_{t}|\boldsymbol{\theta}_{0})$ is the conjugate prior for the mean and variance of a Gaussian distribution, estimating $\boldsymbol{\Omega}_{t}$ from $\mathbf{D}_{t}$ for given values of $\boldsymbol{\theta}_{0}$ and $\mathbf{x}_{0}$ simply involves performing an optimization of the form of (2) for each time point $t$ separately. Conversely, for given values $\{\boldsymbol{\Omega}_{t}\}_{t=1}^{T}$ the update for $\boldsymbol{\theta}_{0}$ is given in closed form:

\displaystyle\boldsymbol{\mu}_{0,k}\leftarrow\left(\sum_{t=1}^{T}\boldsymbol{\Sigma}_{t,k}^{-1}\right)^{-1}\sum_{t=1}^{T}\boldsymbol{\Sigma}_{t,k}^{-1}\boldsymbol{\mu}_{t,k},\quad\boldsymbol{\Sigma}^{-1}_{0,k}\leftarrow\left(\frac{1}{T}\sum_{t=1}^{T}\boldsymbol{\Sigma}_{t,k}^{-1}\right)\frac{P_{0,k}}{P_{0,k}-N-2},

whereas updating $\mathbf{x}_{0}$ involves the optimization (cf. [22])

\operatornamewithlimits{argmin}_{\mathbf{x}_{0}}\sum_{d}\left[K_{0}U_{d}(\mathbf{x}_{0},\mathbf{x}_{ref})+K_{1}\sum_{t=1}^{T}U_{d}(\mathbf{x}_{0},\mathbf{x}_{t})\right],

which we solve numerically using a limited-memory BFGS algorithm.

3.0.4 Implementation:

In order to avoid longitudinal processing biases resulting from not treated all time points in exactly the same way, we first compute an unbiased within-subject template using an inverse consistent registration method [23]. This template is a robust representation of the average subject anatomy over time, and we use it as an unbiased reference to register all time points to in a preprocessing step. We also use it to start the proposed iterative algorithm optimizing (3): we apply the cross-sectional method to the template, and use the estimated model parameters $\boldsymbol{\Omega}$ to initialize $\boldsymbol{\Omega}_{t},t=1,\ldots,T$ . The proposed algorithm, which interleaves updating the latent variables $\boldsymbol{\theta}_{0}$ and $\mathbf{x}_{0}$ with updating the parameters $\{\boldsymbol{\Omega}_{t}\}_{t=1}^{T}$ , is then run for five iterations, which we have found to be sufficient to reach convergence.

Based on initial pilot experiments on scans from the ADNI project¹¹1http://adni.loni.usc.edu/ (distinct from the ones used in the experiments below), we set the method’s hyperparameter values to $K_{1}=14K$ and $K_{0}=K$ , where $K$ is the mesh stiffness in the existing cross-sectional method, and $P_{0,k}$ to the number of voxels assigned to class $k$ in the segmentation of the within-subject template.

Our implementation builds upon the C++ and Python code of [24, 16], and is publicly available from FreeSurfer²²2http://freesurfer.net/. Segmenting one subject takes approximately 15 minutes per time point on an Intel 12-core i7-8700K processor with a GeForce GTX 1060 graphics card.

4 Experiments and Results

In order to assess whether introducing subject-specific latent variables leads to better longitudinal performance, we compared the proposed method and the cross-sectional method on three different datasets:

•

Test-retest [25]: This dataset consists of longitudinal T1-weighted (T1w) and FLuid Attenuation Inversion Recovery (FLAIR) scans of 2 MS subjects. For each subject 6 repeated scans were acquired from 3 different 3T scanners (Philips Achieva; Siemens Verio; GE Signa MR750) within 3 weeks.
•

Achieva: This dataset consists of longitudinal T1w and FLAIR scans of 86 MS subjects. The subjects were scanned between 3 and 6 times (time between scans between 6 and 12 months) with a 3T Philips Achieva scanner at the Department of Neurology, School of Medicine, at the Technical University of Munich in the context of the in-house cohort study on MS named TUM-MS. All the subjects were diagnosed as relapsing-remitting MS.
•

ADNI: This dataset consists of longitudinal T1w scans of 135 subjects randomly selected from the ADNI project. Scanners from multiple sites were used to acquired the scans, and subjects were scanned between 2 and 6 times, with 6 or 12 months between scans. The subjects were divided into 3 groups: cognitively normal (CN, n=45), mild cognitive impairment (MCI, n=54), and Alzheimer disease (AD, n=36).

We report results on the estimated volumes of the following 26 regions: left and right cerebral white matter, cerebellum white matter, cerebral cortex, cerebellum cortex, lateral ventricle, hippocampus, thalamus, putamen, pallidum, caudate, amygdala and nucleus accumbens, as well as brain stem and lesions. To avoid cluttering, results for left and right structures are averaged.

4.0.1 Temporal consistency:

We wished to assess whether the proposed method is able to reduce non-biological variations in longitudinal volume measurements, both within the short ( $<3$ weeks) and longer ( $<6$ years) time intervals of the test-retest and the Achieva datasets, respectively. For the test-retest dataset one can expect true biological changes to be minimal, and we therefore computed the coefficient of variation (ratio of the standard deviation to the mean) for each brain structure. The results, shown in Table 3, indicate that the longitudinal method indeed performs better in this respect than the cross-sectional one for almost all the structures.

For the Achieva dataset, one may assume that the true change in volume of a structure over the span of a few years is approximately linear, except for lesions whose temporal trajectory is affected by disease effects, with growing and shrinking lesions occurring at the same time. We therefore fitted, for each structure and for each subject, a linear regression model to the longitudinal volumes estimated by each method, and computed the ratio of the standard deviation of the residuals to the intersect (time of the first scan is taken as zero). The results are summarized in Table 3 and indicate that the proposed model indeed yields generally better results in this respect.

4.0.2 Detecting disease effects:

In order to ensure that the proposed method is not simply over-regularizing, we also assessed whether it can capture known group differences in the temporal evolution of certain brain structures better than the cross-sectional method. Towards this end, we compared the annualized percentage change (the slope of a linear regression model divided by its intersect) in the volume of the hippocampus between the CN, MCI and AD groups in the ADNI dataset. The results, shown in Fig. 3, indicate that the longitudinal method can indeed detect group differences better this way.

	Subject 1	Subject 2	Avg
Lesions	3.79	3.90	4.78	3.79	6.27	3.39	2.84	4.04	2.57	1.95	2.35	1.22	3.77	3.05
Cerebral white matter	0.38	0.46	0.14	0.12	1.70	1.24	0.93	0.79	0.78	0.17	1.16	1.00	0.85	0.63
Cerebellum white matter	0.82	0.56	0.28	0.19	2.03	1.96	1.28	0.91	0.71	0.41	1.14	1.00	1.04	0.84
Cerebral cortex	0.53	0.50	0.50	0.44	1.50	1.84	0.60	1.08	0.30	0.40	0.74	1.10	0.70	0.89
Cerebellum cortex	0.38	0.34	0.31	0.36	1.13	1.25	0.61	0.63	0.42	0.36	0.86	0.57	0.62	0.58
Lateral ventricles	1.10	1.03	1.37	0.38	1.11	0.57	3.73	2.95	1.62	1.10	2.71	1.24	1.94	1.21
Hippocampus	1.03	0.67	0.55	0.43	0.84	1.46	1.88	1.39	0.55	0.57	1.37	0.91	1.04	0.90
Thalamus	0.44	0.44	0.52	0.31	1.21	0.54	0.92	0.79	0.50	0.35	1.02	0.36	0.77	0.46
Putamen	0.58	0.14	0.84	0.76	0.70	0.71	0.51	0.31	1.10	0.52	1.44	1.08	0.86	0.59
Pallidum	2.25	1.35	2.04	1.58	3.94	1.31	2.77	2.39	1.12	0.77	4.35	1.84	2.74	1.54
Caudate	0.80	1.09	0.96	0.90	1.00	0.84	1.79	1.04	0.92	0.54	0.61	0.41	1.01	0.80
Amygdala	1.51	0.47	0.42	0.20	1.85	1.04	1.17	0.64	1.05	0.41	0.60	0.42	1.10	0.53
Accumbens	1.84	1.27	1.77	1.40	1.80	1.39	1.96	0.69	0.80	0.80	1.73	0.80	1.65	1.06
Brain stem	0.70	0.53	0.32	0.19	1.10	0.93	0.78	0.59	0.54	0.34	0.61	0.63	0.67	0.53
Intracranial volume	0.30	0.06	0.13	0.18	0.53	0.56	0.46	0.06	0.22	0.08	0.52	0.36	0.36	0.22

Structure	Cross	Long	Structure	Cross	Long
Lesions	9.41	9.27	Cerebral white matter	0.56	0.31
Cerebellum white matter	0.59	0.45	Cerebral cortex	0.54	0.59
Cerebellum cortex	0.45	0.42	Lateral ventricles	2.04	1.85
Hippocampus	0.69	0.55	Thalamus	0.51	0.49
Putamen	0.70	0.42	Pallidum	1.23	0.78
Caudate	1.01	0.80	Amygdala	0.70	0.40
Accumbens	1.39	0.82	Brain stem	0.62	0.46
Intracranial volume	0.28	0.08

5 Discussion and conclusion

In this paper we have proposed a novel method for the segmentation of longitudinal brain MRI scans of patients suffering from MS. The method is based on an existing cross-sectional method for simultaneous whole-brain and lesion segmentation, and leverages subject-specific latent variables to encourage segmentations across time points to be similar to each other. Preliminary results indicate that it is able to produce more consistent and reliable segmentations compared to the cross-sectional version, while being more sensitive to group differences. Future work will involve an extensive analysis of disease progression in different MS patient groups, as well as a more careful tuning of the hyperparameters of the model.

5.0.1 Acknowledgments:

This project has received funding from the European Union’s Horizon 2020 research and innovation program under the Marie Sklodowska-Curie project TRABIT agreement No 765148, the German Research Foundation No 428223038, and the NIH NINDS No R01NS112161.

References

[1] F. Barkhof et al. Imaging outcomes for neuroprotection and repair in multiple sclerosis trials. Nature Reviews Neurology, 5(5):256–266, may 2009.
[2] C.J. Azevedo et al. Thalamic atrophy in multiple sclerosis: A magnetic resonance imaging marker of neurodegeneration throughout disease. Annals of Neurology, 83(2):223–234, 2018.
[3] A.J. Thompson et al. Diagnosis of multiple sclerosis: 2017 revisions of the McDonald criteria. The Lancet Neurology, 17(2):162–173, 2018.
[4] A. Carass et al. Longitudinal multiple sclerosis lesion segmentation data resource. Data in Brief, 12:346–350, 2017.
[5] C.R.G. Guttmann et al. Quantitative follow-up of patients with multiple sclerosis using mri: Reproducibility. Journal of Magnetic Resonance Imaging, 9(4):509–518, 1999.
[6] G. Gerig et al. Exploring the discrimination power of the time domain for segmentation and characterization of active lesions in serial MR data. Medical Image Analysis, 4(1):31–42, 2000.
[7] P. Schmidt et al. Automated segmentation of changes in FLAIR-hyperintense white matter lesions in multiple sclerosis on serial magnetic resonance imaging. NeuroImage: Clinical, 23, 2019.
[8] R. McKinley et al. Automatic detection of lesion load change in Multiple Sclerosis using convolutional neural networks with segmentation confidence. NeuroImage: Clinical, 25, 2020.
[9] S.M. Smith et al. Accurate, Robust, and Automated Longitudinal and Cross-Sectional Brain Change Analysis. NeuroImage, 17(1):479–489, 2002.
[10] D. Smeets et al. Reliable measurements of brain atrophy in individual patients with multiple sclerosis. Brain and Behavior, 6(9), 2016.
[11] S.M. Smith et al. Normalized accurate measurement of longitudinal brain change. Journal of Computer Assisted Tomography, 25(3):466–475, 2001.
[12] D. Rey et al. Automatic detection and segmentation of evolving processes in 3D medical images: Application to multiple sclerosis. Medical Image Analysis, 6(2):163–179, 2002.
[13] M. Battaglini et al. Automated identification of brain new lesions in multiple sclerosis using subtraction images. Journal of Magnetic Resonance Imaging, 39(6):1543–1549, 2014.
[14] S. Jain et al. Two time point MS lesion segmentation in brain MRI: An expectation-maximization framework. Frontiers in Neuroscience, 10, 2016.
[15] D. García-Lorenzo et al. Review of automatic segmentation methods of multiple sclerosis white matter lesions on conventional magnetic resonance imaging. Medical Image Analysis, 17(1):1–18, 2013.
[16] S. Cerri et al. A Contrast-Adaptive Method for Simultaneous Whole-Brain and Lesion Segmentation in Multiple Sclerosis, 2020.
[17] K. Van Leemput. Encoding probabilistic brain atlases using Bayesian inference. IEEE Transactions on Medical Imaging, 28(6):822–837, 2009.
[18] J. Ashburner et al. Image registration using a symmetric prior - In three dimensions. Human Brain Mapping, 9(4):212–225, 2000.
[19] D.P. Kingma and M. Welling. Auto-encoding variational bayes, 2013.
[20] W.M. Wells et al. Adaptive segmentation of MRI data. IEEE Transactions on Medical Imaging, 15(4):429–442, 1996.
[21] K. Van Leemput et al. Automated model-based bias field correction of MR images of the brain. IEEE Transactions on Medical Imaging, 18(10):885–896, 1999.
[22] J.E. Iglesias et al. Bayesian longitudinal segmentation of hippocampal substructures in brain MRI using subject-specific atlases. NeuroImage, 141:542–555, 2016.
[23] M. Reuter et al. Within-subject template estimation for unbiased longitudinal image analysis. NeuroImage, 61(4):1402–1418, 2012.
[24] O. Puonti et al. Fast and sequence-adaptive whole-brain segmentation using parametric Bayesian modeling. NeuroImage, 143:235–249, 2016.
[25] V. Biberacher et al. Intra- and interscanner variability of magnetic resonance imaging based volumetry in multiple sclerosis. NeuroImage, 142:188–197, 2016.

	Subject 1						Subject 2						Avg
	GE		Philips		Siemens		GE		Philips		Siemens
Structure	Cross	Long	Cross	Long	Cross	Long	Cross	Long	Cross	Long	Cross	Long	Cross	Long
Lesions	3.79	3.90	4.78	3.79	6.27	3.39	2.84	4.04	2.57	1.95	2.35	1.22	3.77	3.05
Cerebral white matter	0.38	0.46	0.14	0.12	1.70	1.24	0.93	0.79	0.78	0.17	1.16	1.00	0.85	0.63
Cerebellum white matter	0.82	0.56	0.28	0.19	2.03	1.96	1.28	0.91	0.71	0.41	1.14	1.00	1.04	0.84
Cerebral cortex	0.53	0.50	0.50	0.44	1.50	1.84	0.60	1.08	0.30	0.40	0.74	1.10	0.70	0.89
Cerebellum cortex	0.38	0.34	0.31	0.36	1.13	1.25	0.61	0.63	0.42	0.36	0.86	0.57	0.62	0.58
Lateral ventricles	1.10	1.03	1.37	0.38	1.11	0.57	3.73	2.95	1.62	1.10	2.71	1.24	1.94	1.21
Hippocampus	1.03	0.67	0.55	0.43	0.84	1.46	1.88	1.39	0.55	0.57	1.37	0.91	1.04	0.90
Thalamus	0.44	0.44	0.52	0.31	1.21	0.54	0.92	0.79	0.50	0.35	1.02	0.36	0.77	0.46
Putamen	0.58	0.14	0.84	0.76	0.70	0.71	0.51	0.31	1.10	0.52	1.44	1.08	0.86	0.59
Pallidum	2.25	1.35	2.04	1.58	3.94	1.31	2.77	2.39	1.12	0.77	4.35	1.84	2.74	1.54
Caudate	0.80	1.09	0.96	0.90	1.00	0.84	1.79	1.04	0.92	0.54	0.61	0.41	1.01	0.80
Amygdala	1.51	0.47	0.42	0.20	1.85	1.04	1.17	0.64	1.05	0.41	0.60	0.42	1.10	0.53
Accumbens	1.84	1.27	1.77	1.40	1.80	1.39	1.96	0.69	0.80	0.80	1.73	0.80	1.65	1.06
Brain stem	0.70	0.53	0.32	0.19	1.10	0.93	0.78	0.59	0.54	0.34	0.61	0.63	0.67	0.53
Intracranial volume	0.30	0.06	0.13	0.18	0.53	0.56	0.46	0.06	0.22	0.08	0.52	0.36	0.36	0.22