This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

11institutetext: School of computer science and technology, Harbin Institute of Technology, China 22institutetext: CVTE Research, China 33institutetext: Research Institute of Trustworthy Autonomous Systems, Southern University of Science and Technology, China 33email: [email protected] 44institutetext: Department of Radiation Oncology, Sun Yat-sen University Cancer Center, China

Weighted Concordance Index Loss-based Multimodal Survival Modeling for Radiation Encephalopathy Assessment in Nasopharyngeal Carcinoma Radiotherapy

Jiansheng Fang 112233    Anwei Li 22    Pu-Yun OuYang 44    Jiajian Li 22    Jingwen Wang 22    Hongbo Liu 22    Fang-Yun Xie 44    Jiang Liu Corresponding author. Co-first authors: Jiansheng Fang, Anwei Li, Pu-Yun OuYang.33
Abstract

Radiation encephalopathy (REP) is the most common complication for nasopharyngeal carcinoma (NPC) radiotherapy. It is highly desirable to assist clinicians in optimizing the NPC radiotherapy regimen to reduce radiotherapy-induced temporal lobe injury (RTLI) according to the probability of REP onset. To the best of our knowledge, it is the first exploration of predicting radiotherapy-induced REP by jointly exploiting image and non-image data in NPC radiotherapy regimen. We cast REP prediction as a survival analysis task and evaluate the predictive accuracy in terms of the concordance index (CI). We design a deep multimodal survival network (MSN) with two feature extractors to learn discriminative features from multimodal data. One feature extractor imposes feature selection on non-image data, and the other learns visual features from images. Because the priorly balanced CI (BCI) loss function directly maximizing the CI is sensitive to uneven sampling per batch. Hence, we propose a novel weighted CI (WCI) loss function to leverage all REP samples effectively by assigning their different weights with a dual average operation. We further introduce a temperature hyper-parameter for our WCI to sharpen the risk difference of sample pairs to help model convergence. We extensively evaluate our WCI on a private dataset to demonstrate its favourability against its counterparts. The experimental results also show multimodal data of NPC radiotherapy can bring more gains for REP risk prediction.

Keywords:
Dose-Volume Histogram Survival Modeling Multimodal Learning Loss Function Concordance Index.

1 Introduction

Radiotherapy is the standard radical treatment for nasopharyngeal carcinoma (NPC) and has considerably improved disease control and survival [3]. However, NPC radiotherapy may induce radiation encephalopathy (REP), which makes the brain suffer irreversible damage during the incubation period [22]. REP diagnosis remains challenging in clinical research due to its various clinical symptoms, insidious onset, long incubation period [6]. Currently, conventional magnetic resonance (MR) diagnosis can only discern REP at the irreversible stage [5]. Hence, it is highly desirable to assist clinicians in optimizing the radiotherapy regimen to reduce radiotherapy-induced temporal lobe injury (RTLI) according to the probability of REP onset. In this work, we study how to speculate REP risk in the pre-symptomatic stage by jointly exploiting the diagnosis and treatment data generated in NPC radiotherapy regimen, including computed tomography (CT) images, radiotherapy dose (RD) images, radiotherapy struct (RS) images, dose-volume histogram (DVH), and demographic characteristics.

Recently, the data-driven approach has attracted a wide range of attention in the field of NPC. For example, many works for NPC segmentation have shifted from traditional hand-engineered models [13, 14, 30] to automatic deep learning models [12, 19]. Currently, there are a few studies that use data-driven approaches to predict the onset of REP. Zhao et al. predict REP in NPC by analyzing the whole-brain resting-state functional connectivity density (FCD) of pre-symptomatic REP patients using multivariate pattern analysis (MVPA) [29]. Zhang et al. explore fractional amplitude of low-frequency fluctuation (fALFF) as an imaging biomarker for predicting or diagnosing REP in patients with NPC by using the support vector machine (SVM) [28]. However, existing studies mainly consider the contribution of hand-engineered features computed from MR images to the REP prediction. The lesion region of REP stemming from NPC radiotherapy usually lie in the bilateral temporal lobe, and its severity is positively related to the dose [24]. Hence, according to the clinical evaluation mechanism, by casting REP risk prediction as a survival analysis task, we aim to build a deep multimodal survival network (MSN) to learn valuable and discriminative features from multiple data types generated during NPC radiotherapy.

The data types of NPC radiotherapy can be grouped into two modalities: image and non-image data. In MSN, we instinctively design two feature extractors to capture the discriminative information from multimodal data. One feature extractor explores the image data by convolution module, and the other mines the one-dimensional data by multi-layer perception (MLP) module. Given the concordance index (CI) [9] is a standard performance metric for survival models, the existing balanced CI (BCI) loss function formulates the learning problem as directly maximizing the CI [21]. However, BCI exhibits poor predictive accuracy due to its sensitivity to the sample pairing per batch. Hence, we propose a novel weighted CI (WCI) loss function with a temperature hyper-parameter τ\tau to combat the sensitivity to uneven sampling and sharpen the relative risk of sample pairs, thus helping model convergence. We demonstrate our WCI advantage in terms of CI and 3-years area under the curve (AUC).

Our contributions are: (a) To advance REP risk prediction to assess the rationality of NPC radiotherapy regimen, we propose a deep multimodal survival network (MSN) equipped with two feature extractors to learn discriminative features from image and non-image data. (b) We propose a novel WCI loss function with a temperature hyper-parameter for our MSN training to effectively leverage REP samples of each batch and help model convergence. (c) We confirm the benefits of our WCI with other loss functions used for survival models by extensively experimenting on a private dataset.

2 Materials and Methods

2.1 NPC-REP Dataset Acquisition

Refer to caption
Figure 1: Schematic of image data in NPC-REP dataset. The upper row is a patient that confirms REP during follow-up, and the lower row is a patient without temporal lobe injury. The bilateral temporal lobe in MR images during follow-up place on column (MR). And by traceback, corresponding CT, RD, RS images generated in NPC radiotherapy lie in columns (CT), (RD), and (RS), respectively.

We acquire eligible 4,816 patients from a cancer center. The eligibility of enrolling in the NPC-REP dataset is that diagnosis and treatment data in NPC radiotherapy and follow-up are traceable and recorded. We binarize a label for each patient according to whether the REP confirmation by MR image diagnosis during follow-up. If diagnosing out RTLI, the label of NPC patient is REP, otherwise non-REP. At the same time, we record the time intervals (in Months) between NPC radiotherapy and REP confirmation or the last follow-up of non-REP. Then we trace back the diagnosis and treatment data during NPC radiotherapy for each patient. RD images show the radiation dose distribution to the human body during radiation therapy. Clinicians sketch the tumor outline in closed curve coordinate form to yield the RS image. Then we apply the mask of RS images to generate the input ROIs of RD and CT images. As Fig 1 shows, we crop out the identical regions of interest (ROI) for CT images for diagnosis and RD images according to regions of the bilateral temporal lobe in RS images. Then we feed the three 3D ROIs into the multimodal learning network for extracting visual features. It is confirmed that the imaging dose delivered during NPC radiotherapy is positively related to REP risk [24]. Hence, in addition to considering image data, we introduce 18 features manually calculated from DVH, which relates radiation dose to tissue volume in radiation therapy planning [8]. Moreover, we also observe the affection of demographic information (age, gender), clinic stages (T/N/TNM), and treatment options (radio-chemotherapy and radiotherapy) to REP risk. The detailed statistics of demographic and clinical characteristics for the NPC-REP dataset are shown in Appendix A1.

2.2 Multimodal Survival Modeling

Refer to caption
Figure 2: The architecture of our MSN with two feature extractors. Characteristics c1,2,6c_{1,2,\dots 6} denote age, gender, T stage, N stage, TNM stage and treatment option. d1,2,18d_{1,2,\dots 18} are manually calculated on DVH. FCs includes two fully-connected layers. GAP is a global average pooling layer. B is the batch size, and c indicates a concatenation operator.

We notice that most NPC patients with REP are diagnosed after radiotherapy for about 37 months. And it is almost impossible to occur REP when the follow-up time is less than two months and more than 73 months. Such a statistics observation about the onset time of REP illustrates that the risk probability of suffering REP is highly related to some factors in NPC radiotherapy. These factors cause the onset of REP in a relatively definite period. At the same time, we can cast REP assessment as a 3-years survival analysis task according to the average REP confirmation time intervals. Thus we build a deep multimodal survival network (MSN) to identify potential factors from the NPC-REP dataset, which consists of a binarized label, time intervals, image data (CT, RD, and RS images), and non-image data (24 features in total). As Fig 2 shows, one feature extractor learns visual features of 3D ROIs intercepted from CT, RD, and RS images, and the other explores the contributions of non-image data to REP risk. The 24 features of non-image data are directly fed into the MLP module to output the REP risk RnvR_{nv}. In the branch of visual feature extractor, we first apply convolution network as a backbone, followed by non-local attention capturing the global inter-dependencies [23]. Next, we use a global average pooling (GAP) to merge feature maps and two fully-connected layers (FCs) to generate the REP risk RvR_{v}. After generating the two risk probabilities for two modality data, we further utilize our WCI loss function to train the network. The combination of the two predicted probabilities is defined as follows:

R=wnvRnv+wvRv,R=w_{nv}R_{nv}+w_{v}R_{v}, (1)

where Rnv,Rv[0,1]R_{nv},R_{v}\in[0,1] and wnvw_{nv} and wvw_{v} are their corresponding weights. Both are initially set to 0.5. We can tune the two weights to observe the contribution of different modal data on assessing the REP risk.

2.3 WCI Loss Function

CI is a standard performance metric for survival models that corresponds to the probability that the model correctly orders a randomly chosen pair of patients in terms of event time [4, 25]. To calculate this indicator, we first pair all samples in the dataset of sample size nn with each other to get n(n1)/2n(n-1)/{2} sample pairs. Then, we filter out evaluation sample pairs from all sample pairs, as follows:

𝔼{(i,j)|(Oi=1TjTi)},\mathbb{E}\coloneqq\{(i,j)|(O_{i}=1\wedge T_{j}\geq T_{i})\}, (2)

where TiT_{i} and TjT_{j} are the time intervals of ithi^{th} and jthj^{th} NPC patients respectively, and Oi=1O_{i}=1 denotes that ithi^{th} NPC patient is observed as REP. 𝔼\mathbb{E} is the set of sample pairs (i,j)(i,j) based on the ordering of survival times, i.e., the time intervals of ithi^{th} NPC patient is shorter than jthj^{th} NPC patient. Assuming the REP risks predicted by models for sample pairs (i,j)(i,j) are RiR_{i} and RjR_{j}, we assert the prediction outcomes are correct if Ri>RjR_{i}>R_{j} and Ti<TjT_{i}<T_{j}. We mark those sample pairs whose predicted results coincide with the actual as a set 𝔼t\mathbb{E}_{t}. The CI value is indicated as the number ratio of 𝔼t/𝔼\mathbb{E}_{t}/{\mathbb{E}}. The higher CI explains away the better competence of survival models in predicting REP risks.

Because the assumptions that proportional hazards are invariant over time and log-linear limit the Cox loss function (See Appendix A2) used for survival models, BCI loss function (See Appendix A3) directly formulates the optimizing problem of survival models as maximizing the CI. The Cox depends only on the ranks of the observed survival time rather than on their actual numerical value [21]. Given the CI as a performance measure in survival analysis, BCI can effectively exploit the changes of proportional hazards with survival time. However, BCI is sensitive to uneven sampling, thus yielding poor predictive accuracy. Uneven sampling refers to the sample pairs meeting the constraints of Equation 2 in each batch failing to equally cover REP samples (Oi=1O_{i}=1) which are the key learning objectives. BCI balances CI loss for all REP samples in each batch by adopting a global average operation, not paying more attention to those REP samples with fewer sample pairs.

By referring to the weighted cross-entropy loss function, our WCI enhances the contributions of those REP samples with fewer sample pairs in each batch by a dual average operation, as follows:

wci=1NO=1i:Oi=1(1NTjTij:TjTie(RiRj)/τ),\mathcal{L}_{wci}=\frac{1}{N_{O=1}}\sum_{i:O_{i}=1}(\frac{1}{N_{T_{j}\geq T_{i}}}\sum_{j:T_{j}\geq T_{i}}e^{-(R_{i}-R_{j})/{\tau}}), (3)

where τ\tau is a temperature hyper-parameter [10, 26] and initially set as 0.1. NO=1N_{O=1} (outer) is the number of REP samples, and NTjTiN_{T_{j}\geq T_{i}} (inner) is the number of sample pairs of ithi^{th} REP sample. If the relation of predicted risk probabilities is Ri<RjR_{i}<R_{j} (wrong prediction), the CI loss is amplified, and if Ri>RjR_{i}>R_{j} (correct prediction), the CI loss infinitely approximates to zero. After tuning the risk difference, we use an inner average operation 1/NTjTi1/{N_{T_{j}\geq T_{i}}} and an outer average operation 1/NO=11/{N_{O=1}} to prevent fluctuation in the optimization process caused by uneven sampling per batch. The outer average operation is balanced for all REP samples, while the inner average operation assigns different weights for REP samples according to their number of sample pairs.

3 Experiments

3.1 Implementation Details

Dataset. We split the NPC patients in the NPC-REP dataset as a training set with 3,253 patients (264 REP and 2,989 non-REP), a validation set with 575 patients (43 REP and 532 non-REP), and a test set with 988 patients (61 REP and 927 non-REP). Moreover, we view left and right temporal lobes as independent samples during model training and inference.

Loss functions. We experiment with four loss functions to demonstrate the gains of our WCI, including standard cross-entropy (CE) [27], censored cross-entropy (CCE) [25], BCI [21], and Cox [16]. We adopt the survival time of 3-years (36 Months) for training and evaluation in terms of the time intervals in the NPC-REP dataset. For the CE, we define the label as 1 if REP confirmation and as 0 if the last follow-up time of non-REP is beyond 36 months. And the CCE extends CE used for classification models to train survival predictions with right-censored data by discretizing event time into intervals (0,36],(36,)(0,36],(36,\infty). The BCI maximizes a lower bound on CI, explaining the high CI-scores of proportional hazard models observed in practice. The Cox for every sample is a function of all samples in the training data and is usually used for fitting Cox proportional hazard models [2, 7].

Evaluation metrics. CI is the most frequently used evaluation metric of survival models [1, 11]. It is defined as the ratio of correctly ordered (concordant) pairs to comparable pairs. Two samples ii and jj are comparable if the sample with lower observed time TT confirmed REP during follow-up, i.e., if Tj>TiT_{j}>T_{i} and Oi=1O_{i}=1, where Oi=1O_{i}=1 is a binary event indicator. A comparable pair (i,j)𝔼(i,j)\in\mathbb{E} is concordant if the estimated risk RR by an assessment model is higher for NPC patients with lower REP confirmation time, i.e., Ri>RjTj>TiR_{i}>R_{j}\wedge T_{j}>T_{i}, otherwise the pair is discordant. We also report the time-dependent AUC [15, 20] for all loss functions, apart from CI.

Optimizer. Our MSN model is trained from scratch using the SGD optimizer for all compared loss functions. Specifically, the learning rate is decay scheduled by cosine annealing [18] setting 2e-4 as an initial value and 1e-3 from 5th epochs. We set the batch size as a multiple of 8 for all tasks, i.e., 128 (8 GPUs with 16 input images per GPU). The parameters of networks are optimized in 60 epochs with a weight decay of 3e-5 and a momentum of 0.8.

3.2 Results and Analysis

Refer to caption
Figure 3: Comparison of different loss functions on our MSN for the NPC-REP dataset in terms of CI and 3-years AUC (in %). DT6 covers age, gender, T/N/TNM stage, and treatment option, six features in total.

Multimodal data Analysis. The intention of acquiring treatment and diagnosis data in NPC radiotherapy launches on the confirmation that the imaging dose delivered during NPC radiotherapy is positively related to REP risk. Hence, we employ our MSN to jointly leverage non-image data (DVH+DT6) and images (CT+RD+RS) to predict REP risk. We report the CI and 3-years AUC in Fig 3 for compared loss functions used for training our MSN. We can observe that combining DVH and DT6 (orange line) brings more gains than DVH (blue line) alone or CT+RD+RS (green line), and using all three data types (red line) achieves the best predictive accuracy. Such experimental results can demonstrate the contributions of multimodal data of NPC radiotherapy to the improvement of REP prediction and give supportive evidence for the above confirmation. Next, we analyze the contribution proportion of two modality data: images and non-image data. We conduct an ablation study for Equation 1 to find the best weight ratio for two probabilities predicted by the two branches in our MSN. Table 1 shows the best CI (bold) achieved by setting the proportion of wnvw_{nv} and wvw_{v} as 0.7:0.3. The more contributions of non-image data than images coincide with the reports of Fig 3 in which non-image data (orange line) yields more gains than image data (green line).

Table 1: CI (in %) of different loss functions with varying weight ratios.
𝒘𝒏𝒗:𝒘𝒗\bm{w_{nv}:w_{v}} CE Cox BCI CCE WCI w/o τ\tau WCI (Ours)
0.1:0.90.1:0.9 79.54 79.33 79.47 79.65 80.25 80.53
0.3:0.70.3:0.7 79.59 80.54 80.58 80.61 80.77 81.27
0.5:0.50.5:0.5 80.01 81.47 81.66 81.54 81.66 81.89
0.7:0.30.7:0.3 80.79 81.96 82.05 82.06 82.24 82.52
0.9:0.10.9:0.1 80.65 81.32 81.42 81.50 81.53 81.62

Loss Function Analysis. After confirming the utility of multimodal data and analyzing the contributions of different modalities, we further discuss the benefits of our WCI according to the report in Fig 3. We interpret CI as the probability of correct ranking of REP occurring probability by casting predicting REP risk as a survival analysis task. Hence, CE used for learning classification models can not achieve sound accuracy without considering event times. Compared to CE, Cox can significantly improve the performance by exploiting the ranks of the observed survival time. On the other hand, Cox is beat by BCI due to ignoring the changes of proportional hazards with survival time. Although BCI enjoys the gain of directly maximizing CI, its performance is slightly lower than CCE due to sensitivity to uneven sampling. Identical to BCI, CCE also considers the contributions of the proportional hazard of sample pairs in learning the survival model. Based on the comparable performance of BCI and CCE, WCI without τ\tau can further obtain better performance by encouraging our MSN to effectively learn REP samples on the condition of uneven sampling per batch. By using τ\tau help model convergence, our WCI achieves the highest CI (82.52) and 3-years AUC (83.75) on two modality data (DVH+DT6+CT+RD+RS). Although WCI is designed initially for the CI indicator used for evaluating survival models, the best performance on 3-years AUC can also prove the excellent generalization of our MSN trained by our WCI. We also conduct McNemar tests [17] on CI between our WCI and other loss functions, including CE, Cox, BCI, CCE, and WCI w/o τ\tau. All comparisons in a p-value of less than 0.01 can support that our WCI brings significant improvement to CI.

Table 2: CI (in %) of different data types with varying τ\tau for our WCI loss function.
Data types 𝝉=𝟏𝟎\bm{\tau=10} 𝝉=𝟏\bm{\tau=1} 𝝉=0.1\bm{\tau=0.1} 𝝉=0.05\bm{\tau=0.05} 𝝉=0.02\bm{\tau=0.02}
DVH 80.23 80.27 80.45 80.23 79.38
DVH+DT6 80.85 80.94 81.02 80.65 80.69
CT+RD+RS 80.32 80.57 80.62 80.29 79.53
DVH+DT6+CT+RD+RS 80.93 82.24 82.52 81.16 80.76

Ablation studies of τ\tau. By casting REP assessment as a survival analysis task, our WCI exhibits manifest advantage over other loss functions by introducing the dual average operation and temperature hyper-parameter τ\tau. Apart from the dual average operation preventing our MSN from being sensitive to uneven sampling, we also utilize τ\tau to sharpen the risk difference of sample pairs to help model convergence. We conduct an ablation study for τ\tau to observe the heating and cooling effect of tuning the CI loss for our WCI. Table 2 shows that the best value of τ\tau used to train our MSN is 0.1 (bold). And when τ=1.0\tau=1.0, our WCI without imposing the tuning on CI achieves the second-highest CI of 82.24 (underline) over two modality data (DVH+DT6+CT+RD+RS). When τ>1.0\tau>1.0, it occurs the opposite effect. At the same time, the adjustment force should not be too large from the performance of setting τ=0.05\tau=0.05 or τ=0.02\tau=0.02. The ablation studies in Table 2 confirm that τ\tau can help model convergence by sharpening the risk difference. Theoretically, the significant risk difference helps improve the predicted certainty probability, i.e., reduces the entropy of probability distribution. Benefiting from the insensitive to uneven sampling and sharpening risk difference, our WCI exhibits more stability and convergence than BCI by comparing standard deviations of batch loss (See Appendix A4).

4 Conclusions

We present a novel WCI loss function to effectively exploit REP samples per batch by a dual average operation and help model convergence by sharpening risk difference with a temperature hyper-parameter. It is the first exploration of jointly leveraging multimodal data to predict the probability of REP onset to help optimize the NPC radiotherapy regimen. We acquire a private dataset and experiment on it to demonstrate the favorability of our WCI by comparing it to four popular loss functions used for survival models. We also confirm the contributions of multimodal data to REP risk prediction.

References

  • [1] Brentnall, A.R., Cuzick, J.: Use of the concordance index for predictors of censored survival data. Statistical methods in medical research 27(8), 2359–2373 (2018)
  • [2] Breslow, N.: Covariance analysis of censored survival data. Biometrics pp. 89–99 (1974)
  • [3] Cavanna, A.E., Trimble, M.R.: The precuneus: a review of its functional anatomy and behavioural correlates. Brain 129(3), 564–583 (2006)
  • [4] Chaudhary, K., Poirion, O.B., Lu, L., Garmire, L.X.: Deep learning–based multi-omics integration robustly predicts survival in liver cancer. Clinical Cancer Research 24(6), 1248–1259 (2018)
  • [5] Chen, Q., Lv, X., Zhang, S., Lin, J., Song, J., Cao, B., Weng, Y., Li, L., Huang, R.: Altered properties of brain white matter structural networks in patients with nasopharyngeal carcinoma after radiotherapy. Brain imaging and behavior 14(6), 2745–2761 (2020)
  • [6] Chen, Y.P., Chan, A.T., Le, Q.T., Blanchard, P., Sun, Y., Ma, J.: Nasopharyngeal carcinoma. The Lancet 394(10192), 64–80 (2019)
  • [7] Cox, D.R.: Partial likelihood. Biometrika 62(2), 269–276 (1975)
  • [8] Drzymala, R., Mohan, R., Brewster, L., Chu, J., Goitein, M., Harms, W., Urie, M.: Dose-volume histograms. International Journal of Radiation Oncology* Biology* Physics 21(1), 71–78 (1991)
  • [9] Harrell Jr, F.E., Lee, K.L., Mark, D.B.: Multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors. Statistics in medicine 15(4), 361–387 (1996)
  • [10] He, K., Fan, H., Wu, Y., Xie, S., Girshick, R.: Momentum contrast for unsupervised visual representation learning. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp. 9729–9738 (2020)
  • [11] Heller, G., Mo, Q.: Estimating the concordance probability in a survival analysis with a discrete number of risk groups. Lifetime data analysis 22(2), 263–279 (2016)
  • [12] Huang, J.b., Zhuo, E., Li, H., Liu, L., Cai, H., Ou, Y.: Achieving accurate segmentation of nasopharyngeal carcinoma in mr images through recurrent attention. In: International Conference on Medical Image Computing and Computer-Assisted Intervention. pp. 494–502. Springer (2019)
  • [13] Huang, K.W., Zhao, Z.Y., Gong, Q., Zha, J., Chen, L., Yang, R.: Nasopharyngeal carcinoma segmentation via hmrf-em with maximum entropy. In: 2015 37th annual international conference of the IEEE engineering in medicine and biology society (EMBC). pp. 2968–2972. IEEE (2015)
  • [14] Huang, W., Chan, K.L., Zhou, J.: Region-based nasopharyngeal carcinoma lesion segmentation from mri using clustering-and classification-based methods with learning. Journal of digital imaging 26(3), 472–482 (2013)
  • [15] Hung, H., Chiang, C.T.: Estimation methods for time-dependent auc models with survival data. Canadian Journal of Statistics 38(1), 8–26 (2010)
  • [16] Katzman, J.L., Shaham, U., Cloninger, A., Bates, J., Jiang, T., Kluger, Y.: Deepsurv: personalized treatment recommender system using a cox proportional hazards deep neural network. BMC medical research methodology 18(1), 1–12 (2018)
  • [17] Lachenbruch, P.A.: Mcnemar test. Wiley StatsRef: Statistics Reference Online (2014)
  • [18] Loshchilov, I., Hutter, F.: Sgdr: Stochastic gradient descent with warm restarts. arXiv preprint arXiv:1608.03983 (2016)
  • [19] Men, K., Chen, X., Zhang, Y., Zhang, T., Dai, J., Yi, J., Li, Y.: Deep deconvolutional neural network for target segmentation of nasopharyngeal cancer in planning computed tomography images. Frontiers in oncology 7,  315 (2017)
  • [20] Nuño, M.M., Gillen, D.L.: Censoring-robust time-dependent receiver operating characteristic curve estimators. Statistics in Medicine 40(30), 6885–6899 (2021)
  • [21] Steck, H., Krishnapuram, B., Dehing-Oberije, C., Lambin, P., Raykar, V.C.: On ranking in survival analysis: Bounds on the concordance index. In: Advances in neural information processing systems. pp. 1209–1216 (2008)
  • [22] Tang, Y., Zhang, Y., Guo, L., Peng, Y., Luo, Q., Xing, Y.: Relationship between individual radiosensitivity and radiation encephalopathy of nasopharyngeal carcinoma after radiotherapy. Strahlentherapie und Onkologie 184(10), 510–514 (2008)
  • [23] Wang, X., Girshick, R., Gupta, A., He, K.: Non-local neural networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp. 7794–7803 (2018)
  • [24] Wei, W.I., Sham, J.S.: Nasopharyngeal carcinoma. The Lancet 365(9476), 2041–2054 (2005)
  • [25] Wulczyn, E., Steiner, D.F., Xu, Z., Sadhwani, A., Wang, H., Flament-Auvigne, I., Mermel, C.H., Chen, P.H.C., Liu, Y., Stumpe, M.C.: Deep learning-based survival prediction for multiple cancer types using histopathology images. PLoS One 15(6), e0233678 (2020)
  • [26] Yi, X., Yang, J., Hong, L., Cheng, D.Z., Heldt, L., Kumthekar, A., Zhao, Z., Wei, L., Chi, E.: Sampling-bias-corrected neural modeling for large corpus item recommendations. In: Proceedings of the 13th ACM Conference on Recommender Systems. pp. 269–277 (2019)
  • [27] Zadeh, S.G., Schmid, M.: Bias in cross-entropy-based training of deep survival networks. IEEE Transactions on Pattern Analysis and Machine Intelligence 43(9), 3126–3137 (2020)
  • [28] Zhang, Y.m., Kang, Y.f., Zeng, J.j., Li, L., Gao, J.m., Liu, L.z., Shi, L.r., Liao, W.h.: Surface-based falff: a potential novel biomarker for prediction of radiation encephalopathy in patients with nasopharyngeal carcinoma. Frontiers in Neuroscience p. 873 (2021)
  • [29] Zhao, L.M., Kang, Y.F., Gao, J.M., Li, L., Chen, R.T., Zeng, J.J., Zhang, Y.M., Liao, W.: Functional connectivity density for radiation encephalopathy prediction in nasopharyngeal carcinoma. Frontiers in oncology p. 2515 (2021)
  • [30] Zhou, J., Chan, K.L., Xu, P., Chong, V.F.: Nasopharyngeal carcinoma lesion segmentation from mr images by support vector machine. In: 3rd IEEE International Symposium on Biomedical Imaging: Nano to Macro, 2006. pp. 1364–1367. IEEE (2006)