\jyear

2021

[1,2]\fnmAntonio \surS Soares

[1]\orgdivWellcome/EPSRC Interventional and Surgical Sciences (WEISS) Centre, \orgnameUniversity College London, \orgaddress\street43-45 Foley Street, \cityLondon, \postcodeW1W 7JN, \countryUnited Kingdom

2]\orgdivDivision of Surgery and Interventional Sciences, \orgnameUniversity College London

3]\orgdivDepartment of Computer Science, \orgnameUniversity College London

4]\orgdivDepartment of Medical Physics and Biomedical Engineering, \orgnameUniversity College London

Fluorescence angiography classification in colorectal surgery

A preliminary report

[email protected] \fnmSophia \surBano [email protected] \fnmNeil \surT Clancy [email protected] \fnmLaurence \surB Lovat [email protected] \fnmDanail \surStoyanov [email protected] \fnmManish \surChand [email protected] * [ [ [

Abstract

Background: Fluorescence angiography has shown very promising results in reducing anastomotic leaks by allowing the surgeon to select optimally perfused tissue. However, subjective interpretation of the fluorescent signal still hinders broad application of the technique, as significant variation between different surgeons exists. Our aim is to develop an artificial intelligence algorithm to classify colonic tissue as ’perfused’ or ’not perfused’ based on intraoperative fluorescence angiography data.

Methods: A classification model with a Resnet architecture was trained on a dataset of fluorescence angiography videos of colorectal resections at a tertiary referral centre. Frames corresponding to fluorescent and non-fluorescent segments of colon were used to train a classification algorithm. Validation using frames from patients not used in the training set was performed, including both data collected using the same equipment and data collected using a different camera. Performance metrics were calculated, and saliency maps used to further analyse the output. A decision boundary was identified based on the tissue classification.

Results: A convolutional neural network was successfully trained on 1790 frames from 7 patients and validated in 24 frames from 14 patients. The accuracy on the training set was 100%, on the validation set was 80%. Recall and precision were respectively 100% and 100% on the training set and 68.8% and 91.7% on the validation set.

Conclusion: Automated classification of intraoperative fluorescence angiography with a high degree of accuracy is possible and allows automated decision boundary identification. This will enable surgeons to standardise the technique of fluorescence angiography. A web based app was made available to deploy the algorithm.

keywords:

fluorescence angiography, computer vision, colorectal surgery

1 Abstract

2 Introduction

Anastomotic leak is an important source of morbidity, mortality and rising healthcare costs Ashraf2013 in colorectal surgery. Surgeons use fluorescence angiography to optimise the perfusion of the tissue used to create the anastomosisKeller_ICG . The use of fluorescence angiography has been shown to reduce anastomotic leaks in non-randomized studies Blanco-Colino2018 ; Degett2016 and has been recommended by the European Association for Endoscopic Surgery [submitted]. The true utility of the technique remains unquantified.

The interpretation of the signal in fluorescence angiography is dependent on the surgeon and perhaps represents the actual ‘learning curve’ for this technique. These decisions have a high degree of variability shown both for static images [Soares, AS et al., Interobserver agreement study, under review] and dynamic video hardy2021inter assessments. Automating the interpretation of the signal would eliminate this variability and provide an alternative means of quantification, thereby leveraging the usefulness of surgical data science Maier-Hein2022 in the clinical setting.

Convolutional neural networks are especially suited for image analysis. The input for these neural networks are the pixel values of a given image and the outputs are the probabilities of the image belonging to a particular pre-defined class.

This paper describes the development of an artificial intelligence algorithm to automatically classify tissue based on perfusion using fluorescence angiography during colorectal surgery. A decision boundary is estimated based on the classification of different areas of the tissue of interest.

3 Methods

3.1 Dataset

Patients undergoing colorectal surgery that involved construction of an anastomosis were included in the study. De-identified data was collected between May and November 2019 as previously described by our group soares2022multisensor . Additional data used for validation was collected between January and August 2021. This was a convenience sample. In all cases, the bowel transection was performed following extracorporealisation of the bowel. Prior to transection, the bowel was placed on clean white swabs to act as ‘background’. A continuous sequence was recorded from the point of ICG injection. 10 mg of ICG was injected intravenously in all cases. The recording lasted until the operating surgeon decided fluorescence intensity was adequate to choose the point of proximal transection (maximal fluorescence intensity), as per current standard practice in FA. The training data was collected using the PINPOINT camera (Stryker, Michigan, USA) and the validation data was collected with PINPOINT and Stryker 1688 (Stryker, Michigan, USA) and Arthrex (Arthrex Inc., Florida, USA) cameras. All patients provided written informed consent for video collection, and standard clinical practice was followed in all cases. No missing data occurred. The TRIPOD statementTRIPOD was followed.

3.2 Model development

In the literature of deep learning, convolutional neural networks (CNNs) have shown remarkable performance for the classification task. Transfer learning was used based on a model trained on ResNet34. The neural network was then fine-tuned using the Fast AI library in Python howard2018fastai . Before loading the data for training, it was augmented using a randomly selected crop from the complete 1440 x 1080 pixel frame. This process occurred for 4 epochs. A random selection of 20% of the training data was used for initial validation. The threshold defined for classification of fluorescent was a probability output of the algorithm above 80%. The final result is an algorithm which takes the colorectal image as input and output a binary decision, whether the given input image has fluorescence in it or not.

3.3 Model evaluation

The model was deployed through a web interface using the Gradio Library and Huggingface Spaces (https://huggingface.co/spaces/asampaiosoares/fluorescence_id_app). This platform enabled the serial testing of additional frames that were not used for training the algorithm. Saliency maps were created to analyse the predictions from the validation set.

3.4 Decision boundary estimation

The estimation of the decision boundary was performed on the images considered positive for fluorescence in the validation set. These images were divided into JPEG segments of 100 pixel width along the colonic longitudinal axis using imageJ Schneider2012 and inputted into the algorithm. Both the binary classification results as well as the relative probabilities were recorded for each case. The decision boundary was defined as the limit between the most distal area that was classified as fluorescent and the areas considered not fluorescent.

3.5 Model deployment

The algorithm was deployed through an online platform called Gradio, which allows the upload of images to be classified using the algorithm developed.

4 Results

A dataset of patients undergoing colorectal resection was created with the characteristics shown in Table 1. Representative frames corresponding to the training set can be found in figure 1.

	Training set	Validation set
Number of patients	7	14
Number of frames	1790	30
Fluorescence positive frames	19.6%	53.3%

Table 1: Dataset characteristics

4.1 Training and validation

Refer to caption — Figure 1: Representative frames of the training set

Training took 21 minutes and 11 seconds. The validation on data not used in the training set was done manually using the Gradio app. The performance metrics can be seen in Table 2.

	Recall / Sensitiviy	Precision / PPV	Accuracy	F1 score
Training	100.0%	100.0%	100.0%	100.0%
Validation overall	68.8%	91.7%	80.0%	78.6%
Validation - internal data	60.0%	100.0%	80.0%	75.0%
Validation - external data	83.3%	83.3%	80.0%	83.3%

Table 2: Performance metrics for the classification algorithm. PPV positive predictive value

Representative frames of true positive, true negative, false positive and false negative results can be seen in figure 2, including the respective saliency maps. These maps show areas that contribute more to activation of the neural network in a darker shade of red.

4.2 Decision boundary estimation

The decision boundary was estimated for all the positive cases used for validation as shown in figure 3.

5 Discussion

These results have show that a neural network trained on clinical data has achieved an accuracy over 90%. There was a drop in performance using data acquired with a different equipment for validation but still maintaining an accuracy over 80%.

The analysis of the saliency maps shows that in the cases where both false negative and positive results were obtained the pixels corresponding to the colon were not responsible for the main activation of the algorithm. This leads us to hypothesize that an image analysis pipeline segmenting the colon from the background first and, on this output, running the classification algorithm could lead to improved results. Based on previous work from our group [Soares, AS et al., Interobserver agreement study, under review] and others hardy2021inter significant variation exists between different surgeons’ assessments. Standardisation of fluorescence angiography is a necessary step to enable accurate comparison of results.

Our algorithm provides an easily accessible tool based on state of the art deep learning techniques that eliminates interobserver variability in interpreting the fluorescent signal. Previous efforts to standardise fluorescence angiography assessments have focused on quantitative measurements requiring post hoc signal processing wada2017icg ; son2019quantitative . Our algorithm has the ability to simplify post-hoc processing as it was trained on data captured intraoperatively and is designed to output a binary output (”fluorescent” vs ”non fluorescent”).

Some limitations should be addressed. Although a high number of frames was used for training, the variability of frames used for validation was high but the overall number was small. This should be mitigated by including data from more centres performing fluorescence angiography. Furthermore, the data collected was of low quality to begin with, which limited the usefulness of the algorithm. This should be interpreted with caution as even an experienced surgeon would find some trouble analysing this data intraoperatively. A close collaboration between engineers and clinicians has led to the development of the algorithm presented in this work. Further work should aim to enrich the dataset for further refinement of the algorithm. Clinical testing in a randomized clinical trial will be necessary before large scale implementation can occur.

6 Conclusion

An artificial intelligence algorithm to classify colonic perfusion based on fluorescence angiography was developed using state of the art deep learning techniques. This work paves the way for standardisation of fluorescence angiography in colorectal surgery.

References

\bibcommenthead
(1) Ashraf, S.Q., Burns, E.M., Jani, A., Altman, S., Young, J.D., Cunningham, C., Faiz, O., Mortensen, N.J.: The economic impact of anastomotic leakage after anterior resections in English NHS hospitals: Are we adequately remunerating them? Colorectal Disease 15(4), 190–199 (2013). https://doi.org/10.1111/codi.12125
(2) Keller, D.S., Ishizawa, T., Cohen, R., Chand, M.: Indocyanine green fluorescence imaging in colorectal surgery: overview, applications, and future directions. The Lancet Gastroenterology ’I&’ Hepatology 2(10), 757–766 (2017). https://doi.org/10.1016/S2468-1253(17)30216-9
(3) Blanco-Colino, R., Espin-Basany, E.: Intraoperative use of ICG fluorescence imaging to reduce the risk of anastomotic leakage in colorectal surgery: a systematic review and meta-analysis. Techniques in coloproctology 22(1), 15–23 (2018). https://doi.org/10.1007/s10151-017-1731-8
(4) Degett, T.H., Andersen, H.S., Gögenur, I.: Indocyanine green fluorescence angiography for intraoperative assessment of gastrointestinal anastomotic perfusion: a systematic review of clinical trials. Langenbeck’s Archives of Surgery 401(6), 767–775 (2016). https://doi.org/10.1007/s00423-016-1400-9
(5) Hardy, N.P., Dalli, J., Khan, M.F., Andrejevic, P., Neary, P.M., Cahill, R.A.: Inter-user variation in the interpretation of near infrared perfusion imaging using indocyanine green in colorectal surgery. Surgical Endoscopy, 1–8 (2021)
(6) Maier-Hein, L., Eisenmann, M., Sarikaya, D., März, K., Collins, T., Malpani, A., Fallert, J., Feussner, H., Giannarou, S., Mascagni, P., Nakawala, H., Park, A., Pugh, C., Stoyanov, D., Vedula, S.S., Cleary, K., Fichtinger, G., Forestier, G., Gibaud, B., Grantcharov, T., Hashizume, M., Heckmann-Nötzel, D., Kenngott, H.G., Kikinis, R., Mündermann, L., Navab, N., Onogur, S., Roß, T., Sznitman, R., Taylor, R.H., Tizabi, M.D., Wagner, M., Hager, G.D., Neumuth, T., Padoy, N., Collins, J., Gockel, I., Goedeke, J., Hashimoto, D.A., Joyeux, L., Lam, K., Leff, D.R., Madani, A., Marcus, H.J., Meireles, O., Seitel, A., Teber, D., Ückert, F., Müller-Stich, B.P., Jannin, P., Speidel, S.: Surgical data science – from concepts toward clinical translation. Medical Image Analysis 76, 102306 (2022) arXiv:2011.02284. https://doi.org/10.1016/j.media.2021.102306
(7) Soares, A.S., Bano, S., Clancy, N.T., Stoyanov, D., Lovat, L.B., Chand, M.: Multisensor perfusion assessment cohort study: Preliminary evidence toward a standardized assessment of indocyanine green fluorescence in colorectal surgery. Surgery (2022)
(8) Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (tripod): The tripod statement. Annals of Internal Medicine 162(1), 55–63 (2015) https://doi.org/10.7326/M14-0697. https://doi.org/10.7326/M14-0697. PMID: 25560714
(9) Howard, J., et al.: The fast.ai deep learning library. GitHub (2018)
(10) Schneider, C.A., Rasband, W.S., Eliceiri, K.W.: NIH Image to ImageJ: 25 years of image analysis (2012). https://doi.org/10.1038/nmeth.2089. http://www.nature.com/articles/nmeth.2089
(11) Wada, T., Kawada, K., Takahashi, R., Yoshitomi, M., Hida, K., Hasegawa, S., Sakai, Y.: Icg fluorescence imaging for quantitative evaluation of colonic perfusion in laparoscopic colorectal surgery. Surgical endoscopy 31(10), 4184–4193 (2017)
(12) Son, G.M., Kwon, M.S., Kim, Y., Kim, J., Kim, S.H., Lee, J.W.: Quantitative analysis of colon perfusion pattern using indocyanine green (icg) angiography in laparoscopic colorectal surgery. Surgical endoscopy 33(5), 1640–1649 (2019)