This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Robust Modelling of Reflectance Pulse Oximetry for SpO2 Estimation

Sricharan Vijayarangan1∗, Prithvi Suresh1∗, Preejith SP1, Jayaraj Joseph1 and Mohansankar Sivaprakasam1,2 * Denotes Equal Contribution1 are with Healthcare Technology and Innovation Center (HTIC), Indian Institute of Technology (IIT-M), India [email protected]2 is with Department of Electrical Engineering, Indian Institute of Technology, Madras (IITM), India
Abstract

Continuous monitoring of blood oxygen saturation levels is vital for patients with pulmonary disorders. Traditionally, SpO2 monitoring has been carried out using transmittance pulse oximeters due to its dependability. However, SpO2 measurement from transmittance pulse oximeters is limited to peripheral regions. This becomes a disadvantage at very low temperatures as blood perfusion to the peripherals decreases. On the other hand, reflectance pulse oximeters can be used at various sites like finger, wrist, chest and forehead. Additionally, reflectance pulse oximeters can be scaled down to affordable patches that do not interfere with the user’s diurnal activities. However, accurate SpO2 estimation from reflectance pulse oximeters is challenging due to its patient dependent, subjective nature of measurement. Recently, a Machine Learning (ML) method was used to model reflectance waveforms onto SpO2 obtained from transmittance waveforms. However, the generalizability of the model to new patients was not tested. In light of this, the current work implemented multiple ML based approaches which were subsequently found to be incapable of generalizing to new patients. Furthermore, a minimally calibrated data driven approach was utilized in order to obtain SpO2 from reflectance PPG waveforms. The proposed solution produces an average mean absolute error of 1.81% on unseen patients which is well within the clinically permissible error of 2%. Two statistical tests were conducted to establish the effectiveness of the proposed method.
Clinical relevance— The proposed method ameliorates our current understanding of reflectance based pulse oximetry and provides a method to estimate SpO2 from reflectance pulse oximeters.

I INTRODUCTION

Pulse oximetry is a non intrusive method used to measure the amount of oxygen saturation in a person’s blood. Peripheral Capillary Oxygen Saturation (SpO2), an estimate of the amount of oxygen in the blood, is conventionally obtained by taking the ratio of volume of oxygenated haemoglobin to that of the total haemoglobin. This is estimated by passing light of two different wavelengths, generally Red (660 nm) and Infrared (IR)(890 nm), and analyzing the amount of light absorbed along the way [1]. This provides information about the pulsatile arterial blood flow.

The most prevalent type of pulse oximetry is transmittance pulse oximetry, wherein the light passes through the region of interest (traditionally the finger) and the required analysis is carried out. A relatively unconventional method is reflectance pulse oximetry [2], in which the intensity of the reflected light is analysed. Under a controlled clinical environment as described in Pälve et al. [3] and König et al. [4], the need and success of reflectance pulse oximetry over transmittance is well established.

Despite its clinical necessity, estimation of SpO2 from a reflectance photoplethysmogram requires a novel approach due to the inherent path difference between the red and IR wavelengths at the receiver. The variation in this path difference is not constant and depends upon the nature of one’s skin and tissue[5]. This invalidates the use of one-time calibration as done in transmittance pulse oximeters. As a result, a novel data driven approach to model reflectance pulse oximetry ,subsequently estimating SpO2, would enable a flexible yet robust approach that circumvents the current problems experienced by reflectance pulse oximetry

Venkat et al. [6] proposed a calibration-less Machine Learning (ML) approach to derive SpO2 from reflectance PPG waveforms. However, the number of data points corresponding to each SpO2 value obtained for the study is skewed in the favour of higher SpO2 values, implying that the algorithm was not tested across all possible SpO2 values. Furthermore, the proposed methodology was not tested on new patients. In order to arrive at a generalizable decision system centered around deriving SpO2 from reflectance pulse oximetry, our contributions are as follows:

  1. 1.

    We identified a data acquisition protocol that aims to acquire a wider range of values of SpO2 from the same patient in order to understand the variation of reflectance PPG waveform with SpO2.

  2. 2.

    We perform a comprehensive analysis on the data acquired and corroborate the requirement of a data-driven approach.

  3. 3.

    We examine previous ML methods and demonstrate its ineffectiveness in generalizing to unseen data.

  4. 4.

    We propose a method of estimation, requiring minimal calibration, that would generalize well to new patients. We quantify the estimation using Box plots and Bland Altman analysis.

II Design Methodology

II-A Data Acquisition Protocol

A clinical study was conducted to analyze the feasibility of estimating reflectance SpO2 measurements from transmittance SpO2. The hardware design used for all the experiments in the proposed method was the same as that used in Venkat et al. [6]. The device information, clinical protocols, and the study objectives were approved by the Institutional Review Board, Christian Medical College and Hospital (CMC), Vellore. Data was collected with informed consent from 28 subjects, suffering from various pulmonary disorders, with natural SpO2 levels without external aid varying from 60% to 100%. Reflectance probes were attached to the subject in the following locations: finger, wrist and forehead. Nellcor transmitance probe, which was used to obtain reference SpO2, was attached to the finger (same hand) when the data was collected.

Refer to caption
Figure 1: Data Collection Procedure

Arterial Blood Gas (ABG) test [7] was done on the subject, in parallel, to measure their SaO2. Prior to the ABG test, oxygen mask was removed from the subject to measure their natural SpO2 level. This period would correspond to a drop in the SpO2 levels of the patient. Once the ABG test was conducted, the supplemental oxygen supply was resumed which led to an increase in SpO2 levels. Note that the data collection (shown in Figure 1) was initiated a few minutes before the ABG test was conducted and was concluded a few minutes after the test. To the best of our knowledge, this is the first procedure in which a continuous wide range of SpO2 values were obtained for a single patient.

II-B Data Preprocessing

The transmittance and reflectance SpO2 obtained from the fingers with a sampling frequency of 600 Hz were preprocessed in a similar fashion. These steps are elucidated below:

  1. 1.

    Moving Average (MAV) Filter: The Red and IR signals were passed through a MAV filter with a window length of 50 samples to smoothen them, thus removing high frequency noise components.

  2. 2.

    Detrending: Since both the red and IR signals had exhibited baseline wander, detrending was carried out to ease the process of peak and valley detection.

  3. 3.

    Peak and valley Detection: Position of peaks and valleys were extracted from the detrended signals. Subsequently, erroneous peaks and valleys were discarded based on the average peak-to-peak and valley-to-valley distance.

  4. 4.

    Signal Quality Check: The signals were deemed fit based on the cross correlation between the red and IR signals.

The length of the window and the percentage overlap was empirically chosen to be 4 seconds and 25% respectively. This was decided based on the average change in the DC value within a window.222 code is available at https://github.com/prithusuresh/Reflectance-SPO2

II-C Problem Formulation

From each window in the transmittance waveform, a ratio was obtained by combining AC and DC components of both wavelengths, using the following formula

R_value=(REDAC/REDDC)(IRAC/IRDC)R\_value=\frac{(RED_{AC}/RED_{DC})}{(IR_{AC}/IR_{DC})}

The actual SpO2 was obtained from the R_valueR\_value, which is the standard method for SpO2 estimation. The goal of the proposed method is to estimate the SpO2 value by modelling the relationship between the reflectance PPG waveform and the transmittance SpO2.

Refer to caption
Figure 2: Distribution of SpO2 for overall data collected

II-D Exploratory Data Analysis (EDA)

Although the data acquisition obtained a wide range of SpO2 values across various patients, it is clear from Figure 2 that this approach does not reduce the massive imbalance against the lower values(<<75%) of SpO2. This was expected as it is prohibitively hard to allow a patient to have a low SpO2 level for longer periods.

Furthermore, a linear relationship between R_valueR\_value, calculated from the reflectance signal (reflectance-R_valueR\_value), and the corresponding SpO2, obtained from the transmittance signal (transmittance-SpO2), was observed for individual patients as shown in Figure 3. For each individual a unique straight line was obtained. Examples of this trend is shown on three patients who were randomly picked. However, there were two patients whose R_valueR\_value-SpO2 relationship deviated from the observation above. On closer examination of their PPG waveform, it was inferred that different reflectance-R_valueR\_values were getting mapped to the same transmittance-SpO2 values. The reason for the same is discussed in Section IV.

Refer to caption
Figure 3: Trends for SpO2 vs R for three patients selected at random from the train set. Note that these trends were obtained by fitting a line to the scatter plot

.

III Experiments and Results

III-A Examination of previous machine learning methods

Venkat et al. [6] proposed to model reflectance pulse oximetry by using a classifier wherein each target class corresponds to a rounded off SpO2 value. The data representing each SpO2 class was split into a train-validation set of 70%-30%, which was later accumulated. This was done to tackle the severe imbalance in the representation of lower SpO2 values. Venkat et al. showed that the bagging of decision trees provides the best accuracy across different values of SpO2. In order to test the ability of these models to generalize, the current work splits the data into three sets namely, train, validation and test sets. To deal with the imbalance, majority undersampling was utilized. Data obtained from 20 patients was split randomly into 10 folds of train and validation. Data from the rest of the patients was put into a test set. Both bagging (Random Forest Classifier and Random Forest Regressor) and boosting (Gradient boosting) were tried. Multiple features as given in Venkat et al. [6] were extracted from the transmittance waveform. The reported evaluation metrics are Average Mean Square Error (Avg MSE), Average Mean Absolute Error (Avg MAE) and the Average R2 Score (Avg R2 Score) across the validation and test set. Table I highlights the performance of multiple machine learning models on the validation set. Table II highlights the performance of all these models on the test set.

III-B Proposed Estimation Method

As the transmittance-SpO2 vary linearly with the reflectance-R_valuesR\_values (corroborated in Figure 3), we use a straight line to model the relationship between transmittance-SpO2 and reflectance-R_valueR\_value. Given a new patient, whose reflectance-R_valueR\_value and transmittance-SpO2 are known, the objective is now reduced to finding the best fit line. The training set was computed from the data of those patients (Π\Pi), whose SpO2 values varied over a range greater than 15, which totalled to 10 patients. The training set (τ\tau) consists of individual straight lines (lil_{i}) that best captured their respective reflectance-R_valueR\_value and transmittance-SpO2 relationship. Thus,

τ\tau = {lil_{i}} iΠ\forall\ i\ \in\ \Pi

The calibration set (γ\gamma) consists of five values of reflectance-R_valueR\_value and transmittance-SpO2 pairs. The number of points chosen in the calibration set is a trade off between an accurate representation of the new patient versus the calibration time. All these five points were values obtained between 90-95, as these are better represented in our dataset Figure 2. The lateral distance (LDLD), defined as the absolute difference in R_valueR\_value between each point in the γ\gamma and each line in τ\tau for the corresponding SpO2, is computed. A point is matched to a line in τ\tau if it has the least LDLD. The line lil_{i}, with the most number of matches, is selected to model the new patient. Table III highlights the results obtained using the proposed decision system on twelve unseen patients. Only these patients exhibited a SpO2 range over 10%. The other 6 patients displayed an inadequate range and were not considered.

TABLE I: Performance of ML models on Validation set
Model Avg MSE Avg MAE Avg R2 Score
RF Regressor 1.284 1.284 0.941
RF Classifier 0.941 0.580 0.928
Gradient Boosting 2.617 1.150 0.762
TABLE II: Performance of ML models on test set
Model Avg MSE Avg MAE Avg R2 Score
RF Regressor 8.213 6.557 -2.901
RF Classifier 8.213 6.557 -2.901
Gradient Boosting 5.175 4.490 -0.550
TABLE III: Mean Absolute errors across all patients
Patient Id MAE
1 3.18
2 1.80
3 3.28
4 1.40
5 0.92
6 2.79
7 1.08
8 1.55
9 0.66
10 1.87
11 1.75
12 1.39

IV Discussion

From Table I, it is observed that the ML methods do well on the validation set. This implies that the model seems to be performing well on the unseen SpO2 values of the same patient. However, Table II shows that the ML models do not generalize well to the test set. The poor performance of these models can be explained by the inherent difference in paths of the both the red and IR wavelengths at the receiver’s end for different patients. The path difference occurs as a result of difference in finger thickness, skin colour and other possible variances. These results show that the training set does not effectively capture all possible variations in SpO2 and that the features that were used were not discriminative of the path length in different people.

Table III demonstrates the effectiveness of the proposed method on the test set. The Mean Absolute Error (MAE) is computed for all the patients on the test set. The Average of the MAE across all patients is 1.81%. This significantly outperforms the best ML model which gave an Average MAE of 4.490. However, for certain patients it was observed that the MAE values were as high as 3.18. On closer analysis, it was observed that this was due to predictions on certain windows being highly erroneous, resulting in a skewed average value. This aberration is a result of abnormal R_valueR\_value computation due to a high DC component of the red wavelength. The high DC component caused mutliple reflectance-R_valueR\_values to be mapped to the same SpO2 values, as observed in section II-D. This observation can be corroborated by the box plot shown in Figure 4. It can be seen that the 75% quartile range lies within 2% for all except one patient. The few outliers had produced a large error thus increasing the average of the MAE. However, the box plot does not take into account the possibility of systematic and random errors or bias in the measurement and computation of SpO2 from the transmittance PPG waveform.

Refer to caption
Figure 4: Box plot of MAE’s for all windows across all patients in the unseen test set. y refers to actual SpO2 while y’ refers to the Predicted SpO2

A better understanding about the degree of agreement between the estimated SpO2 and the actual SpO2 that accounts for the errors in measurement of SpO2 is given by the Bland-Altman analysis. To this end, the Bland-Altman plots for four patients are shown in Figure 5. The Limits of Agreement (LoA) were defined as ±1.96SD\pm 1.96SD of the mean difference. Generally, it is recommended that the accurately predicted observations lie within the error band of ±2%\pm 2\%. Figure 5a and 5b show examples where the LoA is well within the error band indicating good estimations of SpO2. It is also clear that more than 95% of the points are within the LoA. These plots are representative of 83% of the patients. For two patients, due to the presence of large outliers, the LoA marginally eclipses the error band as shown in Figure 5c and 5d. Given more training data that covers a larger distribution and a robust preprocessing stage to discard signals with abnormally high values of DC amplitudes, the proposed method is likely to generalize even better to unseen data. Nevertheless, it provides a massive improvement over ML based methods across all patients.

Refer to caption
Figure 5: Bland Altman computed across all patients. y refers to the Actual SpO2 value and y’ refers to the predicted SpO2 value. The black,red and grey lines correspond to the means of differences, clinical threshold and the limits of agreement, respectively.

V CONCLUSIONS

This work proposes a novel method for estimating SpO2 from reflectance pulse oximetry with minimal calibration using SpO2 obtained from transmittance PPG signals. However, subsequent statistical analysis showed that the modelling could be improved with more data and better selectivity.

VI ACKNOWLEDGEMENT

The authors would like to acknowledge Dr Balamugesh and Dr Christopher DJ, Department of pulmonary medicine, Christian Medical College and Hospital (CMC), Vellore, who provided valuable expertise regrading the data and its acquisition process.

References

  • [1] M. S. Mortz, “System for pulse oximetry spo2 determination,” Aug. 10 1999, uS Patent 5,934,277.
  • [2] A. I. Eidelman, Y. Saroussi, and I. Y. Shemesh, “Reflectance pulse oximetry (rpo): A new tool for continous measurement of heart rate and oxygen saturation,” Pediatric Research, vol. 45, no. 2, pp. 16–16, 1999.
  • [3] H. Pälve, “Reflection and transmission pulse oximetry during compromised peripheral perfusion,” Journal of clinical monitoring, vol. 8, no. 1, pp. 12–15, 1992.
  • [4] V. König, R. Huch, and A. Huch, “Reflectance pulse oximetry–principles and obstetric application in the zurich system,” Journal of Clinical Monitoring and Computing, vol. 14, no. 6, pp. 403–412, 1998.
  • [5] Y. Mendelson and B. D. Ochs, “Noninvasive pulse oximetry utilizing skin reflectance photoplethysmography,” IEEE Transactions on Biomedical Engineering, vol. 35, no. 10, pp. 798–805, 1988.
  • [6] S. Venkat, M. T. A. PS, A. Alex, S. Preejith, D. Christopher, J. Joseph, M. Sivaprakasam et al., “Machine learning based spo2 computation using reflectance pulse oximetry,” in 2019 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).   IEEE, 2019, pp. 482–485.
  • [7] M. D. Davis, B. K. Walsh, S. E. Sittig, and R. D. Restrepo, “Aarc clinical practice guideline: blood gas analysis and hemoximetry: 2013,” Respiratory care, vol. 58, no. 10, pp. 1694–1703, 2013.