Uncertainty Error Modeling for Non-Linear State Estimation With Unsynchronized SCADA and PMU Measurements
Abstract
Distribution systems of the future smart grid require enhancements to the reliability of distribution system state estimation (DSSE) in the face of low measurement redundancy, unsynchronized measurements, and dynamic load profiles. Micro phasor measurement units (PMUs) facilitate co-synchronized measurements with high granularity, albeit at an often prohibitively expensive installation cost. Supervisory control and data acquisition (SCADA) measurements can supplement PMU data, although they are received at a slower sampling rate. Further complicating matters is the uncertainty associated with load dynamics and unsynchronized measurements—not only are the SCADA and PMU measurements not synchronized with each other, but the SCADA measurements themselves are received at different time intervals with respect to one another. This paper proposes a non-linear state estimation framework which models dynamic load uncertainty error by updating the variances of the unsynchronized measurements, leading to a time-varying system of weights in the weighted least squares state estimator. Case studies are performed on the 33-Bus Distribution System in MATPOWER, using Ornstein–Uhlenbeck stochastic processes to simulate dynamic load conditions.
Index Terms:
Distribution system state estimation, micro-phasor measurement units, SCADA, uncertainty modelingI Introduction
Increasing power distribution system (PDS) complexity and distributed energy resource (DER) penetration present complicated challenges for the modern smart grid. As a consequence, great effort has been dedicated to bolster the reliability of distribution management systems (DMS) given the growing uncertainty associated with increasingly complicated load dynamics and the much improved, but nevertheless imperfect, synchronization of metering equipment [1].
This uncertainty error associated with unsynchronized measurements most significantly affects the distribution level, where non-observability issues due to measurement device scarcity at the low and medium voltage levels [2] further complicate obtaining a reliable snapshot of the system. This directly hinders distribution system state estimation (DSSE), which relies on system observability to ensure state variable estimation from the measurement set [3]. Pseudo measurements have been proposed to increase system observability artificially [4], however poor accuracy and reliability of these measurements may corrupt the DSSE, especially with the increasing complexity of PDS and DER penetration.
To compensate, advanced metering infrastructure (AMI) devices have been proposed to augment the measurement set. The work in [5] proposes a DSSE using AMI, although the measurements are considered synchronized. Supervisor control and data acquisition (SCADA) and micro-phasor measurement units (PMUs) have been used with AMI in [6] which, like the proposed work, models time-varying loads as Ornstein–Uhlenbeck stochastic processes. The referenced work uses linear state estimation (LSE) based on complex current and voltage measurements, leveraging the high granularity of PMUs to quickly obtain state variable estimates [7]. State estimation (SE) aims to obtain the complex voltages at each system bus based on measurement data, which is instrumental to power system monitoring [8]. In the referenced LSE approach, measurement variances are then updated based on the OU load model [9], improving SE results. Shortcomings of this implementation include high noise sensitivity. Further, the LSE’s necessary linearization of non-linear SCADA and AMI measurement functions introduces additional potential for error. The proposed model seeks to address these issues.
The idea for this work is to incorporate the uncertainty error associated with unsynchronized measurements into a two-step non-linear SE measurement model, rather than deciphering the cause of uncertainty error after the fact. The few high-granularity PMU measurements will be used to obtain synchronized active and reactive power injections at the PMU buses. Unsynchronized SCADA measurements, including active and reactive power flows and injections, will not require linearization. Case studies will explore various redundancy levels and SCADA sampling rates. Further, the co-asynchronicity among SCADA will be simulated by staggering their arrival times with respect to one another.
Three specific contributions of this paper towards the state-of-the-art include:
-
•
An enhanced two-step non-linear SE framework which includes uncertainty error in the measurement model.
-
•
A formal modeling of the LSE truncation error, which the proposed work circumvents.
-
•
Robustness to the asynchronicity between SCADA and PMU measurements, as well as the co-asynchronicity between SCADA. This is in contrast to traditional methods which treat measurements as synchronized.

II Background
II-A Non-Linear State Estimation
Classical power system SE follows a Weighted Least Squares (WLS) framework [8]. A power system with buses and measurements can be modeled as a set of non-linear algebraic equations in the following measurement model:
(1) |
where is the measurement vector, is the state variables vector, is a continuous non-linear differentiable function, and is the measurement error vector. Each measurement error is assumed to follow a zero mean Gaussian distribution. is the number of measurements while is the number of unknown state variables, i.e., the complex voltages at each bus.
In the traditional WLS approach, the best state vector estimate in (1) is determined by minimizing the weighted norm of the residual, represented with the cost function :
(2) |
where is the inverse covariance matrix of the measurements, otherwise known as the weight matrix. The solution to (2) is found through the iterative Newton-Raphson method. The linearized form of (1) then becomes:
(3) |
where is the Jacobian matrix of at the current state estimate , is the measurement vector correction and is the state vector correction. The WLS solution can further be understood geometrically [10] as the projection of onto the Jacobian space () by a linear projection matrix , i.e. :
(4) |
The SE can further be interpreted by visualizing the geometrical position of the measurement error in relation to the Jacobian range space (). Through decomposing the measurement vector space into a direct sum of () and H, it is possible to also decompose the measurement error vector into two components:
(5) |
The component is the detectable error, which is equivalent to the residual in the classical WLS model. The component is the undetectable component of the residual. lies in the space orthogonal to the Jacobian range space, while is hidden in the Jacobian space [8]. To formalize the impact of the undetectable component , the measurement Innovation Index () is introduced [11]:
(6) |
A low indicates that a large component of the error is not reflected in the residual. The composed measurement error () can then be formulated in terms of the residual and , after which the normalized is obtained:
(7) |
where is the -th measurement standard deviation.
The SE yields values based on measurements taken throughout the power system. Real measurements in this work are defined as those obtained from SCADA and PMUs. In addition, per the methodology in [12], synthetic measurements (SM) are artificially created in low redundancy areas considering measurement and -tuple of critical measurements. This serves to maintain a high global redundancy level (), i.e., the number of measurements divided by the number of state variables, by increasing the degrees of freedom, improving the reliability of the hypothesis test [8].
hypothesis testing is used for bad data detection during gross error (GE) analytics. The -based objective function value (8) is compared to a threshold. The value of this threshold depends on a chosen probability and the degrees of freedom , taken to be the number of measurements fed to the SE:
(8) |
If the value of is greater than or equal to the threshold, then a GE is detected. For this work, a two-step SE approach is used, illustrated in Fig. 1. The classical first step presented in [13] uses an empirical approach wherein all measurements are weighted equally proportional to the measurement magnitude, after which the is compared to . Only if does the procedure advance to the second SE step, during which the SE is repeated—this time with meter precision values for each measurement type per state-of-the-art methodologies [3]. This serves to strike a balance between GE detectability and deciphering the measurement(s) in error. Next, following [14], bad data is identified through analysis of the . This work seeks to improve upon the first-step of the classical two-step SE by replacing the empirical assumption with a continually updating model of the unsynchronized measurement uncertainty error.
II-B Dynamic Load Modeling
Due to the asynchronicity between PMU and SCADA measurements, as well as the co-asynchronicity among SCADA, load dynamics cause SCADA measurements to become obsolete, thus providing an inaccurate snapshot of the system state. This uncertainty error must be included into the error vector from (1) to better provide accurate state variables, error detection, and GE analytics. The drift in the system state due to load dynamics can be modelled as Ornstein–Uhlenbeck (OU) stochastic processes, which have been used to successfully reflect time-varying load profiles [15]. OU load modelling relies on the assumption that, although load demand profiles do fluctuate over time, they rarely deviate sharply from their previous value. Further, the OU is a mean-reverting process, making it an appropriate model for ascribing the uncertainty error to outdated measurement variance only.
A stochastic differential equation is used to represent the -th load apparent power, , where and are the -th load active and reactive power respectively:
(9) |
where 0 is the decay rate controlling to what degree the load varies from its initial value, 0 controls the scale of the noise, and denotes a complex Wiener process whose sample paths follow a continuous Gaussian process [16]. Statistical analysis of PMU data can be used to estimate these parameters [15]. The load variations can then be expressed as a discrete time process for every t PMU sample [6]:
(10) |
where t is the time when unsynchronized SCADA is received, and for and time indices, and is a complex random variable of zero mean and variance . Dynamic load uncertainty can substantially increase the in the absence of actual measurement error, especially when measurements are unsynchronized, as illustrated in Fig. 2. The variance of the discrete-time Gaussian variable can then be found recursively by:
(11) |
where . The updated variances of the unsynchronized SCADA measurements then constitute the time-varying inverse covariance matrix of the measurements, . The improvement in the due to the updating is observed in Fig. 3, as the should not exceed the threshold due to load dynamics only.
II-C Non-Linear Measurement Uncertainty
The classical WLS model in (1) does not consider the possibility of errors attributed to dynamic load measurement uncertainty. This model can be augmented by considering, for any -th measurement, a noiseless true measurement at time , which is related to a previous value plus random load variation , given by (12), whereas the noisy and out-of-date unsynchronized measurement, , is represented in (13):
(12) |
(13) |
where is the measurement noise of the same Gaussian distribution as in (1). At a given instance when the first step of the non-linear SE is performed, the precise value of is unobtainable; the best available measurement in time is . However, it is possible to approximate by assuming that is attributed only to load variations:
(14) |
The model in (14) is used for approximating the unsynchronized SCADA measurement variances, which are used to update the time-varying weight matrix.
The proposed work also seeks to formalize its assertion of preserving measurement fidelity. The work in [6] uses an OU process load modeling approach with variance update equations, however it relies on linearizing SCADA and AMI measurements. Unlike the proposed work, this linearization process introduces yet an additional error component exclusive to the LSE approach, which this work calls the linearization truncation error . To formalize this additive error, the Taylor series of the LSE measurement model can be developed as:
(15) |
By setting (15) equal to the standard SE measurement model , one obtains:
(16) |
By defining the linearized form of as , one finally obtains:
(17) |
where is the measurement error and is the LSE additive truncation error, which contributes to SE inaccuracies. The proposed model does not require measurement function linearization, and thus preserves measurement fidelity by avoiding the error component . Further, in the referenced LSE approach, additional error is introduced in the calculation of linearized measurement covariances, which must be approximated using the truncated Taylor series of their functions [6]. The proposed work thus further preserves measurement fidelity by circumventing measurement covariance approximation.
III Case Study
Validation was performed in MATPOWER on the 12.66 kV 33-bus distribution system from Baran and Wu [17]. OU processes were used to model each load using a value of 0.0125, as obtained from [15]. An optimal number of 11 PMUs was selected per the methodology in [18]. SCADA sampling rates of 1, 2, and 4 seconds were considered. To simulate asynchronicity, the SCADA measurements were organized into 33 groups. If a PMU is present at a given bus, then only the active and reactive power flows to and from that bus are considered as SCADA measurements for that group. Otherwise, the active and reactive power injections are also considered as SCADA for that group. The remaining power injections are obtained through the PMUs or SM.
For a given SCADA sampling rate, maximum asynchronicity was simulated by staggering the SCADA arrival rates as far apart as possible by a factor of integer division , where is the PMU sampling rate and are the number of SCADA groups. To test performance with smaller data sets, two separate measurement plans were considered of 180 and 195 measurements ( and respectively). = 0.95 was considered for the threshold, however this can be customized depending on desired dependability/security trade-off. Random errors with distribution were applied to all measurements.
The first SE iteration is performed once SCADA information from every group is collected, along with the PMU data. In the classical two-step SE approach described in Section II-A, an empirical approach is considered in which all measurements are considered as possibly having GEs, with each measurement standard deviation being a percentage of the measurement magnitude [19]. In the proposed work, however, the classical empirical approach is now replaced with the uncertainty error modeling embedded into the time-varying weight matrix , which is continually updated using the variances obtained from (11).
If left unmodeled, complications of the added uncertainty error to the error vector include: 1.) erroneous triggering of protection and control devices [20] due to the value exceeding the threshold—a process that should be reserved for detecting measurement errors [8] during GE analytics—and 2.) reduced state estimate accuracy. Therefore, the case study will focus on these two metrics.


To quantify improved state variable accuracy, the metric is defined as the L2-norm of the estimate error:
(18) |
where is the number of buses, is the actual voltage magnitude at the -th bus with zero error, and is the estimated voltage magnitude at the -th bus.
Three cases were explored for this analysis. An “Ideal SE” case considers perfect synchronicity across PMU and SCADA, meaning that the only error introduced into the SE process comes from measurement noise; uncertainty error is absent. Although unrealistic, the ideal case provides comparative insight between the traditional and proposed unsynchronized models. The “Traditional SE” case considers asynchronicity across all measurements with a static weight matrix . Finally, the “Proposed SE” employs the updating time-varying weight matrix . Each simulation was run for 6 hours in MATPOWER with varying OU load dynamics.
Fig. 4 illustrates the improvement of the proposed model’s state variable accuracy by plotting the cumulative sum of the metric, showing a 37.7746% decrease in the for moderate SCADA asynchronicity. As expected, the Proposed SE improved upon state variable accuracy by updating the measurement variances, as opposed to the Traditional SE which contains no such uncertainty modeling.
Next, the improvement in false positive rate (FPR) of the proposed model was evaluated. FPR refers to the erroneous triggering of the GE analytics process, which could result in a measurement being falsely deleted from the measurement set per standard SE practices [8], hindering the reliability of hypothesis testing. In a worse case scenario, this could lead to false activation of the protection and control apparatus [20]. Table I compares the FPRs across three different SCADA sampling rates at maximum asynchronicity and a of 3, where Table II reduces the measurement set to a of 2.769. A substantial decrease in FPR was observed with variance updating, which in turn reduced the proportion of uncertainty error in the error vector .
SCADA Sampling Rate | FPR Without Update | FPR With Update |
---|---|---|
1 s | 1.32% | 0.00% |
2 s | 11.81% | 0.00% |
4 s | 32.24% | 0.98% |
SCADA Sampling Rate | FPR Without Update | FPR With Update |
---|---|---|
1 s | 47.17% | 0.00% |
2 s | 64.49% | 5.23% |
4 s | 81.22% | 12.84% |

IV Summary and Conclusions
This work presents a two-step non-linear SE framework which includes load dynamics-induced uncertainty error into the measurement model. The performance of the proposed method was evaluated with worst case scenario asynchronicity, varying degrees of unsynchronized SCADA arrival time, and a reduced input measurement data set, showcasing its reliability and practicality compared to conventional SE. Two major improvements were observed in the proposed work when compared to conventional SE: increased accuracy of state variable estimates and decreased FPR associated with erroneous triggering of the bad data detection step. Further, a formalizing of the truncation error associated with a previous LSE approach with OU load modeling was presented, which the proposed work circumvents by preserving the non-linear measurement functions, leading to more accurate state variable estimates and robustness to approximation error.
References
- [1] F. Jiyuan and S. Borlase, “The evolution of distribution,” IEEE Power Energy Mag., vol. 7, no. 2, 2009.
- [2] A. G. Expósito and A. Abur, Power System State Estimation: Theory and Implementation. Marcel Dekker, 2004.
- [3] A. Monticelli, State estimation in electric power systems: a generalized approach. Springer, 2012.
- [4] A. K. Ghosh, D. L. Lubkeman, and R. H. Jones, “Load modeling for distribution circuit state estimation,” IEEE Trans. Power Del., vol. 12, no. 2, p. 999–1005, 1997.
- [5] M. Baran and T. E. McDermott, “Distribution system state estimation using ami data,” Proc. IEEE/PES Power Syst. Conf. Expo. (PSCE), pp. 1–3, 2009.
- [6] R. D. Trevizan, C. Ruben, A. Rossoni, S. C. Dhulipala, A. Bretas, and N. G. Bretas, “pmu-based temporal decoupling of parameter and measurement gross error processing in dsse,” electricity, vol. 2, pp. 423–438, 2021.
- [7] A. G. Phadke and J. S. Thorp, Synchronized Phasor Measurements and Their Applications. Springer, 2008.
- [8] A. Bretas, N. Bretas, J. B. London Jr, B. Carvalho et al., Cyber-Physical Power Systems State Estimation. Elsevier, 2021.
- [9] E. Bibbona, G. Panfilo, and P. Tavella, “The ornstein–uhlenbeck process as a model of a low pass filtered white noise,” Metrologia, vol. 45, no. 6, 2021.
- [10] N. G. Bretas, S. A. Piereti, A. S. Bretas, and A. C. P. Martins, “A geometrical view for multiple gross errors detection, identification, and correction in power system state estimation,” IEEE Transactions on Power Systems, vol. 28, no. 3, p. 2128–2135, 2013.
- [11] N. Bretas, A. Bretas, and S. A. R. Piereti, “Innovation concept for measurement gross error detection and identification in power system state estimation,” IET Generation, Transmission and Distribution, vol. 5, no. 6, p. 603–608, 2011.
- [12] A. Bretas, A. Rossoni, R. Trevizan, and N. Bretas, “Distribution networks nontechnical power loss estimation: A hybrid data-driven physics model-based framework,” Electric Power Systems Research, vol. 186, p. 106397, 2020.
- [13] N. Bretas and A. Bretas, “A two steps procedure in state estimation gross error detection, identification, and correction,” International Journal of Electrical Power & Energy Systems, vol. 73, p. 484–490, 2015.
- [14] A. Bretas, N. Bretas, and B. Carvalho, “Further contributions to smart grids cyber-physical security as a malicious data attack: Proof and properties of the parameter error spreading out to the measurements and a relaxed correction model,” International Journal of Electrical Power Energy Systems, vol. 104, p. 43–51, 2019.
- [15] C. Roberts, E. M. Stewart, and F. Milano, “Validation of the ornstein-uhlenbeck process for load modeling based on µpmu measurements,” Power Systems Computation Conference, 2016.
- [16] R. Zárate-Miñanoa, M. Anghel, and F. Milano, “Continuous wind speed models based on stochastic differential equations,” Applied Energy, vol. 104, pp. 42–49, 2013.
- [17] M. E. Baran and F. F. Wu, “Network reconfiguration in distribution systems for loss reduction and load balancing,” IEEE Transactions on Power Delivery, vol. 4, no. 2, pp. 1401–1407, 1989.
- [18] X. Chen, L. Sun, T. Chen, Y. Sun, Rusli, K. J. Tseng, K. V. Ling, W. K. Ho, and G. A. J. Amaratunga, “Full coverage of optimal phasor measurement unit placement solutions in distribution systems using integer linear programming,” energies, vol. 12, no. 8, 2019.
- [19] N. G. Bretas, A. S. Bretas, and A. C. P. Martins, “Convergence property of the measurement gross error correction in power system state estimation, using geometrical background,” IEEE Transactions on Power Systems, vol. 28, no. 4, p. 3729–3736, 2013.
- [20] S. H. Horowitz and A. G. Phadke, Power System Relaying. Wiley, 2013.