Hierarchical Spatio-Temporal Uncertainty Quantification for Distributed Energy Adoption
Abstract
The rapid deployment of distributed energy resources (DER) has introduced significant spatio-temporal uncertainties in power grid management, necessitating accurate multilevel forecasting methods. However, existing approaches often produce overly conservative uncertainty intervals at individual spatial units and fail to properly capture uncertainties when aggregating predictions across different spatial scales. This paper presents a novel hierarchical spatio-temporal model based on the conformal prediction framework to address these challenges. Our approach generates circuit-level DER growth predictions and efficiently aggregates them to the substation level while maintaining statistical validity through a tailored non-conformity score. Applied to a decade of DER installation data from a local utility network, our method demonstrates superior performance over existing approaches, particularly in reducing prediction interval widths while maintaining coverage.
Index Terms:
Distributed energy resource, uncertainty quantification, spatio-temporal hierarchical prediction.I Introduction
In recent years, the global energy landscape has experienced a transformative shift, primarily characterized by an extraordinary increase in renewable energy sources [1]. One of its primary contributors is distributed energy resources (DER), a decentralized approach to power generation that includes a variety of technologies such as solar panels, wind turbines, and small-scale hydroelectric systems [2].
While DER deployment has accelerated rapidly, it introduces significant spatio-temporal variability and uncertainties that vary across regions and evolve over time. Understanding these uncertainties is crucial for effective energy management [3], facilitating DER integration into existing power grids [4], identifying grid enhancement opportunities [5], and planning for future energy demands [6]. More importantly, such uncertainty quantification (UQ) must also be accessible at multiple spatial scales to offer a more comprehensive overview of the future and support their downstream strategic decision-making. For example, granular circuit-level predictions enable precise operational decisions, such as real-time load balancing and resource allocation. Meanwhile, aggregated substation-level forecasts provide the broader perspective needed for long-term infrastructure investments, including capacity enhancement and resilience improvements.
However, achieving multilevel uncertainty quantification presents two main challenges: First, jointly predicting multiple spatial units (e.g., circuits) can result in overly conservative uncertainty intervals, potentially diminishing their practical utility. Excessively wide prediction intervals fail to provide actionable insights for specific operational needs. Second, while aggregating these predictions from circuit to substation level may seem straightforward, doing so without careful adjustment can result in prediction intervals that do not fully capture the true underlying uncertainty at the aggregate level. Simple summation of circuit-level uncertainties often overestimates variability, leading to either inflated risk assessments or misleading confidence in grid performance metrics.
To address these challenges, we propose a hierarchical spatio-temporal model based on conformal prediction [7] to predict the spatio-temporal uncertainty in DER growth. Our approach begins by generating circuit-level growth predictions along with corresponding uncertainty measures, and then aggregates these predictions to the substation level based on the grid topology. A key methodological contribution of our method is a novel non-conformity score tailored to this multilevel spatio-temporal predictive task. Our results show that this method maintains both statistical validity and efficiency across multiple spatial levels, ensuring that prediction intervals remain informative and practically useful.
Finally, we apply our method to a real-world dataset containing rooftop solar panel installation records in a major U.S. city over the last decade, collected in collaboration with the local utility. Our findings demonstrate the method’s empirical effectiveness and show remarkable improvements over the state-of-the-art approaches, especially in reducing the width of prediction intervals while maintaining coverage. This highlights the utility of our model in providing stakeholders with actionable insights for future strategic decisions.
II Related work
Forecasting demand for DER has gained increased attention in recent years, particularly in predicting power and electricity consumption [8]. Driven by the high-stakes nature of their downstream operations, a wide range of UQ techniques have been explored, such as probabilistic forecasting [9], interval forecasting [10], and deep learning approaches [11]. Due to the unique distributed network topology structure of these renewable energy infrastructures, much work has also explored how these inherent hierarchies can be respected or exploited in various forecasting applications including electricity grid management [12] and power generation forecasts [13].
Our work extends further on these works by incorporating conformal prediction (CP) [7], a distribution-free UQ approach. We consider the distribution shift [14] induced by time series by adopting the line of sequential CP framework [15]. Additionally, our method gained inspiration from the Probabilistic Conformal Prediction [16, 17], allowing our method to tailor to more complex DER forecasting problems by integrating probabilistic models. Our work is also related to hierarchical time series prediction [18].
III Problem Setup
Consider a utility network consisting of distribution circuits and distribution substations. Each substation serves as a “hub”, connecting and coordinating multiple circuits within the network. The network topology between circuits and substations is defined by a matrix , where indicates circuit is associated with substation , and otherwise.
For each circuit at time , let represent the number of DER installations (e.g., rooftop solar panels). Additionally, let denote a set of demographic covariates associated with circuit at time ; these covariates may include factors like population density, average income, or degree of urbanization in the circuit’s vicinity. To simplify notation, define as the vector capturing DER installation counts across all circuits at time , and as the corresponding vector of demographic covariates. Suppose we have a calibration dataset, denoted by , where represents the DER installation counts at time , and represents all observed predictor variables, including both the historical DER installation counts across circuits up to time and the demographic covariates at time .
Our objective is to construct prediction intervals for each circuit at future time , defined by lower bounds and upper bounds ). Let represent the DER installation counts across all circuits for the next time step, and represent aggregated DER installation counts at the substation level. For a specified confidence level , we require these prediction intervals to satisfy:
(1) | ||||
where denotes the -th element of a vector.
We evaluate our prediction intervals based on two criteria: () Validity: Ensuring the intervals meet the coverage requirements specified in (1); () Efficiency: Minimizing the width of the prediction intervals. While wide intervals could trivially achieve the desired coverage, our goal is to produce narrow, informative prediction sets that remain statistically valid and practically useful for decision-making.
IV Proposed Method
We propose a novel method called “Hierarchical Spatio-Temporal Conformal Prediction” (HST-Conformal) based on the conformal prediction (CP) framework [7, 14, 16, 15] to predict the spatio-temporal uncertainty in DER growth. Given a fitted prediction model, the key idea of CP is to construct the uncertainty intervals as a wrapper function around the model’s predictions. The size of the interval is calibrated on a separate calibration dataset through the use of a non-conformity score. which measures the deviance of the model from the designated uncertainty quantification objective. CP is completely distribution-free, meaning that no distributional assumptions need to be imposed as part of the algorithmic procedure, and thus has gained broad popularity in application settings where complex statistical interdependencies may be present between prediction variables.
Specifically, our approach begins by dividing the data into two parts: a training set and a calibration set. The training set is used to create a probabilistic model that simulates future scenarios across all circuits, while the calibration set enables us to evaluate prediction errors and fine-tune prediction intervals for each circuit to ensure valid coverage. One key challenge in this process is the hierarchical constraint of substation-level validity, as described in (1). This constraint has not been considered in prior CP literature, making existing non-conformity scores inadequate for achieving (1). To address this, we design a novel non-conformity score that effectively integrates topology network knowledge, ensuring compatibility with this hierarchical structure. This ensures that the resulting prediction intervals achieve statistical validity without compromising much efficiency compared to prior alternative non-conformity score designs.
IV-A Data Splitting and Model Training
We first partition the dataset based on a specified cut-off time index, . This separation allows us to use historical data to train a model and use recent data to assess and calibrate its predictions. The first part of the dataset, denoted by , is used to fit a probabilistic predition model. The second part of the data, denoted by , is reserved for calibration in the next phase, where we evaluate how well the model’s predictions match observed outcomes, and adjust prediction intervals to achieve valid coverage.
In our analysis, we specify the probabilistic prediction model as a multivariate spatio-temporal Hawkes process [19, 20, 21]. It is a widely used statistical framework for modeling discrete events across space and time, making it well-suited for capturing DER adoption dynamics. Specifically, it is specified by a vector-valued conditional intensity function , where each -th entry represents the likelihood of an installation occurring in circuit at time given the history. Here, the star notation emphasizes its dependence on the observed events in the past. We define it as:
where is an exponentially decaying kernel function capturing the self-exciting effect of DER growth, and quantifies the spatial spillover effect between circuits and . These parameters are designed to capture the customer-level triggering effect of DER adoption, which is a phenomenon in which individuals exhibit an increased likelihood of adopting DER equipment after observing their peers do so. This social contagion effect in shaping adoption behaviors has been well-documented in previous empirical studies [22]. The parameter reflects the decay rate of the intensity function, accounting for the saturation effect, which occurs as the penetration rate of DER approaches a population-based limit. Multivariate spatio-temporal Hawkes processes can be fitted by maximizing the likelihood function over observed data [21].


IV-B Calibration
For each data pair , the calibration procedure operates as follows:
-
1.
Simulate outcomes from the fitted multivariate spatio-temporal Hawkes process via thinning algorithm [23]:
where each sample is one possible joint prediction across all circuits at time .
-
2.
Let represent shared substation memberships, where each entry if circuit and are assoicated wih the same substation, and otherwise. The non-conformity score can be written as
(2) where is the elementwise product, and is the th row of , indicating the indices of circuits that share the same substation with circuit .
Here, the proposed non-conformity score in (2) represents the smallest Substation Maximum Error (SME) for circuit across all samples. For each circuit, the SME is computed using the infinity norm to capture the largest prediction error among circuits sharing the same substation, thus quantifying the worst-case deviation within each substation group.
IV-C Constructing Prediction Intervals
To construct prediction intervals for each circuit , we first compute non-conformity scores from the calibration data using (2), creating the set:
The prediction intervals for circuit are then defined as
(3) |
where is the -empirical quantile of , estimated using quantile regression to account for potential temporal distributional shifts [15]. Substation-level prediction intervals are constructed by aggregating the prediction bounds of associated circuits, given by and . We incorporate standardization for both predictive errors in (2) and estimated quantiles in (3) to further enhance prediction efficiency.


V Results: DER Adoption Forecast
We validate our proposed method using a comprehensive real-world dataset of rooftop solar panel installations from a local utility’s network. The dataset spans from 2010 to 2024, consisting of installation records distributed across circuits and substations. This extensive network coverage not only provides an ideal testbed for validating our predictive framework but also enables us to generate actionable insights for system operators. Our analysis delivers valuable decision support for both operational planning and strategic infrastructure management.
In our implementation of HST-Conformal, we set , and the parameters of the Hawkes process are randomly initialized, and trained until convergence for number of epochs using a learning rate of learning rate with the Adam gradient descent optimizer.
V-A Data Description and Preprocessing
In partnership with the local utility, we analyze detailed records of DER installations at the customer level. Each installation record contains geo-location coordinates, application date, and network topology information, including associated pole, circuit, and substation identifiers. The spatial distribution and temporal evolution of these installations over the past decade are visualized in Figure 1.
For our analysis, we aggregate installation data at six-month intervals for each circuit , where represents the total number of installations within circuit during time period . To incorporate socioeconomic factors, we supplement our dataset with five covariates for each circuit at time , sourced from both the utility and the US Census Bureau. These variables are key socioeconomic indicators, including the average number of power outages, average electrical load, mean electricity price, average education level, and median household income.
Method Full Half Val AggVal Size Val AggVal Size LSTM No (-) No (-) - No (-) No (-) - VAR No (56%) No (45%) 0.37 No (68%) No (79%) 0.37 GPR No (83%) Yes (96%) 1.24 No (83%) Yes (98%) 1.24 QFR Yes (93%) Yes (99%) 1.09 Yes (93%) Yes (100%) 1.09 HST-Conformal Yes 99% Yes 100% 1.06 Yes 99% Yes 95% 0.77
V-B Numerical Evaluation
We conduct a comprehensive numerical experiment on the dataset, benchmarking against four state-of-the-art load/demand forecasting methods: () Linear Vector Autoregression (VAR) [24], () Long Short-Term Memory (LSTM) [25] () Gaussian Process (GPR) [26] and () Quantile Regression (QFR) [27]. We use a significance level of uniformly across all methods, and assess the validity and efficiency of constructed prediction intervals by evaluating the circuit-level validity (Val), substation-level validity (AggVal), and average interval size (Size). To ensure the robustness of our results, the evaluation is performed iteratively by computing one-step ahead monthly prediction intervals in the test set under two trials, wherein the second trial we randomly sample half of its nodes.
As shown in Table I, while our method ensures that both the validity and aggregated validity are satisfied, it also achieves the best efficiency compared to the other baseline methods. Figure 2 and Figure 3 attribute our superior performance at both circuit and substation levels to a robust predictive model configuration combined with a well-designed uncertainty quantification approach, underscoring its practical reliability for deployment within the utility service territory.


V-C Long-Term Forecast and Analysis
To capture the long-term growth trajectory of DER growth, we present forecasts spanning 2024 to 2050 using the proposed HST-Conformal model. Recognizing the influence of regional economic development on energy infrastructure, we enhance our model with additional socioeconomic predictors, including economic indicators, population demographics, and load usage patterns. Expert insights from the local utility further refine our model to ensure alignment with localized industry trends and infrastructure needs.
As illustrated in Figure 4, our forecasts reveal a rapid initial growth phase for DER adoption, which gradually slows as saturation is reached, with a plateau projected around 2036. This trend holds consistently across the base, low, and high forecast scenarios, reflecting a natural adoption limit as market penetration reaches peak capacity within the service area. At the substation level, our analysis identifies significant disparity in growth magnitude and uncertainty. Figure 5 offers a spatial perspective on this heterogeneity, highlighting that the degree of uncertainty (prediction interval width) is closely tied to each location’s base adoption rate. This pattern suggests that high-adoption substations are also areas of high forecast uncertainty, a critical insight for grid operators and planners.
These findings underscore the importance of strategic investment and planning in high-demand substations, as these regions are especially susceptible to compounded risks of elevated demand and forecast uncertainty. Our work emphasizes the utility of our method in delivering granular, hierarchical interval forecasts that can support informed decision-making, ensuring resilience in planning for grid infrastructure under variable adoption trajectories.
VI Conclusion
This paper addresses the critical need for multilevel uncertainty quantification in forecasting DER growth by developing a hierarchical spatio-temporal model. Our method jointly predicts circuit-level growth and aggregates them to the substation level, tackling challenges associated with overly conservative prediction intervals and excessive variability in aggregated forecasts. By tailoring non-conformity scores to the unique demands of spatio-temporal data, we ensured that our method achieves both statistical validity and practical relevance across spatial scales. Applied to the local utility’s DER installation data, our model demonstrated improved prediction interval efficiency without sacrificing coverage, offering a robust tool for energy stakeholders navigating the dynamic landscape of DER integration.
References
- [1] Q. Hassan, S. Algburi, A. Z. Sameen, J. Tariq, A. K. Al-Jiboory, H. M. Salman, B. M. Ali, M. Jaszczur et al., “A comprehensive review of international renewable energy growth,” Energy and Built Environment, 2024.
- [2] H. A. Rahman, M. S. Majid, A. R. Jordehi, G. C. Kim, M. Y. Hassan, and S. O. Fadhl, “Operation and control strategies of integrated distributed energy resources: A review,” Renewable and Sustainable Energy Reviews, vol. 51, pp. 1412–1420, 2015.
- [3] Y. Zhang, J. Wang, and Z. Li, “Uncertainty modeling of distributed energy resources: techniques and challenges,” Current Sustainable/Renewable Energy Reports, vol. 6, pp. 42–51, 2019.
- [4] F. Ren, Z. Wei, and X. Zhai, “A review on the integration and optimization of distributed energy systems,” Renewable and Sustainable Energy Reviews, vol. 162, p. 112440, 2022.
- [5] Q. Hassan, C.-Y. Hsu, K. Mounich, S. Algburi, M. Jaszczur, A. A. Telba, P. Viktor, E. M. Awwad, M. Ahsan, B. M. Ali et al., “Enhancing smart grid integrated renewable distributed generation capacities: Implications for sustainable energy transformation,” Sustainable Energy Technologies and Assessments, vol. 66, p. 103793, 2024.
- [6] V. Vahidinasab, “Optimal distributed energy resources planning in a competitive electricity market: Multiobjective optimization and probabilistic design,” Renewable energy, vol. 66, pp. 354–363, 2014.
- [7] H. Papadopoulos, K. Proedrou, V. Vovk, and A. Gammerman, “Inductive confidence machines for regression,” in Machine learning: ECML 2002: 13th European conference on machine learning Helsinki, Finland, August 19–23, 2002 proceedings 13. Springer, 2002, pp. 345–356.
- [8] M. Sharifzadeh, A. Sikinioti-Lock, and N. Shah, “Machine-learning methods for integrated renewable power generation: A comparative study of artificial neural networks, support vector regression, and gaussian process regression,” Renewable and Sustainable Energy Reviews, vol. 108, pp. 513–538, 2019.
- [9] D. W. Van der Meer, J. Widén, and J. Munkhammar, “Review on probabilistic forecasting of photovoltaic power production and electricity consumption,” Renewable and Sustainable Energy Reviews, vol. 81, pp. 1484–1512, 2018.
- [10] Z. Shi, H. Liang, and V. Dinavahi, “Direct interval forecast of uncertain wind power based on recurrent neural networks,” IEEE Transactions on Sustainable Energy, vol. 9, no. 3, pp. 1177–1187, 2017.
- [11] H. Quan, A. Khosravi, D. Yang, and D. Srinivasan, “A survey of computational intelligence techniques for wind power uncertainty quantification in smart grids,” IEEE transactions on neural networks and learning systems, vol. 31, no. 11, pp. 4582–4599, 2019.
- [12] V. Almeida, R. Ribeiro, and J. Gama, “Hierarchical time series forecast in electrical grids,” in Information Science and Applications (ICISA) 2016. Springer, 2016, pp. 995–1005.
- [13] T. Silveira Gontijo and M. Azevedo Costa, “Forecasting hierarchical time series in power generation,” Energies, vol. 13, no. 14, 2020.
- [14] R. J. Tibshirani, R. Foygel Barber, E. Candes, and A. Ramdas, “Conformal prediction under covariate shift,” Advances in neural information processing systems, vol. 32, 2019.
- [15] C. Xu and Y. Xie, “Sequential predictive conformal inference for time series,” in International Conference on Machine Learning. PMLR, 2023, pp. 38 707–38 727.
- [16] Z. Wang, R. Gao, M. Yin, M. Zhou, and D. Blei, “Probabilistic conformal prediction using conditional random samples,” in International Conference on Artificial Intelligence and Statistics. PMLR, 2023, pp. 8814–8836.
- [17] M. Zheng and S. Zhu, “Optimizing probabilistic conformal prediction with vectorized non-conformity scores,” arXiv preprint arXiv:2410.13735, 2024.
- [18] R. J. Hyndman, R. A. Ahmed, G. Athanasopoulos, and H. L. Shang, “Optimal combination forecasts for hierarchical time series,” Computational Statistics & Data Analysis, vol. 55, no. 9, pp. 2579–2589, 2011.
- [19] Z. Dong, S. Zhu, Y. Xie, J. Mateu, and F. J. Rodríguez-Cortés, “Non-stationary spatio-temporal point process modeling for high-resolution covid-19 data,” Journal of the Royal Statistical Society Series C: Applied Statistics, vol. 72, no. 2, pp. 368–386, 2023.
- [20] S. Zhu and Y. Xie, “Spatiotemporal-textual point processes for crime linkage detection,” The Annals of Applied Statistics, vol. 16, no. 2, pp. 1151–1170, 2022.
- [21] S. Zhu, R. Yao, Y. Xie, F. Qiu, Y. Qiu, and X. Wu, “Quantifying grid resilience against extreme weather using large-scale customer power outage data,” arXiv preprint arXiv:2109.09711, 2021.
- [22] B. Bollinger and K. Gillingham, “Peer effects in the diffusion of solar photovoltaic panels,” Marketing Science, vol. 31, no. 6, pp. 900–912, 2012.
- [23] Y. Ogata, “On lewis’ simulation method for point processes,” IEEE transactions on information theory, vol. 27, no. 1, pp. 23–31, 1981.
- [24] A.-H. Jung, D.-H. Lee, J.-Y. Kim, C. K. Kim, H.-G. Kim, and Y.-S. Lee, “Regional photovoltaic power forecasting using vector autoregression model in south korea,” Energies, vol. 15, no. 21, p. 7853, 2022.
- [25] K. Wang, X. Qi, and H. Liu, “Photovoltaic power forecasting based lstm-convolutional network,” Energy, vol. 189, p. 116225, 2019.
- [26] D. W. Van der Meer, M. Shepero, A. Svensson, J. Widén, and J. Munkhammar, “Probabilistic forecasting of electricity consumption, photovoltaic power generation and net demand of an individual building using gaussian processes,” Applied energy, vol. 213, pp. 195–207, 2018.
- [27] P. Lauret, M. David, and H. Pedro, “Probabilistic solar forecasting using quantile regression models,” Energies, vol. 10, no. 10, 2017.