To main content

The value of multiple data sources in machine learning models for power system event prediction

Abstract

We describe a method for assessing the value of additional data sources used in the prediction of unwanted events (voltage dips, earth faults) in the power system. Using this method, machine learning models for event prediction using (combinations of) different data sources are developed. The value of each data source is the improvement in model performance it brings. In addition, feature importance is retrieved using SHapley Additive exPlanations (SHAP). The methodology is applied to models that predict faults based on power quality and weather data. We find that models that combine sources outperform models using either in isolation. They predict ground faults and voltage dips with AUCs (Area Under Curve) of 0.74 and 0.80, respectively. Meteorological data appears more valuable than power quality data and the most important features are dew point, month of the year, and the power spectral density at 4.7 Hz
Read the publication

Category

Academic chapter

Language

English

Affiliation

  • SINTEF Digital / Sustainable Communication Technologies
  • SINTEF Energy Research / Energisystemer

Year

2021

Publisher

IEEE (Institute of Electrical and Electronics Engineers)

Book

2021 International Conference on Smart Energy Systems and Technologies - SEST

ISBN

9781728176604

View this publication at Norwegian Research Information Repository