To main content

Enhancement of a Hydrogen Incident and Accident Database Using Large Language Models

Abstract

Hydrogen holds significant potential for decarbonizing various industries, including energy and mobility. However, the limited availability of accident data poses a significant challenge to effective safety risk analysis and assessment. This study leverages large language models to address the critical task of filling gaps in the Hydrogen Incidents and Accidents Database (HIAD) 2.1, a prominent repository of hydrogen-related unwanted events. A three-step Artificial Intelligence-driven algorithm is proposed: (i) a preprocessing phase to standardize and prepare an event description, (ii) a processing phase utilizing OpenAI’s sentence embedding technology to extract semantic relationships, and (iii) an enhancement phase employing trained multilayer perceptrons to impute missing data. The algorithm demonstrates promising results in predicting categorical entries and is applied to enhance the entire database, with a specific focus on the 2019 fueling station fire in Sandvika (Norway). This case study highlights the proposed algorithm’s potential to improve our understanding of hydrogen-related incidents and contribute to enhanced risk management strategies.

Category

Academic chapter/article/Conference paper

Language

English

Author(s)

Affiliation

  • SINTEF Energy Research / Gassteknologi
  • Norwegian University of Science and Technology
  • University of Bologna
  • University of Rome 'La Sapienza'

Year

2025

Publisher

Research Publishing Services

Book

Stavanger ESREL SRA-E 2025 Proceedings : 35th European Safety and Reliability Conference and the 33rd Society for Risk Analysis Europe Conference, 15-19 June 2025, Norway

ISBN

978-981-94-3281-3

Page(s)

1185 - 1192

View this publication at Cristin