To main content

Enhancement of a Hydrogen Incident and Accident Database Using Large Language Models

Abstract

Hydrogen holds significant potential for decarbonizing various industries, including energy and mobility. However, the limited availability of accident data poses a significant challenge to effective safety risk analysis and assessment. This study leverages large language models to address the critical task of filling gaps in the Hydrogen Incidents and Accidents Database (HIAD) 2.1, a prominent repository of hydrogen-related unwanted events. A three-step Artificial Intelligence-driven algorithm is proposed: (i) a preprocessing phase to standardize and prepare an event description, (ii) a processing phase utilizing OpenAI’s sentence embedding technology to extract semantic relationships, and (iii) an enhancement phase employing trained multilayer perceptrons to impute missing data. The algorithm demonstrates promising results in predicting categorical entries and is applied to enhance the entire database, with a specific focus on the 2019 fueling station fire in Sandvika (Norway). This case study highlights the proposed algorithm’s potential to improve our understanding of hydrogen-related incidents and contribute to enhanced risk management strategies.
Read the publication

Category

Academic chapter

Language

English

Author(s)

Affiliation

  • SINTEF Energy Research / Gassteknologi
  • University of Bologna
  • University of Rome 'La Sapienza'
  • Norwegian University of Science and Technology

Year

2025

Publisher

Research Publishing Services

Book

Stavanger ESREL SRA-E 2025 Proceedings : 35th European Safety and Reliability Conference and the 33rd Society for Risk Analysis Europe Conference, 15-19 June 2025, Norway

ISBN

9789819432813

Page(s)

1185 - 1192

View this publication at Norwegian Research Information Repository