From Checklists to Chatbots: Reimagining HRA with Generative AI
Abstract
This paper evaluates the capability of Large Language Models (LLMs) to support Human Reliability Assessment (HRA) through a systematic test using the Integrated Human Event Analysis System for Event and Condition Assessment (IDHEAS-ECA) methodology. Using Claude Opus 4.1, we generated Steam Generator Tube Rupture scenarios and subsequently tasked the model with producing a comprehensive HRA analysis, which was then independently reviewed by two IDHEAS-ECA method experts. The LLM demonstrated substantial domain knowledge, generating technically coherent scenarios with appropriate procedural details and system responses, and produced a structured analysis covering cognitive functions and performance influencing factors. However, expert review identified critical methodological gaps including conflation of concepts from different HRA methods, omission of formal task analysis steps required by NUREG-2256, and inadequate human failure events identification. While current LLMs show promise as auxiliary tools for scenario generation and preliminary analysis, they require significant enhancement before supporting safety-critical HRA applications. Future work should focus on method-specific training, integration with structured knowledge representations (e.g. knowledge graphs), and development of validation protocols to ensure appropriate application boundaries.
Keywords: Large Language Models, Human Reliability Assessment, Nuclear Power Operations, Knowledge Graphs, IDHEAS-ECA
DOI: 10.54941/ahfe1007027
Cite this paper
More from this volume
- Warnings and Multilingual Audiences
- EAT Da Vinci 3.0_Translating Cinematic Narrative into Media Art Installation
- From Manual to Automated: Enhancing Inclusivity in Foreign Language Education with Technology
- The effect of multi-sensory physical experiences in daily emotional self-tracking service for emotion self-awareness
- Parametric generation based graphic design and spatial expression research
- Gender Stereotypes in Video Gaming: Impacts of Anxiety Levels, Verbal Communication, and Performance
- Exploring Usability And User-experience Metrics With A Novel AR App In The MASTERLY Project
- Drawing Dialogues Between Generative AI and Children with Autism: A Qualitative Study on the Externalization of “Understanding”
- Human-Centered Design of Integrated Food Service Management Systems: Reducing Cognitive Load in Resource-Constrained Kitchen Operations
- The Design Futures Art-driven (DFA) Method: Structuring Art-Tech Collaboration for Sustainable Future of Food System
- Increasing importance of Instinct
- Bridging the Privacy Gap: Stakeholder Solutions to Support Transparent Data Management Practices in Digital Health Research


AHFE Open Access