TY - GEN
T1 - Natural Language Processing-Enhanced Nuclear Industry Operating Experience Data Analysis
T2 - 18th International Probabilistic Safety Assessment and Analysis, PSA 2023
AU - Zhang, Sai
AU - Xu, Fei
AU - Ma, Zhegang
AU - Xian, Min
N1 - Funding Information:
The authors would like to thank Michael Calley, Han Bao, and Katie Stokes for their reviews and edits. This paper was prepared as an account of work sponsored by an agency of the U.S. Government. Neither the U.S. Government nor any agency thereof, nor any of their employees, makes any warranty, expressed or implied, or assumes any legal liability or responsibility for the accuracy, completeness, or usefulness, of any information, apparatus, product, or process disclosed, or represents that its use would not infringe privately owned rights. References herein to any specific commercial product, process, or service by trade name, trade mark, manufacturer, or otherwise, do not necessarily constitute or imply its endorsement, recommendation, or favoring by the U.S. Government or any agency thereof. The views expressed herein do not necessarily state or reflect those of the U.S. Government or any agency thereof.
Publisher Copyright:
© 2023 Proceedings of 18th International Probabilistic Safety Assessment and Analysis, PSA 2023. All Rights Reserved.
PY - 2023
Y1 - 2023
N2 - Industry-wide operating experience is a critical source of raw data for reliability and risk model parameter estimations for nuclear power plants. A large portion of operating experience data are failure events stored as reports that contain unstructured data, such as narratives. In current practice, a failure report is usually reviewed and manually coded by analysts. The coding is based on extracting several event characteristics such as system name, component type, sub-part type, failure mode, and failure cause. Event narratives are mostly used to help understand events and extract their characteristics. In this line of research, we aim to maximize the usage of event narratives by leveraging natural language processing (NLP) methods to automatically convert an event narrative to a causal graph. This research has promise to improve physical understanding of failure initiation and propagation and to facilitate use of non-failure data (e.g., near-misses and degradations) to complement the limited data pool of failures. In our previous work, we developed an NLP tool and applied it to analyze a number of licensee event reports submitted by U.S. nuclear power plants to the Nuclear Regulatory Commission. In this paper, we will report our recent research progress in aggregating the results of multiple reports, developing network model(s), and drawing statistical insights.
AB - Industry-wide operating experience is a critical source of raw data for reliability and risk model parameter estimations for nuclear power plants. A large portion of operating experience data are failure events stored as reports that contain unstructured data, such as narratives. In current practice, a failure report is usually reviewed and manually coded by analysts. The coding is based on extracting several event characteristics such as system name, component type, sub-part type, failure mode, and failure cause. Event narratives are mostly used to help understand events and extract their characteristics. In this line of research, we aim to maximize the usage of event narratives by leveraging natural language processing (NLP) methods to automatically convert an event narrative to a causal graph. This research has promise to improve physical understanding of failure initiation and propagation and to facilitate use of non-failure data (e.g., near-misses and degradations) to complement the limited data pool of failures. In our previous work, we developed an NLP tool and applied it to analyze a number of licensee event reports submitted by U.S. nuclear power plants to the Nuclear Regulatory Commission. In this paper, we will report our recent research progress in aggregating the results of multiple reports, developing network model(s), and drawing statistical insights.
KW - causal learning
KW - event narrative
KW - natural language processing
KW - Nuclear power plant
KW - operating experience data
KW - probabilistic risk assessment
UR - http://www.scopus.com/inward/record.url?scp=85184350077&partnerID=8YFLogxK
U2 - 10.13182/PSA23-41355
DO - 10.13182/PSA23-41355
M3 - Conference contribution
AN - SCOPUS:85184350077
T3 - Proceedings of 18th International Probabilistic Safety Assessment and Analysis, PSA 2023
SP - 230
EP - 239
BT - Proceedings of 18th International Probabilistic Safety Assessment and Analysis, PSA 2023
PB - American Nuclear Society
Y2 - 15 July 2023 through 20 July 2023
ER -