Exploring use of transformer based models on incident reports in aviation

Samuel Kierszbaum; Laurent Lapasset; Thierry Klein

doi:10.24348/coria.2021.court_20

Exploring use of transformer based models on incident reports in aviation

Samuel Kierszbaum

Laurent Lapasset

Thierry Klein

DOI: 10.24348/coria.2021.court_20

Résumé

Recently, transformer-based models have beaten humans in Natural Language Understanding (NLU) tasks such as text classification, and have been used in specialized fields such as healthcare. In this context, our general aim is to explore how such models could help support analysts working in safety in aviation, in particular when they are used on incident reports. In this article, we work with the Aviation Safety Reporting System (ASRS) data set. It is made up of incident reports in English, as well as supporting metadata. Such reports are characterized by the heavy use of specialized language, abbreviations, and domain-specific vocabulary, as opposed to general day-to-day English. We explore the idea that analyst work can be re-framed as a set of NLU tasks. We then propose an experimental procedure to try and use transformer-based models on one of these tasks.

Exploring use of transformer based models on incident reports in aviation

Exploring use of transformer based models on incident reports in aviation

Sponsors de CORIA 2021