A Sequence-to-sequence Based Error Correction Model for Medical Automatic Speech Recognition

A Sequence-to-sequence Based Error Correction Model for Medical Automatic Speech Recognition Conference

Jiang, Y, Poellabauer, C. (2021). A Sequence-to-sequence Based Error Correction Model for Medical Automatic Speech Recognition . 3029-3035. 10.1109/BIBM52615.2021.9669554

cited authors

Jiang, Y; Poellabauer, C

authors

Poellabauer, Christian

abstract

The use of Automatic Speech Recognition (ASR) systems in medical applications is receiving rapidly growing interest due to their ability to reduce distractions and the cognitive workload of physicians, particularly during critical medical procedures. However, state-of-the-art ASR systems still experience recognition errors, especially in noisy environments where speakers rely on medical-domain terminologies. This paper proposes a customized language model and a neural network based sequence-to-sequence (seq2seq) error correction module for medical ASR systems to provide domain adaptation and more reliable transcription results. Specifically, the error correction module learns the error patterns in noisy scenarios and is able to correct such errors during inference. Our experiments show that the proposed method can reduce the sentence error rate (SER) by up to 81% for formatted input and up to 31% SER for unformatted input in noisy environments.

publication date

January 1, 2021

Digital Object Identifier (DOI)

https://doi.org/10.1109/bibm52615.2021.9669554

start page

3029

end page

3035

FIU Discovery

A Sequence-to-sequence Based Error Correction Model for Medical Automatic Speech Recognition Conference

Overview

cited authors

authors

abstract

publication date

Identifiers

Digital Object Identifier (DOI)

Additional Document Info

start page

end page