A Sequence-to-sequence Based Error Correction Model for Medical Automatic Speech Recognition Conference

Jiang, Y, Poellabauer, C. (2021). A Sequence-to-sequence Based Error Correction Model for Medical Automatic Speech Recognition . 3029-3035. 10.1109/BIBM52615.2021.9669554

cited authors

  • Jiang, Y; Poellabauer, C

abstract

  • The use of Automatic Speech Recognition (ASR) systems in medical applications is receiving rapidly growing interest due to their ability to reduce distractions and the cognitive workload of physicians, particularly during critical medical procedures. However, state-of-the-art ASR systems still experience recognition errors, especially in noisy environments where speakers rely on medical-domain terminologies. This paper proposes a customized language model and a neural network based sequence-to-sequence (seq2seq) error correction module for medical ASR systems to provide domain adaptation and more reliable transcription results. Specifically, the error correction module learns the error patterns in noisy scenarios and is able to correct such errors during inference. Our experiments show that the proposed method can reduce the sentence error rate (SER) by up to 81% for formatted input and up to 31% SER for unformatted input in noisy environments.

publication date

  • January 1, 2021

Digital Object Identifier (DOI)

International Standard Book Number (ISBN) 13

start page

  • 3029

end page

  • 3035