Technological background of a speech recognition system for the dictation of thyroid gland medical reports

Authors

  • András Kocsor
  • András Bánhalmi
  • Dénes Paczolay

Keywords:

continous speech recognizer, ASR, automatic speech recognition, dictation system, HMM, Hidden Markov Model, MSD, Morphosyntactical descriptor, grammar, accoustic model, N-gram

Abstract

With the considerable development of speech recognition technologies in several administration-requiring professions the demand for the so called speech-based documentation has grown. This is particularly true in the case of the documentation of medical reports therefore the acceleration of this procedure is of great importance for smaller languages whit special linguistic features few systems for dictating medical reports have been developed so far wich fact can be attributed to linguistic specialties and high development expenses. In Szeged we developed a core module capable of automatic recognition of the Hungarian language on wich several domain oriented system can be built- The core module contains the so called acoustic model, which is suitable for building of the model we used two significantly different approaches. One is the Hidden Markov Model well know in speech recognition, the other is the novel stochastic segmental approach developed in Szeged. For the developed of both models we used a large speech corpus with 500 speakers, and then the performance of the modules was tested on test databases. To accompany the core module we built languages module was tested on test databases. To accompany the core module we built a languages module (for Windows environment) suitable for the dictation of thyroid gland medical reports in order to justify the applicability of the developed methods. The module was built on 9231 written thyroid medical reports and over 2500 word forms. We present the structure of built language and acoustic models, the test results describing the efficiency of the models, furthermore we mention the different aspect of the application and technology of the software.

Published

2006-02-15

How to Cite

Technological background of a speech recognition system for the dictation of thyroid gland medical reports. (2006). ACTA AGRARIA KAPOSVARIENSIS, 10(1), 113-128. https://journal.uni-mate.hu/index.php/aak/article/view/1764