Graphical access to medical expert systems: V. Integration with continuous-speech recognition.

Overview

This paper describes three prototypes of computer-based clinical record-keeping tools that use a combination of window-based graphics and continuous speech in their user interfaces. Although many of today's commercial speech-recognition products achieve high rates of accuracy for large grammars (vocabularies of words or collections of sentences and phrases), they can only "listen for" (and therefore recognize) a limited number of words or phrases at a time. When a speech application requires a grammar whose size exceeds a speech-recognition product's limits, the application designer must partition the large grammar into several smaller ones and develop control mechanisms that permit users to select the grammar that contains the words or phrases they wish to utter. Furthermore, the user interfaces they design must provide feedback mechanisms that show users the scope of the selected grammars. The three prototypes described were designed to explore the use of window-based graphics as control and feedback mechanisms for continuous-speech recognition in medical applications. Our experiments indicate that window-based graphics can be effectively used to provide control and feedback for certain classes of speech applications, but they suggest that the techniques we describe will not suffice for applications whose grammars are very complex.