| ||||||||||||||
|
|
|||||||||||||

*Department of Anesthesiology, University of Arizona Health Sciences Center, Tucson, Arizona; and
Department of Psychology, Harvard University and the Massachusetts Institute of Technology, Boston, Massachusetts
Address correspondence and reprint requests to Robert Loeb, MD, PO Box 245114, Tucson, AZ 85724-5114. Address e-mail to RLoeb{at}U.Arizona.edu
| Abstract |
|---|
|
|
|---|
IMPLICATIONS: We developed and tested an auditory display of multivariable clinical data. With little training, clinicians successfully used the display to detect and diagnose simulated critical events. This suggests that a multivariable auditory display could enhance intraoperative monitoring.
| Introduction |
|---|
|
|
|---|
Auditory displays may overcome this inherent limitation of visual displays. The auditory system is omnidirectional. Even when otherwise occupied, humans continue to hear, and analyze, auditory stimuli at a preattentive level (5). Thus, an anesthesiologist could continuously monitor information coming from an auditory display, even while performing other duties.
The variable-pitch pulse oximeter tone is an auditory display of arterial hemoglobin saturation and heart rate that has been widely accepted in the practice of anesthesia (6). Its efficacy has been demonstrated; anesthesiologists react faster to changes in hemoglobin saturation when using a variable-pitch pulse oximeter than when using a fixed-pitch oximeter (7).
Other cardiovascular and respiratory variables deserve to be monitored as closely as are saturation and heart rate. Therefore, we used a patented technique (8) to develop an auditory display of six important physiologic variables. The purpose of this investigation was to evaluate whether anesthesiologists could use this display to detect and diagnose critical events.
| Methods |
|---|
|
|
|---|
In each study session, a single subject was trained to use the three display types and to recognize six event types (Table 1). After training, the subject was tested during 21 trials, each lasting a maximum of 2 min. Subjects monitored six variables: heart rate, blood pressure, hemoglobin saturation, respiratory rate, tidal volume, and end-tidal CO2. At the beginning of every trial, each variable was set to a baseline value. These values were held constant for a random period of 15 to 80 s until an event began, at which time all affected variables transitioned synchronously from baseline to abnormal over 30 s. An event occurred during each trial. The subjects task was to detect when the event started and then to identify the type of event.
|
Displays
The visual display replicated a conventional physiologic monitor, with five waveforms and six numeric values (Fig. 1). Subjects watched it on a 17-in. monitor (1024 x 768 resolution) and input their responses using a mouse and on-screen controls. They heard the auditory display through stereo speakers placed at either side of the computer monitor.
|
|
In the second stage of training, subjects learned to recognize the clinical events. First, they were asked to memorize a table showing the manifestations of each event (i.e., the bold-typeface items in Table 1). Then, they took two self-paced computerized tests to ensure that they could identify each event from its manifestations. Subjects received feedback on each response and were tested until they correctly identified 12 consecutive events.
During the third stage of training, subjects practiced the whole-task during 21 training scenarios. Each of seven event types (six "trained" and one "unfamiliar") was presented in random order, once each using each type of display. Subjects monitored vital signs and pressed an on-screen button when they first noted a change in any variable; this indicated their detection of an event. They then pressed a button, labeled with the name of the event, as soon as they deciphered the event type; this indicated their identification of the event. Subjects were instructed that the premature detection of an event or the misidentification of an event would be scored as an error; and that their first priority was to minimize error rate and their second priority was to minimize response time. To simplify event detection during training, variables were displayed without superimposed random noise factors (see next section). At the end of each trial, the subject received feedback regarding the true identity of the event.
Testing
In the testing phase of the study, subjects performed the experimental task during 21 testing scenarios. The experimental task, instructions to subjects, and data collection techniques were identical to those of the whole-task training exercise, with two exceptions. First, subjects did not receive feedback about the correctness of their responses. Second, a random noise factor (i.e., heart rate ± 3 bpm; blood pressure ± 10 mm Hg; hemoglobin saturation ± 0.5%; respiratory rate ± 1 breath/min; tidal volume ± 30 mL, end-tidal CO2 ± 1 mm Hg) was added to each variable on every update to prevent numerical fields on the visual display from remaining static while in their baseline state. The latter modification made it impossible for subjects to detect the onset of an event by simply detecting any change in a numerical field.
Statistical Analysis
Detection and identification scoring systems are summarized in Table 3. Detection and identification accuracy rates were calculated for each subject and display type. The effects of display type on detection accuracy rate and identification accuracy rate were analyzed by using one-factor within-subjects analysis of variance. Where appropriate, post hoc comparisons were made by using the Scheffé F test.
|
| Results |
|---|
|
|
|---|
Training
Display Training.
In the display training exercise, it took subjects between 12 and 50 attempts to record 12 consecutive correct responses. Subjects had no trouble identifying a change in a single variable when vital signs were presented visually (141/143 changes correctly identified when displayed visually, and 142/144 changes correctly identified when displayed visually and audibly). However, they did have difficulty identifying the changing variable when vital signs were only presented audibly (85/131 changes correctly identified when displayed audibly). Variables within the breath signal were more difficult to correctly identify than were those within the heart signal (52/63 heart sound changes correctly identified, versus 33/68 breath sound changes correctly identified). Subjects often confused blood pressure changes with saturation changes. They also often confused CO2 concentration, respiratory rate, and tidal volume changes with one another. However, they rarely confused cardiac changes with respiratory changes.
Whole-Task Training.
Subjects performances tended to improve over the course of the 21 whole-task training trials (Fig. 2). All performance measures tended to stabilize by the end of the training session, suggesting that the subjects reached a plateau in the learning curve.
|
Testing
Subjects detected every critical event within its 2-min scenario. They prematurely detected the event in approximately 20% of the scenarios; this was not affected by display type. When the event was correctly detected, it was detected 20% more quickly with the combined visual and auditory display than with either display alone (Table 4).
|
Subjects performances tended to improve over the course of the 21 testing trials (Fig. 2). Detection and identification accuracy were better in the last seven trials than in the first seven trials with all display types. Detection latency was shorter in the last seven trials than in the first seven trials with the auditory and combined displays.
The bronchospasm event was detected more quickly than all other event types (8.6 ± 5.8 s vs 12.6 ± 6.6 s, P = 0.002).
| Discussion |
|---|
|
|
|---|
The auditory display was designed to be easy to learn. Where possible, physiologic variables were mapped to sound qualities in a manner that we predicted would be compatible with clinicians natural expectations. For instance, oxygen saturation was linked to the pitch of the heart signal because clinicians have already learned that mapping by using pulse oximeters. The timbre (ringing quality versus dullness) of the heart signal was chosen to represent blood pressure because the timbre of Korotkoff sounds decreases as a blood pressure cuff deflates. We designed the breath signals to become closer together and shorter in duration when the respiratory rate increased, to mimic how natural breath sounds change when breathing becomes faster (at a fixed inspiratory-to-expiratory ratio). Finally, we made the breath signals longer in duration and more raucous sounding as the tidal volume increased to emulate the natural sound of taking larger breaths. Only one variable, end-tidal CO2, did not have an obvious natural mapping to any sound. But, inasmuch as end-tidal CO2 is somewhat analogous to saturation, we linked it to the pitch of the respiratory signal.
Even though the auditory display was designed to be intuitive, after the initial training period, some subjects had difficulty interpreting what they heard. Overall, they misidentified the variable that was changing in 35% of the presentations, and the error rate was more frequent with the respiratory signal than it was with the cardiovascular signal. This suggests that the design of the display can be improved in a few ways. First, changes in sound property could be made more perceivable. For instance, several subjects commented that timbre changes were difficult to hear. Second, some mappings could be modified to make them more compatible with clinicians expectations. For instance, four subjects suggested that the pitch of the respiratory signal should increase as end-tidal CO2 concentration decreases, because they associate lower pitch with central nervous system depression and decreased respiratory drive. Future studies will address these ergonomic issues.
Certain differences between the test conditions used in this study and a real clinical situation limit the clinical applicability of the results. In this study, subjects had no concurrent responsibilities to distract them from the primary task whereas, in the clinical setting, anesthesiologists perform many concurrent technical and cognitive tasks. On average, anesthesiologists look at monitor displays <30% of the time during an operation. They look at monitors even less during high workload periods, such as induction and intubation (2,4). Because they do not constantly watch visual displays, anesthesiologists react more quickly to auditory stimuli than they do to equivalent visual stimuli in the OR. For instance, Morris and Montano (11) found that anesthesiologists detected 90% of auditory alarms in 3 seconds, whereas it took them 40 seconds to detect the same proportion of visual alarms. Thus, we anticipate that when loaded with other tasks, anesthesiologists will react more quickly to critical events with an auditory or combined presentation than with a visual presentation. However, this prediction was not tested in the current study because our subjects were able to look at the visual display without interruption.
This study was conducted in a quiet setting that was not representative of an OR. ORs are noisy. Their average sound level has been compared with that of a freeway (12). To be useful, an auditory display must be clearly audible above the background noise. This can be achieved by increasing the volume. But, it is preferable to select sound frequencies and patterns that are not masked by the sounds of an OR (13,14). Because the auditory displays audibility was not evaluated in this study, it cannot be predicted how well it would be perceived in a noisy OR.
In this study, subjects were asked to diagnose events based solely on information coming from the auditory display. However, this is not how anesthesiologists typically use auditory displays, such as alarms, in the OR. Anesthesiologists cannot reliably identify OR alarms based on their sound (15). In responding to an auditory alarm, anesthesiologists often perform a visual search of the environment and look at visual information displays to identify and verify the alarm condition. The predominant function of alarms is to capture attention. The same can be said for the pulse oximeters auditory display. Its primary function is to alert the clinician to a change in heart rate or saturation. Thus, it was somewhat of a contrived task to have subjects diagnose events based solely on information coming from the auditory display. Although it is comforting to know that subjects usually made a correct diagnosis based solely on auditory information, in a natural setting, they would more likely refer to visual displays before making a diagnosis and performing interventions.
In summary, this study demonstrates that: 1) multivariable clinical data can be encoded in an auditory display, 2) with little training, clinicians can decipher and use an auditory display to detect and diagnose simulated critical events, and 3) detection and diagnosis of critical events is accelerated when an auditory display is added to a visual display of the same data. These results suggest that a multivariable auditory display could enhance intraoperative monitoring. However, such displays need to be tested under conditions more representative of the real clinical environment before they can be accepted for routine clinical use.
| Acknowledgments |
|---|
The authors gratefully acknowledge the technical assistance of Liza Kantor.
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
P. M. Sanderson, M. O. Watson, W. J. Russell, S. Jenkins, D. Liu, N. Green, K. Llewelyn, P. Cole, V. Shek, and S. S. Krupenia Advanced Auditory Displays and Head-Mounted Displays: Advantages and Disadvantages for Monitoring by the Distracted Anesthesiologist Anesth. Analg., June 1, 2008; 106(6): 1787 - 1797. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. M. Sanderson, M. O. Watson, and W. J. Russell Advanced Patient Monitoring Displays: Tools for Continuous Informing Anesth. Analg., July 1, 2005; 101(1): 161 - 168. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|