| ||||||||||||||
|
|
|||||||||||||
¶


¶
*The Israel Center for Medical Simulation (M.S.R.,1);
Department of Anesthesiology and Intensive Care, Sheba Medical Center, Tel Hashomer, Israel;
Department of Anesthesiology, University Hospitals of Cleveland, Cleveland, Ohio;
The National Institution for Testing and Evaluation, Jerusalem, Israel; ||Department of Anesthesiology, Edith Wolfson Medical Center, Holon, Israel; and ¶Tel Aviv University Sackler School of Medicine, Tel Aviv, Israel
Address correspondence and reprint requests to Haim Berkenstadt, MD, Department of Anesthesiology and Intensive Care, The Israel Center for Medical Simulation, Sheba Medical Center, Tel Hashomer, Israel. Address e-mail to berken{at}netvision.net.il.
| Abstract |
|---|
|
|
|---|
statistic). Values in the original study were 0.720.76 for the Long and 0.710.75 for the Short Form. The reliability did not change when a revised Israeli version of the scoring was used. Interrater reliability measured by Pearson correlation was 0.91 for the Long and 0.96 for the Short Form (P < 0.01). The high scores for plausibility given to the scenarios and the similar reliability of the original assessment tool support the feasibility of using simulation-based evaluation tools, developed in the US, in Israel. The higher scores achieved by Israeli residents may be related to the fact that most Israeli residents are immigrants with previous training in anesthesia. | Introduction |
|---|
|
|
|---|
Another possible application for full-scale simulation is in the evaluation and accreditation of anesthesia trainees and graduates. Full-scale simulation has been used to assess medical student performance in anesthesia-related scenarios (4), to detect gaps in medical student knowledge in anesthesia (5,6), to observe and quantify technical performance of novice anesthesiologists (7), and in the process of evaluating physicians with lapsed medical skills (8). Other investigators have conducted studies aimed at development of valid, reliable, and realistic simulation-based evaluation methods (913). One such multi-institutional study was performed in 10 centers and involved 99 anesthesia residents (14). Another aspect of evaluation is the development of a behavioral rating system (15) by researchers interested not only in commonly assessed technical skills but also in behavioral performance assessment.
Despite the efforts involved in the development of valid and reliable simulation-based evaluation tools, a recent review concluded that development in this area has not yet progressed enough to justify its use in formal, summative evaluation of competence in anesthesia (16). Moreover, a recent international survey found that only 7%14% of simulation centers are using these methods for evaluation of competence. The suggested reasons for this apparent under-utilization include the lack of research in this area, and the lack of standardized, valid, and reliable tests (17). An accompanying editorial to that article, entitled "Anesthesiology Simulators: Networking Is the Key," expressed the belief that communication and collaboration among centers involved in simulator programs is of paramount importance to the future of this technology (18).
The combined effort of the anesthesia simulation community is needed to overcome the obstacles of using simulation for testing and evaluation. To lay the groundwork for collaboration among >150 anesthesia simulation centers worldwide, it would be helpful, given the differences in language, education, anesthesia practice, and health care delivery systems, to delineate the results of simulation-based evaluation in different countries.
We present the adaptation of simulation-based evaluation tools, developed in the United States (US) and published by Schwid et al. (14), in academic Israeli anesthesia departments, highlighting issues relevant to the possible sharing of scenarios and evaluation tools in different countries. The hypothesis of this study was that simulation-based evaluation tools used and published elsewhere are applicable to the Israeli setting.
| Methods |
|---|
|
|
|---|
Scenarios
Four simulation scenarios and grading forms used in this study were developed and used in a prior study by Schwid et al. (14). The human patient simulator from METI (Sarasota, FL), located in a fully equipped simulated OR, was used as the simulation platform in the present study.
Scenario 1: Esophageal Intubation
The anesthesia residents allowed a paramedic to intubate a patients trachea after induction of general anesthesia. According to a preconfigured standard script, the paramedic performed the laryngoscopy, reported visualization of the vocal cords, and then proceeded to intubate the esophagus while affirming that the endotracheal tube was placed correctly. Then, physiologic signs of esophageal intubation, including lack of breath sounds upon auscultation, increased airway pressure, absent exhaled carbon dioxide on the capnograph, and eventually decreased arterial oxygen saturation, were presented by the anesthesia simulator. The grading criteria for this scenario included the correct diagnosis of the problem as reported by the resident and the time taken to reestablish ventilation.
Scenario 2: Anaphylactic Reaction
A few minutes after tracheal intubation, the surgeon requested the administration of an antibiotic. At the same time, the anesthesia resident administered a muscle relaxant for surgical relaxation. An anaphylactic reaction was triggered, with an increase in heart rate to120 bpm and a decrease in systolic blood pressure to 5060 mm Hg, which was refractory to treatment with ephedrine and phenylephrine. The participant was informed that a rash was present if he or she inquired. The participant was given 15 min to diagnose and treat the anaphylactic reaction and grading was focused mainly on making the correct diagnosis, and appropriate and timely administration of IV fluids and epinephrine.
Scenario 3: Chronic Obstructive Pulmonary Disease (COPD) Exacerbation
After the previous scenario, the simulator was reset, and the participant induced anesthesia for a second patient with preexisting COPD. Shortly after intubation, bronchospasm was triggered with high peak airway pressure, decreased tidal volume, carbon dioxide retention, and decreased arterial oxygen saturation. Grading criteria were: consideration of differential diagnosis for bronchospasm and difficult ventilation, appropriate administration of bronchodilators and of an increased concentration of inhaled anesthetic. The bronchospasm lasted 15 min or until the resident administered bronchodilators twice.
Scenario 4: Myocardial Ischemia
A few minutes after resolution of the bronchospasm, ST depression on electrocardiogram, tachycardia, and hypotension developed. Grading criteria for the residents performance were: IV administration of vasopressors and fluids to increase arterial blood pressure, decreasing the inhaled anesthetic and administration of narcotics, use of ß-blockers to decrease heart rate, and appropriate titration of nitroglycerin. The participant was graded for selection of therapeutic drugs, doses administered, and timing of administration.
Performance Assessment
The simulation training sessions were videotaped using digital video recordings. Three cameras (PELCO, US), one of them a PTZ (pan-tilt zoom) camera, connected to a digital recording system (DARIM, Korea), were used. A four-quadrant screen that included two separate views of the participants and the mannequin and one screen demonstrating the patients vital signs monitoring system were used. Performance was assessed by two senior anesthesiologists who reviewed the videotapes separately and independently, using two scoring systems described by Schwid et al. (14). The first "Long Form" grading system had 108 assessment points, with many points determined by the participants verbalization of observations. The second "Short Form" grading system had 40 points, and was based mainly on therapeutic actions that would directly benefit the patient with no points for observations or differential diagnosis (both forms are presented in Appendix 1). It should be noted that, with the Short Form, it was possible to earn multiple points for a single action depending on the timing of action. The reviewers were also asked to report on common mistakes during management of the simulated critical incidents.
Subjective Assessment by Participants
After the simulation session, the participants globally rated the realism of the scenarios on a graded scale (4 = very realistic, 3 = somewhat realistic, 2 = somewhat unrealistic, 1 = very unrealistic).
Evaluation of the Grading System by Senior Anesthesiologists
To assess the relevance of the checklists developed for the US multi-institutional study to local medical practice, 15 senior anesthesiologists from 2 different hospitals in Israel were asked to mark each of the checklists items used for evaluation on a 15 scale (1 = irrelevant at all; 5 = very relevant). The reviewers were encouraged to add items to the checklist if needed. After the first session, the mean results were presented to the 15 senior anesthesiologists for a second opinion. In this second review, checklist items that achieved a score of 1 or 2 were excluded, and new items achieving a score of 4 or 5 were included in a revised version of the grading system used for a second reliability assessment of the simulator scenarios.
Statistical Analysis
Interrater reliability was measured using Pearson correlation for the two evaluators and the average score for the two raters was used for the remainder of the statistics. Reliability of the simulator evaluation was assessed by the Cronbach
statistic for internal consistency.
| Results |
|---|
|
|
|---|
Subjects scored from 37 to 95 (70 ± 12) of 108 possible points with the Long Form (data presented as percentage of the maximal score in Table 1). Short Form scores ranged from 18 to 35 (28.2 ± 4.5) of 40 possible points (data presented as percentage of the maximal score in Table 1).
|
The Israeli senior anesthesiologists recommended deleting four checklist items from the assessment forms (using ketamine or lidocaine in the treatment of bronchospasm, points for establishing ventilation >5 min after esophageal intubation and points for nitroglycerine administration >8 min after the beginning of myocardial ischemia). Two new items (changes in mechanical ventilation and sending arterial blood gases for analysis in the management of bronchospasm) were recommended.
Reliability of the original simulator assessment is demonstrated by 0.66 internal consistencies for the Long Form and 0.75 for the Short Form as measured by the Cronbach
statistic. The reliability after adaptation of the assessment tools according to the recommendations of the Israeli experts was 0.67 and 0.76, respectively. Interrater reliability measured by Pearson correlation was 0.91 for the Long Form and 0.96 for the Short Form (P < 0.01).
Scenarios
Scenario 1: Esophageal Intubation.
All 31 participants diagnosed and treated the esophageal intubation, 29 of them reestablished ventilation in <5 min and the other 2 within 3 more minutes. Only 12 participants performed laryngoscopy for diagnosis before taking the esophageal tube out. Only six participants left the esophageal tube in place until another tube was placed into the trachea, or pulled the tube while cricoid pressure was applied. Only five participants placed an oro/nasogastric tube to evacuate the stomach.
Scenario 2: Anaphylactic Reaction.
Five participants (16%) did not make the correct diagnosis. Four participants maintained that the tachycardia and hypotension were induced by preoperative fluid deficit and one participant misinterpreted sinus tachycardia as supraventricular tachycardia and attempted to apply electrical cardioversion (data on some common mistakes are presented in Table 2).
|
In this scenario, treatment of hypotension was essential, regardless whether or not the diagnosis of anaphylaxis was made. All participants but one increased fluid administration. Most of them (71%) used a pressurized infusion device and started a second IV line (64%). In addition to appropriate administration of IV fluids during anaphylaxis-induced hypotension, 90% of participants decreased the inhaled anesthetic and 84% administered epinephrine. One participant administered a 1-mg IV epinephrine bolus, a potentially arrhythmogenic dose.
Scenario 3: COPD Exacerbation.
All participants diagnosed bronchospasm correctly, but not all of them presented a comprehensive differential diagnosis. During patient assessment, 16 (52%) of the participants did not try manual, bag ventilation and 22 (71%) did not pass a suction catheter or perform fiberoptic bronchoscopy through the endotracheal tube. All participants administered a bronchodilator and most of them (90%) increased the concentration of the inhaled anesthetic to promote bronchodilation (in the multi-institutional study, 92% administered a bronchodilator and 52% increased the concentration of the inhaled anesthetic). Only 10 participants (33%) gave a bolus of a neuromuscular blocking drug, and no participant administered ketamine or lidocaine. Changes in mechanical ventilation setting were performed by 26 participants (84%), and 6 participants (19%) asked for arterial blood gases.
Scenario 4: Myocardial Ischemia.
In the assessment of the patient during this scenario, none of the 31 participants examined the skin. Errors in the treatment of myocardial ischemia included omission of analgesia (52%), omission of administration of fluids (48%), omission of vasopressors (16%), omission of a ß1 blocker to slow the heart rate (23%), and omission of nitroglycerin (13%) (Table 3). The initial doses of nitroglycerin as well as the titration to a target dose were correct in most of the cases.
|
| Discussion |
|---|
|
|
|---|
statistic. These findings are similar to the values found in the original US population with 0.720.76 internal consistencies for the Long Form and 0.710.75 for the Short Form. The interrater reliability of both the Israeli and the US raters was high, even though the Israeli raters knew the subjects and the US raters did not, because they were evaluated by raters at a different institution. This high interrater reliability might have been related to the fact that evaluation consists of a Yes/No predefined checklist. Although data on mistakes performed during training in the present study are presented in comparison to the data from the previous US study, our goal was not to compare the performance of Israeli and US residents. Such a comparison is difficult because postgraduate training year (CB, CA 13), although representing a well defined progression of knowledge and skill acquisition in the US, is not easily equated to the training of Israeli residents. Most Israeli residents are older and more experienced than their US counterparts because many were previously trained and certified as medical doctors in the former Soviet Union, and some were in anesthesia practice for several years before moving to Israel. Because of significant differences between the practice of medicine in Israeli and Eastern European systems, they are required to repeat their training. The differences in the medical background and experience of participants may help explain the higher scores of Israeli junior residents in the present study in comparison to the US residents in both Long and Short Form assessment scores. This may also account for the less frequent incidence of mistakes among Israeli residents.
Some items in the scenarios emphasize the differences in anesthesia practice between the US departments involved in the multicenter study and the departments participating in the Israeli study. Although no difference between the two studies in the response to esophageal intubation was found, only six Israeli participants left the esophageal tube in place until another endotracheal tube was correctly placed into the trachea, or removed the tube while cricoid pressure was applied, to prevent aspiration.
During the treatment of COPD exacerbation, none of the Israeli participants gave ketamine or lidocaine, and most participants changed the variables of mechanical ventilationan action not requested at all in the original US assessment forms. The disagreements in treatment protocols were highlighted by the evaluation of the grading system by senior Israeli anesthesiologists. Some of the 15 evaluators maintained that airway protection after esophageal intubation is not indicated and most supported the exclusion of lidocaine or ketamine treatment from the checklists.
Other common mistakes performed by the Israeli residents were not accepted by the Israeli senior anesthesiologists as standard of practice. The mistakes of not trying manual bag ventilation and not passing a suction catheter or fiberoptic bronchoscopy through the endotracheal tube in the assessment of COPD exacerbation might represent systemic problems in the training of anesthesiology residents and not a true difference in anesthesia practice.
Based on the subjective opinion of the participants and agreement among experts on performance assessment checklist items, we were able to demonstrate that simulation-based scenarios can be shared between simulation-based anesthesia training centers in two different countries. In our study, the reliability of the assessment tools was slightly lower and adaptation using local experts opinions failed to improve it. Because of differences in training systems and the lack of routine scored trainee evaluation, we were unable to assess construct validity of the scenarios or to compare the scores achieved in the simulation-based evaluation with departmental evaluation tools. In addition, evaluation outcome differences between the two populations of residents cannot be directly compared because of the confounding differences in practice patterns and level of resident training and experience.
Further prospective evaluation of scenarios should be part of a joint effort of the medical simulation community. Such evaluation should emphasize not only the evaluation of technical skills but also communication or leadership skills (19).
| Appendix 1 |
|---|
|
|
|---|
Case 1: Esophageal Intubation Followed by Anaphylaxis
Paramedic student performs esophageal intubation.
Elapsed time to reestablish ventilation: <2 min, 8 pts; 25 min, 5 pts; 58 min, 3 pts; >8 min, 0 ptsaccording to the Israeli experts, 0 pts were given if ventilation was reestablished after 58 min.
Airway protection:
A few minutes after esophageal intubation is corrected, surgeon asks for antibiotic and complains about relaxation. Anaphylaxis without bronchospasm is triggered, heart rate (HR) increases to 120, and arterial blood pressure (BP) decreases to 5060 systolic.
Elapsed time to turn off anesthetic: <5 min, 3 pts; 58 min, 2 pts; 812 min, 0 pts; >12 min or not done, 2 pts.
Case 2: Bronchospasm Followed by Myocardial Ischemia
A few minutes after bronchospasm is corrected, the patient develops ST segment depression, frequent premature ventricular contractions (PVCs), hypotension, and tachycardia.
Elapsed time to administer pressor: <8 min, 5 pts; 812 min, 3 pts; >12 min, 0 pts; no pressor, 4 pts.
Elapsed time to administer nitroglycerin (NTG): <8 min, 5 pts; 812 min, 3 pts; >12 min, 0 pts; NTG not used, 4 ptsaccording to the Israeli experts, 0 pts were given if NTG was administered after 812 min.
Start at 0.250.5 µg · kg1 · min1 (+2 pts); titrate up to 12 µg · kg1 · min1 (+2 pts)
Short Form Scoring System
Case 1: Esophageal Intubation Followed by Anaphylaxis
Anaphylaxis:
Case 2: Bronchospasm Followed by Myocardial Ischemia
Time ST segment changes begin:
| Footnotes |
|---|
Accepted for publication April 15, 2005.
| References |
|---|
|
|
|---|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|