JOURNAL HOME CME HOME THIS MONTH PAST ISSUES ETOC COLLECTIONS
AUTHORS REVIEWERS EDITORIAL BOARD FEEDBACK RSS HELP
A&A International Anesthesia Research Society
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a colleague
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrowRequest Permissions
Citing Articles
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Shang, A. B.
Right arrow Articles by Weininger, S.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Shang, A. B.
Right arrow Articles by Weininger, S.


ARTICLE

Development of a Standardized Method for Motion Testing in Pulse Oximeters

Allan B. Shang, MD, MSE*{dagger}, Raymond T. Kozikowski, BSE{ddagger}, Andrew W. Winslow{ddagger}, and Sandy Weininger, PhD§

From the *Department of Anesthesiology, Duke University Medical Center; {dagger}The Fitzpatrick Institute for Photonics, Duke University; {ddagger}Department of Electrical and Computer Engineering, Duke University, Durham, North Carolina; and §The Center for Devices and Radiological Health, United States Food and Drug Administration, Silver Spring, MO.

Address correspondence and reprint requests to Allan Shang, MD, MSE, Box 3094 DUMC, Durham, NC 27710. Address e-mail to shang004{at}mc.duke.edu.

Abstract

BACKGROUND: Pulse oximeter performance in the presence of motion varies among devices and manufacturers because of variations in hardware, software, testing, and calibration. Compounding these differences is a lack of uniform characterization of motion, and the consequential effects of motion upon the wide range of normal and abnormal human physiology. Traditional motion testing attempts to standardize motion into a reproducible form by using a mechanical jig to produce passive motion of a known amplitude and frequency. This type of motion challenge fails to account for the physiologic changes induced by active movement.

METHODS: We postulate that a more appropriate method for testing the performance of pulse oximeters in the presence of motion is to create a feedback control loop between the device and the test subject, providing a reproducible, actively created, and controlled motion test suitable for standardized testing among manufacturers. It is hoped that relying on a signal as seen from the oximeter's perspective will enable the creation of a sensitive and reproducible test method capable of separating those oximeters that can reject motion artifact from those that cannot.

RESULTS: Preliminary results have concentrated on building the tools and clinical protocols needed to evaluate this method. Some basic observations are reported, but insufficient numbers of experienced subjects precludes rigorous conclusions.

CONCLUSION: We have set the stage for a feasibility demonstration using a novel form of testing. With sufficient subjects and proper statistical evaluation, a robust test method for assessing the performance of pulse oximeters in the presence of motion may be at hand.

In an ideal world, the performance of a medical device is fully characterized and understood before it becomes a standard of care. In pulse oximeters, the complex interaction between photons and tissue is not fully understood, leading to uncertainty in the true value of oxygen saturation. The accuracy specifications are stated as a range of acceptable values to account for some variation in human phenotype, and are at best an approximation, or "best fit" calibration. This situation is exacerbated by a historical lack of a standard method to calibrate and report performance. Despite these shortcomings, oximeters provide an invaluable vital sign in clinical monitoring, and therefore are considered a standard of care in many areas of medicine.

Assessing the uncertainty of pulse oximeters, independently characterizing their performance, has been a continuing quest of many researchers since modern oximeters first appeared on the market. The application of powerful microprocessors, modern signal processing theory, and improved electronics have enabled the current generation of oximeters to more reliably report saturation values. The new generation of pulse oximeters is able to obtain valid data despite noise and motion artifacts, and is capable of detecting ever smaller signals, thereby differentiating themselves from "first generation" oximeters (1) and intensifying the need for more powerful standardized performance assessment tools. An overview of pulse oximeter technology and testing is given by Jopling et al. (2). Issues that need to be investigated and resolved include the best method to assess accuracy, what effects contribute to calibration differences between brands, and actual performance under conditions of motion and perfusion metrics.

In this article we report the initial work to develop methods to assess motion performance claims on motion tolerance, state clearly the assumptions of the methods, and gather evidence to validate those assumptions. It is a collaboration among the United States Food and Drug Administration, academic anesthesiologists, and pulse oximeter manufacturers, with the goal of understanding and documenting the basic clinical performance of pulse oximeters.

BACKGROUND ON MOTION TESTING

The American Society for Testing and Materials (ASTM) pulse oximeter committee (composed of manufacturers, clinicians, and public representatives/regulators) wrote its first pulse oximeter standard, F1415, in 1987, which was adopted by the International Standards Organization (ISO) and the European Committee for Standardization. This draft had no requirements for motion artifact performance. In 1997, the ASTM committee was reconvened to update the standard, finally producing ISO 9919:2005/IEC 60601-2-54 in 2005. Even as evidence of the impact of motion was being published (3–5) the standard was still not able to establish definitions for motion types, a normative requirement for what constitutes motion tolerance, and a standardized method for testing motion resistance claims. The only requirement with respect to motion tolerance calls for disclosure of each manufacturer's test method. The consequence is that performance between brands may not be comparable. The ASTM pulse oximetry committee recognized this deficiency and launched an effort to develop the necessary test method.

As a prototype model of the motions seen in the clinical environment, and to construct a starting point to develop a test method, the Committee defined five types of motion to be used as exemplars: wave, scratch, clasp, tap, and squeeze. These are defined as follows.

  1. Flexing or waving of the wrist and forearm with the elbow resting on a countertop
  2. Scratching or rubbing motion of the fingertips against the countertop (the rest of the extremity should remain relatively motionless)
  3. Flexion-extension of the fingers ("clenching" of the hand) but without pressing against a surface (the elbow is stationary against a countertop and the arm is held perpendicular to the floor)
  4. Squeezing the "test" hand around a firm rubber handle, while compressing the tips of the fingers against the rubber handle with a force similar to that used to grip a handrail
  5. Tapping the fingers of the "test" hand against the countertop

These were not intended to be the definitive set of motions, as little research has been published to fully characterize the clinical environment, but instead were postulated to span the range of motions seen (6). Further work is needed to qualitatively characterize these motions so that they may be reproducibly communicated. It is recognized that low perfusion and desaturation can add to the motion challenge (7), but the main focus here is to identify the issues that influence testing using relatively simple room air tests.

METHODS

We propose a test method for motion tolerance that relies on controlling the signal that the pulse oximeter sees internally rather than imposing an external constraint. Although traditional motion testing was performed by externally generating the motion of an appendage, for example, by placing the hand in a mechanical jig (4,8,9) or adding volunteer reference motion to recorded signals (10), this new approach uses the raw plethysmogram, the actual data stream collected and analyzed by the oximeter, as an indicator of the intensity of the external motions. We hope that this method will produce a more sensitive and realistic assessment of an oximeter's ability to maintain accuracy in the presence of motion.

We intend to use a pulse oximeter probe as a plethysmographic measuring tool on one finger of the hand in motion to measure, control, and modulate motion intensity with devices under test (DUT) on the remaining two fingers. In this way, we hope to present an equivalent motion challenge to all three fingers on the test hand. The resulting saturation values recorded will be compared with reference saturations from a stationary hand. Measurements taken with the subject motionless are considered the rest signal, in that the only pulsations measured are those generated by arterial pressure wave forms. Measurements taken during periods of motion constitute signal plus noise because of the confounding effect of motion artifact overlaying the signal. Our goal is to find an expression for this signal that is proportional to the level of motion intensity. As the level of motion rises, we would like the test signal to rise accordingly and not be a function of the absolute amplitude of the plethysmogram, which is predominantly a "DC" or static signal (not time varying).

Several candidate signals were explored, including:

  1. The ratio of the alternating component (AC) to DC value computed using a Fourier transform over a short period, say 3–5 s
  2. The ratio of the short-term pulse amplitude, calculated as a local maximum – minimum (3–5 s), to the longer term average (30 s). This is representative of the "perfusion index" (PI) used by several manufacturers but only acts on the infrared (IR) signal
  3. The root mean square energy
  4. The smoothed plethysmogram itself

We intend to use this signal in a closed-loop feedback arrangement, taking advantage of the test subject's ability to modulate their own level of motion activity and cause the plethysmogram to adjust in proportion (Fig. 1). The subject adjusts their intensity of activity in response to the visual presentation of their plethysmogram (or the selected function of the plethysmogram) to the desired target level.


Figure 112
View larger version (24K):
[in this window]
[in a new window]

 
Figure 1. Subject sees the test signal presented on the monitor and the desired level. The subject adjusts their level of intensity (exertion levels) to keep it within the control limits.

 

Within this setup, we define A as the ratio of the test signal in a motion period to a resting period. This has the effect of normalizing the absolute amplitude of the signal across the population of test subjects. We then use A as an indicator of the intensity of the motion challenge. We hypothesize that there will be some motion conditions where changes in A are small (A close to 1 meaning the motion intensity is not distinguishable from the resting intensity) with the result that all oximeters can read through the motion with little error. Conversely, we hypothesize that there will be some intensities of motion where A is very large and no oximeters will be able to read accurately. In between, we hypothesize that there are levels of activity that we can "request," which will distinguish between the capable oximeters and those that are not motion tolerant (Figs. 2–4). The range of these levels of A define the motion performance of the oximeter for each type of motion. Error was not defined by the ASTM committee—it was left to the developers to find an acceptable level of error, but generally this meant that the observed accuracy of the oximeter on a motion hand was significantly different from some reference value, typically an oximeter on a stationary hand.


Figure 212
View larger version (21K):
[in this window]
[in a new window]

 
Figure 2. Subject is exerting sufficient activity to keep the test signal in the desired range.

 


Figure 312
View larger version (17K):
[in this window]
[in a new window]

 
Figure 3. Subject is not exerting sufficient activity to keep the test signal in the desired range. Subject needs to increase their level of activity.

 


Figure 412
View larger version (17K):
[in this window]
[in a new window]

 
Figure 4. Subject is exerting too much activity causing the test signal to exceed the desired range. Subject needs to decrease their level of activity.

 
Why do we say a different A for each type of motion? We did not want to make assumptions about the relative difficulty of one type of motion versus another. One aspect of this research is to determine which motions have independent characteristics, and thus find the minimum set of test motions necessary. We propose that characterizing the time-frequency behavior of the motion signals can give us evidence about the similarities and help reduce the number of test types or identify new test types not yet considered.

Upper and lower control limits (UCL, LCL) specify the upper and lower boundaries for the test signal being fed back to allow a tolerance band around the desired intensity. A 30% tolerance band was chosen. This produced an UCL defined as:

UCL = A x (1 + 0.3)KRavg

where KRavg is defined as the average amplitude (or whatever function we choose) of the resting IR plethysmogram measured immediately before the motion epoch. The LCL is similarly defined as:

LCL = A x (1 – 0.3)KRavg

Figure 5 shows an example of the test signal during test as an attempt is being made to keep it within the UCL and LCL, as indicated by the dotted red and white lines.


Figure 512
View larger version (25K):
[in this window]
[in a new window]

 
Figure 5. Example of trying to keep the feedback variable within the control limits (indicated by the red and white lines).

 

RESULTS

As our intent was to develop a complete test method, we started with an application (Fig. 14) that made many assumptions. It became evident after only a few subjects that we had no justification for the values we had chosen, nor could we find literature to support our position. Consultation with manufacturers produced many anecdotal recommendations, but no firm evidence upon which to develop a justifiable test method. We, therefore, began a journey to understand how our system behaved and how to set reasonable values for the parameters of our experiments. This phenomenon, ending with more questions than we started with after each new experimental protocol, repeated itself throughout our studies (and continues). For this reason, much of the work presented is shown empirically, with little group analysis performed, as new questions needed to be answered before the study could be completed. For example, after running our initial application on several subjects, we realized that the factor A was not easy to determine. This terminated the study of the motion testing until we could better understand how A was related to motion type or position within the test. It was observed that the order of motions, the durations of the periods of rest and motion, how long it takes a subject to recover back to the resting baseline, and how well a subject could keep the test signal within the control limits all needed further examination. We then decided to reexamine some basic assumptions, characterizing the electrical response of the instrumentation system and some basic physiological responses of the subject, so that we could move forward on a sound footing. The gain and position studies are reported to represent these findings.


Figure 1412
View larger version (53K):
[in this window]
[in a new window]

 
Figure 14. LabVIEW® screenshot of a data collection program designed to include the human user in the feedback loop. The motion sequence, duration of each period, and control limit parameters are configurable. Directions are given to the operator and subject. Program name: Motion Characterization 12-10-2003.vi.

 

Test Instrumentation and Data Collection
A method was required to measure the raw plethysmogram in real-time with sufficient resolution to implement our feedback approach. We were fortunate that two pulse oximeter manufacturers, GE/Ohmeda and Philips, make pulse oximeters available, an Ohmeda TruSat® and a Philips M3®, with a real-time raw plethysmographic output. With this equipment available, we have access to the raw IR plethysmogram (60 Hz for the Ohmeda TruSat and about 50 Hz for the Philips M3) and a computed PI at 1 Hz. It would be even more helpful to have access to both the red and IR plethysmograms, but the manufacturers are reticent to provide access to both wavelengths to prevent their calibration curves from being deduced.

The data collected were part of a Duke University IRB-approved study with all volunteers providing appropriate written consent. The Food and Drug Administration also follows a continuing review of the Duke University IRB process. The test uses a control oximeter on the middle finger of one hand to measure the raw plethysmogram and two other DUT on the index and ring fingers of the same hand. The thumb and fifth finger are not used. The saturations measured by the DUTs are compared with an oximeter on the rest hand. Failure is defined as being more than some amount different from the reference on the rest hand (10). This framework makes several important assumptions:

  1. Fingers on the same hand see an equivalent motion load
  2. Changes in the test variable are a function of the amount of motion and not a function of instrumentation

The first assumption can be tested after each trial, provided that we have appropriate tools. It is proposed to use the time-frequency characteristics of the signal. The second assumption can be validated with studies examining the response of the "system" to differing light-emitting diode (LED) excitation currents and to the position of the hand relative to the heart. We want to ensure that the selected motions impose a change in the test signal more than the changes because of relative position.

Testing and Validating Underlying Assumptions
The proposed method of analyzing the recorded signal from one "reference" finger as an indicator of motion intensity of the other fingers relies on the assumption that the plethysmogram is essentially equivalent for the middle three fingers on the same hand. More precisely, for these three fingers, we postulate the oximeters see a raw plethysmogram with comparable intensity and time-frequency characteristics. This relies on an assumption that the sampled tissue bed interacts linearly with light intensity, and that the feedback variable is independent of the level of the fingers with respect to the heart. The same techniques that we propose to use to explore the independence of the motion types can be used to draw conclusions about the similarity of the finger responses. This is the short-term frequency transform (STFT).

Before beginning to see if our test method would be feasible, we wanted to confirm the sensitivity of our analysis methods and ensure the signal detected was a result of a physiologic change and not an instrumentation gain change in the emitter or receiver circuitry. Confirmation of some basic physiologic responses to position was required to determine if these responses had a larger affect on our signal than the proposed motions. Two studies were performed: a gain study and a positional study.

Gain Studies: Response of Test Signal to Differing LED Intensities
With the help of custom software from Ohmeda, we used the TruSats in a mode where the LED emitter outputs could be controlled, thus allowing the transfer functions (output light/input light) to be measured. The subjects were resting with their hands at the level of their heart. The emitter outputs were set at auto-gain (where the oximeter chooses the levels), 8, 30, 60, and 120 mA, and the detected light measured. One study was performed using an opaque bag over only the emitters to determine normal levels of ambient light to distinguish the baseline signal from background electronic noise. The mean and standard deviation for each drive level were compared to see whether the differences in the intensity of the measured raw IR plethysmogram are proportional to the changes in emitter output.

Figure 6 shows an example of the IR plethysmogram and PI as a function of the variation of LED emitter gain as measured using the Ohmeda TruSat oximeter. Ideally, the raw plethsymogram curves should increase linearly, with the 30 mA state generating results 3.75x the 8 mA state. In turn, the 60 mA state should then generate results twice that of 30 mA, and so on. Instead, our experimental data demonstrated only a slight increase in measured light output from 8 to 30 mA. The data from 30 to 120 mA were linear within one significant digit. Additionally, as seen in Figure 6, there is also an unexpected difference in the data from two supposedly identical devices on adjacent fingers. Switching finger–probe-monitor combinations demonstrated that the differences were inherent in the devices but did not affect saturation values.


Figure 612
View larger version (64K):
[in this window]
[in a new window]

 
Figure 6. The upper graph show the resting raw infrared plethysmogram (pleth, sampled at approximately 50 Hz) and the lower graph shows the PI (sampled at 1 Hz), for pulse oximeters on two fingers, in this case index and middle, as a function of LED excitation state (auto, 8, 30, 60, 120 mA). The accompanying charts show the mean and standard deviations for each LED excitation state for each channel (60 s average).

 

The linear relationship observed in the plethysmographic amplitudes at the higher excitation currents was as expected but at the lower currents, particularly 8 to 30 mA, the plethysmographic response seemed to lose its linear relationship (Fig. 7). We wanted to know if this was a result of the signal getting lost in the noise floor or the effects of ambient light. An opaque covering was applied over the emitters to determine the background signal on the detectors. The covering was intermittently removed and replaced during this test, and the results are illustrated in Figure 8. The background signal with no LED illumination is effectively zero, excluding ambient light as the cause of the nonlinearity.


Figure 712
View larger version (28K):
[in this window]
[in a new window]

 
Figure 7. The value of the average plethysmogram versus LED excitation current for the subject in Figure 6 showing a roughly linear relationship for all but the lowest current setting.

 

Figure 812
View larger version (68K):
[in this window]
[in a new window]

 
Figure 8. This test explores the background signal by setting the TruSat to auto-gain and applying black tape to the emitter during time 1:00–2:00 and 3:00–4:00 min.

 

A STFT frequency analysis was performed of the data in Figure 6, with spectrograms displayed in Figure 9, to demonstrate that the signals seen by two adjacent fingers were equivalent. The curves look so similar when compared visually that no additional computation was used.


Figure 912
View larger version (31K):
[in this window]
[in a new window]

 
Figure 9. The spectrograms (short-term frequency transform, with a 5-s window) demonstrate that the energy spectra are similar.

 

There is a jump in the plethysmogram when the TruSat emitters were set to 8 mA from their initial auto-gain setting, thought to be because of the settling in the receiver circuitry. As the emitter gain is then increased, the spectrograms are observed to increase in amplitude, consistent with the increase in emitter drive current. At this juncture, we are unable to precisely determine why the 8 mA emitter setting causes this response, with the most likely theory being the possibility of an inherent nonlinearity in the response of the emitter diodes.

Positional Studies: Response of Test Signal to Varying Hand Position
Are the effects on our test signal of changing hand position larger than the effects we see with some of our exemplar motion types? Varying the position of the finger/probe with respect to the level of the heart provided some interesting data to address this issue. The protocol to investigate positional impact used the PI as calculated by the TruSat or M3 as a function of arm position (down, heart level, head level, up/raised). Position started at heart level and stepped through the following states: down, heart, head, up. After each state, the hand was returned to the heart position as a reference. To simplify our experimental setup, we used the same pulse oximeter on the middle finger to provide the raw IR plethysmogram. This is our control oximeter. The ring and index fingers then provide platforms for testing two additional devices. All the position and drive level studies used dual TruSat oximeters. Additional positional studies used two Philips M3 oximeters to confirm observations made with the TruSats.

The raw IR plethysmogram displayed an increasing trend with amplitude as the hand position was elevated higher toward head level with a concomitant increase in PI, apparently because of the larger AC component of the pulse, but seemed to decrease when the position was raised above the head. A sample of this response is seen in Figure 10. The upper left curve shows the raw plethysmogram acquired from a pair of Ohmeda TruSats as a function of hand position with respect to the heart (referred to as the state). The curve underneath this shows the PI as calculated by the TruSats and two prospective test signals, (max – min)/avg and AC/DC, again as a function of the position. The curve in the upper right shows a 7-s period comparing the time domain wave forms from the index and middle fingers. The curve underneath this shows the correlation between the two plethysmograms as a function of time. Underneath these curves are the two spectrograms (STFT). This response is found in both the TruSats and the M3s, but not consistently with all subjects. Both a normalized correlation coefficient calculation and the STFT spectrogram shown in Figure 10 indicate that the two oximeters on the same hand were seeing the same magnitude of activity in the IR plethysmogram.


Figure 1012
View larger version (80K):
[in this window]
[in a new window]

 
Figure 10. Tool developed in LabVIEW to explore the characteristics of plethysmogram signals on the same hand (not including thumb and pinky).

 

For some subjects, the PI "followed" the hand position, increasing as the hand was raised, as shown in Figure 11.


Figure 1112
View larger version (55K):
[in this window]
[in a new window]

 
Figure 11. The recorded raw plethysmogram is split into its AC and DC components using an Fourier transform based analysis calculated over a 3-s interval. Notice that the DC component (purple) is much less dependent on the position than the AC component (bright green).

 

The middle graph of Figure 11 demonstrates how the AC and DC components respond to the changes in position. From these curves, it appears that the change in the AC component is the greatest contributor to changes in PI (as measured by the TruSat), indicating that our test signal should focus on the AC characteristics. The TruSat oximeters continued to exhibit a small bias between the two monitors shown by the offset between the two signal curves in the top plot of Figure 11 but, importantly, the devices did not demonstrate an appreciable difference in measured saturation.

We continued to explore the question of how similar the finger plethysmograms are by comparing the wideband plethysmograms (the raw signal compensated for by the gain of the LEDs and the photodetector amplifier) from two fingers on the same hand. For both the TruSat and M3s, the motion generated with both fingers was synchronous, via visual assessment of an equivalent spectrogram. Figure 12, performed using the M3s, shows that, for noisy plethysmograms, the STFT is useful for comparing equivalency and that both [max – min]/avg (blue signal in the second trace) and AC/DC (green signal in the second trace) track perfusion though each can be quite noisy. Importantly, we were able to determine when the two fingers did not move synchronously, as can be seen by comparing the spectrogram in Figure 13.


Figure 1212
View larger version (80K):
[in this window]
[in a new window]

 
Figure 12. Same as Figure 10 but using the Philips M3. Notice the amplitude of the plethysmogram is 100x smaller than for the TruSats. We were unable to obtain the gain corrected plethysmogram values for the M3 but the signals were adequate for our studies anyway.

 

Figure 1312
View larger version (78K):
[in this window]
[in a new window]

 
Figure 13. Data collected using the Philips M3. These fingers are not presenting the same motion challenge. Differences can be seen in the raw plethysmograms and the short-term frequency transforms. Even the correlation function has more noise than usual.

 

DISCUSSION

We need to understand our instrumentation and the system that we were exploiting, namely the response of the tissue finger bed to light. It is important to be certain that the changes being measured were from the added motion and not from changes in underlying perfusion, possibly as a result of a change in position with respect to the heart, or a nonlinearity. Although much of what we seek is likely known to corporate research labs and anecdotally, we could not find published evidence of how the tissue bed responds to varying intensity exposure, nor could we find information about the plethysmographic response with respect to position for our particular instrumentation configuration. Rather than make assumptions, several simple experiments were devised to characterize our systems' performance. Two basic experiments were performed: 1) we exposed the tissue bed to a known intensity excitation holding the position constant, and 2) we varied the position of the hand and observed the response. In each of these experiments, we explored whether fingers on the same hand had equivalent plethysmograms, because our proposed test method uses one finger to control the amount of motion on the other two fingers. The visual representation of the STFT was used to compare finger responses with a more elaborate method for assessing similarity, based on the average difference of the point values under development.

In parallel with the continuing basic studies above, we developed an application to prototype a potential test method. Our application allows us to select the order of motions, the duration of the rest and motion periods, the desired A, and the desired control limits. We exercised this application on several volunteers, which raised several questions:

  • Does the type of motion that proceeds affect the results? Would the subject fatigue for one type of motion and not another and would this influence the subject's ability to keep the signal within the control limits?
  • Does the order of motion types influence the results?
  • Might exercise lead to exhaustion and the inability to maintain a high intensity state (not reach the control limits)?
  • Does the baseline recover to the same (resting) level? Which is the correct resting baseline to use in the calculation of A: the initial resting period, or the period immediately preceding the motion?

Figure 14 illustrates an example of the LabVIEW® based data collection system. Set-up parameters include length of rest period, length of motion period, the computation to be preformed on the raw plethysmogram (if any), and the control limits for that study. The subject performs the indicated motion, varying the intensity to attempt to maintain the computed feedback variable, K, within the range indicated by the control limits. The screen shot shows the data transition from a rest period to a motion period.

Where Do We Go from Here?
We have demonstrated the tools needed to scientifically and systematically create a motion standard for pulse oximetry. The next step will be selecting values for the parameters, using them to collect data from a representative population of subjects, and developing statistically significant conclusions. By creating a closed-loop control system, it is hoped that testing can be reproducible and will allow devices to be compared using an equivalently challenging test. To accomplish our test method, it was necessary to determine if the plethysmograms are equivalent in the fingers on the same hand, and that the signal observed is a result of the added motion and not an artifact of our measurement system, including the instrumentation and the human subject.

Some work is left to be done to select the computation of K, such that most subjects will be able to keep the test signal within the control limits. The ability of a subject to keep within the control limits is a function of how fast the system responds and the formulation of K. Our system could redraw the screen once per second, leading to a less than smooth feedback signal. Using a longer averaging filter or more points in the calculation will create a feedback variable that reacts more slowly. We instituted the concept of %IN, defined as a measure of time within the control limits compared with the time outside the control limits, when it was realized that many novice subjects had difficulty keeping their test signal within the desired range and it seemed the more experienced subjects had less difficulty. The degree to which the subject can keep the test signal within the control limits may be crucial to demonstrating the feasibility of this method. An acceptable value needs to be found, likely by experimentation and iteration.

Whether there is a critical threshold on %IN, below which the test becomes invalid, remains to be seen and is the focus of future work. It is expected from prior observations that training will improve the attainable %IN, and that %IN will also vary between motion types, as some motions are more difficult to maintain than others. We may use %IN as a quality control measure to discard invalid tests, just as the STFT may identify when the fingers are not acting in synchrony.

Proposed Protocol
Our proposed protocol will have the subject perform the motion for several increasing values of A. Preliminary experiments have shown us that A may not be the same value for all motions. An initial value of A will be set, and the subject will be asked to keep the amplitude of the generated test signal within the UCLs and LCLs for as much of the motion period as possible. At some point, the amount of motion generated with increasing A will cause failure (Af) of the tested device compared with the reference oximeter on the stationary hand. Barker and Shah (8) used the concept of percentage of time when the oximeter error exceeded a specified threshold. As additional subjects are tested for that same motion, additional values for Af will be determined. It remains to be seen how Af behaves over a population of subjects and whether it is reproducible for a subject and across subjects. With these data in hand, a fair threshold for a "pass" or "fail" can be determined. It is expected that some devices will have more difficulty with some motions than others, but a certain minimal level of performance should be mandatory to describe a device as "motion-resistant" or equivalent.

One concept for reporting the performance of each oximeter is a table format with the error measured as a function of motion type (Table 1).


View this table:
[in this window]
[in a new window]

 
Table 1. Proposed Testing Grid

 

Finally, the relative amplitudes and shapes in the STFT spectrograms we believe to be the motion signatures. We postulate that two motions with similar spectrogram signals are not independent, only one is needed in the test method. One of the next dilemmas needing to be addressed is how to validate this postulate. This will allow us to produce an efficient test, one that has a minimum amount of testing yet is still effective in differentiating motion-resistant oximeters from nonresistant ones.

CONCLUSIONS

We have proposed a method and rationale to perform motion testing that we believe will be reproducible and robust with respect to comparing motion-resistant pulse oximeter designs. Tools have been developed to explore and test the validity of the underlying assumptions, including the reliance on adjacent fingers seeing equivalent motion challenges and the independence of motion types. We believe we have identified variables that may have an influence on the ability of the test to be effective. What is left to be done is to perform rigorous, statistically valid protocols to select an appropriate form of the test statistic, optimize the duration of the resting and testing periods, and order the motions in an appropriate way. Our observations have demonstrated that the best chance for success relies on using trained volunteer subjects that can consistently control their motion and the resulting plethysmograms. Toward this end, we will seek additional opportunities to bring our agenda to completion. This work is just beginning.

Footnotes

Accepted for publication May 9, 2007.

Supported by Pooled funds donated by the Committee Members of ASTM committee F1415 on pulse oximetry, Duke University/Department of Anesthesiology fund 399-0046, and US FDA 1 year grant.

REFERENCES

  1. Shang AB. An investigational study to assess the performance of pulse oximeters under the presence of motion artifact. Anesth Analg 2002;94:S107
  2. Jopling MW, Mannheimer PD, Bebout DE. Issues in the laboratory evaluation of pulse oximeter performance. Anesth Analg 2002;94:S62–8[Web of Science][Medline]
  3. Severinghaus JW. History and recent developments in pulse oximetry. Scand J Clin Lab Invest Suppl 1993;214:105–11[Medline]
  4. Langton JA, Hanning CD. Effect of motion artefact on pulse oximeters: evaluation of four instruments and finger probes. Br J Anaesth 1990;65:564–70[Abstract/Free Full Text]
  5. Plummer JL, Zakaria AZ, Ilsley AH, Fronsko RR, Owen H. Evaluation of the influence of movement on saturation readings from pulse oximeters. Anaesthesia 1995;50:423–6[Web of Science][Medline]
  6. Tobin RM, Pologe JA, Batchelder PB. A characterization of motion affecting pulse oximetry in 350 patients. Anesth Analg 2002;94:S54–61[Web of Science][Medline]
  7. Gehring H, Hornberger C, Matz H, Konecny E, Schmucker P. The effects of motion artifact and low perfusion on the performance of a new generation of pulse oximeters in volunteers undergoing hypoxemia. Respir Care 2002;47:48–60[Medline]
  8. Barker SJ, Shah NK. The effects of motion on the performance of pulse oximeters in volunteers. Anesthesiology 1997;86:101–8[Web of Science][Medline]
  9. Barker SJ. "Motion-resistant" pulse oximetry: a comparison of new and old models. Anesth Analg 2002;95:967–72[Abstract/Free Full Text]
  10. Kästke SW, Konecny E. Determining the artifact sensitivity of recent pulse oximeters during laboratory benchmarking. J Clin Monit Comput 2000;16:509–22[Web of Science][Medline]




This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a colleague
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrowRequest Permissions
Citing Articles
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Shang, A. B.
Right arrow Articles by Weininger, S.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Shang, A. B.
Right arrow Articles by Weininger, S.


Lippincott, Williams & Wilkins Anesthesia & Analgesia® is published for the International Anesthesia Research Society® by Lippincott Williams & Wilkins and Stanford University Libraries' HighWire Press®. Copyright 2007 by the International Anesthesia Research Society. Online ISSN: 1526-7598   Print ISSN: 0003-2999 HighWire Press