Evidence Report/Technology Assessment: Number 1

Systematic Review of the Literature Regarding the Diagnosis of Sleep Apnea

Summary


Under its Evidence-based Practice Program, the Agency for Health Care Policy and Research (AHCPR) is developing scientific information for other agencies and organizations on which to base clinical guidelines, performance measures, and other quality improvement tools. Contractor institutions—such as the Metaworks, Inc., Evidence-based Practice Center—review all relevant scientific literature on assigned clinical care topics and produce evidence reports and technology assessments, conduct research on methodologies and the effectiveness of their implementation, and participate in technical assistance activities.

Overview / Reporting the Evidence / Methodology / Findings / Future Research / Availability of Full Report


Overview

In this study, MetaWorks investigators have developed an evidence base via a systematic review of the literature pertinent to diagnostic testing and screening in sleep apnea in adult patients. Sleep apnea (SA) is a recently recognized disorder of sleep characterized by recurrent apneic and hypopneic episodes.

Apnea was typically defined as complete cessation of airflow, but in some studies, a > 80 percent reduction in airflow was used. For defining hypopnea, most papers suggested a 50 percent or greater reduction in airflow was used, with or without a coincident O2 desaturation of anywhere from 2 percent to 4 percent from some average SaO2 over a preceding interval of time.

In view of its high prevalence and serious associated morbidity, SA has recently been described as a major public health concern. A major problem in the field in 1998 is diagnosis: who to test, how to test, and what are the implications of test results regarding the risk of serious clinical sequelae?

Sleep apnea is a condition where the gold standard diagnostic method, overnight full-channel polysomnography (PSG) in a sleep lab is intrusive and costly, and the interpretation can be difficult. A standard PSG typically consists of electroencephalogram (EEG), submental (±tibialis) electromyogram (EMG), electrooculogram (EOG), respiratory airflow (usually by oronasal flow monitors), respiratory effort (usually by plethysmography), and oxygen saturation (oximetry). Electrocardiography (ECG) and body position are also frequently monitored in formal sleep studies and stated to be standard requirements of PSG by some groups.

If, however, the estimated prevalence of sleep apnea at 2 percent to 4 percent of middle-aged adults is correct, the costs of full PSGs to diagnose all suspected cases would be prohibitive. The development of simpler and less costly alternatives for diagnostic testing would be highly desirable as would simpler prescreening tests prior to full PSG.

Diagnostic approaches which might be viewed either as alternatives to PSGs or as screening tests to better select patients for PSG include:

All such interventions were within the scope of this review, provided they compared results against the gold standard diagnostic test, the standard PSG.

Although the type of sleep evaluation study preferred (and reimbursed) varies widely among physicians, sleep centers, and managed care organizations, MetaWorks investigators have avoided making specific recommendations in this review. MetaWorks investigators also did not review technical considerations related to data acquisition, storage, retrieval, and analysis of various devices, which were beyond the scope of this project. Rather, it is intended that this synthesis of the best available evidence will serve as an information resource for local decisionmakers and developers of guidelines/recommendations. It should also serve to highlight gaps in literature and areas ripe for future research.

Return to Contents

Reporting the Evidence

The key questions that guided this review were:

  1. What diagnostic and screening tests are presently available?
  2. What is the strength of the evidence in support of each?
  3. What is the predictive value of these tests in different populations (which requires estimating the prevalence of SA in different populations)?
  4. What are the implications of certain PSG results in terms of serious clinical events occurring as comorbidities in association with a diagnosis of SA?

Return to Contents

Methodology

In general, MetaWorks investigators used systematic review methods derived from the evolving science of review research. The review followed a prospective protocol that was developed a priori and shared with the nominating partners on the project (Blue Cross/Blue Shield [BC/BS] of Massachusetts and the Sleep Disorders Centre of Metropolitan Toronto), a panel of technical experts (with representation from consumer groups and professional specialties: neurology, pulmonology, dentistry, otolaryngology, epidemiology, and nursing); and the Task Order Officers at AHCPR. The protocol outlined the methods to be used for the literature search, study eligibility criteria, data elements for extraction, and methodological strategies to minimize bias and maximize precision during the process of data collection, extraction, and synthesis.

The published literature was searched from 1980 to present. The search cutoff date was November 1, 1997, and the retrieval cutoff date was January 30, 1998. The search started with a broad Medline search using the terms "sleep apnea syndrome" and "monitoring, physiologic," "sleep apnea syndrome" and "airway resistance," and "human." Also, MetaWorks investigators searched "sleep apnea syndromes," "sleep apnea syndrome," and "index." In addition, the 1997 Current Contents CD-ROM was searched ("sleep apnea") to the same cutoff date. All citations and abstracts were printed and screened at MetaWorks for any mention of diagnostic tests in adults with SA, for which full papers were obtained. The electronic searches noted above were supplemented by a thorough search of the reference lists of all eligible studies and relevant review articles.

To be included in the review, studies had to report results of any diagnostic test or intervention to establish or support a diagnosis of SA in adults, with at least 10 patients as total sample size. Studies reported in the following Western European languages—English, German, French, Spanish, or Italian—were accepted.

All eligible papers were scored on features pertinent to diagnostic test study design, execution, and reporting, with a range of possible scores from 0 to 44. Those falling in the lowest 20 percent of the distribution of actual scores were dropped from data extraction and analysis. Each accepted diagnostic study was extracted in duplicate by investigators with one extractor using a blinded copy of each study report, masked as to source of financial support, authors, and journal. The agreement between extractors was approximately 78 percent and differences were resolved by consensus.

Key data elements sought for extraction from each study included study level, patient level, and test characteristics. Only clearly reported aggregate results were extracted from studies. Results that were only given for individual patients, and results that would require extrapolations from graphs or derivations from figures or tables were not captured. For all tests, sensitivity, specificity, positive predictive value, negative predictive value, and correlation coefficients of each test relative to PSG AI or AHI (RDI) results were sought.

Apnea index (AI) is defined as the number of apneic episodes/hour sleep, and apnea-hypopnea index (AHI) is the total apneas plus hypopneas during total time asleep, divided by the number of hours asleep. The respiratory distress index (RDI) is the same as AHI.

The main objective of the analysis was to evaluate the diagnostic accuracy of alternatives to full PSG for the diagnosis of SA as compared to a full PSG (gold standard). Initially, weighted averages using Mantel-Haenszel fixed effects models combining the comparative summary statistics were calculated and summarized for groups based on diagnostic test category. Study and patient-level covariates and study evidence scores were also summarized for each diagnostic test category. A summary receiver operating characteristic (ROC) curve was calculated for each diagnostic group where data were available. While differences among studies may be an argument against estimating one common sensitivity and specificity using fixed or random effects models, these factors can be described using the summary ROCs, which both display and summarize the heterogeneity.

A group of 22 peer reviewers drawn from consumer groups and professional organizations, along with our technical experts and partners, was assembled to review and provide suggestions to the draft final report describing this project. Their feedback, as well as that from AHCPR, was incorporated wherever possible within the original scope of the project.

Return to Contents

Findings

All Studies: PSG

Partial Channel PSGs

Portable Devices

Oximetry

Partial Time PSGs

Radiologic

Miscellaneous

Prediction Equations

Prevalence Studies

Comorbidity Studies

Conditions associated with SA:

Return to Contents

Future Research

Future studies of diagnostic test strategies should address the many limitations of the literature noted above. The field could benefit from adoption of a common terminology and definitions for fundamental concepts such as apnea and hypopnea, and the relation between AI and AHI should be established, in order to allow conversions and comparisons across studies. Researchers should seek to clarify what is the frequency of sleep apnea/hypopnea in general populations by gender and age. More naturalistic sleep studies (in the home) are still of interest, as MetaWorks investigators suspect much of the uncertainty about the nature of SA, its pathophysiology, the risk factors, and the clinical consequences, derive from the fact that the phenomenon of SA may be altered by the fact of observing it via standard PSG. Long-term followup studies are recommended to better document the findings of treated vs. untreated SA. Lastly, all sleep monitoring systems which are proposed as prequalifiers or replacements for PSG must be validated in the setting in which they are intended to be used.

Return to Contents

Availability of the Full Report

The full evidence report from which this summary was taken was prepared for AHCPR by MetaWorks, Inc., Boston, MA, under contract No. 290-97-0016. Print copies may be obtained free from the Publications Clearinghouse by calling 1-800-358-9295. Requesters should ask for Evidence Report/Technology Assessment No. 1, Systematic Review of the Literature Regarding the Diagnosis of Sleep Apnea (AHCPR Publication No. 99-E002). The Evidence Report is available online on the National Library of Medicine Bookshelf.

Return to Contents

AHCPR Publication Number 99-E001
Current as of October 1999


Internet Citation:

Systematic Review of the Literature Regarding the Diagnosis of Sleep Apnea. Summary, Evidence Report/Technology Assessment: Number 1, December 1998. Agency for Health Care Policy and Research, Rockville, MD. http://www.ahrq.gov/clinic/epcsums/apneasum.htm


Return EPC Evidence Reports
Clinical Information
AHRQ Home Page
Department of Health and Human Services