|
||||||||||||
|
||||||||||||
Patterns of Mental Health Service Utilization and Substance Use Among Adults, 2000 and 2001 |
This appendix describes statistical methods and data limitations of the 2001 National Household Survey on Drug Abuse (NHSDA). The 2000 NHSDA used a similar design and is described in Office of Applied Studies (OAS, 2001).
An important limitation of the NHSDA estimates of drug use prevalence is that they are only designed to describe the target population of the surveythe civilian, noninstitutionalized population aged 12 or older. Although this population includes almost 98 percent of the total U.S. population aged 12 or older, it excludes some important and unique subpopulations who may have very different drug-using patterns. For example, the survey excludes active military personnel, who have been shown to have significantly lower rates of illicit drug use. Persons living in institutional group quarters, such as prisons and residential drug treatment centers, are not included in the NHSDA and have been shown in other surveys to have higher rates of illicit drug use. Also excluded are homeless persons not living in a shelter on the survey date, another population shown to have higher than average rates of illicit drug use. Other surveys that provide data for these populations are described in other readily available reports produced by OAS (2001, 2002a, 2002b).
The national estimates, along with the associated variance components, were computed using a multiprocedure package, SUrvey DAta ANalysis (SUDAAN) Software for Statistical Analysis of Correlated Data, which was designed for the statistical analysis of sample survey data from stratified, multistage cluster samples (RTI, 2001). The final, nonresponse-adjusted, and poststratified analysis weights were used to compute unbiased design-based drug use estimates.
The sampling error (i.e., the standard error [SE]) of an estimate is the error caused by the selection of a sample instead of conducting a census of the population. Sampling error is reduced by selecting a large sample and by using efficient sample design and estimation strategies, such as stratification, optimal allocation, and ratio estimation.
With the use of probability sampling methods in the NHSDA, it is possible to develop estimates of sampling error from the survey data. These estimates have been calculated in SUDAAN for all estimates presented in this report using a Taylor series linearization approach that takes into account the effects of the complex NHSDA design features. The sampling errors are used to identify unreliable estimates and to test for the statistical significance of differences between estimates.
Estimates of proportions, d, such as drug use prevalence rates, take the form of nonlinear statistics where the variances cannot be expressed in closed form. Variance estimation for nonlinear statistics in SUDAAN is performed using a first-order Taylor series approximation of the deviations of estimates from their expected values.
Corresponding to proportion estimates, d, the number of drug users, d, can be estimated as
where d is the estimated population total for domain d, and d is the estimated proportion for domain d. The SE for the total estimate is obtained by multiplying the SE of the proportion by d, that is,
This approach is theoretically correct when the domain size estimates, d, are among those forced to Census Bureau population projections through the weight calibration process. In these cases, d is clearly not subject to sampling error.
For domain totals, d, where d is not fixed, this formulation may still provide a good approximation if it can be reasonably assumed that the sampling variation in d is negligible relative to the sampling variation in d. In most analyses conducted for prior years, this has been a reasonable assumption.
For a subset of the tables produced from the 2001 data, it was clear that the above approach yielded an underestimate of the variance of a total because d was subject to considerable variation. In these cases, a different method was used to estimate variances. SUDAAN provides an option to directly estimate the variance of the linear statistic that estimates a population total. Using this option did not affect the SE estimates for the corresponding proportions presented in the same sets of tables.
As has been done in past NHSDA reports, direct survey estimates considered to be unreliable due to unacceptably large sampling errors are not shown in this report and are noted by asterisks (*) in the tables containing such estimates. The criterion used for suppressing all direct survey estimates was based on the relative standard error (RSE), which is defined as the ratio of the standard error (SE) over the estimate.
Proportion estimates () within the range [0 < < 1], rates, and corresponding estimated number of users were suppressed if
or
Using a first-order Taylor series approximation to estimate RSE[-ln()] and RSE[-ln(1 - )], the following was obtained and used for computational purposes:
or
The separate formulas for 0.5 and > 0.5 produce a symmetric suppression rule (i.e., if is suppressed, then so will 1 - ). This ad hoc rule requires an effective sample size in excess of 50. When 0.05 < < 0.95, the symmetric property of the rule produces a local maximum effective sample size of 68 at = 0.5. Thus, estimates with these values of along with effective sample sizes falling below 68 are suppressed. A local minimum effective sample size of 50 occurs at = 0.2 and again at = 0.8 within this same interval, so estimates are suppressed for values of with effective sample sizes below 50.
Prior to the 2000 NHSDA, these varying sample size restrictions sometimes produced unusual occurrences of suppression for a particular combination of prevalence rates. For example, in some cases, lifetime prevalence rates near = 0.5 were suppressed (effective sample size was < 68 but > 50), while not suppressing the corresponding past year or past month estimates near = 0.2 (effective sample sizes were > 50). To reduce the occurrence of this type of inconsistency, a minimum effective sample size of 68 was added to the NHSDA suppression criteria starting in 2000. As approached 0.00 or 1.00 outside the interval (0.05, 0.95), the suppression criteria still required increasingly larger effective sample sizes. For example, if = 0.01 and 0.001, the effective sample size must exceed 152 and 684, respectively.
Also new to the NHSDA starting in 2000 were minimum nominal sample size suppression criteria (n = 100) that protect against unreliable estimates caused by small design effects and small nominal sample sizes. Prevalence estimates also were suppressed if they were close to 0 or 100 percent (i.e., if < 0.00005 or if 0.99995).
Estimates of other totals (e.g., number of initiates) along with means and rates (both not bounded between 0 and 1) were suppressed if RSE() > 0.5. Additionally, estimates of the mean age at first use were suppressed if the sample size was smaller than 10 respondents; moreover, the estimated incidence rate and number of initiates were suppressed if they rounded to 0.
The suppression criteria for various NHSDA estimates are summarized in Table C.1 at the end of this appendix.
This section describes the methods used to compare prevalence estimates in this report. Customarily, the observed difference between estimates is evaluated in terms of its statistical significance. "Statistical significance" refers to the probability that a difference as large as that observed would occur due to random error in the estimates if there were no difference in the prevalence rates for the population groups being compared. The significance of observed differences in this report is generally reported at the 0.05 and 0.01 levels. When comparing 2000 and 2001 prevalence estimates, the null hypothesis (no difference in the 2000 and 2001 prevalence rates) can be tested against the alternative hypothesis (there is a difference in prevalence rates) using the standard difference in proportions test expressed as follows:
where 1 = 2000 estimate, 2 = 2001 estimate, var(1) = variance of 2000 estimate, var(2) = variance of 2001 estimate, and cov(1, 2) = covariance between 1 and 2.
Under the null hypothesis, Z is asymptotically distributed as a normal random variable. Calculated values of Z can therefore be referred to as the unit normal distribution to determine the corresponding probability level (i.e., p value). Because there is a 50 percent overlap in the sampled segments between the 2000 and 2001 NHSDA, the covariance term in the formula for Z will, in general, be greater than 0. Estimates of Z, along with its p value, were calculated using SUDAAN, using the analysis weights and accounting for the sample design as described in Appendix B. A similar procedure and formula for Z were used for estimated totals and for comparing prevalence estimates for different population subgroups from the same data year.
When examining the effects of subgroup variables with more than two levels on a prevalence measure, a 2 test of independence of the subgroup and the prevalence variables was conducted first to control the error level for multiple comparisons. If the 2 test indicated some significant differences, the significance of each particular subgroup comparison discussed in the report was tested as indicated above. SUDAAN analytic procedures were used in all tests to properly account for the sample design.
Nonsampling errors can occur from nonresponse, coding errors, computer processing errors, errors in the sampling frame, reporting errors, and other errors not due to sampling. Nonsampling errors are reduced through data editing, statistical adjustments for nonresponse, close monitoring and periodic retraining of interviewers, and improvement in various quality control procedures.
Although nonsampling errors can often be much larger than sampling errors, measurement of most nonsampling errors is difficult or impossible. However, some indication of the effects of some types of nonsampling errors can be obtained through proxy measures, such as response rates and from other research studies.
Response rates for the NHSDA were stable for the period from 1994 to 1998, with the screening response rate at about 93 percent and the interview response rate at about 78 percent (response rates discussed in this appendix are weighted). In 1999, the computer-assisted interviewing (CAI) screening response rate was 89.6 percent, and the interview response rate was 68.6 percent. A more stable and experienced field interviewer (FI) workforce improved these rates in 2000 and continued in 2001. Of the 171,519 eligible households sampled for the 2001 NHSDA main study, 157,471 were successfully screened for a weighted screening response rate of 91.9 percent (Table C.2). In these screened households, a total of 89,745 sample persons were selected, and completed interviews were obtained from 68,929 of these sample persons, for a weighted interview response rate of 73.3 percent. A total of 13,478 (16.5 percent) sample persons were classified as refusals or parental refusals, 4,681 (5.3 percent) were not available or never at home, and 2,657 (4.9 percent) did not participate for various other reasons, such as physical or mental incompetence or language barrier (Table C.3). Tables C.4 and C.5 show the distribution of the selected sample by interview code and age group. The weighted interview response rate was highest among 12 to 17 year olds (82.2 percent), females (74.6 percent), blacks and Hispanics (75.0 and 78.8 percent, respectively), in nonmetropolitan areas (76.7 percent), and among persons residing in the South (74.4 percent) (Table C.6).
The overall weighted response rate, defined as the product of the weighted screening response rate and weighted interview response rate, was 61.5 percent in 1999, 68.6 percent in 2000, and 67.3 percent in 2001. Nonresponse bias can be expressed as the product of the nonresponse rate (1 - R) and the difference between the characteristic of interest between respondents and nonrespondents in the population (Pr - Pnr). Thus, assuming the quantity (Pr - Pnr) is fixed over time, the improvement in response rates in 2000 and 2001 over 1999 will result in estimates with lower nonresponse bias.
Among survey participants, item response rates were above 97 percent for most questionnaire items. However, inconsistent responses for some items, including the drug use items, were common. Estimates of substance use from the NHSDA are based on the responses to multiple questions by respondents, so that the maximum amount of information is used in determining whether a respondent is classified as a drug user. Inconsistencies in responses are resolved through a logical editing process that involves some judgment on the part of survey analysts and is a potential source of nonsampling error. Because of the automatic routing through the CAI questionnaire (e.g., lifetime drug use questions that skip entire modules when answered "no"), there is less editing of this type than in the paper-and-pencil interviewing (PAPI) questionnaire used prior to the NHSDA redesign in 1999.
In addition, logical editing is used less often because with the CAI data, statistical imputation is relied upon more heavily to determine the final values of drug use variables in cases where there is the potential to use logical editing to make a determination. The combined amount of editing and imputation in the CAI data is still considerably less than the total amount used in prior PAPI surveys. For the 2001 CAI data, for example, 6.7 percent of the estimate of past month hallucinogen use was based on logically edited cases and 6.6 percent on imputed cases, for a combined amount of 13.3 percent. In the 1998 NHSDA (administered using PAPI), the amount of editing and imputation for past month hallucinogen use was 60 and 0 percent, respectively, for a total of 60 percent. The combined amount of editing and imputation for the estimate of past month heroin use was 5.7 percent for the 2001 CAI and 37.0 percent for the 1998 PAPI data.
Nonresponse categories for survey items include "don't know" and "refused" responses, as well as missing data on subsequent items linked to those responses. Among survey participants, response rates to the treatment receipt survey items were above 95 percent. The majority (74 to 85 percent) of nonrespondents to treatment receipt items also did not respond to the unmet treatment need item. To examine the potential impact of missing data on the present study, sociodemographic distributions between respondents and nonrespondents on treatment receipt variables were compared. Nonrespondents also included individuals whose indicated treatment receipt was "unknown." Missing data on treatment were more common among men, non-high school graduates, the unemployed, those in the lowest family income category, and welfare recipients. Less missing data was indicated among nondrug users and those with no additional treatment need. Patterns of missing data were similar among persons receiving inpatient, outpatient, and prescription medication-only treatment, although some small differences were observed.
To further examine the interrelationship of demographic and socioeconomic variables to the "risk" of having missing data, adjusted odds ratios and confidence intervals were derived from multiple logistic regression models. Significant correlates of nonresponse to the "any inpatient mental health treatment" item included being male, non-Hispanic white, having lower family income, and no illicit drug use in the past year. Nonrespondents to the "any outpatient mental health treatment" item were more likely to be married, have a lower family income, and live in a small metropolitan area. Significant correlates of nonresponse to receiving "prescription medication" included having a family income of $50,000 to $74,999 and no illicit drug use. Nonrespondents to "any mental health treatment" item were more likely to have lower family incomes, live in a small metropolitan area, and report no illicit drug use in the past year. As the above sociodemographic groups were less likely to respond to the treatment receipt survey items, findings presented in the analyses in this report may be less generalizable to these population groups.
NHSDA estimates are based on self-reports of drug use, and their value depends on respondents' truthfulness and memory. Although many studies have generally established the validity of self-report data and the NHSDA procedures were designed to encourage honesty and recall, some degree of underreporting is assumed (Harrell, 1997; Harrison & Hughes, 1997; Rouse, Kozel, & Richards, 1985). No adjustment to NHSDA data is made to correct for this. The methodology used in the NHSDA has been shown to produce more valid results than other self-report methods (e.g., by telephone) (Aquilino, 1994; Turner, Lessler, & Gfroerer, 1992). However, comparisons of NHSDA data with data from surveys conducted in classrooms suggest that underreporting of drug use by youths in their homes may be substantial (Gfroerer, 1993; Gfroerer, Wright, & Kopstein, 1997).
Estimate | Suppress if: |
---|---|
Prevalence rate, , with nominal sample size, n, and design effect, deff | The estimated prevalence rate, , is < 0.00005 or 0.99995, or
, or , or Effective n < 68, or n < 100 where Note: The rounding portion of this suppression rule for prevalence rates will produce some estimates that round at one decimal place to 0.0 or 100.0 percent but are not suppressed from the tables. |
Estimated number (numerator of ) |
The estimated prevalence rate, , is suppressed.
Note: In some instances when is not suppressed, the estimated number may appear as a 0 in the tables; this means that the estimate is > 0 but < 500 (estimated numbers are shown in thousands). |
Mean age at first use, , with nominal sample size, n | , or
n < 10 |
Incidence rate, | Rounds to < 0.1 per 1,000 person-years of exposure, or |
Number of initiates, | Rounds to < 1,000 initiates, or |
Screening Result | 1999 NHSDA | 2000 NHSDA | 2001 NHSDA | |||
---|---|---|---|---|---|---|
Sample Size | Weighted Percentage | Sample Size | Weighted Percentage | Sample Size | Weighted Percentage | |
Total Sample | 223,868 | 100.00 | 215,860 | 100.00 | 203,544 | 100.00 |
Ineligible cases | 36,026 | 15.78 | 33,284 | 15.09 | 32,025 | 15.40 |
Eligible cases | 187,842 | 84.22 | 182,576 | 84.91 | 171,519 | 84.60 |
Ineligibles | 36,026 | 100.00 | 33,284 | 100.00 | 32,025 | 100.00 |
Vacant | 18,034 | 49.71 | 16,796 | 50.76 | 16,489 | 51.71 |
4,516 | 12.90 | 4,506 | 13.26 | 4,706 | 14.69 | |
Not a dwelling unit | 4,626 | 12.70 | 3,173 | 9.33 | 2,913 | 8.66 |
All military personnel | 482 | 1.22 | 414 | 1.21 | 327 | 0.93 |
Other, ineligible | 8,368 | 23.46 | 8,395 | 25.43 | 7,590 | 24.00 |
Eligible Cases | 187,842 | 100.00 | 182,576 | 100.00 | 171,519 | 100.00 |
Screening complete | 169,166 | 89.63 | 169,769 | 92.84 | 157,471 | 91.86 |
No one selected | 101,537 | 54.19 | 99,999 | 55.36 | 90,530 | 52.11 |
One selected | 44,436 | 23.63 | 46,981 | 25.46 | 43,601 | 25.94 |
Two selected | 23,193 | 11.82 | 22,789 | 12.03 | 23,340 | 13.82 |
18,676 | 10.37 | 12,807 | 7.16 | 14,048 | 8.14 | |
No one home | 4,291 | 2.38 | 3,238 | 1.82 | 3,383 | 1.90 |
Respondent unavailable |
651 | 0.36 | 415 | 0.24 | 392 | 0.24 |
incompetent |
419 | 0.24 | 310 | 0.16 | 357 | 0.20 |
Language barrier Hispanic |
102 | 0.06 | 83 | 0.05 | 130 | 0.09 |
Language barrier Other |
486 | 0.28 | 434 | 0.27 | 590 | 0.39 |
Refusal | 11,097 | 5.92 | 7,535 | 4.14 | 8,525 | 4.93 |
Other, access denied | 1,536 | 1.08 | 748 | 0.45 | 613 | 0.35 |
Other, eligible | 38 | 0.02 | 7 | 0.00 | 9 | 0.00 |
56 | 0.03 | 37 | 0.02 | 49 | 0.03 |
Final Interview Code | 1999 NHSDA | 2000 NHSDA | 2001 NHSDA | |||
---|---|---|---|---|---|---|
Sample Size | Weighted Percentage | Sample Size | Weighted Percentage | Sample Size | Weighted Percentage | |
Total Selected Persons | 89,883 | 100.00 | 91,961 | 100.00 | 89,745 | 100.00 |
Interview complete | 66,706 | 68.55 | 71,764 | 73.93 | 68,929 | 73.31 |
No one at dwelling unit | 1,795 | 2.13 | 1,776 | 2.02 | 1,728 | 2.00 |
Respondent unavailable | 3,897 | 4.53 | 3,058 | 3.52 | 2,953 | 3.30 |
Breakoff | 50 | 0.07 | 72 | 0.09 | 79 | 0.12 |
Physically/mentally incompetent | 1,017 | 2.62 | 1,053 | 2.57 | 1,020 | 2.43 |
168 | 0.12 | 109 | 0.08 | 190 | 0.17 | |
480 | 1.46 | 441 | 1.06 | 470 | 1.30 | |
Refusal | 11,276 | 17.98 | 10,109 | 14.99 | 10,961 | 15.60 |
Parental refusal | 2,888 | 1.01 | 2,655 | 0.88 | 2,517 | 0.92 |
Other | 1,606 | 1.53 | 924 | 0.86 | 898 | 0.86 |
Final Interview Code | 1999 NHSDA | 2000 NHSDA | 2001 NHSDA | |||
---|---|---|---|---|---|---|
Sample Size | Weighted Percentage | Sample Size | Weighted Percentage | Sample Size | Weighted Percentage | |
Total Selected Persons | 32,011 | 100.00 | 31,242 | 100.00 | 28,188 | 100.00 |
Interview complete | 25,384 | 78.07 | 25,756 | 82.58 | 23,178 | 82.18 |
No one at dwelling unit | 322 | 1.09 | 278 | 0.86 | 254 | 0.92 |
Respondent unavailable | 872 | 3.04 | 617 | 2.05 | 551 | 2.13 |
Breakoff | 13 | 0.03 | 18 | 0.05 | 17 | 0.05 |
Physically/mentally incompetent | 244 | 0.76 | 234 | 0.76 | 219 | 0.79 |
15 | 0.03 | 10 | 0.03 | 18 | 0.08 | |
58 | 0.18 | 50 | 0.20 | 34 | 0.11 | |
Refusal | 1,808 | 5.97 | 1,455 | 4.52 | 1,247 | 4.14 |
Parental refusal | 2,885 | 9.50 | 2,641 | 8.35 | 2,517 | 8.95 |
Other | 410 | 1.33 | 183 | 0.59 | 153 | 0.64 |
Final Interview Code |
1999 NHSDA | 2000 NHSDA | 2001 NHSDA | |||
---|---|---|---|---|---|---|
Sample Size | Weighted Percentage | Sample Size | Weighted Percentage | Sample Size | Weighted Percentage | |
Total Selected Persons | 57,872 | 100.00 | 60,719 | 100.00 | 61,557 | 100.00 |
Interview complete | 41,322 | 67.41 | 46,008 | 72.92 | 45,751 | 72.29 |
No one at dwelling unit | 1,473 | 2.25 | 1,498 | 2.16 | 1,474 | 2.12 |
Respondent unavailable | 3,025 | 4.71 | 2,441 | 3.69 | 2,402 | 3.43 |
Breakoff | 37 | 0.07 | 54 | 0.09 | 62 | 0.13 |
Physically/mentally incompetent | 773 | 2.85 | 819 | 2.78 | 801 | 2.62 |
153 | 0.13 | 99 | 0.09 | 172 | 0.18 | |
422 | 1.62 | 391 | 1.16 | 436 | 1.43 | |
Refusal | 9,468 | 19.41 | 8,654 | 16.22 | 9,714 | 16.92 |
Parental refusal | 3 | 0.00 | 14 | 0.01 | 0 | 0.00 |
Other | 1,196 | 1.55 | 741 | 0.89 | 745 | 0.88 |
1999 NHSDA | 2000 NHSDA | 2001 NHSDA | |||||||
---|---|---|---|---|---|---|---|---|---|
Selected Persons | Selected Persons | Weighted Response Rate | Selected Persons | Selected Persons | Weighted Response Rate | Selected Persons | Selected Persons | Weighted Response Rate | |
Total | 89,883 | 66,706 | 68.55% | 91,961 | 71,764 | 73.93% | 89,745 | 68,929 | 73.31% |
Age in Years | |||||||||
1217 | 32,011 | 25,384 | 78.07% | 31,242 | 25,756 | 82.58% | 28,188 | 23,178 | 82.18% |
1825 | 30,439 | 22,151 | 71.21% | 29,424 | 22,849 | 77.34% | 30,304 | 22,931 | 75.51% |
26 or older | 27,433 | 19,171 | 66.76% | 31,295 | 23,159 | 72.17% | 31,253 | 22,820 | 71.75% |
Gender | |||||||||
Male | 43,883 | 31,987 | 67.12% | 44,899 | 34,375 | 72.68% | 43,949 | 33,109 | 71.92% |
Female | 46,000 | 34,719 | 69.81% | 47,062 | 37,389 | 75.09% | 45,796 | 35,820 | 74.58% |
Race/Ethnicity | |||||||||
Hispanic | 11,203 | 8,755 | 74.59% | 11,454 | 9,396 | 77.95% | 10,885 | 8,777 | 78.78% |
White | 63,211 | 46,272 | 67.98% | 64,517 | 49,631 | 73.39% | 63,228 | 48,016 | 72.65% |
Black | 10,552 | 8,044 | 70.39% | 10,740 | 8,638 | 76.19% | 10,584 | 8,295 | 74.98% |
All other races | 4,917 | 3,635 | 59.28% | 5,250 | 4,099 | 67.31% | 5,048 | 3,841 | 66.65% |
Region | |||||||||
Northeast | 16,794 | 11,830 | 64.03% | 18,959 | 14,394 | 71.68% | 19,180 | 14,444 | 71.02% |
Midwest | 24,885 | 18,103 | 69.63% | 25,428 | 19,355 | 73.23% | 25,560 | 19,212 | 73.25% |
South | 27,390 | 21,018 | 70.93% | 27,217 | 22,041 | 76.38% | 26,278 | 20,609 | 74.44% |
West | 20,814 | 15,755 | 67.47% | 20,357 | 15,974 | 72.68% | 18,727 | 14,664 | 73.51% |
County Type | |||||||||
36,101 | 25,901 | 65.15% | 37,754 | 28,744 | 71.77% | 35,395 | 26,403 | 71.00% | |
30,642 | 22,612 | 69.98% | 31,400 | 24,579 | 74.96% | 31,740 | 24,575 | 74.66% | |
Nonmetropolitan | 23,140 | 18,193 | 74.97% | 22,807 | 18,441 | 77.58% | 22,610 | 17,951 | 76.72% |
|
|
Click to Return to OAS Home Page |
|
This page was last updated on June 08, 2004.
|