278 E
VIDENCE
-B
ASED
H
EALTH
P
RACTICE
utilized in treatment planning (Dobscha, Gerrity, & Ward, 2001). In fact, the U.S.
Preventive Services Task Force (2002) concluded that there is currently insuffi-
cient evidence to recommend for or against routine use of standardized question-
naires to screen for depression in primary care patients, and that if screening is
implemented, the settings must ensure that procedures conducive to accurate di-
agnosis, effective treatment, and careful follow-up are also implemented (Pignone
et al., 2002).
The failed attempts to improve the adequacy of depression treatment by solely
promoting the use of screening highlighted the need for sustainable, multifaceted
supports to the structure and organization of care (Gilbody, Whitty, Grimshaw, &
Thomas, 2003). Even though screening alone has not been associated with im-
proved depression rates of treatment and detection, screening is still a pivotal
component of evidence-based care. Clear evaluation of how the numerous depres-
sion screeners are appropriately or inappropriately used across settings and ser-
vice populations remains important.
The literature demonstrates that numerous screening options are available for
detecting depression. These scales vary in length, administration style, scoring
complexity, and underlying assumptions. Some of these scales were designed for
specific use in medical settings; others were designed specifically for older
adults. When selecting a screen for use, practitioners should compare the possible
depression screeners on four domains, as described by Corcoran (1991): (1) sam-
ple characteristics, (2) reliability, (3) validity, and (4) practicality.
In the first domain, sample characteristics, the focus was on identifying mea-
sures that were validated for older adult populations or specific clinic popula-
tions (e.g., primary care settings), diversity in populations, and languages
available. Examples of depression screens tested on older adults are listed in Table
11.1. The second domain for consideration in selecting a screening instrument is
the methodological concern for reliability—the ability of the measure to yield the
same value of a construct over time, situation, or other influences of error (Ker-
linger, 1986). Likewise, the third domain is the validity of the screening instru-
ment—the accuracy of the measure. Often, validity is measured by two factors:
specificity and sensitivity. Sensitivity refers to the instrument’s ability to cor-
rectly identify participants with depression (i.e., true positives). Specificity is the
ability to correctly identify those who do not have depression (true negatives; Ro-
bison, Gruman, Gaztambide, & Blank, 2002). In clinical practice, the focus has
been on ensuring high sensitivity because clinicians do not want to risk unrecog-
nized depression. However, low specificity also becomes a concern for resource
allocation because referring false-positive clients for full mental health evalua-
tion leads to resources being spent unnecessarily. This is especially problematic
in settings were competing demands and limited resources already confine the
reach of mental health service use (Nease, Klinkman, & Volk, 2002).
The sources provided for each screening instrument in Table 11.1 describe the
respective evidence for the validity and reliability of each measure identified for
use with older adults. Both Mulrow et al. (1995) and Williams, Pignone, Ramirez,
and Perez Stellato (2002) concluded that multiple instruments with reasonable
operating characteristics are available for depression screening. Williams et al.
concluded that because operating characteristics among the instruments are
equivalent, selecting a depression screener should depend on factors of feasibil-
ity, administration, and scoring ease.