Printer Friendly
The Free Library
14,497,001 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

Interpreting Validity Indexes for Diagnostic Tests: An Illustration Using the Berg Balance Test.


Key Words: Diagnosis; Tests and measurements, general.

Physical therapists routinely perform diagnostic tests on their patients. For diagnostic test results to be most useful, we contend that validity estimates from studies of the diagnostic test in question should be used to guide clinical decisions. The purpose of this perspective is to describe a conceptual model proposed by other authors[1,2] for the application of validity indexes for diagnostic (or prognostic prog·nos·tic
adj.
1. Of, relating to, or useful in prognosis.

2. Of or relating to prediction; predictive.

n.
1. A sign or symptom indicating the future course of a disease.

2.
) tests to clinical practice. We use a clinical illustration to demonstrate how measures, which we refer to as "validity indexes" (ie, sensitivity, specificity, positive and negative predictive values The negative predictive value is the proportion of patients with negative test results who are correctly diagnosed. Worked example
Relationships among terms:

Condition
(as determined by "Gold standard")

True False
, likelihood ratios), can be interpreted for individual patients. The illustration combines data from 2 studies on the use (validity) of the Berg Balance Test (BBT BBT basal body temperature.
BBT,
n See technique, Buteyko breathing.
) for predicting risk of falls among elderly people aged 65 to 94 years.[3,4] The illustration is meant only to demonstrate how validity indexes can be useful for practice and not necessarily to assist clinicians in the examination of patients suspected of having balance disorders balance disorder Audiology A disturbance in equilibrium due to a disruption of the labryrinth. See Equilibrium. .

Studies that can be used to determine whether meaningful clinical inferences can be made based on diagnostic tests are classified as "criterion-related validity studies."[5] Criterion-related validity studies take 1 of 2 forms. Researchers can compare a clinical measure with a "gold standard" measure (ideally, a valid diagnostic test or a definitive measure of whether the condition of interest is truly present) obtained at about the same time as the measure being studied. In our illustration, the patient's report of falling is considered the gold standard measure. In other cases, a gold standard measure may be a diagnosis made at the time of surgery or via an invasive invasive /in·va·sive/ (-siv)
1. having the quality of invasiveness.

2. involving puncture of the skin or insertion of an instrument or foreign material into the body; said of diagnostic techniques.
 diagnostic procedure. Studies in which some form of gold standard is obtained at about the same time as the diagnostic test being studied are commonly called "concurrent criterion-related validity studies."[5] Researchers can also compare a measure's prediction of a future event with what actually happens to a patient in the future. These studies are commonly termed "predictive criterion-related validity studies."[5]

Studies designed to estimate the risk of a future adverse event are often used by clinicians to make judgments about prognoses. For example, investigating whether the BBT can be used to predict whether a person will fall in the future is an illustration of a predictive criterion-related validity study. The gold standard for this type of study would be the subjects' report of falls for a period of time following administration of the BBT.

The Berg Balance Test

The BBT was designed to be an easy-to-administer, safe, simple, and reasonably brief measure of balance for elderly people. The developers expressed the hope that the BBT would be used to monitor the status of a patient's balance and to assess disease course and response to treatment.[6] Patients are asked to complete 14 tasks, and each task is rated by an examiner on a 5-point scale ranging from 0 (cannot perform) to 4 (normal performance). Elements of the test are supposed to be representative of daily activities that require balance, including tasks such as sitting, standing, leaning over, and stepping. Some tasks are rated according to according to
prep.
1. As stated or indicated by; on the authority of: according to historians.

2. In keeping with: according to instructions.

3.
 the quality of the performance of the task, whereas the time taken to complete the task is measured for other tasks. The developers of the BBT provided operational definitions for each task and the criteria for grading each task. Overall scores can range from 0 (severely impaired balance) to 56 (excellent balance).

Data exist to support the reliability of BBT scores obtained from elderly subjects.[3,6,7] For example, Bogle bo·gle  
n.
A hobgoblin; a bogey.



[Scots bogill, perhaps ultimately from Welsh bwg, ghost, hobgoblin.
 Thorbahn and Newton[3] reported an intertester reliability (Spearman spear·man  
n.
A man, especially a soldier, armed with a spear.
 rho) value of .88 for 17 subjects aged 69 to 94 years. Evidence also exists to support the content validity content validity,
n the degree to which an experiment or measurement actually reflects the variable it has been designed to measure.
,[6] construct validity construct validity,
n the degree to which an experimentally-determined definition matches the theoretical definition.
,[7,8] and criterion-related validity[3,4,8] of test scores for inferring fall risk in elderly subjects tested in a variety of settings. Construct validity has been assessed using a variety of approaches. For example, construct validity was supported to the extent that BBT scores were shown to correlate reasonably well with other measures of balance (Pearson Pear·son   , Lester Bowles 1897-1972.

Canadian politician who served as prime minister (1963-1968). He won the 1957 Nobel Peace Prize for his role in the negotiation of a solution to the Suez crisis (1956).
 r=.38-.91) and measures of motor performance (Pearson r= .62-.94).[7,8] For example, the Pearson r correlation between the BBT and the balance subscale of the Tinetti Performance-Oriented Mobility Assessment[9] was .91.[8] The Pearson r correlation between the BBT and the Barthel Index Barthel index,
n.pr standard, well-validated assessment that measures functional outcomes, including independence in mobility and self-care. Commonly used in rehabilitation medicine.
 mobility subscale[10] was .67.[8]

The Illustration

To illustrate how to interpret validity indexes, we have combined data from 2 studies[3,4] designed to determine whether BBT scores could identify elderly people (age range=65-94 years) who are at risk for falling. Subjects in both studies were of similar ages and had similar BBT scores, and the proportions of male and female subjects were also similar (Tab. 1). In both studies, the subjects reported whether they had fallen and the number of falls in the 6 months prior to being admitted to the study. In addition, for both studies, the authors appeared to use essentially the same definition for what constituted a fall. Bogle Thorbahn and Newton[3] defined a fall as an unexpected contact of any part of the body with the ground. Shumway-Cook and colleagues[4] defined a fall as any event that led to an unplanned, unexpected contact with a supporting surface.

Table 1. Characteristics of the Subjects Combined From Two Studies[3,4]
                    Study of
                    Shumway-Cook    Study of Bogle
                    and             Thorbahn and
                    colleagues[4]   Newton[3]
Characteristic      (N=44)          (N=66)

Age (y)
 [bar] X            76.1            79.2
 SD                  6.6             6.2
 Range              65-94           69-94

Sex (%)
 Male               27              24
 Female             73              76

Berg Balance Test
 [bar] X            46.1            48.2
 SD                 10.5             9.9
 Range              18-56            9-56

Gold standard       50              17
classification of
fallers (%)


The 2 studies differed in 2 potentially important ways. First, Shumway-Cook et al[4] excluded subjects with comorbidities that may have affected balance. Bogle Thorbahn et al[3] did not exclude these types of subjects. Subjects in the study by Shumway-Cook and colleagues reported no comorbidities, whereas 38% of the subjects in the study by Bogle Thorbahn and Newton reported having diagnoses of neurological neurological, neurologic

pertaining to or emanating from the nervous system or from neurology.


neurological assessment
evaluation of the health status of a patient with a nervous system disorder or dysfunction.
 or orthopedic orthopedic /or·tho·pe·dic/ (-pe´dik) pertaining to the correction of deformities of the musculoskeletal system; pertaining to orthopedics.  conditions. Second, subjects in the study by Shumway-Cook et al were required to have fallen at least twice in the previous 6 months, whereas subjects in the study by Bogle Thorbahn et al had to have fallen only once or more in the previous 6 months. It is unclear how these differences affected the validity estimates reported by these authors, but we believe the studies were similar enough to allow us to combine the data for the illustration in this article. It is also unclear why the proportion of fallers (50%) in the study by Shumway-Cook et al was much higher than the proportion of fallers (17%) in the study by Bogle Thorbahn and Newton.

Diagnostic Test Methodology

We believe that the subjects studied (the sample) should represent those types of patients who will be measured during clinical practice.[11] In our illustration, the sample of subjects was elderly people (ages ranging from 65 to 94 years) living independently. Some patients will have the disorder of interest (using our illustration, some subjects reported falls), and some patients will not have the disorder of interest (some reported no falls). The test being studied (ie, the BBT) and the gold standard or criterion measure (ie, determination of whether the subject had fallen in the past 6 months) are applied to all subjects, and the test's diagnostic accuracy (Tab. 2) is determined.[9]

Table 2. Two x Two Table, Formulas, and Definitions for Validity Indexes(a)
                    Gold Standard Test Result

Diagnostic    +                     -
Test Result   (Condition Present)   (Condition Absent)

+             True Positive         False Positive
              (a)                   (b)

-             False Negative        True Negative
              (c)                   (d)

Total         a + c                 b + d

Diagnostic
Test Result   Total

+             a + b

-             c + d

Total         a + b + c + d


Sensitivity: Those people correctly identified by the test as having the condition of interest as a percentage of all those who truly have the condition of interest: [100% X (a/[a + c])].

Specificity: Those people correctly identified by the test as not having the condition of interest as a percentage of all those who truly do not have the condition of interest: [100% X (d/[b + d])].

False Positive Rate: Those people falsely identified by the test as having the condition of interest as a percentage of all patients without the condition of interest: [100% X (b/b + d)].

False Negative Rate: Those people falsely identified by the test as not having the condition of interest as a percentage of all patients with the condition of interest: [100% X (c/[a + c])].

Positive Predictive Value Positive predictive value (PPV)
The probability that a person with a positive test result has, or will get, the disease.

Mentioned in: Genetic Testing

positive predictive value 
: Those people correctly identified by the test as having the condition of interest as a percentage of all those identified by the test as having the condition of interest: [100% X (a/[a + b])].

Negative Predictive Value: Those people correctly identified by the test as not having the condition of interest as a percentage of all those identified by the test as not having the condition of interest: [100% X (d/[c + d])].

Diagnostic Accuracy: The percentage of people who are correctly diagnosed: [100% X (a + d)/(a + b + c + d)].

Prevalence: The percentage of people in a target population who truly have the condition of interest: [100% X (a + c)/(a + b + c + d)].

Likelihood Ratio for a Positive Test: Is sensitivity divided by 1 - specificity [{a/(a + c)}/{b/(b + d)}].

Likelihood Ratio for a Negative Test: Is 1 -- sensitivity divided by specificity [{c/(a + c)}/{d/(b + d)}].

Pretest pre·test  
n.
1.
a. A preliminary test administered to determine a student's baseline knowledge or preparedness for an educational experience or course of study.

b. A test taken for practice.

2.
 Probability of the Disorder: The therapist's estimate of the patient's chance of having the disorder (condition of interest) prior to the therapist doing the test. It is usually estimated by the clinician clinician /cli·ni·cian/ (kli-nish´in) an expert clinical physician and teacher.

cli·ni·cian
n.
 based on prior knowledge and experience.

Posttest post·test  
n.
A test given after a lesson or a period of instruction to determine what the students have learned.
 Probability of the Disorder: The patient's chance of having the condition of interest after the results of the test are obtained.

(a) All definitions agree with the Standards for Tests and Measurements in Physical Therapy Practice.[5] Definitions for sensitivity, specificity, false positive rate, false negative rate, positive predictive rate, and negative predictive rate are derived from the Standards for Tests and Measurements in Physical Therapy Practice.[5] Definitions for diagnostic accuracy, prevalence, likelihood ratio for a positive test, likelihood ratio for a negative test, pretest probability of the disorder, and posttest probability of the disorder are derived from Sackett The Sackett family is a fictional American family featured in a number of western novels, short stories and historical novels by American writer Louis L'Amour. Background  and colleagues.[1,2]

The results from diagnostic accuracy studies are often summarized in a format similar to that shown in Table 2.[12-14] In this table, the terms "condition present" and "condition absent" are used to identify people who truly have or do not have the condition of interest (the gold standard test is either positive or negative). The letters "a," "b," "c," and "d" are used to reference cells in the table, and the sums "a+b," "c+d," "a+c," "b+d," and "a + b + c + d" denote de·note  
tr.v. de·not·ed, de·not·ing, de·notes
1. To mark; indicate: a frown that denoted increasing impatience.

2.
 marginal values Marginal value is a term widely used in economics, to refer to the change in economic value associated with a unit change in output, consumption or some other economic choice variable. . The cell values and marginal values are combined in various ways to calculate validity indexes. Definitions of terms related to diagnostic testing Diagnostic testing
Testing performed to determine if someone is affected with a particular disease.

Mentioned in: Von Willebrand Disease
 and formulas for the many validity indexes also are presented in Table 2.

Sensitivity and Specificity

Sensitivity indicates how often a diagnostic test detects a disease or condition when it is present. Sensitivity essentially tells the clinician how good the test is at correctly identifying patients with the condition of interest. Specificity indicates how often a diagnostic test is negative in the absence of the disease or condition. Specificity essentially tells the clinician how good the test is at correctly identifying the absence of disease.[15] The closer the sensitivity or specificity is to 100%, the more sensitive or specific the test.

The authors of both studies in our illustration reported the sensitivity and specificity of the BBT for determining current fall risk. Berg et al[8] contended that the best way to interpret scores on the BBT is to use a single cutoff point Cutoff point

The lowest rate of return acceptable on investments.
 of 45 to differentiate those at risk for falls (those with scores of [is less than] 45) and those who are not at risk for fails (those with scores of [is greater than or equal to] 45). Using a cutoff point of 45, as recommended by Berg et al, the sensitivity for the data collected by Shumway-Cook and colleagues[4] was 55% and the specificity was 95%. For the data collected by Bogle Thorbahn and Newton[3] the sensitivity was 82% and the specificity was 87%. When we combined the data from both studies, a cutoff point of 45 yielded a sensitivity of 64% and a specificity of 90% (Tab. 3). A sensitivity of 64% indicates that 64% of subjects who were true fallers had a positive BBT (a score of [is less than] 45). That is, approximately a third of the subjects who were fallers were missed by the BBT. Although there are no agreed-on standards for judging sensitivity and specificity, we believe the sensitivity of 64% should generally be considered quite low because more than a third of the subjects were misclassified.

Table 3. Sensitivity and Specificity for Four Cutoff Points of the Berg Balance Test (BBT)
              2 x 2 Tables for Four BBT Cutoff Points

              Gold Standard for   Gold Standard for
              Cutoff of 40        Cutoff of 45

BBT Cutoff
Point          Fall   No Fall      Fall   No Fall

               15 a   b 3
40             18 c   d 74         21 a   b 8
45                                 12 c   d 69
50
55

Sensitivity
 a/(a + c)     45%                 64%
Specificity
 d/(b + d)            96%                 90%

              2 x 2 Tables for Four BBT Cutoff Points

              Gold Standard for   Gold Standard for
              Cutoff of 50        Cutoff of 55

BBT Cutoff
Point          Fall   No Fall      Fall   No Fall

40
45             28 a   b 21
50              5 c   d 56         32 a   b 57
55                                  1 c   d 20

Sensitivity
 a/(a + c)     85%                 97%
Specificity
 d/(b + d)            73%                 26%


A specificity of 90% indicates that 90% of subjects who were nonfallers had a negative BBT (a score of [is greater than or equal to] 45). That is, only 10% of the nonfallers were missed by the BBT. Specificity was much higher than sensitivity, indicating that the BBT does a better job of identifying subjects who are not fallers than subjects who are fallers. When we use diagnostic tests, we do not know who has the condition of interest and who does not have the condition of interest. That is, sensitivity and specificity have somewhat limited usefulness because they do not describe validity in the context of the test result.[1] Rather, they describe validity in the context of the gold standard, a value we do not know when we do diagnostic tests. Sensitivity, for example, does not take into account the false positive test results (Tab. 2) on a group of patients. Stated another way, sensitivity does not describe how often patients with positive tests have the disorder of interest. Sensitivity only describes the proportion of patients with the disorder of interest who have a positive test. Similarly, specificity does not take into account false negative test results (Tab. 2). Specificity does not describe how often patients with negative tests do not have the disorder of interest. Specificity only describes the proportion of patients without the disorder of interest who have a negative test.

Diagnostic testing, in our view, is used because clinicians want to know the probability of the condition existing. Because clinicians make decisions based on diagnostic test results and not necessarily on results of tests that are considered gold standards, some authors[1] have contended that positive and negative predictive values (see next section) are more important than sensitivity and specificity for clinical practice.

Positive and Negative Predictive Values

Before diagnostic testing, therapists usually have collected a variety of information (eg, medical history, some examination data) from the patient. Based on their knowledge, training, and experience, therapists can sometimes use these data, depending on what is known about various conditions, to estimate the probability the condition of interest is present. This is known as the pretest probability of the disorder.[1] For example, if a therapist found that an elderly patient had a history of dizziness dizziness: see vertigo.  and required assistance with most activities of daily living, the therapist might anticipate that the patient's risk of falling was quite high, say on the order of 60%. Because the therapist knew evidence existed to indicate that dizziness[16] and difficulty with home activities of daily living[17] increase fall risk, the therapist estimated the pretest probability for falls to be quite high. The pretest probability estimate of 60% is only an estimate and may contain some error. The therapist could then do a BBT to better estimate the patient's risk of falling. Positive and negative predictive values describe the probability of disease after the test is completed. The probability of the condition of interest after the test result is obtained is also known as the posttest probability of the disorder.[1]

For many clinicians, the idea of estimating the probability of a disorder prior to doing a diagnostic test (pretest probability) may seem like a new or unusual concept. We believe that some clinicians, based on their experience and training, may use an ordinal-based scale estimate of pretest probability, such as the disease is highly likely, somewhat likely, or not very likely given the patient's signs and symptoms. In our view, however, using percentage estimates of pretest probability is not commonly done by most therapists. We suggest that therapists should make percentage estimates of the pretest probability of the disorder of interest. For example, if a clinician used an ordinal scale ordinal scale (or´dn  similar to the one just described, we contend that the clinician should convert it to a percentage estimate of pretest probability in the following way. If the pretest probability of the disorder were judged to be highly likely, this judgment could be converted to a 75% pretest probability, whereas a rating of "somewhat likely" could be converted to pretest probability of 50%. A rating of "not very likely" might be converted to a pretest probability of 25%. We believe that, as therapists become more comfortable with making percentage estimates of pretest probability, they will become more accurate, although we have no data to support this argument. By using percentage estimates for pretest probability, therapists can take full advantage of positive and negative predictive values (and likelihood ratios, to be discussed elsewhere in this article) reported in the literature. Several examples are discussed elsewhere in this article to illustrate how pretest probability can be estimated and how these estimates can influence the interpretation of the diagnostic test.

Positive predictive value is the proportion of patients with a positive test who have the condition of interest.[1] Negative predictive value is the proportion of patients with a negative test who do not have the condition of interest.[1] The closer the positive predictive value is to 100%, the more likely the disease is present with a positive test finding. The closer the negative predictive value is to 100%, the more likely the disease is absent with a negative test finding.

In our illustration, the combined data from both studies yielded a positive predictive value of 72% when using a cutoff point of 45 on the BBT (Tab. 4). A positive predictive value of 72% indicates that 72% of patients with a positive test (a BBT of [is less than] 45) were classified as fallers (the gold standard) and 28% of the patients were misclassified as fallers based on the BBT, an error rate that we consider to be fairly high. A negative predictive value of 85% indicates that 85% of patients with a negative test (a BBT of [is greater than or equal to] 45) were classified as nonfallers (the gold standard). Our misclassification rate for nonfallers is less than for fallers (ie, we can be more confident about identifying nonfallers than fallers based on BBT test results).

Table 4. Validity Estimates for Several Different Cutoff Points of the Berg Balance Test
              Positive      Negative
Berg          Predictive    Predictive
Balance       Value         Value        Sensitivity
Test Result   (95% CI(a))   (95% CI)     (95% CI)

35            77%           67%          30%
              (54-100)      (58-76)      (14-46)

40            83%           67%          45%
              (66-100)      (57-77)      (28-62)

45            72%           85%          64%
              (56-88)       (77-93)      (48-80)

50            57%           92%          85%
              (43-71)       (85-99)      (73-97)

55            36%           95%          97%
              (26-46)       (86-100)     (91-100)

60            30%           100%         100%
              (21-39)       (5)          (91)

                            Positive     Negative
Berg                        Likelihood   Likelihood
Balance       Specificity   Ratio        Ratio
Test Result   (95% CI)      (95% CI)     (95% CI)

35            96%           7.8          0.7
              (92-100)      (2.3-26.4)   (0.6-0.9)

40            96%           11.7         0.6
              (92-100)      (3.6-37.6)   (0.4-0.8)

45            90%           6.1          0.4
              (83-97)       (3.0-12.4)   (0.3-0.6)

50            73%           3.1          0.2
              (63-83)      (2.1-4.6)    (0.1-0.5)

55            26%           1.3          0.1
              (16-36)       (1.1-1.5)    (0.02-0.8)

60            1%            1.01         Undefined
              (0-3)         (1-1.04)


(a) CI=confidence interval confidence interval,
n a statistical device used to determine the range within which an acceptable datum would fall. Confidence intervals are usually expressed in percentages, typically 95% or 99%.
.

As with sensitivity and specificity, no standard exists for what constitutes an acceptable level of positive or negative predictive value. In addition, interpretations of predictive values pre·dic·tive value
n.
The likelihood that a positive test result indicates disease or that a negative test result excludes disease.



predictive value

a measure used by clinicians to interpret diagnostic test results.
, sensitivity, and specificity are not always straightforward. In the next section, we attempt to describe the critical issues that we believe should be considered when interpreting validity indexes.

Issues Related to the Interpretation of Sensitivity, Specificity, and Predictive Values

Some tests have a binary Meaning two. The principle behind digital computers. All input to the computer is converted into binary numbers made up of the two digits 0 and 1 (bits). For example, when you press the "A" key on your keyboard, the keyboard circuit generates and transfers the number 01000001 to the  outcome (2 mutually exclusive Adj. 1. mutually exclusive - unable to be both true at the same time
contradictory

incompatible - not compatible; "incompatible personalities"; "incompatible colors"
 categories such as "present" or "absent"), but many other test results are reported on an ordinal scale (such as the manual muscle test) or a continuous scale (such as the BBT). When using sensitivity, specificity, and predictive values, the researcher is forced to dichotomize di·chot·o·mize  
v. di·chot·o·mized, di·chot·o·miz·ing, di·chot·o·miz·es

v.tr.
To separate into two parts or classifications.

v.intr.
To be or become divided into parts or branches; fork.
 results for ordinal (mathematics) ordinal - An isomorphism class of well-ordered sets.  and continuous measures (such as the BBT) and, therefore, may lose information about the usefulness of the test. One example is the use of a single cutoff point of 45 for the BBT. We will show later how some researchers have dealt with the problem of only one cutoff point for continuous measures.

The choice of the cutoff point influences the sensitivity, specificity, and positive and negative predictive values. This concept is illustrated in Table 4. For example, if the cutoff point for the BBT were set at 40, the sensitivity would be 45% and the specificity would be 96%. With a cutoff point of 50, the sensitivity is 85% and the specificity is 73%. Generally, the choice of cutoff point by the researcher will increase one validity index (eg, sensitivity) but will decrease the other validity index (eg, specificity). For example, when sensitivity rises (as seen when going from a cutoff point of 40 to a cutoff point of 50 on the BBT), specificity falls. The same concept holds for positive and negative predictive values. When the positive predictive value rises (as seen when going from a cutoff point of 50 to a cutoff point of 40 on the BBT), the negative predictive value falls (Tab. 4).

The principal factor influencing the clinician's choice of a cutoff point is related to the consequence of misclassifying patients. Broadly speaking Adv. 1. broadly speaking - without regard to specific details or exceptions; "he interprets the law broadly"
broadly, generally, loosely
, there are 3 choices for a cutoff point: (1) maximize both sensitivity and specificity, (2) maximize sensitivity at the cost of minimizing specificity, and (3) maximize specificity at the cost of minimizing sensitivity. Maximizing sensitivity and specificity is appropriate when the consequences of false positives and false negatives are about equal. Maximizing sensitivity at the cost of minimizing specificity is desirable when the consequence of a false negative (eg, falsely identifying a subject as a nonfaller) exceeds the consequence of a false positive (eg, falsely identifying the subject as a faller). Conversely con·verse 1  
intr.v. con·versed, con·vers·ing, con·vers·es
1. To engage in a spoken exchange of thoughts, ideas, or feelings; talk. See Synonyms at speak.

2.
, maximizing specificity at the cost of minimizing sensitivity is desirable when the consequence of a false positive exceeds the consequence of a false negative. In the case of the BBT, it would appear that sensitivity should be optimized to avoid classifying a faller as a nonfaller. Misclassifying fallers would appear to have serious consequences (eg, fractures Fractures Definition

A fracture is a complete or incomplete break in a bone resulting from the application of excessive force.
Description
).

An important advantage associated with the use of sensitivity and specificity is that they are not influenced by prevalence. Prevalence is defined as the proportion of patients with the disorder of interest among all patients tested.1 A therapist can use sensitivity and specificity estimates from a published report and apply these estimates to a patient as long as the patient is reasonably similar to the subjects in the study.

Predictive values should guide clinical decisions (they estimate validity in the context of the test result), but unlike sensitivity and specificity, predictive values are prevalence dependent.[1] That is, as the proportion of those with the disease changes, predictive values also change. Predictive values, therefore, vary when the prevalence of the disorder of interest changes. As the prevalence increases, the positive predictive value increases and the negative predictive value decreases. When the prevalence decreases, the positive predictive value decreases and the negative predictive value increases. Because the chance that an individual patient will have a target disorder varies (ie, the pretest probability changes depending on the patient's signs and symptoms), the prevalence associated with a diagnostic accuracy study may not apply to a given patient. For example, in the study by Shumway-Cook et al,[4] there was a prevalence of fallers of 50%. If, for example, a clinician estimated the pretest probability of falling for a patient to be only 10%, the predictive values from the data of Shumway-Cook et al would not provide accurate estimates of positive or negative predictive values for the patient. The positive predictive value from the data of Shumway-Cook and colleagues would be spuriously spu·ri·ous  
adj.
1. Lacking authenticity or validity in essence or origin; not genuine; false.

2. Of illegitimate birth.

3. Botany Similar in appearance but unlike in structure or function.
 high (because of the higher prevalence), and the negative predictive value would be spuriously low for the patient with a pretest probability of 10%.

Unfortunately, predictive values are influenced by prevalence, whereas sensitivity and specificity are not. Sensitivity and specificity, however, are related to positive and negative predictive values in the following way. When specificity is high, the positive predictive value tends to be high, and when sensitivity is high, the negative predictive value tends to be high. That is, when sensitivity is high, a negative test generally indicates the disorder is not present (or, in our illustration, the person is not at risk of falling). When specificity is high, a positive test generally indicates the disorder is present (the person is at risk of falling).[2] Table 4 illustrates this concept. When specificity is high, for example, for a BBT cutoff point of 40 (96%), the positive predictive value will generally be high (83%). A clinician might hypothetically hy·po·thet·i·cal   also hy·po·thet·ic
adj.
1. Of, relating to, or based on a hypothesis: a hypothetical situation. See Synonyms at theoretical.

2.
a. Suppositional; uncertain.
 believe, for example, that based on medical history and examination data, a patient had a pretest probability of falling of approximately 40% and the patient might subsequently have a score of 37 on the BBT, a score considered positive using a cutoff point of 40 (Tab. 4). The positive predictive value would be 83%, an increase of 43 percentage points from the pretest probability. We contend that the clinician can be reasonably confident the patient is a faller.

Similarly, when sensitivity is high (97% for a cutoff point of 55), the negative predictive value will also generally be high (95%). For example, a clinician might believe, based on a patient's medical history and examination data, that the patient had a pretest probability of falling of approximately 40% (or a pretest probability of not falling of 60%). The patient might subsequently have a score of 56 on the BBT, a score considered negative using a cutoff point of 55 (Tab. 4). The negative predictive value (posttest probability) in this hypothetical Hypothetical is an adjective, meaning of or pertaining to a hypothesis. See:
  • Hypothesis
  • Hypothetical
  • Hypothetical (album)
 example would be 95%, and we argue that the clinician can be very confident the patient is not a faller. We noted earlier that predictive values are dependent on prevalence, and in our examples, the prevalence (pretest probability) for falls was estimated to be 40%, a reasonable approximation approximation /ap·prox·i·ma·tion/ (ah-prok?si-ma´shun)
1. the act or process of bringing into proximity or apposition.

2. a numerical value of limited accuracy.
 of the prevalence reported in our illustration using the BBT data. Had the pretest probabilities for the patient examples been appreciably ap·pre·cia·ble  
adj.
Possible to estimate, measure, or perceive: appreciable changes in temperature. See Synonyms at perceptible.
 lower or higher, the predictive values reported in the 2 examples above would not have been accurate estimates of posttest probability.

In summary, sensitivity and specificity are not dependent on prevalence and are therefore seen as useful for clinical practice.[1] As a general guide, we believe clinicians should conclude the condition is likely to be present when a test is positive and the specificity for the test is high. Conversely, clinicians should conclude the condition is likely to be absent when a test is negative and the sensitivity for the test is high.[1,2] Positive and negative predictive values are, in part, prevalence dependent. As a result, we argue that predictive values are meaningful only when the prevalence reported in a study approximates the pretest probability of the disorder the clinician has estimated for the patient. To be most accurate, pretest probability estimates should be based on sound scientific data.

Confidence Intervals for Validity Indexes

Sensitivity, specificity, positive and negative predictive values, and likelihood ratios represent point estimates of population values.[15] Point estimates are estimations of the true value for the index of interest. To determine the accuracy of a point estimate, confidence intervals (CIs) are calculated.[15] Confidence intervals indicate how closely a study's point estimate of these values approximate the population values.[15] Confidence intervals essentially describe for clinicians how confident they can be about a point estimate. For example, if sensitivity was 80%, with a 95% CI of 70% to 90%, the true value for sensitivity in the population (with 95% certainty) lies between 70% and 90%. The width of a CI becomes narrower as the sample size increases, and it becomes wider as the sample size decreases.[15] In addition, the width is dependent on the variability of the measure with the population.[15] The degree of confidence we place on these validity estimates can be calculated.[1,18] In our view, studies that examine the validity of diagnostic tests should provide CI estimates.

For example, the 95% CI for specificity reported by Bogle Thorbahn and Newton[3] ranged from 67% (not very specific) to 100% (perfect specificity). The 95% CI for specificity for the combined data from the studies of Bogle Thorbahn and Newton[3] and Shumway-Cook et al[4] ranged from 83% to 97% (both values, in our opinion, represent reasonably high specificity).

Likelihood Ratios

Positive and negative likelihood ratios(*) are 2 additional validity indexes for diagnostic tests. Likelihood ratios have been proposed to be more efficient and more powerful than sensitivity, specificity, and predictive values.[15,19] Likelihood ratios essentially combine the benefits of both sensitivity and specificity into one index.[1] Likelihood ratios indicate by how much a given diagnostic test result will raise or lower the pretest probability of the target disorder.[20] Likelihood ratios are reported in a decimal Meaning 10. The numbering system used by humans, which is based on 10 digits. In contrast, computers use binary numbers because it is easier to design electronic systems that can maintain two states rather than 10.  number format rather than as percentages. A likelihood ratio of 1 means the posttest probability (probability of the condition after the test results are obtained) for the target disorder is the same as the pretest probability (probability of the condition before the test was done). Likelihood ratios greater than 1 increase the chance the target disorder is present, whereas likelihood ratios less than 1 decrease the chance the target disorder is present.[20]

Jaeschke and colleagues[20] proposed the following guide to interpreting likelihood ratios. Likelihood ratios greater than 10 or less than 0.1 generate large and often conclusive Determinative; beyond dispute or question. That which is conclusive is manifest, clear, or obvious. It is a legal inference made so peremptorily that it cannot be overthrown or contradicted.  changes from pretest to posttest probability. Likelihood ratios between 5 and 10 or between 0.2 and 0.1 generate moderate changes from pretest to posttest probability. Likelihood ratios from 2 to 5 and from 0.5 to 0.2 result in small (but sometimes important) shifts in probability, and likelihood ratios from 0.5 to 2 result in small and rarely important changes in probability.

Because likelihood ratios can be applied to score intervals for tests with continuous measures, we believe they are more useful than sensitivity, specificity, and predictive values, which are limited to data presented in a dichotomous di·chot·o·mous  
adj.
1. Divided or dividing into two parts or classifications.

2. Characterized by dichotomy.



di·chot
 format. For example, the positive likelihood ratio for the score interval of 40 to 44 (a test score considered positive based on recommendations of Berg and colleagues[8]) is 2.8 (Tab. 5). This likelihood ratio indicates that a patient with a BBT score between 40 and 44 is 2.8 times more likely to be a faller than a nonfaller. The 95% CI ranges from 0.9 to 8.5. That is, the 95% CI overlaps 1 (no change in the probability of the disorder); therefore, a clinician cannot be very confident that a score between 40 and 44 increases the probability of identifying a patient at risk for falls. If a patient scores below 40 on the BBT, however, the likelihood ratio increases to 11.7 (95% CI=3.6-37.6). A patient with a BBT score below 40 is at greater risk for falls as compared with patients with scores between 40 and 44. On average, patients with BBT scores less than 40 are almost 12 times more likely to be a faller than a nonfaller.

Table 5. Positive Likelihood Ratios for Several Different Intervals of Berg Balance Test Scores
                       Gold Standard Test Result

                    Positive              Negative
Berg Balance
Test Result    Number   Proportion    Number   Proportion

<40            15       15/33=0.455    3        3/77=0.039
40-44           6        6/33=0.182    5        5/77=0.065
45-49           7        7/33=0.212   13       13/77=0.169
50-54           4        4/33=0.121   36       36/77=0.467
>54             1        1/33=0.03    20       20/77=0.26

Total          33                     77

Berg Balance   Positive Likelihood
Test Result    Ratio (95% CI(a))

<40            11.7 (3.6-37.6)
40-44           2.8 (0.9-8.5)
45-49           1.3 (0.5-2.9)
50-54           0.3 (0.1-0.7)
>54             0.1 (0.02-0.8)

Total


(a) CI=confidence interval.

Applications of Likelihood Ratios to Clinical Practice

Likelihood ratios can also be calculated for several different cutoff points of the BBT (Tab. 4). Scores below the cutoff are considered positive tests, and scores above the cutoff are considered to be negative tests. Because the scale is dichotomized when using cutoffs, both positive and negative likelihood ratios can be calculated. For example, given a BBT cutoff point of 40, the positive likelihood ratio is 11.7 (95% CI=3.6-37.6). That is, a patient with a score of less than 40 is approximately 12 times more likely to be a faller than a nonfaller. The negative likelihood ratio is 0.6 (95% CI-0.4-0.8). That is, a patient with a negative BBT score (score of [is greater than or equal to] 40) is 0.6 times as likely to be a faller as a nonfaller. When using a cutoff point of 40, for a negative score (score of [is greater than or equal to] 40), a patient is more likely to be a nonfaller than a faller. Based on the data summarized in Table 4, lower cutoffs will usually increase the magnitude of the positive likelihood ratio (a desirable trait trait (trat)
1. any genetically determined characteristic; also, the condition prevailing in the heterozygous state of a recessive disorder, as the sickle cell trait.

2. a distinctive behavior pattern.
), but they will also increase the magnitude of the negative likelihood ratio (an undesirable trait).

Another advantage of the use of likelihood ratios is that, along with the use of a nomogram nomogram /nom·o·gram/ (nom´o-gram) a graph with several scales arranged so that a straightedge laid on the graph intersects the scales at related values of the variables; the values of any two variables can be used to find the values of  (Figure), a clinician can determine the probability of a disorder, given the result of the test (also called "posttest probability").[21] Because likelihood ratios do not vary when disorder prevalence varies, likelihood ratios can be generalized gen·er·al·ized
adj.
1. Involving an entire organ, as when an epileptic seizure involves all parts of the brain.

2. Not specifically adapted to a particular environment or function; not specialized.

3.
 to other patients. To use the nomogram, the clinician must first estimate the pretest probability of the disorder. The pretest probability of the disorder (likelihood of the presence of the disorder prior to doing the test) is estimated, as mentioned earlier, by the clinician's own clinical training and experience with similar types of patients in the specific setting in which the patients are seen.[2] The constellation Constellation, ship
Constellation (kŏnstĭlā`shən), U.S. frigate, launched in 1797. It was named by President Washington for the constellation of 15 stars in the U.S. flag of that time.
 of signs and symptoms also influences the clinician's judgment of the pretest probability of the disorder. If we knew the likelihood ratios for each of the medical history items and signs and symptoms of patients, we could repeatedly recalculate re·cal·cu·late  
tr.v. re·cal·cu·lat·ed, re·cal·cu·lat·ing, re·cal·cu·lates
To calculate again, especially in order to eliminate errors or to incorporate additional factors or data.
 the pretest and posttest probability of the disorder of interest and come up with a very accurate estimate of the final posttest probability.[20] Most of these data, unfortunately, are unavailable, so clinicians typically must rely on training, experience, and knowledge of the literature to estimate the pretest probability of the disorder. To use the nomogram, the clinician simply estimates the pretest probability of the disorder and identifies this value in the left-hand left-hand
adj.
1. Of, relating to, or located on the left.

2. Relating to, designed for, or done with the left hand.


left-hand
Adjective

1.
 column of the nomogram (Figure). A straightedge is then anchored on the left column of the Figure at the pretest probability estimate and aligned on the middle column at the likelihood ratio. The right column indicates the posttest probability.

[Figure ILLUSTRATION OMITTED]

To demonstrate how likelihood ratios and the nomogram can be used to guide clinical decision making, we will apply our concept and argument to 2 hypothetical situations. For the first example, assume your 67-year-old patient lived alone in her home and was independent and relatively active. Her only comorbidity co·mor·bid·i·ty
n.
A concomitant but unrelated pathological or disease process.


comorbidity
 was that she had a hip joint replacement hip joint replacement Total hip replacement, see there  1 year prior to testing. The therapist suspected the pretest probability of the disorder (falls, in this case) would be relatively low, perhaps on the order of 20%. The patient then had a BBT done and a score of 50 (a negative test, using a cutoff point of 50) was obtained. The negative likelihood ratio for a cutoff point of 50 is 0.2 (Tab. 4). We align align (līn),
v to move the teeth into their proper positions to conform to the line of occlusion.
 a ruler with the left column of the nomogram (Figure) at 20 (20% pretest probability) and with the middle column at a likelihood ratio of approximately 0.2. We find that the posttest probability of current fall risk for this patient is approximately 5%, an improvement of 15 percentage points from the pretest probability (the chance of the patient being a faller has gone from 20% down to 5%). Hypothetically, we substantially increased our level of certainty about the patient's current risk of falling based on the BBT score.

Our second hypothetical example is about a 75-year-old man who was diagnosed with congestive heart failure congestive heart failure, inability of the heart to expel sufficient blood to keep pace with the metabolic demands of the body. In the healthy individual the heart can tolerate large increases of workload for a considerable length of time.  approximately 5 years previously and requires assistance with some activities of daily living. He reports losing his balance occasionally and remembers falling once in the past few years. Based on the patient's medical history and functional status, the pretest probability for falls would be fairly high (ie, on the order of 50%). A BBT was done, and a score of 38 (a positive test, using a cutoff point of 40) was obtained. Using the data in Table 4, the positive likelihood ratio for a score of less than 40 is 11.7. That is, this patient is 11.7 times more likely to be a faller than a nonfaller. Using the nomogram shown in the Figure, the posttest probability for current fall risk is approximately 92%, an increase of 42 percentage points above the pretest probability. If we believe our data are correct and our estimates are appropriate, we can theoretically be confident that we have identified a patient who has a very high probability of falling. We again appear to have substantially increased our level of certainty about the patient's risk of falling.

Summary

Validity indexes for diagnostic tests were reviewed, and terms used in studies designed to describe the validity of diagnostic tests were defined. Data from 2 studies examining the validity of measurements obtained with the BBT for inferring current fall risk were used as an illustration to demonstrate how clinicians could use diagnostic test studies to guide clinical decisions for individual patients. Unfortunately, there are only a small number of diagnostic test studies describing the validity of examination procedures commonly used by physical therapists. There is an urgent need to conduct more studies of the usefulness of diagnostic and prognostic tests in physical therapy.

Acknowledgments See About this product.

We thank Dr Anne Anne, British princess
Anne (Anne Elizabeth Alice Louise), 1950–, British princess, only daughter of Queen Elizabeth II and Prince Philip, duke of Edinburgh. She was educated at Benenden School.
 Shumway-Cook, Linda A set of parallel processing functions added to languages, such as C and C++, that allows data to be created and transferred between processes. It was developed by Yale professor David Gelernter, when he was a 23-year old graduate student.  Thorbahn, and Dr Roberta Newton for their insights and for allowing us to use their data in this article. We also thank Cheryl Cheryl is a female given name and can refer to:

In crime:
  • Cheryl Bentov, Israeli Mossad agent
  • Cheryl Crane, daughter of Lana Turner and Stephen Crane
In politics:
  • Cheryl Carolus, South African politician
 Ford-Smith and Sue Cromwell for reviewing an earlier version of the manuscript manuscript, a handwritten work as distinguished from printing. The oldest manuscripts, those found in Egyptian tombs, were written on papyrus; the earliest dates from c.3500 B.C. .

(*) Likelihood ratios should not be confused with odds ratios. Odds ratios are an estimate of risk often expressed in case-control studies case-control study,
n an investigation employing an epidemiologic approach in which previously existing incidents of a medical condition are used in lieu of gathering new information from a randomized population.
 designed to investigate causation causation

Relation that holds between two temporally simultaneous or successive events when the first event (the cause) brings about the other (the effect). According to David Hume, when we say of two types of object or event that “X causes Y” (e.g.
 of a disease.

References

[1] Sackett DL, Haynes Haynes refers to: Persons named Haynes
  • Abner Haynes (1937–), American football player
  • Arden Haynes (1927–), Canadian former CEO of Imperial Oil and former Chancellor of York University
 RB, Guyatt GH, Tugwell Tugwell is a surname, and may refer to:
  • A. P. Tugwell
  • Finn Tugwell
  • Rexford Tugwell

This page or section lists people with the surname Tugwell.
 P. Clinical Epidemiology epidemiology, field of medicine concerned with the study of epidemics, outbreaks of disease that affect large numbers of people. Epidemiologists, using sophisticated statistical analyses, field investigations, and complex laboratory techniques, investigate the cause : A Basic Science for Clinical Medicine. 2nd ed. Boston Boston, town, England
Boston, town (1991 pop. 26,495), E central England, on the Witham River. Boston's fame as a port dates from the 13th cent., when it was a Hanseatic port trading wool and wine. Having recovered from a decline in the 18th and 19th cent.
, Mass: Little, Brown and Co Inc; 1991:85-86.

[2] Sackett DL, Richardson Richardson, city (1990 pop. 74,840), Dallas and Collins counties, N Tex., a suburb of Dallas; founded in the 1850s, inc. as a city 1956. Richardson manufactures telecommunications equipment, medical devices, supercomputers, computer chips, and fiber optics.  WS, Rosenberg Rosenberg (rō`zənbərg), city (1990 pop. 20,183), Fort Bend co., S Tex., on the Brazos River, in an oil and natural gas area; inc. 1902. Rosenberg and its sister city of Richmond are physically one community.  W, Haynes RB. Evidence-based Medicine evidence-based medicine Decision-making 'The use of scientific data to confirm that proposed diagnostic or therapeutic procedures are appropriate in light of their high probability of producing the best and most favorable outcome'. See Meta-analysis. : How to Practice and Teach EBM EBM Evidence-Based Medicine
EBM Electronic Body Music
EBM ecosystem-based management
EBM Evidence Based Medical (statistics)
EBM Environmentally Benign Manufacturing
EBM Expressed Breast Milk
EBM Executive Board Meeting
. New York New York, state, United States
New York, Middle Atlantic state of the United States. It is bordered by Vermont, Massachusetts, Connecticut, and the Atlantic Ocean (E), New Jersey and Pennsylvania (S), Lakes Erie and Ontario and the Canadian province of
, NY: Churchill Livingstone Imprint of a medical publishing company owned by Elsevier Ltd, but previously owned by Harcourt and Pearsons. Originally formed from Livingstone, Edinburgh, Scotland, and J & A Churchill, London, UK, and subsequently with an office in New York, but now integrated with the rest of  Inc; 1997.

[3] Bogle Thorbahn LD, Newton RA. Use of the Berg Balance Test to predict falls in elderly persons. Phys Ther. 1996;76:576-583.

[4] Shumway-Cook A, Baldwin Baldwin, cities, United States
Baldwin.

1 Uninc. city (1990 pop. 22,719), Nassau co., SE N.Y., on the south shore of Long Island, on Baldwin Bay; settled 1640s. A fishing center and summer resort, it has varied manufactures.
 M, Polissar NL, Gruber Gru·ber , Max von 1853-1927.

Austrian bacteriologist noted for his work in serum diagnosis, including the discovery (1896) of the specific agglutination of bacteria by the blood serum of immunized animals.
 W. Predicting the probability for falls in community-dwelling older adults. Phys Ther. 1997;77:812-819.

[5] Task Force on Standards for Measurement in Physical Therapy. Standards for tests and measurements in physical therapy practice. Phys Ther. 1991;71:589-622.

[6] Berg KO, Wood-Dauphinee SL, Williams JI, Gayton Gayton may refer to:

Places

  • Gayton, Merseyside
  • Gayton, Norfolk
  • Gayton, Northamptonshire
  • Gayton, Staffordshire
  • Gayton Engine, Lincolnshire
  • Gayton le Marsh Lincolnshire
  • Gayton le Wold Lincolnshire
  • Gayton Thorpe, Norfolk

Other

 D. Measuring balance in the elderly: preliminary development of an instrument. Physiotherapy physiotherapy: see physical therapy.  Canada Canada (kăn`ədə), independent nation (2001 pop. 30,007,094), 3,851,787 sq mi (9,976,128 sq km), N North America. Canada occupies all of North America N of the United States (and E of Alaska) except for Greenland and the French islands of . 1989;41:304-311.

[7] Berg KO, Maki Ma´ki

n. 1. (Zool.) A lemur. See Lemur.
 BE, Williams JI, et al. Clinical and laboratory measures of postural balance postural balance,
n optimally distributed body mass relative to the force of gravity.
 in an elderly population. Arch Phys Med Rehabil. 1992;73:1073-1080.

[8] Berg KO, Wood-Dauphinee SL, Williams JI, Maki B. Measuring balance in the elderly: validation See validate.

validation - The stage in the software life-cycle at the end of the development process where software is evaluated to ensure that it complies with the requirements.
 of an instrument. Can J Public Health. 1992;83 (suppl 2):S7-S11.

[9] Tinetti ME. Performance-oriented assessment of mobility problems in elderly patients. J Am Geriatr Soc. 1986;34:119-126.

[10] Mahoney Mahoney could refer to:
  • Mahoney (surname), an Irish last name.
People
  • Roger (Cardinal) Mahony
  • Tim Mahoney
  • Steve Mahoney
  • Mary Eliza Mahoney
  • Cindy Mahoney
  • Tim Mahoney (guitarist)
  • William Mahoney
  • Mike Mahoney
  • Patrick Mahoney
 FL, Barthel DW. Functional evaluation: the Barthel index. Md State Med J. 1965;14:61-65.

[11] Department of Clinical Epidemiology and Biostatistics biostatistics /bio·sta·tis·tics/ (-stah-tis´tiks) biometry.

bi·o·sta·tis·tics
n.
The science of statistics applied to the analysis of biological or medical data.
, McMaster University McMaster University, at Hamilton, Ont., Canada; nondenominational; founded 1887. It has faculties of humanities, science, social sciences, business, engineering, and health sciences, as well as a school of graduate studies and a divinity college. . How to read clinical journals, II: to learn about a diagnostic test. Can Med Assoc J. 1981:124:703-710.

[12] Department of Clinical Epidemiology and Biostatistics, McMaster University. Interpretation of diagnostic data, 2: how to do it with a simple table (part A). Can Med Assoc J. 1983:129:5-11.

[13] Department of Clinical Epidemiology and Biostatistics, McMaster University. Interpretation of diagnostic data, 2: how to do it with a simple table (part B). Can Med Assoc J. 1983:129:12-17.

[14] Department of Clinical Epidemiology and Biostatistics, McMaster University. Interpretation of diagnostic data, 2: how to do it with simple math. Can Med Assoc J. 1983:129:22-29.

[15] Sackett DL. A primer prim·er
n.
A segment of DNA or RNA that is complementary to a given DNA sequence and that is needed to initiate replication by DNA polymerase.
 on the precision and accuracy of the clinical examination. JAMA JAMA
abbr.
Journal of the American Medical Association
. 1992;267:2638-2644.

[16] Luukinen H, Koski K, Kivela SL, Laippala P. Social status, life changes, housing conditions housing conditions nplcondiciones fpl de habitabilidad

housing conditions nplconditions fpl de logement

, health, functional abilities, and lifestyle as risk factors for recurrent recurrent /re·cur·rent/ (re-kur´ent) [L. recurrens returning]
1. running back, or toward the source.

2. returning after remissions.


re·cur·rent
adj.
1.
 falls among the home-dwelling Home´-dwell`ing

a. 1. Keeping at home.
 elderly. Public Health. 1996; 110:115-118.

[17] Tinetti ME, Speechley M, Ginter SF. Risk factors for falls among elderly persons living in the community. N Engl ENGL English  J Med. 1988;319:1701-1707.

[18] Colton Colton, city (1990 pop. 40,213), San Bernardino co., S Calif., a suburb of San Bernardino, inc. 1887. Originally a rich citrus and farm area, Colton experienced population growth and urban development in the late 20th cent. There is light industry.  T. Statistics in Medicine. Boston, Mass: Little, Brown and Co Inc; 1974:160.

[19] Crombie Crombie may refer to the following people:
  • John Crombie, the founder of the Crombie Mills and clothing brand, born 1772.
  • Alistair Cameron Crombie, Australian historian of Science (1915-1996).
  • David Crombie, Canadian politician born 1936.
 DL. Diagnostic process. J Coll v. t. 1. To embrace.  Gen Prac. 1963;6:579-589.

[20] Jaeschke R, Guyatt GH, Sackett DL. Users' guides to the medical literature, III: how to use an article about a diagnostic test, B: What are the results and will they help me in caring for my patients? JAMA. 1994;271:703-707.

[21] Fagan TJ. Nomogram for Bayes theorem Bayes theorem

a statistical means of including local general information, intuitive judgment, clinical skill as learned over a long period, and similar subjective influences, in the assessment of probability, e.g. in making a diagnosis.
 [letter]. N Engl J Med. 1975;293:257.

DL Riddle riddle, puzzling question, specifically one that consists of a fanciful description or definition of something to be guessed. A famous riddle was asked by the Sphinx: "What goes on four legs in the morning, on two at noon, on three at night?" Oedipus guessed the , PhD, PT, is Associate Professor, Department of Physical Therapy, Medical College of Virginia History
The school was founded in 1838 as the Medical Department of Hampden-Sydney College. It received an independent charter from the General Assembly in 1854 and became the Medical College of Virginia, and shortly thereafter transferred all its property to the Commonwealth
 Campus, Virginia Commonwealth University Formed by a merger between the Richmond Professional Institute and the Medical College of Virginia in 1968, VCU has a medical school that is home to the nation's oldest organ transplant program. , 1200 E Broad, Richmond Richmond, cities, United States
Richmond.

1 City (1990 pop. 87,425), Contra Costa co., W Calif., on San Pablo Bay, an inlet of San Francisco Bay; inc. 1905.
, VA 23298-0224 (USA) (driddle@hsc.vcu.edu See .edu.

(networking) edu - ("education") The top-level domain for educational establishments in the USA (and some other countries). E.g. "mit.edu". The UK equivalent is "ac.uk".
). Address all correspondence to Dr Riddle.

PW Stratford Stratford, estate, United States
Stratford, home of the Lee family, overlooking the Potomac River, E Va., SE of Fredericksburg. A national shrine dedicated in 1935, the site was purchased in 1716 by Thomas Lee, who built the mansion Stratford Hall in
, PT, is Associate Professor, School of Rehabilitation rehabilitation: see physical therapy.  Science, and Associate Member, Department of Clinical Epidemiology and Biostatistics, McMaster University, Hamilton Hamilton, city, Bermuda
Hamilton, city (1990 est. pop. 3,100), capital of Bermuda, on Bermuda Island. It is a port at the head of Great Sound, a huge lagoon and deepwater harbor protected by coral reefs.
, Ontario Ontario, city, United States
Ontario, city (1990 pop. 133,179), San Bernardino co., S Calif., near Los Angeles, in a region of vineyards; inc. 1891.
, Canada.

Concept, writing, and data analysis were provided by Riddle and Stratford. Consultation (including review of manuscript before submitting) was provided by Cheryl Ford-Smith, Susan Cromwell, Dr Roberta Newton, and Dr Anne Shumway-Cook.

This article was submitted December 7, 1998, and was accepted July 7, 1999.3
COPYRIGHT 1999 American Physical Therapy Association, Inc.
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 1999, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Author:Stratford, Paul W
Publication:Physical Therapy
Date:Oct 1, 1999
Words:7350
Previous Article:Can Some Physical Therapy and Manual Techniques Generate Potentially Osteogenic Levels of Strain Within Mammalian Bone?
Next Article:Visual-Vestibular Habituation and Balance Training for Motion Sickness.
Topics:



Related Articles
Subjective measures and clinical decision making.
Standards for tests and measurements in physical therapy practice.
Physical therapy to improve functioning of older people in residential care facilities.
Classification and low back pain: a review of the literature and critical analysis of selected systems.
Use of Visual Feedback in Retraining Balance Following Acute Stroke.
What Types of Tests and Measures Do Physical Therapists Use?(A Guide to Physical Therapist Practice)
Balance and Mobility Following Stroke: Effects of Physical Therapy Interventions With and Without Biofeedback/Forceplate Training.
Making Geriatric Assessment Work: Selecting Useful Measures.
Examining Diagnostic Tests: An Evidence-Based Perspective.(Statistical Data Included)
The stroke rehabilitation assessment of movement (STREAM): a comparison with other measures used to evaluate effects of stroke and rehabilitation....

Terms of use | Copyright © 2009 Farlex, Inc. | Feedback | For webmasters | Submit articles