Printer Friendly
The Free Library
14,735,875 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

Roland-Morris scale reliability. (Letters to the Editor).


Letter to the Editor:

There is much more research that describes the measurement properties of evaluative measures such as the Roland-Morris (RM) scale (1) today than there was a decade ago. The greater volume of studies provides more data that can be used to shape clinical decisions. This increased amount of research also increases the chance that the results of some studies, at times, may conflict with results of other studies. As the number of studies on a particular issue grows, the potential for conflicting results increases. The study of Davidson Da·vid·son   , Jo(seph) 1883-1952.

American sculptor best remembered for his vigorous portrait busts of Woodrow Wilson, Franklin D. Roosevelt, and Albert Einstein, among others.
 and Keating Keating may refer refer to the following: People
For people with the surname Keating, see Keating (surname) Places
Several places in the US:
  • Keating Township, Potter County, Pennsylvania
  • Keating Township, McKean County, Pennsylvania
 (2) seems to be an illustration of this phenomenon.

Davidson and Keating2 examined the reliability and responsiveness of 5 functional status questionnaires designed for patients with low back pain (LBP LBP

In currencies, this is the abbreviation for the Lebanese Pound.

Notes:
The currency market, also known as the Foreign Exchange market, is the largest financial market in the world, with a daily average volume of over US $1 trillion.
). One of the scales examined was the RM scale, a questionnaire that has been studied extensively by our group and many others. Davidson and Keating found that the reliability of RM scale measurements was low, with an intraclass correlation In statistics, the intraclass correlation (or the intraclass correlation coefficient[1]) is a measure of correlation, consistency or conformity for a data set when it has multiple groups.  coefficient coefficient /co·ef·fi·cient/ (ko?ah-fish´int)
1. an expression of the change or effect produced by variation in certain factors, or of the ratio between two different quantities.

2.
 (ICC ICC

See: International Chamber of Commerce
 [2,1]) of .53 (95% confidence interval confidence interval,
n a statistical device used to determine the range within which an acceptable datum would fall. Confidence intervals are usually expressed in percentages, typically 95% or 99%.
 [CI]=.29,.71) for a sample of 47 patients with LBP who reported that their LBP was "about the same," "a little better," or "a little worse." For a smaller subgroup sub·group  
n.
1. A distinct group within a group; a subdivision of a group.

2. A subordinate group.

3. Mathematics A group that is a subset of a group.

tr.v.
 that reported their LBP was "about the same," the ICC (2,1) was lower at .42 (95% CI=-.07, .75). Based in part on these findings, the authors concluded that the RM scale "appeared to lack sufficient reliability and scale width for clinical application." (2(p8))

In our opinion, these results are dramatically different from the large volume of evidence reported in the literature on the reliability of RM scale scores (Table). (1,2,6-17) The evidence summarized in the Table was collected on diverse samples of patients from different countries with many different LBP diagnoses. Davidson and Keating (2) attributed their findings, as compared with previous research, to a variety of "sample differences." For example, they suggested that, because their sample consisted of some patients who were self-referred for physical therapy, these patients added additional variability, leading to the low reliability. Several other researchers (10,12,17) investigated the reliability of RM scale scores on samples that included self-referred and physician-referred patients. The ICCs reported in these studies varied from .79 to .88. Another potential explanation for the lower reliability could be related to the interval between assessments--6 weeks in the study of Davidson and Keating. Yet, other studies (8,11) with reassessment Reassessment

The process of re-determining the value of property or land for tax purposes.

Notes:
Property is usually reassessed on an annual basis. You may request a "reassessment" if you disagree with your assessment.
 intervals of equal or longer duration reported ICCs on the order of .66 to .86.

A commonly accepted statistical concept is that the greater the number of studies (or statistical tests) conducted on a specific issue, the more likely an aberrant aberrant /ab·er·rant/ (ah-ber´ant) (ab´ur-ant) wandering or deviating from the usual or normal course.

ab·er·rant
adj.
1.
 finding will occur simply by chance alone. (18) We believe, given the large number of studies summarized in the Table, that the low reliability reported by Davidson and Keating (2) for the RM scale was likely to be due to random variation associated with making a point estimate, such as an ICC. A point estimate is a single value, whereas a confidence interval represents a range of likely values. The upper bounds of the 95% CIs reported by Davidson and Keating were .71 or higher for the RM scale. These upper-bound estimates are more in line with point estimates from the literature (Table).

Why were the point estimates reported by Davidson and Keating (2) for the RM scale so low relative to past research? Considering the relatively small sample size, especially for the subgroup self-classified as "about the same," we suspect that there were a few patients who had an unusual amount of variability in their scores. Large variability in a few subjects could lead to low reliability when sample sizes are small. The authors refer to a small group of subjects who had pain for greater than 6 months and who demonstrated "considerable variability" in RM scores despite reporting no change in their condition. Given the relatively small sample sizes in the 2 reliability analyses (n = 16 and n = 47), it would likely take only a few patients with unusual variability in their scores to skew (1) The misalignment of a document or punch card in the feed tray or hopper that prohibits it from being scanned or read properly.

(2) In facsimile, the difference in rectangularity between the received and transmitted page.
 the reliability data and produce point estimates that are atypical atypical /atyp·i·cal/ (-i-k'l) irregular; not conformable to the type; in microbiology, applied specifically to strains of unusual type.

a·typ·i·cal
adj.
 compared with the large amount of evidence that has already been published.

Given the extensive evidence that supports the reliability for RM scale scores, (1,6-17) we disagree with Verb 1. disagree with - not be very easily digestible; "Spicy food disagrees with some people"
hurt - give trouble or pain to; "This exercise will hurt your back"
 the authors' recommendations that the RM scale should not be used as a measure of functional outcome in a general clinical population. Some clinicians may be tempted, based on the results reported by Davidson and Keating, (2) to discontinue dis·con·tin·ue  
v. dis·con·tin·ued, dis·con·tin·u·ing, dis·con·tin·ues

v.tr.
1. To stop doing or providing (something); end or abandon:
 use of the RM scale or to consider other measures in lieu of Instead of; in place of; in substitution of. It does not mean in addition to.  the RM scale. We think this would be misguided mis·guid·ed  
adj.
Based or acting on error; misled: well-intentioned but misguided efforts; misguided do-gooders.



mis·guid
 when considering the evidence. We contend that the overwhelming majority of evidence supports use of the RM scale for routine clinical use or for research, and many experts agree with this view. (3-5)
Table.

Evidence Reported in the Literature on Reliability of Roland-Morris
Scale Scores (a)

                         Sample   Reliability Sample
Study                    Size     Type

Roland and Morris (1)    N=20     Unspecified sample of
                                    patients with LBP

Deyo (6)                 N=10     Outpatients with acute or
                                    chronic LBP without
                                    neurologic deficit

                         N=47

Kopec et al (7)          N=98     Patients seen by physician
                                    or physical therapist for
                                    acute or chronic LBP

Stratford et al (8)      N=36     Patients seen by physician
                                    or physical therapist for
                                    acute or chronic LBP

Nusbaum et al (9)        N=30     Patient with LBP >3 mo
                                    seen in rheumatology
                                    clinic

Stratford and            N=136    Patients with acute or
  Binkley (10)                      chronic LBP self-referred
                                    or physician-referred for
                                    physical therapy

Patrick et al (11,b)     N=52     Patients with acute or
                                    chronic sciatica

                         N=50

Johansson and            N=54     Patients with acute or
  Lindberg (12)                     chronic LBP self-referred
                                    or physician-referred for
                                    physical therapy

Wiesinger et al (13)     N=20     Patients with acute or
                                    chronic LBP

Underwood et al (14,c)   N=34     Patients with chronic LBP

                         N=14     Patients with acute or
                                    chronic LBP

Jacob et al (15,d)       N=96     Patients with acute or
                                    chronic LBP referred for
                                    physical therapy

Jensen et al (16)        N=50     Patients with chronic LBP
                                    seen in a pain clinic

Stratford et al (17)     N=28     Patients with LBP of at least
                                    2-weeks duration
                                  LBP self-referred or
                                    physician-referred for
                                    physical therapy

Davidson and             N=47     Patients with acute or
  Keating (2)                       chronic LBP self-referred
                                    or physician-referred for
                                    physical therapy

                         N=16

                              Time
                              Between
                      Sample  Repeated     Anchor for Judging
Study                 Size    Measures     Stability

Roland and            N=20    Same day     None (assumption of no
  Morris (1)                                 change)

Deyo (6)              N=10    3 wk         Pain rated as "unchanged" by
                                             physician and patient

                      N=47                 Patient report of not
                                             resuming full activity

Kopec et al (7)       N=98    1-14 d       Self report of "no change"

Stratford et al (8)   N=36    3-6 wk       -1 to +3 on -7 to +7 rating
                                             of change scale

Nusbaum et al (9)     N=30    7 d          Assumption of no change with
                                             no intervention

Stratford and         N=136   Within 48 h  Assumption of no change
  Binkley (10)

Patrick et al (11,b)  N=52    3 mo         No patient-reported
                                             improvement in leg pain at
                                             follow-up

                      N=50                 No patient-reported
                                             improvement in quality of
                                             life

Johansson and         N=54    1 wk         No intervention during 1-wk
  Lindberg (12)                              interval

Wiesinger et al (13)  N=20    1 d          No intervention during 1-d
                                             interval

Underwood et          N=34    1-2 wk       No intervention during 2-wk
  al (14,c)                                  interval

                      N=14    1-2 wk       Short time frame between
                                             measures

Jacob et al (15,d)    N=96    2-14 d       No intervention given during
                                             2- to 14-day interval

Jensen et al (16)     N=50    2 d to 7 mo  No intervention at the pain
                                (median=     clinic
                                39 d)

Stratford et al (17)  N=28    Within 48 h  Assumption of no change

Davidson and          N=47    6 wk         Patients self-rated as "a
  Keating (2)                                little better," "a little
                                             worse," or "about the
                                             same"

                      N=16    6 wk         Patients self-rated as
                                             "about the same"

                         Sample   Reliability Coefficient
Study                    Size     (95% CI if Reported)

Roland and Morris (1)    N=20     r=.91

Deyo (6)                 N=10     r=.83

                         N=47     r=.76

Kopec et al (7)          N=98     ICC (2,1)=.91

Stratford et al (8)      N=36     ICC (1,1)=.86 (.72, .94)

Nusbaum et al (9)        N=30     ICC=.94

Stratford and            N=136    ICC (2,1)=.79 (lower 1-sided
  Binkley (10)                      95% CI=.73)

Patrick et al (11,b)     N=52     ICC=.76

                         N=50     ICC=.66

Johansson and            N=54     ICC=.88
  Lindberg (12)

Wiesinger et al (13)     N=20     r=.82

Underwood et al (14,c)   N=34     ICC=.89
                         N=14     ICC=.96

Jacob et al (15,d)       N=96     ICC=.93

Jensen et al (16)        N=50     r=.72

Stratford et al (17)     N=28     ICC (2,1)=.81 (.62, .91)

Davidson and             N=47     ICC (2,1)=.53 (.29, .71)
  Keating (2)

                         N=16     ICC (2,1)=.42 (-.07, .75)

(a) CI=confidence interval, LBP=low back pain, ICC=intraclass
correlation coefficient.

(b) The RM scale was modified to contain 23 items, and the term
"because of my back problem or leg pain" was added.

(c) Modified to ask about symptoms over previous 4 weeks.

(d) Modification described by Patrick et al. (11)
Daniel L Riddle, PT, PhD
Associate Professor
Department of Physical Therapy
School of Allied Health Professions
Medical College of Virginia Campus
Virginia Commonwealth University
Box 980224
Richmond, VA 23298

Paul W Stratford, PT, MSc
Associate Professor
School of Rehabilitation Science
Associate Member
Department of Clinical Epidemiology and
  Biostatistics
McMaster University
Hamilton, Ontario, Canada


References

(1) Roland Roland (rō`lənd), the great French hero of the medieval Charlemagne cycle of chansons de geste, immortalized in the Chanson de Roland (11th or 12th cent.).  M, Morris R. A study of the natural history of back pain, part I: development of a reliable and sensitive measure of disability in low back pain. Spine. 1983;8:141-144.

(2) Davidson M, Keating JL. A comparison of five low back disability questionnaires: reliability and responsiveness. Phys Ther. 2002; 82:8-24.

(3 Bombardier C. Outcome assessments in the evaluation and treatment of spinal spinal /spi·nal/ (spi´n'l)
1. pertaining to a spine or to the vertebral column.

2. pertaining to the spinal cord's functioning independently from the brain.


spi·nal
adj.
 disorders. Spine. 2000;25:3100-3103.

(4) Kopec JA. Measuring functional outcomes in persons with back pain. Spine. 2000;25: 3110-3114.

(5) Deyo RA, Battie M, Beurskens AJH AJH American Journal of Hypertension
AJH Association des Journalistes Haïtiens (Haitian Journalists' Association)
AJH Anti-Jam Hopper
AJH American Journal of Hygiene
. Outcome measures for low back pain research: a proposal for standardized standardized

pertaining to data that have been submitted to standardization procedures.


standardized morbidity rate
see morbidity rate.

standardized mortality rate
see mortality rate.
 use. Spine. 1998;23:2003-2013.

(6) Deyo RA. Comparative validity of the Sickness Impact Profile Sickness Impact Profile Medtalk An instrument used to evaluate perceived health status–quality of life and changes in functional status in Pts being treated for a potentially fatal condition.  and shorter scales for functional assessment in low-back pain. Spine. 1986;9:951-954.

(7) Kopec JA, Esdaile JM, Abrahamowicz M, et al. The Quebec Quebec, city, Canada
Quebec, Fr. Québec, city (1991 pop. 167,517), provincial capital, S Que., Canada, at the confluence of the St. Lawrence and St. Charles rivers.
 Back Pain Disability Scale: measurement properties. Spine. 1995; 20: 341-352.

(8) Stratford Stratford, estate, United States
Stratford, home of the Lee family, overlooking the Potomac River, E Va., SE of Fredericksburg. A national shrine dedicated in 1935, the site was purchased in 1716 by Thomas Lee, who built the mansion Stratford Hall in
 PW, Finch finch, common name for members of the Fringillidae, the largest family of birds (including over half the known species), found in most parts of the world except Australia.  E. Solomon Solomon, d. c.930 B.C., king of the ancient Hebrews (c.970–c.930 B.C.), son and successor of David. His mother was Bath-sheba. His accession has been dated to c.970 B.C. According to the Bible.  P, et al. Using the Roland-Morris scale to make decisions about individual patients. Physiotherapy physiotherapy: see physical therapy.  Canada Canada (kăn`ədə), independent nation (2001 pop. 30,007,094), 3,851,787 sq mi (9,976,128 sq km), N North America. Canada occupies all of North America N of the United States (and E of Alaska) except for Greenland and the French islands of . 1996;48:107-110.

(9) Nusbaum Nusbaum is a village in the district Bitburg-Prüm in Rhineland-Palatinate, Germany, situated in the southern part of the Eifel. Links
  • Ortsgemeinde Nusbaum
Coordinates:  
 L, Natour J, Ferraz MB, Goldenberg Goldenberg may refer to:

People:
  • Billy Goldenberg (born 1936), American composer
  • Carl Goldenberg (1907-1996), Canadian lawyer & politician
  • Charles Goldenberg (1911-1986), American football player
  • Eddie Goldenberg, Canadian political advisor
 J. Translation, adaptation and validation See validate.

validation - The stage in the software life-cycle at the end of the development process where software is evaluated to ensure that it complies with the requirements.
 of the Roland-Morris questionnaire: Brazil Brazil (brəzĭl`), Port. Brasil, officially Federative Republic of Brazil, republic (2005 est. pop. 186,113,000), 3,286,470 sq mi (8,511,965 sq km), E South America.  Roland-Morris. Braz J Med Biol Res. 2001;34:203-210.

(10) Stratford PW, Binkley JM. A comparison study of the Back Pain Functional Scale and Roland Morris Questionnaire. North American North American

named after North America.


North American blastomycosis
see North American blastomycosis.

North American cattle tick
see boophilusannulatus.
 Orthopaedic 1. See otrthopedic and orthopedics.

Adj. 1. orthopaedic - of or relating to orthopedics; "orthopedic shoes"
orthopedic, orthopedical

orthopaedic (US), orthopedic adj
 Rehabilitation rehabilitation: see physical therapy.  Research Network. J Rheumatol. 2000;27:1928-1936.

(11) Patrick DL, Deyo RA, Atlas Atlas, in Greek mythology
Atlas (ăt`ləs), in Greek mythology, a Titan; son of Iapetus and Clymene and the brother of Prometheus.
 SJ, et al. Assessing health-related quality of life in patients with sciatica sciatica (sīăt`ĭkə), severe pain in the leg along the sciatic nerve and its branches. It may be caused by injury or pressure to the base of the nerve in the lower back, or by metabolic, toxic, or infectious disease. . Spine. 1995;20:1899-1908.

(12) Johansson Johansson is a family name which means "son of Johan". It is the most common Swedish family name, followed by Andersson.

People with the family name Johansson include:
  • Anders Johansson, heavy metal drummer
  • Bernt Johansson, road bicycle racer
 E, Lindberg P. Subacute subacute /sub·acute/ (-ah-kut´) somewhat acute; between acute and chronic.

sub·a·cute
adj.
Between acute and chronic.
 and chronic low back pain: reliability and validity of a Swedish version of the Roland and Morris Disability Questionnaire. Scand J Rehabil Med. 1998;30:139-143.

(13) Wiesinger GF, Nuhr M, Quittan M, et al. Cross-cultural adaptation of the Roland-Morris questionnaire for German-speaking patients with low back pain. Spine. 1999;24:1099-1103.

(14) Underwood MR, Barnett AG, Vickers MR. Evaluation of two time-specific back pain outcome measures. Spine. 1999;24:1104-1112.

(15) Jacob Jacob (jā`kəb), in the Bible, ancestor of the Hebrews, the younger of Isaac and Rebecca's twin sons; the older was Esau. In exchange for a bowl of lentil soup, Jacob obtained Esau's birthright and, with his mother's help, received the blessing  T, Baras M, Zeev A, Epstein L. Low back pain: reliability of a set of pain measurement tools. Arch Phys Med Rehabil. 2001;82:735-742.

(16) Jensen MP, Strom SE, Turner JA, Romano JM. Validity of the Sickness SICKNESS. By sickness is understood any affection of the body which deprives it temporarily of the power to fulfill its usual functions.
     2. Sickness is either such as affects the body generally, or only some parts of it.
 impact Roland scale as a measure of dysfunction dysfunction /dys·func·tion/ (dis-funk´shun) disturbance, impairment, or abnormality of functioning of an organ.dysfunc´tional

erectile dysfunction  impotence (2).
 in chronic pain patients. Pain. 1992;50:157-162.

(17) Stratford PW, Binkley JM, Riddle riddle, puzzling question, specifically one that consists of a fanciful description or definition of something to be guessed. A famous riddle was asked by the Sphinx: "What goes on four legs in the morning, on two at noon, on three at night?" Oedipus guessed the  DL. Development and initial validation of the Back Pain Functional Scale. Spine. 2000;25:2095-2102.

(18) Winer BJ. Statistical Principles in Experimental Design. New York New York, state, United States
New York, Middle Atlantic state of the United States. It is bordered by Vermont, Massachusetts, Connecticut, and the Atlantic Ocean (E), New Jersey and Pennsylvania (S), Lakes Erie and Ontario and the Canadian province of
, NY: McGraw-Hill; 1962:4-13.

Author Response:

We would like to thank Riddle and Stratford for raising a number of issues in response to our article. We found that measurements taken with the Roland Morris Questionnaire (RMQ RMQ Risk Management Questionnaire ) of subjects who were classified as unchanged were, as a proportion of the utilized scale, more variable than measurements taken using the Oswestry Disability Questionnaire, the Quebec Back Pain Disability Scale, the Physical Function Scale, and the Medical Outcomes Study 36-Item Short-Form Health Survey (SF-36). Riddle and Stratford asked why our correlation coefficients Correlation Coefficient

A measure that determines the degree to which two variable's movements are associated.

The correlation coefficient is calculated as:
 for repeated RMQ measurements were at the lower end of the range of reported values and whether a few patients with unusual variability in their scores were responsible for the results.

Correlation indices of reliability such as intraclass correlation coefficients (ICCs) indicate the error in measurements as a proportion of the total variance The discrepancy between what a party to a lawsuit alleges will be proved in pleadings and what the party actually proves at trial.

In Zoning law, an official permit to use property in a manner that departs from the way in which other property in the same locality
 in scores. (1) They are affected by sample variance (ie, the range of scores demonstrated by subjects) as well as inconsistency in·con·sis·ten·cy  
n. pl. in·con·sis·ten·cies
1. The state or quality of being inconsistent.

2. Something inconsistent: many inconsistencies in your proposal.
 in measurements. In answering the question "Why are the ICCs lower?" we would like to examine the standard error of measurement (SEM), as we believe that expressions of error in the same scale units (in this case, RMQ units) provide a more useful basis of comparison than the correlation coefficients.

Table 1 shows the SEMs reported for, or that we have calculated from, a number of studies that have reported ICCs and Pearson correlation coefficients. (2-13) The SEM provides an indication of the extent to which the average respondent In Equity practice, the party who answers a bill or other proceeding in equity. The party against whom an appeal or motion, an application for a court order, is instituted and who is required to answer in order to protect his or her interests.  varies (in RMQ units) when retested at a time when his or her condition could reasonably be considered to be unchanged. (14) Table 1 shows that SEMs for measurements taken using the RMQ ranged from 1.5 to 4.1 RMQ units. This means that, on average, subjects who are assumed to be unchanged typically could be 1.5 to 4.1 RMQ units to either side of an obtained score. We believe it is likely that the amount of expected error in self-report measurements of activity limitation varies with time between test and retest re·test  
tr.v. re·test·ed, re·test·ing, re·tests
To test again.

n.
A second or repeated test.
. Our study and that of Patrick et al (12) identified comparable estimates of error. Patrick et al were the only other researchers to report on error associated with retesting a cohort cohort /co·hort/ (ko´hort)
1. in epidemiology, a group of individuals sharing a common characteristic and observed over time in the group.

2.
 at greater than 6 weeks from the first test. Other researchers who have conducted retests 6 weeks or more after an initial test have pooled these data with data obtained for subjects retested much closer to the first test. The RMQ measurements may display increasing variability as the time between tests increases. Clinicians should consider these findings in the light of the time frames over which they typically monitor patient progress. In contrast, the other instruments used in our study yielded scores that were relatively more stable for the same subjects.

There are, we believe, several reasons for doing a reliability study. One reason is to quantify Quantify - A performance analysis tool from Pure Software.  the magnitude of error in order to assess the suitability of the measurements for application to a specific task. Another reason is to determine whether measurement consistency can be improved. This might be achieved by identifying the source of error and refining refining, any of various processes for separating impurities from crude or semifinished materials. It includes the finer processes of metallurgy, the fractional distillation of petroleum into its commercial products, and the purifying of cane, beet, and maple sugar  the way measurements are taken in order to reduce this error. When the purpose of a reliability study is to determine which of several instruments provides the most consistent measurements, we believe a head-to-head comparison (ie, having the same subjects complete the questionnaires at the same time) is the best method because it ensures that the variability between measures is not due to sample differences but is related to questionnaire characteristics.

Riddle and Stratford make the reasonable suggestion that an explanation for our observation is that a few subjects exhibiting extreme test-retest differences in a small sample distorted the results. In our group of 47 unchanged subjects, RMQ change scores ranged from -9 to +19 points, and 13 subjects (28%) had change scores of 5 points (21% of the scale width) or more. The interesting question for us is why those subjects who exhibited large variations in RMQ change scores had more stable scores on the other questionnaires (Tab. 2).

On the RMQ, subjects are asked to choose only the statements that apply to them "today." Back pain can vary considerably from day to day, whereas the overall rating of change in our study related to a 6-week period. However, the Waddell and Quebec questionnaires also refer to "today," so this in itself does not sufficiently explain why subjects were more variable in their responses to the RMQ in our study. Of respondents In the context of marketing research, a representative sample drawn from a larger population of people from whom information is collected and used to develop or confirm marketing strategy.  classified as unchanged, more than 30% changed their response to the statements "I sleep less well because of my back," "Because of my back, I go upstairs more slowly than usual," and "Because of my back, I try not to bend or Bend Or (1877-1903) was a British Thoroughbred racehorse who won the 1880 edition of the Epsom Derby. His regular jockey Fred Archer, winner of thirteen consecutive British jockey titles, said Bend Or was probably the greatest horse he had ever ridden.  kneel down." More than 20% of the respondents changed their minds on 15 out of the 24 RMQ items.

The size of the ICC does not tell us what items are more or less useful or whether the magnitude of error is acceptable for the intended use of the instrument. Close examination of patterns in the data, we believe, allows us to explore ways to refine instruments that we use to evaluate people with back pain. Publication bias, in our opinion, almost certainly confers an optimistic op·ti·mist  
n.
1. One who usually expects a favorable outcome.

2. A believer in philosophical optimism.



op
 message about measurement utility. It is likely that some of the biases that result in underpublication of clinical trials with null A character that is all 0 bits. Also written as "NUL," it is the first character in the ASCII and EBCDIC data codes. In hex, it displays and prints as 00; in decimal, it may appear as a single zero in a chart of codes, but displays and prints as a blank space.  findings (15) also lead to underpublication of reliability studies with low reliability coefficient values. However, we are not aware of any studies that have explored the extent to which investigators fail to submit, or journals reject, such studies. The responsibility of researchers is to investigate and improve the instruments that we recommend for use in the examination of patients.

The incisive incisive /in·ci·sive/ (-si´siv)
1. having the power or quality of cutting.

2. pertaining to the incisor teeth.


in·ci·sive
adj.
1. Having the power to cut.
 questions raised by Riddle and Stratford regarding our article are appreciated. Our data set nevertheless speaks for itself. We do not consider our findings or conclusions to be aberrant simply because they vary from previous findings. Most previous studies were based on different samples using shorter time frames for retesting. Indeed, the study by Patrick et al, (12) in which longer retest periods than ours were used, produced findings that were similar to ours. Clinicians and researchers should weigh this evidence when considering examination instrument choice and should be prepared to change their choice of outcome measurement tools as better options present themselves.
Table 1.

Comparison of Standard Error of Measurement (SEM) Across Studies

Authors              Retest Period   Coefficient

Jacob et al (2)      2-14 d          ICC=.93
                                     r=.88
Wiesinger et al (3)  1 d             r=.82

Johansson and        1 wk            ICC=.88
  Lindberg (4)

Nusbaum et al (5)    7 d             ICC=.94
                                     r=.88

Stratford et al (6)  3-6 wk          ICC (1,1)=.86 (.72,.94)

Kopec et al (7)      1-14 d          ICC (2,1)=.91

Roland and           Same d          r=.91
  Morris (8)

Stratford et al (9)  48 hr           ICC (2,1)=.81 (.62,.91)

Jensen et al (10)    2 d-6 mo        r=.72
                       (median=39 d)

Stratford and        48 hr           ICC(2,1)=.79 (lower
  Binkley (11)                         confident interval=.73)

Patrick et al (12)   3 mo            ICC=.76
                                     ICC=.66

Davidson and         6 wk            ICC (2,1)=.53 (.29,.71)
  Keating (13)                       ICC (2,1)=.42 (-.07,.75)

Authors               Coefficient                 SD      SEM

Jacob et al (2)       ICC=.93                     5.8     1.5
                      r=.88                               2.0

Wiesinger et al (3)   r=.82                       4       1.7

Johansson and         ICC=.88                     5       1.7
  Lindberg (4)

Nusbaum et al (5)     ICC=.94                     7.3     1.8
                      r=.88                               2.5

Stratford et al (6)   ICC (1,1)=.86 (.72,.94)     4.7     1.8

Kopec et al (7)       ICC (2,1)=.91               6 (a)   1.8

Roland and            r=.91                       6.4     1.9
  Morris (8)

Stratford et al (9)   ICC (2,1)=.81 (.62,.91)     5.2     2.1

Jensen et al (10)     r=.72                       4.3     2.3

Stratford and         ICC(2,1)=.79 (lower         5.3     2.4
  Binkley (11)          confident interval=.73)

Patrick et al (12)    ICC=.76                     6 (b)   2.9
                      ICC=.66                     6 (b)   3.5

Davidson and          ICC (2,1)=.53 (.29,.71)     5.4     3.7
  Keating (13)        ICC (2,1)=.42 (-.07,.75)    5.4     4.1

(a) Estimated from frequency counts.

(b) Estimated from pooled subject data.
Table 2.

Comparison of Change Scores (a) for Subjects Who Are Stable With
Roland-Morris Questionnaire Raw Score Change of 9 or More

Subject  Roland-Morris     Oswestry Disability   Quebec Back Pain
No.      Questionnaire     Questionnaire         Disability Scale

1             79            7                    -9
2             50           16                    35
3             37.5         16                    17
4             37.5         -9                    13
5            -37.5          8                     5

Subject  Waddell           SF-36 (b) Physical
No.      Disability Index  Function Scale

1         22
2        -11                 0
3         11                 0
4         11                10
5          0               -10

(a) All scores transformed to 0-100 to facilitate direct comparison.
SF-36PF change score for subject 1 could not be calculated due to
missing data at time 2.

(b) SF-36=Medical Outcomes Study 36-Item Short-Form Health Survey.
Megan Davidson, PT, BApSc(Physio)
Lecturer
School of Physiotherapy
Faculty of Health Sciences
La Trobe University
Victoria 3086 Australia

Jennifer Keating, PT, PhD
Senior Lecturer
School of Physiotherapy
Faculty of Health Sciences
La Trobe University


References

(1) Shrout PE, Fleiss JL. Intraclass correlations: uses in assessing rater rat·er  
n.
1. One that rates, especially one that establishes a rating.

2. One having an indicated rank or rating. Often used in combination: a third-rater; a first-rater. 
 reliability. Psychol Bull. 1979;86:420-428.

(2) Jacob T, Baras M, Zeev A, Epstein L. Low back pain: reliability of a set of pain measurement tools. Arch Phys Med Rehabil. 2001;82:735-742.

(3) Wiesinger GF, Nuhr M, Quittan M, et al. Cross-cultural adaptation of the Roland-Morris questionnaire for German-speaking patients with low back pain. Spine. 1999;24:1099-1103.

(4) Johansson E, Lindberg P. Subacute and chronic low back pain: reliability and validity of a Swedish version of the Roland and Morris Disability Questionnaire. Scand J Rehabil Med. 1998;30:139-143.

(5) Nusbaum L, Natour J, Ferraz MB, Goldenberg J. Translation, adaptation and validation of the Roland-Morris questionnaire: Brazil Roland-Morris. Braz J Med Biol Res. 2001;34:203-210.

(6) Stratford PW, Finch E. Solomon P, et al. Using the Roland-Morris scale to make decisions about individual patients. Physiotherapy Canada. 1996;48:107-110.

(7) Kopec JA, Esdaile JM, Abrahamowicz M, et al. The Quebec Back Pain Disability Scale: measurement properties. Spine. 1995;20: 341-352.

(8) Roland M, Morris R. A study of the natural history of back pain, part I: development of a reliable and sensitive measure of disability in low back pain. Spine. 1983;8:141-144.

(9) Stratford PW, Binkley JM, Riddle DL. Development and initial validation of the Back Pain Functional Scale. Spine. 2000;25:2095-2102.

(10) Jensen ME Strom SE, Turner JA, Romano JM. Validity of the Sickness Impact Profile Roland scale as a measure of dysfunction in chronic pain patients. Pain. 1992;50:157-162.

(11) Stratford PW, Binkley JM. A comparison of the Back Pain Functional Scale and Roland Morris Questionnaire. North American Orthopaedic Rehabilitation Research Network. J Rheumatol. 2000;8:1928-1936.

(12) Patrick DL, Deyo RA, Atlas SJ, et al. Assessing health-related quality of life in patients with sciatica. Spine. 1995;20:1899-1908.

(13) Davidson M, Keating JL. A comparison of five low back disability questionnaires: reliability and responsiveness. Phys Ther. 2002; 82:8-24.

(14) Keating J, Matyas T. Unreliable inferences from reliable measurements. Australian Australian

pertaining to or originating in Australia.


Australian bat lyssavirus disease
see Australian bat lyssavirus disease.

Australian cattle dog
a medium-sized, compact working dog used for control of cattle.
 Journal of Physiotherapy. 1998;44:5-10.

(15) Song F, Eastwood AJ, Gilbody S, et al. Publication and related biases. Health Technol Assess. 2000;4:10.
COPYRIGHT 2002 American Physical Therapy Association, Inc.
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 2002, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Author:Stratford, Paul W
Publication:Physical Therapy
Date:May 1, 2002
Words:3936
Previous Article:Evidence in practice: patient examples are designed to illustrate how evidence is gathered and used to guide clinical decision making.
Next Article:Article titles in rehabilitation literature.(Bibliography)



Related Articles
Reliability and validity of the Dutch adaptation of the Quebec Back Pain Disability Scale.
Defining the minimum level of detectable change for the Roland-Morris Questionnaire. (disability evaluation standards)(includes commentary and author...
Physical therapy utilization by patients with acute low back pain.
A new approach to measuring recovery in injured workers with acute low back pain: Resumption of Activities of Daily Living Scale. (includes appendix)
Functional Abilities Confidence Scale: a clinical measure for injured workers with acute low back pain. (includes appendix)
A Comparison of a Modified Oswestry Low Back Pain Disability Questionnaire and the Quebec Back Pain Disability Scale.
A comparison of five low back disability questionnaires: reliability and responsiveness. (Research Report).(Statistical Data Included)
OLD, NEW LOVE IN 'POSSESSION'.(U)(Review)
MOM MAKES DO WHILE SON IN IRAQ.(News)
Management of symphysis pubis dysfunction during pregnancy using exercise and pelvic support belts.(Research Report)

Terms of use | Copyright © 2009 Farlex, Inc. | Feedback | For webmasters | Submit articles