Printer Friendly
The Free Library
6,672,202 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

Performance measures provide assessments of pain and function in people with advanced osteoarthritis of the hip or knee.


Patients with osteoarthritis osteoarthritis
 or osteoarthrosis or degenerative joint disease

Most common joint disorder, afflicting over 80% of those who reach age 70. It does not involve excessive inflammation and may have no symptoms, especially at first.
 (OA) and those proto arthroplasty often present with pain limitations of physical function. The rate, pattern, and direction of change may differ for pain and function depending on the period in the natural or clinical history over which a patient is assessed. (1,2) For example, at 2 months after total joint arthroplasty total joint arthroplasty
n.
Arthroplasty in which both joint surfaces are replaced with artificial materials, usually metal and high-density plastic.
 of the hip or knee, patients' pain ratings were shown to be comparable to or lower than their preoperative pre·op·er·a·tive
adj.
Preceding a surgical operation.



preoperative

preceding an operation.


preoperative care
the preparation of a patient before operation.
 values; however, the time to complete performance tasks was substantially increased. (2,3) Because pain and physical function represent different but related health concepts and interventions targeting them frequently differ, separate assessments of these attributes were recommended at the Outcome Measures in Arthritis Clinical Trials conference (OMERACT OMERACT Outcome Measures in Rheumatoid Arthritis Clinical Trials ) III. (4) An intriguing in·trigue  
n.
1.
a. A secret or underhand scheme; a plot.

b. The practice of or involvement in such schemes.

2. A clandestine love affair.

v.
 aspect of the OMERACT III outcome model was that although the assessment of physical function was essential, the application of performance tests was optional. (4) Self-report measures of physical function also were favored over performance measures in an authoritative review of outcome measures for people with OA. (5) This recommendation apparently was based on the lower cost and ease of administration associated with self-report measures; however, it assumes that self-report measures and performance measures of physical function assess the same attribute and nothing else.

Although many self-report measures profess pro·fess  
v. pro·fessed, pro·fess·ing, pro·fess·es

v.tr.
1. To affirm openly; declare or claim: "a physics major
 to assess physical function, few provide an operational definition of its intended meaning. A noted exception is the Western Ontario and McMaster Universities McMaster University, at Hamilton, Ont., Canada; nondenominational; founded 1887. It has faculties of humanities, science, social sciences, business, engineering, and health sciences, as well as a school of graduate studies and a divinity college.  Osteoarthritis Index (WOMAC WOMAC Western Ontario McMaster University Osteoarthritis Index Rheumatology An arthritic pain scoring system ranging from 0–no pain/disability to 100–most severe pain/disability ) physical function subscale, which provides the following statement: "By this we mean your ability to move around and to look after yourself." (6) We suspect that this statement captures the intended meaning of lower-extremity physical functional status left undeclared by many researchers, and it is representative of our view of lower-extremity functional status.

Contrary to the belief that self-report measures and performance measures of physical function provide comparable information is a body of work refuting this idea. (2,3,7) Parent and Moffet, (2) in a study of patients after total knee arthroplasty, noted improvement in self-reported physical function as measured by the WOMAC and the Medical Outcomes Study 36-Item Health Survey Questionnaire (SF-36) physical function subscales but a significant reduction in the 6-minute walking distance when assessed at 2 months after arthroplasty. Maly et al, (7) in an investigation of patients with OA of the knee, reported higher correlations between pain and WOMAC and SF-36 physical function scores than between pain and 3 performance measures (Six-Minute Walk Test six-minute walk test

an assessment of a dog's ability to undertake daily activities.
, Timed "Up & Go" Test, and a stair stair  
n.
1. A series or flight of steps; a staircase. Often used in the plural.

2. One of a flight of steps.



[Middle English, from Old English
 test). Using a stepwise stepwise

incremental; additional information is added at each step.


stepwise multiple regression
used when a large number of possible explanatory variables are available and there is difficulty interpreting the partial regression
 linear regression Linear regression

A statistical technique for fitting a straight line to a set of data points.
 analysis that included pain and thigh thigh (thi) femur; the portion of the leg above the knee.

thigh
n.
The part of the leg between the hip and the knee. Also called femur.
 muscle strength as independent variables and WOMAC and SF-36 physical function subscales as dependent variables, these investigators also found that pain was more predictive of self-reported function than muscle strength. (7) Stratford and Kennedy, (3) in a study of patients after hip or knee arthroplasty, reported higher standardized standardized

pertaining to data that have been submitted to standardization procedures.


standardized morbidity rate
see morbidity rate.

standardized mortality rate
see mortality rate.
 regression coefficients Regression coefficient

Term yielded by regression analysis that indicates the sensitivity of the dependent variable to a particular independent variable. See: Parameter.


regression coefficient 
 between pain and WOMAC physical function scores and change scores than between pain and the time or distance associated with several performance tasks. Also reported in this article was the finding that self-reported Lower-Extremity Functional Scale (LEFS LEFS Local Enterprise Finance Scheme (Singapore) ) scores were most strongly associated with pain preoperatively, exertion exertion,
n vigorous action, a great effort, a strong influence.
 when assessed within 2 weeks of arthroplasty, and the time or distance associated with performance measures when evaluated approximately 2 months after arthroplasty. (3)

A further insight is provided in a study that examined the relationship between performance-rated components of pain, exertion, and function (time or distance) and LEFS scores. (8) Patients with end-stage OA of the hip or knee and awaiting arthroplasty performed 3 performance tasks--40-m self-paced walk, stair test, and Timed "Up & Go" Test--and completed the LEFS. Immediately following each performance task, patients reported the amount of pain and exertion that they experienced. (8) An exploratory factor analysis identified 3 factors, with pain responses loading on 1 factor, exertion loading on the second factor, and time loading on the third factor. The LEFS loaded on all 3 factors (pain=.44; exertion=.41; and time=.35). (8) Recently, Terwee et al (9) examined the relationship between the WOMAC and SF-36 pain and function subscales with the performance-based DynaPort Knee Test * for patients with OA of the knee before and after arthroplasty. Applying an exploratory factor analysis, these investigators found that the self-report measures of pain and function loaded on 1 factor and that the performance measure loaded on a second factor. (9) The 8F-36 function score loaded on both factors, with the higher loading on the factor composed of the self-report measures (.78 and .69). (9) Collectively, these findings support the premise that self-report measures of physical function assess more than a patient's ability to move around. (2,3,7-9) It appears that, in addition to providing patients' perceptions of their ability to move around, self-report measures of physical function also are influenced by what patients experience when moving around (eg, pain and exertion).

A further understanding of the relationship between self-report assessments of pain and physical function is offered by a number of studies that examined the factorial factorial

For any whole number, the product of all the counting numbers up to and including itself. It is indicated with an exclamation point: 4! (read “four factorial”) is 1 × 2 × 3 × 4 = 24.
 validity of the WOMAC. Factorial validity exists to the extent that items cluster in accordance Accordance is Bible Study Software for Macintosh developed by OakTree Software, Inc.[]

As well as a standalone program, it is the base software packaged by Zondervan in their Bible Study suites for Macintosh.
 with the specified domains to which they have been assigned by the measure's developer. The WOMAC was conceived to assess 3 domains: pain, stiffness, and physical function. (10) Accordingly, factorial validity would exist if the 5 pain items loaded on 1 factor, the 2 stiffness items loaded on a second factor, and the 17 physical function items loaded on a third factor. However, there is consistent evidence demonstrating that the WOMAC pain and physical function items group more by activity than by the hypothesized domains of pain and physical function. (11-14)

There is no doubt that pain and physical function are related health concepts. Yet to the extent that during assessments, clinicians routinely inquire in·quire   also en·quire
v. in·quired, in·quir·ing, in·quires

v.intr.
1. To seek information by asking a question: inquired about prices.

2.
 about pain and physical function separately, outcome measures have separate scales to assess pain and function, and due to the fact that authoritative groups such as OMERACT III have identified pain and physical function as 2 core outcome measures rather than 1, investigators are challenged to develop assessment methods that maximize valid information concerning the attributes of interest. It was with these challenges in mind that we undertook the present study.

Our intent was to determine whether performance test assessments of pain and physical function provided responses consistent with these 2 domains. Specifically, our goal was to evaluate the factorial validity of performance assessments of pain and physical function. Our specific hypotheses were as follows: (1) responses to the performance assessments could be explained by 2 factors, 1 consisting of pain items and the other consisting of time (distance) items; (2) each pain or performance item would be related only to the health concept that it was perceived to be assessing (each item would have a nonzero non·ze·ro  
adj.
Not equal to zero.



nonzero  

Not equal to zero.
 loading on the factor that it was conceived to measure and a zero loading on the other factor); (3) the factors pain and physical function would be correlated cor·re·late  
v. cor·re·lat·ed, cor·re·lat·ing, cor·re·lates

v.tr.
1. To put or bring into causal, complementary, parallel, or reciprocal relation.

2.
; and (4) the measurement error terms associated with the items would be uncorrelated.

Method

Subjects

Patients were eligible for this study if they were able to speak and comprehend written English; were diagnosed with end-stage OA of the hip or knee, as labeled by the surgeon and confirmed by radiographs; were scheduled to undergo primary total hip arthroplasty total hip arthroplasty,
n total hip replacement; surgical reconstruction of the hip in which the ball-and-socket joint is replaced with a prosthesis.
 or total knee arthroplasty; were able to complete the performance tests; and provided written informed consent. Patients undergoing revision or bilateral arthroplasty or additional operative procedures or those demonstrating comorbidities associated with cognitive impairment Impairment

1. A reduction in a company's stated capital.

2. The total capital that is less than the par value of the company's capital stock.

Notes:
1. This is usually reduced because of poorly estimated losses or gains.

2.
 were excluded. Of the 188 patients reviewed, 177 (94%) met the eligibility criteria. The study sample consisted of 81 participants with hip OA and 96 participants with knee OA. Eighty-five of the participants were women, 36 of whom received total hip arthroplasty. The mean age and body mass index of the sample were 65 years (first and third quartiles: 58.0 and 72.0) and 29.1 kg/[m.sup.2] (first and third quartiles: 26.4 and 33.0), respectively. A breakdown of the participants' characteristics by site of OA is shown in Table 1. The study took place at a tertiary-care orthopedic orthopedic /or·tho·pe·dic/ (-pe´dik) pertaining to the correction of deformities of the musculoskeletal system; pertaining to orthopedics.  hospital in Toronto, Ontario, Canada, and data were collected from November 2001 to February 2003.

Design

We applied a cross-sectional study cross-sectional study
n.
See synchronic study.


cross-sectional study,
n the scientific method for the analysis of data gathered from two or more samples at one point in time.
 design. Participants completed the performance tests at a median interval of 20 days (first and third quartiles: 12 and 22) before surgery.

Measures

Participants completed 4 performance measures in the following order: self-paced walk, Timed "Up & Go" Test, stair test, and Six-Minute Walk Test. Several minutes were provided between the self-paced walk, Timed "Up & Go" Test, and stair test. A 10-minute rest interval was provided between the stair test and the Six-Minute Walk Test. With the exception of the Six-Minute Walk Test, the outcome was the time to complete the task. Time was measured on a stopwatch to the nearest one-hundredth of a second, and distance was measured to the nearest meter.

Self-paced walk. Participants walked 2 lengths of a 20-m indoor course in response to the instructions, "Walk as quickly as you can without overexerting yourself." (15) The turnaround time (1) In batch processing, the time it takes to receive finished reports after submission of documents or files for processing. In an online environment, turnaround time is the same as response time.  was excluded. An intraclass correlation In statistics, the intraclass correlation (or the intraclass correlation coefficient[1]) is a measure of correlation, consistency or conformity for a data set when it has multiple groups.  coefficient coefficient /co·ef·fi·cient/ (ko?ah-fish´int)
1. an expression of the change or effect produced by variation in certain factors, or of the ratio between two different quantities.

2.
 (ICC ICC

See: International Chamber of Commerce
) for test-retest reliability test-retest reliability Psychology A measure of the ability of a psychologic testing instrument to yield the same result for a single Pt at 2 different test periods, which are closely spaced so that any variation detected reflects reliability of the instrument  of .91 and a standard error of measurement of 1.73 seconds have been reported for this measure for patients similar to the participants in the present study. (15)

Timed "Up & Go" Test. Participants were instructed to rise from a standard arm chair, walk at a safe and comfortable pace to a line 3 m away, cross the line, turn, and return to a sitting position in the chair. (16) An ICC for test-retest reliability of .75 and a standard error of measurement of 1.07 seconds have been reported for this measure for patients with OA and those undergoing arthroplasty of the hip or knee. (15)

Stair test. Participants ascended and descended 9 stairs (step height, 20 cm; step depth, 27 cm) in their usual manner at a safe and comfortable pace. (15) A handrail was available. An ICC for test-retest reliability of .90 and a standard error of measurement of 2.35 seconds have been reported for this measure for patients similar to the participants in the present study. (15)

Six-Minute Walk Test. Participants were instructed to cover as much distance as possible during the 6-minute time frame. Standardized encouragement--"You are doing well, keep up the good work"--was provided at 60-second intervals. The test was conducted on a premeasured, 46-m, unobstructed, uncarpeted, rectangular rec·tan·gu·lar  
adj.
1. Having the shape of a rectangle.

2. Having one or more right angles.

3. Designating a geometric coordinate system with mutually perpendicular axes.
 circuit. The outcome was the distance walked in 6 minutes. (15,17) An ICC for test-retest reliability of .94 and a standard error of measurement of 26.29 m have been reported for this measure for patients similar to the participants in the present study. (15)

Activity-specific pain rating. Participants marked the pain that they experienced on an 11-point (0-10) numeric numeric

see numerical.


numeric cluster
see ten-key pad.
 rating scale immediately following each performance test. (15) We are not aware of test-retest reliability values for patients similar to the participants in the present study; however, a reliability estimate (ICC) of .86 and a standard error of measurement of 1.04 have been reported for people with a spectrum of lower-extremity problems. (18)

Data Analysis

We applied confirmatory factor analysis In statistics, confirmatory factor analysis (CFA) is a special form of factor analysis. It is used to assess the the number of factors and the loadings of variables.  with a maximum-likelihood estimation estimation

In mathematics, use of a function or formula to derive a solution or make a prediction. Unlike approximation, it has precise connotations. In statistics, for example, it connotes the careful selection and testing of a function called an estimator.
 method (AMOS Amos (ā`məs), prophetic book of the Bible. The majority of its oracles are chronologically earlier than those of the Bible's other prophetic books. His activity is dated c.760 B.C.  4.0 ([dagger])) to assess the factorial validity of the performance tests. (19-22) Unlike exploratory factor analysis, which provides all possible factor loadings, confirmatory factor analysis provides factor loadings for the specified model only. We conceptualized a measurement model with 2 factors, which we labeled pain and physical function (Fig. 1). (22) We applied the following indexes to assess model fit: comparative fit index (CFI CFI
abbr.
cost, freight, and insurance
), relative fit (RF), Tucker-Lewis Index (TLI (Transport Level Interface) A common interface for transport services (layer 4 of the OSI model). It provides a common language to a transport protocol and allows client/server applications to be used in different networking environments. ), root-mean-square error of approximation approximation /ap·prox·i·ma·tion/ (ah-prok?si-ma´shun)
1. the act or process of bringing into proximity or apposition.

2. a numerical value of limited accuracy.
 (RMSEA RMSEA Root Mean Square Error of Approximation ), and the model fit chi-square test chi-square test: see statistics.  and associated P value. (22) Although no single standard exists for defining acceptable model fit, the following values are generally accepted: CFI, RF, and TLI values exceeding .95 indicate good fit; RMSEA values of less than .05 indicate good fit; and RMSEA values of less than .08 indicate reasonable fit. (22,23) A significant chi-square value (eg, P<.05) indicates that the data do not fit the model. Prior to conducting the analyses, we assessed the data and found several of the underlying distributions to be nonnormal. Accordingly, we applied the bootstrap See boot.

(operating system, compiler) bootstrap - To load and initialise the operating system on a computer. Normally abbreviated to "boot". From the curious expression "to pull oneself up by one's bootstraps", one of the legendary feats of Baron von Munchhausen.
 feature of AMOS 4.0 for 1,000 samples with replacement to estimate the parameter (1) Any value passed to a program by the user or by another program in order to customize the program for a particular purpose. A parameter may be anything; for example, a file name, a coordinate, a range of values, a money amount or a code of some kind.  values and model fit indexes. (22)

[FIGURE 1 OMITTED]

To enhance the validity and generalizability of our final model, we performed 2 cross-validation procedures. First, we stratified stratified /strat·i·fied/ (strat´i-fid) formed or arranged in layers.

strat·i·fied
adj.
Arranged in the form of layers or strata.
 by site (hip or knee) and used a random-number generator to create 2 samples with approximately equal representation of hips and knees.

One group was used to generate the initial model and modifications (n=88:48 knees, 40 hips), and the second group was used to cross-validate the model (n=89: 48 knees, 41 hips). The second cross-validation procedure repeated the steps described above; however, this time 1 group was composed of participants with knee OA and the other group was composed of participants with hip OA.

Results

Results for the first cross-validation analysis were similar for the combined samples, which included participants with knee OA and hip OA in each group (initial sample: [[chi square chi square (kī),
n a nonparametric statistic used with discrete data in the form of frequency count (nominal data) or percentages or proportions that can be reduced to frequencies.
].sub.8]=7.7, P=.473; cross-validation sample: [[chi square].sub.8]=9.6, P=.269; simultaneous test for a difference between model structures: [[chi square].sub.16]=17.6, P=.351). Similar results also were obtained for the second cross-validation analysis (knee sample: [[chi square].sub.8] 12.1, P=.148; hip sample: [[chi square].sub.8] = 13.3, P=. 108; simultaneous test for a difference between model structures: [[chi square].sub.16]=25.3, P=.064). Given that the cross-validation analyses supported the model for various independent subgroups of participants, we present the results for the entire sample of 177 participants.

Descriptive statistics descriptive statistics

see statistics.
 for the performance measures are shown in Table 2, Figure 1 shows the standardized factor loadings for the initial measurement model (model 1), and Table 3 shows the fit statistics. The observed or measured variables in Figure 1 are shown in rectangles, and the latent variables In statistics, Latent variables (as opposed to observable variables), are variables that are not directly observed but are rather inferred (through a mathematical model) from other variables that are observed and directly measured.  are shown in circles. The larger circles labeled "pain" and "function" designate des·ig·nate  
tr.v. des·ig·nat·ed, des·ig·nat·ing, des·ig·nates
1. To indicate or specify; point out.

2. To give a name or title to; characterize.

3.
 the factors, and the smaller circles with numbered "e" values signify sig·ni·fy  
v. sig·ni·fied, sig·ni·fy·ing, sig·ni·fies

v.tr.
1. To denote; mean.

2. To make known, as with a sign or word: signify one's intent.
 the measurement error terms associated with each observed variable. The numbers between the factors and observed variables connected by single-headed arrows represent the standardized factor loadings. The negative value associated with the function component of the 6-minute walk test occurs because higher functional levels are associated with greater distances, whereas shorter times reflect higher functional levels for the other 3 performance tests. The curved double-headed arrow showing a value of .48 represents the correlation between the factors pain and function. Although the CFI, RF, and TLI exceeded .90 (Tab. 3), the root-mean-square coefficient indicated a less-than-desirable fit. The modification index for this model (not shown) suggested that the model could be improved by adding a correlation between the stair pain and time error terms, and we elected to address this association with 2 revised models. To ascertain the magnitude of the correlated error terms, the first revised model (model 2a) specified a correlation between the stair pain and time error terms (Fig. 2a: curved double headed arrow showing a correlation of .41). The second revised model (model 2b) removed the stair pain and time terms (Fig. 2b). The fit statistics for both models are shown in Table 3. Both modified models improved the fit over that of the initial model. However, of the 2 modified models, only the 1 that removed the stair terms achieved a good fit for all indexes and was consistent with all of our initial hypotheses.

[FIGURE 2 OMITTED]

Discussion

Although OMERACT III identified pain and physical function as 2 core outcome measures requiring separate assessments, (4) we suspect that most investigators would agree that these health concepts are related in patients with OA and those progressing to arthroplasty. The question unanswered to this point is whether a more distinct assessment of these health concepts can be obtained than has been reported for self-report measures such as the WOMAC. Repeatedly, studies have failed to support the factorial validity of the WOMAC pain and physical function subscale (the principal self-report measure for patients with OA of the hip or knee), (11-14) and these data have led some investigators to abandon the notion of separate assessments of these concepts in favor of grouping WOMAC items by activity regardless of their health domain. For example, Angst angst 1
n.
A feeling of anxiety or apprehension often accompanied by depression.



angst 2
abbr.
angstrom
 et a124 suggested combining pain and physical function items to form 4 subscales, which they labeled lying/sitting, standing/walking, bending, and ascending/descending. This approach ignores the recommendation of OMERACT III that pain and physical function should represent 2 core outcome measures. Moreover, from a clinical perspective, combining pain and physical function items into a single domain does not assist clinicians in identifying treatment goals. Is the patient's principal problem pain, physical function, or a combination of these 2 concepts? In addition, after a course of treatment targeting pain is provided, is the patient's poor score on the standing/walking domain a result of ineffective pain management or a consequence of the patient's reduced ability to move around for reasons other than pain (eg, poor balance, muscle weakness, or restricted range of motion)?

Rather than adhering ADHERING. Cleaving to, or joining; as, adhering to the enemies of the United States.
     2. The constitution of the United States, art. 3, s 3, defines treason against the United States, to consist only in levying war against them or in adhering to their enemies,
 to the notion that self-report measures represent the preferred method of assessing physical function, we examined whether performance-specific evaluations of pain and physical function provide a viable method for obtaining a more distinct assessment of these 2 related health concepts than has been reported for self-report measures, such as the WOMAC. Our initial model yielded a correlation of .48 between pain and function, providing support for hypotheses 1 and 3, which conceptualized 2 correlated health concepts. Moreover, our second hypothesis was sustained in that significant correlations were obtained for the specified health concepts, and no evidence of loading on the nonspecified health concept was evident. However, our fourth hypothesis was not supported in that the error terms for stair pain and function were correlated. This finding led to the exploration of 2 revised models: 1 allowed a correlation between stair pain and time error terms, and the other removed the stair terms from the model. The intent of the model that allowed a correlation between the stair pain and time terms was to examine the extent to which these components were correlated beyond the correlation between the factors pain and function. The second revised model excluded the stair test from the analysis, and this model provided results consistent with our 4 hypotheses. The correlation between the factors pain and function was .43 for the final model. This correlation is lower than that typically reported between the pain and function subscales of the WOMAC (.74-.84) (7,9,25) and SF-36 (.57) (9) for patients reasonably similar to the participants in the present study.

Inclusion of the stair test makes the distinction between the health concepts of pain and function less discernable. This finding is reflected in the lower correlation between pain and function noted in model 2b (r=.43) than in models 1 and 2a (r =.48). Accordingly, when the clinical goal is to obtain as distinct an assessment as possible between the health concepts of pain and function, our results suggest that the stair test not be included in a composite score. However, we are not suggesting that the stair test be excluded from a patient's assessment. It is clear that 1 of the physical therapist's responsibilities for patients similar to the participants in the present study is to ascertain their ability to safely ascend and descend de·scend  
v. de·scend·ed, de·scend·ing, de·scends

v.intr.
1. To move from a higher to a lower place; come or go down.

2.
 stairs and to intervene when appropriate. We simply stress that if the results from the stair test are combined with the results from the other performance measures, then the impressions of pain and function will be less distinct.

Assessments of pain and function are important both to identify patients' problems at a point in time and to assess change over time. Information from these assessments is applied by clinicians to guide decisions concerning individual patients, by researchers to ascertain the relative effectiveness of competing interventions in clinical trials, and by health care policy makers to set benchmarks regarding the maximum number of patient visits and corresponding payment plans. Previous work demonstrated that self-reports of physical function after arthroplasty are strongly influenced by pain and change in pain. (3) The consequences are that patients report their physical function to be higher than is demonstrated by performance tests and that health care professionals who rely on self-reports alone overestimate o·ver·es·ti·mate  
tr.v. o·ver·es·ti·mat·ed, o·ver·es·ti·mat·ing, o·ver·es·ti·mates
1. To estimate too highly.

2. To esteem too greatly.
 patients' functional status levels. (2,3) The results of the confirmatory factor analysis of the present study indicate that performance-rated pain and function represent 2 factors that have not emerged in previous factor analyses Verb 1. factor analyse - to perform a factor analysis of correlational data
factor analyze

analyse, analyze - break down into components or essential features; "analyze today's financial market"
 of self-report measures. Accordingly, complementing existing self-report assessments of physical function with performance-rated pain and function tests may provide clinicians with a more valid assessment of these health concepts.

There are several potential limitations of the present study. First, the study sample was patients awaiting hip or knee arthroplasty. Presumably pre·sum·a·ble  
adj.
That can be presumed or taken for granted; reasonable as a supposition: presumable causes of the disaster.
, these patients have more severe OA than the typical patient seen in general physical therapist practice. However, in considering this point, it should be remembered that the study participants were able to complete all of the performance tests. A second limitation relates to the sample size for the cross-validation portion of the present study. Although there is no standard method for estimating sample size, it is generally agreed that the sample size should be at least 10 subjects per observed variable or a minimum of 100 subjects. (21,26) Although our overall sample size of 177 participants exceeded the recommended minimum sample size, the number of participants in each of the cross-validation samples was slightly smaller than the recommended sample size.

Conclusion

Our goal was to determine whether performance-specific assessments of pain and physical function could provide a more distinct evaluation of these attributes than has been found for self-report measures. We conceived a 2-factor model consisting of pain and physical function and tested the model with 4 activities (self-paced walk test, Timed "Up & Go" Test, stair test, and Six-Minute Walk Test) by use of a confirmatory factor analysis. Although the initial model appeared promising, the stair test pain and function error terms were correlated. Dropping the stair test from the analysis provided results that supported the application of performance-specific assessments of pain and function as a method of i obtaining reasonably distinct assessments of these attributes. We believe that performance-specific assessments of pain and function offer a more distinct method of assessing these attributes than can be obtained by self-reports alone and that performance measures should be viewed as core measures for people with OA of the hip or knee and those progressing to arthroplasty.

This article was received January 4, 2006, and was accepted July 5, 2006.

References

(1) Kennedy DM. Modelling Early Recovery Following Total Hip and Knee Replacement: The Case for Self-report and Physical Performance Measures [thesis]. Hamilton, Ontario, Canada: McMaster University; 2003.

(2) Parent E, Moffet H. Comparative responsiveness of locomotor lo·co·mo·tor or lo·co·mo·tive
adj.
Of or relating to movement from one place to another.



locomotor

of or pertaining to locomotion.
 tests and questionnaires used to follow early recovery after total knee arthroplasty. Arch Phys Med Rehabil. 2002;83:70-80.

(3) Stratford PW, Kennedy DM. Performance measures were necessary to obtain a complete picture of osteoarthritic patients. J Clin Epidemiol. 2006;59:160-167.

(4) Bellamy N, Kirwan J, Boers M, et al. Recommendations for a core set of outcome measures for future phase III clinical trials Noun 1. phase III clinical trial - a large clinical trial of a treatment or drug that in phase I and phase II has been shown to be efficacious with tolerable side effects; after successful conclusion of these clinical trials it will receive formal approval from the  in knee, hip, and hand osteoarthritis: consensus development at OMERACT III. J Rheumatol. 1997;24:799-802.

(5) Bellamy N. Osteoarthritis clinical trials: candidate variables and clinimetric properties. J Rheumatol. 1997;24:768-778.

(6) Bellamy N. WOMAC Osteoarthritis Index User Guide IV. Herston, Queensland Herston is an inner suburb of Brisbane, Australia located 3km north of the Brisbane CBD. Geography
Herston is dominated by the Royal Brisbane and Women's Hospital.
, Australia: University of Queensland The University of Queensland (UQ) is the longest-established university in the state of Queensland, Australia, a member of Australia's Group of Eight, and the Sandstone Universities. It is also a founding member of the international Universitas 21 organisation. ; 2000.

(7) Maly MR, Costigan PA, Olney SJ. Determinants of self-report outcome measures in people with knee osteoarthritis. Arch Phys Med Rehabil. 2006;87:96-104.

(8) Stratford PW, Kennedy D, Pagura SMC SMC Saint Mary's College
SMC Santa Monica College
SMC Solaris Management Console
SMC Smooth Muscle Cell
SMC Small Magellanic Cloud (also see LMC)
SMC Safety Management Certificate (maritime shipping) 
, et al. The relationship between self-report and performance-related measures: questioning the content validity content validity,
n the degree to which an experiment or measurement actually reflects the variable it has been designed to measure.
 of timed tests. Arthritis Rheum rheum (rldbomacm) any watery or catarrhal discharge.

rheum
n.
A watery or thin mucous discharge from the eyes or nose.



rheum

any watery or catarrhal discharge.
. 2003;49:535-540.

(9) Terwee CB, van der Slikke RMA (RealMedia Architecture) See RealMedia. , van Lummel RC, et al. Self-reported physical functioning was more influenced by pain than performance-based physical functioning in knee-osteoarthritis patients. J Clin Epidemiol. 2006;59:724-731.

(10) Bellamy N, Buchanan WW, Goldsmith CH, et al. Validation See validate.

validation - The stage in the software life-cycle at the end of the development process where software is evaluated to ensure that it complies with the requirements.
 study of WOMAC: a health status instrument for measuring clinically important patient relevant outcomes to antirheumatic drug therapy in patients with osteoarthritis of the hip or knee. J Rheumatol. 1988;15:1833-1840.

(11) Guermazi M, Poiraudean S, Yahia M, et al. Translation, adaptation and validation of the Western Ontario and McMaster Universities osteoarthritis index (WOMAC) for an Arab population: the Sfax modified WOMAC. Osteoarthritis Cartilage cartilage (kär`təlĭj), flexible semiopaque connective tissue without blood vessels or nerve cells. It forms part of the skeletal system in humans and in other vertebrates, and is also known as gristle. . 2004;12:459-468.

(12) Thumboo J, Chew LH, Soh CH. Validation of the Western Ontario and McMaster University osteoarthritis index in Asians with osteoarthritis in Singapore. Osteoarthritis Cartilage. 2001;9:440-446.

(13) Faucher M, Poiraudeau S, Lefevre-Colau MM, et al. Algo-functional assessment of knee osteoarthritis: comparison of the test-retest reliability and construct validity construct validity,
n the degree to which an experimentally-determined definition matches the theoretical definition.
 of the WOMAC and Lequesne indexes. Osteoarthritis Cartilage. 2002;10:602-610.

(14) Kennedy D, Stratford PW, Pagura SMC, et al. Exploring the factorial validity and clinical interpretability of the Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC). Physiother Can. 2003;55:160-168.

(15) Kennedy DM, Stratford PW, Wessel J, et al. Assessing stability and change of four performance measures: a longitudinal study longitudinal study

a chronological study in epidemiology which attempts to establish a relationship between an antecedent cause and a subsequent effect. See also cohort study.
 evaluating outcome following total hip and knee arthroplasty. BMC (BMC Software, Inc., Houston, TX, www.bmc.com) A leading supplier of software that supports and improves the availability, performance, and recovery of applications in complex computing environments.  Musculoskelet Disord. 2005;6:3.

(16) Podsiadlo D, Richardson S Richardson, city (1990 pop. 74,840), Dallas and Collins counties, N Tex., a suburb of Dallas; founded in the 1850s, inc. as a city 1956. Richardson manufactures telecommunications equipment, medical devices, supercomputers, computer chips, and fiber optics. . The timed "Up & Go": a test of basic functional mobility for frail elderly frail elderly,
n.pl older persons (usually over the age of 75 years) who are afflicted with physical or mental disabilities that may interfere with the ability to independently perform activities of daily living.
 persons. J Am Geriatr Soc. 1991;39: 142-148.

(17) Guyatt GH, Sullivan MJ, Thompson PJ, et al. The 6-minute walk: a new measure of exercise capacity in patients with chronic heart failure. Can Med Assoc J. 1985;132:919-923.

(18) Stratford PW, Spadoni G. The reliability, consistency, and clinical application of a numeric pain rating scale. Physiother Can. 2001;53: 88-91, 114.

(19) Streiner DL, Norman GR. Health Measurement Scales: A Practical Guide to Their Development and Use. 3rd ed. New York New York, state, United States
New York, Middle Atlantic state of the United States. It is bordered by Vermont, Massachusetts, Connecticut, and the Atlantic Ocean (E), New Jersey and Pennsylvania (S), Lakes Erie and Ontario and the Canadian province of
, NY: Oxford University Press; 2003.

(20) Swisher swisher Sexology A regional term for a really queer queer, not that there's anything wrong with that  LL, Beckstead JW, Bebeau MJ. Factor analysis as a tool for survey analysis using a professional role orientation inventory as an example. Phys Ther. 2004;84:784-799.

(21) de Vet HC, Ader HJ, Terwee CB, et al. Are factor analytical Adj. 1. factor analytical - of or relating to or the product of factor analysis
factor analytic
 techniques used appropriately in the validation of health status questionnaires? A systematic review on the quality of factor analysis of the SF-36. Qual Life Res. 2005;14:1203-1218.

(22) Byrne BM. Structural Equation Modeling Structural equation modeling (SEM) is a statistical technique for testing and estimating causal relationships using a combination of statistical data and qualitative causal assumptions.  With AMOS: Basic Concepts, Applications, and Programming. Mahwah, NJ: Lawrence Erlbaum Associates; 2001.

(23) Schumacker RE, Lomax RG. A Beginner's Guide to Structural Equation Modeling. Mahwah, NJ: Lawrence Erlbaum Associates; 1996.

(24) Angst F, Ewert T, Lehmann S Leh·mann   , Lotte 1888-1976.

German-born American soprano known for her performances in operas by Richard Strauss. She sang with the Metropolitan Opera in New York City (1934-1945).
, et al. The factor subdimensions of the Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC) help to specify hip and knee osteoarthritis: a prospective evaluation and validation study. J Rheumatol. 2005;32:1324-1330.

(25) Stratford PW, Kennedy DM, Hanna SE. Condition-specific Western Ontario McMaster Osteoarthritis Index was not superior to region-specific Lower Extremity lower extremity
n.
The hip, thigh, leg, ankle, or foot. Also called inferior limb, pelvic limb.
 Functional Scale at detecting change. J Clin Epidemiol. 2004;57:1025-1032.

(26) Norman GR, Streiner DL. Biostatistics biostatistics /bio·sta·tis·tics/ (-stah-tis´tiks) biometry.

bi·o·sta·tis·tics
n.
The science of statistics applied to the analysis of biological or medical data.
: The Bare Essentials. 2nd ed. Hamilton, Ontario, Canada: BC Decker; 2000.

* McRoberts BV, The Hague, the Hague, The (hāg), Du. 's Gravenhage or Den Haag, Fr. La Haye, city (1994 pop. 445,279), administrative and governmental seat of the Kingdom of the Netherlands, capital of South Holland prov., W Netherlands, on the North Sea.  Netherlands.

([dagger]) SmallWaters Corp, 1507 E 53rd St, Suite 452, Chicago, IL 60615.

PW Stratford, PT, MSc, is Professor, School of Rehabilitation rehabilitation: see physical therapy.  Science, and Associate Member, Department of Clinical Epidemiology epidemiology, field of medicine concerned with the study of epidemics, outbreaks of disease that affect large numbers of people. Epidemiologists, using sophisticated statistical analyses, field investigations, and complex laboratory techniques, investigate the cause  and Biostatistics, McMaster University, Hamilton, Ontario, and a Scientific Affiliate in the Department of Surgery, Sunnybrook Health Sciences Centre Sunnybrook Health Sciences Centre or Sunnybrook, is a hospital located in Toronto, Ontario. The origins of Sunnybrook go back to the 1880s. The Sunnybrook campus first opened in 1943 after the land, Sunnybrook Farm was donated by Joseph and Alice M. Kilgour. , Toronto, Ontario, Canada. Address all correspondence to Mr Stratford at: straffor@mcmaster.ca.

DM Kennedy, BScPT, MSc, is the Manager of Program Development for Hip and Knee Replacement, Holland Orthopaedic & Arthritic Centre of Sunnybrook Health Sciences Centre, and Part-time Assistant Clinical Professor, School of Rehabilitation Science, McMaster University.

LJ Woodhouse, PT, PhD, is Assistant Professor, School of Rehabilitation Science, McMaster University, and a Scientific Affiliate in the Department of Surgery, Sunnybrook Health Sciences Centre.

All authors provided concept/idea/research design and writing. Ms Kennedy provided data collection, subjects, and institutional liaisons. Mr Stratford and Dr Woodhouse provided data analysis and fund procurement The fancy word for "purchasing." The procurement department within an organization manages all the major purchases. .

The Research and Ethics Committee ethics committee A multidisciplinary hospital body composed of a broad spectrum of personnel–eg, physicians, nurses, social workers, priests, and others, which addresses the moral and ethical issues within the hospital. See DNR, Institutional review board.  of the Holland Orthopaedic & Arthritic Centre of Sunnybrook Health Sciences Centre approved this study.
Table 1.
Descriptive Statistics of Study Sample

                                Patients           Patients
                                With Hip           With Knee
                                Osteoarthritis     Osteoarthritis
Characteristic                  (n = 81)           (n = 96)

Sex (no. of women/men)          36/45              49/47
Age, y, median                  60.5 (53.0, 69.0)  69.0 (63.0, 74.0)
  (first, third quartiles)
Body mass index, kg/[m.sup.2],  27.3 (25.0, 31.6)  30.2 (27.6, 34.4)
  median (first, third
  quartiles)

Table 2.
Descriptive Statistics for Performance Measures

                                      Function, Time/
Test                       Pain (a)   Distance (a)

Self-paced walk (s)        3 (2, 5)   30.4 (25.6, 36.3)
Stair test (s)             4 (2, 6)   15.2 (11.1, 24.6)
Timed "Up & Go" Test (s)   3 (1, 5)   9.4 (7.7, 11.7)
Six-Minute Walk Test (m)   5 (3, 7)   407 (325, 499)

(a) Median (first, third quartiles).

Table 3.
Fit Statistics for Initial and Modified Measurement Models

Statistic                    Model 1       Model 2a (a)  Model 2b (b)

Comparative fit index         .957           .975           .991
Relative fit index            .914           .939           .964
Tucker-Lewis Index            .936           .961           .983
Root-mean-square error of     .124           .097           .071
  approximation
Model fit chi-square (P)   70.71 (<.001)  48.11 (<.001)  15.14 (.056)

(a) Stair test error terms were correlated.

(b) Stair test--both pain and time--was removed from the model.


Invited Commentary

The article by Stratford et al provides an excellent model for analyzing the performance of outcome measures. In the true context of construct validity, they evaluated whether the scales are measuring what they are intended to measure and they brought modern analytic techniques into the analysis to do so. They analyzed an·a·lyze  
tr.v. an·a·lyzed, an·a·lyz·ing, an·a·lyz·es
1. To examine methodically by separating into parts and studying their interrelations.

2. Chemistry To make a chemical analysis of.

3.
 4 performance-based measures of lower-extremity pain and function with the goal of seeing whether performance-based measures are better able to separate out the concepts of pain and function than previous experience with patient self-report measures. The work by Stratford et al is an excellent example of the need to push our measurement work and test all of the assumptions under which we believe we are measuring a certain construct.

There are 2 main points regarding the art of statistical modeling and conceptual frameworks For the concept in aesthetics and art criticism, see .

A conceptual framework is used in research to outline possible courses of action or to present a preferred approach to a system analysis project.
 that I would like to raise in this commentary, which the readers might consider as they review this work.

The Art of Statistical Modeling

Structural equation modeling is a powerful and attractive tool for this type of work. Indeed, it offers a route into exactly what the authors wanted to explore. However, there is always a bit of art to the modeling process, and guidelines guidelines,
n.pl a set of standards, criteria, or specifications to be used or followed in the performance of certain tasks.
 as to how to make it work best for your situation. (1,2) Structural equation modeling is a large-sample method. Stratford et al used a time-honored method of splitting their data in order to have a test and validation sample to confirm their methods; however, in so doing, they were working at or below the very lowest limits of the sample size requirements. Kline (1) suggested a minimum of 100 to 150 people, and approximately 10 to 20 people per parameter estimate. With 8 attributes loading onto 2 factors, the authors were estimating 17 parameters. The small sample size in general and the number of parameters being estimated could lead to a misestimation mis·es·ti·mate  
tr.v. mis·es·ti·mat·ed, mis·es·ti·mat·ing, mis·es·ti·mates
To estimate incorrectly.



mis·es
 of the model and an inability to converge con·verge  
v. con·verged, con·verg·ing, con·verg·es

v.intr.
1.
a. To tend toward or approach an intersecting point: lines that converge.

b.
. The latter was not the case in this study as the authors reached convergence. Two other things also support the conclusion that the model did not suffer from the small sample. First, the authors had similar findings on the second half of the data. Second, the variables in the model were highly correlated with the latent Hidden; concealed; that which does not appear upon the face of an item.

For example, a latent defect in the title to a parcel of real property is one that is not discoverable by an inspection of the title made with ordinary care.
 trait trait (trat)
1. any genetically determined characteristic; also, the condition prevailing in the heterozygous state of a recessive disorder, as the sickle cell trait.

2. a distinctive behavior pattern.
, which means they were less vulnerable to misestimation. The authors made the choice to split the data, taking the risks associated with small samples. Another choice would have been to only run the model on the full sample and take advantage of being closer to the recommended sample sizes. This is an important consideration for the application of analytic techniques such as structural equation modeling--they take large numbers of observations.

Analysts using the standard error of the mean also make a series of decisions about how to model. Stratford et al began with a 2-factor model to see whether there were 2 concepts being measured: pain and function. Their description seems to suggest that they loaded the items on the respective factors and tested the model. I wonder whether we would have had even more information had they allowed the items to load on both factors or if they had tried just 1 factor first--both of which would have allowed them to demonstrate that their measures did not fit a model with either of the cross loadings, or only one latent trait. This would add strength to their findings that they observed a 2-factor model, and it would have offered more information and better data fit than if they had tried to load all indicators onto 1 factor (latent trait). Indeed, had they done this, they could have tested for the improved explanatory ex·plan·a·to·ry  
adj.
Serving or intended to explain: an explanatory paragraph.



ex·plan
 power and fit of their model compared with one with just 1 factor with a chi-square difference test. (1) The authors were clear in their decisions as to how they modeled, and these comments highlight the interpretive in·ter·pre·tive   also in·ter·pre·ta·tive
adj.
Relating to or marked by interpretation; explanatory.



in·terpre·tive·ly adv.
 part of any form of path or structural equation modeling.

Conceptual Frameworks

Measurement requires 2 conceptual frameworks to be considered: (1) that of the target construct and (2) that of the instrument selected to define that construct. We have come a long way in rehabilitation toward an understanding of the former conceptual framework. Stratford et al stated that they were measuring physical function, and they used the definition based on the work of Bellamy (3,4) to define their target: "the ability to move around and to look after yourself." Pain was not specifically defined by the authors. They identified these as 2 distinct concepts using the Outcome Measures in Arthritis Clinical Trials conference (OMERACT) III recommendations. What they have not provided is an overall framework that would help "guide our communication, clinical research, and patient care" as Jette noted in this journal. (5) Such a framework helps to elaborate on the anticipated relationships between factors such as pain and function, and in this case may have helped in offering an explanation for the differences between self-report and performance-based measurement.

Stratford et al discussed the difference between their findings--where performance-based indicators of function and self-reported pain fell into separate factors. They contrasted this with past experiences of being unable to separate self-reported function and self-reported pain. A broader conceptual framework might have aided in this. If placed within Verbrugge and Jette's framework, (6) Stratford et al would have been measuring a functional limitation when using a time test of function. Several of the self-report measures might be measuring more at the level of the whole task in an unrestricted context--the disability level--and more likely to be influenced by personal and environmental factors than the structured timed tests. This supports Stratford and colleagues' findings, but from a position of difference in concept rather than difference in quality of either type of scale (timed versus self-report measures of physical function). The same would be true if put within the International Classification of Functioning, Disability and Health International Classification of Functioning, Disability and Health, also known as ICF, is a classification of the health components of functioning and disability.  (ICF (Internet Connection Firewall) The built-in firewall in Windows XP. It provides a stateful inspection of packets which accepts only responses to requests originated by the user. ) model, (7) where the timed test would be considered an activity limitation and appraisal of functioning as a whole might shift that to participation. In both situations, the timed test and self-report both tap physical functioning, but at different levels of the conceptual frameworks we use. Interestingly, the final revision of the model provided by Stratford et al removed one task due to an undesirable correlation between the timed outcome for this test and its pain rating. The task was performing the stair test, which was distinct from the other tasks in that it was moving toward a higher level of contextualized complexity, much like serf-report of physical function at the level of disability. (5) Just as with the self-report measures, perhaps the distinction between pain and disability in this severely disabled population is lost when you move toward more applied complex appraisals of activities in daily life.

The second framework that must be considered is that of the measure that has been chosen. Hopefully, the developers have provided their definition and have described how they operationalized it in the development of their outcome measure; however, many do not. (8) The user then must appraise appraise v. to professionally evaluate the value of property including real estate, jewelry, antique furniture, securities, or in certain cases the loss of value (or cost of replacement) due to damage.  this and make sure it matches with the intended target. Stratford et al have clearly articulated an a priori a priori

In epistemology, knowledge that is independent of all particular experiences, as opposed to a posteriori (or empirical) knowledge, which derives from experience.
 definition of their target--physical function--that includes both mobility and taking care of oneself. In the end, they have 3 of the 4 measures in the model. All 3 measures focus on the timed performance of mobility tasks: Timed "Up & Go" Test, serf-paced walk, and Six-Minute Walk Test. The second domain is the pain experience immediately after each of these tasks. As mentioned above, the most functional one--the stair test--was dropped. Were the 3 remaining tasks measuring something congruent con·gru·ent  
adj.
1. Corresponding; congruous.

2. Mathematics
a. Coinciding exactly when superimposed: congruent triangles.

b.
 with Stratford and colleagues' definition of physical function, or were they measuring timed performance of mobility tasks alone? Similarly, Stratford and colleagues' pain scale focused on the pain associated with the timed tasks, not pain as we typically might measure it. Is this a broad enough measure of pain related to osteoarthritis, or do we need to consider a broader scale (pain visual analog scale or numeric rating scale without attribution at·tri·bu·tion  
n.
1. The act of attributing, especially the act of establishing a particular person as the creator of a work of art.

2.
 to a task as suggested by OMERACT, or other arthritis-related pain scales)?

Stratford et al have pushed us to demand more from our measures, and specifically to make sure that our measures are measuring what they are supposed to be measuring in the way that we expect. The points raised above offer some possible explanations, all of which are testable in a study that fields both self-report and performance-based measures with attributed and non-attributed pain items and in a large enough sample to allow for even more confidence in large-sample modeling techniques, perhaps in people along the course of their experience with osteoarthritis rather than just at the end stage. The door has been opened, and the rehabilitation community could easily collaborate on such a venture.

Measurement is the application of a set of rules to get numeric quantification quan·ti·fy  
tr.v. quan·ti·fied, quan·ti·fy·ing, quan·ti·fies
1. To determine or express the quantity of.

2.
 of attributes--in Stratford and colleagues' case, the pain and physical function concepts. Once we believe we have a good way of measuring these concepts, we keep pushing the boundaries to make sure we can interpret the scores in ways that we should. Stratford and colleagues have done that in their study.

References

(1) Kline RB. Principles and Practice of Structural Equation Modeling. 2nd ed. New York, NY: The Guilford Press; 2005.

(2) Fayers PM, Hand DJ. Factor analysis, causal indicators and quality of life. Qual Life Res. 1997;6:139-150.

(3) Bellamy N. WOMAC Osteoarthritis Index User Guide IV. Herston, Queensland, Australia: University of Queensland; 2000.

(4) Bellamy N, Kirwan J, Boers M, et al. Recommendations for a core set of outcome measures for future phase III clinical trials in knee, hip, and hand osteoarthritis: consensus development at OMERACT III. J Rheumatol. 1997;24:799-802.

(5) Jette AM. Toward a common language for function, disability, and health. Phys Ther. 2006;86:726-734.

(6) Verbrugge LM, Jette AM. The disablement process. Soc Sci Med. 1994;38:1-14.

(7) International Classification of Functioning, Disabilty and Health: ICF. Geneva Geneva, canton and city, Switzerland
Geneva (jənē`və), Fr. Genève, canton (1990 pop. 373,019), 109 sq mi (282 sq km), SW Switzerland, surrounding the southwest tip of the Lake of Geneva.
, Switzerland: World Health Organization; 2001.

(8) Lohr KN, Aaronson NK, Alonso J, et al. Evaluating quality-of-life and health status instruments: development of scientific review criteria. Clin Ther. 1996;18:979-992.

Dorcas Beaton, BScOT, PhD

Scientist and Director

Mobility Program Clinical Research Unit

St Michael's Hospital

Assistant Professor

Department of Occupational Sciences

Graduate Departments of Rehabilitation Sciences and Health Policy Management and Evaluation

University of Toronto Research at the University of Toronto has been responsible for the world's first electronic heart pacemaker, artificial larynx, single-lung transplant, nerve transplant, artificial pancreas, chemical laser, G-suit, the first practical electron microscope, the first cloning of T-cells,

Toronto, Ontario, Canada

Author Response

We thank Dr Beaton for her stimulating commentary, which conveys important information concerning statistical modeling and conceptual frameworks relevant to the assessment of health outcomes. Her commentary includes general insights and issues specific to our article. Beaton has organized her review under 2 headings, and we will apply the same 2 headings in response to her comments.

The Art of Statistical Modeling

We agree with Beaton's position that art complements the science of statistical modeling in general and structural equation model specifically. Beaton wonders how information concerning a single-factor model Single-factor model

A model of security returns that acknowledges only one common factor. The single factor is usually the market return. See: Factor model.
 or a model that includes the cross-loading of items on both factors would affect our conclusions. Our rationale for starting with a 2-factor model without cross-loading was that we conceptualized 2 factors with the relevant items loading uniquely on their respective factors. Although we did not desire cross-loading at the item-factor level, we imagined the factors would be related and allowed for a correlation between the factors. However, in response to Beaton's query regarding additional factor loading models, we will briefly summarize sum·ma·rize  
intr. & tr.v. sum·ma·rized, sum·ma·riz·ing, sum·ma·riz·es
To make a summary or make a summary of.



sum
 the results from 2 additional analyses. The first was a confirmatory factor analysis that specified a single factor with all pain and time items loading on this factor. The chi-square value indicated a poorly fitting mode ([chi square]=419.4, df=20, P<.001) with the following factor loadings: self-paced walk-pain=.49, stair test-pain=.44, Timed "Up & Go" Test-pain=.49, Six-Minute Walk Test-pain=.34, self-paced walk-time =.94, stair test-time=.88, Timed "Up & Go" Test-time=.92, and 6-Minute Walk Test-distance= -.84. The negative loading with distance occurs because greater distances are associated with more desirable health status levels, whereas faster times and lower pain levels are related to more desirable health status levels. Our initial 2-factor model, which specified pain and time (distance) items loading on their respective factors, yielded a chi-square value of 70.7 on 19 degrees of freedom. The difference in chi-square values between the single-factor model and our initial 2-factor model is highly significant in favor of the 2-factor model ([chi square]= 348.7, df=1, P<.001).

The second supplementary analysis did not assume an a priori factor structure and allowed for all possible cross-loadings. We accomplished this by applying exploratory factor analysis using maximum likelihood estimation and oblique o·blique
adj.
Situated in a slanting position; not transverse or longitudinal.



oblique

slanting; inclined.
 rotation. Two factors had eigenvalues eigenvalues

statistical term meaning latent root.
 greater than 1. Time/distance items loaded highly on one factor, and pain items loaded highly on the other factor. The correlation between factors was .46. The Table reports a summary of the pattern matrix factor loadings. It is evident from the Table that cross-loadings on the no-congruent factor were negligible This article or section is written like a personal reflection or and may require .
Please [ improve this article] by rewriting this article or section in an .
. In summary, these supplementary analyses further support a 2-factor model that does not include a cross-loading of items.

Conceptual Frameworks

We have divided Beaton's comments into the following 3 topics and respond to each in turn: (1) our application of Bellamy's definition of lower-extremity physical function, (2) the proposition that the distinction between pain and function may diminish as the contextual complexity of an item increases (framed in the International Classification of Functioning, Disability and Health [ICF] lexicon, participation items are more contextually complex than activity items), and (3) the proposition that pain may be better assessed by applying nonattributed measures compared with attributed (eg, task-specific) measures of pain.

Bellamy is one of the few measure developers to offer a definition or clarifying phrase for the concept of lower-extremity functional status: "by this we mean your ability to move around and to look after yourself." (1,2) We applied Bellamy's definition; however, we agree with Beaton's point that, in the context of our assessment process, the performance measures focus on "the ability to move around" and not on "the ability to look after yourself." Thus, within the ICF classification scheme, our performance measures assess aspects of activity limitation and not participation restriction.

The second point offered by Beaton is the speculation that the distinction between pain and function may become less distinct as the contextual complexity of an item or performance task increases. We are not aware of a study that has explored this exciting hypothesis prospectively. However, in a previous study that attempted to explain why the WOMAC physical function subscale could not detect deterioration de·te·ri·o·ra·tion
n.
The process or condition of becoming worse.
 identified by performance measures in patients 2 weeks after hip or knee arthroplasty, physical function items were divided into 2 sets. (3) One set contained items similar to those on the WOMAC pain subscale, and the second set consisted of items not specific to the items on the WOMAC pain subscale. Item scores within each set were summed to yield total scores for the similar and not-specific item sets. The total item score for the not-specific item set detected deterioration consistent with the times from the performance tests. The item structure of the 2 sets of items is pertinent to our current discussion. All items in the set that did not detect deterioration tapped activity limitation; all but one item in the set that detected deterioration assessed participation restrictions. Although the results of this study are at odds with Beaton's hypothesis, the study was not conceived to investigate Beaton's hypothesis. We strongly support the need to investigate prospectively the hypothesis raised by Beaton. As a minor point, Beaton has suggested that our stair climbing Stair climbing is the climbing of a flight of stairs. It is often described as a "low-impact" exercise, often for people who have recently started trying to get in shape.

A common phrase in health pop culture is "Take the stairs, not the elevator".
 test may be more contextually complex; however, given the way this activity was framed in our investigation--without any more relevance to self-care, occupation, and recreation than the other 3 performance tests--we are uncomfortable accepting this specific example.

Beaton's third point addresses the assessment of pain. Given the body of evidence suggesting that self-report measures of physical function are strongly influenced by pain, (4-6) we examined whether a more distinct assessment of these 2 attributes could be achieved by performance measures. Our goal was to determine whether performance measures could bring into focus 2 health concepts that to date have been blurred blur  
v. blurred, blur·ring, blurs

v.tr.
1. To make indistinct and hazy in outline or appearance; obscure.

2. To smear or stain; smudge.

3.
 by self-report measures. We believe that complementing, not replacing, self-report functional status and pain measures with performance measures will increase the validity of impressions of pain and physical function formed by clinicians. We concur CONCUR - ["CONCUR, A Language for Continuous Concurrent Processes", R.M. Salter et al, Comp Langs 5(3):163-189 (1981)].  with Beaton's point that pain is a complex health concept and that further investigation concerning attributed and nonattributed pain ratings is warranted in a larger sample of patients.

References

(1) Bellamy N. An Evaluative Index for Clinical Trials. Hamilton, Ontario, Canada: McMaster University; 1982.

(2) Bellamy N. WOMAC Osteoarthritis Index User Guide IV. Herston, Queensland, Australia: University of Queensland; 2000.

(3) Stratford PW, Kennedy DM. Does parallel item content on WOMAC's Pain and Function Subscales limits its ability to detect change in functional status? BMC Musculoskelet Disord. 2004;5:17.

(4) Maly MR, Costigan PA, Olney SJ. Determinants of self-report outcome measures in people with knee osteoarthritis. Arch Phys Med Rehabil. 2006;87:96-104.

(5) Terwee CB, van der Slikke RM, van Lummel RC, et al. Self-reported physical functioning was more influenced by pain than performance-based physical functioning in knee-osteoarthritis patients. J Clin Epidemiol. 2006;59:724-731.

(6) Stratford PW, Kennedy DM. Performance measures were necessary to obtain a complete picture of osteoarthritic patients. J Clin Epidemiol. 2006;59:160-167.

Paul W Stratford, PT, MSc

Deborah M Kennedy, BScPT, MSc

Linda J Woodhouse, PT, PhD
Table.
Pattern Matrix Factor Loading

                                  Factor

                                 1       2

Self-paced walk-time           .992   -.063
Timed "Up & Go" Test-time      .898    .029
Stair test-time                .841    .057
6-Minute Walk Test-distance   -.841    .004
Self-paced walk-pain           .044    .869
Timed "Up & Go" Test-pain      .051    .852
6-Minute Walk Test-pain       -.077    .792
Stair test-pain                .024    .789
COPYRIGHT 2006 American Physical Therapy Association, Inc.
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 2006, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Title Annotation:Research Report
Author:Woodhouse, Linda J.
Publication:Physical Therapy
Geographic Code:1CANA
Date:Nov 1, 2006
Words:7924
Previous Article:A comparison of physical functional performance and strength in women with fibromyalgia, age- and weight-matched controls, and older women who are...
Next Article:Gait characteristics of elderly people with a history of falls: a dynamic approach.(Research Report)



Related Articles
Osteoarthritis.(Pamphlet)
Philadelphia panel evidence-based clinical practice guidelines on selected rehabilitation interventions for knee pain. (Specail Issue).
Osteoarthritis: is more attention to nutritional health required? (Original Research).
Determinants of function after total knee arthroplasty. (Research Report).
California Hospital Medical Center: younger, more active people opt for joint replacement surgery.(Advertisement)
California Hospital Medical Center: younger, more active people opt for joint replacement surgery.(Advertisement)
The influence of pathology, pain, balance, and self-efficacy on function in women with osteoarthritis of the knee.(Research Report)
Treatment of osteoarthritis.(Featured CME Topic: Arthritis)
Contribution of psychosocial and mechanical variables to physical performance measures in knee osteoarthritis.(Research Report)
Aquatic physical therapy for hip and knee osteoarthritis: results of a single-blind randomized controlled trial.(Research Report)

Terms of use | Copyright © 2009 Farlex, Inc. | Feedback | For webmasters | Submit articles