Printer Friendly
The Free Library
5,666,273 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

On "prospective evaluation of the AM-PAC-CAT ...".


On "Prospective Evaluation of the AM-PAC-CAT ..." Jette et al. Phys Ther. 2007;87:385-398.

To the editor: The authors are to be complimented for a strong paper describing a methodologically complex process in an understandable manner. Although others have applied computerized adaptive testing Computerized adaptive testing is a more commonly used term [1] for Computer-adaptive testing.  (CAT) applications in outpatient rehabilitation rehabilitation: see physical therapy.  for several years, (1) Jette et al (2) are the first to publish results of a practical application of a CAT in a peer-reviewed journal peer-reviewed journal Refereed journal Academia A professional journal that only publishes articles subjected to a rigorous peer validity review process. Cf Throwaway journal. .

The strength of the work by Jette et al lies in the process used to develop the product. Item response theory Item response theory is a body of theory used in the field of psychometrics. Pychometrics is concerned with the theory and technique of educational and psychological measurement.  (IRT IRT Item Response Theory
IRT In Regard To
IRT Incident Response Team
IRT In Reference To
IRT In Regards To
IRT Icing Research Tunnel (wind tunnel)
IRT Interborough Rapid Transit
) methods and CAT applications have the potential to be the foundation of outcomes measurement development in rehabilitation just as they were in educational measurement. (3) We should not forget that IRT and CAT are not new; they are just new to rehabilitation and medicine. Jette et al discuss in the current paper and in earlier work how these methods can be used to develop a new outcomes scale, assess the strengths and weaknesses of the scale, and discuss how a scale can be improved when scale deficits are identified via practical application in busy clinics. These methods are sorely sore·ly  
adv.
1. Painfully; grievously.

2. Extremely; greatly: Their skills were sorely needed.
 needed for many common paper-and-pencil instruments that are so popular in rehabilitation.

The study is not without limitations, many of which are detailed nicely by Jette et al. One psychometric psy·cho·met·rics  
n. (used with a sing. verb)
The branch of psychology that deals with the design, administration, and interpretation of quantitative tests for the measurement of psychological variables such as intelligence, aptitude, and
 issue not discussed relates to differential item functioning Differential item functioning (DIF) occurs when people from different groups (commonly gender or ethnicity) with the same latent trait (the same ability/skill) have a different probability of giving a certain response on a questionnaire or test.  (DIF (1) (Data Interchange Format) A standard file format for spreadsheet and other data structured in row and column form. Originally developed for VisiCalc, DIF is now under Lotus' jurisdiction. ). (4) Differential item functioning occurs when patients from different groups--for example, patients with hip versus knee impairments--have different probabilities of endorsing item response categories. In clinical terms, that means patients with knee impairments perceive the act of squatting squatting /squat·ting/ (skwaht´ing) a position with hips and knees flexed, the buttocks resting on the heels; sometimes adopted by the parturient at delivery or by children with certain types of cardiac defects.  as more difficult compared with patients who have hip impairments, which is clinically logical and important. (5) Differential item functioning is common in patients treated in outpatient rehabilitation. (5,6) When DIF is present and of practical importance, the lack of control for DIF can erode Erode (ĕrōd`), city (1991 urban agglomeration pop. 361,755), Tamil Nadu state, S India, on the Kaveri River. The city is located in a cotton-growing region, and its industries include cotton ginning and the manufacture of transport equipment.  the validity of the outcomes measure. (3)

One of the strengths of IRT techniques is the ability to detect and possibly control for DIE However, when DIF is identified but of no practical importance, DIF can be ignored when calibrating items. (7,8) Discussion of DIF at least by body part treated would have strengthened the Jette et al paper, particularly because differences in item calibrations by body part treated (6) have been published for the physical functioning items of the SF-36, (9) which appear to be included in the AM-PAC-CAT item bank. (10) From previous evidence, (5,6) it would not be unexpected to find DIF between patients with hip, knee, or foot/ankle impairments, between patients with shoulder compared with elbow/wrist/hand impairments, and between patients with lumbar lumbar /lum·bar/ (lum´bar) pertaining to the loins.

lum·bar
adj.
Of, near, or situated in the part of the back and sides between the lowest ribs and the pelvis.
 compared with cervical impairments for body mobility and activity item banks.

The authors rely on earlier factor analytic Adj. 1. factor analytic - of or relating to or the product of factor analysis
factor analytical
 work (10) that identified the mobility and daily activity constructs. Although the conceptual foundation identifying these 2 factors appears sound, there is evidence that these factors might not be distinctly separate. In the original sample that did not contain patients treated in outpatient facilities, (10) factor loadings supported grouping items into the mobility and activity factors. However, in the current study of outpatients, using the CATs developed from a sample that may not have included outpatients provides some evidence supporting the need for further unidimensionality testing. Specifically, in the current study, mobility measures were most responsive for patients with lower-extremity impairments compared with patients with spine or upper-extremity impairments. The lowest effect size for both scales and all impairments was for patients with upper-extremity impairments using the mobility scale. However, the greatest effect size for the activity scale was recorded for the patients with upper-extremity impairments.

These results are clinically logical, given the items and sample. However, do the activity and mobility items really describe different constructs for patients treated in outpatient clinics? Could the mobility and activity items be combined into one item bank that is "essentially unidimensional u·ni·di·men·sion·al  
adj.
One-dimensional.

Adj. 1. unidimensional - relating to a single dimension or aspect; having no depth or scope; "a prose statement of fact is unidimensional, its value being measured wholly in terms
," where one dimension is dominant, possibly in the presence of one or more minor dimensions, (11) without erosion of the scale psychometrics psychometrics

Science of psychological measurement. Psychometricians design and administer psychological tests (see psychological testing), both to generate empirical data on mental processes and to refine their understanding of measurement techniques and the
? Do patients' impairments demand different scales in order to assess the most appropriate construct of interest to the patient, that is, mobility for lower-extremity impairments versus activity for upper-extremity impairments? Do more difficult items (assessed using item calibrations) describe a separate construct compared with easier items, regardless of construct (mobility versus activity)?

If payers were to reimburse re·im·burse  
tr.v. re·im·bursed, re·im·burs·ing, re·im·burs·es
1. To repay (money spent); refund.

2. To pay back or compensate (another party) for money spent or losses incurred.
 out-patient therapy services for value (unit of functional improvement per dollar cost), (12) which construct should be used, that is, should we assess mobility for patients with lower-extremity impairments, and activity for patients with upper-extremity impairments? Which construct is more important for patients with cervical or lumbar impairments? The Jette et al results combined with the results of Hart and colleagues (5,7,8) suggest the need for further assessment of item unidimensionality in patients receiving outpatient therapy. In addition, given that CATs are continuously evolving, how do developers, journal editors, and users keep current with pertinent CAT changes?

The results describing the responsiveness and construct validity construct validity,
n the degree to which an experimentally-determined definition matches the theoretical definition.
 of the AM-PAC-CAT measures support previous work using CATs applied in outpatient rehabilitation. For example, the effect size for prospectively collected data using body part-specific CATs on average was 0.92 in an earlier study, (1) which is similar to the highest effect size for patients with lower-extremity impairment Impairment

1. A reduction in a company's stated capital.

2. The total capital that is less than the par value of the company's capital stock.

Notes:
1. This is usually reduced because of poorly estimated losses or gains.

2.
 in the Jette et al study. However, Stratford and Riddle (13) recommend using an external standard to assess sensitivity to change in a sample of patients who are likely to change at different rates. Such analyses are recommended for future AM-PAC-CAT investigations. Furthermore, construct validity results using CATs in the Hart and Connolly (1) report are similar to the results reported by Jette et al. Taken together, results support that CAT administrations produce responsive and valid estimates of ICF (Internet Connection Firewall) The built-in firewall in Windows XP. It provides a stateful inspection of packets which accepts only responses to requests originated by the user.  activity measures in patients receiving outpatient therapy.

Jette et al describe in detail the content balancing performed by their CAT. However, is content balancing, which was developed for educational tests, as important in outpatient rehabilitation as it is in educational testing? The answer may be "probably." Given that the primary advantage of CAT applications is reduced respondent burden without erosion of measure precision and validity, the answer may be that providers should take advantage of the efficiency of CATs and collect more data. In this way, providers in busy clinics can assess multiple constructs of interest efficiently, such as mobility, activity, and fear-avoidance. (14) CATs should save clinicians time assessing multiple constructs.

Finally, the collaboration of good researchers and a proprietary database management company (eg, CRE CRE Commercial Real Estate
CRE Corporate Real Estate
CRE Commission for Racial Equality (Scotland)
CRE CCD (Charge Coupled Device) and Readout Electronics
CRE Camp Response Element
 Care LLC (Logical Link Control) See "LANs" under data link protocol.

LLC - Logical Link Control
) facilitated the implementation of the current study. The integration of good science, electronic application of psychometrically sound outcomes instruments in busy clinics, and a journal's need to publish scientifically sound material produced a result that may affect clinical practice positively. As payers progress toward new methods of payment that may include value-based purchasing, (2,12,15,16) proprietary database management companies may become more important, as they manage large databases of scientifically sound outcomes measures without undo political pressures. Jette et al and the editors of PTJ PTJ Part-Time Job  have taken the "high-road" by publishing this paper, and the readers will be the benefactors.

Thank you for the opportunity to contribute to this important discussion.

Dennis t Hart

DL Hart, PT, PhD, is Director of Consulting and Research, Focus on Therapeutic Outcomes Inc.

This letter was posted as a Rapid Response on February 22, 2007, at ww.ptjournal.org.

References

(1) Hart DL, Connolly JB. Pay-for-Performance for Physical Therapy and Occupational Therapy: Medicare Part B Services. Grant #18-P-93066/9-01: Health & Human Services/Centers for Medicare & Medicaid Services; 2006.

(2) Jette AM, Haley SM, Tao W, et al. Prospective evaluation of the AM-PAC-CAT in outpatient rehabilitation settings. Phys Ther. 2007;87:385-398.

(3) Wainer H, ed. Computerized Adaptive Testing. A Primer. 2nd ed. Mahway, NJ: Lawrence Erlbaum Associates; 2000.

(4) Crane PK, Hart DL, Gibbons Famous people named Gibbons include:
  • Beth Gibbons (born 1965), British singer
  • Billy Gibbons, guitarist for ZZ Top
  • Cedric Gibbons (1893–1960), American art director
  • Christopher Gibbons (1615 - 1676), English composer, son of Orlando
 LE, Cook KE A 37-item shoulder functional status item pool had negligible differential item functioning. J Clin Epidemiol. 2006;59:478-484.

(5) Hart DL, Mioduski JE, Stratford PW. Simulated computerized adaptive tests for measuring functional status were efficient with good discriminant validity Discriminant validity describes the degree to which the operationalization is not similar to (diverges from) other operationalizations that it theoretically should not be similar to.  in patients with hip, knee, or foot/ ankle impairments. J Clin Epidemiol. 2005;58:629-638.

(6) Hart DL. Assessment of unidimensionality of physical functioning in patients receiving therapy in acute, orthopedic outpatient centers. J Outcome Meas. 2000;4:413-430.

(7) Hart DL, Cook KF, Mioduski JE, et al. Simulated computerized adaptive test for patients with shoulder impairments was efficient and produced valid measures of function. J Clin Epidemiol. 2006;59:290-298.

(8) Hart DL, Mioduski JE, Werneke MW, Stratford PW. Simulated computerized adaptive test for patients with lumbar spine Lumbar spine
The segment of the human spine above the pelvis that is involved in low back pain. There are five vertebrae, or bones, in the lumbar spine.

Mentioned in: Low Back Pain
 impairments was efficient and produced valid measures of function. J Clin Epidemiol. 2006;59:947-956.

(9) Ware JE Jr, Sherbourne CD. The MOS (1) (Metal Oxide Semiconductor) See MOSFET.

(2) (Mean Opinion Score) The quality of a digitized voice line. It is a subjective measurement that is derived entirely by people listening to the calls and scoring the results from
 36-item short-form health survey (SF-36), I: conceptual framework For the concept in aesthetics and art criticism, see .

A conceptual framework is used in research to outline possible courses of action or to present a preferred approach to a system analysis project.
 and item selection. Med Care. 1992;30:473-483.

(10) Haley SM, Coster Cos´ter   

n. 1. One who hawks about fruit, green vegetables, fish, etc.
 WJ, Andres PL, et al. Activity outcome measurement for post-acute care. Med Care. 2004;42(1 suppl): 149-161.

(11) Stout WF. A new item response theory modeling approach with applications to unidimensionality assessment and ability estimation. Psychometrika. 1990;55:293-325.

(12) Porter ME, Teisberg EO. Redefining Health Care. Creating Value-Based Competition on Results. Boston, Mass: Havard Business School Press; 2006.

(13) Stratford PW, Riddle DL. Assessing sensitivity to change: choosing the appropriate change coefficient. Health Qual Life Outcomes. 2005;3:23.

(14) Waddell G, Newton M, Henderson I, et al. A Fear-Avoidance Beliefs Questionnaire (FABQ FABQ Fear Avoidance Beliefs Questionnaire ) and the role of fear-avoidance beliefs in chronic low back pain and disability. Pain. 1993;52:157-168.

(15) Institute of Medicine. Crossing the Quality Chasm: A New Health System for the 21st Century. Washington, DC: National Academy Press; 2001.

16 Institute of Medicine. Rewarding Provider Performance: Aligning Incentives in Medicare. Washington, DC: National Academies Press; 2006.

[DOI (Digital Object Identifier) A method of applying a persistent name to documents, publications and other resources on the Internet rather than using a URL, which can change over time. : 10.2522/ptj.2007.87.5.609]

Author Response

We appreciate Dr Hart's thoughtful letter in response to our article. We agree with his major points and would like to comment on a few of the issues he raises.

We agree with Hart that differential item functioning (DIF) across patients with various impairments is clinically relevant and an issue deserving of additional study. Although this issue was not the focus of our paper, we did examine the potential presence of DIF in the AM-PAC items most frequently administered in the AM-PAC-CAT across outpatients in our sample with different primary impairments. Only 1 item out of 36 displayed a significant DIE We suspect that more focused constructs such as basic mobility and daily activity function may have less potential for significant DIF than broader health-related concepts. We do agree with Dr Hart that DIF is an issue that should be examined both during item bank development and in CAT applications in various patient populations. (1)

With respect to Hart's suggestion that the daily activity and basic mobility domains of the AM-PAC might not be distinct in an outpatient population, we wish to clarify that outpatients were represented in our calibration samples. The AM-PAC's combined calibration samples of 1,041 patients in post-acute care included patients from 4 different care settings: outpatient therapy (n=237), home health care (n=246), skilled nursing or transitional care This article or section needs sources or references that appear in reliable, third-party publications. Alone, primary sources and sources affiliated with the subject of this article are not sufficient for an accurate encyclopedia article.  (n=138), and inpatient rehabilitation (n=420). (2) The AM-PAC was intentionally developed and tested in samples drawn from several post-acute care settings to provide users with one instrument that had the ability to track functional recovery across settings throughout an entire episode of post-acute care.

In separate analyses done on the outpatient sample used in this pilot study, we confirmed a distinction between the daily activities and basic mobility domains of the AM-PAC. We saw only a moderate positive correlation Noun 1. positive correlation - a correlation in which large values of one variable are associated with large values of the other and small with small; the correlation coefficient is between 0 and +1
direct correlation
 between the basic mobility and daily activity scales (0.40 on admission and 0.55 at discharge), suggesting the psychometric and clinical merits of keeping these 2 domains of activity function separate and distinct.

Given the dynamic nature of CAT outcome instruments, a feature that allows for periodic refinements and updating, Dr Hart raises an interesting concern about a potential challenge of keeping users (as well as journal editors and reviewers) current with pertinent changes in various CAT instruments being used with increasing frequency in health care. We agree that this is an important issue that must be taken seriously. Our current thinking is that CAT instrument developers might look to the broader software development field for guidance on how this might be efficiently accomplished by adopting a policy of labeling different versions of CAT software. For instance, although "version 1" (AM-PAC-CATv1) was examined in this pilot study, "version 2" (AM-PAC-CATv2) is soon to be released and will be the subject of future study. Accurate labeling of software and instruments may help readers and various users keep track of the evolution of CAT software.

Again, we thank Hart for his letter and look forward to further discussions of these and related issues relevant to the introduction and use of CAT instruments in health care.

Alan M Jette, Stephen M Haley, Wei Tao, Pengsheng Ni, Richard Moed, Doug Meyers, Mathew Zurek

AM Jette, PT, PhD, FAPTA FAPTA Fellows of the American Physical Therapy Association , is Director, Health & Disability Institute, Boston University Boston University, at Boston, Mass.; coeducational; founded 1839, chartered 1869, first baccalaureate granted 1871. It is composed of 16 schools and colleges.  of Public Health.

This letter was posted as a Rapid Response on March 6, 2007, at www.ptjoumal.org.

References

(1) Haley SM, Coster WJ, Andres PL, et al. Activity outcome measurement for postacute care. Med Care. 2004;42(1 suppl): 149-169.

(2) Haley SM, Ni P, Hambleton RK, et al. Computer adaptive testing improved accuracy and precision of scores over random item selection in a physical functioning item bank. J Clin Epidemiol. 2006;59:1174-1182.

[DOI: 10.2522/ptj.2007.87.5.611]
COPYRIGHT 2007 American Physical Therapy Association, Inc.
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 2007, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Title Annotation:Letters to the Editor
Author:Zurek, Mathew
Publication:Physical Therapy
Article Type:Letter to the editor
Date:May 1, 2007
Words:2285
Previous Article:Thoracic kyphosis affects spinal loads and trunk muscle force.(Research Report)
Next Article:Scientific Evidence for Musculoskeletal, Bariatric, and Sports Nutrition.



Related Articles
LETTERS IN THE EDITOR'S MAILBAG.(Letters)(Letter to the editor)
Reform farm bill.(Letters)(Congress should replace wasteful subsidy system)(Letter to the editor)
LETTERS IN THE EDITOR'S MAILBAG.(Letters)(Letter to the editor)
LETTERS IN THE EDITOR'S MAILBAG.(Letters)(Letter to the editor)
SPORTS MAIL.(Letters)(Letter to the editor)
Prospective evaluation of the AM-PAC-CAT in outpatient rehabilitation settings.(Research Report)
Past princesses.(Letters)(Letter to the editor)
Elo's insights.(Letters)(Letter to the editor)
No faun.(Letters)(Letter to the editor)
Till we meet again.(Editor's Note)

Terms of use | Copyright © 2009 Farlex, Inc. | Feedback | For webmasters | Submit articles