Patient/Client Satisfaction.To the Editor: We write to comment on the article by Goldstein et al titled, "The Development of an Instrument to Measure Satisfaction With Physical Therapy" published in the September 2000 issue of Physical Therapy. Our concern is that we believe faulty procedures in that research quite likely led to incorrect conclusions. We contend that use of the instrument described in that article may perpetuate per·pet·u·ate tr.v. per·pet·u·at·ed, per·pet·u·at·ing, per·pet·u·ates 1. To cause to continue indefinitely; make perpetual. 2. misconceptions Misconceptions is an American sitcom television series for The WB Network for the 2005-2006 season that never aired. It features Jane Leeves, formerly of Frasier, and French Stewart, formerly of 3rd Rock From the Sun. and may mislead mis·lead tr.v. mis·led , mis·lead·ing, mis·leads 1. To lead in the wrong direction. 2. To lead into error of thought or action, especially by intentionally deceiving. See Synonyms at deceive. those who are interested in patient satisfaction. The introduction of the article professes an interest in the domains of patient satisfaction. The second paragraph on page 856 implies that no psychometric psy·cho·met·rics n. (used with a sing. verb) The branch of psychology that deals with the design, administration, and interpretation of quantitative tests for the measurement of psychological variables such as intelligence, aptitude, and analyses have been conducted in the area of patient satisfaction in physical therapy. This is puzzling because the previous year we published an extensive study on outpatient satisfaction in this same journal.[1] We reported on 3 samples with a total of 607 subjects from 21 different facilities. Our research was designed to cross-validate the Physical Therapy Outpatient Satisfaction Survey (PTOPS PTOPS Pilot Transportation Operational Personal Property Standard System ) across separate samples and by means of both exploratory and confirmatory factor analyses Verb 1. factor analyse - to perform a factor analysis of correlational data factor analyze analyse, analyze - break down into components or essential features; "analyze today's financial market" . We provided evidence to support cross-validation by replicating findings from phase 2 (model B) on a new sample in phase 3 through confirmatory factor analysis In statistics, confirmatory factor analysis (CFA) is a special form of factor analysis. It is used to assess the the number of factors and the loadings of variables. . We identified 4 domains of outpatient satisfaction: "Enhancers," "Detractors," "Location," and "Cost." We believe that there were several methodological flaws in the Goldstein et al study. First, the researchers used 20 items to assess 11 hypothesized satisfaction domains. The rationale for proposing several of these domains, in our view, was lacking. What is more important, however, is that there was no way to test the presence of 11 domains, because 6 of the hypothesized domains were assessed with only a single item. Therefore, we argue that it was impossible to develop any intra-domain variance. Because variance is a reflection of the extent to which the scores in a set differ among themselves"[2(p17)] and each set in this case consisted of only one item, it follows that there can be no variance when there is only one item per domain. While certainly there is intersubject variance, there can be no inter-item variance within domains. In the absence of this type of variance, those 6 items would gravitate grav·i·tate intr.v. grav·i·tat·ed, grav·i·tat·ing, grav·i·tates 1. To move in response to the force of gravity. 2. To move downward. 3. to a domain with a degree of similarity and reliability. Some authors have argued that a general rule in psychometrics psychometrics Science of psychological measurement. Psychometricians design and administer psychological tests (see psychological testing), both to generate empirical data on mental processes and to refine their understanding of measurement techniques and the is that hypothesized domains should contain a minimum of 3 items for purposes of domain reliability.[3] This was true of only 2 of the 11 hypothesized domains. Lacking sufficient alternatives, we believe the items clumped around a single factor. Another concern about the study results relates to the obtained item means. Table 3 on page 859 indicates that all of the items had extremely large means, that is, subjects were very favorably fa·vor·a·ble adj. 1. Advantageous; helpful: favorable winds. 2. Encouraging; propitious: a favorable diagnosis. 3. inclined toward physical therapy. Roush and Sonstroem[1] have discussed the unfavorable effects of high endorsement on test development. The article by Goldstein et al illustrates this well. Dividing the scale means in Table 3 by 19 (the number of items) indicates that each of the 20 items had a mean between 4.6 and 4.7 on a 5-point scale. It can be seen that items can experience a ceiling effect. For example, anyone who wants to test the effect of a particular program on patient satisfaction in a pretest-posttest design would find it difficult to show appreciable ap·pre·cia·ble adj. Possible to estimate, measure, or perceive: appreciable changes in temperature. See Synonyms at perceptible. gain. The extremely high scores at entry limit the degree of gain that can be demonstrated. Unfortunately, the authors did not address this important issue. The Table 3 data, in our view, indicate the presence of both compressed variance and ceiling effect. The Table 3 data indicate that the 20-item scale had a mean of about 93 (out of a possible 100 points). It is difficult to see how the standard deviation In statistics, the average amount a number varies from the average number in a series of numbers. (statistics) standard deviation - (SD) A measure of the range of values in a set of numbers. can be larger than 2, if that large. In the presence of this lack of variance and pronounced ceiling effect, it is difficult to believe that this scale can discriminate dis·crim·i·nate v. dis·crim·i·nat·ed, dis·crim·i·nat·ing, dis·crim·i·nates v.intr. 1. a. across levels of patient satisfaction. Finally, we have concerns about how the survey items were phrased. All of the survey items were phrased in a positive manner; no negative statements were included in the scale. The presence of positive statements exclusively tends to create an acquiescence Conduct recognizing the existence of a transaction and intended to permit the transaction to be carried into effect; a tacit agreement; consent inferred from silence. response set.[4,5] Using only positive items in the study quite probably promoted the high scores shown in Table 3, reduced the item pool variance, and led to the finding of a single factor. We are pleased with the interest being manifested in measuring patient satisfaction in physical therapy; understanding patient satisfaction, in our opinion, would be an outstanding asset for the profession. Additionally, we appreciate that different investigators may develop different tools to measure patient satisfaction, and we look forward to more research in this area. It is hoped that these endeavors will utilize sound test development procedures from the psychometric literature. We believe, however, that the issues presented above concerning the effort published in the September 2000 issue of the Journal discount the use of that instrument because of the failure to utilize such procedures. Susan E Roush, PT, PhD Associate Professor Physical Therapy Program University of Rhode Island History The University was first chartered as the state's agricultural school in 1888. The site of the school was originally the Oliver Watson Farm, and the original farmhouse still lies on the campus today. 25 W Independence Way Kingston, RI 02881 Robert J Sonstroem, PhD Professor Emeritus e·mer·i·tus adj. Retired but retaining an honorary title corresponding to that held immediately before retirement: a professor emeritus. n. pl. Physical Education Department University of Rhode Island References [1] Roush SE, Sontroem RJ. Development of the Physical Therapy Outpatient Satisfaction Survey (PTOPS). Phys Ther. 1999; 79:159-170. [2] Allen MJ, Yen WM. Introduction to Measurement Theory. Monterey, Calif. Brooks/Cole Publishing; 1979. [3] Comrey AL. Factor-analytic methods of scale development in personality and clinical psychology. J Consult Clin Psychol. 1988;56:754-761. [4] Kline P. A Handbook of Test Construction: Introduction to Psychometric Design. London, England: Methuen & Co Ltd; 1986:66-72. [5] Wiggins JS. Personality and Prediction: Principles of Personality Assessment. Reading, Mass: Addison-Wesley; 1973. Author Response: In response to Dr Roush and Dr Sonstroem, we contend that the use of the term "faulty procedures" in the first paragraph of their letter is too strong. We do not agree that the procedures were faulty, nor do we believe that they led to incorrect conclusions. We welcome the chance to respond to the questions and criticisms raised in the letter. Although it is clear that the authors of the letter believe that we blithely ignored their own article on patient satisfaction, published in Physical Therapy, this is not what happened. First, their article was cited in our report. Although the citation appeared later in our report, it was referred to and clearly appears in the list of references. In fact, we believe that the authors misinterpreted the statement in our article to which they refer. The statement that instruments generally have not been subjected to rigorous psychometric review is legitimate and refers to those instruments in the Patient Satisfaction Instruments: A Compenium.[1] The instruments listed in the Compendium com·pen·di·um n. pl. com·pen·di·ums or com·pen·di·a 1. A short, complete summary; an abstract. 2. A list or collection of various items. , which are typical of those currently used by physical therapists, have not been subjected to psychometric analysis. We did not imply that all instruments in use have not been evaluated, and we are pleased that Dr Roush and Dr Sonstroem developed an instrument that was rigorously tested. The third paragraph of the letter begins a series of statements about methodological flaws that Dr Roush and Dr Sonstroem perceive existed in our analysis of data. They state that we used 20 questions to assess 11 hypothesized domains of patient satisfaction. They misstated our intent and ignored what we wrote in several places. We never stated that items were selected to assess domains. Rather, we stated (paragraph 6, page 856) that 20 items were selected to represent domains. We assessed the construct validity construct validity, n the degree to which an experimentally-determined definition matches the theoretical definition. of the domains (see Tab. 4, page 860), but we, ourselves, stated (paragraph 2, page 859) that this type of analysis is not possible for domains with only 1 item. The letter goes on to raise the issue of lack of variance and the presence of a ceiling effect. We concur CONCUR - ["CONCUR, A Language for Continuous Concurrent Processes", R.M. Salter et al, Comp Langs 5(3):163-189 (1981)]. with this comment, and our concurrence CONCURRENCE, French law. The equality of rights, or privilege which several persons-have over the same thing; as, for example, the right which two judgment creditors, Whose judgments were rendered at the same time, have to be paid out of the proceeds of real estate bound by them. Dict. de Jur. h.t. is evident in the article. The second paragraph of the "Limitations" section stated that "a broad range of ratings for satisfaction is not available on which to establish and evaluate the psychometric properties of the instrument." We invite replication of the study. It hardly seems appropriate to say "the issues presented above ... discount the use of that instrument" when we, in fact, have pointed out the limitations of the analysis ourselves. We also do not agree with the comments about our decision to phrase all items in a positive direction. The authors cite references, stating that the "presence of positive statements exclusively tends to create an acquiescence response set." Survey instrument design is not an exact science. Shuman and Presser[2] contend that changing the wording of some items to a negative direction does not have a great effect on responses. Furthermore, having all questions phrased in a similar direction reduces response measurement error among respondents In the context of marketing research, a representative sample drawn from a larger population of people from whom information is collected and used to develop or confirm marketing strategy. who do not read each question carefully. It may also serve to decrease respondent In Equity practice, the party who answers a bill or other proceeding in equity. The party against whom an appeal or motion, an application for a court order, is instituted and who is required to answer in order to protect his or her interests. bias and increase return rate. A changing response set may very well tax the limited reading abilities of some patients/clients. Similarly, mental rotation of a visually presented response sequence might prove very challenging to individuals with certain kinds of learning disabilities. Our purpose in making all scales consistent was a conscious decision on our part to reduce respondent burden. At some point, clinical utility must be an important consideration in questionnaire design. Respondents--either patients/ clients or clinicians--will not complete an instrument that is time-consuming. Consideration of optimal response rate must be taken into account when designing a survey instrument. In fact, despite the extensive (and even dazzling) battery of psychometric testing psychometric test Any test used to quantify a particular aspect of a person's mental abilities or mindset–eg, aptitude, intelligence, mental abilities and personality. See IQ test, Personality testing, Psychological testing. to which Dr Roush and Dr Sonstroem subjected their own instrument, they perhaps failed to consider the most pertinent question in applied research. That is, having built the "better" mousetrap, will anyone actually use it, given the items it contains? Given the paucity pau·ci·ty n. 1. Smallness of number; fewness. 2. Scarcity; dearth: a paucity of natural resources. of the physical therapy literature on patient satisfaction, and the essential need to collect such data, we share Dr Roush's and Dr Sonstroem's concern that satisfaction instruments have not been widely incorporated into daily clinical practice. But, can one really imagine that all patients and clients will easily risk their relationships with physical therapists to whom they may have to return by plainly stating their dissatisfaction? It is quite difficult to challenge a perceived authority when one also perceives that same authority is responsible for his or her return to health. Although we still have much to learn about the psychometric properties of our instrument, and fully anticipate that a better one can and will be developed, we do believe that our instrument can and will be easily adopted into practice. Overall, we believe that the authors have created a straw man by claiming that we used wording that in fact we did not use. They then proceeded to knock it down with carefully calculated arguments. Our major problem with their approach is that their criticism of our study is based on a straw man. In summary, we believe that the instrument described in our article should and will be used in practice. We further believe that (1) the points alleged by the authors to be oversights were actually addressed in our article, (2) we cited the limitations of our study and made no grandiose grandiose /gran·di·ose/ (gran´de-os?) in psychiatry, pertaining to exaggerated belief or claims of one's importance or identity, often manifested by delusions of great wealth, power, or fame. claims about the instrument or our findings, (3) we stated there was little variance, (4) we acknowledged that we cannot establish construct validity on domains represented by a single item, and (5) we indicated that our results demonstrating that a single factor underlies the instrument were based solely on this sample. We invite additional testing of the instrument. It is hardly collegial col·le·gi·al adj. 1. a. Characterized by or having power and authority vested equally among colleagues: "He . . . , however, to recommend discounting the use of the instrument, and we applaud the expansion of the profession's body of knowledge that such testing will produce. Marc S Goldstein, EdD Director, Research Services Division of Practice and Research American Physical Therapy Association The American Physical Therapy Association (APTA) is a national professional organization representing more than 66,000 members. Its goal is to foster advancements in physical therapy practice, research, and education. 1111 N Fairfax St Alexandria, VA 22314 Steven D Elliott, PhD Senior Systems Analyst Westat Inc Rockville, Md Andrew A Guccione, PT, PhD, FAPTA FAPTA Fellows of the American Physical Therapy Association Senior Vice President Division of Practice and Research American Physical Therapy Association References [1] Patient Satisfaction Instruments: A Compendium. Alexandria, Va: American Physical Therapy Association; 1995. [2] Shuman H, Presser S. Questions and Answers in Attitude Surveys: Experiments on Question Form, Wording, and Context. San Diego San Diego (săn dēā`gō), city (1990 pop. 1,110,549), seat of San Diego co., S Calif., on San Diego Bay; inc. 1850. San Diego includes the unincorporated communities of La Jolla and Spring Valley. Coronado is across the bay. , Calif: Academic Press Inc; 1981. |
|
||||||||||||||||||||

Printer friendly
Cite/link
Email
Feedback
Reader Opinion