Printer Friendly
The Free Library
14,508,224 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

Prospective evaluation of the AM-PAC-CAT in outpatient rehabilitation settings.


Computerized adaptive testing Computerized adaptive testing is a more commonly used term [1] for Computer-adaptive testing.  (CAT), an outcome measurement approach for comprehensive and precise assessment of patient-related outcomes, is being used with increasing frequency in the health care field. (1-3) This method of patient assessment uses a computer to administer test items to patients and is adaptive in the sense that each "test" is tailored to the unique level of each patient. Each person who takes an adaptive test is taking a different version of the test because the items are administered on the basis of the patient's previous responses. By avoiding the administration of a large number of questionnaire items, by selecting only those questions from a large "item bank" that provide the maximum amount of information based on a person's responses to previous questions, CAT approaches allow for the rapid collection of accurate outcome information that can feasibly be implemented in busy clinical settings as well as in research settings. (4)

A CAT is programmed to first present an item from the mid-range
For loudspeakers, see mid-range speaker
In statistics, the mid-range or mid-extreme of a set of statistical data values is the arithmetic mean of the maximum and minimum values in a data set, or:

 of a predefined item bank of outcome questions and then directs subsequent questions to the patient's most appropriate level based on his or her previous responses. By having comprehensive item banks available for each outcome domain of interest, the CAT algorithm algorithm (ăl`gərĭth'əm) or algorism (–rĭz'əm) [for Al-Khowarizmi], a clearly defined procedure for obtaining the solution to a general type of problem, often numerical.  selects only the items that are needed to provide a score estimate based on a predetermined pre·de·ter·mine  
v. pre·de·ter·mined, pre·de·ter·min·ing, pre·de·ter·mines

v.tr.
1. To determine, decide, or establish in advance:
 number of items or a predetermined level of measurement precision. This allows for fewer items to be administered to each patient while gaining accurate information regarding an individual's placement along an outcome continuum Continuum (pl. -tinua or -tinuums) can refer to:
  • Continuum (theory), anything that goes through a gradual transition from one condition, to a different condition, without any abrupt changes or "discontinuities"
. (5) The development of comprehensive and methodologically sound item banks for each outcome of interest is a prerequisite pre·req·ui·site  
adj.
Required or necessary as a prior condition: Competence is prerequisite to promotion.

n.
 to the development of psychometrically adequate CAT platforms that have clinical or research utility.

Item response theory Item response theory is a body of theory used in the field of psychometrics. Pychometrics is concerned with the theory and technique of educational and psychological measurement.  (IRT IRT Item Response Theory
IRT In Regard To
IRT Incident Response Team
IRT In Reference To
IRT In Regards To
IRT Icing Research Tunnel (wind tunnel)
IRT Interborough Rapid Transit
) is both a theoretical framework and a collection of quantitative techniques used to construct outcome instruments, to scale responses to individual test items, and to equate e·quate  
v. e·quat·ed, e·quat·ing, e·quates

v.tr.
1. To make equal or equivalent.

2. To reduce to a standard or an average; equalize.

3.
 scores, as well as to identify item bias and to facilitate CAT. (3,6) With IRT, items are calibrated cal·i·brate  
tr.v. cal·i·brat·ed, cal·i·brat·ing, cal·i·brates
1. To check, adjust, or determine by comparison with a standard (the graduations of a quantitative measuring instrument):
 on the same scale that is used to measure a patient's functional ability. As such, the items are inherently linked to the scale both in terms of ability and the amount of information that an item provides at some point along the scale. In a CAT application, items are selected based on maximum information near the individual's estimated level of ability, thus avoiding the administration of items that are too easy or too difficult. This property of IRT supports an efficient selection of items during a CAT administration. In essence, the CAT software is programmed to select the items that provide optimal information, thus leading to a precise and efficient estimate of the patient's ability. (7,8) This feature of CAT and IRT methods creates important flexibility in administering tests in a dynamic and tailored approach for each patient.

Although CAT applications for health care have been recommended for nearly a decade (2,5,9) and a major set of papers on the subject was published in 2000, (7,10,11) the literature has been limited largely to either position papers, (4,12-14) data simulations, (1,15-19) or small-scale small-scale
adj.
1. Limited in scope or extent; modest: a small-scale plan.

2. Created on a small scale:
 prospective research demonstrations. (17,18,20)

If CAT applications are going to become widely accepted as a means of monitoring health care outcomes, prospective evaluations should become more readily available in the clinical literature. To our knowledge, no previous study has evaluated the prospective use of CAT in health care environments. Building on our previous work, (21-23) in this pilot study we prospectively evaluated the practical and psychometric psy·cho·met·rics  
n. (used with a sing. verb)
The branch of psychology that deals with the design, administration, and interpretation of quantitative tests for the measurement of psychological variables such as intelligence, aptitude, and
 adequacy of the Activity Measure for Post-Acute Care (AM-PAC) "item bank" and CAT assessment platform (AM-PAC-CAT) when applied within orthopedic orthopedic /or·tho·pe·dic/ (-pe´dik) pertaining to the correction of deformities of the musculoskeletal system; pertaining to orthopedics.  outpatient outpatient /out·pa·tient/ (-pa-shent) a patient who comes to the hospital, clinic, or dispensary for diagnosis and/or treatment but does not occupy a bed.

out·pa·tient
n.
 physical therapy settings. Our evaluation consisted of 3 components: (1) a practical evaluation that included test efficiency of the CAT (ie, number of items used and amount of time needed to complete the CAT assessment); (2) a psychometric evaluation, including content range coverage, item exposure rate (IER IER Institut d'Economie Rurale
IER Institute for Economic Research (Ljubljana, Slovenia)
IER Institute for Employment Research
IER Ion-Exchange Resin (building material)
IER Initial Environmental Review
), test precision, and person fit; and (3) an assessment of the validity and sensitivity to change of the score estimates derived by the AM-PAC-CAT.

In this study, we evaluated the Basic Mobility and Daily Activity scales of the AM-PAC-CAT. (24) Our intent was to identify areas where the prototype AM-PAC-CAT instrument was working well and where the CAT could be improved to enhance its utility for use in outpatient rehabilitation rehabilitation: see physical therapy.  and related clinical settings.

Method

Instrument

The AM-PAC is an activity limitations instrument developed using the World Health Organization's International Classification of Functioning, Disability and Health International Classification of Functioning, Disability and Health, also known as ICF, is a classification of the health components of functioning and disability.  (ICF (Internet Connection Firewall) The built-in firewall in Windows XP. It provides a stateful inspection of packets which accepts only responses to requests originated by the user. ). (25) Within the ICF, an activity limitation is defined as "difficulty in the execution of a task or action by an individual." (25(p14)) In developing the AM-PAC-CAT, we used 2 different samples, for a combined sample size of more 1,000 patients in post-acute care settings. (17,22)

We developed an initial pool of AM-PAC items based on input from measurement and content experts, suggestions from several focus groups of people with disabilities, and a comprehensive literature review. Some items were modified from existing functional instruments, but adapted for difficulty or assistance response categories included in the AM-PAC. We framed the activity questions in a general fashion without specific attribution at·tri·bu·tion  
n.
1. The act of attributing, especially the act of establishing a particular person as the creator of a work of art.

2.
 to health, medical conditions See carpal tunnel syndrome, computer vision syndrome, dry eyes and deep vein thrombosis. , or disabling dis·a·ble  
tr.v. dis·a·bled, dis·a·bling, dis·a·bles
1. To deprive of capability or effectiveness, especially to impair the physical abilities of.

2. Law To render legally disqualified.
 factors. The AM-PAC data are collected by self-report, either through self-administration or when administered either by a clinician clinician /cli·ni·cian/ (kli-nish´in) an expert clinical physician and teacher.

cli·ni·cian
n.
 or by a trained data collector.

The Daily Activity scale item bank encompasses 65 distinct personal care and instrumental activities of daily living instrumental activities of daily living A series of life functions necessary for maintaining a person's immediate environment–eg, obtaining food, cooking, laundering, housecleaning, managing one's medications, phone use; IADL measures a  tasks. The Basic Mobility domain contains 120 basic physical activities such as bending, walking, carrying, or climbing stairs. Based on factor analytic Adj. 1. factor analytic - of or relating to or the product of factor analysis
factor analytical
 work and IRT analyses, (21,23) Basic Mobility and Daily Activity scale domains were identified and confirmed. A third AM-PAC domain--applied cognitive activity--was not included because it was judged by the clinical sites participating in this study as not relevant to this patient population.

The IRT modeling of the items was conducted using the generalized gen·er·al·ized
adj.
1. Involving an entire organ, as when an epileptic seizure involves all parts of the brain.

2. Not specifically adapted to a particular environment or function; not specialized.

3.
 partial credit model (GPCM GPCM Generalized Parity-Check Matrix ). (26) The GPCM uses 2 parameters--item difficulty and discrimination--in estimating item locations and person scores and makes no assumptions regarding the similarity Similarity is some degree of symmetry in either analogy and resemblance between two or more concepts or objects. The notion of similarity rests either on exact or approximate repetitions of patterns in the compared items.  of item response categories across items. Adequate levels of reliability of individual items and validity of the AM-PAC have been established and have been reported previously. (21,27)

We developed a CAT version of the AM-PAC instrument (the prototype AM-PAC-CAT instrument) and have conducted a preliminary evaluation in samples of patients in post-acute care settings. (24) The CAT software includes options for item selection, score estimation estimation

In mathematics, use of a function or formula to derive a solution or make a prediction. Unlike approximation, it has precise connotations. In statistics, for example, it connotes the careful selection and testing of a function called an estimator.
 using the expected a posteriori [Latin, From the effect to the cause.]

A posteriori describes a method of reasoning from given, express observations or experiments to reach and formulate general principles from them. This is also called inductive reasoning.
 (EAP (Extensible Authentication Protocol) A protocol that acts as a framework and transport for other authentication protocols. EAP uses its own start and end messages, but then carries any number of third-party messages between the client (supplicant) and access control ) estimator method, and stopping rules In probability theory, in particular in the study of stochastic processes, a stopping time is a specific type of "random time".

The theory of stopping rules and stopping times can be analysed in probability and statistics, notably in the optional stopping theorem.
 based on the number of items or level of precision.

In this study, we set a stop rule of administering no more than 7 items to each patient based on the participating clinic's desire to keep patient (and clinic staff) burden to an absolute minimum. We also used a content balancing algorithm that allowed AM-PAC items to be selected based on both content specifications and maximum information function for the first 3 items of the Basic Mobility scale and the first 4 items of the Daily Activity scale. (28)

The content balancing algorithm ensured that content chosen within the CAT item selection procedure was not limited to only one content aspect of the scale. For example, the first 3 items of the Basic Mobility scale were an item from each of the 3 major content areas: (1) bend/lift/ reach/carry/items, (2) mobility items, and (3) transfer items. Likewise, the CAT was programmed to select an item with the most information from one of each of the 4 Daily Activity scale content areas: (1) dressing items, (2) meal items, (3) grooming Combining, consolidating and segregating network traffic using devices such as digital cross-connects, add/drop multiplexers and SONET switches. Grooming is a telephone term that typically refers to managing high-capacity lines between central offices, carriers, ISPs and very large  and hygiene hygiene, science of preserving and promoting the health of both the individual and the community. It has many aspects: personal hygiene (proper living habits, cleanliness of body and clothing, healthful diet, a balanced regimen of rest and exercise); domestic hygiene  items, and (4) instrumental activity items. Subsequent items in both scales then were selected based on maximum information at each iterative it·er·a·tive  
adj.
1. Characterized by or involving repetition, recurrence, reiteration, or repetitiousness.

2. Grammar Frequentative.

Noun 1.
 step.

Estimated AM-PAC scores for each subject in the sample were converted to norm-based scoring, which is a simple linear translation that expresses scores as deviations from a measure of central tendency. In this study, we used a mean of 50 and a standard deviation In statistics, the average amount a number varies from the average number in a series of numbers.

(statistics) standard deviation - (SD) A measure of the range of values in a set of numbers.
 of 10. By using norm-based scoring instead of the more traditional 0-100 scale, as we raise the ceiling or lower the floor of a scale in the future by adding and calibrating new items, the placement (and scoring) of the item thresholds in relation to the average does not change. We based the CAT algorithms The following is a list of the algorithms described in Wikipedia. See also the list of data structures, list of algorithm general topics and list of terms relating to algorithms and data structures.  used in this study on software developed at the Health and Disability Research institute, Boston University Boston University, at Boston, Mass.; coeducational; founded 1839, chartered 1869, first baccalaureate granted 1871. It is composed of 16 schools and colleges. .

Subjects

Subjects for this study, conducted in 2005, consisted of a convenience sample of 1,815 patients with spine, lower-extremity (LE), or upper-extremity (UE) impairments who received outpatient physical therapy in 1 of 20 outpatient clinics across 5 states that were operated by HealthSouth's Outpatient Division Inc.

Background characteristics of the study sample, by major impairment Impairment

1. A reduction in a company's stated capital.

2. The total capital that is less than the par value of the company's capital stock.

Notes:
1. This is usually reduced because of poorly estimated losses or gains.

2.
 grouping, are shown in Table 1. The sample was predominantly pre·dom·i·nant  
adj.
1. Having greatest ascendancy, importance, influence, authority, or force. See Synonyms at dominant.

2.
 female, with a mean age between 46.8 and 51.4 years.

Data Collection

On their initial and discharge visits for physical therapy, subjects completed the self-report AM-PAC-CAT on a tablet computer A complete computer contained in a touch screen. Tablet computers can be specialized for only Internet use or be full-blown, general-purpose PCs with all the bells and whistles of a desktop unit.  provided to them in the clinic waiting room prior to their physical therapy visit. An office staff member was available to the subjects during the administration process to answer any questions. The 1,815 subjects included in this analysis completed both admission and discharge AM-PAC-CATs.

Subject demographic information, acuity acuity /acu·i·ty/ (ah-ku´i-te) clarity or clearness, especially of vision.

a·cu·i·ty
n.
Sharpness, clearness, and distinctness of perception or vision.
 level, surgical status, and major impairment were all available from administrative data collected routinely by each outpatient clinic. Reliability and validity data on these administrative data elements were not available. Acuity was defined as the number of days from the onset of the condition for which therapy was being sought to the admission visit to the physical therapy clinic. Payer source was defined as the primary source of payment for that physical therapy episode of care. Spine impairments included impairments of the cervical cervical /cer·vi·cal/ (ser´vi-k'l)
1. pertaining to the neck.

2. pertaining to the neck or cervix of any organ or structure.


cer·vi·cal
adj.
, thoracic thoracic /tho·rac·ic/ (thah-ras´ik) pectoral; pertaining to the thorax (chest).

tho·rac·ic
adj.
Of, relating to, or situated in or near the thorax.
, or lumbosacral region lumbosacral region,
n that area of the back that approximates level of the lumbar and sacral vertebrae. The lower third of the back.
 of the spine. Upper-extremity impairments included conditions of the shoulder, elbow, hand, or wrist. Lower-extremity impairments were conditions of the hip, knee, foot, or ankle.

Data Analysis

To evaluate the practical utility of the AM-PAC-CAT, we assessed the CAT's efficiency, which was defined as the number of CAT items administered per assessment and the amount of time taken to complete the CAT. In the psychometric evaluation, we assessed the content range of each scale item pool, IER, test precision, and model fit in this sample. We also evaluated scale score validity and sensitivity to change over the episode of care.

Content range coverage assessed how well the AM-PAC item bank captured the range of physical functioning experienced by the subjects in each Activity Limitation scale content domain. We examined potential ceiling effects (ie, the point at which subjects received the highest score) and floor effects (ie, the point at which subjects received the lowest possible score).

The IER identified which AM-PAC items were administered more often in the CAT application. Item exposure rate was defined as the ratio of the total number of times an item was administered over the total number of test occasions in a CAT study. Plots of the IER against item difficulty levels were constructed to detect possible relationships between frequencies of items being selected and their difficulty levels. The IER is influenced by the difficulty and discrimination of items, the distribution of ability of the patients, what other similar items are in the item bank, and the specific content balancing specifications developed for each scale. (29,30)

Test precision was examined in this sample using the test information function (TIF TIF Tagged Image File (file name extension)
TIF Tax Increment Financing
TIF Temporary Internet Files
TIF Transport Innovation Fund (UK)
TIF Telecommunications Infrastructure Fund
). The TIF is a summary of information provided by individual items in the instrument and identities where along an underlying scale that items have their best level of discrimination and measurement precision. Although the ideal for a CAT instrument is equal measurement precision (small standard errors of measurement) at all levels of ability, there is likely to be some variability of measurement precision for a certain group of people depending on their level of ability on the scale. The location on the scale where the test information curve peaked indicates the portion on an ability scale best measured by that instrument. When the test information is peaked at around the same range on the scale as the patients' peak of ability distribution, the instrument is assumed to "fit" the population being measured.

Test information function values are closely related to the calculation of standard error (SE) of the person ability estimates. Specifically, the SE of the person ability estimate is inversely proportional See Directly proportional, under Directly, and Inversion, 4.

See also: Inversely
 to the TIF value: SE=1/square root(TIF). To illustrate the precision levels of CAT scores at different ability levels, we also calculated the average SE of estimates for people at different score ranges. Confidence intervals confidence interval,
n a statistical device used to determine the range within which an acceptable datum would fall. Confidence intervals are usually expressed in percentages, typically 95% or 99%.
 (CIs) of the estimates were generated by multiplying mul·ti·ply 1  
v. mul·ti·plied, mul·ti·ply·ing, mul·ti·plies

v.tr.
1. To increase the amount, number, or degree of.

2. Mathematics To perform multiplication on.
 the SE by a z score corresponding to certain confidence level.

To assess sample fit to the CAT model, we estimated the degree to which the subjects' responses to items met the hierarchical A structure made up of different levels like a company organization chart. The higher levels have control or precedence over the lower levels. Hierarchical structures are a one-to-many relationship; each item having one or more items below it.  assumptions of the fixed calibrations used in the CAT for the Basic Mobility and Daily Activity scales. For any IRT scale, an important assumption is that item difficulty locations on the underlying functional scale are similar for all people and that these locations have a predetermined hierarchy that applies to most individuals. To test this assumption, we used a standardized standardized

pertaining to data that have been submitted to standardization procedures.


standardized morbidity rate
see morbidity rate.

standardized mortality rate
see mortality rate.
 log-likelihood statistic statistic,
n a value or number that describes a series of quantitative observations or measures; a value calculated from a sample.


statistic

a numerical value calculated from a number of observations in order to summarize them.
 ([l.sub.z]) for polytomous Po`lyt´o`mous

a. 1. (Bot.) Subdivided into many distinct subordinate parts, which, however, not being jointed to the petiole, are not true leaflets; - said of leaves.
 items to test for person fit. (31) The empirical distribution of the log-likelihood statistic is reasonably close to a standardized normal distribution Standardized normal distribution

A normal distribution with a mean of 0 and a standard deviation of 1.
, so we calculated the percentage of administrations (both at admission and discharge) in which [l.sub.z] exceeded an alpha level of .05:

Validity of CAT score estimates was assessed using construct validation See validate.

validation - The stage in the software life-cycle at the end of the development process where software is evaluated to ensure that it complies with the requirements.
 techniques. To provide evidence for construct validity construct validity,
n the degree to which an experimentally-determined definition matches the theoretical definition.
 of the AM-PAC-CAT scales, we compared AM-PAC-CAT scores between subjects with less than 35 acuity days (the median) and subjects with more than 35 acuity days and between subjects who had postsurgery treatment and those who had not. We hypothesized that earlier treatment following the onset of a condition and treatment after surgery would be associated with more improvement on both AM-PAC outcome scales.

Sensitivity to change was examined using one-sample dependent t tests to determine whether the increase in AM-PAC-CAT scores between admission and discharge from therapy were significantly greater than zero. In addition, we calculated the minimal detectable change (MDC (1) (Mobile Daughter Card) See riser card.

(2) See Meta Data Coalition.
) and the MDC proportion. The MDC is considered the minimal amount of change that is not likely to be due to measurement error. It is one of the more common distributional-based change indexes, which can be used to identify reliable changes in function, strength (force-generating capacity), and walking efficiency. (32) The MDC can be reported at different confidence levels. We chose to report both the [MDC.sub.68] and [MDC.sub.90] confidence levels in this article. The MDC proportion was calculated as the proportion of people scored equal to or above MDC. In calculating the MDC, we used test-retest reliability test-retest reliability Psychology A measure of the ability of a psychologic testing instrument to yield the same result for a single Pt at 2 different test periods, which are closely spaced so that any variation detected reflects reliability of the instrument  estimates on the short-form AM-PAC from our earlier work. (27)

Results

The mean length of time to complete the Basic Mobility scale was 1.9 minutes, using, on average, 6.6 items per CAT session from the Basic Mobility scale item pool. The mean length of time to complete the Daily Activity scale was 1.01 minutes, using on average, 6.8 items from the Daily Activity scale item pool. The percentages of cases using the maximum number of items (7 items) allowed in this application were: 72% for the Basic Mobility scale and 87% for the Daily Activity scale.

Content Coverage

The mean Basic Mobility scale scores at the admission and discharge visits, for the total sample and by impairment group, are listed in Table 2. The mean admission score of the total group was 63.3, and the mean discharge score of the total group was 68.7, an average increase of 5.4 units. When broken down into the 3 impairment groups, the UE group had the highest Basic Mobility scale scores and the LE group had the lowest scores in both admission and discharge sessions.

There was neither a ceiling effect (highest possible AM-PAC score estimate for a subject) nor a floor effect (lowest possible AM-PAC score estimate for a subject) in the admission Basic Mobility scale, but on discharge, 10% of the total sample achieved the highest possible score. Ceiling effects were the greatest for the UE group, where 12.7% scored the highest value at discharge. Figure 1 shows that the admission scores were roughly normally distributed for each impairment group. However, in the discharge session (Fig. 1), the Basic Mobility scale scores were negatively skewed skewed

curve of a usually unimodal distribution with one tail drawn out more than the other and the median will lie above or below the mean.

skewed Epidemiology adjective Referring to an asymmetrical distribution of a population or of data
, illustrating some ceiling effect.

[FIGURE 1 OMITTED]

The mean Daily Activity scale scores at the admission and discharge visits, for the total sample and by impairment group, are shown in Table 3. The mean Daily Activity scale admission score of the total group was 57.0, and the mean discharge score of the total group was 60.9, an increase of 3.9 units. When broken down into the 3 impairment groups, the LE group had the highest Daily Activity scale scores and the UE group had the lowest scores in both admission and discharge sessions. There was no floor effect at either visit, but there were substantial ceiling effects. A greater proportion of subjects scored at 65.3, very close to the maximum possible score of 67. Therefore, for this scale, we expanded the definition of ceiling effect to contain the score range from 65.3 to the maximum. In the admission session, 25% of the total sample displayed a ceiling effect on the Daily Activity scale. The greatest ceiling effect was seen for the LE group on admission, where 32.3% of the subjects scored at the ceiling on this scale. In the discharge session, almost haft of the subjects scored at the ceiling on the Daily Activity scale, with the greatest ceiling effect (62.6%) seen for those subjects with an LE impairment. The frequency distributions presented in Figure 2 illustrate the negatively skewed distributions Skewed distribution

Probability distribution in which an unequal number of observations lie below (negative skew) or above (positive skew) the mean.
 for the subjects at admission and at discharge.

[FIGURE 2 OMITTED]

Item Exposure Rate

In the Basic Mobility scale item pool, one item ("Bending over to pick up something") was administered at every test occasion (IER=100%) because it was the predetermined starting rule. Eighteen items (15%) were not administered, and 81 items (67.5%) were exposed below 5% of the time. Table 4 displays the 21 Basic Mobility scale items that achieved an IER greater than 5% in the total sample across admission and discharge administrations.

In the Daily Activity scale item pool, 2 items were highly used (with an IER between 90% and 100%). All 65 items in the pool were used in this study, and a majority of the items (76.9%) had an IER below 5%. The 15 items with an IER greater than 5% are shown in Table 5.

Figures 3 and 4 provide a chart of IER for the Basic Mobility and Daily Activity scale item pools by item difficulty level for the total sample. For the Basic Mobility scale item pool, although items on the upper haft of the scale were exposed more often than items on the lower half of the scale, the distribution of the higher IER items was roughly even across the upper half of the scale. In contrast, for the Daily Activity scale domain, the higher IER items were clustered within a smaller range at the higher end Coordinates:
For other places with the same name, see Billinge.
Higher End or Billinge Higher End is a district of the Metropolitan Borough of Wigan, in Greater Manchester, England.
 of the scale. This pattern reflects the ceiling effect of the whole item bank illustrated in previously described results.

[FIGURES 3-4 OMITTED]

Test Precision

Figures 5 and 6 contrast the TIFs for the full set of items and the TIFs for the items selected most often by the CAT (across both admission and discharge sessions). A higher level of information indicates greater measurement precision at that point along the scale. For the Basic Mobility scale domain, the TIF curve for the entire test pool peaked around 55 units on the ability scale, and the TIF curve for the 16 most frequently exposed items in CAT administrations shifted somewhat to the right and peaked at around 60. In the Daily Activity scale domain, the full item bank TIF curve peaked at around 40 on the ability scale. The TIF for the items administered most frequently by the CAT peaked at around 45. As expected from the distribution of scores, the most frequently used CAT items had optimal precision at higher levels of daily activity functioning than the overall TIF for the full item bank.

[FIGURES 5-6 OMITTED]

Table 6 presents the SE of estimates for subjects at different ability levels, which were calculated after combining the admission and discharge sessions for each scale. As shown in the table, for the Basic Mobility scale, scores between 50 and 69 were estimated the most precisely (SE=1.99). As the ability level moved farther away from this range, the precision level decreased. This table also presents the 95% CI for each score range by multiplying the average SE of the estimate by 1.96 ([Z.sub.0.95]=1.96). For example, the average SE of the estimate for a Basic Mobility scale score between 30 and 49 is 2.16 points and the 95% confidence width is [+ or -] 4.23 (1.96x2.16). Therefore, if a person scores 35, we are 95% confident that the true ability level of this person is between 30.77 (35-4.23) and 39.23 (35+ 4.23). For the Daily Activity scale, due to ceding cede  
tr.v. ced·ed, ced·ing, cedes
1. To surrender possession of, especially by treaty. See Synonyms at relinquish.

2.
 effect, no subject scored above 70, thus no SE is available for this range. The most precisely estimated score range is between 30 and 49, with SE equal to 1.98 points, and the least precisely estimated score range is between 50 and 69, with SE equal to 5.32 points.

Model Fit

Person score misfit mis·fit  
n.
1. Something of the wrong size or shape for its purpose.

2. One who is unable to adjust to one's environment or circumstances or is considered to be disturbingly different from others.
 occurs when a person answers an item or items in a very unexpected way, given the estimate of functional ability from other item responses. Using the log-likelihood test, a misfitting item profile was detected in only 3% of the Basic Mobility scale test administrations and in only 2% of the Daily Activity scale test administrations.

Construct Validity

If both AM-PAC scales discriminated properly, we expected to see greater increases in basic mobility and daily activity for those subjects who were below the median level of acuity compared with those who were above the median level and for those who received postsurgical treatment compared with those who did not receive postsurgical treatment. As hypothesized, the data presented in Table 7 revealed that there were statistically significant differences in level of improvement in the Basic Mobility scale as a function of a subject's acuity level and his or her surgical status. The Daily Activity scale discriminated across acuity subgroups, but the difference was not statistically significant for the surgical status subgroups.

Sensitivity to Change

The sensitivity to change between admission and discharge visits of the Basic Mobility and Daily Activity scales is shown in Tables 2 and 3. The Basic Mobility and Daily Activity scales detected statistically significant mean score increases for the total sample and by each impairment group, with moderate to large effect sizes and standardized response means. Effect sizes for the Basic Mobility scale ranged from 0.34 for UE impairments to 0.91 for LE impairments. For the Daily Activity scale, the range was from 0.42 for spine impairments to 0.60 for UE impairments.

Among the 3 impairment groups, the LE impairment group experienced the highest Basic Mobility scale mean score increase (8.32 units), followed by the spine impairment group (4.83 units) and then by the UE impairment group (2.78 units). The Daily Activity scale also detected significant mean score increases for the total sample (3.9 units) and by each impairment group. Among the impairment groups, the LIE impairment group experienced the highest mean Daily Activity scale score increase (5.65 units), followed by the LE impairment group (3.69 units) and then the spine impairment group (2.89 units).

For the Basic Mobility scale, 60% of the patient episodes exceeded the [MDC.sub.68] and 49% exceeded the [MDC.sub.90]. For the Daily Activity scale, 50% of the patient episodes exceeded the [MDC.sub.68] and 42% exceeded the [MDC.sub.90]. The proportion of patients who exceeded the MDC varied across impairment groups is shown in Tables 2 and 3.

Discussion and Conclusions

The CAT outcome instruments are intuitively appealing for use as quality monitoring tools within and across various clinical settings due to their promise of reducing respondent In Equity practice, the party who answers a bill or other proceeding in equity. The party against whom an appeal or motion, an application for a court order, is instituted and who is required to answer in order to protect his or her interests.  burden and minimizing data collection costs without sacrificing their psychometric properties. The findings from this study provide initial prospective evidence that CAT instruments can deliver on this promise. The 2 AM-PAC-CAT scales used in this study used, on average, 6 to 7 items per scale to estimate AM-PAC scores for the 120-item Basic Mobility scale and the 65-item Daily Activity scale. These 2 AM-PAC-CAT scales were completed, on average, in under 2 minutes, making them practical to use in busy clinical settings. However, to be truly useful in tracking functional outcomes for the purpose of quality monitoring, CAT scales must meet several psychometric standards as well.

The present study is the first attempt, to our knowledge, to prospectively evaluate the psychometric utility of 2 CAT-based Activity Limitation outcome scales within an actual clinical setting for the purpose of monitoring functional outcomes. This evaluation included an assessment of the scale distributions and content coverage, particularly their ceiling and floor effects; CAT selection of items from the underlying item pool; precision of the test; and construct validity and sensitivity to change, along with an examination of how well the item banks fit this sample of patients receiving outpatient orthopedic physical therapy services. Overall, the findings are encouraging, yet they do suggest areas for improvement that would advance measurement in this sample.

The AM-PAC-CAT's Basic Mobility scale demonstrated excellent psychometric properties when applied in this outpatient rehabilitation sample. The frequency distributions were roughly normally distributed, with no floor effects and only modest ceiling effects (10% at discharge). Although the Basic Mobility scale was sensitive to change in all 3 impairment groups, it was most sensitive to change among those subjects with primary LE and spinal spinal /spi·nal/ (spi´n'l)
1. pertaining to a spine or to the vertebral column.

2. pertaining to the spinal cord's functioning independently from the brain.


spi·nal
adj.
 impairments. The effect size level for the Basic Mobility scale was 0.34 for subjects with UE impairments, but an effect size level of 0.91 was achieved for subjects with LE impairments. The greatest proportion of subjects exceeding the Basic Mobility scale MDC (66.1%) was seen in those with LE impairments. The Basic Mobility scale worked least well for subjects with UE impairments, which makes clinical sense when one considers that people with UE conditions are less likely to experience mobility limitations in the types of activities measured by this scale.

The Basic Mobility scale also discriminated well among subjects as a function of their acuity level and their postsurgical status. Among a pool of 120 items, the CAT relied primarily on 21 Basic Mobility scale items. While the CAT relied most frequently on those items in the upper half of the Basic Mobility scale, the distribution of the higher-end items was roughly even across the upper half of the scale. Considering that the AM-PAC was designed for patients in post-acute care inpatient inpatient /in·pa·tient/ (in´pa-shent) a patient who comes to a hospital or other health care facility for diagnosis or treatment that requires an overnight stay.

in·pa·tient
n.
 and outpatient settings, one would expect the items used in an outpatient sample to be selected from the upper half of the item bank. Based on the TIF of the most frequently used CAT items, analyses revealed that the greatest measurement precision occurred when a person's Basic Mobility scale score was between 50 and 60, with less precision being achieved above and below this range. It is clear that new items located at the upper (better functioning) end of the Basic Mobility scale could help reduce ceiling effects and improve measurement precision.

The AM-PAC-CAT's Daily Activity scale demonstrated less adequate psychometric properties than the Basic Mobility scale in this outpatient sample. Analyses revealed several areas where the Daily Activity scale is in need of revision and improvement to best suit the needs of people in orthopedic outpatient settings. The frequency distributions of the Daily Activity scale scores revealed the negatively skewed distributions for subjects in each impairment group on admission to and at discharge from physical therapy care. There was a substantial ceiling effect in the Daily Activity scale scores for all 3 impairment groups, especially in the LE impairment group where the ceiling was reached by 32.3% of the subjects on admission and by 62.6% of the subjects at discharge. The Daily Activity scale discriminated among subjects as a function of their acuity status and detected significant mean score increases in function for all 3 impairment groups.

Despite the shortcomings A shortcoming is a character flaw.

Shortcomings may also be:
  • Shortcomings (SATC episode), an episode of the television series Sex and the City
 of the Daily Activity scale, the group effect sizes achieved were substantial: the range was from 0.42 for spine impairments to 0.60 for UE impairments. The Daily Activity scale was most sensitive to change among those subjects with UE impairments, which also makes clinical sense because UE impairments are the type of condition most likely to affect personal care and performance of instrumental activities of daily living. Among a pool of 65 Daily Activity scale items, the CAT relied primarily on 15 items, which were predominantly located in the upper end of the item pool. The TIF curve for the items most frequently selected by the CAT on the Daily Activity scale revealed that these items provided less information for the subjects with Daily Activity scale scores above 60 units. For improved measurement precision at higher levels of functioning, particularly for improving the precision of individual score estimates, these findings suggest that the Daily Activity scale needs revision and addition of new items to make the scale more useful for outpatients of the type seen in this study.

An important advantage of CAT methodology, in contrast to traditional fixed-form measurement approaches, is the ability to readily update and improve the item bank as well as the CAT algorithms as problems are identified. Based on the results of this study, our research group has developed new questionnaire items for the Basic Mobility and Daily Activity scale item banks and has tested them, along with the existing AM-PAC-CAT scales, within a new sample of outpatients receiving physical therapy services. We are currently examining these new items in an IRT analysis to determine whether they fit the Basic Mobility or Daily Activity outcome scales, have content advantages over current items, and have locations on these outcome scales that fill in the content gap identified in the current study. Once these new IRT analyses are completed, the new items will be incorporated into the next revision of the Basic Mobility and Daily Activity scale item bank and CAT programs. In this sense, CAT outcome instruments can be viewed as dynamic, with the potential for continuous updating and improvement.

One of the concerns over using CAT-based outcome instruments is whether restricting the number of items administered to a patient (a maximum of 7 items in this study) could diminish the sensitivity of the instrument to change. With regard to this issue, it was encouraging to note that the effect sizes observed using the AM-PAC-CAT were comparable to those observed in similar types of patients followed with more traditional fixed-form functional outcome tools. For example, in this study, we observed an effect size of 0.91 with the AM-PAC Basic Mobility scale when used with patients with LE impairments. This finding compares with an effect size of 0.94 at 4 weeks for the Activities of Daily Living Scale in subjects with knee impairments, (33) an effect size of 0.93 for the Lysholm Knee Rating Scale, and an effect size of 0.81 that was observed on the Physical Function scale of the 36-Item Short-Form Health Status questionnaire (SF-36) when applied in a sample of outpatients with knee impairments. (35) In our study, the average AM-PAC-CAT Basic Mobility scale effect size was 0.62 in subjects with spinal impairments, which compares with an effect size of 0.70 that was observed on the Physical Function scale of the SF-36 when applied in a sample of outpatients with cervical and lumbar lumbar /lum·bar/ (lum´bar) pertaining to the loins.

lum·bar
adj.
Of, near, or situated in the part of the back and sides between the lowest ribs and the pelvis.
 impairments. (36) Future studies will be directed at evaluating the extent to which adding more than the 7 items per scale may improve upon the levels of sensitivity observed in this study.

Limitations

There are several limitations to the pilot study that should be noted. The first is that the subjects were a convenience sample of outpatients drawn from 20 outpatient practices. As with any convenience sample, we have no way of determining the extent to which these subjects represent the populations served by these clinics.

The reader also should note that only those subjects who completed both admission and discharge AM-PAC-CATs were eligible for our analyses. Securing discharge CATs in these busy clinical practices was a problem. Although the sample for this paper consisted of only 38% of all subjects who had completed the admission AM-PAC-CAT, those subjects who completed an admission AM-PAC-CAT but not a discharge AM-PAC-CAT were very similar to those who completed both instruments. Subjects who completed only the admission AM-PAC-CAT versus subjects who completed both admission and discharge AM-PAC-CATs were slightly younger (mean age=48 years versus 50 years), were more likely to have a spinal impairment (36% versus 32%), and were less likely to be receiving postsurgical treatment (24% versus 28%). The mean Basic Mobility and Daily Activity scale scores for patients who completed only the admission AM-PAC-CAT were 62.7 and 56.5, not statistically different from the mean scores of 62.9 and 56.8 for the final sample. Finally, we used test-retest Test-retest is a statistical method used to examine how reliable a test is: A test is performed twice, e.g., the same test is given to a group of subjects at two different times.  estimates from an earlier study of the AM-PAC that was done with both inpatients and outpatients who were receiving post-acute care. (27) The ideal approach would have been to derive test-retest estimates from a sample of subjects from orthopedic outpatient settings. We were unable to do so in this study, so we used the estimates from our earlier work. These methodological limitations should be kept in mind when interpreting our findings.

Implications

We believe that contemporary measurement methods such as IRT and CAT methodology present an exciting innovation that has the potential to transform the way in which patient-based outcome assessments are conducted within and across health care settings. The National Institutes of Health, for example, has recently included CAT approaches as part of their Roadmap A roadmap may refer to:
  • A map of roads, and possibly other features, to aid in navigation
  • A plan, e.g.
  • Road map for peace, to resolve the Israeli-Palestinian conflict
 and has funded major multi-year CAT projects to develop clinical research applications (14) designed to ensure more uniformity in outcome endpoints used for clinical trials. Because CAT assessments provide an accurate, real-time 1. real-time - Describes an application which requires a program to respond to stimuli within some small upper limit of response time (typically milli- or microseconds). Process control at a chemical plant is the classic example.  measurement of outcomes, the CATs can readily be used to track patient-reported outcomes
EPRO redirects here. For other uses, see EPRO (disambiguation).


A patient-reported outcome or PRO is a questionnaire used in a clinical trial or a clinical setting, where the responses are collected directly from the patient.
 to clinical interventions, making them attractive for use in quality-monitoring systems applied across various clinical settings. (37)

We believe that the advantages of CAT-based instruments are likely to be maximized when applied across various post-acute care settings where the breadth of the CAT-based instrument will be of maximum advantage. For instance, when used to monitor patient outcomes across inpatient rehabilitation, nursing home, and home health care settings, the sensitivity of the AM-PAC-CAT has been shown to be superior to traditional setting-specific instruments such as the Functional Independence Measure. (38,39)

Future CAT development should include work that attempts to balance the utility of generating scores for groups of patients (as was done in this study) with a desire by clinicians to use these CAT assessments as a source of usable USable is a special idea contest to transfer US American ideas into practice in Germany. USable is initiated by the German Körber-Stiftung (foundation Körber). It is doted with 150,000 Euro and awarded every two years.  information for individual treatment planning In radiotherapy, Treatment Planning is the process in which a team consisting of radiation oncologists, medical radiation physicists and dosimetrists plan the appropriate external beam radiotherapy treatment technique for a patient with cancer. Typically, medical imaging (i.e.  and specific patient monitoring. Past efforts that have tried to use group-level outcome assessment tools for individual patient assessment have largely been disappointing. (40) The problem is that group-level instruments yield imprecise im·pre·cise  
adj.
Not precise.



impre·cisely adv.
 and insensitive in·sen·si·tive  
adj.
1. Not physically sensitive; numb.

2.
a. Lacking in sensitivity to the feelings or circumstances of others; unfeeling.

b.
 scores for individual patients. This problem might be solved using CAT methodology. (2)

In theory, it is possible to generate CAT item selection algorithms In computer science, a selection algorithm is an algorithm for finding the kth smallest number in a list, called order statistics. This includes the cases of finding the minimum, maximum, and median elements. There are worst-case linear time selection algorithms.  that would select items to be administered to a patient from the relevant underlying item pool based on clinical considerations as well as on maximizing information for the CAT estimate. Computerized adaptive testing methodology allows the user to yield reliability estimates at the level of the individual person, thus facilitating the selection of a sufficient number of items for longitudinal lon·gi·tu·di·nal
adj.
Running in the direction of the long axis of the body or any of its parts.
 assessment of individual change over time. An example of how this individual patient approach using a CAT version of the Pediatric pediatric /pe·di·at·ric/ (pe?de-at´rik) pertaining to the health of children.

pe·di·at·ric
adj.
Of or relating to pediatrics.
 Evaluation of Disability Inventory was recently published. (32) A challenge to developing CAT applications that are useful at the individual patient level in rehabilitation is to provide sufficient information at the patient level while minimizing patient response burden so that CATs remain feasible to use in clinical practice.

If CAT outcome instruments such as the AM-PAC-CAT are shown to be beneficial for widespread application and use, a future challenge will be to develop effective and efficient methods to disseminate dis·sem·i·nate  
v. dis·sem·i·nat·ed, dis·sem·i·nat·ing, dis·sem·i·nates

v.tr.
1. To scatter widely, as in sowing seed.

2.
 these innovations. It is essential not only that information about contemporary outcome instruments is communicated accurately and efficiently, but also that potential users understand what these instrument can offer and have the skill to appropriately implement them to assess functional outcomes. Without careful attention to dissemination dissemination Medtalk The spread of a pernicious process–eg, CA, acute infection Oncology Metastasis, see there  and training, health care professionals may not know how to use these innovative tools and, consequently, outdated out·dat·ed  
adj.
Out-of-date; old-fashioned.


outdated
Adjective

old-fashioned or obsolete

Adj. 1.
 ordinal-scaled measures are likely to remain the outcome measurement norm for years to come.

To accomplish this challenge, new dissemination methods will need to be developed and implemented beyond the traditional methods of professional conference presentations and publication in scholarly journals. (41) Funding mechanisms will need to be developed that will support these dissemination tasks at every level. Future users must be provided with the software needed to apply, analyze, and interpret CAT-based outcome instruments. This may require the development of continuing education continuing education: see adult education.
continuing education
 or adult education

Any form of learning provided for adults. In the U.S. the University of Wisconsin was the first academic institution to offer such programs (1904).
 seminars or high-quality technical assistance vehicles to assist rehabilitation professionals and organizations in their understanding, application, and interpretation of contemporary outcome measurement tools. Accreditation accreditation,
n a process of formal recognition of a school or institution attesting to the required ability and performance in an area of education, training, or practice.
 or professional organizations might be able to play a crucial role in this dissemination approach, facilitating the dissemination process.

In addition, efforts need to be taken to ensure that future generations of clinicians are appropriately trained through the development of didactic di·dac·tic
adj.
Of or relating to medical teaching by lectures or textbooks as distinguished from clinical demonstration with patients.
 courses and professional curricula on contemporary outcomes measurement. Specific courses on modem measurement technology can be incorporated into professional curricula as a new basic science in professional (entry-level en·try-lev·el
adj.
Appropriate for or accessible to one who is inexperienced in a field or new to a market: an entry-level job in advertising; an entry-level computer. 
) health professions education. To accomplish this challenge will require efforts to educate faculty in the science of contemporary outcome measurement so that they have the skill to develop and deliver these courses to their future students. All of these dissemination steps are necessary to ensure that future generations of clinicians are familiar with and skilled in the application of contemporary outcomes measurement. Once developed and fully tested, these contemporary outcome instruments need to be widely disseminated disseminated /dis·sem·i·nat·ed/ (-sem´i-nat?ed) scattered; distributed over a considerable area.

dis·sem·i·nat·ed
adj.
Spread over a large area of a body, a tissue, or an organ.
 and incorporated into clinical practice and research to improve our understanding of the effectiveness of health care interventions.

D Jette and Dr Haley Ha·ley   , Alex 1921-1992.

American writer best known for Roots (1976), a fictionalized chronicle tracing his family history back to its African origins.

Noun 1.
 provided concept/idea/ research design and writing. Mr Meyers Meyers may refer to: People
  • Albert Meyers (born 1932), American organic chemist, professor at Colorado State University
  • Ann Meyers (born 1955), former American basketball player and current sportscaster
 and Mr Zurek provided data collection. D Jette, Dr Haley, Ms Tao, and Dr Ni provided data analysis. D Jette and Mr Moed provided project management. Dr Jette and Dr Haley provided fund procurement The fancy word for "purchasing." The procurement department within an organization manages all the major purchases. . Mr Zurek provided institutional liaisons. Mr Meyers and Mr Zurek provided subjects, facilities/equipment, and consultation (including review of manuscript manuscript, a handwritten work as distinguished from printing. The oldest manuscripts, those found in Egyptian tombs, were written on papyrus; the earliest dates from c.3500 B.C.  before submission).

This study was approved by the Institutional Review Board of Boston University.

This study was supported by HealthSouth Corporation's Outpatient Division. It also was supported, in part, by an Independent Scientist Award (K02 HD45354-01) to Dr Haley.

Dr Jette, Dr Haley, and Mr Moed have stock interest in CRE CRE Commercial Real Estate
CRE Corporate Real Estate
CRE Commission for Racial Equality (Scotland)
CRE CCD (Charge Coupled Device) and Readout Electronics
CRE Camp Response Element
 Care, LLC (Logical Link Control) See "LANs" under data link protocol.

LLC - Logical Link Control
, which distributes the Activity Measure for Post-Acute Care products.

This article was received April 24, 2006, and was accepted November 29, 2006.

DOI (Digital Object Identifier) A method of applying a persistent name to documents, publications and other resources on the Internet rather than using a URL, which can change over time. : 10.2522/ptj.20060121

References

(1) Gardner W, Kelleher KJ, Pajer KA. Multidimensional mul·ti·di·men·sion·al  
adj.
Of, relating to, or having several dimensions.



multi·di·men
 adaptive testing for mental health problems in primary care. Med Care. 2002;40:812-823.

(2) McHorney CA. Generic health measurement: past accomplishments and a measurement paradigm for the 21st century. Ann ANN, Scotch law. Half a year's stipend over and above what is owing for the incumbency due to a minister's relict, or child, or next of kin, after his decease. Wishaw. Also, an abbreviation of annus, year; also of annates. In the old law French writers, ann or rather an, signifies a year.  Intern intern /in·tern/ (in´tern) a medical graduate serving in a hospital preparatory to being licensed to practice medicine.

in·tern or in·terne
n.
 Med. 1997;127:743-750.

(3) Hays Hays, city (1990 pop. 17,767), seat of Ellis co., W central Kans.; inc. 1885. It is a rail, trade, and medical center in a grain, cattle, and oil area. Manufactures include electronic equipment, plastics, feeds, medical supplies, aircraft, and motorcycles.  RD, Morales LS, Reise SP. Item response theory and health outcomes measurement in the 21st century, med Care. 2000;38:1128-1142.

(4) Jette AM, Haley SM. Contemporary measurement techniques for rehabilitation outcomes assessment. J Rehabil Med. 2005; 37:339-345.

(5) Revicki DA, Cella DF. Health status assessment for the twenty-first century: item response theory, item banking and computer adaptive testing. Qual Life Res. 1997;6:595-600.

(6) Cook KF, Monahan PO, McHorney CA. Delicate balance between theory and practice: health status assessment and item response theory. Med Care. 2003;41:571-574.

(7) Hambleton RK. Emergence of item response modeling in instrument development and data analysis. Med Care. 2000; 38:1160-1165.

(8) Embretson S, Reise S. Item Response Theory for Psychologists This list includes notable psychologists and contributors to psychology, some of whom may not have thought of themselves primarily as psychologists but are included here because of their important contributions to the discipline. . Mahwah, NJ: Lawrence Erlbaum Associates; 2000.

(9) Bjorner JB, Ware JE Jr. Using modern psychometric methods to measure health outcomes. Medical Outcomes Trust Monitor. 1998;3(2):14-18.

(10) Celia D, Chang Chang (chăng) or Yangtze (yăng`sē`, yäng`dzŭ`), Mandarin Chang Jiang, longest river of China and of Asia, c.3,880 mi (6,245 km) long, rising in the Tibetan highlands, SW Qinghai prov.  C-H. A discussion of item response theory and its applications in health status assessment. Med Care. 2000; 38:66 -72.

(11) Ware JE Jr, Bjorner JB, Kosinski M. Practical implications of item response theory and computerized adaptive testing: a brief summary of ongoing studies of widely used headache headache

Pain in the upper portion of the head. Episodic tension headaches are the most common, usually causing mild to moderate pain on both sides. They result from sustained contraction of face and neck muscles, often due to fatigue, stress, or frustration.
 impact scales. Med Care. 2000;38:1173-1182.

(12) Ware JE Jr. Conceptualization con·cep·tu·al·ize  
v. con·cep·tu·al·ized, con·cep·tu·al·iz·ing, con·cep·tu·al·iz·es

v.tr.
To form a concept or concepts of, and especially to interpret in a conceptual way:
 and measurement of health-related quality of life: comments on an evolving field. Arch Phys Med Rehabil. 2003;84:S43-S51.

(13) Cook KF, O'Malley KJ, Roddey TS. Dynamic assessment of health outcomes: time to let the CAT out of the bag to tell a secret, carelessly or willfully.

See also: cat
? Health Serv Res. 2005;40:1694-1711.

(14) Fries J, Bruce Bruce, Scottish royal family descended from an 11th-century Norman duke, Robert de Brus. He aided William I in his conquest of England (1066) and was given lands in England.  B, Cella D. The promise of PROMIS PROMIS Project Management Information System
PROMIS Prosecutor's Management Information System
PROMIS Patient-Reported Outcomes Measurement Information System
ProMIS Property Management Information System
PROMIS Procurement Management Information System
: using item response theory to improve assessment of patient-reported outcomes. Clin Exp Rheumatol. 2005;23: S53-S57.

(15) Hart DL, Mioduski JE, Stratford PW. Simulated computerized computerized

adapted for analysis, storage and retrieval on a computer.


computerized axial tomography
see computed tomography.
 adaptive tests for measuring functional status were efficient with good discriminant validity Discriminant validity describes the degree to which the operationalization is not similar to (diverges from) other operationalizations that it theoretically should not be similar to.  in patients with hip, knee, or foot/ankle impairments. J Clin Epidemiol. 2005;58:629-638.

(16) Hart DL, Cook KF, Mioduski JE, et al. Simulated computerized adaptive test for patients with shoulder impairments was efficient and produced valid measures of function. J Clin Epidemiol. 2006;59:290-298.

(17) Haley SM, Ni PS, Hambleton RK, et al. Computer adaptive testing improves accuracy and precision of scores over random item selection in a physical functioning item bank. J Clin Epidemiol. 2006;59: 1174-1182.

(18) Ware JE Jr, Gandek B, Sinclair SJ, Bjorner JB. Item response theory in computer adaptive testing: implications for outcomes measurement in rehabilitation. Rehabil Psychol. 2005;50:71-78.

(19) Dijkers MP. A computer adaptive testing simulation applied to the FIM FIM

The ISO 4217 currency code for the Finnish Markka.
 instrument motor component. Arch Phys Med Rehabil. 2003;84:384-393.

(20) Haley SM, Fragala-Pinkham MA, Ni PS. Sensitivity of a computer adaptive assessment for measuring functional mobility changes in children enrolled in a community fitness program. Clin Rehabil. 2006;20:616-622.

(21) Haley SM, Coster Cos´ter   

n. 1. One who hawks about fruit, green vegetables, fish, etc.
 WJ, Andres PL, et al. Activity outcome measurement for post-acute care. Med Care. 2004;42:I-49-I-61.

(22) Haley SM, Andres PL, Coster WJ, et al. Short-form activity measure for post-acute care (AM-PAC). Arch Phys Med Rehabil. 2004;85:649-660.

(23) Coster WJ, Haley SM, Andres PL, et al. Refining refining, any of various processes for separating impurities from crude or semifinished materials. It includes the finer processes of metallurgy, the fractional distillation of petroleum into its commercial products, and the purifying of cane, beet, and maple sugar  the conceptual basis for rehabilitation outcome measurement: personal care and instrumental activities domain. Med Care. 2004;42:I62-I72.

(24) Haley SM, Coster WJ, Andres PL, et al. Score comparability of short-forms and computerized adaptive testing: simulation study with the Activity Measure for Post-Acute Care (AM-PAC). Arch Phys Med Rehabil. 2004;85:661-666.

(25) International Classification of Functioning, Disability and Handicap handicap

In sports and games, a method of offsetting the varying abilities or characteristics of competitors in order to equalize their chances of winning. Handicapping takes many, often complicated, forms.
 (ICF). Geneva Geneva, canton and city, Switzerland
Geneva (jənē`və), Fr. Genève, canton (1990 pop. 373,019), 109 sq mi (282 sq km), SW Switzerland, surrounding the southwest tip of the Lake of Geneva.
, Switzerland: World Health Organization; 2001.

(26) Muraki E. A generalized partial credit model. In: van der Linden Linden, city, United States
Linden, city (1990 pop. 36,701), Union co., NE N.J., in the New York metropolitan area; inc. 1925. During the first half of the 20th cent.
 W, Hambleton RK, eds. Handbook
For the handbook about Wikipedia, see .

This article is about reference works. For the subnotebook computer, see .
"Pocket reference" redirects here.
 of Modern Item Response Theory. New York New York, state, United States
New York, Middle Atlantic state of the United States. It is bordered by Vermont, Massachusetts, Connecticut, and the Atlantic Ocean (E), New Jersey and Pennsylvania (S), Lakes Erie and Ontario and the Canadian province of
, NY: Springer-Verlag New York Inc; 1997:153-168.

(27) Andres PL, Haley SM, Ni PS. Is patient-reported function reliable for monitoring post-acute outcomes? Am J Phys Med Rehabil. 2003;82:614-621.

(28) Kingsbury G, Zara A. A comparison of procedures for content-sensitive item selection in computerized adaptive testing. Applied Measurement in Education. 1991;4: 241-261.

(29) Revuelta J, Ponsoda V. A comparison of item exposure control methods in computerized adaptive testing. J Educ Meas. 1998;35:311-327.

(30) Stocking M, Lewis C. Controlling item exposure conditional on ability in computerized adaptive testing. J Educ Behav Stat. 1998;23:57-75.

(31) Drasgow F, Levine M, Williams E. Appropriateness measurement with polytomous item response models and standardized indices. Br J Math Stat Psychol. 1985;38:67-86.

(32) Haley SM, Fragala-Pinkham MA. Interpreting change scores of tests and measures used in physical therapy. Phys Ther. 2006; 86:735-743.

(33) Irrgang JJ, Snyder-Mackler L, Wainner RS, et al. Development of a patient-reported measure of function of the knee. J Bone Joint Surg Am. 1998;80:1132-1145.

(34) Tegner Y, Lysholm J. Rating systems in the evaluation of knee ligament ligament (lĭg`əmənt), strong band of white fibrous connective tissue that joins bones to other bones or to cartilage in the joint areas. The bundles of collagenous fibers that form ligaments tend to be pliable but not elastic.  injuries. Clin Orthop. 1985;190:43-49.

(35) Jette DU, Jette AM. Physical therapy and health outcomes in patients with knee impairments. Phys Ther. 1996;76:1178-1187.

(36) Jette DU, Jette AM. Physical therapy and health outcomes in patients with spinal impairments. Phys Ther. 1996;76:930-941.

(37) Wilkerson DL, Johnston MV. Clinical program monitoring systems: current capability and future directions. In: Fuhrer füh·rer also fueh·rer  
n.
A leader, especially one exercising the powers of a tyrant.



[German, from Middle High German vüerer, from vüeren, to lead, from Old High German
 MJ, ed. Assessing Medical Rehabilitation Practices: The Promise of Outcomes Research. Baltimore Baltimore, city (1990 pop. 736,014), N central Md., surrounded by but politically independent of Baltimore co., on the Patapsco River estuary, an arm of Chesapeake Bay; inc. 1745. , Md: Paul H Brookes Publishing Co Inc; 1997:275-306.

(38) Coster WJ, Haley SM, Jette AM. Measuring patient-reported outcomes after discharge from inpatient rehabilitation settings. J Rehabil Med. 2006;38:237-242.

(39) Haley SM, Siebens H, Coster WJ, et al. Computerized adaptive testing for follow-up follow-up,
n the process of monitoring the progress of a patient after a period of active treatment.


follow-up

subsequent.


follow-up plan
 after discharge from inpatient rehabilitation. Arch Phys Med Rehabil. 2006;87:1033-1042.

(40) McHorney C, Tarlov A. Individual-patient monitoring in clinical practice: are available health status surveys adequate? Qual Life Res. 1995;4:293-306.

(41) Farkas M, Jette AM, Tennstedt S, et al. Knowledge dissemination and utilization in gerontology gerontology: see geriatrics. : an organizing framework. Gerontologist ger·on·tol·o·gy  
n.
The scientific study of the biological, psychological, and sociological phenomena associated with old age and aging.



ge·ron
. 2003;43:47-56.

AM Jette, PT, PhD, is Director, Health and Disability Research Institute, School of Public Health, Boston University, 580 Harrison Ave AVE Avenue
AVE Average
AVE Alta Velocidad Espanola (train between Madrid and Seville)
AVE Alta Velocidad Española (Spanish: High Speed Train)
AVE Audio Video Entertainment
AVE Advertising Value Equivalent
, 4th Floor, Boston, MA 02218 (USA). Address all correspondence to Dr Jette at: ajette@bu.edu.

SM Haley, PT, PhD, is Associate Director, Health and Disability Research Institute, School of Public Health, Boston University.

W Tao, BS, is Graduate Research Associate, Health and Disability Research Institute, School of Public Health, Boston University.

P Ni, MD, MPH MPH Master of Public Health.
MPH Master's Degree in Public Health
, is Research Assistant Professor, Health and Disability Research institute, School of Public Health, Boston University.

R Moed, MBA MBA
abbr.
Master of Business Administration

Noun 1. MBA - a master's degree in business
Master in Business, Master in Business Administration
, is President, CRE Care, LLC, Boston, Mass.

D Meyers, MBA, is National Director of Trends and Outcomes, HealthSouth Outpatient Services outpatient services Hospital-based services Managed care Medical and other services provided, to a nonadmitted Pt, by a hospital or other qualified facility–eg, mental health clinic, rural health clinic, mobile X-ray unit, free-standing dialysis unit Examples , HealthSouth Corporation, Birmingham, Ala ALA aminolevulinic acid.
Ala alanine.
ala (a´lah) pl. a´lae   [L.] a winglike process.
.

M Zurek, PT, is Vice President of Clinical Quality, HealthSouth Outpatient Services, HealthSouth Corporation.

[Jette AM, Haley SM, Tao W, et al. Prospective evaluation of the AMPAC-CAT in outpatient rehabilitation settings. Phys Ther. 2007;87: 385-398.]
Table 1.
Demographic Characteristics of the Study Sample, by Impairment
Group (a)

                       Spine (n = 717)   UE (n = 488)    LE (n = 610)

Age, mean (SD)         50.7 (17.40)      51.41 (17.49)   46.84 (19.26)
Acuity (d), median     32                46              40
Median no. of visits    8 (1-51)         10 (1-51)        9 (1-48)
  (range)
Average duration of    29 (17.4)         34 (20.6)       29 917.4)
  episode care (d),
  mean (SD)
Sex (%)
  Male                 38.6              45.5            36.7
  Female               61.4              54.5            63.3
Postsurgical
  treatment (%)
  Yes                   9.2              29.5            40.3
BM on admission,       63.57 (7.78)      68.42 (8.23)    58.75 (9.16)
  mean (SD)
DA on admission,       57.74 (6.84)      53.15 (9.35)    59.17 (6.76)
  mean (SD)

(a) UE = upper extremity, LE = lower extremity, BM = Basic Mobility
scale, DA = Daily Activities scale.

Table 2.
Scale Distributions and Sensitivity to Change in the AM-PAC-CAT Basic
Mobility Scale, by Impairment Group (a)

        N       Mean (SD)

                Admission      Discharge      Difference

Total   1,703   63.26 (9.19)   68.71 (8.41)   5.45 (7.97) (c)
Spine     666   63.57 (7.78)   68.40 (8.59)   4.83 (7.56) (c)
UE        462   68.42 (8.24)   71.20 (7.74)   2.78 (6.87) (c)
LE        575   58.75 (9.16)   67.07 (8.26)   8.32 (8.35) (c)

        Mean (SD)            MDC (b)

        Effect Size   SRM    [MDC.sub.68]   [MDC.sub.90]

Total   0.59          0.68   1,024 (60%)    834 (49%)
Spine   0.62          0.64   380 (57%)      307 (46%)
UE      0.34          0.40   207 (45%)      146 (32%)
LE      0.91          1.00   437 (76%)      381 (66%)

(a) MDC = minimal detectable change, SRM = standardized response mean.

(b) [MDC.sub.68] = [Z.sub.68] x [SD.sub.baseline] x [square root of
([2 x (1-r)]] = 1 x 9.19 X [square root of ([2(1-0.96)]] = 2.60,
[MDC.sub.90] = [Z.sub.90] x [SD.sub.baseline] x [square root of
([2 x (1-r)]] = 1.645 x 9.19 x [square root of ([2(1-0.96)]] = 4.28.

(c) P [less than or equal to] 5.001.

Table 3.
Scale Distributions and Sensitivity to Change in the AM-PAC-CAT Daily
Activity Scale, by Impairment Group (a)

        N       Mean (SD)

                Admission      Discharge      Difference

Total   1,704   56.97 (7.95)   60.88 (6.53)   3.91 (6.91) (c)
Spine     666   57.74 (6.84)   60.63 (6.61)   2.89 (6.09) (c)
UE        462   53.13 (9.35)   58.79 (7.49)   5.65 (8.07) (c)
LE        576   59.17 (6.76)   62.86 (4.84)   3.69 (6.53) (c)

        Mean (SD)            MDC (b)

        Effect Size   SRM    [MDC.sub.68]   [MDC.sub.90]

Total   0.49          0.57   852 (50%)      720 (42%)
Spine   0.42          0.47   302 (45%)      251 (38%)
UE      0.60          0.70   267 (58%)      243 (53%)
LE      0.55          0.57   283 (49%)      226 (39%)

(a) MDC = minimal detectable change, SRM = standardized response mean.

(b) [MDC.sub.68] = [Z.sub.68] x [SD.sub.baseline] x [square root of
([2 x (1-r)])] = 1 x 7.95 x [square root of ([2(1-0.96)])] = 2.25,
[MDC.sub.90] = [Z.sub.90] x [SD.sub.baseline] x [square root of
([2 x (1-r)])] = 1.645 x 7.95 x [square root of ([2(1-0.96)])] = 3.70.

(c) P [less than or equal to] .001.

Table 4.
Most Frequently Used Functional Tasks From the AM-PAC-CAT Basic
Mobility Scale Item Bank

Item Description                 Content (a)   Frequency   IER (b) (%)

Bending over to pick up          1             11,491      100
  something
Standing up from a low, soft     3              8,658       75.35
  couch
Walking 1.6 km (1 mile)          2              7,674       66.78
  briskly, without stopping to
  rest
Walking 30.48 m (100 ft)         2              5,315       46.25
  indoors
Walking outdoors on steep,       2              4,609       40.11
  unpaved inclines
Running outdoors short           2              4,026       35.04
  distances
Running outdoors for 5 min on    2              4,002       34.83
  level terrain
Running outdoors for 10 min on   2              3,933       34.23
y
Walking up and down a flight     2              3,336       29.03
  of outdoor stairs, without
  using a handrail
Vigorous activities              2              3,121       27.16
Making sharp turns when          2              2,904       25.27
  running fast
Standing up from a chair         3              2,779       24.18
  without side arms
Carrying object in both arms     1              2,644       23.01
  while climbing stairs
Walking indoors in a familiar    2              2,229       19.40
  setting
Walking outdoors on uneven       2              2,014       17.53
  surfaces
Strenuous activities             1              1,648       14.34
Walking indoors in an            2              1,637       14.25
  unfamiliar setting
Walking outdoors on slippery     2              1,119        9.74
  surfaces
Light housework                  1              1,068        9.29
Climbing 1 step with a           2                665        5.79
  handrail
Walking outdoors more than       2                651        5.67
  1.6 km

(a) Content: 1 = bend/lift/reach/carry/activity items, 2 = mobility
items, 3 = transfer items.

(b) IER = item exposure rate (exposed frequency divided by the total
number of computerized adaptive testing occasions).

Table 5.
Most Frequently Used Functional Tasks From the AM-PAC-CAT Daily
Activity Scale Item Bank

Item Description                 Content (a)   Frequency   IER (b) (%)

Tying shoes                      1             11,514      100
Chopping or slicing vegetables   2             11,270       97.88
Sewing on a button               4              9,551       82.95
Pounding nail into wall          4              8,202       71.24
Unscrewing lid                   2              7,288       63.30
Bathing or dressing              3              7,281       63.24
Shaving legs and underarms       3              5,555       48.25
  safely and thoroughly with a
  blade razor
Using manual screwdriver         4              5,540       48.12
Tightening small parts           4              3,776       32.79
Using common kitchen utensils    2              2,640       22.93
Cutting your toenails            3              2,059       17.88
Trimming and filing your         3              1,912       16.61
  fingernails
Breaking open plastic            4              1,803       15.66
  packaging using scissors
Washing small clothing items     4              1,719       14.93
  by hand in sink
Writing for 30 min               4                999        8.68

(a) Content: 1 = dressing, 2 = meals, 3 = grooming/hygiene, 4 =
instrumental activities of daily living.

(b) IER = item exposure rate (exposed frequency divided by the total
number of computerized adaptive testing occasions).

Table 6.
Average Standard Error (SE) at Different Ability Score Range for
AM-PAC-CAT Basic Mobility and Daily Activity Scales (a)

Scale            Score Range

                 10-29            30-49

Basic Mobility
  Average SE     4.45             2.16
  95% CI band    [+ or -] 8.72    [+ or -] 4.23
Daily Activity
  Average SE     2.60             1.98
  95% CI band    [+ or -] 5.1     [+ or -] 3.88

Scale            Score Range

                 50-69            70-90

Basic Mobility
  Average SE     1.99             3.19
  95% CI band    [+ or -] 3.90    [+ or -] 6.25
Daily Activity
  Average SE     5.32             n/a
  95% CI band    [+ or -] 10.43   n/a

(a) n/a = not  applicable, CI=confidence interval.

Table 7.
Difference Scores (Mean [+ or -] SD) for AM-PAC-CAT Basic Mobility and
Daily Activity Scales, by Acuity and Postsurgical Treatment Groups

Scale            Acuity

                 [less than or     >35 d (n = 533)   Difference
                 equal to] 35 d
                 (n = 478)

Basic Mobility
  Admission      63.45 (9.39)      62.85 (9.63)      0.60
  Discharge      70.11 (8.17)      68.14 (8.30)      1.97 (a)
  Increase        6.62 (8.60)       5.22 (7.94)      1.40 (a)
Daily Activity
  Admission      56.31 (8.28)      56.18 (8.43)      0.13
  Discharge      61.85 (5.96)      60.12 (7.06)      1.73 (a)
  Increase        5.53 (7.60)       3.89 (6-95)      1.64 (a)

Scale            Postsurgical Treatment

                 Yes (n = 456)     No (n = 1,358)    Difference

Basic Mobility
  Admission      59.00 (10.14)     64.69 (8.37)      -5.70 (a)
  Discharge      66.73 (7.79)      69.34 (8.52)      -2.61 (a)
  Increase        7.85 (8.62)      4.65 (7.57)        3.20 (a)
Daily Activity
  Admission      54.32 (8.19)      57.88 (7.67)      -3.56 (a)
  Discharge      60.70 (6.67)      60.95 (6.50)      -0.25
  Increase        6.38 (8.10)      3.07 (6.24)        3.31 (a)

(a) P [less than or equal to] .05.
COPYRIGHT 2007 American Physical Therapy Association, Inc.
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 2007, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Title Annotation:Research Report
Author:Zurek, Matthew
Publication:Physical Therapy
Date:Apr 1, 2007
Words:9186
Previous Article:Validation of the Comprehensive International Classification of Functioning, Disability and Health Core Set for Rheumatoid Arthritis: the perspective...
Next Article:Pelvic-floor muscle function in women with pelvic organ prolapse.(Research Report)



Related Articles
Rehabilitation of an elite gymnast with a type II manubriosternal dislocation.(Case Report)
Use of outpatient physical therapy services by people with musculoskeletal conditions.(Research Report)
On "prospective evaluation of the AM-PAC-CAT ...".(Letters to the Editor)(Letter to the editor)
Corrections.(Correction notice)
Till we meet again.(Editor's Note)
Clinical instructors' perceptions of behaviors that comprise entry-level clinical performance in physical therapist students: a qualitative...
Validation of the clinical internship evaluation tool.(Research Report)
Modified constraint-induced therapy in patients with chronic stroke exhibiting minimal movement ability in the affected arm.(Research Report)
Every day physical therapists walk into a clinic, meet an individual with movement dysfunction, and attempt to determine the problems that can be...
Tehuti Research Foundation.

Terms of use | Copyright © 2009 Farlex, Inc. | Feedback | For webmasters | Submit articles