Printer Friendly
The Free Library
22,728,043 articles and books

Development of the Patient Activation Measure (PAM): conceptualizing and measuring activation in patients and consumers.

Two significant emerging policy directions put patients and consumers in a key role for influencing health care quality and costs. First, consumer-directed health plans rely on informed consumer choices to contain costs and improve the quality of care. This approach assumes that consumers will make more prudent health and health care choices when they are given financial incentives along with access to comparative cost and quality information. This approach also assumes that the combination of financial incentives and relevant information will increase their "activation" (Gabel, Lo Sasso, and Rice 2002). Second, the Chronic Illness Care Model (Bodenheimer et al. 2002) emphasizes patient-oriented care, with patients and their families integrated as members of the care team. A critical element in the model is activated patients, with the skills, knowledge, and motivation to participate as effective members of the care team (Von Korff et al. 1997).

A key health policy question is, what would it take for consumers to become effective and informed managers of their health and health care? What skills, knowledge, beliefs, and motivations do they need to become "activated" or more effectual ef·fec·tu·al  
Producing or sufficient to produce a desired effect; fully adequate. See Synonyms at effective.

[Middle English effectuel, from Old French, from Late Latin
 health care actors? These are essential questions if we hope to improve the health care process, the outcomes of care, and control costs. This is true especially with regard to the 99 million Americans with a chronic disease. Because those with chronic illness need ongoing care, account for a large portion of health care costs, and must play an important role in maintaining their own functioning, encouraging their activation should be a priority.

Even though patient activation is a central concept in both the consumer driven health care approach and the chronic illness care models, it remains conceptually and empirically underdeveloped un·der·de·vel·oped
Not adequately or normally developed; immature.
. There has been a lack of conceptual clarity regarding "activation," and thus a lack of adequate measurement. There are a number of existing methods for assessing different aspects of activation, such as health locus of control locus of control
A theoretical construct designed to assess a person's perceived control over his or her own behavior. The classification internal locus indicates that the person feels in control of events; external locus
 (Wallston, Stein, and Smith), self-efficacy in self-managing behaviors (Lorig et al. 1996), and readiness to change health-related behaviors (DiClemente et al. 1991; Prochaska, Redding Redding, city (1990 pop. 66,462), seat of Shasta co., N central Calif., on the Sacramento River; inc. 1872. A principal tourist center for a mountain and lake region, it also has lumbering, food-processing, and diverse manufacturing. , and Evers 1997), but these measures tend to focus on the prediction of a single behavior. Moreover, there is no existing measure that includes the broad range of elements involved in activation, including the knowledge, skills, beliefs, and behaviors that a patient needs to manage a chronic illness.

In this paper we describe the development of the Patient Activation Measure (PAM), a measure of activation that is grounded in rigorous conceptualization con·cep·tu·al·ize  
v. con·cep·tu·al·ized, con·cep·tu·al·iz·ing, con·cep·tu·al·iz·es
To form a concept or concepts of, and especially to interpret in a conceptual way:
 and appropriate psychometric psy·cho·met·rics  
n. (used with a sing. verb)
The branch of psychology that deals with the design, administration, and interpretation of quantitative tests for the measurement of psychological variables such as intelligence, aptitude, and
 methods. The PAM was developed in four stages:

Stage 1. Conceptually defining activation involved a literature review, systematic consultation with experts using a "consensus method," and consultation with individuals with chronic disease using focus groups.

Stage 2. Preliminary scale development began by building on the domains identified in stage one and operationalizing them with survey items within each domain. Steps included generating, refining, and testing a large item pool. We used Rasch psychometric methods to develop the scale and test the preliminary measure's psychometric properties.

Stage 3. Stage three involved exploring the possibility of extending the range of the measure, refining the response categories, and testing whether the measure could be used with respondents who had no chronic illnesses.

Stage 4. In the fourth and final stage a national probability sample was used to assess the performance of the measure across different subsamples in the population and to assess the construct validity construct validity,
n the degree to which an experimentally-determined definition matches the theoretical definition.
 of the measure.


Literature Review

Methods. A review of published articles that discuss skills and knowledge needed to successfully manage a chronic illness was conducted. Articles on self-care, self-management, doctor-patient communication, and using comparative information to inform health care choices were reviewed.

Findings. The review findings indicated that being an engaged and active participant in one's own care is linked to better health outcomes (Von Korff et al. 1997; Lorig et al. 1999; Von Korff et al. 1998; Bodenheimer et al. 2002) and measurable cost savings (Glasgow et al. 2002). Training patients with chronic diseases to self-manage their disease is effective, at least in the short term, in increasing functioning, reducing pain, and reducing health care costs (Lorig et al. 1999). Research also indicated a positive relationship between self-efficacy, preventive actions, and health outcomes (Bandura ban`dur´a   

n. 1. A traditional Ukrainian stringed musical instrument shaped like a lute, having many strings.
 1991; Grembowski et al. 1993; O'Leary 1985; Day, Bodmer, and Dunn 1996; Kaplan, Greenfield Greenfield, town (1990 pop. 18,666), seat of Franklin co., NW Mass., at the confluence of the Deerfield and Green rivers, near their junction with the Connecticut; settled 1686, set off from Deerfield and inc. 1753. , and Ware 1989).

Collaborating on care and engaging in shared clinical decision making are also linked with better health outcomes (Von Korff et al. 1997; Kaplan, Greenfield, and Ware 1989; Glasgow 2002). Coaching patients to be more involved and to have more control in the medical encounter has been shown to produce better health and functioning in patients (Wasson et al. 1999; Greenfield, Kaplan, and Ware 1985; Greenfield et al. 1988).

Several studies document the problems consumers have in understanding and navigating the health care system, which may lead to reduced access to appropriate and timely care (Isaacs 1996; Hibbard et al. 1998, 2001). Similarly, because of the documented variability in the quality of different health care providers and hospitals, it is hypothesized that consumers who use comparative quality information to choose health care providers will receive higher-quality medical care (Marshall et al. 2000).

To summarize, the review of the literature indicates that patients who are able to: (1) self-manage symptoms/problems; (2) engage in activities that maintain functioning and reduce health declines; (3) be involved in treatment and diagnostic choices; (4) collaborate with providers; (5) select providers and provider organizations based on performance or quality; and (6) navigate the health care system, are likely to have better health outcomes. We used these six domains as a starting point Noun 1. starting point - earliest limiting point
terminus a quo

commencement, get-go, offset, outset, showtime, starting time, beginning, start, kickoff, first - the time at which something is supposed to begin; "they got an early start"; "she knew from the
 for an expert consensus process and for patient focus groups.

Expert Consensus

Methods. The expert consensus process was adapted from Kahn et al. (1997) and Thorndike and Hagen (1991) and was designed to identify consensus among experts who view the issue of activation from a wide range of perspectives. Twenty-one panelists were chosen, in part, because they had demonstrated national prominence in their area of expertise. To limit the influence of one respondent on another, we gathered input through mailed surveys.

The process involved two rounds of contact, and 18 of the 21 experts completed both rounds. The key question posed to the experts in each round was, "What are the knowledge, beliefs, and skills that a consumer needs to successfully manage when living with a chronic disease?"

The first round was designed to elicit a broad range of ideas about domains to be included. We began with the six "domains" developed from the literature review (listed above) and elaborated these to include patients' beliefs, knowledge, and skills associated with each of these areas. Thus we had a total of 18 possible domains: beliefs, knowledge, and skills for each of the six domains. Within each of these 18 domains we listed a number of subdomains involving specific characteristics, attributes, or behaviors. (Figure 1a shows examples of the subdomains.) We asked the experts to edit these subdomains, add any new subdomains, and rate the importance of each subdomain and each general domain for its importance to the construct.
Figure 1a: Example of Subdomains, under the Domain

Has the Skills for Maintaining Function, Prevention, and Making
Lifestyle Changes


1. Has the skills to reduce the impact of symptoms on ability to

2. Can self-monitor and make lifestyle and environmental changes
that improve functioning and well-being.

3. Maintains recommended diet and exercise regimens or other
lifestyle/environmental recommendations.

4. Maintains activities when experiencing some pain or fatigue.

5. Can use evidence-based strategy for managing their primary risks.

6. Ability to "come back" after a behavioral lapse.

Subdomains 1-3 were rated as important or very important to defining
the domain by the expert panel; Subdomains 4-6 were rated as less

Findings. The results of the first round indicated considerable consensus in conceptualizing activation with many of the experts providing similar comments and additions. On the basis of this expert feedback our original classification of beliefs, knowledge, and skills was altered slightly to include "accessing emotional support."

In the second expert consensus round the expert respondents were given an expanded set of subdomains that more clearly defined the larger domain and included subdomains suggested by the experts. The experts rated and rank-ordered the importance of each domain and each subdomain. The domains where there was expert consensus are identified in Figure 1b.
Figure 1b: Domain Endorsed by Expert Consensus and Patients

                     Self-    Collaborate     Maintain     Access
                     Manage   with Provider   Function/    Appropriate
                                              Prevent      and High-
                                              Declines     Quality Care

Believes patients
is important in:       *            *             *

Has the knowledge
to:                    *                          *

Has the skills
to:                    *            *             *           ***

Can access
supports to:           **                         **

* Identified by experts and patients as a key component and retained in
later stages of scale development

** Identified by experts as a key component and omitted in later stages
of scale development

*** Identified by experts as a key component and identified by patients
as a secondary component and retained for preliminary scale development

Patient Focus Groups

Methods. Two focus groups explored the same potential domains with a convenience sample of chronic disease patients. One focus group had ten participants; the other group had nine participants. The domains that were explored with the experts were revised and reworded in layman's terms and were used as the basis for a discussion on the key components of successful management of chronic disease. Participants, like the expert panel, could also edit or add to the list. The participants were recruited with ads in the local newspaper and were paid $35 for their participation. The average age was 55 (range 39 to 78). Sixty-eight percent of the respondents were female. Ninety percent had more than one chronic condition.

Findings. The expert panel and focus group participants were in agreement regarding most of the domains (Figure 1b). However, the focus group participants were much less likely than the experts to view emotional support of family and friends as important in successful management of chronic disease.

Based on results from the expert panel and the consumer focus groups we derived a conceptual definition A conceptual definition is an element of the scientific research process, in which a specific concept is defined as a measurable occurrence. It is mostly used in fields of philosophy, psychology, communication studies. This is especially important when conducting a content analysis.  of health activation in patients and consumers: Those who are activated believe patients have important roles to play in self-managing care, collaborating with providers, and maintaining their health. They know how to manage their condition and maintain functioning and prevent health declines; and they have the skills and behavioral repertoire to manage their condition, collaborate with their health providers, maintain their health functioning, and access appropriate and high-quality care. We used this definition as the basis for developing the measure.



To operationalize the domains in Figure 1b, an 80-item pool was constructed by selecting questions from existing instruments and creating new ones where none existed. The items in the pool were categorized cat·e·go·rize  
tr.v. cat·e·go·rized, cat·e·go·riz·ing, cat·e·go·riz·es
To put into a category or categories; classify.

 under the domains they were intended to measure and were reviewed by a subset of the expert panel for face and content validity content validity,
n the degree to which an experiment or measurement actually reflects the variable it has been designed to measure.

All 80 items were further refined with three rounds of face-to-face cognitive testing with 20 respondents with chronic conditions. Items were evaluated in terms of how well they were understood, the degree to which there was variability in responses, and the adequacy of the response categories. Seventy-five items were retained after the cognitive interviews and used for the pilot study.

Study Sample

The pilot study was conducted with a convenience sample of 100 respondents. Participants were recruited through newspaper advertisements and were paid for their participation. Respondents ranged in age from 19 to 79 and reported a wide range of chronic conditions. Items were administered through a telephone interview that included the 75-item pool and a limited set of demographic and health status questions.

Psychometric Analysis

The initial set of items constituting the PAM were selected using Rasch analysis (Rasch 1960; Wright and Masters 1982; Wright and Stone 1979; Massof 2002). Rasch measurement can be used to create interval-level, unidimensional, probabilistic Guttman-like scales from ordinal (mathematics) ordinal - An isomorphism class of well-ordered sets.  data such as rating scale responses to survey questions. The measurement model calibrates the "difficulty" of the items in terms of response probabilities. The calibration of an item on the measurement scale indicates how much of the measured variable a respondent must exhibit to be able to endorse the item.

Once the measure is constructed, individuals are measured as to where they fall on the scale, and their location represents how much of the variable each respondent possesses. In the case of the PAM, an individual's location indicates how activated the person is. Both the people who are measured and the items doing the measurement are located on the same equal interval scale, yet these two parameters are statistically independent of each other. This concept of parameter separation means that the calibration of the items is independent of the activation levels of the particular respondents measured.

The precision with which an item's scale location, or calibration, has been estimated is represented by the item's standard error of measurement. Likewise, the precision of each individual respondent's estimated scale location is specified by the standard error of measurement of that person.

Item selection is based on item fit statistics representing how much responses to an item deviate from the model's expectations. A fit value of 1.0 indicates perfect fit to model expectations. Fit values > 1.0 indicate more stochastic By guesswork; by chance; using or containing random values.

stochastic - probabilistic
 variability in responses than expected (e.g., persons with low measured activation endorsing items requiring a high level of activation) and fit values < 1.0 indicate that responses to the item by persons of different activation levels do not vary as much as the model expects.

Two item fit statistics are calculated. Infit is an information-weighted residual and is most sensitive to item fit when the item's scale location is close to the respondent's scale location. Outfit is more sensitive to item fit for items with a scale location that is distant from the respondent's scale location. Simulation studies and experience suggest that item fit values between .5 and 1.5 produce sufficient unidimensionality and expected response variability for useful rating scale measurement (Smith 1996). All analyses were conducted with the Winsteps Rasch models software application (Linacre 2002).


Table 1 shows the 21 items constituting the preliminary activation measure, the calibrated cal·i·brate  
tr.v. cal·i·brat·ed, cal·i·brat·ing, cal·i·brates
1. To check, adjust, or determine by comparison with a standard (the graduations of a quantitative measuring instrument):
 scale location (difficulty) of each item, and the fit and item discrimination statistics. Item difficulty calibration on the "calibration" shown in Table 1 indicates how much activation is required for a patient to have .5 probability of responding "agree" to an item. Item scale locations have been transformed from the original logit metric to a user-friendly 0-100 metric where 0 = the lowest possible activation and 100 = the highest possible activation as measured by this set of items. While the metric allows for a potential range of 0-100, the items included in the measure only covered the range from 40-60, not tapping what would be theoretically the lowest or highest ranges of the construct.

All the domains derived through the conceptualization stage (Figure 1b) are reflected in the 21 items, except for the domain of accessing appropriate and high-quality care. While items addressing this domain correlate with the 21-item measure, fit statistics revealed these items tap a different construct than activation.

Most importantly Adv. 1. most importantly - above and beyond all other consideration; "above all, you must be independent"
above all, most especially
, this analysis indicates that the items form a unidimensional, probabilistic Guttman-like scale. Close inspection of the difficulty order of items on the scale suggests that they reflect a developmental model of activation (Bond and Fox 2001). Beliefs about the patient role and basic knowledge about one's condition and treatment appear to be important early developmental steps. Items in this early stage involve areas such as knowledge of medications and needed lifestyle changes as well as a belief that active involvement in one's health care is important. Only a small amount of activation is required to be able to endorse these items. Skills and confidence appear to come at later developmental steps. Items at the midpoint mid·point  
1. Mathematics The point of a line segment or curvilinear arc that divides it into two parts of the same length.

2. A position midway between two extremes.
 of the scale involve confidence that one can identify when medical care is needed, and that one can follow through on medical recommendations and handle symptoms on one's own. Items at the top of the activation continuum, indicating greatest activation, include maintaining needed lifestyle changes, having the confidence to handle new situations or problems, and keeping chronic illness from interfering with one's life.

Reliability Assessments

Rasch person reliability is the proportion of the total sample variability in measured activation that is not measurement error. Rasch person reliability provides upper and lower bounds This article is about order theory and lattice theory. For analysis of algorithms in computational complexity, see Big O notation.

In mathematics, especially in order theory, an upper bound of a subset S of some partially ordered set (P
 to the estimate of the "true score" reliability of a measure. Real person reliability is calculated under the assumption that all of the misfit mis·fit  
1. Something of the wrong size or shape for its purpose.

2. One who is unable to adjust to one's environment or circumstances or is considered to be disturbingly different from others.
 in the responses is due to departure of the data from the model's expectations. This is the lower bound reliability of the measurement of persons in this sample with this set of items. Model person reliability is based on the assumption that the data fit model expectations and that the misfit in the data is due to the probabilistic nature of the model. This is the upper-bound reliability. The true reliability of the measure lies somewhere between these lower and upper bounds. The Rasch person reliability for the preliminary 21-item measure was between .85 (real) and .87 (model). Cronbach's alpha Cronbach's (alpha) has an important use as a measure of the reliability of a psychometric instrument. It was first named as alpha by Cronbach (1951), as he had intended to continue with further instruments.  was .87.

We also conducted a test-retest reliability assessment. Thirty respondents from the pilot survey were reinterviewed two weeks after the initial interview with the same protocol. For each person we calculated the precision of their measured activation at test and again at retest re·test  
tr.v. re·test·ed, re·test·ing, re·tests
To test again.

A second or repeated test.
, measured by the standard error of measurement (SEM) for each person's estimated activation at each time point. The SEM times 1.96 provides the 95 percent confidence interval confidence interval,
n a statistical device used to determine the range within which an acceptable datum would fall. Confidence intervals are usually expressed in percentages, typically 95% or 99%.
 (CI) for each person's measured (estimated) activation. Twenty-eight of the 30 respondents had a retest activation estimate within the 95 percent CI of their test activation estimate.

Criterion Validity The introduction to this article provides insufficient context for those unfamiliar with the subject matter.
Please help [ improve the introduction] to meet Wikipedia's layout standards. You can discuss the issue on the talk page.

To assess criterion validity, we interviewed 10 respondents from the pilot study: five who scored at the lowest end of the activation scale, and five who scored at the highest. An in-depth, open-ended, semistructured interview protocol was used to elicit elaborated explanations of how respondents dealt with common problems and challenges associated with managing their conditions, such as handling a situation with a physician who did not answer questions well, their responses to recommendations to change their lifestyle, and handling self-treatments on their own. The interviews were transcribed and three judges, blinded to the person's measured activation, reviewed and independently categorized each transcript as that of a person "low" or "high" in activation.

The three independent judges' classification of respondents agreed with their measured activation level (high or low) 83 percent of the time (or 25 of the 30 classifications were correct). Cohen's kappa Kappa

Used in regression analysis, Kappa represents the ratio of the dollar price change in the price of an option to a 1% change in the expected price volatility.

Remember, the price of the option increases simultaneously with the volatility.
 for measured activation and each judge's classification were .80, .90, and .90 (p<.001 for all three kappas). No one respondent was misclassified by all three judges. These findings suggested that the preliminary measure had criterion validity when evaluated using the key criterion of self-described behavior.


Our goals for the third phase of scale development were to refine the measure and extend the range of activation assessed by the items. First, because the items in the preliminary scale calibrated only the midrange midrange Epidemiology The halfway point or midpoint in a set of observations; for most data, MR is calculated as the sum of the smallest observation and the largest observation, divided by 2; for age data, one is added to the numerator; a midrange is usually  of activation (4060), we tested items for possible inclusion that might extend the item difficulty. Second, because the items in the preliminary survey used several different response scales we tested the items using the same response scale for all items. Thus, the items were changed from questions to statements with the respondent indicating degree of agreement (four categories of degrees of agreement). Third, we wanted to assess how well the instrument would perform with a population that did not have a chronic disease. Fourth, we wanted to collect data from a larger sample to further assess the psychometric properties of the measure. Finally, we evaluated the use of a self-administered questionnaire.


A convenience sample of 486 respondents was recruited from among cardiac rehabilitation Cardiac Rehabilitation Definition

Cardiac rehabilitation is a comprehensive exercise, education, and behavioral modification program designed to improve the physical and emotional condition of patients with heart disease.
 patients (n = 120) and employees of a large health system in a second community (n = 366). The employee sample responded to a web-based version of the survey; the clinic sample responded to a self-administered paper questionnaire. Twenty-four percent of the sample reported no chronic disease (n = 118) and the remainder reported from 1 to 8 chronic illnesses.


A Rasch rating scale model (Andrich 1978; Wright and Stone 1979) analysis yielded a 22-item measure (Figure 2). (1) Importantly, despite the slight change in item content, response categories, and the two different modes of administration, the findings confirm the item hierarchy observed in the preliminary 21-item scale. These results strongly suggest that activation is developmental in nature: the different elements of knowledge, belief, and skill that constitute activation have a hierarchical order, as shown in Figure 2.


In comparing this refined measure to our conceptual definition of activation, it appears that activation has four stages: The first involves beliefs about the importance of the patient role. The second involves the confidence and knowledge necessary to take action, including knowledge of medications and lifestyle changes, confidence in talking to Noun 1. talking to - a lengthy rebuke; "a good lecture was my father's idea of discipline"; "the teacher gave him a talking to"
lecture, speech

rebuke, reprehension, reprimand, reproof, reproval - an act or expression of criticism and censure; "he had to
 health care providers and knowing when to seek help, and (at slightly higher levels of activation) confidence in following through on recommendations, knowing the nature and causes of the health condition, and different medical treatment options. The third stage involves actually taking action, including maintaining lifestyle changes, knowing how to prevent further problems, and handling symptoms on one's own. The fourth stage involves actually staying the course even when under stress. Patients who endorse these items are confident they can maintain lifestyle changes when under stress, that they can handle problems (rather than simply symptoms) on their own at home, and that they can keep their health problems from interfering with their life.

The structure of this probabilistic hierarchy of item difficulty implies that what is needed to increase activation depends on where the person is on the activation continuum. For example, those at the low end of activation may lack the belief that they have an important role to play in their health and lack elementary knowledge about their condition and their care. Respondents scoring in the mid range of the scale tend to have the necessary knowledge for self-care, but appear to lack some of the skills and confidence needed to carry through on all that is required for effective self-care. Those scoring at the higher end of the scale largely possess the necessary knowledge, skills, and

Ordering is by difficulty calibration.

SEM: The standard error of measurement in estimation of the item difficulty. SEM is the precision of the item difficulty estimation and is shown in 0-100 units.

Infit: Infit mean square error is one of two quality control fit statistics assessing item dimensionality (the degree to which the item falls on the same single, real number line as the rest of the items). Infit is an information-weighted residual of observed responses from model expected responses and is most sensitive to item fit when the item is located near the person's scale location.

Outfit: Outfit mean square error fit statistic is most sensitive to item dimensionality when the item scale location is distant from the person's scale location. confidence, but may be derailed from their course when they are under stress or encounter unexpected health events. (2)

The items have infit values between .76 and 1.32, well within the range required for a unidimensional measure. The Rasch person reliability for the 22-item measure was between .85 (real) and .88 (model). Cronbach's alpha was .91. Reliability statistics for those with and without chronic conditions are comparable.

In addition, an analysis to determine whether there were any observable mode effects was conducted. The log-odds equivalent of a Mantel-Haenszel differential item function analysis was conducted in Winsteps (2002) comparing web-based questionnaire and paper questionnaire item calibrations. No significant differences in item calibrations could be attributed to administration method.



This stage of the research evaluated the measure in a heterogeneous national probability sample to evaluate the performance of the measure across diverse groups and assess the construct validity of the measure.

Study Sample

A national probability sample (N = 1,515) of people 45 years and older was included in the telephone survey. Respondents were selected via a random digit dial selection and a screening question to determine age eligibility. No other eligibility requirement was employed. A 48 percent response rate was achieved with a protocol of a minimum of 12 call-backs. Many "no answer" or "busy" numbers had in excess of 20 attempts. Respondents ranged in age from 45 to 97, with 66 percent of the sample under the age of 65. Half the sample had a high school education or less and 32 percent had a household income of less than $25,000. Seventy-nine percent of the sample reported at least one chronic disease. (3) Among those with a chronic condition, 73 percent reported 2 or more conditions. Table 2 shows the distribution of the sample on other health and demographic characteristics.

The national sample largely mirrors census data for this age group. Differences between the sample and the census data are in gender distribution (census data 54 percent female, our sample 63 percent) and in the distribution on race (census data 83 percent white, our sample 88 percent)


The Rasch analysis of items from the national survey replicated the results obtained with the stage three pilot survey, showing the same developmental hierarchy of items and that the items maintain this same difficulty structure for both those with and without chronic illness.


Assessments of the 22-item PAM using national sample data show a high level of reliability with infit values ranging from .71 to 1.44. All but one of the outfit statistics are between .80 and 1.34.

The Rasch person reliability statistics for the measure are shown in Table 2 for the entire sample and meaningful subsamples. The consistency of performance of the measure is apparent in the reliability coefficients across subsamples. The high-reliability estimates indicate that the measure is appropriate for individual-level use, such as designing a care plan for an individual patient.

Some other notable characteristics of the measure are apparent in Table 2. First, the measure performs well for both those with a chronic condition as for those with no chronic condition. It is also stable across differing levels of health status. Reliability is also stable across gender and different age groups with a slight decline in the oldest group (85+years). Finally, the measurement precision is stable across the several different chronic illnesses represented in the sample. This suggests that the measure can be reliably used to assess activation across a variety of subgroups in the population.


To assess construct and criterion validity, the 22-item PAM variables believed to be conceptually related to activation were examined for their relationship to measured activation. In addition, outcomes that are hypothesized to be a result of activation levels were examined, such as health behaviors and health functioning. Validity was assessed for the sample as a whole and for those with specific chronic illnesses. It was hypothesized that those with higher activation would be more likely to engage in specific self-care and preventive behaviors. Further, those with higher activation who have a specific chronic disease should be more likely to engage in the self-care behaviors specific to their condition (e.g., exercising to control arthritis pain). Similarly, it was hypothesized that those with higher measured activation should engage in other health "consumeristic" behaviors, such as seeking relevant health care information, being persistent in getting clear answers from providers, and using comparative performance information to make health care choices. We further expected that those with greater activation would have better health and functioning and lower rates of health care utilization. Finally, because being activated implies having a sense of control over one's health, an item that is intended to measure "health fatalism fa·tal·ism  
1. The doctrine that all events are predetermined by fate and are therefore unalterable.

2. Acceptance of the belief that all events are predetermined and inevitable.
" (4) was included. We hypothesized that those with more activation would indicate less fatalism about their future health.

The results indicate considerable evidence for the construct validity of the PAM. Those with higher activation report significantly better health as measured by the SF 8 (r = .38, p<.001), and have significantly lower rates of doctor office visits, emergency room visits, and hospital nights (r = -.07, p<.01). Those with higher activation are significantly more likely to exercise regularly, follow a low-fat diet, eat more fruits and vegetables, and not smoke (Table 3). In addition, those with higher activation are significantly more likely to engage in consumeristic health behaviors, such as finding out about a new provider's qualifications. Self-management behaviors associated with specific conditions are also significantly associated with measured activation levels. For instance, diabetics with higher activation are more likely to keep a glucose journal, more-activated arthritics are more likely to exercise, and among those with high cholesterol Cholesterol, High Definition

Cholesterol is a fatty substance found in animal tissue and is an important component to the human body. It is manufactured in the liver and carried throughout the body in the bloodstream.
, those with higher activation are more likely to follow a low-fat diet. Finally those with higher activation indicate a lower degree of fatalism about their health.

These findings indicate that the measure has a high degree of construct and criterion validity. Future work is needed to determine the predictive validity In psychometrics, predictive validity is the extent to which a scale predicts scores on some criterion measure.

For example, the validity of a cognitive test for job performance is the correlation between test scores and, for example, supervisor performance ratings.
 of the measure, its sensitivity to detect changes in underlying behavior, and the types of interventions that help people move up the activation scale. Research is underway to assess predictive validity and sensitivity to changes in self-care behaviors.


There is wide agreement that engaging patients to be an active part of the care process is an essential element of the quality of care. Any serious attempts to improve this aspect of care will require three essential steps: (1) The development of a measure to assess patient activation; (2) The identification and use of evidenced-based interventions to increase patient activation; and (3) A method to hold providers and delivery systems accountable for supporting and increasing patient activation. The first step of developing a measure is necessary before the other two steps can be attempted.

The Patient Activation Measure (PAM) appears to be a valid and reliable instrument to measure activation. The measure has strong psychometric properties and appears to tap into the developmental nature of activation. Because the measure is highly reliable at the person level, it is possible to use it on an individual patient basis to diagnose activation and individualize in·di·vid·u·al·ize  
tr.v. in·di·vid·u·al·ized, in·di·vid·u·al·iz·ing, in·di·vid·u·al·iz·es
1. To give individuality to.

2. To consider or treat individually; particularize.

 care plans. Moreover, because the measure maintains precision across different demographic and health status groups, it can also be used at the aggregate level to evaluate and compare the efficacy of interventions and health care delivery systems.

It is not unreasonable to expect that providers delivering high-quality care would have, over time, more-activated patients. Changes in the activation levels of patient populations might be used as an indicator of the performance of providers or delivery systems, and be employed for quality assessment and public accountability purposes. Consumers will likely want to know which providers and systems are performing well in this area and comparative data could drive purchaser and consumer choices.

The PAM may be useful for both designing interventions and in evaluating them. The measure can be used in a clinical setting to assess individual patients and to develop care plans tailored to that patient and integrated into the processes of their care. Because the measure is developmental, interventions could be tailored to the individual's stage of activation. For example, those at early stages of activation would need interventions designed to increase knowledge about their condition and their treatments. Patients at later stages would need interventions designed to increase their skills and confidence in the different self-management tasks. As patients advance in activation, the type of interventions that will be helpful to them will also change. The approach is economical because it is targeted rather than omnibus omnibus: see bus. . Employers could also use the measure to assess interventions designed to increase engagement and activation among their employees. In summary, wide use of a precise, valid, and useful measure is the first step toward the goal of informed and engaged patients and ultimately to more effective and efficient delivery systems. The measurement properties of the Patient Activation Measure (PAM) when assessed using the stringent Rasch model Rasch models are used for analysing data from assessments to measure things such as abilities, attitudes, and personality traits. For example, they may be used to estimate a student's reading ability from answers to questions on a reading assessment, or the extremity of a person's  suggest that it could fulfill that role.

Having a valid and reliable measure is the very first step in understanding patient activation and its role in health care quality, outcomes, and cost containment cost containment,
n the features of a dental benefits program or of the administration of the program designed to reduce or eliminate certain charges to the plan.
. Of course, the validity of the measure is limited by our current level of understanding of activation. As our understanding of the construct increases through the use of the measure, it should be anticipated that refinement of the measure will be necessary.
Table 1: Preliminary (from Stage 2) 21-Item Activation Measure with

Item                                    Calibration  SEM  Infit  Outfit

How much do you know about why you are     40.3      1.4   1.12     1.2
  supposed to take each of your
  prescribed medicines?
Taking an active role in my own care       41.0      1.5   1.15    1.11
  is the most important factor in
  determining my health and ability to
How much do you know about the             42.4      1.4   1.33    1.14
  lifestyle changes, like diet and
  exercise, that are recommended for
  your condition?
How much do you know about the nature      44.3      1.4   1.28    1.28
  and causes of your health
How confident are you that you can         45.9      1.3   1.40    1.33
  tell your health care provider
  concerns you have even when he/she
  does not ask?
How much do you know about how to          46.2      1.3   0.90    0.82
  prevent further problems with your
Even if I make the changes in diet and     47.0      1.3   1.13    1.13
  exercise recommended for my condition,
  it won't make any difference to my
How much do you know about self-           47.9      1.3   1.20    1.06
  treatment approaches for your
How much do you know about the medical     48.9      1.3   1.20    1.12
  treatment options available for your
How confident are you that you can         48.9      1.2   1.10    1.03
  find trustworthy sources of
  information about your health
  condition and your health choices?
How confident are you that you can         50.0      1.2   0.87    0.81
  follow through on medical treatments
  you need to do at home?
How confident are you that you can         50.2      1.2   0.92    1.10
  identify when it is necessary to get
  medical care and when you can handle
  the problem yourself?
How confident are you that you can         51.2      1.2   0.92    0.88
  take actions that will help prevent
  or minimize some symptoms or problems
  associated with your condition?
How confident are you that you can         52.9      1.2   0.88    0.90
  follow through on medical
  recommendations your health care
  provider makes such as changing your
  diet or doing regular exercise?
To what extent are you able to handle      54.4      1.2   1.02    1.01
  symptoms on your own at home?
How well have you been able to             55.2      1.2   0.73    0.74
  maintain these lifestyle changes?
To what extent have you made the           56.4      1.2   0.74    0.73
  changes in your lifestyle, like diet
  and exercise, that are recommended for
  your condition?
Maintaining the lifestyle changes that     57.0      1.1   0.76    0.76
  have been recommended for my
  condition is too hard to do on a
  daily basis.
Even if I'm dissatisfied, it is            57.7      1.1   1.04    1.12
  usually too much of a hassle to
  change health care providers.
How confident are you that you can         57.7      1.1   0.74    0.73
  figure out solutions when new
  situations or problems arise with
  your condition?
How confident are you that you can         59.5      1.1   1.02    1.04
  keep the symptoms of your disease
  from interfering with the things
  you want to do?

Ordering is by difficulty calibration.

SEM: SEM is the standard error of measurement in estimation of the item
difficulty. SEM is the precision of the item difficulty estimation and
is shown in 0-100 units.

Infit: Infit mean square error is one of two quality control fit
statistics assessing item dimensionality (the degree to which the item
falls on the same single, real number line as the rest of the items).
Infit is an information-weighted residual of observed responses from
model expected responses and is most sensitive to item located near
the person's scale location.

Outfit: Outfit mean square error fit statistic is most sensitive to
item dimensionality when the item scale location is distant from the
person's scale location.


The authors wish to acknowledge The Robert Wood Johnson Foundation Robert Wood Johnson Foundation, charitable organization devoted exclusively to health care issues. It was established in 1936 by Robert Wood Johnson (1893–1968), board chairman of the Johnson & Johnson medical products company.  who provided funding for this work. Also acknowledged are Sarah Jane Sarah Jane can refer to:

  • Sarah-Jane Dias, an Indian model and veejay; winner of the Femina Miss India World 2007 title
  • Sarah-Jane Hutt, the fourth delegate of the United Kingdom to win the Miss World pageant in 1983
 Satre and Summer Meyer of the PeaceHealth Methods, Outcomes Measurement and Statistics Team for conducting data collection, and the eighteen experts who participated in the consensus process.


(1.) For the changes in items between phase 2 and phase 3, see online-only Appendix A, Note 1. The Appendix is available at

(2.) The PAM can be scored using a Rasch score table that converts curvilinear curvilinear

a line appearing as a curve; nonlinear.

curvilinear regression
see curvilinear regression.
 summated raw scores to linear, interval scores. This is essential to obtain accurate scores. To obtain a copy of the score table and instructions, contact the first author.

(3.) For explanation of differences in item wording depending on chronic disease status, see online-only Appendix A, Note 2.

(4.) For exact wording of fatalism item, see online-only Appendix A, Note 3.


Andrich, D. 1978. "A Rating Formulation for Ordered Response Categories." Psychometrica 43 (4): 561-73.

Bandura, A. 1991. "Self efficacy Mechanism in Physiological Activation and Health-Promoting Behavior." In Adaption adaption

see adaptation.
, Learning and Affect, edited by J. Madden mad·den  
v. mad·dened, mad·den·ing, mad·dens
1. To make angry; irritate.

2. To drive insane.

To become infuriated.
, S. Matthysse, and J. Barchas, pp. 226-69. New York New York, state, United States
New York, Middle Atlantic state of the United States. It is bordered by Vermont, Massachusetts, Connecticut, and the Atlantic Ocean (E), New Jersey and Pennsylvania (S), Lakes Erie and Ontario and the Canadian province of
: Raven raven, common name for the largest member of the family Corvidae (crow family), ranging throughout the arctic and temperate regions of the Northern Hemisphere. The raven, Corvus corax, is a glossy black scavenging bird about 26 in.  Press.

Bodenheimer, T., K. Lorig, H. Holman, and K. Grumbach. 2002. "Patient Self-Management of Chronic Disease in Primary Care." Journal of the American Medical Association JAMA: The Journal of the American Medical Association is an international peer-reviewed general medical journal, published 48 times per year by the American Medical Association. JAMA is the most widely circulated medical journal in the world.  288 (19): 2469-75.

Bond, T., and C. Fox. 2001. Applying the Rasch Model: Fundamental Measurement in the Human Sciences. Mahwah, NJ: Erlbaum.

Day, J., C. W. Bodmer, and O. M. Dunn. 1996. "Development of a Questionnaire Identifying Factors Responsible for Successful Self-management of Insulin-Treated Diabetes." Diabetic Medicine 13 (6): 564-73.

DiClemente, C. C., J. O. Prochaska, S. K. Fairhurst, W. F. Velicer, M. M. Velasquez, and J. S. Rossi. 1991. "The Process of Smoking Cessation smoking cessation Public health Temporary or permanent halting of habitual cigarette smoking; withdrawal therapies–eg, hypnosis, psychotherapy, group counseling, exposing smokers to Pts with terminal lung CA and nicotine chewing gum are often ineffective. : An Analysis of Precontemplation, Contemplation Contemplation
Compleat Angler, The

Izaak Walton’s classic treatise on the Contemplative Man’s Recreation. [Br. Lit.: The Compleat Angler]

Thinker, The

sculpture by Rodin, depicting contemplative man.
 and Preparation Stages of Change." Journal of Consulting and Clinical Psychology The Journal of Consulting and Clinical Psychology (JCCP) is a bimonthly psychology journal of the American Psychological Association. Its focus is on treatment and prevention in all areas of clinical and clinical-health psychology and especially on topics that appeal to a broad  59 (2): 295-304.

Gabel, J. R., A. T. Lo Sasso, and T. Rice. 2002. "Consumer-Driven Health Plans: Are They More Than Talk Now?" Health Affairs Web exclusive, available at Exclusives/2201Gabel.pdf.

Glasgow, R. E., M. M. Funnell, A. E. Bonomi, C. Davis, V. Beckham, and E. H. Wagner. 2002. "Serf-management Aspects of the Improving Chronic Illness Care Breakthrough Series: Implementation with Diabetes and Heart Failure Teams." Annals an·nals  
1. A chronological record of the events of successive years.

2. A descriptive account or record; a history: "the short and simple annals of the poor" 
 of Behavioral Medicine behavioral medicine
The application of behavior therapy techniques, such as biofeedback and relaxation training, to the prevention and treatment of medical and psychosomatic disorders and to the treatment of undesirable behaviors, such as overeating.
 24 (2): 80-7.

Glasgow, R. 2002. "Technology and Chronic Care." Paper presented at the Congress on Improving Chronic Care: Innovations in Research and Practice. September 8-10, Seattle, Washington This page is protected from moves until disputes have been resolved on the .
The reason for its protection is listed on the protection policy page.

Greenfield, S., S. Kaplan, and J. E. Ware. 1985. "Expanding Patient Involvement in Care. Effects on Patient Outcomes." Annals of Internal Medicine Annals of Internal Medicine (Ann Intern Med) is an academic medical journal published by the American College of Physicians (ACP). It publishes research articles and reviews in the area of internal medicine. Its current editor is Harold C. Sox.  102 (4): 520-8.

Greenfield, S., S. Kaplan, J. E. Ware, E. M. Yano, and H. J. Frank. 1988. "Patients' Participation in Medical Care: Effects on Blood Sugar Control and Quality of Life in Diabetes." Journal of General Internal Medicine 3 (5): 448-57.

Grembowski, D. E., D. L. Patrick, P. Diehr, M. Durham, S. Beresford, E. Kay, and J. Hecht. 1993. "Serf-Efficacy and Health Behavior among Older Adults." Journal of Health and Social Behavior In biology, psychology and sociology social behavior is behavior directed towards, or taking place between, members of the same species. Behavior such as predation which involves members of different species is not social.  34 (2): 89-104.

Hibbard, J. H., J. J. Jewett, S. Engelmann, and M. Tusler. 1998. "Can Medicare Beneficiaries Make Informed Choices?" Health Affairs 17 (6): 181-93.

Hibbard, J. H., M. Greenliek, H. Jimison, J. Capizzi, and L. Kunkel. 2001. "The Impact of a Community-Wide Serf-Care Information Project on Serf-Care and Medical Care Utilization." Evaluation and the Health Professions 24 (4): 404-23.

Isaacs, S. L. 1996. "Consumer's Information Needs: Results of a National Survey." Health Affairs 15 (4): 31-41.

Kahn, D. A., J. P. Docherty, D. Carpenter, and A. Frances. 1997. "Consensus Methods in Practice Guideline Development: A Review and Description of a New Method." Psychopharmacology psychopharmacology (sī'kōfär'məkŏl`əjē), in its broadest sense, the study of all pharmacological agents that affect mental and emotional functions.  Bulletin 33 (4): 631-9.

Kaplan, S., S. Greenfield, and J. E. Ware. 1989. "Assessing the Effects of Physician-Patient Interactions on the Outcomes of Chronic Disease." Medical Care 27 (3, supplement): S110-27.

Linacre, J. M. 2002. Winsteps Manual Chicago: Winsteps.

Lorig, K. 1996. Outcome Measures for Health Education and Other Health Care Interventions. Thousand Oaks Thousand Oaks, residential city (1990 pop. 104,352), Ventura co., S Calif., in a farm area; inc. 1964. Avocados, citrus, vegetables, strawberries, and nursery products are grown. , CA: Sage.

Lorig, K. R., D. S. Sobel, A. L. Stewart, B. W. Brown, A. Bandura, P. Ritter rit·ter  
n. pl. ritter
A knight.

[German, from Middle High German riter, from Middle Dutch ridder, from r
, V. M. Gonzalez, D. D. Laurent, and H. R. Holman. 1999. "Evidence Suggesting That a Chronic Disease Self-Management Program Can Improve Health Status While Reducing Hospitalization hospitalization /hos·pi·tal·iza·tion/ (hos?pi-t'l-i-za´shun)
1. the placing of a patient in a hospital for treatment.

2. the term of confinement in a hospital.
: A Randomized ran·dom·ize  
tr.v. ran·dom·ized, ran·dom·iz·ing, ran·dom·iz·es
To make random in arrangement, especially in order to control the variables in an experiment.
 Trial." Medical Care 37 (1): 5-14.

Marshall, M. N., P. G. Shekelle, R. H. Brook, and S. Leatherman. 2000. "Use of Performance Data to Change Physician Behavior." Journal of the American Medical Association 284 (9): 1079.

Massof, R. W. 2002. "The Measurement of Vision Disability." Optometry optometry (ŏptŏm`ətrē), eye-care specialty concerned with eye examination, determination of visual abilities, diagnosis of eye diseases and conditions, and the prescription of lenses and other corrective measures.  and Vision Science 79 (8): 516-52.

O'Leary, A. 1985. "Self-Efficacy and Health." Behaviour Research and Therapy 23 (4): 437-51.

Prochaska, J. O., C. A. Redding, and K. E. Evers. 1997. "The Transtheoretical Model The transtheoretical model of change in health psychology explains or predicts a person's success or failure in achieving a proposed behavior change, such as developing different habits. It attempts to answer why the change "stuck" or alternatively why the change was not made.  and Stages of Change." In Health Behavior and Health Education, 2d ed., edited by K. Glanz, F. M. Lewis, and B. K. Rimer rim·er  
Variant of rhymer.
, pp. 60-84. San Francisco San Francisco (săn frănsĭs`kō), city (1990 pop. 723,959), coextensive with San Francisco co., W Calif., on the tip of a peninsula between the Pacific Ocean and San Francisco Bay, which are connected by the strait known as the Golden : Jossey-Bass.

Rasch, G. 1960. Probabilistic Models for Some Intelligence and Attainment Tests (reprint reprint An individually bound copy of an article in a journal or science communication , with Foreword fore·word  
A preface or an introductory note, as for a book, especially by a person other than the author.


an introductory statement to a book

Noun 1.
 and Afterword af·ter·word  
See epilogue.
 by B. D. Wright, Chicago: University of Chicago Press The University of Chicago Press is the largest university press in the United States. It is operated by the University of Chicago and publishes a wide variety of academic titles, including The Chicago Manual of Style, dozens of academic journals, including , 1980). Copenhagen, Denmark: Danmarks Paedogogiske Institut.

Smith, R. M. 1996. "Polytomous Mean-Square Fit Statistics." Rasch Measurement Transactions 10 (3): 516-7.

Thorndike, R. M., and E. P. Hagen. 1991. Measurement and Evaluation in Psychology and Education. New York: Macmillan.

Von Korff, M., J. E. Moore, K. Lorig, D. C. Cherkin, K. Saunders, V. M. Gonzalez, D. Laurent, C. Rutter, and F. Comite. 1998. "A Randomized Trial of a Lay Person-Led Self-Management Group Intervention for Back Pain Patients in Primary Care." Spine 23 (23): 2608-15.

Von Korff, M., J. Gruman, J. Schaefer, S. J. Curry, and E. H. Wagner. 1997. "Collaborative Management of Chronic Illness." Annals of Internal Medicine 127 (12): 1097-102.

Von Korff, M., W. Katon, T. Bush, E. H. Lin, G. E. Simon, K. Saunders, E. Ludman, E. Walker, and J. Unutzer. 1998. "Treatment Costs, Cost Offset, and Cost-Effectiveness of Collaborative Management of Depression." Psychosomatic Medicine psychosomatic medicine (sī'kōsōmăt`ĭk), study and treatment of those emotional disturbances that are manifested as physical disorders.  60 (2): 143-9.

Wallston, K. A., M. J. Stein, and C. A. Smith. 1994. "Form C of the MHLC MHLC Multidimensional Health Locus of Control
MHLC Multilateral High Level Conference (fishing) 
 Scales: A Condition-Specific Measure of Locus of Control." Journal of Personality Assessment 63 (3): 534-53.

Wasson, J. H., T. A. Stukel, J. E. Weiss, R. D. Hays, A. M. Jette, and E. C. Nelson. 1999. "A Randomized Trial of the Use of Patient Self-Assessment Data to Improve Community Practices." Effective Clinical Practice 2 (1): 1-10.

Winsteps. 2002. Winsteps: Rasch Model Statistical Software (Version 337). Chicago: Winsteps.

Wright, B. D., and G. Masters. 1982. Rating Scale Analysis. Chicago: Mesa Press.

Wright, B. D., and M. H. Stone. 1979. Best Test Design. Chicago: Mesa Press.

Address correspondence to Judith Hibbard, Dr.P.H., Professor of Health Policy, University of Oregon-1209, Department of Planning, Public Policy, and Management, 119 Hendricks Hall, Eugene, OR 97403-1209. Jean Stockard, Ph.D., is a Professor at the University of Oregon The University of Oregon is a public university located in Eugene, Oregon. The university was founded in 1876, graduating its first class two years later. The University of Oregon is one of 60 members of the Association of American Universities. . Eldon Mahoney, Ph.D., is Director, Survey Research and Development, PeaceHealth, Bellevue, Washington Bellevue is a rapidly growing city in King County, Washington, U.S., across Lake Washington from Seattle. Long known as a suburb or satellite city of Seattle,[1] it is now categorized as an edge city or a boomburb. . Martin Tusler, M.S., is a data analyst at the Department of Planning, Public Policy, and Management, University of Oregon.
COPYRIGHT 2004 Health Research and Educational Trust
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 2004 Gale, Cengage Learning. All rights reserved.

 Reader Opinion




Article Details
Printer friendly Cite/link Email Feedback
Title Annotation:Methods
Author:Hibbard, Judith H.; Stockard, Jean; Mahoney, Eldon R.; Tusler, Martin
Publication:Health Services Research
Geographic Code:4EUUK
Date:Aug 1, 2004
Previous Article:Economic profiling of primary care physicians: consistency among risk-adjusted measures.
Next Article:Coding response to a case-mix measurement system based on multiple diagnoses.

Terms of use | Copyright © 2014 Farlex, Inc. | Feedback | For webmasters