Printer Friendly
The Free Library
5,677,471 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

Reliability of clinical pressure-pain algometric measurements obtained on consecutive days.


Algometry is a method of quantifying soft

tissue tenderness. An algometer registers the

force (in kilograms per square centimeter centimeter (sĕn`tĭmē'tər), abbr. cm, unit of length equal to 0.01 meter, the basic unit of length in the metric system. The centimeter is the unit of length in the cgs system. It is approximately equal to 0. )

that is applied to the tissues via a small

rubber footplate footplate /foot·plate/ (-plat) the flat portion of the stapes, which is set into the oval window on the medial wall of the middle ear.

foot·plate
n.
1. See base of stapes.

2.
. The force that is recorded is usually

the amount of pressure that causes pain, called the

pressure-pain threshold (PPT). Normal ranges of PPT have

been established for some muscles and for the sites of

some bony prominences.[1] The PPT has been used with

individuals without pain to assess the hypoalgesic effect

of physical therapy modalities Modalities
The factors and circumstances that cause a patient's symptoms to improve or worsen, including weather, time of day, effects of food, and similar factors.
. For instance, laser

therapy has been applied to normal peripheral sensory

nerves, and PPT was compared before and after the

intervention.[2]

Algometry in Delayed-Onset Muscle Soreness

Algometers have been used to measure tenderness

associated with inflammatory conditions.[1] Accumulation of

fluid in intracellular intracellular /in·tra·cel·lu·lar/ (-sel´u-ler) within a cell or cells.

in·tra·cel·lu·lar
adj.
Occurring or situated within a cell or cells.
 or extracellular extracellular /ex·tra·cel·lu·lar/ (-sel´u-lar) outside a cell or cells.

ex·tra·cel·lu·lar
adj.
Located or occurring outside a cell or cells.
 spaces as a

consequence of injury raises tissue pressure and lowers PPT.[3,4]

Delayed-onset muscle soreness (DOMS DOMS Director of Military Support
DoMS Department of Management Studies
DOMS Delayed Onset Muscular Soreness
DOMS Directorate Of Military Support
DOMS Digital Objects Management System
DOMS Diploma in Ophthalmic Medicine & Surgery
)[3,5] is a condition

that occurs when untrained muscles perform strenuous stren·u·ous  
adj.
1. Requiring great effort, energy, or exertion: a strenuous task.

2. Vigorously active; energetic or zealous.


exercise. This condition develops between 24 and 48

hours after exercise, and it can be recognized by the

presence of pain on stretching, loss of force, stiffness,

and tenderness in the affected muscles. Blood enzyme

analysis[3] and muscle biopsy In medicine, a muscle biopsy is a procedure in which a piece of muscle tissue is removed from an organism and examined microscopically. A biopsy needle is usually inserted into a muscle, wherein a small amount of tissue remains. [6] reveal that there are

morphological mor·phol·o·gy  
n. pl. mor·phol·o·gies
1.
a. The branch of biology that deals with the form and structure of organisms without consideration of function.

b.
 and biochemical changes biochemical changes (bī·ō·keˈmik·  in untrained

muscles following strenuous eccentric eccentric, in mechanics, device for changing rotary to back-and-forth motion. A disk is mounted off center on a shaft. One flat, open, circular end of a rod fits around the edge of the disk; the other end is usually attached to a block that slides in a slot.  exercise. Disruption disruption /dis·rup·tion/ (dis-rup´shun) a morphologic defect resulting from the extrinsic breakdown of, or interference with, a developmental process.

of myofibrils as well as supporting connective connective - An operator used in logic to combine two logical formulas. See first order logic.  tissue has

been noted.[7] Some authors[7] have noted the presence of

cellular infiltrates Infiltrates
Cells or body fluids that have passed into a tissue or body cavity.

Mentioned in: Eosinophilic Pneumonia
, including neutrophils neutrophils (ner·ō·trōˑ·filz),
n.pl white blood cells with cytoplasmic granules that consume harmful bacteria, fungi, and other foreign materials.
, macrophages Macrophages
White blood cells whose job is to destroy invading microorganisms. Listeria monocytogenes avoids being killed and can multiply within the macrophage.
,

and other inflammatory mediators. Tenderness is

thought to be due to swelling swelling /swell·ing/ (swel´ing)
1. transient abnormal enlargement of a body part or area not due to cell proliferation.

2. an eminence, or elevation.
 in the myofibrils and

extracellular space.

Delayed-onset muscle soreness can be induced for

experimental purposes. The premise for using DOMS in

research is that it results in a controlled injury being

imposed on the muscles. This control allows the time

course of the response to be examined, either under

different exercise conditions or under different

treatment conditions. We believe that algometers are quick

and safe to administer and are preferred over invasive

procedures as a daily measure of the effects of strenuous

exercise. The absence of any abnormal tenderness in the

muscle prior to inducing DOMS is a prerequisite pre·req·ui·site  
adj.
Required or necessary as a prior condition: Competence is prerequisite to promotion.

n.
 of the

model.

Algometry has been used to monitor symptoms of

experimental DOMS. Jones et al[3] induced DOMS in the biceps

brachii muscle and used PPT and goniometry goniometry /go·ni·om·e·try/ (go?ne-om´e-tre) the measurement of angles, particularly those of range of motion of a joint.

goniometry

the measurement of range of motion in a joint.
 to

measure tenderness and stiffness, respectively, following the

exercise. Pressure-pain threshold has also been used to

assess differences in development of DOMS. Newham et

al,[5] for example, induced DOMS in the quadriceps quadriceps /quad·ri·ceps/ (kwod´ri-seps) having four heads.

quad·ri·ceps
n.
The large four-part extensor muscle at the front of the thigh.

adj.


femoris muscle and used PPT to compare the

distribution of tenderness and the degree of tenderness induced

by two different exercise protocols. Pressure-pain

threshold also has been used to assess the effect of treatment

on DOMS. Hasson and colleagues,[4,8] for example,

investigated the effect of ultrasound ultrasound or sonography, in medicine, technique that uses sound waves to study and treat hard-to-reach body areas. In scanning with ultrasound, high-frequency sound waves are transmitted to the area of interest and the returning echoes recorded  and dexamethasone dexamethasone /dex·a·meth·a·sone/ (dek?sah-meth´ah-son) a synthetic glucocorticoid used primarily as an antiinflammatory in various conditions, including collagen diseases and allergic states; it is the basis of a screening test in the

iontophoresis iontophoresis /ion·to·pho·re·sis/ (i-on?to-fah-re´sis) the introduction of ions of soluble salts into the body by means of electric current.iontophoret´ic

i·on·to·pho·re·sis
n.
 on DOMS in the quadriceps femoris Noun 1. quadriceps femoris - a muscle of the thigh that extends the leg
musculus quadriceps femoris, quadriceps, quad

extensor, extensor muscle - a skeletal muscle whose contraction extends or stretches a body part


muscle. They used PPT and a pain measure to evaluate

DOMS over a 48-hour period after exercise. Intervention

with ultrasound reduced the symptoms of DOMS, as

evidenced by less pain being reported and higher PPT

values in a treatment group than in a control group.

Dexamethasone iontophoresis was found to be

ineffective for the treatment of DOMS using the same outcome

measures. Jones et al,[3] Hasson et al,[4] and Newham et al[5]

reported changes in PPT measurements obtained after

exercise as compared with PPT measurements obtained

before exercise.

In none of the studies discussed was a non-DOMS

control group used to examine whether PPT changed as

a result of measurement procedures alone.

Pressure-pain threshold testing involves forcible forc·i·ble  
adj.
1. Effected against resistance through the use of force: The police used forcible restraint in order to subdue the assailant.

2. Characterized by force; powerful.
 probing of the

muscle surface. In individuals without pain, the PPT in

muscle may be as high as 11 kg/[cm.sup.2]. We have observed

that this amount of pressure may cause bruising bruising

discoloration and actual hemorrhage at the site of injury, and a serious disadvantage in the meat trade. In the first 12 hours after injury the bruise is bright red, at 24 hours it is dark red, at 24 to 36 hours it loses its firm consistency and becomes watery and at 3 or
. We

wondered, therefore, whether algometry at high

pressures over the same site daily might lead to progressive

lowering of PPT. We have not found studies addressing

the reliability of algometric measurements over

consecutive days. Thus, the reliability of PPT as an outcome

measure of DOMS has not been established.

Reliability Issues in Algometry

Fischer[1] studied the reliability of algometric

measurements in 10 muscles of 50 subjects without pain on a

single occasion. On the basis that there was no

difference between single measurements of corresponding

muscles on opposite sides of the body, Fischer

concluded that PPT was reproducible re·pro·duce  
v. re·pro·duced, re·pro·duc·ing, re·pro·duc·es

v.tr.
1. To produce a counterpart, image, or copy of.

2. Biology To generate (offspring) by sexual or asexual means.
 and proposed a range

of normal values normal values
pl.n.
A set of laboratory test values used to characterize apparently healthy individuals, now replaced by reference values.
. He noted that PPT varied between

individual muscles. The quadriceps femoris and biceps

brachii muscles are the muscles that are examined most

often in DOMS research. Fischer studied the quadriceps

femoris muscle, but we have not found any investigation

of normal PPT in the biceps brachii muscle
For other uses, see biceps.


In human anatomy, the biceps brachii is a muscle located on the upper arm. The biceps has several functions, the most important simply being to flex the elbow and to rotate the forearm.
. Abnormal

tenderness is an exclusion criterion for studies involving

DOMS.

Some authors[9-12] tested the reliability of repeated

measurements of PPT. They demonstrated that, although

measurements were not precise, differences between

trials did not exist. Reliability was confirmed by different

authors for several patterns of repetition of PPT,

including 10 to 50 consecutive measurements,[9,11] trials 45

minutes apart,[9] trials 1 hour apart,[10] and trials 1 week

apart.[11] Marking test sites was thought to be one method

of improving the reliability of PPT measurements.[10] The

reaction time of the examiner and variation in the rate

of pressure increase were other factors that affected

reliability.[9]

Nonelectronic algometers, such as the Fischer

algometer,(*) depend on the operator to control the rate of

pressure increase. Fischer[1] recommended a rate of

1 kg/[cm.sup.2]/s. Jensen et al[9] emphasized the importance

of increasing pressure at a standardized standardized

pertaining to data that have been submitted to standardization procedures.


standardized morbidity rate
see morbidity rate.

standardized mortality rate
see mortality rate.
 rate, based on

their finding that higher PPT scores were recorded at

higher application rates. Some authors[9-l3] used

electronic algometers to reduce variation in the rate of

pressure increase; the electronic tool provides

examiners with visual cues to improve their timing. Another

advantage of an electronic algometer is that the reaction

time of the examiner is eliminated; on reaching the pain

threshold, the subject activates a button to release

pressure. Jensen et al[9] thought that measurements of PPT

were most reliable when the measurement site was flat,

broad, and bony as opposed to a soft tissue site where the

footplate might slide off the target.

Kosek et al[12] used an electronic algometer and studied

three trials of PPT. In contrast to other authors, they

found a decrease in PPT between trials done 10 seconds

apart and an increase in PPT between trials done 20 to

30 minutes apart. The mean PPT of the three trials,

however, was not different from the mean PPT of three

trials after a 1-week interval. Other investigators[11,13] also

used the mean of multiple trials as a criterion score to

reduce variation across occasions.

Ohrbach and Gale[13] carried out a study to determine the

number of measurements that gave the best estimate of

PPT. They used an electronic algometer and measured

facial muscles facial muscles,
n See muscles, facial.
 five times each, at 4- to 5-minute intervals.

They found that PPT increased and decreased

unsystematically Adv. 1. unsystematically - in an unsystematic manner; "his books were lined up unsystematically on the shelf"
consistently, systematically - in a systematic or consistent manner; "they systematically excluded women"
 from trial to trial but that there was a correlation

between pairs of trials (Pearson r=.81-.91). Combining

trials showed that the mean of trials 1 and 2 provided a

more reliable estimate of PPT than either trial alone,

and the authors reported that more than three trials was

not justified by their data.

Merskey and Spear[14] investigated PPT using a

non-electronic algometer. They measured PPT twice on two

separate occasions. There were no differences across the

four trials. Their results, however, appear to support the

idea that an electronic algometer provides more reliable

measurements, because they reported a lower

between-trial correlation (Pearson r=.65) than that reported by

Orbach and Gale,[13] who used an electronic instrument.

In the same work, Merskey and Spear[14] examined the

interrater reliability of PPT measurements. They reported

that there was no difference between examiners, although

there was a tendency for one examiner to score higher

than the other examiner. The correlation between

examiners was reported as Pearson r=.59. In spite of the low

Pearson correlation coefficient Correlation Coefficient

A measure that determines the degree to which two variable's movements are associated.

The correlation coefficient is calculated as:
, the authors stated that the

degree of reliability in their study supported the use of PPT

for investigation of the efficacy of analgesia analgesia /an·al·ge·sia/ (an?al-je´ze-ah)
1. absence of sensibility to pain.

2. the relief of pain without loss of consciousness.
.

Delaney and McKee[15] also examined the interrater and

intrarater reliability of PPT measurements. They used a

Fischer algometer. The results of their preliminary work

showed lower correlation between examiners (Pearson

r [is less than] .28) than that reported by Merskey and Spear.[14] They

attributed the finding to a difference in rate of pressure

increase, and they addressed the problem prior to

another study by training examiners to apply pressure

while being timed. Pain-pressure threshold was then

measured by two examiners alternately, at 5-minute

intervals, for a total of four trials per point.

Standardizing the timing of force application appears to have been

an effective strategy because high interrater and

intrarater reliability were reported in their final study

(intraclass correlation In statistics, the intraclass correlation (or the intraclass correlation coefficient[1]) is a measure of correlation, consistency or conformity for a data set when it has multiple groups.  coefficients [ICCs]=.80-.92).

In summary, algometric measurements have been shown

to have good interrater and intrarater reliability when

the measurements were performed once or repeatedly

(2-50 repetitions) on a single day, at weekly intervals

(1-5 weeks), and at longer intervals (8-12 weeks).[8-14]

The reliability of measurements taken over consecutive

days has not been studied. Investigators have suggested

that electronic algometers provide more reliable

measurements than do nonelectronic algometers.[9-12] The

latter type of instrument, however, is more convenient to

use and is more commonly available.

Our study was designed to (1) examine the range of

"normal" PPT in the biceps brachii muscle, (2)

reexamine re·ex·am·ine also re-ex·am·ine  
tr.v. re·ex·am·ined, re·ex·am·in·ing, re·ex·am·ines
1. To examine again or anew; review.

2. Law To question (a witness) again after cross-examination.
 the intertrial and interrater reliability of PPT

measurements using a nonelectronic algometer on

asymptomatic a·symp·to·mat·ic
adj.
Exhibiting or producing no symptoms.


Asymptomatic
Persons who carry a disease and are usually capable of transmitting the disease but, who do not exhibit symptoms of the disease are said to be
 muscle over 3 consecutive days, and (3) establish

the number of measurements needed for the best

estimate of PPT.

Method and Materials

Two examiners participated in the study. They were

physical therapists with many years of clinical experience

but no prior experience using an algometer. One week

prior to the study, the examiners practiced using an

algometer while being timed. The standard was to

increase pressure linearly to 5 kg/[cm.sup.2] over 5 seconds

according to according to
prep.
1. As stated or indicated by; on the authority of: according to historians.

2. In keeping with: according to instructions.

3.
 the method recommended by Fischer.[1] Ten

practice trials were performed by each examiner. A

Fischer algometer was used for the practice and test

trials. The instrument has a 1-[cm.sup.2] rubber footplate and

a scale marked from 2 to 20 kg/[cm.sup.2], in increments of

0.2 kg/[cm.sup.2]. A new instrument was acquired for the

purpose of the study. No calibration calibration /cal·i·bra·tion/ (kal?i-bra´shun) determination of the accuracy of an instrument, usually by measurement of its variation from a standard, to ascertain necessary correction factors.  was performed.

Thirty-five subjects without complaints of pain

volunteered and gave informed consent to participate in the

study. There was an imbalance imbalance /im·bal·ance/ (im-bal´ans)
1. lack of balance, such as between two opposing muscles or between electrolytes in the body.

2. dysequilibrium (2).
 of female subjects in the

sample (Tab. 1). The PPT of the biceps brachii muscle in

the nondominant arm of each subject was measured on

3 consecutive days by each examiner.

[TABULAR tab·u·lar
adj.
1. Having a plane surface; flat.

2. Organized as a table or list.

3. Calculated by means of a table.



tabular

resembling a table.
 DATA 1 NOT REPRODUCIBLE IN ASCII ASCII or American Standard Code for Information Interchange, a set of codes used to represent letters, numbers, a few symbols, and control characters. Originally designed for teletype operations, it has found wide application in computers. ]

Subjects were seated with their test arm positioned on a

padded support in 90 degrees of horizontal abduction Abduction
Balfour, David

expecting inheritance, kidnapped by uncle. [Br. Lit.: Kidnapped]

Bertram, Henry

kidnapped at age five; taken from Scotland. [Br. Lit.
,

with full elbow extension and forearm forearm /fore·arm/ (for´ahrm) antebrachium; the part of the arm between elbow and wrist.

fore·arm
n.
The part of the arm between the wrist and the elbow.
 supination supination /su·pi·na·tion/ (soo?pi-na´shun) [L. supinatio ] the act of assuming the supine position, or the state of being supine. . The

upper arm was measured, and the skin overlying overlying

suffocation of piglets by the sow. The piglets may be weak from illness or malnutrition, the sow may be clumsy or ill, the pen may be inadequate in size or poorly designed so that piglets cannot escape.
 the

biceps muscle belly was marked with indelible ink, at a

point one fourth of the distance from the elbow crease crease (kres) a line or slight linear depression.

flexion crease , palmar crease


to the lateral border of the acromion acromion /acro·mi·on/ (ah-kro´me-on) the lateral extension of the spine of the scapula, forming the highest point of the shoulder.

a·cro·mi·on
n.
. This mark

established the site for all testing. Each subject's non-test arm

was similarly measured and marked for the purpose of a

practice session to familiarize subjects with the sensation

of PPT.

Standardized instruction was given prior to each trial on

all occasions. Subjects were instructed to "report as soon

as the sensation of pressure changes to pain by saying

`pain,' and I will stop." The footplate of the algometer

was held perpendicular to the muscle belly with the

gauge turned away from the subject and the examiner.

Pressure was increased at a rate of approximately 1

kg/[cm.sup.2]/s until the subject reported "pain." The examiner

then released the pressure and lifted the algometer off

the muscle to read the gauge and record the

measurement. The needle on the gauge was returned to baseline

before each trial using the pressure-release button on

the algometer. Subjects were kept uninformed of their

scores throughout the study to prevent subject bias from

influencing the results.

The first examiner did three practice trials on each

subject's non-test arm. The practice trials were followed

by three trials of measuring PPT on the subject's test

arm, with 10-second intervals between trials. After a

20-minute interval, the procedure was repeated by the

second examiner, using the marked sites on the non-test

arm followed by the marked sites on the test arm. On

days 2 and 3, procedures were repeated on the test arm

only, using the same sequence and timing. Thus, each

subject's test arm was measured three times on each of 3

days (9 trials) by each examiner for a total of 18 trials.

For 20 subjects, the order of daily testing was examiner A

followed by examiner B. The order of examiners was

reversed for 15 subjects. During the study, examiners did

not have access to each other's scores or to their own

scores of previous days. No analysis was done until data

collection was complete.

Data Analysis

Intraclass correlational analyses (Shrout and Fleiss

formula, ICC ICC

See: International Chamber of Commerce
[2,1], a two-way random-effects layout[16]) were

used to estimate interrater, trial-to-trial, and day-to-day

reliability. Interrater reliability was estimated for each of

the nine trials and for scores derived from the mean

score of various combinations of trials.

Trial-to-trial reliability was estimated by correlating the

trial 1 and trial 2 scores and the trial 2 and trial 3 scores

each day, as well as by computing computing - computer  the correlation among

all three trials on each day. Day-to-day reliability was

estimated by computing the correlation between single

trials of like number in the sequence of daily trials.

Correlations were also calculated for day-to-day scores

derived from the mean score of a combination of

like-numbered trials in the sequence of daily trials.

A plot of the data showing the relationship in scores

between trials and between examiners suggested that the

data varied considerably from the line of equality. We

considered it necessary, therefore, to further analyze the

data using graphical techniques, as recommended by

Bland and Altman[17] and as described.

Graphical analysis: interrater reliability of single trials.

A subject's scores in a single trial recorded by each of the

two examiners were paired for comparison. The mean of

each pair was plotted against their difference. The

overall (n=35) mean difference (d) and standard

deviation DEVIATION, insurance, contracts. A voluntary departure, without necessity, or any reasonable cause, from the regular and usual course of the voyage insured.
     2.
 of the differences were calculated for each of the

nine trials.

Using the SAS (1) (SAS Institute Inc., Cary, NC, www.sas.com) A software company that specializes in data warehousing and decision support software based on the SAS System. Founded in 1976, SAS is one of the world's largest privately held software companies. See SAS System.  Univariate procedure,(dagger) differences were

found to be normally distributed. Most of the differences

(95%) could, therefore, be expected to lie between the

mean difference and approximately two standard

deviations (d [+ or -] 1.96SD), which was interpreted as the "limits

of agreement."[17]

Graphical analysis: interrater reliability of repeated

measurements. Each examiner calculated each

subject's mean score for two consecutive trials within the

same day. Means were computed for trials 1 and 2 and

trials 2 and 3 daily. Each subject's mean scores, derived

from like-numbered trials by each examiner, were paired

for comparison. The mean of the paired scores was

plotted against their difference. Overall mean difference

(n=35) and limits of agreement were calculated as

described previously.

A correction, according to the method of Bland and

Altman,[17] was applied to calculate the standard deviation In statistics, the average amount a number varies from the average number in a series of numbers.

(statistics) standard deviation - (SD) A measure of the range of values in a set of numbers.


of the differences between paired scores, which were

derived from the means of multiple measurements. The

correction was to compensate for removal of some of the

measurement error. The graphical technique was not

used for analysis of the mean of more than two repeated

measurements because of the risk of underestimation of

the standard deviation of the differences.

Graphical analysis: trial-to-trial and day-to-day reliability

of single trials. The scores of each examiner were

analyzed an·a·lyze  
tr.v. an·a·lyzed, an·a·lyz·ing, an·a·lyz·es
1. To examine methodically by separating into parts and studying their interrelations.

2. Chemistry To make a chemical analysis of.

3.
 separately. A subject's score in one trial was

paired for comparison with the score in the subsequent

trial on the same day to assess trial-to-trial reliability and

with the score in the like-numbered trial in the sequence

of trials on the subsequent day to assess day-to-day

reliability. Means and differences were plotted, and the

overall mean difference (n=35) and limits of agreement

were calculated as described previously.

Effect of order of examiners. A one-way analysis of

variance (ANOVA anova

see analysis of variance.

ANOVA Analysis of variance, see there
) on the mean difference between

examiners in trial 1 was used to assess whether the order

of examiners affected the results.

Results

A total of 630 PPT scores were collected. The mean PPT

and standard deviation are shown for each trial and each

examiner in Table 2. Some subjects had bruising at the

measurement site by the third day of the study.

Table 2.

Repeated Measurements of Pressure-Pain Threshold (PPT) for

35 Subjects

        PPT (kg/[cm.sup.2])

        Examiner A            Examiner B

Trial   X      SD             X      SD



Day 1

  1     3.41   0.98           3.27   1.37

  2     3.50   1.09           3.23   1.36

  3     3.54   1.23           3.26   1.46

Day 2

  1     3.41   1.32           3.05   1.16

  2     3.39   1.38           2.98   1.24

  3     3.39   1.44           3.12   1.48

Day 3

  1     3.39   1.28           3.01   1.15

  2     3.41   1.31           3.00   1.37

  3     3.44   1.34           3.09   1.52





Normative nor·ma·tive  
adj.
Of, relating to, or prescribing a norm or standard: normative grammar.



nor
 PPT Values for the Biceps Brachii Muscle

The mean PPT in the biceps brachii muscle was 4.63

kg/[cm.sup.2] (range=2.04-10.32) for the male subjects (90

scores) and 3.05 kg/[cm.sup.2] (range=1.81-6.80) for the

female subjects (540 scores).

Interrater Reliability

Examiner A recorded higher scores than those recorded

by examiner B in about 70% of the paired

measurements. As mean PPT increased, however, examiner A

tended to score increasingly lower than examiner B did.

At a mean PPT of approximately 7.0 kg/[cm.sup.2], examiner

A recorded a score that was 2.5 kg/[cm.sup.2] lower than the

score recorded by examiner B. On balance, however, the

mean difference between examiners (0.14 kg/[cm.sup.2] in

trial 1) was small.

Table 3 shows interrater ICCs (2,1) for single trials and

for mean scores derived from various combinations of

trials; all correlations were significant at P [is less than] .0001. Each

day reliability was lowest for the first of the single trials

and highest for the third of the single trials (ICC=.74-.89).

Table 3.

Intraclass Correlation Coefficients (ICC[2,1]) Between Two Examiners

for Measurements of Pressure-Pain Threshold for Single Trials or for

Scores Derived From the Mean of Multiple Trials for 35 Subjects

        Single   Interrater                     Interrater

        Trials   ICC          Multiple Trials   ICC



Day 1   1        .74           1, 2    (mean)   .81

        2         84           1, 2, 3 (mean)   .85

        3        .89           2, 3    (mean)   .88



Day 2   1        .75           1, 2    (mean)   .82

        2        .84           1, 2, 3 (mean)   .84

        3        .84           2, 3    (mean)   .86



Day 3   1        .78           1, 2    (mean)   .82

        2        .82           1, 2, 3 (mean)   .84

        3        .86           2, 3    (mean)   .85





Reliability improved when the mean

score of the three daily trials was used

rather than the score of the first or

second trial of the day. The highest

reliability, however, was seen when the

score of the first trial of each day was

omitted and the mean of the second

and third trials of the day

(ICC=.85-.88), or the score of the third trial alone

(ICC=.84-.89), was used as the

criterion score.

Figure 1 shows the relationship

between examiners of the scores

recorded in trial 1. The line of equality

is shown on which all points would lie if

the two examiners recorded identical

scores. The variation from the line of

equality shown in Figure 1 is typical of

the results of all the single trials and

prompted the additional analyses using

Bland and Altman's methods.[17]

[FIGURE 1, GRAPH OMITTED]

Figures 2 and 3 show the agreement

between examiners using Bland and

Altman's methods.[17] Each subject is

represented by a point (n=35) that

shows the difference in scores between

examiner's against their mean score.

Points on the zero line show perfect

agreement. The overall mean

difference between examiners is shown by a

broken line. The limits of agreement

are also shown (d [+ or -] 1.96SD).

[FIGURE 2 & 3, GRAPH OMITTED]

Figure 2 illustrates the added benefit of

using Bland and Altman's methods[17]

for the data obtained in trial 1. The

mean difference between examiners in

this trial was 0.14 kg/[cm.sup.2] (SD=0.86).

From the limits of agreement, it can be

projected that if one examiner

measures PPT once, a second examiner

would score the same subjects within

1.55 kg/[cm.sup.2] below and 1.83 kg/[cm.sup.2]

above the first examiner's

measurement 95% of the time. Agreement

between examiners was better in some

later trials than in trial 1. The limits of

agreement for day 2 of trial 3 for

example, were from -1.22 to +1.78 kg/[cm.sup.2].

There was little change in agreement between examiners

when the measure was derived from the mean score of

the first two trials rather than the scores of the first trial

each day. For example, when the measurement was

based on subjects' mean score for day 1 of trials 1 and 2,

the limits of agreement between examiners were from

-1.29 to +1.71 kg/[cm.sup.2].

On all 3 days, however, agreement between examiners

was best when the measurement was derived from the

subjects' mean score of trials 2 and 3. Figure 3 shows the

results for day 1; the limits of agreement lie between

-0.97 and +1.47 kg/[cm.sup.2].

Trial-to-Trial Reliability

Tables 4 and 5 show the trial-to-trial and day-to-day

reliability (ICC[2,1]) of measurements of PPT. With the

exception of day 1, trial-to-trial reliability was higher

between trials 2 and 3 than between trials 1 and 2.

Day-to-day reliability for a single measurement of PPT

was highest in trial 3, and day-to-day reliability for a

measurement derived from the mean of multiple trials

was highest for the mean of trials 2 and 3.

Table 4.

Trial-to-trial intraclass Correlation Coefficients (ICC[2,1]) for

Pressure-Pain Threshold,(a) Rated by Examiner B, for 35 Subjects

        Trials    ICC   Trials    ICC   Trials             ICC



Day 1   1 and 2   .98   2 and 3   .96   All trials (1-3)   .96

Day 2   1 and 2   .94   2 and 3   .95   All trials (1-3)   .93

Day 3   1 and 2   .94   2 and 3   .98   All trials (1-3)   .95





(a) Correlations are between single trials on the same day.

Table 5.

Intraclass Correlation Coefficients (ICC[2,1]) for Day-to-Day

Pressure-Pain Threshold,(a) Rated by Examiner B, for 35 Subjects

Trial                            ICC



Trial 1 x 3 days                 .88

Trial 2 x 3 days                 .88

Trial 3 x 3 days                 .89

Trials 2 and 3 (mean) x 3 days   .90





(a) Correlation are between single trials of like number in the sequence of

trials on different days or scores derived from the mean of multiple trials on

one day correlated cor·re·late  
v. cor·re·lat·ed, cor·re·lat·ing, cor·re·lates

v.tr.
1. To put or bring into causal, complementary, parallel, or reciprocal relation.

2.
 with the mean of multiple trials of like number in the

sequence of trials on subsequent days.

Figure 4 shows the trial-to-trial agreement of the scores

recorded by examiner B on day 1 of trials 2 and 3, using

Bland and Altman's method of graphical analysis.[17] The

results were similar on days 2 and 3. A point is plotted for

each subject (n=35) showing the difference in scores

between trials 2 and 3 against their mean. Perfect

agreement (zero line) and overall mean difference

(-0.03 kg/[cm.sup.2], SD=0.42) between the two trials are

shown. The limits of agreement lie between -0.86 and

+0.79 kg/[cm.sup.2]. Thus, the PPT in the third trial was

between 0.86 kg/[cm.sup.2] below and 0.79 kg/[cm.sup.2] above the

PPT in the second trial 95% of the time.

[FIGURE 4, GRAPH OMITTED]

The same methods of analysis for the scores recorded by

examiner A in trials 2 and 3 showed an overall mean

difference between trials of -0.04 kg/[cm.sup.2] (SD=0.35).

Limits of agreement were between -0.72 and +0.64

kg/[cm.sup.2].

Order of Examiners

The ANOVA revealed that the order in which the

examiners measured PPT had no effect on the

differences between their scores (P=.33).

Discussion

The purpose of our study was to establish the normal

range of PPT values in the biceps brachii muscle because

this muscle is frequently used in studies of

experimentally induced DOMS and to examine interrater,

trial-to-trial, and day-to-day reliability of algometric

measurements in healthy muscle. If PPT proved to be a stable

measure in the absence of pathology pathology, study of the cause of disease and the modifications in cellular function and changes in cellular structure produced in any cell, organ, or part of the body by disease. , then it could be

used as an outcome measure in studies of experimentally

induced DOMS.

We observed PPT in a group of individuals without pain

whose ages were within a fairly restricted range. The

group included a disproportionate dis·pro·por·tion·ate  
adj.
Out of proportion, as in size, shape, or amount.



dispro·por
 number of female

subjects. These factors might limit the applicability of

our results. The height and weight of the subjects were

fairly representative of the adult population. All subjects

tolerated 18 measurements over 3 days, although some

subjects showed bruising at the measurement site.

Fischer[1] suggested that PPT is reproducible between

individual subjects. He calculated PPT from the mean of

two measurements taken on contralateral contralateral /con·tra·lat·er·al/ (-lat´er-al) pertaining to, situated on, or affecting the opposite side.

con·tra·lat·er·al
adj.
 sides of the

body and examined the distribution of values in a study

of nine healthy muscles. Based on the standard deviation

from the average logarithmic logarithmic

pertaining to logarithm.


logarithmic relationship
when the logs of two variables plotted against each other create a straight line.
 values of the PPT findings

in his study, Fischer proposed that for diagnostic

purposes and for quantifying pain, 84.1% of mean PPT

should be considered a cutoff value for "normal."

Fischer did not study PPT in the biceps brachii muscle.

In our study, mean PPT for the biceps brachii muscle in

the female subjects was 3.05 kg/[cm.sup.2]. Using 84.1% of

mean PPT as a cutoff, as recommended by Fischer, the

lowest normal value for the biceps brachii muscle in

female subjects would be 2.44 kg/[cm.sup.2]. This value should

be taken into consideration when screening subjects for

admission to studies of DOMS involving the biceps

brachii muscle.

Intraclass correlation coefficients appear frequently in

the literature as an index of reliability. The examiners in

our study were not experienced in algometric

techniques. Thus, they represent examiners as broadly

defined, not a particular set of examiners, and we believe

that the random-effects model (ICC[2,1]) applies.[16]

Using a measurement derived from the

mean score of trials 2 and 3 daily, PPT

appears to yield reliable measurements

of muscle tenderness over a 20-minute

period and over 3 consecutive days,

according to ICC analyses. Moreover,

the ICCs suggest that two examiners

could be used interchangeably INTERCHANGEABLY. Formerly when deeds of land were made, where there Were covenants to be performed on both sides, it was usual to make two deeds exactly similar to each other, and to exchange them; in the attesting clause, the words, In witness whereof the parties have hereunto  to

measure PPT.

Delaney and McKee,[15] using the

Fischer algometer on muscle, also

reported lower reliability for the first of

two trials. They considered their

examiners to be experienced in algometric

techniques, and they timed their

examiners' rate of pressure application in an

attempt to improve reliability. The

reported reliability (ICC=.80-.92) was

similar to ours, and our examiners were

not timed during the testing. Ohrbach

and Gale[13] similarly concluded that

measurements obtained in their first

trial did not agree well with the mean

of five trials of PPT. On the basis of the

95% confidence interval confidence interval,
n a statistical device used to determine the range within which an acceptable datum would fall. Confidence intervals are usually expressed in percentages, typically 95% or 99%.
 around the mean of five trials,

they recommended the use of data recorded during trial

2 or the mean of trials 1 and 2 to estimate PPT. Our

results did not support the use of trial 2 alone or the

mean score of trials 1 and 2.

Some authors[9-12] have argued that the reliability of

measurements of PPT was improved by the use of

electronically controlled instruments. In our study, we

used a nonelectronic instrument. In spite of the

difficulty of maintaining the recommended rate of pressure

increase of 1 kg/[cm.sup.2]/s using our type of algometer,

there was still a positive correlation Noun 1. positive correlation - a correlation in which large values of one variable are associated with large values of the other and small with small; the correlation coefficient is between 0 and +1
direct correlation
 between trials in our

study, and our reliability was comparable to that

obtained with electronic algometers.

The use of correlation coefficients to assess the repeatability

of measurements is misleading, according to Bland and

Altman.[17] These authors noted that correlation coefficients

measure the relationship between two measurements, not

the agreement between them. Because the examiners in

our study measured the same subjects, and measured them

repeatedly, we would expect the scores between examiners

and between trials to be strongly related. In accordance Accordance is Bible Study Software for Macintosh developed by OakTree Software, Inc.[]

As well as a standalone program, it is the base software packaged by Zondervan in their Bible Study suites for Macintosh.


with the recommendations of Bland and Altman,[17] we

plotted the measurements of one examiner against those

of the other examiner to assess visually whether the data

varied from the line of equality. Bland and Altman[17] noted

that two sets of measurements that agree perfectly lie on

the line of equality; measurements that are highly correlated

lie along any straight line. A plot of our data (Fig. 1)

showed considerable variation around the line of equality,

especially at higher values of PPT. We believed, therefore,

that additional methods of analysis were indicated.

When we used graphical methods This is a list of graphical methods with a mathematical basis. Included are diagram techniques, chart techniques, plot techniques, and other forms of visualization.

There is also a list of computer graphics and descriptive geometry topics.
 to assess agreement,

our findings supported the opinion of Bland and

Altman[17] that high correlations can exist with concurrent

lack of good agreement between measurements (Figs.

1-4). Graphical analysis of the data provided

information about trials and raters that was not

obvious from the ICCs.

From the distribution of the

measurements around zero, it was apparent that

although the examiners agreed well on

average (ie, small mean difference),

there were quite large differences

between them for individual subjects

(Figs. 2-4). Furthermore, examination

of the relationship between differences

and means showed that the differences

were affected by the size of the

measurement, in that differences between

examiners were larger and in an

opposite direction, at high mean values of

PPT than at low mean values. The

change of direction of differences

between the examiners as mean PPT

increased was unexpected and warrants

some explanation. We speculated that

at low mean PPT, examiner A used a

faster rate of pressure application than

examiner B used. Jensen et al[9] noted

that higher rates tended to produce a

higher PPT. We think that the reason for this finding is

that examiner reaction time is slowed by high rates of

pressure increase, leading to overestimation o·ver·es·ti·mate  
tr.v. o·ver·es·ti·mat·ed, o·ver·es·ti·mat·ing, o·ver·es·ti·mates
1. To estimate too highly.

2. To esteem too greatly.
 of PPT. Why

then did examiner A not overestimate o·ver·es·ti·mate  
tr.v. o·ver·es·ti·mat·ed, o·ver·es·ti·mat·ing, o·ver·es·ti·mates
1. To estimate too highly.

2. To esteem too greatly.
 at high levels of

PPT? A self-limiting factor might have been that

examiner A was not able to maintain the high rate of pressure

increase at the highest levels of force encountered in our

study. This explanation would account for the change of

direction of differences between examiners.

Graphical methods, in contrast to ICCs, demonstrate

measurement error in units that are clinically

meaningful (eg, kilograms per square centimeter) so that the

consequence of differences between methods can be

assessed. For example, Figure 2 shows that the limits of

agreement between examiners based on a single rating

of PPT were -1.5 to +1.8 kg/[cm.sup.2]. Figure 3 shows that by

using the mean of multiple measurements, the

measurement error between examiners is reduced to -1.0 to

+1.5 kg/[cm.sup.2]. In our opinion, however, a difference

between examiners of up to 1.5 kg/[cm.sup.2] on a

measurement that has a "true" value (mean measurement of two

examiners) of 3 to 5 kg/[cm.sup.2] is large, especially in view of

our finding that the measurement error between trials

by one examiner was from -0.9 to +0.8 kg/[cm.sup.2] (Fig. 4).

Our findings from the graphical analysis suggest that

one examiner should perform all measurements of PPT.

Graphical analysis of trial-to-trial agreement (Fig. 4)

showed that there was no systematic bias of one trial

relative to another trial (consistent upward or downward

shift in PPT). Because the results were similar for all

trial-to-trial comparisons, we conclude that there was no

effect of repeated use of the algometer. There was also

no effect of size of measurement on the size or direction

of trial-to-trial differences. Trial-to-trial results were

similar for both examiners. Thus, we conclude that

examiners A and B were consistent in their individual

techniques, even though there were differences between

their scores.

Our study lends support to previous work that has shown

that measurements of PPT are highly reliable in

individuals without pain.[9-13] Reliability improves when three

trials are performed and data from the last two trials are

used to determine PPT. Variation in rate of pressure

increase may be the factor most affecting reliability. To

minimize this effect, we believe that testing should be

performed by one examiner.

The six PPT ratings that we performed daily for 3 days,

in two sets of three with a 20-minute interval each day is

typical of measurement in intervention studies intervention studies,
n.pl the epidemiologic investigations designed to test a hypothesized cause and effect relation by modifying the supposed causal factor(s) in the study population.
 involving

DOMS. Our testing procedure did not, in itself, effect a

change in PPT. We have shown that PPT can be used as

an outcome measure in the treatment of persons with

DOMS.

Conclusion

Measurements of PPT in healthy muscle obtained with a

simple nonelectronic algometer were reliable from trial

to trial within the same day and from day to day over 3

consecutive days. Measurements by one examiner were

more reliable than measurements between examiners.

We have demonstrated that reliability is improved when

the first of three trials is excluded for estimating the

"true" PPT. The algometer appears to have potential for

measuring day-to-day changes in soft tissue tenderness in

persons with DOMS.

(*) Pain Diagnostics and Thermography thermography (thûr'mŏg`rəfē), contact photocopying process that produces a direct positive image and in which infrared rays are used to expose the copy paper.  Inc, 233 E Shore Rd, Suite 106, Great

Neck, NY 11023.

(dagger) SAS Institute SAS Institute Inc., headquartered in Cary, North Carolina, USA, has been a major producer of software since it was founded in 1976 by Anthony Barr, James Goodnight, John Sall and Jane Helwig.  Inc, SAS Campus Dr, Cary, NC 27513.

References

[1] Fischer A. Pressure algometry over normal muscles: standard values,

validity, and reproducibility reproducibility Lab medicine  The degree of agreement among repeated measurements of a particular parameter, presented in terms of a standard deviation or coefficient of variation of the results in a set of measurements  of pressure threshold. Pain.

1987;30:115-126.

[2] Wylie L, Baxter GD, Walsh DM, Robinson L. The hypoalgesic effects

of low-intensity infrared An invisible band of radiation at the lower end of the visible light spectrum. With wavelengths from 750 nm to 1 mm, infrared starts at the end of the microwave spectrum and ends at the beginning of visible light.  laser therapy upon mechanical pain

threshold. Lasers Surg Med Suppl. 1995;17:9.

[3] Jones DA, Newham DJ, Clarkson PM. Skeletal skeletal /skel·e·tal/ (skel´e-t'l) pertaining to the skeleton.

skeletal

pertaining to the skeleton. See also skeletal muscle.
 muscle stiffness and

pain following eccentric exercise of the elbow flexors. Pain.

1987;30:233-242.

[4] Hasson S, Mundorf R, Barnes W, et al. Effect of pulsed ultrasound

versus placebo placebo (pləsē`bō), inert substance given instead of a potent drug. Placebo medications are sometimes prescribed when a drug is not really needed or when one would not be appropriate because they make patients feel well taken care of.  on muscle soreness perception and muscular muscular /mus·cu·lar/ (mus´ku-lar)
1. pertaining to or composing muscle.

2. having a well-developed musculature.


mus·cu·lar
adj.
1.


performance. Scand J Rehabil Med. 1990;22:199-205.

[5] Newham DJ, Mills KR, Quigley BM, Edwards RHT RHT Reinforced Heel and Toe (stockings)
RHT Richtig Hartes Training
RHT Atlantic Sharpnose Shark (FAO fish species code)
RHT Retractable Hard Top (convertible autos) 
. Pain and fatigue

after concentric Coming from the center, or circles within circles. For example, tracks on a hard disk are concentric. Tracks on optical media are concentric or spiral shaped (in a coil) depending on the type.  and eccentric muscle contractions Noun 1. muscle contraction - (physiology) a shortening or tensing of a part or organ (especially of a muscle or muscle fiber)
contraction, muscular contraction

shortening - act of decreasing in length; "the dress needs shortening"
. Clin Sci.

1983;64:55-62.

[6] Stauber WT, Clarkson PM, Fritz fritz  
n. Informal
A condition in which something does not work properly: Our television is on the fritz.



[Perhaps from German Fritz
 VK, Evans WJ. Extracellular matrix extracellular matrix (eksˈ·tr·selˑ·y

disruption and pain after eccentric muscle action. J Appl Physiol.

1990;69:868-874.

[7] MacIntyre DL, Reid WD, McKenzie DC. Delayed muscle soreness: the

inflammatory response to muscle injury and its clinical implications.

Sports Med. 1995;20:24-40.

[8] Hasson S, Wible C, Reich M, et al. Therapeutic effect of

iontophoretically delivered dexamethasone on delayed muscle soreness. Phys Ther.

1989;69:389. Abstract.

[9] Jensen K, Andersen HO, Olesen J, Lindblom U. Pressure-pain

threshold in human temporal Having to do with time. Contrast with "spatial," which deals with space.  region: evaluation of a new pressure

algometer. Pain. 1986;25:313-323.

[10] Vatine J, Shapira SC, Magora F, et al. Electronic pressure algometry

of deep pain in healthy volunteers. Arch Phys Med Rehabil.

1993;74:526-530.

[11] Brennum J, Kjeldsen M, Jensen K, Jensen T. Measurements of

human pressure-pain thresholds on fingers and toes Fingers and Toes
See also anatomy; body, human; hands.

adactyly

a birth defect in which one or more fingers or toes are missing.

dactyl

a digit; a finger or toe. See also measurement.
. Pain.

1989:38:211-217.

[12] Kosek E, Ekholm J, Nordemar R. A comparison of pressure-pain

thresholds in different tissues and body regions. Scand J Rehabil Med.

1993;25:117-124.

[13] Ohrbach R, Gale EN. Pressure-pain thresholds in normal muscles:

reliability, measurement effects, and topographic topographic

describing or pertaining to special regions.
 differences. Pain.

1989;37:257-263.

[14] Merskey H, Spear FG. The reliability of the pressure algometer.

British Journal of Social and Clinical Psychology. 1964;3:130-136.

[15] Delaney GA, McKee AC. Inter- and intra-rater reliability of the

pressure threshold meter in measurement of myofascial trigger point myofascial trigger point Internal medicine A self-sustaining hyperirritative focus that may occur in any skeletal muscle after strain produced by acute or chronic overload; MTPs produce a referred pain pattern characteristic for that individual muscle; each pattern

sensitivity. Am J Phys Med Rehabil. 1993;72:136-139.

[16] Shrout P, Fleiss JL. Intraclass correlations: uses in assessing rater rat·er  
n.
1. One that rates, especially one that establishes a rating.

2. One having an indicated rank or rating. Often used in combination: a third-rater; a first-rater. 


reliability. Psychol Bull. 1979;86:420-428.

[17] Bland JM, Altman DG. Statistical methods for assessing agreement

between two methods of clinical measurement. Lancet lancet /lan·cet/ (lan´set) a small, pointed, two-edged surgical knife.

lan·cet
n.
. February 8,

1986:307-310.
COPYRIGHT 1998 American Physical Therapy Association, Inc.
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 1998, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Author:Downes, Laurie
Publication:Physical Therapy
Date:Feb 1, 1998
Words:6074
Previous Article:Age-related differences in movement patterns used by toddlers to rise from a supine position to erect stance.
Next Article:Investigation of the effects of a model of physical therapy on mother-child interactions and the motor behaviors of children with motor delay.
Topics:



Related Articles
Generalizability of grip strength measurements in patients with tennis elbow.
Intertrial reliability of work measurements recorded during concentric isokinetic knee extension and flexion in subjects with and without meniscal...
Obtaining reliable measurements of knee extensor torque produced during maximal voluntary contractions: an experimental investigation. (includes...
Facioscapulohumeral dystrophy natural history study: standardization of testing procedures and reliability of measurements.
Reliability of pain and stiffness assessments in clinical manual lumbar spine examination. (includes commentary and author response)
Effectiveness of a Home Program of Ischemic Pressure Followed by Sustained Stretch for Treatment of Myofascial Trigger Points.
Hypoalgesic and sympathoexcitatory effects of mobilization with movement for lateral epicondylalgia. (Research Report).
An electromyographic analysis of the deep cervical flexor muscles in performance of craniocervical flexion.(Research Report)
A new method of isometric dynamometry for the craniocervical flexor muscles.
A comparison of pressure pain detection thresholds in people with chronic low back pain and volunteers without pain.(physical examination)

Terms of use | Copyright © 2009 Farlex, Inc. | Feedback | For webmasters | Submit articles