Printer Friendly
The Free Library
14,495,747 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

Use of the standard error as a reliability index of interest: an applied example using elbow flexor strength data.


A review of the physical examination literature reveals a plethora plethora /pleth·o·ra/ (pleth´ah-rah)
1. an excess of blood.

2. by extension, a red florid complexion.pletho´ric


pleth·o·ra
n.
1.
 of competing tests for most components of the assessment.[1-5] For example, numerous devices, methods, and protocols exist for assessing range of motion, muscle strength, and joint stability.[1-5] The goal of many clinical measurement studies has been to aid in selecting the best measure for a specific purpose. One type of measurement study reported frequently is the reliability study, and numerous design choices are available to investigators who are interested in conducting such studies. Several designs reported often in the physical therapy literature include the simple replication design and the interrater design (equivalent to the intertrial or inter-occasion designs),(*) where raters, trials, or occasions have been considered to be either fixed or random.[6,7] A simple replication study replication study Internal medicine A clinical study that seeks to verify data from a prior study  exists when multiple measurements are taken on a sample of subjects and there is no structure linking the replicate rep·li·cate
v.
1. To duplicate, copy, reproduce, or repeat.

2. To reproduce or make an exact copy or copies of genetic material, a cell, or an organism.

n.
A repetition of an experiment or a procedure.
 measures to each other. Different raters assess different subjects, and there is no need for the number of replicate measurements to be the same for each subject. Presuming pre·sum·ing  
adj.
Having or showing excessive and arrogant self-confidence; presumptuous.



pre·suming·ly adv.
 that raters are randomly allocated to subjects, within an analysis-of-variance (ANOVA anova

see analysis of variance.

ANOVA Analysis of variance, see there
) context, this represents a one-way random effects model In statistics, a random effect(s) model, also called a variance components model is a kind of hierarchical linear model. It assumes that the data describe a hierarchy of different populations whose differences are constrained by the hierarchy. .

A potential limitation of this design is that all components of error are included in a single error term. An equally popular and slightly more complex design is the interrater reliability study, where all raters evaluate all subjects (de, raters are crossed with subjects). Data from this design can be analyzed an·a·lyze  
tr.v. an·a·lyzed, an·a·lyz·ing, an·a·lyz·es
1. To examine methodically by separating into parts and studying their interrelations.

2. Chemistry To make a chemical analysis of.

3.
 using a two-way ANOVA.[6,7] This approach allows partitioning To divide a resource or application into smaller pieces. See partition, application partitioning and PDQ.  of the variance terms into patients, raters, and error. With this design, the investigator must determine whether inferences are to be unique to only those raters taking part in the study or whether the raters are intended to represent a sample from a larger population of raters. In the first case, the rater rat·er  
n.
1. One that rates, especially one that establishes a rating.

2. One having an indicated rank or rating. Often used in combination: a third-rater; a first-rater. 
 effect is considered to be fixed, and in the second case, the rater effect is random.

Results from reliability studies, where the measure is quantified on an interval or ratio scale, are usually expressed by reporting the intraclass correlation In statistics, the intraclass correlation (or the intraclass correlation coefficient[1]) is a measure of correlation, consistency or conformity for a data set when it has multiple groups.  coefficient (ICC ICC

See: International Chamber of Commerce
) or the standard error of measurement (SEM).[1-5,8,9] The ICC provides information about a measure's ability to differentiate among patients, and it is obtained by dividing the true variance In statistics, the term true variance is often used to refer to the unobservable variance of a whole finite population, as distinguished from an observable statistic based on a sample.  by the total variance.[10,11] When patients are the object of measurement, the true variance represents the variance among patients. The SEM expresses measurement error in the same units as the original measurement, and it is not influenced by variability among patients.[10,11]

Relationship Between the SEM and the ICC

The relationship between the SEM and ICC is defined by the following expression:

[MATHEMATICAL EXPRESSION A group of characters or symbols representing a quantity or an operation. See arithmetic expression.  NOT REPRODUCIBLE IN ASCII ASCII or American Standard Code for Information Interchange, a set of codes used to represent letters, numbers, a few symbols, and control characters. Originally designed for teletype operations, it has found wide application in computers. ]

where [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] represents the total variance and [Rho] is the ICC. Using a bit of algebra algebra, branch of mathematics concerned with operations on sets of numbers or other elements that are often represented by symbols. Algebra is a generalization of arithmetic and gains much of its power from dealing symbolically with elements and operations (such as , it can be shown that:

[MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]

where [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] is the error variance and equals the mean square error term from an ANOVA. Although the SEM and ICC are related, they do not convey the same information.[12] This point is highlighted in the results presented by Hayes and colleagues,[8] who studied the test-retest reliability test-retest reliability Psychology A measure of the ability of a psychologic testing instrument to yield the same result for a single Pt at 2 different test periods, which are closely spaced so that any variation detected reflects reliability of the instrument  of measurements of knee joint range of motion in patients with osteoarthritis osteoarthritis
 or osteoarthrosis or degenerative joint disease

Most common joint disorder, afflicting over 80% of those who reach age 70. It does not involve excessive inflammation and may have no symptoms, especially at first.
. Tables 1 and 2 present the ANOVA results for extension and flexion flexion /flex·ion/ (flek´shun) the act of bending or the condition of being bent.

flex·ion
n.
1. The act of bending a joint or limb in the body by the action of flexors.

2.
 measurements, respectively. The ICCs for extension and flexion are .71 and .96, respectively, and the SEMs for extension and flexion are 3.23 and 5.03 degrees, respectively. Notice that the ICC shows the flexion measurements to be more reliable, whereas the SEM indicates that the extension measurements are more reliable. The interpretation of this paradox is that, within the context of Hayes and colleagues' study design, knee flexion measurements are more useful for discriminating dis·crim·i·nat·ing  
adj.
1.
a. Able to recognize or draw fine distinctions; perceptive.

b. Showing careful judgment or fine taste:
 among patients, whereas the magnitude of error (expressed in degrees) is less for knee extension measurements. A second important point is that both the ICC and SEM represent point estimates of the population values for these variables. Although considerable attention has been provided to reporting ICCs (eg, stating the type and confidence interval confidence interval,
n a statistical device used to determine the range within which an acceptable datum would fall. Confidence intervals are usually expressed in percentages, typically 95% or 99%.
 are usual requirements),[13,14] the same thoughtfulness has not been afforded to reporting SEMs. The goals of this article are to illustrate a process for obtaining an interval estimate of the SEM, to show how this process can be used to estimate sample size, and to illustrate how error estimates can be compared statistically. To illustrate these analyses, we use actual strength assessment data.
Table 1.
Analysis-of-Variance Results for Knee Extension Range of Motion(a)

                                        Variance
Source           df   SS        MS      Estimate

Between people   21   1264.91   60.23   24.91
Within people    22   229.00    10.41   10.41
Total            43   1493.91           35.32




(a) Adapted from the work of Hayes et al[8] (Tab. 6, Examiner 1).
Table 2.
Analysis-of-Variance Results for Knee Flexion Range of Motion(a)

                                               Variance
Source            df    SS          MS         Estimate

Between people    21    28219.73    1343.80     659.11
Within people     22    563.00        25.59      25.29
Total             43    28782.73                684.40




(a)Adapted from the work of Hayes et al[8] (Tab. 7, Examiner 1).

Example Study

The data represent make and break tests performed on the right elbow flexors of 30 female volunteers. Two make tests and two break tests were performed on all subjects at the same test session by the same examiner. The order of testing methods was balanced across subjects. The data are shown in the Table 3, and summary ANOVA tables for the tests are presented in Tables 4 and 5. The error variances are equal to the mean square error (MSE MSE Mouse (computer)
MSE Materials Science & Engineering
MSE Mean Squared Error
MSE Mean Square Error
MSE Master of Science in Engineering
MSE Manufacturing Systems Engineering
MSE Mechanically Stabilized Earth
) terms, and the SEMs are the square roots of these terms. Accordingly, the error variance is 0.76 (SEM=0.87 kgf) for the make test and 1.51 (SEM=1.23 kgf) for the break test. The difference scores for each method, shown in Table 3 and reproduced in Table 6, were obtained by subtracting the second trial score from the first trial score (eg, trial 1-trial 2). The correlation between the difference scores for the make and break tests is 0.21, suggesting a modest degree of dependency.
Table 3.
Summary Data for the Make and Break Tests (in Kilograms of Force)

         Make Test      Break Test

Subject                   Differ-                     Difference
No.        1      2       ence       1        2

1       19.1      16.2    2.9        25.4     22.3    3.1
2       18.2      18.3   -0.1        17.7     20.3   -2.6
3       19.5      19.2    0.3        19.6     18.8    0.8
4       23.7      24.5   -0.8        29.2     23.7    5.5
5       27.1      26.2    0.9        27.2     26.3    0.9
6       20.1      22.7   -2.6        23.0     23.9   -0.9
7       21.1      20.3    0.8        20.9     19.2    1.7
8       25.4      25.7   -0.3        26.2     27.6   -1.4
9       17.8      18.9-   1.1        21.5     21.9   -0.4
10      17.8      17.4    0.4        20.4     20.4    0.0
11      22.8      22.6    0.2        22.1     21.2   -1.1
12      16.7      15.2    1.5        19.5     18.3    1.2
13      20.3      19.6    0.7        20.1     21.2   -1.1
14      15.3      16.3   -1.0        19.6     16.6    3.0
15      23.0      21.8    1.2        23.5     19.3    4.2
16      14.8      15.7   -0.9        18.9     18.2    0.7
17      17.6      16.2    1.4        18.4     18.6   -0.2
18      19.1      17.2    1.9        19.2     20.3   -1.1
19      31.7      29.4    2.3        31.8     31.4    0.4
20      16.5      16.3    0.2        18.9     18.9    0.0
21      19.7      19.4    0.3        21.5     20.5    1.0
22      23.8      22.7    1.1        22.7     21.2    1.5
23      22.8      23.8   -1.0        23.1     22.8    0.3
24      19.3      18.1    1.21       21.4     20.5    0.9
25      15.1      15.2   -0.1        17.7     19.2   -1.5
26      16.2      16.8   -0.6        19.8     18.8    1.0
27      17.7      17.9   -0.2        17.5     16.9    0.6
28      16.7      17.6   -0.9        19.7     19.1    0.6
29      16.2      16.3   -0.1        15.8     15.0    0.8

30      16.4      18.5   -2.1        19.2     20.8   -1.6
X       19.72     19.53   0.18       21.38    20.83   0.56
SD      3.91       3.68   1.23        3.61     3.35   1.74
Table 4.
Analysis-of-Variance Results for Make Test

                                            Variance
Source             df     SS        MS      Estimate

Between subjects   29     814.31    28.08   13.66
Trials              1       0.50     0.50    0.00
Within subjects    29      22.06     0.76    0.76
Total              59     836.87            14.42
Table 5.
Analysis-of-Variance Results far Break Test

                                           Variance
Source              df   SS       MS       Estimate

Between subjects    29   659.22   22.731   0.61
Trials               1     4.65    4.65    0.10
Error               29    43.80    1.51    1.51
Total               59   707.67           12.12
Table 6.
Difference Scores for Make and Break Tests

Subject    Make Test    Break Test
No.        Difference   Difference      Sum     Difference
1           2.9          3.1            6.0     -0.2
2          -0.1         -2.6           -2.7      2.5
3           0.3          0.8            1.1     -0.5
4          -0.8          5.5            4.7     -6.3
5           0.9          0.9            1.8      0.0
6          -2.6         -0.9           -3.5     -1.7
7           0.8          1.7            2.5     -0.9
8          -0.3         -1.4           -1.7      1.1
9          -1.1         -0.4           -1.5     -0.7
10          0.4          0.0            0.4      0.4
11          0.2         -1.1           -0.9      1.3
12          1.5          1.2            2.7      0.3
13          0.7         -1.1           -0.4      1.8
14         -1.0          3.0            2.0     -4.0
15          1.2          4.2            5.4     -3.0
16         -0.9          0.7           -0.2     -1.6
17          1.4         -0.2            1.2      1.6
18          1.9         -1.1            0.8      3.0
19          2.3          0.4            2.7      1.9
20          0.2          0.0            0.2      0.2
21          0.3          1.0            1.3     -0.7
22          1.1          1.5            2.6     -0.4
23         -1.0          0.3           -0.7     -1.3
24          1.2          0.9            2.1      0.3
25         -0.1         -1.5           -1.6      1.4
26         -0.6          1.0            0.4     -1.6
27         -0.2          0.6            0.4     -0.8
28         -0.9          0.6           -0.3     -1.5
29         -0.1          0.8            0.7     -0.9
30         -2.1         -1.6           -3.7     -0.5




Estimating a Confidence Interval for the Standard Error of Measurement

Variances of independent samples drawn randomly from a population with a normal distribution have a chi-square distribution chi-square distribution

in statistical terms this is said of a variable with K degrees of freedom if it is distributed like the sum of the squares of K independent random variables each of which has a normal distribution with mean zero and variance of 1.
.[15] Specifically, the relationship is defined as

[MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]

where [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] is the chi-square distribution on n-1 degrees of freedom, [s.sup.2] is the sample variance, [Sigma.sup.2] is the population variance, and the symbol ~ represents [Sigma.sup.2] is distributed as." Using the approach outlined by Armitage and Berry,(16)(PP221-223) an illustration of a confidence interval calculation based on the sample error variance for the make test, 0.76 from Table 4, is shown. Specifically, a two-sided 95VO confidence interval for the error variance associated with knee extensor extensor /ex·ten·sor/ (-ser) [L.]
1. causing extension.

2. a muscle that extends a joint.


ex·ten·sor
n.
A muscle that extends or straightens a limb or body part.
 measurements is defined as

[MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]

[MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]

[MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]

where SSE (1) An earlier full-screen editor in OS/2.

(2) (Streaming SIMD Extensions) A series of additional instructions built into Pentium CPU chips for improved multimedia performance by performing mathematical operations on multiple sets of data at the
 is the sum of squares error (22.06) in the ANOVA table, [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] represents the chi-square value for the probability level CY, and dfe is the degrees of freedom associated with SSE (ie, 29). Taking the square root of the confidence interval values for [Sigma.sup.2] yields a 95% confidence interval on the SEM of 0.69 to 1.17 degrees.

Sample Size Estimation

This section illustrates a method for determining sample size when an investigator is interested in designing a study to estimate the magnitude of the SEM. Consider the following scenario. A researcher is interested in conducting a study to determine the reliability of make tests for the elbow flexors in patients with osteoarthritis. Based on a pilot study on subjects with no known history of osteoarthritis, the researcher expects the subsequent investigation using an enhanced protocol on patients with osteoarthritis to yield a SEM of 0.71 kgf; however, the researcher believes that a SEM as high as 0.95 kgf would be acceptable. Rephrasing re·phrase  
tr.v. re·phrased, re·phras·ing, re·phras·es
To phrase again, especially to state in a new, clearer, or different way.

Noun 1.
 this last sentence yields the following hypotheses:

Null hypothesis null hypothesis,
n theoretical assumption that a given therapy will have results not statistically different from another treatment.

null hypothesis,
n
: The measurement protocol of interest in

patients with osteoarthritis is reliable

(SEM [is greater than or equal to] 0.95 kgf.

Alternate hypothesis The alternate hypothesis (or maintained hypothesis or research hypothesis) and the null hypothesis are the two rival hypotheses whose likelihoods are compared by a statistical hypothesis test. : The measurement protocol of interest

in patients with osteoarthritis is not reliable (SEM [is greater than] 0.95

kgf).

Moreover, the researcher wants to evaluate these hypotheses at a one-sided 100(1-[Alpha])% confidence level of 95% when the sample SEM is 0.71. The first step is to square the SEM values to produce variances. Thus, the researcher expects to find an error variance of 0.50 (ie, 0.712) with an upper acceptable variance limit of 0.90 (de, 0.952). Once again, we make use of the relationship

[MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]

where [s.sup.2] is the expected study variance and [Sigma.sup.2] is the maximal max·i·mal
adj.
1. Of, relating to, or consisting of a maximum.

2. Being the greatest or highest possible.
 acceptable value for the population variance (ie, the upper 95% confidence limit is set to this value).

Using the approach outlined by Armitage and Berry,(16)(PP221-223) n-1 is replaced by the degrees of freedom associated with the error variance (dfe) and this term is isolated

[MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]

[MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]

[MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]

Using an iterative it·er·a·tive  
adj.
1. Characterized by or involving repetition, recurrence, reiteration, or repetitiousness.

2. Grammar Frequentative.

Noun 1.
 process, it is found that a chi-square value of 11.59 on 21 degrees of freedom produces the desired one-sided confidence level. Thus, 22 patients are required, provided two measurements are performed on each patient. Often, an investigator may want to obtain more than two measurements on each patient, and this will influence the number of patients required for the study. Table 7 supplies sample size estimates for various combinations of [Sigma.sup.2]/[s.sup.2] ratios and number of measurements per patient. For the example mentioned previously, [Sigma.sup.2]/[s.sup.2] is equal to 1.8 (0.90/0.50). By referring to Tables 4 and 5, for a [Sigma.sup.2]/[s.sup.2] ratio of 1.8, it is evident that 22 patients are required when two measurements are performed on each patient, 12 patients are needed when three measurements are performed on each patient, and so on.

Table 7. Approximate Sample Sizes for o One-Sided Upper 95% Confidence Limit for Population Variance
No of           Ratio of Population Variance to
Measurements    Sample Variance
per Subject     1.3   1.4   1.5   1.6   1.7   1.8   1.9   2.0

2               91    59    42    33    27    22    20    17
3               46    30    22    17    14    12    10     9
4               31    20    15    11    9      8     7     6
5               24    16    11     9    7      6     6     5
6               19    12     9     7    6      5     4     4




Comparing Error Estimates

Researchers and clinicians are constantly in search of strategies, techniques, and protocols that minimize measurement error. In pursuing this goal, it is necessary to compare the methods or measures of interest by estimating the error associated with the competing measures. Estimates of measurement error for the competing measures can be obtained on different samples or on the same sample. In the case where one group of subjects is used to estimate the error associated with method A and a second group of subjects is used to approximate the error for method B, the measurements are usually considered to be independent. When both methods are evaluated on the same group of subjects, however, the measurements are likely to be dependent. The intent of the following section is to show several statistical approaches for comparing error variances, and ultimately their square roots, the SEMs. The data for the make and break tests will be used to demonstrate the analyses.

Independent Samples: Variance Ratio Approach

Variance estimates from two independent samples can be compared using the ratio of the variance estimates.[16(PP115-117)] Variance ratios are distributed as an F distribution. Specifically, the larger variance is divided by the smaller variance, and this value is converted to a probability by referring to the summary table of F values for the appropriate degrees of freedom.

[MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]

[F.sub.(29,29)] = 1.51/0.76

[F.sub.(29,29)] = 1.98

P = .035

This analysis suggests that the break test variance is significantly greater than the make test variance.

Dependent Samples: Paired Approach

This approach is discussed by Armitage and Berry[16] and Snedecor and Cochran.[17] It is appropriate when pairs of measurements are being compared. To use this technique to compare within-subject error variances, we have added an intermediate step. This step requires calculation of the difference between trials (ie, trial 1-trial 2) for each of the assessment methods. Thus, we are actually testing whether the variance of the difference scores for the two assessment methods (ie, make test difference and break test difference) differs, rather than whether the error variance associated with a single measurement differs. Given that the variance of a difference score is equal to the variance of a single measure times two, the statistical test provides valid information about the extent to which the error variances, and ultimately the SEMs of the make and break tests, differ. Table 6 provides the information necessary to formally evaluate the variances. The first step involves calculating the sum (make test+break test) and difference (make test--break test) for the difference scores shown in Table 6. The variances are compared by calculating the correlation between the sum and difference scores and testing whether the magnitude of the correlation coefficient Correlation Coefficient

A measure that determines the degree to which two variable's movements are associated.

The correlation coefficient is calculated as:
 differs from zero. In our example, the correlation between the sum and difference scores is 0.34 (F=3.58; df=1,28; P-.069), suggesting that there is a marginal difference associated with the variances of the make and break tests.

Dependent Samples: Bootstrap See boot.

(operating system, compiler) bootstrap - To load and initialise the operating system on a computer. Normally abbreviated to "boot". From the curious expression "to pull oneself up by one's bootstraps", one of the legendary feats of Baron von Munchhausen.
 Approach

The paired method mentioned above is restricted to two observations (trials or judges) per subject for each measure being assessed. An alternate approach that avoids this shortcoming short·com·ing  
n.
A deficiency; a flaw.


shortcoming
Noun

a fault or weakness

Noun 1.
 is the bootstrap method. Although reviewing this technique in detail is beyond the scope of this article, a brief explanation of its essential features will be offered. Specifically, the bootstrap method uses the sample data to estimate the distribution of the parent population.[17-19] This estimation is accomplished by drawing a relatively large number of random draws with replacement.[17-19] The random draws are used to construct the estimated population distribution, and the confidence interval of interest can be determined.

This approach was used to estimate the distribution of the difference in make and break test error variances. Specifically, 1,000 bootstrap samples, each with a sample size of 29 drawn with replacement, were obtained. This was accomplished by writing a macro program using MINITAB statistical software.[dagger] The one-tailed probability associated with the observed error variance difference of 0.75 (de, 1.51 - 0.76 from Tabs. 4 and 5) was P=.053. Again, the magnitude of this value suggests that the error variance for the break test may be marginally greater than for the make test.

Summary

This article shows that although the ICC and SEM are related, they convey different information about the reliability of a measure. Specifically, the ICC is influenced by multiple sources of variation (eg, subjects, raters, trials) and by error, whereas the SEM is affected by error variation only. A method for estimating a confidence interval for the SEM is illustrated, and how an a priori a priori

In epistemology, knowledge that is independent of all particular experiences, as opposed to a posteriori (or empirical) knowledge, which derives from experience.
 specification of confidence interval width can be used to estimate sample size is discussed. Finally, several approaches used to compare error variances are discussed.

(*) The repeated-measures factor could also represent trials or occasions; however, we will sue the word "raters" in the text.

[dagger] MINITAB Inc, 3081 Enterprise Dr, State College, PA 16801-3008.

References

[1] Bohannon RW. Make tests and break tests of elbow flexor flexor /flex·or/ (flek´ser)
1. causing flexion.

2. a muscle that flexes a joint.


flexor retina´culum  see entries under retinaculum.
 muscle strength. Phys Ther. 1988;68:193-194.

[2] Malouin F, Boiteau M, Bonneau C, et al. Use of a hand-held dynamometer dynamometer /dy·na·mom·e·ter/ (di?nah-mom´e-ter) an instrument for measuring the force of muscular contraction.

dy·na·mom·e·ter
n.
An instrument for measuring the degree of muscular power.
 for the evaluation of spasticity spasticity /spas·tic·i·ty/ (spas-tis´i-te) the state of being spastic; see spastic (2).

spas·tic·i·ty
n.
1. A spastic state or condition.

2. Spastic paralysis.
 in a clinical setting: a reliability study. Physiotherapy physiotherapy: see physical therapy.  Canada. 1989;41:126-134.

[3] Helewa A, Goldsmith CH, Smythe A. The modified sphygmomanometer--an instrument to measure muscle strength: a validation study. J Chronic Dis. 1981;34:353-361.

[4] Stratford PW, Miseferi D, Ogilvie R, et al. Assessing the responsiveness of five KT1000 knee arthrometer measures used to evaluate anterior anterior /an·te·ri·or/ (an-ter´e-or) situated at or directed toward the front; opposite of posterior.

an·te·ri·or
adj.
1. Placed before or in front.

2.
 laxity laxity /lax·i·ty/ (lak´si-te)
1. slackness or looseness; a lack of tautness, firmness, or rigidity.

2. slackness or displacement in the motion of a joint.lax´


laxity

looseness.
 at the knee joint. Clin J Sports Med. 1991;1:225-228.

[5] LaStayo PC, Wheeler DL. Reliability of passive wrist flexion and extension goniometric go·ni·om·e·ter  
n.
1. An optical instrument for measuring crystal angles, as between crystal faces.

2. A radio receiver and directional antenna used as a system to determine the angular direction of incoming radio signals.
 measurements: a multicenter study. Phys Ther. 1994;74:162-176.

[6] Fleiss JL. The Design and Analysis of Clinical Expenments. New York New York, state, United States
New York, Middle Atlantic state of the United States. It is bordered by Vermont, Massachusetts, Connecticut, and the Atlantic Ocean (E), New Jersey and Pennsylvania (S), Lakes Erie and Ontario and the Canadian province of
, NY: John Wiley John Wiley may refer to:
  • John Wiley & Sons, publishing company
  • John C. Wiley, American ambassador
  • John D. Wiley, Chancellor of the University of Wisconsin-Madison
  • John M. Wiley (1846–1912), U.S.
 & Sons Inc; 1986:1-32, 291-300.

[7] Shrout PE, Fleiss JL. Intraclass correlations: uses in assessing rater reliability. Psychol BulL 1979;86:420-428.

[8] Hayes KW, Peterson C, Falconer Falconer

prison where former professor Farragut, who had killed his brother, witnesses the torments and chaos of the penal system. [Am. Lit.: Cheever Falconer in Weiss, 151]

See : Imprisonment
 J. An examination of Cyriax's passive motion tests with patients having osteoarthritis of the knee. Phys Ther. 1994;74:697-709.

[9] Eliasziw M, Young SL, Woodbury MG, Fryday-Field K Statistical methodology for the concurrent assessment of interrater and intrarater reliability: using goniometric measurements as an example. Phys Ther. 1994;74:777-788.

[10] Streiner DL, Norman GR. Health Measurement Scales: A Practical Guide to Their Development and Use. New York, NY: Oxford University Press; 1989:79-89.

[11] Domholdt E. Physical Therapy Research: Principles and Applications. Philadelphia, Pa: WB Saunders Co; 1993:153-157.

[12] Stratford PW. Reliability: consistency or differentiating among subjects? Phys Ther. 1989;69:299-300. Letter to the editor.

[13] Krebs DE. Declare your ICC type. Phys Ther. 1986;66:1431. Letter to the editor.

[14] Stratford PW. Confidence limits for your ICC. Phys Ther. 1989;69: 237-238. Letter to the editor.

[15] Kleinbaum DG, Kupper LL, Muller Mul·ler , Hermann Joseph 1890-1967.

American geneticist. He won a 1946 Nobel Prize for the study of the hereditary effect of x-rays on genes.



Mül·ler , Johannes Peter 1801-1858.
 KE. Applied Regression Analysis In statistics, a mathematical method of modeling the relationships among three or more variables. It is used to predict the value of one variable given the values of the others. For example, a model might estimate sales based on age and gender.  and Other Multivariable Methods. 2nd ed. Boston, Mass: PWS-Kent Publishing Co; 1988:23.

[16] Armitage P, Berry G. Statistical Methods in Medical Research. 3rd ed. Boston, Mass: Blackwell Scientific Publications; 1994:115-117, 221-223.

[17] Snedecor GW, Cochran WG. Statistical Methods. 6th ed. Ames, Iowa Ames is a city located in the central part of the U.S. state of Iowa, about 30 miles north of Des Moines in Story County. It is the principal city of the 'Ames, Iowa Metropolitan Statistical Area' which encompasses all of Story County, Iowa and which, when combined with the : Iowa State University Academics
ISU is best known for its degree programs in science, engineering, and agriculture. ISU is also home of the world's first electronic digital computing device, the Atanasoff–Berry Computer.
 Press; 1967:195-197.

[18] Stine RA. An introduction to bootstrap methods: examples and ideas. Sociological Methods Research. 1989;18:243-291.

[19] Efron B, Gong G. A leisurely look at the bootstrap, the jackknife jack·knife  
n.
1. A large clasp knife.

2. Sports A dive in the pike position, in which the diver straightens out to enter the water hands first.

v.
, and cross-validation. American Statistician. 1983;37:36-48.

PW Stratford, PT, is Assistant Professor, School of Rehabilitation rehabilitation: see physical therapy.  Science, and Associate Member, Department of Clinical Epidemiology epidemiology, field of medicine concerned with the study of epidemics, outbreaks of disease that affect large numbers of people. Epidemiologists, using sophisticated statistical analyses, field investigations, and complex laboratory techniques, investigate the cause  and Biostatistics biostatistics /bio·sta·tis·tics/ (-stah-tis´tiks) biometry.

bi·o·sta·tis·tics
n.
The science of statistics applied to the analysis of biological or medical data.
, McMaster University McMaster University, at Hamilton, Ont., Canada; nondenominational; founded 1887. It has faculties of humanities, science, social sciences, business, engineering, and health sciences, as well as a school of graduate studies and a divinity college. , Hamilton, Ontario, Canada. Address all correspondence to Mr Stratford at Faculty of Health Sciences, School of Rehabilitation Science, McMaster University, OT/PT OT/PT Occupational/Physical Therapy (medical)  Bldg T-16, 1280 Main St W, Hamilton, Ontario, Canada LOS 4K1 (stratfor@mcmaster.ca).

CH Goldsmith, PhD, is Professor of Clinical Epidemiology and Biostatistics, McMaster University, and Honorary Professor of Physical Therapy, Department of Physical Therapy, University of Western Ontario Western is one of Canada's leading universities, ranked #1 in the Globe and Mail University Report Card 2005 for overall quality of education.[2] It ranked #3 among medical-doctoral level universities according to Maclean's Magazine 2005 University Rankings. , London, Ontario, Canada.

This article was submitted June 2, 1996, and was accepted January 7, 1997.
COPYRIGHT 1997 American Physical Therapy Association, Inc.
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 1997, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Author:Goldsmith, Charlie H.
Publication:Physical Therapy
Date:Jul 1, 1997
Words:3983
Previous Article:Patient sexual behaviors and sexual harassment: a national survey of physical therapists.
Next Article:Multidimensional assessment of motor function in a child with cerebral palsy following intrathecal administration of baclofen.
Topics:



Related Articles
Effects of upper limb immobilization on isometric muscle strength, movement time, and triphasic electromyographic characteristics.
Low-load, prolonged stretch in treatment of elbow flexion contractures secondary to head trauma: a case report.
The application of generalizability theory to reliability assessment: an illustration using isometric force measurements. (includes commentaries and...
Effects of exercise on hip range of motion, trunk muscle performance, and gait economy.
Reliability of passive wrist flexion and extension goniometric measurements: a multicenter study. (includes commentary and author response)
The relationships among knee extensor torques produced during maximal voluntary contractions under various test conditions.
Statistical methodology for the concurrent assessment of interrater and intrarater reliability: using goniometric measurements as an example.
Relationship of plantar-flexor peak torque and dorsiflexion range of motion to kinetic variables during walking.
Determining consistency of elbow joint threshold angle in elbow flexor muscles with spastic hypertonia.
Passive Ankle Stiffness in Subjects With Diabetes and Peripheral Neuropathy Versus an Age-Matched Comparison Group.

Terms of use | Copyright © 2009 Farlex, Inc. | Feedback | For webmasters | Submit articles