Berg balance scale: intrarater test-retest reliability among older people dependent in activities of daily living and living in residential care facilities.The maintenance of balance function is essential to stay physically active in life. (1) Due to aging processes
1. A reduction in a company's stated capital. 2. The total capital that is less than the par value of the company's capital stock. Notes: 1. This is usually reduced because of poorly estimated losses or gains. 2. can lead to dramatic consequences such as dependency in activities of daily living (ADL), admission to nursing homes, and fails and fractures. (2-4) Training of static and dynamic balance control, therefore, is of great importance in rehabilitation rehabilitation: see physical therapy. for older people, and there is a need for valid and reliable instruments for evaluating the effects of treatment. (5) It is crucial for the clinician clinician /cli·ni·cian/ (kli-nish´in) an expert clinical physician and teacher. cli·ni·cian n. to know whether a change in scores on functional tests is due to a real change in functioning or to measurement error. (5) Reliability (ie, when repeated measurements of an individual's performance are consistent from one time to another) (5,6) has been described as relative or absolute. (5) Relative reliability examines the relationship between 2 or more measurements and the consistency of an individual's position within the group. Absolute reliability examines variability in scores in repeated measurements. The intraclass correlation In statistics, the intraclass correlation (or the intraclass correlation coefficient[1]) is a measure of correlation, consistency or conformity for a data set when it has multiple groups. coefficient (ICC ICC See: International Chamber of Commerce ) is commonly used to evaluate relative reliability. (7) However, the ICC value is of limited use to the clinician because it is not related to the actual scale of measurement and is dependent on the range of the individuals' performances. (7) If the individuals' range of scores is low, the ICC value often will show poor reliability, and vice versa VICE VERSA. On the contrary; on opposite sides. . (7,8) This means that the clinician cannot be sure whether a high ICC value for an instrument actually means low variability at the individual level. A more appropriate way of investigating the reliability of an instrument intended for use in a clinical setting seems to be to examine absolute reliability. (9) When using absolute reliability, the assessor receives information about how much variability caused by measurement error can be expected in scores for an individual. (5) In 1996, Bland and Altman (9) presented a way of calculating absolute reliability, which they referred to as the "repeatability." Other methods also are available for estimating absolute reliability; however, it appears that the same calculations are used but the outcome is given different names, thereby creating confusion. The equation used by Bland and Altman also has been referred to as the "smallest detectable difference" (SDD (Software Design Description) The architecture of an information system. See IDD. ), (10) the "minimum detectable change" (MDC (1) (Mobile Daughter Card) See riser card. (2) See Meta Data Coalition. ), (11) and the "smallest real difference" (SRD SRD Suriname Dollar (ISO currency code) SRD Sustainable Resource Development (Alberta, Canada) SRD Short Range Devices (wireless networking) SRD System Reference Document ). (12) Reliability can be tested either by having 2 different observers independently assessing the same individual (interrater reliability) or by having the same assessor performing the assessments (intrarater reliability). (5,13) When investigating variability within the assessed individual, intrarater reliability seems preferable because the variability from the assessor is minimized. (5) The Berg Balance Scale (BBS (1) (Bulletin Board System) A computer system used as an information source and forum for a particular interest group. They were widely used in the U.S. ) was developed to measure balance among older people with impairment in balance function by assessing the performance of functional tasks. (13-15) It is a valid instrument used for evaluation of the effectiveness of interventions and for quantitative descriptions of function in clinical practice and research. The BBS has been evaluated in several reliability studies, of which 3 were concerned with intrarater reliability among older people. (10,13,14) Two of these studies, (13,14) which evaluated relative reliability only (ICC=.97 and .99, respectively), were performed among older people who were living in the community or in a senior citizens' residence or who were inpatients and among patients with a recent stroke. To our knowledge, the absolute intrarater reliability of BBS scores has been evaluated in only one study, (10) which was performed among people with Parkinson disease Parkinson Disease Definition Parkinson disease (PD) is a progressive movement disorder marked by tremors, rigidity, slow movements (bradykinesia), and posture instability. (n=26) who were living in the community. The ICC was .87, and the absolute reliability was 2.8 BBS points (95% confidence level). Thus, there is lack of absolute intrarater reliability studies of the BBS among older people, and absolute reliability has never been evaluated among older people in residential care facilities. This is a group of people who are commonly seen for rehabilitation. They have a high prevalence of diseases and cognitive and physical impairments, (16) which may cause a multisystem reduction in reserve capacity. (17) This makes these older people sensitive to internal and external disturbances, such as pain or environmental stressors, and may result in markedly day-today fluctuations in function. (17) Cognitive impairment may cause difficulties in processing information and making judgments, and the reliability of assessments, therefore, may be affected due to difficulties in understanding instructions. (18,19) Thus, our hypothesis was that, because of the health status of these individuals, the variability between repeated assessments would be large. The purpose of this study was to examine the absolute and the relative intrarater test-retest reliability test-retest reliability Psychology A measure of the ability of a psychologic testing instrument to yield the same result for a single Pt at 2 different test periods, which are closely spaced so that any variation detected reflects reliability of the instrument of BBS scores among older people who are dependent in ADL and living in residential care facilities. Method Setting This study was performed in September 2004 at 3 residential care facilities in Umea, Sweden. It was part of a larger study with the purposes of evaluating reliability and validity for a number of measures of physical function and monitoring change in physical function over a period of 6 months (from March to September 2004) among older people who were dependent in ADL. All facilities comprised private apartments with access to a common dining room, alarms, and on-site nursing and care. In Sweden, a person needs a decision from the municipality MUNICIPALITY. The body of officers, taken collectively, belonging to a city, who are appointed to manage its affairs and defend its interests. to move into this kind of facility and all residents have cognitive or physical impairments and require supervision, functional support, or nursing care. These facilities seem similar to the "assisted living as·sist·ed living n. A living arrangement in which people with special needs, especially older people with disabilities, reside in a facility that provides help with everyday tasks such as bathing, dressing, and taking medication. " facilities in the United States United States, officially United States of America, republic (2005 est. pop. 295,734,000), 3,539,227 sq mi (9,166,598 sq km), North America. The United States is the world's third largest country in population and the fourth largest country in area. . One facility also comprised units for people with dementia, where they have private rooms with staff on-hand. Participants Inclusion criteria
Inclusion criteria are a set of conditions that must be met in order to participate in a clinical trial. were: aged 65 years or over, dependent on assistance in one or more personal ADL according to according to prep. 1. As stated or indicated by; on the authority of: according to historians. 2. In keeping with: according to instructions. 3. the Katz Index, (20) able to stand up from a chair with armrests with help from no more than one person, a Mini Mental State Examination (MMSE MMSE Mini Mental State Examination MMSE Minimum Mean Squared Error MMSE Mini-Mental Status Examination MMSE Multiuse Mission Support Equipment MMSE Multimission Support Equipment MMSE Multi Media Service Environment ) (21) score of 10 or higher, and approval from the resident's physician. Screening and Inclusion Process All participants underwent a screening process carried out by a physical therapist. They were given oral and written information about the study. The participants themselves, or the relatives of those with severe cognitive impairment, gave their informed oral consent. One hundred four people met the inclusion criteria. Forty-five of those individuals completed 2 testing sessions with the BBS in September and were included in this study (Fig. 1). [FIGURE 1 OMITTED] Katz ADL scores and proportion of men and women did not differ between the participants (n=45) and those who met the inclusion criteria but were not included in the study (n=59). However, age differed significantly between the 2 groups; the participants were younger than those who were not included in the study (mean age [[+ or -] SD] of 82.3 [+ or -] 6.6 years versus 86.3 [+ or -] 7.5 years, respectively; P=.006). Assessments A member of the staff who knew the participant well was interviewed about the participant's ability to manage ADL according to the Barthel Index Barthel index, n.pr standard, well-validated assessment that measures functional outcomes, including independence in mobility and self-care. Commonly used in rehabilitation medicine. , which has scores ranging from 0 to 20. (22,23) Cognitive function cognitive function Neurology Any mental process that involves symbolic operations–eg, perception, memory, creation of imagery, and thinking; CFs encompasses awareness and capacity for judgment was screened using the MMSE, (21) which has scores ranging between 0 and 30. A score of 18 to 23 indicates mild cognitive impairment mild cognitive impairment (MCI), n memory loss generally associated with aging; does not affect normal independent functioning of an individual. , and a score of [less than or equal to] 17 indicates severe cognitive impairment. (24) The Geriatric Depression Scale The Geriatric Depression Scale (GDS) is a 30-item self-report assessment used to identify depression in the elderly. Description The GDS questions are answered "yes" or "no", instead of a five-category response set. (GDS GDS Global Distribution System GDS Google Desktop Search (Google) GDS Goodie Domain Service (Vienna University of Technology, Austria) GDS Guards ) was used to measure symptoms of depression. (25) The scores on the GDS range from 0 to 15, with a score of 5 to 9 indicating mild depression and a score of 10 or higher indicating moderate to severe depression. (26) The Functional Ambulation am·bu·late intr.v. am·bu·lat·ed, am·bu·lat·ing, am·bu·lates To walk from place to place; move about. [Latin ambul Categories (FAC FAC - Functional Array Calculator. An APL-like language, but purely functional and lazy. It allows infinite arrays. ["FAC: A Functional APL Language", H.-C. Tu and A.J. Perlis, IEEE Trans Soft Eng 3(1):36-45 (Jan 1986)]. ) were used to measure walking ability with or without a walking aid at 6 levels (0-5). (23,27) A score of 3 (verbal supervision or standby help without physical contact) or less was chosen as the indicator of severe physical impairment. The Mini Nutritional Assessment nutritional assessment Oncology The profiling of a Pt's current nutritional status and risk of malnutrition and cancer cachexia. See Cachexia, Malnutrition. was used to describe nutritional status nutritional status, n the assessment of the state of nourishment of a patient or subject. . (28) The scores range from 0 to 30, with scores between 17 and 23.5 indicating risk for malnutrition malnutrition, insufficiency of one or more nutritional elements necessary for health and well-being. Primary malnutrition is caused by the lack of essential foodstuffs—usually vitamins, minerals, or proteins—in the diet. and scores of less than 17 indicating malnutrition. (29) In March 2004, a registered nurse from each facility completed a questionnaire regarding diagnoses and prescribed drugs. This information, together with assessments and measurements, was evaluated by a specialist in geriatric geriatric /ger·i·at·ric/ (jer?e-at´rik) 1. pertaining to elderly persons or to the aging process. 2. pertaining to geriatrics. ger·i·at·ric adj. 1. medicine before completion of the final diagnoses. Dementia and depression was diagnosed using the DSM-IV DSM-IV Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition (DSM-IV). This reference book, published by the American Psychiatric Association, is the diagnostic standard for most mental health professionals in the United States. criteria. (30) The BBS consists of 14 tasks of varied difficulty, all graded on a 5-point ordinal scale ordinal scale (or´d tr.v. re·test·ed, re·test·ing, re·tests To test again. n. A second or repeated test. (Berg B) were done with 1 to 3 days in between tests and were started at the same time of the day ([+ or -] 1 hour). After Berg A, date and time of day were noted and the test protocol for Berg A was placed in an envelope. Thus, only the date and time of Berg A was provided at the administration of Berg B. Test Procedure The participants were interviewed and tested in their own homes or in the corridor outside their rooms. They were informed that they could stop the testing session whenever they wanted and that they were allowed to rest between tasks, ff necessary. They also were told to wear stable and comfortable shoes. There were 4 assessors--2 physical therapists and 2 physical therapist students. All of the assessors were given education before the study. The assessors received the test manual before the education, which consisted of a half-day session during which they could ask questions and a practice assessment was made on a geriatric patient. In addition, the physical therapists had performed the same assessments in 2 previous data collections. For practical reasons, the assessor for each participant was not randomized ran·dom·ize tr.v. ran·dom·ized, ran·dom·iz·ing, ran·dom·iz·es To make random in arrangement, especially in order to control the variables in an experiment. . Data Analysis To investigate the incidence of outliers, a box plot of the distribution of the absolute differences between tests (Berg A and Berg B) was used. An outlier outlier /out·li·er/ (out´li-er) an observation so distant from the central mass of the data that it noticeably influences results. outlier an extremely high or low value lying beyond the range of the bulk of the data. was defined as a participant with a difference of 1.5-3 x interquartile range In descriptive statistics, the interquartile range (IQR), also called the midspread, middle fifty and middle of the #s, is a measure of statistical dispersion, being equal to the difference between the third and first quartiles. (IQR IQR Interquartile Range (statistics) IQR Internet Quick Reference IQR Individual Qualification Record IQR Internal Quality Review ) from the upper or lower edge of the box. An extreme outlier was defined as >3 x IQR. Two subjects were considered to be outliers according to the box plot, but because they were not extreme outliers, their scores were included in the analyses. Their absolute differences were 9 and 11 BBS points, respectively. In Berg A, they had 7 and 48 BBS points, respectively, and their MMSE scores were 12 and 13, respectively. The absolute reliability--also referred to as "repeatability"--was calculated using an analysis of variance (ANOVA anova see analysis of variance. ANOVA Analysis of variance, see there ), according to the procedure described by Bland and Altman. (9) In the one-way ANOVA, the square root of the within-people residual mean square The introduction to this article provides insufficient context for those unfamiliar with the subject matter. Please help [ improve the introduction] to meet Wikipedia's layout standards. You can discuss the issue on the talk page. is the within-subject standard deviation In statistics, the average amount a number varies from the average number in a series of numbers. (statistics) standard deviation - (SD) A measure of the range of values in a set of numbers. ([s.sub.w]), which enables the size of the measurement error to be calculated. (9) The repeatability is calculated according to the equation: [square root of (2)] x 1.96 [s.sub.w] or 2.77 [s.sub.w]. For 95% of pairs of observations, the measurement error is expected to be less than 2.77 [s.sub.w]. To be certain that a change in score is due to a change in function and not just to measurement error, the difference has to be [greater than or equal to] 2.77 [s.sub.w]. Likewise, measurement errors were calculated with 90% and 80% confidence levels ([square root of (2)] x 1.645 [s.sub.w] and [square root of (2)] x 1.28 [s.sub.w], respectively). The exception to the use of this method is when heteroscedasticity occurs (ie, when the measurement error is dependent on the size of the measurement). The occurrence of heteroscedasticity was investigated graphically by plotting each individual's absolute difference against his or her mean and by calculating a rank correlation In statistics, rank correlation is the study of relationships between different rankings on the same set of items. It deals with measuring correspondence between two rankings, and assessing the significance of this correspondence. coefficient using the Kendall Tau-B statistic statistic, n a value or number that describes a series of quantitative observations or measures; a value calculated from a sample. statistic a numerical value calculated from a number of observations in order to summarize them. . (9) The relative reliability was analyzed using the ICC. The ICC (version 3,1) was calculated using a 2-way mixed-effects model for a single measure, and the ICC (version 1,1) was calculated using a 1-way random-effects model for a single measure. All error is assumed to be random measurement error with ICC(1,1). With ICC(3,1), it is assumed that systematic error is not part of the measurement error. When ICC(1,1) equals ICC(3,1), no systematic error is present. (32) The ICC ranges from 0 to 1, where 1 indicates perfect agreement and 0 indicates no agreement. Intraclass correlation coefficients of .90 or higher are generally considered high. (6) Additional analyses also were performed. The variation in each item of the BBS was analyzed by weighted kappa Kappa Used in regression analysis, Kappa represents the ratio of the dollar price change in the price of an option to a 1% change in the expected price volatility. Notes: Remember, the price of the option increases simultaneously with the volatility. . Suggested interpretation of the results of the weighted kappa analysis is: .41-.60=fair agreement, .61-.80=good agreement, .81-.92= very good agreement. (33) The effect of the participant's cognitive function on the difference in total score between tests was evaluated using linear regression Linear regression A statistical technique for fitting a straight line to a set of data points. analysis. The dependent variable was absolute difference in BBS score. The independent variables were the MMSE score and the MMSE score dichotomized into participants with severe cognitive impairment (MMSE score of [less than or equal to] 17) and those without severe cognitive impairment, respectively. Likewise, analyses were performed evaluating the effect of the participant's age as well as evaluating differences between assessments performed by physical therapists and those performed by physical therapist students. All additional regression analyses (concerning participants' cognitive function, age, and physical therapist assessment versus physical therapist student assessment) were adjusted for initial BBS score (Berg A) by adding this as an independent variable. Analyses were performed using SPSS A statistical package from SPSS, Inc., Chicago (www.spss.com) that runs on PCs, most mainframes and minis and is used extensively in marketing research. It provides over 50 statistical processes, including regression analysis, correlation and analysis of variance. software, version 10.0. * A P value of <.05 was considered to indicate statistical significance. Results A description of the characteristics of the 45 participants is presented in Table 1. Four participants had declined in cognitive function to an MMSE score of < 10 since inclusion in March 2004, but their data were still included in the analyses. Thirty-six participants (80%) had 1 day in between the 2 BBS testing occasions, 8 participants (18%) had 2 days in between tests, and 1 participant (2%) had 3 days in between tests. The difference in the time of the day when the test started between Berg A and Berg B ranged from 0 to 60 minutes. The participants' mean BBS score was 30.1 points (SD = 15.9, range=3-53) for Berg A and 30.6 points (SD=15.6, range=4-54) for Berg B. The distribution of the participants' differences in BBS scores between the 2 test occasions is shown in Figure 2. The absolute differences in the BBS scores ranged from 0 to 11 (Tab. 2). Eight participants (18%) showed no difference between the test occasions, 18 participants (40%) had a difference of 1 BBS point or fewer, and 25 participants (56%) had a difference of 2 points or fewer. The mean absolute difference was 2.8 points (SD=2.7, range=0-11), and the median was 2 points. The absolute differences for the 4 participants with an MMSE of <10 were 1, 3, 4, and 8 BBS points, respectively. [FIGURE 2 OMITTED] Absolute Reliability No heteroscedasticity was found, either graphically or statistically (P=.905). The variation analysis, performed according to the procedure described by Bland and Altman, (9) gave a residual mean square of 7.6278. The equation of repeatability gives 2.77 x [square root of (7.6278)] = 7.7 for a 95% confidence level. This implies that a change of 7.7 BBS points must occur to reveal a genuine change in function for a participant. Corresponding figures for 90% and 80% confidence levels were 6.4 and 5.0 BBS points, respectively. When performing the analyses without the participants who had declined in cognitive function below initial inclusion criteria (MMSE score of <10), the results for absolute reliability were 7.4 BBS points (95% confidence level), 6.2 BBS points (90% confidence level), and 4.8 BBS points (80% confidence level). Relative Reliability Both the ICC(3,1) and the ICC(1,1) were calculated to .97. Additional Analyses Twelve of the 14 items showed a good or very good agreement, using weighted kappa (Tab. 3). The variation ranged from .55 (item 14) to .83 (item 5). Regarding absolute difference in scores between Berg A and Berg B, participants with severe cognitive impairment had a mean ([+ or -] SD) difference of 3.2 [+ or -] 3.0 BBS points compared with 2.6 [+ or -] 2.4 BBS points for participants without severe cognitive impairment (P=.498). There was no significant association between the MMSE scores and absolute difference in BBS scores between tests (P=.882). In addition, the absolute difference was not significantly associated with the participant's age (P=.286). The physical therapists assessed 28 participants, and the physical therapist students assessed 17 participants. The absolute differences in BBS scores between tests were a mean ([+ or -] SD) of 2.5 [+ or -] 2.2 BBS points for the physical therapists and 3.5 [+ or -] 3.4 BBS points for the physical therapist students (P=.233). Discussion The absolute reliability indicated large variability on an individual level where a change of 8 BBS points must occur before a change in function can be detected, using a 95% confidence interval confidence interval, n a statistical device used to determine the range within which an acceptable datum would fall. Confidence intervals are usually expressed in percentages, typically 95% or 99%. , among older people who are dependent in ADL and living in residential care facilities. Contrary to the absolute value, the relative intrarater reliability for the BBS was high (ICC=.97). The variability between the 2 test occasions was not significantly associated with the participants' cognitive function, age, or whether a physical therapist or a physical therapist student performed the assessments. The variability found in this study of absolute intrarater reliability was higher than in a study of patients with Parkinson disease, (10) where absolute reliability was 2.8 BBS points. However, in an absolute interrater reliability study by Stevenson (11) of patients who had a stroke, the variability was 6.9 BBS points (95% confidence interval). Stevenson also calculated an absolute reliability of 6.2 BBS points (95% confidence interval) in data from patients with a recent stroke from an interrater reliability study by Berg et al. (13) Thus, large variability also has been seen in other studies, even though a direct comparison with these 2 studies (11,13) is not applicable because they used interrater reliability and the study by Stevenson used the best performance of 3 attempts at each item. One important reason for the large variability might be difficulties in measuring physical function. (23) However, the main reason for the high variability in the present study is most likely the population's fluctuating fluc·tu·ate v. fluc·tu·at·ed, fluc·tu·at·ing, fluc·tu·ates v.intr. 1. To vary irregularly. See Synonyms at swing. 2. To rise and fall in or as if in waves; undulate. v. function, which is indicated by a high prevalence of diseases and cognitive and physical impairments. This is supported by the large variability in an instrument that evaluates basic mobility and balance--the Timed "Up & Go" Test--when evaluated in a similar population. (34) In addition, the mean value for the BBS was rather low in the present study, which may reflect that many of the participants performed their maximum physical capacity in a majority of the items. When assessing maximum physical capacity, one could assume that minor day-to-day fluctuations in function also would lead to considerable variation in repeated measurements. Cognitive function was not found to be significantly related to the variability in BBS scores in the present study. However, because the great majority of the participants had cognitive impairment, this might have contributed to the large variability found. In earlier studies of reliability of BBS scores, (10,11,13) the participants seem to have had a higher level of cognitive function than those in the present study. The high ICC value in this study supports earlier intrarater reliability studies of the BBS where ICCs ranged from .87 to .99. (10,14) However, this finding contradicts the interpretation of the result of the absolute reliability. The limitations with ICC, where the ICC value might be an effect of the range of scores among individuals, also could be demonstrated from earlier reliability studies of the BBS. In the study by Berg et al, (13) with a large range among individuals' scores (mean BBS score [[+ or -] SD]=37.1 [+ or -] 17.2), ICC was calculated as .98 and absolute reliability was 6.2 points (11,13) compared with an ICC value of .87 and an absolute reliability value of 2.8 points in a study by Lim et al (10) with a lower range of participants' scores (mean BBS score [[+ or -] SD]=53.8 [+ or -] 2.0). Although absolute reliability showed less variability in the study by Lim and colleagues, the ICC value indicated the opposite. Several authors (7,8,11,12,35) have discussed the limited value of ICCs when interpreting results from reliability studies, and, according to Finch finch, common name for members of the Fringillidae, the largest family of birds (including over half the known species), found in most parts of the world except Australia. , (36) relative reliability is used to provide information about whether a scale can discriminate among individual performances. This means that high ICC values show that the individuals tested get different scores along the scale, indicating that the scale can differentiate among individual functions. One limitation in this study is that the sample size did not reach the recommended 50 people for using Bland and Altman's statistical method. (8) However, the 45 people included in the present study approached the recommended number. Even though the interval may have become slightly smaller with a larger sample, the results still indicate wide variability in balance performance for these individuals. The main result in this study was calculated with a 95% confidence level, but one could question whether this confidence level is always necessary in the clinical setting when interpreting results from an assessment such as the BBS. Stevenson (11) also has presented lower levels of confidence, which seems useful as an alternative when clinically evaluating an individual's function. Two people were excluded before the second test because of acute illness. However, a more thorough assessment prior to the test occasions (eg, assessment of whether the participant had slept badly or had increased pain) might have provided additional information that may explain some of the variability between the test occasions. The exclusion at baseline of those people who were independent in ADL and those with very low physical or cognitive function limits the external validity External validity is a form of experimental validity.[1] An experiment is said to possess external validity if the experiment’s results hold across different experimental settings, procedures and participants. . Despite this limitation, the participants in this study had a wide range of both cognitive and physical performance, making them representative of a large part of the population living in residential care facilities. Inclusion was done 6 months before the start of the study; consequently, 4 participants had declined in cognitive function to a level below the inclusion criterion. However, exclusion of those 4 participants in the analyses did not notably affect the results. The methodological strengths of the present study are that the testing environment and time of the day were standardized standardized pertaining to data that have been submitted to standardization procedures. standardized morbidity rate see morbidity rate. standardized mortality rate see mortality rate. and that adherence was high. It was considered necessary to have at least one day of rest between tests to avoid having a lower result on the retest because of fatigue. A larger number of days between tests might have increased the risk of other incidents occurring that could affect the health of these older people. Despite large variability at an individual level, the mean for the group was rather consistent between the 2 test occasions, indicating that the BBS is suitable for evaluating groups over time. In addition, the BBS is a functional way of measuring an individual's balance and can provide valuable information for clinicians designing individual exercise programs. It also is easy to administer because it does not require much time or equipment. However, it seems important for clinicians to be careful when using single assessments of the BBS to draw conclusions concerning a change in balance function in the studied population. It has been indicated that the use of the mean of repeated measurements increases the reliability for tests of walking ability. (37) It would be interesting to evaluate this with reference to the BBS. Conclusion Despite a high ICC value, the result of the absolute reliability indicates that a change of 8 BBS points is required to reveal a genuine change in function between 2 assessments using a 95% confidence level among older people who are dependent in ADL and living in residential care facilities. This knowledge is important in the clinical setting when evaluating an individual's change in balance function over time in this group of older people. All authors provided concept/idea/research design and consultation (including review of manuscript before submission). Ms Conradsson provided writing. Ms Conradsson, Ms Lindelof, Ms Malmqvist, and Dr Gustafson provided data collection. Ms Conradsson, Dr Lundin-Olsson, Ms Lindelof, Mr Littbrand, Dr Gustafson, and Dr Rosendahl provided data analysis. Dr Gustafson provided fund procurement The fancy word for "purchasing." The procurement department within an organization manages all the major purchases. , subjects, facilities/equipment, institutional liaisons, and clerical support. The authors acknowledge and thank Michael Stenvall and Halvar Sivertsson for contributing to the data collection and Hans Stenlund for excellent guidance in statistics. They also thank the Social Authorities of the municipality of Umea, the participants, and the staff at the facilities for their cooperation and participation. The study was approved by the Ethics Committee ethics committee A multidisciplinary hospital body composed of a broad spectrum of personnel–eg, physicians, nurses, social workers, priests, and others, which addresses the moral and ethical issues within the hospital. See DNR, Institutional review board. of the Medical Faculty of Umea University. Parts of the study were given as abstract presentations at the 18th Nordic Congress of Gerontology gerontology: see geriatrics. ; May 28-31, 2006; Jyvaskyla, Finland and at a European Union European Union (EU), name given since the ratification (Nov., 1993) of the Treaty of European Union, or Maastricht Treaty, to the European Community workshop; September 25-26, 2006; Oslo, Norway. This work was supported by grants from the Swedish Research Council The Swedish Research Council (Swedish: Vetenskapsrådet) is a Swedish government agency established in 2001, with the responsibility to support and develop basic scientific research. K2005-27VX15357-01A, Erik and Anne-Marie Detlof's Foundation, and the European Union (Interreg Kvarken-MittSkandia). This article was received November 10, 2006, and was accepted April 20, 2007. DOI (Digital Object Identifier) A method of applying a persistent name to documents, publications and other resources on the Internet rather than using a URL, which can change over time. : 10,2522/ptj.20060343 References (1) Shumway-Cook A. Motor Control: Theory and Practical Applications. 2nd ed. Baltimore, Md: Lippincott Williams & Wilkins; 2001. (2) Guralnik JM, Simonsick EM, Ferrucci L, et al. A short physical performance battery assessing lower extremity lower extremity n. The hip, thigh, leg, ankle, or foot. Also called inferior limb, pelvic limb. function: association with self-reported disability and prediction of mortality and nursing home admission. J Gerontol. 1994;49:M85-M94. (3) Guralnik JM, Ferrucci L, Simonsick EM, et al. Lower-extremity function in persons over the age of 70 years as a predictor of subsequent disability. N Engl J Med. 1995; 332:556-561. (4) Masud T, Morris RO. Epidemiology of falls. Age Ageing. 2001;30(suppl 4):3-7. (5) Domholdt E. Rehabilitation Research: Principles and Applications. 3rd ed. St Louis, Mo: Elsevier-Saunders; 2005. (6) Vincent WJ. Statistics in Kinesiology kinesiology Study of the mechanics and anatomy of human movement and their roles in promoting health and reducing disease. Kinesiology has direct applications to fitness and health, including developing exercise programs for people with and without disabilities, preserving . 2nd ed. Champaign, III: Human Kinetics kinetics: see dynamics. Kinetics (classical mechanics) That part of classical mechanics which deals with the relation between the motions of material bodies and the forces acting upon them. Inc; 1999. (7) Bland JM, Altman DG. A note on the use of the intraclass correlation coefficient in the evaluation of agreement between two methods of measurement. Comput Biol Med. 1990;20:337-340. (8) Rankin G, Stokes Stokes , William 1804-1878. British physician. Known especially for his studies of diseases of the chest and heart, he expanded on the observations of John Cheyne in describing the breathing irregularity now known as Cheyne-Stokes respiration. M. Reliability of assessment tools in rehabilitation: an illustration of appropriate statistical analyses. Clin Rehabil. 1998;12:187-199. (9) Bland JM, Altman DG. Measurement error. BMJ BMJ n abbr (= British Medical Journal) → vom BMA herausgegebene Zeitschrift . 1996;313:744. (10) Lim LI, van Wegen EE, de Goede CJ, et al. Measuring gaff and gaff-related activities in Parkinson's patients own home environment: a reliability, responsiveness and feasibility study The analysis of a problem to determine if it can be solved effectively. The operational (will it work?), economical (costs and benefits) and technical (can it be built?) aspects are part of the study. Results of the study determine whether the solution should be implemented. . Parkinsonism Relat Disord. 2005;11:19-24. (11) Stevenson TJ. Detecting change in patients with stroke using the Berg Balance Scale. Aust J Physiother. 2001;47:29-38. (12) Beckerman H, Roebroeck ME, Lankhorst GJ, et al. Smallest real difference, a link between reproducibility and responsiveness. Qual Life Res. 2001;10:571-578. (13) Berg KO, Wood-Dauphinee S, Williams JI. The Balance Scale: reliability assessment with elderly residents and patients with an acute stroke. Scand J Rehabil Med. 1995; 27:27-36. (14) Berg KO, Wood-Dauphinee S, Williams JI, Gayton D. Measuring balance in the elderly: preliminary development of an instrument. Physiother Can. 1989;41: 304-311. (15) Berg KO, Wood-Dauphinee SL, Williams JI, Maki B. Measuring balance in the elderly: validation of an instrument. Can J Public Health. 1992;83(suppl 2):S7-S11. (16) Jensen J, Lundin-Olsson L, Nyberg L, Gustafson Y. Fall and injury prevention in older people living in residential care facilities: a cluster randomized trial. Ann Intern intern /in·tern/ (in´tern) a medical graduate serving in a hospital preparatory to being licensed to practice medicine. in·tern or in·terne n. Med. 2002;136:733-741. (17) Campbell AJ, Buchner DM. Unstable disability and the fluctuations of frailty frailty Vox populi A state of delicacy or weakness which, which encompasses age-related fragility, in particular osteoporosis. See FICSIT, Osteoporosis. . Age Ageing. 1997;26:315-318. (18) Phillips CD, Chu CW, Morris JN, Hawes C. Effects of cognitive impairment on the reliability of geriatric assessments geriatric assessment, n the evaluation of the physical, mental, and emotional health of elderly patients. in nursing homes. J Am Geriatr Soc. 1993;41: 136-142. (19) Tappen RM, Roach roach: see cockroach. roach Common European sport fish (Rutilus rutilus) of the carp family (Cyprinidae), found in lakes and slow rivers. A high-backed, yellowish green fish with red eyes and reddish fins, the roach is 6–16 in. KE, Buchner D, et al. Reliability of physical performance measures in nursing home residents with Alzheimer's disease Alzheimer's disease (ăls`hī'mərz, ôls–), degenerative disease of nerve cells in the cerebral cortex that leads to atrophy of the brain and senile dementia. . J Gerontol A Biol Sci Med Sci. 1997;52:M52-M55. (20) Katz S, Ford AB, Moskowitz RW, et al. Studies of illness in the aged. The index of ADL: a standardized measure of biological and psychosocial psychosocial /psy·cho·so·cial/ (si?ko-so´shul) pertaining to or involving both psychic and social aspects. psy·cho·so·cial adj. Involving aspects of both social and psychological behavior. function. JAMA JAMA abbr. Journal of the American Medical Association . 1963; 185:914-919. (21) Folstein MF, Folstein SE, McHugh PR. "Mini-mental state": a practical method for grading the cognitive state Noun 1. cognitive state - the state of a person's cognitive processes state of mind interestedness - the state of being interested amnesia, memory loss, blackout - partial or total loss of memory; "he has a total blackout for events of the evening" of patients for the clinician. J Psychiatr Res. 1975; 12:189-198. (22) Mahoney FI, Barthel DW. Functional evaluation: the Barthel Index. Md State Med J. 1965;14:61-65. (23) Wade DT. Measurement in Neurological neurological, neurologic pertaining to or emanating from the nervous system or from neurology. neurological assessment evaluation of the health status of a patient with a nervous system disorder or dysfunction. Rehabilitation. Oxford, United Kingdom: Oxford University Press; 1992. (24) Tombaugh TN, McIntyre NJ. The minimental state examination: a comprehensive review. J Am Geriatr Soc. 1992;40: 922-935. (25) Sheikh sheikh or shaykh Among Arabic-speaking tribes, especially Bedouin, the male head of the family, as well as of each successively larger social unit making up the tribal structure. The sheikh is generally assisted by an informal tribal council of male elders. JI, Yesavage JA. Geriatric Depression Scale (GDS): recent evidence and development of a shorter version. Clin Gerontol. 1986;5:165-172. (26) Alden D, Austin C, Sturgeon sturgeon, primitive fish of the northern regions of Europe, Asia, and North America. Unlike evolutionarily advanced fishes, it has a fine-grained hide, with very reduced scalation, a mostly cartilaginous skeleton, upturned tail fins, and a mouth set well back on the R. A correlation between the Geriatric Depression Scale long and short forms. J Gerontol. 1989;44:P124-P125. (27) Collen FM, Wade DT, Bradshaw CM. Mobility after stroke: reliability of measures of impairment and disability. Int Disabil Stud stud 1. purebred. 2. a place, usually a farm, at which purebred animals are maintained and reproduced. stud animal an animal registered in a stud book. . 1990;12:6-9. (28) Guigoz Y, Vellas B, Garry P. Mini Nutritional Assessment: a practical assessment tool for grading the nutritional state of elderly patients. Facts Res Gerontol. 1994; 4:15-59. (29) Vellas B, Guigoz Y, Garry PJ, et al. The Mini Nutritional Assessment (MNA MNA Monitored Natural Attenuation MNA Massachusetts Nurses Association MNA Michigan Nonprofit Association MNA Mini-Nutritional Assessment MNA Mission to North America (Presbyterian Church in America outreach) ) and its use in grading the nutritional state of elderly patients. Nutrition. 1999;15: 116-122. (30) Diagnostic and Statistical Manual of Mental Disorders Diagnostic and Statistical Manual of Mental Disorders /Di·ag·nos·tic and Sta·tis·ti·cal Man·u·al of Men·tal Dis·or·ders/ (DSM) a categorical system of classification of mental disorders, published by the American Psychiatric Association, that delineates objective . 4th ed. Washington DC: American Psychiatric Association The American Psychiatric Association (APA) is the main professional organization of psychiatrists and trainee psychiatrists in the United States, and the most influential world-wide. Its some 148,000 members are mainly American but some are international. ; 1994. (31) Jensen J, Lundin-Olsson L, Lindmark B, et al. The Berg Balance Scale: evaluation of inter-rater reliability Inter-rater reliability, Inter-rater agreement, or Concordance is the degree of agreement among raters. It gives a score of how much , or consensus, there is in the ratings given by judges. [in Swedish]. Nordisk Fysioterapi. 1998;2:3-8. (32) Moe-Nilssen R. Test-retest reliability of trunk accelerometry during standing and walking. Arch Phys Med Rehabil. 1998;79: 1377-1385. (33) Byrt T. How good is that agreement? Epidemiology. 1996;7:561. (34) Nordin E, Rosendahl E, Lundin-Olsson L. Timed "Up & Go" Test: reliability in older people dependent in activities of daily living-focus on cognitive state. Phys Ther. 2006;86:646-655. (35) Bland JM, Altman DG. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet lancet /lan·cet/ (lan´set) a small, pointed, two-edged surgical knife. lan·cet n. . 1986;1:307-310. (36) Finch E. Physical Rehabilitation physical rehabilitation See Physical therapy. Outcome Measures. 2nd ed. Hamilton, Ontario, Canada: Lippincott Williams & Wilkins; 2002. (37) Connely DM, Stevenson TJ, Vandervoort AA. Between- and within-rater reliability of walking tests in a frail elderly frail elderly, n.pl older persons (usually over the age of 75 years) who are afflicted with physical or mental disabilities that may interfere with the ability to independently perform activities of daily living. population. Physiother Can. 1996;1:47-51. * SPSS Inc, 233 S Wacker Wacker may refer to:
M Conradsson, PT, MSc, is Research Assistant, Department of Community Medicine and Rehabilitation, Geriatric Medicine, Umea University, SE-901 87 Umea, Sweden. L Lundin-Olsson, PT, PhD, is Associate Professor, Department of Community Medicine and Rehabilitation, Physiotherapy physiotherapy: see physical therapy. , Umea University. N Lindelof, PT, MSc, is a PhD Student, Department of Community Medicine and Rehabilitation, Geriatric Medicine, Umea University, and a PhD student, Physiotherapy Unit, Department of Health Sciences, Lulea University of Technology, SE-971 87. Lulea, Sweden. H Littbrand, PT, MSc, is a PhD student, Department of Community Medicine and Rehabilitation, Geriatric Medicine, Umea University. L Malmqvist, PT, BSc, is Physiotherapist physiotherapist /phys·io·ther·a·pist/ (-ther´ah-pist) physical therapist. physiotherapist physical therapist. , Department of Community Medicine and Rehabilitation, Geriatric Medicine, Umea University. Y Gustafson, MD, PhD, is Professor, Department of Community Medicine and Rehabilitation, Geriatric Medicine, Umea University. E Rosendahl, PT, PhD, holds Junior Research Position, Physiotherapy Unit, Department of Health Sciences, Lulea University of Technology. Address all correspondence to Dr Rosendahl at: erik.rosendahl@ltu.se. [Conradsson M, Lundin-Olsson L, Lindelof N, et al. Berg Balance Scale: intrarater test-retest reliability among older people dependent in activities of daily living and living in residential care facilities. Phys Ther. 2007;87:1155-1163.]
Table 1.
Characteristics of the Participants (N=45) (a)
Characteristic
Age (y), mean[+ or -]SD (range) 82.3[+ or -]6.6 (68-96)
Female sex, n (%) 36 (80)
Diagnoses and medical conditions
Depression, n (%) 29 (64)
Dementia, n (%) 30 (67)
Previous stroke, n (%) 10 (22)
Heart failure, n (%) 14 (31)
Previous fracture (during 13 (29)
the last 5 years), n (%)
Delirium episode (in 12 (27)
the last month), n (%)
Urinary tract infection 20 (44)
(in the last year), n (%)
Drugs for regular use
Analgesics, n (%) 20 (44)
No. of prescribed drugs, 9.6[+ or -]3.5 (3-17)
mean[+ or -]SD (range)
Functional assessments
Barthel ADL Index, mean 13.7[+ or -]5.0 (2-19)
[+ or -]SD (range)
Independent walker, with/ 33 (73)
without aid indoors, (b)
n (%)
FAC, median (interquartile 4 (3-4)
range)
Severe physical impairment 13 (29)
(FAC [less than or equal
to]3), n (%)
MMSE, mean[+ or -]SD 17.5[+ or -]6.3 (4-30)
(range), n=44
Severe cognitive impairment 23 (52)
(MMSE <18), n (N), n=44
Severe physical or 27 (60)
cognitive impairment,
n (%)
Geriatric Depression Scale, 3.8[+ or -]3.2 (0-13)
mean[+ or -]SD (range),
n=38
Mini Nutritional Assessment, 20.9[+ or -]3.7 (7-27)
mean[+ or -]SD (range),
n=41
(a) Functional assessments were made in connection with the
assessment of the Berg Balance Scale. Data for all other
characteristics were collected 6 months earlier. Numbers of
participants after a characteristic indicate that assessments
are missing. FAC=Functional Ambulation Categories, MMSE=Mini
Mental State Examination.
(b)Assessed with the Barthel ADL Index.
Table 2.
Absolute Difference in Total Scores of the Berg Balance Scale (BBS)
Between the 2 Test Occasions
Differences in No. of Percentage Cumulative
BBS Points Participants Percentage
(N=45)
0 8 18 18
1 10 22 40
2 7 16 56
3 6 13 69
4 4 9 78
5 3 7 84
G 2 4 89
7 1 2 91
8 2 4 96
9 1 2 98
11 1 2 100
Table 3.
Overview of the Berg Balance Scale Items and Absolute Difference in
Score for Each Item Between the 2 Test Occasions (a)
Item Total 1-Point [greater than or
Agreement, Difference, equal to] 2-Point
n (%) n (%) Difference,n (%)
1 Sit to stand 33 (73) 10 (22) 2 (4)
2 Standing unsupported 36 (80) 9 (20) 0 (0)
3 Sitting unsupported 41 (91) 4 (3) 0 (0)
4 Stand to sit 34 (76) 9 (20) 2 (4)
5 Transfers 36 (80) 7 (16) 2 (4)
6 Standing with 29 (64) 8 (18) 8 (18)
eyes closed
7 Standing with 30 (67) 10 (22) 5 (11)
feet together
8 Reaching forward 31 (69) 10 (22) 4 (9)
while standing
9 Retrieving object 37 (82) 4 (9) 4 (9)
from floor
10 Turning trunk 25 (56) 12 (27) 8 (18)
(feet fixed)
11 Turning 360[degrees] 30 (67) 10 (22) 5 (11)
12 Placing alternate 32 (71) 7 (16) 6 (13)
foot on stool
13 Tandem standing 34 (76) 8 (18) 3 (18)
14 Standing on one leg 32 (71) 10 (22) 3 (7)
Item Weighted
Kappa
(95% CI (b))
1 Sit to stand .74 (.58-.89)
2 Standing unsupported .82 (.72-.93)
3 Sitting unsupported .72 (.49-.96)
4 Stand to sit .70 (.53-.87)
5 Transfers .83 (.72-.94)
6 Standing with .63 (.46-.80)
eyes closed
7 Standing with .74 (.61-.87)
feet together
8 Reaching forward .71 (.57-.85)
while standing
9 Retrieving object .81 (.68-.95)
from floor
10 Turning trunk .58 (.40-.75)
(feet fixed)
11 Turning 360[degrees] .64 (.46-.82)
12 Placing alternate .73 (.60-.86)
foot on stool
13 Tandem standing .78 (.64-.91)
14 Standing on one leg .55 (34-.77)
(a) Each item is graded 0-4 points. Suggested interpretation of the
results of the weighted kappa analysis: .41-.60=fair agreement,
.61-.80=good agreement, .81-.92=very good agreement. (33)
(b) CI=confidence interval.
|
|
||||||||||||||||

Printer friendly
Cite/link
Email
Feedback
Reader Opinion