Printer Friendly
The Free Library
6,672,335 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

Practical significance: the use of effect sizes in school counseling research.


In response to the increasing call by professional organizations, journal editors, and statisticians Statisticians or people who made notable contributions to the theories of statistics, or related aspects of probability, or machine learning: A to E
  • Odd Olai Aalen (1947–)
  • Gottfried Achenwall (1719–1772)
  • Abraham Manie Adelstein (1916–1992)
 to include not only derived significance levels from quantitative statistical procedures but also measures of effect sizes (ES), this article first provides a rationale rationale (rash´nal´),
n the fundamental reasons used as the basis for a decision or action.
 for school counseling-related researchers to include these key indices in their studies. We contend that with this information, readers are better able to contextualize con·tex·tu·al·ize  
tr.v. con·tex·tu·al·ized, con·tex·tu·al·iz·ing, con·tex·tu·al·iz·es
To place (a word or idea, for example) in a particular context.
 and properly interpret research findings and conclusions. Second, the three major ES families and the indices associated with each one are reviewed. Next, the influence of various research designs and differing sample sizes on the size of these effects is discussed, followed by an introduction to ES computation Computation is a general term for any type of information processing that can be represented mathematically. This includes phenomena ranging from simple calculations to human thinking. , reporting, and interpretation, particularly as these issues relate to school counseling settings. Practical examples, ES summary tables, and supplementary resources are overviewed as well.

**********

Numerous publications have advocated for additional and higher-quality school counseling research, especially in regard to comprehensive school counseling programs (e.g., American American, river, 30 mi (48 km) long, rising in N central Calif. in the Sierra Nevada and flowing SW into the Sacramento River at Sacramento. The discovery of gold at Sutter's Mill (see Sutter, John Augustus) along the river in 1848 led to the California gold rush of  School Counselor A school counselor is a counselor and educator who works in schools, and have historically been referred to as "guidance counselors" or "educational counselors," although "Professional School Counselor" is now the preferred term.  Association [ASCA ASCA American School Counselor Association
ASCA Australian Shepherd Club of America
ASCA Arab Society of Certified Accountants
ASCA American Swimming Coaches Association
ASCA American Society of Consulting Arborists
ASCA Association of State Correctional Administrators
], 2005b; Bauman Bauman is a surname and may refer to:
  • Anna Bauman
  • Christopher Bauman, Jr.
  • Eric Bauman, owner of eBaum's World
  • Neil Bauman, father and secretary of Eric Bauman
  • Joe Bauman
  • Nikolay Bauman, Russian revolutionary
  • Robert Bauman
  • Zygmunt Bauman
 et al., 2002; Brown & Trusty, 2005a, 2005b; Lapan, 2001, 2005; Sink, 2005). In fact, prominent school counselor educators (e.g., Astramovich, Coker Coker can refer to several things:
  • Coker, Alabama, United States town
  • Coker, Texas
  • East Coker, English village
  • Coker College, in the U.S. state of South Carolina
  • Charles W.
, & Hoskins Hoskins is a surname, and may refer to:
  • Albert Hoskins
  • Allen "Farina" Hoskins
  • Andrew Hoskins
  • Sir Anthony Hoskins
  • Basil Hoskins, British actor
  • Bob Hoskins, British actor
  • Brian Hoskins
  • Donald Hoskins
  • Eric Hoskins
  • George Hoskins (coach)
, 2005; Baker & Gerler, 2004; Gysbers & Henderson Henderson.

1 City (1990 pop. 25,945), seat of Henderson co., NW Ky., on the Ohio River, in an oil, coal, tobacco, corn, and livestock area; founded 1797, inc. as a city 1867.
, 2006; Myrick, 2003; Schmidt, 2002) and the authors of the ASCA National Model[R] (2005a) have suggested that accountability data must be collected to appraise appraise v. to professionally evaluate the value of property including real estate, jewelry, antique furniture, securities, or in certain cases the loss of value (or cost of replacement) due to damage.  the efficacy of program interventions, activities, and services. Moreover, ASCA's (2004) Ethical Standards for School Counselors indicate that a practitioner who acts in a professional manner "conducts appropriate research and report findings in a manner consistent with acceptable educational and psychological research practices" (F.lc). To further underscore The underscore character (_) is often used to make file, field and variable names more readable when blank spaces are not allowed. For example, NOVEL_1A.DOC, FIRST_NAME and Start_Routine.

(character) underscore - _, ASCII 95.
 this ethical declaration as well as reinforcing Bauman et al.'s sentiments, Whiston Whiston is the name of several places in England:
  • Whiston, Merseyside
  • Whiston, Northamptonshire
  • Whiston, South Yorkshire
See also
  • William Whiston
 (2003) argued that "the future of professional school counseling is at risk unless there is a shared commitment to conducting research that clearly documents how professional school counselors make a positive difference in students' lives" (p. 447).

One of the major challenges with school counseling research is determining when an investigation's statistically significant findings are actually useful to practitioners. In other words Adv. 1. in other words - otherwise stated; "in other words, we are broke"
put differently
, how do school counselors know when the results of a statistical analysis (e.g., t test or correlational analysis Noun 1. correlational analysis - the use of statistical correlation to evaluate the strength of the relations between variables
statistics - a branch of applied mathematics concerned with the collection and interpretation of quantitative data and the use of
) have some real-world application? This practical "applicability" problem is chiefly an issue in quantitative studies, that is, those investigations that attempt to understand and summarize sum·ma·rize  
intr. & tr.v. sum·ma·rized, sum·ma·riz·ing, sum·ma·riz·es
To make a summary or make a summary of.



sum
 school counseling trends using numerical data Numerical data (or quantitative data) is data measured or identified on a numerical scale. Numerical data can be analysed using statistical methods, and results can be displayed using tables, charts, histograms and graphs. .

Take, for example, a hypothetical Hypothetical is an adjective, meaning of or pertaining to a hypothesis. See:
  • Hypothesis
  • Hypothetical
  • Hypothetical (album)
 quantitative study that includes five urban school districts and nearly 150 middle school students who were referred more than 10 times over a period of 2 years to school principals for inappropriate classroom behavior. A school counseling researcher wants to determine the effectiveness of three individual counseling methods (the independent variable called counseling method)--including Glasser's reality therapy (RT), Rogerian counseling (RC), and solution-focused counseling (SFC SFC
abbr.
sergeant first class
)--on the mean (M) recidivism recidivism: see criminology.  rate of inappropriate classroom behavior (the dependent variable). The researcher then computes an analysis of variance The discrepancy between what a party to a lawsuit alleges will be proved in pleadings and what the party actually proves at trial.

In Zoning law, an official permit to use property in a manner that departs from the way in which other property in the same locality
 (ANOVA anova

see analysis of variance.

ANOVA Analysis of variance, see there
) to statistically compare potential differences among the M recidivism scores. A statistically significant difference is found among the three individual counseling methods. The derived statistical output from this analysis includes the following: F(2, 142) = 15.89, p < .05 ([M.sub.Recidivism] for SFC = 5; [M.sub.Recidivism] for RT = 10; [M.sub.Recidivism] for RC = 20, with the smaller the M, the lower the recidivism rate for behavior problems).

Interpreting these results, and others not reproduced here, the researcher demonstrates that SFC yielded the greatest amount of behavior change Behavior change refers to any transformation or modification of human behavior. Such changes can occur intentionally, through behavior modification, without intention, or change rapidly in situations of mental illness.  (i.e., lowest recidivism rate of inappropriate classroom behavior) in middle school students. However, there is the question of practical applicability even though the above ANOVA result is statistically significant. Is it important enough for school counselors in the five districts to take seriously and change their individual counseling approach when working with students with major classroom behavior problems? Or more critically, should school counselors use SFC with middle school students more frequently than Roger's feeling-focused approach and Glasser's RT based on this statistical finding? A good way to estimate the applicability of research findings to school counseling practice is to calculate an effect size (ES).

In this article, we focus on the most widely used parametric ES See parametric modeling, parametric symbol and PTC.  measures and how they can be used in school counseling outcomes research. (Parametric data See parametric symbol.  are data--e.g., test scores--that approximate a bell-shaped Bell´-shaped`   

a. 1. Having the shape of a wide-mouthed bell; campanulate.

Adj. 1. bell-shaped - having a convex shape that resembles a bell
bulging, convex - curving or bulging outward
 or normal curve.) In particular, we (a) discuss the rationale for reporting ESs, (b) overview the primary ES categories and the ESs associated with each, (c) explore how disparate research designs and sample sizes affect the magnitude of ESs, (d) highlight ES computations, (e) consider how these indices should be reported and interpreted in school counseling settings, and (f) provide additional ES resources to consult. Before discussing the two principal types of ESs, a brief caveat is in order. Given limitations on article length, we cannot explicate fully much of the technical material presented below. Before applying ESs to the results of a small or large study, readers are encouraged to first read up on the topic (see below for recommendations) and consult with a research-oriented counselor educator at a local university.

WHY SHOULD EFFECT SIZES BE REPORTED?

Although the value of ESs in counseling, educational, and social science research has certainly received more attention in recent years, the Years, The

the seven decades of Eleanor Pargiter’s life. [Br. Lit.: Benét, 1109]

See : Time
 idea is not new. Ronald Fisher suggested more than 75 years ago that researchers include a correlation ratio In statistics, the correlation ratio is a measure of the relationship between the statistical dispersion within individual categories and the dispersion across the whole population or sample.  or an eta squared ([[eta].sup.2]) with their ANOVA findings to indicate the magnitude of association between the independent and dependent variables (Huberty, 2002). Since then, more than 40 "effect magnitude" indices have been suggested (Kirk, 1996). What is new or at least what has become more pressing, given the No Child Left Behind (U.S. Department of Education, 2001) legislation, is the need for school counseling researchers to present data and findings that reflect both the statistical and practical significance of their work. Cohen cohen
 or kohen

(Hebrew: “priest”) Jewish priest descended from Zadok (a descendant of Aaron), priest at the First Temple of Jerusalem. The biblical priesthood was hereditary and male.
 (1990), in fact, adamantly ad·a·mant  
adj.
Impervious to pleas, appeals, or reason; stubbornly unyielding. See Synonyms at inflexible.

n.
1. A stone once believed to be impenetrable in its hardness.

2. An extremely hard substance.
 stated that the point of research should be to measure the magnitude of an effect rather than simply its statistical significance.

Why, then, should school counseling researchers and those publishing in Professional School Counseling report both a p value indicating the research finding's statistical significance and an ES index as an estimate of its practical significance (Kirk, 1996)? In this section, we attempt to make the case for including ESs in research studies and program evaluations Program evaluation is a formalized approach to studying and assessing projects, policies and program and determining if they 'work'. Program evaluation is used in government and the private sector and it's taught in numerous universities. . First, when investigators only report a statistical test's derived significance level (i.e., its p value), readers are left without a useful statistical tool to determine the importance or practicality of the finding for their own work settings. Suppose, for instance, an instructional method used during an elementary school elementary school: see school.  classroom guidance lesson increased the mean self-concept self-concept
n.
An individual's assessment of his or her status on a single trait or on many human dimensions using societal or personal norms as criteria.
 score (the dependent variable) of third graders in the experimental group by 5 points over those students in the control group. This fairly modest mean difference actually may be statistically significant (p < .05), especially if the sample size is large enough. In other words, "whether or not such a 5-point difference (i.e., magnitude of effect) between the groups is meaningful from an instructional standpoint The Standpoint is a newspaper published in the British Virgin Islands. It was originally published under the name Pennysaver, largely as a shopping-coupon promotional newspaper, but since emerged as one of the most influential sources of journalism in the  depends on many factors besides the statistically significant p value" (Snyder Snyder, city (1990 pop. 12,195), seat of Scurry co., NW Tex., in a prairie and mesquite region; inc. 1907. Oil production is the city's main industry; natural gas is also refined and processed.  & Lawson The name Lawson can refer to a number of different things: People
  • Alfred Lawson - 20th century American aviator, reformer, utopian and religious leader
  • Alfred Lawson, Jr.
, 1993, p. 335).

Vasquez Vázquez (or Vásquez, Vásques) is a Galician-Spanish surname.
  • Andres Vasquez, a Peruvian-born Swedish football midfielder.
  • Andrew Vasquez, Native American flute player
  • Catalina Vasquez Villalpando, treasurer of the United States
, Gangstead, and Henson Hen·son   , Jim 1936-1990.

American puppeteer and creator of the Muppets, a troupe of puppets including Kermit the Frog, Ernie and Bert, and Miss Piggy.

Noun 1.
 (2000) further elaborated on how statistically derived p values are sharply influenced by the sample size:
   Small differences/relationships can be interpreted
   as statistically significant based upon
   the presence of large sample sizes, or conversely,
   large differences/relationships can be
   declared statistically non-significant due to
   small sample sizes.... A relatively small p
   value does not necessarily mean that there is a
   strong relationship between the independent
   and dependent variables of interest in a study.
   (pp. 4-5)


Other writers have suggested that a single study resulting in a decision to reject or accept a null hypothesis null hypothesis,
n theoretical assumption that a given therapy will have results not statistically different from another treatment.

null hypothesis,
n
 based on an a priori a priori

In epistemology, knowledge that is independent of all particular experiences, as opposed to a posteriori (or empirical) knowledge, which derives from experience.
 alpha level does little to advance the development of theory, whereas the reporting of an ES allows the comparison of research studies and the contextualization Contextualization of language use
Contextualization is a word first used in sociolinguistics to refer to the use of language and discourse to signal relevant aspects of an interactional or communicative situation.
 of findings (e.g., Huberty, 1987). In short, if researchers fail to report ESs, and only include the research findings' derived significance levels, key information is missing that assists in understanding the practical value of the results.

A second reason for reporting ESs comes from the American Psychological Association's (APA (All Points Addressable) Refers to an array (bitmapped screen, matrix, etc.) in which all bits or cells can be individually manipulated.

APA - Application Portability Architecture
) Publication Manual (2001; see also Fidler, 2002, for an overview), the most widely accepted writing style manual for school counseling, psychological, and educational researchers. It certainly encourages the use of ESs:
   For the reader to fully understand the importance
   of your findings, it is almost always necessary
   to include some index of effect size or
   strength of relationship in your Results section.
   You can estimate the magnitude of the
   effect or the strength of the relationship with
   a number of common effect size estimates....
   The general principle to be followed ... is to
   provide the reader not only with information
   about statistical significance but also with
   enough information to assess the magnitude
   of the observed effect or relationship. (APA,
   pp. 25-26)


According to according to
prep.
1. As stated or indicated by; on the authority of: according to historians.

2. In keeping with: according to instructions.

3.
 Thompson Thompson, city, Canada
Thompson, city (1991 pop. 14,977), central Man., Canada, on the Burntwood River. A mining town, it developed after large nickel deposits were discovered in the area in 1956.
 (2006), 24 journals have heeded APA's suggestion (see Wilkinson Noun 1. Wilkinson - English chemist honored for his research on pollutants in car exhausts (born in 1921)
Sir Geoffrey Wilkinson
 & APA Task Force on Statistical Inference Inferential statistics or statistical induction comprises the use of statistics to make inferences concerning some unknown aspect of a population. It is distinguished from descriptive statistics. , 1999) and now require or strongly recommend that ESs be reported in manuscripts submitted for publication, including, for example--

* Career Development Quarterly

* Contemporary Educational Psychology

* Counseling &. Values

* Educational and Psychological Measurement

* Exceptional Children

* Journal of Applied Psychology Journal of Applied Psychology is a publication of the APA. It has a high impact factor for its field. It typically publishes high quality empirical papers.

www.apa.


* Journal of Community Psychology

* Journal of Consulting and Clinical Psychology The Journal of Consulting and Clinical Psychology (JCCP) is a bimonthly psychology journal of the American Psychological Association. Its focus is on treatment and prevention in all areas of clinical and clinical-health psychology and especially on topics that appeal to a broad

* Journal of Counseling & Development

* Journal of Educational and Psychological Consultation

* Measurement and Evaluation in Counseling and Development

* Research in the Schools.

Thompson (2000) provided additional reasons why reporting ESs is important, because by doing so it (a) facilitates higher-quality meta-analytic research reviews, (b) assists future researchers to devise more specific study parameters and outcome expectations, and (c) aids with evaluating how a study's results fit within the context of previous research. (That is, how similar are these results compared to previous studies? What features of the research contributed to these similarities and/or and/or  
conj.
Used to indicate that either or both of the items connected by it are involved.

Usage Note: And/or is widely used in legal and business writing.
 differences?)

To recap re·cap 1  
tr.v. re·capped, re·cap·ping, re·caps
1. To replace a cap or caplike covering on: recapped the bottle.

2.
, the inclusion of ESs above and beyond the reporting significance level is increasingly encouraged, and in many cases a requirement, for the publication of research in reputable rep·u·ta·ble  
adj.
Having a good reputation; honorable.



repu·ta·bil
, peer-reviewed journals peer-reviewed journal Refereed journal Academia A professional journal that only publishes articles subjected to a rigorous peer validity review process. Cf Throwaway journal. . Effect sizes provide an indication of the magnitude of an effect and offers "comparative standards with past and present research ... assist[ing] the researcher in identifying important characteristics for subsequent follow-up follow-up,
n the process of monitoring the progress of a patient after a period of active treatment.


follow-up

subsequent.


follow-up plan
 research" (Vasquez et al., 2000, p. 4). Effect sizes allow investigators to say something more definitive about the practical strength of the findings than merely reporting the derived p value (Thompson, 1998, 2006).

FAMILIES OF EFFECT SIZES

Effect sizes are referred to in numerous ways in counseling, educational, and social science research. They often are interchangeably INTERCHANGEABLY. Formerly when deeds of land were made, where there Were covenants to be performed on both sides, it was usual to make two deeds exactly similar to each other, and to exchange them; in the attesting clause, the words, In witness whereof the parties have hereunto  called ES estimates, ES indices, or ES measures. Kline (2004) indicated that the most commonly used ESs fall into one of two general families: (a) standardized standardized

pertaining to data that have been submitted to standardization procedures.


standardized morbidity rate
see morbidity rate.

standardized mortality rate
see mortality rate.
 mean differences (also called group difference indices; e.g., Cohen's d), or (b) strength of association (also referred to as relationship variance-accounted-for or variance-explained ESs; e.g., Pearson Pear·son   , Lester Bowles 1897-1972.

Canadian politician who served as prime minister (1963-1968). He won the 1957 Nobel Peace Prize for his role in the negotiation of a solution to the Suez crisis (1956).
 [r.sup.2], [[eta].sup.2]). VachaHaase and Thompson (2004) added a third class of ES called "corrected" effect sizes. These three categories are briefly explained next.

Tables 1 and 2 summarize sample standardized mean differences and strength of association ES indices. (Whereas the standardized mean difference ESs are reported in a nonsquared, standardized score metric, the relationship variance-accounted-for ESs are reported in a squared metric--e.g., [r.sup.2] and [R.sup.2] [Thompson, 2002].) The mean difference ESs are computed when the focus of the statistical analysis uses mean outcome scores to compare potential group differences. For instance, attest To solemnly declare verbally or in writing that a particular document or testimony about an event is a true and accurate representation of the facts; to bear witness to. To formally certify by a signature that the signer has been present at the execution of a particular writing so as  may be computed to determine whether there was a significant (p < .05, two-tailed) difference between girls' and boys' (group is the independent variable) mean math test scores (outcome or dependent variable). The means for each group also could be compared and reported, for example, using a Cohen's d ES.

Strength of association ES indices are calculated across a wide variety of research designs (e.g., ex post facto ex post facto adj. Latin for "after the fact," which refers to laws adopted after an act is committed making it illegal although it was legal when done, or increases the penalty for a crime after it is committed. Such laws are specifically prohibited by the U. S.  or causal causal /cau·sal/ (kaw´z'l) pertaining to, involving, or indicating a cause.

causal

relating to or emanating from cause.
 comparative, experimental, quasi-experimental, prediction [regression regression, in psychology: see defense mechanism.
regression

In statistics, a process for determining a line or curve that best represents the general trend of a data set.
 and correlational]) using the general linear model (GLM GLM Global Language Monitor
GLM Global Marine (stock symbol)
GLM Graduated Length Method (ski instruction)
GLM Good Looking Mom (used in pediatric practices)
GLM God Loves Me
; for detailed explanations, see Rutherford Rutherford (rŭth`ərfərd), borough (1990 pop. 17,790), Bergen co., NE N.J., a residential suburb of the New York City–N New Jersey metropolitan area; inc. 1881. Several pre-Revolutionary houses remain there. , 2001; Trochim, 2005) for the statistical analyses. Strength of association ESs indicate how much of the variance associated with the dependent variable (e.g., students' English test scores) can be accounted for (or explained) by the independent variable(s) (e.g., gender, ethnicity ethnicity Vox populi Racial status–ie, African American, Asian, Caucasian, Hispanic , method of counseling; Snyder & Lawson, 1993; Vacha-Haase & Thompson, 2004).

In other words, the GLM statistical procedure deployed to compute To perform mathematical operations or general computer processing. For an explanation of "The 3 C's," or how the computer processes data, see computer. , for example, a factorial factorial

For any whole number, the product of all the counting numbers up to and including itself. It is indicated with an exclamation point: 4! (read “four factorial”) is 1 × 2 × 3 × 4 = 24.
 ANOVA produces strength of association or variance-accounted-for ESs. These are based on the ratio of between the specific variance that is related to the independent variable(s) (e.g., variance due to the differential effect of the intervention A procedure used in a lawsuit by which the court allows a third person who was not originally a party to the suit to become a party, by joining with either the plaintiff or the defendant.  on group [independent variable]) to total variance (the combined variance relating to relating to relate prepconcernant

relating to relate prepbezüglich +gen, mit Bezug auf +acc 
 the independent and dependent variables as well as to residual or error variance). These ESs also can be reported as (a) coefficients of determination, where [r.sup.2] represents the squared bivariate bi·var·i·ate  
adj.
Mathematics Having two variables: bivariate binomial distribution.

Adj. 1.
 Pearson correlation coefficients Correlation Coefficient

A measure that determines the degree to which two variable's movements are associated.

The correlation coefficient is calculated as:
 and [R.sup.2] symbolizes the squared multiple correlation Noun 1. multiple correlation - a statistical technique that predicts values of one variable on the basis of two or more other variables
multiple regression
 among three or more variables; and (b) squared etas ([[eta].sup.2]s) generated from a statistical analysis with perhaps several independent and dependent variables.

Both ES families can be further categorized cat·e·go·rize  
tr.v. cat·e·go·rized, cat·e·go·riz·ing, cat·e·go·riz·es
To put into a category or categories; classify.



cat
 into biased (uncorrected) and unbiased (corrected) measures (Fan, 2001; Roberts & Henson, 2002; Snyder & Lawson, 1993; Thompson, 1999, 2002, 2006; Vacha-Haase & Thompson, 2004; see Tables 1 and 2). The former relates to "sampling error" and how the ESs derived from statistical procedures computed with smaller groups of participants (samples) drawn from a larger population will be higher than would be found either in the original population or in participant samples studied at a later date. For example, assume a school counseling researcher randomly selects a sample of 15 girls and 15 boys from a 10th-grade physical education class in one high school. This 30-person sample then is assumed to represent the entire population of male and female students attending all Grade 10 physical education classes across a district's five high schools. Because there is always sampling error (i.e., problems with selecting the students to participate in the research study), uncorrected ESs (e.g., Cohen's d, Glass' [DELTA], [eta.sup.2], [R.sup.2]) will be higher (i.e., positively biased or overestimated) for the 30 students than if the researcher actually could have included in the study every 10th-grade physical education student across all five high schools.

In contrast to biased ESs, unbiased ones (e.g., adjusted [R.sup.2], Hays' [omega.sup.2], [epsilon.sup.2], and Wherry formulas; see Tables 1 and 2) statistically correct for this positive bias by calculating the strength of association indices so that they better reflect the true ESs for the entire population and those calculated in future samples (e.g., see the Herzberg and Lord formulas in Tables 1 and 2). Because these formulas must adjust for sampling error occurring both in the present study and in subsequent investigations, ES estimates for future research samples result in more shrinkage Shrinkage

The amount by which inventory on hand is shorter than the amount of inventory recorded.

Notes:
The missing inventory could be due to theft, damage, or book keeping errors.
 (i.e., are smaller or more conservative) than those ES estimates based on an entire population (Snyder & Lawson, 1993). Therefore, interpretation of ESs is often less than straightforward and must be exercised with caution.

To summarize, parametric ES indices are available both for mean comparison statistical designs (standardized mean difference ESs) and for correlational and GLM statistical designs (strength of association or relationship variance-accounted-for ESs). Moreover, some ESs are uncorrected or biased, while others are corrected or unbiased. Statistical software can now be used to compute ESs and many of these can be found online at various Web sites.

HOW DO DIFFERENT RESEARCH DESIGNS AND SAMPLE SIZES INFLUENCE ES?

The type of research design selected by the school counseling investigator can determine whether to use, for example, a Cohen's d or an [eta.sup.2] ES. As alluded to previously, if the research design, for instance, calls for a statistical comparison between pretest pre·test  
n.
1.
a. A preliminary test administered to determine a student's baseline knowledge or preparedness for an educational experience or course of study.

b. A test taken for practice.

2.
 and posttest post·test  
n.
A test given after a lesson or a period of instruction to determine what the students have learned.
 mean scores for a group of students receiving a school counseling intervention versus those who are not (the control group), the ES to use is a standardized differences index. However, if the research design examines the relationship among variables (i.e., correlational design) and calls for a statistical procedure that uses GLM (e.g., ANOVA), then relationship variance-accotmted-for ESs should be reported.

Numerous factors in research can influence the amount of ES bias (O'Grady, 1982; Olejnik & Algina, 2000), including (a) limited reliability of the outcome or measurement instruments (generally, the better the reliability of the tests used, the higher the ESs); (b) sample size (larger sample sizes generally will produce less biased results); (c) number of independent variables to number of participants (the more participants per variable, the less bias); (d) heterogeneity het·er·o·ge·ne·i·ty
n.
The quality or state of being heterogeneous.



heterogeneity

the state of being heterogeneous.
 of the study sample (more homogenous homogenous - homogeneous  samples generally yield smaller effect sizes); and (e) type of research design (experimental studies generally produce smaller ESs). A well-designed and implemented study will, therefore, diminish the size of the ES bias and make it less likely that a corrected ES will be needed. If the research design is far less than optimal, always report the corrected ESs.

When larger samples (n > 50) are used, the magnitude of biased (uncorrected) and unbiased (corrected) ESs will be essentially the same (Snyder & Lawson, 1993). However, statistical corrections tend to be larger with small sample sizes (n > 30) and produce smaller ESs (Thompson, 1990, 1997). Tables 3 and 4 exemplify ex·em·pli·fy  
tr.v. ex·em·pli·fied, ex·em·pli·fy·ing, ex·em·pli·fies
1.
a. To illustrate by example: exemplify an argument.

b.
 two different group sizes and their resulting ESs using various measures of strength of association. Sample 1 is relatively undersized undersized

see dwarfism, runt.
 (n = 24) and produces a small ES [[eta].sup.2]=.17), whereas sample 2 is large (n = 150) and yields a moderate-level ES ([[eta].sup.2]=.44). These examples also illustrate how sample size can influence how much strength of association indices can vary across different ES formulas. With a small sample size, the ESs range from .02 to .17. In contrast, with a large sample, the ES indices vary little, ranging from .42 to .44. In short, a larger sample size tends to produce more stable ESs.

HOW ARE EFFECT SIZES COMPUTED?

In general, standardized mean difference ESs are relatively easy to calculate. In the numerator numerator

the upper part of a fraction.


numerator relationship
see additive genetic relationship.


numerator Epidemiology The upper part of a fraction
, one computes the difference between the control (or comparison) and experimental groups' average scores, and this figure then is divided by an estimate of the groups' standard deviation In statistics, the average amount a number varies from the average number in a series of numbers.

(statistics) standard deviation - (SD) A measure of the range of values in a set of numbers.
. Strength of association ESs, generally speaking, are derived using the ratio of between-group (or independent variable) variance to total variance. Because strength of association measures are based on a regression model (GLM), most can be interpreted as the percent of variance accounted for in the dependent variable. For example, an [eta.sup.2] of .26 means that 26% of the variance in the dependent variable can be accounted for by the independent variable(s). In GLM statistical analyses, most statistical programs (e.g., SPSS A statistical package from SPSS, Inc., Chicago (www.spss.com) that runs on PCs, most mainframes and minis and is used extensively in marketing research. It provides over 50 statistical processes, including regression analysis, correlation and analysis of variance. , 2005) will include a printout (PRINTer OUTput) Same as hard copy.  with a "partial eta squared" if one selects "estimates of effect sizes" under the "options" on the GLM tab (this is equal to a "R squared," as they are both calculated using the same formula). Most ES measures can be calculated by hand as well using the formulas provided in Tables 1 and 2. For additional information, Coe (2002) and Thompson (1997) have provided a useful overview of ES calculations.

HOW SHOULD EFFECT SIZES BE REPORTED AND INTERPRETED?

Reporting

There are key ES reporting guidelines guidelines,
n.pl a set of standards, criteria, or specifications to be used or followed in the performance of certain tasks.
 in counseling research (Vacha-Haase & Thompson, 2004). First, when including ESs in the results, it is vital to clearly indicate what type of ES is being reported. Without the name of the ES, readers cannot evaluate its strength. A Cohen's d of .75 means something quite different in practical terms than a [R.sup.2] of .75. Second, report the derived ESs in context to the normalcy nor·mal·cy  
n.
Normality.

Noun 1. normalcy - being within certain limits that define the range of normal functioning
normality
 of data used to calculate them (Huberty & Lowman, 2000; Vacha-Haase & Thompson). As mentioned earlier, ESs, like other statistics (e.g., t, F, r), are subject to the parametric assumptions underlying them. When "distribution or homogeneity Homogeneity

The degree to which items are similar.
 assumptions are severely violated vi·o·late  
tr.v. vi·o·lat·ed, vi·o·lat·ing, vi·o·lates
1. To break or disregard (a law or promise, for example).

2. To assault (a person) sexually.

3.
, F and p calculated values may be compromised, but so too will be the effect estimates" (Vacha-Haase & Thompson, p. 477). Another interpretation practice that is strongly encouraged in the ES literature is to formulate formulate /for·mu·late/ (for´mu-lat)
1. to state in the form of a formula.

2. to prepare in accordance with a prescribed or specified method.
 confidence intervals confidence interval,
n a statistical device used to determine the range within which an acceptable datum would fall. Confidence intervals are usually expressed in percentages, typically 95% or 99%.
 around a derived ES estimate (Bird, 2002; Thompson, 2002; Vacha-Haase & Thompson). A confidence interval provides a "band" around the derived ES that indicates how much the ES could fluctuate based on sampling error and, thus, vary from one sample or group of students to the next sample (Thompson).

Interpreting

Cohen (1992) published general guidelines for assessing and interpreting the magnitude of standardized difference effect sizes. As a rule, the stronger (or higher) the ESs, the more compelling the evidence that the statistically significant results are useful for school counseling practice. Table 1 suggests the threshold numbers (cutoffs) for labeling the magnitude of each type of ES. For example, small, medium, and large ds are the absolute values, .2, .5, and .8, respectively. However, the final interpretation of the practical or clinical significance of all ESs ultimately remains with the counseling researcher (Thompson, 2002; Vacha-Haase & Thompson, 2004). Any ES should always be considered within the context of previous related research, the design of the study, and the educational impact of the findings. When comparing ESs across school counseling studies, therefore, make sure to attend to the differences in research designs and how they may influence the size of the reported effects.

Thompson (1999, 2002) also cautioned against applying ES cutoffs with the same rigidity rigidity /ri·gid·i·ty/ (ri-jid´i-te) inflexibility or stiffness.

clasp-knife rigidity
 that has typically been applied in statistical significance testing, noting that Cohen only intended the cutoffs for small, medium, and large ESs as broad, general guidelines, not as inflexible universal standards. More succinctly suc·cinct  
adj. suc·cinct·er, suc·cinct·est
1. Characterized by clear, precise expression in few words; concise and terse: a succinct reply; a succinct style.

2.
, Thompson (2002) suggested that researchers never use these guidelines (the ES cutoffs) blindly "with the same rigidity that the [alpha] = .05 criterion has been used, we would merely be being stupid in a new metric" (p. 68).

SUPPLEMENTARY RESOURCES ON EFFECT SIZES

Because there are numerous Web sites that address the basics and the nuances of ES usage, listing one or two here would not cover the wide variety of Web-based resources available to readers. An Internet Internet

Publicly accessible computer network connecting many smaller networks from around the world. It grew out of a U.S. Defense Department program called ARPANET (Advanced Research Projects Agency Network), established in 1969 with connections between computers at the
 search (e.g., using "Google (Google, Mountain View, CA, www.google.com) The largest search engine on the Web, founded by Larry Page and Sergey Brin, two Stanford University students. In 1996, they developed their "BackRub" search engine, named after its unique page ranking method (explained below). ") for the term effect sizes would produce a substantial number of "hits." Some Web sites and their documents are straightforward and readily comprehensible com·pre·hen·si·ble  
adj.
Readily comprehended or understood; intelligible.



[Latin compreh
 (see, e.g., Coe, 2002), though others are less so. Google also will lead searchers to simple-to-use ES calculators. If readers would prefer helpful texts, we recommend four relatively recent books by Green and Salkind (2004), Grisson and Kim (2005), Kline (2004), and Rosenthal, Rosnow, and Rubin (2000). Many of the publications cited in this article are useful to consult as well.

CLOSING REMARKS

The importance of ES reporting and interpretation is widely recognized in the educational and counseling literature. Effect size reporting is increasingly a preference--if not a requirement--by many educational and counseling journals (e.g., Journal of Counseling & Development) and APA's (2001) Publication Manual. It is imperative that both consumers of research and those interested in conducting school counseling studies understand the value of ESs as a measure of practical significance. Even though this article is by no means an exhaustive discussion of ESs and their uses, we hope that it will serve as a valuable introduction and resource for school counselor educators, practitioners, and graduate students to consult as they attempt to make better sense of statistical findings they produce or read. With the reporting of ESs, administrators and policymakers as well should be more equipped to interpret whether the school counseling research outcomes they are reviewing are applicable to their schools.

References

American Psychological Association The American Psychological Association (APA) is a professional organization representing psychology in the US. Description and history
The association has around 150,000 members and an annual budget of around $70m.
. (2001). Publication manual of the American Psychological Association (5th ed.). Washington, DC: Author.

American School Counselor Association. (2004). Ethical standards for school counselors. Alexandria, VA: Author. Retrieved December 1, 2005, from http://www.schoolcounselor.org/content.asp?contentid=173

American School Counselor Association. (2005a). The ASCA national model: A framework for school counseling programs (2nd ed.). Alexandria, VA: Author.

American School Counselor Association. (2005b). Foundations and basics. Alexandria, VA: Author.

Astra movich, R. L., Coker, J. K., & Hoskins, W. J. (2005).Training school counselors in program evaluation. Professional School Counseling, 9, 49-54.

Baker, S. B., & Gerler, E. R. (2004). School counseling in the twenty-first century (4th ed.). Upper Saddle River Saddle River may refer to:
  • Saddle River, New Jersey, a borough in Bergen County, New Jersey
  • Saddle River (New Jersey), a tributary of the Passaic River in New Jersey
, NJ: Pearson.

Bauman, S., Siegel, J.T., Davis, A., Falco Falco

a genus of the family Falconidae (birds of prey). Includes F. biarmicus—lanner falcon, F. columbaris—kestrel, pigeon hawk or merlin, F. mexicanus—prairie falcon, F. peregrinus—peregrine falcon, F. rusticolus—gyrfalcon, F.
, L. D., Seabolt, K., & Szymanski, G. (2002). School counselors' interest in professional literature and research. Professional School Counseling, 5, 346-352.

Bird, K. D. (2002). Confidence intervals for effect sizes in analysis of variance. Educational & Psychological Measurement, 62, 197-226.

Brown, D., & Trusty, J. (2005a). School counselors, comprehensive school counseling programs, and academic achievement: Are school counselors promising more than they can deliver? Professional School Counseling, 9, 1-8.

Brown, D., & Trusty, J. (2005b). The ASCA national model, accountability, and establishing causal links between school counselors' activities and student outcomes: A reply to Sink. Professional School Counseling, 9, 13-15.

Coe, R. (2002, September). It's the effect size, stupid: What effect size is and why it is important. Paper presented at the annual conference of the British Educational Research Association, University of Exeter, England. Retrieved December 13, 2005, from http://www.leeds.ac.uk/educol/documents/00002182.htm

Cohen, J. (1990).Things I have learned (so far). American Psychologist The American Psychologist is the official journal of the American Psychological Association. It contains archival documents and articles covering current issues in psychology, the science and practice of psychology, and psychology's contribution to public policy. , 45, 1304-1312.

Cohen, J. (1992). A power primer prim·er
n.
A segment of DNA or RNA that is complementary to a given DNA sequence and that is needed to initiate replication by DNA polymerase.
. Psychological Bulletin, 112, 155-159.

Fan, X. (2001). Statistical significance and effect size in education research: Two sides of a coin. Journal of Educational Research, 94, 275-282.

Fidler, F. (2002).The fifth edition of the APA publication manual: Why its statistical recommendations are so controversial. Educational and Psychological Measurement, 62, 749-770.

Green, S. B., & Salkind, N. J. (2004). Using SPSS for Windows and Macintosh:Analyzing and understanding (4th ed.). Upper Saddle River, N J: Prentice Hall Prentice Hall is a leading educational publisher. It is an imprint of Pearson Education, Inc., based in Upper Saddle River, New Jersey, USA. Prentice Hall publishes print and digital content for the 6-12 and higher education market. History
In 1913, law professor Dr.
.

Grisson, R. J., & Kim, J. J. (2005). Effect sizes for research. Mahwah, N J: Erlbaum.

Gysbers, N. C., & Henderson, P. (2006). Developing and managing your school guidance program (4th ed.). Alexandria, VA: American Counseling Association The American Counseling Association (ACA) is a non-profit, professional organization that is dedicated to the counseling profession. ACA is the world's second largest association exclusively representing professional counselors. .

Huberty, C. J. (1987). On statistical testing. Educational Researcher, 16, 4-9.

Huberty, C. J. (2002). A history of effect size indices. Educational and Psychological Measurement, 62, 227-240.

Huberty, C. J., & Lawman, L. L. (2000). Group overlap o·ver·lap
n.
1. A part or portion of a structure that extends or projects over another.

2. The suturing of one layer of tissue above or under another layer to provide additional strength, often used in dental surgery.

v.
 as a basis for effect size. Educational & Psychological Measurement, 60, 543-563.

Kirk, R. E. (1996). Practical significance: A concept whose time has come. Educational & Psychological Measurement, 55, 746-759.

Kline, R. B. (2004). Beyond significance testing. Washington, DC: American Psychological Association.

Lapan, R.T. (2001). Results-based comprehensive guidance and counseling guidance and counseling, concept that institutions, especially schools, should promote the efficient and happy lives of individuals by helping them adjust to social realities.  programs: A framework for planning and evaluation. Professional School Counseling, 4, 289-299.

Lapan, R.T. (2005). Evaluating school counseling programs. In C. A. Sin k (Ed.), Contemporary school counseling: Theory, research, and practice (pp. 257-293). Boston: Houghton-Mifflin. Myrick, R. D. (2003). Developmental guidance and counseling (4th ed.). Minneapolis, MN: Educational Media.

O'Grady, K. E. (1982). Measures of explained variance Explained variance is part of the variance of any residual that can be attributed to a specific condition (cause). The other part of variance is unexplained variance. The higher the explained variance relative to the total variance, the stronger the statistical measure used. : Cautions and limitations. Psychological Bulletin, 92, 766-777.

Olejnik, S., & Algina, J. (2000). Measures of effect size for comparative studies: Application, interpretations, and limitations. Contemporary Educational Psychology, 25, 241-286.

Roberts, J. K., & Henson, R. K. (2002). Correction for bias in estimating effect sizes. Educational and Psychological Measurement, 62, 241-253.

Rosenthal, R., Rosnow, R. L., & Rubin, D. B. (2000). Contrasts and effect sizes in behavioral behavioral

pertaining to behavior.


behavioral disorders
see vice.

behavioral seizure
see psychomotor seizure.
 research: A correlational approach. Cambridge, UK: Cambridge University Press Cambridge University Press (known colloquially as CUP) is a publisher given a Royal Charter by Henry VIII in 1534, and one of the two privileged presses (the other being Oxford University Press). .

Rutherford, A. (2001). Introducing ANOVA and ANCOVA ANCOVA Analysis of Covariance : A GLM approach. Thousand Oaks Thousand Oaks, residential city (1990 pop. 104,352), Ventura co., S Calif., in a farm area; inc. 1964. Avocados, citrus, vegetables, strawberries, and nursery products are grown. , CA: Sage.

Schmidt, J. (2002). Counseling in schools: Essential services and comprehensive programs (4th ed.). Boston: Allyn and Bacon.

Sink, C. A. (2005). Comprehensive school counseling programs and academic achievement--A rejoinder The answer made by a defendant in the second stage of Common-Law Pleading that rebuts or denies the assertions made in the plaintiff's replication.

The rejoinder allows a defendant to present a more responsive and specific statement challenging the allegations made
 to Brown and Trusty. Professional School Counseling, 9, 9-12.

Snyder, P., & Lawson, S. (1993). Evaluating results using corrected and uncorrected effect size estimates. Journal of Experimental Education, 61, 334-349.

SPSS. (2005). Data analysis with comprehensive statistics software. Retrieved December 10, 2005, from http://www.spss.com/spss/

Thompson, B. (1990). Finding a correction for the sampling error in multivariate The use of multiple variables in a forecasting model.  measures of relationship: A Monte Carlo Monte Carlo (môNtā` kärlō`), town (1982 pop. 13,150), principality of Monaco, on the Mediterranean Sea and the French Riviera.  study. Educational and Psychological Measurement, 50, 15-31.

Thompson, B. (1997). Computing computing - computer  effect sizes. College Station, TX: Texas A & M University. Retrieved on December 2, 2005, from http://www.coe.tamu.edu/~bthompson/effect.html

Thompson, B. (1998). Statistical significance and effect size reporting: Portrait of a possible future. Research in the Schools, 5, 33-38.

Thompson, B. (1999). If statistical significance tests are broken/misused, what practices should supplement or replace them? Theory & Psychology, 9, 165-181.

Thompson, B. (2000). A suggested revision to the forthcoming 5th edition of the APA publication manual Retrieved on December 2, 2005, from http://www.coe.tamu.edu/~bthompson/apaeffec.htm

Thompson, B. (2002)."Statistical," "practical," and "clinical": How many kinds of significance do counselors need to consider? Journal of Counseling & Development, 80, 64-71.

Thompson, B. (2006).The role of effect sizes in contemporary research in counseling. Counseling and Values, 50, 176-186.

Trochim, W. M. K. (2005). General linear model Retrieved December 2, 2005, from http://www.socialresearchmethods.net/kb/genlin.htm

U.S. Department of Education. (2001). No Child Left Behind Act The No Child Left Behind Act of 2001 (Public Law 107-110), commonly known as NCLB (IPA: /ˈnɪkəlbiː/), is a United States federal law that was passed in the House of Representatives on May 23, 2001  (Pub. L No. 107-110). Retrieved January 6, 2006, from http://www.ed.gov/nclb/overview/intro/index.html

Vacha-Haase, T., & Thompson, B. (2004). How to estimate and interpret various effect sizes. Journal of Counseling and Psychology, 51, 473-481.

Vasquez, L. M., Gangstead, S. K., &Henson, R. K. (2000, January). Understanding and interpreting effect size measures in general linear model analyses. Paper presented at the Annual Meeting of the Southwest Educational Research Association, Dallas, TX.

Whiston, S. C. (2003). Outcomes research on school counseling services. In B.T. Erford (Ed.), Transforming the school counseling profession (pp. 435-447). Upper Saddle River, N J: Merrill Prentice-Hall.

Wilkinson, L., & American Psychological Association Task Force on Statistical Inference. (1999). Statistical methods in psychology journals: Guidelines and explanations. American Psychologist, 54, 594-604.

Christopher A. Sink, Ph.D., NCC NCC

See National Clearing Corporation (NCC).
, LMHC LMHC Licensed Mental Health Counselor
LMHC Lockheed Martin Hanford Corporation
LMHC Lakeview Manor Healthcare Center (Tawas City, Michigan)
LMHC Low Mass High Cost
, is a professor and chair with the School of Education, Seattle Pacific University External links
  • Seattle Pacific University official web site
  • IMAGE Comes to SPU
  • KSPU College Radio
  • The Falcon Online


    
, WA. E-mail: csink@spu.edu

Heather R. Stroh, Ed.D., is a research analyst with the Washington School Many schools are named Washington School including:
  • Washington School (Appleton, Wisconsin), listed on the National Register of Historic Places
  • Washington School (Mississippi), Greenville, Mississippi
 Research Center, Lynnwood, WA.
Table 1. Effect Size Matrix

ES Index             Symbol         Formula

Standardized Difference

Biased

Cohen's d               d           [MATHEMATICAL EXPRESSION
                                    NOT REPRODUCIBLE IN ASCII]

Eta squared       [[eta].sup.2]     [t.sup.2]/[t.sup.2] +
                                    ([N.sub.1] + [N.sub.2] - 2)

Eta squared       [[eta].sup.2]     [t.sup.2]/[t.sup.2] + N - 1

Glass' delta         [DELTA]        [MATHEMATICAL EXPRESSION
                                    NOT REPRODUCIBLE IN ASCII]

Unbiased

Hedges g                g           [[bar.X].sub.A] - [[bar.X].sub.B]/
                                    [square root of ([MS.sub.w])]

Variance-Accounted-for

Biased

Pearson r               r           [square root of ([t.sup.2]/
                                    [t.sup.2] + [df.sub.within])]

R squared           [R.sup.2]       [SS.sub.regression]/
                                    [SS.sub.total]

Univariate        [[eta].sup.2]     [SS.sub.between]/
eta squared                         [SS.sub.total]

Multivariate      [[eta].sup.2]     1 - [[LAMBDA].SUP.1/s]
eta squared

Eigenvalue           [delta]        [SS.sub.between]/
                                    [SS.sub.within]

Unbiased

Adjusted R          [R.sup.2]       1 - (1 -[R.sup.2])
squared                             N - 1/N - k - 1

Partial eta      [[eta].sup.2]P     [SS.sub.between]/
squared                             [SS.sub.between] + [SS.sub.error]

Has'             [[omega].sup.2]    [SS.sub.between] - (v - 1)
[omega.sup.2]                       [MS.sub.within]/[SS.sub.total]
                                    + [MS.sub.within]

Epsilon         [[epsilon].sup.2]   [SS.sub.between] - (v - 1)
squared                             [MS.sub.within]/[SS.sub.total]

Wherry                none          1 - [(n - 1)/(n - k - 1)]
                                    (1 - [R.sup.2])

Herzberg            [R.sup.2]       1 - [(n - 1)/(n - k - 1)]
                                    [(n + k + 1)/n](1 - [R.sup.2])

Lord                  none          1 - (1 - [R.sup.2])
                                    [(n + k + 1)/(n - k -1)]

ES Index       Procedures       Interpretation (a)
               Used With

Standardized Difference

Biased

Cohen's d      Independent-     .2 [approximately equal to] small
               samples          .5 [approximately equal to] medium
               t tests (2       .8 [approximately equal to] large
               groups)

Eta squared    Independent-     .01 [approximately equal to] small
               samples t        .06 [approximately equal to] medium
               tests            .14 [approximately equal to] large

Eta squared    Paired-          .01 [approximately equal to] small
               samples (or      .06 [approximately equal to] medium
               dependent)       .14 [approximately equal to] large
               t tests (e.g.,
               pretest and
               posttest
               scores)

Glass' delta   Meta-            .2 [approximately equal to] small
               analyses         .5 [approximately equal to] medium
                                .8 [approximately equal to] large

Unbiased

Hedges g       Independent-     .2 [approximately equal to] small
               samples t        .5 [approximately equal to] medium
               tests            .8 [approximately equal to] large

Variance-Accounted-for

Biased

Pearson r      Bivariate        [absolute value of .1]
               linear and       [approximately equal to] small
               regression,      [absolute value of .3]
               partial          [approximately equal to] medium
               correlations     [absolute value of .5]
                                [approximately equal to] large

R squared      GLM:             .01 [approximately equal to] small
               Multiple         .06 [approximately equal to] medium
               regression       .14 [approximately equal to] large

Univariate     Univariate       .01 [approximately equal to] small
eta squared    GLM:             .06 [approximately equal to] medium
               ANOVA,           .14 [approximately equal to] large
               ANCOVA,
               repeated
               measures
               ANOVA

Multivariate   Multivariate     .01 [approximately equal to] small
eta squared    GLM:             .06 [approximately equal to] medium
               MANOVA,          .14 [approximately equal to] large
               MANCOVA,
               repeated
               measures
               MANOVA

Eigenvalue     GLM:
               Discriminant
               analysis

Unbiased

Adjusted R     GLM:             .01 [approximately equal to] small
squared        Multiple         .06 [approximately equal to] medium
               regression       .14 [approximately equal to] large

Partial eta    Univariate       .01 [greater than or equal to] small
squared        GLM:             .06 [greater than or equal to] medium
               ANOVA,           .14 [greater than or equal to] large
               ANCOVA,
               repeated
               measures
               ANOVA

Has'           Univariate       .01 [approximately equal to] small
[omega.sup.2]  GLM:             .06 [approximately equal to] medium
               ANOVA,           .14 [approximately equal to] large
               ANCOVA,
               repeated
               measures
               ANOVA

Epsilon        Univariate       .01 [approximately equal to] small
squared        GLM:             .06 [approximately equal to] medium
               ANOVA,           .14 [approximately equal to] large
               ANCOVA,
               repeated
               measures
               ANOVA

Wherry         Univariate       .01 [approximately equal to] small
               GLM:             .06 [approximately equal to] medium.
               ANOVA,           .14 [approximately equal to] large
               ANCOVA,
               repeated
               measures
               ANOVA

Herzberg       Univariate       .01 [approximately equal to] small
               GLM:             .06 [approximately equal to] medium
               ANOVA,           .14 [approximately equal to] large
               ANCOAA,
               repeated
               measures
               ANOVA

Lord           Univariate       .01 [approximately equal to] small
               GLM:             .06 [approximately equal to] medium
               ANOVA,           .14 [approximately equal to] large
               ANCOVA,
               repeated
               measures
               ANOVA

ES Index       Notes

Standardized Difference

Biased

Cohen's d      Ranges in value from 0 to 1. One of the most
               widely used and commonly known ESs.

Eta squared    Ranges in value from 0 to 1. 0 indicates that the
               mean of the differences in scores is equal to zero.
               1 indicates that the differences in scores in the
               sample are all the same nonzero value.

Eta squared    Ranges in value from 0 to 1. 0 indicates that the
               mean of the differences in scores is equal to zero.
               1 indicates that the differences in scores in the
               sample are all the same nonzero value.

Glass' delta   Ranges in value from 0 to 1. Difference between
               group means standardized using the SD of the
               control/comparison group.

Unbiased

Hedges g       Ranges in value from 0 to 1. Difference between
               group means standardized using the pooled
               variance estimate.

Variance-Accounted-for

Biased

Pearson r      Range from -1 to +1; + means that as X
               increases, Y increases; - means that as X
               increases, Y decreases. Values closer to +/-1
               indicate stronger linear relationships. [r.sup.2] = the
               amount of variance accounted for in the criterion
               variable by the predictor variable

R squared      Ranges from 0 to 1. 0 = no linear relationship; 1=
               perfect linear relationship. Indicates the amount
               of variance accounted for in the criterion variable
               by the predictor variable(s). Shows greater bias
               when the sample size is small and the number of
               predictors is larvae.

Univariate     Ranges from 0 to 1.0 = no association; 1 =
eta squared    perfect association. Same as [R.sup.2]. Estimates the
               degree of association for the sample.

Multivariate   Ranges from 0 to 1.0 = no association; 1 =
eta squared    perfect association. Unclear what should be
               considered as small, medium, and large effects;
               interpret similar to univariate [eta.sup.2].

Eigenvalue     Value is [greater than or equal to] 0. Eigenvalue has
               no upper limit. Difficult to interpret. See Green and
               Salkind (2004) for more information.

Unbiased

Adjusted R     Ranges from 0 to 1.0 = no linear relationship; 1=
squared        perfect linear relationship. Indicates the amount
               of variance accounted for in the criterion variable
               by the predictor variable(s). Adjusted for
               overestimation population [R.sup.2]. Use when sample
               size is small and number of
               predictors/independent variables is large.

Partial eta    Ranges from 0 to 1. Values are usually smaller
squared        than those from an [eta.sup.2]. Small, medium, and
               large cutoffs for [eta.sup.2] are probably too large,
               so interpret with caution.

Has'           Estimate of the degree of association in the
[omega.sup.2]  population. It is unclear what should be
               considered as small, medium, and large effects,
               so interpret with caution.

Epsilon        Estimate of the degree of association in the
squared        population. It is unclear what should be
               considered as small, medium, and large effects,
               so interpret with caution.

Wherry         Estimate of the degree of association in the
               population. It is unclear what should be
               considered as small, medium, and large effects,
               so interpret with caution.

Herzberg       Corrects for estimates potentially realized in
               future samples. It is unclear what should be
               considered as small, medium, and large effects,
               so interpret with caution.

Lord           Corrects for estimates potentially realized in
               future samples. It is unclear what should be
               considered as small, medium, and large effects,
               so interpret with caution.

Note. SS = sum of squares; MS = mean square; n = number of
participants in sample; v = number of levels in independent variable
(factor); [bar.X] = mean (average score); SD = standard deviation;
[SD.sub.pooled] = average within groups SD; k = number of predictor
variables; df = degrees of freedom; t = t statistic; s = number of
levels of the factor minus 1, or number of dependent variables
(whichever is smaller); GLM = general linear model; ANOVA = analysis
of variance; ANCOVA = analysis of covariance; MANOVA = multivariate
analysis of variance.

(a) These are general guidelines from Green and Salkind (2004).

Table 2. Effect Size Measures by Statistical Procedure

Statistical Procedure   Effect Size Estimate      Symbol
Used

                        Biased     Cohen's d      d

        Independent                Eta squared    [[eta].sup.2]
        samples

t test                  Unbiased   Hedges g       g

        One-sample      Biased     Glass' delta   [DELTA]

        Dependent       Biased     Eta squared    [[eta].sup.2]
        (paired)
        samples

Meta-analysis           Biased     Glass' delta   [DELTA]

Bivariate linear        Biased     Pearson r      r
regression and
partial correlation

Multiple regression     Biased     R squared      [R.sup.2]

                        Unbiased   Adjusted R     [R.sup.2]
                                   squared

ANOVA (univariate),     Biased     Univariate     [[eta].sup.2]
ANCOVA, or                         eta squared
ANOVA with repeated
measures (or within-    Unbiased   Partial eta    Partial
subjects ANOVA)                    squared        [[eta].sup.2]

                                   Hays' omega    [[omega].sup.2]

                                   Epsilon        [[epsilon].sup.2]
                                   squared

                                   Wherry         none

                                   Herzber        [R.sup.2]

                                   Lord           none

MANOVA,                 Biased     Omnibus        [[eta].sup.2]
MANCOVA, or                        multivariate
MANOVA with repeated               eta squared
measures (or within-
subjects MANOVA)

Discriminant analysis   Biased     Eigenvalue     [lambda]

Statistical Procedure   Effect Size Estimate
Used

                        Biased     Cohen's d      [[bar.X].sub.A] -
                                                  [[bar.X].sub.B]/
                                                  [S.sub.pooled]

        Independent                Eta squared    [t.sup.2]/[t.sup.2]
        samples                                   + ([N.sub.1] +
                                                  [N.sub.2] -2)

t test                  Unbiased   Hedges g       [[bar.X].sub.A] -
                                                  [[bar.X].sub.B]/
                                                  [square root of
                                                  ([MS.sub.w])]

        One-sample      Biased     Glass' delta   ([[bar.X].sub.E] -
                                                  [[bar.X].sub.C])/
                                                  [SD.sub.control]

        Dependent       Biased     Eta squared    [t.sup.2]/[t.sup.2]
        (paired)                                  + N -1
        samples

Meta-analysis           Biased     Glass' delta   ([[bar.X].sub.E] -
                                                  [[bar.X].sub.C])/
                                                  [SD.sub.control]

Bivariate linear        Biased     Pearson r      [square root of
regression and                                    ([t.sup.2]/[t.sup.2]
partial correlation                               + [df.sub.within])]

Multiple regression     Biased     R squared      [SS.sub.regression]/
                                                  [SS.sub.total]

                        Unbiased   Adjusted R     1 - (1 - [R.sup.2])
                                   squared        N - 1/N - k - 1

ANOVA (univariate),     Biased     Univariate     [SS.sub.between]/
ANCOVA, or                         eta squared    [SS.sub.total]
ANOVA with repeated
measures (or within-    Unbiased   Partial eta    [SS.sub.between]/
subjects ANOVA)                    squared        [SS.sub.between] +
                                                  [SS.sub.error]

                                   Hays' omega    [SS.sub.between] -
                                                  (v - 1)
                                                  [MS.sub.within]/
                                                  [SS.sub.total] +
                                                  [MS.sub.within]/

                                   Epsilon        [SS.sub.between] -
                                   squared        (v - 1)
                                                  [MS.sub.within]/
                                                  [SS.sub.total] +

                                   Wherry         1 - [(n -1)/
                                                  (n - k -1)] (1 -
                                                  [R.sup.2])

                                   Herzber        1 - [(n -1)/
                                                  (n - k -1)] (n + k +
                                                  1)/n (1 - [R.sup.2])

                                   Lord           1 - (1 - [R.sup.2])
                                                  [(n + k + 1)/
                                                  (n - k -1)]

MANOVA,                 Biased     Omnibus        1 -
MANCOVA, or                        multivariate   [[LAMBDA].sup.1/2]
MANOVA with repeated               eta squared
measures (or within-
subjects MANOVA)

Discriminant analysis   Biased     Eigenvalue     [SS.sub.between]/
                                                  [SS.sub.within]/

Table 3. Sample 1 Showing Results of GLM ANOVA (Test of
Between-Subjects Effect), with Group as Independent Variable
and Self-Concept Scale Score as Dependent Variable, and ES
Computations (N = 24)

Source                Sum of Squares   df   Mean Square     F

Corrected model (b)
                          27.75        2       13.88        2.14
Intercept
                          759.38       1      759.38      117.36
Group
                          27.75        2       13.88        2.14
Error
                          135.88       21      6.47
Total
                          923.00       24
Corrected total
                          163.63       23

Source                p (a)    Partial [Eta.sup.2]

Corrected model (b)
                       .14           .17
Intercept
                       .00           .85
Group
                       .14           .17
Error

Total

Corrected total

                        ES Computations

[R.sup.2]         [R.sup.2] = [SS.sub.regression]/[SS.sub.total]

                  = 27.75/163.625

                  = .17

[[Eta].sup.2]     [[eta].sup.2] = [SS.sub.between]/[Ss.sub.total]

                  = 27.75/163.625

                  = .17

Hays'             [[omega].sup.2] = [SS.sub.between] - (v -1)
[omega.sup.2]     [MS.sub.within]/[SS.sub.total] + [MS.sub.within]

                  = 27.75 - (3 - 1) 6.47/163.625 + 6.47

                  = 27.75 - 12.94/170.095

[Epsilon.sup.2]   [[epsilon].sup.2] = [SS.sub.between] - (v - 1)
                  [MS.sub.within]/[SS.sub.total]

                  = 27.75 - (3 - 1) 6.47/163.625

                  = 27.75 - 12.94/163.625

                  = .09

Wherry            1 - [(n - 1)/(n - k 1)] (1 - [R.sup.2])

                  = 1 - [(24 -1)/(24 - 1 - 1)] (1 - .17)

                  = 1 - (1.045)(.83)

                  = .13

Herzberg          [R.sup.2] = 1 - [(n - 1)/(n - k - 1)][n + k + 1)/n]
                  (1 - [R.sup.2])

                  = 1 - [(24 - 1)/(24 - 1 - 1)][24 + 1 + 1)/24]
                  (1 - .17)

                  = 1 - (1.045)(1.083)(.83)

                  = .06

Lord              1 - (1 - [R.sup.2]) [n + k + 1)/(n - k - 1)]

                  = 1 - (1 - .17) [(24 + 1 + 1)/(24 - 1 - 1)]

                  = 1 - (.83)(1.18)

                  = .02

(a) Computed using alpha = .05. (b) [R.sup.2] = .21 (adjusted
[R.sup.2] = .17).

Table 4. Sample 2 Showing Results of GLM ANOVA (Test of
Between-Subjects Effect), with Group as Independent Variable
and Self-Concept Scale Score as Dependent Variable, and ES
Computations (N = 150)

Source                Sum of Squares   df    Mean Square      F

Corrected model (b)
                          449.92        2      224.96        54.27
Intercept
                         4118.64        1      4118.64     1048.49
Group
                          449.92        2      224.96        57.27
Error
                          577.44       147      3.93
Total
                         5146.00       150
Corrected total
                         1027.36       149

Source                p (a)   Partial [Eta.sup.2]

Corrected model (b)
                       .00            .44
Intercept
                       .00            .88
Group
                       .00            .44
Error

Total

Corrected total

                           ES Computations

[R.sup.2]         [R.sup.2] = [SS.sub.regression]/[SS.sub.total]

                  = 449.92/1027.36

                  = .44

[[Eta].sup.2]     [[eta].sup.2] = [SS.sub.between]/[Ss.sub.total]

                  = 449.92/1027.36

                  = .44

Hays'             [[omega].sup.2] = [SS.sub.between] - (v -1)
[omega.sup.2]     [MS.sub.within]/[SS.sub.total] + [MS.sub.within]

                  = 449.92 -(3 - 1)3.928/1027.36 + 3.928

                  = 449.92 - 7.856/1031.28

                  = .43

[Epsilon.sup.2]   [[epsilon].sup.2] = [SS.sub.between] - (v - 1)
                  [MS.sub.within]/[SS.sub.total]

                  = 449.92 - (3 - 1)3.928/1027.36

                  = 449.92 - 7.856/1027.36

                  = .43

Wherry            1 - [(n - 1)/(n - k 1)] (1 - [R.sup.2])

                  = 1 - [(150 - 1)/(150 - 1 - 1)] (1 - .44)

                  = 1 - (1.01)(.56)

                  = .43

Herzberg          [R.sup.2] = 1 - [(n - 1)/(n - k - 1)][n + k + 1)/n]
                  (1 - [R.sup.2])

                  = 1 - [(150 - 1)/(150 - 1 - 1)][150 + 1 + 1)/150]
                  (1 - .44)

                  = 1 - (1.01)(1.01)(.56)

                  = .43

Lord              1 - (1 - [R.sup.2]) [n + k + 1)/(n - k - 1)]

                  = 1 - (1 - .44) [(150 + 1 + 1)/(150 - 1 - 1)]

                  = 1 - (.56)(1.03)

                  = .42

(a) Computed using alpha = .05. (b) [R.sup.2] = .44 (adjusted
[R.sup.2] = .43).
COPYRIGHT 2006 American School Counselor Association
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 2006, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Author:Stroh, Heather R.
Publication:Professional School Counseling
Geographic Code:1USA
Date:Jun 1, 2006
Words:7340
Previous Article:Recent innovations in small-N designs for research and practice in professional school counseling.
Next Article:The ASCA National School Counseling Research Center: a brief history and agenda.(American School Counselor Association)
Topics:



Related Articles
Raising achievement test scores of early elementary school students through comprehensive school counseling programs.
Helping students improve academic achievement and school success behavior.
Play therapy practices among elementary school counselors.
Linking school counselors and student success: a replication of the student success skills approach targeting the academic and social competence of...
Collaborative action research and school counselors.
School counselors as resource brokers: the case for including teacher efficacy in data-driven programs.
How collaboration and research can affect school counseling practices: the Minnesota story.
Professional school counselors' approaches to technology.(PERSPECTIVES FROM THE FIELD)
Using comparison groups in school counseling research: a primer.
Research methods in school counseling: a summary for the practitioner.

Terms of use | Copyright © 2009 Farlex, Inc. | Feedback | For webmasters | Submit articles