Printer Friendly
The Free Library
14,763,846 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

Debunking the myth of the "highly significant" result: effect sizes in gifted education research.


The use of statistical effect sizes in educational research is frequently recommended, especially with the recent emphasis on meta-analysis of research results. The purpose of this article is to describe the utility of effect size reporting and interpretation and to investigate the use of statistical effect sizes in research on giftedness. Research articles published in a five-year span in Journal for the Education of the Gifted, Roeper Review, Gifted Child gifted child

Child naturally endowed with a high degree of general mental ability or extraordinary ability in a specific domain. Although the designation of giftedness is largely a matter of administrative convenience, the best indications of giftedness are often those
 Quarterly, and a collection of journals not directly associated with gifted education Gifted education is a broad term for special practices, procedures and theories used in the education of children who have been identified as gifted or talented. Programs providing such education are sometimes called Gifted and Talented Education (GATE) or  were content analyzed an·a·lyze  
tr.v. an·a·lyzed, an·a·lyz·ing, an·a·lyz·es
1. To examine methodically by separating into parts and studying their interrelations.

2. Chemistry To make a chemical analysis of.

3.
 with respect to the reporting of effect sizes. Results suggest that effect sizes generally are not included in research articles, with results consistent across journals.

Educators frequently hear or read that the results of a particular research project were "highly significant" or "approached significance" (Thompson, 1988). In fact, the significance level of a statistical test (the p value) is routinely interpreted as the only important statistic statistic,
n a value or number that describes a series of quantitative observations or measures; a value calculated from a sample.


statistic

a numerical value calculated from a number of observations in order to summarize them.
 in educational research. Since the p level is directly influenced by the number of subjects in the sample (Snyder & Lawson, 1993; Thompson, 1988, 1989b), a study with many subjects may produce statistically significant results without the existence of a meaningful relationship between independent and dependent variables. Conversely con·verse 1  
intr.v. con·versed, con·vers·ing, con·vers·es
1. To engage in a spoken exchange of thoughts, ideas, or feelings; talk. See Synonyms at speak.

2.
, a study with a small sample may produce a relatively high p value in the presence of a large, meaningful relationship among variables. For example, remarking that a test result is "highly significant" implies that the difference between groups is large, when that may not be the case. Overreliance and misinterpretation of the complexities of statistical significance (Cohen cohen
 or kohen

(Hebrew: “priest”) Jewish priest descended from Zadok (a descendant of Aaron), priest at the First Temple of Jerusalem. The biblical priesthood was hereditary and male.
, 1994; Huberty, 1987, 1993; Thompson, 1989a, 1989b) lead researchers into misinterpretation or under-interpretation of their results.

This situation can be conceptualized in non-research terms: Imagine that you are standing at the brink of a deep chasm. Since you need to cross the chasm in order to proceed on your journey, you would like to determine if the chasm is too wide for you to jump across. The fact that the chasm exists (i.e., the p level) is helpful, for without it you might not notice the hole in the ground. However, your primary concern is the size of the chasm (i.e., the effect size). Is it 10 inches wide or 10 feet wide? Either width is significant enough to notice, but attempting to cross one will probably be met with more success than the other. Determining the chasm's existence is important, but doing so does not provide any information about the size of the chasm.

In statistical terms, the calculation and reporting of effect sizes allow both researchers and research consumers to determine the practical significance of statistical tests (Harris, 1991). Effect sizes represent the size of group differences and/or relationships as opposed to the existence of such differences and relationships (i.e., the p level).

The effect size tells us something

very different from the p level. A

result that is statistically significant

is not necessarily practically

significant as judged by the magnitude

of the effect. Consequently,

highly significant p values should

not be interpreted as automatically

reflecting large effects. (Resnow

& Rosenthal, 1989, p. 1276)

The importance of effect size estimation and interpretation is enhanced by the current emphasis upon meta-analysis of educational research (e.g., Bus, Van ljzendoorn, & Pellegrini, 1995; Cameron & Pierce, 1994). Meta-analysis, which involves the combination of results across studies (e.g., Kulik & Kulik, 1982, 1992; Rogers, 1993), is often preferable to the traditional review of literature, which stresses the combination of conclusions across studies. For example, Winne (1979) -- via the traditional review of literature -- concluded that teachers' use of higher order questioning had little effect upon student achievement. In stark contrast, Redfield and Rousseau (1981) used meta-analysis to analyze effect sizes from various studies to investigate the same topic and found that teacher use of higher order questioning had a substantial impact upon student achievement. On average, students whose teachers asked higher order questions could be expected to score at the 77th percentile percentile,
n the number in a frequency distribution below which a certain percentage of fees will fall. E.g., the ninetieth percentile is the number that divides the distribution of fees into the lower 90% and the upper 10%, or that fee level
 on an achievement test versus the 50th percentile if not subjected to higher order questions. By using effect sizes to analyze several studies simultaneously, Redfield and Rousseau (1981) convincingly refuted a review of literature with conclusions that could not have otherwise been contradicted.

To facilitate both recta-analysis and the interpretation of results, researchers in the social sciences are encouraged to provide either effect size information or test statistics that allow the calculation of effect sizes (APA (All Points Addressable) Refers to an array (bitmapped screen, matrix, etc.) in which all bits or cells can be individually manipulated.

APA - Application Portability Architecture
, 1994; Asher, 1986; Cohen, 1994; Huberty, 1993; Rosnow & Rosenthal, 1989). The recommendation is also pertinent to research within the field of gifted education. Researchers and consumers of research need to be familiar with the situations in which specific estimates are appropriate, how the estimates can be calculated for common statistical tests, and how effect size estimates can be used during interpretation of statistical analyses.

However, considering that a comprehensive study of effect size usage in gifted education research has yet to be conducted, individuals planning to provide researchers with information about effect size estimates do not have a solid research base with which to focus their efforts. Because books can be (and have been) written on the calculation of effect size estimates (Cohen, 1988; Glass, McGaw, & Smith, 1981; Hedges & Olkin, 1985; Hedges, Shymansky, & Woodworth, 1989; Hunter & Schmidt, 1990; Rosenthal, 1984), detailed instructions for doing so are not included in this article. Rather, the primary purpose of this article is to investigate effect size usage in gifted education research in order to provide a focus for education efforts within gifted education research. In the remainder of this introduction, an attempt is made to familiarize researchers and research consumers with the basic types of effect size estimates and provide several examples of effect size estimates' usefulness when applied to research on giftedness.

Types of Effect Sizes

Researchers use several different terms when they refer to effect size estimates: effect size, percent of variance accounted for, strength of association, measure of association, and magnitude of effect. Some commonly used effect sizes are summarized in Table 1. Several estimates are statistically identical but conceptually are often taught as separate entities. For example, many researchers are quite familiar with correlation coefficients Correlation Coefficient

A measure that determines the degree to which two variable's movements are associated.

The correlation coefficient is calculated as:
 ([r.sup.2]), the variance accounted for in a regression analysis In statistics, a mathematical method of modeling the relationships among three or more variables. It is used to predict the value of one variable given the values of the others. For example, a model might estimate sales based on age and gender.  ([R.sup.2]), and the variance accounted for in an analysis of variance ([eta.sup.2]), yet many people do not realize that these estimates provide the exact same information in most cases. Effect size estimates can generally be classified as either measures of association or measures of mean difference(2).
Selected Effect Sizes and Uses

                       Type of
Estimate               Statistic

r                      parametric
[eta.sup.2]            parametric
[R.sup.2]              parametric
[R.sup.2] adjusted     parametric
[omega.sup.2]          parametric
[epsilon.sup.2]        parametric
phi                    nonparametric
contingency            nonparametric
coefficient
Cramer's V             nonparametric
BESD                   parametric
g                      parametric
d                      parametric
canonical              parametric
correlation

                       Nature of
Estimate               Statistic(a)   Bias(b)

r                      MA             B
[eta.sup.2]            MA             B
[R.sup.2]              MA             B
[R.sup.2] adjusted     MA             U
[omega.sup.2]          MA             U
[epsilon..sup.2]       MA             U
phi                    MA             U(c)
contingency            MA             B
coefficient
Cramer's V             MA             U
BESD                   MA             U
g                      MD             B
d                      MD             U
canonical              MA             B
correlation

Estimate               Common Use

r                      correlational studies
[eta.sup.2]            ANOVA, regression, MANOVA
[R.sup.2]              regression
[R.sup.2] adjusted     regression
[omega.sup.2]          ANOVA, regression
[epsilon..sup.2]       ANOVA, regression
phi                    chi square
contingency            chi square
coefficient
Cramer's V             chi square
BESD                   meta-analysis, experimental design
g                      meta-analysis
d                      meta-analysis
canonical              multivariate analyses
correlation


(a) measure of association (MA) or mean difference (MD)

(b) biased (B) or unbiased (U)

(c) unbiased only for 2 X 2 contingency tables contingency table
n.
A statistical table that shows the observed frequencies of data elements classified according to two variables, with the rows indicating one variable and the columns indicating the other variable.
 

Table 1

Measures of association involve the relationship between two variables and are usually expressed in terms of correlation or shared variance. Common measures include eta squared ([[Eta].sup.2]), R-squared ([R.sup.2]), adjusted R-squared ([R.sup.2]adj), and omega squared (Snyder & Lawson, 1993). Comparable estimates for use with chi square chi square (kī),
n a nonparametric statistic used with discrete data in the form of frequency count (nominal data) or percentages or proportions that can be reduced to frequencies.
 and other nonparametric tests include the phi coefficient Noun 1. phi coefficient - an index of the relation between any two sets of scores that can both be represented on ordered binary dimensions (e.g., male-female)
fourfold point correlation, phi correlation
, contingency coefficient coefficient /co·ef·fi·cient/ (ko?ah-fish´int)
1. an expression of the change or effect produced by variation in certain factors, or of the ratio between two different quantities.

2.
, and Cramer's V (Gibbons Famous people named Gibbons include:
  • Beth Gibbons (born 1965), British singer
  • Billy Gibbons, guitarist for ZZ Top
  • Cedric Gibbons (1893–1960), American art director
  • Christopher Gibbons (1615 - 1676), English composer, son of Orlando
, 1976, 1993; Gibbons & Chakraborti, 1992; Norusis, 1990; Siegel, 1956). In addition, various correlation coefficients (e.g., the Pearson correlation coefficient, Spearman spear·man  
n.
A man, especially a soldier, armed with a spear.
 rho) are occasionally interpreted as effect sizes. These estimates generally range between 0 and 1.

Estimates of mean differences provide an estimate of the difference between two groups on a particular test or measure and are commonly used in meta-analysis (Hedges & Olkin, 1985; Hedges, Shymansky, & Woodworth, 1989; Seifert, 1991) and power analysis (Cohen, 1988). The most well-known estimates of mean difference, which measure group differences in terms of standard deviation In statistics, the average amount a number varies from the average number in a series of numbers.

(statistics) standard deviation - (SD) A measure of the range of values in a set of numbers.
 units, are g (Glass, McGaw, & Smith, 1981), the standardized standardized

pertaining to data that have been submitted to standardization procedures.


standardized morbidity rate
see morbidity rate.

standardized mortality rate
see mortality rate.
 mean difference, and d (see Cohen, 1988), the unbiased standardized mean difference. Unlike many other effect size estimates, if the difference between group means is large relative to their standard deviations, the effect size can exceed 1 or even 2. However, general guidelines guidelines,
n.pl a set of standards, criteria, or specifications to be used or followed in the performance of certain tasks.
 for interpretation of effect sizes are still applicable.

Characteristics of Effect Sizes

Quantitative researchers recognize eta squared and R2 as representing the "percentage of variance accounted for" by a particular effect. However, these statistics capitalize on Cap´i`tal`ize on`   

v. t. 1. To turn (an opportunity) to one's advantage; to take advantage of (a situation); to profit from; as, to capitalize on an opponent's mistakes s>.
 sampling error and as a result rarely equal zero in the absence of a relationship between the variables of interest (Huberty, 1994; Pedhazur, 1982; Snyder & Lawson, 1993). Other statistics correct for this bias and, therefore, are unbiased estimators of effect size (see Table 1). The unbiased estimators range from 0 (signifying Signifyin' (slang) is an African-American rhetorical device featuring indirect communication or persuasion and the creating of new meanings for old words and signs. Signifying, in this sense, includes repetition and difference, implication and association, combining words and  a complete lack of relationship) to 1 (direct relationship).

Interpretation of effect size estimates is rather arbitrary, although the following guidelines are generally suggested: 0 to .2 representing a small effect, .3 to .4 representing a moderate effect, and .4 or larger signifying the presence of a large effect. Interpretation guidelines vary by discipline. For example, an effect size of .001 in a large medical study may represent the saving of one person's life, even though the effect is very small. However, conceiving Conceiving may refer to:
  • Conceiving a child
  • Conceiving an idea
See also
  • Conception (disambiguation)
 of such an ethically and morally serious scenario in the social sciences is difficult. The best guide for interpretation of effect sizes is a review of similar studies to determine the range of effect sizes usually found.

Examples of Effect Size Usage

The following three examples illustrate the utility of effect size use and interpretation. In the first, an example of the recommended use and interpretation of nonparametric effect sizes is provided by Westberg, Archambault, Dobyns, and Salvin (1993), who reported

The obtained chi square statistic

indicates a significant difference;

namely, target gifted and talented

students were provided with less

wait time than target average students.

However, the phi coefficient

indicates that the strength of association

between the wait time and

target student variables is low.

(p. 135)

Without the use of an effect size estimate (i.e., phi), this particular result would have been reported much differently: Observed gifted students are given less wait time when questions are asked of them compared to their nongifted peers. The use of an effect size, however, provides evidence that this difference is predominantly statistical and not practically important.

The second example involves the calculation of effect sizes without proper interpretation. The researchers investigated differences among gifted and non-gifted students' social, emotional, and behavioral adjustment using a longitudinal lon·gi·tu·di·nal
adj.
Running in the direction of the long axis of the body or any of its parts.
 data set. Effect sizes are reported for all chi square tests and ANOVAS, although the types of effect sizes are not mentioned. The authors report effect sizes ranging from .09 to .27 (with most below .20), indicating that most differences among groups lack practical significance. However, the authors still discuss implications for the reported group differences as though the effect sizes were quite large. The lack of practical importance is not revisited after the effect sizes are presented.

In the third example, researchers found an unexpected result and could have benefited from inclusion of effect size estimates. The researchers report 20 chi square tests of independence and 14 t-tests without effect sizes. Using the reported means and standard deviations, I calculated d as an effect size for the t-tests. The average d was .327 (ranging from .005 to .872), and the average d for the results reported as statistically significant at p [is less than or equal to] .01 was .437 (ranging from .166 to .872). These values could have aided the researchers in their interpretation of results. For example, at one point the authors remark that "surprisingly, [gifted] underachievers earned a higher score on [the] Natural Sciences" subtest of the ACT than gifted high achievers, reporting t(257, 30347) = 2.68, p [is less than or equal to] .01. The effect size for this test is d=.17, indicating that the observed statistical difference has little practical meaning. The "surprising" result actually lacks practical importance and is conceptually similar to the other results reported in the study.

Method

A content analysis of major journals in gifted education was conducted to identify the extent of effect size estimate usage. Analysis of the quarterly journals issues from the summer of 1992 to the most recent issue at the time of analysis: volume 36(3) to 39(1) of Gifted Child Quarterly; 15(4) to 18(2) of Journal for the Education of the Geed; and 14(4) to 17(3) of Roeper Review. Since gifted education research published outside of these major journals may be held to a different standard of quality, a comparison group was created. The 40 most recent research articles listed in the ERIC CD-ROM CD-ROM: see compact disc.
CD-ROM
 in full compact disc read-only memory

Type of computer storage medium that is read optically (e.g., by a laser).
 database with "gifted" in the title and not published in the three gifted education journals listed above were analyzed. In the process of analyzing the non-gifted education journals, a special issue of the Journal of Educational Psychology on the topic of giftedness was located. Given the reputation of this journal as a publisher of high quality educational research, all articles in this issue were also analyzed and added to the non-gifted education journal subsample sub·sam·ple  
n.
A sample drawn from a larger sample.

tr.v. sub·sam·pled, sub·sam·pling, sub·sam·ples
To take a subsample from (a larger sample).
 (i.e., in order to create a consistently high standard to which gifted education journals could be compared). All studies were classified via the dichotomous di·chot·o·mous  
adj.
1. Divided or dividing into two parts or classifications.

2. Characterized by dichotomy.



di·chot
 key included in Appendix A. A list of the articles analyzed for this study is available from the author.

Studies increasingly contain a mix of qualitative and quantitative research Quantitative research

Use of advanced econometric and mathematical valuation models to identify the firms with the best possible prospectives. Antithesis of qualitative research.
, and many quantitative studies include reports of univariate, bivariate bi·var·i·ate  
adj.
Mathematics Having two variables: bivariate binomial distribution.

Adj. 1.
, and multivariate statistics Multivariate statistics or multivariate statistical analysis in statistics describes a collection of procedures which involve observation and analysis of more than one statistical variable at a time. Sometimes a distinction is made between univariate (e.g. . To simplify the content analysis, each research technique was classified as a block. Studies that included only descriptive statistics descriptive statistics

see statistics.
 were coded as descriptive blocks, although reports of research containing both descriptive and inferential statistics inferential statistics

see inferential statistics.
 were coded according to according to
prep.
1. As stated or indicated by; on the authority of: according to historians.

2. In keeping with: according to instructions.

3.
 the level of statistics employed (i.e., univariate or multivariate The use of multiple variables in a forecasting model. ). For example, if a study contained case studies, ANOVAS, and a discriminant dis·crim·i·nant  
n.
An expression used to distinguish or separate other expressions in a quantity or equation.
 analysis, the study was coded as having one qualitative research Qualitative research

Traditional analysis of firm-specific prospects for future earnings. It may be based on data collected by the analysts, there is no formal quantitative framework used to generate projections.
 block, one univariate block, and one multivariate block. The author used the chi square test to analyze the data with Cramer's V reported as an effect size. Cramer's V was chosen because it is an unbiased estimator with values that range from 0 (no effect) to 1 (maximum possible effect). Values below .2 were interpreted as small, and values in excess of .2 as moderate.

The author analyzed differences in effect size reporting among journals to determine whether trends in effect size usage generalized gen·er·al·ized
adj.
1. Involving an entire organ, as when an epileptic seizure involves all parts of the brain.

2. Not specifically adapted to a particular environment or function; not specialized.

3.
 across research on gifted education, while similar analyses by year and by type of statistical analysis (i.e., univariate vs. multivariate) allowed determination of trends in reporting over time and of use based on complexity of statistical analysis, respectively.

Results

The total number of quantitative research blocks and blocks containing effect size information is reported in Figure 1 for each journal. The lack of effect size information is consistent across journals, and a chi square test of independence produced no evidence of a significant difference among journals ([chi square] [3]=5.01, p=. 17, V=. 18). The percentage of effect size blocks in the gifted journals was similar to that in the nongifted journals ([chi square][1]=.823, p=.367, V=.07).

[Figure 1 ILLUSTRATION OMITTED]

Figure 2 contains a breakdown of Effect size reporting by years. For simplification purposes, the two time periods are 1992-1993 and 1994-1995. Due to the relative infrequency of articles dealing with gifted education in the non-gifted journals, a breakdown by year was not possible. Chi square tests for the entire sample ([chi square] [1]=1.63, p=.20, V=.12) and for Journal for the Education of the Gifted ([chi square] [1]= 1.29, p=.26, V=. 18) and Roeper Review ([chi square] [1]=.004, p=.95, V=.01) provide evidence that there is neither statistically nor practically significant differences between the time periods with respect to effect size reporting. However, chi square test results for Gifted Child Quarterly ([chi square] [1]=2.07, p=. 15, V=.25), with a moderate p level and moderate effect size, suggest that this particular test lacked statistical power. With a larger sample size, a small increase in the percentage of articles published with effect sizes in GCQ GCQ Gauss-Chebyshev Quadrature (numerical method)
GCQ Generic Quality of Life
GCQ Generalized Cascaded Quadruplet
 may be apparent over time.

[Figure 2 ILLUSTRATION OMITTED]

A third analysis was conducted to determine whether multivariate blocks contained effect size information in roughly the same percentage as univariate blocks (Figure 3). Results indicate that multivariate blocks included effect sizes information more frequently than univariate blocks ([chi square] [1]=20.92, p [is less than] .001, V=.365).

[Figure 3 ILLUSTRATION OMITTED]

This phenomenon could be due to a number of factors. With respect to the classification of research blocks, MANOVAS with multiple ANOVAS as a post hoc post hoc  
adv. & adj.
In or of the form of an argument in which one event is asserted to be the cause of a later event simply by virtue of having happened earlier:
 test were classified as having effect size information if effect sizes were reported for the ANOVAS. This classification increased the percentage of multivariate effect size blocks, in spite of the fact that the use of ANOVAS to analyze MANOVA MANOVA Multivariate Analysis of the Variance  results is strongly discouraged (Huberty & Morris, 1989) and that more appropriate multivariate effect sizes are available for MANOVA (e.g., multivariate etasquared, canonical correlation In statistics, canonical correlation analysis, introduced by Harold Hotelling, is a way of making sense of cross-covariance matrices. Definition
Given two column vectors and
).

Another explanation is that the researchers most likely to use multivariate techniques are also most likely to know how to use, interpret, and report them as recommended in the literature. As a result, multivariate blocks include more statistical sophistication so·phis·ti·cate  
v. so·phis·ti·cat·ed, so·phis·ti·cat·ing, so·phis·ti·cates

v.tr.
1. To cause to become less natural, especially to make less naive and more worldly.

2.
 and more effect size information. However, in light of other investigations of the use of multivariate statistics in gifted education research (Plucker pluck  
v. plucked, pluck·ing, plucks

v.tr.
1. To remove or detach by grasping and pulling abruptly with the fingers; pick: pluck a flower; pluck feathers from a chicken.
, 1995; Pyryt, 1995) and the possible influence of the present study's design, this hypothesis would not appear to be valid.

Discussion

Inclusion of effect size estimates in research articles is generally infrequent in·fre·quent  
adj.
1. Not occurring regularly; occasional or rare: an infrequent guest.

2.
, with multivariate effect sizes reported more frequently than for univariate tests. The content analysis yields other findings pertaining per·tain  
intr.v. per·tained, per·tain·ing, per·tains
1. To have reference; relate: evidence that pertains to the accident.

2.
 to the use of effect sizes. First, most reports of effect sizes do not contain specifications of the type of estimate used. Considering the variety of available estimates and their strengths and weaknesses, this lack of information makes interpretation of results difficult. For example, one study stated that effect sizes ranged from approximately .30 to .60. Without knowing which type of effect size estimate was used (e.g., d, eta-squared, phi), interpreting the effect sizes is impossible.

Second, when researchers report a specific effect size, they usually employ a biased estimator (i.e., one which does not equal zero in the absence of an effect). In fact, not one univariate block in this study included an unbiased estimator, and only one multiple regression Multiple regression

The estimated relationship between a dependent variable and more than one explanatory variable.
 included a value for R-squared adjusted. The use of biased estimators was consistent across both gifted and non-gifted journals, as were all other trends observed in this study. These results are similar to those recently reported for research in the psychological sciences (e.g., Kirk, 1996).

The findings of this study hold implications primarily for reviewers of gifted education journals, instructors who are responsible for training future researchers in gifted education, and anyone who reads research about giftedness and gifted education. Instructors should include coverage of effect sizes in their students' quantitative coursework coursework
Noun

work done by a student and assessed as part of an educational course

Noun 1. coursework - work assigned to and done by a student during a course of study; usually it is evaluated as part of the student's
, emphasizing that a wide variety of effect size estimates is available, each estimate with its own strengths and weaknesses. Future researchers should be aware of the range of unbiased estimators that are applicable to the statistical situation in which they are involved and should interpret results accordingly.

Reviewers should make certain that when research results are reported by authors, the type of effect size is reported, the possible range of values for that effect size are described, and effect sizes are used in the interpretation of the statistical results. Doing so will increase both the accuracy, interpretation, and, therefore, the usability of research on giftedness and gifted education. In a related vein, people who read research studies will find the information to be much more valuable if they know how to interpret effect size estimates.

Analysis of effect sizes is often seen as a way to determine if a given piece of research is "bad." This misinterpretation is unfortunate. Rather than be viewed in such a negative light, effect size estimates are useful tools that aid both researchers and consumers of research in the interpretation of results produced by statistical analyses. Rogers (1989) recently noted that

researchers [concerned with gifted

education], except for a select

few .... have not kept current in

their selections of designs to

answer questions or test hypotheses

dealing with 1980's issues in

gifted education (p. 86)

In combination with the research of Rogers (1989), the results of this study suggest that gifted education researchers are moving in the right direction with respect to the sophistication of statistical analyses employed (e.g., more frequent use of multivariate designs), but that considerable effort is still necessary to expose researchers to the advantages of calculating and reporting effect sizes in their work.

Author Notes

This is one possible organization of effect size estimates. For example, Kirk (1996) adds a third category ("other") which includes odds ratios, success rates, and relative risk.

REFERENCES

American Psychological Association The American Psychological Association (APA) is a professional organization representing psychology in the US. Description and history
The association has around 150,000 members and an annual budget of around $70m.
. (1994). Publication manual of the American Psychological Association (4th ed.). Washington, DC: Author.

Asher, W. (1986). Conducting research with meta-analysis: A new direction for gifted education. Gifted Child Quarterly, 30, 7-10.

Bus, A. G., Van Ijzendoom, M. H., & Pellegrini, A. D. (1995). Joint book reading makes for success in learning to read: A meta-analysis on intergenerational in·ter·gen·er·a·tion·al  
adj.
Being or occurring between generations: "These social-insurance programs are intergenerational and all
 transmission of literacy. Review of Educational Research, 65, 1-21.

Cameron, J., & Pierce, W. D. (1994). Reinforcement reinforcement /re·in·force·ment/ (-in-fors´ment) in behavioral science, the presentation of a stimulus following a response that increases the frequency of subsequent responses, whether positive to desirable events, or , reward, and intrinsic motivation: A meta-analysis. Review of Educational Research, 64, 363-423.

Cohen, J. (1988). Statistical power analysis for the behavioral sciences behavioral sciences,
n.pl those sciences devoted to the study of human and animal behavior.
 (2nd ed.). Hillsdale, NJ: Lawrence Eftbaum.

Cohen, J. (1994). The earth is round (p [is less than] .05). American Psychologist The American Psychologist is the official journal of the American Psychological Association. It contains archival documents and articles covering current issues in psychology, the science and practice of psychology, and psychology's contribution to public policy. , 49, 997-1003.

Gibbons, J. D. (1976). Nonparametric methods for quantitative analysis Quantitative Analysis

A security analysis that uses financial information derived from company annual reports and income statements to evaluate an investment decision.

Notes:
. New York New York, state, United States
New York, Middle Atlantic state of the United States. It is bordered by Vermont, Massachusetts, Connecticut, and the Atlantic Ocean (E), New Jersey and Pennsylvania (S), Lakes Erie and Ontario and the Canadian province of
: Holt holt  
n. Archaic
A wood or grove; a copse.



[Middle English, from Old English.]

holt
Noun

the lair of an otter [from
, Rinehart, & Winston.

Gibbons, J. D. (1993). Nonparametric measures of association. Newbury Park, CA: Sage.

Gibbons, J. D., & Chakraborti, S. (1992). Nonparametric statistical inference Inferential statistics or statistical induction comprises the use of statistics to make inferences concerning some unknown aspect of a population. It is distinguished from descriptive statistics.  (3rd. ed Rd.
abbr.
road
.). New York: Marcel Dekker Marcel Dekker is a well-known encyclopedia publishing company with editorial boards found in New York, New York. They are part of the Taylor and Francis publishing group.

Initially a textbook publisher, they went to encyclopedia publishing in the late 1990's.
.

Glass, G. V., McGaw, B., & Smith, M. L. (1981). Metaanalysis in social research. Beverly Hills Beverly Hills, city (1990 pop. 31,971), Los Angeles co., S Calif., completely surrounded by the city of Los Angeles; inc. 1914. The largely residential city is home to many motion-picture and television personalities. , CA: Sage.

Harris, M. J. (1991). Significance tests are not enough: The role of effect-size estimation in theory corroboration. Theory and Psychology, 1, 375-382.

Hedges, L. V., & Olkin, I. (1985). Statistical methods for meta-analysis. New York: Academic Press.

Hedges, L. V., Shymansky, J. A., & Woodworth, G. (1989). Modern methods of meta-analysis. Washington, DC: National Science Teachers Association.

Huberty, C. J. (1987). On statistical testing. Educational Researcher, 16(8), 4-9.

Huberty, C. J. (1993). Historical origins of statistical testing practices: The treatment of Fisher versus Neyman-Pearson views in textbooks. Journal of Experimental Education, 61, 317-333.

Huberty, C. J. (1994). A note on interpreting an R2 value. Journal of Educational and Behavioral Statistics, 19, 351-356.

Huberty, C. J., & Morris, J. D. (1989). Multivariate analysis multivariate analysis,
n a statistical approach used to evaluate multiple variables.

multivariate analysis,
n a set of techniques used when variation in several variables has to be studied simultaneously.
 versus multiple univariate analyses. Psychological Bulletin, 105, 302-308.

Hunter, J, E., & Schmidt, F. L. (1990). Methods of metaanalysis. Newbury Park, CA: Sage.

Kirk, R. E. (1996). Practical significance: A concept whose time has come. Educational and Psychological Measurement, 56, 746-759.

Kulik, C.-L. C., & Kulik, J. A. (1982). Effects of ability grouping ability grouping
n.
1. The practice of placing students with others with comparable skills or needs, as in classes or in groups within a class.

2. See tracking.
 on secondary school students: A rectaanalysis of evaluation findings. American Educational Research Journal, 19, 415-428.

Kulik, J. A., & Kulik, C.-L. C. (1992). Meta-analytic findings on grouping programs. Gifted Child Quarterly, 36, 73-77.

Norusis, M. J. (1990). SPSS/PC+ 4.0 base manual. Chicago: SPSS A statistical package from SPSS, Inc., Chicago (www.spss.com) that runs on PCs, most mainframes and minis and is used extensively in marketing research. It provides over 50 statistical processes, including regression analysis, correlation and analysis of variance.  Inc.

Pedhazur, E. J. (1982). Multiple regression in behavioral research (2nd ed.). New York: Holt, Rinehart and Winston.

Plucker, J. A. (1995, April). Application of discriminant analysis in research with gifted students. Paper presented at the Annual Meeting of the American Educational Research Association The American Educational Research Association, or AERA, was founded in 1916 as a professional organization representing educational researchers in the United States and around the world. , San Francisco San Francisco (săn frănsĭs`kō), city (1990 pop. 723,959), coextensive with San Francisco co., W Calif., on the tip of a peninsula between the Pacific Ocean and San Francisco Bay, which are connected by the strait known as the Golden , CA.

Pyryt, M. (1995, April). MANOVA in research with gifted students. Paper presented at the Annual Meeting of the American Educational Research Association, San Francisco, CA.

Redfield, D. L., & Rousseau, E. W. (1981). A recta-analysis of experimental research on teacher questioning behavior. Review of Educational Research, 51, 237-245.

Rogers, K. B. (1989). A content analysis of the literature on giftedness. Journal for the Education of the Gifted, 13, 78-88.

Rogers, K. B. (1993). Grouping the gifted and talented: Questions and answers. Roeper Review, 16, 8-12.

Rosenthal, R. (1984). Meta-analytic procedures for social research. Beverly Hills, CA: Sage.

Rosnow, R. L., & Rosenthal, R. (1989). Statistical procedures and the justification of knowledge in psychological science. American Psychologist, 44, 1276-1284.

Seifert, T. L. (1991). Determining effect sizes in various experimental designs. Educational and Psychological Measurement, 51, 341-347.

Siegel, S. (1956). Nonparametric statistics Noun 1. nonparametric statistics - the branch of statistics dealing with variables without making assumptions about the form or the parameters of their distribution  for the behavioral sciences. New York: McGraw-Hill.

Snyder, P., & Lawson, S. (1993). Evaluating results using corrected and uncorrected effect size estimates. Journal of Experimental Education, 61, 334-349.

Thompson, B. (1988). A note about significance testing. Measurement and Evaluation in Counseling and Development, 21, 146-148.

Thompson, B. (1989a). Asking "what if" questions about significance tests. Measurement and Evaluation in Counseling and Development, 22, 66-68.

Thompson, B. (1989b). Statistical significance, result importance, and result generalizability: Three noteworthy but somewhat different issues. Measurement and Evaluation in Counseling and Development, 22, 2-5.

Westberg, K. L., Archambault, F. X., Jr., Dobyns, S. M., & Salvin, T. J. (1993). The classroom practices observation study. Journal for the Education of the Gifted, 16, 120-146.

Winne, P. H. (1979). Experiments relating teachers' use of higher cognitive questions to student achievement. Review of Educational Research, 49, 13-50.

Jonathan A. Plucker is an assistant professor of learning, cognition cognition

Act or process of knowing. Cognition includes every mental process that may be described as an experience of knowing (including perceiving, recognizing, conceiving, and reasoning), as distinguished from an experience of feeling or of willing.
, and instruction at Indiana University Indiana University, main campus at Bloomington; state supported; coeducational; chartered 1820 as a seminary, opened 1824. It became a college in 1828 and a university in 1838. The medical center (run jointly with Purdue Univ. .

Manuscript submitted November, 1996. Revision accepted June, 1997.
COPYRIGHT 1997 The Roeper School
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 1997, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Author:Plucker, Jonathan A.
Publication:Roeper Review
Date:Dec 1, 1997
Words:4356
Previous Article:Parent and teacher views of gifted children's social abilities.
Next Article:Word Class: a way to modify spelling instruction for gifted learners.
Topics:



Related Articles
Self-Regulation, Goal Orientation, Self-Efficacy, Worry, and High-Stakes Math Achievement for Mathematically Gifted High School Students(1,2).
Recent Doctoral Dissertation Research on Gifted.(gifted students)
Grouping the gifted and talented: questions and answers.(Ability Grouping and Acceleration)
In search of reality: unraveling the myths about tracking, ability grouping, and the gifted.(Ability Grouping and Acceleration)
A comparison study of student attitudes and perceptions in homogeneous and heterogeneous classrooms.(Ability Grouping and Acceleration)
Twenty-five years later--spinning our wheels or moving forward?
Ability grouping is not just tracking anymore.(On Gifted Students in School)
Two steps forward, one step back: effect size reporting in gifted education research from 1995-2000.(An Evolving Field)
Giftedness in early childhood: the search for complexity and connection.(The Young Gifted Child)
A comparison between intellectually gifted and typical children in their coping responses to a school and a peer stressor.(Social and Emotional...

Terms of use | Copyright © 2009 Farlex, Inc. | Feedback | For webmasters | Submit articles