Printer Friendly
The Free Library
14,757,922 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

Getting ahead by staying behind: an evaluation of Florida's program to end social promotion.


Of the many entrenched en·trench   also in·trench
v. en·trenched, en·trench·ing, en·trench·es

v.tr.
1. To provide with a trench, especially for the purpose of fortifying or defending.

2.
 school customs that have been reconsidered and reformed over the past decade, social promotion has been among the most resistant to change. Holding children back in the same grade has long been frowned upon Frowned Upon is an intergender comedy duo made up of Devon T. Coleman and D'Arcy Erokan. Their base of operations is New York City. For the most part, their sketches are a complex analysis of their strange relationship. , and a large body of research seems to support that point of view: retained students tend to have lower test scores and are allegedly more likely to drop out than students who initially performed at an equally low level but were nevertheless promoted.

[ILLUSTRATION OMITTED]

Despite the old habits and the old research, however, school districts across the nation have been slowly but steadily bucking bucking Respiratory therapy Violent resistance by a Pt to intubated ventilation that may cause asynchronous breathing, ergo V/Q mismatching and risk of barotrauma, cardiac arrhythmia, and ↑ intracranial pressure; the newer ventilatory support devices rarely  convention. Several large systems, including Chicago (beginning in 1996), New York New York, state, United States
New York, Middle Atlantic state of the United States. It is bordered by Vermont, Massachusetts, Connecticut, and the Atlantic Ocean (E), New Jersey and Pennsylvania (S), Lakes Erie and Ontario and the Canadian province of
 (2004), and Philadelphia (2005), now require students in particular grades to demonstrate a benchmark level of mastery in basic skills on a standardized test A standardized test is a test administered and scored in a standard manner. The tests are designed in such a way that the "questions, conditions for administering, scoring procedures, and interpretations are consistent" [1]  before they can be promoted. Florida (2002) and Texas (2002) have taken the lead among states in forbidding social promotions. In 2000, the most recent year for which national enrollment data are available, these five school systems alone enrolled nearly 20 percent of the nation's 3rd-grade students. (For more on Chicago's policy, see Alexander Russo, "Retaining Retention," features, Winter 2005; and Robin Tepper Jacob Jacob (jā`kəb), in the Bible, ancestor of the Hebrews, the younger of Isaac and Rebecca's twin sons; the older was Esau. In exchange for a bowl of lentil soup, Jacob obtained Esau's birthright and, with his mother's help, received the blessing  and Susan Stone, "Teachers and Students Speak," features, Winter 2005.)

But is this new approach to grade promotion effective? And what about those studies that say retention doesn't work? Proponents of the new programs believe that schools do students no favor by promoting them if they don't have the skills to succeed at a higher level. But because these arguments, however plausible, have little research to support them, we set out to determine if they have scientific merit. Our findings from Florida suggest that the use of standardized testing policies to end social promotion can help low-performing students make modest improvements in reading and substantial improvements in math.

Florida's Program to End Social Promotion

Over the past several years Florida has attempted substantial reforms of its struggling public school system, the fourth-largest in the country and one that consistently ranks close to the bottom on academic indicators, including high-school graduation Graduation is the action of receiving or conferring an academic degree or the associated ceremony. The date of event is often called degree day. The event itself is also called commencement, convocation or invocation.  rates and scores on the National Assessment of Educational Progress The National Assessment of Educational Progress (NAEP), also known as "the Nation's Report Card," is the only nationally representative and continuing assessment of what America's students know and can do in various subject areas.  (NAEP NAEP National Assessment of Educational Progress
NAEP National Association of Environmental Professionals
NAEP National Association of Educational Progress
NAEP National Agricultural Extension Policy
NAEP Native American Employment Program
). The Sunshine State had instituted school voucher A school voucher, also called an education voucher, is a certificate by which parents are given the ability to pay for the education of their children at a school of their choice, rather than the public school (UK state school) to which they were assigned.  programs, increased the number of charter schools, and devised a sophisticated accountability system that evaluates schools on the basis of their progress as measured by the Florida Comprehensive Assessment Test The Florida Comprehensive Assessment Test, or the FCAT, is the standardized test used in the primary and secondary public schools of Florida. First administered statewide in 1998[1], it replaced the State Student Assessment Test (SSAT) and the High School  (FCAT FCAT Florida Comprehensive Assessment Test (statewide standardized test for Florida school children) ). But in May 2002, the state legislature A state legislature may refer to a legislative branch or body of a political subdivision in a federal system.

The following legislatures exist in the following political subdivisions:
 made one of its boldest moves, revising the School Code, the state's education law, to require 3rd-grade students to score at the Level-2 benchmark or above on the reading portion of the FCAT in order to be promoted to 4th grade.

The hurdle HURDLE, Eng. law. A species of sledge, used to draw traitors to execution.  created for students was not terribly high. The state's department of education describes a student who scores at Level 2 (of five levels) as having "limited success" against the state standards; only students who score at Level 3 or above are considered to be proficient pro·fi·cient  
adj.
Having or marked by an advanced degree of competence, as in an art, vocation, profession, or branch of learning.

n.
An expert; an adept.
 for the purposes of evaluating schools under No Child Left Behind. Even so, roughly 24 percent of 3rd graders tested in Florida in 2001-02, the year before the retention policy was introduced, performed below Level 2. This number fell slightly, to 22 percent, in the 2002-03 academic year.

Not all these students were retained, however, even after the policy change. The law allowed for exceptions to the retention policy if a student had limited English proficiency pro·fi·cien·cy  
n. pl. pro·fi·cien·cies
The state or quality of being proficient; competence.

Noun 1. proficiency - the quality of having great facility and competence
 or a severe disability, scored above the 51st percentile percentile,
n the number in a frequency distribution below which a certain percentage of fees will fall. E.g., the ninetieth percentile is the number that divides the distribution of fees into the lower 90% and the upper 10%, or that fee level
 on the Stanford-9 standardized test, had demonstrated proficiency through a performance portfolio, or had already been held back for two years. Altogether, roughly 40 percent of the 3rd-grade students who scored below the Level-2 threshold in 2002-03 were promoted.

The Problem with Earlier Studies

Traditionally, the retention of a student, uncommon as it was, resulted from an individual teacher's assessment of the student's ability to succeed at the next level. But such teacher discretion, while arguably ar·gu·a·ble  
adj.
1. Open to argument: an arguable question, still unresolved.

2. That can be argued plausibly; defensible in argument: three arguable points of law.
 desirable as a matter of policy, is the primary reason earlier studies of social promotion are flawed flaw 1  
n.
1. An imperfection, often concealed, that impairs soundness: a flaw in the crystal that caused it to shatter. See Synonyms at blemish.

2.
. We must assume from studying those retention programs, which are still the predominant pre·dom·i·nant  
adj.
1. Having greatest ascendancy, importance, influence, authority, or force. See Synonyms at dominant.

2.
 practice in schools throughout the United States United States, officially United States of America, republic (2005 est. pop. 295,734,000), 3,539,227 sq mi (9,166,598 sq km), North America. The United States is the world's third largest country in population and the fourth largest country in area. , that students who were held back were fundamentally different from students who were promoted. Because teachers were considering intangible factors, even when race, gender, family income, and academic achievement are the same, there was no way to isolate isolate /iso·late/ (i´sah-lat)
1. to separate from others.

2. a group of individuals prevented by geographic, genetic, ecologic, social, or artificial barriers from interbreeding with others of their kind.
 the effect of being held back, much less to make reasonable conclusions about the effects of retention on a student's academic achievement or the probability of his dropping out of high school. Are students who were retained less likely to graduate because they were retained? Or were they retained because of characteristics that also predisposed pre·dis·pose  
v. pre·dis·posed, pre·dis·pos·ing, pre·dis·pos·es

v.tr.
1.
a. To make (someone) inclined to something in advance:
 them to drop out? Because the retention policies were subjective, we will simply never know.

There are also reasons to believe that subjective retention policies affect students differently than policies that use promotion criteria like performance on standardized tests. If promotion depends on an individual teacher's assessment of a child, then that child is not likely to know what he or she must do to avoid being held back. Also, if few students were being held back, then those students might perform worse because they felt excluded and inferior INFERIOR. One who in relation to another has less power and is below him; one who is bound to obey another. He who makes the law is the superior; he who is bound to obey it, the inferior. 1 Bouv. Inst. n. 8. . A policy that holds back thousands of students might dilute di·lute
v.
To reduce a solution or mixture in concentration, quality, strength, or purity, as by adding water.

adj.
Thinned or weakened by diluting.
 this sense of being singled out. Finally, subjective assessments of students are vulnerable to inappropriate influences, including teachers' prejudices and pressure brought by parents, in ways that objective criteria of performance might inhibit inhibit /in·hib·it/ (in-hib´it) to retard, arrest, or restrain.

in·hib·it
v.
1. To hold back; restrain.

2.
.

Implementing objective standards, even if they were accompanied by subjective exemptions, might significantly change the effects of retention in ways that previous research could not anticipate or measure. For research purposes, objective retention policies also create a useful comparison group of students not subject to retention. In the case of Florida's program to end social promotion, for example, we can compare students who were subject to the threat of retention with students who would have been had they been born a year later.

What a Difference a Year Makes

To determine the impact of ending social promotion for 3rd graders in Florida, we compared low-scoring 3rd graders in 2002, the first students to be subject to the program, with low-scoring 3rd graders from the previous year. Of the 43,996 3rd graders in 2002 for whom we have valid test scores on both FCAT math and reading assessments, 60 percent were actually retained. By contrast, of the 45,401 3rd graders in 2001 for whom we have valid test scores, only 9 percent were retained. Our analysis assumes that the students from the two school years should be similar in all respects except for the year in which they happened to have been born. We analyzed an·a·lyze  
tr.v. an·a·lyzed, an·a·lyz·ing, an·a·lyz·es
1. To examine methodically by separating into parts and studying their interrelations.

2. Chemistry To make a chemical analysis of.

3.
 the test-score improvements made between each student's first 3rd-grade year and the following year on both the state's own accountability exam and the Stanford-9, a nationally normed exam administered at the same time as the FCAT but not used for accountability purposes.

We measure FCAT performance using developmental-scale scores, which allow us to compare the test-score gains of all the students in our study, even though they took tests designed for different grade levels. Developmental-scale scores are designed to measure academic proficiency on a single scale for students of any grade and in any year. For example, a 3rd grader A grader, also commonly referred to as a blade or a motor grader, is an engineering vehicle with a large blade used to create a flat surface. Typical models have three axles, with the engine and cab situated above the rear axles at one end of the vehicle and a third  with a developmental-scale score of 1,000 and a 4th grader with a developmental-scale score of 1,000 have the same level of academic achievement; if a student gets a developmental-scale score of 1,000 in 2001 and then gets the same score of 1,000 in 2002, this indicates that the student has not made any academic progress in the intervening in·ter·vene  
intr.v. in·ter·vened, in·ter·ven·ing, in·ter·venes
1. To come, appear, or lie between two things: You can't see the lake from there because the house intervenes.

2.
 year. The developmental-scale scores required to reach Level 2 on the FCAT reading test were consistent for each year's cohort cohort /co·hort/ (ko´hort)
1. in epidemiology, a group of individuals sharing a common characteristic and observed over time in the group.

2.
.

We began by measuring the effect on all low-scoring 3rd graders of simply having been subject to the new policy. That is, we did not distinguish in our initial analysis between students who were actually retained and those who received an exemption and were promoted to the next grade. This analysis provides an estimate of the average impact of the policy change on all students in the state performing below the Level-2 benchmark. It also allows for the possibility that exempted students enjoyed spillover spill·o·ver  
n.
1. The act or an instance of spilling over.

2. An amount or quantity spilled over.

3. A side effect arising from or as if from an unpredicted source:
 benefits from the retention policy, since they were now being instructed in a system in which fewer students in 4th grade were unprepared to do grade-level work.

To identify the policy's average impact, we compared the gains in developmental-scale scores made by students who first entered 3rd grade in 2002 and scored below the FCAT benchmark with gains made by students who first entered 3rd grade in 2001 and scored below the FCAT benchmark. In making this comparison, we took into account other factors that could affect achievement gains, such as the student's race, whether the student received a free or reduced-price school lunch, whether the student was deemed Limited English Proficient, and the student's precise test score during his first 3rd-grade year. With these differences accounted for, the only distinction between the two groups of students was assumed to be that the former group entered the school system a year later and was therefore subject to the new policy in 3rd grade.

As discussed above, however, many low-scoring 3rd graders were granted exemptions and promoted to the 4th grade even under the new policy. We therefore also evaluated the effect of actually being retained, again controlling for race, eligibility for free or reduced-price lunch, English proficiency, and baseline test baseline test Clinical practice Any test than measures current or pre-treatment parameters, including chemistries, cell counts, enzyme levels and so on, against which response(s) to therapy, if any, is evaluated  scores. In conducting this analysis, we also needed to account for the fact that the students who were held back were a select group of students who could differ in important ways from the promoted students. Presumably pre·sum·a·ble  
adj.
That can be presumed or taken for granted; reasonable as a supposition: presumable causes of the disaster.
, teachers and other decisionmakers expected these students, unlike promoted students, to benefit from an additional year as 3rd graders. Fortunately, the fact that simply having entered school a year later increased the probability of retention for all low-scoring students again provides a way around this obvious selection problem. In essence, the statistical method we use compares those retained students that our data suggest would not have been retained the previous year with a comparable group of students who were not retained. Our results therefore indicate the effect of retention on those students who were held back as a result of the new policy.

During this time, Florida was engaged in other education reforms as well: instituting several school-voucher programs, increasing the number of charter schools in the state, and improving the system used to assign grades to schools based on the FCAT. However, it is reasonable to assume that whatever effect these other policies have on our analyses is minor. In order for the existence of another policy to affect our results significantly, we would have to believe that the program substantially improved the education of the 3rd graders in 2002-03 without having a similar effect on the previous year's cohort. Moreover, while a sudden policy change could conceivably con·ceive  
v. con·ceived, con·ceiv·ing, con·ceives

v.tr.
1. To become pregnant with (offspring).

2.
 explain the overall improvements between the two cohorts, it is difficult to see how such a change could cause substantially larger gains among those students actually retained.

Retention Works

Our fundamental findings from an analysis of the 3rd- and 4th-grade data for these two years indicate that the performance of students identified for retention, regardless of whether they were retained or exempted and promoted, exceeded the performance of low-performing students from the previous year who were not subject to the retention policy; and students who were actually retained made the larger relative gains.

Students identified for retention by the Florida policy gained 0.06 of a standard deviation In statistics, the average amount a number varies from the average number in a series of numbers.

(statistics) standard deviation - (SD) A measure of the range of values in a set of numbers.
 in reading on both the FCAT and Stanford-9 over equally low-performing 3rd graders from the previous school year (see Figure 1). In math, students identified for retention surpassed low performers who were not subject to the policy by 0.15 standard deviations (4.8 percentiles) on the FCAT and 0.14 standard deviations (4.4 percentiles) on the Stanford-9.

Students who were actually retained experienced even larger relative improvements (see Figure 2). Retained students performed better than low-scoring students who were promoted by 0.13 standard deviations (4.10 percentiles) on the FCAT and 0.11 standard deviations (3.45 percentiles) on the Stanford-9 in reading. In math retained students improved 0.30 standard deviations (10.0 percentiles) on the FCAT and 0.28 standard deviations (9.3 percentiles) on the Stanford-9 over promoted students.

Some critics of the new retention policies argued that teachers and schools would respond to them by manipulating test scores, either directly by cheating or indirectly by teaching students skills that would help them to improve their test scores but would not provide real academic proficiency. This argument would have merit only if we found strong gains on the high-stakes FCAT and no similar gains on the low-stakes Stanford-9, for which there is no incentive to manipulate manipulate

To cause a security to sell at an artificial price. Although investment bankers are permitted to manipulate temporarily the stock they underwrite, most other forms of manipulation are illegal.
 scores. But our results are consistent between the FCAT and the Stanford-9, indicating that there have been no serious manipulations of the high-stakes testing A high-stakes test is an assessment which has important consequences for the test taker. If the examinee passes the test, then the examinee may receive significant benefits, such as a high school diploma or a license to practice law.  system. If teachers are in fact changing their curricula with the intent to "teach to" the FCAT, they are doing so in ways that also contribute to gains on the highly respected Stanford-9. This would indicate that teachers have made changes resulting in real increases in students' proficiency.

An unexpected benefit of the retention policy is the improvement in math scores. This might seem odd, given that it is the reading portion of the FCAT that students must pass to earn promotion and that the rhetoric supporting Florida's retention program emphasizes that it will improve student literacy. Of course, the math gains could simply reflect the fact that math skills are learned primarily in schools, while reading is practiced both in and outside of school. For this reason, evaluations of school reforms frequently find stronger effects in math than in reading. Alternatively, it may be that students who were retained specifically because of their poor reading skills are particularly poor in that subject and that this limits their room for improvement.

We also explored the possibility that the objective retention program could have different effects on students of different races. Our results show gains of similar sizes by the three racial groups for which we have an adequate sample size to have reasonable confidence in our findings: white, black, and Hispanic Hispanic Multiculture A person of Mexican, Puerto Rican, Cuban, Central or South American, or other Spanish culture or origin, regardless of race Social medicine Any of 17 major Latino subcultures, concentrated in California, Texas, Chicago, Miam, NY, and elsewhere . The exception is for whites' performance on the FCAT reading test. It is difficult for us to interpret why white students would fail to benefit from the retention policy as measured by the FCAT reading test but would be shown to benefit as measured by the Stanford-9 reading test.

Our results also suggest that low-scoring Florida 3rd graders who were given an exemption and promoted might have benefited from another year in the 3rd grade. This does not mean that it would be wise to eliminate all exemptions to the testing requirement. There are certainly students for whom testing is either inappropriate or whose performance on other academic measures could reasonably indicate that they would be better served by moving on to the next grade. However, our findings do indicate that teachers and school systems should be cautious when granting exemptions.

What It Means

At first glance our findings seem inconsistent with evaluations of Chicago's program ending social promotion, to our knowledge the only similarly designed retention policy to be evaluated using comparable methods. In Chicago, students in the 3rd, 6th, and 8th grades must exceed benchmarks on the Iowa Test of Basic Skills The Iowa Test of Basic Skills (ITBS) are a set of standardized tests given annually to school students in the United States. These tests are given to students beginning in kindergarten and progressing until Grade 8 to assess educational development.  (ITBS ITBS Iowa Test of Basic Skills
ITBS Iliotibial Band Syndrome
ITBS Industrial Technologies Business Solutions
), a respected standardized test, in order to be promoted to the next grade. In a study conducted in 2004 by scholars at the Consortium on Chicago School Chicago School

Group of architects and engineers who in the 1890s exploited the twin developments of structural steel framing and the electrified elevator, paving the way for the ubiquitous modern-day skyscraper.
 Research, the performance of 3rd- and 6th-grade students who scored just below the benchmark on the ITBS, most of whom were retained because of the mandate, was compared with the performance of students who scored just above the benchmark, most of whom were promoted. The Chicago researchers were able to measure test-score performance for two years after implementation of the program. They found benefits from the program after one year, similar to what we found in Florida, but discovered that those benefits went away after the second year. Third-grade students were not affected, and 6th-grade students were negatively affected by the policy in their performance on the ITBS reading test. The findings on the Chicago retention program emphasize the importance of following the progress of retained students in Florida over time.

Still, the Chicago policy differs from Florida's in some respects. In 1999 the Chicago policy stopped allowing students to be retained twice, which Florida's policy does allow. This difference might reduce teachers' motivation to work with already retained students, whom they now can expect to be promoted the next year regardless of their performance. Other programs with different and more stable retention policies might show different results.

Finally, while our study provides valuable information about the effectiveness of Florida's policy to end social promotion, it does not offer a full catalog catalog, descriptive list, on cards or in a book, of the contents of a library. Assurbanipal's library at Nineveh was cataloged on shelves of slate. The first known subject catalog was compiled by Callimachus at the Alexandrian Library in the 3d cent. B.C.  of the policy's benefits or of its potential costs. It will be some time before we can examine whether retention increased or reduced the probability of dropping out of school later on. Most important, it does not provide any information about the program's effects on students' academic progress the first time they were in 3rd grade. The policy's greatest benefits could result not from retention itself, but rather from increased efforts on the part of teachers and even students to avoid being retained in the first place.

Jay P. Greene is professor and head of the Department of Education Reform, the University of Arkansas The University of Arkansas strives to be known as a "nationally competitive, student-centered research university serving Arkansas and the world." The school recently completed its "Campaign for the 21st Century," in which the university raised more than $1 billion for the school, used ; he is also a senior fellow at the Manhattan Institute The Manhattan Institute for Policy Research is a self-described "free market think tank" established in New York City in 1978, with its headquarters on Vanderbilt Avenue in Midtown Manhattan. . Marcus A. Winters is a doctoral fellow, the University of Arkansas and a senior research associate at the Manhattan Institute.
A Productive Policy (Figure 1)

Low-scoring 3rd graders subjected to Florida's new retention policy in
2003 made larger test-score gains the following year than did comparable
students not subjected to the policy who entered 3rd grade in 2002.

Change in Test-Score Gains of Low-Performing Students due to the
Retention Policy

                     Percent of a Standard Deviation

FCAT reading                      6
Stanford-9 reading                6
FCAT math                        15
Stanford-9 math                  14

Note: All effects are statistically significant at the 0.001 level and
control for differences in race, free or reduced-price lunch status,
Limited English Proficiency status, and prior test scores.
SOURCE: Authors' calculations from Florida Department of Education data

Note: Table made from bar graph.

Retention Works (Figure 2)

Students retained in 2003 as a result of the new policy made
substantially more progress in reading and, especially, in math than
comparable students who were not retained.

Change in Test-Score Gains of All Students Who Were Retained

                     Percent of a Standard Deviation

FACT reading                     13
Stanford-9 reading               11
FCAT math                        30
Stanford-9 math                  28

Note: All effects are statistically significant at the 0.001 level and
are adjusted for differences in race, free or reduced-price lunch
status, Limited English Proficiency status, and prior test scores.
SOURCE: Authors' calculations from Florida Department of Education data

Note: Table made from bar graph.
COPYRIGHT 2006 Hoover Institution Press
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 2006, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Title Annotation:research
Author:Winters, Marcus A.
Publication:Education Next
Geographic Code:1USA
Date:Mar 22, 2006
Words:3207
Previous Article:When principals rate teachers: the best--and the worst--stand out.(research)
Next Article:Savage exaggerations: worshiping the cosmology of Jonathan Kozol.(The Shame of the Nation: The Restoration of Apartheid Schooling in...
Topics:



Related Articles
Retention vs. Social Promotion.
PASS OR FAIL?; PLAN TO END SOCIAL PROMOTION DEBATED.(NEWS)
Hutchinson study, gold standard or spruce goose: an epistemological view of prevention research.
Report on a formative evaluation conducted for the youth against tobacco counter marketing campaign.
Building social policy evaluation capacity.
Positioning social marketing as a planning process for health education.
The road not traveled: promotion or retention? Struggling students fare better in the largely uncharted gray area between the two.
Retention or promotion? Wrong question.(Research corner: essentials on education data and analysis from research authority AEL)
Effects of Kindergarten Retention Policy on Children's Cognitive Growth in Reading and Mathematics.

Terms of use | Copyright © 2009 Farlex, Inc. | Feedback | For webmasters | Submit articles