The gentleman's "A": new evidence that tough-grading teachers elicit better student performance.WITH REPORTS THAT SOME OF THE NATION'S FINEST UNIVERSITIES have been handing out A's like lollipops at Halloween Halloween (hăl'əwēn`, häl'–), Oct. 31, the eve of All Saints' Day, observed with traditional games and customs. The word comes from medieval England's All Hallows' eve (Old Eng. hallow="saint"). , the lowering of standards in higher education higher education Study beyond the level of secondary education. Institutions of higher education include not only colleges and universities but also professional schools in such fields as law, theology, medicine, business, music, and art. has become a hot topic. But grading standards in primary and secondary education have received remarkably less attention. There are two major questions related to grading standards. First, to what degree do the grades distributed by schools and teachers correspond to their students' performance on state and national exams? Second, and more important, how does "tough" or "easy" grading affect students' learning? The literature on these questions is extremely thin. In fact, to our knowledge, the analysis presented here represents the first study to examine the grading standards of individual teachers and how those standards affect students' performance on independent exams. Our data set enabled us to examine the test-score gains of individual students from grade to grade across three school years. Thus we can see how individual students perform on nationally normed exams as they move from "tough" to "easy" grading teachers and vice versa VICE VERSA. On the contrary; on opposite sides. . Our results suggest that elementary-school students learn more with "tough" teachers, with the effects varying depending on students' initial performance levels and on the overall performance level of their classrooms. Measuring Grading Standards For this study, we analyzed an·a·lyze tr.v. an·a·lyzed, an·a·lyz·ing, an·a·lyz·es 1. To examine methodically by separating into parts and studying their interrelations. 2. Chemistry To make a chemical analysis of. 3. confidential data provided by the school board of Alachua County, Florida Coordinates: Alachua County is a county located in the U.S. state of Florida. The U.S. Census Bureau 2006 estimate for the county is 227,120.[1] Its county seat is Gainesville, FloridaGR6. , which includes the city of Gainesville Gainesville. 1 City (1990 pop. 84,770), seat of Alachua co., N central Fla.; inc. 1869. The Univ. of Florida is a major source of employment in the city. Agriculture and the manufacture of electronic equipment add to the economy. . The data consist of observations on almost every 3rd, 4th, and 5th grader A grader, also commonly referred to as a blade or a motor grader, is an engineering vehicle with a large blade used to create a flat surface. Typical models have three axles, with the engine and cab situated above the rear axles at one end of the vehicle and a third in the school system between the 1995-96 and 1998-99 school years, allowing us to follow two cohorts with three years of data each. Alachua County Public Schools is a relatively large district (by national standards), averaging about 1,800 test-taking students per grade, per year. The county is racially diverse, with a student population that is 60 percent white, 34 percent African-American, 3 percent Hispanic Hispanic Multiculture A person of Mexican, Puerto Rican, Cuban, Central or South American, or other Spanish culture or origin, regardless of race Social medicine Any of 17 major Latino subcultures, concentrated in California, Texas, Chicago, Miam, NY, and elsewhere , and 2 percent Asian. Nearly half of all students are eligible for subsidized sub·si·dize tr.v. sub·si·dized, sub·si·diz·ing, sub·si·diz·es 1. To assist or support with a subsidy. 2. To secure the assistance of by granting a subsidy. lunches, while 19 percent are identified as gifted, 8 percent as learning disabled, and less than 1 percent as English 1. English - (Obsolete) The source code for a program, which may be in any language, as opposed to the linkable or executable binary produced from it by a compiler. The idea behind the term is that to a real hacker, a program written in his favourite programming language is learners. [ILLUSTRATION OMITTED] Alachua County provides a unique advantage for a study of this nature because it administers both the Iowa Test of Basic Skills The Iowa Test of Basic Skills (ITBS) are a set of standardized tests given annually to school students in the United States. These tests are given to students beginning in kindergarten and progressing until Grade 8 to assess educational development. (ITBS ITBS Iowa Test of Basic Skills ITBS Iliotibial Band Syndrome ITBS Industrial Technologies Business Solutions ), a nationally normed exam, and the Florida Comprehensive Assessment Test The Florida Comprehensive Assessment Test, or the FCAT, is the standardized test used in the primary and secondary public schools of Florida. First administered statewide in 1998[1], it replaced the State Student Assessment Test (SSAT) and the High School (FCAT FCAT Florida Comprehensive Assessment Test (statewide standardized test for Florida school children) ). The FCAT was designed to measure the degree to which students are meeting the Sunshine State Standards, which are also supposed to be the basis for students' letter grades. Today, the state uses the FCAT both to rate public schools as part of the "A+" accountability plan and to determine whether students in certain grades are promoted to the next grade. During the period covered by our data, however, these tests were used for informative and diagnostic purposes only (though they were publicly reported each year at the school level). Students receive scores on the FCAT ranging from 1 (lowest) to 5 (highest), with the thresholds for each performance level designed to correspond with the letter grades A through F. Thus results from the FCAT are ideal for developing a measure of how generous individual teachers' grading policies are. We can then examine the relationship between teachers' grading standards and their students' performance gains on a different test, the ITBS. Our primary measure of teachers' grading standards is the average gap between the letter grades given by particular teachers and the FCAT scores attained at·tain v. at·tained, at·tain·ing, at·tains v.tr. 1. To gain as an objective; achieve: attain a diploma by hard work. 2. by their students. Students take the FCAT math exam in 5th grade and the FCAT reading exam in 4th grade. Consequently, this measure of grading standards is calculated using the math grades and test scores of 5th-grade teachers, and the reading grades and test scores of 4th-grade teachers. Examining students' performance on the ITBS in the 3rd, 4th, and 5th grades enables us to see how their gains in reading from 3rd to 4th grade, and in math from 4th to 5th grade, were affected by their teachers' grading standards that academic year. Grade Inflation It turns out that the grades teachers assign are highly correlated cor·re·late v. cor·re·lat·ed, cor·re·lat·ing, cor·re·lates v.tr. 1. To put or bring into causal, complementary, parallel, or reciprocal relation. 2. with students' ITBS and FCAT scores. But teachers also tend to grade far less stringently than the state standards indicate they should (see Figure 1). For instance, just 9 percent of students who were awarded A's by their teachers attained a score of 5 on the FCAT. In fact, just 50 percent attained even a 4. Only 11 percent of students awarded B's by their teachers attained level 4 or above, and a mere 39 percent attained level 3 or above. And of students awarded C's, only 14 percent attained level 3 or above, and only 39 percent attained level 2 or above. Put differently Adv. 1. put differently - otherwise stated; "in other words, we are broke" in other words , 86 percent of "C students" failed to achieve the minimum level of competency COMPETENCY, evidence. The legal fitness or ability of a witness to be heard on the trial of a cause. This term is also applied to written or other evidence which may be legally given on such trial, as, depositions, letters, account-books, and the like. 2. accepted (Level 3) on the Florida Florida, state, United States Florida (flôr`ĭdə, flŏr`–), state in the extreme SE United States. A long, low peninsula between the Atlantic Ocean (E) and the Gulf of Mexico (W), Florida is bordered by Georgia and standards, along with 61 percent of "B students" and 17 percent of "A students." Yet not all teachers are so lax LAX - LAnguage eXample. A toy language used to illustrate compiler design. ["Compiler Construction", W.M. Waite et al, Springer 1984]. : Among the top half of teachers as ranked by their grading standards, 65 percent of A students attained level 4 or above while just 5 percent attained level 2 or below. In short, teachers vary considerably in their grading standards, even within a single school district. In fact, teachers' grading standards often vary as much within a single school as within the school district as a whole. For instance, during the 1997-98 school year, the district-wide standard deviation In statistics, the average amount a number varies from the average number in a series of numbers. (statistics) standard deviation - (SD) A measure of the range of values in a set of numbers. in teacher-level grading standards was 0.68, while the mean within-school standard deviation in grading standards was 0.60. Tough-grading teachers, it appears, often teach alongside teachers with far lower standards. For research purposes, this is reassuring re·as·sure tr.v. re·as·sured, re·as·sur·ing, re·as·sures 1. To restore confidence to. 2. To assure again. 3. To reinsure. , since our empirical strategy relies mainly on within-school variation in teachers' grading standards to isolate isolate /iso·late/ (i´sah-lat) 1. to separate from others. 2. a group of individuals prevented by geographic, genetic, ecologic, social, or artificial barriers from interbreeding with others of their kind. the effects of those standards. A Stable Trait trait (trat) 1. any genetically determined characteristic; also, the condition prevailing in the heterozygous state of a recessive disorder, as the sickle cell trait. 2. a distinctive behavior pattern. Estimating the effect of individual teachers' grading standards on their students' achievement gains assumes that these standards remain relatively consistent over time, that they are not unduly influenced by the composition of their class, and that they are not a reflection of some other observable ob·serv·a·ble adj. 1. Possible to observe: observable phenomena; an observable change in demeanor. See Synonyms at noticeable. 2. characteristic that might account for any effects we observe. Fortunately, our data provide evidence in support of each of these assumptions. To see whether teachers' grading standards remained stable over time, we divided the full sample of teachers into thirds according to according to prep. 1. As stated or indicated by; on the authority of: according to historians. 2. In keeping with: according to instructions. 3. their grading standards each year and examined how the position of individual teachers changed from year to year. For instance, we found that 75 percent of the teachers whose standards put them in the "easy" category (on a scale from "easy" to "moderate" to "tough") in one year remained in that category the following year, while just 6 percent evolved from easy to tough graders in a single year. This trend was essentially the same across the three categories, with very little movement between groups. Nor does it appear that teachers' grading standards are influenced by the ability level of their students. To gauge this, we compared teachers who taught a higher ability class, as measured by their average 3rd-grade test scores, in 1998-99 than in the previous year, and vice versa. We found that even large changes in the ability level faced by teachers do not seem to affect their grading standards. Turning finally to the relationship between other observable teacher characteristics and grading standards, we found that relatively tough graders are in fact slightly more experienced and slightly less likely to have attended a selective or highly selective undergraduate institution, though these differences are not statistically significant. However, tough graders are significantly more likely to hold Master's degrees master's degree n. An academic degree conferred by a college or university upon those who complete at least one year of prescribed study beyond the bachelor's degree. Noun 1. . In any case, our analysis below controls for each of these measures of teachers' qualifications in order to rule out the possibility that teachers' observed characteristics drive the estimated effects of grading standards on student outcomes. Classroom Assignment The method by which students are assigned as·sign tr.v. as·signed, as·sign·ing, as·signs 1. To set apart for a particular purpose; designate: assigned a day for the inspection. 2. to teachers could also cause problems for an analysis of the effect of grading standards on performance gains. The fact that students are not assigned to teachers randomly would be especially troublesome in a cross-sectional analysis Cross-sectional analysis Assessment of relationships among a cross-section of firms, countries, or some other variable at one particular time. , in which one compares one classroom with another in the same year. For instance, looking in cross-section cross section also cross-sec·tion n. 1. a. A section formed by a plane cutting through an object, usually at right angles to an axis. b. A piece so cut or a graphic representation of such a piece. 2. across our own data set reveals that teachers with high standards also have students who are more likely to be white or gifted and less likely to be low-income low-in·come adj. Of or relating to individuals or households supported by an income that is below average. or learning disabled. This is true even within a given school. Hence it is unclear whether the outcomes associated with high standards are actually due to the standards themselves. But our analysis looks at year-to-year changes in the grading standards faced by a student, making this less of a concern. We found that students are nearly as likely to move to a teacher with different standards as to experience the same grading standards from year to year. For instance, just 57 percent of students with teachers whose grading standards are below the median within their own school continue to have below-median teachers the next year. Likewise, 54 percent of students with above-median teachers continue to have above-median teachers the next year. This indicates that year-to-year differences in grading standards within schools are close to random. Similar patterns are observed for most subgroups--black and white students are approximately equally likely to move between groups, as are students who are eligible and ineligible in·el·i·gi·ble adj. 1. Disqualified by law, rule, or provision: ineligible to run for office; ineligible for health benefits. 2. for a free lunch. It is the case that gifted students (around 12 percent of the sample), no matter where they start out, are considerably more likely to be placed with a high-standards teacher the next year than are nongifted students. Nevertheless, the vast majority of students are almost as likely to move between low-standards and high-standards teachers as to experience the same level of standards across years. And the results presented below do not change materially when gifted students are excluded from the analysis. Empirical Results For our primary analysis, we controlled for the average annual gain made by all students in the relevant school during the period of analysis; for such classroom characteristics as the share of white students, the share eligible for free lunches, and the students' average math score in 3rd grade; and for the teacher's years of experience, education level, and the selectivity selectivity /se·lec·tiv·i·ty/ (se-lek-tiv´i-te) in pharmacology, the degree to which a dose of a drug produces the desired effect in relation to adverse effects. selectivity 1. of his or her undergraduate institution. In the end, we were interested in the effects on ITBS scores of changing a student from one level of grading standards to another. Using this strategy, we found statistically significant improvements in test scores associated with higher standards. An increase in measured grading standards of 1 standard deviation is associated with about one-fifth of a year of schooling's worth of gains in test scores--a large effect relative to many other interventions. (A year of test-score gain is measured as the average gain from one year to the next in Alachua County Public Schools. Because Alachua County's gain scores tend to be larger than the national average, these are more conservative estimates of years of gain than are those based on national grade equivalents.) While the average effects of grading standards are important, the theoretical literature on grading standards suggests that higher standards could produce both winners and losers. Students who achieve a given standard may be made better off because the standard becomes a more meaningful accomplishment. But those students who are not able to achieve the standard precisely because it is now more rigorous are made worse off. To study this issue, we tested whether the effect of high grading The term high grading has uses in forestry, mining, and fishing relating to selectively harvesting goods. Also known as “cutting the best and leaving the rest” Mining standards varied for students with different initial test scores in 3rd grade. We found that the positive effects of higher grading standards were restricted to those students who were no more than 0.8 and 0.9 standard deviation below the average score in reading and math, respectively. For students performing below this threshold, grading standards had no detectable effect, either positive or negative, on performance. We also found that higher-achieving classes, as measured by their average 3rd-grade test score in the relevant subject, may fare somewhat better than lower-achieving classes under teachers with tough grading standards. Equally interesting, however, are the distributional effects of tough grading standards on performance gains within a given classroom. Put differently, are the benefits of high standards uniform within a class, or do some children benefit more than others? We found that high-achieving students benefit most from tough grading standards when they are placed in classrooms where the overall level of achievement is relatively low (see Figure 3). The opposite is also true: tough grading standards elicit e·lic·it tr.v. e·lic·it·ed, e·lic·it·ing, e·lic·its 1. a. To bring or draw out (something latent); educe. b. To arrive at (a truth, for example) by logic. 2. the most improvement from low-achieving students when they are in classrooms with relatively high overall achievement. This result has intuitive appeal. Since the grades assigned vary much less across classrooms than does students' performance on standardized tests A standardized test is a test administered and scored in a standard manner. The tests are designed in such a way that the "questions, conditions for administering, scoring procedures, and interpretations are consistent" [1] , high-achieving students should be more likely to earn high grades in classrooms where the other students, on average, do not perform well on external assessments. Likewise, low-achieving students in classes with many strong students run a greater risk of receiving a low grade than they do when in low-achieving classes. So it seems only sensible that initially high-achieving students benefit most from tough teachers when they are among the strongest members of a class. Similarly, initially low-achieving students are challenged more to get a good grade with tough teachers, but particularly when they are among the weakest members of a class. Parental Involvement What might explain the positive effects of higher grading standards? One possibility, of course, is that high standards motivate students to work harder. A second possibility is that parents may devote more attention to their children's schoolwork if their grades suggest that they are struggling, as they might with a tough-grading teacher. To assess the latter possibility, in the spring of 2001 we conducted a survey of parents with students in both 4th and 5th grades in Alachua County. We asked the responsible parent to report on how much time he or she spends weekly helping each of the two children with their homework. This allowed us to control for factors, such as parental motivation, that might be common to both siblings siblings npl (formal) → frères et sœurs mpl (de mêmes parents) in a household. We found that, holding constant the child's grade level, 3rd-grade test scores, and the average 3rd-grade test score in the child's class, parents spend more time helping the child with the tougher teacher with homework than they do helping the sibling sibling /sib·ling/ (sib´ling) any of two or more offspring of the same parents; a brother or sister. sib·ling n. with the easier teacher. These results do not appear to be due to tougher teachers' assigning as·sign tr.v. as·signed, as·sign·ing, as·signs 1. To set apart for a particular purpose; designate: assigned a day for the inspection. 2. more homework; parental reports suggest that the typical tough teacher assigns Individuals to whom property is, will, or may be transferred by conveyance, will, Descent and Distribution, or statute; assignees. The term assigns is often found in deeds; for example, "heirs, administrators, and assigns to denote the assignable nature of just 10 percent more homework than the typical easy teacher. [ILLUSTRATION OMITTED] Another intriguing in·trigue n. 1. a. A secret or underhand scheme; a plot. b. The practice of or involvement in such schemes. 2. A clandestine love affair. v. finding from this survey is that parents do not perceive tougher teachers to be better teachers. We asked parents to grade their children's teachers from A to F. While there is relatively little variation in these grades (in their own form of grade inflation, two-thirds of the parents gave their children's teachers A's), the results suggest that, if anything, parents view tough teachers less favorably fa·vor·a·ble adj. 1. Advantageous; helpful: favorable winds. 2. Encouraging; propitious: a favorable diagnosis. 3. than they view easier teachers. Parents were 50 percent more likely to assign a grade of B or below to a tough teacher than to a relatively easy teacher, after adjusting for the same controls as above. This result suggests that our measure of grading standards is not merely reflecting some other attribute of a teacher that is viewed as desirable to parents. It also bolsters our argument that it is high grading standards rather than some unobserved measure of teacher quality that is responsible for the positive effects on students' performance gains. Conclusion Our results indicate that students benefit academically from higher grading standards. However, these results were not uniform: high-ability students appear to benefit more than low-ability students from high grading standards. Moreover, initially low-performing students appear to benefit more from high grading standards when they are placed in high-achieving classrooms. Likewise, high-performing students appear to react best to high grading standards when placed in low-achieving classrooms. It is, however, premature to conclude that high grading standards are unambiguously desirable. We cannot yet speak to the consequences of teacher-level grading standards at the secondary level, where the same effects might not hold and where high grading standards might even lead more students to drop out of school altogether. In addition, our results do not tell us anything about how to raise the grading standards of teachers whose standards are currently low. Before we can recommend a general policy of higher standards, we must understand the distributional consequences at all levels and know how to implement a policy of high standards. As the yawning yawning a deep, involuntary inspiration with the mouth open, often accompanied by the act of stretching. Repeated yawning in the presence of other signs, may accompany signs of chronic abdominal pain or hepatic disease. gaps between the grades teachers presently give out and their students' scores on state tests suggest, it will be no easy task to change teachers' grading habits.
Grade Inflation (Figure 1)
Overall, students who receive high grades tend to attain higher scores
than their peers on the Florida Comprehensive Assessment Test (FCAT).
However, just 50 percent of "A students" and 11 percent of "B students"
received scores of 4 or higher on the FCAT.
Classroom Letter Grade % of Students Scoring a 4 or 5 on the FCAT
A Students 50%
B Students 11%
Note: FCAT scores are designed to correspond to letter grades, e.g.,
5 = A, 4 = B.
SOURCE: Authors
Note: Table made from bar graph.
"Tough" versus "Easy" Teachers (Figure 2)
Students who earn an A from a teacher with high grading standards are
more than twice as likely to score a 4 or better on the FCAT as are
students who receive A's from an easy-grading teacher.
Percentage of "A Students" Scoring a 4 or Above or a 2 or Below on the
FCAT in Classrooms with Different Grading Standards
% of Students
FCAT Score of 5 or 4 FCAT Score of 2 or 1
Tough Teachers* 65% 5%
Easy Teachers** 28% 32%
* Teachers whose grading standards are above the median.
** Teachers whose grading standards are below the median.
Note: FCAT scores are designed to correspond to letter grades, e.g.,
5 = A, 4 = B.
SOURCE: Authors
Note: Table made from bar graph.
Benefits from Higher Grading Standards Vary (Figure 3)
For high-achieving students, the positive effects of tough grading
standards are larger when they are taught alongside relatively
low-achieving peers. The opposite is true for low-achieving students.
Impact of Higher Grading Standards on High-Achieving Students*
Years of Extra Growth in
ITBS Reading Test Score
Low-Achieving Classroom 0.71
High-Achieving Classroom 0.18
* Student scoring half a standard deviation above the mean on ITBS
reading test in the 3rd grade. Results significant at the .01 level.
Impact of Higher Grading Standards on Low-Achieving Students*
Years of Extra Growth in
ITBS Math Test Score
Low-Achieving Classroom 0.07
High-Achieving Classroom 0.28
* Students scoring half a standard deviation below the mean on ITBS math
test in the 3rd grade. Results significant at the .1 level.
Note: Graphs present the estimated impact of increasing teachers'
grading standards by one standard deviation. The average test scores of
classrooms labeled "low achieving" were 1.5 standard deviations below
the mean for all classrooms. The average test scores in high-achieving
classrooms were 1.5 standard deviations above the mean.
SOURCE: Authors
Note: Table made from bar graph.
--David N. Figlio is a professor of economics at the University of Florida University of Florida is the third-largest university in the United States, with 50,912 students (as of Fall 2006) and has the eighth-largest budget (nearly $1.9 billion per year). UF is home to 16 colleges and more than 150 research centers and institutes. and a research associate of the National Bureau of Economic Research The National Bureau of Economic Research (NBER) is a "private, nonprofit, nonpartisan research organization" dedicated to studying the science and empirics of economics, especially the American economy. . Maurice E. Lucas is director of research and assessment for the school board of Alachua County, Florida. |
|
||||||||||||||||

Printer friendly
Cite/link
Email
Feedback
Reader Opinion