Who's really #1?It happens every fall. Fierce arguments erupt over which college football team is the best in the country, As the season progresses, this frenzy of head scratching and navel gazing mounts until the climactic bowl games on New Year's Day New Year's Day, among ancient peoples the first day of the year frequently corresponded to the vernal or autumnal equinox, or to the summer or winter solstice. In the Middle Ages it was celebrated among Christians usually on Mar. 25. finally settle the issue. Or do they? Even as the mud on the fields of battle begins to congeal con·geal v. con·gealed, con·geal·ing, con·geals v.intr. 1. To solidify by or as if by freezing: "My aim . . . was to take the Hill by storm before . . . , select sportswriters and coaches vote to rank the top 25 U.S. college teams and award the unofficial national championship. But in most years, when the final polls appear, grumbling echoes throughout the land. Allegations of bias, unfairness, cronyism Cronyism Tammany Hall Manhattan Democratic political circle notorious for spoils system approach. [Am. Hist.: Jameson, 492] , and petty politicking taint taint an unpleasant odor and flavor in a human foodstuff of animal origin. Caused by the ingestion of the substance, commonly a plant such as Hexham scent, or while in storage, e.g. milk stored with pineapples, or as a result of animal metabolism, e.g. boar taint. the result. This season, those cries of anguish and disgust sounded early, In an epic confrontation on Nov. 13, Notre Dame Notre Dame IPA: [nɔtʁ dam] is French for Our Lady, referring to the Virgin Mary. In the United States of America, Notre Dame defeated Florida State, 31 to 24, to claim the top spot in the national polls. The Associated Press Associated Press: see news agency. Associated Press (AP) Cooperative news agency, the oldest and largest in the U.S. and long the largest in the world. media poll dropped Florida State from first to second place, while the CNN/USA Today coaches poll put undefeated Nebraska second and Florida State third. The media voting reflected a fanatic desire to see a rematch between Notre Dame and Florida State in the season-ending Fiesta Bowl The Fiesta Bowl, now sponsored by Tostitos tortilla chips (a Frito-Lay product), is a United States college football game played annually since 1971. Originally, the game was hosted in Tempe, Arizona at Sun Devil Stadium where it remained until 2006. , and that could occur only if Florida State retained its second-place standing. But such obvious maneuvering was enough to trigger outbursts of outrage from a number of commentators. "Voters are no longer rating the best teams. They are creating their idea of the best game," puffed Malcolm Moran Malcolm Moran is an American sportswriter who has written for four newspapers over 30 years – Newsday, The New York Times, Chicago Tribune and USA Today. of the New York New York, state, United States New York, Middle Atlantic state of the United States. It is bordered by Vermont, Massachusetts, Connecticut, and the Atlantic Ocean (E), New Jersey and Pennsylvania (S), Lakes Erie and Ontario and the Canadian province of Times. "Credibility will always be a problem in a business of self-interest and mutual distrust," he concluded. "But a computerized power rating... would restore a sense of fairness." It was a direct appeal for a clean, mathematical answer to a messy, human situation. Similar thoughts had occurred to mathematician James P. Keener of the University of Utah The University of Utah (also The U or the U of U or the UU), located in Salt Lake City, is the flagship public research university in the state of Utah, and one of 10 institutions that make up the Utah System of Higher Education. in Salt Lake City after the 1984 college football season. That was the year sportswriters and coaches voted Utah rival Brigham Young University Brigham Young University, at Provo, Utah; Latter-Day Saints; coeducational; opened as an academy in 1875 and became a university in 1903. It is noted for its law and business schools. (BYU BYU Brigham Young University BYU Bayou BYU Bob's Your Uncle BYU Bayreuth, Germany - Bindlacher Berg (Airport Code) BYU Beyond Your Understanding ) the national title on the strength of its undefeated season. But these victories had come against generally undistinguished un·dis·tin·guished adj. 1. a. Marked by no peculiar quality; not distinguished; ordinary: an undistinguished appearance. b. opponents. Did this BYU team, despite its record, deserve to be number one? The voters in the polls had said yes. Perturbed per·turb tr.v. per·turbed, per·turb·ing, per·turbs 1. To disturb greatly; make uneasy or anxious. 2. To throw into great confusion. 3. by this result, Keener set out to see whether a mathematical scheme, which automatically takes into account the strength of a team's opponents, would provide a more satisfactory answer. "My aim was to get some kind of fair, mathematically based ranking system that removes the subjectivity," Keener says. Ranking places teams in an order from first to last, while rating assigns numerical values to their relative strengths. Once you have ratings, you can easily generate rankings. Keener turned to a ranking system suggested a few years earlier by Joseph B. Keller Joseph B. Keller is an American mathematician who specializes in applied mathematics. He is famous for his work on the "Geometrical Theory of Diffraction". He is also known for his contribution to the Einstein-Brillouin-Keller method (EBK), which provided new methods to compute , an applied mathematician at Stanford University Stanford University, at Stanford, Calif.; coeducational; chartered 1885, opened 1891 as Leland Stanford Junior Univ. (still the legal name). The original campus was designed by Frederick Law Olmsted. David Starr Jordan was its first president. , Keller had devised his ranking strategy while serving a term on the board of trustees board of trustees Politics The posse of thugs who oversee an institution's administration. See Board of directors. of the Society for Industrial and Applied Mathematics
The Society for Industrial and Applied Mathematics (SIAM) was founded by a small group of mathematicians from academia and industry who met in Philadelphia in 1951 to start an organization (SIAM). He had become interested in ascertaining how well SIAM's published journals stacked up against others in the same field. He decided he might rank SIAM journals by counting how many times they were cited by other publications. But this wasn't good enough. There had to be a way of adjusting the scores so that being cited in a prestigious, influential journal counted for more than being cited in some "garbage" journal, Keller noted at the time. This sounds like a chicken-and-egg situation. It's hard to see how one can determine which journals are more influential without knowing how they rank, and ranks can't be determined without knowing how influential the journals are. Remarkably, a relatively obscure mathematical result known as the Perron-Frobenius theorem furnishes a recipe for calculating just such a ranking. "The theorem shows that this circular reasoning actually has a solution," Keener says. "It also hints at how to arrive at that solution." Using this recipe and given the right data, a computer can establish the rankings in a matter of seconds. Keller never got a chance to apply his scheme to journals. He did try it on baseball teams, however, using the number of times one team beat another team as the entry in the table of values at the heart of his computation of rankings. In the 1984 National League baseball season, the Eastern Division had proved stronger than the Western Division, but teams within a division play more games against each other than against teams in the other division. Keller's scheme provided a way of ranking all 12 teams while taking into account the fact that the six teams in the stronger division faced tougher opponents more often and deserved greater credit for their victories. College football, however, involves many more teams, which individually play far fewer games than professional baseball teams. The issue comes down to what value should go into the space reserved for the outcome of a matchup between each pair of teams in the table used for computing rankings. In the simplest possible scheme, one can assign a single point for a win, half a point for a draw, and zero for a loss and calculate rankings on this basis. But unlike baseball teams, football teams rarely play each other more than once during a season. Moreover, all the credit goes just to the winner, whether the score is close or lopsided. Utah's Keener chose to allocate the value per game between the two competing teams on the basis of the game score, and he explored various ways of making this distribution. Each method he looked at showed certain biases, rewarding or penalizing close defensive contests, wild shootouts, blowouts, and other types of game outcomes in different ways. It's this aspect that bothers Stanford's Keller. "With baseball, it's clear what the entries ought to be - the number of times one team beats another," Keller says. "In football, you have to make up some other way to decide what number goes in as an entry, It's a little too arbitrary" Nonetheless, once the ground rules however arbitrary -- are set, Keener's scheme produces the required rankings. No amount of politicking or favoritism can budge the results. Inevitably, coaches and fans who saw Keener's computer-generated rankings found objectionable features. For example, one of his schemes clearly penalizes a team that plays a weak schedule because such a team can never earn enough points playing weaker opponents to increase its own earned score. "Of course, this is not all bad," Keener responds. "Simply because a team is undefeated does not mean it should have the highest rank, particularly if it did not play a difficult schedule." Indeed, when Keener took a look at Brigham Young's 1984 season, the team finished out of the top 10. "There's no objective ranking scheme that I know of that makes BYU number one or even number two," he claims. Keener is not the only mathematician or statistician with a strong interest in the sporting world. Many have dabbled dab·ble v. dab·bled, dab·bling, dab·bles v.tr. To splash or spatter with or as if with a liquid: "The moon hung over the harbor dabbling the waves with gold" in the art of ranking, not just in football but also in tennis, basketball, hockey, soccer, racquetball racquetball, sport played indoors by two or four players, combining elements of court handball and such racket games as squash racquets. It is played on a standard handball court 40 ft (12.2 m) long, 20 ft (6. (SN: 10/24/87, p. 269), and other sports. Some have also delved into rating teams or players in order to predict the outcome of future games. The statistics literature, for example, includes a 1976 paper titled "Predicting the outcome of football games, or can you make a living working one day a week?" Moreover, the New York Times and USA Today USA Today National U.S. daily general-interest newspaper, the first of its kind. Launched in 1982 by Allen Neuharth, head of the Gannett newspaper chain, it reached a circulation of one million within a year and surpassed two million in the 1990s. publish separate team rankings and ratings based on proprietary mathematical formulas. The ratings provide a numerical measure of a team's relative strength and have some value in predicting by how much one team would beat another if they played at a neutral site. The fine print accompanying the newspaper charts notes that these analyses are based on each team's scores, with the emphasis on who won, by what margin, and against what quality of opposition. The New York Times computer model also "collapses" runaway scores, takes note of home-field advantage, and counts recent games more than earlier games. The USA Today computer ratings, compiled by Jeff Sagarin Jeff Sagarin is an American sports statistician well-known for his development of a methodology for ranking and rating sports teams in a variety of sports. His ratings have been a regular feature in the USA Today sports sections since 1985, have been used by the NCAA Tournament , incorporate similar adjustments. Harvard statistician Hal Stern has played with the data to see if he can tease out the mathematical procedures underlying these computer ratings. Neither appears to be based on a method as straightforward and elegant as Keener's. "The more mathematical stuff that Keener has done is interesting and helpful if you want a mathematical foundation for ranking," Stern says. "But from a statistician's perspective, the question is which model predicts best." Gamblers, for example, want to predict the outcomes of games. To them, a team's rating is a far more valuable piece of information than its rank because the rating suggests the potential margin of victory over a given opponent. On the basis of his own experiments, Stern suspects that the USA Today computer rating is based on a "least squares" method. When one team plays at home against another, the game's outcome is predicted by calculating the difference between the two teams' ratings and adding some factor to account for the home-field advantage. A team's rating for a single season is obtained by minimizing the sum of the squares of the differences between the actual outcomes of games and the predicted outcomes. "I've used a fairly standard least squares method least squares method Statistical method for finding a line or curve—the line of best fit—that best represents a correspondence between two measured quantities (e.g., height and weight of a group of college students). , and I come close to what's in USA Today," Stern says. "The small differences, I think, probably have to do with weighting games differently as the season progresses or using information from previous seasons." But most people can't quite bring themselves to trust the computer-generated rankings. After the Notre Dame-Florida State game, the New York Times rankings left Florida State, despite its loss, in first place and relegated winning Notre Dame to third. The USA Today computer poll ranked Florida State first and Notre Dame second. Things got even more confused in subsequent weeks. Fans find such counterintuitive coun·ter·in·tu·i·tive adj. Contrary to what intuition or common sense would indicate: "Scientists made clear what may at first seem counterintuitive, that the capacity to be pleasant toward a fellow creature is ... results a little hard to swallow. Besides, the computer polls don't even agree among themselves, they complain. College football is not the only sport in which rankings and ratings play a role. When the National Collegiate Athletic Association National Collegiate Athletic Association (NCAA) Organization that administers U.S. intercollegiate athletics. It was formed in 1906 but did not acquire significant powers to enforce its rules until 1942. Headquartered at Indianapolis, Ind. (NCAA NCAA abbr. National Collegiate Athletic Association ) chooses the participants for its annual basketball tournament, it uses a rating scheme to help determine which teams should fill unclaimed spots in the field of 64. "They have a couple of fudge factors, and they play with those factors to make it come out right," Stern says. "The problem is that they now seem to be using the same fudge factors in other sports." Those factors may distort the selection process. Responding to complaints from Harvard's hockey coach, Stern is trying to obtain data to assess whether the same fudge factors that apply to basketball teams would also work for hockey. "It's a very tricky problem," Stern admits. "The key point is to try and define carefully what you would like to do [with the ratings]? For example, if the ratings are to be used to choose teams for participation in a championship playoff, it's inappropriate to use information from prior seasons, he argues. If prediction of future games is the ultimate goal, then information from prior seasons may be useful. In the end, when it comes to rating or ranking teams, the value of any particular method resides in the mind of the beholder. "There is no unique way to devise a ranking scheme," Keener says. "The different ranking schemes give different rankings because they weigh important factors differently," The same conclusion applies to rating methods. And there's always the temptation to fiddle with Verb 1. fiddle with - manipulate, as in a nervous or unconscious manner; "He twiddled his thumbs while waiting for the interview" twiddle manipulate - hold something in one's hands and move it the mathematics to obtain a result more in line with human thinking, in effect enshrining preconceived notions of how things should turn out. "Each of the schemes proposed... has strengths and weaknesses," Keener adds. "But invariably in·var·i·a·ble adj. Not changing or subject to change; constant. in·var i·a·bil when a method is
tweaked See tweak. to get rid of some 'undesirable' feature, another
'counterintuitive' result shows up."
Nonetheless, mathematical ranking schemes do remove some- though not all -- of the subjectivity that plagues the national polls. And there's a lot of entertainment value in these methods. "I find that the few minutes I spend each week during the fall collecting and entering data for my computer program are justified by the increased student interest in the mathematics of the methods." Keener writes in the March SIAM Review. "It is not difficult for students to write their own computer programs to test some of these ideas on their favorite competition." So let the games begin, and may the best numbers win. [TABULAR DATA OMITTED] |
|
||||||||||||||||||

i·a·bil
Printer friendly
Cite/link
Email
Feedback
Reader Opinion