Computers and proof: applying automated reasoning to prove mathematical theorems.The late Paul Erdos liked to describe a mathematician as "a device for turning coffee into theorems." To Erdos, however, it wasn't enough to cobble together cobble together Verb [-bling, -bled] to put together clumsily: a coalition cobbled together from parties with widely differing aims Verb 1. any old proof to establish the truth of a conjecture. He subscribed to the notion that God has a book containing all the theorems of mathematics, together with their most elegant and beautiful proofs. The goal of mathematicians was to uncover these sublime instances of logical reasoning The three methods for logical reasoning, deduction, induction and abduction can be explained in the following way: [1] Given preconditions α, postconditions β and the rule R1: α ∴ β (α therefore β). . When Erdos wanted to express particular appreciation of a proof, he would exclaim ex·claim v. ex·claimed, ex·claim·ing, ex·claims v.intr. To cry out suddenly or vehemently, as from surprise or emotion: The children exclaimed with excitement. v. , "This is one from the book!" One result unlikely to qualify for "the book" is last fall's proof of the so-called Robbins conjecture, which concerns an aspect of the basic rules of logic. Nonetheless, the proof is noteworthy because it was found by a computer program, succeeding spectacularly where mathematicians had failed. Originally proposed in the 1930s by Herbert Robbins Herbert Ellis Robbins (born January 12, 1915 in New Castle, Pennsylvania; died February 12, 2001 in Princeton, New Jersey) was a mathematician and statistician who did research in topology, measure theory, statistics, and a variety of other fields. , now at Rutgers University Rutgers University, main campus at New Brunswick, N.J.; land-grant and state supported; coeducational except for Douglass College; chartered 1766 as Queen's College, opened 1771. Campuses and Facilities Rutgers maintains three campuses. in New Brunswick New Brunswick, province, Canada New Brunswick, province (2001 pop. 729,498), 28,345 sq mi (73,433 sq km), including 519 sq mi (1,345 sq km) of water surface, E Canada. , N.J., the problem had stumped everyone who tackled it over the years. It finally succumbed to a computer program designed to reason in a general way rather than a program designed to solve a particular problem, whether an equation or the moves of a chess game. Called EQP EQP Equipment EQP Environment(al) Quality Program EQP End Quantity Pricing EQP Elder's Quorum President EQP Education Quality Partner EQP Equipment Object EQP Equalizer Presets for "equational prover," the automated reasoning Automated reasoning is an area of computer science dedicated to understanding different aspects of reasoning in a way that allows the creation of software which allows computers to reason completely or nearly completely automatically. software was developed by computer scientist William McCune of the Argonne National Laboratory Argonne National Laboratory, research center, based in Argonne, Ill., 27 mi (43 km) SW of downtown Chicago, with other facilities at the Idaho National Engineering Laboratory, 50 mi (80 km) W of Idaho Falls, Idaho. Founded in 1946 by the U.S. in Illinois. "I was surprised that the problem finally yielded to the program because some powerful minds had tried to solve it and hadn't succeeded," says mathematician Larry Wos, a colleague of McCune's at Argonne. "It's an impressive result. We've made a lot of progress in automated theorem proving Automated theorem proving (ATP) or automated deduction, currently the most well-developed subfield of automated reasoning (AR), is the proving of mathematical theorems by a computer program. in the last few years." "It's a real breakthrough," says mathematician Stanley N. Burris of the University of Waterloo The University of Waterloo (also referred to as UW, UWaterloo, or Waterloo) is a medium-sized research-intensive public university in the city of Waterloo, Ontario, Canada. The school was founded in 1957. in Ontario. "I had no expectations that a proof would emerge at this time." McCune's computer-generated proof is to be published in a future issue of the Journal of Automated Reasoning. Computers are not new to the business of proving theorems. In the past, however, they generally served as bookkeepers or assistants to check the large number of special cases needed to establish a theorem (SN: 12/24&31/88, p. 406). Mathematicians or computer scientists outlined the necessary steps, wrote the special-purpose software, and specified the calculations required for a proof. "What we do is quite different," McCune says. Though the computer's search has constraints, neither the path to a result nor the ultimate outcome is determined beforehand. Indeed, because typically no one knows any route to a solution, the search can't be guided in a meaningful way. Wos and his coworkers began developing software for automated reasoning in the 1960s. They focused on how to describe a mathematical problem Mathematical problem may mean two slightly different things, both closely related to mathematical games:
"We never tried to imitate what people do," Wos insists. The idea was to give an automated reasoning system the statement of a conjecture along with a few rules of thumb constraining the search among the infinite number infinite number a number so large as to be uncountable. Represented by 8, frequently obtained by 'dividing' by zero. of possible logical paths to a proof, then allow it to proceed on its own. Advances came in the form of improved strategies for keeping the computer from getting entangled en·tan·gle tr.v. en·tan·gled, en·tan·gling, en·tan·gles 1. To twist together or entwine into a confusing mass; snarl. 2. To complicate; confuse. 3. To involve in or as if in a tangle. in lengthy chains of reasoning that apparently led nowhere. Those improvements, along with faster computers and better software engineering, eventually produced an agile reasoning system called Otter, aimed at questions in abstract algebra
Abstract algebra is the subject area of mathematics that studies algebraic structures, such as groups, rings, fields, modules, and formal logic. Using Otter, McCune and his coworkers have in recent years solved a wide variety of mathematical problems, generating original proofs of theorems in logic, algebraic geometry algebraic geometry, branch of geometry, based on analytic geometry, that is concerned with geometric objects (loci) defined by algebraic relations among their coordinates (see Cartesian coordinates). , group theory, and other areas of mathematics. A few years ago, McCune decided to make a fresh start and develop a "daughter of Otter," incorporating techniques and strategies that researchers had found useful while experimenting with Otter during the previous decade. "Otter was just getting too big and too patched-up," McCune explains. "I wanted a new program that was easier to modify to try things out." The result was EQP. The program incorporated a number of strategies for selecting possible search paths, including simply following the path defined by the smallest equation or formula at certain steps or side-stepping expressions containing more than, say, 100 logical relationships. Initial tests of EQP were encouraging, and the program succeeded in proving, sometimes for the first time, several theorems of special interest to a few logicians and other experts. These results, however, were of limited value to most mathematicians. "The trouble was that [McCune and his coworkers] had never really done a mainstream problem of sufficiently wide interest," Burris remarks. Until that happened, "mathematicians would have no appreciation of how powerful automated theorem proving could be." As a benchmark test of his system, McCune turned to the Robbins conjecture, which states that a set of three equations in logic is equivalent to a Boolean algebra Boolean algebra (b `lēən), an abstract mathematical system primarily used in computer science and in expressing the relationships between sets (groups of objects or concepts). (see box). In other words Adv. 1. in other words - otherwise stated; "in other words, we are broke"put differently , the problem is to determine whether this particular set of equations incorporates the basic laws of Boolean logic The "mathematics of logic," developed by English mathematician George Boole in the mid-19th century. Its rules govern logical functions (true/false) and are the foundation of all electronic circuits in the computer. , which specify the different ways in which collections of objects can be combined or manipulated using logic operations such as "and," "or," and "not." Such logic underlies the workings of today's digital computers and is often used to facilitate database searches. Robbins himself worked on the problem, and it was later taken up by others, including the late Alfred Tarski Alfred Tarski (January 14, 1902, Warsaw, Russian-ruled Poland – October 26, 1983, Berkeley, California) was a logician and mathematician who spent four decades as a professor of mathematics at the University of California, Berkeley. , a prominent logician at the University of California, Berkeley The University of California, Berkeley is a public research university located in Berkeley, California, United States. Commonly referred to as UC Berkeley, Berkeley and Cal . When Tarski failed to make any headway, he handed the problem out as a challenge to graduate students and visitors. Wos first came across the problem in 1979, when he helped a student develop a novel way of attacking it by working backwards-that is, by finding conditions that, if true, would prove the theorem. That promising start, however, did not lead to an immediate proof. At the same time, the problem had characteristics ideal for an automated attack. Embodied in just three equations, the Robbins conjecture could be stated succinctly in a form a computer could understand. Moreover, researchers could identify a number of conditions that, if true, would each point to a complete proof. "We tried it many times using Otter and other programs," Wos says. No luck. Last fall, McCune's EQP was ready for a new challenge. McCune let the prover software grapple continuously with the Robbins conjecture. He checked the computer every day for progress. On the eighth day, the computer found the answer and stopped. It had proved that the Robbins conjecture is true. Fresh out of the computer, the proof consisted of a cryptic chain of lengthy, practically unreadable statements of logic. McCune then refined EQP's procedures to obtain somewhat more streamlined proofs. In the meantime Adv. 1. in the meantime - during the intervening time; "meanwhile I will not think about the problem"; "meantime he was attentive to his other interests"; "in the meantime the police were notified" meantime, meanwhile , several mathematicians independently checked the computer proof and found it correct. Burris, for one, converted the proof's long statements into simpler forms that could be more readily understood by humans. "I ended up with a lot of little equations," he says. "You could easily sit on a bus and go through the hundred or so steps of the proof." Philosophy graduate student Branden E. Fitelson of the University of Wisconsin-Madison “University of Wisconsin” redirects here. For other uses, see University of Wisconsin (disambiguation). A public, land-grant institution, UW-Madison offers a wide spectrum of liberal arts studies, professional programs, and student activities. also worked through the result. "It's a very complicated proof, not at all elegant or conceptual," he comments. "It's a perfect example of the type of proof that a machine can find but we can't because of its complexity." EQP's success raises some interesting questions about mathematical creativity. If a mathematician had proved the conjecture, the achievement would have been heralded as a significant accomplishment. Yet it's highly unlikely that a mathematician would have come up with the kind of proof that EQP found. At the same time, mathematicians generally have no way of telling beforehand-just by looking at a problem or a conjecture-what kind of attack will subdue it. In fact, one of the attractions of the Robbins conjecture was that it looked vulnerable to techniques mathematicians had used to prove similar theorems, differing only in what seemed like minor details. "There's no general theory of when [various methods] are not going to work," Fitelson says. "That makes life interesting, but it's also hard to predict how things are going to work out." No one knows whether the computer proof generated by EQP represents a fluke success or the first of many triumphs. "That's something we'll be able to judge only in hindsight," Burris says. "Though we have strong theorem provers, we don't yet know what power is required to be really competitive with human beings." Nonetheless, he adds, "we now know [automated theorem proving] can do something significant, and we didn't know that a year ago." McCune, Wos, and others are taking a closer look at EQP's performance to see if they can learn something that could help with comparable problems. "What was it doing during those 8 days?" Wos wonders. "Is there some kind of strategy that would have tightened up the reasoning?" The researchers are also interested in extending the kind of deductive reasoning Deductive reasoning Using known facts to draw a conclusion about a specific situation. an automated theorem prover can apply from relatively simple logical relationships involving equality to more complicated expressions. They would like to introduce mathematical induction, which would allow the computer to make inferences. "Our programs do not learn, do not make judgments, and they do not invent concepts," Wos notes. Nonetheless, it may be possible to program them to be self-analytical-to recognize in a mechanical sense when searches are not going well and to shift strategies. McCune and Wos are also trying to interest mathematicians in suggesting problems that could be tried out on EQP or Otter. McCune, for example, is already working with R. Padmanabhan of the University of Manitoba Location The main Fort Garry campus is a complex on the Red River in south Winnipeg. It has an area of 2.74 square kilometres. More than 60 major buildings support the teaching and research programs of the university. in Winnipeg, using reasoning software to establish a large number of simple, unproven conjectures as part of a larger effort. "Those conjectures are the kind of thing that might take a mathematician an hour to do by hand but only a few seconds with a theorem prover," McCune says. Wos and McCune envision offering mathematicians the option of using a computer to free them from the drudgery of finding proofs, allowing them to spend more time on harder, more interesting problems. "Automated reasoning could complement your reasoning, so you could freelance more and use your intuition more," Wos argues. Programs such as Otter and EQP already work well in specialized areas of pure mathematics, such as the theory of algebraic 1. (language) ALGEBRAIC - An early system on MIT's Whirlwind. [CACM 2(5):16 (May 1959)]. 2. (theory) algebraic - In domain theory, a complete partial order is algebraic if every element is the least upper bound of some chain of compact elements. systems. "I think that computer-generated proofs will have a significant influence in the near future [in those fields]," says mathematician Kenneth Kunen Kenneth Kunen is a professor of mathematics at the University of Wisconsin-Madison who works in set theory and its applications to various areas of mathematics, such as set-theoretic topology and measure theory. of the University of Wisconsin-Madison. Similar approaches could help engineers design circuits or allow computer programmers to verify the correctness of their software. Indeed, research groups throughout the world are working on such applications of automated reasoning. Computers have yet to come up with a proof that belongs in "the book," and they can't turn coffee into theorems. However, the power of theorem-proving programs continues to increase. "Can machines reason at the same level as humans?" Burris asks. "In the long run, I would bet on it, but it's unlikely I'll see it in my lifetime." RELATED ARTICLE: Boolean logic Boolean algebra is a mathematical model of some of the basic rules of logic. It includes such laws as "for any proposition P, the negation of the negation of P means the same thing as P" or "for any two propositions P, Q, the conjunction of P and Q is false if and only if one or both of them is false." The Robbins problem is equivalent to proving that the equation not(not(P)) = P can be derived from the following three equations: P or Q = Q or P; (P or Q) or R = P or (Q or R); not(not(P or Q) or not(P or not(Q))) = P. |
|
||||||||||||||||||

`lēən)
Printer friendly
Cite/link
Email
Feedback
Reader Opinion