Exceptions to the rule; at least two human languages contain grammatical features that put them outside some well-known grammars.In the African language Bambara, the phrase wuluniyninanyinina o wulunyininanyinina means "whoever searches for dog serchers." Although this expression doesn't slip smoothly into typical cocktail party chatter, the recent recognition of this type of grammatical construction Noun 1. grammatical construction - a group of words that form a constituent of a sentence and are considered as a single unit; "I concluded from his awkward constructions that he was a foreigner" construction, expression settles a longstanding linguistic dispute. At issue is the nature of the simplest grammer that encompasses all human languages. "Striking discoveries about human languages are not very common in science," says linguist lin·guist n. 1. A person who speaks several languages fluently. 2. A specialist in linguistics. [Latin lingua, language; see Geoffrey K. Pullum of the University of California The University of California has a combined student body of more than 191,000 students, over 1,340,000 living alumni, and a combined systemwide and campus endowment of just over $7.3 billion (8th largest in the United States). at Santa Cruz Santa Cruz, city, United States Santa Cruz (săn`tə kr z), city (1990 pop. 49,040), seat of Santa Cruz co., W Calif., on the north shore of Monterey Bay; inc. 1866. . "But this is a nice instance of a genuinely new discovery about natural languages." The finding may even have implications for programming computers to understand human conversation (SN:7/27/85, p. 53). "It's now [recognized to be] a more demanding task," says Pullum. For decades, linguists A linguist in the academic sense is a person who studies linguistics. Ambiguously, the word is sometimes also used to refer to a polyglot (one who knows more than 2 languages), or a grammarian, but these two uses of the word are distinct. have delved into words and senteces, pondering the subtle interplay of position, meaning and pronunciation that adds up to a meaningul, grammatical sentence. Part of this study has been an attempt to pick out the hidden framework that underlies human languages and allows them to work as effective message carriers. In any human language, there are certain things that native speakers, even without the prompting of a schoolteacher, naturally and readily identify as grammatical. These expressions are part of the language, while other, "ungrammatical un·gram·mat·i·cal adj. 1. Not in accord with the rules of grammar. 2. Not in accord with standard or socially prestigious linguistic usage. un " sentences clearly don't belong. This ability to separate grammatical from ungrammatical sentences provides some of the data that go into the study of language structure. But it isnht enough just to observe what people say and write. Linguists also try to make some sense out the observations and to explain why languages seem to be structured in particular ways. "You wind up hypothesizing a lot of theoretical machinery that's not directly reflected in any way in the observed data," says P. Stanley Peters Jr. of Stanford University Stanford University, at Stanford, Calif.; coeducational; chartered 1885, opened 1891 as Leland Stanford Junior Univ. (still the legal name). The original campus was designed by Frederick Law Olmsted. David Starr Jordan was its first president. . As a result, linguists have developed an array of competing grammars, all trying to shed light on how human languages work. One of the earliest grammars to be postulated pos·tu·late tr.v. pos·tu·lat·ed, pos·tu·lat·ing, pos·tu·lates 1. To make claim for; demand. 2. To assume or assert the truth, reality, or necessity of, especially as a basis of an argument. 3. and studied was based on the observation that in grammatical sentences, certain words tend to appear in clusters. "they are grouped into phrases, and those phrases are broken down into subphrases and so on," says Peters. "Phrase structure is somehow an important part of the language." These word clusters can be put into categories: noun phrases, verb phrases, adjective phrases and perhaps a few dozen other types. The idea is that all phrases of the same type ought to be completely interchangeable. In such a phrase-structure grammar," anywhere you have a grammatical sentence with one phrase of a certain type," says Peters, "you can just substitute for it any other phrase of the same type, and the result will be a grammatical sentence." This concept leads to a grammar that is especially simple mathematically. Known as a "context-free phrase-structure grammar," it consists of a finite list of phrase types and a finite set In mathematics, a set is called finite if there is a bijection between the set and some set of the form where n is a natural number. (The value n = 0 is allowed; that is, the empty set is finite.) An infinite set is a set which is not finite. of rules for piecing together sentences. "You're off to the races," says Peters. "You can generate an infinite set (mathematics) infinite set - A set with an infinite number of elements. There are several possible definitions, e.g. (i) ("Dedekind infinite") A set X is infinite if there exists a bijection (one-to-one mapping) between X and some proper subset of X. of sentences with a finite set of rules." But does this simple structure really account for all grammatical features of all human languages? Early on, researchers ran into problems. In practice, for example, it turns out that not dozens but thousands of phrase types have to be identified. In addition, this grammar sometimes requires an unwieldy number of complicated rules. Into this confusing situation stepped Noam Chomsky Noun 1. Noam Chomsky - United States linguist whose theory of generative grammar redefined the field of linguistics (born 1928) A. Noam Chomsky, Chomsky of the Massachusetts Institute of Technology Massachusetts Institute of Technology, at Cambridge; coeducational; chartered 1861, opened 1865 in Boston, moved 1916. It has long been recognized as an outstanding technological institute and its Sloan School of Management has notable programs in business, . He argued in the 1950s that many of the problems can be overcome by defining a small core of phrase types and adopting new rules allowing "syntactic transformations." These transformations generate other forms of sentences that make up the rest of the language. Chomsky's theory "overcame some apparent shortcomings A shortcoming is a character flaw. Shortcomings may also be:
The basic constituent particles of which elementary particles are understood to be composed. Theoretical models built on the quark concept have been very successful in understanding and predicting many phenomena in the physics of elementary particles. instead of three, he says. The theory predicts all sorts of funny combinations that don't actually show up. Chomsky also conjectured that human languages are too complex to fit within either finite-state or context-free grammars, the two simplest types. In the case of a finite-state grammar, this was easy to prove. In such a grammar, sentences are generated by a series of choices made one after the other. To generate a sentence, one picks the first word, then the second, then the third, and so on until the sentence is complete. Chomsky said that a grammar based on such a system can't handle English sentences like "The bananas in the crate...are yellow." The choice of the word "are" depends not on the words immediately before it but on the words "bananas," which may be separated from "are" by an indefinite amount of material. Similarly, Chomsky contended that while context-free phrase-structure grammars may succeed in distinguishing the grammatical sentences of some languages, they do so only with unnecessarily complex rules. Although this contention wasn't rigorously proven, most linguistic scholars accepted it and began to focus on transformational grammars. However, as transformational grammars themselves grew more and more complex, a few linguists began to feel that "some truth had been thrown overboard with the falsehood," says Peters. In particular, several researchers began to look again at phrase-structure grammars to see if they could come up with a theory that was mathematically neater and more tightly constrained than transformational theory. In 1982, Pullum and Gerald Gazdar Gerald Gazdar (born February 24, 1950) is a linguist and computer scientist. He graduated from the University of East Anglia in 1970, and completed his master's degree in 1972 at the University of Reading, where he also received his PhD in 1976. of the University of Sussex in Brighton, England, published a paper reviewing all of the important efforts during the previous 25 years to prove that some languages, including Dutch, English and Mohawk, are not context-free. They concluded that none of the arguments was valid. "This made it once again a live theoretical issue," says Pullum. These conclusions encouraged Pullum and others to examine context-free phrase-structure grammars more closely. "This theory had survived some pretty powerful empirical testing," says Peters. It was attractive, says Pullum, because "a context-free language The introduction to this article provides insufficient context for those unfamiliar with the subject matter. Please help [ improve the introduction] to meet Wikipedia's layout standards. You can discuss the issue on the talk page. can always be handled fairly efficiently. It's easy to determine whether a sentence is grammatical." This makes such a language amenable to computer processing. "If you go much beyond context-freeness," says D. Terence Langendoen of the City University of New York The City University of New York (CUNY; acronym: IPA pronunciation: [kjuni]), is the public university system of New York City. Graduate Center, "you get into classes of grammars that are very hard to understand formally. They've quite obscure and unpleasant in a mathematical sense." Recently, the picture changed. In the August LINGUISTICS AND PHILOSOPHY, two papers show for the first time that there are human languages that cannot possibly be described by a context-free phrase-structure grammar. Stuart M. Shieber of SRI International (company) SRI International - One of the world's largest contract research firms. Founded in 1946 in conjuction with Stanford University as the Stanford Research Institute, they later became fully independent and were incorporated as a non-profit organisation under U.S. in Menlo Park Menlo Park. 1 Residential city (1990 pop. 28,040), San Mateo co., W Calif.; inc. 1874. Electronic equipment and aerospace products are manufactured in the city. Menlo College and a Stanford Univ. research institute are there. 2 Uninc. , Calif., shows it for a Swiss dialect of German, and Christopher Culy, now with the U.S. Peace Corps in Africa, shows it for Bambara, a language spoken in Mali. In general, says Pullum, context-free phrase-structure grammars can't define languages in which a "string" of arbitrary length in one part of a sentence must have an exact duplicate elsewhere. Both Shieber and Culy found such a structure or its analog in Swiss German Swiss German adj → suisse-allemand(e) Swiss German Swiss adj → deutsch-schweizerisch Swiss German adj → and in Bambara. Shieber concentrates on a class of subordinate clauses in which a string of verbs may be indefinitely distant from their direct objects. These objects, depending on the verb, may be in the accusative accusative (əky `zətĭv') [Lat.,=accusing], in grammar of some languages, such as Latin, the case typically meaning that the noun refers to the entity directly affected by an or the dative case Noun 1. dative case - the category of nouns serving as the indirect object of a verbdative oblique, oblique case - any grammatical case other than the nominative . The important point is that when such clauses are constructed, some mental bookkeeping bookkeeping, maintenance of systematic and convenient records of money transactions in order to show the condition of a business enterprise. The essential purpose of bookkeeping is to reveal the amounts and sources of the losses and profits for any given period. is needed to ensure that there are exactly enough of each type of verb and object. Something in one part of a sentence must correspond exactly to something elsewhere in the sentence. This example from Swiss German illustrates the principle involved. The clause ... mer d'chind em Hans es huus haend wele laa halfe aastriiche can be translated literally as: "... we the children [accusative] Hans [dative dative (dā`tĭv) [Lat.,=giving], in Latin grammar, the case typically used to refer to an indirect object, i.e., a secondary recipient of an action. For example, him in I gave him a book is translated in Latin by a dative case. ] the house [accusative] have wanted let help paint." A smoother English translation turns the clause into "...we have wanted to let the children help Hans paint the house." Theoretically, in Swiss German there's no limit to the number of objects that must be attached to specific verbs elsewhere in the clause. "Natural languages can indeed cross the context-free barrier," concludes Shieber. Culy's demonstration turns on a doubling pattern. In Bambara, certain compound words contain a noun stem repeated twice. It must be the same noun stem each time, and the noun stem may be arbitrarily long. The word wulu, for instance, means "dog." The construction wulu o wulu, in which the noun is repeated exactly, means "whichever dog." Bambara also contains the following construction: wulu + nyini + la = wulunyinina. This translates as "one who searches for dogs" or as dog searchers." The construction can be "doubled" to produce wulunyininanyinina, which means "one who searches for dog searchers." Combining this construction with the one described in the previous paragraph produces the example that started off this article. The point is that, theoretically, a speaker can build up words of arbitrary length by way of a doubling scheme. Yet if such words are used in the "o" construction, the nouns on each side of "o" must be the same. "This very free process of redoubling causes the vocabulary of Bambara to be non-context-free," says Culy. Linguists are still debating whether English contains a similar feature. The sentence "Expert or no expert, I think he's wrong" may fill the bill. The noun "expert" some linguists argue, can be replaced by an arbitrarily long noun phrase. But because the same words must appear on either side of "or no," the situation is similar to that found in Bambara, or Swiss German. Not everyone, however, is convinced that this argument is valid. Nevertheless, to Pullum and others, the evidence now clearly points to the fact that at least two human languages are generally somewhat more complex than the simplest grammars would allow. In the end, Chomsky was right. "It seems now that we have sufficient grounds to believe that this difficult, and for 20 years controversial, issue has been resolved," says Langendoen. But context-free phrase-structure grammars are hard to give up. Because of their straightforward mathematical structure In mathematics, a structure on a set, or more generally a type, consists of additional mathematical objects that in some manner attach to the set, making it easier to visualize or work with, or endowing the collection with meaning or significance. , context-free languages are easy to analyze. Well-known and highly efficient algorithms do the job nicely. If the recently discovered exceptions prove to be minor, then something may still be salvaged. "The context-free grammars come very close to describing all structure of all languages, says Pullum. "In fact, the tricky patterns that make for the non-context-freeness, when studied in their own terms, turn out to be fairly easy to describe and likely to be tractable tractable easy to manage; tolerable. as well. "Although you can't deal with a language that has redoubling and duplication using a context-free grammar, and therefore there are certain parsing See parse. parsing - parser techniques, familiar in computer scient, that won't work on a language like that," he says, "nevertheless, you can easily devise another technique that will." The emphasis on grammar is important, says Langendoen. "To get a system to work effectively with language as we understand it," he says, "we're going to have to have pretty precise models of grammar in place." A computer needs to known more than just the meanings of words. "I still resist the conclusion that there's anything about natural languages that makes them wild and woolly," insists Pullum. "I think this kind of mathematical attention to the details of language structure is leading to the conclusion that we can deal with the structures of these languages very successfully using computers. I think the age of true natural language processing Natural language processing Computer analysis and generation of natural language text. The goal is to enable natural languages, such as English, French, or Japanese, to serve either as the medium through which users interact with computer systems such as is on the horizon." But there are deeper linguistic questions. "The mind is a black box," says Barbara Partee Barbara Hall Partee (born June 23, 1940 in Englewood, New Jersey) is a Distinguished University Professor of Linguistics and Philosophy Emerita at UMass-Amherst. Partee is the primary creator of modern formal semantics. External links
"You say something to me," says Peters. "I say something to you." Within a fraction of a second, he notes, people do what appears to be an unbelievably complicated analysis involving the linguistic structure. "One of the things we would like to understand about language," he says, "is how we're able to make the mental computations that are involved in going from the perception of a blast of noise...to grasping the information that the blast of noise actually represents." One appeal of context-free phrase-structure grammars was the ease with which they could be processed. "If something like this could really be made to work as a theory of language," says Peters, "then it would also give you a good start on answering the question of how it is all computationally feasible." That was the hope; the reality is somewhat different. The Culy and Shieber discoveries now show that developing the right grammar is more complicated than it had seemed. |
|
||||||||||||||||||

z)
Printer friendly
Cite/link
Email
Feedback
Reader Opinion