Diagramming sentences can save your company billions: this isn't some sort of cryptic plea to hire hungry, unemployed English majors. Rather, a software company--with CIA ties--has developed a means by which problem identification can be performed so that actual problems can be solved before the potential of recalls enter the picture--or text, as it were.In early May, the National Highway Traffic Safety Administration The National Highway Traffic Safety Administration (NHTSA, often pronounced "nit-suh") is an agency of the Executive Branch of the U.S. Government, part of the Department of Transportation. (NHTSA NHTSA National Highway Traffic Safety Administration (US government) ) released the March 2004 listing of vehicle recalls. Included in the report are large numbers of "potentially involved" vehicles. As in as many as 938,789 2000-03 Taurus/Sable models ("to correct a problem with a malfunctioning mal·func·tion intr.v. mal·func·tioned, mal·func·tion·ing, mal·func·tions 1. To fail to function. 2. To function improperly. n. 1. Failure to function. 2. stop lamp switch and/or associated wiring. This malfunctioning could render the stop lamps inoperable inoperable /in·op·er·a·ble/ (in-op´er-ah-b'l) not susceptible to treatment by surgery. in·op·er·a·ble adj. Unsuitable for a surgical procedure. or cause them to stay on all the time....") As many as 3,662,211 GM '00 to '04 Silverados, Sierras, Avalanches, and Escalade es·ca·lade n. The act of scaling a fortified wall or rampart. [French, from Italian scalata, ultimately from Latin sc EXTs ("the galvanized gal·va·nize tr.v. gal·va·nized, gal·va·niz·ing, gal·va·niz·es 1. To stimulate or shock with an electric current. 2. steel tailgate A conversion layer that lets IDE devices connect to the IEEE 1394 Firewire interface. support cables that retain the tailgate in the full open (horizontal) position may corrode cor·rode v. cor·rod·ed, cor·rod·ing, cor·rodes v.tr. 1. To destroy a metal or alloy gradually, especially by oxidation or chemical action: acid corroding metal. , weaken, and eventually fracture....") There are more. Plenty more. Chryslers and Bentleys. Maseratis and Winnebagos. Millions of cars. On an annual basis, billions of dollars. To say nothing of all of the people who may lose their interest in a particular vehicle manufacturer's products and spend their dollars elsewhere. Recalls are a non-trivial problem. A company that has its roots at the University of Utah The University of Utah (also The U or the U of U or the UU), located in Salt Lake City, is the flagship public research university in the state of Utah, and one of 10 institutions that make up the Utah System of Higher Education. may have a means to address vehicle problems that crop up before they aggregate to the tipping point The point in time in which a technology, procedure, service or philosophy has reached critical mass and becomes mainstream. See network effect. See also tip and ring. of recalls. As Craig Norris Craig Norris is a Canadian rock singer and radio personality. He is the lead singer for The Kramdens, and is also a host on CBC Radio 3, including the network's weekly record chart show The R3-30. External links
It's linguistics meets computer science. And the result is something known as "natural language processing Natural language processing Computer analysis and generation of natural language text. The goal is to enable natural languages, such as English, French, or Japanese, to serve either as the medium through which users interact with computer systems such as ." Which is simply (well, simple to say, anyway) a method that uses computers to process written language--such as the "comments" field in a warranty claim--for a useful purpose. Like finding problems fast. And taking action. Realize that when you look at those numbers of vehicles involved in recalls that there are a whole lot of warranty claims that may need to be read--and which probably aren't, given the vast numbers. But the Attensity approach handles this via silicon, not eyeballs The number of users. "There are 110 eyeballs" means there are 110 users currently online. See eyeball hang time. . And while this technology may sound as though it is something that you'd have to bust out "Bust Out" is the twenty-third episode of the HBO original series The Sopranos and the tenth of the show's second season. It was written by Frank Renzulli, Robin Green and Mitchell Burgess, directed by John Patterson and originally aired on Sunday March 19 2000. the supercomputers to handle, Attensity's Relational Extraction Server can run on a PC with a Pentium 4 and Windows XP The previous client version of Windows. XP was a major upgrade to the client version of Windows 2000 with numerous changes to the user interface. XP improved support for gaming, digital photography, instant messaging, wireless networking and sharing connections to the Internet. . NOT JUST A CUSTOMER. AN INVESTOR. How encouraging this is depends entirely on your point of view: One of the investors in Attensity is In-Q-Tel. Which is an investment firm established by the CIA CIA: see Central Intelligence Agency. (1) (Confidentiality Integrity Authentication) The three important concerns with regards to information security. Encryption is used to provide confidentiality (privacy, secrecy). . (Apparently that's not a secret.) One of the examples that's used to explain the function of the company's Relational Extraction Server is the following sentence: John Doe bought C4 from John Smith in Cairo on October 4, 2000. Given the amount of information that outfits like the CIA must process, having a natural language processor can certainly be advantageous. Which explains the CIA's interest. But while we're on the subject of sentences, here's one that Norris provides that comes from the auto industry: The bolt on the underside of the transmission cracked due to heat. He explains that a company Attensity is working with had been using another language processor. That language processor came to the conclusion that the transmission cracked, not the bolt, because the two words are next to each other. Attensity's algorithms came up with the right interpretation: the bolt cracked. That's because they do something that other systems apparently don't do. THIS WOULD HAVE MADE ENGLISH CLASS A WHOLE LOT EASIER. Simply stated, Attensity's product diagrams sentences. Remember the whole subject, verb, object, modifiers thing? That's what they do. Norris: "We're able to diagram sentences very fast: We can do Moby Dick Moby Dick pursued by Ahab and crew of Pequod. [Am. Lit.: Moby Dick] See : Quarry Moby Dick white whale pursued relentlessly by Captain Ahab; “It was the whiteness of the whale that above all things appalled me. in less than five seconds on a simple one-process or machine." So what difference does that make to building better cars, trucks, and the like? Plenty. TALKING IN CODES. The good news about language--the regular lingua franca lingua franca (lĭng`gwə frăng`kə), an auxiliary language, generally of a hybrid and partially developed nature, that is employed over an extensive area by people speaking different and mutually unintelligible tongues in order to that we use--is that it is so flexible. Which is also the bad news about language: It is open to interpretation. So when it comes to things like warranty forms, instead of using discursive language, code is deployed. One of the first companies in vehicle manufacturing that Attensity worked with was John Deere. Recalling what they learned early on in the engagement, Bart O'Brien, Attensity vp of Business Development, says, "We were surprised at how useless the codes were for actually identifying problems." And there are plenty of reasons why. For one thing, he says, "Codes have a fairly distant relationship to the cause of the failure unless the failure is strictly caused by a component failing." He explains, "The reporting system is component-oriented, so it almost assumes that all problems are component failures. But given the advent of things like Six Sigma Not to be confused with Sigma 6. Six Sigma is a set of practices originally developed by Motorola to systematically improve processes by eliminating defects.[1] A defect is defined as nonconformity of a product or service to its specifications. , the components are probably the least likely source of failure." For another: "Codes tend to disguise failures that have happened for other reasons--particularly assembly-related reasons that have no part number." PRIME NUMBERS There are infinitely many prime numbers. The first 500 are listed below, followed by lists of the first prime numbers of various types in alphabetical order. The first 500 prime numbers 2 3 5 7 11 13 17 19 23 29 31 37 41 43 47 53 59 61 67 71 . "You can have one problem that's coded in multiple different ways. And for logical reasons," says Norris. One reason is that there may not be a code that describes the actual problem, so they have to make do with what's available. (Or there could be a code, but it is so uncommon that the technician would have to look it up, and he's not paid to do that, so he goes with what he knows for the sake of expedience ex·pe·di·ence n. Expediency. Noun 1. expedience - the quality of being suited to the end in view expediency .) Another reason is that since the parts often flow through the warranty system, the problem is identified as being with the most expensive part involved since they're looking for Looking for In the context of general equities, this describing a buy interest in which a dealer is asked to offer stock, often involving a capital commitment. Antithesis of in touch with. reimbursement, so the prime part ID'd tends to be that costly one. (The stories that suppliers can tell about 4receiving parts back that are ostensibly os·ten·si·ble adj. Represented or appearing as such; ostensive: His ostensible purpose was charity, but his real goal was popularity. "faulty" but that work just fine are undoubtedly legion.) It's not that the technicians aren't trying to do the right thing. According to according to prep. 1. As stated or indicated by; on the authority of: according to historians. 2. In keeping with: according to instructions. 3. O'Brien, "We did an analysis and found the technicians were doing a good job of picking the best codes of those offered to them." But what's being offered to them may not describe the real problem. They cite a situation with a Tier One seating supplier. Heated seats weren't heating. The codes used to describe the problem indicated that there were electrical accessory failures. The real issue was that the wrong seat backs were being installed. But there was no code for that. "There are certain problems, especially assembly problems, that get spread over many different part numbers," O'Brien says. This is because there's no readily available code to describe that problem. "It forces what we call statistical dilution. People are struggling to describe something that can't be described in code." So they pick various part numbers, and the problems consequently get spread around, or diluted. "Look at the universe of problems," O'Brien posits. Three categories: Component failure. Design problems. Assembly problems. "Design and assembly problems can mask themselves as a component failure: If I miswire the fuse box in a tractor, it's going to blow the alternator alternator: see generator. alternator Source of direct electric current in modern vehicles for ignition, lights, fans, and other uses. The electric power is generated by an alternator mechanically coupled to the engine, with a rotor field coil . So that part needs to be replaced." And that part is undoubtedly going to be the one identified as the one that there was a problem with. The wiring issue gets disguised. "The reporting system itself is component-oriented, so it almost assumes all problems are component problems. It's not going to be reported to be spoken of; to be mentioned, whether favorably or unfavorably. See also: Report as an assembly problem." At least not in terms of filling in the Complaint, Cause, Correction fields on the typical form. [ILLUSTRATION OMITTED] KEEPING UP WITH THE READING. When the problems are complex--which many problems tend to be nowadays--and the number of vehicles and claims are huge, there are exceedingly complex issues to be resolved. And some of them can be resolved with natural language processing. So why don't people just read what the technicians write in their comments? For one thing, there is a limited number of people who are available to do the reading and a seemingly increasing number of forms to read. What's more, when technicians from around the country write about the very same problem, they do so in a variety of styles with a variety of jargon. This is what's called "unstructured data Data that does not reside in fixed locations. Free-form text in a word processing document is a typical example. Contrast with structured data. See free-form database. ." It may be rich in information, but from the point of view of a person, there are issues. For one thing, even if you had a person reading all of the warranty forms, she wouldn't know what she was looking for. "That's a problem," O'Brien says, "because the things you're trying to solve are happening at typically less than a 1% incidence level, and the human mind cannot really pick up things at such a low level of incidence, especially when they're coming in over time." Which is where Attensity comes in. "Our belief," O'Brien intones, "is that only by doing a consistent mass conversion of this rich, unstructured form of data into a structured form that can be graphed and analyzed can you start picking up lowlevel problems that you want to solve." He's talking about picking them up when they're at a level of 0.1%. BACK TO THE DIAGRAMMING. Essentially, the Attensity products take the forms that come in and diagram the sentences in the texts produced by the technicians. This allows them to extract the who, what, why, when, where of the situation. This information is then "relationalized"--put into rows and columns. Because the system diagrams the sentences the right words go in the appropriate categories. For example, back to the cracked transmission/bolt sentence. Using the Attensity linguistic extraction and monitoring tools, the note "The bolt on the underside of the transmission case cracked due to the heat" can be organized: Event type Part affected Area on vehicle Cause* Crack Bolt Transmission case Heat *For those of you who are wondering about the way that "John Doe bought C4 from John Smith in Cairo on October 4, 2000" was handled: Event type Sub-type Potential Bomber Terrorism Bomb making John Doe Material Place of Purchase Date of Purchase C4 Cairo 10-04-02 Norris says that once the information is set into a structured format, executives and engineers can gain a better understanding of what's occurring and they can apply existing analytic tools to focus their people on the real cause of the problems: "There is always a tension between the number of problems that I have and know about and my resource capability to assign to try to solve them." If the existing part-centric approach holds sway, and if there is statistical dilution as various technicians indicate various parts as being the cause, then the likelihood of recalls continuing is high. Consider the aforementioned alternator/wiring problem. People responsible for the alternator are tasked with solving the problem. They determine that there is nothing wrong with the alternator. So they star reading through the warranty claims. They read the verbatim reports and realize that it is actually a wiring harness problem. So they send it to that group. Norris suggests that this is too time-consuming. "We take the verbatim information that's used at the end of the process and move it to the front end. Let's ID where the problem is likely to be. If I know that we've got a specific problem, then we can send it to the factory, and they can simply walk out on the floor and ask what's going on What's Going On is a record by American soul singer Marvin Gaye. Released on May 21, 1971 (see 1971 in music), What's Going On reflected the beginning of a new trend in soul music. with the wiring harness." Maybe the problem is with the design or the assembly--not the harness itself. And certainly not the alternator. THE IMPORTANCE OF RELATIONALIZED INFORMATION. Norris estimates that as much as 85% of the information that's available to corporations is not in rows and columns. It's not relationalized. It's in text. And there's an increasing amount of information out there. So what's happening? Decisions are being made about things without complete information. Norris tells a little story about the consequences of this. Say your arm hurt. You go to a doctor. She does tests and tells you to come back in a week. You come back in a week. She says that your arm is to be amputated. ("My first thought would be that my arm feels better," Norris says.) You ask how she came to that conclusion. She answers that she used 15% of the information from the test and ignored the rest. Chances are, you'd ask her to look at the other 85%. By Gary S. Vasilash, Editor-in-Chief |
|
||||||||||||||||||||

Printer friendly
Cite/link
Email
Feedback
Reader Opinion