Printer Friendly

Computing tools to glean data efficiently.

In our highly digitized world, the average person generates enough information to fill several CDs a year. Now think how much data a large organization, such as a governmental agency, produces on a daily basis. The amounts of emails, presentations, spreadsheets and other products multiply at an exponential rate.

All of that information is stored, but if you ever had to go find a particular bit of data, how would you begin to sift through the meaningless zeroes and ones to get to the proverbial needle in a haystack?

The problem is not a new one, but it is becoming more critical as the amount of information being produced, collected and stored far exceeds its capabilities to be processed and analyzed. Developments in data mining software to help analysts sort through the avalanche of information cannot keep pace with innovations in data storage devices that can accommodate thousands of giga-bytes. In the intelligence sector and the defense weapons testing communities in particular, the lack of analytical tools to search through and understand the sea of information is being felt pointedly. Both industries sop up voluminous quantities of data daily and have similar challenges in searching for the diamonds in the rough.

Military testers, analysts and engineers encounter haystacks full of needles, but with current data mining methods, which require a lot of human intervention, correlations can be difficult to ascertain.

"The human is definitely the choke point," says Dr. James A. Wall, director of the computing and information technology division in the Texas Center for Applied Technology--the research arm of Texas A&M University's Texas Engineering Experiment Station.

"We can collect and store information at rates we never have. But if we can't take advantage of it, it's a limiting factor," he says.

Before building new weapons technologies, the Defense Department virtually constructs and tests concepts in simulations. In a recent test event for the Future Combat Systems, the Army's Operational Test Command at Fort Hood collected more than 23 terabytes of network data.

A terabyte is 1,000 gigabytes. Or put another way, a terabyte of the letter "A" typed consecutively in 12-point Courier font would form a chain long enough to circumnavigate the Earth's equator 63 times, says Wall.

When FCS--the Army's digitally connected fleet of combat systems--goes into full testing, it could generate up to 100 terabytes of data each month.

"That's a lot of data," says Wall. Before the Army can begin to construct the system, it must analyze the test information to look for design flaws and other problems. Culling through so much data with available software could take years--a luxury the service does not have.

To help solve the problem, a team in Wall's division is working with the Army to build a framework for collecting data, organizing it and tying it to new data visualization methods to glean more information.

"By improving the capability to navigate and interactively manipulate and explore data, we can enable the analyst to engage in a 'discourse' with the data," write J.J. Thomas and K.A. Cook in an article on visual analytics, a science of analytical reasoning supported by visual, and often interactive, interfaces.

Supporters of visual analytics argue that by improving the state of data visualization tools in these areas, scientists can increase the likelihood that important pieces of information buried within massive databases will be recognized in time, they say.

Wall's team is in its initial year of a potentially three-year project with the test command to deal with the data proliferation and mining issue. It will provide the testing community with software tools and an architecture that will allow for the insertion of visual analytic software as needed in the future.

"This software will not only allow data collectors and analysts to identify and integrate new visualization methods customized for individual test requirements, but will also provide an environment in which users can collaborate by sharing visual products from a given dataset," says the team's report.

Initially, the software will provide the command with interactive visualization of the large amounts of network traffic data generated during the testing of FCS components.

"This technology has great applicability to any high data volume environment," the report says.

In the future, quantum computing may solve many of the large data processing issues, says Wall. While quantum technology has only seen limited applications, a few rudimentary small-scale prototypes have been built to prove some of the claims, he adds.

[ILLUSTRATION OMITTED]

Please email your comments to GJean@ndia.org
COPYRIGHT 2007 National Defense Industrial Association
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 2007, Gale Group. All rights reserved.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Title Annotation:INSIDE Science AND Technology
Comment:Computing tools to glean data efficiently.(INSIDE Science AND Technology)
Author:Jean, Grace
Publication:National Defense
Date:Sep 1, 2007
Words:752
Previous Article:Critical infrastructure plans making progress, GAO says.
Next Article:Arms sales: U.S.-U.K. defense technology pact likely to draw fire.
Topics:


Related Articles
Info Ag 2007.
Building successful partnerships.
Unveiling Your Hidden Power.
Beneficial disruption: smart insurers use predictive modeling to boost strategic and tactical decision-making.

Terms of use | Copyright © 2014 Farlex, Inc. | Feedback | For webmasters