Computing tools to glean data efficiently.
All of that information is stored, but if you ever had to go find a particular bit of data, how would you begin to sift through the meaningless zeroes and ones to get to the proverbial needle in a haystack?
The problem is not a new one, but it is becoming more critical as the amount of information being produced, collected and stored far exceeds its capabilities to be processed and analyzed. Developments in data mining software to help analysts sort through the avalanche of information cannot keep pace with innovations in data storage devices that can accommodate thousands of giga-bytes. In the intelligence sector and the defense weapons testing communities in particular, the lack of analytical tools to search through and understand the sea of information is being felt pointedly. Both industries sop up voluminous quantities of data daily and have similar challenges in searching for the diamonds in the rough.
Military testers, analysts and engineers encounter haystacks full of needles, but with current data mining methods, which require a lot of human intervention, correlations can be difficult to ascertain.
"The human is definitely the choke point," says Dr. James A. Wall, director of the computing and information technology division in the Texas Center for Applied Technology--the research arm of Texas A&M University's Texas Engineering Experiment Station.
"We can collect and store information at rates we never have. But if we can't take advantage of it, it's a limiting factor," he says.
Before building new weapons technologies, the Defense Department virtually constructs and tests concepts in simulations. In a recent test event for the Future Combat Systems, the Army's Operational Test Command at Fort Hood collected more than 23 terabytes of network data.
A terabyte is 1,000 gigabytes. Or put another way, a terabyte of the letter "A" typed consecutively in 12-point Courier font would form a chain long enough to circumnavigate the Earth's equator 63 times, says Wall.
When FCS--the Army's digitally connected fleet of combat systems--goes into full testing, it could generate up to 100 terabytes of data each month.
"That's a lot of data," says Wall. Before the Army can begin to construct the system, it must analyze the test information to look for design flaws and other problems. Culling through so much data with available software could take years--a luxury the service does not have.
To help solve the problem, a team in Wall's division is working with the Army to build a framework for collecting data, organizing it and tying it to new data visualization methods to glean more information.
"By improving the capability to navigate and interactively manipulate and explore data, we can enable the analyst to engage in a 'discourse' with the data," write J.J. Thomas and K.A. Cook in an article on visual analytics, a science of analytical reasoning supported by visual, and often interactive, interfaces.
Supporters of visual analytics argue that by improving the state of data visualization tools in these areas, scientists can increase the likelihood that important pieces of information buried within massive databases will be recognized in time, they say.
Wall's team is in its initial year of a potentially three-year project with the test command to deal with the data proliferation and mining issue. It will provide the testing community with software tools and an architecture that will allow for the insertion of visual analytic software as needed in the future.
"This software will not only allow data collectors and analysts to identify and integrate new visualization methods customized for individual test requirements, but will also provide an environment in which users can collaborate by sharing visual products from a given dataset," says the team's report.
Initially, the software will provide the command with interactive visualization of the large amounts of network traffic data generated during the testing of FCS components.
"This technology has great applicability to any high data volume environment," the report says.
In the future, quantum computing may solve many of the large data processing issues, says Wall. While quantum technology has only seen limited applications, a few rudimentary small-scale prototypes have been built to prove some of the claims, he adds.
Please email your comments to GJean@ndia.org
|Printer friendly Cite/link Email Feedback|
|Title Annotation:||INSIDE Science AND Technology|
|Comment:||Computing tools to glean data efficiently.(INSIDE Science AND Technology)|
|Date:||Sep 1, 2007|
|Previous Article:||Critical infrastructure plans making progress, GAO says.|
|Next Article:||Arms sales: U.S.-U.K. defense technology pact likely to draw fire.|
|Info Ag 2007.|
|Building successful partnerships.|
|Unveiling Your Hidden Power.|
|Beneficial disruption: smart insurers use predictive modeling to boost strategic and tactical decision-making.|