Printer Friendly
The Free Library
14,529,525 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

Reduce the cost of compliance: database archiving and Information Lifecycle Management.


Database growth and storage issues are a top priority for large and small enterprises worldwide. Companies across industries rely on large, complex relational databases relational database

Database in which all data are represented in tabular form. The description of a particular entity is provided by the set of its attribute values, stored as one row or record of the table, called a tuple.
 to support mission-critical applications. According to according to
prep.
1. As stated or indicated by; on the authority of: according to historians.

2. In keeping with: according to instructions.

3.
 META Group, relational databases are growing at a rate of 125% annually. Compounding these challenges, data is stored across a variety of different database management systems, operating systems Operating systems can be categorized by technology, ownership, licensing, working state, usage, and by many other characteristics. In practice, many of these groupings may overlap. , hardware platforms Each hardware platform, or CPU family, has a unique machine language. All software presented to the computer for execution must be in the binary coded machine language of that CPU. Following is a list of the major hardware platforms in existence today. See platform.  and storage systems. These heterogeneous environments Using hardware and system software from different vendors. Organizations often use computers, operating systems and databases from a variety of vendors. Contrast with homogeneous environment.  provide open, selective or restrictive access to application data for hundreds or thousands of users.

Experience shows that larger databases negatively impact application performance and take longer to load, unload To remove a program from memory or take a tape or disk out of its drive. , search, reorganize re·or·gan·ize  
v. re·or·gan·ized, re·or·gan·iz·ing, re·or·gan·iz·es

v.tr.
To organize again or anew.

v.intr.
To undergo or effect changes in organization.
 and recover. Response time can slow to the point where maintaining service levels can become nearly impossible. Batch processing (1) Performing a particular operation automatically on a group of files all at once rather than manually opening, editing and saving one file at a time. For example, graphics software that converts a selection of images from one format to another would be a batch processing utility.  windows expand, processing times overlap, and the start of routine operations is often delayed.

So, what are the alternatives? Database tuning Database tuning describes a group of activities used to optimize and homogenize the performance of a database. It usually overlaps with query tuning, but refers to configuration of the database files, the database management system (DBMS), and the operating system and hardware the ? Expensive capacity upgrades? More storage? These options may abate abate v. to do away with a problem, such as a public or private nuisance or some structure built contrary to public policy. This can include dikes which illegally direct water onto a neighbors property, high volume noise from a rock band or a factory, an improvement  the problems in the short term, but only offer diminishing returns because they only address the symptoms, while databases continue to grow. Data retention requirements and the need to manage and store information for longer periods are resulting in increased operational costs. Companies want to reduce IT costs and gain a greater return on their data management and storage investment. That is why Information Lifecycle Management Information Lifecycle Management refers to a wide-ranging set of strategies for administering storage systems on computing devices. Specifically, four categories of storage strategies may be considered under the auspices of ILM.  (ILM) is considered an overall "best practice" approach, and database archiving is an essential component of ILM for managing database growth over the long term.

What is Active Archiving Moving data to a secondary storage medium that can be readily accessed if required. Active archiving enhances the performance of production databases by eliminating records that are not accessed daily, but may be needed for reference from time to time. ?

Unique to database archiving, active archiving is a proven strategy that allows organizations to archive rarely accessed data from application databases and manage the data efficiently, while providing easy access to data on demand. Archived data, business context data and metadata (1) (meta-data) Data that describes other data. The term may refer to detailed compilations such as data dictionaries and repositories that provide a substantial amount of information about each data element.  are saved to an Archive File See archive.  (referentially intact and complete) and these files can be easily stored on the most convenient and cost-effective storage medium. Once archived, the data can be safely removed from application databases, improving the performance and availability of critical systems immediately. However, when needed, the archived data can be easily accessed, researched and selectively restored.

By separating essential data from nonessential non·es·sen·tial
adj.
Being a substance required for normal functioning but not needed in the diet because the body can synthesize it.
 data, many organizations can safely reduce the size of overloaded o·ver·load  
tr.v. o·ver·load·ed, o·ver·load·ing, o·ver·loads
To load too heavily.

n.
An excessive load.

Adj. 1.
 databases by up to 50% or more during the initial archive. Ongoing active archiving (daily, weekly or monthly) enables organizations to maintain mission-critical databases at a size that fits comfortably within the desired processing and disaster recovery windows. Routine backup and database maintenance processes take much less time. Mission-critical data is available at peak performance.

Most importantly Adv. 1. most importantly - above and beyond all other consideration; "above all, you must be independent"
above all, most especially
, active archiving ensures data quality and referential integrity A database management safeguard that ensures every foreign key matches a primary key. For example, customer numbers in a customer file are the primary keys, and customer numbers in the order file are the foreign keys. . Active archiving understands and remembers data relationships with 100% accuracy and saves not only the data, but also the metadata describing the tables, columns and relationships used to create the Archive File. Users continue to have real-time access to archived data using comprehensive search, browse and reporting capabilities without restoring a single row.

[FIGURE 1 OMITTED]

With active archiving, users no longer need to restore all archived data for the sake of restoring a small subset A group of commands or functions that do not include all the capabilities of the original specification. Software or hardware components designed for the subset will also work with the original.  of the data. Archived data can be restored selectively by specifying the criteria that identifies the data to be restored, even if it differs from the original criteria used to archive the data. Active archiving also ensures that data can be restored to a new or existing database, even if the data model has changed over time. So, how does active archiving complement ILM and reduce the cost of compliance?

Managing the Information Lifecycle

All information has a lifecycle, from acquisition through disposal (see Figure 1). During this period, the data access requirements range from heavily accessed to rarely accessed. At each stage in the lifecycle, the business value of the information, as well as the access rates and performance requirements will vary. In general, for any given type of information, you should be able to compare the business value against the cost to store and access it in a timely manner.

You can reduce the storage and operational costs associated with data retention, and reduce the cost of compliance, by matching the storage medium to the access and performance requirements corresponding to the business value of the data. In other words Adv. 1. in other words - otherwise stated; "in other words, we are broke"
put differently
, to achieve an optimal return on investment, you should manage and store data in a way that matches its business value.

Basically, data is most valuable once it is created, or soon thereafter. As data ages, the relative value and the corresponding access rates decrease. There comes a point of diminishing returns where the cost to provide optimal production-level access and performance outweighs the actual business value derived from the data. At this point, it makes sense to move your data out of the high-cost, fast response system and into lower-cost, slower response systems to better match its business value. Although ILM is a simple concept, implementing ILM can be highly challenging, especially for managing complex relational data. Database archiving is the critical component within any ILM framework that enables companies to manage relational data according to its business value.

For example, as shown in Figure 2, while storing the initial two years of current financial data in the production database, you can still provide optimal performance for financial processing and yearly comparison reporting. To maintain performance levels, data that is two to five years old can be moved into an archive database that provides full data access when needed.

Data that is five to seven years old can be saved to an Archive File to take advantage of more cost-effective storage options. If you need access to data stored in this online archive, you can still access the data directly, and/or selectively restore some of the data to the archive database. Finally, for the older financial data that must be saved for a decade or longer, the capability to store these Archive Files on an offline archive tape further reduces storage costs.

Of course, this entire information lifecycle approach could easily be adjusted to meet your company's unique requirements. It is important to remember that every company values information differently; make sure you understand how your information is valued and spend as much, and only as much, as necessary to provide the appropriate level of access.

Selecting Database Archiving Technology

If you decide to research vendor offerings to purchase archiving technology and software, knowing the appropriate questions to ask is critical to selecting the ideal archiving solution for your environment. The ideal database archiving solution must meet several critical requirements:

* Provide out-of-the-box archiving capabilities to ensure a fast and easy implementation

* Preserve the business context and referential integrity of the data

* Provide predefined archive templates that can be easily customized to meet your database archiving requirements

* Reduce downtime The time during which a computer is not functioning due to hardware, operating system or application program failure.  during application upgrades and ensure access to the archived data after the upgrade is completed

[FIGURE 2 OMITTED]

* Allow business users to access archived data easily using standard forms and reports

* Offer intuitive or integrated functionality that is easy to learn and use Easy to learn refers to software that is well designed and capable of being used right away without having to wade through documentation. If you make the program work with little effort, it is easy to learn.  without extensive training

* Address data retention regulations and audit requirements

* Support information lifecycle and storage management strategies

* Enable your company to defer de·fer 1  
v. de·ferred, de·fer·ring, de·fers

v.tr.
1. To put off; postpone.

2. To postpone the induction of (one eligible for the military draft).

v.intr.
 or eliminate future hardware expenditures to accommodate data growth

* Provide proven technology that has been in operation at customer sites around the world for several years.

Putting Database Archiving Technology to the Test

In general, to obtain an accurate assessment in the shortest period of time, a Proof of Concept that focuses on a particular application can really put the database archiving solution to the test. For that reason, the Proof of Concept design should verify the application and database issues that are of greatest concern. The success criteria for the Proof of Concept must be thoughtfully defined to verify that specific capabilities satisfy the application archiving and retrieval requirements. Familiarity with the application functionality and the database architecture is critical for success. At a minimum, the Proof of Concept should demonstrate capabilities that:

* Archive precise subsets of related data and store archived data on the most appropriate and cost-effective storage medium

* Accurately remove precise subsets of related data from the application database based on their specifications, leaving all other data referentially intact

* Locate and browse a specific instance of a set of related data in an Archive File quickly and easily

* Selectively restore data to a separate archive database, mirrored archive tables or back into the production database environment, even if the data model has changed

* Provide business users with easy access to archived data upon request

* Demonstrate that basic application functions are unaffected by removing the historical data. In some cases, it might be necessary to make adjustments, either in the application or in the business processes, to resolve any effects of removing specific data

* Once the Proof of Concept success criteria are defined, all that remains is developing a Proof of Concept verification plan and setting up the test environment. In most cases, this effort takes just a few days.

Benefits of Active Archiving

Upon completing the Proof of Concept, the results will clearly indicate whether active archiving meets all the criteria and provides needed capabilities. Some expected benefits include the ability to:

* Archive almost 50% of the application database, resulting in shorter batch processing and maintenance windows, and significant improvements in performance and availability

* Store archived data to a near-line system and store some data in a separate archive database. This approach allows for faster access to archived data when needed and reduces the impact on the online system when accessing historical data in the production environment

* Store archived historical data on the most cost-effective and appropriate medium (online in an archive database, near-line on a file server, on optical devices, offline to tape, WORM devices, such as EMC (1) (EMC Corporation, Hopkinton, MA, www.emc.com) The leading supplier of storage products for midrange computers and mainframes. Founded in 1979 by Richard J. Egan and Roger Marino, EMC has developed advanced storage and retrieval technologies for the world's largest companies.  Centera or NetApp's SnapLock) and enhance the value of SAN, NAS (1) See network access server.

(2) (Network Attached Storage) A specialized file server that connects to the network. A NAS device contains a slimmed-down operating system and a file system and processes only I/O requests by supporting the popular
, HSM (1) (Hierarchical Storage Management) The automatic movement of files from hard disk to slower, less-expensive storage media. The typical hierarchy is from magnetic disk to optical disc to tape.  and other storage strategies

* Defer or eliminate the need to spend millions of dollars for the additional high-end processing units and reduce costs.

Selecting a proven database archiving solution eliminates the costs of developing, testing and maintaining an in-house solution, a much more complex and costly project. The Proof of Concept provides an effective way to ensure that the archiving software can perform the tasks required for a specific database environment. Once the application success criteria are defined, this "show me" technique puts the selected software to the test, conclusively con·clu·sive  
adj.
Serving to put an end to doubt, question, or uncertainty; decisive. See Synonyms at decisive.



con·clusive·ly adv.
 proving the value of archiving to the organization.

The benefits of active archiving are clear. Removing rarely accessed historical data from application databases improves the performance and availability and can significantly reduce the costs associated with additional capacity upgrades. Most importantly, active archiving works within the framework of existing storage strategies to offer a value added Value Added

The enhancement a company gives its product or service before offering the product to customers.

Notes:
This can either increase the products price or value.
 proposition. Having the flexibility to store data cost effectively on the most appropriate medium offers advantages, and selecting a storage medium based on the types of data to be stored makes it easier to comply with data retention requirements throughout the information lifecycle.

Jim Lee is vice president, product marketing, at Princeton Softech (Princeton, NJ)

www.princetonsoftech.com
COPYRIGHT 2003 West World Productions, Inc.
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 2003, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Title Annotation:Storage Management
Author:Lee, Jim
Publication:Computer Technology Review
Geographic Code:1USA
Date:Dec 1, 2003
Words:1825
Previous Article:Will DAM flood the media? For cost-effective ways to manage and leverage content assets, here's digital asset management. But is it the...
Next Article:Pervasive data volume reduction: a high-value alternative to aging tape-based backup technologies; A paradigm shift in backup technology can cut...
Topics:



Related Articles
Princeton Softech's database archiving technology supports EMC's Centera Compliance Edition.(Archive for Servers 2.0)
StorageTek enhances Email Xcelerator suite with new offerings.(Top Technology Showcase)
DATABASE ARCHIVING SOLUTION SUPPORTS EMC CENTERA MAINFRAME.
The case for compliance profiling.(Regulatory Compliance)
New ILM solutions for regulatory compliance: case study on how a customer achieves both financial and operational efficiencies.(Regulatory...
IBM, EMC, HP, CA, Oracle and Iron MTN executives gather to define lifecycle MGMT.(Computer Associates)(EMC Software Group)(management)
PeopleSoft announces expanded agreements with Princeton Softech, Mercury Interactive and Quest Software.
Not Information Lifecycle Management but Information Value Management.(Special ILM Issue)
Archiving has nasty sting in tail.(STORAGE)
Data lifecycle management: hard drives are not enough.(Disaster Recovery & Backup/Restore)

Terms of use | Copyright © 2009 Farlex, Inc. | Feedback | For webmasters | Submit articles