Data protection: the #1 storage priority; There's no ILM process without it.Data protection has become a critical piece of the ILM (Information Lifecycle Management Information Lifecycle Management refers to a wide-ranging set of strategies for administering storage systems on computing devices. Specifically, four categories of storage strategies may be considered under the auspices of ILM. ) process. There are four fundamental stages in the lifecycle of digital data: data creation, data access, data archive, and data deletion/destruction. The deletion/destruction phase no longer applies to all data types as some data is essentially being stored indefinitely, if not forever. The overarching o·ver·arch·ing adj. 1. Forming an arch overhead or above: overarching branches. 2. Extending over or throughout: "I am not sure whether the missing ingredient . . . goal of data protection is to protect information that cannot be easily replaced or replaced at all. [ILLUSTRATION OMITTED] Different levels of data protection exist and, as expected, higher levels of data protection cost more to implement. If the MTBF (Mean Time Between Failure) The average time a component works without failure. It is the number of failures divided by the hours under observation. MTBF - Mean Time Between Failures (Mean Time Between Failure) of hardware devices would have been 100% for the early years of the IT industry, businesses would have only needed to engage in straightforward backup and recovery processes. Software errors, human errors, natural disasters, power failures, building damages, and intrusions such as worms and viruses have turned data protection into a complex process. Better data protection and security has evolved over the years from simply improving the MTBF of devices to implementing local backup, remote backup and campus and remote hot sites. Three objectives are now routinely being used and have been defined by SNIA (Storage Networking Industry Association, San Francisco, CA, www.snia.org) An organization devoted to the advancement of mission critical storage systems. Founded in 1997, its goal is to determine the standards that must be developed to allow hosts and storage systems to interact via (Storage Networking Industry Association An association of producers and consumers of storage networking products, whose goal is to further storage networking technology and applications. The Storage Networking Industry Association, or SNIA ) to form the criteria needed to build an optimal data protection strategy. RPO RPO Recruitment Process Outsourcing RPO Recovery Point Objective (disaster recovery) RPO Royal Philharmonic Orchestra RPO Rochester Philharmonic Orchestra RPO Representative Poetry Online RPO Railway Post Office (Recovery Point Objective): The desired amount of time between data protection events. RTO (Recovery Time Objective) The amount of time a computer system or application can stop functioning before it is considered intolerable to the enterprise. It can be computed to be from seconds to days, depending on how critical the application is to the organization. (Recovery Time Objective): The time needed to recover from a data loss event and return to service. In other words Adv. 1. in other words - otherwise stated; "in other words, we are broke" put differently , this requires classifying data or an application by its criticality or value to the business and determining how long the business can survive without this data. DPW DPW n abbr (US) (= Department of Public Works) → ministerio de obras públicas (Data Protection or Backup Window): This is the acceptable amount of time available for an application to be interrupted while data is copied to another physical location for backup purposes. Note: Replication differs from backup in that replication moves data to another location and the data is accessible from both locations. A replicated image is ready to go almost immediately whereas a backup copy A disk, tape or other machine readable copy of a data or program file. Making backup copies is a discipline most computer users learn the hard way-- after months of work is lost. See backup and LAN free backup. requires a move to become accessible again. A replicated image improves the RTO significantly. A backed copy of data has to be restored by backup software See backup program. (tool, software) backup software - Software for doing a backup, often included as part of the operating system. Backup software should provide ways to specify what files get backed up and to where. or intelligence in the fabric. Measuring Availability Most businesses have developed multiple strategies to reduce and try to eliminate downtime The time during which a computer is not functioning due to hardware, operating system or application program failure. . Hourly costs of computer downtime can cost $50,000 to nearly $3 million, depending on the business and application. Minimizing downtime is potentially the most critical IT activity. Businesses that haven't measured the cost of downtime should do so in order to establish their own value of data metries. The computer industry has made significant strides in reducing the failure rate of the IT infrastructure by providing enhancements such as RAID, a variety of replication capabilities, hot (non-disruptive) code loads, and many redundancy features. Natural disasters require careful development of contingency plans A plan involving suitable backups, immediate actions and longer term measures for responding to computer emergencies such as attacks or accidental disasters. Contingency plans are part of business resumption planning. while energy failures mandate the expensive provision for alternative sources. All of these failures can be managed with enough financial resources. While devices have become significantly more reliable in protecting against device and component failures, valuable data is now being exposed to even higher risks as a result of destructive worms, viruses and spam as the wave of hackers and terrorists worldwide gains momentum. Recovery from an intrusion is difficult and the impact of an intrusion is destructive as permanent data loss frequently results unless special and often complex procedures are implemented. The looming threat to delivering high data availability Refers to the degree to which data can be instantly accessed. The term is mostly associated with service levels that are set up either by the internal IT organization or that may be guaranteed by a third party datacenter or storage provider. is now the "intrusion factor" and storage security has become the newest storage management discipline. In reality, there is no silver bullet No Silver Bullet - essence and accidents of software engineering is a well-known paper on software engineering written by Fred Brooks in 1986. Brooks argues that there will be no more technologies or practices that will serve as "silver bullets" and create a twofold in place yet to implement a bulletproof Refers to extremely stable hardware and/or software that cannot be brought down no matter what unusual conditions arise. See industrial strength. bulletproof - Used of an algorithm or implementation considered extremely robust; lossage-resistant; capable of correctly and secure IT infrastructure. Planned downtime is unpleasant but often occurs as a business choice. The most common causes of planned downtime are maintenance, hardware and software upgrades, and database backup and are presently unavoidable--but many non-disruptive capabilities are in development. The downtime required to perform database backup is more challenging and requires the database to completely stop service or be placed in a read-only mode. The Value of Uptime Businesses often calculate availability indexes for key applications in terms of "the number of nines." The estimated average costs of system failures would be nearly fatal to some companies and can range to nearly $3 million/hour of downtime in certain industries. A server that is 99% available may seem highly available but will actually be unavailable 5,000 minutes per year! Availability figures range from approximately 99% for Windows-based servers to 99.999+% (or five-9s) for enter-prise z/Series mainframe servers. Revenues lost per hour of outage out·age n. 1. A quantity or portion of something lacking after delivery or storage. 2. A temporary suspension of operation, especially of electric power. reflect on the criticality of the IT function to a particular business. Higher availability systems normally cost more but are often justified based on reducing the amount of revenue lost when an outage occurs. The number of minutes per year of unavailability or availability index is a good starting point Noun 1. starting point - earliest limiting point terminus a quo commencement, get-go, offset, outset, showtime, starting time, beginning, start, kickoff, first - the time at which something is supposed to begin; "they got an early start"; "she knew from the to measure availability--but it can be misleading. Beyond the number of 9s, a new set of metrics has emerged defining the impact of lost availability on the level of service delivered. QoS, or Quality of Service, is a better way to look at the type of service being delivered when a failure actually occurs. QoS takes the availability percentages to the next level and begins to add meaning to the real impact of an outage. A variety of data recovery architectures provide increasing levels of availability at a corresponding higher cost. The path to the "high 9s" describes new computing architectures that will ultimately implement advanced self-healing capabilities using embedded Inserted into. See embedded system. nanotechnology components. Data Protection Options Backup/restore is the most traditional disaster recovery method moving data, usually a complete file or full volume, from primary disk to either disk or tape for backup. The backed copy is not executable and must be restored to become accessible. In most cases, traditional backup causes the application being backed up to be impacted or even stop. Tradeoffs exist when choosing an effective backup strategy. Backing up full disk volumes or files can become very time consuming and may be difficult to schedule. In addition to full backups See backup types. , incremental Additional or increased growth, bulk, quantity, number, or value; enlarged. Incremental cost is additional or increased cost of an item or service apart from its actual cost. and differential backups See backup types. (operating system) differential backup - A kind of backup that copies all files that have changed since the last full backup. Each differential backup will include all files in previous differential backups since the full backup so to restore a version of represent further options. In a differential backup, the same data that was backed up on the previous differential backup is also backed up on the next differential backup. That's why differentials often grow each day in size between full backups. Daily backups get gradually larger, but the restore time is minimized compared to full or incremental backups See backup types. (operating system) incremental backup - A kind of backup that copies all files which have changed since the date of the previous backup. The first backup of a file system should include all files - a "full backup". Call this level 0. . A full restore only requires the last full backup and the last differential copy. For incremental backups, only the data that has changed since the last incremental backup is backed up. This minimizes the amount of data backed up and therefore reduces the time needed for the "backup window" making it different from differential backup. A full restore takes longer as each incremental backup will have to be restored to get all files to their last known state and is generally a more complex process. Often a full backup will be performed weekly, while an incremental backup is performed daily. Incremental backup minimizes the backup time and differential backup minimizes the restore time and the specific application may require one or the other. Mirroring is implemented as a block-for-block replica of a file, a logical unit, or a physical disk volume normally using disks for all copies. Once the mirrored data element is established by copying the original data element, the mirror is maintained by replicating all write operations in two (or more) places creating identical copies. Mirroring eliminates the backup window but doubles the amount of disk storage required adding expense. Storage administrators must choose to implement asynchronous Refers to events that are not synchronized, or coordinated, in time. The following are considered asynchronous operations. The interval between transmitting A and B is not the same as between B and C. The ability to initiate a transmission at either end. or synchronous mirroring and tradeoffs exist for each case. Synchronous mirroring is frequently used in z/OS (mainframe) environments, given the critical nature of its applications. In synchronous mirroring, both the source and the target devices must acknowledge the write is completed before the next write can occur. This degrades application performance but keeps the mirrored elements synchronized syn·chro·nize v. syn·chro·nized, syn·chro·niz·ing, syn·chro·niz·es v.intr. 1. To occur at the same time; be simultaneous. 2. To operate in unison. v.tr. 1. as true mirror images of each other. For asynchronous mirroring, the source and target devices do not have to synchronize See synchronization. their writes and the second and subsequent writes occur independently. Therefore, asynchronous mirroring is faster than synchronous mirroring but the secondary copies are slightly out-of-sync with the primary copy. This is sometimes referred to as a fuzzy copy. Asynchronous mirroring is often used with an IP storage protocol to replicate data to locations hundreds of miles away. In reality, the secondary data element is usually no more than one minute behind or out-of-sync with the primary copy. This can be a significant exposure for write-intensive applications. Mirroring is used for many mission-critical applications and it is the fastest way to recover data from a device or subsystem failure, since restore operations can occur in no more than a few seconds by switching to a mirrored copy. Mirroring does not help protect against a data corruption Data corruption refers to errors in computer data that occur during transmission or retrieval, introducing unintended changes to the original data. Computer storage and transmission systems use a number of measures to provide data integrity, the lack of errors. problem (hacker A person who writes programs in assembly language or in system-level languages, such as C. The term often refers to any programmer, but its true meaning is someone with a strong technical background who is "hacking away" at the bits and bytes. , worm, virus, intrusion, human or software error) as it produces two or more copies of corrupted data. For best practices, mirroring should always be accompanied by point-in-time copies for data that can permit a restore to occur from clean data that existed before the corruption occurred. Mirroring is defined, and also commonly referred to, as RAID-1. Snapshot copy presents a consistent point-in-time view of changing data. There are many variations of snapshot copy. When using snapshot copy and write operations occur, the changed areas (writes) are saved in a separate area or partition on disk of disk storage specifically reserved for snapshot activity. Here, the old value of the affected area or block can be saved in case the new block(s) are corrupted or to permit a fuzzy data image that can be used for a non-disruptive backup. Snapshots provide data protection from intrusion and data corruption but not from a device failure. PIT (Point-In-Time) copy provides an executable image of data at a specific point-in-time. Like a series of still images, PIT copies are complete data images are taken at specified points in time. PIT copies enable an administrator to go back in time to restore data from a stable state prior to when a corruption or other disruption occurred. This represents the most complete method to protect from human errors, software problems, hardware viruses and intrusions and data corruption and should accompany any mirroring implementation. Again, tradeoffs exist. The more frequently the PIT copy is taken, the more storage is required and the more time it takes to determine which copy is the correct one to restore from. Journaling is another method to enable data recovery where every write and update operation is continuously written to another device that may or may not be the same as the primary device. Unlike mirroring however, the secondary copy is a sequential history of write events. All write operations are queued to the secondary device, or the journal device, which may be disk or tape. Journals are typically kept as a continuous history for 2-4 days covering the period of maximum likelihood for a data recovery action to occur. Journals are especially good for protecting from intrusion and data corruption enabling restores to go back in time to a point before the corruption occurred. Conclusion Data protection is a critical IT discipline and businesses often choose the simplest approach after sustaining three years of downsizing (1) Converting mainframe and mini-based systems to client/server LANs. (2) To reduce equipment and associated costs by switching to a less-expensive system. (jargon) downsizing and cutbacks. However the simplest approach may not provide the highest availability and severe business impact may occur. Today's IT environments are demanding a more comprehensive strategy for data protection, security and high-availability than ever before. Replication options must match the applications' specific business requirements in order to yield the highest probability of success. The data protection solutions are now available to deliver ultra-high availability, thus increasing the probability for a business to survive most all types of outages. This is critical since most all businesses will not survive without IT. Resilient to machine and human imperfections such as intrusions, mistakes, accidents and cyber-terrorism, the price to pay for implementing data protection is not an option. |
|
||||||||||||||||||||

Printer friendly
Cite/link
Email
Feedback
Reader Opinion