Measuring The Impact Of Fragmentation On NT System Performance.For many years, Microsoft remainded ambivalent about the impact of fragmentation on NT. Not anymore. The company now openly advocates regular defragmentation See defragment. as an NT performance necessity and has included a "lite" defragmenter in Windows 2000. "Disk fragmentation can cause performance problems," says the Microsoft Windows See Windows. (operating system) Microsoft Windows - Microsoft's proprietary window system and user interface software released in 1985 to run on top of MS-DOS. Widely criticised for being too slow (hence "Windoze", "Microsloth Windows") on the machines available then. NT Server Resource Guide. "You should consider running a defragmentation program on a regular basis." So what's behind this dramatic shift of position? A wealth of new research has made it clear that fragmentation exerts a serious toll on system performance. This article discusses these research findings, the latest defragmentation techniques, and why regular defragmentation is an integral part of a smoothly running NT network. Fragmentation In Court The U.S. District Court in Portland, Maine Portland is the largest city in the U.S. state of Maine, with a 2004 population of 63,882. Portland is Maine's cultural, social and economic capital. Tourists are drawn to Portland's historic Old Port district along Portland Harbor, which is at the mouth of the Fore River and part serves as a striking example of how fragmentation can cripple any NT system. This site originally relied on a Dual Pentium A PC with a motherboard that contains two Pentium CPUs. Such machines are designed for use with an operating system that supports symmetric multiprocessing (SMP), such as Windows NT or 2000. See SMP. Pro 200 server with three 4GB drives to run ARCServe, Exchange, and several other applications. Yet after a few months of operation, NT performance significantly deteriorated. Some server operations took 25 times longer than normal and backup times lengthened considerably. In response, four NT servers were added over a three-year period. Despite these hardware upgrades, however, user complaints began to mount up due to file access delays. Some directory listings and file opens, for instance, were taking ten seconds or more and server response again became sluggish. "One server took 20 minutes to shut down during reboot To reload the operating system, which restarts the computer. See boot. (operating system) reboot - (From boot) A boot with the implication that the computer has not been down for long, or that the boot is a bounce intended to clear some state of wedgitude. See warm boot. ," says senior automation manager Kevin Beaulieu. "System deterioration over time was the reality of the NT world as far as we were concerned." The U.S. District Court then introduced a Dual Pentium II 450 with 52GB online storage (RAID 5) and 512MB of RAM. Yet after an initial period of exceptional performance, even this server began to toil noticeably. "By this time, I'd realized that hardware was not the problem," reports Beaulieu. "Finally, I tracked down the true villain-- disk fragmentation." At this site, Executive Software's network defragmenter Diskeeper 5.0 was installed across every server and Windows 95, 98, and NT box on the network. Instead of 20 minutes to down a server, it took 45 seconds. File access times dropped from 10 seconds to one to three seconds and many hours were saved through faster backups. Seeing Is Believing Seeing is believing is an idiom first recorded in this form in 1639 that means "only physical or concrete evidence is convincing".[1] Seeing is Believing may refer to:
For those who consider the experience of the U.S. District Court to be no more than a freak occurrence, try this experiment: Install NT and MS Office on a new machine, then run an analysis of fragmentation (Diskeeper Lite, which includes a fragmentation analysis utility, can be downloaded free from www.execsoft/dklite/download.htm). You'll be shocked at how much fragmentation exists-- even before the system has had a chance to operate. "Don't assume you have no fragmentation just because you have a lot of free disk space," says software developer and NTFS (NT File System) An optional file system for Windows NT, 2000 and XP operating systems. NTFS is the more advanced file system, compared to FAT32. It improves performance and is required in order to implement numerous security and administrative features in the OS. expert, Larry Seltzer. "Even a freshly installed NT or Windows 2000 system has a large number of fragmented files." The results of that experiment are supported by a recent fragmentation study conducted on 100 companies (ranging from small to large) by La Crescenta, CA-based research firm BNI BNI Business Network International BNI Business Networking International BNI Bank Negara Indonesia BNI Bechtel National, Inc. BNI British Nursing Index BNI Barrow Neurological Institute (Phoenix, AZ) . This survey revealed that 56 percent of NT workstations had files fragmented between 1,050 and 8,162 pieces. One in four reported finding files with as many as 10,000 to 51,222 fragments. For servers, an even greater degree of fragmentation existed. Half of the respondents discovered 2,000 to 10,000 fragments and another 33 percent had files fragmented into 10,333 to 95,000 pieces. Although rare, it is possible to find the occasional file splintered into millions of pieces. With such levels of fragmentation existing as the norm, valuable hours can be squandered squan·der tr.v. squan·dered, squan·der·ing, squan·ders 1. To spend wastefully or extravagantly; dissipate. See Synonyms at waste. 2. during backups due to the time it takes to read the disk before relaying the entire file to the storage medium. In extreme cases, the backup device See backup storage. may even come to a halt while the host computer assembles the file. Defragmentation Performance Testing Performance Testing covers a broad range of engineering or functional evaluations where a material, product, or system is not specified by detailed material or component specifications: Rather, emphasis is on the final measurable performance characteristics. Surprisingly, defragmentation had never been independently benchmarked, an omission rectified earlier this year by National Software Testing Lab See NSTL. (NSTL (National Software Testing Lab, Blue Bell, PA, www.nstl.com) An independent organization established in 1983 that evaluates computer hardware and software in the PC, mobile and gaming markets. It adheres to controlled testing methods to ensure objective results. ) of Conshohocken, PA. NSTL conducted performance tests on two of the most common NT configurations, determined through a survey of 6,000 system managers running Windows NT (Windows New Technology) A 32-bit operating system from Microsoft for Intel x86 CPUs. NT is the core technology in Windows 2000 and Windows XP (see Windows). Available in separate client and server versions, it includes built-in networking and preemptive multitasking. . A Pentium II The successor to the Pentium Pro from Intel. Pentium II refers to the CPU chip or the PC that uses it. Code named "Klamath," the Pentium II was a Pentium Pro with MMX multimedia instructions. 266 workstation with 96MB of memory and a 2GB IDE hard drive running Outlook and Excel, showed a performance leap of 74.5 percent with a defragmented drive. On a PII See Pentium II. 400MHz (MegaHertZ) One million cycles per second. It is used to measure the transmission speed of electronic devices, including channels, buses and the computer's internal clock. A one-megahertz clock (1 MHz) means some number of bits (16, 32, 64, etc. workstation with 228MB RAM and a 4.2GB HD, the improvement rose to 80.6 percent. Two popular server configurations also underwent testing. A dual Pentium PRO 200 with 128MB of memory on five, 4gig SCSI SCSI in full Small Computer System Interface Once common standard for connecting peripheral devices (disks, modems, printers, etc.) to small and medium-sized computers. SCSI has given way to faster standards, such as Firewire and USB. hard drives, running RAID 5, Exchange Server, and SQL Server An earlier relational DBMS from Sybase and from Microsoft. Sybase introduced SQL Server in 1988 for various Unix versions. In that same year, with help from IBM, Sybase created an OS/2 version that Microsoft licensed and branded as Microsoft SQL Server. 7.0, recorded an increase of 19.6 percent on a defragmented drive. On a Pentium PRO The sixth generation of the Intel x86 family of CPU chips. The term may refer to the chip or to a PC that uses it. Introduced in 1995 as the successor to the Pentium, models from 150 MHz to 200 MHz were released. 200 with 64MB of RAM, two 4GB SCSI HDs running Exchange and SQL SQL in full Structured Query Language. Computer programming language used for retrieving records or parts of records in databases and performing various calculations before displaying the results. 7.0, performance rose by a hefty 56.1 percent. NSTL tested the effects of fragmentation on files of all types and sizes. The Microsoft Outlook For the e-mail and news client bundled with certain versions of Microsoft Windows, see . Microsoft Outlook or Outlook (full name Microsoft Office Outlook tests, for example, included: opening 50 messages simultaneously; moving messages from the inbox to a separate folder; opening (and displaying to: from: subject: and date:) a large subfolder; a full text search of all messages in a folder for a specific string; and a filter that displayed all messages in a folder that contained an attachment. Each of these tests was executed on the system when the personal folder was fragmented and defragmented. The SQL, Exchange, and Excel tests were carried out in a similar manner. The results of this testing might exceed what some would have believed possible from defragmenting, but even if the real-world improvements are only half as good as NSTL recorded under lab conditions, it represents performance numbers to rival many hardware upgrades. (NSTL's "Final Report on Defragmentation Performance Testing," and white paper "System Performance and File Fragmentation in Windows NT" can be read in full at www.execsoft.com). How Defragmenters Work With fragmentation now proven to exact such a severe penalty on NT performance, safe and reliable methods of dealing with the problem are an absolute must. Fortunately, several recent advances have made it possible for system managers to eliminate file fragmentation as a reason for system slows. Here is how a modem network defragmenter functions. The program checks each file on a disk to determine which files need to be defragmented and which files should be moved to another location to provide more contiguous free space. If a file is to be moved online, then the defragmenter uses special APIs (known as IOCTLs--input output controls) that work in harmony with the file system to accomplish defragmentation safely. This special MoveFile API makes a contiguous copy of the file on the disk to the location specified by the defragmenter. Next, the MoveFile API changes the pointers to that file to point to the new contiguous copy of the file. Lastly, the original fragmented file has its disk space de-allocated, but only after successful completion of the prior cycles. All this is safely conducted online by current-day defragmenters. Being online, files can be defragmented in the background, usually running at low priority so as not to tax overhead. As a result, the system doesn't have to be shut down to handle the disk. However, there are two file types the 0/S will not allow to be defragmented online using the special APIs or "hooks." These are the paging file and, on NTFS, the Master File Table (MFT (1) (Master File Table) A list of files in an NTFS volume. It contains the name, size, time and date, etc. for each file. See NTFS and FAT. (2) (Managed File T ). These critical NT files can be safely defragmented at boot time The time it takes for a device to be ready to operate after the power has been turned on. See boot. and should be, as they are especially vulnerable to fragmentation. Allowing them to become fragmented is guaranteed to decrease overall system performance. (Note: On NT 4.0, the directories also cannot be defragmented online. In Windows 2000, though, the APIs have been enhanced to allow the directories to be moved safely online during defragmentation.). Preventing MFT Fragmentation: On NTFS, the MFT is the "map" of each and every file on the volume and it is itself a file. Every time a new file is made, a new record in the MFT file is created. As more files are added, the MFT expands. Unfortunately, files that are constantly growing such as the MFT are most susceptible to the extremes of fragmentation. Since the MFT is such an important file to combat any tendency to fragment, Microsoft reserved space on the disk immediately after the MFT called the MFT Zone. Approximately one eighth of a NTFS volume is reserved for the MFT Zone. The theory is that the MFT Zone is reserved space on the disk into which the MFT expands, thus, preventing/minimizing MFT fragmentation. Despite this precaution, the MFT does fragment due to other files being written into the MFT Zone under certain circumstances. For example, when a disk is full, files are stored in the MFT Zone. Now, let's say that some files get deleted from this volume--but NOT the ones stored in the MFT Zone. Despite having space on the disk, files reside in the MFT Zone. When more files are stored onto this disk, the MFT must expand to store these new files, but the files in the MFT Zone block the way. As a result, the MFT becomes fragmented, adversely affecting both system performance and backup velocity. Several solutions exist to MFT fragmentation. One new method is to bypass the Windows NT 0/S and the special APIs in order to defragment To reorganize the disk by putting files into contiguous order. Because the operating system stores new data in whatever free space is available, data files become spread out across the disk as they are updated. the MFT online (Microsoft technical authorities consider this approach dangerous), Alternatives include offline defragmentation during rebooting (less convenient, but safe) and the latest approach of Diskeeper 5.0--preventing the MFT from becoming fragmented in the first place. Here's how it works. When the defragmentation software is first installed on a machine, it is quickly defragmented at boot time if the MFT is fragmented. From that point on, an online monitoring process ensures that the MFT does not again become fragmented, thus eliminating the need to defragment it. As this form of fragmentation greatly increases head movement, a fragment-free MFT can make a considerable difference to backup duration. Preventing Pagefile Fragmentation The purpose of the paging file is to store over-committed system memory. As an active paging file is held open for the exclusive use of the NT O/S, online defragmenters cannot access it. This problem, however, can be handled either online (by going underneath the operating system operating system (OS) Software that controls the operation of a computer, directs the input and output of data, keeps track of files, and controls the processing of computer programs. ), offline, or in the same preventive fashion as MFT defragmentation. By monitoring the area on the disk at the end of the paging file and ensuring that there is enough space for it to expand into, fragmentation can usually be prevented. The Smarter Course Performance degradation is a complex topic. Any system slowdown is potentially attributable to a slow processor, not enough memory, or even an old device driver. Where performance has tended to deteriorate over time, however, fragmentation is the probable cause Apparent facts discovered through logical inquiry that would lead a reasonably intelligent and prudent person to believe that an accused person has committed a crime, thereby warranting his or her prosecution, or that a Cause of Action has accrued, justifying a civil lawsuit. . Yet waiting for a slump in I/O (Input/Output) The transfer of data between the CPU and a peripheral device. Every transfer is an output from one device and an input to another. See PC input/output. I/O - Input/Output pace or sluggish backup performance before acting is asking for irate calls from dissatisfied users and customers. The smarter course is to take fragmentation out of the equation entirely by installing a defragmenter on every single NT, 95, and 98 box across the network. Drew Robb is the president of Robb Editorial (Tujunga, CA). Fragmentation Worsens RAID Performance According to according to prep. 1. As stated or indicated by; on the authority of: according to historians. 2. In keeping with: according to instructions. 3. popular mythology, RAID devices do not need defragmenting. Yet in actual practice, fragmentation degrades stripe set stripe set - data striping performance more than it degrades single volumes. "RAID systems, using both hardware RAID and Windows NT Server's support for software RAID, are also susceptible to file fragmentation and need defragmentation," states NSTL. Let's take as an example a RAID array of four physical disks. When data is written to the array, it is written more or less equally to all four devices. This means writing and reading can be almost four times as fast as to a single disk because all four devices are active simultaneously. If it takes 1000ms (milliseconds) to write a file to a single disk, the same write will take about 250 to 300ms to write to this RAID array. Reads are similarly faster. If the data on one of the disks is in two fragments, the read will not take twice as long, but it will take longer than usual. The read/write head A device that reads (senses) and writes (records) data on a magnetic disk or tape. For writing, the surface of the disk or tape is moved past the read/write head. By discharging electrical impulses at the appropriate times, bits are recorded as tiny, magnetized spots of positive or will have to perform an additional seek (seek = movement of the read/write head from where it is to the track where the data is) to get to the second fragment and wait for the disk to turn until the data comes under the head. How long this takes depends on the hard disk, but it typically averages about 9ms. If this file contains one extra fragment, it will take about 3% longer to read (about 9ms extra time added to a 250ms read). If it has ten extra fragments (all on the same disk), it will take about 30% longer to read (90ms added to 250ms). Now, look at the same file if it is on a plain, single disk partition See partition. instead of a stripe set. It takes 1,000ms to read and each fragment adds 9ms. One fragment extends the read time only 9%; ten fragments extend the time only 9%. As we are adding the same amount of time in milliseconds whether the file is on a stripe set or not, the added percentage of time is much worse on a stripe set. Of course, the stripe set figures assume all of the fragmentation is on one disk, but in the real world, fragmentation will be spread across all of the disks. If it is spread equally (best case), the percentage of slowdown will be the same as for a single disk. Any other distribution will be worse. So, at best, the stripe set will be no better off than a single disk. In most cases, though, the effects of fragmentation will be greater. A properly designed online defragmenter defragments RAID arrays without defeating the striping Interleaving or multiplexing data to increase speed. See disk striping. striping - data striping effect. It does this by treating the segments of the file as separate files. That is, it consolidates all of the fragments on one member disk into a contiguous segment, but it never moves data from one member disk to another. The History Of Fragmentation Fragmentation first appeared about thirty years ago, right after that dark age when computers existed without disks or operating systems Operating systems can be categorized by technology, ownership, licensing, working state, usage, and by many other characteristics. In practice, many of these groupings may overlap. . By the late Sixties, disks began appearing, able to store thousands of bytes-a revolutionary concept at the time. One early computer, the PDP- 11, had an 0/S called RT-11 that introduced the concept of storing files in a formal file structure. The downside--all files had to be contiguous Disks with plenty of space, but lacking one free space large enough to accommodate a new file, were "full." With frequent file deletions, it wasn't unusual to have disk reach the point where no more files could be created, even though the disk was little over half full. The solution became the SQUEEZE command that compacted files at the beginning of the disk. When a new 0/S (RSX-11) came along that allowed multiple simultaneous users of the same PDP (1) (Plasma Display Panel) See plasma display. (2) (Policy Decision Point) See COPS and XACML. (3) (Programmed Data P 11 computer, SQUEEZE users became a problem, as using it meant all users had to stop working while it ran. That led to the creation of a file structure that could locate different parts of a file in different places on the disk. Each file had a header that gave the location and size of each section of the file, so the file could be in pieces scattered around the disk. Thus, fragmentation became a feature, not a bug in these early systems. The RSX-11's fragmentation approach was carried over into the Open VMS (1) (Virtual Memory System) A multiuser, multitasking, virtual memory operating system for the VAX series from Digital. VMS applications run on any VAX from the MicroVAX to the largest unit. See OpenVMS. 0/S and, when its principal designers moved over to Microsoft, they built the NT file on systems this same fragmentation model. The problem, though, is that as hard drive capacities and CPU speeds grew exponentially, disk speeds didn't keep pace. In today's, client/server world where thousands of files are being written and deleted from disks repeatedly, the end product is files split into thousands of pieces that exertra significant toll on system I/O. |
|
||||||||||||||||||

Printer friendly
Cite/link
Email
Feedback
Reader Opinion