Digital archiving in the twenty-first century: practice at the national library of the Netherlands.
Research journals are increasingly being published digitally. The advantage of digital publishing is obvious: immediate accessibility anywhere. Gradually a disadvantage is also becoming clear: digital publishing endangers the continuity of research information. As a consequence of the obsolescence ob·so·les·cent
1. Being in the process of passing out of use or usefulness; becoming obsolete.
2. Biology Gradually disappearing; imperfectly or only slightly developed. of formats, hardware, software, and carriers, digital information will be lost unless we act. Digital publishing is also causing a shift in the roles and responsibilities of publishers and libraries concerned with archiving digital publications for future use. Archiving digital publications requires a major turnaround in the policy and practice of national libraries. Although some actions have been taken, digital preservation research and implementation are still in their infancy. National libraries will need substantial funding for venture research activities and development of archival infrastructures. They will also have to work together more closely to successfully organize digital archiving in the twenty-first century.
In 1994 the Koninklijke Bibliotheek Koninklijke Bibliotheek (Royal Library in English) can stand for:
athenaeum, atheneum - a place where reading materials are available . For this purpose the KB designed the "Safe Place Strategy," a strategy ensuring the transfer of digital publications from their publishing environment to a dedicated archiving environment. Through a series of experiments and projects the KB developed skills, procedures, organization, and infrastructure for digital archiving. The digital archive of the KB, the e-Depot, has been fully operational since early 2003. The technical core of the e-Depot is the Digital Information Archiving System (DIAS), a system jointly developed by the KB and IBM (International Business Machines Corporation, Armonk, NY, www.ibm.com) The world's largest computer company. IBM's product lines include the S/390 mainframes (zSeries), AS/400 midrange business systems (iSeries), RS/6000 workstations and servers (pSeries), Intel-based servers (xSeries) . DIAS is the first archiving system available on the market that is compliant with the Open Archival Information System An Open Archival Information System (or OAIS) is an archive, consisting of an organization of people and systems, that has accepted the responsibility to preserve information and make it available for a Designated Community. (OAIS OAIS Open Archival Information System (library and information science)
OAIS Officer Assignment Information System
OAIS Opinion, Attitude, and Interest Survey ) reference model standard. Elsevier and several other international scientific publishers have entered into a formal archiving relationship with the KB to guarantee future availability of their digital publications. The KB is researching and testing digital preservation methods and solutions that will guarantee permanent access to the archived material.
ARCHIVING IN THE DIGITAL ERA
The Impact of Digital Publishing
In the past decade journal publishing has increasingly turned digital, a change led by publishers in the field of Science, Technology, and Medicine (STM (Scanning Tunneling Microscope) A microscope that can image down to the atomic level. An STM uses a piezoelectric tube with a tiny sharp tip at the end that is moved within nanometers of the object being sampled. ). Generally, all STM publishers now publish their journals digitally with printed derivatives only produced as a secondary format. Without doubt the pioneer of digital publishing is Elsevier (Elsevier, n.d.). Early work on digital publishing began in 1979 with project ADONIS on CDROM See CD-ROM. and, in 1991, with project TULIP for desktop delivery of e-journals. In 1995 the TULIP project was followed by ScienceDirect OnSite and in 1997 by the Web-based service ScienceDirect. New publishing models are under development, such as SPARC (Scalable Performance ARChitecture) A family of RISC CPUs from Sun that runs mostly under Sun's Solaris, but also under Linux and BSD operating systems. After development began in the mid-1980s by David Patterson of the University of California at Berkeley and Bill and BioMed Central BioMed Central (BMC) is a UK-based scientific publisher specializing in open access publication. BMC publishes over 180 scientific journals, including Arthritis Research & Therapy, Breast Cancer Research, Critical Care, Genome Biology . (1) Even though the final outcome of these experiments is still uncertain (Elsevier, 2004), it is obvious that digital publishing is a success and here to stay. This is evidenced by the increasing number of scientists who use the electronic services offering digital publications.
For publishers, digital publishing has shifted the emphasis from the production and selling of publications toward creating and providing enhanced services Enhanced service is service offered over commercial carrier transmission facilities used in interstate communications, that employs computer processing applications that act on the format, content, code, protocol, or similar aspects of the subscriber's transmitted information; for searching, linking, and retrieving digital information. Despite the advantages, however, digital publishing poses a serious threat to the continuity of the "Record of Science." In the past, libraries had assumed the responsibility for archiving the printed journals they bought and collected. Now, with digital resources, libraries do not own the publications but license them from the publishers. Therefore, libraries no longer compile a collection, and the digital publications remain with the publisher. As a consequence the archiving of printed publications implicitly offered by library collections no longer exists for e-journals.
Changing Roles Subsequently, archiving needs to be organized explicitly for e-journals. To accomplish this, some institutions have taken the initiative to investigate and develop the archiving of digital publications. There are several significant examples of such initiatives, (2) and recently several national programs (3) have been set up to encourage the development of digital archiving.
Digital publishing not only affects the archiving role of libraries but also the role of publishers. To cover their share of responsibility to maintain the Record of Science, it is no longer sufficient for publishers to only produce, market, and sell the publications. An active role is required to plan and organize the archiving of the published information. Scientific publishers have become aware of this change and are rising to the challenge by implementing archiving policies. (4)
Archiving Digital Information Libraries need to encourage the publishers to deposit their digital publications for archiving. However, this is only the first part of the digital archiving story. To understand why archiving of digital information is more complicated than archiving printed information, it is helpful to compare printed publications with digital publications.
A printed publication is a physical object. A digital publication, on the contrary, is not a physical but a logical object stored on a physical medium. Another difference is that a printed publication presents the information so that it is immediately accessible to the human eye and can be read directly. To read or view the information in a digital publication, specific functionality enabled by software and hardware is required. Journalist Richard Poynder described this difference as follows:
unlike paper or microfilm where the meaning is transparently inscribed on the surface of the medium--digital documents are opaque bit streams only understandable to humans when interpreted by a machine. The hardware and software to do this interpretation, however, is constantly superseded. There have, for instance, been more than 200 digital storage formats alone deployed since the 1960s, with none lasting more than 10 years. (Poynder, 2003)
Another difference is that publications on paper or vellum vellum: see parchment. can survive "by accident" and remain readable for a very long time without any specific actions. Visualize, for instance, a manuscript such as the Book of Hours book of hours, form of prayer book developed in the 14th cent. from the prayers of clerics appended to the main service. The subjects of the miniature illustrations (see miniature painting) were frequently derived from the appendix of the Psalter. from the fifteenth century, once owned by the princes of Trivulzio living in Milan. The KB received this beautiful illuminated manuscript illuminated manuscript
Handwritten book decorated with gold or silver, brilliant colours, elaborate designs, or miniature paintings. “Illumination” originally denoted embellishment of text with gold or silver, which gave the impression that the page had been (5) as a valuable gift, worth at least six million Euro, from an anonymous donor. It was assumed to be lost until the family of the donor acquired the manuscript through an auction some time ago. Would a digital publication survive left sitting on a shelf for hundreds of years? Considering the threats to digital publications, the answer is definitely negative.
Threats to Digital Information One threat to digital information is that the physical carrier of a digital publication will deteriorate much faster than paper or vellum. The format of the digital object can be damaged or lost and may no longer be intact or retrievable. But even before that happens, the technology used to store the publication is likely to become obsolete. Another threat is the loss of the functionality needed to interpret, display, and use the information contained in the digital object. Without this functionality, provided by specific hardware and software, the information will not be available even if the bitstream of the digital object has been preserved. When we are able to address all the threats to a digital publication, we can successfully keep it for future use. If we cannot, sooner or later we will lose the digital publication itself or the access to the information it contains.
Permanent Access Solutions To guarantee permanent access, a variety of solutions will be required (migration, normalization In relational database management, a process that breaks down data into record groups for efficient processing. There are six stages. By the third stage (third normal form), data are identified only by the key field in their record. , emulation, or others). The choice of which techniques to use will depend on the nature of the digital publication (for example, what is the format, is it static or executable, etc.) and also on the requirements of the user (will they want to simply view or also process the information).
Migration (sometimes referred to as conversion) is a commonly applied technique in computer science. However, migration in essence makes changes to the original and implies a risk of damaging the information, a risk that increases if a sequence of migrations is applied. This could potentially mean the loss of the information. Normalization is migration of the information to a specifically chosen format before it is accepted into the archive. This approach explicitly accepts the possibility of loss of specific characteristics and of part of the digital information. However, in some cases it might be useful to hold only a limited number of formats in an archive. Emulation is a totally different approach aimed at rendering the digital information as authentically as possible. The risk of technology obsolescence is addressed by replacing the hardware with a computer written in software (an emulator). The use of emulation as a permanent access solution was suggested a decade ago (Rothenberg, 1995).
Sometimes a combination of emulation and controlled migration can be used, an attempt to combine the best of two worlds. An example of such an approach recently developed in practice is the Universal Virtual Computer (UVC UVC ultraviolet C; see ultraviolet.
UVC Umbilical vein catheter, see there ) approach from Raymond Lorie, a researcher at IBM (Lorie, 2002, 2004). The UVC is a program for a virtual computer that performs all the essential functions of a regular computer. The UVC program is available for testing and evaluation at Alphaworks. (6)
Application of the UVC The KB and IBM have applied the UVC to offer permanent access for selected formats. The approach requires that an easily readable technical description of the UVC is preserved. Using this description, an UVC emulator can be created at any time for any given platform. The UVC emulator will run the programs that have been written for the UVC and that have also been preserved. In figure 1 the components of such a permanent access solution are shown. In the archiving phase, the original image, the UVC Specification, the Decoder A hardware device or software that converts coded data back into its original form. See decode and MPEG decoder. , and the Logical Data Schema are stored. In the future, in the delivery phase the stored elements will be used to build an UVC emulator to run the Decoder that generates the Logical Data View. Finally, a Viewer is built that will restore the original image. The KB and IBM have recently developed and tested the first operational permanent access UVC tool for JPEG JPEG
in full Joint Photographic Experts Group
Standard computer file format for storing graphic images in a compressed form for general use. JPEG images are compressed using a mathematical algorithm. (Wijngaarden & Oltmans, 2004).
[FIGURE 1 OMITTED]
Long-Term Preservation Research
Achieving Permanent Access As was explained before, digital preservation requires both the permanent maintenance of the bitstream of the digital publication as well as the maintenance of functionality to render the information from the bitstream now and in the future. Working on permanent access solutions is complicated and costly. Hence, as information technology (IT) continues to develop, ongoing research is needed to address the obsolescence of formats, software, and hardware. It is essential that institutions researching permanent access solutions cooperate closely and exchange experience and results. To support this, a joint terminology and joint requirements for interoperability would be very useful, as well as a joint framework for the research.
A Framework for Permanent Access Research The idea to create a joint framework for permanent access research was introduced by a consortium (7) in a project proposal to the European Commission European Commission, branch of the governing body of the European Union (EU) invested with executive and some legislative powers. Located in Brussels, Belgium, it was founded in 1967 when the three treaty organizations comprising what was then the European Community , under the title Permanent Access Toolbox See toolkit and toolbar. for Digital Cultural Heritage, or PATCH. The Permanent Access Toolbox, or PATbox, is a concept representing the functional and technical requirements to which permanent access solutions should comply (see figure 2). The aim of the PATbox is to promote interoperability and increase the usefulness of the results of permanent access research, regardless of where or by whom it was performed. In addition to manifesting a framework of requirements, the PATbox also represents the collection of generally available permanent access tools. Access tools may be based on different techniques or approaches, such as conversion, emulation, or other technical solutions.
[FIGURE 2 OMITTED]
ORGANIZING ARCHIVING IN THE TWENTY-FIRST CENTURY
Digital preservation of and permanent access to digital information are complex problems that can only be addressed successfully by the cooperation of several parties. National libraries have come forth as a major stakeholder stakeholder n. a person having in his/her possession (holding) money or property in which he/she has no interest, right or title, awaiting the outcome of a dispute between two or more claimants to the money or property. . Their core business is to keep published information available in the long term. Another integral party is comprised of the governments and international political bodies who are needed to support national libraries with appropriate policies and legislation and who should also assign the resources needed to implement digital archiving in practice. Some important examples are discussed later. The third party involved is the publishers. In contrast to the world of printed publications, archiving digital publications requires the active involvement of publishers (Steenbakkers, 2004). Permanent archiving will also require close cooperation with key players in the IT industry to develop new technological solutions and new standards. Their support will be essential to organize the registration of file formats (8) and software versions and to realize global software repositories A software repository (sometimes abbreviated as a repo) is a storage location from which software packages may be retrieved and installed on a computer. Many software publishers and other organisations maintain servers on the Internet for this purpose, either free of . Hopefully, academic research institutions will also contribute to the development of durable digital information as this problem is not restricted to publications but also concerns digital research data in general. Not only will the parties directly involved benefit from these efforts, but also society as a whole. Digital information, either for business or for private use, needs to become durable.
Academic and national libraries have always collected and preserved publications, but archiving of digital publications requires a new, proactive approach. Considering the apparent progress in archiving technology, it is feasible that more and more academic libraries will be able to keep digital information in institutional repositories An Institutional Repository is an online locus for collecting, preserving, and disseminating -- in digital form -- the intellectual output of an institution, particularly a research institution. for the (relatively) short term. Commitment to long-term digital preservation, however, can only be expected from institutions like national libraries, which are dedicated to permanent safeguarding of information.
The national libraries at least will have to take responsibility for archiving the digital publications from their own country, often written in the national language. Just a few years ago the preservation of digital information was still considered to be so complicated and costly that making prints of digital texts and images on acid-free paper was suggested to be an appropriate solution (Gorman, 2001). As off-the-shelf archiving systems are becoming available today (Hodge & Frangakis, 2004, pp. 41-44), it is feasible that national libraries can start implementing an archive of their national digital publications and contribute to the universal control of digital content in the twenty-first century. Under the umbrella of the IFLACDNL Alliance for Bibliographic Standards (ICABS ICABS IFLA-CDNL Alliance for Bibliographic Standards ), a group of national libraries is looking into the matter of bibliographic and resource control in the digital era (ICABS, 2004).
For international digital publications, such as those that are part of the international Record of Science, preservation will have to be organized beyond national boundaries at a global level. One solution being developed is for publishers to establish a formal archiving relationship to work with a limited number of institutions distributed across the world that qualify as international archiving centers (Hunter, 2002). A substantial number of major international science publishers are already following this model and archiving their e-journals at the KB.
To achieve permanent archiving of electronic publications, the involvement of political bodies is essential. Support is needed not only from the local or national government but also from international bodies. Several of them are already involved. From the European perspective, two initiatives from 2002 are particularly relevant. One of them was initiated by the European Commission, which produced the experts report "Preserving Tomorrow's Memory" (Smith, 2003), and the other was initiated by the European Council European Council, a consultative branch of the governing body of the European Union (EU). It is composed of the heads of government of the EU nations and their foreign ministers, in conjunction with the president and two additional members from the European , resulting in an EU resolution (Council of the European Union Council of the European Union, branch of the governing body of the European Union (EU) that has the final vote on legislation proposed by the European Commission and deliberated by the European Parliament. , 2002). The European Commission has made digital preservation one of the five themes of the last call for proposals of its Sixth Framework Programme The Sixth Framework Programme (abbreviated FP6) was the Framework Programme for Research and Technological Development from 2002 till 2006 set up by the European Union (EU) in order to fund and promote European research and technological development. (2002-06). (9) An example of political activity at a global level is the UNESCO UNESCO: see United Nations Educational, Scientific, and Cultural Organization.
in full United Nations Educational, Scientific and Cultural Organization initiative, which resulted in a charter on digital preservation containing recommendations to governments. In addition, UNESCO has published the Guidelines for the Preservation of the Digital Heritage (UNESCO, 2003).
DIGITAL PRESERVATION IN PRACTICE
Digital Archiving in the Netherlands
The national library of the Netherlands, the Netherlands, The
officially Kingdom of The Netherlands byname Holland
Country, northwestern Europe. Area: 16,034 sq mi (41,528 sq km). Population (2005 est.): 16,300,000. Capital: Amsterdam. Seat of government: The Hague. Most of the people are Dutch. Koninklijke Bibliotheek, was founded in 1798. The organization comprises the equivalent of 290 full-time staff, and the annual budget is 38 million Euro. The KB is located in The Hague, the Hague, The (hāg), Du. 's Gravenhage or Den Haag, Fr. La Haye, city (1994 pop. 445,279), administrative and governmental seat of the Kingdom of the Netherlands, capital of South Holland prov., W Netherlands, on the North Sea. residence of the government of the Netherlands. One of the key tasks of the KB is to act as the depository library for publications. The aim is to collect the publications, preserve them, and provide access to them, now as well as in the future. Depositing of publications at national libraries can either be regulated by legislation or be arranged by voluntary agreements. In the Netherlands deposit is based on voluntary arrangements between the KB and publishers.
Traditionally the depository The place where a deposit is placed and kept, e.g., a bank, savings and loan institution, credit union, or trust company. A place where something is deposited or stored as for safekeeping or convenience, e.g., a safety deposit box. task of the KB concerned only printed material, like journals, books, and newspapers. In 1994, as more and more publications became digital, the KB decided to start including digital publications in its depository collection. To implement this the KB officially extended the depository task to include digital publications; began cooperation with publishers and IT-partners; and established research and organization of digital archiving in practice.
Breaking New Ground
In the early 1990s when the first digital publications were deposited, the KB treated them as printed books and shelved them in the book stacks. They were "offline" or "handheld" digital publications and were catalogued on the basis of information on the wrapper A data structure or software that contains ("wraps around") other data or software, so that the contained elements can exist in the newer system. The term is often used with component software, where a wrapper is placed around a legacy routine to make it behave like an object. or box of the publication. In order to learn to handle digital publications appropriately, the KB had to break new ground in a variety of areas. The library staff had to develop completely new procedures and acquire new skills. The KB also had to develop close cooperation with IT partners and had to obtain a dedicated IT infrastructure. Last but not least, the KB had to build the trust of publishers to cooperate in experiments to archive their digital publications (Adams, 2004). A similar tripartite TRIPARTITE. Consisting of three parts, as a deed tripartite, between A of the first part, B of the second part, and C of the third part. cooperation of libraries, IT companies, and publishers provided the basis for the success of the European project NEDLIB NEDLIB Networked European Deposit Library , which is discussed later. To acquire more experience in handling digital publications in practice, the KB started to experiment with "online" digital publications, using samples of e-journals kindly provided by Elsevier and Kluwer Academic. The KB experimented with a number of offline publications, trying to install them in order to check the content (Noordermeer, 1997).
Archiving Dutch e-Journals
The publishing company Elsevier has been involved in the archiving experiments of the KB since 1993. In 1995 the KB and Elsevier discussed the possibility of depositing the e-journals with a Dutch imprint (that is, those originally printed in the Netherlands) with the library. In 1996 a preliminary agreement was signed, and Elsevier started to deposit its e-journals. In 1999, when a similar arrangement was reached with the Dutch Publishers Association, the deposit of digital publications was extended to Dutch publishers in general. Both offline and online digital publications are deposited. Another archiving relationship developed by the KB dates from 2003 when the Dutch universities Dutch Universities are supported by state funding (with the exception of University Nyenrode) so that universities do not have to rely on private funding to facilitate tuition. All citizens of the Netherlands who complete high school on the pre-academic level (VWO) or have the Dutch jointly started the Digital Academic Repositories (DARE) project (10) to create institutional repositories. The KB will archive and ensure the persistency of the digital information published through these institutional repositories.
Archiving International e-Journals
While developing the new digital archiving practices, the KB recognized that imprint (that is, place of publication) was no longer a feasible selection criterion for depositing digital publications by international publishers like Elsevier. Therefore, the KB investigated the adoption of a wider responsibility for archiving all the e-journals published by Elsevier irrespective of irrespective of
Without consideration of; regardless of.
preposition despite their imprint. The issue was discussed with the Ministry of Education, Science and Culture The Ministry of Education, Science and Culture can refer to:
* A formal archiving relationship to ensure permanent archiving of the publisher's publications
* To guarantee integrity of the digital information and to ensure permanent availability of the publications
* To act as host to provide access for (former) customers of the publisher and also to provide access as an emergency backup for the publisher
Following the archiving agreement with Elsevier, the KB signed agreements with other international publishers such as Kluwer Academic Publishers, BioMed Central, Blackwell Publishing, the Taylor and Francis Group, Oxford University Press, Springer springer
a North American term commonly used to describe heifers close to term with their first calf. , and Brill Brill or Bril, Flemish painters, brothers.
Mattys Brill (mä`tīs), 1550–83, went to Rome early in his career and executed frescoes for Gregory XIII in the Vatican. Academic Publishers. In these archiving agreements the digital publications archived by the KB may be used as follows:
* Bibliographic metadata about the publications may be included in the KB's online public catalogue and in the National Bibliography.
* Publications may only be used onsite at the KB and only by persons authorized by the KB.
* In the case of Open Access publishers and nonprofit publishers, the onsite restriction does not apply.
* For authorized KB staff, both onsite and remote use is allowed.
* The archived digital publications may be used as a source for print or fax copies for interlibrary in·ter·li·brar·y
Existing or occurring between or involving two or more libraries: an interlibrary loan; an interlibrary network. loan within the Netherlands.
* Sending or transferring the electronic files outside the library by any means is not allowed.
Safe Place Strategy
KB's Archiving Strategy Over the years the KB has developed an archiving strategy (Steenbakkers, 2002) that is referred to as the "Safe Place Strategy." This strategy consists of three steps. The first step is to create an archiving environment to which any publication that has to be archived will be transferred. The technical core of the archiving environment is a deposit system that has a similar function to the physical stacks of a library. The archiving environment offers specific and controlled conditions for storage, maintenance, and management of the publications.
The second step is to organize and execute "perfect" copying of the digital object in order to refresh (1) To continuously charge a device that cannot hold its content. CRTs must be refreshed, because the phosphors hold their glow for only a few milliseconds. Dynamic RAM chips require refreshing to maintain their charged bit patterns. See vertical scan frequency and redraw. the storage medium. This has to be done before the old storage medium deteriorates or becomes technically obsolete. Because a deposit collection will continually grow, in the end copying will not be a trivial task due to the large size of the collection (Diessen & Rijnsoever, 2002).
The third and most complicated step is to ensure that the publications can continue to be used in the future. To achieve this, we will have to register, preserve, or replace the functionality for rendering the digital information. These activities will require ongoing research and development as information technology is rapidly and continuously developing.
IT Solutions To manage the online digital publications, the KB recognized that a specific computer system was needed and therefore looked for an IT partner who was willing to provide a pilot system. In 1995 the KB succeeded in teaming up with AT&T Solutions and its Bell Laboratories. AT&T provided the KB with a system called Right Pages, which is designed to handle a modest amount of e-journal articles. Besides creating the workflow for the online digital publications, the KB and AT&T jointly investigated the potential to scale-up the Right Pages system in order to manage larger amounts of digital publication.
The KB and AT&T project made good progress, but in 1997 AT&T closed down the European division that was developing and marketing Right Pages and the KB was forced to look for another IT partner. After a search for alternative products, IBM's Digital Library was selected as a replacement. The implementation of Digital Library at the KB required major effort from both IBM and the KB. This second pilot system became operational in January 1998, and it was substantially larger and contained basic functionality for handling and managing digital publications (Steenbakkers, 1999). The system, referred to as the Depot voor Nederlandse Electronische Publicaties, or DNEP-system, contained about 1.9 Tb of storage capacity. The DNEP-system was not only used to load and maintain digital publications but also to provide access to the digital content for library visitors and staff.
As described before, in 1994 the KB adapted its deposit policy to include digital publications and experimented with pilot deposit systems to store and handle them. In 1998 a comprehensive pilot deposit system was installed. In January of the same year the international project NEDLIB (11) started; it aimed to define the functional, technical, and organizational requirements for an operational electronic deposit system.
NEDLIB NEDLIB stands for the Networked European Depository Library project and was launched in January 1998 with the aim to define and test the architecture and procedures for capturing, preserving, and accessing digital publications. The NEDLIB project was initiated by COBRA+, a cooperation of a number of national libraries operating under the umbrella of the Conference of European National Librarians (CENL CENL Conference of European National Librarians
CENL Conference of European National Libraries ). The project was cofunded by the Commission of the European Communities European Community: see European Union.
European Community (EC)
Organization formed in 1967 with the merger of the European Economic Community, European Coal and Steel Community, and European Atomic Energy Community. within the Telematics for Libraries Programme. The project operated over three years, 1998-2000, and had a total budget of 1,760,195 Euro. The participants consisted of eight national libraries, one national archive A national archive is a central archive maintained by a nation. List of national archives
The objectives of NEDLIB were to develop a functional specification and an overall design for a depository for electronic publications and to address the issue of long-term preservation and permanent access. Also within this process a technical description and a prototype of a depository of digital publications had to be delivered (Werf-Davelaar, 1999). By the time NEDLIB had created the first draft of a generic model for a depository for electronic publications, the OAIS Reference Model White Book 1998 was published. After matching the NEDLIB model with the OAIS model, the NEDLIB partners decided to adopt the OAIS model and to contribute to its further development, as well as to apply it for use in national libraries and national archives National Archives, official depository for records of the U.S. federal government, established in 1934 by an act of Congress. Although displeasure concerning the method of keeping national records was voiced in Congress as early as 1810, the United States continued .
The results of NEDLIB have been published in a series of six reports (NEDLIB, n.d.). These reports in turn provided valuable input for the development and implementation of the KB's operational electronic depository. Through the NEDLIB project the European national libraries were also able to contribute to the development of the OMS OMS - Opportunity Management System archiving standard.
Acquiring the Deposit System Based on experience with the pilot system, the NEDLIB Guidelines (Steenbakkers, 2000), and the model for a Deposit System for Electronic Publications (Werf, 2000), the KB defined the requirements for its deposit system. In 1999 a scan of the IT market was done through a "Request for Information." The results from the request indicated that a deposit system could not be bought off the shelf. It also showed that the IT market seemed to be interested in designing and developing a deposit system. Encouraged by this, at the end of 1999 the KB decided to move ahead and publish the "Call for Tender Depot van Nederlandse Publicaties" (National Library of the Netherlands, 1999).
Requirements for the Deposit System The overall requirement for the deposit system was that it should offer a controlled archiving environment. It should support the maintenance of the digital publications in such a way that no data would be lost or mutilated mu·ti·late
tr.v. mu·ti·lat·ed, mu·ti·lat·ing, mu·ti·lates
1. To deprive of a limb or an essential part; cripple.
2. To disfigure by damaging irreparably: mutilate a statue. and that access to the information was guaranteed, now as well as in the future. The detailed requirements were published in the tender document (National Library of the Netherlands, 1999). In summary, the requirements were as follows:
* The deposit system should be designed to handle a constantly increasing variety and amount of digital publication
* The system should be durable, in the sense that it can be technically updated continuously, without affecting the reliability of the archiving process and without endangering the archived content
* The functional design of the deposit system should be in accordance with OAIS (2002) Reference Model ISO (1) See ISO speed.
(2) (International Organization for Standardization, Geneva, Switzerland, www.iso.ch) An organization that sets international standards, founded in 1946. The U.S. member body is ANSI. 14721:2002
* The deposit system should be a separate system interfacing with the digital library infrastructure that will offer traditional functions like Cataloguing, Search & Retrieval, etc.
* The interfaces to its environment should be well defined and easy to maintain
* As much as possible, the system should be constructed with proven technology and off-the-shelf building blocks with a large installed base
Tender Result The result of the European call for tender was a short list of five bidders, from which IBM was selected as the best technology partner and was contracted in October 2000. It took several months of intense but constructive negotiations before the contract could be signed.
The major problem to be solved during the contract negotiation was that the KB wanted an operational, comprehensive, OAIS-compliant deposit system, including full functionality for digital preservation (planning, management, and permanent access). At that time, however, the KB could not define the requirements for the preservation functionality precisely enough to demand its development and delivery. To solve this problem the contract was divided into the development and delivery of an operational deposit system and an additional study to help define the requirements for long-term preservation and permanent access. This research effort was implemented as the "Long-Term Preservation Study," which ran parallel to the creation of the deposit system. IBM used the results of the study for the design and development of the Digital Information Archiving System (DIAS). The results were also published in December 2002 in a series of six reports. (12) It is interesting to note that in the contract negotiation the KB decided to grant IBM the full intellectual property of the archival system. The KB hoped that this would be an incentive for IBM to brand, market, and update the archival system as an IBM product.
IBM's Digital Information Archiving System (DIAS) Even though IBM developed DIAS in partnership with the KB, it is not a system specific to the KB nor even to libraries but rather is a solution for digital archiving in general. (13) DIAS is, as required, based on the Depository System for Electronic Publications (DSEP DSEP Defense Science and Engineering Program ) model as published by NEDLIB, making it the first concrete application of the OMS Reference Model (Werf, 2000). Compliance with the OMS archival standard (see figure 3) recommends inclusion of the following functions:
[FIGURE 3 OMITTED]
* Delivery and Capture: services and functions to receive from the publishers the digital publications and check their quality and to produce SIPs (Submission Information Packages) and present them to Ingest in·gest
tr.v. in·gest·ed, in·gest·ing, in·gests
1. To take into the body by the mouth for digestion or absorption. See Synonyms at eat.
* Ingest: the services and functions to check and accept the SIPs
* Archival Storage: services and functions for the storage, maintenance, and retrieval of MPs (Archival Information Packages)
* Data Management: services and functions for keeping, maintaining, and accessing descriptive information for the archived publications and other administrative data
* Administration: services and functions for controlling the daily operations
* Preservation: services and functions for planning, monitoring, and executing preservation strategies and actions
* Access: services and functions to locate and retrieve the archived information and produce the DIPs (Dissemination Information Packages)
* Packaging and Delivery: services and functions for preprocessing A preliminary processing of data in order to prepare it for the primary processing or for further analysis. The term can be applied to any first or preparatory processing stage when there are several steps required to prepare data for the user. the information packaged in a DIP and delivering it to the user
* Monitoring and Logging: services and functions for registering of and reporting on actions
Further information and explanation of the OAIS model can be found in other publications such as Sawyer et al. (2002) and Cornell University Library The Cornell University Library is the library system of Cornell University. In 2005 it held 7.5 million printed volumes in open stacks, 8.2 million microfilms and microfiches, and a total of 440,000 maps, motion pictures, DVDs, sound recordings, and computer files in its (2003).
A simplified model of DIAS (14) is shown in figure 4. The DIAS Core contains the functions shown in the OMS model above, except the Delivery and Capture and the Packaging and Delivery, which are shown in more detail on either side.
[FIGURE 4 OMITTED]
IBM used off-the-shelf components as well as specifically developed components for constructing DIAS. Off-the-shelf components included Content Manager, Tivoli Storage Manager, DB2 database, and Business Objects. The Delivery and Capture functions (manual or automated capture) and Packaging and Delivery function (delivery through the network or on a specific workstation [such as RefWS] (15)) were especially created for DIAS. The Preservation function is under construction.
The Preservation Subsystem The current version of DIAS already contains basic preservation functionality. However, the KB and IBM are developing a Preservation Subsystem for DIAS that will support the registering of technical metadata and will provide other functionality needed for preservation. In figure 5 the model of DIAS is shown with the planned Preservation Subsystem in place. The subsystem consists of three components. In the lower right of the subsystem box is the Preservation Manager, a component that is used to register the technical metadata. At the top of the subsystem is the Permanent Access Toolbox or PATbox, as described earlier. At the bottom left is the Preservation Processor, used to execute the preservation actions.
[FIGURE 5 OMITTED]
Technical Metadata An important part of the preservation metadata (16) is the metadata for rendering, or in OAIS terminology, "Representation Information." Metadata has to be registered for each archived digital publication. The OAIS concept of Representation Information was further developed by NEDLIB into a specific layered model (Lupovici & Masanes, 2000). In order to apply the layered model in practice, IBM developed it further into the Preservation Layer Model (PEM (Privacy Enhanced Mail) A standard for secure e-mail on the Internet. It supports encryption, digital signatures and digital certificates as well as both private and public key methods. Not widely used, work on PEM later evolved into S/MIME. See MIME. ) (Diessen, 2002), which is shown in figure 6.
[FIGURE 6 OMITTED]
The PLM (Product Life cycle Management) A comprehensive information system that coordinates all aspects of a product from initial concept to its eventual retirement. Sometimes called the "digital backbone" of a product, it includes the requirements phase, analysis and design represents the template structures used by the Preservation Subsystem to register, maintain, and manage the technical metadata. A concrete PLM for a specific data format is called a "View Path." For example, a View Path for PDF (Portable Document Format) The de facto standard for document publishing from Adobe. On the Web, there are countless brochures, data sheets, white papers and technical manuals in the PDF format. 1.3 data format in practice may contain the following elements: reference platform = Intel Pentium, operating system operating system (OS)
Software that controls the operation of a computer, directs the input and output of data, keeps track of files, and controls the processing of computer programs. = NT, viewer application = Acrobat Reader The former name of Adobe Reader. See PDF. 3.0 . The policy of the KB is to guarantee at least two View Paths for every digital publication archived.
The Preservation Manager The central component of the Long-Term Preservation function is the Preservation Manager, which supports the managing of the technical metadata. It registers the View Paths for every digital publication and monitors their viability. The Preservation Manager has recently been developed by IBM and has been tested by the KB. It will be integrated into the next version of DIAS. (Diessen, Oltmans, & Wijngaarden, 2004). The purpose of the Preservation Manager in detail is to support registration and management of the metadata for rendering; monitor the availability of the technical prerequisites for accessing the information; and aid the planning of preservation actions, for example, migration or emulation.
The e-Depot: The Operational KB Archive
The e-Depot's Infrastructure The "e-Depot" is the name of the KB's infrastructure and organization for archiving digital publications, including the workflow and procedures for handling and archiving digital publications. The name "e-Depot" also applies to the archiving service offered by the KB to producers and users of digital information. The e-Depot's infrastructure consists of DIAS with some smaller supporting systems and of other systems offering the usual digital library functions. In figure 7 the technical infrastructure of the e-Depot is schematically sche·mat·ic
Of, relating to, or in the form of a scheme or diagram.
A structural or procedural diagram, especially of an electrical or mechanical system. presented.
[FIGURE 7 OMITTED]
DIAS is the technical core of the e-Depot, offering dedicated archiving functions. In the schema the functions on the left are for receiving and loading the digital publications: EPO EPO
EPO Erythropoietin, see there is the Electronic Post Office, BER (1) (Basic Encoding Rules) A set of encoding rules for ASN.1 notation, which is a method for defining data structures. See ASN.1.
(2) (Bit Error Rate) The average number of bits transmitted in error. See BERT.
1. is the Basic Error Recovery, and NBN NBN National Broadband Network (Philippines)
NBN National Biodiversity Network (UK nature conservation)
NBN Nefesh B'Nefesh (Jewish Souls United)
NBN National Broadcasting Network is the National Bibliographic Number generator for unique identifiers With reference to a given (possibly implicit) set of objects, a unique identifier is any identifier which is guaranteed to be unique among all identifiers used for those objects and for a specific purpose. . The functions on the right are for cataloguing, search, retrieval, and delivery: GGC GGC Girl Guides of Canada
GGC Greenwood Genetic Center (South Carolina)
GGC Gwasanaeth Gwaed Cymru (Welsh Blood Service)
GGC Generalized Goppa Code
GGC Grosvenor Gallery Company is the national central cataloguing system of Pica/OCLC, KB-TITEL is the local overall catalogue database of the KB, and IAA IAA
Noun 1. IAA - a plant hormone promoting elongation of stems and roots
auxin - a plant hormone that promotes root formation and bud growth
2. is the function for Identification, Authentication (1) Verifying the integrity of a transmitted message. See message integrity, e-mail authentication and MAC.
(2) Verifying the identity of a user logging into a network. , and Authorization of end users.
The e-Depot's Organization Within the KB three divisions are jointly responsible for running the service and developing the infrastructure of the e-Depot:
1. The Acquisition and Processing Division is in charge of the day-to-day operations of obtaining, checking, and loading the publications, including their metadata.
2. The Division for Information and Communication Technology is responsible for the technical maintenance of the infrastructure of the e-Depot. This task includes the maintenance of DIAS and expanding its storage capability, guaranteeing backup, and providing media migration. This division also manages integration of the deposit system within the general digital library infrastructure.
3. The Research and Development Division performs studies and experiments to further develop the functionality of the e-Depot. These activities are usually joint projects with the two divisions mentioned above. External technology partners are often involved. The Research and Development Division also organizes or participates in international activities, like development of standards, preservation studies, projects, and conferences. For these activities a dedicated Digital Preservation research unit has been created.
The e-Depot is a strategic activity that has a great impact on the KB's policy and organization. To coordinate the activities and policy development concerning the e-Depot, the KB has implemented an e-Depot Steering Board. In addition to the three divisions already mentioned, the User Services Division also participates on the board. This division is in charge of providing access to the digital publications in accordance with conditions specified in the publishers' archiving agreements.
The e-Depot as a Trusted Repository The aim of the KB is to develop the e-Depot to become a "Trusted Digital Repository" (TDR TDR - time domain reflectometer ). The concept of the Trusted Digital Repository was introduced in 2002 by the Research Libraries Group/Online Computer Library Center (RLG/OCLC) Working Group on Digital Archive Attributes A file classification that indicates whether the file has been updated since the last backup. A bit is set in the file directory to indicate the archive status. When a file is created or saved, the bit is turned on. When it is backed up, the bit is turned off. See file attribute. (Research Libraries Group, 2002). In its report the working group gives the following definition: "A trusted digital repository is one whose mission is to provide reliable, long-term access to managed digital resources to its designated community, now and in the future" (Research Libraries Group, 2002, i).
Other archival concepts relevant to the e-Depot are the OAIS reference model and the NEDLIB model for DSEE DSEE - Domain Software Engineering Environment Figure 8 depicts the relationship of the KB's e-Depot to the following concepts and models:
[FIGURE 8 OMITTED]
* The e-Depot is the operational digital archive of the KB, dedicated to long-term archiving and permanent access. "e-Depot" refers to all aspects involved in operational digital archiving--policy, infrastructure, organization, and services--and encompasses a significant proportion of all the other concepts represented here.
* TDR is a concept describing the various requirements an institution has to meet in order to be considered a trusted repository that is able to archive digital information for the long term. In addition to the functions described in the OAIS model, the TDR concept includes functions like administrative responsibility administrative responsibility Any task or duty related to managing an institution; non-Pt management-related responsibilities of physicians include chart review, participation in the tumor board or tissue committee, etc. Cf Clinical responsibility. , organizational viability, financial sustainability, system security, technical and procedural suitability, and procedural accountability (Research Libraries Group, 2002).
* OAIS is an International Organization for Standardization International Organization for Standardization (ISO)
Organization for determining standards in most technical and nontechnical fields. Founded in Geneva in 1947, its membership includes more than 100 countries. (ISO) standard describing the functional, organizational, and procedural requirements for a digital archive (Sawyer et al., 2002).
* DSEP is the NEDLIB process model that details the functional aspects of the OAIS in a deposit system model.
* DLI DLI Department of Land Information (Western Australia)
DLI Donor Lymphocyte Infusion
DLI Defense Language Institute
DLI Durham Light Infantry (Regiment of the British Army) is not a specific concept for archiving. It represents the overall IT infrastructure offering the "traditional" digital library functions, like cataloguing processes, access to the catalogue, user registration and authentication, etc.
The Success of e-Depot
In October 2002 DIAS was handed over to the KB. Once DIAS was implemented and the workflow and procedures for handling the digital publications were in place, the loading of digital publications started. The DNEP pilot system was closed down at the end of 2004 as all its functions had been taken over by the e-Depot. The total budget available for the KB to create and obtain an operational deposit system; for professional project management support; for all hardware, software, and twelve Tb (terabytes) storage and seventeen Tb backup, was 5.5 million Euro. IBM developed the system at the KB's premises, for the costs agreed and according to according to
1. As stated or indicated by; on the authority of: according to historians.
2. In keeping with: according to instructions.
3. the project planning project planning - project management .
In the past few years the stakeholders--either producers, brokers, and users of digital publications or IT producers and governmental bodies--became increasingly aware of the problem of preserving digital publications. More and more stakeholders Stakeholders
All parties that have an interest, financial or otherwise, in a firm-stockholders, creditors, bondholders, employees, customers, management, the community, and the government. have started to address this threat to the human digital record. Several publishers have developed an archiving policy and are acting accordingly. National and international governmental bodies are raising awareness Raising awareness is a common phrase advocacy groups use to justify a particular event, brochure or even the entire organization. Raising awareness refers to alerting the general public that a certain issue exists and should be approached the way the group desires. of the problem of digital preservation and are creating regulations and legislation to support digital archiving. The scientific research community also has recently noticed the problem of digital preservation. The IT sector is increasingly becoming interested in studying and tackling the problem of digital preservation, as solving the digital preservation issues will have a long-term return on investment.
National libraries such as the KB in the Netherlands were amongst the first institutions not only to raise awareness of the problem of digital preservation but also to address it in practice. In this process the national libraries found the lack of funding the most serious obstacle, which has inhibited the progress in digital preservation research and development. Funding agencies, both in Europe and in the United States United States, officially United States of America, republic (2005 est. pop. 295,734,000), 3,539,227 sq mi (9,166,598 sq km), North America. The United States is the world's third largest country in population and the fourth largest country in area. , usually do not provide funding for cultural sector venture research projects. They prefer to fund "safe" activities, like performing another study or survey or drafting another report or research agenda, rather than funding the research and development itself. Nevertheless, national libraries have undeniably taken the lead and emerged as key players in the field of digital preservation. The International Publishers Association (IPA IPA - International Phonetic Alphabet ) and the International Federation of Library Associations and Institutions
The International Federation of Library Associations and Institutions (IFLA IFLA International Federation of Library Associations and Institutions
IFLA International Federation of Landscape Architects
IFLA Instituto Forestal Latinoamericano (Venezuela)
IFLA Israel Free Loan Association ) have acknowledged the position of the national libraries (Hodge & Frangakis, 2004, pp. 17-18).
In the past the lack of resources has frustrated frus·trate
tr.v. frus·trat·ed, frus·trat·ing, frus·trates
a. To prevent from accomplishing a purpose or fulfilling a desire; thwart: the progress of research and development of solutions for a problem that will have a great impact on both our private lives and our business activities. It is about time that the national libraries and the national archives, together with partners in technology and publishing, receive substantial support to move the practice of digital preservation forward.
I would like to thank Deborah Woodyard-Robinson not only for inviting me to contribute to this special issue of Library Trends but also for the valuable advice she gave me while writing the article. I would also like to thank Hilde van Wijngaarden, Erik Oltmans, and Raymond van Diessen for the information provided.
Adams, G. (2004). Partners go Dutch Dutch
a. Of or relating to the Netherlands or its people or culture.
b. Of or relating to the Dutch language.
b. to preserve the minutes of science. Research Information, 13, 18-22. Retrieved February 15, 2005, from http://www.researchinformation.info/risepoct04archiving.html.
Cornell University Library. (2003). Digital preservation management: Implementing short-term strategies for long-term problems [Online tutorial]. Retrieved February 15, 2005, from http://www .library.cornell.edu/iris/tutorial/dpm/index.html.
Council of the European Union. (2002). Council resolution: On preserving tomorrow's memory--Preserving digital content for future generations. Official Journal of the European Communities, 162, 1-2. Retrieved August 15, 2005, from http://europa.eu.int/ eur-lex/pri/en/oj/dat/2002/c_162/c_16220020706en00040005.pdf.
Diessen, R.J. van. (2002). Preservation requirements in a deposit system (IBM/KB Long-Term Preservation Study Report Series 3). Retrieved February 15, 2005, from http://www.kb.nl/hrd/dd/dd_onderzoek/reports/3-preservation.pdf.
Diessen, R.J. van, Oltmans, E., & Wijngaarden, H. van. (2004). Preservation functionality in a digital archive. In Proceedings of the 4th ACM/IEEE-CS Joint Conference on Digital Libraries, Tucson, Arizona Tucson (pronounced /ˈtusɑn/, Spanish: Tucsón [tuk'son] (pp. 279-86). New York New York, state, United States
New York, Middle Atlantic state of the United States. It is bordered by Vermont, Massachusetts, Connecticut, and the Atlantic Ocean (E), New Jersey and Pennsylvania (S), Lakes Erie and Ontario and the Canadian province of : ACM (Association for Computing Machinery, New York, www.acm.org) A membership organization founded in 1947 dedicated to advancing the arts and sciences of information processing. In addition to awards and publications, ACM also maintains special interest groups (SIGs) in the computer field. Press. Retrieved February 15, 2005, from http://portal.acm.org/citation.cfm?id=996350.996416.
Diessen, R.J. van, & Rijnsoever, B.J. van. (2002). Managing media migration in a deposit system (IBM/KB Long-Term Preservation Report Series 5). Retrieved February 15, 2005, from http://www.kb.nl/hrd/dd/dd_onderzoek/reports/5-mediamigration.pdf.
Elsevier. (n.d.). E-Journals at Elsevier: Over two decades of experimentation and development. Retrieved February 15, 2005, from http://www.elsevier.com/wps/find/authored_newsitem .cws_home/companynews05_00021.
--. (2002). Elsevier and Koninklijke Bibliotheek finalise Verb 1. finalise - make final; put the last touches on; put into final form; "let's finalize the proposal"
finalize, nail down, settle
terminate, end - bring to an end or halt; "She ended their friendship when she found out that he had once been convicted of major archiving agreement. Retrieved February 15, 2005, from http://www.elsevier.com/wps/find/ authored_newsitem.librarians/companynews05_00020.
--. (2004). Elsevier's comments on evolutions in scientific, technical and medical publishing and reflections on possible implications of Open Access journals for the UK. Retrieved February 15, 2005, from http://www.elsevier.com/authored_news/corporate/images/ UKST UKST United Kingdom Schmidt Telescope 1Elsevier_position_paper_on_stm_in_UK.pdf.
Gorman, M. (2001). Bibliographic control or chaos: An agenda for national bibliographic services in the 21st century. In Libraries and librarians: Making a difference in the knowledge age: 67th IFLA Council and General Conference, August 16-25, 2001, Boston, USA. Retrieved February 15, 2005, from http://www.ifla.org/IV/ifla67/papers/134-133e.pdf.
Hodge, G., & Frangakis, E. (2004). Digital preservation and permanent access to scientific information: The state of the practice. A report sponsored by the International Council for Scientific and Technical Information (ICSTI ICSTI International Council for Scientific and Technical Information (Conseil International pour l'Information Scientifique et Technique) ) and U.S. Federal Information Managers Group (CENDI CENDI Commerce, Energy, NASA, NLM, Defense and Interior ). Retrieved February 15, 2005, from http://cendi.dtic.mil/pubarc.html.
Hunter, K. (2002). STM members and digital archiving. Information Services See Information Systems. & Use, 22(2/3), 83-88.
ICABS. (2004). IFLA Core Activity: IFLA-CDNL Alliance for Bibliographic Standards (ICABS). Retrieved February 15, 2005, from http://www.ifla.org/VI/7/icabs.htm.
Information Society Technologies in the 6th Framework Programme. (2002-06.) Thematic priority under the specific programme "Integrating and strengthening the European research area The European Research Area (ERA) is a system of scientific research programmes integrating the European Union's scientific resources. Since its creation in 2000, the structure has been concentrated on multi-national co-operation in the fields of medical, environmental, industrial ." Retrieved August 15, 2005, from http://ica.cordis.lu/search/index.cfm?fuseaction=prog v. i. 1. To wander about and beg; to seek food or other supplies by low arts; to seek for advantage by mean shift or tricks.
imp. & p. p. os>
( ) r>.
p. pr. & vb. n. os>
CFID Cold Fusion Id =3981689&CFTOKEN=15082814.
Lorie, R. (2002). The UVC: A method for preserving digital documents: Proof of concept (IBM/KB Long-Term Preservation Study Report Series 4). Retrieved February 15, 2005, from http:// www.kb.nl/hrd/dd/dd_onderzoek/reports/4-uvc.pdf.
--. (2004). Preserving digital documents for the long-term (pp. 88-92). Proceedings of the Imaging Science & Technology 2004 Archiving Conference, San Antonio, Texas “San Antonio” redirects here. For other uses, see San Antonio (disambiguation).
San Antonio is the second most populous city in Texas, the third most populous metropolitan area in Texas, and is the seventh most populous city in the United States. As of the 2006 U.S. . Springfield, VA: Society for Imaging Science and Technology The Society for Imaging Science and Technology (IS&T) is a research and education organization in the field of imaging. Founded in 1947, as the Society of Photographic Science and Engineering (SPSE), it is headquartered in Washington DC. .
Lupovici, C., & Masanes, J. (2000). Metadata for long term preservation of electronic publications (NEDLIB Report Series 2). Retrieved February 15, 2005, from http://www.kb.nl/ coop/nedlib/results/NEDLIBmetadata.pdf.
National Library of the Netherlands. (1999). Call for tender Depot van Nederlandse Elektronische Publicaties [published in accordance with the European procedure under number 99/ S 177-126796/NL]. Retrieved August 15, 2005, from http://www.kb.nl/dnp/ e-depot/dm/callfortender.pdf.
NEDLIB. (n.d.). NEDLIB Web site, Results section. Retrieved February 15, 2005, from http:// www.kb.nl/coop/nedlib/.
Noordermeer, T. (1997). Deposit for Dutch electronic publications: Research practice in the Netherlands. In C. Peters & C. Thanos (Eds.), Research and advanced technology for digital libraries: 1st European Conference on Digital Libraries, ECDL ECDL European Computer Driving License (computer skills certification programme; European Computer Driving Licence Foundation Ltd.)
ECDL European Conference on Research and Advanced Technology for Digital Libraries 1997, Pisa, Italy, September 1-3, 1997 (Lecture Notes in Computer Science Lecture Notes in Computer Science (LNCS) is a computer science series published by Springer Science+Business Media. , 1324, pp. 361-73). London: Springer-Verlag.
Poynder, R. (2003). Elephants and dung-trucks. Information Today, 20(8). Retrieved February 15, 2005, from http://dspace.dial.pipex.com/town/parade/df04/elephants and dung DUNG. Manure. Sometimes it is real estate, and at other times personal property. When collected in a heap, it is personal estate; when spread out on the land, it becomes incorporated in it, and it is then real estate. Vide Manure. .htm.
Reference Model for an Open Archival Information System (OAIS). (1998). Consultative Committee for Space Data Systems The Consultative Committee for Space Data Systems (CCSDS) was formed in 1982 by the major space agencies of the world to provide a forum for discussion of common problems in the development and operation of space data systems. , Recommendation concerning space data systems standards, CCSDS CCSDS Consultative Committee for Space Data Systems
CCSDS Consultative Committee for Space Data System (NASA)
CCSDS Consultative Committee on Space Data Standards
CCSDS Consultative Committee for Standard Data Services 650.0-W-3.0, White Book. Retrieved August 15, 2005, from ftp://nssdcftp .gsfc.nasa.gov/standards/nost/nost/isoas/us12/CCSDS-650.0-W-3.pdf.
--. (2002). Consultative Committee for Space Data Systems, Recommendation for space data system standards, CCSDS 650.0-B-1, Blue Book. Retrieved August 15, 2005, from http://ssdoo.gsfc.nasa.gov/nost/wwwclassic/documents/pdf/ CCDS-650.0-B-1.pdf.
Research Libraries Group. (2002). Trusted digital repositories: Attributes and responsibilities. An RLG-OCLC Report. Retrieved February 15, 2005, from http://www.rlg.org/ longterm/repositories.pdf.
Rothenberg, J. (1995). Ensuring the longevity of digital documents. Scientific American Scientific American
U.S. monthly magazine interpreting scientific developments to lay readers. It was founded in 1845 as a newspaper describing new inventions. By 1853 its circulation had reached 30,000 and it was reporting on various sciences, such as astronomy and , 272, 24-29.
Sawyer, D., Reich, L., Giaretta, D., Mazal, P., Huc, C., Nonon-Lapatie, M., et al. (2002). The Open Archival System (OAIS) reference model and its usage. Retrieved February 15, 2005, from http://www.ccsds.org/documents/SO2002/SPACEOPS02_P_T5_39.PDF.
Smith, B. (2003). Preserving tomorrow's memory: Preserving digital content for future generations. International Preservation News, 29, 4-10. Retrieved August 15, 2005, from http://www .ifla.org/VI/4/news/ipnn29.pdf.
Steenbakkers, J. (1999). Developing the depository of Netherlands electronic publications. Alexandria, 11(2), 93-99.
--. (2000). The NEDLIB guidelines: Setting up a deposit system for electronic publications (NEDLIB Report Series 5). Retrieved February 15, 2005, from http://www.kb.nl/coop/ nedlib/results/NEDLIBguidelines.pdf.
--. (2002). Preserving electronic publications. Information Services & Use, 22, 89-96.
--. (2004). Digital archiving: A necessary evil or new opportunity? Serials Review, 30, 29-32.
UNESCO. (2003). Guidelines for the Preservation of the Digital Heritage. Retrieved February 15, 2005, from http://unesdoc.unesco.org/images/0013/001300/130071e.pdf.
Werf, T. van der (2000). A process model: The deposit system for electronic publications (NEDLIB Report Series 6). Retrieved February 15, 2005, from http://www.kb.nl/coop/nedlib/ results/DSEPprocessmodel.pdf/.
Werf-Davelaar, T. van der. (1999). Long-term preservation of electronic publications: The NEDLIB project. D-Lib Magazine D-Lib Magazine is an on-line magazine dedicated to digital library research and development. Content of current and past issues are available free of charge. The publication is financially supported by the Defense Advanced Research Projects Agency (as part of the Digital , 5(9). Retrieved February 15, 2005, from http://www.dlib.org/dlib/september99/vanderwerf/09vanderwerf.html.
Wijngaarden, H. van, & Oltmans, E. (2004). Digital preservation and permanent access: The UVC for images (pp. 254-58). Proceedings of the Imaging Science & Technology 2004 Archiving Conference, San Antonio, Texas. Springfield, VA: Society for Imaging Science and Technology.
(1.) Information about SPARC is available at http://www.arl.org/sparc/; information about BioMed Central is available at http://www.biomedcentral.com/.
(2.) Examples of initiatives in digital archiving are PANDAS PANDAS Pediatric Autoimmune Neuropsychiatric Disorders Associated with Streptococcal Infections (http://www.nla.gov.an/padi/); JSTOR JSTOR Journal Storage (http://www.jstor.org); YEA (http://www.library.yale.edu/~okerson/yea/ frontmatter.pdf); NEDLIB (http://www.kb.nl/coop/nedlib/); PubMed Central PubMed Central is a free digital database of full-text scientific literature in biomedical and life sciences. It can be reached at .
It grew from the online Entrez PubMed biomedical literature search system. PubMed Central was developed by the U.S. (http://www. pubmedcentral .nib.gov/); DNEP and e-Depot (http://www.kb.n1/kb/resources/ frameset The HTML tag that divides a Web page into sections (frames). The following HTML divides the screen page into two horizontal frames with the top frame having 25% of the screen and the bottom frame having 75%. See frames.
<frameset rows=25%, 75%"> </frameset> re> _kenniscentrum .html); LOCKSS LOCKSS Lots of Copies Keep Stuff Safe (Stanford University) (http://lockss.stanford.edu/); D-Space (http://dspace.org/ index.html); E-Journal Archiving Program See archive program. (USA) (http://www.diglib.org/ preserve/ejp.htm); DOM Programme (British Library British Library, national library of Great Britain, located in London. Long a part of the British Museum, the library collection originated in 1753 when the government purchased the Harleian Library, the library of Sir Robert Bruce Cotton, and groups of manuscripts. ) (http://vincent.bl.uk/cgi-bin/htm_h1? DB=website&STEMMER=en&WORDS=digit+object+manag+&COLOUR=Olive&STYLE=s& URL URL
in full Uniform Resource Locator
Address of a resource on the Internet. The resource can be any type of file stored on a server, such as a Web page, a text file, a graphics file, or an application program. =http://www.bl.uk/about/policies/etmeetjul2003.html#muscat Muscat, Maskat, or Masqat (all: mŭs`kăt, mŭs`kət), city (1993 pop. 533,774), capital of Oman, SE Arabia, on the Gulf of Oman. It is flanked by rugged mountains. _highlighter high·light·er
1. A usually fluorescent marker used to mark important passages of text.
2. A cosmetic for emphasizing areas of the face, such as the eyes or cheekbones. first match); KOPAL (Die Deutsche Bibliothek) (http://www.langzeitarchivierung.de/); E-Archive (http://www.ithaka.org/e-archive/index.htm); and New Zealand New Zealand (zē`lənd), island country (2005 est. pop. 4,035,000), 104,454 sq mi (270,534 sq km), in the S Pacific Ocean, over 1,000 mi (1,600 km) SE of Australia. The capital is Wellington; the largest city and leading port is Auckland. (http://www.natlib.govt.nz/bin/news/pr?item=1085888952).
(3.) National programs to promote digital archiving include NDIIP NDIIP National Digital Information Infrastructure and Preservation , of the United States (http:// www.digitalpreservation.gov/); DPC DPC Department of Premier and Cabinet (Victoria, Australia)
DPC Dutch Power Cows
DPC Deferred Procedure Calls (Microsoft Windows NT 4. , of the United Kingdom (http://www.dpconline.org/graphics/index.html); and NESTOR Nestor, in Greek mythology
Nestor (nĕs`tər), in Greek mythology, wise king of Pylos; son of Neleus and father of Antilochus. , of Germany (http://www.langzeitarchivierung.de/).
(4.) A number of publishers have published their archiving policies online, including Elsevier (http://www.elsevier.com/wps/find/librariansinfo.librarians/ libr_policies#sdarchiving) ; Kluwer Academic Publishers (http://www.wkap.nl/prod/a/newsaboutkluwer#190503); BioMed Central Journal (http://www.biomedcentral.com/info/libraries/archive); Blackwell Publishing (http://www.blackwellpublishing.com/press/pressitem.asp? ref=83&site=1) ; and Taylor & Francis Group (http://www.taylorand francisgroup.com); and Oxford University Press (http://www.oup.co.uk/).
(5.) For information about the Trivulzio manuscript, see http://www.kb.nl/webexpo/trivulzioen.html.
(6.) The UVC can be downloaded via Alphaworks at http://www.alphaworks.ibm.com/tech/uvc.
(7.) The PATCH Consortium is comprised of Koninklijke Bibliotheek (The Netherlands), consortium leader; British Library (United Kingdom); UK National Archives (United Kingdom); Centre National d'Etudes Spatiales (France); Bibliotheque National de France (France); University of Leeds Organisation
The various schools, institutes and centres of the University are arranged into nine faculties, each with a dean, pro-deans and central functions:
(8.) See the important work of the project PRONOM on File Format Registry at http://www .nationalarchives.gov.uk/PRONOM/default.htm.
(9.) See http://www.digicult.info/pages/drr_themes.php.
(10.) Information about DARE can be found at http://www.darenet.nl/en/.
(11.) Information about NEDLIB can be found at http://www.kb.nl/coop/ nedlib/.
(12.) See http://www.kb.nl/hrd/dd/dd_onderzoek/dnep_ltp_study-en.html for more information.
(13.) The national library of Germany, Die Deutsche Bibliothek, is actually implementing DIAS for their digital archive: see http://www-306.ibm.com/software/success/ cssdb.nsf/CS/MCAG-65VEUX?OpenDocument&Site=default.
(14.) See http://www-5.ibm.com/nl/dias/for more information about DIAS.
(15.) The Reference Workstation (RefWS) or Reference Platform is a standard computer system configuration designated by the deposit library for installing and running electronic publications. The RefWS is suited for most publications appearing on the consumer market during a given period of time (Werf, 2000, p. 26). The KB considers the use of RefWS a temporary solution for guaranteeing access to archived CD-ROM CD-ROM: see compact disc.
in full compact disc read-only memory
Type of computer storage medium that is read optically (e.g., by a laser). publications.
(16.) For a comprehensive framework for Preservation Metadata (or PREMIS PREMIS Pesticide Residue Elimination Management Information Service
PREMIS Professional Real Estate Management Information System ), see http://www .oclc.org/research/projects/pmwg/.
Johan F. Steenbakkers, Director of e-Strategy and Property Management, Koninklijke Bibliotheek, National Library of the Netherlands The Koninklijke Bibliotheek (Dutch name; KB) is the national (royal) library of The Netherlands, based in The Hague. It was founded in 1798, and was given its current name by king Lodewijk Napoleon in 1806. , Prins Willem-Alexanderhof 5, P.O. Box 90407, NL-2509 LK, The Hague, The Netherlands, firstname.lastname@example.org. Johan F. Steenbakkers is Director of e-Strategy and Property Management at the Koninklijke Bibliotheek (KB), the national library of the Netherlands. In 1969 Johan graduated cum laude cum lau·de
adv. & adj.
With honor. Used to express academic distinction: graduated cum laude; 25 cum laude graduates. in Biology and Biochemistry at the University of Utrecht, the Netherlands. Until 1973 he performed scientific research on bio-membranes. He then started his career in the field of libraries and documentation. Johan joined the KB in 1987 and was involved in restructuring the library's organization. He also renovated the IT infrastructure of the KB and initiated several strategic digital library projects. He was the project coordinator of the European project NEDLIB (1998-2000), which focused on the functional and technical aspects of handling and preservation of electronic publications. Johan initiated the e-Depot of the KB, the infrastructure and organization for preserving digital information. He organized and managed the joint KB/IBM project (2000-2002) for the development of the deposit system DIAS, the first OAIS compliant archival system. He has published several articles about the development of the e-Depot and digital preservation in general. He is a member of the Task Force on Digital Repository Certification, initiated by RLG and the National Archives and Records Administration (NARA Nara (nä`rä), city (1990 pop. 349,349), capital of Nara prefecture, S Honshu, Japan. An ancient cultural and religious center, it was founded in 706 by imperial decree and was modeled after Chang'an (see Xi'an), the capital of T'ang China. ) in the U.S.