Printer Friendly
The Free Library
14,506,104 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

Selecting Research Collections for Digitization: Applying the Harvard Model.


ABSTRACT

ONE OF THE MOST IMPORTANT CHALLENGES facing digital library planners is the

selection of research collections for digitization dig·i·tize  
tr.v. dig·i·tized, dig·i·tiz·ing, dig·i·tiz·es
To put (data, for example) into digital form.



dig
. The costs associated with creating digital resources are significant. Planners must develop selection criteria and procedures in order to ensure that limited time and resources are committed to projects to digitize To convert an image or signal into digital code by scanning, tracing on a graphics tablet or using an analog to digital conversion device. 3D objects can be digitized by a device with a mechanical arm that is moved onto all the corners.  the most significant collections with the highest probability of successful completion. Librarians This is a list of people who have practised as a librarian and are well-known, either for their contributions to the library profession or primarily in some other field.  at many academic libraries have developed selection criteria for the creation of digital collections. These criteria consider many of the same factors that go into the decision to license or purchase information resources (1) The data and information assets of an organization, department or unit. See data administration.

(2) Another name for the Information Systems (IS) or Information Technology (IT) department. See IT.
. However, there are additional considerations. Librarians at Harvard University Harvard University, mainly at Cambridge, Mass., including Harvard College, the oldest American college. Harvard College


Harvard College, originally for men, was founded in 1636 with a grant from the General Court of the Massachusetts Bay Colony.
 have written the most comprehensive guide to selecting research collections for digitization. In this article, the author applies the Harvard Model to a digitization project at Indiana University Indiana University, main campus at Bloomington; state supported; coeducational; chartered 1820 as a seminary, opened 1824. It became a college in 1828 and a university in 1838. The medical center (run jointly with Purdue Univ.  in order to evaluate the appropriateness of the model for use at another institution and to adapt the model to local needs.

INTRODUCTION

Indiana University's Bloomington Libraries launched its first digital initiatives in the early 1990s, but it was not until November 1997 that a coalition of university partners created the Digital Library Program (http://www.dlib.indiana.edu). The Indiana University Digital Library Program is dedicated to the selection, production, and maintenance of a wide range of high-quality networked resources for scholars and students at Indiana University and elsewhere. Building on a previous partnership with University Information Technology Services, it is a collaborative effort of the Indiana University Libraries, the Office of the Vice President for Information Technology, and the School of Library and Information Science A School of Library and Information Science (SLIS) is a university-based institution that provides a Master's degree or other advanced degrees associated with Library science, Information Science, or a combination of the two. . The goal of this collaboration is to capitalize on Cap´i`tal`ize on`   

v. t. 1. To turn (an opportunity) to one's advantage; to take advantage of (a situation); to profit from; as, to capitalize on an opponent's mistakes s>.
 the institutional capabilities of this university, focusing university resources on digital library projects that support the teaching and research of IU faculty, support the learning and research of IU students, and foster research about digital libraries. Although one objective of the program is to support existing digital initiatives, such as the VARIATIONS Project in the Music Library, another is to encourage new digital initiatives, including projects to digitize portions of the research collections throughout the eight campuses of Indiana University. In the two years since the DLP (Digital Light Processing) A data projection technology from TI that produces clear, readable images on screens in lit rooms. DLP is used in all types of projection devices, from data projectors that weigh only a few pounds to large rear-projection TVs to electronic  was created, we have begun digitizing "Digitizer" redirects here. For the computer device, see Digitizing tablet. For the digitizer in Tablet PC's, see Tablet PC.

Digitizing or digitization
 four research collections, two with internal funding and two with external funding, and are currently in the planning stages of a fifth project with partners from the Committee on Institutional Cooperation (CIC CIC

circulating immune complexes.

CIC Circulating immune complexes. See Immune complexes.
). We support digital operations to provide resources exclusively to affiliates of Indiana University, such as VARIATIONS and the DIDO Dido (dī`dō), in Roman mythology, queen of Carthage, also called Elissa. She was the daughter of a king of Tyre. After her brother Pygmalion murdered her husband, she fled to Libya, where she founded and ruled Carthage.  Image Bank and digital collections offered via the Web.

All academic institutions that are planning and implementing digitization projects confront issues related to selecting collections for digitization. With limited time and resources, libraries can only undertake a limited number of digitization projects, based on wise and expeditious ex·pe·di·tious  
adj.
Acting or done with speed and efficiency. See Synonyms at fast1.



ex
 choices. A number of academic libraries have developed criteria and models for selecting research collections for digitization, including Columbia (1998), Harvard (1998), University of California The University of California has a combined student body of more than 191,000 students, over 1,340,000 living alumni, and a combined systemwide and campus endowment of just over $7.3 billion (8th largest in the United States).  (1997), and Oxford University (Lee, 1999). The most comprehensive model is the work of Dan Hazen, Jeffrey Horrell, and Jan Merrill-Oldham, of Harvard University, published in the CLIR CLIR Council on Library and Information Resources
CLIR cross-language information retrieval
CLIR Connected Line Identification Restriction
CLIR Calling Line Identity Restriction
CLIR cross-lingual information retrieval
CLIR Calling Line Identification Restriction
 monograph mon·o·graph  
n.
A scholarly piece of writing of essay or book length on a specific, often limited subject.

tr.v. mon·o·graphed, mon·o·graph·ing, mon·o·graphs
To write a monograph on.
, Selecting Research Collections for Digitization (1998), referred to throughout this article as the Harvard Model. One of the attractive features of this monograph is that it includes a graphical matrix for decision making, summarizing the steps and questions outlined in the essay (see Figure 1). In order to evaluate this model, the author used it to reconsider re·con·sid·er  
v. re·con·sid·ered, re·con·sid·er·ing, re·con·sid·ers

v.tr.
1. To consider again, especially with intent to alter or modify a previous decision.

2.
 the first DLP digitization project, an internally-funded project to digitize the Lilly Library's Frank M. Hohenberger Photograph Collection.

[Figure 1 ILLUSTRATION OMITTED]

The Web site http://www.dlib.indiana.edu/collections/lilly/ hohenberger/index.html, referred to throughout this article, contains a part of the Hohenberger Collection. The purpose of this evaluation was to answer the following questions: Would using the Harvard Model have led to the decision to digitize this collection? Does the Harvard Model include the major factors that were actually used to reach the decision? How might the model be customized to provide more reliable guidance to project planners in the DLP?

COSTS OF DIGITIZATION

The decision to digitize a collection is similar to the decision to acquire an information resource through purchase or license agreement, but there are significant differences. For any selection, the selector (programming) selector - 1. In Smalltalk or Objective C, the syntax of a message which selects a particular method in the target object.

2. An operation that returns the state of an object but does not alter that state.
 evaluates the significance of the resource, its potential use, its relationship to other resources in the collection, its format, and its cost. For acquired resources, it is not necessary to include the costs of cataloging and overhead. However, these costs must be considered for any digitization project, and it is difficult to estimate the complete costs for a digitization project.

In a recent article in RLG RLG Research Libraries Group, Inc. (Dublin, OH)
RLG Ring Laser Gyro
RLG RedLightGreen Project
RLG Royal Laotian Government
RLG Resident Love Goddess
RLG Right, Let's Go
 DigiNews, Steve Puglia (1999) analyzes cost data from many digital projects. He found that the actual production costs differ significantly, depending on the category of material being digitized and whether or not textual tex·tu·al  
adj.
Of, relating to, or conforming to a text.



textu·al·ly adv.
 material will be processed to become keyword searchable in addition to image scanned. He also found that, for all projects, there are hidden costs. For image projects, without text encoding See encode. , the overall average production costs of $17.65 per image breaks down as follows:

1/3 the cost is digital conversion (32 percent overall)--adjusted average $6.15 per image.

Slightly less than 1/3 the cost is in metadata (1) (meta-data) Data that describes other data. The term may refer to detailed compilations such as data dictionaries and repositories that provide a substantial amount of information about each data element.  creation, including cataloging, description, and indexing (29 percent overall)--adjusted average $7.00 per image.

Slightly more than 1/3 the cost is in other activities, such as administration and quality control (39 percent overall)--adjusted average $10.10 per image.

However, the most illuminating il·lu·mi·nate  
v. il·lu·mi·nat·ed, il·lu·mi·nat·ing, il·lu·mi·nates

v.tr.
1. To provide or brighten with light.

2. To decorate or hang with lights.

3.
 figures might be the range in projected production costs reported for individual projects: digitizing--$0.25-$16.65; metadata creation--$0.75-$17.25; other--$0.45-$28.15; total--$1.85-$42.45. These figures do not include the cost of maintaining digital resources. Few sources exist for these data and Puglia found that costs vary widely. Puglia notes: "Often major IT infrastructure costs are budgeted separately from digitizing projects, and therefore the network upgrade and database development costs were not factored into the estimates for long-term maintenance for the digital images." One estimate of the cost of maintaining master files and online access files was $1.70 to $4.70 per image for the first ten years or 14 percent to 38 percent of the initial cost per image.

On the basis of cost alone, it is essential that academic institutions choose digital projects carefully based on an established set of criteria within a well-planned procedure. Even for institutions with ample funding for the creation of digital resources, there are limitations of time, technology, and expertise.

HOHENBERGER PHOTOGRAPH COLLECTION

In discussions with librarians and archivists at Indiana University about the new Digital Library Program (DLP), many of the first questions dealt with digitization of research collections. DLP planners realized the need for selection criteria for digitization as well as a procedure for making the actual selection. Many libraries and archives within the institution have suitable collections, so the task was to prioritize pri·or·i·tize  
v. pri·or·i·tized, pri·or·i·tiz·ing, pri·or·i·tiz·es Usage Problem

v.tr.
To arrange or deal with in order of importance.

v.intr.
 projects. In order to frame the discussion, participants were asked to identify their most significant collections, preferably pref·er·a·ble  
adj.
More desirable or worthy than another; preferred: Coffee is preferable to tea, I think.



pref
 ones in the public domain or with Indiana University-held copyrights. DLP staff posed series of questions about the collections and their users. These questions focused on the copyright status of the collection; its size; its popularity; its use; its physical condition; the formats included in the collection--i.e., text, images, audio, film; the existence of electronic finding aids; and more. One outcome of these meetings was to begin writing project proposals for the most promising collections. The goal was to have some projects in development when suitable external funding opportunities became available. The staff used the LC/ Ameritech Competition proposal outline in order to evaluate collections for digitization (Library of Congress, 1999).

As a result of these discussions and further evaluation, DLP project planners selected a collection for an internally funded project, images from the Frank M. Hohenberger Photograph Collection. Frank Hohenberger was a nationally-recognized photographer who lived and worked in the small town of Nashville, Indiana Nashville is a town in Brown County, Indiana, United States. The population was 826 at the 2000 census. The town is the county seat of Brown CountyGR6. The town is best known as the center of the Brown County Art Colony with a variety of specialty shops. , from 1917 until his death in 1963. Upon his death, Hohenberger bequeathed his entire photograph collection, totaling more than 9,000 images and personal papers, to the Lilly Library. Based on past use, a Lilly Library curator CURATOR, persons, contracts. One who has been legally appointed to take care of the interests of one who, on account of his youth, or defect of his understanding, or for some other cause, is unable to attend to them himself.
     2.
 selected 400 photographs, then an additional 100, for digitization. The digitization project began in spring 1998, and there are now 500 photographs on the site. The current plan is to outsource the digitization of the remainder of the photographs. The site also includes biographical bi·o·graph·i·cal   also bi·o·graph·ic
adj.
1. Containing, consisting of, or relating to the facts or events in a person's life.

2. Of or relating to biography as a literary form.
 information about Hohenberger and a 1933 article published about him in American Magazine The American Magazine was a periodical publication founded in June of 1906, stemming from failed publications purchased a few years earlier from publishing mogul Miriam Leslie. .

THE HARVARD MODEL

The model described in Selecting Research Collections for Digitization grew out of Hazen, Horrell, and Merrill-Oldham's work at Harvard University. A task force was appointed in late 1995, charged with drafting a white paper to help Harvard's librarians and curators plan digital projects. The author of this article was impressed im·press 1  
tr.v. im·pressed, im·press·ing, im·press·es
1. To affect strongly, often favorably:
 by the comprehensiveness of their model and their placement of the selection process into "the larger framework of collection building by focusing first on the nature of the collections and their use, and second, on the realities of the institutional context in which these decisions are made" (Hazen, Horrell, & Merrill-Oldham, 1998). Hazen, Horrell, and Merrill-Oldham note that the decision to digitize a collection is similar to the decision to purchase, microfilm A continuous film strip that holds several thousand miniaturized document pages. See micrographics.


Microfilm and Microfiche
, and withdraw library materials. However, this process is further complicated by a range of procedures and technologies with widely varying implications and costs. They concluded that the judgments made in selecting collections for digitization involve the following factors: "the intellectual and physical nature of the source materials Noun 1. source materials - publications from which information is obtained
source - a document (or organization) from which information is obtained; "the reporter had two sources for the story"
; the number and location of current and potential users; the current and potential nature of use; the format and nature of the proposed digital product and how it will be described, delivered, and archived; how the proposed product relates to other digitization efforts; and projections of costs in relation to benefits."

Overall, the Harvard Model seems to be more a planning model than a selection model, but this is by design. The authors note in their conclusion: "The process of deciding what to digitize anticipates all the major stages of implementation." By asking difficult questions about the collection and the proposed digitization project, librarians minimize the chances of making a costly error in judgment.

APPLYING THE HARVARD MODEL TO THE HOHENBERGER PROJECT

Hazen, Horrell, and Merrill-Oldham (1998) pose a series of questions in their essay. These questions are represented in their model, "Selection for Digitizing: A Decision-Making Matrix." The questions posed in the essay lay the groundwork for a plan of work should the collection under consideration be selected for digitization. The matrix includes nine broad questions that require a "yes" or "no" answer. Answering "no" to any question should stop the evaluation process for a given collection. Patently unsuitable collections will be eliminated early in the process, saving the effort of answering all other questions. Answering these questions requires considerable thought and investigation. One would want to apply it to a collection only after careful pre-selection.

One important issue is copyright. Formally, it lies outside the model, yet the authors call it "the place to begin." Following their advice, the author began with copyright, then used the questions posed in the essay to assess the decision to digitize the Hohenberger Collection.

Copyright

It was easy to resolve questions of copyright with regard to the Hohenberger Collection. The majority of the photographs are still covered by copyright, having been made between 1906 and the early 1960s. However, when Hohenberger bequeathed his collection to the Lilly Library, he assigned the copyrights to the Indiana University Foundation, which subsequently transferred them to the university. Thus, Indiana University could digitize the photographs and offer them on the Web without seeking permission. Of course, DLP staff now want to protect the university's interests, which is accomplished by offering only relatively low-resolution images on the Web and providing access to the high-resolution TIF files A file extension used for TIFF files; for example, image1.tif. See TIFF and extension.  only at the Lilly Library. Instead of worrying about seeking permissions, Lilly Library is concerned about infringements.

The Intellectual Nature of the Source Materials

The model begins with questions requiring highly subjective judgments, proceeds to more measurable judgments, then, with cost benefits, it becomes more subjective again. The first factor in the decision-making process requires a subjective judgment concerning the intellectual value of the collection under consideration. In making this determination, the authors pose a series of questions. Throughout this article, Hazen, Horrell, and Merrill-Oldham's specific questions will be given in italics.

Does the intellectual quality of the source material warrant the level of access made possible by digitizing? For the Hohenberger Collection project, planners relied on the .judgment of the curators at Lilly Library. At the first meeting of DLP staff and Lilly staff, this was one of three collections presented for consideration. Frank Hohenberger and his work as both a photographer and a journalist have been the subject of scholarly publications including articles in the Journal of Indiana History and two books published by the Indiana University Press Indiana University Press, also known as IU Press, is a publishing house at Indiana University that engages in academic publishing, specializing in the humanities and social sciences. It was founded in 1950. Its headquarters are located in Bloomington, Indiana. . In addition to the photographs, Hohenberger kept work-related diaries, recording the subjects of his photographs and anecdotes about them. These anecdotes formed the basis of character studies that he published in a column in the Sunday Indianapolis Star called "Down in the Hills `o Brown County" from 1923 to 1932 and from 1936 to 1954. For this work, he was posthumously post·hu·mous  
adj.
1. Occurring or continuing after one's death: a posthumous award.

2. Published after the writer's death: a posthumous book.

3.
 voted into the Indiana Journalism Hall of Fame in 1976.

Will digitization enhance the intellectual value of the material? The Web site with the digital images of the photographs allows users to scan the thumbnails or view higher-resolution images. It also provides captions with the images instead of separating them in a finding aid. The TIF files allow users to zoom To change from a distant view to a more close-up view (zoom in) and vice versa (zoom out). An application may provide fixed or variable levels of zoom. A display adapter may also have built-in zoom capability.  in on the image, revealing details that are impossible to detect in the photographic prints without intense magnification Magnification

A measure of the effectiveness of an optical system in enlarging or reducing an image. For an optical system that forms a real image, such a measure is the lateral magnification m
.

Will electronic access to a body of information add significantly to its potential to enlighten en·light·en  
tr.v. en·light·ened, en·light·en·ing, en·light·ens
1. To give spiritual or intellectual insight to:
, or are the original books, manuscripts, photographs, or paintings sufficient to the task? The first part of the Hohenberger Project was to mount an EAD-encoded finding aid. Previously, users were limited to using a printed finding aid. Even having the keyword-searchable finding aid without the digital images (and the site does not offer a significant percentage of the collection at this time) provides users with a valuable research tool. The collection is heavily used by genealogical ge·ne·al·o·gy  
n. pl. ge·ne·al·o·gies
1. A record or table of the descent of a person, family, or group from an ancestor or ancestors; a family tree.

2. Direct descent from an ancestor; lineage or pedigree.
 researchers looking for Looking for

In the context of general equities, this describing a buy interest in which a dealer is asked to offer stock, often involving a capital commitment. Antithesis of in touch with.
 photographs of their ancestors Ancestors
See also father; heredity; mother; origins; parents; race.

archaism

an inclination toward old-fashioned things, speech, or actions, especially those of one’s ancestors. Also archaicism. — archaist, n.
 or ancestral homes The Ancestral Home (Dom Ojczysty) is a political party in Poland, founded after the elections. It is a splinter of the League of Polish Families and led by Piotr Krutul. . The Web site allows searching by name and date, providing dramatically improved access to the collection over the paper inventory. Furthermore, catalogers are enhancing the inventory by adding Library of Congress subject headings The Library of Congress Subject Headings (LCSH) comprise a thesaurus (in the information technology sense) of subject headings, maintained by the United States Library of Congress, for use in bibliographic records.  to each photograph, thereby improving access to the collection.

To what extent will the combination or aggregation of original sources increase their value? Although Indiana University owns and plans to digitize Hohenberger's entire body of photographic work for this Web site, other images may be located in other collections. However, the DLP also plans to digitize other complementary photograph collections. The DLP is currently digitizing a collection of U.S. Steel The United States Steel Corporation (NYSE: X) is an integrated steel producer with major production operations in the United States and Central Europe. The company is the world's seventh-largest steel producer ranked by sales (see list of steel producers).  photographs from the Calumet Calumet, region, United States
Calumet (kăl`ymĕt'), industrialized region of NW Ind. and NE Ill., along the south shore of Lake Michigan.
 Regional Archive at Indiana University Northwest Academics
As of 2003, there were about 5,100 undergraduate and graduate students at IUN and about 360 full-time faculty. The university offers Indiana University degrees in more than 30 different academic programs.
 in Gary, Indiana. Steelmaker-Steeltown: U.S. Steel Photograph Collection, 1906-1941 consists of 1,900 photographs made between 1906 and 1941, documenting the early years of Gary, Indiana, and the steelworks there. The DLP hopes to work with libraries and archives throughout the state, such as the Indiana Historical Society The Indiana Historical Society is one of the nation's oldest and largest historical societies and describes itself as "Indiana’s storyteller". The Indiana Historical Society is located at 450 West Ohio St. Indianapolis, IN 46202. , to digitize other complementary collections and provide access to federated Connected and treated as one. See federated database and federated directories.  digital collections of images depicting people, places, and events across Indiana Across Indiana is a weekly 30 minute long documentary-style television program which covers places, people, history and culture across Indiana.

Hosted by Michael Atwood, Across Indiana
.

Current and Potential Users

Indiana University purchases resources to meet the needs of scholars across a broad range of disciplines. Bibliographers rarely make purchase decisions based on the number of potential users. However, the creation of digital resources varies in this regard. It is imperative to focus resources on collections that have the largest potential audience.

Are scholars now consulting the proposed source materials? Are the materials being used as much as they might be? The Hohenberger photographs are among the most heavily used resources in the Lilly Library. The curators identified this collection as a possible candidate for digitization because it is the subject of both scholarly and popular use. Many members of the general public request access to the collection. As a public university, providing information resources to the residents of Indiana is an important part of our mission. Project planners were convinced that providing digital access to the collection would increase its popularity.

Is current access to the proposed materials so difficult that digitization will create a new audience? The Lilly Library provides service to all users, but there are several obstacles to accessing the Hohenberger Collection. First, the library operates a limited number of hours and users must visit the library to see the Hohenberger photographs. Online access allows browsing See browse. , searching, and viewing digital surrogates twenty-four hours a day, seven days a week. Second, before creation of the online finding aid, users had to visit the library to use the finding aid, which provides inadequate and inconsistent access to individual images anyway. The online finding aid is more accessible and more useful. Third, users had to know that the collection exists and that it is at the Lilly Library. Through the Web, many people have found the photographs who were completely unaware of their existence. Since September 1998, the site has received an average of 1,600 hits per month from all over the world. Fourth, the site is reaching young people, including students of Indiana history, who would never have considered visiting the Lilly Library even if they knew that the Hohenberger photographs are available there.

Does the physical condition of the original materials limit their use? Because the collection is so popular, many of the photographic prints had begun to show signs of wear. The Lilly Library has negatives for most of the photographs, so it is possible to make new prints. However, the digital surrogates have reduced handling of the prints. They are still available for examination, but most users are content with the Web versions.

Are related materials so widely dispersed dis·perse  
v. dis·persed, dis·pers·ing, dis·pers·es

v.tr.
1.
a. To drive off or scatter in different directions: The police dispersed the crowd.

b.
 that they cannot be studied in context? Lilly Library curators believe that the library holds the most significant Hohenberger materials.

Will the proposed digital files be of manageable size and format? There are numerous standards for the digitization and delivery of photographs on the Web. Project technical staff consulted the guidelines guidelines,
n.pl a set of standards, criteria, or specifications to be used or followed in the performance of certain tasks.
 that the Library of Congress published for the American Memory American Memory is an Internet-based archive for public domain image resources, as well as audio, video, and archived Web content. It is published by the Library of Congress. The archive came into existence on October 13, 1994 after $13,000,000 was raised in donations.  Project and quickly concluded that the Hohenberger photographs would present no storage problems.

Will digitization address the needs of local students and scholars? The Lilly librarians assured DLP planners that the Hohenberger photographs would be of interest to students and faculty of Indiana University as well as the general public. However, it is probable that the digital collection is of greater interest to non-IU affiliates. For a public university, that may be a positive factor.

Actual and Anticipated Nature of Use

How do scholars use the existing source materials? What approach to digitization will facilitate their work? Project planners considered this project after viewing numerous photograph collections on the Web. Technical staff were convinced that the Web, coupled with the high-resolution images available via the campus network, would provide high enough fidelity to the originals to satisfy the needs of most users. Moreover, the thumbnails provide users with a quick way to browse (1) To view the contents of a file or a group of files. Browser programs generally let you view data by scrolling through the documents or databases. In a database program, the browse mode often lets you edit the data. See Web browser.  the available digital images, while the complete finding aid provides information about the other photographs available at the Lilly Library. One service option might be digitizing on demand for remote users who would like to see an image that has not yet been digitized.

Will digitization increase the utility of the source materials? Will it enable new kinds of teaching or research? Do scholars agree that the proposed product will be useful? One of the goals of the project is to increase use of the collection in teaching. An intern intern /in·tern/ (in´tern) a medical graduate serving in a hospital preparatory to being licensed to practice medicine.

in·tern or in·terne
n.
 from the School of Library and Information Science has created learning activities designed for grades 4-8 using the Hohenberger photographs to teach visual literacy Visual literacy is the ability to interpret, negotiate, and make meaning from information presented in the form of an image. Visual literacy is based on the idea that pictures can be “read” and that meaning can be communicated through a process of reading. . These activities will be tested with students in 2000. Project planners also considered the possibility of working with a photography professor on campus to use the images with his beginning photography students. However, project planners did not consult with teachers or faculty while considering the collection for digitization. Perhaps this should have been a factor in the selection process. Instead, planners relied on the proven popularity of the collection.

Are there other scholars, librarians, and archivists who can collaborate to create a useful product? Given the broad appeal of the collection, scholars, librarians, and archivists throughout the state and perhaps regionally could contribute to the project.

The Format and Nature of the Digital Product

What critical features of the source material must be captured in the digital product? Are very high resolution copies, accurate rendition ren·di·tion  
n.
1. The act of rendering.

2. An interpretation of a musical score or a dramatic piece.

3. A performance of a musical or dramatic work.

4. A translation, often interpretive.
 of color not of the white race; - commonly meaning, esp. in the United States, of negro blood, pure or mixed.

See also: Color
, a seamless combination of images and text, or other qualities considered essential? Project planners did not anticipate that the Hohenberger photographs would present special digitization problems. The DLP Visual Resources Specialist confirmed this initial assessment.

If the original sources are to be retained, can they withstand the digitization process? This was not an issue in the Hohenberger project. Upon the recommendation of the Lilly staff, project members planned to digitize duplicate negatives A negative reproduced from a negative or diapositive.  that are in good condition.

What type of hardware should be used for conversion? All photographs in the collection are black and white. There are established standards for digitizing black and white photographs. The Visual Resources Specialist identified possible scanners during the early planning stages.

Will a digitized sample meet users' needs? If so, how should the sample be constructed? The goal of the Hohenberger project is to digitize the entire collection of photographs and allow users complete access. However, a Lilly Library curator selected 500 images for a pilot project. This group constitutes highlights from Hohenberger's body of work, including representative and popular images.

Will the information resources upon which the project is based continue to grow? No, the collection is believed to be complete.

How will users navigate (1) "Surfing the Web." To move from page to page on the Web.

(2) To move through the menu structure in a software application.
 within and among digital collections? The finding aid is based on Hohenberger's organization scheme, which has value as well as drawbacks. Hohenberger organized his photographs for retrieval by size. This scheme is not particularly informative to users, so the finding aid offers keyword searching as well as browsing by Hohenberger's categories. Some of the categories are topical topical /top·i·cal/ (top´i-k'l) pertaining to a particular area, as a topical antiinfective applied to a certain area of the skin and affecting only the area to which it is applied.

top·i·cal
adj.
 but most are not, so project planners proposed subject enhancements to the finding aid. It has been a challenge to retain important information from the artist while providing additional information that will improve access to individual images.

Describing, Delivering, and Retaining the Digital Product

How will users know the digital file exists? The metadata specialist on the Digital Library Program team created a plan for providing access to the collection. She created a collection-level MARC record for our online catalog Similar to an online library or databases in the information storage respect, ‘’’online catalogs’’’ allow potential customers to browse a company’s items for sale from a different location using the internet.  with PURLs for the collection home page and the finding aid.

How can the digital product best be delivered to users? The project was designed for Web delivery from the beginning. Project planners never considered CD-ROM CD-ROM: see compact disc.
CD-ROM
 in full compact disc read-only memory

Type of computer storage medium that is read optically (e.g., by a laser).
, near-line, or off-line storage. However, once the entire collection has been digitized, project staff may learn that only a small percentage of the images are actually being used regularly; this issue could be reexamined.

Who will be authorized au·thor·ize  
tr.v. au·thor·ized, au·thor·iz·ing, au·thor·iz·es
1. To grant authority or power to.

2. To give permission for; sanction:
 to use the digital resource and under what circumstances CIRCUMSTANCES, evidence. The particulars which accompany a fact.
     2. The facts proved are either possible or impossible, ordinary and probable, or extraordinary and improbable, recent or ancient; they may have happened near us, or afar off; they are public or
? The university always intended to provide access to this resource free of charge on the Web. However, the copyright statement specifies how the images may be used. They are not intended for publication, but the university relies on their resolution to prevent misuse. The Lilly Library sells reprints of the images for private use and licenses them for publication. This information is on the Web site accompanied by an order form.

How will the integrity of the digitized data be ensured? Project planners did not consider this factor while considering the collection for digitization or during project planning project planning - project management .

Particularly for digital products created to meet local demand, is the existing technology infrastructure adequate? This was not an issue with the Hohenberger Project, since the Digital Library Program has the necessary infrastructure to support Web delivery of this collection.

What are the long-term intentions for the digital file? Indiana University intends to retain the files indefinitely in·def·i·nite  
adj.
Not definite, especially:
a. Unclear; vague.

b. Lacking precise limits: an indefinite leave of absence.

c.
 and will create a plan for the longevity longevity (lŏnjĕv`ĭtē), term denoting the length or duration of the life of an animal or plant, often used to indicate an unusually long life.  of all Digital Library Program collections.

Is the long-term preservation of deteriorated materials a project goal? Protecting the prints and negatives is a goal of the project. It was anticipated that the digital files would reduce handling of the originals and that most users would not require access to them for their information needs.

Relationships to Other Digital Efforts

Have the materials proposed for digitization already been converted to electronic form? No one had digitized a significant number of Hohenberger photographs.

Can cooperative digitization efforts bring together a cohesive cohesive,
n the capability to cohere or stick together to form a mass.
 body of material that would otherwise remain disassociated? Not in this case but, as noted above, DLP planners are interested in creating complementary collections of photographs, working in cooperation with other institutions.

Costs and Benefits

Project planners did not conduct an extensive cost/benefit analysis for the Hohenberger Project. However, technical staff knew from preliminary investigations that the cost of storage for the grayscale In computing, a grayscale or greyscale digital image is an image in which the value of each pixel is a single sample. Displayed images of this sort are typically composed of shades of gray, varying from black at the weakest intensity to white at the strongest, though in  images at the proposed resolutions would not be especially high.

Who will benefit from the proposed digital product? One of the important beneficiaries of the proposed digital project would be Lilly Library staff. At times they have been overwhelmed o·ver·whelm  
tr.v. o·ver·whelmed, o·ver·whelm·ing, o·ver·whelms
1. To surge over and submerge; engulf: waves overwhelming the rocky shoreline.

2.
a.
 with requests for access to the Hohenberger photographs. Project planners were interested in providing relief for the staff as well as improving access to users and protecting the photographic prints.

Is the intellectual value of the proposed product commensurate com·men·su·rate  
adj.
1. Of the same size, extent, or duration as another.

2. Corresponding in size or degree; proportionate: a salary commensurate with my performance.

3.
 with the expense? This is a difficult judgment to make. Project evaluators are asked to decide whether this project will have the same impact that another project might have. It is impossible to know whether this is the best possible project, so evaluators must be satisfied with the judgment that it will have a significant impact on users and the condition of an important collection. In some cases, alternative methods of reformatting would be considered here. With regard to the Hohenberger Collection, digitization is the only logical reformatting option for delivery via the Web.

Could an acceptable product be created at lower cost? The digitization plan is as inexpensive as possible given the nature of the collection. Because the project plan relies on the prints and negatives for back-up, image technicians are scanning for access only. For another project to digitize the university's Hoagy Carmichael Noun 1. Hoagy Carmichael - United States songwriter (1899-1981)
Carmichael, Hoagland Howard Carmichael
 collection, technicians are scanning twice, once for archival purposes with standard settings and once for access, manipulating the image to improve its appearance on the Web if necessary. To reduce costs, planners eliminated the archival scan from the Hohenberger Project.

How will the proposed project address the long-term costs associated with digital files? This question asks project evaluators to predict whether a digitization project might result in cost savings in another part of the institution. With regard to the Hohenberger Project, this is difficult to predict. However, planners anticipate that a digital collection might save staff costs in the Lilly Library and produce revenues by publicizing pub·li·cize  
tr.v. pub·li·cized, pub·li·ciz·ing, pub·li·ciz·es
To give publicity to.

Noun 1. publicizing - the business of drawing public attention to goods and services
advertising
 the collection. The Lilly Library sells photographic prints from the collection and licenses them for publication. The digital collection might lead to increased orders for reprints or reproduction and the digital files enable those prints to be made more easily and more cost-effectively. Staff can send the digital file (s) to Photographic Services, saving time as well as wear to the negative.

Can external funding be secured to support the proposed project? Although some digitization projects are so large that they require external funding, the Hohenberger Project was designed for internal funding. The concept was to select a small percentage of the photographs in order to learn the process of digitizing photographs for Web delivery, then consider a grant-funded project to digitize the remainder of the collection. Options for grant opportunities figured into the selection decision. If the collection had no potential for outside funding, it is doubtful that the DLP would have undertaken the pilot project.

ACTUAL DECISION-MAKING PROCESS

To evaluate the Hohenberger Photograph Collection for digitization, project planners actually used a process similar to the Harvard Model based on the proposal outline for the Library of Congress/Ameritech Competition (http://memory.loc.gov/ammem/award/index.html). The DLP staff had recently submitted a proposal to the competition and were familiar with its content. The three-year program ended with the 1998/99 competition, but the LC/Ameritech Competition used a two-part evaluation process that provides a framework for planning a digitization project, especially one with preservation goals. For the competition, the first group of evaluators assessed content issues:

"Significance of the collection's content for understanding United States United States, officially United States of America, republic (2005 est. pop. 295,734,000), 3,539,227 sq mi (9,166,598 sq km), North America. The United States is the world's third largest country in population and the fourth largest country in area.  history and culture, as well as its breadth of interest and utility to students and the general public."

"Availability and usability How easy something is to use. Both software and Web sites can be tested for usability. Considering how difficult applications are to use and Web sites are to navigate, one would wish that more designers took this seriously. See user interface and usability lab.  of aids to intellectual access that can be integrated into the American Memory resource."

The second group of evaluators assessed technical issues:

"Technical and administrative viability of the project's plan of work in relation to the scope of the project."

Although project planners did not intend to submit a proposal to the LC/ Ameritech competition to digitize the Hohenberger photographs, using that outline for the content evaluation would provide a framework for the collection selection and preliminary project planning. In addition to collection factors, project planners also considered intellectual access and institutional factors. With regard to intellectual access, the primary considerations were that we had a finding aid in electronic form (a word processing word processing, use of a computer program or a dedicated hardware and software package to write, edit, format, and print a document. Text is most commonly entered using a keyboard similar to a typewriter's, although handwritten input (see pen-based computer) and  file) and that we have expertise in creating EAD-encoded finding aids. Although it would have been possible to create the finding aid from a paper inventory, this step would have slowed the project timeline considerably. Speed was a factor in selecting a collection for digitization. Of course, the additional work would have also resulted in additional costs.

The factors that project planners considered in selecting the Hohenberger photographs for digitization can be summarized as follows:

Collection Factors

* Description--How large is the collection? What is its content?

* Significance--Why is it important?

* Audience and Users--Who uses it now? Who might use it on the Web?

* Preparation and Preservation of the Collection--What preparation would the collection require for digitization? What are the preservation issues?

* Ownership, Privacy, and Copyright Issues--Who holds the copyright? How does the university want to protect its copyright?

* Intellectual Access--What does the finding aid include? How difficult would it be to convert to an EAD EAD Ensino A Distancia (Brazil)
EAD Encoded Archival Description (DTD for SGML)
EAD Employment Authorization Document (US INS)
EAD Exposure At Default
 finding aid?

Institutional Factors--Added to the LC/Ameritech outline

* Collection Format--Photographs--The Digital Library Program wanted to digitize all or a selection from an archival photograph collection to gain the experience and develop expertise.

* Indiana Connection--The collection is the work of a local artist with a national reputation. Indiana University owns the collection and controls the copyrights. We are the logical creators of a digital collection of Hohenberger's work.

* Benefits to Lilly Library Staff--The online finding aid and access to the digital files would significantly ease the workload of Lilly Library staff in providing access to the collection.

* Context--Ample supplemental material is available to add context to the Web site. Text from secondary sources could be added to the site, or additional archival material, such as selections from Hohenberger's diaries and his newspaper column, contributing richness to the resource.

The LC/Ameritech proposal outline worked well for both the selection process and the preliminary planning. The Hohenberger Project proceeded as planned and has met or exceeded expectations with regard to user response.

EVALUATING THE HARVARD MODEL

The Harvard Model would have resulted in the DLP staff making the decision to digitize the Hohenberger Collection. The answers to the nine major questions in the model matrix were "yes." The more detailed questions posed in the essay would have produced more detailed information and required more careful thought.

The Harvard Model proved to be a valuable evaluative tool. However, the author encountered two problems in using it. First, Selecting Research Collections for Digitization (Hazen, Horrell, & Merril-Oldham, 1998) concludes with a one-page graphical summary of the process, "Selection for Digitization: A Decision-Making Matrix." This matrix presents the nine major questions and decision points, but it is impossible to align align (līn),
v to move the teeth into their proper positions to conform to the line of occlusion.
 the issues and questions in the text of the essay with this graphical representation. The steps were in a different order and the questions in the matrix did not correspond to the questions in the text. The author began working with the matrix but soon abandoned it for the questions in the text. Part of the difficulty may be that the matrix attempts to pose "yes" and "no" questions that would stop the project at any step. In reality, the questions are more likely to result in "maybe" rather than "yes" or "no" responses.

The second problem relates to the level of detail in the model. For many projects, the actual selection process would probably stop with an analysis of the content and finding aid issues. It would be necessary to proceed to the technical planning of a complex project in order to determine the cost-effectiveness of the proposed project. However, the Hohenberger Project did not require this level of analysis. The questions related to copyright, the intellectual nature of the source materials, current and potential users, and actual and anticipated nature of use would have provided project planners with enough information to make a decision. However, with a more complex project, selectors might have needed the additional input that only technical inquiry can provide. One recommendation might be to develop a two-tiered decision-making process, one for simpler projects that involve formats and technologies with which the institution has experience, and one for more complex projects that involve multiple formats and unfamiliar technologies. However, for any project, it is difficult to make a final decision without undertaking preliminary planning.

CONCLUSION

The vast number of worthy collections that should be considered for digitization require that digital library program planners establish criteria and procedures for selecting research collections for digitization. Each institution must develop its own criteria for selecting collections for digitization based on a standard set of criteria with adjustments and additions based on local needs. The Harvard Model provides an excellent foundation for creating a local adaptation. It is comprehensive, yet flexible. At Indiana University there may exist a need for a simpler version and a more complex version with a graphical flowchart representation. There may be decision points along the way, with a major decision point coming before preliminary technical planning and one coming afterward af·ter·ward   also af·ter·wards
adv.
At a later time; subsequently.

Adv. 1. afterward - happening at a time subsequent to a reference time; "he apologized subsequently"; "he's going to the store but he'll be back here
. A collection may pass the content test and fail the technical test. Additional selection criteria would emphasize Indiana history and culture; local technical expertise, such as the digitization of music; and local language expertise, such as Russian and Tibetan. By customizing an existing model for selecting research collections for digitization, institutions can maximize the probabilities of spending resources wisely, preserving valuable collections, and making them more accessible to users at home and throughout the world.

REFERENCES

Columbia University Columbia University, mainly in New York City; founded 1754 as King's College by grant of King George II; first college in New York City, fifth oldest in the United States; one of the eight Ivy League institutions. . (1998). Selection criteria for digital imaging projects. Retrieved January 31, 2000 from the World Wide Web: http://www.columbia.edu/cu/libraries/digital/criteria.htm.

Harvard University. (1998). Selection for digitization. Retrieved January 31, 2000 from the World Wide Web: http://preserve.harvard.edu/resources/digitization/selection.html.

Hazen, D.; Horrell, J.; & Merrill-Oldham, J. (1998). Selecting research collections for digitization. Washington, DC: Council on Library and Information Resources. Retrieved January 31, 2000 from the World Wide Web: http://www.clir.org/pubs/reports/hazen/pub74.html.

Lee, S. D. (1999). Assessment criteria for digitization. Oxford, England: Oxford University. Retrieved January 31, 2000 from the World Wide Web: http://www.bodley.ox.ac.uk/scoping/assessment.html.

Library of Congress. (1998). The Library of Congress/Ameritech 1998/99 guidelines, National Digital Library competition. Washington, DC: LC. Retrieved January 31, 2000 from the World Wide Web: http://memory.loc.gov/ammem/award/guide98.html.

Puglia, S. (1999). The costs of digital imaging projects. RLG DigiNews, 3(5). Retrieved January 31, 2000 from the World Wide Web: http://www.rlg.org/preserv/diginews/diginews35.html#feature.

University of California. (1997). University of California selection criteria for digitization. Retrieved January 31, 2000 from the World Wide Web: http://www.libnet.sh.cn/diglib/digselec.htm.

Kristine R. Brancolini, Digital Library Program, Indiana University Libraries, Main Library E170, 1320 E. Tenth Street, Bloomington, IN 47405

KRISTINE R. BRANCOLINI is Associate Director, Digital Libraries Program, and Librarian (1) A person who works in the data library and keeps track of the tapes and disks that are stored and logged out for use. Also known as a "file librarian" or "media librarian." See data library.

(2) See CA-Librarian.
 for Digital Collections and Services at Indiana University--Bloomington. She has also been the Librarian for Media and Film Studies since 1983. Recent digital projects include The Hoagy Carmichael Collection <http://www.dlib.indiana.edu/collections/hoagy>, a multimedia project funded by a National Leadership grant from the Institute of Museum and Library Services The Institute of Museum and Library Services is an independent agency of the United States federal government. It is the main source of federal support for libraries and museums within the United States.  (IMLS IMLS Institute of Museum and Library Services
IMLS Institute for Museum and Library Services (US)
IMLS Institute of Medical Laboratory Sciences
), and Steelmaker-Steeltown: U.S. Steel Photograph Collection, 1906-1941 <http://www.dlib.indiana.edu/new/ steel.html>. Her other research interests include media resources and multimedia collections and services in academic libraries. Recent papers include "A Bibliographic bib·li·og·ra·phy  
n. pl. bib·li·og·ra·phies
1. A list of the works of a specific author or publisher.

2.
a.
 Essay on Using Film to Teach about Peace Studies and Structural Violence," in Insights from Film into Violence and Oppression The offense, committed by a public official, of wrongfully inflicting injury, such as bodily harm or imprisonment, upon another individual under color of office.

Oppression, which is a misdemeanor, is committed through any act of cruelty, severity, unlawful exaction, or
: Shattered Dreams "Shattered Dreams" is a single by Russian pop star Sergey Lazarev.

This is his first solo thing outside Soviet zone and it was the first international single released in UK properly. This is also a cover of Johnny Hates Jazz and it is rearranged.
 of the Good Life (Wesport, CT: Praeger Publishers, 1998), and "Media Reference Sources in Academic Libraries," in Reference Librarian (Number 65, 1999).
COPYRIGHT 2000 University of Illinois at Urbana-Champaign
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 2000, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Author:BRANCOLINI, KRISTINE R.
Publication:Library Trends
Geographic Code:1USA
Date:Mar 22, 2000
Words:6244
Previous Article:Collecting Full-Text CD-ROMs in Literature: Theory, Format, and Selection.(humanities library purchasing)
Next Article:The Impact of Electronic Resources on Serial Cancellations and Remote Storage Decisions in Academic Research Libraries.
Topics:



Related Articles
Forty-five years after Lamont: the university undergraduate library in the 1990s.(The Library and Undergraduate Curriculum)
Introduction.(Letter Thomas Nisonger, professor)
The Impact of Electronic Resources on Serial Cancellations and Remote Storage Decisions in Academic Research Libraries.
Temple dancing in Philadelphia. (News).(Mary E. Edsall, Philadelphia Dance Collection at Temple University)
The economic behavior of academic research libraries: toward a theory. (Academic Libraries).(Brief Article)
An integrated framework for developing e-business projects.
Yes, let's get our libraries online.(My View)
Introduction.
GIS collection development within an academic library.
IT news and products; Kirtas Technologies signs digitization contract with Microsoft.(SOFTWARE WORLD DIGEST)

Terms of use | Copyright © 2009 Farlex, Inc. | Feedback | For webmasters | Submit articles