Intellectual Access to Images.ABSTRACT CONVENIENT IMAGE CAPTURE TECHNIQUES, inexpensive storage, and widely available dissemination methods have made digital images a convenient and easily available information format. This increased availability of images is accompanied by a need for solutions to the problems inherent in indexing them for retrieval. Unfortunately, to date, very little information has been available on why users search for images, how they intend to use them, as well as how they pose their queries, though this situation is being remedied as a body of research begins to accumulate. New image indexing methods are also being explored. Traditional concept-based indexing uses controlled vocabulary Controlled vocabularies are used in subject indexing schemes, subject headings, thesauri and taxonomies. Controlled vocabulary schemes mandate the uses of predefined, authorised terms that have been preselected by the designer of the controlled vocabulary as opposed to natural or natural language to express what an image is or what it is about. Newly developed content-based techniques rely on a pixel-level interpretation of the data content of the image. Concept-based indexing has the advantage of providing a higher-level analysis of the image content but is expensive to implement and suffers from a lack of interindexer consistency due to the subjective nature of image interpretation. Content-based indexing is relatively inexpensive to implement but provides a relatively low level of interpretation of the image except in fairly narrow and applied domains. To date, very little is known about the usefulness of the access provided by content-based systems, and more work needs to be done on user needs and satisfaction with these systems. An examination of a number of image database systems shows the range of techniques that have been used to provide intellectual access to image collections. INTRODUCTION With the rapid development of computing technologies, particularly in storage, display, and telecommunications, access to digital images has become widespread. At the same time, the ease with which images can be incorporated into software packages for display, publication, and dissemination has increased the perceived information need of users for images. This greatly increased need for, and access to, images has focused attention on the problems inherent in image description, particularly from the perspective of image indexing and retrieval. Researchers in the fields of library and information science, computer science, medical informatics medical informatics, n the field of information science concerned with the analysis and dissemination of medical data through the application of computers to various aspects of health care and medicine. , cognitive science cognitive science Interdisciplinary study that attempts to explain the cognitive processes of humans and some higher animals in terms of the manipulation of symbols using computational rules. , and so on, have brought their different points of view to address the problems inherent in image indexing. The development and use of controlled vocabularies for image indexing has always been an area of interest, and the exploration of natural language for image description is an area of ongoing research. A relatively new research area, drawing on the pixel-level data that comprise digital images, is content-based retrieval, which automatically extracts index features such as color, texture, and shape from the image file. Other researchers are examining the potential of combined sources of evidence using natural language text, such as captions, to assist in the automatic interpretation of digital images. A welcome development in the study of image access is the focus of a number of researchers on questions underlying users' access to images-i.e., how images are perceived and described, what information needs exist, and how users of pictorial information determine what is useful to them. The answers to these questions will inform the design of a new generation of image retrieval An image retrieval system is a computer system for browsing, searching and retrieving images from a large database of digital images. Most traditional and common methods of image retrieval utilize some method of adding metadata such as captioning, keywords, or descriptions to the systems that will better meet the needs of users by employing technologies in useful and creative ways. The discussion will focus on the problems inherent in image description and access, with a perspective on traditional and new solutions. Recent developments in intellectual access to images will be surveyed and contrasted with software-based analysis of image content. A more detailed survey of this research is given by Rasmussen (1997). Lancaster (1998) extends the discussion of indexing to multimedia sources. CHALLENGES IN IMAGE INDEXING Images bring with them problems of description and access more complex than those of text. While text can be indexed manually, it can also be retrieved directly using, as access points, the natural language that it contains. While this retrieval is imperfect, it does provide a means of access independent of human indexing. Digital images are composed of pixels arranged in an infinite variety of patterns and, in general, it is impossible to predict the particular pattern that would match an information need. At present, only relatively low-level attributes of images can be queried directly (for instance, color and texture), and these attributes do not carry the meaning of the image with them. Even where human indexing of the image is undertaken, it is difficult to reach agreement on the content and meaning of the image, or on what aspects are appropriate for indexing. The same image may mean different things to different people and may be used to project a different meaning at different times depending on the way it is used or the aspect that is the focus of attention or the context it is chosen to illustrate. In general, it is easier to determine a picture's content than to interpret what it is about, and this distinction has engaged many scholars. Krause (1998) distinguishes between "hard" indexing (the description of what an indexer can see in the frame), and "soft" indexing ("aboutness," the image as stimulus). He says: We know that pictures provoke reaction, stimulate ideas, rekindle memories. They are powerful instruments in story telling, teaching, propaganda, and numerous other fields. Therefore, it is important that libraries provide access to images which illustrate ideas, even abstract ones like hunger, or the experience of hunger.... If we can index this aspect of the picture, we make it easily available to users requiring such an image; we make our collection more accessible. (pp. 73-74) A number of authors (such as Shatford, 1986) have based their analysis of image indexing on the theories of the art historian Panofsky (1939), who identified three levels of meaning in works of art. At the first, or pre-iconographic, level, subject matter was designated as factual ("ofness") or expressional ("aboutness"), and based on the objects and events in an image as it could be interpreted through everyday experience. At the second, or iconographic i·co·nog·ra·phy n. pl. i·co·nog·ra·phies 1. a. Pictorial illustration of a subject. b. The collected representations illustrating a subject. 2. , level, interpretation requires some cultural knowledge of themes and concepts (not "a sailor" but "Ulysses"). The third or iconological level requires interpretation at a sophisticated level using world and cultural knowledge plus a deeper understanding of the history and background of the work. Shatford (1986) suggests that this third level cannot be indexed with any degree of consistency. Svenonius (1994) points out that "indexing aboutness at the iconographic level is equally problematic" (p. 603), since what is symbolized is not always evident, nor is there always a simple referent ref·er·ent n. A person or thing to which a linguistic expression refers. Noun 1. referent - something referred to; the object of a reference to it. Shatford (1986) uses Panofsky's levels of meaning to explore the kinds of subjects an image might have, proposing "Generic Of, "Specific Of," and "About" with facets answering the questions Who? What? When? and Where? Interestingly, a preliminary attempt in the Hulton study (described later in this article) to categorize cat·e·go·rize tr.v. cat·e·go·rized, cat·e·go·riz·ing, cat·e·go·riz·es To put into a category or categories; classify. cat queries posed to an image database according to according to prep. 1. As stated or indicated by; on the authority of: according to historians. 2. In keeping with: according to instructions. 3. Panofsky's levels of meaning was not successful, suggesting that they did not translate well from the area of Renaissance art to a more general domain (Enser & McGregor, 1993). Markey (1984) studied interindexer consistency by nonspecialists using a free vocabulary to index pictorial works of art, finding terminology consistency scores of 7 percent and concept consistency scores of 13 percent. While interindexer consistency has always been problematic, even in text, these figures do serve to illustrate the imperfect level of agreement in subject analysis of images. Clearly, image analysis can be carried out at many levels, from the primitive (What colors are present? What shapes?) to more analytical but general (What objects appear in the image? What is this a picture of?.) to a more culturally dependent interpretation (What specific individual or thing is portrayed? What is the mood? What metaphor or lesson is presented?). Choosing an approach to image indexing may require a compromise based on what the system is capable of delivering and what the users of the system would like in an optimal retrieval environment. The question of user need for images is at present relatively little studied. STUDIES OF USERS' IMAGE QUERIES Before considering how image access has been provided, it is worth considering what we know about users' information needs and how users present queries to image databases. Probably the most extensive study to address this question is the "Hulton Study," Enser and McGregor's (1993) examination of the queries addressed to the Hulton Deutsch picture collection. The Hulton Deutsch Collection is a major picture archive of news and current affairs current affairs npl → (noticias fpl de) actualidad f current affairs current npl → (questions fpl d')actualité f , historical landscapes and portraits, and other collections used primarily by the press. Enser and McGregor examined 2,722 requests and found that they could be mapped into four categories along two dimensions: unique ("Kenilworth Castle Kenilworth Castle is in Kenilworth, Warwickshire, England. Historically the Castle was contained within the Forest of Arden. A fortification has existed on the site from Saxon times. ") or non-unique ("dinosaurs") and refined (e.g., specified by activity, time period, and so on) or nonrefined. An example of a query in the unique refined category is "Edward VIII Edward VIII, 1894–1972, king of Great Britain and Ireland (1936), known in later years as the duke of Windsor; eldest son of George V. He attended the naval colleges at Osborne and Dartmouth and Magdalen College, Oxford. In 1911 he was made prince of Wales. looking stupid" and in the nonunique refined category is "couples dancing the Charleston." Interestingly, only requests for unique unrefined subjects were easily satisfied by the Gibbs-Smith classification scheme being used by the picture archive (Enser, 1995). The Hulton Study was subsequently extended to seven additional picture libraries/ archives in the United Kingdom, five of which were concerned with still images (Armitage & Enser, 1997). They arrived at a mode and facet analysis adopted from Panofsky (1939) and refined by Shatford (1986). In a smaller-scale study, Hastings (1995) examined the queries of a specific user group--i.e., art historians--to a collection of Caribbean art images. She identified queries at four levels of complexity, ranging from simple level one queries for who, what, where to level four queries for meaning, subject, and why? Some of the simpler queries could be answered without images while, at the most complex level, text and image alone was sometimes not sufficient to answer the queries. Another interesting study reviewed queries presented to NLM's Prints and Photographs Collection. Keister (1994) found that descriptions of concrete image elements made up a significant proportion of picture requests, and these elements were worth cataloging in some detail. However, she cites examples in which images are described in terms of the visual message of the picture--e.g., a "warm picture of a nurse, mother, and baby" (p. 10). Word-images based on a particular communication need arose frequently, and users often described and used images in ways different from their original intent. While there begins to be a body of research addressing the question of image information needs, the studies are fragmented. The Hulton Study remains the only study of its scale to examine information needs in a nondomain-specific environment. IMAGE ATTRIBUTES There is as yet no general agreement on what attributes of an image should be indexed. Shatford (1986) indicates that it is much easier toindex an image for a collection with some specific use than one for use by a heterogeneous group. In the latter case, the subject orientation of users and the information need that will lead them to pose queries to the collection cannot be anticipated, and hence the dimensions along which the collection should be indexed cannot be predicted. Research by Jorgensen (1998), in which participants were asked to write descriptions of color not of the white race; - commonly meaning, esp. in the United States, of negro blood, pure or mixed. See also: Color images, suggested four perceptual classes as a minimal framework for image indexing: objects (the largest set in her study), people, color, and location. Content/story attributes were also identified as significant for image description. Jorgensen (1998) points out the need to include interpretive as well as perceptual attributes, a conclusion supported by her previous research (1995). She indicates that "the disjunction disjunction /dis·junc·tion/ (-junk´shun) 1. the act or state of being disjoined. 2. in genetics, the moving apart of bivalent chromosomes at the first anaphase of meiosis. between these results and those attributes typically addressed in traditional image indexing systems suggest revisiting assumptions upon which image indexing and retrieval systems are being created" (p. 172). In order to determine what image attributes should be used to provide access, Layne (1994) proposes four categories: (1) "biographical" attributes that deal with the images' origin and provenance prov·e·nance n. 1. Place of origin; derivation. 2. Proof of authenticity or of past ownership. Used of art works and antiques. ; (2) subject attributes (the "most problematic and least objective" [p. 584]); (3) exemplified attributes that seem to be physical characteristics such as medium, and (4) relationship attributes (relationship to other images or texts). It is the subject attributes which are addressed here; the two main approaches to image indexing, concept-based and content-based, differ in the level of interpretation that they bring to the indexing process and will be discussed separately. CONCEPT-BASED IMAGE INDEXING AND RETRIEVAL Concept-based retrieval refers to retrieval from text-based indexing of images, which may use a controlled vocabulary or natural language text or captions, and range from the purely descriptive ("Winston Churchill," "a duck on a pond") to the abstract or subjective ("poverty, .... despair"). Controlled vocabularies have been developed for use in specific collections such as Western art or images of historical costume. A particularly ambitious undertaking is ICONCLASS, an early classification system for Western art developed in the 1940s by van de Waal
mĕr`ĭk) or alphameric (ăl'fəmĕr`ĭk), the set of letters and numbers. notation covered in seven volumes of
subject headings. ICONCLASS has been used for DIAL (Decimal Index to the
Art of the Lowlands), the Marburger Index to works of art in Germany,
van Straten's index of Italian Prints, and American paintings in
the Courtauld Institute (Roberts, 1988).Two controlled vocabularies that were developed relatively recently are the Art &Architecture Thesaurus (AAT Alpha-1-antitrypsin (AAT) A blood component that breaks down infection-fighting enzymes such as elastase. Mentioned in: Chronic Obstructive Lung Disease ) (Oxford University Press, 1990) and the Library of Congress Thesaurus for Graphic Materials (Library of Congress, 1995). The Art & Architecture Thesaurus (AAT) covers the history and making of the visual arts visual arts npl → artes fpl plásticas visual arts npl → arts mpl plastiques visual arts npl → and is geographically and historically comprehensive but lacks coverage of iconographical themes (Petersen, 1990). The vocabulary of nearly 120,000 terms is structured under seven facets (e.g., physical attributes, styles and periods, activities) which are subdivided into thirty-three sub-facets or hierarchies. It is currently supported by the Getty Information Institute (see their Web page at http:// www.gii.getty.edu/vocabulary/aat.html). The Thesaurus for Graphic Materials is in two parts: TGMI: Subject Terms and TGMII: Genre and Physical Characteristic Terms. TGMI is less structured than AAT, lacking its faceted and highly hierarchical arrangement, though it does follow standard thesaural guidelines. TGMI provides a broader, though smaller, vocabulary than AAT, suitable for a general subject description of images. A detailed comparison of these two vocabularies is provided by Greenberg (1993). Other controlled vocabularies have been developed for specific collections but, for many collections of images, particularly those on the Web, natural language indexing is preferred. Natural language may be in the form of text in which the image is embedded Inserted into. See embedded system. (newsphotos in newspapers, for instance), descriptions or captions accompanying it, or hypermedia hypermedia: see hypertext. The use of hyperlinks, regular text, graphics, audio and video to provide an interactive, multimedia presentation. All the various elements are linked, enabling the user to move from one to another. links. For instance, Guglielmo and Rowe (1996) used natural language requests to query a database of historical images of aircraft and weapon projects captioned with natural language text. By parsing See parse. parsing - parser and matching queries and captions, they were able to use natural language processing Natural language processing Computer analysis and generation of natural language text. The goal is to enable natural languages, such as English, French, or Japanese, to serve either as the medium through which users interact with computer systems such as and inferencing techniques to answer queries such as "training missiles on a skyhawk." In some contexts it seems logical to use images as surrogates for text in retrieval. A project at NASA's Johnson Space Center used a visual thesaurus to provide access to images from the Manned Space Flight Program, using images corresponding to those in a subject-oriented linguistic thesaurus (Seloff, 1990). This and other examples of visual thesauri are discussed by Hogan et al. (1991), who extend the concept of the visual thesaurus to the hypermedia environment, supporting browsing and searching through direct image links. This type of access corresponds to what Enser (1995) refers to as image retrieval in the VV mode--visual query, visual search. CONTENT-BASED INDEXING OF IMAGES Content-based information retrieval (image, algorithm) content-based information retrieval - (CBIR) A general term for methods for using information stored in image archives. [IEEE Computer, September 1995]. (CBIR (image) CBIR - content-based information retrieval. ) refers to retrieval based on computer analysis of image content at the pixel level, automatically extracting such features as colin, texture, and shape, locally or globally, from digital images. The CBIR systems currently available provide powerful retrieval engines retrieval engine n. A search engine. for certain classes of query, although the developers have sometimes oversold Oversold In technical analysis, it is a market in which the volume of selling that has occurred is greater than the fundamentals justify. Notes: It is the opposite of overbought. their abilities, arguing that, since human indexing of image subject is prohibitively expensive, they propose to replace it by automatic indexing by color and texture. These systems are useful in some situations and no doubt will become more useful as their powers of interpretation become more sophisticated. The query categories proposed for them by Gudivada and Raghavan (1995) are color, texture, sketch, shape, volume, spatial constraints, browsing, objective attributes, subjective attributes, motion, text, and domain categories. Perhaps the capabilities that are currently best developed are retrieval by color, texture, and overall image similarity. Shape :retrieval is most effective where solid images (clip art A set of canned images used to illustrate word processing and desktop publishing documents. , trademarks) are queried. Domains where automatic indexing and retrieval have proven effective include face retrieval and fingerprints. A realistic assessment of the state of the art of what they name visual information retrieval information retrieval Recovery of information, especially in a database stored in a computer. Two main approaches are matching words in the query against the database index (keyword searching) and traversing the database using hypertext or hypermedia links. (VIR VIR Virtual VIR Virgin Islands (ISO Country code) VIR Virginia International Raceway VIR Vascular and Interventional Radiology VIR Vehicle Inspection Report VIR Virtual Interface (Alteon) ) is given by Gupta and Jain (1997). They indicate that systems providing information extraction In natural language processing, information extraction (IE) is a type of information retrieval whose goal is to automatically extract structured information, i.e. categorized and contextually and semantically well-defined data from a certain domain, from unstructured from images still require some human image interpretation. The relative merits of concept-and content-based indexing are weighed by Flickner et al. (1995): Perceptual organization--the process of grouping image features into meaningful objects and attaching semantic descriptions to scenes through model matching--is an unsolved problem in image understanding. Humans are much better than computers at extracting semantic descriptions from pictures. Computers, however, are better than humans at measuring properties and retaining these in long-term memory. (p. 23) Probably the best known such system, the Query by Image Content (QBIC QBIC Query By Image Content QBIC queries based on image content QBIC Cubic Format ) system developed by IBM (International Business Machines Corporation, Armonk, NY, www.ibm.com) The world's largest computer company. IBM's product lines include the S/390 mainframes (zSeries), AS/400 midrange business systems (iSeries), RS/6000 workstations and servers (pSeries), Intel-based servers (xSeries) , is commercially available and widely used (Flickner et al., 1995) (http://www.qbic.almaden.ibm.com). Image features are automatically extracted and stored in a database. Because of the problems in automatically outlining objects, manual and user-assisted techniques are used to identify shapes, though automated methods are available in some domains. Queries, which may be color and texture, user-drawn outlines, or sample images, are posed by sketching, selecting from a color palette Also called a "color lookup table," "lookup table," "index map," "color table" or "color map," it is a commonly used method for saving file space when creating 8-bit color images. , or selecting an image from a retrieved set as a further query. The QBIC system is currently being tested by the Department of Art and Art History at the University of California The University of California has a combined student body of more than 191,000 students, over 1,340,000 living alumni, and a combined systemwide and campus endowment of just over $7.3 billion (8th largest in the United States). at Davis (Holt & Hartwick, 1994; Holt et al., 1997). They report better success with color and texture searches than with shape for content-based retrieval, with text searches preferred when artist or image is already known. Other similar systems include Virage (http://www.virage.com) and VisualSEEK (http://www.ctr.columbia.edu/~jrsmith/VisualSEEK/). One of the more interesting developments in image indexing is the integration of concept- and content-based approaches using the information in descriptive text or captions to assist in the interpretation of the image. For instance, work by Srihari (1995) examines retrieval from a database of captioned newspaper photographs. Captions place constraints on the photographs, which help in identifying their content and the location in the image of the objects or individuals; however this information can often be interpreted only in the context of world knowledge, since human viewers are expected to recognize, for instance, President Clinton, or differentiate between Mr. and Mrs. Smith without spatial information. For example, it is hard to imagine the need for a caption as specific as "President Clinton (left) dancing with Hillary Clinton (right) at the Inaugural Ball." Research on the combination of textual and image sources of evidence for retrieval holds some promise in overcoming some of the disadvantages of text or image-based retrieval alone. IMAGE INDEXING ENVIRONMENTS--CASE STUDIES An examination of image collections on the WWW shows that it is not uncommon for access to be limited to a simple browsing approach. However, there are many collections in which indexing was used to improve access, either through a concept- or content-based technique. A few case studies will serve to illustrate the range of solutions that have been applied. The Promenade system (McLean et al., 1994) was designed to provide access to a series of botanical images published in Curtis Botanical Magazine, an eighteenth-century compilation of images and text describing botanical samples collected by captains on voyages of exploration. The original intent was to use the natural language text of the descriptions as the index information, but the early printing, irregular typefaces This is a list of typefaces. Serif Here you can find a graphical version of this table.
adj. 1. Varying from or not adhering to the standard: nonstandard lengths of board. 2. abbreviations made the text error-prone for OCR OCR in full optical character recognition Scanning and comparison technique intended to identify printed text or numerical data. It avoids the need to retype already printed material for data entry. or human transcription, and a vocabulary and procedures for human indexing were selected. Since no existing vocabulary seemed well-suited to the historical and botanical nature of the image and textual materials, one was tailored to the image collection. This was an expensive solution, and the project could have benefited from content-based indexing techniques, then in their infancy, since retrieval by color was significant, and the clearly delineated de·lin·e·ate tr.v. de·lin·e·at·ed, de·lin·e·at·ing, de·lin·e·ates 1. To draw or trace the outline of; sketch out. 2. To represent pictorially; depict. 3. botanical images would have allowed some degree of shape matching. Two image database projects using controlled vocabulary are ELISE ELISe Electron-Ion Scattering in a Storage Ring (Facility for Antiproton and Ion Research) ELISE Enabling Library Information Skills for Everyone ELISE European Network for the Exchange of Information on Local Initiatives for the Creation of Employment and Deja Vu See DjVu. . The ELISE Project, funded by the European Commission European Commission, branch of the governing body of the European Union (EU) invested with executive and some legislative powers. Located in Brussels, Belgium, it was founded in 1967 when the three treaty organizations comprising what was then the European Community , provides retrieval of full color images A (digital) color image is a digital image that includes color information for each pixel. For visually acceptable results, it is necessary (and almost sufficient) to provide three samples (color channels over a network (Black & Eyre, 1995). Initially two image collections, one from the Victoria and Albert Museum Victoria and Albert Museum, South Kensington, London, opened in 1852 as the Museum of Manufacturers at Marlborough House. It originally contained a nucleus of contemporary objects of applied art bought from the Great Exhibition of 1851 at the instigation of the and one from Tilburg University Library in the Netherlands, were made available using both full-text descriptions and controlled vocabulary with the AAT as the source. Deja Vu is an interface created for information retrieval systems in which users can browse through subject terms to find items that meet their information needs. The browsing process is facilitated by a knowledge structure in which subject terms are grouped based on the commonsense com·mon·sense adj. Having or exhibiting native good judgment: "commonsense scholarship on the foibles and oversights of a genius" Times Literary Supplement. knowledge of library users in order to provide an interconnected browsing space. For example, when a user enters a search statement, the Broader Terms (BT), Narrower Terms (NT), Related Terms (RT), Notes, some relevant knowledge, and retrieved items will be displayed. The authors used the ]Library of Congress Thesaurus for Graphic Materials in this project (Gordon & Domeshek, 1998). One of the more interesting applications of content-based retrieval systems is to databases of trademarks. While shape-based retrieval can be problematic in fine arts images with complex patterns of light and dark which make it difficult to extract individual shapes, trademarks are generally high-contrast shapes with good definition. The STAR system for trademark archiving and registration (Wu et al., 1995) is intended to allow searches for conflicting trademarks when a request is made for registration of a new image. The system ranks retrieved trademarks in order of similarity to the query trademark. There are a number of instances on the Web of databases that are searchable using the QBIC software. For example, the Fine Arts Museum of San Francisco San Francisco (săn frănsĭs`kō), city (1990 pop. 723,959), coextensive with San Francisco co., W Calif., on the tip of a peninsula between the Pacific Ocean and San Francisco Bay, which are connected by the strait known as the Golden offers a QBIC search of a portion of its database comprising 3,000 Japanese prints. The similarity measure may be based on color percentages, color layout, texture, or a search may be customized using a color palette indicating the percentages desired of up to five colors (see their Web site at http://www.thinker.org/imagebase/index-2.html). IBM offers demonstration searches of stamps, trademarks, and stock photos at their site at http://www.qbic.almaden.ibm.com/. The University of California at Davis study discussed above (Holt & Hartwick, 1994; Holt et al., 1997) can also be explored on their Web site at http://libra.ucdavis.edu/ qbic.html. Reports of the evaluation of image access systems are relatively rare in the literature. An exception is an evaluation of the Micro Gallery, a visitor information system at the National Gallery in London by Beaulieu and Melior (1995). The system allows museum visitors to search the gallery collection by artist, historical atlas A historical atlas is an atlas that includes historical maps and charts depicting the evolving geopolitical landscape. They are helpful in understanding historical context, the scope and scale of historical events and historical subjects (such as the expansion of the Roman Empire), , picture type, and general reference. A combination of data collection methods was used to examine the impact of the interface features on search behavior, including questionnaires before and after the use of the system and direct observation with a talk aloud protocol. CONCLUSION With the increased availability of images comes the problems inherent in indexing them for retrieval. In order to develop solutions to these problems, more information is needed on why users search for images and how they intend to use them as well as how they pose their queries. Two approaches to image indexing have been developed and studied-concept-based and content-based. Concept-based indexing has the advantage of providing a higher-level analysis of the image content but is expensive to implement and suffers from a lack of interindexer consistency due to the subjective nature of image interpretation. Content-based indexing is relatively inexpensive to implement but provides a relatively low level of interpretation of the image except in fairly narrow and applied domains. To date, very little is known about the usefulness of the access provided by content-based systems, and more work needs to be done on user needs and satisfaction with these systems. An examination of a number of image database systems shows the range of techniques that have been used to provide intellectual access to image collections. REFERENCES Armitage, L. H., & Enser, P. G. B. (1997). Analysis of user need in image archives. Journal of Information Science, 23(4), 287-299. Beaulieu, M., & Melior, V. (1995). The Micro Gallery: An evaluation of the hypertext system Noun 1. hypertext system - a database management system that allows strings of text (`objects') to be processed as a complex network of nodes that are linked together in an arbitrary way in the National Gallery, London. New Review of Hypermedia and Multimedia, 1, 233-260. Black, K., & Eyre, J. (1995). The ELISE Project (electronic objects). In M. Collier & K. Arnold (Eds.), ELVIRA 2 (Proceedings of the Second Electronic Library and Visual Information Research Conference, May 1995, De Montfort University De Montfort University (DMU) is a British university situated in Leicester, England. History Origins De Montfort University, which is named after Simon de Montfort who was Earl of Leicester in the 13th century, is one of two universities situated in the , Milton Keynes Milton Keynes (mĭl`tən kēnz`), town (1991 pop. 36,886) and borough, S central England. Milton Keynes was designated one of the new towns in 1967 to alleviate overpopulation in London. It is the seat of the Open Univ. , England) (pp. 70-78). London, England: Aslib. Enser, P. G. B. (1995). Pictorial information retrieval. Journal of Documentation, 51(2), 126-170. Enser, P. G. B., & McGregor, C. G. (1993). Analysis of visual information retrieval queries (British Library British Library, national library of Great Britain, located in London. Long a part of the British Museum, the library collection originated in 1753 when the government purchased the Harleian Library, the library of Sir Robert Bruce Cotton, and groups of manuscripts. R & D Report No. 6104). London, England: British Library Board. Flickner, M.; Sawhney, H.; Niblack, W.; Ashley, J.; Huang, Q.; Dom, B.; Gorkani, M.; Hafner, J.; Lee, D.; Petkovic, D.; Steele, D.; & Yanker, P. (1995). Query by image and video content: The QBIC System. Computer 28(9), 23-31. Gordon, A. S., & Domeshek, E. A. (1998). Deja Vu: A knowledge-rich interface for retrieval in digital libraries. In IUI IUI Intrauterine insemination, see there 98 (International Conference on Intelligent User Interfaces, January 6-9, 1998, San Francisco, CA) (pp. 127-134). New York New York, state, United States New York, Middle Atlantic state of the United States. It is bordered by Vermont, Massachusetts, Connecticut, and the Atlantic Ocean (E), New Jersey and Pennsylvania (S), Lakes Erie and Ontario and the Canadian province of : Association for Computing Machinery See ACM. Association for Computing Machinery - Association for Computing Press. Greenberg, J. (1993). Intellectual control of visual archives: A comparison between the Art and Architecture Thesaurus and the Library of Congress Thesaurus for Graphic Materials. Cataloging & Classification Quarterly 16(1), 85-117. Gudivada, V. N., & Raghavan, V.V. (1995). Content-based image retrieval Content-based image retrieval (CBIR), also known as query by image content (QBIC) and content-based visual information retrieval (CBVIR) is the application of computer vision to the image retrieval problem, that is, the problem of searching for systems. Computer 28(9), 18-22. Guglielmo, E. J., & Rowe, N. C. (1996). Natural language retrieval of images based on descriptive captions. ACM (Association for Computing Machinery, New York, www.acm.org) A membership organization founded in 1947 dedicated to advancing the arts and sciences of information processing. In addition to awards and publications, ACM also maintains special interest groups (SIGs) in the computer field. Transactions on Information Systems, 14(3), 237-267. Gupta, A., & Jain, R. (1997). Visual information retrieval. Communications of the ACM (publication) Communications of the ACM - (CACM) A monthly publication by the Association for Computing Machinery sent to all members. CACM is an influential publication that keeps computer science professionals up to date on developments. , 40(5), 70-79. Hastings, S. K. (1995). Query categories in a study of intellectual access to digitized art images. In T. Kinney (Ed.), ASIS 1. ASIS - Application Software Installation Server. 2. (language) ASIS - Ada Semantic Interface Specification. '95 (Proceedings of the 58th annual meeting of the American Society for Information Science, October 9-12, 1995, Chicago, IL) (pp. 38). Medford, NJ: American Society for Information Science. Hogan, M.; Jorgensen, C.; & Jorgensen, P. (1991). The visual thesaurus in a hypermedia environment: A preliminary exploration of conceptual issues and applications. In D. Bearman (Ed.), Hypermedia & interactivity in museums (Proceedings of an international conference, October 14-16, 1991, Sheraton Station Square, Pittsburgh, PA) (pp. 202221). Pittsburgh, PA: Archives and Museum Informatics Same as information technology and information systems. The term is more widely used in Europe. . Holt, B., & Hartwick, L. (1994). "Quick, who painted fish?"': Searching a picture database with the QBIC project at UC Davis. Information Services See Information Systems. & Use, 14(2), 79-90. Holt, B.; Weiss, K.; Niblack, W.; Flickner, M.; & Petkovic, D. (1997). The QBIC Project in the Department of Art and Art History at UC Davis. In C. Schwartz & M. Rorvig (Eds.), Digital collections, implications for users, funders, developers, and maintainers (Proceedings of the 60th Annual Meeting of the American Society for Information Science, November 1-6, 1997, Washington, DC) (pp. 189-195). Medford, NJ: Information Today. Jorgensen, C. (1995). Classifying images: Criteria for grouping as revealed in a sorting task. In R. P. Schwartz, C. Beghtol, E. K. Jakob, B. H. Kwasnik, & P. Smith (Eds.), Proceedings of the 6th ASIS SIG/CR classification research workshop (October 8, 1995, Chicago, IL) (pp. 65-78). Chicago, IL: ASIS. Jorgensen, C. (1998). Attributes of images in describing tasks. Information Processing information processing: see data processing. information processing Acquisition, recording, organization, retrieval, display, and dissemination of information. Today the term usually refers to computer-based operations. & Management, 34(2/3), 161-174. Keister, L. H. (1994). User types and queries: Impact on image access systems. In R. Fidel, T. B. Hahn, E.M. Rasmussen, & P J. Smith (Eds.), Challenges in indexing electronic text and images (pp. 7-22). Medford, NJ: Learned Information. Krause, M. G. (1998). Intellectual problems of indexing picture collections. Audiovisual Librarian, 14(2), 73-81. Lancaster, E W. (1998). Indexing and abstracting in theory and practice (2d ed). Urbana-Champaign: University of Illinois University of Illinois may refer to:
Library of Congress. (1995). Thesaurus for graphic materials. Washington, DC: Cataloging Distribution Service, Library of Congress. Markey, K. (1984). Interindexer consistency tests: A literature review and report of a test of consistency in indexing visual materials. Library and Information Science Research, 6(2), 155-177. McLean, S.; Rasmussen, E. M.; & Williams, J. G. (1994). Promenade: Networked query and retrieval of horticultural hor·ti·cul·ture n. 1. The science or art of cultivating fruits, vegetables, flowers, or ornamental plants. 2. The cultivation of a garden. images. In D. I. Raitt & B. Jeapes (Eds.), Online Information 94 (Eighteenth International Online Information proceedings, 6-8 December 1994, London) (pp. 457-468). Oxford. England: Learned Information. Panofsky, E. (1939). Studies in iconology i·co·nol·o·gy n. The branch of art history that deals with the description, analysis, and interpretation of icons or iconic representations. i·con : Humanistic hu·man·ist n. 1. A believer in the principles of humanism. 2. One who is concerned with the interests and welfare of humans. 3. a. A classical scholar. b. A student of the liberal arts. themes in the art of the Renaissance. New York: Oxford University Press. Rasmussen, E. M. (1997). Indexing images. Annual Review of Information Science and Technology, 32, 169-196, Roberts, H. (1988). "Do you have any pictures of ...?": Subject access to works of art in visual collections and book reproductions. Art Documentation, 7(3), 87-90. Seloff, G. A. (1990). Automated access to the NASA-JSC image archives. Library Trends, 38(4), 682-696. Shafford, S. (1986). Analyzing the subject of a picture: A theoretical approach. Cataloging & Classification Quarterly, 6(3), 39-62. Sherman, C. R. (1987). ICONCLASS: A historical perspective. Visual Resources, 4, 237246. Srihari, R. (1995). Automatic indexing and content-based retrieval of captioned images. Computer, 28(9), 49-56. Svenonius, E. (1994). Access to nonbook non·book n. A book having little or no literary merit or substance, often published to exploit a fad. adj. Of, relating to, or being something other than a book, such as microfilm or microfiche in a library. materials: The limits of subject indexing Subject indexing is the act of describing a document by index terms to indicate what the document is about or to summarize its content. The index terms are often selected from some form of controlled vocabulary. for visual and aural aural /au·ral/ (aw´r'l) 1. auditory (1). 2. pertaining to an aura. au·ral 1 adj. Relating to or perceived by the ear. languages. Journal of the American Society for Information Science, 45(8), 600-606. Wu, J. K.; Narasimhalu, A. D.; Mehtre, B. M.; Lam, C. P.; & Gao, Y. J. (1995). CORE: A content-based retrieval engine for multimedia information systems. Multimedia Systems, 3(1), 25-41. Hsin-liang Chert chert: see flint. , School of Information Sciences, University of Pittsburgh, 646 IS Building, 135 North Bellefield Avenue, Pittsburgh, PA 15260 Edie Rasmussen, School of Information Sciences, University of Pittsburgh, 646 IS Building, 135 North Bellefield Avenue, Pittsburgh, PA 15260 LIBRARY TRENDS, Vol. 48, No. 2, Fail 1999, pp. 291-302 HSIN-LIANG CHEN Chen - Peter Chen is an Assistant Professor in the School of Library and Information Science at the University of Wisconsin-Milwaukee. His research interests are image retrieval, human-computer interaction Human-computer interaction An interdisciplinary field focused on the interactions between human users and computer systems, including the user interface and the underlying processes which produce the interactions. , instructional technology There are two types of instructional technology: those with a systems approach, and those focusing on sensory technologies. The definition of instructional technology prepared by the Association for Educational Communications and Technology (AECT) Definitions and Terminology , user studies, and information literacy Several conceptions and definitions of information literacy have become prevalent. For example, one conception defines information literacy in terms of a set of competencies that an informed citizen of an information society ought to possess to participate intelligently and . EDIE RASMUSSEN is an Associate Professor in the School of Information Sciences at the University of Pittsburgh, where she teaches courses in information retrieval, image database management, and indexing and abstracting. She has also taught and researched in institutions in Canada, Malaysia, Singapore, and the United Kingdom. |
|
||||||||||||||||||

mĕr`ĭk)
Printer friendly
Cite/link
Email
Feedback
Reader Opinion