Can document-genre metadata improve information access to large digital collections?ABSTRACT WE DISCUSS THE ISSUES OF RESOLVING the information-retrieval problem in large digital collections through the identification and use of document genres. Explicit identification of genre seems particularly important for such collections because any search usually retrieves documents with a diversity of genres that are undifferentiated undifferentiated /un·dif·fer·en·ti·at·ed/ (un-dif?er-en´she-at-ed) anaplastic. un·dif·fer·en·ti·at·ed adj. Having no special structure or function; primitive; embryonic. by obvious clues as to their identity. Also, because most genres are characterized by both form and purpose, identifying the genre of a document provides information as to the document's purpose and its fit to the user's situation, which can be otherwise difficult to assess. We begin by outlining the possible role of genre identification in the information-retrieval process. Our assumption is that genre identification would enhance searching, first because we know that topic alone is not enough to define an information problem and, second, because search results containing genre information would be more easily understandable. Next, we discuss how information professionals have traditionally tackled the issues of representing genre in settings where topical representation is the norm. Finally, we address the issues of studying the efficacy of identifying genre in large digital collections. Because genre is often an implicit notion, studying it in a systematic way presents many problems. We outline a research protocol that would provide guidance for identifying Web document genres, for observing how genre is used in searching and evaluating search results, and finally for representing and visualizing visualizing, v 1., holding an image in one's mind. 2., forming an image of a goal or destination in one's mind before undertaking it, so as to facilitate success. genres. INTRODUCTION Current computerized information-access systems face a fundamental limitation: they know what documents say but not what they mean or for what purposes they might be useful. Extracting and representing the meaning of documents is difficult and time consuming, and automatic systems still have significant limitations. We note, though, that humans rarely have to read every word of a document to understand its purpose. Instead, people take a shortcut (1) In Windows, a shortcut is an icon that points to a program or data file. Shortcuts can be placed on the desktop or stored in other folders, and double clicking a shortcut is the same as double clicking the original file. : they start by identifying the kinds of documents they are faced with (i.e., the document's genre), and then use different types of documents in appropriate ways. For example, a grant proposal is used differently from a syllabus A headnote; a short note preceding the text of a reported case that briefly summarizes the rulings of the court on the points decided in the case. The syllabus appears before the text of the opinion. , a product brochure, or a bank statement. Accordingly, differences in an information situation are often reflected in the kind of document that is considered helpful (e.g., a problem set, a lesson plan, and a tutorial about mathematics are all about math but useful in different situations). Information-access systems would be more useful for many tasks if they could similarly distinguish the purpose of documents and handle them in appropriate ways. In this paper we discuss the possibility of improving information access in large digital collections through the identification and use of document genre as a facet facet /fac·et/ (fas´it) a small plane surface on a hard body, as on a bone. fac·et n. 1. A small smooth area on a bone or other firm structure. 2. of document and query representation. First, we provide some historical background on the concept of genre and the approach it provides to the problem of incorporating context into information retrieval information retrieval Recovery of information, especially in a database stored in a computer. Two main approaches are matching words in the query against the database index (keyword searching) and traversing the database using hypertext or hypermedia links. . We outline the framework of the information-retrieval problem with respect to genre and some traditional resolutions that have been attempted. Finally, we outline a research agenda that addresses some of the questions and issues that investigating genre entails. THEORY: DOCUMENT GENRE Rhetoricians since Aristotle have attempted to classify communications with similar form or purpose into types or "genres." Numerous definitions of genre, or discourse type, have been suggested (e.g., Longacre, 1983; Miller, 1984; Swales, 1990). In our discussion, we draw on the definition of genre proposed by Orlikowski and Yates (1994), who describe genre as "a distinctive type of communicative action Communicative action is a concept associated with the German philosopher Jürgen Habermas. Habermas uses this concept to describe agency in the form of communication, which under his understanding is restricted to deliberation, i. , characterized by a socially recognized communicative com·mu·ni·ca·tive adj. 1. Inclined to communicate readily; talkative. 2. Of or relating to communication. com·mu purpose and common aspects of form" (p. 543). For instance, this document is an example of the journal article genre. It has a form familiar to most researchers and practitioners and is monitored by the journal's editorial policies as well as the profession's communication practices. There are many document genres: some common, such as a report or a newsletter, and others restricted to specific domains, such as the course syllabus or a problem set in higher education higher education Study beyond the level of secondary education. Institutions of higher education include not only colleges and universities but also professional schools in such fields as law, theology, medicine, business, music, and art. . Genre is applicable to electronic as well as physical documents. For example, in a study of Web documents, Crowston and Williams (2000) were able to identify documents of many familiar genres and of a few genres that seemed to be new to the Web, such as the home page (Dillon & Gushrowski, 2000) or the hotlist A listing of the best of something. It typically refers to the most popular Web sites. (World-Wide Web) hotlist - (From hypertext "hot spot") A document on the World-Wide Web or a user's browser configuration file containing hypertext links, often unorganised and . Genre is useful because it makes documents more easily recognizable and understandable to recipients, thus reducing the cognitive load Cognitive Load is a term (used in Educational psychology and other fields of study) that refers to the load on working memory during problem solving, thinking and reasoning (including perception, memory, language, etc.). of processing them (Bartlett, 1932 [1967]).Yates and Sumner (1997) argue that, on the Web genres help in both the production and consumption of documents because genre adds "fixity fix·i·ty n. pl. fix·i·ties 1. The quality or condition of being fixed. 2. Something fixed or immovable. " in a medium that does not otherwise distinguish very well between text types (say, a book and a Post-it). In our preliminary studies of people searching the Web (Roussinov, Crowston, Nilan, Kwasnik, Liu, & Cai, 2001), we observed that the genre of the document was one of the clues used in assessing document relevance, value, quality, and usefulness. The Problems of Information Access To explain how genre can be useful, we will first briefly review the problems faced by an information-access system. An information-access system has three components: 1. the users, who approach the system with contextually based information needs; 2. a store of information (e.g., the documents or databases); and 3. an intermediating mechanism to connect needs and information. The intermediary may be a person, a search algorithm In computer science, a search algorithm, broadly speaking, is an algorithm that takes a problem as input and returns a solution to the problem, usually after evaluating a number of possible solutions. , a browsing environment, or a summarizer, among others. The basic process of matching users' needs to potentially useful information in the system is complicated by many factors. First, problems may occur due to improper or incomplete representations of the information itself. When the information-access system is created, the documents or texts must be represented in such a way that they can be retrieved again as needed as needed prn. See prn order. . Librarianship has occupied itself for over a century with systematic approaches to organizing and representing information in systems. In creating bibliographic records, we call this process cataloging; in organizing actual documents or topics for meaningful retrieval, we call it classification; in providing access to bibliographic databases For computer programs to manage an individual's bibliographic references, see Reference management software A bibliographic or library database is a database of bibliographic information. , we call the representation process indexing. There are similar processes of information representation on the Web and in many other applications in which large stores of information are prepared for eventual use in the future. Many of the schemes are adaptations of traditional schemes, such as that used on Yahoo.com, the Dublin Core A set of meta-data descriptions about resources on the Internet. Used for resource discovery, it contains data elements such as title, creator, subject, description, date, type, format and so on. Dublin Core descriptions are often included in HTML meta tags. Project, or the GEM Metadata Project for educational materials (http://www.thegateway.org/). Others comprise grassroots, emergent emergent /emer·gent/ (e-mer´jent) 1. coming out from a cavity or other part. 2. pertaining to an emergency. emergent 1. coming out from a cavity or other part. 2. coming on suddenly. sorts of organization and representations, such as the evolving classification on eBay.com or amazon.com (Kwasnik & Liu, 2000; Kwasnik, 2002). An increasingly popular approach relies solely on the full text of the documents. Another problem that may arise is that the process itself of matching users' queries to the document representations may be inadequate or faulty. Much effort on the part of information scientists has been spent in developing and perfecting search strategies, including various matching algorithms, probabilistic (probability) probabilistic - Relating to, or governed by, probability. The behaviour of a probabilistic system cannot be predicted exactly but the probability of certain behaviours is known. Such systems may be simulated using pseudorandom numbers. techniques, citation mapping, and natural language processing Natural language processing Computer analysis and generation of natural language text. The goal is to enable natural languages, such as English, French, or Japanese, to serve either as the medium through which users interact with computer systems such as . These efforts struggle with many obstacles--among them the difficulties of evaluating search results in real environments, as well as problems of scaling, reliability, and the representation bottleneck A lessening of throughput. It often refers to networks that are overloaded, which is caused by the inability of the hardware and transmission lines to support the traffic. It can also refer to a mismatch inside the computer where slower-speed peripheral buses and devices prevent the CPU . On the user side, we have people in need of information. Often, though, users are unable to precisely specify what it is they need and, even if they do, the ways in which humans articulate their needs produces a great variety of expression. The problem of appropriate representation of users' queries is not just a question of finding the correct representation according to according to prep. 1. As stated or indicated by; on the authority of: according to historians. 2. In keeping with: according to instructions. 3. some absolute criteria. Because information use is situated in specific contexts, there is also the need to be able to represent the information in such a way that a match can be made not only on the level of physical description and topic, for example, but also in terms of matching the information with a potential use. For example, consider a person approaching a system with the query, "I want to prepare a Passover dinner." At a certain level we can see that there is a need for concrete information in the form of actual recipes. We might even interpret this as a "known item search." However, recipes may satisfy the need only partially, since the person may want to know much more about the rituals and meanings of a Passover dinner and not just the food itself. The information need may be either broader than what is asked for or much narrower and more specific. Furthermore, we know that people ask for what they expect they can get that most closely matches what they really want, and thus their requests are often presented in a compromised form. Thus, we can see that topic alone is not enough to define an information problem because different users may require different solutions to seemingly similar information problems. Indeed, even the same user may require different information at different times. These different needs arise because the situation (or context) of a user determines not only what topics are requested and what strategies are invoked in searching and evaluating output but also what types of resources are considered relevant and useful. While we know that it is important to understand the situation of the user, the representation of the situation and then its implementation in a system is a difficult problem. Our efforts to create user profiles, universal situation grammars, and so on suffer from limitations of scope to specific domains and lack of extensibility and flexibility. Why We Think Identification of Genre Would Be Useful We suggest that enhancing document representations by incorporating nontopical characteristics of the documents that signal their purpose--that is, their genre--will enrich document (and query) representations in such a way that they resonate res·o·nate v. res·o·nat·ed, res·o·nat·ing, res·o·nates v.intr. 1. To exhibit or produce resonance or resonant effects. 2. more truly with the information need of a user as situated in a particular context. Because most genres are characterized by both form and purpose, identifying the genre of a document provides information as to the document's pro-pose and its fit to the user's situation, which can be otherwise difficult to assess. For instance, a university professor looking for Looking for In the context of general equities, this describing a buy interest in which a dealer is asked to offer stock, often involving a capital commitment. Antithesis of in touch with. information about computer database systems for the class that she teaches would most likely be interested in documents of educational genres (e.g., syllabi syl·la·bi n. A plural of syllabus. , assignments, class notes). On the other hand, when working on a research paper in the database area, the same professor would more likely appreciate scholarly work (e.g., papers, annotated bibliographies An annotated bibliography is a bibliography that gives a summary of the research that has been done. It is still an alphabetical list of research sources. In addition to bibliographic data, an annotated bibliography provides a brief summary or annotation. , calls for papers). The relevant documents for these two searches would be quite different, even though the topic and query keywords might be nearly the same. Explicit identification of genre seems particularly important for large digital collections because any search of these collections usually retrieves documents with a diversity of genres and, what is worse, these genres are undifferentiated by obvious clues to their identity. This is in contrast to nondigital information-seeking situations in which the searcher generally has an idea of what sorts of documents exist in the collection. Even if he or she does not, clues of physical form and location increase the chances that a document's genre is recognized. For example, a user searching in a library can visually distinguish CDs from monographs, encyclopedias This article contains a list of encyclopedias, including projects to create new works. Because the number of works that can be considered encyclopedias is very large, this list does not attempt to be comprehensive. , or newspapers. Similarly, a user searching a database containing only journal articles has already implicitly restricted the search to that genre of documents. On the Web, however, a search of a large and diverse document collection will usually retrieve some documents of relevant genres along with many documents of irrelevant genres--a low-precision result--even if all retrieved documents conform to Verb 1. conform to - satisfy a condition or restriction; "Does this paper meet the requirements for the degree?" fit, meet coordinate - be co-ordinated; "These activities coordinate well" search specifications regarding the topical content of the document. Recognition of genre also has implications for automated methods of representing documents, such as automated summarization sum·ma·rize intr. & tr.v. sum·ma·rized, sum·ma·riz·ing, sum·ma·riz·es To make a summary or make a summary of. sum and indexing. A one-size-fits-all approach to summarizing or evaluating Web documents without regard for their form and function is likely to misrepresent mis·rep·re·sent tr.v. mis·rep·re·sent·ed, mis·rep·re·sent·ing, mis·rep·re·sents 1. To give an incorrect or misleading representation of. 2. many of them. For example, a newspaper article can be summarized by the first few sentences of the document, but such an approach will not work for a home page or a frequently-asked-questions document (FAQ (Frequently Asked Questions) A group of commonly asked questions about a subject along with the answers. Vendors often display them on their Web sites for use as troubleshooting guidelines. ) (Marcu, 1997). When medical information is sought, it makes a difference to the evaluation whether the document retrieved is a newsletter, a personal home page, or a hospital's patient information site. How Librarians Have Addressed the Notion of Genre in Library Information Systems We do not mean to imply that information science has never addressed the notion of genre, or that genre has not been incorporated into any information-representation schemes. Indeed, several classification systems allow some articulation articulation In phonetics, the shaping of the vocal tract (larynx, pharynx, and oral and nasal cavities) by positioning mobile organs (such as the tongue) relative to other parts that may be rigid (such as the hard palate) and thus modifying the airstream to produce speech of genre, and many metadata standards, including the Dublin Core, include a field for genre. The treatment of genre is limited or not very well defined, however. Our understanding of the nature and role of document genre is still nascent nascent /nas·cent/ (nas´ent) (na´sent) 1. being born; just coming into existence. 2. just liberated from a chemical combination, and hence more reactive because uncombined. , and so the use of this kind of information is underdeveloped un·der·de·vel·oped adj. Not adequately or normally developed; immature. in information-retrieval systems. Furthermore, it is not clear whether the extension of genre designations originally designed for physical collections will export well to digital ones. Historically, most library information systems took genre for granted since most collections contained only a limited array of document types. The exceptions are literary genres Noun 1. literary genre - a style of expressing yourself in writing writing style, genre drama - the literary genre of works intended for the theater prose - ordinary writing as distinguished from verse (such as poetry) and publication types (such as almanacs Almanacs See also astronomy; calendar almanagist a person who compiles almanacs. ephemeris an astronomical almanac giving, as an aid to the astronomer and navigator, the locations of celestial bodies for each day of the year. or newspapers), which have had a lively existence in explicit document representation for several centuries. Aside from these, the primary facets of access to documents in traditional systems are descriptive components and subject, while genre is relatively rare. The descriptive access points derive from traditional ways of talking about books and book-like documents, and include: title, author, place and name of publisher, edition, date, series, physical description in terms of pages, size, volumes, and sometimes information about components, which are called analytics. The subject analysis of a document captures what a document is about--that is, its topic. Librarians and information scientists have recognized that the topical approach is extremely important but insufficient in some situations and completely inappropriate in others. Not every document is necessarily about something. Sometimes the document's nature as a document represents the most important or useful aspect of it. For instance, on the one hand we can say that a book may be about symphonies--their history or structure--but what is Beethoven's Fifth about? It simply is. A symphony has a form and identifiable characteristics but it does not have a readily identifiable topic, per se, except that which can be attributed to it through subtle and nonconsensual processes of interpretation. As the notion of document becomes broader and more diverse, as it does in the environment of the Web, we can see how the concept of subject does not stretch very well to cover all types of information. In response to the need to identify a document's form or genre in addition to its subject, librarians have created auxiliary tools in the form of tables and subdivisions to be used with existing topically based classification and subject-heading schedules. Here are a few examples: The Dewey Decimal Classification Dewey Decimal Classification or Dewey Decimal System System for organizing the contents of a library based on the division of all knowledge into 10 groups. Each group is assigned 100 numbers. (DDC See VESA DDC. ) (Dewey, Mitchell, Beall, Matthews, & New, 1996) provides several ways to denote de·note tr.v. de·not·ed, de·not·ing, de·notes 1. To mark; indicate: a frown that denoted increasing impatience. 2. a document's form or genre. The first is to incorporate a designation in the number itself. This is used in the 800s, which cover belles lettres Noun 1. belles lettres - creative writing valued for esthetic content belles-lettres literary composition, literary work - imaginative or creative writing . The first part of the number designates country/language, and the final digits represent the genre--1 for poetry, 2 for drama, 3 for fiction, 5 for speeches ... 7 for humor humor, according to ancient theory, any of four bodily fluids that determined man's health and temperament. Hippocrates postulated that an imbalance among the humors (blood, phlegm, black bile, and yellow bile) resulted in pain and disease, and that good health was and satire, and so on (Table 1). These genre designations are limited to the genres generally accepted by Western literary scholars and do not necessarily do a good job of describing emerging, culturally diverse, or hybrid genres. Still, it is a way of privileging genre in the organization of literary works. It is interesting to note, however, that most public libraries do not make use of this formal system for fiction and arrange such works by author, with the ad hoc For this purpose. Meaning "to this" in Latin, it refers to dealing with special situations as they occur rather than functions that are repeated on a regular basis. See ad hoc query and ad hoc mode. tradition of separating out popular genres into separate sections for easy access and browsability: Mysteries, Romances, Science Fiction, and so on. Another technique in the DDC is to use suffixes from the Tables. The number for the topic is established, and then suffixes from some Table are added to denote the form or genre. For example: In physical collections, the suffixes serve to distinguish materials on the same topic hut in different publication formats. This notion of form/genre evolved from the physical distinctions of publication and document types and thus is grounded in publishing practices and realities. The further interpretation of how such documents will be used remains implicit in Adj. 1. implicit in - in the nature of something though not readily apparent; "shortcomings inherent in our approach"; "an underlying meaning" underlying, inherent the nature of the forms themselves but has practical implications for collections. For instance, many dictionaries and encyclopedias comprise the noncirculating reference collection, magazines are indexed and stored differently than are books, and so on. In terms of digital collections, however, where the physical clues of publication format are largely absent, these suffixes might provide useful indicators for sorting and filtering search results. Another way in which subject is indicated on the bibliographic record is through the use of subject terms from a thesaurus or list, such as the Library of Congress Subject Headings The Library of Congress Subject Headings (LCSH) comprise a thesaurus (in the information technology sense) of subject headings, maintained by the United States Library of Congress, for use in bibliographic records. (LCSH LCSH Library of Congress Subject Headings LCSH Lee County Senior High (Sanford, NC, USA) ). The LCSH comprises an evolving list of terms used by catalogers to assign subject designations to a work. Terms can denote topics, such as "sonnets," in which case this would be a work about sonnets, not the sonnets themselves. Proper names may also be subjects. For example, a document about William Shakespeare will be assigned Shakespeare's name as a subject, while a work by William Shakespeare would not. Modern cataloging practices abound in confusions about topic, creative responsibility, and genre/form, since in many documents these three are inextricably in·ex·tri·ca·ble adj. 1. a. So intricate or entangled as to make escape impossible: an inextricable maze; an inextricable web of deceit. b. fused. This confusion extends to searchers as well, who do not realize that searching for a genre using LCSH is problematic at best. This distinction of reserving subject headings for topics/subjects only is somewhat moderated by the addition of a subdivision. There are several kinds of subdivisions that can be used to "subdivide TO SUBDIVIDE. To divide a part of a thing which has already been divided. For example, when a person dies leaving children, and grandchildren, the children of one of his own who is dead, his property is divided into as many shares as he had children, including the deceased, and the share " a subject by time, geographical location, and further topical aspects. For example: Witchcraft--Sweden Witchcraft--15th Century Witchcraft--Biblical teaching The subdivisions of interest here, though, are the ones from the Form Subdivisions list. This type of subdivision allows the cataloger cat·a·log or cat·a·logue n. 1. a. A list or itemized display, as of titles, course offerings, or articles for exhibition or sale, usually including descriptive information or illustrations. b. to further describe a work by its form or literary genre. This list is limited to several hundred well-established types. The genres included have literary warrant, since every subject heading and division in the LCSH was developed for an existing, rather than a hypothetical, work. Witchcraft--Bibliography Witchcraft--Case studies Witchcraft--Dictionaries Witchcraft--Handbooks, manuals, etc. Witchcraft--Periodicals Witchcraft--Poetry The fact remains, however, that form and genre are not, as a rule, an important finding aid in traditional systems. For instance, the work Final Environmental Impact Statement for the Green Mountain and Finger Lakes National Forests The Finger Lakes National Forest encompasses 16,032 acres (65.9 km²), nestled between Seneca Lake and Cayuga Lake in the Finger Lakes Region of New York State in the United States of America. Land Resource Management Plan is assigned the following subject headings from the LCSH: Forest reserves--Vermont Green Mountain National Forest Green Mountain National Forest is a national forest located in Vermont. It is roughly 300,000 acres (0 km), of various types of forest supporting many species of wildlife. Forest reserves--New York (State)--Finger Lakes National Forest Forest management--Vermont Green Mountain National Forest Forest management--New York (State)--Finger Lakes National Forest Green Mountain National Forest (Vt.) Finger Lakes National Forest (N.Y.) Thus, this work can be retrieved by the names of either of the two national forests covered in the report and by two topics: forest reserves and forest management. It is not possible to retrieve this work as an environmental impact statement except for the coincidence that the terms appear in the title and would come up on a keyword search. There are many genres such as this one that serve a useful purpose as templates and are of interest in their own right, aside from the specific topic, but since this work is an example of an environmental impact statement, rather than a work about one, there is no subject heading assigned for this important aspect of the document. Some libraries recognize that genre and form are often perceived as "topical" and have made some additional access points to accommodate this. For instance, the Rare Book, Manuscript, and Special Collections In library science, special collections (often abbreviated to Spec. Coll. or S.C.) is the name applied to a specific repository within a library which stores materials of a "special" nature. Library at Duke University (http://scriptorium.lib.duke.edu.genre-headings.html) has an interesting set of auxiliary tools for searching its collection. One of these is a Genre/Form list from which genre terms can be used for searching as if they were topics. Here is a sample of terms from that list: Accounts Business letters Manuscripts Official reports Pattern books Petitions Recipes Seals Subliterary sub·lit·er·ar·y adj. 1. Of, relating to, having the qualities of, or producing subliterature. 2. Not written as or intended to be literature: subliterary works such as letters and diaries. papyri Tax returns Vouchers It is immediately obvious how very helpful such a list might be in studying the communicative forms of the cultures represented in the collection. HOW TO STUDY GENRE Having presented our case for understanding more about document genres in order to enhance retrieval of information from large digital collections, we turn now to the issues of precisely how we might study this phenomenon. Because genre is often an implicit and subtle notion, studying it in a systematic way presents many problems. Our overarching o·ver·arch·ing adj. 1. Forming an arch overhead or above: overarching branches. 2. Extending over or throughout: "I am not sure whether the missing ingredient . . . question is, Would identification of document genres improve information access technologies in large digital collections such as digital libraries and the Web? This question cannot be answered directly, given the current understanding of genre or of genre's role in information retrieval. Thus, we envision a research agenda for investigating genre that proceeds through a series of componential studies, each of which we see as necessary for a full understanding. Thus, in answering the central question with respect to genre, it is necessary to investigate the following: * The identification of Web document genres from the users' perspective and articulated in the users' own terms; * The creation of a faceted (i.e., multidimensional mul·ti·di·men·sion·al adj. Of, relating to, or having several dimensions. mul ti·di·men ) classification
of these genres that can be used for controlled investigations in later
stages of study;* An investigation of how users integrate genre metadata into their own searching, evaluation, and use of documents; * An evaluation of the degree to which incorporation of genre metadata in information-access systems makes a difference to the effectiveness of searching, sorting, ranking, and eventual use of documents; and * An evaluation of various interfaces for visualizing and presenting genre metadata once it has been identified. We also recognize that studying genre cannot be a once-and-fox-all endeavor, since new genres are emerging all the time and old ones are being used in ways that are different than originally conceived. Thus, we propose that any study of genre must also establish a conceptual framework For the concept in aesthetics and art criticism, see . A conceptual framework is used in research to outline possible courses of action or to present a preferred approach to a system analysis project. from within which to design continuing investigations. That is, we need a set of working hypotheses based on what we know about genre as a social construct. How are genres recognized? How do they evolve and change? How are they used and understood? Studying Genre from the Users' Perspective We take it as a given that studies of genres must be based in real situations with real users. Since traditional designations of document genres will probably not adequately or accurately describe all the document types present and emerging on the Web and in digital libraries, it would make no sense to use such designations as a checklist against which digital genres are compared. Such a comparison would inevitably miss genres new or unique to the Web or, even more confusing, mistake traditional genres that have been adapted to new uses on the Web. Thus, we see the first step as a descriptive phase of inductively in·duc·tive adj. 1. Of, relating to, or using logical induction: inductive reasoning. 2. Electricity Of or arising from inductance: inductive reactance. extracting from what people say about their terminology and sense of genres. At the same time, we recognize that studying the entire range of possible document genres and the tasks for which identification might be useful is not realistic. Furthermore, genres can be specific to a particular discourse community The term discourse community links the terms discourse, a concept describing all forms of communication that contribute to a particular, institutionalized way of thinking; and community, which in this case refers to the people who use, and therefore help create, a particular , so too broad a scope may make it difficult to identify a useful set of genres in a manageable time with limited resources. Thus, as a first step we suggest that document genres be studied for a particular set of users, such as lawyers, educators, city planners, real estate agents, and so on. If the right population is chosen, limiting the scope does not necessarily mean that only a small subset of genres will emerge. For instance, teachers can potentially search for a wide range of topics and utilize documents of many genres, including a number that are particular to education, e.g., "lesson plans" and "academic standards." This diversity means there would be a wide range of potential document types to study. On the other hand, in the domain of education there is a core set of tasks that provides a base on which to study the impact of genre identification. Such tasks and situations for teachers include, for instance, writing a lesson plan, creating a reading list, adapting an existing class, or developing tests and assignments. For administrators it might include conforming to educational standards, managing human resources The fancy word for "people." The human resources department within an organization, years ago known as the "personnel department," manages the administrative aspects of the employees. , writing reports, and communicating with students and parents. Limiting the domain of inquiry will help focus a study, establish a reasonable scope, and provide a manageable set of situations with which to work and on which to test the impact of genre identification. We realize that limiting a study in this way also limits the ability to generalize generalize /gen·er·al·ize/ (-iz) 1. to spread throughout the body, as when local disease becomes systemic. 2. to form a general principle; to reason inductively. , but initially the aim is to show that genre identification is of value for certain tasks. Having demonstrated the basic concept for educators, for instance, we expect that it would be possible to then extend the principles beyond the domain of education. Identification of Genres In order to design effective empirical studies Empirical studies in social sciences are when the research ends are based on evidence and not just theory. This is done to comply with the scientific method that asserts the objective discovery of knowledge based on verifiable facts of evidence. to investigate people's use of genre, it is necessary to identify, describe, and categorize cat·e·go·rize tr.v. cat·e·go·rized, cat·e·go·riz·ing, cat·e·go·riz·es To put into a category or categories; classify. cat the range of document genres used by the target population and the tasks associated with these documents. There is a substantial body of work on analyzing genre in printed documents and some work studying them on the Web (e.g., Bretan, Dewe, Hallberg, & Wolkert, 1998; Crowston & Williams, 2000; Dillon & Gushrowski, 2000; Furuta & Marshall, 1996; Karlgren, Bretan, Dewe, Hallberg, & Wolkert, 1998; Stamatatos, Fakotakis, & Kokkinakis, 2000). However, these studies have typically been top-down, that is, they analyzed a set of documents based on theoretical principles or according to a priori a priori In epistemology, knowledge that is independent of all particular experiences, as opposed to a posteriori (or empirical) knowledge, which derives from experience. classifications. For example, Crowston and Williams (2000) based their classification on the Art and Architecture Thesaurus (Petersen, 1994), and a number of studies used the categories of the Brown Corpus The Brown Corpus of Standard American English (or just Brown Corpus) was compiled by Henry Kucera and W. Nelson Francis at Brown University, Providence, RI as a general corpus (text collection) in the field of corpus linguistics. . A top-down approach Top-down approach A method of security selection that starts with asset allocation and works systematically through sector and industry allocation to individual security selection. to genre is problematic for two reasons. First, genres are socially constructed, so different social groups using documents with similar structural features may think about them and describe them differently. A document may be unfamiliar and difficult to understand for someone outside of the community in which the genre is used. Therefore, it is important to capture the users' own language and understanding of these genres. Second, it is imperative to extend any investigation to genres that are not necessarily vetted by traditional schemes, such as those that come out of domain-specific work (e.g., "block-scheduled curriculum plans"). As pointed out by Dillon and Gushrowski (2000, p. 202), genres are no longer necessarily "slow-forming, often emerging only over generations of production and consumption." Thus, we assume that a traditional typology typology /ty·pol·o·gy/ (ti-pol´ah-je) the study of types; the science of classifying, as bacteria according to type. typology the study of types; the science of classifying, as bacteria according to type. of genre or document forms will not be sufficient to describe the emerging and dynamic genres identifiable by users in general and our community in particular. Some researchers have attempted to identify genres bottom-up through relatively small-scale user studies (e.g., Dewe, Karlgren, & Bretan, 1998; Nilan, Pomerantz, & Paling, 2001). However, we do not as yet have a fully articulated set of data that reveals what genres people recognize nor for what tasks they find documents of specific genres useful. In investigating the range of genres identified by users, we suggest that the following questions should be addressed: * How do people talk about the genre of documents? Does the naming and identification of digital-document genres correspond to the naming of traditional nondigital genres? * How do people understand and make use of new, unnamed, emerging, and "colonized Colonized This occurs when a microorganism is found on or in a person without causing a disease. Mentioned in: Isolation " genres (Beghtol, 2000) in digital collections? * What clues do people use to identify genre when engaged in information-access activities? * What facets (basic attributes) of genre do people perceive? Creation of a Faceted (i.e., Multidimensional) Classification of Genres If genre is to be used as another facet of description for documents and queries, we are still left with the issue of how to describe genre itself in such a way that it can be implemented in a system. Genre itself is a multidimensional phenomenon, incorporating form, function, and the numerous clues and components that allow us to discriminate one genre from another. Toward this end we see the need to create a rich and flexible description of document genres that will do justice to their complexity while at the same time providing a structured tool for systematic inquiry. One way to achieve this is through a faceted classification A faceted classification system allows the assignment of multiple classifications to an object, enabling the classifications to be ordered in multiple ways, rather than in a single, pre-determined, taxonomic order. . A classification will help determine the level of granularity The degree of modularity of a system. More granularity implies more flexibility in customizing a system, because there are more, smaller increments (granules) from which to choose. that can be achieved in genre identification. Genre complexity can be managed by organizing the genres in a classification from more general to more specific. By picking appropriate levels of specificity it might be possible to avoid having to identify hundreds of detailed genres, while still providing a basic level of distinction in areas of particular interest. Most organized lists of genres are structured as hierarchies. The criticism of traditional hierarchies is that they rely on a single organizing principle, which may not be useful for all cases. To overcome this problem we suggest using the facet analysis approach. Facet analysis identifies multiple fundamental dimensions along which objects, such as genres, can be described and then clustered. For example, a genre such as a "lesson plan" can be identified by its source, its purpose, its structural features, and so on. Each facet, or basic dimension, can be articulated following its own logic and subsequently can be used for its own type of clues for classification. In suggesting the use of facet analysis we follow the example of previous genre-identification studies such as Paivarinta (1999), Tyrvainen and Paivarinta (1999), and Karjalainen, Paivarinta, Tyrvainen, and Rajala (2000), who looked at the management of enterprise documents, and Kessler, Nunberg, and Schuetze (1997), who sought to identify a limited set of facets for communicative purposes. A faceted approach to classification is pragmatic and not dependent on any one conceptual perspective. It allows for the development of description and clustering using a number of fundamental dimensions, rather than just one. The results of this process yield a classification that is flexible, expressive, and hospitable hos·pi·ta·ble adj. 1. Disposed to treat guests with warmth and generosity. 2. Indicative of cordiality toward guests: a hospitable act. 3. to new genres and genre combinations. It also allows a view of genres at a variety of conceptual levels, from the general and inclusive to the very specific, which will be useful in simulations in later phases of inquiry. How would a faceted approach to genres work? In principle, this approach requires several passes. The first pass identifies and labels facets that seem to be important. These might include form, content, style, implied use, and the relationship of that document to others. These facets serve as starting points Noun 1. starting point - earliest limiting point terminus a quo commencement, get-go, offset, outset, showtime, starting time, beginning, start, kickoff, first - the time at which something is supposed to begin; "they got an early start"; "she knew from the , and new facets may emerge. After identifying the basic facets, one must again review the entire corpus repeatedly to see the range of categories on which these facets are revealed--for instance, what do people use to describe "source"? The process continues until saturation saturation, of an organic compound saturation, of an organic compound, condition occurring when its molecules contain no double or triple bonds and thus cannot undergo addition reactions. is reached (i.e., no new categories emerge). If necessary, more data are collected. Once the Web genres are identified, it might be interesting to compare them to the more traditional sources of genres for overlaps of structure and coverage. We expect that there will be a significant amount of redundancy among the genres identified in this way. The aim is to generate a classification that reflects not only currently identified genres but will also flexibly accommodate identification of emerging and future genres (Beghtol, 2000; Kwasnik, 1999), thus providing a basis for future work in this area. How Users Integrate Genre Metadata into Their Own Searching, Evaluation, and Use of Documents Besides identifying genres and their attributes, any study of genre effectiveness must also establish how people in fact utilize genre in searching, in generating queries, and in their evaluation of documents. To accomplish this, further observations are necessary in order to answer the following questions: * In what contexts and for what tasks is the identification of genre useful? * To what extent are documents of various genres specific to certain tasks as opposed to being generally useful? * To what extent are people interested in documents of genres specific to their domain and environment vs. those used more widely? In summary, the results of the necessary initial phases of studying genres on the Web would be a better understanding of how users in the community of interest describe the genre of documents and how they use genre when they work with documents to solve information problems. This phase should also provide a database of documents categorized cat·e·go·rize tr.v. cat·e·go·rized, cat·e·go·riz·ing, cat·e·go·riz·es To put into a category or categories; classify. cat according to their user-specified genre, genre features, user evaluations, and related information tasks. Such a database can be used as the baseline for simulations and evaluation studies in subsequent phases. It would also provide an inductively derived faceted classification scheme for usefully clustering genres and features and for determining granularity. Evaluating the Effectiveness of Genre Identification Using the basic information discovered in the preliminary phases, it should then be possible to carry out controlled user studies whereby various aspects of genre use can be manipulated to see the differences in retrieval effectiveness associated with each manipulation. Thus, in this phase one can study how genre metadata can best be utilized in information-access tasks. By "best" we mean initially improving users' performance (e.g., time, accuracy, or perceived usability) in information searching, filtering, and evaluation tasks. Ultimately, of course, "best" means improving the performance in the kinds of information tasks people face in their day-to-day work lives. This phase in the research plan must address several questions: * How best to use genre metadata in information-access systems? * To what extent does providing genre metadata improve performance and utility? * Which specific facets of genre improve performance most? * To what extent does using genre metadata to cluster and/or rank documents improve performance and utility? * Can genre metadata be used to inform other aspects of the process of searching, summarizing, or evaluating documents? To answer these questions, one approach would be to develop simulated information-access system interfaces that incorporate genre metadata and then to conduct evaluations of the efficacy of these interfaces in controlled laboratory experiments. Depending on resources and available design expertise, one could implement these prototypes in various ways, starting with storyboards and paper mockups, scripted interfaces and, if possible, by implementing them on a real search engine. The following are some suggestions for scenarios in which genre information could be implemented in order to test its efficacy: * Provide aids for query construction using genre metadata; * Cluster documents based on genre metadata (explicitly labeled vs. labeled using other techniques vs. unlabeled); * Show genre information in combination with other metadata; * Raise or lower a document's rank based on genre metadata; or * Incorporate genre-specific information-access processes. The method of presentation of genre information will inevitably influence its efficacy. For this reason, before any retrieval evaluation can take place, this aspect of genre must also be investigated. Specifically, studies should investigate: * How best to represent and display genre metadata to the user and receive feedback? * What level of granularity of genre metadata improves performance most (i.e., how specific should the description of genre be)? Is there, perhaps, a basic level of genre that is neither too general and abstract, nor too specific? * To what extent does combining genre metadata with other kinds of metadata (e.g., subject) improve performance? For example, is it more useful to identify documents as "fifth-grade science lesson plans" or just as "lesson plans"? * To what extent is the user's performance degraded de·grad·ed adj. 1. Reduced in rank, dignity, or esteem. 2. Having been corrupted or depraved. 3. Having been reduced in quality or value. by miscategorization of documents based on genre? In answering these questions there are some general interface design issues that must also be addressed. For example, it is important to decide on: * The choice between opaque and transparent modes of presentation (Roussinov et al., 2001). In transparent mode, the system will expose its identification of genre to the user by labeling or otherwise identifying it. In opaque mode, results will make use of genre metadata, e.g., for ranking, but the user will not be made explicitly aware of it; * If the transparent mode is used, then one must make a choice between interfaces that explicitly name genres and those that use other methods of labeling (e.g., by providing example documents). A typical experiment might for instance contrast two interfaces. Participants could use one interface in the first half and a different interface in the second half of the experiment, with the order of presentation counterbalanced coun·ter·bal·ance n. 1. A force or influence equally counteracting another. 2. A weight that acts to balance another; a counterpoise or counterweight. tr.v. across subjects. An example task might be to find a relevant Web page for a given problem scenario. ANSWERING THE BIG QUESTION: DOES GENRE HELP? Identifying Web document genres, observing how genre is used in searching and evaluation of search results, the multifaceted mul·ti·fac·et·ed adj. Having many facets or aspects. See Synonyms at versatile. Adj. 1. multifaceted - having many aspects; "a many-sided subject"; "a multifaceted undertaking"; "multifarious interests"; "the multifarious representation of genres, and finally the design of presentation and visualization Using the computer to convert data into picture form. The most basic visualization is that of turning transaction data and summary information into charts and graphs. Visualization is used in computer-aided design (CAD) to render screen images into 3D models that can be viewed from all techniques allows a systematic exploration of the overall effectiveness and utility of genre information for Web documents. We know that people use genre information and that for many applications it is perhaps the single most important piece of information that can be provided. Nevertheless, the extent to which the general inclusion of genre information will enhance information access on the Web is an open question. There is much to understand before such information can be practically implemented. On the other hand if, as we suspect, genre information is helpful, then studying it in a systematic way, as we propose above, can provide the initial baseline understanding that must precede any automatic implementation.
Table 1.
English poetry 821
English drama 822
English fiction 823
English speeches 825
English humor and satire 827
Bulgarian poetry 891.81
Bulgarian drama 891.82
Bulgarian fiction 891.83
Bulgarian speeches 891.85
Bulgarian humor and satire 891.87
Table 2.
Middle Eastern Cooking 641.5956
Middle Eastern Cooking Encyclopedia 641.5956+03
(dictionaries & encyclopedias)
Middle Eastern Cooking Magazine 641.5956+05
(serial publication)
REFERENCES Barrett, F. (1932 [1967]). Remembering: A study in experimental and social psychology. Cambridge: Cambridge University Press Cambridge University Press (known colloquially as CUP) is a publisher given a Royal Charter by Henry VIII in 1534, and one of the two privileged presses (the other being Oxford University Press). . Beghtol, C. (2000). The concept of genre and its characteristics. Bulletin of the American Society for Information Science & Technology, 26(5). Bretan, I., Dewe, J., Hallberg, A., & Wolkert, N. (1998). Web-specific genre visualization. Paper presented at the WebNet 1998 Conference. Orlando, FL. Crowston, K., & Williams, M. (2000). Reproduced and emergent genres of communication on the World Wide Web. Information Society, 16(3), 201-216. Dewe, J., Karlgren, J., & Bretan, I. (1998). Assembling a balanced corpus from the Internet. Paper presented at the 11th Nordic Computational Linguistics computational linguistics (CL) Use of digital computers in linguistics research. The simplest examples are the use of computers to scan text and produce such aids as word lists, frequency counts, and concordances. Conference. Copenhagen, Denmark. Dewey, M. A., Mitchell, J. S. E., Beall, J. E., Matthews, W. E. J. E., & New, G. R. E. (Eds.). (1996). Dewey Decimal Classification and Relative Index Set, 21st ed. Dublin: OCLC OCLC - Online Computer Library Center Forest Press. Dillon, A., & Gushrowski, B. (2000). Genres and the Web: Is the personal home page the first uniquely digital genre? Journal of the American Society for Information Science, 5(12), 202-205. Furuta, R., & Marshall, C. C. (1996). Genre as reflection of technology in the World Wide Web. [Technical Report]. College Station, TX: Hypermedia hypermedia: see hypertext. The use of hyperlinks, regular text, graphics, audio and video to provide an interactive, multimedia presentation. All the various elements are linked, enabling the user to move from one to another. Research Lab, Texas A&M. Karjalainen, A., Paivarinta, T, Tyrvainen, P., & Rajala, J. (2000). Genre-based metadata for enterprise document management. In Proceedings of the 33rd Annual Hawaii International Conference on System Sciences The Hawaii International Conference on System Sciences (HICSS) is an annual conference for Information Systems and Information Technology academics and professionals sponsored by the University of Hawaii at Manoa. . Los Alamos Los Alamos (lôs ăl`əmōs', lŏs), uninc. town (1990 pop. 11,455), seat of Los Alamos co., N central N.Mex. It is on a long mesa extending from the Jemez Mts. The U.S. , CA: IEEE Computer Society (body) IEEE Computer Society - The society of the IEEE which publishes the journal "Computer". http://computer.org/. Press. Kevin Crowston and Barbara H. Kwasnik, Syracuse University Syracuse University, main campus at Syracuse, N.Y.; coeducational; chartered 1870, opened 1871. Syracuse is noted for its research programs in government and industry; facilities include the Center for Science and Technology, the Newhouse Communications Center, and School of Information Studies, 4-206 Centre for Science and Technology, Syracuse, NY 13244-4100 Karlgren, J., Bretan, I., Dewe, J., Hallberg, A., & Wolkert, N. (1998). Iterative it·er·a·tive adj. 1. Characterized by or involving repetition, recurrence, reiteration, or repetitiousness. 2. Grammar Frequentative. Noun 1. information retrieval using fast clustering and usage-specific genres. Paper presented at the Eighth DELOS Workshop: User Interface in Digital Libraries. Stockholm, Sweden. Kessler, B., Nunberg, G., & Schuetze, H. (1997). Automatic detection of text genre. In Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics (body) Association for Computational Linguistics - (ACL) The international scientific and professional society for people working on problems involving natural language and computation. and the 8th Meeting of the European Chapter of the Association for Computational Linguistics EACL is the abbreviation for the European chapter of the Association for Computational Linguistics and of the annual conference of the chapter. The EACL is the primary professional association for computational linguistics and natural language processing research in (pp. 32-38). San Francisco San Francisco (săn frănsĭs`kō), city (1990 pop. 723,959), coextensive with San Francisco co., W Calif., on the tip of a peninsula between the Pacific Ocean and San Francisco Bay, which are connected by the strait known as the Golden : Morgan Kaufmann Publishers. Kwasnik, B. H. (1999). The role of classification in knowledge representation and discovery. Library Trends, 48(1), 22-47. Kwasnik, B. H. (2002). Commercial Websites and the use of classification schemes: The case of amazon.com. In M.J. Lopez-Huertas (Ed.), Challenges in knowledge representation and organization for the 21st century: Integration of knowledge across boundaries. Proceedings of the Seventh International ISKO ISKO International Society for Knowledge Organization Conference (pp. 279-285). Wurzburg: Ergon Verlag. Kwasnik, B. H., Crowston, K., Nilan, M., & Roussinov, D. (2000). Identifying document genre to improve Web search effectiveness. Bulletin of the American Society for Information Science and Technology The American Society for Information Science and Technology (also referred to as ASIST or ASIS&T) is an organization of information professionals. Established in 1937, the organization sponsors an annual conference and publishes proceedings from this conference under , 27(2), 23-26. Kwasnik, B. H., & Liu, X. (2000). Classification structures in the changing environment of active commercial Websites: The case of eBay.com. In C. Beghtol, L. C. Howarth, & N. J. Williamson (Eds.), Dynamism and stability in knowledge organization. Proceedings of the Sixth International ISKO Conference, 7(10-13), 372-377. Toronto: Advances in Knowledge Organization. Longacre, R. (1983). The grammar of discourse. New York New York, state, United States New York, Middle Atlantic state of the United States. It is bordered by Vermont, Massachusetts, Connecticut, and the Atlantic Ocean (E), New Jersey and Pennsylvania (S), Lakes Erie and Ontario and the Canadian province of : Plenum In a building, the space between the real ceiling and the dropped ceiling, which is often used as an air duct for heating and air conditioning. It is also filled with electrical, telephone and network wires. See plenum cable. Press. Marcu, D. (1997). From discourse structures to text summaries. Paper presented at the 14th National Conference on Artificial Intelligence (AAAI-97). Miller, C. R. (1984). Genre as social action. Quarterly Journal of Speech, 70, 151-167. Nilan, M. S., Pomerantz, J., & Paling, S. (2001). Genres from the bottom up: What has the Web brought us? In T. B. Hahn (Ed.), Information in a networked world: Harnessing the flow. Proceedings of the ASIST ASIST Cardiology A clinical trial–Atenolol Silent Ischemia Study that evaluated the effect of atenolol on M&M in Pts with CAD and/or silent myocardial ischemia. See Atenolol, Coronary artery disease, Silent ischemia. 2001 Annual Meeting. Washington, DC. Orlikowski, W.J., & Yates, J. (1994). Genre repertoire: The structuring of communicative practices in organizations. Administrative Sciences Administrative Science may refer to:
Paivarinta, T. (1999). A genre approach to applying critical social theory to information systems development. In C. H.J. Gilson, I. Grugulis, & H. Willmott (Eds.), Proceedings of the 1st Critical Management Studies Conference, information technology and critical theory stream. Hamilton, New Zealand Hamilton (Kirikiriroa in Māori) is the centre of New Zealand's fourth largest urban area, and is the country's seventh largest city. It is in the Waikato region of the North Island, approximately 130 km south of Auckland. : University of Waikato In 2002 over 14,000 students were enrolled at the university. More than a quarter of students were aged over 25, and over half were women. It has the highest proportion of Māori students on any campus in New Zealand. Management School. Petersen, T. (1994). Art and architecture thesaurus. New York: Oxford University Press. Roussinov, D., Crowston, K., Nilan, M., Kwasnik, B., Liu, X., & Cai, J. (2001). Genre-based navigation on the Web. Paper presented at the 34th Hawaii International Conference on Systems Science (HICSS-34). Maui, HI.. Stamatatos, E., Fakotakis, N., & Kokkinakis, G. (2000). Automatic text categorization in terms of genre and author. Computational Linguistics, 26(4), 471-498. Swales, J. M. (1990). Genre analysis: English in academic and research settings. New York: Cambridge University Press. Tyrvainen, P., & Paivarinta, "12 (1999). On rethinking organizational document genres for electronic document management. In Proceedings of the 32nd Annual Hawaii International Conference on System Sciences. Los Alamos, CA: IEEE Computer Society Press. Yates, J., & Orlikowski, W.J. (1992). Genres of organizational communication Organizational communication, broadly speaking, is: people working together to achieve individual or collective goals. [1] Discipline History The modern field traces its lineage through business information, business communication, and early mass communication : A structurational approach to studying communications and media. Academy of Management Review, 17(2), 299-326. Yates, S. J., & Sumner, T. (1997). Digital genres and the new burden of fixity. In Hawaiian International Conference on System Sciences (HICCS 30). Los Alamos, CA: IEEE Computer Society Press. KEVIN CROWSTON is an Associate Professor in the School of Information Studies at Syracuse University and Director of the Ph.D. program. He joined the school in 1996. He received his A.B. (1984) in Applied Mathematics (Computer Science) from Harvard University Harvard University, mainly at Cambridge, Mass., including Harvard College, the oldest American college. Harvard College Harvard College, originally for men, was founded in 1636 with a grant from the General Court of the Massachusetts Bay Colony. and a Ph.D. (1991) in Information Technologies from the Sloan School of Management, Massachusetts Institute of Technology Massachusetts Institute of Technology, at Cambridge; coeducational; chartered 1861, opened 1865 in Boston, moved 1916. It has long been recognized as an outstanding technological institute and its Sloan School of Management has notable programs in business, (MIT MIT - Massachusetts Institute of Technology ). Before moving to Syracuse he was a founding member of the Collaboratory for Research on Electronic Work at the University of Michigan (body, education) University of Michigan - A large cosmopolitan university in the Midwest USA. Over 50000 students are enrolled at the University of Michigan's three campuses. The students come from 50 states and over 100 foreign countries. and of the Center for Coordination Science at MIT. BARBARA H. KWASNIK is a Professor in the School of Information Studies at Syracuse University in Syracuse, New York
Syracuse (IPA: , where she has been teaching in the areas of organization of information, theory of classification, and information science. Her current research interests are in developing methodologies for studying information-related behavior and in the cognitive processes Cognitive processes Thought processes (i.e., reasoning, perception, judgment, memory). Mentioned in: Psychosocial Disorders of browsing and the structure of classificatory systems, particularly the nature of unspecified term relationships. She is founder and co-coordinator of annual workshops sponsored by SIG/CR (ASIST) and coeditor or program committee member for the proceedings. Dr. Kwasnik is also active in the International Society for Knowledge Organization and is a member of the American Society of Indexers. |
|
||||||||||||||

ti·di·men
Printer friendly
Cite/link
Email
Feedback
Reader Opinion