Printer Friendly
The Free Library
6,672,335 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

Information Retrieval Beyond the Text Document.


ABSTRACT

WITH THE EXPANSION OF THE INTERNET, searching for information goes beyond the boundary of physical libraries. Millions of documents of various media types--such as text, image, video, audio, graphics, and animation-are available around the world and linked by the Internet. Unfortunately, the state of the art of search engines for media types other than text lags far behind their text counterparts. To address this situation, we have developed the Multimedia Analysis and Retrieval System (MARS). This article reports some of the progress made over the years toward exploring information retrieval information retrieval

Recovery of information, especially in a database stored in a computer. Two main approaches are matching words in the query against the database index (keyword searching) and traversing the database using hypertext or hypermedia links.
 beyond the text domain. In particular, the following aspects of MARS are addressed in the article: visual feature extraction In pattern recognition and in image processing, Feature extraction is a special form of dimensionality reduction.

When the input data to an algorithm is too large to be processed and it is suspected to be notoriously redundant (much data, but not much information) then the
, retrieval models, query reformulation techniques, efficient execution speed performance, and user interface considerations. Extensive experimental results are reported to validate the proposed approaches.

INTRODUCTION

Huge amounts of digital data are being generated daily. Scanners convert the analog/physical data into digital form; digital cameras and camcorders directly generate digital data at the production phase. Owing to owing to
prep.
Because of; on account of: I couldn't attend, owing to illness.

owing to prepdebido a, por causa de 
 all these multimedia devices, presently information is in all media types, including graphics, images, audio, and video in addition to the conventional text media type. Not only is multimedia information being generated at an ever-increasing rate, it is transmitted worldwide due to the expansion of the Internet. Experts say that the Internet is the largest library that ever existed; it is, however, also the most disorganized dis·or·gan·ize  
tr.v. dis·or·gan·ized, dis·or·gan·iz·ing, dis·or·gan·iz·es
To destroy the organization, systematic arrangement, or unity of.
 library ever.

Textual document retrieval The ability to search for documents by keywords and other attributes such as date and author. It implies that the documents have been indexed on all pertinent fields and that keywords have been chosen based upon title and textual content. See document imaging and document management system.  has achieved considerable progress over the past two decades. Unfortunately, the state of the art of search engines for media types other than text lags far behind their text counterparts. Textual indexing of nontextual media, although common practice, has some limitations. The most notable limitations include the human effort required and the difficulty of describing accurately certain properties humans take for granted while having access to the media. Consider how human indexers would describe the ripples on an ocean; these could be very different under situations such as calm weather or a hurricane. To address this situation, we undertook the Multimedia Analysis and Retrieval System (MARS) project to provide retrieval capabilities to rich multimedia data. Research in MARS addresses several levels including the multimedia features extracted, the retrieval models used, query reformulation techniques, efficient execution speed performance, and user interface considerations.

This article reports some of the progress made over the years toward exploring information retrieval (IR) beyond the text domain. In particular, the discussion will concentrate on visual information retrieval (VIR VIR Virtual
VIR Virgin Islands (ISO Country code)
VIR Virginia International Raceway
VIR Vascular and Interventional Radiology
VIR Vehicle Inspection Report
VIR Virtual Interface (Alteon) 
) concepts as opposed to implementation issues In the Business world, companies frequently set-up a connection between which they transfer data. When the connection is being set-up, it is referred to as implementation. When issues occur during this phase, they are known as implementation issues. . MARS explores many different visual feature representations. A review of these features appears in the next section ("Visual Feature Extraction"). These visual features are analogous to keyword features in textual media. Another section ("Retrieval Models Used in MARS") describes two broad retrieval models we have explored: the Boolean and vector models and the incorporated enhancements to support visual media retrieval such as relevance feedback Relevance feedback is a feature of some information retrieval systems. The idea behind relevance feedback is to take the results that are initially returned from a given query and to use information about whether or not those results are relevant to perform a new query. . Results are given in a later section ("Experimental Results"). The last section provides remarks summarizing the overall discussion ("Conclusion").

VISUAL FEATURE EXTRACTION

The retrieval performance of any IR system is fundamentally limited by the quality of the "features" and the retrieval model it supports. This section sketches the features obtained from visual media. In text-based retrieval systems, features can be keywords, phrases, or structural elements Structural elements are used in structural analysis to simplify the structure which is to be analysed.

Structural elements can be linear, surfaces or volumes.

Linear elements:
  • Rod - axial loads
  • Beam - axial and bending loads
. There are many techniques for reliably extracting, for example, keywords from text documents. The visual counterparts to textual features in visual based systems are features such as color, texture, and shape.

For each feature, there are several different techniques for representation. The reason for this is twofold: (1) the field is still under development and, more importantly, (2) features are perceived differently by people and thus different representations cater to various preferences. Image features are generally considered as orthogonal At right angles. The term is used to describe electronic signals that appear at 90 degree angles to each other. It is also widely used to describe conditions that are contradictory, or opposite, rather than in parallel or in sync with each other.  to each other. The idea is that a feature will capture some dimension of the content of the image, and different features will effectively capture different aspects of the image content. In this way, two images closely related in one feature could be very different in another feature. A simple example of this are two images, one of a deep blue sky and the other of a blue ocean. These two images could be very similar in terms of just color; however, the ripples caused by waves in the ocean add a distinctive pattern that distinguishes the two images in terms of their texture. Rui et al. (1999) give a detailed description of the visual features, and the following paragraphs emphasize the important ones.

The color feature is one of the most widely used visual features in VIR. This feature captures the color content of images. It is relatively robust to background complication and independent of image size and orientation. Some representative studies of color not of the white race; - commonly meaning, esp. in the United States, of negro blood, pure or mixed.

See also: Color
 perception and color spaces can be found in McCamy et al. (1976) and Miyahara (1988). In VIR, color histograms (Swain & Ballard, 1991), color moments (Stricker & Orengo, 1995), and color sets (Smith & Chang, 1995) are the most used representations.

Texture refers to the visual patterns that have properties of homogeneity Homogeneity

The degree to which items are similar.
 that do not result from the presence of only a single color or intensity. It is an innate property of virtually all surfaces, including clouds, trees, bricks, hair, fabric, and so on. It contains important information about the structural arrangement of surfaces and their relationship to the surrounding environment (Haralick et al., 197.s;). Co-occurrence matrix A co-occurrence matrix, also referred to as a co-occurrence distribution, is defined over an image to be the distribution of co-occurring values at a given offset. Mathematically, a co-occurrence matrix C is defined over an n x m image I, parameterized by an offset  (Haralick et al., 1973), Tamura texture (Tamura et al., 1978), and Wavelet (mathematics) wavelet - A waveform that is bounded in both frequency and duration. Wavelet tranforms provide an alternative to more traditional Fourier transforms used for analysing waveforms, e.g. sound.  texture (Kundu & Chen, 1992) are the most popular texture representations.

In general, the shape representations can be divided into two categories: boundary-based and region-based. The former uses only the outer boundary of the shape while the latter uses the entire shape region (Rui et al., 1996). The most successful representatives for these two categories are Fourier Descriptor (1) A word or phrase that identifies a document in an indexed information retrieval system.

(2) A category name used to identify data.

(operating system) descriptor
 and Moment Invariants. Some recent work in shape representation and matching includes the Finite Element See FEA.  Method (FEM FEM Female
FEM Finite Element Method
FEM Feminine
FEM Finite Element Model
FEM Fédération Européenne des Métallurgistes (European Metalworkers' Federation)
FEM Faculdade de Engenharia Mecânica (Brasil) 
) (Pentland et al., 1996), Turning Function (Arkin et al., 1991), and Wavelet Descriptor (Chuang & Kuo, 1996).

RETRIEVAL MODELS USED IN MARS

With the large number of retrieval models proposed in the IR literature, MARS attempts to exploit this research for content-based retrieval over images. The retrieval model comprises the document or object model (here a collection of feature representations), a set of feature similarity measures, and a query model.

The Object Model

We first need to formalize how an object is modeled (Rui et al., 1998b). We will use images as an example, even though this model can be used for other media types as well. An image object O is represented as:

(1) O = O(D, F, R)

* D is the raw image data--e.g., a jpeg image.

* F = {[f.sub.i]} is a set of low-level visual features associated with the image object, such as color, texture, and shape.

* R = {[r.sub.ij]} is a set of representations for a given feature [f.sub.i]--e.g., both color histogram and color moments are representations for the color feature (Swain & Ballard, 1991).

Note that, each representation [r.sub.ij] itself may be a vector consisting of multiple components, that is:

(2)] [r.sub.ij] = [[r.sub.ij1], ... [.r.sub.ijk], ... [r.sub.ijk]

where K is the length of the vector.

Figure 1 shows a graphic representation of the object (image) model. The proposed object model supports multiple representations to accommodate the rich content in the images. An image is thus represented as a collection of low-level image feature representations (see section entitled "Visual Feature Extraction") extracted automatically using computer vision methods as well as a manual text description of the image.

[Figure 1 ILLUSTRATION OMITTED]

Each feature representation is associated with some similarity measure. All these similarity measures are normalized to lie within [0,1] to denote the degree to which two images are similar in regard to the same feature representation. A value of 1 means that they are very similar and a value of 0 means that they are very dissimilar. Revisiting our blue sky and ocean example from the early section ("Visual Feature Extraction"), the sky and ocean images may have a similarity of 0.9 in the color histogram representation of color and 0.2 in the wavelet representation of texture. Thus the two images are fairly similar in their color content but very different in their texture content. This mapping M = { <feature representation, similarity [measure.sub.i] [is greater than], ...} together with the object model O, forms (D, E R, M), a foundation on which query models can be built.

Query Models

Based on the object model and the similarity measures defined above, query models that work with these raw features are built. These query models, together with the object model, form complete retrieval models used for VIR.

We explore two major models for querying. The first model is an adaptation of the Boolean retrieval model to visual retrieval in which selected features are used to build predicates used in a Boolean expression A statement using Boolean operators that expresses a condition that is either true or false. See Boolean search. . The second model is a vector (weighted summation summation n. the final argument of an attorney at the close of a trial in which he/she attempts to convince the judge and/or jury of the virtues of the client's case. (See: closing argument) ) model where all the features of the query object play a role in retrieval. The section on Boolean retrieval describes the Boolean model and the section on the "Vector Model" describes that model.

Boolean Retrieval

A user may not only be interested in more than a single feature from a single image. It is very likely that the user may choose multiple features from multiple images. For example, using a point-and-click interface, a user can specify a query to retrieve images similar to an image A in color and similar to an image B in texture. To cope with composite queries, a Boolean retrieval model is used to interpret the query and retrieve a set of images ranked based on their similarity to the selected feature.

The basic Boolean retrieval model needs a pre-defined threshold, which has several potential problems (Ortega et al., 1998b). To overcome these problems, we have adopted the following two extensions to the basic Boolean model to produce a ranked list of answers:

* Fuzzy Boolean Retrieval. The similarity between the image and the query feature is interpreted as the degree of membership of the image to the fuzzy set Fuzzy sets are sets whose elements have degrees of membership. Fuzzy sets have been introduced by Lotfi A. Zadeh (1965) as an extension of the classical notion of set. In classical set theory, the membership of elements in a set is assessed in binary terms according to a bivalent  of images that match the query feature. Fuzzy set theory is used to interpret the Boolean query, and the images are ranked based on their degree of membership in the set.

* Probabilistic (probability) probabilistic - Relating to, or governed by, probability. The behaviour of a probabilistic system cannot be predicted exactly but the probability of certain behaviours is known. Such systems may be simulated using pseudorandom numbers.  Boolean Retrieval. The similarity between the image and the query feature is considered to be the probability that the image matches the user's information need. Feature independence is exploited to compute the probability of an image satisfying the query which is used to rank the images.

In the discussion below, we will use the following notations. Images in the collection are denoted by [I.sub.1] [I.sub.2], ... [I.sub.m]. Features over the images are denoted by [F.sub.1], [F.sub.2], ... [F.sub.r], where [F.sub.i] denotes both the name of the feature as well as the domain of values that the feature can take. The [j.sup.th] instance of feature [F.sub.i] corresponds to image [I.sub.j] and is denoted by [f.sub.ij] For example, say [F.sub.1] is the color feature which is represented in the database using a histogram histogram
 or bar graph

Graph using vertical or horizontal bars whose lengths indicate quantities. Along with the pie chart, the histogram is the most common format for representing statistical data.
.

In that case, [F.sub.1] is also used to denote the set of all the color histograms, and [f.sub.1,5] is the color histogram for image 5. Query variables are denoted by [v.sub.1], [v.sub.2], ... [v.sub.n] |[v.sub.k] [element of] [F.sub.i] so each [v.sub.k] refers to an instance of a feature [F.sub.i] (an [f.sub.ij]). Note that [F.sub.i] ([I.sub.j] = [f.sub.ij] During query evaluation, each [v.sub.k] is used to rank images in the collection based on the feature domain off [f.sub.i] ([F.sub.i]), that is [v.sub.k]'s domain. Thus, [v.sub.k] can be thought of as being a list of images from the collection ranked based on the similarity of [v.sub.k] to all instances of [F.sub.i]. For example, say [F.sub.2] is the set of all wavelet texture vectors in the collection, if [v.sub.k] = [f.sub.2,5], then [v.sub.k] can be interpreted as being both the wavelet texture vector corresponding to image 5 and the ranked list of all [is less than] I, [S.sub.F2] ([F.sub.2] (I),[f.sub.2,5]) [is greater than] with [S.sub.F2] being the similarity function that applies to two texture values.

A query Q([v.sub.1], [v.sub.2], ... [v.sub.n]) is viewed as a query tree whose leaves correspond to single feature variable queries. Internal nodes of the tree correspond to the Boolean operators. Specifically, nonleaf nodes are one of three forms: ([v.sub.1], [v.sub.2] ... [v.sub.n]): a conjunction of positive literals; ([v.sub.1], [v.sub.2], ... [v.sub.p] [v.sub.p+1], ... [v.sub.n]), a conjunction consisting of both positive and negative literals; and ([v.sub.1] [v.sub.2], ... [v.sub.n]), which is a disjunction disjunction /dis·junc·tion/ (-junk´shun)
1. the act or state of being disjoined.

2. in genetics, the moving apart of bivalent chromosomes at the first anaphase of meiosis.
 of positive literals. The following is an example of a Boolean query: Q ([v.sub.1], [v.sub.2]) = ([v.sub.1] = [f.sub.1,5]) [conjunction] ([v.sub.2] = [f.sub.2,6]) is a query where [v.sub.1] has a value equal to the color histogram associated with image [I.sub.5], and [v.sub.2] has a value of the texture feature associated with [I.sub.6]. Thus, the query Q represents the desire to retrieve images whose color matches that of image [I.sub.5] and whose texture matches that of image [I.sub.6]. Figure 2 shows an example query Q ([v.sub.1], [v.sub.2], [v.sub.3], [v.sub.4]) = (([v.sub.1] = [f.sub.1,4] [conjunction]) ([v.sub.2] = [f.sub.2,8])) [disjunction](([v.sub.3] = [f.sub.3,8]) [conjunction] ?? (v.sub.4] = [f.sub.1,9])) in its tree representation.

[Figure 2 ILLUSTRATION OMITTED]

Weighting in the Query Tree

In a query, one feature can receive more importance than another according to according to
prep.
1. As stated or indicated by; on the authority of: according to historians.

2. In keeping with: according to instructions.

3.
 the user's perception. The user can assign the desired importance to any feature by a process known as feature weighting. Traditionally, retrieval systems (Flickner et al., 1995; Bach et al., 1996) use a linear scaling factor as feature weights. Under our Boolean model, this is not desirable. Fagin and Wimmers (1997) noted that such linear weights do not scale to arbitrary functions used to compute the combined similarity of an image. The reason is that the similarity computation for a node in a query tree may be based on operators other than a weighted summation of the similarity of the children. Fagin and Wimmers (1997) present a way to extend linear weighting to the different components for arbitrary scoring functions as long as they satisfy certain properties. We are unable to use their approach since their mapping does not preserve orthogonality orthogonality

In mathematics, a property synonymous with perpendicularity when applied to vectors but applicable more generally to functions. Two elements of an inner product space are orthogonal when their inner product—for vectors, the dot product (see
 properties on which our algorithms rely (Ortega et al., 1998b). Instead, we use a mapping function map·ping function
n.
A mathematical formula that relates distances on a gene map to recombination frequencies; its graphic rendering shows that the recombination value of two genes is never greater than 50 percent regardless of how far apart the genes
 from [0,1] [right arrow] [0,1] of the form:

(3) [MATHEMATICAL EXPRESSION A group of characters or symbols representing a quantity or an operation. See arithmetic expression.  NOT REPRODUCIBLE IN ASCII ASCII or American Standard Code for Information Interchange, a set of codes used to represent letters, numbers, a few symbols, and control characters. Originally designed for teletype operations, it has found wide application in computers. ]

which preserves the range boundaries [0,1] and boosts or degrades the similarity in a smooth way. Sample mappings are shown in Figure 3. This method preserves most of the properties explained in Fagin and Wimmers (1997), except it is undefined for a weight of 0. In Fagin and Wimmers, a weight of 0 means the node can be dismissed. Here, [lim lim
abbr.
Mathematics limit
.sub.weight [right arrow] 0] similarity' = 0 for similarity [element of] [0,1). A perfect similarity of 1 will remain at 1. This mapping is performed at each link connecting a child to a parent in the query tree.

[Figure 3 ILLUSTRATION OMITTED]

Figure 4a shows how the fuzzy model would work with our running example of blue sky and blue ocean images. Figure 4b shows how the probabilistic model would work with our running example of blue sky and blue ocean images.

[Figure 4 ILLUSTRATION OMITTED]

Computing Boolean Queries

Fagin (1996) proposed an algorithm to return the top k answers for queries with monotonic monotonic - In domain theory, a function f : D -> C is monotonic (or monotone) if

for all x,y in D, x <= y => f(x) <= f(y).

("<=" is written in LaTeX as \sqsubseteq).
 scoring functions that has been adopted by the Garlic multimedia information system under development at the IBM (International Business Machines Corporation, Armonk, NY, www.ibm.com) The world's largest computer company. IBM's product lines include the S/390 mainframes (zSeries), AS/400 midrange business systems (iSeries), RS/6000 workstations and servers (pSeries), Intel-based servers (xSeries)  Almaden Research Center The IBM Almaden Research Center, located near San Jose, California, is one of IBM's largest research centers, specializing in both basic research in material science and applied research in computer storage, where many refinements and improvements were made in hard disc drive  (Fagin & Wimmers, 1997). A function F is monotonic if F([x.sub.1], ... [x.sub.m]) [is less than or equal to] F([x'.sub.1], ... [x'.sub.m]) for [x.sub.i] [is less than or equal to] [x'.sub.i] for every i. Note that the scoring functions for both conjunctive CONJUNCTIVE, contracts, wills, instruments. A term in grammar used to designate particles which connect one word to another, or one proposition to another proposition.
     2.
 and disjunctive dis·junc·tive  
adj.
1. Serving to separate or divide.

2. Grammar Serving to establish a relationship of contrast or opposition. The conjunction but in the phrase poor but comfortable is disjunctive.
 queries for the fuzzy and probabilistic Boolean models satisfy the monotonicity property. This algorithm relies on reading a number of objects from each branch in the query tree until it has k objects in the intersection. Then it falls back on probing to enable a definite decision. In contrast, our algorithms (Ortega et al., 1998b) are tailored to specific functions that combine object scoring (here called fuzzy and probabilistic models).

Another approach to optimizing query processing over multimedia repositories has been proposed in Chaudhari and Gravano (1996). It presents a strategy to optimize queries when users specify thresholds on the grade of match of acceptable objects as filter conditions. It uses the results in Fagin (1996) to convert top-k queries to threshold queries and then process them as filter conditions. It shows that, under certain conditions (uniquely graded repository), this approach is expected to access no more objects than the strategy in Fagin (1996). Furthermore, while the above approaches have mainly concentrated on the fuzzy Boolean model, we consider both the fuzzy and probabilistic models in MARS. This is significant since the experimental results illustrate that the probabilistic model outperforms the fuzzy model in terms of retrieval performance, which is discussed in a later section ("Experimental Results").

Vector Model

An information retrieval model consists of a document model, a query model, and a model for computing similarity between the documents and the queries. One of the most popular IR models is the vector model (Buckley & Salton, 1995; Salton & McGill, 1983; Shaw, 1995). Various effective retrieval techniques have been developed for this model. Among these, term weighting and relevance feedback are of fundamental importance.

Term weighting is a technique for assigning different weights for different keywords (terms) according to their relative importance to the document (Shaw, 1995; Salton & McGill, 1983). If we define [w.sub.ik] to be the weight for term [t.sub.k], k = 1, ..., N, in document i ([D.sub.i]), where N is the number of terms. Document i can be represented as a weight vector in the term space:

(4) [D.sub.i] = [[w.sub.il], ... [w.sub.ik], ... [w.sub.iN]]

Experiments have shown that the product of tf (term frequency) and idf (inverse document frequency) is a good estimation of the weights (Buckley & Salton, 1995; Salton & McGill, 1983; Shaw, 1995). The query Q has the same model as that of a document D-i.e., it is a weight vector in the term space:

(5) Q = [[W.sub.q1], ... [w.sub.qk], ... [w.sub.qN]]

The similarity between D and Q is defined as the Cosine distance.

(6) similarity (D, Q) = D x Q/||D ||x | |Q ||

where || || denotes norm-2.

As we can see from the previous subsection ("Computing Boolean Queries"), in the vector model, the specification of [w.sub.qk]'s in Q is very critical, since the similarity values (similarity (D, Q)'s) are computed based on them. However, it is usually difficult for a user to map precisely his information need into a set of terms. To overcome this difficulty, the technique of relevance feedback has been proposed (Buckley & Salton, 1995; Salton & McGill, 1983; Shaw, 1995). Relevance feedback is the process of automatically adjusting an existing query using information feedback by the user about the relevance of previously retrieved documents. Term weighting and relevance feedback are powerful techniques in IR. We next generalize generalize /gen·er·al·ize/ (-iz)
1. to spread throughout the body, as when local disease becomes systemic.

2. to form a general principle; to reason inductively.
 these concepts to VIR.

Vector Query Model and Integration of Relevance Feedback to VIR

As discussed in a previous section ("The Object Model"), an object model O(D,F,R), together with a set of similarity measures M = {[m.sub.ij]}, provides the foundation for retrieval (D,F,R,M). The similarity measures are used to determine how similar or dissimilar two objects are. Different similarity measures may be used for different feature representations. For example, Euclidean distance In mathematics, the Euclidean distance or Euclidean metric is the "ordinary" distance between two points that one would measure with a ruler, which can be proven by repeated application of the Pythagorean theorem.  is used for comparing vector-based representations, while Histogram Intersection is used for comparing color histogram representations (see the earlier section on "Visual Feature Extraction").

The query model is shown in Figure 5. The query has the same form as an object, except it has weights at every branch at all levels. [W.sub.i], [W.sub.ij], and [W.sub.ijk] are associated with features [f.sub.i], representations [r.sub.ij], and components [r.sub.ijk] respectively. The purpose of the weights is to reflect as closely as possible the combination of feature representations that best express the user's information need. The process of relevance feedback described below aims at updating these weights to form the combination of features that best captures the user's information need.

[Figure 5 ILLUSTRATION OMITTED]

Intuitively, the similarity between query and object feature representations is computed, and then the feature similarity computed as the weighted sum of the similarity of the individual feature representations. This process is repeated one level higher when the overall similarity of the object is the weighted sum over all the feature similarities. The weights at the lowest level, the component level, are used by the different similarity measures internally. Figure 6 traces this process for our familiar example of a blue sky image as a query and a blue ocean image in the collection.

[Figure 6 ILLUSTRATION OMITTED]

Based on the image object model and the set of similarity measures, the retrieval process can be described as follows. At the initial query stage, equal weights are associated with the features, representations, and components. Best matches are then displayed back to the user. Depending on his true information need, the user will mark how good the returned matches are (degree of relevance). Based on the user's feedback, the retrieval system will automatically update weights to match the user's true information need. This process i,; illustrated in Figure 5. In Figure 5, the information need embedded in Q flows up while the content of O's flows down. They meet at the dashed line where the similarity measures [m.sub.ij] are applied to calculate the similarity values S([r.sub.ij])'s between Q and O's.

Based on the intuition that important representations or components should receive more weight, we have proposed effective algorithms for updating these two levels' weights. Due to page limitation, we refer the readers to Rui et al. (1998b).

EXPERIMENTAL RESULTS

In the experiments reported here, we test our approaches over the image collection from the Fowler Museum of Cultural History at the University of California--Los Angeles. It contains 286 ancient African and Peruvian artifacts artifacts

see specimen artifacts.
 and is part of the Museum Educational Site Licensing Project (MESL MESL Museum Educational Site Licensing
MESL Museum of Eastern Shore Life (Centreville, MD)
MESL Mission Essential Subsystem List
MESL Mission Event Synchronization List
MESL Men's Evangelical Softball League
) sponsored by the Getty Information Institute. The size of the MESL test set is relatively small, but it allows us to explore all the color, texture, and shape features simultaneously in a meaningful way. More extensive experiments with larger collections have been performed and reported in Ortega et al. (1998b) and Rui et al. (1998b).

In the following experiments, the visual features used are color, texture, and shape of the objects in the image. The representations used are color histogram and color moments (Swain & Ballard, 1991), for the color feature Tamura (Tamura et al., 1978; Equitz & Niblack, 1994), and co-occurrence matrix (Haralick et al., 1973; Ohanian & Dubes, 1992) texture representations for the texture feature, and Fourier descriptor and chamfer chamfer (cham´fr),
n in extracoronal cavity preparations, a marginal finish that produces a curve from an axial wall to the cavosurface.
 shape descriptor (Rui et al., 1997b) for the shape feature.

Boolean Retrieval Model Results

To conduct the experiments, we chose several queries and manually determined the relevant set of images with the help of experts in librarianship as part of a seminar in multimedia retrieval. With the set of queries and relevant answers for each of them, we constructed precision-recall curves (Salton & McGill, 1983). These are based on the well-known precision and recall metrics. Precision measures the percentage of relevant answers, and recall measures the percentage of relevant objects returned to the user. The precision/recall graphs are constructed by measuring the precision for various levels of recall.

We conducted experiments to verify the role of feature weighting in retrieval. Figure 7 (a) shows results of a shape or color query--i.e., to retrieve all images having either the same shape or the same color as the query image. We obtained four different precision/recall curves by varying the feature weights. The retrieval performance improves when the shape feature receives more emphasis.

[Figure 7 ILLUSTRATION OMITTED]

We also conducted experiments to observe the impact of the retrieval model used to evaluate the queries. We observed that the fuzzy and probabilistic interpretations of the same query yield different results. Figure 7(b) shows the performance of the same query (a texture or color query) in the two models. The result shows that neither model is consistently better than the other in terms of retrieval.

Figure 7(c) shows a complex query (shape ([I.sub.i]) and color ([I.sub.i]) or shape ([I.sub.j]) and layout ([I.sub.j])) with different weightings. The three weightings fared quite similarly, which suggests that complex weightings may not have a significant effect on retrieval performance. We used the same complex query to compare the performance of the retrieval models. The result is shown in Figure 7(d). In general, the probabilistic model outperforms the fuzzy model.

Vector Retrieval Model with Relevance Feedback Results

There are two sets of experiments reported here. The first set of experiments is on the efficiency of the retrieval algorithm--i.e., how fast the retrieval results converge to the true results. The second set of experiments is on the effectiveness of the retrieval algorithm--i.e., how good the retrieval results are subjectively.

Efficiency of the Algorithm

As we have discussed in the section "The Object Model," the image object is modeled by the combinations of representations with their corresponding weights. If we fix the representations, then a query can be completely characterized by the set of weights embedded in the query object Q. Obviously, the retrieval performance is affected by the offset of the true weights from the initial weights. We thus classify the test into two categories--i.e., moderate offset and significant offset--by considering how far away the true weights are from the initial weights. The convergence ratio (recall) for these cases is summarized in Figure 8. Based on the curves, some observations can be made:

[Figure 8 ILLUSTRATION OMITTED]

* In all the cases, the convergence ratio (CR) increases the most in the first iteration One repetition of a sequence of instructions or events. For example, in a program loop, one iteration is once through the instructions in the loop. See iterative development.

(programming) iteration - Repetition of a sequence of instructions.
. Later iterations only result in minor increases in CR. This is a very desirable property, which ensures that the user gets reasonable results after only one iteration of feedback.

* CR is affected by the degree of offset. The lower the offset, the higher the final absolute CR. However, the more the offset, the higher the relative increase of CR.

Effectiveness of the Algorithm

Extensive experiments have been carried out. Users from various disciplines, such as computer vision, art, library science, and so on, as well as users from industry, have been invited to judge the retrieval performance of the proposed interactive approach. A typical retrieval process on the MESL test set is given in Figures 9 and 10.

[Figures 9-10 ILLUSTRATION OMITTED]

The user can browse through the image database. Once the user finds an image of interest, that image is submitted as a query. In Figure 9, the query image is displayed at the upper-left corner as well as the best eleven retrieved images. The top eleven best matches are displayed in order from top to bottom and from left to right. The retrieved results are obtained based on their overall similarities to the query image, which are computed from all the features and all the representations. Some retrieved images are similar to the query image in terms of the shape feature while others are similar to the query image in terms of the color or texture feature.

Assume the user's true information need is to "retrieve similar images based on their shapes." In the proposed retrieval approach, the user is no longer required to explicitly map his or her information need to low-level features, but rather the user can express the intended information need by marking the relevance scores of the returned images. In this example, images 247, 218, 228, and 164 are marked highly relevant. Images 191,168, 165, and 78 are marked highly non-relevant. Images 154, 152, and 273 are marked no-opinion.

Based on the information fed back by the user, the system dynamically adjusts the weights, putting more emphasis on the shape feature, possibly even more emphasis to one of the two shape representations which better matches the user's subjective perception of shape. The improved retrieval results are displayed in Figure 10. Note that our shape representations are invariant (programming) invariant - A rule, such as the ordering of an ordered list or heap, that applies throughout the life of a data structure or procedure. Each change to the data structure must maintain the correctness of the invariant.  to translation, rotation, and scaling. Therefore, images 164 and 96 are relevant to the query image.

CONCLUSION

This article discussed techniques to extend information retrieval beyond the textual domain. Specifically, it discussed how to extract visual features from images and video; how to adapt a Boolean retrieval model (enhanced with fuzzy and probabilistic concepts) for VIR systems; and how to generalize the relevance feedback technique to VIR.

In the past decade, two general approaches to VIR emerged. One is based on text (tides, keywords, and annotation 1. (programming, compiler) annotation - Extra information associated with a particular point in a document or program. Annotations may be added either by a compiler or by the programmer. ) to search for visual information indirectly. This paradigm requires much human labor and suffers from vocabulary inconsistency problems across human indexers. The other paradigm seeks to build fully automated systems by completely discarding the text information and performing the search on visual information only. Neither paradigm has been very successful. In our view, these two paradigms both have their advantages and disadvantages and sometimes are complimentary to each other. For example, in the MESL database, it will be much more meaningful if we first do a text-based search to confine the category and then use a visual feature-based search to refine the result. Another promising research direction is the integration of the human user into the retrieval system loop. A fundamental difference between an old pattern recognition system and today's VIR system is that the end-user of the latter is human. By integrating human knowledge into the retrieval process, we can bypass the unsolved problem of image understanding. Relevance feedback is one technique designed to deal with this problem.

ACKNOWLEDGMENTS

This work was supported by NSF NSF - National Science Foundation  CAREER award IIS-9734300; in part by NSF CISE CISE Center for Information and Systems Engineering (Boston University)
CISE Construction Industry Safety Excellence (Award)
CISE Curriculum, Instruction and Special Education
 Research Infrastructure Grant CDA-9624396; and in part by the Army Research Laboratory under Cooperative Agreement No. DAAL01-96-0003. Michael Ortega is supported in part by CONACYT CONACYT Consejo Nacional de Ciencia y Tecnología (National Board of Science and Technology; Mexico, Bolivia, Paraguay)  Grant 89061 and an IBM Fellowship. Some example images used in this article are used with permission from the Fowler Museum of Cultural History at the University of California--Los Angeles.

REFERENCES

Arkin, E. M.; Chew, L.; Huttenlocher, D.; Kedem, K.; & Mitchell, J. (1991). An efficiently computable metric for comparing polygonal pol·y·gon  
n.
A closed plane figure bounded by three or more line segments.



po·lygo·nal adj.
 shapes. IEEE Transactions on Pattern Analysis and Machine Intelligence The IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) is a monthly journal published by the IEEE Computer Society. It presents the most important research results in areas including all traditional areas of computer vision and image understanding, all , 13(3), 209-216.

Bach, J. R.; Fuller, C.; Gupta, A.; Hampapur, A.; Horowitz, B.; Humphrey, R.; Jain, R.; & Shu, C-F. (1996). The Virage image search engine: An open framework for image management. In Storage and retrieval for image and video databases IV (Proceedings held February 1-2, 1996, San Jose San Jose, city, United States
San Jose (sănəzā`, săn hōzā`), city (1990 pop. 782,248), seat of Santa Clara co., W central Calif.; founded 1777, inc. 1850.
, CA) (pp. 76-87). Bellingham, WA: SPIE SPIE International Society for Optical Engineering
SPIE Society of Photo-Optical Instrumentation Engineers
SPIE Source Path Isolation Engine
SPIE Special Purpose Insertion Extraction
SPIE Software Process Improvement Experimentation
SPIE Standard Protocols in Effect
 

Buckley, C., & Salton, G. (1995). Optimization of relevance feedback weights. In SIGIR SIGIR Special Interest Group on Information Retrieval (Association for Computing Machinery)
SIGIR Special Inspector General for Iraq Reconstruction
 '95 (Proceedings of the 18th Annual International ACM (Association for Computing Machinery, New York, www.acm.org) A membership organization founded in 1947 dedicated to advancing the arts and sciences of information processing. In addition to awards and publications, ACM also maintains special interest groups (SIGs) in the computer field.  SIGIR Conference on Research and Development in Information Retrieval, July 9-13, 1995, Seattle, WA) (pp. 351357). New York New York, state, United States
New York, Middle Atlantic state of the United States. It is bordered by Vermont, Massachusetts, Connecticut, and the Atlantic Ocean (E), New Jersey and Pennsylvania (S), Lakes Erie and Ontario and the Canadian province of
: Association for Computing Machinery See ACM.

Association for Computing Machinery - Association for Computing
 Press.

Chaudhari, S., & Gravano, L. (1996). Optimizing queries over multimedia repositories. In Proceedings of the 1996 ACM SIGMOD SIGMOD Special Interest Group On Management of Data  International Conference on Management of Data (June 4-6, 1996, Montreal, Quebec, Canada) (pp. 91-102). New York: Association for Computing Machinery Press.

Chuang, G. C-H., & Kuo, C-C C-C Carbon-Carbon
C-C Carotid-Cavernous (relating to the carotid artery and the sinuses) 
. J. (19961,. Wavelet descriptor of planar A technique developed by Fairchild Instruments that creates transistor sublayers by forcing chemicals under pressure into exposed areas. Planar superseded the mesa process and was a major step toward creating the chip.  curves: Theory and applications. IEEE (Institute of Electrical and Electronics Engineers, New York, www.ieee.org) A membership organization that includes engineers, scientists and students in electronics and allied fields.  Transactions of Image Processing image processing

Set of computational techniques for analyzing, enhancing, compressing, and reconstructing images. Its main components are importing, in which an image is captured through scanning or digital photography; analysis and manipulation of the image, accomplished
, 5(1), 56-70.

Equitz, W., & Niblack, W. (1994). Retrieving images from a database using texture-algorithms from the QBIC QBIC Query By Image Content
QBIC queries based on image content
QBIC Cubic Format
 system (IBM Computer Science Tech. Rep. No. RJ 9805). San Jose, CA: IBM.

Fagin, R. (1996). Combining fuzzy information from multiple systems. In Proceedings of the Fifteenth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems (PODS 1996, conference held June 3-5, 1996, Montreal, Canada) (pp. 216-226). New York: Association for Computing Machinery Press.

Fagin, R., & Wimmers, E. L. (1997). Incorporating user preferences in multimedia queries. In F. N. Afrati (Ed.), Database Theory-ICDT '97 (Proceedings of the 6th International Conference, January 8-10, 1997, Delphi, Greece) (pp. 247-261). Berlin, Germany: Springer.

Flickner, M.; Sawhney, H.; Niblack, W.; Ashley, J.; Huang, Q.; Dom, B.; Gorkani, M.; Hafine, J.; Lee, D.; Petkovic, D.; Steele, D.; & Yanker, P. (1995). Query by image and video content: The QBIC system. Computer, 28(9), 23-32.

Haralick, R. M.; Shanmugam, K.; & Dinstein, I. (1973). Texture features for image classification. IEEE Transactions on Systems, Man, and Cybernetics cybernetics [Gr.,=steersman], term coined by American mathematician Norbert Wiener to refer to the general analysis of control systems and communication systems in living organisms and machines. , 3(6), 610-621.

Kundu, A., & Chen, J-L. (1992). Texture classification using QMF (Query Management Facility) An IBM fourth-generation language for end-user interaction with DB2.  bank-based subband decomposition decomposition /de·com·po·si·tion/ (de-kom?pah-zish´un) the separation of compound bodies into their constituent principles.

de·com·po·si·tion
n.
1.
. Graphical Models and Image Processing, 54(5), 369-384.

McCamy, C. S.; Marcus, H.; & Davidson J. G. (1976). A color-rendition chart. Journal of Applied Photographic Engineering, 2(';), 95-99.

Miyahara, M. (1988). Mathematical transform of (R,G,B) color data to munsell (H,S,V) color data. In R. Hsing (Ed.), Proceedings of SPIE Proceedings of SPIE is proceeding of The International Society for Optical Engineering; see the homepage of the proceefing at [1]. This publication does not match the requirements of a scientific journal, some papers in the proceedings of SPIE : The Visual Society for Optical Engineering, Vol. 1001 (Visual Communications and Image Processing '88, November 9-11, 1988, Cambridge, MA) (pp. 650-657). Bellingham, WA: SPIE.

Ortega, M..; Rui, Y.; Chakrabarti, K.; Porkaew, K.; Mehrotra, S.; & Huang, T. S. (1998). Supporting ranked Boolean similarity queries in MARS. IEEE Transactions on Knowledge and Data Engineering, 10(6), 905-925.

Pentland, A.; Picard, R. W.; & Sclaroff, S. (1996). Photobook: Content-based manipulation of image databases. International Journal of Computer Vision, 18(3), 233-254.

Rui, Y.; She, A. C.; & Huang, T. S. (1996). Modified Fourier descriptors for shape representation--a practical approach. In A. Smeulders & R. Jain (Eds.), Image databases and multi media search (pp. 165-180). River Edge, NJ: World Scientific.

Rui, Y.; Huang, T. S.; Ortega, M.; & Mehrotra, S. (1998). Relevance feedback: A power tool in interactive content-based image retrieval Content-based image retrieval (CBIR), also known as query by image content (QBIC) and content-based visual information retrieval (CBVIR) is the application of computer vision to the image retrieval problem, that is, the problem of searching for . IEEE Transactions on Circuits and Systems for Video Technology, 8(5), 644-655.

Rui, Y.; Huang, T. S.; & Chang, S-F. (1999). Image retrieval An image retrieval system is a computer system for browsing, searching and retrieving images from a large database of digital images. Most traditional and common methods of image retrieval utilize some method of adding metadata such as captioning, keywords, or descriptions to the : Current techniques, promising directions, and open issues. Journal of Visual Communication and Image Representation, 10(1), 39-62.

Salton, G., & McGill, M. J. (1983). Introduction to modern information retrieval. New York: McGraw-Hill Book Company.

Shaw, W. M. (1995). Term-relevance computations and perfect retrieval performance. Information Processing information processing: see data processing.
information processing

Acquisition, recording, organization, retrieval, display, and dissemination of information. Today the term usually refers to computer-based operations.
 and Management, 31(4), 491-498.

Smith, J. R., & Chang, S-E S-E Spheno Ethmoidectomy  (1996). Tools and techniques for color image A (digital) color image is a digital image that includes color information for each pixel.

For visually acceptable results, it is necessary (and almost sufficient) to provide three samples (color channels
 retrieval. In Storage & retrieval for image and video databases IV (Proceedings of the International Society for Optical Engineering, vol. 2670) (pp. 426-437). Bellingham, WA: SPIE.

Stricker, M., & Orengo, M. (1995). Similarity of color images. In W. Niblack & R. C. Jain (Eds.), Storage and retrieval for image and video database III (Proceedings of the International Society for Optical Engineering, vol. 2420) (pp. 381-392). Bellingham, WA: SPIE.

Swain, M., & Ballard, D. (1991). Color indexing. International Journal of Computer Vision, 7(1), 11-32.

Tamura, H.; Mori, S.; & Yamawaki, T. (1978). Texture features corresponding to visual perception. IEEE Transactions on Systems, Man, and Cybernetics, 8(6), 460-473.

ADDITIONAL REFERENCES

Hu, M. K. (1962). Visual pattern recognition by moment invariants, computer methods in image analysis. In IRE Transactions on Information Theory (316 p.). New York: Institute of Radio Engineers Following several attempts to form a technical organization of wireless practitioners in 1908-1912, the Institute of Radio Engineers (IRE) was finally established in 1912 in New York. .

Ortega, M.; Chakrabarti, K.; Porkaew, K.; & Mehrotra, S. (1998). Cross media validation in a multimedia retrieval system. Unpublished paper presented at the ACM Digital Libraries '98 Workshop on Metrics in Digital Libraries.

Ortega, M.; Rui, Y.; Chakrabarti, K.; Mehrotra, S.; & Huang, T. S. (1997). Supporting similarity queries in MARS. In Proceedings of ACM Multimedia '97 (November 9-13, 1997, Seattle, WA) (pp. 403-413). New York: Association for Computing Machinery.

Rui, Y.; Huang, T. S.; & Mehrotra, S. (1997). Content-based image retrieval with relevance feedback in MARS. In Proceedings of the International Conference on Image Processing (October 26-29, 1997, Santa Barbara Santa Barbara (săn'tə bär`brə, –bərə), city (1990 pop. 85,571), seat of Santa Barbara co., S Calif., on the Pacific Ocean; inc. 1850. , CA) (pp. 815-818). Los Alamitos Los Alamitos (lôs ăləmē`təs, lŏs), city (1990 pop. 11,676), Orange co., NE of Long Beach, S Calif., in a suburban area; inc. 1960. Los Alamitos Racetrack and U.S. military installations are nearby. , CA: IEEE Computer Society (body) IEEE Computer Society - The society of the IEEE which publishes the journal "Computer".

http://computer.org/.
.

Rui, Y.; Huang, T. S.; & Mehrotra, S. (1998). Exploring video structure beyond the shots. In Proceedings of the International Conference on Multimedia Computing and Systems (June 28-July 1, 1998, Austin, TX) (pp. 237-240). Los Alamitos, CA: IEEE Computer Society.

Yong Rui, Microsoft Research Microsoft Research (MSR) is a division of Microsoft created in 1991 for researching various computer science topics and issues. Overview
Microsoft Research (MSR) is one of the top research centers worldwide, currently employing Turing Award winners, C.A.R.
, One Microsoft Way, Redmond, WA 98052 Michael Ortega, 444 Computer Science, University of California The University of California has a combined student body of more than 191,000 students, over 1,340,000 living alumni, and a combined systemwide and campus endowment of just over $7.3 billion (8th largest in the United States). , Irvine, CA 92697-3425 Thomas S. Huang, Beckman Institute for Advanced Science and Technology, University of Illinois University of Illinois may refer to:
  • University of Illinois at Urbana-Champaign (flagship campus)
  • University of Illinois at Chicago
  • University of Illinois at Springfield
  • University of Illinois system
It can also refer to:
, Urbana, IL 61801 Sharad Mehrotra, Department of Information and Computer Science, University of California, Irvine, CA 92697-3425 LIBRARY TRENDS, Vol. 48, No. 2, Fall 1999, pp. 455-474

YONG RUI is currently a researcher at Microsoft Research in Redmond, Washington Redmond is a city in King County, Washington, USA. It is situated on the eastern edge of the Seattle urban area, in what is known as the Eastside. In 2003 the Census Bureau estimated the city population was 46,391. . His research interests include multimedia information retrieval, multimedia signal processing See DSP. , computer vision, and artificial intelligence. He has published over thirty technical papers in these areas. He is a 1989-1990 Huitong University Fellowship recipient, a 1992-1993 Guanghua University Fellowship recipient, and a 1996-1998 CSE (Certified Systems Engineer) See Microsoft certification.  Engineering College Fellowship recipient.

MICHAEL ORTEGA is currently pursuing his graduate studies at the University of Illinois at Urbana-Champaign Early years: 1867-1880
The Morrill Act of 1862 granted each state in the United States a portion of land on which to establish a major public state university, one which could teach agriculture, mechanic arts, and military training, "without excluding other scientific
. He received a Fulbright/ CONACYT/Garcia Robles Robles is a common surname in the Spanish language meaning oaks, and may refer to:
  • Alfonso García Robles (1911-1991), Mexican diplomat and politician
  • Aurora Robles (born 1980), Mexican fashion model
  • Charlie Robles (born 1943), Puerto Rican musician
 scholarship to pursue graduate studies as well as the Mavis Award at the University of Illinois and is a member of the Phi Kappa Phi The Honor Society of Phi Kappa Phi (or simply Phi Kappa Phi) is the oldest, largest and most selective all-discipline honor society for land-grant and public colleges in the United States.  honor society honor society
n.
An organization to which students are admitted in recognition of academic achievement.
, the IEEE computer society, and member of the ACM. His research interests include multimedia databases, database optimization for uncertainty support, and content-based multimedia information retrieval.

THOMAS S. HUANG joined the University of Illinois at Urbana-Champaign in 1980, where he is now William L. Everitt Distinguished Professor of Electrical and Computer Engineering, Research Professor at the Coordinated Science Laboratory, and Head of the Image Formation and Processing Group at the Beckman Institute for Advanced Science and Technology. He was on the Faculty of the Department of Electrical Engineering electrical engineering: see engineering.
electrical engineering

Branch of engineering concerned with the practical applications of electricity in all its forms, including those of electronics.
 at MIT MIT - Massachusetts Institute of Technology  from 1963 to 1973 and on the faculty of the School of Electrical Engineering and Director of its Laboratory for Information and Signal Processing at Purdue University Purdue University (pərdy`, -d`), main campus at West Lafayette, Ind.  from 1973 to 1980. Dr. Huang's professional interests lie in the broad area of information technology, especially the transmission and processing of multidimensional mul·ti·di·men·sion·al  
adj.
Of, relating to, or having several dimensions.



multi·di·men
 signals. He has published twelve books and over 300 papers on network theory, digital filtering, image processing, and computer vision. He received the IEEE Acoustics, Speech, and Signal Processing Society's Technical Achievement Award in 1987 and the Society Award in 1991. He is a Founding Editor of the International Journal of Computer Vision, Graphics, and Image Processing and editor of the Springer Series in Information Sciences published by Springer Verlag.

SHARAD MEHROTRA is an Assistant Professor in the Computer Science Department at the University of Illinois at Urbana-Champaign since 1994. He has subsequently worked at MITL MITL Man-In-The-Loop
MITL Magnetically-Insulated Transmission Line
MITL Meet In The Lobby
, Princeton, as a scientist from 1993 to 1994. He specializes in the areas of database management, distributed systems Distributed systems (computers)

A distributed system consists of a collection of autonomous computers linked by a computer network and equipped with distributed system software.
, and information retrieval. His current research projects are on multimedia analysis, content-based retrieval of multimedia objects, multidimensional indexing, uncertainty management in databases, and concurrency Operations that are performed simultaneously within the computer. For example, dual-core CPUs provide complete overlapping of two independent processes. See dual core, hyperthreading, multiprocessing, multitasking, multithreading, SMP and MPP.

concurrency - multitasking
 and transaction management. Dr. Mehrotra is an author of over fifty research publications in these areas. He is the recipient of the NSF Career Award and the Bill Gear Outstanding junior faculty award in 1997.
COPYRIGHT 1999 University of Illinois at Urbana-Champaign
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 1999, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Author:MEHROTRA, SHARAD
Publication:Library Trends
Date:Sep 22, 1999
Words:6623
Previous Article:Evaluation of Image Retrieval Systems: Role of User Feedback.
Next Article:Precise and Efficient Retrieval of Captioned Images: The MARIE Project.
Topics:



Related Articles
Finding needles in database haystacks. (new method for text retrieval)
Introduction.(examination of knowledge discovery in databases)
Exploiting Multimodal Context in Image Retrieval.
Author Guidelines for Electronic References.
NOVEMBER CONFERENCE TACKLES TEXT RETRIEVAL SYSTEMS.(Brief Article)
Document Management for the Enterprise.(Book Review)
LaserFiche offers new applications.(Market place: new products & industry announcements)
Scan and search more comprehensively.(Doculex Search version 6)(Product/Service Evaluation)
The most influential paper Gerard Salton never wrote.
Improving performance support systems through information retrieval evaluation.

Terms of use | Copyright © 2009 Farlex, Inc. | Feedback | For webmasters | Submit articles