Printer Friendly
The Free Library
14,495,914 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

How to find it on the Net.


True or false: (1) The Internet is an incredible research resource overflowing o·ver·flow  
v. o·ver·flowed, o·ver·flow·ing, o·ver·flows

v.intr.
1. To flow or run over the top, brim, or banks.

2. To be filled beyond capacity, as a container or waterway.

3.
 with valuable information. (2) The Internet has become a labyrinth labyrinth (lăb`ərĭnth), intricate building of chambers and passages, often constructed so as to perplex and confuse a person inside.  of hard-to-find data and dead-end home pages replete re·plete  
adj.
1. Abundantly supplied; abounding: a stream replete with trout; an apartment replete with Empire furniture.

2. Filled to satiation; gorged.

3.
 with redundant, often useless information.

If you answered "true" to both, you're right-even though the two answers seem to contradict con·tra·dict  
v. con·tra·dict·ed, con·tra·dict·ing, con·tra·dicts

v.tr.
1. To assert or express the opposite of (a statement).

2. To deny the statement of. See Synonyms at deny.
 each other. While the Net is a labyrinth with dead ends and lots of junk, software tools are available to bypass the useless and deliver the essential information you're looking for Looking for

In the context of general equities, this describing a buy interest in which a dealer is asked to offer stock, often involving a capital commitment. Antithesis of in touch with.
. All this can be done with a few mouse clicks; the trick is to know what to click on.

Search tools (also known as search engines) are available free on the Net, and this article helps you pick the right one for your particular research. All search tools are linked to databases that essentially are giant indexes of much of the information available on the Net. Accessing such indexes is easy: Just click on the Web site of the search tool; there is no software to download or configure See configuration.

(software) configure - A program by Richard Stallman to discover properties of the current platform and to set up make to compile and install gcc.

Cygnus configure was a similar system developed by K.
.

While most Internet providers Internet provider - Internet Service Provider  usually default to one search tool when you evoke e·voke  
tr.v. e·voked, e·vok·ing, e·vokes
1. To summon or call forth: actions that evoked our mistrust.

2.
 a search, you can always change that default to any other search engine. And if you want to locate other search tools, just type "search tool" as your key words in whatever engine your software defaults to and it will find the others.

Two ingredients go into making a good search engine: a large, up-to-date and comprehensive index and a database search method that is fast, customizable and designed so it doesn't bring; up too many sites not relevant to your search. Be aware, however, that no search engine is perfect: Nearly every search will generate some irrelevant data, but the better engines are more discriminating dis·crim·i·nat·ing  
adj.
1.
a. Able to recognize or draw fine distinctions; perceptive.

b. Showing careful judgment or fine taste:
. As you can imagine, indexing is a herculean task: thousands upon thousands of addresses (also called URLs, pronounced either U-R-Ls or earls) plus millions of pages of data must be scanned, formatted and incorporated into the database index. With rare exceptions, the job is done exclusively by computer; as a result, sometimes mistakes are made and words are indexed incorrectly. But considering the millions of words involved, all the search tools generally do the job relatively well.

The most popular search tools and their Web addresses are
AltaVista         http://altavista.digital.com
Cyber411          http://www.cyber411.com
Excite            http://www.excite.com
InfoSeek          http://info.infoseek.com
InfoSeek Ultra    http://ultra.infoseek.com
Lycos             http://www.lycos.com
Magellan          http://magellan.mckinley.com
WebCrawler        http://www.webcrawler.com
Yahoo             http://www.yahoo.com




And now for a closer look at these popular search tools.

ALTAVISTA

This service claims its index has over 50 million pages on 476,000 servers and 4 million articles from 14,000 Usenets (Internet jargon jargon, pejorative term applied to speech or writing that is considered meaningless, unintelligible, or ugly. In one sense the term is applied to the special language of a profession, which may be unnecessarily complicated, e.g., "medical jargon.  for newsgroups This is a list of newsgroups that are significant for their popularity or their position in Usenet history.

As of October 2002, there are about 100,000 Usenet newsgroups, of which approximately a fifth are active.
). AltaVista says users access the service more than 28 million times every weekday. Be aware that some hyperbole hyperbole (hīpûr`bəlē), a figure of speech in which exceptional exaggeration is deliberately used for emphasis rather than deception.  is common among search engine vendors, and many of their claims are hard, if not impossible, to substantiate To establish the existence or truth of a particular fact through the use of competent evidence; to verify.

For example, an Eyewitness might be called by a party to a lawsuit to substantiate that party's testimony.
 because the Net is free form and uncontrolled. But it's clear that AltaVista does have a large index and it's a popular service.

Unlike Yahoo (the oldest and probably the best known search engine), which indexes the key descriptive words of home pages, AltaVista indexes the complete text within pages. This is both an advantage and disadvantage.

Advantage: You can complete an exhaustive search, including words in a certain order. For example, if you want items mentioning Great Plains Software, most search tools will return finds with both great and plains. AltaVista allows you to specify the order of the words--in effect, acting as a filter.

Disadvantages: Unless you are very specific, you probably will be flooded with data. And because AltaVista's indexing is so comprehensive, it needs more time to index new home pages on the Net; as a result, some very new ones don't get into the database immediately.

CYBER411

With so many search engines available, it was bound to happen: a search engine that searches the other search engines. Cyber411 is what's called a "parallel" search engine, which means it contacts 15 other search engines to do the actual work and then collates the results. Cyber411 also eliminates duplicate entries from the various search engines.

Each of its finds is hyperlinked to related sites and is earmarked by the engine that generated the location. However, unlike most of the other engines, it does not provide additional summary information about the found sites.

When we tested the site with a word string query, we found that some of the sites Cyber411 found didn't contain that word string--in other words, it located information we did not want. While such errors are annoying, it does not take away from the comprehensive search power of Cyber411.

EXCITE

Excite also says it has 50 million Web pages indexed in its database. When you type a description of what you're looking for in an Excite search form, it scrutinizes the full text of 50 million Web pages. In addition to listing finds, Excite is able to estimate how relevant each one is to your initial request. And then it goes a step further: If you click on a button that says "More like this," you can track down more sites that are similar to any of the sites just found. That's particularly handy because a first-time search usually discloses a smattering of information a bit off the mark. But with this feature, if one of the search results looks like the closest match, then the "More like this" option delivers more results similar to that near hit.

Excite claims to work at twice the speed of its competition, but, while it is fast, our tests couldn't confirm the specific speed claim. Like most other search engines, Excite lists about 10 search results at a time, in decreasing order of confidence that it has matched your request. Each result consists of a title, a URL URL
 in full Uniform Resource Locator

Address of a resource on the Internet. The resource can be any type of file stored on a server, such as a Web page, a text file, a graphics file, or an application program.
 and a brief summary of the found page. By adding a symbol and a number value to the end of a word, a user can command the engine to focus its search more intensely on that word in a string of search words; in general, that usually results in a higher success rate. Here's how that works: Say you want to search the following: dog care grooming Combining, consolidating and segregating network traffic using devices such as digital cross-connects, add/drop multiplexers and SONET switches. Grooming is a telephone term that typically refers to managing high-capacity lines between central offices, carriers, ISPs and very large . By adding 3 after the word grooming, you will order the search engine to give three times more emphasis to it than to the other words in the string. So while the words dog and care remain search targets, they do not get as much attention from the search engine as grooming.

Something new is Excite Live (live.excite.com). Evoke that page and Excite will pop up a series of questions about what news you want to track. Once you profile your interests, Excite Live creates a summary of just that news. Each summary is hyperlinked, so if you want to dig deeper, one click will take you to the full story.

You can make Excite the default search tool with a simple click on a button on its home page.

INFOSEEK

In its May 1996 issue, the magazine Internet World tested many search engines and praised Infoseek as delivering the most relevant results. Fortune also rated Infoseek highly.

Infoseek has a unique indexing system: It keeps track of the order in which words appear on pages and in articles, so you can specify phrases or groups of words in exact order or just in proximity. You also can interact with the search engine to improve your results. When you find a search result that's interesting to you, just click the Similar Pages link. Infoseek will scour scour, scours

1. the chemical and physical cleaning of fleece wool.

2. diarrhea.


dietetic scour
see dietary diarrhea.

peat scour
see secondary nutritional copper deficiency.
 the Internet for pages with similar contents.

INFOSEEK ULTRA

One of the newer search engines, Infoseek Ultra claims to be the next generation of search technology. It can perform 1,000 queries per second on a database of tens of millions of documents. The engine uses a proprietary search formula that can merge results from multiple searches but with more accuracy than Cyber411.

Infoseek Ultra has an enormous index. It lists over 80 million URLs and claims to have indexed the full texts of over 50 million. More important, Infoseek Ultra indexes only what its programmers consider to be meaningful URLs. With today's search technology, you often have to deal with dead hyperlinks (pages that contain no data) or duplicate pages--making the search difficult and frustrating frus·trate  
tr.v. frus·trat·ed, frus·trat·ing, frus·trates
1.
a. To prevent from accomplishing a purpose or fulfilling a desire; thwart:
. Infoseek Ultra is relatively effective in eliminating these pages.

The engine also uses automatic name recognition, which means it can recognize that a word in a query is a person's name and finds only documents containing the name. Many other engines require that you use special typographic See typography.  symbols, such as quotation marks quotation marks
Noun, pl

the punctuation marks used to begin and end a quotation, either `` and '' or ` and '

quotation marks nplcomillas fpl

 or capitalization capitalization n. 1) the act of counting anticipated earnings and expenses as capital assets (property, equipment, fixtures) for accounting purposes. 2) the amount of anticipated net earnings which hypothetically can be used for conversion into capital assets. , to after for name recognition.

LYCOS

This search engine is similar to Yahoo (listed below) but with a few extras. For example, if the user initiates a search for an accounting standard, say, Lycos will not only find the specific site in which the standard is listed but it also will search the text for any mention of the standard in other sites--in effect, doing a global search for the standard.

Lycos publishes a list of what its managers consider the best sites on the Net--a handy reference for new surfers. It also maintains a multimedia catalogue of the Web for sounds and pictures (all of which can be imbedded imbedded,
adj See embedded.
 in files for illustration or emphasis), hyperlinks to information by city and even the addresses of domain servers (the specific computers that serve Internet addresses There are two kinds of addresses that are widely used on the Internet. One is a person's e-mail address, and the other is the address of a Web site, which is known as a URL. Following is an explanation of Internet e-mail addresses only. For more on URLs, see URL and Internet domain name. , such as the American Institute of CPAs home page: www.aicpa.org).

When a user enters key words for searches, Lycos not only displays the addresses located but also provides a summary of what's available in each site. In addition, the engine estimates how relevant a located site is to your search; however, sometimes its estimates are not quite on the mark. Further, if in reading the summary it's clear that the listed results are nearly, but not quite, on target, you can use Lycos to search for related sites by effectively narrowing the search field; in that case, Lycos will not repeat URLs derived from the first search--a time-saving bonus.

Lycos has a handy feature called Remote Control. When you select it, a small window appears on the screen and remains active during your search. This feature allows you to work within the search engine while viewing Web sites.

Another nice feature is Road Maps. Not only can the feature deliver driving directions and a road map between any two U.S. locations but also if you provide it with the address of a domain server--say, www.aicpa.org--the following appears on your screen:

MAGELLAN

This online guide includes original editorial content, a directory of Internet sites that Magellan's staff has reviewed and rated, a vast database of yet-to-be-reviewed sites and a powerful search engine. If you browse the Magellan topics listed on its home page or perform a search, a list of sites that matches your area of interest pops up.

Magellan does a good job of keeping up with the expanding Web. It reviews and rates thousands of new sites each week. Magellan has been translated into French and German and soon will become available in other languages as well.

WEBCRAWLER

This search engine is owned by America Online See AOL. , which uses it as its default search tool. WebCrawler has some unique features. For example, it can use Boolean search A search for specific data. It implies that any condition can be searched for using the Boolean operators AND, OR and NOT. For example, the English language request: "Search for all Spanish and French speaking employees who have MBAs would be expressed as follows.  syntax--a high-tech language used by advanced database users. If you're familiar with the syntax syntax: see grammar.
syntax

Arrangement of words in sentences, clauses, and phrases, and the study of the formation of sentences and the relationship of their component parts.
, you get a bit more control over the search operation. It's possible, for example, to limit search results to just home page sites or just sites with summaries. Also, the number of search results can be customized to any number you want; that's important, because some searches may overwhelm o·ver·whelm  
tr.v. o·ver·whelmed, o·ver·whelm·ing, o·ver·whelms
1. To surge over and submerge; engulf: waves overwhelming the rocky shoreline.

2.
a.
 you by targeting hundreds of sites.

WebCrawler has another feature that is useful to users with their own home pages: a backward search function, which reports on who linked to your home page so you can keep track of visitors as a marketing tool.

YAHOO

The oldest and the best known Internet search engine is Yahoo. It presents a catalog catalog, descriptive list, on cards or in a book, of the contents of a library. Assurbanipal's library at Nineveh was cataloged on shelves of slate. The first known subject catalog was compiled by Callimachus at the Alexandrian Library in the 3d cent. B.C.  of sites on the Web. Its listings are much like the yellow pages or an encyclopedia encyclopedia, compendium of knowledge, either general (attempting to cover all fields) or specialized (aiming to be comprehensive in a particular field). Encyclopedias and Other Reference Books
 --showing many sites under subject categories or key words. It excels when searching for all sites by a key word or company name. For example, if the key word is Widget Pronounced "wih-jit," for decades, the term has been a popular word for a generic "thing" when there is no real name for it. It is often used to describe examples of made-up products along with other fictitious names; for example, "10 widgets, 5 frabbits and 2 dingits.  Manufacturing Co., Yahoo can be programmed to evoke only the home page for Widget, not all other references to the word widget.

Yahoo contains buttons on its home page that, when clicked, produce some very useful information. For example, there's Yellow Pages (to find a business), People Search (to find phone numbers and e-mail addresses See Internet address.

e-mail address - electronic mail address
 of individuals), City Maps (in which you type in a street address, city, state and ZIP code zip code

System of postal-zone codes (zip stands for “zone improvement plan”) introduced in the U.S. in 1963 to improve mail delivery and exploit electronic reading and sorting capabilities.
 to generate a street map of that area), Today's News, Stock Quotes and Sports Scores.

Another handy feature is Get Local. Upon typing in a ZIP code, Yahoo retrieves information specifically related to that locality 1. locality - In sequential architectures programs tend to access data that has been accessed recently (temporal locality) or that is at an address near recently referenced data (spatial locality). This is the basis for the speed-up obtained with a cache memory.
2.
. It typically includes weather, yellow page data, local news headlines and even local sport scores. In addition, Get Local provide hyperlinks to other pages that carry information specific to that area.

THE BOTTOM LINE

Many search tools are available on the Net. All of them are good--none is perfect. Try them out and see which works best for you. It may turn out that, depending on the type of search you're conducting, some work better than others. It's best to pick two or three and get to know them very well. Take the time to read about the tools' advanced search techniques; that information is available on the home pages. This will allow you to speed your searches and make them more effective. The time you spend getting to know the tools will pay handsome dividends.

The Right Tool for the Job
If you want...                  ...try this search tool

Exhaustive research                  Cyber411

Research for the latest
information available               Infoseek Ultra

Research by string of words
within text contained in various
Web pages                           AltaVista

Advanced search techniques--
such as word weightings             Excite

Searches using the largest          AltaVista,
database                            Infoseek Ultra

Backward searches                   WebCrawler

Sites by categories                 Lycos, WebCrawler,
                                    Yahoo

Search FTP, Gopher and
Telnet sessions                     Infoseek, Magellan

Site reviews                        Lycos, Magellan

Customized news stories             Lycos

Easy hyperlinks to other
information such as city maps,
stock quotes, sport scores          Lycos, Yahoo

Quick location of a company site    Yahoo




RELATED ARTICLE: Search Tool Features
Feature                          InforSeek       AltaVista

Number of URLs indexed(1)           50+             50+
(claimed in millions)

Exact document counts(2)             X        Approximation

Automatic name recognition           X

Phrase searching                     X               X

Plain English queries(3)             X               X

Find all word variants(4)            X

Case-sensitive searching(5)          X               X

Seach on any word(6)                 X

Field-based searches(7)              X               X

Feature                           Exite     Lycos     Yahoo

Number of URLs indexed(1)          50+       50+       0.2+
(claimed in millions)

Exact document counts(2)           X          X         NA

Automatic name recognition

Phrase searching                                       Limited

Plain English queries(3)           X          X

Find all word variants(4)

Case-sensitive searching(5)                               X

Seach on any word(6)                                      X

Field-based searches(7)

Source: Inforseek.




(1) THe number of URLs indexed is widely used as a benchmark. Most engines claim in the 50 million range. However, since there are no stadnard methods of counting and reporting URLs in a search engine's database, this tends to be an unrealible comparison.

(2) Exact document counts refers to the search engine's ability to return the exact number of documents containing each word and phrase in the query, as well as the number of documents that match the entire query. This information can help you refine your query.

(3) Plain English Plain English (sometimes known, more broadly, as plain language) is a communication style that focuses on considering the audience's needs when writing. It recommends avoiding unnecessary words and avoiding jargon, technical terms, and long and ambiguous sentences.  queries means the user can type a question such as, "What are the lyrics lyrics npl [of song] → paroles fpl

lyrics lyric npl [of song] → Text m 
 to Penny Lane?" and get meaningful results.

(4) Find all word variants is the ability of the search engine to locate all variations of the query words. For example, a seach for mouse in Inforseek returns as many documents as a search for mice. Infoseek also can find other variations of a term, so when you type in Oracle-7, you also get Oracle 7 and Oracle 7.

(5) Case-sensitive searching is the ability of the seach engine to find capitalized or mixed-case terms, such as AIDS or NeXT, without getting false matches.

(6) Seach on any word refers to the ability of the search engine to find capitalized or mixed-case terms, such as AIDS or NeXT, without getting false matches.

(6) Search on any word refers to the ability of the search engine to complete a search that contains numbers and other common words. Most search engines, when faced with the phrase "to be or not to be To Be or Not to Be can refer to:
  • To be, or not to be (the soliloquy from Hamlet)
  • Two films with the same title:
  • To Be or Not to Be (1942 film)
  • To Be or Not to Be
," find all sites that contain any of those words--which is not very useful.

(7) Field-based searches refers to the search engine's ability to find words and phrases Words and Phrases®

A multivolume set of law books published by West Group containing thousands of judicial definitions of words and phrases, arranged alphabetically, from 1658 to the present.
 within a certain field of the Web page, such as in the title, URL or hyperlink A predefined linkage between one object and another. See hypertext.

hyperlink - anchor
. This enables you to determine the popularity of your URL or Web site.

RELATED ARTICLE: The Transient Internet

The good and bad news about the Internet is that it's easy to add, delete and change a Web page. The good feature is that information can be distributed and corrected with ease. The bad feature is that the Internet and its information are so impermanent im·per·ma·nent  
adj.
Not lasting or durable; not permanent.



im·perma·nence, im·per
 changes can be implemented with the click of a mouse. So, while all the services mentioned in this article existed when the article was prepared, it's possible--even likely--that some service won't be there when you look for it or new ones will be added.

WAYNE E. HARDING, CPA (Computer Press Association, Landing, NJ) An earlier membership organization founded in 1983 that promoted excellence in computer journalism. Its annual awards honored outstanding examples in print, broadcast and electronic media. The CPA disbanded in 2000. , is a vice-president of Great Plains Software, Fargo, North Dakota “Fargo” redirects here. For other uses, see Fargo (disambiguation).
Fargo is a city in Cass County, North Dakota in the United States. It is the county seat of Cass County, located in the Red River Valley region.
. A member of the American Institute of CPAs information technology research subcommittee sub·com·mit·tee  
n.
A subordinate committee composed of members appointed from a main committee.


subcommittee
Noun
, he is a former vice-president of the Colorado Society of CPAs.
COPYRIGHT 1997 American Institute of CPA's
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 1997, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Title Annotation:Internet search tools
Author:Harding, Wayne E.
Publication:Journal of Accountancy
Date:Jun 1, 1997
Words:2999
Previous Article:Sizing up office suites. (business application software packages)
Next Article:Heavy-duty marketing. (techniques employed at Herman Yula Schwartz & Lagomarsino)
Topics:



Related Articles
The Internet comes to the nursing home.
An Internet primer for tax professionals.
The Internet - tool or toy?
Netscape Offers New Lycos Pro Search Technology.
Web Leads New Wave of User Interactivity.(wood industry)
Start your engines.(online searching services from sites such as Excite, Google, etc.)(Brief Article)
EarthLink Selects Google as Its Preferred Search Provider; Google's Highly Relevant Search Technology Available Now to EarthLink Community.
VALUE-ADDED INTERNET SEARCH TOOLS MOST POPULAR USES OF NET.
Searching for health info is a growing online activity.(Technology Notes)
Odvard Egil Dyrli on visual search tools: Grokker and related tools are transforming online research.

Terms of use | Copyright © 2009 Farlex, Inc. | Feedback | For webmasters | Submit articles