Government information on the Internet.ABSTRACT THE U.S. FEDERAL GOVERNMENT HAS BEEN A MAJOR PUBLISHER ON THE INTERNET. Its many agencies have used the Internet, and the Web most recently, to provide access to a great quantity of their information output. Several agencies such as the Library of Congress and the Government Printing Office have taken a leading role in both providing information and offering finding aids, while other endeavors such as FirstGov and subject gateways offer other avenues of access. A brief look back at the history of the government on the Web and the continuing concerns and challenges show how the government is not only a major content provider on the Internet but also a source for the organization of the content. INTRODUCTION: GOVERNMENT INFORMATION DISSEMINATION AND THE INTERNET The United States United States, officially United States of America, republic (2005 est. pop. 295,734,000), 3,539,227 sq mi (9,166,598 sq km), North America. The United States is the world's third largest country in population and the fourth largest country in area. federal government produces a great quantity of information and has been one of the largest publishers in the world. Throughout the twentieth century the amount of information from the federal government has increased enormously. Consider just the number of physical volumes published for each of the decennial de·cen·ni·al adj. 1. Relating to or lasting for ten years. 2. Occurring every ten years. n. A tenth anniversary. censuses and how with each census until the most recent the number of print volumes has grown tremendously. The rest of the government's corpus increased in a similar fashion. The ideal of the free flow of government information to the people grew into the Federal Depository Library Noun 1. depository library - a depository built to contain books and other materials for reading and study library athenaeum, atheneum - a place where reading materials are available Program (FDLP FDLP Federal Depository Library Program FDLP Federal Direct Loan Program ), with libraries in every state as a means to achieve that ideal. The FDLP certainly provided unprecedented access to government documents to a significant portion of the country's citizens. In the latter part of twentieth century, the Paperwork Reduction Act The Paper Reduction Act, officially the Paperwork Reduction Act of 1980, Pub. L. No. 96-511, 94 Stat. 2812 (Dec. 11, 1980), codified in part at Subchapter I of Chapter 35 of Title 44 of the United States Code, through , is a United States federal law enacted in 1980 that and other legislative and regulatory efforts showed a significant concern with the cost to the government of both the printing of all of these documents and the expense of disseminating them to so many libraries. Meanwhile, another government effort, ARPANET (Advanced Research Projects Agency NETwork) The research network funded by the U.S. Advanced Research Projects Agency (ARPA). The software was developed by Bolt, Beranek and Newman (BBN), and Honeywell 516 minicomputers were the first hardware used as from the Advanced Research Projects Agency (ARPA ARPA - Defense Advanced Research Projects Agency , later known as the Defense Advanced Research Projects Agency Defense Advanced Research Projects Agency (DARPA), U.S. government agency administered by the Department of Defense (see Defense, United States Department of). or DARPA DARPA: see Defense Advanced Research Projects Agency. (Defense Advanced Research Projects Agency) The name given to the U.S. Advanced Research Projects Agency during the 1980s. It was later renamed back to ARPA. ), was creating the beginnings of the Internet. Cold War fears led the researchers to look into using packet switching A network technology that breaks up a message into small packets for transmission. Unlike circuit switching, which requires the establishment of a dedicated point-to-point connection, each packet in a packet-switched network contains a destination address. technology for the network to survive nuclear bombing attacks taking out large sections of the network. As the Internet developed, the information dissemination capabilities of a large network became apparent to those involved in the research. Network developers used it themselves to communicate with each other, and electronic mail became one of the principal means of electronic communications. In addition to brief messages, researchers began sharing documents and then databases. Basically, the Internet became a way to share information. The federal government certainly has made great use of the Internet for the dissemination and organization of its publications. Since the early days of the Internet, government information resources (1) The data and information assets of an organization, department or unit. See data administration. (2) Another name for the Information Systems (IS) or Information Technology (IT) department. See IT. have grown and expanded in scope. From the Library of Congress to the Government Printing Office and many others, a great quantity of information content has been made available online, and a variety of finding aids and search engines help provide access. While there remain gaps, concerns, and challenges, the government is a major provider of quality, substantial content on the Internet. LIBRARY OF CONGRESS The Library of Congress (LC) has been organizing print resources for decades. The Library of Congress Classification Library of Congress Classification or LC Classification System of library organization developed during the reorganization of the U.S. Library of Congress. System and Subject Headings are staples of library organization. While they have not brought the same level of organization to the Internet, they have certainly contributed some major resources. A look back at the brief history of LC on the Internet provides an example on a large scale of what many other government agencies have done. Start with the April 30, 1993, announcement from LC. On that date, the Library announced its accessibility on the Internet when it made the Library of Congress Information System (LOCIS LOCIS Library of Congress Information System LOCIS Logistics Control and Information Support LOCIS Logistic Command & Information Support ) available via telnet connections. LC had joined the hundreds of other libraries who freely offer their catalogs via the Internet. Library catalogs occupy a unique role in the growth of information resources on the Internet. Internet availability on college campuses and at government research labs in the 1980s meant that telnet was widely available and that it created new possibilities for information dissemination. And recently automated libraries had freely available online databases that they were happy to share. Lynch and Preston (1990) note that, by 1989, the Colorado Association of Research Libraries (CARL) catalogs and the University of California The University of California has a combined student body of more than 191,000 students, over 1,340,000 living alumni, and a combined systemwide and campus endowment of just over $7.3 billion (8th largest in the United States). system catalog (database) system catalog - The data dictionary of a DBMS. The system catalogue stores meta-data including the schemas of the databases. It is a mini-database, and is usually stored using the DBMS itself in special tables called system tables. (MELVYL) were available on the Internet via the telnet protocol. The number of library catalogs rapidly expanded from there so that, by October 16, 1992, Billy Barron (1992) offered a listing of 482 Internet-accessible library catalogs. But with the 1993 LC launch, the largest government library provided not only its catalog but a collection of other important federal information. Notess (1993) describes the various databases that LC made available via telnet, including a database of copyright registrations and another with information about federal legislation. This became one of the first free sources for information on federal proposed and passed legislation. The database was not just for current bills. It even covered legislation back to 1973. The Library of Congress' offering of LOCIS, even back in 1993, demonstrates several trends and approaches to presenting and organizing government information on the Internet. LC data was available online even before LOCIS, but when LC opened up LOCIS to Internet users Internet user n → internauta m/f Internet user Internet n → internaute m/f , it went well beyond descriptive agency information or agency-specific databases. First of all, consider the LC catalog itself. Much of the data within the LC catalog had long been accessible online. Fee-based bibliographic utilities such as OCLC OCLC - Online Computer Library Center , RLIN RLIN Research Libraries Information Network , and WLN WLN Washington Library Network WLN Wiswesser Line Notation (notation used in organic chemistry) WLN Western Library Network WLN Little Naukati, Alaska (Airport Code) WLN Weapons Logistics Network offered LC cataloging records to subscribers. And all three utilities were Internet-accessible by 1993. More significant for the general Internet user was the Digital Research Associates' (DRA DRA Delta Regional Authority DRA Developmental Reading Assessment (educational test) DRA Division of Ratepayer Advocates (California) DRA Data Research Associates DRA Directory and Resource Administrator ) service, which was often referred to as the "LC catalog." DRA was a library automation vendor, and before the Library of Congress itself opened up LOCIS via the Internet, DRA provided telnet access (originally at dra.com and still available at lcmarc.dra.com/lcmarc) to the LCMARC bibliographic file and authority file (Rogers, 1992). Before 1993 was over, LC moved on from the telnet-based LOCIS to using the newer menu-driven gopher technology and introduced LC MARVEL (Library of Congress Machine-Assisted Realization of the Virtual Electronic Library). MARVEL prefigured much of the kind of information content that most government agencies put online even today. It included sections for information about LC and MARVEL, links to LOCIS, press releases, library hours, information on how to obtain an ISBN ISBN abbr. International Standard Book Number ISBN International Standard Book Number ISBN n abbr (= International Standard Book Number) → ISBN m or ISSN ISSN abbr. International Standard Serial Number , and congressional information ("Library of Congress Goes Online," 1993). The next year, LC announced plans to digitize To convert an image or signal into digital code by scanning, tracing on a graphics tablet or using an analog to digital conversion device. 3D objects can be digitized by a device with a mechanical arm that is moved onto all the corners. some of its collections, such as photographs, maps, pamphlets, and speeches. It then planned to make them accessible on the Internet and in particular wanted to promote education by making the collections accessible at schools and local libraries (DeLoughry, 1994). The great success of the American Memory American Memory is an Internet-based archive for public domain image resources, as well as audio, video, and archived Web content. It is published by the Library of Congress. The archive came into existence on October 13, 1994 after $13,000,000 was raised in donations. Project (http://memory.loc.gov) grew out of this laudable laud·a·ble adj. Healthy; favorable. goal. By the beginning of 1995, LC had also announced the launch of its new Congressional Web site, THOMAS, at thomas.loc.gov/ (Library of Congress, 1995). THOMAS brought the full text of legislation, the House calendar, summaries of floor proceedings, and additional resources to the growing Web-using public. This move from telnet to gopher to the Web was mirrored by many other government agencies. LC has come a long way since 1995, and its Web site is a gateway to THOMAS, American Memory, the LC Catalog, and much more. Yet this brief overview of its early history on the Internet highlights several trends seen elsewhere in online government resources including: that information is first put online by a nongovernment agency, the use of new technologies to disseminate information, and the digitization dig·i·tize tr.v. dig·i·tized, dig·i·tiz·ing, dig·i·tiz·es To put (data, for example) into digital form. dig of documents. PUSH FROM THE OUTSIDE Some of the other major government information resources were likewise first made available through the efforts and servers of nongovernmental entities before the government itself made the move. The Internet Town Hall, a nongovermnental Internet site, was an active proponent One who offers or proposes. A proponent is a person who comes forward with an a item or an idea. A proponent supports an issue or advocates a cause, such as a proponent of a will. PROPONENT, eccl. law. of making the Security and Exchange Commission's (SEC) Electronic Data Gathering, Analysis, and Retrieval Electronic Data Gathering, Analysis, and Retrieval See EDGAR (Electronic Data Gathering, Analysis, and Retrieval). (EDGAR Edgar or Eadgar (both: ĕd`gər), 943?–975, king of the English (959–75), son of Edmund, king of Wessex. In 957 the Mercians and Northumbrians rebelled against Edgar's brother Edwy and chose Edgar as their king. ) filings available for free to the public. And while the SEC was busy pushing companies to file via EDGAR, thus ensuring that the SEC would have the information in electronic format, the Internet Town Hall influenced the SEC to then make the filings available to the public (Notess, 1994). Eventually, the SEC developed its own system and process, so that it now makes all the EDGAR filings available on its Web site. Full-text U.S. patents are another well-known example. Several other nongovernmental companies also offered free access to full-text patents and related patent information (Santo Santo, New Hebrides: see Espíritu Santo. , 1995). Two private organizations, MicroPatent and Source Translation and Optimization, offered some very useful free search services with access to patent abstracts and even some full-text patents. The Internet Town Hall also offered an experimental Internet publication of patent information in 1994. It provided free access to the full-text of recent patents. Shortly before their experiment ended officially, the Patent and Trademark Office (PTO PTO abbr. 1. Parent Teacher Organization 2. or p.t.o. please turn over 3. power takeoff PTO or pto please turn over Noun 1. ) finally announced (September 26, 1995) that they would provide patent information on their Web site and that it would be available for free to the public. Yet Kaiser (1998) reports that the PTO was just announcing that the data would not be available until late in 1998 and early in 1999. GOVERNMENT PRINTING OFFICE As the LC example showed, government agencies have continued to use new technologies to disseminate, organize, and present their information. The Government Printing Office (GPO) went through a similar transformation. But it had another factor affecting its changes. The rise of the Internet and its incredible transformation in the 1990s from a technology experiment of limited interest to becoming one of the primary means of disseminating information occurred at the same time as a growing concern over government expenditures. The timing of the Internet's growth coincided with efforts to reduce the expense for the government of printing and disseminating publications and the expense of producing them. So the GPO and the Superintendent of Documents had great incentive to explore various options for transforming government publications into electronically disseminated documents. Indeed, GPO has been in the forefront of government agencies in effectively using the Internet as a dissemination medium and has made significantly more efforts at providing bibliographic control of its output than many other government agencies. The recently relaunched GPO Access (now at http://www.gpoaccess.gov) has been one of the major sources of government information on the Internet for a decade. It also provides several important tools for the organization of online government information from other agencies. GPO first began electronic dissemination by sending floppy disks and CD-ROMs to depository libraries in the 1980s. However, disks still share the same production and expense problems that print sources face. Multiple copies of each disk are produced and then sent to the depository libraries. Another approach from the 1980s was using an electronic bulletin board (BBS (1) (Bulletin Board System) A computer system used as an information source and forum for a particular interest group. They were widely used in the U.S. ) for dissemination. Data could be produced just once, placed online at the BBS, and then users would dial into the BBS via modem (and perhaps long-distance charges) to retrieve the data. Yet most BBS interfaces were not easy to use and retrieving specific data could get quite complex. The long-distance phone charge also discouraged most general interest use. With the rise in access to the Internet by the public and libraries, GPO then moved on to the Internet. Originating from the GPO Access Act, or more officially, the Government Printing Office Electronic Information Access Enhancement Act of 1993 (Public Law 103-40, 107 Stat. 112, June 8, 1993), GPO created publicly accessible and searchable access to major government publications like the Congressional Record A daily publication of the federal government that details the legislative proceedings of Congress. The Congressional Record began in 1873 and, in 1947, a feature called The Daily Digest was added to briefly highlight the daily legislative activities of each House, and the Federal Register. As Minahan (1994) notes, GPO had the documents up on the Internet by the following year. Searchable access relied on Wide Area Information Service (WAIS (Wide Area Information Server) A database on the Internet that contains indexes to documents that reside on the Internet. Using the Z39.50 query language, text files can be searched based on keywords. Information resources on the Internet are called "sources. ), a sophisticated and free full-text search A search that compares every word in a document, as opposed to searching an abstract or a set of keywords associated with the document. Word processors and text editors contain full-text search functions that let you find a word or phrase anywhere in the document. system with relevancy ranking developed by Brewster Kahle Brewster Kahle (pronounced 'kale', (IPA: /keɪl/) (b. 1960)[1] is a U.S. internet entrepreneur, activist and digital librarian. Kahle graduated from MIT in 1982 with an SB degree in Computer Science & Engineering where he was a member of the Chi Phi Fraternity. (1992) at Thinking Machines Corporation. Although the law allowed a charge for access (except at depository libraries), the GPO Access databases soon became freely available to all through GPO Access Gateway sites set up by depository libraries ("GPO Access," 1995), and then by December 1995 GPO decided to waive the fees for GPO Access (Gordon-Murnane, 1999). As of April 2003, GPO Access lists over ninety distinct databases, all accessible via the GPO Access site. Of those databases, several are particularly important in providing some level of bibliographic control over electronically published documents. The Catalog of U.S. Government Publications (http://www.gpoaccess.gov/cgp), the online successor to the venerable print Monthly Catalog of U.S. Government Publications, is a bibliographic catalog of print and electronic publications created by federal agencies from 1994 through the present. The records for online documents include links to the online full-text publications when possible, but the GPO also provides another database, New Electronic Titles (http://www.access.gpo.gov/su_docs/locators/ net), that focuses exclusively on Web-accessible federal government publications. Organized by month, New Electronic Titles actually does a specialized search in the Catalog of U.S. Government Publications for online documents. The continued cataloging by the GPO of online documents, especially those that no longer have a print version, means that there is better descriptive and subject information about the documents than there is for most Web pages. GPO also decided to use Permanent Uniform Resource Locators See URL. (World-Wide Web) Uniform Resource Locator - (URL, previously "Universal") A standard way of specifying the location of an object, typically a web page, on the Internet. Other types of object are described below. (PURLs) rather than the more standard URLs. The idea of PURLs is certainly worthy. URLs can and do change frequently. A document that used to be at http://agency.gov/ latestreport.html may soon be moved to http://agency.gov/archive/ crypticstringt.html. A GPO PURL (Persistent URL) A URL that points to another URL. PURLs are used when document pages are expected to be moved to different locations from time to time. The PURL is maintained as the official URL for that resource, and when that PURL is requested, a PURL server redirects the will look more like purl.access.gpo.gov/ GPO/LPS25. The PURL is then redirected to the appropriate URL URL in full Uniform Resource Locator Address of a resource on the Internet. The resource can be any type of file stored on a server, such as a Web page, a text file, a graphics file, or an application program. . The permanence Permanence law of the Medes and Persians Darius’s execution ordinance; an immutable law. [O.T.: Daniel 6:8–9] leopard’s spots there always, as evilness with evil men. [O.T.: Jeremiah 13:23; Br. Lit. is achieved by having a PURL resolver that always has the updated URL. For libraries, this greatly eases record maintenance work. Only the PURL resolver needs to be updated, not every single library that has the record in their catalog. On the negative side, PURLs do not provide the same level of information that a URL can. For example, the PURL for Prague, NATO NATO: see North Atlantic Treaty Organization. NATO in full North Atlantic Treaty Organization International military alliance created to defend western Europe against a possible Soviet invasion. , and European Security is http://purl.access.gpo.gov/GPO/LPS12869, which redirects to the URL of http://www.carlisle.army.mil/ssi/pubs/1996/ prague/prague.pdf. A close look at the URL shows that this is a 1996 publication, from an army site, and it is in PDF format, none of which is obvious from the PURL. In addition, even the PURLs sometimes fail to have a functioning URL in the resolver database. At least the full record in the Catalog of U.S. Government Publications usually includes the URL as well as the PURL. GOVERNMENTAL SITE ORGANIZATION Many agencies went from BBS to gopher to the Web but, once they arrived on the Web there was still plenty of development and a concern with how best to present and organize the information on their site or sites. The U.S. federal government had been closely involved with the creation of the Internet, so it was natural that the government would also want to be a savvy user of the network. As government agencies began to make systems available via telnet, FTP FTP in full file transfer protocol Internet protocol that allows a computer to send files to or receive files from another computer. Like many Internet resources, FTP works by means of a client-server architecture; the user runs client software to connect to , gopher, and eventually and most successfully on the Web, there was an obvious organizational structure To comply with Wikipedia's lead section guidelines, one should be written. already in place: the structure of the government Thus, the typical first move online for a government agency has been to set up an agency Web page arranged hierarchically just like the agency. In the mid-1990s government departments set up Web pages that were often organized to mirror their internal structure. It certainly made sense to those within the agency, who knew the structure, but Web designers soon realized that it confused most other users. Several alternative approaches have developed and, even though some Web sites are still primarily organized around the hierarchy of the agency, the government now has a wide diversity in the types of Web sites available. One of the first approaches was to build a site organized by groups of users. The Library of Congress site (http://www.loc.gov) even has a section like this now, with separate entry points for Researchers, Law Researchers, Librarians & Archivists, Teachers, Kids & Families, Publishers, Persons with Disabilities, Blind Persons, and Newcomers. NASA's site (http://www.nasa.gov) highlights four target audience groups: Kids, Students, Educators, and Media and Press. Another approach that started around 1998 was the construction of cross-agency, subject-specific sites. These subject-oriented gateways also had keyword-derived domain names rather than agency-related domains. For example, Healthfinder at http://www.healthfinder.gov was designed to assist consumers in finding government health information on the Internet. Recreation.gov offers information from all of the federal land management agencies that have recreational use on their lands. The U.S. Business Advisor (http://www.business.gov) aims to give businesses a central access point to government services, transactions, regulations, and opportunities. The FEDSTAT site (http//www.fedstats.gov) follows this approach. It tries to provide quick and easy access to the broad range of statistics offered by more than one hundred federal agencies. With topic links, statistics by geography, and a multi-Web site search function, it offers several access points to the statistics. Yet, with the huge number of statistical reports covered, it can still be difficult to identify exactly the most pertinent report without having some knowledge of the whole universe of government-produced statistics. FINDING GOVERNMENT INTERNET SOURCES The problem with the voluminous publications from the government and the corresponding voluminous number of Web sites is that, with so much information available, it can be quite difficult to know where to find an answer to a specific question. Fortunately, there are a number of finding aids and search tools that can make the task at least somewhat easier. Many search and retrieval systems focused on U.S. government information resources have come and gone. In the early era of the Web, directories of government Web sites and gopher servers were the best entryways into the online government resources because they provided some kind of hierarchical agency access. The government section of the well-known Yahoo! directory The Yahoo! Directory is a web directory which rivals the Open Directory Project in size. The directory was Yahoo!'s first offering. When Yahoo! changed to crawler-based listings for its main results in October 2002, the human-edited directory's significance dropped, but it is still was updated relatively frequently in the early years and was fairly comprehensive. It also included sections at the federal, state, local, and international government levels. While useful for finding agencies, it did not work very well at finding government information by subject. By 1997, the Federal Web Locator LOCATOR, civil law. He who leases or lets a thing to hire to another. His duties are, 1st. To deliver to the hirer the thing hired, that he may use it. 2d. To guaranty to the hirer the free enjoyment of it. 3d. from the Villanova Center for Information Law and Policy was one of the most frequently used general directories of U.S. federal government agency Web sites. It was arranged by agency hierarchies that roughly mirrored the arrangement of the U.S. Government Manual. Although it did have a basic keyword search ability, it was still primarily a directory by agency rather than by subject. It has moved around between various URLs but can now be found at http://www.infoctr.edu/fwl. Many other specialized directories have been created to help people find government Web sites. Almost all of them relied heavily on hierarchical agency access. At the same time, developments in search engines that would index all the words on Web pages and make them searchable were being developed and getting increasingly popular. On the government side of things, there were a few specialized search engines See vertical search engine. just for U.S. federal government sites. GovBot, from the Center for Intelligent Information Retrieval information retrieval Recovery of information, especially in a database stored in a computer. Two main approaches are matching words in the query against the database index (keyword searching) and traversing the database using hypertext or hypermedia links. (CIIR CIIR Catholic Institute for International Relations CIIR Center for Intelligent Information Retrieval CIIR counterintelligence information report (US DoD) CIIR Canadian International Information Resource ) at the University of Massachusetts The system includes UMass Amherst, UMass Boston, UMass Dartmouth (affiliated with Cape Cod Community College), UMass Lowell, and the UMass Medical School. It also has an online school called UMassOnline. , was one early example. GovBot used the Inquery software developed at the CIIR to search Web pages in the .gov and .mil domains. However, for some searches, general Web search engines A Web site that maintains an index and short summaries of billions of pages on the Web, Google being the world's largest. Most search engine sites are free and paid for by advertising banners, while others charge for the service. such as HotBot or AltaVista using a .gov or .mil domain limit could be more effective. GovBot lasted for several years but was eventually retired. Another interesting search engine solution was launched by Northern Light in the spring of 1999. USGOVSEARCH was the result of a partnership between the Web search engine See Web search engines. Northern Light and the Commerce Department's National Technical Information Service (NTIS NTIS - National Technical Information Service ). As Hane (1999) reports, USGOVSEARCH included over 20,000 U.S. government agency and military Web sites with almost 4 million pages, along with the 2 million abstracts from NTIS. Because the initial May 1999 announcement included access pricing details, the search engine met with several protests about the pricing structure. Yet, USGOVSEARCH offered some real advantages and features that differentiated it from other government-oriented search engines. It included the full NTIS abstracts database, which was available nowhere else on the Web for free. The advanced search included the subject limits derived from a Northern Light-created, government-oriented taxonomy taxonomy: see classification. taxonomy In biology, the classification of organisms into a hierarchy of groupings, from the general to the particular, that reflect evolutionary and usually morphological relationships: kingdom, phylum, class, order, . While it only listed the more general upper-level hierarchical terms, the more specific subject terms would be found in the folders within search results (Notess, 2000). Unfortunately, the deal with NTIS eventually ended. The NTIS database was removed. And eventually, as Northern Light itself was bought out and then abandoned by Divine, Inc., USGOVSEARCH went the way of GovBot. Within the government itself, several initiatives were underway to create a central, government-specific portal or search engine. GPO Access was one approach, but other agencies also tried developing their own. NTIS's FedWorld crossed agency boundaries. LC had its own directory. And then there was the WebGov initiative. Announced in September 1998, WebGov was supposed to be a central government-wide portal and was going to be up and running in thirty days. Due in part to fighting between agencies and in part to lack of funding, it never got off the ground (Brown, 2000). It was not until two years later, in September 2000, that FirstGov rose out of the ashes of WebGov as a live, viable site. As O'Leary (2001) describes it in an early review, the initial version did have its problems and inaccuracies but, for the first time, the government had its own portal and search engine. Certainly, one major reason for this was that Eric Brewer Eric Brewer may be:
FirstGov has become the most prominent government-sponsored, central access point for government information. And many other government sites link back to FirstGov. For a more detailed view of FirstGov, see Patricia Diamond Fletcher's article "Creating the Front Door to Government" in this issue of Library Trends. CONCERNS AND CHALLENGES One significant concern with the move away from multiple copies of print documents at many libraries to one electronic copy on an agency's Web site is that the loss of redundancy can easily lead to the permanent loss of the information content. If that one copy is removed, lost in a computer crash, or forgotten in a site upgrade, there is an increased potential that all future users will permanently lose access to it. In the same vein, if an online copy is changed, due to necessary corrections or to political leadership changes, there may not be an archived record of the original Web page. Without that, future historians, journalists, and others may not be able to identify what changes have been made and when they were made. Will a new administration in Washington try to purge Web pages from a previous administration? So far in the Internet age, we have only had the change from the Clinton administration Noun 1. Clinton administration - the executive under President Clinton executive - persons who administer the law to the Bush administration. And already there have been several efforts to remove older material. Davis (2002) reports on an internal memo in the Department of Education that called not only for the department to remove outdated pages but also to remove items that might not reflect the current administration's political philosophy. While that process is still ongoing and the eventual fate of all the old pages is yet to be determined, this is exactly the kind of situation that is likely to become more frequent as the Web ages and as administrations change. On another front, the Office of Management and Budget The Office of Management and Budget (OMB), formerly the Bureau of the Budget, is an agency of the federal government that evaluates, formulates, and coordinates management procedures and program objectives within and among departments and agencies of the Executive Branch. (OMB OMB abbr. Office of Management and Budget Noun 1. OMB - the executive agency that advises the President on the federal budget Office of Management and Budget ) proposed major changes to the requirement for government agencies to use GPO for printing publications (Procurement of Printing, 2002). Although the requirement to send publications to the Depository Library Program remains, great concerns have been raised about potential loss of many documents. Heifer HEIFER. A young cow, which has not had a calf. A beast of this kind two years and a half old, was held to be improperly described in the indictment as a cow. 2 East, P. C. 616; 1 Leach, 105. (2003) argues that the fugitive documents problem will be made worse by this weakening of the Federal Depository Library Program and GPO. And the requirements in the OMB proposal that the Superintendent of Documents would bear the costs that have been legislatively mandated to be borne by the agencies themselves certainly could exacerbate the problem of fugitive and lost documents. The nature of the Web makes it difficult for the government to keep many secrets because public information can be posted by almost anyone with a Web site. But if the information is never published or posted, that risk is averted. And even though most government agencies have moved toward putting many of the publications online, there are still other areas in which the government is reticent to publish certain documents or has actively removed them. With the greater concern about terrorism since the September 11, 2001, attack on the World Trade Center, many government sites have actually removed information that was formerly available on the Internet. OMB Watch OMB Watch is a 501(c)(3) nonprofit organization based in Washington, DC. OMB Watch was formed by Gary Bass in 1983 to lift the veil of secrecy shrouding the White House Office of Management and Budget (OMB). , another nongovernmental organization nongovernmental organization (NGO) Organization that is not part of any government. A key distinction is between not-for-profit groups and for-profit corporations; the vast majority of NGOs are not-for-profit. , has documented many such incidents on its Access to Government Information Post September 11th pages at http://www.ombwatch.org/article/archive/104/. Yet even before September 11th, the military had expressed concern. The Deputy Secretary of Defense (1998) issued a memorandum calling for a department-wide review of information vulnerability on the Web. "All DoD components that establish publicly accessible Web sites are responsible for ensuring that the information published on those sites does not compromise national security or place DoD personnel at risk" (p. 1). In another unusual situation completely separate from the Defense Department, the Department of the Interior was ordered to disconnect disconnect - SCSI reconnect its systems from the Internet by a federal judge on December 5, 1999, due to vulnerabilities in Indian trust-fund databases. This meant that many sites such as the National Park Service, the Land Management and Reclamation Bureau, the Fish and Wildlife Service, the Minerals Management Service, and the Surface Mining Office all suddenly had no information or only very abbreviated information on their sites (Dizzard, 2002). The various agencies slowly were able to get approval from the court to bring their sites back up but, as Lisagor (2002) notes, 6 percent of the department's systems were still disconnected as of November 2002. Unfortunately, no private organizations have mirrors of all the information that was on the vanished sites. Any information that was only accessible via an interactive database search is gone. But some of the more static pages have been available from services like the Internet Archive's Wayback Machine A Web site from the Internet Archive (www.archive.org) that records the content of most Web sites for each year of their existence since 1996. All of the pages in the site are generally included unless the site is password protected or is coded to explicitly refuse to be archived (see (http://www.archive.org) as long as a user knew the appropriate URL. CONCLUSION There is no doubt that the quantity and quality of government information on the Internet is a substantial resource for many kinds of users, from everyday U.S. citizens to advanced researchers of social trends. Government sites provide detailed data sets, satellite imagery Satellite imagery consists of photographs of Earth or other planets made from artificial satellites. History The first satellite photographs of Earth were made August 14, 1959 by the US satellite Explorer 6. , weather records and trends, tax forms, contractor opportunities, hazardous waste Hazardous waste Any solid, liquid, or gaseous waste materials that, if improperly managed or disposed of, may pose substantial hazards to human health and the environment. Every industrial country in the world has had problems with managing hazardous wastes. disposal pamphlets, elementary history lesson plans, consumer guides, proposed regulations, laws, court cases, speeches, testimonies, and so much more. Compared to so much else on the Web that is of widely varying quality and often more concerned with selling something than providing accurate content, government sites offer a great wealth of information. Despite some of the concerns and challenges with documents not being published or even removed, there is still a vast quantity of government information freely available on the Web that is of great importance for scholars and researchers. The desktop availability of the data in a wide variety of formats and from many different agencies is a boon for researchers who want access to data at their time of need rather than waiting for delivery of documents many days later. Finding the appropriate material can still be difficult, but search engines like FirstGov and directories and subject-oriented gateways are a great help. Through the efforts of government agencies like the Library of Congress, the Government Printing Office, and many others, the Internet public is fortunate to have a substantial body of valuable information content available at the desktop for free, and at any time of the day from all over the world. REFERENCES Brown, D. (2000). Feds take another run at governmentwide portal, Inter@ctive Week, 7, 10-11. Davis, M. R. (2002). No URL left behind? Web scrub raises concerns. Education Week, 22, 1, 26. DeLoughry, T. J. (1994). Library of Congress announces plan to digitize several of its collections. The Chronicle of Higher Education higher education Study beyond the level of secondary education. Institutions of higher education include not only colleges and universities but also professional schools in such fields as law, theology, medicine, business, music, and art. , 41, A41. Deputy Secretary of Defense. (1998). Information vulnerability and the World Wide Web [memorandum]. Washington, DC: Dept. of Defense. Retrieved April 14, 2003, from http://www.defenselink.mil/other_info/depsecweb.pdf. Dizard, W.P. (2002). Interior struggles to restore its Web presence. Government Computer News, 21, 10. Gordon-Murnane, L. (1999). The Federal Register free on GPO Access. Searcher, 7(6), 46. GPO Access--Free, kind of. (1995). Searcher 3(1), 32-33. Hane, P.J. (1999). Northern Light Technology's USGOVSEARCH begins commercial operation with revised pricing plan. Information Today, 16(7), 22. Helfer, D. S. (2003). The government battle over printing: OMB versus GPO and why it matters to libraries and the public. Searcher, 11, 53-55. Kahle, B., Morris, H., Goldman, J, Erickson, T., & Curran, J. (1992). Interfaces for distributed systems Distributed systems (computers) A distributed system consists of a collection of autonomous computers linked by a computer network and equipped with distributed system software. of information servers. Proceedings of the ASIS 1. ASIS - Application Software Installation Server. 2. (language) ASIS - Ada Semantic Interface Specification. Mid-year Meeting, 124-148. Kaiser, J. (1998). PTO to let sun shine on patents. Science, 281(3), 7. Library of Congress. (1995). New online public access to congressional information. [press release]. Retrieved April 14, 2003, from http://www.loc.gov/today/pr/1995/95-002. Library of Congress goes online. (1993). Link-Up, 10(5), 22. Lisagor, M. (2002). Still a disconnect at Interior. Federal Computer Week. Retrieved April 14, 2003, from http://www.fcw.com/fcw/articles/2002/1111/web-interior-11-11-02.asp. Lynch, C. A., & Preston, C. M. (1990). Internet access See how to access the Internet. to information resources. In M. E. Williams (ed.), Annual review of information science and technology (vol. 25, pp. 263-312). Amsterdam: Elsevier Science Publishers. Minahan, T. (1994). GPO puts Congressional Record, Federal Register on the Internet. Government Computer News, 13(12), 8. Notess, G. R. (1993). LC's debut on the Internet. Database, 16(5), 84-87. Notess, G. R. (1994). SEC EDGAR and the Internet town hall. Database, 17(4), 77-81. Notess, G. R. (2000). USGOVSEARCH: The federal Web, NTIS database, and more. Online, 24, 57-61. O'Leary, M. (2001). FirstGov arrives on fast track. Information Today, 18(1), 17-19. Procurement of printing and duplicating through the Government Printing Office (2002). Federal Register, 67(219), 68913-68918. Rogers, M. (1991). Data Research offers network to all libraries. Library Journal, 117(19), 22. Santo, B. (1995). Web proves patently useful. Electronic Engineering Times, 869, 126. Greg R. Notess, Reference Librarian and Professor, Montana State University Montana State University, at Bozeman; land-grant; coeducational; chartered 1893. It is primarily a technical institution specializing in agriculture, engineering, and applied sciences. The Museum of the Rockies is there. , Bozeman, P.O. Box 173320, MSU MSU Michigan State University MSU Mississippi State University MSU Montana State University MSU Minnesota State University MSU Morehead State University (Kentycky) MSU Montclair State University Library, Bozeman, MT 59717-3320 LIBRARY TRENDS, Vol. 52, No. 2, Fall 2003, pp. 256-257 GREG R. NOTESS is a Reference Librarian and Professor at Montana State University-Bozeman. He has been writing, speaking, and consulting about Internet information resources since 1991. A three-time Information Authorship award winner, he is the "On the Net" and "Internet Search Engine Update" columnist for ONLINE. Notess is the author of the first three editions of Government Information on the Internet and Internet Access Providers See ISP. (networking, company) Internet Access Provider - (IAP) A company or other origanisation which provides access to the Internet to businesses and/or consumers. : An International Resource Directory. An internationally known conference speaker on search engines and other Internet topics This page aims to list articles related to the Internet and Internet Protocol. This is so that those interested in the subject can monitor changes to the pages by clicking on Related changes in the sidebar and on the bottom of the page. , Notess has spoken at many national conferences and international meetings in Stockholm, London, Oslo, Montreal, Copenhagen, and Sydney, Australia. On the Web, Notess maintains SearchEngineShowdown.com, which reviews, compares, analyzes, and tracks news in the search engine industry. |
|
||||||||||||||||||

Printer friendly
Cite/link
Email
Feedback
Reader Opinion