Printer Friendly
The Free Library
4,548,537 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

Evaluating electronic texts in the humanities.


INTRODUCTION

Electronic texts have been used for scholarly research in the humanities for the past forty years or so ever since Roberto Busa Roberto Busa (born November 13, 1913) is an Italian Jesuit priest and one of the pioneers in the usage of computers for linguistic and literary analysis. He is the author of the Index Thomisticus, a complete lemmatization of the works of Saint Thomas Aquinas and of a few related  began work on his Index Thomisticus in 1949. However, it is only in the last three to four years, and particularly with the advent of the Internet, that humanities electronic texts have moved into the center of the scholarly arena as libraries begin to collect them and provide access to them. In the humanities, as in other disciplines, electronic textual resources offer many more possibilities than print, but, in general, libraries do not yet have any well-established practices for collecting and handling electronic texts as they have with print material. Shreeves (1992) discusses some of these questions from the perspective of the librarian, but there is a need also to look at what humanities scholars might want to do with the texts.

Electronic text is used here to mean primary source material in the humanities rather than journals and reference works. Such texts may be literary works (prose, verse, drama), historical papers, letters and memoranda, charters, papyri, inscriptions, and the like. The source material may be in any natural language and may be in print or manuscript form. The focus of this article is also on transcripts of text rather than digitized page or manuscript images. Images provide an exact reproduction of the original so that marginalia mar·gi·na·li·a  
pl.n.
Notes in the margin or margins of a book.



[New Latin, neuter pl. of Medieval Latin margin
, annotations, parallel texts, illustrations, and the like are readily available. They can be used for access and preservation but the text cannot be searched or otherwise manipulated. A transcript of a text allows many more novel possibilities for research and teaching and exploits more fully the capabilities of electronic materials. In the future, a combination of image and text may well form the basis of the electronic library, where it will be possible to search the text and retrieve the image.

THE PRESENT SITUATION

The picture in the early 1990s is one of many humanities texts in many different places and in many different formats. The Georgetown Catalog of Projects in Electronic Texts lists over 300 institutions which hold electronic texts but not the texts themselves. From sources such as The Humanities Computing Yearbook 1989-90: A comprehensive guide to software and other resources (Lancashire, 1991), journals, and proceedings of annual conferences on humanities computing, the number of existing electronic texts in the humanities can be estimated at many thousands. The Internet gives access to a fraction of these, and the existence of most of the others is only known from articles which describe their use in specific projects.

Most of these texts are held by individuals or by research institutes (mainly in Europe) which have compiled them for their own research purposes. Examples include the Istituto di Linguistica Computazionale in Pisa, and the Institut fur Deutsche Sprache, Mannheim. For a variety of reasons, most of the collections of these institutes are not available for others to use. The few exceptions include many of the texts which were compiled for the Tresor de la Langue langue  
n.
Language viewed as a system including vocabulary, grammar, and pronunciation of a particular community.



[French, from Old French; see language.]
 Frangaise at Nancy, which are now available from ARTFL ARTFL American and French Research on the Treasury of the French Language (University of Chicago)  (American Research on the Treasury of the French Language) in Chicago. The texts compiled for the Responsa Responsa (Latin: plural of responsum, "answers") comprise a body of written decisions and rulings given by legal scholars in response to questions addressed to them.  Project at Bar-Ilan University Bar-Ilan University (BIU, אוניברסיטת בר-אילן) is a university in Ramat Gan, Israel. Established in 1955, Bar Ilan is now Israel's second largest academic institution.  are now available on CD-ROM CD-ROM: see compact disc.
CD-ROM
 in full compact disc read-only memory

Type of computer storage medium that is read optically (e.g., by a laser).
 as the Global Jewish Database, and the collection of Early Christian Latin at Louvain-la-Neuve has now been published as the CETEDOC CD-ROM.

The Thesaurus Linguae Graecae For other uses, see TLG (disambiguation).

The Thesaurus Linguae Graecae (TLG) is a research center at the University of California, Irvine. Founded in 1972, under director Dr.
 (TLG TLG Thesaurus Linguae Graecae (project, UC Irvine, California)
TLG The LEGO Group
TLG The Lone Gunmen
TLG The Lost Guys (Counter-Strike Clan)
TLG Technical Liaison Group
) was the earliest systematic attempt to create electronic versions of the complete literature of one language (Ancient Greek Noun 1. Ancient Greek - the Greek language prior to the Roman Empire
Greek, Hellenic, Hellenic language - the Hellenic branch of the Indo-European family of languages
) and its 60 million word task is now almost finished after twenty years TWENTY YEARS. The lapse of twenty years raises a presumption of certain facts, and after such a time, the party against whom the presumption has been raised, will be required to prove a negative to establish his rights.
     2.
 of work. The Packard Humanities Institute (PHI phi
n.
Symbol The 21st letter of the Greek alphabet.


PHI,
n See health information, protected.
) has completed a complementary collection of Classical Latin Noun 1. classical Latin - the language of educated people in ancient Rome; "Latin is a language as dead as dead can be. It killed the ancient Romans--and now it's killing me"
Latin - any dialect of the language of ancient Rome
 which is about 8 million words. Both of these are distributed on CD-ROM.

The largest general purpose collection of electronic texts is the Oxford Text Archive Oxford Text Archive (OTA) is an archive of electronic texts and other literary and language resources which have been created, collected and distributed for the purpose of research into literary and linguistic topics.  (OTA (Over The Air) Refers to any wireless system such as AM/FM radio and network television that uses open space as its transmission medium. ), which was established at Oxford University Computing Services in 1976 in order to prevent texts from becoming "lost" once their compilers had finished with them. The bulk of its collection comes from donations from individual scholars. It is committed to maintaining any text which is deposited in it but does not actively pursue material to be added or correct errors within the texts. It now has some 1,200 texts in about thirty languages and makes these available at nominal cost provided that the compiler has given the appropriate permissions. Little information is known about the source of some OTA texts, and the OTA takes no responsibility for the accuracy of the texts. Some texts are available by FTP FTP
 in full file transfer protocol

Internet protocol that allows a computer to send files to or receive files from another computer. Like many Internet resources, FTP works by means of a client-server architecture; the user runs client software to connect to
 from <black.ox.ac.uk>.

It is estimated that about 95 percent of existing texts are plain text files--that is, ASCII files A file that contains data made up of ASCII characters. It is essentially raw text just like the words you are reading now. Each byte in the file contains one character that conforms to the standard ASCII code (see ASCII chart).  which are not indexed for any specific software. Those who use them must acquire or develop suitable software programs, depending on the nature of their application. Various software programs for humanities electronic texts are in widespread use, notably the Oxford Concordance concordance /con·cor·dance/ (-kord´ins) in genetics, the occurrence of a given trait in both members of a twin pair.concor´dant

con·cor·dance
n.
 Program (OCP (processor) OCP - Order Code Processor. ), Micro-OCP (a PC version); TACT and WordCruncher (interactive text retrieval programs) which all provide some basic facilities as well as more sophisticated tools tailored to the specific needs of the humanities.

The remaining 5 percent of texts are what can be called packaged products, where the text has been indexed for use with specific, of ten proprietary, software and cannot be used for any other purpose. Most of these products, at present, are on CD-ROM (e.g., the CETEDOC library of Christian Latin Texts, the WordCruncher Disc of American and English literature English literature, literature written in English since c.1450 by the inhabitants of the British Isles; it was during the 15th cent. that the English language acquired much of its modern form. , the New Oxford English Dictionary Oxford English Dictionary

(OED) great multi-volume historical dictionary of English. [Br. Hist.: Caught in the Web of Words]

See : Lexicography
 on CD-ROM, and the Global Jewish Database). Libraries which provide these packaged resources will generally find that the support costs are not insignificant. Almost all use their own query language A generalized language that allows a user to select records from a database. It uses a command language, menu-driven method or a query by example (QBE) format for expressing the matching condition.  which in most cases is not intuitive. It takes some time to gain a good understanding of the full potential of many of them. They provide complex search facilities because the texts themselves are complex and scholars want to study them in many different ways. However, most of these products do have manuals which document the source of the text as well as how to use the programs, which is more than can be said for some texts.

The Internet gives access to ARTFL and to the Dartmouth Dante Project (DDP (Distributed Data Processing) See distributed processing.

DDP - Distributed Data Processing
), which includes the text of The Divine Comedy Divine Comedy: see Dante Alighieri.

Divine Comedy

Dante’s epic poem in three sections: Inferno, Purgatorio, and Paradiso. [Ital. Lit.: Divine Comedy]

See : Epic
 and major commentaries. ARTFL uses software developed by its own team, the second version of which is based on UNIX utilities This is a list of UNIX utilities as specified by IEEE Std 1003.1-2004, which is part of the Single UNIX Specification (SUS). These utilities can be found on UNIX Operating systems and most UNIX-like operating systems. List
IEEE Std 1003.
 and is not particularly easy to use for those not familiar with UNIX UNIX

Operating system for digital computers, developed by Ken Thompson of Bell Laboratories in 1969. It was initially designed for a single user (the name was a pun on the earlier operating system Multics).
. The DDP uses BRS-Search with a user-friendly interface. It can perform flexible searches, but scholars who use it extensively will begin to see the limitations of applying a commercial text-retrieval system, which is document-oriented, to complex humanities texts, where it is not clear what constitutes a single document.

At present, it is rare for several different electronic versions of a work to exist. The Bible and Shakespeare are the exceptions. Comparisons of these can help in establishing design principles for better electronic texts. Bolton (1990) reviews three electronic versions of Shakespeare and tools to access them and gives a detailed evaluation from the perspective of a scholar in English studies English studies is an academic discipline that includes the study of literatures written in the English language (including literatures from the U.K., U.S., Ireland, Canada, Australia, New Zealand, Hong Kong, the Philippines, India, South Africa, and the Middle East, among other . This essay highlights the relevance of complex tools for what are complex texts and the need to provide good documentation for them. Most of the current electronic versions of the Bible seem to be intended more for the popular market and only one or two would be really suitable for scholars and students in religious studies. As yet, there are few comparative reviews similar to Bolton's, but soon there will be more versions of electronic texts to choose from and more evaluations are sorely needed. Likely candidates might include a comparison of the texts of J. P. Migne's Patrologia Latina The Patrologia Latina is an enormous collection of the writings of the Church Fathers and other ecclesiastical writers published by Jacques-Paul Migne between 1844 and 1855, with indices published between 1862 and 1865.  published by Chadwyck-Healey.

Given the present situation, how can a library evaluate electronic texts now? What makes a good electronic text that a research library would want to acquire or access? The popular market for electronic texts is growing fast. We are already witnessing different collections of electronic texts which are intended for popular, rather than scholarly, consumption. How can a research library ensure that a text is suitable for its collection? What do libraries need to know to make decisions on what to collect and how to provide access to their collections?

Some very basic questions which should be asked include: How are patrons going to use the text? Is it a text which requires software from elsewhere, or is it a complete package? If it requires software, what is the best program? What facilities does it provide? Are these facilities suitable for scholarly applications? What source text was used? What guarantee is there that the text is accurate? What markup (text) markup - In computerised document preparation, a method of adding information to the text indicating the logical components of a document, or instructions for layout of the text on the page or other information which can be interpreted by some automatic system.  scheme does it use? How will the electronic text be supported in the library? What can the text do for patrons that print materials cannot do? Some understanding of what is involved in creating an electronic text and of basic techniques and applications in literary computing is helpful in order to begin to answer these questions.

CREATING ELECTRONIC TEXTS

Most existing texts have been created by keyboarding in one way or another. Ones which were created many years ago, including the TLG as it is stored on the CD-ROM, are entirely in uppercase letters. Some of these have been converted to upper- and lowercase by software and thus often do not begin sentences with uppercase letters. Much keyboarding has been done by individuals who needed to create an electronic text for their own research purposes. They may have consciously or unconsciously edited the text to suit their own needs and understanding and have possibly not documented these changes. They may also genuinely have made mistakes which went unnoticed in proofreading Proofreading traditionally means reading a proof copy of a text in order to detect and correct any errors. Modern proofreading often requires reading copy at earlier stages as well. . If the text has been keyboarded professionally, it may be less likely to contain mistakes, as in the case of the TLG where very few errors exist. However, if significant cost has been incurred in creating the text, it is perhaps less likely that the text is widely available for others to use. At present, the texts which are most widely available are often those that have been created by individuals. Experience has shown that accuracy must not be assumed.

Optical character reading (OCR OCR
 in full optical character recognition

Scanning and comparison technique intended to identify printed text or numerical data. It avoids the need to retype already printed material for data entry.
) has been used to input a variety of humanities texts. The capability of OCR software has improved somewhat, but OCR is not yet able to handle early printed books, manuscripts, some types of newspapers, and other material printed on poor quality paper. A text which has been input by OCR will need thorough proofreading even if the initial scanning appears to be very good. Claims of accuracy rates of 99.9 percent in effect mean two or three errors per page, which is far more than one would expect to find in a printed book. Many existing texts which have been input via OCR have not been proofread well. Typical errors include confusion of e and c, h, and b and the number 1 and letter 1, as well as words run together or spaces inserted in the middle of words. Extraneous ex·tra·ne·ous  
adj.
1. Not constituting a vital element or part.

2. Inessential or unrelated to the topic or matter at hand; irrelevant. See Synonyms at irrelevant.

3.
 matter on the page--such as blotches on a photocopy--will be read as apostrophes or commas. But, even if the letters have been recognized accurately, it is becoming increasingly clear that optical scanning yields only part of what is needed to create a useful electronic text. It gives a physical representation of the text, which can be ambiguous without additional information. For example, a word which is in italic could be a foreign word, part of a title, or an emphasized word. These distinctions need to be made for a retrieval program to be useful, but they can only be done by adding information to a text after it has been scanned. When a text is keyboarded, this information can be embedded Inserted into. See embedded system.  in the text at the time of capture in the form of encoding See encode.  or markup tags.

As with print materials, the choice of source edition is important for the academic acceptability of a text. Many existing texts have been created from out-of-copyright editions because their compilers have not been able to obtain copyright permissions for newer editions or have not wanted to ask permission for fear of becoming embroiled em·broil  
tr.v. em·broiled, em·broil·ing, em·broils
1. To involve in argument, contention, or hostile actions: "Avoid . . .
 in legal issues. It is often the case that more recent editions have greater scholarly value and would be more appropriate for research use. It is to be hoped that these copyright issues will be tackled and resolved in the future rather than being avoided, which seems to be the case at present. Another factor which has determined the choice of source edition is its suitability for scanning. Again, the text which can be read best on the scanner may not necessarily be the edition with the best scholarly value. Good intentions to edit the text so that it conforms to a better edition are sometimes not carried out. When shown electronic texts, scholars who are skeptical about their value often voice their concern by criticizing the choice of source edition, which is, after all, something they understand from traditional scholarship. The lack of good scholarly texts in electronic form has seriously hampered the development and acceptability of full-text applications in humanities research and teaching. This situation is not being helped by various projects which use the Internet to announce, and make freely available, texts which do not appear to have any particular scholarly value.

USES OF ELECTRONIC TEXTS IN THE HUMANITIES

Certain methodologies and techniques for literary and linguistic computing have been well understood for some time. Hockey (1980) gives an overview of applications, many of which are still current. Butler's (1992) collection of essays is also a useful source. The journals Literary and Linguistic Computing and Computers and the Humanities and the proceedings of various literary and linguistic and humanities computing conferences also give some background (Miall, 1990; Hockey & Ide, 1991).

Concordances concordances,
n.pl 1. items that are in harmony.
2. homeopathic medicines with affinity to one another and therefore can be used serially during the sequence of treating an illness. This interaction was initially noted by Boenninghausen.
 and text retrieval have formed the major application areas in literary and linguistic computing. A concordance is an alphabetical list of words which shows all the instances of a particular form, allowing the scholar to examine them in fine detail. Text retrieval gives instant access to occurrences. The first and most obvious application of these is as a reference tool. In the humanities, questions such as, Does this word ever occur in this text? are as common as, Find a text about this topic. For the former, the text must be absolutely accurate otherwise the user cannot be sure whether the word exists or not.

Concordances have been used as a basis for stylistic analyses and even for studies of disputed authorship. It has been shown that the style of an author, or even a genre, can be characterized by the use of function words, that is, words which authors share in common with their contemporaries. In their study of the Federalist Papers Federalist papers
 formally The Federalist

Eighty-five essays on the proposed Constitution of the United States and the nature of republican government, published in 1787–88 by Alexander Hamilton, James Madison, and John Jay in an effort to persuade
, Mosteller and Wallace (1964) showed that the use of words such as "whilst" or "while," "enough," and "upon" in the disputed papers followed that of Machson rather than Hamilton. Burrows (1987) used a concordance program and some simple statistics to show that the thirty most common words in the novels of Jane Austen can distinguish the "idiolects" of the different characters in her novels. Kenny's (1978) work on the Aristotelian Ethics Aristotle believed that ethical knowledge is not certain knowledge (like metaphysics and epistemology) but is general knowledge. Because it is not a theoretical discipline, he thought a person had to study in order to become "good.  is another classic example of traditional literary and linguistic computing techniques where a study of particles and other common words in Greek shows that the three books which appear in both the Nicomachean Ethics Nicomachean Ethics (sometimes spelled 'Nichomachean'), or Ta Ethika, is a work by Aristotle on virtue and moral character which plays a prominent role in defining Aristotelian ethics.  and Eudemian Ethics The Eudemian Ethics (sometimes abbreviated EE in scholarly works) is a work of philosophy by Aristotle. Its primary focus is on Ethics. It is named for Eudemus of Rhodes, a pupil of Aristotle who may also have had a hand in editing the final work.  of Aristotle are more like the Eudemian Ethics. There are many other similar studies, all of which are based on common words which therefore need to be indexed.

Other computer-aided research has concentrated on the production of new critical editions in print form and now also in electronic form. Collation COLLATION, descents. A term used in the laws of Louisiana. Collation -of goods is the supposed or real return to the mass of the succession, which an heir makes of the property he received in advance of his share or otherwise, in order that such property may be divided, together with the , concordance, and statistical tools can help the scholar establish the text and provide information for the commentary and other annotations (Robinson, in press). Other studies have included programs to analyze sound patterns and correlate these with the sense as, for example, in the Divine Comedy (Robey, 1987) and in Homer (Packard, 1974). Most kinds of research, which are based on lexical analysis (programming) lexical analysis - (Or "linear analysis", "scanning") The first stage of processing a language. The stream of characters making up the source program or other input is read one at a time and grouped into lexemes (or "tokens") - word-like pieces such as keywords, , are suitable for computational techniques, provided that it is understood that the text is viewed as a sequence of graphic forms. Programs for automatic lemmatization lem`ma`tiz`a´tion   

v. t. 1. The act or process of lemmatizing; conversion into a lemma .
 (putting words under their dictionary headings), syntax, and morphological analysis For other senses of this word see morphology.
Morphological analysis or General Morphological Analysis is a method developed by Fritz Zwicky (1967, 1969) for exploring all the possible solutions to a multi-dimensional, non-quantified problem complex.
 are not yet widely available, and, in any case, those that do exist are never completely accurate and require manual verification of the results.

Hypertext applications have also become popular in the humanities (Delany & Landow, 1991). Images and sound can be linked to texts. More importantly, hypertext does not require the text and ancillary material to be constrained into a rigid structure such as a relational database relational database

Database in which all data are represented in tabular form. The description of a particular entity is provided by the set of its attribute values, stored as one row or record of the table, called a tuple.
. The data can be as flexible and extensible as needed as needed prn. See prn order. , thus allowing the scholar to add more information or reorganize re·or·gan·ize  
v. re·or·gan·ized, re·or·gan·iz·ing, re·or·gan·iz·es

v.tr.
To organize again or anew.

v.intr.
To undergo or effect changes in organization.
 existing material as he or she learns more from working with an electronic version of it. The best known humanities hypertext is Perseus, which was developed by a consortium of institutions based at Harvard (Mylonas, 1992). Perseus goes far beyond the text. It is a multimedia encyclopedia of Ancient Greek literature Ancient Greek literature refers to literature written in the Greek language until the 4th century AD. Classical and Pre-Classical Antiquity
This period of Greek literature stretches from Homer until the 4th century BC and the rise of Alexander the Great.
, archaeology, geography, history, and culture. Besides the works of major authors in Greek and English, it contains photographs of vases, sculptures, coins, buildings, and archaeological sites as well as an encyclopedia, historical overview, and Greek/English dictionary. Although Perseus is currently available on CD-ROM, the Perseus team sees the network as the future means of accessing the database and have designed it so that the individual components can easily be imported into other systems.

Retrieval and other applications on humanities texts can be complex simply because of the nature of the texts and the fine detail in which they are normally studied. A text may contain several different natural languages, some of which may be in different scripts or use different alphabets for sorting words. Examples include parallel texts of the Bible or Middle English Middle English

Vernacular spoken and written in England c. 1100–1500, the descendant of Old English and the ancestor of Modern English. It can be divided into three periods: Early, Central, and Late.
 texts which contain sections in Latin with citations of Greek or Hebrew words. Users of the texts must be able to identify which sections are in each language and to index them separately. Variant readings or spellings may be indexed. Quotations from other texts may need separate treatment. Punctuation is important in early printed books and may also need to be searchable. Studies of morphology in inflected in·flect  
v. in·flect·ed, in·flect·ing, in·flects

v.tr.
1. To alter (the voice) in tone or pitch; modulate.

2. Grammar To alter (a word) by inflection.

3.
 languages or of rhyme can benefit from a reverse index where words are alphabetized al·pha·bet·ize  
tr.v. al·pha·bet·ized, al·pha·bet·iz·ing, al·pha·bet·iz·es
1. To arrange in alphabetical order.

2. To supply with an alphabet.
 according to according to
prep.
1. As stated or indicated by; on the authority of: according to historians.

2. In keeping with: according to instructions.

3.
 their endings.

The canonical referencing scheme or logical structure of many humanities texts is complex, yet it needs to be represented in an electronic version. Depending on the type of literature, there are many different subdivisions of verse texts (stanzas, verses, books, quatrains, and so on). In simple terms, a play is divided into acts which are themselves divided into scenes and speeches. It also has a cast list and stage directions. However, a play may also have another referencing scheme which is based on pages within a printed edition. Line numbering may be in relation to the pages or sequential throughout the text. Printed editions of early manuscripts may also have two parallel referencing schemes--pages and lines in the print version as well as folios and lines in the original. All of these should be accessible to the scholar working on the text and therefore need to be identified or encoded within the text. An overview of some of these issues and the need for encoding to handle them is in Sperberg-McQueen (1991).

MARKUP

Markup or encoding makes explicit for computer processing those features of a text which are implicit for the reader. A text without markup is like a bibliographic record which is not divided into fields. Markup is needed to identify the different elements of the referencing scheme as well as to distinguish among features which would otherwise be ambiguous and to encode (1) To assign a code to represent data, such as a parts code. Contrast with decode.

(2) To convert from one format or signal to another. See codec and D/A converter.

(3) The term is sometimes erroneously used for "encrypt.
 features of interest. The period (full stop) is also used in abbreviations or as a decimal point (character) decimal point - "." ASCII character 46. Common names are: point; dot; ITU-T, USA: period; ITU-T: decimal point. Rare: radix point; UK: full stop; INTERCAL: spot.  in numbers in numbered parts; as, a book published in numbers.

See also: Number
. Some programs delimit de·lim·it   also de·lim·i·tate
tr.v. de·lim·it·ed also de·lim·i·tat·ed, de·lim·it·ing also de·lim·i·tat·ing, de·lim·its also de·lim·i·tates
To establish the limits or boundaries of; demarcate.
 concordance citations by orthographic or·tho·graph·ic   also or·tho·graph·i·cal
adj.
1. Of or relating to orthography.

2. Spelled correctly.

3. Mathematics Having perpendicular lines.
 sentences--i.e., by all the text up to a period--and so, without additional markup, abbreviations and decimal points would be erroneously considered as sentence boundaries. Quotations from another author or text need to be identified by their source. For studies such as the Burrows's work on Jane Austen cited earlier, markup is needed to separate the dialogue from the narrative in the text and to identify the speaker for each section of dialogue. In a play, markup could encode the change of speakers and stage directions as well as the logical structure. For further discussions on markup and scholarly text processing, see Coombs Coombs can refer to:
  • Coombs test, a test for the presence of antibodies or antigens
  • Coombs reagent, the reagent used in the Coombs test
  • Coombs' method, a type of voting designed by the psychologist Clyde Coombs
 et al. (1987) and Renear et al. (1992).

A text without markup can only be used for very simple applications. One analogy is trying to perform functions such as sorting and searching on a bibliographic record which does not have field divisions. For textual analysis, this amounts to making a simple alphabetical list of words, counting the word frequencies, and performing very simple searches. None of these will be completely accurate for detailed analyses. A look at various versions of Shakespeare which are available over the Internet will immediately show the problems. Act and scene numbers are not marked up in any way and so will lead to word counts which include all the incidences of the word "Act" as the act number within those of "act" as a verb or noun used in the normal way. Roman numerals Roman numerals

System of representing numbers devised by the ancient Romans. The numbers are formed by combinations of the symbols I, V, X, L, C, D, and M, standing, respectively, for 1, 5, 10, 50, 100, 500, and 1,000 in the Hindu-Arabic numeral system.
 used as act and scene numbers are more problematic. Act I will be assumed to be an occurrence of the personal pronoun personal pronoun
n.
A pronoun designating the person speaking (I, me, we, us), the person spoken to (you), or the person or thing spoken about (he, she, it, they, him, her, them).
 I. Even the Wordcruncher CD-ROM suffers from this problem. The simple searches will retrieve one or more surrounding lines of context. With a prose text, the reader may want to reformat (1) To change the record layout of a file or database.

(2) To initialize a disk over again.
 the lines as on a word processor when the margins are changed. In verse, the lineation is fixed and must not be reformatted. When a text is entirely in verse, one can allow for this, but texts which are mixed verse and prose need to have markup to show the difference. Words which are not in the main language of the text also need to be encoded so that they can be distinguished. Examples include English "vale" and Latin "vale" (farewell), English "pain" and French "pain" (bread).

Many different markup schemes have been developed for humanities electronic texts over the last forty years. Of these, the most notable are COCOA and its variants. COCOA was first devised for an Archive of Old Scots Texts in Edinburgh in the early 1960s (Aitken & Bratley, 1967) and is described fully in the Micro-OCP manual. It provides a way of encoding the canonical referencing structure of a text including parallel referencing schemes and can also be used for other features such as stage directions, editorial comment, and so on. It is used by most of the major text-analysis programs in current use in the humanities, notably the Oxford Concordance Program (OCP) and (in extended form) TACT. The Thesaurus Linguae Graecae developed its own markup scheme, called beta code For the software release stage, see .

Beta Code is a method of representing, using only ASCII characters, characters and formatting found in ancient Greek texts (and other archaic languages).
, which has also been used by other projects in classics and religious studies. The retrieval program WordCruncher also has its own markup scheme. Many existing humanities electronic texts are encoded for use by these programs.

Typographic See typography.  markup is also needed to print or display a text so that it is more easily readable. Even simple word processing word processing, use of a computer program or a dedicated hardware and software package to write, edit, format, and print a document. Text is most commonly entered using a keyboard similar to a typewriter's, although handwritten input (see pen-based computer) and  programs include features such as italic, bold, and so on to highlight sections of a text to draw the reader's eye to them. A parallel set of markup schemes was thus developed for printing and formatting, most notably TeX, TROFF (Typesetting RunOFF) A Unix utility that formats documents for typesetters and laser printers. Using a text editor, troff codes are embedded into the text and the troff command converts the document into the required output. See nroff and TeX. , and later various word processors, such as WordPerfect, where the markup is exposed by the Reveal Codes function.

The result of this plethora of markup schemes has been described as chaos (Burnard, 1988). By the mid-1980s, experience showed clearly that markup is essential for good quality texts, but no scheme had wide acceptance. Each scheme was designed for a specific project or application. Most schemes were poorly documented and had no provision for extension or were not otherwise sufficiently flexible. Much time was wasted on converting from one format to another. None of the existing markup schemes was suitable for adoption as a standard.

In 1986, the Standard Generalized Markup Language (language, text) Standard Generalized Markup Language - (SGML) A generic markup language for representing documents. SGML is an International Standard that describes the relationship between a document's content and its structure.  (SGML SGML
 in full Standard Generalized Markup Language

Markup language for organizing and tagging elements of a document, including headings, paragraphs, tables, and graphics.
) became an international standard (van Herwijnen, 1990). SGML is not, in itself, an encoding scheme. It provides a syntactic framework within which descriptive information about an electronic text can be encoded. The principle of SGML is descriptive, not prescriptive--that is, it describes the structure of a text. It enables the word which is seen to be in italic to be described as part of a title, or a foreign word, or an emphasized word, or whatever the encoder wishes. At a very basic level, SGML views a text as being a collection of objects called elements. These may be chapters, pages, words, lines, stanzas, or whatever the user wishes. The set of elements for a particular text or group of texts and the relationship among them is defined in a document type definition (DTD (Document Type Definition) A language that describes the contents of an SGML document. The DTD is also used with XML, and the DTD definitions may be embedded within an XML document or in a separate file. ). The DTD has a formal structure. It can be read by a computer program called an SGML parser A routine that analyzes a continuous flow of text-based input and breaks it into its constituent parts. See parse.

(language) parser - An algorithm or program to determine the syntactic structure of a sentence or string of symbols in some language.
 which validates the markup in a text or by other SGML-based software which operates on the text. SGML provides a method of encoding which addresses many of the intellectual issues which previously used encoding schemes have not. A further advantage is that it also provides links to material which is not ASCII ASCII or American Standard Code for Information Interchange, a set of codes used to represent letters, numbers, a few symbols, and control characters. Originally designed for teletype operations, it has found wide application in computers.  text--e.g., sound and images--which are likely to become increasingly important. Its one disadvantage is that it views a document as a single hierarchic structure and has no easy way of dealing with the multiple parallel referencing schemes which appear in many humanities texts.

Sets of encoding or markup tags which conform to Verb 1. conform to - satisfy a condition or restriction; "Does this paper meet the requirements for the degree?"
fit, meet

coordinate - be co-ordinated; "These activities coordinate well"
 the SGML syntax are known as SGML applications. When a text is said to be in SGML, it is important to know which SGML application and to have access to the DTD. True SGML must conform to a DTD. There are many electronic texts now in existence which claim to be SGML which do not appear to have DTDs. Others, most notably the New Oxford English Dictionary, are described as SGML-like and may not necessarily be processable by all SGML software. In some cases this may not be a problem now, but it may become so in the future as SGML becomes more widely used.

The need for standardization of markup in the humanities led to the establishment of the Text Encoding Initiative (text, project, standard) Text Encoding Initiative - (TEI) A project working to establish a standard for interchanging electronic text for scholarly research. The TEI has adopted SGML and implemented the TEI standard as an SGML Document Type Definition.  (TEI 1. (communications) TEI - Terminal Endpoint Identifier.
2. (text, project) TEI - Text Encoding Initiative.
) in 1987. The TEI is sponsored by the three major text analysis computing organizations: the Association for Literary and Linguistic Computing, the Association for Computers and the Humanities, and the Association for Computational Linguistics (body) Association for Computational Linguistics - (ACL) The international scientific and professional society for people working on problems involving natural language and computation. . It has become a major international project with funding of over $1 million from North America North America, third largest continent (1990 est. pop. 365,000,000), c.9,400,000 sq mi (24,346,000 sq km), the northern of the two continents of the Western Hemisphere.  and from the Commission of the European Communities beginning in 1988. Its objectives are to define a common encoding format for electronic texts and to provide guidelines for the interchange of electronic texts. Further information about the TEI project is available from the fileserver of the listserv <tei-l[at]uicvm>.

The TEI immediately made a commitment to SGML and set up four main committees to deal with different aspects of encoding electronic texts. The documentation committee defined a method of documenting electronic texts which is stored within the text as a header. This is described in more detail in the next section. The text representation committee first looked at ways of encoding the physical description and logical structure of text and identified the components and core features of basic text types. It then set up a number of work groups to look in more detail at specific areas and text types. These included character sets, hypermedia hypermedia: see hypertext.


The use of hyperlinks, regular text, graphics, audio and video to provide an interactive, multimedia presentation. All the various elements are linked, enabling the user to move from one to another.
, textual criticism textual criticism
n.
1. The study of manuscripts or printings to determine the original or most authoritative form of a text, especially of a piece of literature.

2.
, language corpora corpora

plural form of corpus.


corpora albicantia
see corpus albicans.

corpora arenacea
sandy or gritty bodies, found in the pineal body; appear to be of glial or stromal origin; have the structure of
, formulae and tables, verse, performance texts, and literary prose. A third committee on analysis and interpretation first devised general purpose mechanisms for encoding linguistic and other analytic interpretations which are comprehensive enough to allow several different interpretations to be placed on a word or section of text. It then set up work groups to look at electronic dictionaries, spoken texts, and terminological data as well as the interpretation of historical material and further linguistic analysis. A fourth committee defined how best the TEI might use SGML. It prepared a kind of "house-style" for the TEI's use of SGML and proposed methods for dealing with multiple hierarchies.

The TEI Guidelines have been developed following a set of principles established at the planning meeting in 1987. The guidelines are intended for text in any kind of written or spoken language. They are intended for both scholars and librarians. The guidelines give recommendations both on what features to encode and how to encode them. The features discussed in the guidelines include both those which are explicitly marked and those which are the result of analyzing and interpreting the text. Although the TEI Guidelines include some 400 different encoding tags, very few indeed are absolutely required. The basic philosophy is "if you want to encode this feature, do it this way." Sufficient information is provided for the TEI DTDs to be extended by users if necessary.

The TEI Guidelines are built on the assumption that virtually all texts share a common core of features, to which can be added tags for a specific discipline, text type, or application. The encoding process is seen as incremental Additional or increased growth, bulk, quantity, number, or value; enlarged.

Incremental cost is additional or increased cost of an item or service apart from its actual cost.
, so that additional tags may be inserted in a text as new researchers work on the text. Almost all encoding implies some interpretation of a text and so the guidelines provide for multiple views of a text and multiple encodings for individual phenomena within a text. They also provide a means of documenting any interpretation so that a new user of the text can know why that interpretation is there.

A TEI conformant text consists of a TEI header followed by the text itself. The text has optional front and back matter. The body of a text is divided into units which, for convenience, the TEI has chosen to call divisions using the tag <div>. SGML attributes are used to identify the type of division--e.g., "chapter," "stanza stan·za  
n.
One of the divisions of a poem, composed of two or more lines usually characterized by a common pattern of meter, rhyme, and number of lines.



[Italian; see stance.
," "fact." Within the smallest division, the basic element is a paragraph which can contain many other elements such as lists, names, dates, abbreviations, and so on.

The first draft version of the TEI Guidelines (Sperberg-McQueen & Burnard, 1990) has been distributed extensively for comment. The second draft is being made available electronically in fascicles from the listserv <tei-l[at]uicvm> as new chapters have been completed for publication. A cumulative print version is in preparation (Sperberg-McQueen & Burnard, 1990).

DOCUMENTING ELECTRONIC TEXTS

The source from which an electronic text has been compiled is sometimes not known or is unclear. One of the major reasons for this is that, until recently, there has been no standard way of providing this information in such a way that it does not get detached from the text or lost. Most of the large text archives held in research institutes use databases which they have developed themselves for recording information about the texts. These databases of ten consist of limited information of value only to themselves. Individuals who have created texts have not of ten even provided this information, most obviously because they themselves were fully aware of it and thus did not feel the need to record it. Many existing texts have encoding within them which is not documented and, if the exact source is not known, it may be impossible to identify, for example, what a group of percent signs in the middle of a text may mean.

One of the TEI's major contributions is a set of proposals for documenting electronic texts so that users may know what they have and librarians will have the information they need to catalog the texts. The TEI header is believed to be the first systematic attempt to provide in-file documentation of an electronic text which conforms to the same syntax as the markup within the text. It consists of four major sections or SGML elements each of which contains further elements or subdivisions. The file description element is the most important. It contains a full bibliographic description of the electronic file which can be used for creating catalog entries or bibliographic citations. It must include a file statement which gives the title of the work and those responsible for its intellectual content, a publication statement which identifies the publication or distribution of an electronic text, and a source description, which is a bibliographic description of the source from which the electronic text was derived. Additional optional elements give information relating to relating to relate prepconcernant

relating to relate prepbezüglich +gen, mit Bezug auf +acc 
 one edition of a text, the approximate size of the text in whatever units are convenient, the series, if any, to which a publication belongs, and notes which provide additional descriptive information not contained in other elements.

The encoding description element provides information which the user of a text needs to know. It documents the methods and editorial principles that governed the transcription of the text, also giving the intellectual rationale for any analytic or interpretive information. Additional information which characterizes a text but does not fit easily into the other header sections is given in the profile description. This includes information about the participants in a conversation if the text is a transcript of speech as well as details of the natural languages used in the text. The fourth section, the revision history, documents any changes made to the text and provides information which is critical for working with electronic texts in which changes are made over time and where there is a need to ensure that a particular version of a text was used.

The file description contains sufficient information for a librarian to catalog the text, with indications of its source. The encoding description contains information which anyone who uses the text needs to have. The revision history provides a means of recording updates to the text. The TEI header does not yet have elements which specifically address authentication (1) Verifying the integrity of a transmitted message. See message integrity, e-mail authentication and MAC.

(2) Verifying the identity of a user logging into a network.
, but it would be a simple matter to define extra elements which would contain a time stamp See timestamp.  or other authentication codes. These may further be extended to apply to only certain SGML elements within the text, leaving the others to be modified as users exploit the text for their own purposes.

CATALOGING ELECTRONIC TEXTS

The only attempt to create a systematic catalog of electronic texts in the humanities using standard bibliographic procedures is the Rutgers Inventory of Machine-Readable Texts in the Humanities, which began in 1983 in response to a growing number of enquiries about the availability of electronic texts. Responsibility for the inventory was assumed by the Center for Electronic Texts in the Humanities (CETH CETH Center for Electronic Texts in the Humanities (Rutgers University) ) when it was established in the second half of 1991 at Rutgers and Princeton universities. The inventory is available now on the Research Libraries Information Network (RLIN RLIN Research Libraries Information Network ) and should be accessible soon in other ways as well.

Research done at CETH indicates that there is little expertise in the specific problems of cataloging electronic texts. CETH has found that most of the expertise in cataloging computer files is derived from experience with either software or social science numeric data Refers to quantities and money amounts used in calculations. Contrast with string or character data.  files. There seems to have been very little emphasis specifically on electronic text files in the humanities where it is necessary to know whether the text requires specific software in order to use it, or what encoding scheme it uses. The rules in Chapter 9 of the Anglo-American Cataloging Rules, second edition, 1988 Revision (AACR2R AACR2R Anglo-American Cataloguing Rules, 2nd Revised Edition ) (ALA, 1988), cover all kinds of computer files (programs, numeric data files, and so on) but are not especially suitable for electronic texts. CETH has now developed comprehensive guidelines for cataloging electronic texts in the humanities which ensure that sufficient information is provided in the catalog record to enable librarians to provide access to the text in a meaningful way (Hoogcarspel, 1993, 1994).

At present, the main source of information for updating the inventory is a survey which began in 1990 with funding from the Commission of the European Communities. A first questionnaire was distributed to some 5,000 people who were on the mailing lists of the Association for Literary and Linguistic Computing, the Association for Computers and the Humanities, the Association for Computational Linguistics, and twelve other sponsoring organizations. Recipients were asked to note which of the following types of data they hold: (1) speech (tape recordings), (2) single (individual) texts, (3) collections of texts, (4) corpora (collections of texts which have been put together on a principled prin·ci·pled  
adj.
Based on, marked by, or manifesting principle: a principled decision; a highly principled person.
 basis as samples which represent a specific population--e.g., the Brown Corpus The Brown Corpus of Standard American English (or just Brown Corpus) was compiled by Henry Kucera and W. Nelson Francis at Brown University, Providence, RI as a general corpus (text collection) in the field of corpus linguistics. ), (5) machine-readable dictionaries, and (6) computational lexica lex·i·ca  
n.
A plural of lexicon.
. The intention was to send a different follow-up questionnaire depending on the type of data held. CETH took over responsibility for doing the follow-up for single texts, collections of texts, and corpora.

The questionnaire reflected input from a large number of people in the field and was intended to be comprehensive in its coverage with regard to written language, spoken language, and dictionary and other lexical information. In redesigning the questionnaire, CETH took great care to ensure that bibliographic records could easily be created from the data. Examples were provided together with simple explanation. The responses have come from a variety of individuals and institutes, all of whom appear to have different procedures for documenting and cataloging their texts. None found it particularly easy to describe their material in such a way that catalog records could easily be created from it, and CETH is of ten finding it necessary to go back to the respondent for further information. Cataloging the backlog of existing collections is not going to be an easy task. Future material will be much easier to catalog once standard procedures for documenting the texts are adopted.

SOFTWARE ISSUES

Software issues are also central to the evaluation of electronic texts, the key question being how to provide access to the texts. An ASCII file with markup offers the most possibilities in that it is not restricted to any particular program. Scholars and libraries can use it for whatever purposes they like. However, they need to acquire software from somewhere else or write their own programs. Writing their own software gives the most flexibility, but in reality very few people are going to do that. The time investment is large and it is obviously not efficient for the person who just wants to look up a few words. It pays off only for the scholar who has very specific individual requirements.

Texts which are pre-indexed for some specific software are easier to begin with and can satisfy a good many purposes, but it is important then to look at the facilities provided by the software. Browsing normally is easy, but the real issue is how to find exactly what is wanted. The response to any search request is only as good as what is indexed. Most commonly used index-based retrieval programs cannot really handle many of the complex features of scholarly texts such as marginalia, the critical apparatus and parallel texts which were discussed earlier in this article. They work with document structures which are too simple. Accented characters and diacritics This article is about the academic journal. For the accent mark, see Diacritic.

diacritics is an academic journal founded in 1971 at Cornell University.
 also pose problems as decisions need to be made at the time of building the index as to whether these are to be specified in a search query. Once scholars begin to use these programs seriously and gain an understanding of basic techniques, more often than not they will want to ask questions which cannot be answered by a package, even though it is technically feasible to do so. They will find that much existing software becomes limiting in terms of what it can do. There are instances where the capability of the software is driving the kinds of research which can be done and, in some cases, restricting it. Electronic texts can transform scholarship, but the needs and requirements of scholars ought to be paramount in the design and development of new software which effects these transformations.

Traditional humanities computing software, such as Micro-OCP and TACT, leaves these decisions entirely in the hands of the scholar. He or she has specific areas of research and can work with that in mind, but as more people want to work on the same texts, it makes sense not to duplicate effort and to provide these centrally. Libraries are now beginning to index texts for many different people to use, most notably with the PAT software, a retrieval program developed by Open Text Systems of Waterloo, Canada. In effect, in some ways, they are taking on responsibility for the intellectual content of the text since they have to decide what is indexed and how it can be accessed. The Text Encoding Initiative has laid the groundwork for multipurpose mul·ti·pur·pose  
adj.
Designed or used for several purposes: a multipurpose room; multipurpose software.


multipurpose
Adjective
 text after over four years of research. It is debatable whether existing software provides the capability of building indexes which can satisfy many different purposes. A good deal more research may be needed in this area, particularly in the exploitation of SGML.

CONCLUSION

What can a library do now? Setting up an electronic text center with adequate support is a considerable investment. The greater part of the cost will involve staff who support the facility. Ensuring that staff have adequate training in the relevant tools and techniques is essential so that they can make informed judgments. Discussion groups, such as <etextctr[at]rutvml>, are intended to help librarians enter this new world. Consultation with potential users is important at all stages. Experimenting with some CD-ROM-based resources would be a good first step as well as looking at some widely available software tools. These should help libraries decide how to handle texts over the network which is so obviously the long-term future. More than anything, widespread consultation and collaboration in research will be needed in order to determine the principles and procedures for providing accesss to the multipurpose high-quality electronic text which will serve the needs of humanities scholarship in the next century.

REFERENCES

Aitken, A.J., & Bratley, P. (1967). An archive of older Scottish texts for scanning by computer. English Studies, 48(1), 60-61. American Library Association American Library Association, founded 1876, organization whose purpose is to increase the usefulness of books through the improvement and extension of library services. . (1988). Anglo-American Cataloging Rules (2d ed.). Chicago, IL: ALA. Bolton, W. (1990). The Bard in bits: Electronic editions of Shakespeare and programs to analyze them. Computers and the Humanities, 24(4), 275-287. Burnard, L. D. (1988). Report of workshop on text encoding guidelines. Literary and Linguistic Computing, 3(2), 131-133. Burrows, J. F. (1987). Computation into criticism: A study of Jane Austen's novels and an experiment in method. Oxford, England: Clarendon Press. Butler, C. S. (Ed.). (1992). Computers and written texts (Applied language Studies series). Oxford, England: Blackwell. Coombs, J. H.; Renear, A. H.; & DeRose, S. J. (1987). Markup systems and the future of scholarly text processing. Communications of the ACM (publication) Communications of the ACM - (CACM) A monthly publication by the Association for Computing Machinery sent to all members. CACM is an influential publication that keeps computer science professionals up to date on developments. , 30(11), 933-947. Delany, P., & Landow, G. P. (Eds.). (1991). Hypermedia and literary studies. Cambridge, MA: MIT MIT - Massachusetts Institute of Technology  Press. Hockey, S. (1980). A guide to computer applications in the humanities. Baltimore, MD: Johns Hopkins University Press The Johns Hopkins University Press is a publishing house and division of Johns Hopkins University that engages in publishing journals and books. It was founded in 1878 and holds the distinction of being the oldest continuously running university press in the United States. . Hockey, S. M., & Ide, N. M. (Eds.). (1991). Research in humanities computing 1: Selected papers from the ALLC/ACH Conference, Toronto, June 1989. Oxford, England: Oxford University Press. Hoogcarspel, A. (1993). Bibliographic control of electronic texts. Unpublished paper presented at ACHALLC93 Conference, Georgetown. Hoogcarspel, A. (1994). Guidelines for cataloging monographic electronic texts at the Center for Electronic Texts in the Humanities (CETH Technical Report No. 1). Charlottsville, VA: University of Virginia, CETH. Kenny, A. (1978). The Aristotelian Ethics: A study of the relationship between the Eudemian and Nicomachean ethics of Aristotle. Oxford, England: Clarendon Press. Lancashire, I. (1991). The humanities computing yearbook 1989-90. Oxford, England: Clarendon Press. Miall, D. S. (Ed.). (1990). Humanities and the computer: New directions. Oxford, England: Clarendon Press. Mosteller, F., & Wallace, D. L. (1964). Inference and disputed authorship: The Federalist fed·er·al·ist  
n.
1. An advocate of federalism.

2. Federalist A member or supporter of the Federalist Party.

adj.
1. Of or relating to federalism or its advocates.

2.
. Reading, MA: Addison-Wesley. Mylonas, E. (1992). An interface to classical Greek civilization. Journal of the American Society for Information Science, 43, 192-201, Oxford University Computing Service. (1988). Micro-OCP user manual. Oxford, England: Oxford University Press. Packard, D. W. (1974). Sound-patterns in Homer. Transactions of the American Philological Association Transactions of the American Philological Association (TAPA) is an academic journal founded in 1869 and the official publication of the American Philological Association. , 104, 289-260. Renear, A. H.; Durand, D. G.; & Mylonas, E. (1992). Overlapping hierarchies of text objects: Refining our notion of what text really is. In ALLCACH '92 conference abstracts (the 19th International Conference of the Association for Literary and Linguistic Computing and the 12th International Conference on Computers and the Humanities) (pp. 200-203). Oxford, England: Oxford University. Robey, D. (1987). Sound and sense in the Divine Comedy. Literary and Linguistic Computing, 2(2), 108-115. Robinson, P. M. W. (In Press). Collate: A program for interactive collation of large textual traditions. In S. Hockey & N. Ide (Eds.), Research in Humanities Computing 3. Oxford, England: Oxford University Press. Shreeves, E. (1992). Between the visionaries and the Luddites: Collection development and electronic resources in the humanities, Library Trends, 40(4), 579-595. Sperberg-McQueen, C. M. (1991). Text in the electronic age: Textual study and text encoding, with examples from Medieval texts. Literary 4nd Linguistic Computing, 6(1), 34-46. Sperberg-McQueen, C. M., & Burnard, L. (Eds.). (1990). Guidelines for the encoding and interchange of machine-readable texts (TEI document PI). Chicago, IL and Oxford, England: ACH-ACL-ALLC. Sperberg-McQueen, C. M., & Burnard, L. (Eds.). (1994). Guidelines for electronic text encoding and interchange (TEI document P3). Chicago, IL and Oxford, England: ACH-ACL-ALLC. van Herwijnen, E. (1990). Practical SGML. Dordrecht, The Netherlands: Kluwer Academic Publishers.
COPYRIGHT 1994 University of Illinois at Urbana-Champaign
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 1994, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Title Annotation:Libraries and the Internet: Education, Practice & Policy
Author:Hockey, Susan
Publication:Library Trends
Date:Mar 22, 1994
Words:7601
Previous Article:Equalizing access to electronic networked resources: a model for rural libraries in the United States. (Libraries and the Internet: Education,...
Next Article:The role of computer networks in aerospace engineering. (Libraries and the Internet: Education, Practice & Policy)
Topics:



Related Articles
As the cursor blinks: electronic scholarship and undergraduates in the library.(The Library and Undergraduate Education)
Library user education: examining its past, projecting its future.(The Library and Undergraduate Education)
Becoming digital: policy implications for library youth services.(Children and the Digital Library)
The Internet: a core or value added service? (public's right to Internet at libraries)
Revolution in the Library(*).(the electronic information revolution will alter library services, but hopefully not change the central role of...
The Human Element in the Virtual Library.(a change to virtual libraries seems underway, and librarians are more important than ever as these changes...
Love's Labour's Lost: The Failure of Traditional Selection Practice in the Acquisition of Humanities Electronic Texts.
Introduction.(use of computers)
Evaluating reference services in the electronic age.
The role of research in public libraries.

Terms of use | Copyright © 2008 Farlex, Inc. | Feedback | For webmasters | Submit articles