Printer Friendly
The Free Library
5,074,106 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

Building an Internet archive system for the British Broadcasting Corporation.


ABSTRACT

Since its beginnings as the British Broadcasting Corporation (company) British Broadcasting Corporation - (BBC) The non-commercial UK organisation that commissions, produces and broadcasts television and radio programmes.

The BBC commissioned the "BBC Micro" from Acorn Computers for use in a television series about using computers.
 (BBC BBC
 in full British Broadcasting Corp.

Publicly financed broadcasting system in Britain. A private company at its founding in 1922, it was replaced by a public corporation under royal charter in 1927.
) Networking Club in April 1994, the BBC's Web site has grown to over two million pages. While bbc.co.uk inarguably offers a valuable source of information, entertainment, and education for its users and provides an online arena for peer-to-peer communication, it also brings into focus the challenge of digital preservation. Apart from the sheer volume of material the site represents, the nature of that material is forever changing both to reflect editorial strategy and to benefit from new technologies and improved production techniques. To support its own internal business requirements and to satisfy external legislative requirements, the BBC's Information and Archives Department is building a Legal and Historical Internet Archive See Wayback Machine and Web archiving.  System to capture a selection of content as it is published to the "live" site. This article looks at how the design and development of that system supports the preservation of heterogeneous digital material in the wider context of archiving the BBC's new media output.

**********

The British Broadcasting Corporation's (BBC) (1) Information and Archives Department--a relatively recent amalgamation amalgamation /amal·ga·ma·tion/ (ah-mal´gah-ma´shun) trituration (3).
amalgamation (
 of research libraries, archives, and preservation services across the BBC's national and regional operational centers--manages much of the corporation's physical and electronic records and audio visual assets. One of its current projects is the introduction of a system for the automatic capture of the BBC's online services published to bbc.co.uk. This article describes the development of that system and its design and implementation in the context of the corporation's main business driver: the creation of distinctive programs and services.

Information and Archives (I&A) has always been responsible for a wide variety of material: documents, television and radio programs, and in recent years the digital and electronic equivalents. Preservation is an ongoing part of the remit To transmit or send. To relinquish or surrender, such as in the case of a fine, punishment, or sentence.

An individual, for example, might remit money to pay bills.


TO REMIT. To annul a fine or forfeiture.
     2.
; converting archived content into new formats has been an issue within the Television Archive since 1948, when nitrate nitrate, chemical compound containing the nitrate (NO3) radical. Nitrates are salts or esters of nitric acid, HNO3, formed by replacing the hydrogen with a metal (e.g., sodium or potassium) or a radical (e.g., ammonium or ethyl).  was used for film production and archivists were already aware that alternative stock would need to be developed to save material in the long term. And in the BBC archives there is a vast amount of it.

Stored in multiple sites around the United Kingdom are over 600,000 hours of complete television programs, stockshots, and unedited or untransmitted material held on film, 2-inch and 1-inch videotape videotape

Magnetic tape used to record visual images and sound, or the recording itself. There are two types of videotape recorders, the transverse (or quad) and the helical.
, and U-Matic and Beta formats. More than 300,000 hours of radio are available on wax cylinders, vinyl records, audio cassettes A 1/8" inch, analog audio tape format that has been widely used for music distribution and home recording. Although the same size housing is used, the tape thickness and length determine the recording time. Cassettes holding from 15 minutes to 60 minutes per side have been manufactured. , CDs, and DAT (1) (Dynamic Address Translator) A hardware circuit that converts a virtual memory address into a real address. See also DAT file.

(2) (Digital Audio Tape) A magnetic tape technology used for backing up data.
 or one-quarter-inch tape, with in excess of 25,000 sound effects sound effects
Noun, pl

sounds artificially produced to make a play, esp. a radio play, more realistic

sound effects nplefectos mpl sonoros

 captured on CD and vinyl. And then there are the 100 million documents held at the Written Archives Centre, 3 million photographs--hardcopy and electronic, 22 million newspaper cuttings, 1.2 million commercial music recordings, and 4 million items of sheet music. And all of that needs to be managed through processes for intake, cataloguing and indexing, research and access, and long-term preservation. So, although we are in an increasingly digital environment, the issues are not new--it is only the challenges that are different.

WHAT ARE THE DRIVERS FOR ARCHIVING AND PRESERVATION FOR THE BBC?

The BBC is beholden be·hold·en  
adj.
Owing something, such as gratitude, to another; indebted.



[Middle English biholden, past participle of biholden, to observe; see behold.
 to several pieces of often contradictory UK legislation. One of these is the Broadcasting Act This article is about a Canadian Act of Parliament. For the British laws, see Broadcasting Act 1990, or the Broadcasting Act 1980

The Broadcasting Act (long title: "
 of 1996, which dictates that "it shall be the duty of each broadcasting body to retain a recording of every television or sound programme which is broadcast by that body--

(a) where it is of a television programme, during the period of 90 days beginning with the broadcast, and

(b) where it is of a sound programme, during the period of 42 days beginning with the broadcast" (Queen's Printer The Queen's Printer (or King's Printer when the monarch is male) is a position defined by letters patent under the royal prerogative in the United Kingdom. The holder of the letters patent has the nearly exclusive right of printing, publishing and importing the King James  of Acts of Parliament, 1996a, Section 117, Part V).

In other words Adv. 1. in other words - otherwise stated; "in other words, we are broke"
put differently
, the BBC is legally required to record its TV and radio output off-air to enable the corporation to answer complaints from the listening and viewing public.

This is reiterated by the 2003 Communications Act The establishment of the Federal Communications Commission (FCC) in 1934, the regulatory body for interstate and foreign telecommunications. Its mission is to provide high-quality services at reasonable cost to everyone in the U.S. on a nondiscriminatory basis. , which pays legal lip-service to the existence of platforms other than radio and television by applying the retention periods of 90 and 42 days to "every programme service" (Queen's Printer of Acts of Parliament, 2003a, Section 334). The act also includes a requirement "to comply with any request by OFCOM (2) to produce to them for examination or reproduction a recording retained in pursuance of in accordance with; in prosecution or fulfillment of.

See also: Pursuance
 the conditions in the licence" (Queen's Printer of Acts of Parliament, 2003a, Section 334).

Despite being relatively new legislation, disappointingly the act does not take into consideration the technical challenges of capturing new media services direct from broadcast, preserving them in their original form, and being able to re-create them as required in answer to legal claims. While neither act specifically mentions the Internet--or in fact any other new media platform--the BBC does not want to set any legal precedents but would rather demonstrate best endeavors to meet the letter of the law. To that end, an internal agreement was made in 2000 between the Head of Online and the BBC's Programme Complaints Unit that bbc.co.uk would be defined as another broadcast channel and its content captured off-air as with TV and radio.

But there are other legislative reasons for retaining the BBC's broadcast output that shape archive policy. The period of liability under the Defamation defamation

In law, issuance of false statements about a person that injure his reputation or that deter others from associating with him. Libel and slander are the legal subcategories of defamation. Libel is defamation in print, pictures, or any other visual symbols.
 Act of 1996 is currently one year, which suggests that the BBC should be retaining content for longer than the Broadcasting Act's ninety-day requirement. Under section 2 of the act, any "offer to make amends AMENDS. A satisfaction, given by a wrong doer to the party injured for a wrong committed. 1 Lilly's Reg. 81.
     2. By statute 24 Geo. II. c. 44, in England, and by similar statutes in some of the United States, justices of the peace, upon being notified of an
" is an offer to "make a suitable correction of the statement complained of" and to " publish the correction and apology" (Queen's Printer of Acts of Parliament, 1996b). Consequently, at least one function of the archive is to provide a record of both the original defamatory def·a·ma·tion  
n.
The act of defaming; calumny.



de·fama·tory adj.
 material and any consequent official response broadcast on air.

Another raft of legislation centers on information and data management. Schedule 1 of the 1998 Data Protection Act outlines the data protection principles, which include that "Personal data shall be obtained only for one or more specified and lawful purposes" and that "appropriate technical and organisational measures shall be taken against unauthorised or unlawful processing of personal data" (Queen's Printer of Acts of Parliament, 1998, Schedule 1, Part 1).

BBC online publishes an increasing amount of user-generated content The production of content by the general public rather than by paid professionals and experts in the field. Mostly available on the Web via blogs and wikis, user-generated content refers to material such as the daily news, encyclopedias and other references, movie and product reviews as . "Talk" (http://www.bbc.co.uk/communicate/) calls for feedback on the BBC's output and provides a gateway to the message boards, while sites like "iCan" (http://www.bbc.co.uk/dna/ican/) and "Collective" (http://www .bbc.co.uk/dna/collective/) provide platforms for peer-to-peer debate on local and cultural issues. The personal data shared over the BBC's network needs to be managed in accordance with the requirements of the Data Protection Act; although this is more of a concern for content producers who directly receive and manipulate the data, there could be implications for data that is ingested in·gest  
tr.v. in·gest·ed, in·gest·ing, in·gests
1. To take into the body by the mouth for digestion or absorption. See Synonyms at eat.

2.
 into a long-term archiving system.

More specifically relevant to bbc.co.uk, the United Kingdom's Legal Deposit Libraries Act of 2003 is enabling legislation Noun 1. enabling legislation - legislation that gives appropriate officials the authority to implement or enforce the law
legislation, statute law - law enacted by a legislative body
 intended to extend the concept of "legal deposit" to cover electronic publications, including Web pages. The act is clear about who dictates the delivery of content to the deposit libraries: "where a work is published or made available to the public in different formats, (to) provide for the format in which any copy is to be delivered to be determined in accordance with requirements specified (generally or in a particular case) by the deposit libraries or any of them" (Queen's Printer of Acts of Parliament, 2003b, Section 6). Though daunting daunt  
tr.v. daunt·ed, daunt·ing, daunts
To abate the courage of; discourage. See Synonyms at dismay.



[Middle English daunten, from Old French danter, from Latin
, the act highlights the general desire to preserve the national heritage locked in electronic publications even if the technical problems have yet to be resolved. It also raises the question of who has responsibility for preservation.

Will the British Library--and any other institutions granted "legal deposit" status under the Legal Deposit Libraries Act--become the nation's "digital preserver"? And would that mean the BBC should hand over its content and relinquish such a role? Considering the business demands of the corporation, probably not, but there will be a need for developing a close strategic relationship with the deposit libraries not least because they will have the authority to dictate the means of delivery, which in turn could have a major financial impact on the corporation aside from the implications for content creation processes.

Part of the thinking behind the Legal and Historical Internet Archive System is that, if we can demonstrate that the BBC is responsibly managing its own collection of published Internet output, then perhaps we could work with the deposit libraries to agree upon the nature and regularity of deposits instead of having to meet the blanket requirements of the Legal Deposit Libraries Act. Deciding which URLs suit the needs of which national institutions based on their own selection criteria (for example, the National Library of Wales The National Library of Wales (Welsh: Llyfrgell Genedlaethol Cymru) is the national legal deposit library of Wales, located in Aberystwyth. It is an Assembly Sponsored Public Body.  has already expressed an interest in content from BBC Cymru'r Byd [http://www.bbc.co.uk/cymru/]) would relieve pressure on the BBC's delivery mechanisms and provide focused collections specific to local needs.

I&A already enjoys long-established relationships with external agencies, including the British Film Institute and the British Library Sound Archive The British Library Sound Archive in London, England is one of the largest collections of recorded sound in the world, including music, spoken word and ambient recordings. Opened in 1955 as the British Institute of Recorded Sound, it became part of the British Library in 1983. , the first of which receives previously agreed-upon examples of our television output, and the second provides public access to our collection. So why not establish something similar for bbc.co.uk?

The Legal Deposit Libraries Act also requires the delivery "with the copy of the work, a copy of any computer program and any information necessary in order to access the work, and a copy of any manual and other material that accompanies the work and is made available to the public" (Queen's Printer of Acts of Parliament, 2003b, Section 6). This in itself raises issues of copyright and licensing. What special dispensation DISPENSATION. A relaxation of law for the benefit or advantage of an individual. In the United States, no power exists, except in the legislature, to dispense with law, and then it is not so much a dispensation as a change of the law.  needs to be agreed to allow legal deposit libraries not only the right to archive commercial software but also to use it to provide public access to stored content? Within the BBC the situation is no simpler. Software licensing models are different for each manufacturer and none encompasses the need to retain examples in the long-term for

purposes not covered not covered Health care adjective Referring to a procedure, test or other health service to which a policy holder or insurance beneficiary is not entitled under the terms of the policy or payment system–eg, Medicare. Cf Covered.  when the original licenses were signed.

But licensing aside, discussions with the BBC's technology support departments have focused on the possibility of creating a central store of decommissioned software that might be recommissioned for the purposes of supporting archived content at least in the short term. Clearly, however, that sounds simpler than commercial relationships currently allow.

Apart from external legal requirements, the BBC has the tenets of its own Royal Charter and Agreement to adhere to adhere to
verb 1. follow, keep, maintain, respect, observe, be true, fulfil, obey, heed, keep to, abide by, be loyal, mind, be constant, be faithful

2.
, including the corporation's responsibility for preserving the national heritage as reflected in its own broadcasting history. The agreement states that the BBC is to "establish and maintain ... an archive or archives of films, sound recordings and other recorded and printed matter which is representative of the sound and television programmes and films broadcast or transmitted by the Corporation" (Department of National Heritage: Broadcasting, 1996a, Section 11). This is an expansion of one of the "Objects of the Corporation" in Section 3 of the charter to "establish and maintain libraries and archives containing material relevant to the objects of the Corporation, and to make available to the public such libraries and archives with or without charge" (Department of National Heritage: Broadcasting, 1996b).

So not only should I&A be responsible for establishing a process for capturing, storing, and preserving the BBC's online output, but it should be considering how to make that collection accessible to the public. Both are something of a challenge and both are crucial in this period of charter renewal and external review. The BBC's services are in the spotlight, and should there be any major changes to bbc.co.uk, it is imperative that I&A be ready to record them within an archive of "what was" or, at the very least, to retain descriptive metadata of the content of decommissioned pages alongside information detailing context of their removal from the live site.

In fact, the site has already begun this process of change following the publication in July 2004 of the report of the Independent Review of BBC Online (Graf, 2004) conducted on behalf of the UK's Department of Culture, Media and Sport. The report's recommendations led to the closure of a handful of sites deemed either to duplicate content available elsewhere on the Web or--though serving the public interest--to attract too little traffic to make them economically viable. But irrespective of irrespective of
prep.
Without consideration of; regardless of.

irrespective of
preposition despite 
 the external drivers, archived bbc.co.uk content has potential for reuse, provides a good source of research, and enables producers to reflect on past output in the same way as any of the BBC's broadcast material held in the Radio and Television Archives.

SO WHAT IS IT THAT NEEDS CAPTURING?

bbc.co.uk began in 1989 when it was registered as bbc.uucp ("Unix-to-Unix copy") with dialup access to the site via Brunel University Brunel University is a university situated in West London, England. History
Brunel is one of a number of UK universities created in the 1960s following the Robbins Report on higher education (often called the plate glass universities).
; it was only available to BBC Development. It was then reregistered with the UK's academic "naming" body, the Name Registration Service (NRS NRS Nevada Revised Statutes
NRS National Runaway Switchboard (Chicago, IL)
NRS Natural Reserve System (University of California)
NRS National Readership Survey
NRS National Relay Service
), as bbc.co.uk in 1991, but it was still only available inside the BBC to the renamed BBC Research and Development Department. Three years later, on April 13, 1994, the BBC Networking Club (body) BBC Networking Club - A bulletin board run by the British Broadcasting Corporation Education department from April 1994 to 30 Nov 1995.  "opened for business" at bbcnc.org.uk. Commissioned by BBC Education to support the BBC2 series The Net--the first program with an online forum via the "Auntie" bulletin board--it was aimed at introducing viewers to the Internet: what it was, how it worked, and what it had to offer. Membership cost 5 [pounds sterling] a month, which gave subscribers access to "early-adopter sites" including "Top Gear," "Tomorrow's World Tomorrow's World was a long-running BBC television series, showcasing new (and often wacky) developments in the world of science and technology. First aired in 1965, it ran for 38 years until it was axed at the beginning of 2003, ostensibly because of falling ratings. ," and Radio 4's "Woman's Hour Woman's Hour is a magazine programme broadcast on BBC Radio 4 in the United Kingdom.

Created by Norman Collins and originally presented by Alan Ivimey the programme was first broadcast on 7 October 1946 on the BBC's Light Programme (now called Radio 2).
." It also provided nine-day TV and radio schedules and a "Guide to the Internet" introduced by a character called "Babbage." The BBC Networking Club was an Internet service provider Internet service provider (ISP)

Company that provides Internet connections and services to individuals and organizations. For a monthly fee, ISPs provide computer users with a connection to their site (see data transmission), as well as a log-in name and password.
, communication facilitator, information supplier, and Web publisher. In November 1997 News Online "went live" via their in-house-developed content production system, and by March 1998 the BBC Online home page was providing a portal to the BBC's Internet services.

Since then bbc.co.uk--as is now the accepted branding for the site--has grown to at least 750GB of content with daily incremental Additional or increased growth, bulk, quantity, number, or value; enlarged.

Incremental cost is additional or increased cost of an item or service apart from its actual cost.
 updates of 3-4GB. Content includes as wide a range of file types imaginable i·mag·i·na·ble  
adj.
Conceivable in the imagination: imaginable exploits.



i·mag
, providing audio, video, animation, and text, and meeting any number of editorial criteria within the BBC's overall remit to provide "information, education and entertainment" (Department of National Heritage: Broadcasting, 1996b, Section 3). The potential of the Internet as a broadcasting platform is exploited to the full with constant experimentation of potential services that technologically dictate the means of archiving sites and pages.

Currently content is produced within genre-based departments--for example, Learning and Interactive, Drama and Entertainment--and overseen by a central editorial and technical team that also has responsibility for other new media platforms, including interactive television and mobile devices. Methods of content creation range from the use of commercial tools and software--Microsoft's FrontPage, Macromedia's Dreamweaver, Adobe's GoLive--to handcrafted hand·craft  
n.
Variant of handicraft.

tr.v. hand·craft·ed, hand·craft·ing, hand·crafts
To fashion or make by hand.



hand·craft
 HTML HTML
 in full HyperText Markup Language

Markup language derived from SGML that is used to prepare hypertext documents. Relatively easy for nonprogrammers to master, HTML is the language used for documents on the World Wide Web.
 and in-house content production and management systems.

The biggest challenge for archiving content is the degree to which it might be described as "dynamic." In other words, it should be relatively easy to collect flat HTML pages--complete pages held on Web servers--but less easy to harvest those that are created within client Web browsers The following is a list of web browsers. Historical
Historically important browsers
In order of release:
  • WorldWideWeb, February 26, 1991
  • Erwise, April 1992
  • ViolaWWW, May 1992, see Erwise
 when a particular URL URL
 in full Uniform Resource Locator

Address of a resource on the Internet. The resource can be any type of file stored on a server, such as a Web page, a text file, a graphics file, or an application program.
 is requested at a particular date and time and content is provided from backend databases.

Only a relatively small percentage of bbc.co.uk pages have been or are currently dynamically driven. The now decommissioned and only semi-dynamic "myBBC" site gathered content from specified sources based on user profiles. This content was held on the Web server in readiness for publication and distributed via the publication system in response to user requests. In comparison, the bbc.co.uk home page is updated at least twice daily with headlines and supporting images provided directly from the News Online content production system. And then there is content that appears dynamic: the constantly updated travel information is actually provided via data feed from a third-party supplier. It is then FTPed to the BBC to be transformed into HTML on a production server and published to what is called the Master Content Server (MCS) for Web distribution every thirty minutes. The MCS is key to content publication. Online producers publish new files to the "BORG2 queue"--a mechanism that chronologically makes updates to the site and that provides the means of content delivery to the Legal and Historical Internet Archive.

THE LEGAL AND HISTORICAL INTERNET ARCHIVE SYSTEM

On August 26, 1999, Sir John Birt, the BBC's director general at the time, emailed a request to the then Head of Heritage to "work out what we need to do to preserve the BBC's early work on the Internet." This led to I&A being commissioned by the head of BBC Online to devise an "Online Archive Policy," which in turn led to a series of interviews questioning producers about the type of content they were creating; what they had retained of previously published pages and sites; and what they thought was logistically and technologically possible in terms of capturing output that could be accessed and re-created in the long term.

Engaging with content producers was sometimes difficult. At the time the Internet was seen as ephemeral Temporary. Fleeting. Transitory. : content was published today and deleted or overwritten tomorrow. Why should it be archived at all? Even the legal teams argued that, if it proved too difficult a challenge, we had had few complaints about BBC Online; not having the content with which to answer claims was a defence in itself. But that was not good enough in light of the charter requirements and ignored the potential for research and reuse afforded by an Internet archive.

So what were the options? As an alternative to the existing model for television and radio with I&As managing central collections, a decentralized de·cen·tral·ize  
v. de·cen·tral·ized, de·cen·tral·iz·ing, de·cen·tral·iz·es

v.tr.
1. To distribute the administrative functions or powers of (a central authority) among several local authorities.
 BBC Online collection would mean individual production departments capturing material at point-of-creation as a process integral to their content production or media asset management systems. Local collections would then be linked to a central metadata repository managed by I&A, which in turn would be linked to databases holding relevant rights information necessary for the reuse of content. It quickly became apparent that production departments did not want archiving to encroach upon Verb 1. encroach upon - to intrude upon, infringe, encroach on, violate; "This new colleague invades my territory"; "The neighbors intrude on your privacy"
intrude on, obtrude upon, invade
 content creation. Archiving was I&A's responsibility on behalf of the BBC and, while they would advise on its development, any archiving system would have to remain centralized cen·tral·ize  
v. cen·tral·ized, cen·tral·iz·ing, cen·tral·iz·es

v.tr.
1. To draw into or toward a center; consolidate.

2.
 and independent.

An initial trawl trawl - To sift through large volumes of data (e.g. Usenet postings, FTP archives, or the Jargon File) looking for something of interest.  of the Internet and conversations with other broadcasters pointed to there being no similar projects or initiatives underway and certainly no off-the-shelf software solutions. It did seem, though, that the functionality required for an online archiving system was closest to that provided by systems designed for newspaper publishing and electronic records management. They could handle multifaceted mul·ti·fac·et·ed  
adj.
Having many facets or aspects. See Synonyms at versatile.

Adj. 1. multifaceted - having many aspects; "a many-sided subject"; "a multifaceted undertaking"; "multifarious interests"; "the multifarious
 compound objects--individual components such as text, images, graphics, and complete documents--and offered version control and metadata management. But it was the need to implement as cost-effective a solution as possible that led to the decision to design and develop an in-house system that would

* be dependent on the online production process for the delivery of content, but neither be integral to nor have any detrimental effect upon it;

* be as "future-proof' as possible in order to support the capture of new formats;

* support digital objects irrespective of the source or file type;

* have the ability to display each Web page as originally published to bbc.co.uk;

* provide archival storage for retaining Web pages in perpetuity Of endless duration; not subject to termination.

The phrase in perpetuity is often used in the grant of an Easement to a utility company.


in perpetuity adj. forever, as in one's right to keep the profits from the land in perpetuity.
.

The budget also partially determined the scope of the system. Instead of aiming to provide a source of reusable content--which would have demanded access to adequate rights information--the system would meet only the BBC's legal, charter, and historical requirements. On that basis, BBC Technology was commissioned to develop, implement, and support the Legal and Historical Internet Archive System.

To keep the first iteration One repetition of a sequence of instructions or events. For example, in a program loop, one iteration is once through the instructions in the loop. See iterative development.

(programming) iteration - Repetition of a sequence of instructions.
 of the system as simple as possible and to focus on ensuring basic delivery of content to the archive, it was decided that the system would not capture

* audio/video content--most of which would be stored in its original format in the Television or Sound Archives;

* material published via News Online's content production system including Sport, Weather, and the World Service, all of which--excluding home pages and indexes--remains on the live site;

* dynamic database-driven content, which only exists in response to user requests and cannot be captured at the point of publication.

To ensure that the BBC retained some kind of record of published output, it was also decided that the system would store in perpetuity all metadata associated with content whether in- or out-of-scope.

The aim, however, is for future iterations of the system to include currently excluded output, examples of which have obvious historical and legal value:

* Most audio/video material available on bbc.co.uk is created for television or radio transmission and reformatted for the Web, but there is a growing trend for publishing full online versions of interviews, concerts, and festivals originally edited for broadcast. Without an inclusive archive strategy, there is a risk that these will be lost along with any examples of audio/video content being commissioned specifically for the Web--another current trend.

* News Online's home pages and indexes most clearly demonstrate any design or style changes and, because of the frequency and number of changes made to those pages as they keep pace with developing news stories, they could be the most litigious litigious adj. referring to a person who constantly brings or prolongs legal actions, particularly when the legal maneuvers are unnecessary or unfounded. Such persons often enjoy legal battles, controversy, the courtroom, the spotlight, use the courts to punish  of all the BBC's output.

* Being dynamically created, message boards also fall out of scope but are historically valuable as a demonstration of the Web providing a platform for peer-to-peer communication on topics important in today's society.

That said, there is the possibility of investment in other methods of content capture to complement the system: using stand-alone personal computers to store snapshots of databases feeding dynamic sites, for example. In effect this is no different from I&A retaining and maintaining the machines capable of playing archived film or vinyl records, but it would still represent something of a "first" for digital archiving and preservation.

The working title for the system--the Legal and Historical Internet Archive--deliberately emphasizes that it is not intended as a source of reusable material. One of the main reasons for this is the lack of necessary rights information. The system will collect URLs at the point where they are published and made publicly available, which means that no copyright details will be harvested by the system since such sensitive information is deliberately not included in HTML metatags. Unfortunately, this also means that system users are technically able both to download and reuse stored files. There is currently no way of preventing this from happening, though users will be reminded at login Signing in and gaining access to a network server, Web server or other computer system. The process (the noun) is a "login" or "logon," while the act of doing it (the verb) is to "log in" or to "log on.  that they have no authorization for downloads and warned that, in the event of a legal challenge over republished material, the system will have retained a record of all transactions.

The system will also be able to quarantine quarantine (kwŏr`əntēn), isolation of persons, animals, places, and effects that carry or are suspected of harboring communicable disease.  potentially libelous In the nature of a written Defamation ,a communication that tends to injure reputation.  or defamatory content with access restricted to I&A system administrators and New Media's legal team via the use of a password. This eliminates the risk of content reuse during any period of investigation.

Content quarantined quar·an·tine  
n.
1.
a. A period of time during which a vehicle, person, or material suspected of carrying a contagious disease is detained at a port of entry under enforced isolation to prevent disease from entering a country.
 to a separate area of the system would then be reviewed and reinstated in the main system, marked for permanent exclusion, or simply deleted dependent on the outcome of the legal proceedings All actions that are authorized or sanctioned by law and instituted in a court or a tribunal for the acquisition of rights or the enforcement of remedies. . Certain content--typically that provided by a third-party supplier or independent production company--may very well not be archived because of copyright issues. Though technically in-scope, the URLs will be flagged to prevent their being captured by the system.

Because the archive needs to serve different business needs, user permissions will be dictated by relevant business processes. At the top of the access hierarchy sits the "Approver" with control over all aspects of system management, including, most significantly, the ability to delete and permanently quarantine archived content. The "Administrator" will deal with routine administrative tasks and run business and management reports, while the "Legal User" can view quarantined content and the "Restricted" or "General User" is allowed basic access to content. In order to satisfy the requirements of the "Legal Users"--New Media's team of lawyers who would have to answer any public complaints about content on bbc.co.uk--system logins will be granted to a maximum of fifty users. Compare that to the pan-BBC access to all other archive collections with the workforce now in the region of 27,000 employees. There are benefits of such a "soft launch" beyond minimizing the risk of misuse of the system with illegal reuse of downloaded content. Any problems at implementation will affect relatively few people, and management reports will provide clearly focused information on usage patterns to support decisions on the system's further development.

Continuing the theme of "simplicity," the search mechanism will be restricted to queries based on URL and/or date and/or time conducted via a Web front end integrated with the standard internal BBC desktop. To that end, captured files will be stored as "Filename file·name also file name  
n.
A name given to a computer file to distinguish it from other files, often containing an extension that classifies it by type.
.FileExtension.TimeDateStamp" and in directories mirroring those of the original Web sites. It is important that the system's file structures are also designed to support future integration with federated search Federated search is the simultaneous search of multiple online databases and is an emerging feature of automated, Web-based library and information retrieval systems. It is also often referred to as a portal, as opposed to simply a Web-based search engine.  engines providing simultaneous access to I&A's multiple online resources. Query responses will return the nearest version to the requested URL based on date and time and encourage chronological navigation through to the "next" and "previous" versions of that URL. Where users specify only a date and time but no URL, the bbc.co.uk home page will be returned, and where only a URL is cited, all versions of that URL will be retrieved. All of this will be achieved within three seconds in at least 80 percent of cases and no longer than five seconds for the remaining 20 percent. Users will also be alerted to the status of quarantined or excluded content in response to search queries and reminded of the scope of the collection where requested URLs have not been captured. It is important to remember that Web pages will be returned "as published," and users will have the option of navigating away from the requested URL to any linked content within the archive.

At conception there were two possible options for the build of the system. The first was based on UNIX architecture A Unix architecture is a computer operating system system architecture that embodies the Unix philosophy. It may adhere to standards such as the Single UNIX Specification (SUS) or similar POSIX IEEE standard. , and the plan was to add disk arrays directly to the BBC's MCS, which contains all uploaded content ready for publication to the live Web servers. The proposal was for content to be copied directly to disk, stored in perpetuity and retrievable via a Web page hosted on the MCS. The second---and agreed--option used an Intel processor and Linux operating system operating system (OS)

Software that controls the operation of a computer, directs the input and output of data, keeps track of files, and controls the processing of computer programs.
 not directly integrated with the MCS. The content would be received into a dedicated archive server delivered from the MCS as it would be to any Web server and retrievable via a Web page hosted on that archive server.

The attraction of the second option was that it would allow both the archive system and New Media's content production and publishing system to develop in isolation despite the dependency of the first on the second for the delivery of content. It would also allay al·lay  
tr.v. al·layed, al·lay·ing, al·lays
1. To reduce the intensity of; relieve: allay back pains. See Synonyms at relieve.

2.
 New Media's fears of archiving encroaching on the creative environment and in any way dictating the means by which content is produced.

The only other issue for the system build was the need for the underlying data model to be compliant with the BBC-developed Standard Media Exchange Framework (SMEF SMEF Standard Media Exchange Framework (BBC)
SMEF Spontaneous Massive Existence Failure (Douglas Adams' Starship Titanic)
SMEF Spontaneous Massive Existence Failure
). (3) The framework defines terms used in the content production, distribution, and broadcast chain across all media platforms including television, radio, and the Web and was initially designed to support the sharing of data across BBC networks and via BBC systems.

Project Management

The implementation of the system was directly managed by a project team comprising one project manager from BBC Technology (the system designers, builders, and supporters), another from I&A (representing the needs of the business), and the New Media Archivist ARCHIVIST. One to whose care the archives have been confided.  (the customer). The project team was then accountable to a project board, which convened monthly for updates on progress and whose role it was to make decisions and resolve conflicts or issues. The board included the project team and was augmented by representatives from New Media (both editorial and legal) and the BBC's Business Technology Analyst responsible for the pan-BBC storage strategy.

At the outset the project team identified the major risks as follows:

* If the rate of content creation--and consequently data volumes--dramatically increases, then the solution will run out of storage capacity, originally estimated at 4TB over the initial three years of the system's implementation.

* The BBC could be in breach of legal restrictions if quarantined content was inadvertently made generally available.

* Stored content could be corrupted or lost in the event of catastrophic technical failure.

Only the last of the three was considered unavoidable, whereas ongoing monitoring of data-ingestion rates would allow for timely increase of available storage and, with rigorous data and system management, access to quarantined content should be preventable.

It had already been accepted that the agreed storage strategy--dictated by budgetary constraints--meant that, should two disks become corrupted, then data on all disks, and consequently the whole archive, would be lost. Whereas RAID 1 Mirroring was the preferred--and more secure--option, it would have doubled storage costs compared with the alternative: three disk-arrays set up using RAID 5 Strip Set with Parity, which could cope with the loss of one disk (but not more) with no effect on system performance or stored data.

Unfortunately, there were a number of initially unidentifiable Adj. 1. unidentifiable - impossible to identify
identifiable - capable of being identified
 risks that had an impact on the projected eight-month time scale:

* The sale of BBC Technology--a BBC subsidiary--led to industrial action and resulted in the project manager's resigning prior to implementation.

* The unique system-build made unusual demands of a standard application: the modified version of Apache Web server See Apache.  unexpectedly required the development of a secondary layer of code in order to provide the necessary functionality. This meant eight weeks of redevelopment work and the drafting in of a second developer.

* The hardware failed to perform as required, and it took a second storage box and several weeks of investigation before the problem was identified: the mechanism for managing the ingestion ingestion /in·ges·tion/ (-chun) the taking of food, drugs, etc., into the body by mouth.

in·ges·tion
n.
1. The act of taking food and drink into the body by the mouth.

2.
 of content could not process the 17,000 hourly transactions and so caused an unacceptable backlog in the queue for updates to be published to bbc.co.uk.

The positive consequence of the unforeseen delays was the decision to move from Network Addressed Storage (NAS (1) See network access server.

(2) (Network Attached Storage) A specialized file server that connects to the network. A NAS device contains a slimmed-down operating system and a file system and processes only I/O requests by supporting the popular
) to the BBC's newly established Storage Area Network (SAN), designed to rationalize ra·tion·al·ize
v.
1. To make rational.

2. To devise self-satisfying but false or inconsistent reasons for one's behavior, especially as an unconscious defense mechanism through which irrational acts or feelings are made to appear
 storage management across the corporation and thereby reduce costs. The benefits for the Legal and Historical Internet Archive System were twofold:

1. Content would be ingested directly into the archive via a fiber card, thereby reducing the number of transactions for processing, speeding up data delivery, and avoiding congestion The condition of a network when there is not enough bandwidth to support the current traffic load.

congestion - When the offered load of a data communication path exceeds the capacity.
 in the bbc.co.uk publishing queue.

2. Backup would be to the more resilient tape rather than disk, and the risk of data corruption Data corruption refers to errors in computer data that occur during transmission or retrieval, introducing unintended changes to the original data. Computer storage and transmission systems use a number of measures to provide data integrity, the lack of errors.  would be reduced.

But there was one caveat. While the corporate advantages of central storage were inarguable, it was important to establish the difference between the administration of "active storage" to meet day-to-day business demands and that required for the archive management of content intended for retention in perpetuity. At the very least this meant ensuring data integrity throughout its lifespan and enabling possible future emulation or migration for the purposes of digital preservation.

The Service Level Agreement with BBC Technology--which became Siemens following the sell-off in October 2004--was designed to cover the level of system support provided and thereby went some way toward safeguarding the collection through the promise of monitoring and managing firewalls, general system security, anti-virus and other protective measures, and general hardware maintenance. It was also agreed that any requirement for increasing the system's functionality over the three-year agreement would be treated as an official "Change Request."

Post-Implementation and Future Development

Implementation marked the start of an ongoing review process: Did the system fully meet the requirements for a legal and historical archive? Should the collection be subject to a "Selection and Retention Policy" or remain as comprehensive as is technologically possible? How useful is the system to content producers and how can it be made more useful? Should it be integrated with content production systems? Can it support the reuse of content and how? How far does it fit with I&A's digital archiving and preservation strategy and with the BBC's broader technology roadmap The context of product management
The existence of product managers in the product software industry indicates that software is becoming more and more commercialized as a standard product.
? The data from regular technical and management reports were designed to help answer some of those questions.

The system's basic performance--its degree of success at capturing specified content and its search response times--would be monitored by Siemens' support team as part of the Service Level Agreement and judged against agreed benchmarks. More useful for the ongoing development of the system were the patterns of usage. On a weekly basis reports include the following:

* For individual logins: number and length of search sessions; total number of searches both successful and unsuccessful; and number of and details of downloaded files

* For legally quarantined or excluded content: number of URLs placed into quarantine; length of time URLs held in quarantine; number of URLs to be quarantined in perpetuity; number of URLs reinstated into the main collection; and number of URLs excluded from the capture process and/or deleted

* For general system usage: number of concurrent users In computer science, the number of concurrent users for a resource in a location, with the location being a computing network or a single computer, refers to the total number of people using the resource at the same time.  

Aside from the development plans made based on the use and performance of the system, there are several things that need to happen if it is to fully support the archiving and preservation of bbc.co.uk. Digital collections need preservation strategies, but there are as yet no answers to the emulation versus migration question. Arguably ar·gu·a·ble  
adj.
1. Open to argument: an arguable question, still unresolved.

2. That can be argued plausibly; defensible in argument: three arguable points of law.
, neither option preserves the original integrity of archived data, but each at least assures the recording of a digital memory. Emulation retains content to be replayed, rebroadcast, or retransmitted in its original format but via an imitation of its original technical context, while migration transfers that content from one format to another or from one storage device to another, essentially altering its original characteristics. And for that reason it was made clear throughout the project that the ingestion of individual files, the design of file structures, and the adopted storage strategy must support any future digital preservation process by allowing either blanket access to large volumes of data or focused access to specific file types.

Whether as a "Change Request" during the first three years of its implementation or beyond that initial period, the following requirements have already been highlighted for the next phase of the development of the Legal and Historical Internet Archive System:

* Extending search capabilities to include multilingual mul·ti·lin·gual  
adj.
1. Of, including, or expressed in several languages: a multilingual dictionary.

2.
 free-text searching across all stored content

* Integration with I&A's intranet--research.gateway--to support federated searching across all online archives and information sources

* Retrospective ingestion of existing bbc.co.uk content from a range of sources and on a range of formats: screenshots and complete pages held on servers and/or solid media and tape archives tape archive - tar  kept as backups to bbc.co.uk

* Extended archive coverage to include content published to mobile or other Internet-based platforms, including Wireless Application Protocol (WAP (1) (Wireless Access Point) See access point.

(2) (Wireless Application Protocol) A standard for providing cellular phones, pagers and other handheld devices with secure access to e-mail and text-based Web pages.
) services; home pages and indexes from any sites published via the News Online content production system and including the World Service, Sport, and Weather; dynamic- and user-generated content including message boards; rich media where content is unique to and/or commissioned by bbc.co.uk

The other main area for development would be the standardization standardization

In industry, the development and application of standards that make it possible to manufacture a large volume of interchangeable parts. Standardization may focus on engineering standards, such as properties of materials, fits and tolerances, and drafting
, inputting, and management of metadata. The system is dependent on the delivery of content via the external publishing mechanism. This means that pages carry no metadata beyond that required by search engines for indexing purposes: it has little value for internal business use beyond the legal and historical remit of the first version of the system.

Post-production cataloguing of over two million constantly updated Web pages is clearly not feasible, but perhaps neither is mandating producers to complete metadata records at the point of content creation. An alternative might be to network the archive with content production and/or media asset management systems, which already contain metadata gathered during the creation and publication process. Is it not better to link to sources of information than to input that information a second time?

Though it does not solve the problem of standardizing metadata--I&A is far from imposing even the blandest set of required fields across quite disparate content production areas--it at least begins the process. But I&A will have control over the retrospective addition of metadata to archived content and is considering the use of Dublin Core A set of meta-data descriptions about resources on the Internet. Used for resource discovery, it contains data elements such as title, creator, subject, description, date, type, format and so on. Dublin Core descriptions are often included in HTML meta tags.  (http://dublincore .org) as the basis for developing a standard not only for bbc.co.uk but for implementation across all media. In particular, the recently approved "Provenance prov·e·nance  
n.
1. Place of origin; derivation.

2. Proof of authenticity or of past ownership. Used of art works and antiques.
" field, providing a "statement of any changes in ownership and custody of the resource since its creation," will help ensure data integrity throughout its life cycle and, it is hoped, in perpetuity.

Adopting an internationally recognized standard would not only support the internal and external sharing of content but the provision of future public access to the system where technologically possible without compromising the BBC's own network infrastructure.

NEW MEDIA ARCHIVING

Archiving and preserving bbc.co.uk is no different from archiving other New Media services: each is interactive and each comprises content from different sources in different formats with different drivers for short- and/ or long-term retention. And they each require appropriate storage and preservation strategies. Archiving interactive TV--essentially the result of a collision between broadcast and computing technologies--means capturing audio and video streams, text, graphics, presentational templates, and software applications. Mobile services--including phones and personal digital assistants (PDAs)--could mean the retention of physical devices or, if not, the acquisition of appropriate content emulators.

With such rapid change in the development of platforms for content distribution, it is essential to work as closely as possible with the creators of that content. Digital archiving and preservation are not just about what to do with material when you have it but also about facing the challenge of how to acquire it and, as far as possible, attempting to influence its creation and delivery. That said, it is unlikely that archive requirements could ever directly influence production methods, though the Internet Archive's Brewster Kahle Brewster Kahle (pronounced 'kale', (IPA: /keɪl/) (b. 1960)[1] is a U.S. internet entrepreneur, activist and digital librarian. Kahle graduated from MIT in 1982 with an SB degree in Computer Science & Engineering where he was a member of the Chi Phi Fraternity.  (Rein, 2004) believes that is exactly what should happen. That does not mean, however, that content producers should not have an understanding of those requirements and an agreement that they need to be addressed as early in the production process as possible.

The situation is not as bleak as it might sound. The BBC's New Media Department has recently initiated two archive-related projects. The "Creative Archive" will allow users to download clips from archive programs Software that compresses one or more files and folders into a single file. Many archive programs compress and decompress using the ZIP file format. See ZIP file, archive formats and data compression. , provide tools for editing those clips, and encourage peer-to-peer sharing See peer-to-peer network.  of content. According to according to
prep.
1. As stated or indicated by; on the authority of: according to historians.

2. In keeping with: according to instructions.

3.
 a recent press release (British Broadcasting Corporation, 2004), one of the main objectives is to "pioneer a new approach to public access rights in the digital age," adopting the Creative Commons An organization that has defined an alternative to copyrights by filling in the gap between full copyright, in which no use is permitted without permission, and public domain, where permission is not required at all.  (http:// creativecommons.org) model already prevalent in the United States United States, officially United States of America, republic (2005 est. pop. 295,734,000), 3,539,227 sq mi (9,166,598 sq km), North America. The United States is the world's third largest country in population and the fourth largest country in area. . And there are internal benefits for the BBC: the greater the demand for access to archive material, the faster the growth of its digital collections. This certainly proved true for News Online's "On This Day" (http://news.bbc .co.uk/onthisday/), which not only provides public access to otherwise inaccessible news footage but, in its demand for content, supports digital preservation through the migration of analogue material to digital formats for inclusion on the site. Both projects--as well as the planned Interactive Media Player designed as a portal to the previous seven days of BBC television BBC Television is a service of the British Broadcasting Corporation which began in 1932. The British Broadcasting Corporation has operated in the United Kingdom under the terms of a Royal Charter since 1927.  and radio programing via the Web--will feed directly into the Legal and Historical Internet Archive System. Or perhaps not directly. It might be that audio/visual content of this kind could be archived prior to distribution rather than via the external Web publishing Creating a Web site and placing it on the Web server. A Web site is a collection of HTML pages with the home page typically named INDEX.HTML. Web sites are designed using Web authoring software which provides a graphical layout capability or by hand coding in HTML or both.  mechanism. This would relieve pressure on the ingestion process and ensure the simultaneous delivery of all associated metadata.

The Creative Archive is a good example of what Cory Doctorow (2004) described as the "remix culture Remix culture is a term employed by Lawrence Lessig to describe a society which allows and encourages derivative works. Such a culture would be, by default, permissive of efforts to improve upon, change, integrate, or otherwise remix the work of copyright holders. " and, to support that culture, archiving and preservation needs to provide access to decontextualized content to feed the growing demand for reuse of material in any number of ways on any number of platforms. While that is not the objective of the first iteration of the Legal and Historical Internet Archive, it is certainly on the agenda for further phases of development.

There is much that could be said in conclusion and not least about the lessons learned so far. For instance, it was crucial to continually remind the project managers of the business drivers for the system's implementation. While a lack of detailed technical knowledge meant occasional "scope creep The continual enhancement of the requirements of a project as the system is being constructed. Scope creep occurs frequently in information systems development and is often responsible for going way over budget when the changes occur in the coding and testing stages rather than in the " and infeasible demands being made on the developers, it was important to maintain a focus on the unique requirements of a server-based archive--not least the need for long-term data integrity and opportunities for as yet undefined digital preservation strategies. That said, it is apparent that traditional archive models developed for a linear, analogue, and physical world are not always appropriate in a technologically advancing one and that there are benefits to be had from embracing that technology. Resources can be directed away from more mundane collection management tasks to tackle the issues of digital curation Digital curation, broadly interpreted, is about maintaining and adding value to a trusted body of digital information for current and future use.

Digital curation encompasses all of the actions needed to maintain digitised and born-digital objects and data over their entire
 and preservation.

But decisions need to be made and policies formulated without having all the answers: Which are the best archive formats Following are popular archiving formats that are used to compress one or more files into a single file. Archive formats are typically capable of also storing folders in order to reconstruct the file/folder relationship when decompressed. See archive.

.ARC Compressed archive.
? Should content be selected for retention, or is that too labor intensive Labor Intensive

A process or industry that requires large amounts of human effort to produce goods.

Notes:
A good example is the hospitality industry (hotels, restaurants, etc), they are considered to be very people-oriented.
See also: Capital Intensive, Trading Dollars
 in a server-based environment where Moore's Law "The number of transistors and resistors on a chip doubles every 18 months." By Intel co-founder Gordon Moore regarding the pace of semiconductor technology. He made this famous comment in 1965 when there were approximately 60 devices on a chip.  dictates that storage costs will keep falling? How will issues be resolved in relation to the archiving of software and hardware? Anyone engaged in digital preservation must accept that data cannot always be saved or accessed in its original format; must be prepared to develop emulated environments or processes for frequent data migration; must work as closely as possible with information technologists and content creators See content provider. ; must openly share experiences and knowledge through national and international bodies and organizations; and must appreciate their own business context--for the Legal and Historical Internet Archive System this includes the challenges of a fast-changing broadcast environment--while remaining focused on what does not change: the need to capture digital material and archive, preserve, and make it accessible.

REFERENCES

British Broadcasting Corporation. (2004). BBC Creative Archive pioneers new approach to public access rights in digital age [Press release]. Retrieved February 28, 2005, from http://www .bbc.co.uk/pressoffice/pressreleases/stories/2004/05-may/26/ creative-archive.shtml.

Department of National Heritage: Broadcasting. (1996a). Copy of the Agreement Dated the 25th Day of January 1996 Between Her Majesty's Secretary of State for National Heritage and the British Broadcasting Corporation [Electronic version]. Retrieved February 28, 2005, from http:// www.bbc.co.uk/info/policies/charter/pdtyagreement_text.shtml.

--. (1996b). Copy of Royal Charter for the Continuance The adjournment or postponement of an action pending in a court to a later date of the same or another session of the court, granted by a court in response to a motion made by a party to a lawsuit.  of The British Broadcasting Corporation [Electronic version]. Retrieved February 28, 2005, from http://www.bbc.co.uk/info/policies/charter/pdf/charter_text.shtml.

Doctorow, C. (2004). Written testimony to Select Committee on Culture, Media and sport [Electronic version]. Retrieved February 28, 2005, from http://www.eff.org/IP/ BBC_CMSC CMSC Consortium of Multiple Sclerosis Centers
CMSC Chicago Metropolitan Ski Council
CMSC Certified Medical Staff Coordinator
CMSC Computer Memory Security Check
CMSC California Mine and Safety Code
CMSC Communication Mode Selector Control
CMSC Changan Mazda Motor Sales Co.
_testimony.php.

Graf, P. (2004). Report of the Independent Review of BBC Online [Electronic version]. Retrieved February 28, 2005, from http://www.culture.gov.uk/NR/rdonlyres/ 395685B7-0373-49E2BD1B-CE9F2E F2E Fax to E-Mail 581E99/0/BBConlinereview.doc.

Queen's Printer of Acts of Parliament. (1996a). Broadcasting Act 1996, Chapter 55 [Electronic version]. February 28, 2005, from http://www.hmso.gov.uk/acts/acts1996/ 1996055.htm.

--. (1996b). Defamation Act 1996, Chapter 31 [Electronic version]. Retrieved February 28, 2005, from http://www.hmso.gov.uk/acts/acts1996/1996031.htm.

--. (1998). Data Protection Act 1998, Chapter 29 [Electronic version]. Retrieved February 28, 2005, from http://www.hmso.gov.uk/acts/acts1998/19980029.htm.

--. (2003a). Communications Act 2003, Chapter 21 [Electronic version]. Retrieved February 28, 2005, from http://www.legislation.hmso.gov.uk/acts/acts2003/ 20030021.htm.

--. (2003b). Legal Deposit Libraries Act 2003, Chapter 28 [Electronic version]. Retrieved February 28, 2005, from http://www.legislation.hmso.gov.uk/acts/acts2003/20030028.htm.

Rein, L. (2004). Brewster Kahle on the Internet Archive and People's Technology. Retrieved February 28, 2005, from http://www.openp2p.com/pub/a/p2p/2004/01/22/kahle.html.

NOTES

(1.) The British Broadcasting Corporation is a public service broadcaster operating on the basis of a Royal Charter and Agreement and funded by a licence fee. It is consequently accountable to the British people See :
  • List of English people
  • List of Scots
  • List of Welsh people
  • List of Northern Ireland people
  • List of Cornish people
  • List of Black Britons
  • List of British Asians
  • List of British Jews
Outwith UK
British Overseas Territories
 to whom it delivers local and national television and radio output, including digital and interactive services. It also produces more than two million Web pages and via the BBC World Service
For the BBC television network, see BBC World.


The BBC World Service is one of the most widely recognised international broadcasters, transmitting in 33 languages to many parts of the world through multiple technologies.
 provides radio programming in forty-three languages (http://www.bbc.co.uk/info/).

(2) OFCOM is the regulator for the UK communications industries communications industry, broadly defined, the business of conveying information. Although communication by means of symbols and gestures dates to the beginning of human history, the term generally refers to mass communications. , with responsibilities across television, radio, telecommunications, and wireless communications wireless communications

System using radio-frequency, infrared, microwave, or other types of electromagnetic or acoustic waves in place of wires, cables, or fibre optics to transmit signals or data.
 services (http://www .ofcom.org.uk/).

(3) The SMEF data model is being marketed by the BBC as BBC AS Bø Byggecompagni As  an industry standard available without charge but governed by a no-signature license (http://www.bbc.co.uk/ guidelines/smef/).

Cathy Smith Cathy Evelyn Smith (1948 in Hamilton, Ontario[1]-) is a former backup singer, and rock star girlfriend, groupie and drug dealer, who served time in the California State Prison system in the manslaughter death of John Belushi. , New Media Archivist, Room BC3 D5, White City: Broadcast Centre, Wood Lane, London W12 7TP, cathy.smith@bbc.uk. Cathy Smith is New Media Archivist at the British Broadcasting Corporation in London, working in the Information and Archives Department. She holds an M.S. in Information Science from City University, London City University London is a British university based at Northampton Square in Finsbury, London (). Its official name is The City University.[3] The University is famous for its excellent graduate employment records. , and has been in her current position since 2000. Responsible for establishing archiving practices and procedures for the BBC's New Media output, one of her main projects is the implementation of an Internet archive system designed to capture content published to bbc.co.uk.
COPYRIGHT 2005 University of Illinois at Urbana-Champaign
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 2005, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Author:Smith, Cathy
Publication:Library Trends
Geographic Code:4EUUK
Date:Jun 22, 2005
Words:7642
Previous Article:Exploring variety in digital collections and the implications for digital preservation.
Next Article:Digital archiving in the twenty-first century: practice at the national library of the Netherlands.
Topics:



Related Articles
A Social History of British Broadcasting, vol. 1, 1922-1939: Serving the Nation.
British Broadcasting Corporation -BBC- Joins Growing Excalibur Video Advisory Council.
Teleglobe Announces Full-Time Broadcast Lease Agreement With the British Broadcasting Corporation - BBC -.
Teleglobe Expands Fiber Optic Video Network to Toronto, Vancouver, Washington, D.C.
Capitol Broadcasting Company to Provide Extensive Streaming Media Coverage of the 1999 Special Olympics World Games On the Internet.
FasTV Newsroom Goes Live With Online Video News From Capitol, Cox and Tribune Broadcasting Affiliates.
British Library to Archive E-mails.(Up front: news, trends & analysis)
Introduction.
Activism without borders.(Re-drawing the Boundaries of Activism in a New Media Environment conferance)
Cache money: L.A. firm taps Russian roots for next-generation online archives.(Media & Entertainment)(SNAPSE Inc.)

Terms of use | Copyright © 2009 Farlex, Inc. | Feedback | For webmasters | Submit articles