Video Transmission for Third Generation Wireless Communication Systems.This paper presents a twin-class unequal protected video transmission system over wireless channels. Video partitioning based on a separation of the Variable Length Coded (VLC VLC VideoLAN Client VLC Variable Length Coding VLC Very Low Cost VLC Visual Logic Controller VLC Valencia, Spain - Valencia (Airport Code) VLC Virtual Learning Collaborative VLC Variable Length Codeword VLC Very Low Clearance ) Discrete Cosine Transform See DCT. (mathematics) discrete cosine transform - (DCT) A technique for expressing a waveform as a weighted sum of cosines. The DCT is central to many kinds of signal processing, especially video compression. (DCT (Discrete Cosine Transform) An algorithm that is widely used for data compression. Similar to Fast Fourier Transform, DCT converts data (pixels, waveforms, etc.) into sets of frequencies. The first frequencies in the set are the most meaningful; the latter, the least. ) coefficients within each block is considered for constant bitrate Constant bitrate (CBR) is a term used in telecommunications, relating to the quality of service. Compare with variable bit rate. When referring to codecs, constant bit rate encoding means that the rate at which a codec's output data should be consumed is constant. transmission (CBR (1) (Computer-Based Reference) Reference materials accessible by computer in order to help people do their jobs quicker. For example, this database on disk! (2) (Constant Bit Rate) A uniform transmission rate. ). In the splitting process the fraction of bits assigned to each of the two partitions is adjusted according to according to prep. 1. As stated or indicated by; on the authority of: according to historians. 2. In keeping with: according to instructions. 3. the requirements of the unequal error protection scheme employed. Subsequently, partitioning is applied to the LTU-T H.263 coding standard. As a transport vehicle, we have considered one of the leading third generation cellular radio standards known as WCDMA (Wideband CDMA) A 3G high-speed digital data service provided by cellular carriers that use the TDMA or GSM technology worldwide, including AT&T (formerly Cingular) and T-Mobile in the U.S. . A dual-priority transmission system is then invoked on the WCDMA system where the video data, after being broken into two streams, is unequally protected. We use a very simple error correction coding scheme for illustration and then propose more sophisticated forms of unequal protection of the digitized video signals. We show that this strategy results in a significantly higher quality of the reconstructed video data when it is transmitted over time-varying multipath fading channels. Key words: multimedia communications; third generation mobile systems; WCDMA; wireless video. Accepted: November 1, 2000 Available online: http://www.nist.gov/jres 1. Introduction The wireless revolution in the 1 980s was primarily driven by market demands for mobile radio voice communications. The first generation analog wireless communication systems used old radio technologies combined with novel cellular network planning [1] to provide transparent ubiquitous mobile access to users. Initially, the cellular market was a niche market A niche market also known as a target market is a focused, targetable portion (subset) of a market sector. By definition, then, a business that focuses on a niche market is addressing a need for a product or service that is not being addressed by mainstream providers. targeted at business people. As the price of the user terminals and services dropped due to an unprecedented exponential increase in demand and related economies of scale, cellular radios became the communications tool of choice for the masses. Today, there are in the neighborhood of 300 million mobile phone users in the world. It is expected that in the next decade the number of users will reach the one billion mark. The second generation wireless systems were introduced in the 1990s and were primarily an evolutionary step towards improving the capacity of cellular systems through digitization of voice and efficient digital modulation The altering of a carrier by a digital signal. See modulation and carrier. schemes. These systems also provided additional features such as security, short messaging See SMS. , and circuit-switched data. We are now in the midst Adv. 1. in the midst - the middle or central part or point; "in the midst of the forest"; "could he walk out in the midst of his piece?" midmost of another evolution planned for deployment in the 2000s. This evolution is once again driven by the need for greater bandwidth in anticipation of further demand for voice services. However, the new wireless standards will also provide a pipeline for broadband services such as enhanced high rate data and multimedia services. The global growth of interest in the Internet and in digitized audio and video, and the demand for such services through the fixed communications networks, is an important factor. Although there are currently no applications with great mass market appeal that require broadband wireless See wireless broadband. access, it is anticipated that the popularity of these services in fixed networks will eventually impact the market for wireless communications wireless communications System using radio-frequency, infrared, microwave, or other types of electromagnetic or acoustic waves in place of wires, cables, or fibre optics to transmit signals or data. . The next generation wireless systems are required to have voice services of wireline quality and to provide high bit rate data services of 144 kbit/s to 2 Mbit/s depending on the radio environment. At the same time, they are to operate reliably in different types of environments: macro, micro, and pico cellular; urban, suburban, and rural; indoor and outdoor. In other words Adv. 1. in other words - otherwise stated; "in other words, we are broke" put differently , the next generation systems are supposed to have better quality and coverage, be more power and bandwidth efficient, and be deployed in diverse environments. These high data rates make video transmission possible for a number of applications such as video conferencing See videoconferencing. (communications) video conferencing - A discussion between two or more groups of people who are in different places but can see and hear each other using electronic communications. , emergency medical services An Emergency medical service (abbreviated to initialism "EMS" in many countries) is a service providing out-of-hospital acute care and transport to definitive care, to patients with illnesses and injuries which the patient believes constitutes a medical emergency. , and site surveys. However, since most existing video compression Encoding digital video to take up less storage space and transmission bandwidth. See video codec and data compression. video compression - Compression of sequences of images. standards [7,8,9,10,11] have been developed for relatively benign, nearly error-free environments, they cannot be directly applied in a hostile mobile domain. This is mainly due to the extensive employment of variable length coding techniques, which are efficient in bitrate reductio n terms, but are error-sensitive. A single transmission error may result in an undecodable string of bits. One effective method of protecting the compressed video compressed video - video compression signal is to split the coded video signal into a number of separate bitstreams where each can be transmitted via a separate channel having a different degree of error protection [12,13,14]. The bitstream splitting can be accomplished by taking into consideration the perceptual significance of coded video, where better protection is provided for the transmission of the perceptually more important bits. In this paper, such a strategy has been considered for one of the leading third generation cellular radio standards known as WCDMA. The paper begins with an overview of the WCDMA radio standard [2]. Then, a video partitioning scheme based on the ITU-T See ITU. ITU-T - International Telecommunications Union H.263 standard, is presented. This is followed by a very simple error protection coding scheme for the transmission of partitioned video over IMT-2000 channels. The transmission system model and its parameters are then discussed. This is followed by simulation results that evaluate the transmission of partitioned video using the downlink WCDMA physical layer as a transport vehicle. Finally, for possible future investigations, more sophisticated forms of unequal protection for the digitized video signals are discussed. 2. Review of WCDMA Radio Standards It seems that a common international standard for the third generation wireless communications systems may soon be adopted. The international standard may have three radio access schemes and is currently being harmonized har·mo·nize v. har·mo·nized, har·mo·niz·ing, har·mo·niz·es v.tr. 1. To bring or come into agreement or harmony. See Synonyms at agree. 2. Music To provide harmony for (a melody). by the Third Generation Partnership Project (3GPP GPP Government Performance Project GPP General Purpose Processor GPP General Physical Preparedness GPP Gambian People's Party GPP Good Pharmacy Practice GPP Gross Primary Productivity GPP Green Procurement Program GPP Generic Packetized Protocol ). The three radio access schemes are the Frequency-Division-Duplex (FDD (1) Abbreviation for floppy disk drive. See floppy disk. (2) (Frequency Division Duplexing) A transmission method that separates the transmitting and receiving channels with a guard band (some amount of spectrum that acts as a buffer or insulator). ) mode, the Time-Division Duplex (TDD (Time Division Duplexing) A transmission method that uses only one channel for transmitting and receiving, separating them by different time slots. No guard band is used. Contrast with FDD. See also TDD/TTY. TDD - Telecommunications Device for the Deaf ) mode and the Multi-Carrier (MC) mode. The FDD mode is a direct sequence (DS) CDMA (Code Division Multiple Access) A method for transmitting simultaneous signals over a shared portion of the spectrum. The foremost application of CDMA is the digital cellular phone technology from QUALCOMM that operates in the 800 MHz band and 1.9 GHz PCS band. scheme initially proposed by Japan's Association of Radio Industries and Businesses “ARIB” redirects here. For other uses, see ARIB (disambiguation). The Association of Radio Industries and Businesses, commonly known as ARIB (ARIB is generally pronounced "araib"), is a standardization organization in Japan. (ARIB ARIB Association of Radio Industries and Businesses ) and the European Telecommunications Standards Institute (body) European Telecommunications Standards Institute - (ETSI) A European version of the ITU-T(?). (ETSI (European Telecommunications Standards Institute, Sophia Antipolis technical park, Nice, France, www.etsi.org) A non-profit membership organization founded in 1988, dedicated to standardizing information and communication technologies (ICT) throughout Europe. ). The original proposed radio standard was called the UMTS (Universal Mobile Telecommunications System) The GSM implementation of the 3G wireless phone system. Part of IMT-2000, UMTS provides service in the 2 GHz band and offers global roaming and personalized features. Terrestrial Radio Access or UTRA UTRA UMTS Terrestrial Radio Access UTRA Universal Terrestrial Radio Access which included a TDD mode for deployment in unpaired bands. The multi-carrier mode is based on the Telecommunications Industry Association See TIA. (body, standard) Telecommunications Industry Association - (TIA) An association that sets standards for communications cabling. Cables that TIA set standards for include: EIA/TIA-568A and EIA/TIA-568B category three, four and five cable. (TIA (1) (Telecommunications Industry Association, Arlington, VA, www.tiaonline.org) A membership organization founded in 1988 that sets telecommunications standards worldwide. It was originally an EIA working group that was spun off and merged with the U.S. ) standard known as CDMA2000 or IS2000. In the next section, we discuss the physical layer parameters and spreading and modulation for the FDD mode of 3GPP radio access scheme and we will refer to it as WCDMA [2]. Please note that the information in this paper may not be consistent with the latest developments in the standard. We intend to give the reader a basic understanding of concepts and do not intend this paper to be a replacement for the detailed specifications, which may not be finalized. We then simulate the proposed video coding Video coding is the field in electrical engineering and computer science that deals with finding efficient coding formats and algorithms for digital video. Video data usually not only contains visual information but also audio. scheme over the downlink physical layer of WCDMA and report the performance results. 2.1 The WCDMA Airlink Parameters The modulation chip rate for WCDMA is 3.84 Mega Chips Per Second (Mcps). [1] The number of chips per modulation symbol is called the Spreading Factor spreading factor said of biological fluids, used in pharmaceutical preparations. See hyaluronidase. (SF). The specified pulse shaping In digital telecommunication, pulse shaping is the process of changing the waveform of transmitted pulses. Its purpose is to make the transmitted signal suit better to the communication channel by limiting the effective bandwidth of the transmission. roll off factor is 0.22. This results in an effective minimum bandwidth of 3.84(1+0.22) which is less than 5 MHz (MegaHertZ) One million cycles per second. It is used to measure the transmission speed of electronic devices, including channels, buses and the computer's internal clock. A one-megahertz clock (1 MHz) means some number of bits (16, 32, 64, etc. . The nominal channel spacing The amount of bandwidth allotted to each channel in a communications system that transmits multiple frequencies such as fiber optics. It is measured as the spacing between center frequencies (or wavelengths) of adjacent channels. See guard band. for WCDMA 5 MHz. The WCDMA carriers are specified on a grid with a resolution of 200 kHz. Although the typical separation between the carriers is 5 MHz, it may vary depending on the deployment scenario. The frame length is fixed at 10 ins, which corresponds to 38 400 chips. The number of bits or symbols in a frame may vary depending on the data rate, which is variable. The variable data rates are supported using variable spreading factors. The transmissions from all the physical channels with in a 5 MHz band are code division multiplexed and transmitted over the same band. We discuss two channels in this paper: i) Dedicated Physical Data Channels (DPDCH DPDCH Dedicated Physical Data Channel (3GPP) DPDCH Dedicated Packet Data Channel ) and, ii) Dedicated Physical Data Channels (DPCCH DPCCH Dedicated Physical Control Channel ). Table 1 summarizes the bit rates, symbol rates, and spreading factors for these channels. All channels are spread by a spreading code with a spreading factor (SF) that may vary from 4 to 512 on the downlink and 4 to 256 on the uplink, depending on the data rate of the channel. The spreading factors must be an integer power of 2. 2.2 The WCDMA Downlink The frame structure for the downlink DPCCH/DPDCH channel is shown in Fig. 1. A super frame is made up of 72, 10 ms frames. Each frame has 15 equal-length slots. As shown in Fig. 1, the DPCCH and DPDCH are time multiplexed within the same slot. The DPCCH channel consists of pilot symbols, transmit power-control (TPC (Transaction Processing Performance Council, San Francisco, CA, www.tpc.org) An organization devoted to benchmarking transaction processing systems. In order to derive the number of transactions that can be processed in a given time frame, TPC benchmarks measure the total performance of ) bits and transport format combination indicator (TFCI TFCI Transport Format Combination Indicator TFCI Transport Format Combination Indication ) bits (please note that TFCI is optional and is not used for fixed rate services). The DPDCH channels contain bearer data. The number of bits transmitted in a single slot depends on the data rate of the channel. 2.3 The Downlink Traffic Channel (Forward DPDCH/DPCCH) Figure 2 shows the downlink DPCCH/DPDCH transmit processing for one channel in a 10 ms frame. The process shown in the figure is repeated every 10 ms [2]. A DPCCH/DPDCH channel carries 2n bits of information in every 10 ms frame. The actual number of bits depends on the data rate and hence the spreading factor. The 2n bits are mapped onto n QPSK (Quadrature Phase Shift Keying) A phase modulation technique that transmits two bits in four modulation states. See PSK and phase modulation. complex symbols. The QPSK (quadrature Phase Communication signals often have the form:
SRRC Synchrotron Radiation Research Center SRRC Square Root Raised Cosine (filtering technique) ) with roll-off factor a = 0.22. The I-channel and Q-channel signals are then RF converted using quadrature quadrature, in astronomy, arrangement of two celestial bodies at right angles to each other as viewed from a reference point. If the reference point is the earth and the sun is one of the bodies, a planet is in quadrature when its elongation is 90°. (cos [omega]t and sin [omega]t) carriers, and finally transmitted through the air. The spreading codes are orthogonal At right angles. The term is used to describe electronic signals that appear at 90 degree angles to each other. It is also widely used to describe conditions that are contradictory, or opposite, rather than in parallel or in sync with each other. sequences whose elements are either 1 or -1, and so are the elements of the non-orthogonal long scrambling codes. 2.4 The WCDMA Uplink The frame structure for the uplink DPCCH/DPDCH channel is shown in Fig. 3. The basic parameters for uplink framing are the same as the downlink. The only difference is that the DPDCH and DPCCH are not time multiplexed. Figure 4 shows the uplink DPCCH/DPDCH transmit processing for one channel. Unlike the forward channel, the DPDCH and DPCCH are transmitted in parallel and are separated using different channelization chan·nel·ize tr.v. chan·nel·ized, chan·nel·iz·ing, chan·nel·iz·es 1. To make, form, or cut channels in. 2. To direct through a channel. (spreading) codes. Also, the reverse transmissions carry feedback information bits (FBI). The n information bits from the DPDCH channel are BPSK BPSK Binary Phase Shift Keying BPSK biphase shift keying (US DoD) BPSK Bi-Phase Shift Keying modulated resulting in n symbols which are then spread using a spreading code [C.sub.i] of length [L.sub.d] to produce 38 400 chips. Similarly, m DPCCH bits are mapped onto m BPSK (Binary Phase Shift Keying) symbols which are spread using a spreading code [C.sub.j] of length [L.sub.c] to produce 38 400 chips. In other words, [nL.sub.d] = [mL.sub.c] = 38 400. The chips from the DPDCH are mapped onto the in-phase axis and those from DPCCH are mapped on to the quadrature axis of the the diameter of the sphere which is perpendicular to the plane of the circle. See also: Axis signal constellation. This is shown in Fig. 4 by a j multiply followed by an add operation. The signal constellation is, in effect, a QPSK constellation. Therefore, the modulation scheme for the reverse channel is called dual-channel QPSK. The resulting chips are then scrambled by a complex scrambling code [S.sub.uplink]. The uplink scrambling code is either a complex short code which is 256 chips long or a complex long code which is 38 400 chips long. When the short code is use d, the code is repeated 150 times in order to scramble the 38 400 chip frame. The resulting chips are then passed through a square root raised cosine filter (SRRC) with roll-off factor a = 0.22, are IF/RF IF/RF Intermediate Frequency/Radio Frequency converted and then transmitted through the air. 2.5 The Spreading and Scrambling Codes There are two major types of codes specified for WCDMA channels: orthogonal spreading codes and non-orthogonal long and short scrambling codes. The orthogonal spreading codes are used for channelization and the short and long scrambling codes are used for reducing inter-cell and intra-cell interference. In this paper, we do not discuss the detailed specifications of these codes. Instead, we will give you an overview of the different families of codes and their functions. 2.5.1 Orthogonal Variable Spreading Factor Orthogonal variable spreading factor (OVSF) is an implementation of Code division multiple access (CDMA) where before each signal is transmitted, the spectrum is spread through the use of a user's code. (OVSF OVSF Orthogonal Variable Spreading Factor ) Codes These spreading codes, also known as channelization codes, are used to ensure orthogonality between the channels with different spreading factors and rates and are hence called Orthogonal Variable Spreading Factor (OVSF) codes [4]. On the downlink, the orthogonal spreading codes are used to separate the transmissions of the various traffic and control channels within a given cell. On the uplink, the use of spreading codes is not coordinated by the system. The base station and the mobile station agree only on the number of orthogonal channels and hence the number of codes, but the spreading codes used by the mobile stations is not negotiated. Therefore, two mobile stations in a given cell may use the same spreading code on the uplink. In such cases, the resulting interference is suppressed by the uplink scrambling codes which are unique to every mobile station in a given cell. The OVSF codes are generated using the code tree shown in Fig. 5. The construction of these codes is very similar to Hadamard codes. Two branches emanate em·a·nate intr. & tr.v. em·a·nat·ed, em·a·nat·ing, em·a·nates To come or send forth, as from a source: light that emanated from a lamp; a stove that emanated a steady heat. from each code in the code tree. The top emanating branch is simply the code from the mother branch repeated twice and the bottom branch is that same code followed by its negation. Not all the OVSF codes are mutually orthogonal. A given code can be used in a cell if and only if there are no other codes used on the path from that given code, to the root of the tree, or any code belonging to the sub-tree generated from that specific code. For instance, if [C.sub.4,1] is used, then [C.sub.2,1] and [C.sub.1,1] may not be used in the same cell as they are in the path to the root. Also, [C.sub.8,1] and [C.sub.8,2] and all other codes derived from [C.sub.4,1] cannot be used in the same cell. Depending on the required data rate, the length of the spreading code applied on a given DPCCH/DPDCH channel may vary from 4 to 512 on the downlink and 4 to 256 on the uplink ( see Table 1). 2.5.2 The Non-Orthogonal Scrambling Codes There are three classes of scrambling codes used in WCDMA; 1) downlink long scrambling codes, 2) uplink short scrambling codes, 3) uplink long scrambling codes. In order to reduce the interference from the channels in adjacent cells in the downlink, every base station uses a long scrambling code. The downlink long scrambling codes are from the well-known family of Gold codes [5] built by the product of two maximum linear PN (Pseudo Noise) sequences. These codes have good cross-correlation properties and are widely used in commercial spread spectrum systems. The long code used on the forward DCDPCH/DCCPCH is 38 400 chips of a [2.sup.15] - l Gold Code. The uplink short and long scrambling codes are used to suppress intercell and intracell interference from the various mobile stations at the base station receiver. The mobile stations in a given cell are each assigned a unique short scrambling code. These short scrambling codes are from the periodic extended S(2) code family of length 256. Because of the large number of these codes, it is possible to have a unique code for each mobile station in a given cell and also to make sure that the mobile stations in the neighboring cells are assigned different codes. The long code used on the reverse DCDPCH/DCCPCH is 38 400 chips of a complex [2.sup.25] - 1 Gold Code. The quadrature component of the code is a decimated and shifted version of the in-phase component of the same Gold code. The quadrature component is also multiplied by repetitions of a 1 and -1 sequence to improve the envelope properties of the signal. The WCDMA has provisions for multi-user detection where a base station can detect multiple users at the same time, hence, increasing the effective signal-to-noise-ratio (SNR See signal-to-noise ratio. SNR - signal-to-noise ratio ) of all the users. Only short scrambling codes are applied when multi-user detection is used at the base station receiver. Otherwise, only long scrambling codes are used. 3. Source Coding and Video Partitioning Given a certain total bitrate budget, use of strong error-correction coding increases the error resilience of the video transceiver scheme at the cost of reducing the number of bits available for video coding. Similarly, it is possible to increase the power of the channel codec (1) (enCOder/DECoder) A hardware circuit that performs analog-to-digital conversion (ADC) and digital-to-analog (DAC) conversion. When analog signals are entered into a computer, cellphone or other device via a microphone or video source such as VHS tape or analog TV, assigned to the protection of the perceptually most important video bits at the cost of reducing the protection of the less important video bits. However, it is intractable to directly optimize the associated partitioning. Furthermore, it is possible to assign the video bits to a high number of bit protection classes; however, in most practical cases, it is sufficient to employ two or three protection classes [13]. The partitioning process is, in general, described with the aid of the percentage of bits assigned to the individual partitions. The number of different bit-sensitivity classes and the video bits assigned to them has to be decided on the basis of the visual importance differences of the various video bits, although the rigorous formal evaluation of these sensitivities is an arduous and time-consuming task. Hence, in a somewhat simplistic sim·plism n. The tendency to oversimplify an issue or a problem by ignoring complexities or complications. [French simplisme, from simple, simple, from Old French; see simple , but plausible approach, it is often argued that in a subjective sense, the visual importance of the various video parameters is related to the spatial frequency In mathematics, physics, and engineering, spatial frequency is a characteristic of any structure that is periodic across position in space. The spatial frequency is a measure of how often the structure repeats per unit of distance. of the video features described by the parameters. This is also true in the case of interframe coding In video compression, the coding of the differences between frames. Interframe coding often provides substantial compression because in many motion sequences, only a small percentage of the pixels are actually different from one frame to another. , when the objects move at a high velocity, since the combined effects of high-velocity, high-frequency video contents cannot be accurately resolved by the human eye due to its relatively low so-called fusion frequency. The above observation is inherently exploited in sub-band [12] and Discrete Cosine Transform (DCT) [15,161 based coding, since typically a lower proportion of bits is assigned to the high-frequency video components than to their low-frequency counterparts. Hence, the partitioning of their bitstreams is relatively straightforward. The so-called inter-frame hybrid DCT coding technique has been adopted for most practical video codecs, including the existing video coding standards [7,8,9,10,11]. The partitioning of the associated video reams has also received considerable attention in recent years; e.g., in the context of Asynchronous Transfer Mode See ATM. (communications) Asynchronous Transfer Mode - (ATM, or "fast packet") A method for the dynamic allocation of bandwidth using a fixed-size packet (called a cell). See also ATM Forum, Wideband ATM. ATM acronyms. Indiana acronyms. (ATM) networks for mitigating the effects of cell loss [17,18,19], and for providing SNR scalability--a term indicating that different quality video streams corresponding to different bitrates can be generated--as provisions in the MPEG-2 [11] and H.263 standards [9]. The basic block diagram A chart that contains squares and rectangles connected with arrows to depict hardware and software interconnections. For program flow charts, information system flow charts, circuit diagrams and communications networks, more elaborate graphical representations are usually used. of an interframe hybrid DCT video encoder is depicted in Fig. 6. According to this approach, a video frame is first divided into non-over-lapping blocks of 8 X 8 pixels, where each block is then DCT transformed, quantized quan·tize tr.v. quan·tized, quan·tiz·ing, quan·tiz·es Physics 1. To limit the possible values of (a magnitude or quantity) to a discrete set of values by quantum mechanical rules. 2. (Q) and VLC coded. Except for the first video frame, which has to be intraframe coded (I-frame), the remaining frames may use a previously reconstructed frame known as the predicted or P-frame for motion prediction and compensation. At the cost of additional frame delays, both previous and future reconstructed frames may also be considered for motion prediction. This is known as bi-directional prediction, which has not been considered in our further elaborations. For interframe prediction, a larger block of typically 16 X 16 pixels consisting of four neighboring 8 X 8 luminance The amount of brightness, measured in lumens, that is given off by a pixel or area on a screen. For example, dark red and bright red would have the same chrominance, but a different luminance. DCT blocks--referred to as a macroblock (MB)--is used to perform block matching motion estimation and compensation. We note, furthermore, that a color MB also contains the so-called color difference Refers to the method of encoding color information in video/TV signals. The color difference signal designations are B-Y and R-Y, Cb and Cr, Pb and Pr, I and Q, and U and V. See YUV and YUV/RGB conversion formulas. components, which are processed at half the luminance resolution in vertical and horizontal directions. Since there are two color difference components, a MB can be viewed as though it was represented by six blocks. The estimated displacement motion vectors are multiplexed with the DCT coded data and transmitted as a part of the hierarchically ordered macroblock information. The multiplexing structure of all existing video standards is generally based on a hierarchical, self-descriptive structure of the encoded parameters. For example, in the H.263 standard the video-coded information for each frame is arranged in four hierarchical layers. The top layer is the picture layer followed by a Group of Block (GOB Gob (gŏb), in the Bible, town, SW ancient Palestine. ) layer comprising a number of consecutive macroblocks, then the Macroblock layer, and finally a block layer. Each layer is furnished with some header information that may include synchronization (1) See synchronous and synchronous transmission. (2) Ensuring that two sets of data are always the same. See data synchronization. (3) Keeping time-of-day clocks in two devices set to the same time. See NTP. bits such as picture start code, PSC (Public Service Commission) Same as PUC. , and GOB start code (for the two top layers), and that defines the nature of the information associated with each layer (e.g., inter/intra-type, quantization (1) The division of a range of values into a single number, code or classification. For example, class A is 0 to 999, class B is 1000 to 9999 and class C is 10000 and above. (2) In analog to digital conversion, the assignment of a number to the amplitude of a wave. parameter, and motion vectors). If the header information of a specific video frame is lost during transmission, the decoder will have no indication as to how the frame, GOB, or MB has been coded. Therefore, any further data received will be undecodable, until the next PSC is recognized in the received bit pattern, e.g., by invoking correlation techniques. As expected, the DCT coefficients associated with the particular video blocks are transmitted at the block layer and errors occurring in the DCT coefficients imply that the corresponding DCT coefficients are lost, since this information was variable length coded. If the transmission errors affected only higher frequency coefficients, the damage would be less catastrophic, since the more visually important low-frequency coefficients may have been recovered already. It is, however, important to protect the most error sensitive header information and as many lower frequency DCT coefficients as possible. The coding parameters hence have to be partitioned into a number of bit protection classes, in order to facilitate source-sensitivity matched error protection. We should point out, however, that partitioning could result in a significant increase in the overall bitrate. This is caused by not only the additional partitioning overhead required for synchronization, but also by the coding inefficiencies that can arise from separating the DCT coefficients prior to variable length (VL) coding. For this reason, we argue that it is advantageous if partitioning can be accomplished in terms of the VL-coded DCT coefficients. In addition, we are concerned with developing a video stream partitioning scheme based on constant bitrate (CBR) transmission. We note, further- more, that there are no fundamental requirements for the partitions to be of equal size. In fact, the partitions are often of differing sizes depending on the error protection strategy with respect to the channel conditions. 3.1 VLC-based Partitioning As mentioned earlier, the separation of the quantised DCT coefficients into different protection classes can be arranged either before or after VL-coding. In the former case, i.e., when partitioning before VL-coding, which we refer to as fixed-zone partitioning, a given number of lower frequency DCT coefficients generated by the standard zigzag scanning process [16] is earmarked for transmission over a higher-integrity subchannel, which can be constituted, for example, by the higher-integrity transmission channel. In our experiments we observed, however, that this arrangement could significantly increase bits per block, mainly due to the breakup of the run-level symbol near the DCT cutoff region [14]. Alternatively, if the block is partitioned after VLC coding, by selecting a fixed number of VLCs (i.e., VLC-based partitioning) no additional bits would be needed. The main concern with this approach, however, is that the number of DCT coefficients within the upper-zone may change from one block to the next. This is due the nature of run-level coding, where each VLC can represent a different number of DCT coefficients. Consequently, this affects the progression of noise at the receiver as the lower priority partition (second partition) is often expected to be lost. For a better clarification of the above argument, let us assume that the upper-zone, as shown in Fig. 7a, corresponds to the number of DCT coefficients m that is selected for the current coding block. In addition, suppose that, when the motion compensated block (reference block) was partitioned during the transmission of the previous frame, the number of its DCT coefficients for the first partition resulted in an "n" number of coefficients where n [less than]m. The upper-zone difference between the two blocks is shown in Fig. 7b, where the darker shaded area represents the selected upper-zone region of the reference block. Recognizing that the coefficients within the zone difference belong to the second partition of the reference block, now consider the situation when the second partition had been corrupted by errors during the transmission of the previous frame. Under such a condition, the upper zone DCT representing the first partition of the current interframe block cannot be properly reconstructed due to the loss of DCT information within the zone difference. This would, consequently, cause a drift between the local and remote decoder, regardless of how well the first partition is protected. The visual impact of such distortion depends on the number as well as the magnitudes of the non-zero coefficients that fall within the zone difference, m -- n [14]. However, the progression of such distortion and its visual effects does not appear to be of grave concern-as far as the intraframe reset can be accomplished in a reasonable period (see the results section). 3.2 Splitting Scheme In order to develop a robust partitioning scheme, the splitting mechanism should take into consideration the instantaneous variations of the coded video. Since we are dealing here with CBR transmission, this is arranged in accordance with the measure of buffer fullness, as each partition is equipped with a separate buffer. The control management of these buffers is handled by the buffer control unit (BCU BCU British Canoe Union BCU Basic Command Unit (British Police) BCU Big Close-Up BCU Bus Controller Unit BCU Battery Coolant Unit BCU Bandwidth Control Unit BCU Beer Cans Unlimited (collectors) ) [143. Its function is to calculate and compare the occupancies between the two buffers. Further, it instructs the bitstream splitter to select one of the following options for the first partition: 0) picture and GOB headers, 1) all-headers, 2) all-headers + first VLC, 3) all-headers + first two VLCs. This information, which will be referred to as "cut-off cut-off Anesthesiology The point at which elongation of the carbon chain of the 1-alkanol family of anesthetics results in a precipitous drop in the anesthetic potential of these agents–eg, at > 12 carbons in length, there is little anesthetic activity, value", is represented by 0, 1, 2, and 3, respectively, and should be included in the header of the particular transmission layer on which it will be updated. To avoid frequent buffer overflow/underflow we have observed that the cut-off value can be reasonably accomplished at the GOB level. More importantly, the cut-off value can be embedded into the Group Number (GN). The GN is a fixed length codeword of 5 bits. The bits are the binary representation of the GOB numbers in a frame. For a QCIF See CIF. QCIF - Quarter CIF format (176 pixels by 144 lines per frame) there exist altogether nine GOBs. Since the first group of blocks do not require GOB information (as it is placed immediately after the picture information), three bits would be sufficient to transmit the group number (excluding the first GOB number). In our simulation model this has been arranged by sending the group number for the first two GOBs as "0" and the remainder as sequential numbers. With this arrangement, the two most significant bits of the 5 bit GN will be free and thus, can be utilized for transmitting the cut-off values. To preserve the integrity of the H.263 syntax, in the re-assembling process (i.e., pre-decoder) the group numbers will be put back to their original format before being decoded by the H.263 standard decoder. Under this arrangement, no extra bits will be added to the bitstream [14]. The only extra information would be for frame synchronization While receiving a stream of framed data, frame synchronization is the process by which incoming frame alignment signals, i.e., distinctive bit sequences (a syncword), are identified, i.e. , such as picture start code (PSC), temporal reference code (TR), GOB start code (GBSC GBSC Group of Blocks Start Code GBSC Greater Boston Soaring Club (aviation) GBSC Great Britain Ice Hockey Supporters Club GBSC Global Biologics Supply Chain, LLC GBSC Georgian Bay Sailing Centre (Canada) ) and a GN codeword (3 bits only), which are added to the second bitstream at the beginning of each frame, including its GOBs. The decoder could use this to align the two bitstreams in order to make the second bitstream more robust to transmission errors. Next, we discuss the manner in which the two partitions are formed. For this purpose, let's consider a scenario where the cut-off value indicates that at least two VLCs should be selected for the upcoming GOB. In this case, as shown in Fig. 8, the first partition begins with the GOB header followed by the MB header and the first two VLCs from each block in the transmitting order. This process will continue until the selection value is updated at the next GOB. The remaining VLCs are subsequently transferred to the second partition in the same order. It should be noted that the second bitstream does not carry any VLCs from the blocks whose last VLCs are included in the first partition or identified as zero blocks by the macroblock header (i.e., B2, B5, B6 in Fig. 8). At the receiver, the two bitstreams are joined together to form the original H.263 bitstream via a pre-decoder unit. The pre-decoder's initial task is to read the two most significant bits of the GN number to extract the cut-off value. From the cut-off value, the pre-decoder can determine from which of the two bitstreams it should obtain the macroblock information if the option-0 (split-header) has been identified. Otherwise, the pre-decoder can determine from the cut-off value how many VLCs it should read from the first bitstream. If the last VLC is not coded as a LAST-VLC (please note that a separate codeword is assigned to the last run-level symbol in a DCT block to signify the end-of-block), the remaining VLCs will then be read from the second bitstream until the last LAST-VLC is reached. If an uncorrectable error has occurred in the second bitstream, the program is able to detect it but unable to pinpoint the exact location. Under such a condition, the second bitstream will be ignored for the entire GOB. However, to reconstruct the legal H.263 bitstream, every block within a GOB that does not contain a LAST-VLC (e.g., B1, B3, and B4 in Fig. 8) should be closed. This is done by replacing every codeword (e.g., second VLC according to Fig. 8) by its equivalent LAST-VLC codeword. If the cut-off value has identified an option-0 (split-header), all the blocks in the GOB. will be considered uncoded un·cod·ed adj. Not coded, especially not having or not showing a Zip Code. . This is done by changing the COD flag. This situation normally occurs for P-frames only. 4. The Dual-Priority WCDMA Simulation Model Figure 9 shows the system model for the multi-priority video transmission scheme over the 3GPP FDD downlink channel. The video signal, after being encoded, is partitioned into two separate bitstreams. The first stream, containing the control and header information, is channel encoded using a simple repetition scheme where each bit is repeated three times. The second stream is passed through without encoding and multiplexed with the encoded stream. The multiplexed data is then sent over the 3GPP physical layer, where the resulting bits are first mapped onto a QPSK signal, spread using an OVSF code, and scrambled using a Gold code [5]. The resulting complex baseband signal is subsequently forwarded to a Square Root Raised Cosine (SRRC) filter. The roll-off factor of the SRRC filter is 0.22, as specified for 3GPP. The filtered signal is then sent through a wideband fading channel with 6 paths, in accordance with the IMT IMT, n.pr See inspiratory muscle training. 2000 specification (the parameters of the channel will be discussed later). Noise and interference are then added. The noise and interference have been modeled as Additive White Gauassian Noise (AWGN AWGN Additive White Gaussian Noise AWGN Adaptive White Gaussian Noise ). The signal is then sent to the receiver SRRC filter, and then to a six finger Rake receiver A rake receiver is a radio receiver designed to counter the effects of multipath fading. It does this by using several "sub-receivers" each delayed slightly in order to tune in to the individual multipath components. w here the signals on all the six paths are selected and combined coherently. The combined signal is then sent for QPSK demodulation demodulation: see modulation. See demodulate. (communications) demodulation - To recover the signal from the carrier. For example, in a radio broadcast using amplitude modulation the audio signal is transmitted as the mean amplitude of a . Finally, the resulting bits are demultiplexed into two streams. The bits corresponding to the repetition encoded stream on the transmit side are sent to a majority logic (ML) decoder. The output of the ML decoder is sent for bit stream reassembly reassembly - segmentation . The bits corresponding to the uncoded stream are directly passed through for bit stream reassembly. The reassembled video stream is then sent to the video source decoder to reconstruct the video signal. 4.1 Model Parameters The simulation model for the transmission system was set up using the Signal Processing See DSP. Worksystem (SPW SPW Signal Processing Workstation SPW Shelter in Place Warning SPW Spencer, IA, USA - Spencer Municipal Airport (Airport Code) SPW Special Purpose Weapon SPW Spokane Washington (border patrol sector) ) [3]. The parameters for the simulations are discussed below. The spreading factor was set to 64, which results in 80 bearer bits (40 symbols) in each slot. We used 16 of those bits or 8 symbols as pilot symbols, and the remaining 64 bits are used for data transmission. The pilot bits are placed at the beginning of each slot. The instantaneous pilot power was set 3 dB above the data. The chips are oversampled by a factor of 8. The SRRC filter with a roll-off factor of 0.22 was implemented with a complex Finite Impulse Response (electronics, DSP) Finite Impulse Response - (FIR) A type of digital signal filter, in which every sample of output is the weighted sum of past and current samples of input, using only some finite number of past samples. (FIR) filter with 128 taps. The channel impulse response In simple terms, the impulse response of a system is its output when presented with a very brief signal, an impulse. While an impulse is a difficult concept to imagine, and an impossible thing in reality, it represents the limit case of a pulse made infinitely short in time for the vehicular IMT-2000 channel model is shown in Table 2. In our simulations, we assume ideal finger search for the Rake receiver. That is, we assume that each finger in the receiver has perfect synchronization with the corresponding path in the channel. We also assume perfect sub-chip synchronization (ideal phase locked lo op). However, we used weighted multi-slot averaging (WMSA WMSA Weighted Multi-Slot Averaging WMSA West Missouri Shooters Alliance WMSA Women in Military Service for America ) for channel estimation in each finger. The channel estimation technique uses six consecutive slots for pilot estimation and then weighs the estimate across the six slots. The weighing for WSMA WSMA Washington State Medical Association WSMA Wisconsin School Music Association WSMA Wisconsin Society of Medical Assistants WSMA Wisconsin State Music Association WSMA Web Site Management Application WSMA Washington State Muzzleloaders Association is shown in Table 3. The combining scheme is maximal ratio combining with finger selection. Only those fingers whose power is greater than 10% of the maximum power are selected for combining. The majority logic decoder is a hard decision decoder. It chooses 1 if two or more 1's are detected, and chooses 0 if 2 or more 0's are detected. The simulations were run for a mobile speed of 60 km/h. For a carrier frequency of 2 GHz this translates to a Doppler frequency of 111.2 Hz. Since a spreading factor of 64 was selected, the effective data rate used in our model was 96 kbit/s (only pilot channel consisting of eight symbols per slot was considered for DPCCH). For every one information bit (or 3 encoded bits) from stream 1, three uncoded bits of stream 2 are transmitted and the source coding rate is, therefore, 3/5 times the effective data rate. In other words, the data rate for the partitioned video signal is 57.6 kbit/s with a splitting fraction of X = 1/3 (see Fig. 9). This would result in the following bitrates for each partitioned bitstream: [R.sub.bistream-1] = XR = (1/3) (57.6) = 19.2 kbit/s [R.sub.bistream-2] = (2/3) (57.6) = 38.4 kbit/s where R is the video coded bitrate. 5. Results and Discussion 5.1 Video Partitioning Results For the above splitting factor, the ITU-T H.263 based partitioning scheme discussed in Sec. 3, was then applied to split the first 405 frames of three sequences known as "Salesman", "Claire", and "Carphone" (note that all input sequences were generated at a frame rate of 30/s). As discussed, to embed the cut-off value in the GN codeword, these sequences conformed to the QCIF format. However, to comply with the low splitting fraction of X = 1/3 for the first partition, the coding frame rate was set at 10/s [14]. Furthermore, in these experiments, after every 135 interframe coded frames, the next frame was encoded as an intraframe mode (I-frame reset). Table 4 provides the video partitioning results with a splitting factor of X = 1/3, which includes detailed values of the coding and splitting parameters (excluding the GOB synchronization). In order to provide some subjective evaluations with regards to the distortion built up, we considered a situation where the entire second bitstream was corrupted by errors. Figure 10 depicts the 136th decoded frame of the "Salesman" and "Claire" sequences encoded at 10 frames/s and a bitrate of 57.6 kbit/s, with the splitting fraction of 1/3. For the sake of comparison, this figure also displays the reconstructed frames where the second bitstream is received error free. 5.2 Transmission The final stage of our experiments was concerned with the transmission aspects of the partitioned video signal over WCDMA. The "Salesman" sequence with QCIF format and at the original frame rate of 301s, was used as an input video in these experiments. The sequence contained 405 original frames and was encoded at a rate of 10/s. For a thorough evaluation of our transmission system, the sequence was repeated a hundred times to generate longer data. After the last frame was encoded (frame 136), the first frame of the repeated sequence was intraframe coded (I-frame). Thus, resulting in the I-frame reset period of 136 frames. Figure 11 shows the bit error rate (BER (1) (Basic Encoding Rules) A set of encoding rules for ASN.1 notation, which is a method for defining data structures. See ASN.1. (2) (Bit Error Rate) The average number of bits transmitted in error. See BERT. 1. ) performance verses the signal-to-interference noise ratio (SINR SINR Signal to Interference-plus-Noise Ratio SINR Shanghai Institute of Nuclear Research SINR Signal to Interference and Noise Ratio ), when the partitioned video signal is applied to the system depicted in Fig. 9. As can be seen, using a simple repetition code Repetition code is a coding scheme that repeats the bits across a channel to achieve error free communication. can improve the SINR
for the first bitstream by almost 4 dB.In the process of reconstructing video, if errors were detected in the first bitstream, the entire data in the GOB (group of blocks) was replaced by the reconstructed GOB from the reference frame. This may, consequently, cause a drift between the local and remote decoder that tends to propagate until it reaches the next I frame. Such a drift would result in more severe distortion, particularly if errors occur on the I-frame data leading to the elimination of a portion, or even the entire, video data. In the process of reconstructing video, if errors were detected in the first bitstream, the entire data in the GOB (group of blocks) was replaced by the reconstructed GOB from the reference frame. This may consequently cause a drift between the local and remote decoder that tends to propagate until it reaches the next I frame. Such a drift would result in more severe distortion, particularly if errors occur on the I-frame data leading to the elimination of a portion, or even the entire, video data. If errors were detected in the second bitstream (please also refer to Sec. 3.2), its entire data within the corrupted GOB was eliminated. Consequently, all the blocks within the same GOB were reconstructed using the data received via the first bitstream. This was done by replacing every codeword by its equivalent LASTVLC codeword. If the cut-off value was identified as a split-header, all the blocks in the GOB were considered uncoded (i.e., by changing the COD flag). Figure 12 shows the average peak-to-peak signal to noise ratio (PSNR PSNR Peak Signal-to-Noise Ratio PSNR Peak Signal to Noise Rate ) of the reconstructed frames using different I-frame reset periods ranging from 34-to- 136 frames. As shown, with a default I-frame reset period of 136, which was imposed on the repeated sequence, the recovery of the video signal at lower SINR values is almost impossible. This is mainly due to excessive errors on the first bitstream, causing a loss of most error sensitive header information. Consequently, the distortion effect tends to propagate until reaching the next I-frame. For instance, as shown in Fig. 12, more frequent I-frame resets can enhance the PSNR of the reconstructed video. But this would be at the expense of reducing the video compression efficiency. Therefore, for the best performance, a careful balance between the level of FEC See forward error correction. FEC - Forward Error Correction protection and the I-frame reset period would be needed with respect to the transmission channel conditions. Fortunately, in this case, the FEC overhead is added to protect a third of the input data. However, with better error protection it would be possible to employ a longer I-frame reset; thereby maintaining compression efficiency. For the sake of comparison, the transmission of a non-partitioned H.263 coded bitstream was also carried out and the results are included in Fig. 12. For this case, the video was encoded at the higher bitrate of 102.4 kbit/s and transmitted via the same IMT-2000 channel. These results are presented using the same I frame reset periods ranging from 34 to 136 frames. As shown, except for higher SINR values, the recovery of the non-partitioned video is almost impossible, even at the lowest I-frame reset periods. For instance, let us compare the results of non-partitioned video coded at 96 kbit/s with the shortest I-frame reset period of 34, and that of the partitioned video coded at 57.6 kbitls, but using the longest I-frame reset period of 136. We can observe that despite its lower bitrate, the partitioned video, as well as using a much larger I-frame reset period, is more resilient to errors when transmitted over IMT-2000 channels. Finally, as the simulation results show, using this simple, hard-decision coding scheme results in a significant improvement in the recovery of the video signal compared to when no partitioning is applied. However, with more sophisticated FEC coding schemes, such as turbo codes and using better decoding schemes at the receiver, the performance can be expected to improve considerably. One obvious improvement would be to introduce an interleaver after the repetition encoder, and to use soft decision decoding to allow variable-bit-rate (VBR (1) See MP3 VBR. (2) (Variable Bit Rate) Refers to a communications or computer channel that changes its transmission speed based on any number of criteria. ) video transmission together with the multi-priority scheme at the receiver. That is, instead of doing majority logic decoding Majority logic decoding is a method to decode Repetition codes, based on the assumption that the largest number of occurrences of a symbol was the transmitted symbol. Theory If we have a binary alphabet made of , the outputs of the Rake receiver [r.sub.1], [r.sub.2], [r.sub.3], corresponding to the transmitted and repeated symbol s (i.e., "s, s, s") are combined to give r = [r.sub.1] + [r.sub.2] + [r.sub.3], and r is then sent to the QPSK demodulator See demodulate. Demodulator A device used to recover the original modulating signal from a modulated wave. A demodulator is also known as a detector. to obtain an estimate of the transmitted signal s. Another effective FEC coding technique would be to use different FEC codes for each stream. For instance, stream 1 may be coded using a rate 1/2 turbo code, while stream 2 uses a rate 3/4 turbo code. Moreover, the 3GPP physical layer allows the change of data rate from frame-to-frame, in which case different rates and coding schemes may be used for each partitioned bitstream. 6. Conclusions This paper presents a robust dual-priority video partitioning method suitable for twin-class unequal protected video transmission over IMT-2000 channels. The partitioning advocated is based on a separation of the Variable-length (VL) coded Discrete Cosine Transform (DCT) coefficients within each block. The method was shown to be suitable for constant bit rate (CBR) transmission, where the fraction of bits assigned to each of the two partitions can be adjusted according to the requirements of the unequal error protection scheme employed. The distribution of the VL-coded (VLC) information amongst the two partitions is performed adaptively. The results illustrate that if the second partition is corrupted by errors, the propagation of distortion does not cause a severe degradation to the reconstructed video. The partitioned video was then applied to one of the leading third generation cellular radio standard proposals, often referred to as the WCDMA system. A simple error correction coding scheme was employed to evaluate the performance of unequal error protection for transmission of ITU-T H.263 compressed video over IMT-2000 channels. The results were compared with non-partitioned video using more periodic I-frame encoding to reduce the propagation of distortion. Acknowledgment The authors wish to thank Dr. Kevin Mills at NIST and Professor Lajos Hanzo of Southampton University for their constructive comments on this work. About the authors: Hamid Gharavi is with the Advanced Network Technologies Division of the NIST Information Technology Laboratory. Siavash M. Alamouti is with cadence Design Systems (company) Cadence Design Systems - A company that sells electronic design automation software and services. http://cadence.com/. See also Verilog. Inc. The National institute of Standards and Technology National Institute of Standards and Technology, governmental agency within the U.S. Dept. of Commerce with the mission of "working with industry to develop and apply technology, measurements, and standards" in the national interest. is an agency of the Technology Administration, U.S. Department of Commerce. (1.) Chip is defined as a unit representing a bit after spreading every data symbol. (2.) Figure 2 describes the base station baseband processing for a single traffic channel. The transmission for multiple channels may be combined differently based on the preferred implementation of the base station transmitter. (3.) Signal Processing Work System and SPW are registered trademarks of Cadence Design Systems, Inc. Certain commercial equipment, instruments, or materials are identified in this paper to foster understanding. Such identification does not imply recommendation or endorsement by the National Institute of Standards and Technology, nor does it imply that the materials or equipment identified are necessarily the best available for the purpose. 7. References (1.) V. H. MacDonald, The cellular concept, Bell System Tech. J. 58, 15-43 (1979). (2.) 3GPP Document TS SI. 13 V2.0.0, 3GPP Technical Specification Group, Radio Access Network, Working Group 1, Spreading and Modulation (FDD), April 1999. (3.) TIA/EIA Interim Standard-95, Mobile Station-Base Station Compatibility Standard for Dual-Mode Wideband Spread Spectrum Cellular System, July 1993. (4.) F. Adachi, M. Sawahashi, and K. Okawa, Tree-structured generation of orthogonal spreading codes with different lengths for forward link of DS-CDMA DS-CDMA Direct Sequence Code Division Multiple Access mobile radio, Electron. Lett. 33 (1), 27-28 (1997). (5.) Robert Gold, Optimal binary sequences for spread spectrum multiplexing, IEEE (Institute of Electrical and Electronics Engineers, New York, www.ieee.org) A membership organization that includes engineers, scientists and students in electronics and allied fields. Trans. Inform. Theor. IT-13 (4), 619-621 (1967). (6.) W. C. Jakes, Microwave mobile communications. John Wiley John Wiley may refer to:
New York, Middle Atlantic state of the United States. It is bordered by Vermont, Massachusetts, Connecticut, and the Atlantic Ocean (E), New Jersey and Pennsylvania (S), Lakes Erie and Ontario and the Canadian province of (1974). (7.) ITU-T H.261 Recommendation, Video codec for Audio Visual Services at px64 kbit/s, March 1993. (8.) ITU-T Recommendation H.263, Video Coding for Low Bitrate communication, February 1998. (9.) Draft Text of Recommendation H.263 version 2 ("H.263+") for Decision, COM-16-26, 1998. (10.) MPEG-I International standard coding of moving pictures and assisted audio for digital storage media up to about 1.5[sim]Mbit/s, Part 1-3, Video Recommendation ITU-T H.262, ISO/IEC ISO/IEC International Organization for Standardization/International Electrotechnical Commission (ITU-T M 3000) 13818-2, January 20, 1995. (11.) MPEG-2 International Standard, Video Recommendation ITU-T H.262, 1995. (12.) R. Stedman, H. Gharavi, L. Hanzo, and R. Steele, Transmission of subband coded images via mobile channels, IEEE Transactions on CIRC SYST SYST System SYST Shut Your Stinkin' Trap VID VID Video VID Vide (Latin: See) VID Vendor Id (USB PnP ID) VID Visual InterDev (Microsoft development environment) VID VLAN Identifier (IEEE 802. , 1.3 (1), 15-26, (1993). (13.) H. Gharavi and C. I. Richards, Partitioning of MPEG (Moving Pictures Experts Group) An ISO/ITU standard for compressing digital video. Pronounced "em-peg," it is the universal standard for digital terrestrial, cable and satellite TV, DVDs and digital video recorders (DVRs). coded video bitstreams for wireless transmission, IEEE Signal Processing Lett. 4 (6), 153-155 (1997). (14.) H. Gharavi, Pilot Assisted 16-level QAM (1) (Quality Assessment Measurement) A system used to measure and analyze voice transmission. (2) (Quadrature Amplitude Modulation) A modulation technique that employs both phase modulation (PM) and amplitude modulation (AM). for wireless video transmission, Accepted for publication in IEEE Transactions on CIRC SYST VID. (15.) A. K. Jain, Fundamentals of Digital Image Processing Digital image processing is the use of computer algorithms to perform image processing on digital images. Digital image processing has the same advantages over analog image processing as digital signal processing has over analog signal processing — it allows a much wider , Prentice-Hall (1989). (16.) K. Rao and P. Yip, Discrete Cosine Transform, Academic Press, New York (1990). (17.) S. Tubaro, Two Layers video coding scheme for ATM networks, Signal Processing: Image Communication, Vol. 3 (June 1991) pp. 129-141. (18.) R. Aravind, M. R. Civanlar, and A. R. Reibman, Packet loss resilience of MPEG-2 video coding algorithms, IEEE Trans. CIRC SYST VID 6 (5), 426-435 (1996). (19.) H. Sun, W. Kwok, and J. W. Zdepski, Architectures for MPEG Compressed Bitstream Scaling, IEEE Trans. CIRC SYST VID 6(2), 191-199 (1996). [Graph omitted] [Graph omitted]
Table 1. The bit rates, symbol rates, and spreading factors for
WCDMA dedicated transport channels
Spreading factor (SF) Symbol rate bit rate kbit/s
(thousand symbols per
second)
512 (downlink only) 7.5 15
256 15 30
128 30 60
64 60 120
32 120 240
16 240 480
8 480 960
4 960 1920
Table 2. Channel impulse response for the IMT2000 Vehicular channel
model
Delay (ns) Average power (dB)
0 0
310 -1.0
710 -9.0
1090 -10.0
1730 -15.0
2510 -20.0
1090 10.0
Table 3. The weighting for WMSA
Weight
1 0.3
2 0.8
3 1.0
4 1.0
5 0.8
6 0.3
Table 4. Results for Salesman, Claire, and Carphone sequences codes at
10 frames per second with a splitting fraction of 1/3
Video sequence Salesman Claire Carphone
Header bits (kbit/s) 8.64 9.18 14.3
Coefficient bits (kbit/s) 48.812 48.48 43.2
Mean quantizer 5.62 4.51 10.28
Frames with
split-header GOB's (%) 28.67 17.6 91.6
Mean cut-off 1.1 1.24 0.39
Bitstream1 (kbit/s) 18.93 19.02 19.01
Bitstream2 (kbit/s) 38.53 38.65 38.54
|
|
||||||||||||||||||

[1]
coding scheme that repeats the bits across a channel to achieve error free communication.
Printer friendly
Cite/link
Email
Feedback
Reader Opinion