Timing redundancy in telecommunication systems: a white paper.
In a typical telecommunication product, all cards are synchronized to the same clock. The failure of this clock disrupts the data traffic on all cards. To avoid this problem and increase network reliability, telecom products are designed with at least two clocks--active and redundant. If the active clock fails, the system avoids failure by switching to the redundant clock. This article outlines the importance of clock redundancy, presents two methods (parallel and serial) used to implement timing redundancy, and discusses the advantages and disadvantages of both approaches.
Highly reliable operation
Telecommunications systems must provide highly reliable operation under all network conditions. To do this, the most critical components within the system are made redundant. A typical telecommunications product is a 19inch standard telecom rack populated pop·u·late
tr.v. pop·u·lat·ed, pop·u·lat·ing, pop·u·lates
1. To supply with inhabitants, as by colonization; people.
2. by up to 18 one-inch vertically inserted cards.
As shown in Figure 1, a typical system is comprised of two control cards and multiple line cards that communicate over a common backplane An interconnecting device that has sockets for printed circuit boards to plug into.
Passive and Active
Although resistors may be used, a "passive" backplane adds no processing in the circuit. . The two control cards are identical and run in parallel. Only one control card is active at any given time, and the other takes over if the first fails. Switching from one control card to the other should not cause any interruption or failure in the system.
[FIGURE 1 OMITTED]
The control card includes a system control processor, switching fabric and system timing. It is important to note that in more complex, larger systems, timing is implemented on separate cards to further increase the flexibility of the product. This article covers only the timing aspects of telecommunication systems.
Timing card architecture
Having two timing cards protects against an internal failure where one of the cards fails. To protect from external clock reference failures, the timing cards are designed to be able to synchronize to more than one reference.
A timing card accepts references from multiple sources, selects one, cleans it from phase noise with a digital phase locked loop (DPLL DPLL - Digital Phase-Locked Loop ), and distributes it to the line cards via the backplane. The DPLL is the most important part of the timing card. Depending on the targeted application of the product and region of deployment, the DPLL needs to be compliant with the appropriate timing specifications, such as Telcordia GR-1244 CORE, Telcordia GR-253 CORE or ITU (International Telecommunication Union, Geneva, Switzerland, www.itu.ch) A telecommunications standards body that is under the auspices of the United Nations. Comprising more than 185 member countries, the ITU sets standards for global telecom networks. G.813. The DPLL needs to provide an array of crucial features, including:
* Hitless reference switching--if the reference the DPLL is locked to fails, the DPLL will lock to another available reference without phase disturbances at its output.
* Holdover hold·o·ver
One that is held over from an earlier time: a political advisor who was a holdover from the Reagan era; a family tradition that is a holdover from my grandparents' childhood.
Noun 1. mode--the DPLL constantly calculates the average frequency of the locked reference. If the reference fails and none of the other references are available, the DPLL goes into holdover mode where it generates an output clock based on calculated average value. Holdover stability depends on resolution of the DPLL averaging algorithm and on frequency stability of the oscillator oscillator
Mechanical or electronic device that produces a back-and-forth periodic motion. A pendulum is a simple mechanical oscillator that swings with a constant amplitude, requiring the addition of energy at each swing only to compensate for the energy lost because of air used as the DPLL master clock.
* Reference monitoring--the DPLL needs to constantly monitor quality of its input references. If the reference the DPLL is locked to deteriorates--disappears or drifts in frequency--the DPLL raises an alarm (interrupt) and switches to another valid reference.
* Narrow loop bandwidth--the DPLL can be viewed as a phase noise filter. The narrower the loop bandwidth, the better the phase noise attenuation Loss of signal power in a transmission.
The reduction in level of a transmitted quantity as a function of a parameter, usually distance. It is applied mainly to acoustic or electromagnetic waves and is expressed as the ratio of power densities. . Some specifications, such as G.813, explicitly provide the loop bandwidth. Others, including GR-253 CORE, provide narrow loop bandwidth specifications implicitly through the wander transfer requirement. Ideally, the DPLL should have programmable loop bandwidth so the timing card can be easily used for different applications.
* High jitter A flicker or fluctuation in a transmission signal or display image. The term is used in several ways, but it always refers to some offset of time and space from the norm. For example, in a network transmission, jitter would be a bit arriving either ahead or behind a standard clock cycle and wander tolerance--the DPLL should tolerate large phase noise at its input and still maintain synchronization (1) See synchronous and synchronous transmission.
(2) Ensuring that two sets of data are always the same. See data synchronization.
(3) Keeping time-of-day clocks in two devices set to the same time. See NTP. .
Timing card DPLL references can come externally from a Building Integrated Timing Supply Building Integrated Timing Supply (BITS), also called Building Integrated Timing System, is a method for distributing a precision clock among telecommunications equipment. The physical medium is a T1/E1 (DS1) line. (BITS) or internally from line cards. The BITS is defined as the most accurate clock in an office, and is used as a master clock for all intraoffice equipment. The BITS can be viewed as a standalone timing car& usually with Stratum 2 (0.1 parts per billion) holdover stability. The BITS is timed by two T1 signals and its outputs are distributed to equipment with T1 or Composite Clock (CC) signals. It should be noted that BITS is a North American North American
named after North America.
North American blastomycosis
see North American blastomycosis.
North American cattle tick
see boophilusannulatus. term, while the rest of the world uses Synchronization Supply Unit (SSU SSU Small Subunit
SSU Sonoma State University
SSU Savannah State University (Savannah, Georgia)
SSU Shawnee State University (Ohio)
SSU Salisbury State University ). Where BITS uses TI for clock reception and distribution, SSU uses E1 links.
All nodes in a public telecommunication network must be synchronized to timing references that are traceable to a Primary Reference Source (PRS PRS Partnership (IRB)
PRS Printer (File Name Extension)
PRS Paul Reed Smith (Guitar Brand)
PRS Pairs (shoe industry) ). A PRS provides a clock with Stratum 1 accuracy (0.01 parts per billion). PRS can be generated from an on-site cesium cesium (sē`zēəm) [Lat.,=bluish gray], a metallic chemical element; symbol Cs; at. no. 55; at. wt. 132.9054; m.p. 28.4°C;; b.p. 669.3°C;; sp. gr. 1.873 at 20°C;; valence +1. clock, or from cesium clock-controlled radio signals such as Global Positioning System Global Positioning System: see navigation satellite.
Global Positioning System (GPS)
Precise satellite-based navigation and location system originally developed for U.S. military use. (GPS) and Long Range Navigation System A GPS-based electronic system in a car or truck that provides a real time map of the vehicle's current location as well as step-by-step directions to a programmed destination. See GPS and vehicle tracking. , or Version C (LORAN-C LORAN-C Long Range Navigation (Revision C) ). Due to the high cost of cesium clocks, PRS usually use GPS with LORAN-C as a backup if GPS fails. Because it is not economically viable to have PRS at each network node (networking) network node - (node) An addressable device attached to a computer network. If the node is a computer it is more often called a "host". , few (usually two) nodes have their BITS synchronized directly to PRS.
The other nodes in the network use line timing where their BITS/SSU is synchronized to one of the extracted line clocks. The clock path sequence is shown in Figure 3. In this case, an additional low-cost wideband DPLL is needed to convert the frequency of the line card extracted clock to the frequency needed by T1/E1/CC Line Interface Units (LIU LIU Linköpings Universitet (Sweden)
LIU Long Island University (New York)
LIU Line Interface Unit
LIU Lightguide Interconnection Unit (AT&T)
LIU Laugh It Up ). LIUs are used for the transmission of the timing references between the timing card and BITS and vice versa VICE VERSA. On the contrary; on opposite sides. . For example, if the extracted line clock originates from an OC-3 line card, its frequency is usually 19.44 MHz (MegaHertZ) One million cycles per second. It is used to measure the transmission speed of electronic devices, including channels, buses and the computer's internal clock. A one-megahertz clock (1 MHz) means some number of bits (16, 32, 64, etc. so the wideband DPLL is needed to convert from 19.44 MHz to 1.554 MHz (T1), 2.048 MHz (E1), or 64 KHz (CC).
[FIGURE 3 OMITTED]
Optionally, the timing card can be used to source BITS/SSU clock if an external BITS/SSU source with better holdover accuracy is not available. In this case, the timing card DPLL is synchronized to one of the extracted line clocks. Its output is fed to the backplane and to LIUs via wideband DPLL.
Timing card redundancy
Timing card redundancy is implemented in one of two ways--parallel redundancy or serial redundancy. Parallel redundancy is shown in Figure 4, while serial redundancy (commonly referred to as "master/slave" time redundancy) is illustrated in Figure 5.
[FIGURE 4-5 OMITTED]
As seen in Figures 4 and 5, DPLLs on the active and redundant cards drive the active and redundant clocks to the corresponding traces on the backplane. Each DPLL usually drives common clock frequencies such as 8 kHz (DS0), 1.544 MHz (DS1), 2.048 MHz (E1) and 19.44 MHz (SONET/SDH).
The active and redundant clocks on the backplane should have the same frequency and phase. Ideally, the phase difference should be equal to zero. In practice, a phase difference in the range of few nanoseconds is achievable.
The active and redundant clocks are distributed via the backplane to the line cards. As seen in Figures 4 and 5, the line cards each have a DPLL followed by an analog PLL PLL - phase-locked loop (APLL APLL Analog Phase-Locked Loop
APLL Associate Product Lead for Logistics (FAA) ). The DPLL is used for hitless switching between the active and redundant clocks and to provide clock continuity tbr a short period, such as when the active clock unexpectedly disappears before the system detects active reference failure and switches the line card DPLL to lock to the redundant reference.
The APLL is used only for jitter reduction and frequency multiplication. It is possible to have hitless reference switching with an APLL. However, good clock continuity is difficult to achieve because oscillators used on APLLs (usually LC-based) have very low holdover stability relative to DPLLs that use crystal oscillators. Typically, a DPLL has short-term holdover accuracy of 0.01 ppm (parts per million parts per million
mg/kg or ml/l; see ppm. ) or better, whereas an APLL has holdover accuracy above 100 ppm.
Parallel timing redundancy
In this scheme, as illustrated in Figure 2, DPLLs on both timing cards are locked to either an extracted line clock from one of the line cards or the BITS reference. Both DPLLs should be locked to the same input reference and should have identical loop bandwidth (i.e. 0.1 Hz for Telcordia GR253 CORE). In this case, if the active card does a reference switch from BITS0 to BITSI, the redundant card should simultaneously do the same. Because the DPLLs on the active and redundant timing cards have the same bandwidth and are fed with the same input reference, the outputs should be closely phase aligned regardless of the jitter/wander on the input reference. However; this is only partially true due to intrinsic wander issues. We will look at this later in the article.
Serial (master/slave) redundancy
A serial redundancy timing scheme is implemented by locking the secondary timing card to the output of the primary timing card, as shown in Figure 3. The loop bandwidth of the DPLL on the active timing card should be set in accordance with requirements (for Telcordia GR-253CORE it is 0.1 mz). However, the loop bandwidth of the DPLL on the redundant card should be set as wide as possible--at least l0 times more than the DPLL on the active card. The wider bandwidth allows the DPLL to track clock changes at its input much faster, thus keeping the active and redundant clocks closely aligned at all times.
If it is detected that the clock generated by the active card has failed, the DPLL on the secondary card will go into holdover mode and signal to the board controller. The controller will now promote the secondary card to act as the primary card by selecting the narrowband loop filter on the DPLL and locking the DPLL to the same reference input (if available) that the active card was locked to before it failed. When the failed timing card is replaced, the new card will assume the role of the redundant timing card.
In serial timing redundancy, the phase offset between the active and the redundant clocks can be calculated from:
D = [d.sub.PLL] + [d.sub.RxBuffer] + [d.sub.Mux] + [d.sub.TxBuffer]
[d.sub.RxBuffer] is a typical propagation delay The time it takes to transmit a signal from one place to another. Propagation delay is dependent solely on distance and two thirds the speed of light. Signals going through a wire or fiber generally travel at two thirds the speed of light. Contrast with nodal processing delay. of the receive clock buffer on the slave card,
[d.sub.Mux] is a typical propagation delay of the clock multiplexer,
[d.sub.TxBuffer] is a typical propagation delay of the clock driver on the slave card, and
[d.sub.PLL] is a typical phase offset between input and the output reference after reference alignment is performed.
Some advanced DPLLs intended for timing card design have the ability to advance the output clock relative to the input with a resolution below 1 nanosecond (1) One billionth of a second. Used to measure the speed of logic and memory chips, a nanosecond can be visualized by converting it to distance. In one nanosecond, electricity travels approximately a foot in a wire. . This feature can be used to minimize delay D.
Comparing redundancy schemes
In practice, designers use serial redundancy more often because it has several important advantages.
If the product is in island mode (not locked to the network reference or to the BITS clock), its timing cards must work in a free-run mode. In this mode, the DPLL output frequency will be based on crystal oscillators used as the DPLL master clock. As a result, the active and redundant clocks in the parallel method will drift relative to each other at a rate proportional to the fractional frequency difference between crystal oscillators on the active and redundant cards. However, in the serial redundancy method the active and redundant clocks will always be aligned because the DPLL on the redundant card locks to the clock generated by the free-running DPLL on the active card.
Since DPLLs on the active and redundant timing cards have the same bandwidth in the parallel redundancy method, and because they are led with the same input reference, one would expect that the outputs would be closely phase-aligned regardless of the jitter/wander on the input reference. However, the active and redundant clock may drift back and forth relative to one another due to intrinsic wander generated by the DPLL. This intrinsic wander is dependent on the short time-frequency fluctuations of the crystal oscillator and on the bandwidth of the DPLL. When fed with a clean input reference clock, a DPLL can compensate for those short-term fluctuations and provide clean clocks at its output.
However, the DPLL's ability to do so is dependent on its bandwidth. The wider the bandwidth, the better the compensation. Because the DPLLs on the active and redundant cards in the parallel redundancy method have the same narrow bandwidth they will both have intrinsic wander. Since each card has its own crystal oscillator, the wander generated by the DPLL will be uncorrelated. Thus, the active and redundant clocks may drift back and forth relative to each other. The maximum phase difference between them can be more than 10 nanoseconds when the DPLL is set to 0.1 Hz loop bandwidth, even when very stable oscillators such as Ovenized Crystal Oscillators (OCXO OCXO Oven Controlled Crystal Oscillator
OCXO Oven Compensated Crystal Oscillator ) are used. This problem is not present in the serial redundancy mode because the DPLL on the redundant card compensates for all frequency fluctuations caused by the crystal oscillator due to its wide loop bandwidth.
Yet, the parallel redundancy scheme is easier to implement because it does not require reconfiguration of the DPLL on the redundant card when the active clock/card fails.
Timing card redundancy is implemented in telecommunications products to prevent data loss and increase network reliability. This article presented the typical timing card architecture and two common ways of implementing timing card redundancy. Although slightly more complicated to implement, serial redundancy has several advantages over parallel redundancy.
Alain Blachard, Phase-Locked Loops Phase-locked loops
Electronic circuits for locking an oscillator in phase with an arbitrary input signal. A phase-locked loop (PLL) is used in two fundamentally different ways: (1) as a demodulator, where it is employed to follow (and demodulate) frequency or , Wiely 1976 Synchronous Optical Network (networking) Synchronous Optical NETwork - (SONET) A broadband networking standard based on point-to-point optical fibre networks. SONET will provide a high-bandwidth "pipe" to support ATM-based services. (SONET) Transport Systems."
Common Generic Criteria GR-253-CORE, Issue 3, 2000 Clocks for the Synchronized Network: Common Generic Criteria GR-1244-CORE, Issue 2, 2000
Digital Network Synchronization Plan GR-436 CORE, Issue 1, Revision 1, 6 1996
Timing characteristics of SDH (Synchronous Digital Hierarchy) The European counterpart to SONET. See SONET.
SDH - Synchronous Digital Hierarchy equipment slave clocks These were clocks that were powered from a master clock. The idea was that all clocks could be kept in synchronism and so they all displayed the same time even if it was slightly incorrect. The clocks were used in many large places where people congregated. e.g. (SEC) ITU-T See ITU.
ITU-T - International Telecommunications Union Recommendation G.813, 1998
Transport Systems Generic Requirements (TSGR TSGR Transport Systems Generic Requirements
TSGR Target Sequence Generation by Refinement ): Common Requirements GR-499-CORE, Issue 2, 1998
Circle 241--Zarlink Semiconductor, or connect directly at www.rsleads.com/501df-241
About the author: Slobodan Milijevic is a Senior Applications Engineer with Zarlink Semiconductor: He can be reached at email@example.com.