Exploring collective dynamics in communication networks.A communication network, such as the Internet, comprises a complex system where cooperative phenomena may emerge from interactions among various traffic flows generated and forwarded by individual nodes. To identify and understand such phenomena; we model a network as a two-dimensional cellular automaton A state machine that consists of an array of cells, each of which can be in one of a finite number of possible states. The cells are updated synchronously in discrete time steps, according to a local, identical interaction rule. . We suspect such models can promote better understanding of the spatial-temporal evolution of network congestion In data networking and queueing theory, network congestion occurs when a link or node is carrying so much data that its quality of service deteriorates. Typical effects include queueing delay, packet loss or the blocking of new connections. , and other emergent emergent /emer·gent/ (e-mer´jent) 1. coming out from a cavity or other part. 2. pertaining to an emergency. emergent 1. coming out from a cavity or other part. 2. coming on suddenly. phenomena in communication networks. To search the behavior space of the model, we study dynamic patterns arising from interactions among traffic flows routed across shared network nodes (networking) network node - (node) An addressable device attached to a computer network. If the node is a computer it is more often called a "host". , as we employ various configurations of parameters and two different congestion-control algorithms. In this paper, we characterize correlation in congestion The condition of a network when there is not enough bandwidth to support the current traffic load. congestion - When the offered load of a data communication path exceeds the capacity. behavior within the model at different system sizes and time granularities. As expected, we find that long-range dependence (LRD LRD In currencies, this is the abbreviation for the Liberian Dollar. Notes: The currency market, also known as the Foreign Exchange market, is the largest financial market in the world, with a daily average volume of over US $1 trillion. ) appears at some time granularities, and that for a given network size LRD decays as time granularity The degree of modularity of a system. More granularity implies more flexibility in customizing a system, because there are more, smaller increments (granules) from which to choose. increases. As network size increases, we find that long-range dependence exists at larger time scales. To distinguish effects due to network size from effects due to collective phenomena, we compare congestion behavior within networks of selected sizes to congestion behavior within comparably sized sub-areas in a larger network. We find stronger long-range dependence for sub-areas within the larger network. This suggests the importance of modeling networks of sufficiently large In mathematics, the phrase sufficiently large is used in contexts such as:
Key words: cellular automata cellular automata (CA) Simplest model of a spatially distributed process that can be used to simulate various real-world processes. Cellular automata were invented in the 1940s by John von Neumann and Stanislaw Ulam at Los Alamos National Laboratory. ; collective dynamics; complex system; congestion control
Congestion control concerns controlling traffic entry into a telecommunications network, so as to avoid congestive collapse by attempting to avoid ; emergence; communication networks; long-range dependence; modeling and simulation; network traffic. 1. Introduction As the Internet expands in size (from millions to billions of nodes) and in capabilities (from mainly file-transfers and web traffic to an increasing proportion of streaming multimedia traffic), network researchers propose and investigate new protocols and control algorithms to provide assured quality of service and improved network utilization. In most cases, such investigations are motivated and evaluated by small-scale simulations and experiments while ignoring effects from collective behavior The term "collective behavior" was first used by Robert E. Park, and employed definitively by Herbert Blumer, to refer to social processes and events which do not reflect existing social structure (laws, conventions, and institutions), but which emerge in a "spontaneous" way. that could emerge in a large network. Emergent phenomena are often identified when the behavior of an entire system appears more coherent and directed than the behavior of individual parts of the system. Such phenomena arise in the study of complex systems, where many parts interact with each other and where the study of the behavior of individual parts reveals little about system-wide behavior [1]. Emergent phenomena might significantly influence the performance of proposed new protocols and control algorithms. We a im to identify and understand the source and effects of emergent phenomena in large networks. In this paper, we investigate specifically congestion behavior in a network at various sizes and time scales. Our work exists within an active field of investigation where researchers attempt to understand and control the dynamic behavior of existing networks. For example, past work on the performance of congestion-control algorithms for end nodes in the Internet (2) identified important effects on system-wide congestion arising from the retransmission Retransmission might refer to:
A stochastic differential equation (SDE) is a differential equation in which one or more of the terms is a stochastic process, thus resulting in a solution which is itself a stochastic to describe the behavior of flows and queues, with passage to more tractable tractable easy to manage; tolerable. systems of ordinary differential equations ordinary differential equation Equation containing derivatives of a function of a single variable. Its order is the order of the highest derivative it contains (e.g., a first-order differential equation involves only the first derivative of the function). by taking expectations. The coupled ordinary differential equations can be solved numerically. Similar approaches, based on fluid-flow models, are also being investigated through simulation (4). Innovative ideas such as these begin to probe the ability of abstract mathematical techniques to characterize collective behavio r in large networks. As an alternative to these more abstract mathematical approaches, some researchers represent large networks in terms of discrete-event simulations of individual nodes deployed in specific topologies. Such models attempt to capture the detailed behavior in individual nodes to ensure that significant effects from protocols and protocol interactions will not be overlooked by the coarser granularity used to construct more abstract models. The advantages of simulation models include the ability to represent the time-varying behavior of a network and to capture complex behavioral details, which might have significant effects on global network behavior. Further, it is always possible to construct discrete-event models for specific protocol mechanisms, while it might prove impossible to develop tractable mathematical models
CPU time - processor time , much of which may be devoted to modeling behavior irrelevant to the phenomena being inv estigated. Several researchers are working on speeding up discrete-event simulations, through parallelization and other techniques, in order to enable realistic simulation of networks containing from [10.sup.4] to [10.sup.6] nodes (5, 6). We suspect that an interesting space of informative models may exist somewhere between manageable, abstract mathematical models, and detailed, computationally intensive simulations. A growing interest in the study of complex systems has led networking researchers to consider applying a number of mathematical techniques from physics to characterize collective dynamics in large networks (7-11). One such technique applies a cellular automaton (CA) (12, 13) to identify and characterize emergent properties, such as congestion, in networks (14). We suspect such models can promote better understanding of the spatial-temporal evolution of network congestion, and other emergent phenomena. In this paper, we use a two-dimensional CA to investigate congestion behavior in networks of varying size, and over varying time scales. The CA stimulates collective dynamics in a network using abstract mathematical representations of individual nodes. To search the behavior space of our CA, we study dynamic patterns arising from interactions among traffic flows routed across shared network nodes, as we employ various configurations of parameters and two different congestion-control algorithms. We focus on the relationship between network size and time scale. Specifically, we characterize the correlation in congestion behavior within the model at different system sizes and time granularities. Among other things, our results suggest the importance of modeling (or measuring) networks of sufficiently large size when studying the nature of collective dynamics. The paper is presented in five sections. Section 2 outlines our motivation, provides highlights of our methodology, and describes our computational model
2. Motivation, Methodology, and Model We seek to understand what will happen when many network connections are active simultaneously. We suspect that some collective effects may appear within networks with a size beyond some determinable Liable to come to an end upon the happening of a certain contingency. Susceptible of being determined, found out, definitely decided upon, or settled. determinable adj. threshold. Our concern is to understand the global implications of such collective effects, rather than to discern cause and effect relationships introduced by specific control algorithms and associated protocols. Research into emergent phenomena in physical systems shows that collective effects tend to arise only when many local interactions occur over a wide space, such as might be possible in a large network where dynamic behavior can be transmitted through the nodes. The transmission of dynamic behavior can evolve into correlated patterns. Alternatively, large networks can be viewed as dissipative dis·si·pate v. dis·si·pat·ed, dis·si·pat·ing, dis·si·pates v.tr. 1. To drive away; disperse. 2. dynamic systems, where a driving force (injecting packets) determines the strength of local interactions. Increasing the driving force impels the system toward stronger coupling among interactions. As the driving force reaches a certain critical strength, we can expect the interactions to merge into a coherent behavior, which cannot be simply inferred from the individual behavior of components. The resulting coherent behavior should be observed as spatial-temporal dynamic patterns over the whole system. We think that this view of system dynamics System dynamics is an approach to understanding the behaviour of complex systems over time. It deals with internal feedback loops and time delays that affect the behaviour of the entire system. might prove applicable in large communication networks, such as the Internet . Further, it is possible that feedback control mechanisms, as implemented in such networks, might play a pivotal role in dynamically maintaining both coherence and efficiency of system state by preventing resource saturation due to extreme congestion. 2.1 Methodology To investigate these ideas, we propose to model a large network as a cellular automaton (CA), where the behavior of each cell captures some important details related to network protocols. Specifically, our model maintains the individual identity of packets in order to reproduce the "ripple effect ripple effect Epidemiology See Signal event. " (3) seen in real network connections. Our model also simulates feedback-control regimes for each connection in order to respond to variations in congestion over time and space. We provide two alternative forms of feedback control: connection-admission control (CAC See Consumer Advisory Council. ) and feedback transmission-control protocol (TCP (1) (Transmission Control Protocol) The reliable transport protocol within the TCP/IP protocol suite. TCP ensures that all data arrive accurately and 100% intact at the other end. ). On the other hand, we resort to some extreme simplifications, including a regular grid A regular grid is a tessellation of the Euclidean plane by congruent rectangles or a space-filling tessellation of rectilinear parallelepipeds. Grids of this type appear on graph paper and may be used in finite element analysis. topology topology, branch of mathematics, formerly known as analysis situs, that studies patterns of geometric figures involving position and relative position without regard to size. composed of homogeneous links and nodes (see Fig. 1), in order to achieve a sufficiently large model with well-understood parameters. Further, to ensure that no correlation arises from the input traffic itself, we use homogeneous on-off Poisson processes A Poisson process, named after the French mathematician Siméon-Denis Poisson (1781 - 1840), is a stochastic process which is used for modeling random events in time that occur to a large extent independently of one another (the word event to model the behavior of traffic sources. Our CA comprises a discrete, dynamic system composed of a set of cells arranged in a regular, spatial lattice (theory) lattice - A partially ordered set in which all finite subsets have a least upper bound and greatest lower bound. This definition has been standard at least since the 1930s and probably since Dedekind worked on lattice theory in the 19th century; though he may not . The state of a cell at a given time step depends only on its own state and the states of its neighbors at the previous time step. The cells update synchronously and in parallel. Thus, the entire CA state advances in discrete time Discrete time is non-continuous time. Sampling at non-continuous times results in discrete-time samples. For example, a newspaper may report the price of crude oil once every 24 hours. steps. Global behavior results from multiple interactions in the evolution of states in all the single cells. As a result of the discrete time model, we can measure the entire CA state, or any portion of interest, at each time step. In our results, we include at least 2000 data points in each time series that we analyze. For our coarsest time granularity, where we aggregate 1000 time steps, this means that we must run the model for two million time steps. Such measurements produce a time series that can be graphed, and that can also be analyzed using a wide range of suitable statistical techniques. Our main approach to analyze time-series employs power-spectral analysis (24). We envision the system state as a signal generated over time by the system. Power-spectrum graphs describe the frequency dependence in the signal. The power spectrum is the Fourier transform Fourier transform In mathematical analysis, an integral transform useful in solving certain types of partial differential equations. A function's Fourier transform is derived by integrating the product of the function and a kernel function (an exponential function raised to of the autocorrelation Autocorrelation The correlation of a variable with itself over successive time intervals. Sometimes called serial correlation. function of a time series. Of particular interest to us is the appearance of 1/f noise, which provides a clear indication of some sort of collective effect, or cooperative phenomenon, present in the process that generates the signal (22). "1/f noise" is a type of noise whose power spectrum as a function of frequency (f) behaves like: P(f) = 1/[f.sup.a], where a is between 0 and 2. Such noise is often associated with the presence of a complex system, defined as a system that exhibits large variability in its behavior due to strong dynamic interactions among its parts. In contrast to a complex system, an ordered system exhibits regular behavior due to deterministic 1. (probability) deterministic - Describes a system whose time evolution can be predicted exactly. Contrast probabilistic. 2. (algorithm) deterministic - Describes an algorithm in which the correct next step depends only on the current state. interactions among its parts, and a disordered system exhi bits behavior that can be characterized on a statistical basis because its parts interact rarely (22). In fact, the presence of 1/f noise is often cited in the literature as an indication of emergent phenomena. Further, the power spectrum of 1/f noise exhibits ample energy in low frequency components. This suggests the dominant autocorrelation in the signal exists over the long term, which also indicates long-range dependence. 2.2 Modeling a Network With Cellular Automata The computational model used here represents a network as a CA lattice with LXL cells (Fig. 1) in which each cell corresponds to a node with four nearest neighbors See point sampling. (we interconnect boundary nodes, as necessary). Other researchers have proposed similar network models (8, 9, 15, 16). In our model, each node, which can store and forward packets traveling between source-destination pairs, maintains a queue of unlimited length, where arriving packets are stored until they can be processed. Each node can act both as a source and destination for traffic. In effect, each cell in our model can be thought to represent a host that sends and receives traffic and a router that forwards traffic. The queue length at each node represents the state of the cell. Figure 2 provides a schematic A graphical representation of a system. It often refers to electronic circuits on a printed circuit board or in an integrated circuit (chip). See logic gate and HDL. diagram of the nodespecific behavior of each CA cell at each time step. 2.3 Node as a Cell At each time step, each node: (1) evaluates the traffic-generation restrictions and congestion-control status and, if indicated, places a new data packet at the end of its own queue, (2) selects a packet from the front of its queue (if a packet is present), (3) selects the next hop (unless the packet has reached its destination node), and (4) forwards the packet to the end of the queue within the next-hop node. If the packet at the front of the queue has reached its destination, then the node consumes the packet. However, if the consumed packet is an incoming request for which a reply is indicated, then the node will place a reply packet at the end of its queue. Different choices are possible to model the input process, the congestion-control algorithm, and the routing. A discussion of each of these issues follows. 2.3.1 Input Process In this paper, each node models traffic generation by "on/off' periods, which alternate between wake and sleep. When awake, and if the congestion-control state permits, the node adds a data packet to the end of its queue during each time step. At the beginning of each "on" period, a node randomly selects (uniform distribution) a destination node from among all other nodes in the lattice. Each packet generated during the same "on" period has the same destination address. When sleeping, the node generates no new data packets. On/off periods provide a convenient model of user behavior. In the simulation reported here, the wake and sleep period durations for each source are taken to be exponentially ex·po·nen·tial adj. 1. Of or relating to an exponent. 2. Mathematics a. Containing, involving, or expressed as an exponent. b. distributed with parameters [[lambda].sub.on] and [[lambda].sub.off] Thus, the transitions between state "on" and state "off" form a memoryless (i.e., uncorrelated) process. While we do not assert that real users exhibit memoryless behavior, we aim to investigate long-memory behavior that results from collective inte ractions; therefore, we must eliminate correlations in the behavior of individual traffic sources. The input processes of different nodes are taken to be independent from each other. 2.3.2 Congestion-Control Algorithms Our model contains the possibility of three choices for congestion-control algorithm: (1) open-loop, (2) connection-admission control (CAC), and (3) feedback transmission control protocol (TCP). Only one of these algorithms can be used for a given experiment. In the case of the open-loop algorithm, we can change the average durations of state "on" and state "off" to control the network workload. There is no feedback in this case. An open-loop approach does not model reality very well because the goal of a network is to transmit packets between source-destination pairs while attempting to assure some minimally acceptable quality of service. Even for best-effort service A communications service that makes no guarantees regarding the speed with which data will be transmitted to the recipient or that the data will even be delivered entirely. (i.e., where the network provides no guarantees about service quality) [2], a user will not wait indefinitely in the face of long packet-transmission times that occur in a very congested con·gest·ed adj. Affected with or characterized by congestion. congested ENT adjective Referring to a boggy blood-filled tissue. See Nasal congestion. network. So feedback control is important for networks, and we model two different approaches. In one approach, we use a connection-admission control (CAC) algorithm. CAC requires a source to send a probe packet at the beginning of each "on" state. Upon receiving this probe packet, the destination node returns a probe-reply packet to the source. Upon receiving the probe-reply packet, the source node determines the round-trip time round-trip time - (RTT) A measure of the current delay on a network, found by timing a packet bounced off some remote host. This can be done with ping -s. , RTT RTT - Round-Trip Time , and then normalizes RTT with respect to the distance between the source and destination. If the normalized RTT, Nrtt, falls below a threshold, Drtt, then the source sends a data packet at each time step during the "on" state. If the Nrtt exceeds Drtt, then the source sends another probe packet. Upon receiving the probe-reply, the source repeats its RTT assessment. While the CAC algorithm tests congestion state along the source-destination path prior to injecting a data-packet flow, the flows themselves exhibit fixed inter-packet spacing. Inter-packet spacing within a single TCP flow, however, has been observed to exhibit its own distinguishing variability, which appears as structured behavior on a short time scale. Such variability in TCP flows is likely attributable to the feedback-control mechanisms of TCP (17, 18), though this remains a topic of ongoing study. Other studies have demonstrated that TCP results in interesting dynamics at small time scales (19, 20). To account for the significant behavior of feedback control, we included within our model a modified version of TCP. Our TCP model includes some limiting assumptions. Each node contains an unlimited buffer, so no packets will be lost and retransmitted. Instead we only model the effect that congestion losses would have on TCP flow-control mechanisms. We do this by comparing the normalized RTT, Nrtt, for each received acknowledgment acknowledgment, in law, formal declaration or admission by a person who executed an instrument (e.g., a will or a deed) that the instrument is his. The acknowledgment is made before a court, a notary public, or any other authorized person. (ACK (ACKnowledgment code) The communications code sent from a receiving station to a transmitting station to indicate that it is ready to accept data. It is also used to acknowledge the error-free receipt of transmitted data. Contrast with NAK. 1. ) against a threshold, Drtt. We also ensure that the receiver's advertised flow-control window (2) does not constrain con·strain tr.v. con·strained, con·strain·ing, con·strains 1. To compel by physical, moral, or circumstantial force; oblige: felt constrained to object. See Synonyms at force. 2. the sending rate. Our modified TCP model does include a slow-start and congestion-avoidance algorithm, which is described as follows. For every ACK, if the Nrtt exceeds the Drtt, then we set the slow-start threshold to 1/2 the congestion window In TCP, the congestion window determines the number of bytes that can be outstanding at any time. This is a means of stopping the link between two places from getting overloaded with too much traffic. and set the congestion window to one. Otherwise, if the congestion window is below the slow-start threshold, we increment To add a number to another number. Incrementing a counter means adding 1 to its current value. the congestion window by one. Once the congestion window exceeds the slow-start threshold, we increment the congestion window by the inverse of the congestion window. This procedure simulates the effects of the transition in TCP between the congestion phase, the slow-start phase and the congestion-avoidance phase. At each time step, a source node injects a data packet, up to the limits of its congestion window. 2.3.3 Routing As with input processes and congestion-control algorithms, we can choose different routing strategies for our model. For our experiments, the objective of routing is to minimize the delay for each packet by forwarding it along the shortest path between source-destination pairs. To select the proper next-hop along which to forward a packet, the forwarding node computes (using the approach of Fuks and Lawniczak (15)) the distance from each of its four neighboring neigh·bor n. 1. One who lives near or next to another. 2. A person, place, or thing adjacent to or located near another. 3. A fellow human. 4. Used as a form of familiar address. v. nodes to the packet's destination node. Then the packet is forwarded to the neighboring node nearest to the destination. When multiple neighboring nodes prove equidistant e·qui·dis·tant adj. Equally distant. e qui·dis tance n. from the destination, then one of the candidate nodes is selected randomly with uniform probability. After placing the packet at the end of the queue in the selected node, the model increments a throughput counter associated with the corresponding outgoing link. 3. Simulation Results In this section, we study the behavioral properties of our model under two congestion-control algorithms. An important parameter in our algorithms is Drtt, which can be used to control the quality of service in packet-delivery time, and also the network load. We performed some simulations to assess the effect of Drtt, and observed that, for a fixed network size and fixed [[lambda].sub.on] and [[lambda].sub.off], packet delivery time and network congestion increases with Drtt up to some bound. Further, for several fixed values of Drtt, we observed that the distribution of packet-delivery times seem to follow a lognormal distribution Lognormal distribution Pattern of frequency of occurrence in which the logarithm of the variable follows a normal distribution. Lognormal distributions are used to describe returns calculated over periods of a year or more. , as observed elsewhere (16, 21). From these results (not fully described here) we decided to set Drtt at 50 time steps, a value that permitted congestion to build up within the network, and thus enabled us to study the collective behavior of our congestion-control algorithms. Unless otherwise indicated, all simulations discussed here were run with [[lambda].sub.on] = 100, [[lambda] .sub.off] = 500, Drtt = 50, and with the TCP congestion-control algorithm. We begin by considering the effects of time scale on queue sizes in an individual node. Packet-switched networks, which route messages hop-by-hop over multiple intermediate links and routers between sources and destinations, can be viewed as a mesh of single-server queues, where each queue acts as a memory element. For purposes of illustration, Fig. 3 shows the time series of the queue length, Nr, for a typical node of the CA lattice at three time granularities T= 1, 10, and 100, where T defines the interval with which we sample the system state. (The abscissas on all our graphs depict the number of sample intervals, t, so the total number of CA time steps represented in any particular graph is equal to TXt). At the shortest sample interval, T = 1, the queue length changes smoothly because the memory introduces a correlation. As expected, as the sample interval T increases, the memory becomes weaker and the correlative Having a reciprocal relationship in that the existence of one relationship normally implies the existence of the other. Mother and child, and duty and claim, are correlative terms. structure is diminished. Since each queue exists within a network of queues, we expect correlation in queue size to be influenced by memories of neighboring queues. This influence appears as a kind of spatial-temporal information, which is difficult to discern by observing a single queue. Before we consider a more suitable metric to measure spatial-temporal memory, we discuss another measure of interest, the output process of an individual node, as an alternative to queue size. Let [X.sub.out] denote de·note tr.v. de·not·ed, de·not·ing, de·notes 1. To mark; indicate: a frown that denoted increasing impatience. 2. the number of data packets received by a node, during a time interval T, where the node is the data packet's destination. Measures of [X.sub.out] appear to reflect more the influence of spatial-temporal information than is the case for measures of queue size, Nr. Figure 4, which shows a time series of [X.sub.out] for one node at three time granularities T = 40, 100, and 500, supports this observation. Here, while auto-correlation decreases as T increases, the rate of decrease appears more gradual than is the case with queue sizes (Fig. 3). In our view, [X.sub.out] reflects more information about its time interval and about nearby space because it is an aggregate value, accumulated over the sample interval, rather than a snapshot of system state at one time instant. To gain more insight into our observations, we compute the power spectrum [S.sub.out](f) of [X.sub.out]. Figure 5 shows the power spectra for selected combinations of time granularity and system size. In general, each curve shows a flat line at lower frequencies, followed by a negatively sloped line commencing at higher frequencies. The negatively sloped portion of each curve exhibits the appearance of 1/f noise. In our interpretation of these curves, the larger the negatively sloped region of a curve (or the shorter the flat region), the more 1/f-like the curve. Further, the more a curve appears 1/f-like, the greater the long-range dependence in the signal. For example, comparing the two curves in Fig. 5(a), we find that long-range dependence decays as the time granularity increases from T = 80 to T = 400. We also find that for the same time granularity (T = 400), a larger network size shows a greater long-range dependence (compare Fig. 5(a) T = 400 against Fig. 5(c) T= 400). The presence of 1/f noise, which is characterized by correlations extending over a wide range of time scales (long-range dependence), pr ovides a clear indication of some sort of collective effect. Moreover, using our interpretation, the graphs in Fig. 5 show that long-range dependence decays as T increases for the same system size, L, and that long-range dependence holds for the same time granularity as system size increases. This suggests that congestion dissipates more slowly as network size increases, that is, a larger network seems to have a more pronounced correlative structure. We believe such behavior occurs because network traffic experienced at a node consists of transient packet flows transiting across a mixture of short and long distances, and modulated mod·u·late v. mod·u·lat·ed, mod·u·lat·ing, mod·u·lates v.tr. 1. To adjust or adapt to a certain proportion; regulate or temper. 2. by adaptive congestion-control algorithms. To investigate this behavior more directly, we developed a technique to monitor the congestion present in aggregate among all nodes in our network model. Our technique maps the three-dimensional (3-D) structure (L X L X Nr) of the network state onto a two-dimensional (2-D) binary pattern. To achieve such a mapping, we set a threshold parameter Y against which to compare the state (i.e., the queue length) of every node. If the queue length of the node r is less than or equal to Y, then the state value br of the binary network is set equal to zero, otherwise one. In this way, the network state can be mapped from three dimensions onto a 2-D grid, as shown for example in Fig. 6 where black blocks represent congested nodes (for Y = 5), and white blocks depict congestion-free nodes. Using such a 2-D map proves more convenient then a 3-D map when visualizing the evolution of network state. Such maps provide a more readily comprehensible com·pre·hen·si·ble adj. Readily comprehended or understood; intelligible. [Latin compreh view of the spatial correlation of network congestion. As time progresses, the congestion state of a node in a large network depends more on the congestion state of its neighboring area. This idea was first proposed by researchers who modeled the propagation of congestion between neighboring routers based on contact processes with a Cayley tree [10]. Using the 2-D grid, we can determine the number of congested nodes (y) in our model at any time granularity T, and then record y as a time-series representing the system state for any number of sample intervals t. In Fig. 7 we show a time series of congested nodes for a system size L = 16 at three time granularities: T = 10, 100, and 1000. As the figure shows, we find generally that as the time granularity, T, increases, the number of congested nodes, y, changes less smoothly. This suggests that congested nodes exhibit stronger interdependence in·ter·de·pen·dent adj. Mutually dependent: "Today, the mission of one institution can be accomplished only by recognizing that it lives in an interdependent world with conflicts and overlapping interests" at smaller time scales. To explore the relationship among time scale, network size, and congestion, we plot several power spectra in Fig. 8 at various network sizes and time granularities. Figures 8(a)-(c) illustrate that increasing the time granularity for the same network size leads to a reduced appearance of 1/f-like noise. For example, comparing the two curves in Fig. 8(a) (network size L = 8), we find some autocorrelation in the curve T = 50 but the curve for T = 400 appears almost flat, suggesting little autocorrelation in the signal. Comparing this curve for T = 400 against the curve for the same time granularity but at network size L = 32, shown in Fig. 8(c), we find that as the network size increases from L = 8 to L = 32 the long-range dependence increases for a given time granularity. To us, these results suggest that collective behavior in a larger network causes more profound influence on network congestion and on predictability. If true, these factors might prove meaningful for the design of congestion-control and traffic-engineering mechanisms in networks. We suspect that 1/f noise arises from the collective effect of many interacting network flows, independent of specific details associated with network protocols. Furthermore, as indicated by Fig. 8, the collective effect appears to strengthen as system size increases, but to diminish as time granularity increases. At very large time scales, the evidence of collective effect does not appear; instead, the power spectra tend toward whiteness as the sample interval (T) exceeds some size. These observations suggest that some time-granularity threshold might exist within which the network can be viewed as a coherent whole (that is, a time granularity where the network congestion signal exhibits the most pronounced 1/f-like appearance). We can refer to this granularity as the coherent time scale. As shown by Fig. 8, the larger the network, the greater the coherent time scale, because 1/f-like noise appears at larger sample intervals. On the other hand, our results also suggest some difference in the evolution of con gestion between large and small networks. In our spectral analyses, a larger network, such as L = 32 in Fig. 8(c), retains a more 1/f-like appearance at larger time scales than does a smaller network. On the contrary, as plotted in Fig. 8(a), the 1/f-like appearance in the congestion signal appears to decay faster as T increases in the smaller network (L = 8). To us, these results confirm that a congestion-control system or a traffic-management regime has more time available (a larger coherent time scale) to respond to congestion in a larger network, because the congestion diminishes rather more slowly than in smaller networks. Of course, the management of a large network also requires more time. The effectiveness of control methods may fall off gradually once their reaction time exceeds a certain threshold. Discussion regarding the influence of these findings on concrete network management techniques is beyond the content of this paper. Considering the collective dynamics of a network and the relationship between network size and time granularity, we find that for a given network size one must expect more volatility in congestion as the sample interval increases. We have compared our TCP and CAC congestion-control algorithms against cases where the network does not control congestion. These latter experiments, not reported in this paper, modulate To insert a data signal into a carrier wave or direct current. See modulation. congestion by varying the intensity of source traffic. Across all of our experiments, we find that long-range dependence emerges at different time scales, seeming to depend not only on network size but also on traffic intensity and congestion-control algorithm. One particular phenomenon observed in our current experiments might provide some useful insight for researchers attempting to explore behavior in communication networks. Our results suggest that the congestion response of a sub-area in a larger network may have different features compared with a smaller network with the same size as the sub-area. For example, Fig. 9 illustrates a binary network pattern with L = 32 and indicates two sub-areas with sizes l = 8 and 16. Each sub-area appears as an interconnected or interwoven in·ter·weave v. in·ter·wove , in·ter·wo·ven , inter·weav·ing, inter·weaves v.tr. 1. To weave together. 2. To blend together; intermix. v.intr. part of the larger network; thus, these sub-areas play an indispensable role in global emergence, where collective behavior pertains to the system as a whole. Extracting such sub-areas, by isolating them from the original network into a smaller network with system size L = l, ignores various relationships or interdependencies, and may lead to inaccurate analysis regarding some aspects of dynamic behavior in a large network. We demonstrate this effect in Fig. 10. Figure 10(a) compares the power spectrum of the number of congested nodes, y, at a comparable time granularity, T = 200, for a network of size L = 8 against a sub-area of comparable size, l = 8, from within a larger network with size L = 32. Figure 10(b) provides a similar comparison, except that the time granularity is increased to T = 400, and the network size and sub-area size are increased to L = 16 and 1 = 16, respectively. These figures suggest that sub-areas exhibit stronger dependence with respect to congestion than networks of the same size and time scale. For this reason, network researchers interested in the behavior of networks of a certain size L would be well advised to investigate such networks as sub-areas of size l = L within a larger network. Further, these results also suggest that network researchers should strive to investigate the effects of network congestion and cross traffic within topologies exhibiting sufficiently large scale. The global congestion behavior of a network can look qu ite different depending upon whether a set of identical nodes is arranged in a small or large network. This might represent a caution for researchers who rely on detailed, discrete-event simulations, because simulation models can take substantial computing resources and memory as the size of the topology increases. Yet, a sizeable topology appears necessary in order to approximate the effects of congestion within even a restricted sub-area. 4. Discussion We aim to understand collective dynamics in large networks, where cause-effect relationships might not be inferred readily from the behavior of individual nodes. Our current results suggest how such collective dynamics might arise and evolve. We start with a disordered network, where the nodes act in a random way (based on a memoryless input process), and where any propagating influence will be dispersed dis·perse v. dis·persed, dis·pers·ing, dis·pers·es v.tr. 1. a. To drive off or scatter in different directions: The police dispersed the crowd. b. and dissipated dis·si·pat·ed adj. 1. Intemperate in the pursuit of pleasure; dissolute. 2. Wasted or squandered. 3. Irreversibly lost. Used of energy. quickly. Initially, we saw that nodes were only interacting locally. Such locality of interaction follows from the basic continuity of physical processes: for any influence to pass from one region to another it must first pass through all intermediate regions. During the time that the process propagates through the intervening regions, it will be disturbed by all the fluctuations taking place within those regions. As a result, in the network's original disordered state, distant parts of the network do not influence each other: they appear independent. Over time, as feedback control mechanisms adapt to changes in network congestion in all directions, a discernable structure increases gradually and then continues to evolve. Eventually, the discernable structure expands into a global order. These effects depend upon the size of the network, appearing more strongly over greater time scales as network size increases. No approach to simulation can describe the behaviors of the real Internet completely. Given the state-of-the-art, it is currently within reach to develop good temporal models to study performance at a single node or in a small network (i.e., to represent a smaller part of a larger network). We believe we have shown that it would be desirable, and perhaps feasible, to devise models to study the spatial-temporal performance of a large network as a whole. To achieve such an outcome requires that simplifications be made. While our modeling method seems an extreme simplification, especially in terms of the regular topology, the homogeneous links and nodes, and the routing of traffic, we do maintain the individual identity of packets and we believe that our model captures important details, such as feedback congestion control, that might be missed in other network simulations In computer network research, network simulation is a technique where a program models the behavior of a network either by calculating the interaction between the different network entities (hosts/routers, data links, packets, etc) using mathematical formulas, or actually capturing . Our model enables us to explore collective dynamics in reasonably large networks, up to 1024 nodes so far. Our model also drives 2-D and 3- D animations (not described in this paper) that give an intuitive view of network behavior over time and against key parameters. 5. Future Work The results reported in this paper, as well as elsewhere [8, 9, 14-16], encourage us to continue our investigations into the collective dynamics of large networks. We foresee three directions in our immediate future plans. First, we need to incorporate into our network model additional traffic sources and control mechanisms related to providing quality of service. As more and more high-speed access links are added to the Internet, the Internet, the, international computer network linking together thousands of individual networks at military and government agencies, educational institutions, nonprofit organizations, industrial and financial corporations of all sizes, and commercial enterprises nature of traffic can be expected to change, perhaps leading to an increasing quantity of streaming multimedia traffic and large file transfers, as well as increases in traffic from various interactive group games and from instant messaging Exchanging text messages in real time between two or more people logged into a particular instant messaging (IM) service. Instant messaging is more interactive than e-mail because messages are sent immediately, whereas e-mail messages can be queued up in a mail server for seconds or applications. In addition, the pattern of source-destination pairs may evolve as users begin to move toward increased peer-to-peer communications Communications in which both sides have equal responsibility for initiating, maintaining and terminating the session. Contrast with "master-slave communications," in which the host determines which users can initiate which sessions. . In addition to changing traffic patterns among network nodes, evolving Internet use might alter the balance between TCP and non-TCP traffic. Such changes will increase the criticality of deploying mechanisms to provide quality of service. While researchers have provided an understanding of the properties of various quality-of-service mechanisms on a local scale, the effects of such mechanisms have not been studied on a global scale in large networks. Second, we must increase the size of our model in order to improve our ability to identify and understand emergent phenomena, and especially congestion dynamics, at Internet scale. The current work reports a maximum network of 1024 nodes (L X L = 32 X 32 = 1024) and 4096 links (four links per node). Using MATLAB (MATrix LABoratory) A programming language for technical computing from The MathWorks, Natick, MA (www.mathworks.com). Used for a wide variety of scientific and engineering calculations, especially for automatic control and signal processing, MATLAB runs on Windows, Mac and , we were able to model a network of this size; however, to execute the behavior of the network for two million time steps requires 10 days of CPU time on a 750 MHz (MegaHertZ) One million cycles per second. It is used to measure the transmission speed of electronic devices, including channels, buses and the computer's internal clock. A one-megahertz clock (1 MHz) means some number of bits (16, 32, 64, etc. Pentium Ill with 128 MB of memory. We intend to recode Verb 1. recode - put into a different code; rearrange mentally; "People recode and restructure information in order to remember it" rearrange - put into a new order or arrangement; "Please rearrange these files"; "rearrange the furniture in my room" our model using C or C++ in an effort to achieve a network size on the order of 16 000 nodes and 64 000 links. Such an increase in size should enable us to enhance our ability to identify and understand emergent phenomena. To reach our ultimate intended network size, on the order of [10.sup.5] to [10.sup.6] nodes and links, we might require a parallel processing parallel processing, the concurrent or simultaneous execution of two or more parts of a single computer program, at speeds far exceeding those of a conventional computer. system. Such systems appear well suited to cellular automata (23). With a network model of sufficient size, we can undertake a systematic search to identify, understand, and perhaps explain, significant emergent phenomena in large networks. This represents our third avenue for future work. In this case, we must develop theories regarding the most likely emergent phenomena, and then test those theories with a systematic set of experiments. From this work, we might well provide guidance to researchers seeking to measure Internet behavior on a global scale. In particular, we might identify specific phenomena and related measurement data that could be collected by experimenters in order to confirm or refute re·fute tr.v. re·fut·ed, re·fut·ing, re·futes 1. To prove to be false or erroneous; overthrow by argument or proof: refute testimony. 2. the presence of emergent phenomena within the Internet. 6. Conclusions When attempting to analyze network behavior by examining the behavior of constituent components, researchers can develop the misconception mis·con·cep·tion n. A mistaken thought, idea, or notion; a misunderstanding: had many misconceptions about the new tax program. that uncertainty regarding network congestion and resource consumption increases with network size. We argue that the opposite holds. We find that correlation in congestion among network nodes can be expected to persist at larger time granularities as system size increases, and we also find network size and time granularity to be two closely related aspects of the spatial-temporal dynamics of a network. This suggests that as network size increases, the collective behavior of a network might well become more predictable. Such predictability arises from global emergence, where chain reactions move through the whole system. As the system becomes larger, the correlated interactions persist. As we show, such collective behavior cannot be realized from the analysis of individual network components. Although in this paper we do consider two simplified congestion-control algorithms, we must still evaluate the collective dynamics of large networks that include additional mechanisms, such as differentiated services Offerings that can be classified by type, or quality, of service. For example, a differentiated services network could prioritize real time traffic for a higher fee. and routing updates. However, more important challenges revolve around Verb 1. revolve around - center upon; "Her entire attention centered on her children"; "Our day revolved around our work" center, center on, concentrate on, focus on, revolve about issues of scale. We need to scale the network model up to a million nodes, so that we can investigate multi-scale, spatial-temporal dynamics. This suggests a need to employ a parallel programming environment to study multiple-timescale traffic patterns and network performance, and to identify and understand even more interesting phenomena arising only in sufficiently large networks. We believe that simulating networks at an appropriately large size is key for researchers to gain insight regarding behaviors that might emerge within the Internet. [FIGURE 2 OMITTED] [FIGURE 3 OMITTED] [FIGURE 4 OMITTED] [FIGURE 5 OMITTED] [FIGURE 7 OMITTED] [FIGURE 8 OMITTED] [FIGURE 10 OMITTED] Acknowledgments The manuscript benefited greatly from review by Stefan Leigh, Doug Montgomery, Tassos Nakassis, and Ted Vorburger. We also appreciated the advice of Bert Rust, who helped us ensure our spectral plots were properly graphed and presented. We accept sole responsibility for any remaining ambiguity, inconsistency in·con·sis·ten·cy n. pl. in·con·sis·ten·cies 1. The state or quality of being inconsistent. 2. Something inconsistent: many inconsistencies in your proposal. , or other errors. Accepted: January 28, 2002 7. References (1.) Yaneer Bar-Yam Yaneer Bar-Yam (born 1959, Boston, MA) is an American physicist, systems scientist and founding president of the New England Complex Systems Institute. Biography Yaneer Bar-Yam was born in Boston, MA in 1959. He received the B.S. degree in 1978 and the. Ph.D. , The Dynamics of Complex Systems (Studies in Nonlinearity), ISBN ISBN abbr. International Standard Book Number ISBN International Standard Book Number ISBN n abbr (= International Standard Book Number) → ISBN m 0-201-55748-7, August 1997. (2.) V. Paxson, End-to-End Internet Packet Dynamics, IEEE/ACM IEEE/ACM Institute of Electrical and Electronic Engineers/Association for Computing Machinery (joint effort) Transactions on Networking 7 (3), 277-292, 1999. (3.) B. Liu, Y. Guo, J. Kurose, D. Towsley, and W. Gong, Fluid simulation of large scale networks: issues and tradeoffs, in Proceedings of the International Conference on Parallel and Distributed Processing The first term used to describe the distribution of multiple computers throughout an organization in contrast to a centralized system. It started with the first minicomputers. Today, distributed processing is called "distributed computing." See also client/server. Techniques and Applications, June 28-July 1, 1999. Volume IV, Las Vegas Las Vegas (läs vā`gəs), city (1990 pop. 258,295), seat of Clark co., S Nev.; inc. 1911. It is the largest city in Nevada and the center of one of the fastest-growing urban areas in the United States. , NV, pp. 2136-2142. (4.) D. Nicol, M. Goldsby, and M. Johnson, Fluid-based Simulation of Communication Networks using SSF SSF Scalable Simulation Framework SSF Single Stock Futures SSF Service Switching Function SSF Small Form Factor SSF Svenska Simförbundet (Swedish Swimming Association) SSF Space Station Freedom SSF Society of St. , Proceedings of the 1999 European Simulation Symposium, October 1999, Erlangen-Nuremberg, Germany. (5.) J. Cowie, H. Liu, J. Liu, D. Nicol, and A. Ogiclski, Towards Realistic Million-Node Internet Simulations, in Proceedings of the 1999 International Conference on Parallel and Distributed Processing Techniques and Applications (PDPTA'99), June 28-July 1, 1999, Las Vegas, Nevada. (6.) Hung-Ying Tyan and Chao-Ju Hou, JavaSim: A component-based compositional network simulation environment, Western Simulation Multiconference--Communication Networks And Distributed Systems Distributed systems (computers) A distributed system consists of a collection of autonomous computers linked by a computer network and equipped with distributed system software. Modeling And Simulation, January 2001. (7.) I. Csabai, 1/f Noise in Computer Network Traffic, J. Phys., A 27 (12), L417-L421 (1994). (8.) T. Ohira and R. Sawatari, Phase transition in computer network traffic model, Phys. Rev. E 58, 193-195 (1998). (9.) J. Yuan, Y. Ren, F. Liu, and X. Shan, Phase transition and collective correlation behavior in the complex computer network, Acta Physica Sinicia 50 (7), 1221-1225, July 2001. (10.) M. Takayasu, H. Takayasu, and T. Sato, Critical behaviors and 1/f noise in information traffic, Physica A Physica A (ISSN: 0378-4371): Statistical mechanics and its applications is a scientific journal published by Elsevier that publishes research in the field of statistical mechanics and its applications. 233, 824-834 (1996). (11.) K. Fukuda, M. Takayasu, and H. Takayasu, Spatial and temporal behavior of congestion in Internet traffic Internet traffic is the flow of data around the Internet. It includes web traffic, which is the amount of that data that is related to the World Wide Web, along with the traffic from other major uses of the Internet, such as electronic mail and peer-to-peer networks. , Fractals 7, 23-31 (1999). (12.) J. von Neumann Noun 1. von Neumann - United States mathematician who contributed to the development of atom bombs and of stored-program digital computers (1903-1957) John von Neumann, Neumann , Theory of Self Reproducing Automata automata - automaton , University of Illinois Press The University of Illinois Press (UIP), is a major American university press and part of the University of Illinois. Overview According to the UIP's website: (1966). (13.) N. Gershenfeld, Cellular Automata and Lattice Gases, in The Nature of Mathematical Modeling, Cambridge University Press Cambridge University Press (known colloquially as CUP) is a publisher given a Royal Charter by Henry VIII in 1534, and one of the two privileged presses (the other being Oxford University Press). (1999) pp. 102-110. (14.) J. Yuan, Y. Ren, and X. Shan, Self-Organized Criticality In physics, self-organized criticality (SOC) is a property of (classes of) dynamical systems which have a critical point as an attractor. Their macroscopic behaviour thus displays the spatial and/or temporal scale-invariance characteristic of the critical point of a phase in a Computer Network Model, Phys. Rev., E 61(1), 1067-1071 (2000). (15.) H. Fuks and A. T. Lawniezak, Performance of data networks with random links, Math. Computers Simulation 51, 101-117 (1999). (16.) R. V. Sole and S. Valverde, Information transfer and phase transitions in a model of Internet traffic, Physica A 289 (3-4), 595-605 (2001). (17.) A. Feldmann, A. C. Gilbert, P. Huang, and W. Willinger, Dynamics of IP traffic: A study of the role of variability and the impact of control, in Proc. ACM (Association for Computing Machinery, New York, www.acm.org) A membership organization founded in 1947 dedicated to advancing the arts and sciences of information processing. In addition to awards and publications, ACM also maintains special interest groups (SIGs) in the computer field. SIGCOMM SIGCOMM Special Interest Group on Data Communication (ACM) '99, 301-313 (1999). (18.) K. Park, Future directions and open problems in performance evaluation Performance evaluation The assessment of a manager's results, which involves, first, determining whether the money manager added value by outperforming the established benchmark (performance measurement) and, second, determining how the money manager achieved the calculated return and control of self-similar network traffic, in K. Park and W. Willinger, eds., Self-Similar Network Traffic and Performance Evaluation, Wiley-Interscience (2000). (19.) A. Veres, M. Boda, The chaotic nature of TCP congestion control, in IEEE (Institute of Electrical and Electronics Engineers, New York, www.ieee.org) A membership organization that includes engineers, scientists and students in electronics and allied fields. INFOCOM INFOCOM International Conference on Computer Communications (IEEE) INFOCOM Informatics and Communications (CQU) 2000, Tel Aviv Tel Aviv (tĕl əvēv`), city (1994 pop. 355,200), W central Israel, on the Mediterranean Sea. Oficially named Tel Aviv–Jaffa, it is Israel's commercial, financial, communications, and cultural center and the core of its largest , Mar. 2000, pp. 1715-1723. (20.) W. Feng and P. Tinnakornsrisuphap, The adverse impact of the TCP congestion-control mechanism in heterogeneous computing systems, in Proc. of the Int'l Conf. on Parallel Processing (ICPP'00) August 2000. (21.) B.A. Huberman and R. M. Lukose, Social dilemmas A Social dilemma is a paradox arising from social decision situations in which contributions are needed to attain a common goal and where the rational choice of the individual is to "free-ride". and Internet congestion, Science 277, 535-537 (1997). (22.) P. Bak, How Nature Works: the science of self-organized criticality. Copernicus, New York New York, state, United States New York, Middle Atlantic state of the United States. It is bordered by Vermont, Massachusetts, Connecticut, and the Atlantic Ocean (E), New Jersey and Pennsylvania (S), Lakes Erie and Ontario and the Canadian province of (1996). (23.) D. Hutchinson et al., Parallel Neighborhood Modeling: Research Summary, in Proceedings of Symposium on Parallel Algorithms In computer science, a parallel algorithm, as opposed to a traditional serial algorithm, is one which can be executed a piece at a time on many different processing devices, and then put back together again at the end to get the correct result. and Applications 96, CAM Press, New York (1996) pp. 204-207; B. P. Zeigler et al., The DEVS DEVS Discrete Event System DEVS Devices DEVS Discrete Event System Simulation Environment for High-Performance Modeling and Simulation, IEEE Computational Sci. Eng. 4 (3), 61-71 (1997). (24.) M. Schroeder, Fractals, Chaos, Power Laws: Minutes from an Infinite Paradise, W H. Freeman and Company, New York (1991). About the authors: Jian Yuan is a guest researcher at NIST (National Institute of Standards & Technology, Washington, DC, www.nist.gov) The standards-defining agency of the U.S. government, formerly the National Bureau of Standards. It is one of three agencies that fall under the Technology Administration (www.technology. Kevin Mills is a senior research scientist in the Advanced Network Technologies Division of the Information Technology Laboratory at NIST. The National Institute of Standards and Technology National Institute of Standards and Technology, governmental agency within the U.S. Dept. of Commerce with the mission of "working with industry to develop and apply technology, measurements, and standards" in the national interest. is an agency of the Technology Administration, U.S. Department of commerce. |
|
||||||||||||||||||

is true for sufficiently large
qui·dis
tance n.
Printer friendly
Cite/link
Email
Feedback
Reader Opinion