Estimating time and size of bioterror attack.In the event of a bioterror attack, rapidly estimating the size and time of attack enables short-run forecasts of the number of persons who will be symptomatic and require medical care. We present a Bayesian approach to this problem for use in real time and illustrate it with data from a simulated anthrax anthrax (ăn`thrăks), acute infectious disease of animals that can be secondarily transmitted to humans. It is caused by a bacterium (Bacillus anthracis attack. The method is simple enough to be implemented in a spreadsheet. ********** In the event of a bioterror attack, once the biologic agent has been determined, rapidly estimating the size and time of attack enables a forecast of the number of persons who will be symptomatic and will require medical attention over the days (and perhaps weeks) after the attack. Such a forecast could play a key role in determining the response effort required, e.g., surge capacity planning Determining the required future configuration of hardware and software for a network, datacenter or Web site. There are numerous capacity planning tools on the market used to monitor and analyze the performance of the current hardware and software. at hospitals, distributing vaccines or antimicrobial agents Antimicrobial agents Chemical compounds biosynthetically or synthetically produced which either destroy or usefully suppress the growth or metabolism of a variety of microscopic or submicroscopic forms of life. to the population, as appropriate (1,2). We refer to early knowledge of the size and time of an attack as situational awareness Situation awareness or situational awareness [1] (SA) is the mental representation and understanding of objects, events, people, system states, interactions, environmental conditions, and other situation-specific factors affecting human performance in . We present a Bayesian approach to the real-time estimation of the size and time of a bioterror attack, from case report data, that is simple enough to implement in a spreadsheet. The model assumes a single-source outbreak caused by a bioterror attack at a particular point in time. Although the model assumes that the infectious agent infectious agent Pathogen, see there is not contagious, the analysis still holds for contagious agents until secondary infections have progressed to symptomatic cases. Thus, our model should prove valuable within the first incubation period incubation period n. 1. See latent period. 2. See incubative stage. Incubation period after an attack has been detected for a contagious agent and for longer time periods in the event of a noncontagious agent. However, in the event of multiple attacks at different points in time or an attack with a rapidly progressing contagious agent, the problem becomes more difficult and similar to the use of back-calculation to recover the incidence of infection over time from symptomatic case reporting (3). The key assumptions in our analysis are that the biologic agent used has been identified and that the probability distribution Probability distribution A function that describes all the values a random variable can take and the probability associated with each. Also called a probability function. probability distribution of the incubation time from infection through symptoms is known. The incubation time distribution for anthrax has been estimated by Brookmeyer and colleagues on the basis of the Swerdlovsk outbreak (4); data describing the incubation distribution for smallpox smallpox, acute, highly contagious disease causing a high fever and successive stages of severe skin eruptions. The disease dates from the time of ancient Egypt or before. are summarized by Fenner et al. (5). Although smallpox is a contagious infection, historically the incubation time from infection through onset of symptoms is 7-17 days (5), a fact that renders our model applicable to smallpox for roughly 2 weeks after an outbreak or 1 week after the first observed cases (which is the shortest time until one would expect to see cases resulting from second-generation infections). Likely ranges for the incubation times of other plausible bioterror agents are available at the Centers for Disease Control and Prevention's bioterror Web site (6), in addition to sources in the literature. If the only information available regarding the incubation time for some agent is a likely range, then one approach to creating a distribution for use with our model is to assume that the range corresponds to a probability coverage interval from a plausible incubation time distribution (such as the lognormal log·nor·mal adj. Mathematics Of, relating to, or being a logarithmic function with a normal distribution. log ) and match the parameters of the distribution accordingly. We assume that the attack is detected through the appearance of infected persons with symptoms, and that as cases are identified, patient interview yields the approximate time at which symptoms appeared, a process which avoids the need for explicit estimates of reporting delay. Corrected as such, case reports provide two types of information. The number of cases observed provides a lower bound on the size of the attack. The specific timing of case reports also conveys information that can be better understood when filtered through the agent-specific incubation time distribution. The mathematical details of our approach are described in the Appendix. We define the time origin as the instant when the first case (and hence the attack) is detected (though the time origin can be reassigned if case investigation indicates that a subsequently reported case had earlier symptoms). At the moment the attack is detected, consistent with Bayesian principles (7), we presume a prior robability distribution (henceforth, prior) governing the size of the attack. For any given attack size n, the time since the attack can be equated to the minimum of n independent incubation times (since the attack is detected by the first symptomatic case). As additional cases accumulate over time, the likelihood of observing cases at specific times is computed with standard methods. Given the data observed, application of Bayes rule enables estimation of the posterior distribution of both the size and time of attack, from which summary statistics such as the mean, standard deviation In statistics, the average amount a number varies from the average number in a series of numbers. (statistics) standard deviation - (SD) A measure of the range of values in a set of numbers. , and probability intervals of the attack size and time are easily estimated. Short-run forecasts of future cases are also easily achieved within this framework. We have developed an Excel spreadsheet (Microsoft Corp., Redmond, WA) for implementing this procedure. As an example, we simulated an anthrax attack that infects 100 persons using the incubation time distribution for anthrax estimated from the Swerdlovsk outbreak (4). We assumed a broad prior that assigns equal likelihoods to attacks of different orders of magnitude from 1 to 10,000 (Appendix). Thus, attacks infecting 1-10, 11-100, 101-1,000, and 1,001-10,000 persons each have the same 25% probability of occurrence. With this prior, absent any data other than the first case observed at time 0, the estimated mean attack size is approximately 1,090. Absent intervention, the 100 victims in this simulated attack would appear as case-patients in accord with Figure 1 (open dots). Using the methods shown in the Appendix, we report estimates of the attack size and the time of the attack based on the cumulative number of eases observed at the end of day 5 of the outbreak (where the time origin corresponds to the occurrence of the first observed case) (Table). At the end of day 1, the estimated mean attack size equals 850 (with a 95% probability interval ranging from 60 to 3,300). However, estimates approach the true value of 100 over time. Similarly, the estimates for the time of the attack improve from 1.1 days before the first case (estimated after day 1 of the outbreak) to 1.8 days before the first case is observed; the true time of attack is 1.8 days before the first case observed in the simulated outbreak (Figure 1). [FIGURE 1 OMITTED] Figures 2 and 3 illustrate the posterior distributions of the initial attack size and time of attack at the end of 5 days (when a total of 23 cases have appeared). For example, Figure 2 suggests that while the expected attack size, given the data, equals 90, initial attacks as small as 50 or as large as 150 are also plausible. Similarly, the time of the attack could have been as recent as half a day before the first case was observed, or as early as 3.5 days before the first case appeared. [FIGURES 2-3 OMITTED] Given estimates of the initial size and time of attack, one can forecast the occurrence of future cases over time, as shown in Figure 1 (solid curve), where the forecast is made on the basis of cases observed through the end of day 5 after the first case was observed. Such a short-range forecast could be helpful in determining the resources required to treat those infected in the attack, although once a widespread response to the attack is mounted (e.g., distribution of antimicrobial agents, in the case of anthrax), the forecasts lose their validity (8). The key assumptions in our model are that the probability distribution of the incubation time from infection through development of symptoms is known and that attack victims can report the times of symptom onset (so we have not explicitly accounted for reporting delay). In an actual bioterror attack, determining the incubation time distribution itself might be necessary. For example, as shown recently by Brookmeyer et al. (9), the incubation time for anthrax is dose-dependent. Thus, exposure to anthrax powders with much greater spore concentrations than evident either in Swerdlovsk or the U.S. postal attacks could lead to shorter incubation distributions. While we are investigating statistical methods for this more general problem (J.T. Wu, unpub. data), having a relatively simple tool is still helpful when the probability distribution of incubation times is presumed known and a single, point-source outbreak is suspected. Our model might also prove helpful for education and training exercises, in addition to use during an actual bioterror attack. E.H.K. was supported in part by Yale University's Center for Interdisciplinary Research on AIDS through Grant MH/DA568286 from the U.S. National Institutes of Mental Health and Drug Abuse. Appendix We seek to estimate the initial attack size and the time of the attack from observed cases of infection in real time. The case report data are the (reporting-delay corrected) times at which cases have been reported. We intend this model to be applied once an attack has been discovered and assume that the agent is noncontagious (or in the case of a contagious agent, that no secondary transmission has occurred) and that any interventions mounted (such as vaccination or the administration of antimicrobial agents) have not yet had any effect on the early case reporting data. We define [t.sub.j] to be the time at which the jth case is observed and define the origin as the time at which the first case is observed (so [t.sub.1] = 0). The unknown time from the attack until the first case is observed is denoted by A > 0 (and thus the actual date of the attack is equal to -A), while the unknown number of persons infected in the attack is denoted by N [greater than or equal to] 1. We treat A and N as random variables and assume that the attack is detected through the reporting of the first case at time [t.sub.1] = 0. At the time the attack is detected, we quantify our beliefs regarding the size of the attack by the prior probability prior probability, n the extent of belief held by a patient and practitioner in the ability of a specific therapeutic approach to produce a positive outcome before treatment begins. distribution p(n) = Pr{N = n}. Let X denote the symptom-free incubation time for the attack agent, with probability density probability density n. Statistics In both senses also called probability distribution. 1. A function whose integral over a given interval gives the probability that the values of a random variable will fall within the interval. f(x) and survivor function S(x) = Pr{X > x} = [[integral].sup.[infinity].sub.u=x] f(u)du. If n persons were actually infected in the attack, then the time from the attack until the first case is observed would equal the minimum of n independent incubation times [X.sub.j] = 1,2,...,n, thus [1] [MATHEMATICAL EXPRESSION A group of characters or symbols representing a quantity or an operation. See arithmetic expression. NOT REPRODUCIBLE IN ASCII ASCII or American Standard Code for Information Interchange, a set of codes used to represent letters, numbers, a few symbols, and control characters. Originally designed for teletype operations, it has found wide application in computers. ] Consequently, the probability that a units of time would pass before the attack would be detected by the first case equals [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] from which the conditional probability conditional probability the probability that event A occurs, given that event B has occurred. Written P(AB). density function of A given an attack of size n follows as [2] f(a\n) = nS[(a).sup.n-1] f(a) for a > 0, n = 1,2,3, ... Equation 2 implies that the joint prior distribution for the size and time of attack when the first case is observed is equal to [3] f(a,n) = p(n)xnS[(a).sup.n-1] f(a) for a > 0, n = 1,2,3, ... Now, suppose that by time [tau] an additional k - 1 cases have been observed at times tj for j = 2,3,...k. Conditional upon an attack of size n having occurred at time--a, the joint probability joint probability n. The probability that two or more specific outcomes will occur in an event. Noun 1. joint probability - the probability of two events occurring together density of the data observed (that is, the likelihood function) is given by [4] [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] where t = [t.sub.2], [t.sub.3], [t.sub.4]..., [t.sub.k]. Equation no. 4 is simply the conditional joint density of the first k - 1 order statistics In statistics, the kth order statistic of a statistical sample is equal to its kth-smallest value. Together with rank statistics, order statistics are among the most fundamental tools in non-parametric statistics and inference. observed from a sample of size n - 1, given that a time units had passed from the attack until the first case was observed at time 0, adjusted for the fact that the period of observation extends to time [tau] (1). Unconditioning the likelihood in equation no. 4 by the prior in equation no. 3 yields the joint density [5] [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] and application of Bayes rule yields the joint posterior distribution of the size and time of attack as [6] [MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] The posterior distributions of A and N are then easily obtained from equation no. 6 by summing (over n) or integrating (over a). To obtain a short-run forecast of future cases, note that conditional upon an attack of size n that occurred a time units before detection, the expected number of cases that will occur by some future time [tau]* equals n[1-S(a+[tau]*)]. Unconditioning over equation no. 6 yields a simple short-run forecast of the number of future cases expected given all of the data observed to date. An even simpler approximation is obtained by substituting the posterior expected values Expected value The weighted average of a probability distribution. Also known as the mean value. of N and A in the expression above for the expected number of future cases; we used this approach in producing the forecast shown in Figure 1. In our examples, we assume that, a priori a priori In epistemology, knowledge that is independent of all particular experiences, as opposed to a posteriori (or empirical) knowledge, which derives from experience. , the logarithm logarithm (lŏg`ərĭthəm) [Gr.,=relation number], number associated with a positive number, being the power to which a third number, called the base, must be raised in order to obtain the given positive number. of the attack size N is uniformly distributed between 0 and the logarithm of 10,000, and we approximate this distribution in a spreadsheet with 500 mass points equally spaced on the natural logarithmic scale Noun 1. logarithmic scale - scale on which actual distances from the origin are proportional to the logarithms of the corresponding scale numbers graduated table, ordered series, scale, scale of measurement - an ordered reference standard; "judging on a scale of 1 . This procedure assigns equal probabilities to four different orders of magnitude, that is, attacks that infect 1-10, 11-100, 101-1,000, or 1,001-10,000 persons are each assigned the same 25% probability of attack. The expected prior attack size associated with this distribution approximately equals 9,999/ln(10,000) = 1,090. Our examples also assume that the incubation time from infection through onset of symptoms is distributed in accord with a lognormal distribution Lognormal distribution Pattern of frequency of occurrence in which the logarithm of the variable follows a normal distribution. Lognormal distributions are used to describe returns calculated over periods of a year or more. with a median of 11 days and a dispersion of 2.04 days. This is the distribution fit to the data from the anthrax outbreak in Swerdlovsk (2). For numerical computations, this distribution is also approximated discretely within the spreadsheet. The joint prior distribution of N and A under the stated assumptions is shown in Appendix Figure 1, available online at http://www.cdc.gov/ncidod/eid/vol10no07/03-0632-appG1.htm. Appendix Figure 2, available online at http://www.cdc.gov/ncidod/eid/vol10no07/03-0632-appG2.htm, displays the joint posterior distribution for N and A after a total of 23 cases have been observed by the end of 5 days after the first case was reported (Figure 1). Results similar to those from Figures 1 to 3 and Appendix Figure 2 are obtained if the attack size itself is assumed to be uniformly distributed from 0 to 10,000 for this example, but we believe the log-uniform prior is more sensible in that the primary a priori ignorance regards the order of magnitude A change in quantity or volume as measured by the decimal point. For example, from tens to hundreds is one order of magnitude. Tens to thousands is two orders of magnitude; tens to millions is three orders of magnitude, etc. of the attack size. References (1.) David HA. Order statistics. New York New York, state, United States New York, Middle Atlantic state of the United States. It is bordered by Vermont, Massachusetts, Connecticut, and the Atlantic Ocean (E), New Jersey and Pennsylvania (S), Lakes Erie and Ontario and the Canadian province of : John Wiley John Wiley may refer to:
(2.) Brookmeyer R, Blades N, Hugh-Jones M, Henderson DA. The statistical analysis of truncated truncated adjective Shortened data: application to the Swerdlovsk anthrax outbreak. Biostatistics biostatistics /bio·sta·tis·tics/ (-stah-tis´tiks) biometry. bi·o·sta·tis·tics n. The science of statistics applied to the analysis of biological or medical data. . 2001;2:233-47.
Table. Real-time estimates of size and time of attack, given the
total cases observed in the simulated outbreak (a)
Estimated day of
Days past Estimated attack (before
case no. 1 Total cases attack size case no. 1)
1 5 850 1.1
2 7 120 1.9
3 15 160 1.4
4 18 100 1.8
5 23 90 1.8
(a) See Appendix for details.
References (1.) Kaplan EH, Craft DL, Wein LM. Emergency response to a smallpox attack: the case for mass vaccination. Proc Natl Acad Sci U S A. 2002;99:10935-40. (2.) Wein LM, Craft DL, Kaplan EH. Emergency response to an anthrax attack. Proc Natl Acad Sci U S A. 2003;100:4346-51. (3.) Brookmeyer R, Gail MH. AIDS epidemiology: a quantitative approach. New York: Oxford University Press; 1994. (4.) Brookmeyer R, Blades N, Hugh-Jones M, Henderson DA. The statistical analysis of truncated data: application to the Swerdlovsk anthrax outbreak. Biostatistics. 2001;2:233-47. (5.) Fenner F, Henderson DA, Arita I, Jezek Z, Ladnyi ID. Smallpox and its eradication. Geneva Geneva, canton and city, Switzerland Geneva (jənē`və), Fr. Genève, canton (1990 pop. 373,019), 109 sq mi (282 sq km), SW Switzerland, surrounding the southwest tip of the Lake of Geneva. : World Health Organization; 1988. (6.) Centers for Disease Control and Prevention Centers for Disease Control and Prevention (CDC), agency of the U.S. Public Health Service since 1973, with headquarters in Atlanta; it was established in 1946 as the Communicable Disease Center. . Biological agents/diseases: category A. Atlanta: The Centers; 2003. Available from: http://www.bt.cdc.gov/agent/agentlist-category.asp#a (7.) Box GEP GEP gastroenteropancreatic. , Tiao GC. Bayesian inference Bayesian inference is statistical inference in which evidence or observations are used to update or to newly infer the probability that a hypothesis may be true. The name "Bayesian" comes from the frequent use of Bayes' theorem in the inference process. in statistical analysis. Reading (MA): Addison-Wesley Pub. Co.; 1973. (8.) Brookmeyer R, Blades N. Prevention of inhalational anthrax in the U.S. outbreak. Science. 2002;295:1861. (9.) Brookmeyer R, Johnson E, Bollinger R. Modeling the optimum duration of antibiotic prophylaxis prophylaxis (prō'fĭlăk`sĭs), measures designed to prevent the occurrence of disease or its dissemination. Some examples of prophylaxis are immunization against serious diseases such as smallpox or diphtheria; quarantine to confine in an anthrax outbreak. Proc Natl Acad Sci U S A. 2003;100:10129-32. Address for correspondence: Edward Kaplan, Yale School of Management The Yale School of Management (also known as Yale SOM) is the graduate business school of Yale University and is located on Hillhouse Avenue in New Haven, Connecticut, United States. The School offers M.B.A. and Ph.D. degree programs. , and Department of Epidemiology and Public Health, Yale School of Medicine The primary teaching hospital for the school is Yale-New Haven Hospital. The school is home to the Harvey Cushing/John Hay Whitney Medical Library, one of the largest modern medical libraries, also known for its historical collections. , New Haven New Haven, city (1990 pop. 130,474), New Haven co., S Conn., a port of entry where the Quinnipiac and other small rivers enter Long Island Sound; inc. 1784. Firearms and ammunition, clocks and watches, tools, rubber and paper products, and textiles are among the many , CT 06520-8200, USA; fax: 203-432-9995; email: edward.kaplan@yale.edu Johan Walden * and Edward H. Kaplan Edward H. Kaplan is the William N. and Marie A. Beach Professor of Management Science at the Yale School of Management, Professor of Public Health at the Yale School of Medicine, and Professor of Engineering in the Yale Faculty of Engineering. * ([dagger]) * Yale School of Management, New Haven, Connecticut, USA; and ([dagger]) Yale School of Medicine, New Haven, Connecticut, USA Dr. Walden has a docentship in applied mathematics from Uppsala University Uppsala University (Swedish Uppsala universitet) is a public university in Uppsala, Sweden, 64 kilometres (40 miles) north-northwest of Stockholm.[1] Founded in 1477, it claims to be the oldest university in Scandinavia, outdating the University of Copenhagen and is currently affiliated with the International Center for Finance at the Yale School of Management. His research focuses on mathematical finance Mathematical finance is the branch of applied mathematics concerned with the financial markets. The subject has a close relationship with the discipline of financial economics, which is concerned with much of the underlying theory. , but he is also interested in the mathematical modeling of social systems, including statistical methods for modeling public health policy issues. Dr. Kaplan is the William N. and Marie A. Beach Professor of Management Sciences at the Yale School of Management and professor of public health at the Yale School of Medicine, where he directs the Methodology and Biostatistics Core of Yale's Center for Interdisciplinary Research on AIDS. His interests include the application of operations research operations research Application of scientific methods to management and administration of military, government, commercial, and industrial systems. It began during World War II in Britain when teams of scientists worked with the Royal Air Force to improve radar detection of , statistics, and mathematical modeling to public health policy problems such as HIV HIV (Human Immunodeficiency Virus), either of two closely related retroviruses that invade T-helper lymphocytes and are responsible for AIDS. There are two types of HIV: HIV-1 and HIV-2. HIV-1 is responsible for the vast majority of AIDS in the United States. prevention, and more recently, bioterror preparedness and response. |
|
||||||||||||||||||||

Printer friendly
Cite/link
Email
Feedback
Reader Opinion