Printer Friendly
The Free Library
14,800,529 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

Biologically plausible particulate air pollution mortality concentration-response functions.


In this article I introduce an alternative method for estimating particulate par·tic·u·late
adj.
Of or occurring in the form of fine particles.

n.
A particulate substance.



particulate

composed of separate particles.
 air pollution mortality concentration-response functions. This method constrains the particulate air pollution mortality concentration-response function to be biologically plausible--that is, a nondecreasing function of the particulate air pollution concentration. Using time-series data from Cook County, Illinois Cook County is a county located in the U.S. state of Illinois. As of 2000, the population was 5,376,741, making it the second largest county by population in the United States (after Los Angeles County, California), and accounting for 43. , the proposed method yields more meaningful particulate air pollution mortality concentration--response function estimates with an increase in statistical accuracy. Key words: air pollution, daily mortality, dose-response relationship The Dose-response relationship describes the change in effect on an organism caused by differing levels of exposure (or doses) to a stressor (usually a chemical). This may apply to individuals (eg: a small amount has no observable effect, a large amount is fatal), or to populations , models, particulate matter particulate matter
n. Abbr. PM
Material suspended in the air in the form of minute solid particles or liquid droplets, especially when considered as an atmospheric pollutant.

Noun 1.
.

**********

Numerous community time-series studies on the effect of particulate air pollution on mortality have been conducted (Dominici et al, 2000, 2002a; Ito et al. 1995; Samet et al. 2000; Schwartz and Zanobetti 2000; Smith et al. 1999). These studies typically fit a generalized gen·er·al·ized
adj.
1. Involving an entire organ, as when an epileptic seizure involves all parts of the brain.

2. Not specifically adapted to a particular environment or function; not specialized.

3.
 additive additive

In foods, any of various chemical substances added to produce desirable effects. Additives include such substances as artificial or natural colourings and flavourings; stabilizers, emulsifiers, and thickeners; preservatives and humectants (moisture-retainers); and
 modal Mode-oriented. A modal operation switches from one mode to another. Contrast with non-modal.

1. modal - (Of an interface) Having modes. Modeless interfaces are generally considered to be superior because the user does not have to remember which mode he is in.
2.
 (GAM) (Hastie and Tibshirani 1990) or generalized linear model Not to be confused with general linear model.
In statistics, the generalized linear model (GLM) is a useful generalization of ordinary least squares regression. It relates the random distribution of the measured variable of the experiment (the
 (GLM GLM Global Language Monitor
GLM Global Marine (stock symbol)
GLM Graduated Length Method (ski instruction)
GLM Good Looking Mom (used in pediatric practices)
GLM God Loves Me
) (McCuilagh and Nelder 1989) to time series of daily mortality, particulate air pollution, meteorological me·te·or·ol·o·gy  
n.
The science that deals with the phenomena of the atmosphere, especially weather and weather conditions.



[French météorologie, from Greek
 covariates, and other air pollutants pollutants

see environmental pollution.
. The fitted models are then used to quantify Quantify - A performance analysis tool from Pure Software.  the effect on mortality of particulate air pollution.

Typically, these community particulate air pollution mortality time-series studies assume that the effect of particulate air pollution on the logarithm logarithm (lŏg`ərĭthəm) [Gr.,=relation number], number associated with a positive number, being the power to which a third number, called the base, must be raised in order to obtain the given positive number.  of daily mortality is a linear function of the particulate air pollution concentration. Some recent studies (Daniels et al. 2000; Dominici et al. 2002a; Schwartz and Zanobetti 2000; Schwartz et al. 2001; Smith et al. 2000a, 2000b) investigate the shape of the particulate air pollution concentration-response function by modeling the city-specific concentration--response functions using piecewise linear Piecewise linear may refer to:
  • Piecewise linear function
  • Piecewise linear manifold
 or smooth functions. Schwartz (2000) looks at the possibility of thresholds in the air pollution mortality concentration-response function by limiting his analysis to days with low air pollution concentrations. A potential problem with these studies is that the resulting concentration-response functions may not be biologically plausible. I use the term "biologically plausible" to mean a concentration-response function that is a nondecreasing function of the particulate air pollution concentration. A concentration-response function that is not nondecreasing would indicate that a higher level of particulate air pollution is less harmful to health than a lower level of particulate air pollution.

In this article I introduce a new method for fitting particulate air pollution mortality concentration-response functions. This method constrains the concentration-response function to be biologically plausible. Constraining con·strain  
tr.v. con·strained, con·strain·ing, con·strains
1. To compel by physical, moral, or circumstantial force; oblige: felt constrained to object. See Synonyms at force.

2.
 the concentration-response function to be nondecreasing rules out the possibility of hormesis. However, some recent findings suggest that even low-level exposure to particulate air pollution may pose health risks [Health Effects institute The Health Effects Institute (HEI) is a non-partisan, non-profit corporation specializing in research on the health effects of air pollution. It is headquartered in Charlestown, Massachusetts, USA.  (HEI HEI Higher Education Institution (UK)
HEI Health Effects Institute
HEI Hautes Études Internationales
HEI House Ear Institute
HEI Healthy Eating Index
HEI Hautes Etudes d'Ingénieur
HEI High-Explosive Incendiary
) Perspectives 2002], suggesting that hormesis is unlikely.

A simulation study shows that constraining the concentration-response function to be biologically plausible increases the statistical estimation precision. I use data from Cook County, Illinois, to illustrate the methodology of constrained con·strain  
tr.v. con·strained, con·strain·ing, con·strains
1. To compel by physical, moral, or circumstantial force; oblige: felt constrained to object. See Synonyms at force.

2.
 versus unconstrained concentration-response function estimation.

Data and Methods

Data. The data used to illustrate the methods of this article are concurrent daily time series of mortality, weather, and particulate air pollution from Cook County for 1987-1994. These are the same Cook County data used by Samet et al. (2000).

The mortality time series are nonaccidental daily deaths of individuals [greater than or equal to] 65 years old. The mortality time-series data are available from the National Center for Health Statistics National Center for Health Statistics (NCHS) is part of the Centers for Disease Control and Prevention (CDC), which is part of the United States Department of Health and Human Services.

NCHS is the United States' principal health statistics agency.
 (Hyattsville, MD). The weather time series are the daily 24-hr mean temperature and mean dew point dew point: see dew.  temperature. The temperature and dew point temperature are 24-hr averages of hourly observations taken at O'Hare International Airport O'Hare International Airport is an airport located in Chicago, Illinois, United States, 17 miles (27 km) northwest of the Chicago Loop. It is the largest hub of United Airlines (whose headquarters is in downtown Chicago) and the second-largest hub of American Airlines (after  (Chicago, IL). The weather time-series data are available from the Earth Info (2003) database.

The air pollution time series is based on the ambient Surrounding. For example, ambient temperature and humidity are atmospheric conditions that exist at the moment. See ambient lighting.  24-hr concentration of particulate matter < 10 [mu]m in diameter ([PM.sub.10]). The [PM.sub.10] data are available from the U.S. Environmental Protection Agency Environmental Protection Agency (EPA), independent agency of the U.S. government, with headquarters in Washington, D.C. It was established in 1970 to reduce and control air and water pollution, noise pollution, and radiation and to ensure the safe handling and  (2004) Aerometric Information Retrieval information retrieval

Recovery of information, especially in a database stored in a computer. Two main approaches are matching words in the query against the database index (keyword searching) and traversing the database using hypertext or hypermedia links.
 System (AIRS; now Air Quality System Database). The [PM.sub.10] time series used here is the average of the [PM.sub.10] concentration of the current day and [PM.sub.10] concentration of the previous day. If either the current or previous day's [PM.sub.10] concentration was missing, the average was set equal to the nonmissing value.

The largest 1% of [PM.sub.10] concentrations was removed from the study to avoid the potential for these points to have an undue influence on the estimated concentration-response functions. This leaves 2,838 days of data for Cook County, with a maximum [PM.sub.10] concentration of 89 [mu]g/[m.sup.3].

Methods. In many community time-series studies on the effect of [PM.sub.10] on mortality, an additive Poisson log-linear model log-linear model

a statistical model which models frequency counts in contingency tables by using an analysis of variance approach.
 is fit to the time series of observed mortality. Under this model, the daily mortality counts are modeled as independent Poisson random variables with a time varying mean log([[mu].sub.t]), where

[1] log([[mu].sub.t]) = [mu] + [confounders.sub.t] + [beta][PM.sub.10].

Here, the subscript (1) In word processing and scientific notation, a digit or symbol that appears below the line; for example, H2O, the symbol for water. Contrast with superscript.

(2) In programming, a method for referencing data in a table.
 t refers to the day of the study; [[mu].sub.t] is the mean number of deaths on day t; [mu] is the intercept intercept

in mathematical terms the points at which a curve cuts the two axes of a graph.
 term; [confounders.sub.t] represents other time-varying variables related to daily mortality. Typical confounders include temperature, humidity humidity, moisture content of the atmosphere, a primary element of climate. Humidity measurements include absolute humidity, the mass of water vapor per unit volume of natural air; relative humidity (usually meant when the term humidity , longer-term mortality trends, and seasonality; [PM.sub.t] is the time series of [PM.sub.10] concentrations; and [beta] is the effect of [PM.sub.10] on mortality. It gives the increase in log([[mu].sub.t]) per unit increase in [PM.sub.10].

I allow the effect of [PM.sub.10] on the logarithm of daily mortality to be a piecewise linear function In mathematics, a piecewise linear function

,


where V is a vector space and
 with one or two change points. At the location of the change points, the slope of the piecewise linear concentration-response function is allowed to change. The difference between this study and other studies is that the piecewise linear functions are constrained to ensure the resulting concentration-response function is biologically plausible.

Piecewise linear functions are commonly used to model the [PM.sub.10] mortality concentration-response function. These are useful for investigating the possibility of thresholds and other nonlinearities in the concentration-response function. This could yield important information about the effect of particulate air pollution on mortality and be helpful to regulators. For example, a piecewise linear function with one change point allows for the following forms of the concentration-response function:

* Zero slope before the change point and positive slope after the change point. This would indicate that there is a threshold below which [PM.sub.10] does not affect mortality.

* Positive slope before the change point and zero slope after the change point. This would indicate that there is a saturation point saturation point
n.
1. Chemistry The point at which a substance will receive no more of another substance in solution.

2. The point at which no more can be absorbed or assimilated.
 above which there is no incremental Additional or increased growth, bulk, quantity, number, or value; enlarged.

Incremental cost is additional or increased cost of an item or service apart from its actual cost.
 effect of [PM.sub.10] on mortality.

* Positive slope before the change point and smaller positive slope after the change point. This would indicate that there is a point above which the incrementall effect of [PM.sub.10] on mortality is reduced.

* Positive slope before the change point and larger positive slope after the change point. This would indicate that there is a point above which the incremental effect of [PM.sub.10] on mortality is increased.

A piecewise linear function with two change points allows even more flexibility. For example, it can approximate the logistic lo·gis·tic   also lo·gis·ti·cal
adj.
1. Of or relating to symbolic logic.

2. Of or relating to logistics.



[Medieval Latin logisticus, of calculation
 curve that has been suggested as a plausible shape for the particulate air pollution concentration-response function (Schwartz and Zanobetti 2000).

To allow for more general forms of the [PM.sub.10] concentration-response function, model 1 may be extended:

[2] log([[mu].sub.t]) = [mu] + [confounders.sub.t] + g([PM.sub.t]).

Where g([PM.sub.t]) is some function of [PM.sub.10], two forms of g([PM.sub.t]) will be used in this article:

* Piecewise linear function with one change point:

[MATHEMATICAL EXPRESSION A group of characters or symbols representing a quantity or an operation. See arithmetic expression.  NOT REPRODUCIBLE IN ASCII ASCII or American Standard Code for Information Interchange, a set of codes used to represent letters, numbers, a few symbols, and control characters. Originally designed for teletype operations, it has found wide application in computers. ]

[[beta].sub.1] and [[beta].sub.2] are constrained to be non-negative. These constraints ensure that the piecewise linear function is nondecreasing, [theta Theta

A measure of the rate of decline in the value of an option due to the passage of time. Theta can also be referred to as the time decay on the value of an option. If everything is held constant, then the option will lose value as time moves closer to the maturity of the option.
] is the change point and gives the [PM.sub.10] concentration at which the slope of the piecewise linear concentration-response function is allowed to change. This form of g([PM.sub.t]) is referred to as C1. This model fit without constraints on [[beta].sub.1] and [[beta].sub.2] is referred to as UC1.

* Piecewise linear function with two change points:

[MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII]

[[beta].sub.] [[beta].sub.2], and [beta].sub.3] are constrained to be non-negative. These constraints ensure that the piecewise linear function is nondecreasing. [[theta].sub.1] and [[theta].sub.2] are the change points and give the [PM.sub.10] concentrations at which the slope of the piecewise linear concentration-response function is allowed to change. This form of g([PM.sub.t]) is referred to as C2. This model fit without constraints on [[beta].sub.1], [[beta].sub.2], and [[beta].sub.3] is referred to as UC2.

Other forms of the concentration-response function such as smooth functions represented by cubic or natural cubic splines could also be considered, Additional constraints would be imposed on the parameters to ensure that the estimated concentration-response function is nondecreasing.

Once the form of g([PM.sub.t]) is chosen, the model parameters are fit using a constrained maximum likelihood. I obtained the constrained maximum likelihood estimates iteratively in two repeated steps. The first step fixes the parameters corresponding to the [PM.sub.10] terms at their current values and estimates the parameters corresponding to the confounders. This step can be performed using the GLM function available in S-Plus (Insightful, Seattle, WA, USA). The GLM function allows the parameters corresponding to the confounders to be estimated while keeping the [PM.sub.10] parameters fixed at their current values. The second step fixes the parameters corresponding to the confounders at their current values and estimates the parameters corresponding to the [PM.sub.10] terms. This step can be performed using constrained optimization software Free and Open Source software
  • ASCEND — mathematical modelling system
  • OpenOpt (license: BSD) — toolbox with connections to lots of solvers, for Python language programmers
  • COIN-OR SYMPHONY — integer programming, Common Public License
. These two steps are iterated to obtain the final parameter estimates. GLM estimation was chosen to avoid the recent convergence criteria This is an article about European politics, Convergence criteria is also a mathematical term regarding series.

Convergence criteria (also known as the Maastricht criteria) are the criteria for European Union member states to enter the third stage of European Economic and
 problems associated with the S-Plus GAM function (Dominici et al. 2002b).

Simulation Study

I use simulations to explore the bias and mean squared error In statistics, the mean squared error or MSE of an estimator is the expected value of the square of the "error." The error is the amount by which the estimator differs from the quantity to be estimated.  properties of the constrained, or biologically plausible, concentration-response functions compared with unconstrained concentration-response functions. Mortality time series are generated using two forms of the [PM.sub.10] concentration-response function:

* Null A character that is all 0 bits. Also written as "NUL," it is the first character in the ASCII and EBCDIC data codes. In hex, it displays and prints as 00; in decimal, it may appear as a single zero in a chart of codes, but displays and prints as a blank space.  concentration-response. For the null concentration-response, [PM.sub.10] has no effect on mortality. In this situation, the log mean of" the simulated Poisson mortality time series is given by log([[mu].sub.t]) = log(83). The average daily mortality in Cook County was 83.

* Dual change-point concentration-response. This is the concentration-response found flora fitting model C2 to the actual Cook County data (see "Application"). In this situation, the log mean of the simulated Poisson mortality time series is given by log([[mu].sub.t]) = log(83) + f([PM.sub.t]), where f([PM.sub.t]) is the estimated C2 function for Cook County.

For simplicity, confounders were not

included in the simulations. Using the observed Cook County [PM.sub.10] time series for both the null and dual change-point concentration response, 400 mortality time series were generated. Models C2 and UC2 were estimated from the simulated mortality time series. This was done by fitting the Poisson log linear models log([[mu].sub.]) = [mu] + g([PM.sub.t],C2) and log([[mu].sub.t]) = [mu] + g([PM.sub.t],UC2) to each simulated mortality time series, where g([PM.sub.t]C2) and g([PM.sub.t]UC2) correspond to models C2 and UC2, respectively.

Figure 1 gives the results of the simulations where models C2 and UC2 were used to estimate the null concentration response. The curve corresponding to the mean estimate indicates that the bias of model C2 is modest. The curves corresponding to the 97.5 and 2.5 percentiles indicate that model C2 has a lower estimation variance than model UC2. The root mean squared error (RMSE RMSE Root Mean Square Error
RMSE Root Mean Squared Error
) was used to compare the statistical accuracy of the two methods. Figure 1B gives the RMSE of the 400 concentration-response function estimates at each [PM.sub.10] concentration. The RMSE for model C2 is always lower than the RMSE for model UC2.

[FIGURE 1 OMITTED]

Figure 2 gives the results of the simulations where models C2 and UC2 were used to estimate the dual change-point concentration response. The curve corresponding to the mean estimate for model C2 showed little bias (the mean estimate for model UC2 is an unbiased estimate of the dual change-point concentration response). The curves corresponding to the 97.5 and 2.5 percentiles indicate that model C2 has a lower estimation variance than model UC2. The RMSE for model C2 was again lower than for model UC2.

[FIGURE 2 OMITTED]

The simulations showed that the bias of the constrained concentration-response functions was not large and that the bias is a small part of the RMSE. The bias of the constrained concentration-response functions is more than compensated for by a decrease in estimation variance. The smaller RMSE for the constrained concentration-response functions indicates that they offer an improvement in statistical accuracy over unconstrained concentration-response functions. The constrained concentration-response functions have the added advantage of giving biologically plausible concentration-response function estimates.

Application

In this section the Cook County data illustrate the methodology of constrained versus unconstrained concentration-response function estimation. Using data from only one city is adequate for this purpose.

A Poisson log-linear model similar to those used in Dominici et al. (2002b) was fit to the data:

[3] log([mu]t) = [mu] + [S.sub.t1](time,8/year) + [S.sub.t2]([temp.sub.0],6) + [S.sub.t3]([temp.sub.1-3],6) + [S.sub.t4]([dew.sub.0],3) + [S.sub.t5]([dew.sub.1-3],3) + [gamma][DOW.sub.t] + g([PM.sub.t]).

Here,

* The subscript t refers to the day of the study

* [[mu].sub.t], is the mean number of deaths on day t

* [[mu].sub.t] is the intercept term

* [S.sub.t1] (time,8/year) is a smooth function of time with 8 degrees of freedom per year

* [S.sub.t2]([temp.sub.0],6) and [S.sub.t3]([temp.sub.1-3],6) are smooth functions with a total of 6 degrees of freedom. [temp.sub.0] is the current day's mean 24-hr temperature, and [temp.sub.1-3] is the average of the previous 3 days' 24-hr mean temperatures. [S.sub.t4]([dew.sub.0],3) and [S.sub.t5]([dew.sub.1-3],3) are similar functions for the 24-hr mean dew point temperature

* [DOW.sub.t] is a set of indicator variables for the day of the week

* [gamma] is a vector of coefficients that contains the mortality adjustments for the day of the week

* g([PM.sub.t]) is the [PM.sub.10] concentration-response function.

The smooth functions, [S.sub.t1](), were represented using natural cubic splines. The use of GLMs and natural cubic splines was one of the methods used to adjust for confounders in the Revised Analyses of the National Morbidity morbidity /mor·bid·i·ty/ (mor-bid´it-e)
1. a diseased condition or state.

2. the incidence or prevalence of a disease or of all diseases in a population.


mor·bid·i·ty
n.
, Mortality, and Air Pollution Study (NMMAPS NMMAPS National Morbidity, Mortality, and Air Pollution Study ), Part II (Health Effects Institute 2003). Another common method to adjust for confounders is the use of GAMs, with smoothing splines (Green and Silverman 1994) used to represent the smooth functions [S.sub.ti](). In light of the revised NMMAPS study, the GAM models are now fit with stricter convergence criteria. GLM estimation with natural cubic splines is essentially a fully parametric version of GAM estimation with smoothing splines. Currently, there is no firm evidence that favors the use of GLM or GAM estimation (Health Effects Institute 2003).

Single change-point models C1 and UC1 were fit to the Cook County data using a sequence of change-point values ([theta]) ranging from 5 to 90 [mu]g/[m.sup.3], using increments of 5 [mu]g/[m.sup.3], As the maximum [PM.sub.10] concentration is 89 [mu]g/[m.sup.3], fitting model C1 and UC1 with a change point of 90 mu]g/[m.sup.3] is identical to fitting a constrained linear or unconstrained linear concentration-response function, respectively. For model C1 the optimal change-point value for Cook County occurs at 60 [mu]g/[m.sup.3]. For model UC1 the optimal change-point value for Cook County occurs at 65 [mu]g/[m.sup.3].

A check of the parameter estimates obtained from the constrained iterative it·er·a·tive  
adj.
1. Characterized by or involving repetition, recurrence, reiteration, or repetitiousness.

2. Grammar Frequentative.

Noun 1.
 estimation procedure is possible using only the S-Plus GLM function. This can be done by creating two new [PM.sub.10] time series PML PML - Parallel ML.

["Synchronous Operations as First-Class Values", J.H. Reppy <jhr@research.att.com>, Proc SIGPLAN 88 Conf Prog Lang Design and Impl, June 1988, pp. 250-259].
 and PMU PMU Project Management Unit
PMU Power Management Unit
Pmu Pasteurella multocida
PMU Pregnant Mare Urine (aka premarin)
PMU Pick Me Up
PMU Purdue Memorial Union (Purdue University) 
. PML is equal to [PM.sub.10] if [PM.sub.10] is < [theta] and [theta] otherwise, and PMU is equal to 0 if [PM.sub.10] is < [theta] and ([PM.sub.10]-[theta]) otherwise. The parameters from the GLM fit corresponding to PML and PMU give the slope of the concentration-response function before and after the change point, respectively. For example, to check the estimate of the constrained iterative estimation procedure for [theta] = 60, the GLM model can be fit with PML being the only [PM.sub.10] term included in the model. Not including PMU in the model corresponds to the estimated concentration-response function having zero slope after the change point. If the iterative estimation procedure is giving appropriate estimates, the [PM.sub.10] parameter estimates from the two methods should be the same. This check was used for each value of the change point [theta]. In each case the [PM.sub.10] parameter estimates from the two methods agreed.

Figure 3 contains a plot of the estimated C1 function for Cook County and a 95% pointwise confidence interval confidence interval,
n a statistical device used to determine the range within which an acceptable datum would fall. Confidence intervals are usually expressed in percentages, typically 95% or 99%.
. The estimated UC1 function is included for comparison. Model C1 says that the effect of [PM.sub.10] on mortality increases until the change point of 60 [mu]g/[m.sup.3], but that there is no incremental effect beyond 60 [mu]g/[m.sup.3]. If an actual concentration-response function were shaped like the estimated UC1 function, the effect of [PM.sub.10] on mortality would decrease for concentrations above 65 [mu]g/[m.sup.3]. This is not biologically plausible.

[FIGURE 3 OMITTED]

Confidence intervals in Figure 3 were produced via parametric bootstrap See boot.

(operating system, compiler) bootstrap - To load and initialise the operating system on a computer. Normally abbreviated to "boot". From the curious expression "to pull oneself up by one's bootstraps", one of the legendary feats of Baron von Munchhausen.
. Using the estimated [[beta].sub.1], [[beta].sub.2], and [theta] values from model C1, 100 new mortality time series were generated according to according to
prep.
1. As stated or indicated by; on the authority of: according to historians.

2. In keeping with: according to instructions.

3.
 the Poisson model [Equation 31. Model C1 with [theta] fixed at 60 [mu]g/[m.sup.3] was then fit to each of the generated mortality time series. At each [PM.sub.10] concentration, the 95% confidence interval corresponds to the 97.5 and 2.5 percentiles of the 100 estimated C1 concentration-response functions.

Next, the dual change-point models C2 and UC2 were fit to the Cook County data with [[theta].sub.1] fixed at 25 [mu]g/[m.sup.3] and [[theta].sub.2] fixed at 50 [mu]g/[m.sup.3]. The location of these change points is similar to the knot knot

In cording, the interlacement of parts of one or more ropes, cords, or other pliable materials, commonly used to bind objects together. Knots have existed from the time humans first used vines and cordlike fibers to bind stone heads to wood in primitive axes, and were
 location used by Daniels et al. (2000). The use of two change points allows extra flexibility in the C2 and UC2 concentration-response functions compared with C1 and UC1. The check described above was performed on the constrained iterative estimation procedure for model C2 estimation. Once again the [PM.sub.10] parameter estimates from the two methods agreed. Figure 4 contains a plot of the estimated C2 function for Cook County and a 95% pointwise confidence interval. The confidence intervals, as for model C1, were produced via parametric bootstrap. Model C2 says that [PM.sub.10] has no effect on mortality until the first change point of 25 [mu]g/[m.sup.3] (an actual threshold), has an increasing effect for concentrations between 25 and 50 [mu]g/[m.sup.3], and has no incremental effect for concentrations beyond 50 [mu]g/[m.sup.3]. If an actual concentration-response function were shaped like the estimated UC2 function (dotted), it would say that the effect of [PM.sub.10] on mortality is beneficial at some [PM.sub.10] levels and that higher concentrations of [PM.sub.10] can be less harmful than lower concentrations of [PM.sub.10]. This is not biologically plausible.

[FIGURE 3 OMITTED]

The above analyses were repeated using 4 instead of 8 degrees of freedom per year for the smooth function of time. Fewer degrees of freedom for the smooth function of time correspond to less aggressive control of time-related confounders such as seasonality. The results of using 4 degrees of freedom were similar to those obtained using 8 degrees of freedom: The shape of the estimated concentration-response functions were similar.

The shape of the estimated C1 and C2 concentration-response functions could have public health implications. The estimated C1 concentration-response function suggests that a [PM.sub.10] standard would be inefficient unless it were set below the change point of 60 [mu]g/[m.sup.3]. The reason for this is that according to the estimated C1 concentration-response function, [PM.sub.10] does not have an incremental effect on mortality beyond 60 [mu]g/[m.sup.3]. Similarly, the estimated C2 concentration-response function suggests that a [PM.sub.10] standard will reduce [PM.sub.10]-induced mortality only if it reduces [PM.sub.10] concentrations in the range between 25 and 50 [mu]g/[m.sup.3].

The estimated UC1 and UC2 concentration-response functions were both biologically implausible im·plau·si·ble  
adj.
Difficult to believe; not plausible.



im·plausi·bil
. This is not an uncommon occurrence. In many studies that examine the shape of the concentration-response function, the estimated concentration-response functions are decreasing in certain air pollution ranges and/or indicate that air pollution at certain concentrations can be beneficial to health (Daniels et al. 2000; Dominici et al. 2002a; Moolgavkas and Luebeck 1996; Smith et al. 2000b). One might ask whether I am overinterpreting the negative slopes of the unconstrained functions UC1 and UC2: Are the negatives slopes actually statistically different from zero? Although the question of whether the negative slopes are statistically significant is not of particular concern, the reason for using constrained concentration-response functions is to preclude pre·clude  
tr.v. pre·clud·ed, pre·clud·ing, pre·cludes
1. To make impossible, as by action taken in advance; prevent. See Synonyms at prevent.

2.
 concentration-response functions that are believed implausible, irrespective of irrespective of
prep.
Without consideration of; regardless of.

irrespective of
preposition despite 
 statistical significance.

Table 1 contains p-values for change in deviance Conspicuous dissimilarity with, or variation from, customarily acceptable behavior.

Deviance implies a lack of compliance to societal norms, such as by engaging in activities that are frowned upon by society and frequently have legal sanctions as well, for example, the
 tests. The tests compare the fitted concentration-response functions with a null model of no [PM.sub.10] concentration-response function and a linear [PM.sub.10] concentration-response function. The linear concentration-response function fir to the Cook County data has a positive slope, so the constrained and unconstrained versions of the linear concentration-response function are the same. The "Test" column gives the forms of the concentration-response functions being compared. "Null" indicates a model that does not contain a [PM.sub.10] concentration-response function. "Linear" indicates a model that contains a linear [PM.sub.10] concentration-response function. The "p-value constrained" column contains the p-values of the change in deviance test for the constrained forms of the concentration-response functions (C1 and C2). The "p-value unconstrained" column contains the p-values for the unconstrained forms of the concentration-response functions (UC1 and UC2). The p-values would not be small enough to reject the linear concentration-response function at conventional significance levels. Of the two constrained concentration-response functions, model C2 gave the best fit to the data, as measured by the likelihood value. Because models C1 and C2 use the same number of parameters (C1 uses a parameter for estimation of the change-point location), this comparison suggests that model C2 is the preferred constrained concentration-response function.

It is possible that a biologically implausible fitted concentration-response function will show a statistically significant improvement over a linear concentration-response function that is nondecreasing (biologically plausible). For example, if we fit an unconstrained single change-point piecewise linear concentration-response function to Cook County with the change point fixed at 65 [mu]g/[m.sup.3], this would result in the biologically implausible concentration-response function of Figure 3 (dotted line), which shows mortality decreasing markedly for concentrations > 65 [mu]g/[m.sup.3]. The p-value for the test of whether this concentration-response function is an improvement over a linear concentration-response function is 0.03, suggesting that the biologically implausible concentration-response function is an improvement over a biologically plausible linear concentration-response function. However, rejecting the linear concentration-response function in favor of a biologically implausible nonlinear A system in which the output is not a uniform relationship to the input.

nonlinear - (Scientific computation) A property of a system whose output is not proportional to its input.
 concentration-response function raises problems of interpretation that could be avoided by the use of biologically plausible nonlinear concentration-response functions. These potential anomalies illustrate the importance of having sufficiently flexible parameterizations of biologically plausible concentration-response functions.

Discussion and Conclusion

In this article I introduce a method for fitting [PM.sub.10] mortality concentration-response functions that constrains the concentration-response functions to be biologically plausible, i.e., a nondecreasing function of the [PM.sub.10] concentration. Simulations showed an increase in statistical estimation precision that results from constraining the concentration-response function estimates to be biologically plausible. Biases in the constrained concentration-response function are small and are more than compensated for by a decrease in estimation variance. This was the case whether of not [PM.sub.10] was assumed to have an effect on mortality.

The use of constrained concentration-response functions in contrast to unconstrained concentration-response functions was illustrated using data from Cook County. The estimated unconstrained concentration-response functions were biologically implausible. It is difficult to understand and apply unconstrained concentration-response functions because increases in [PM.sub.10] could imply decreases in mortality, or [PM.sub.10] could be beneficial to health. The use of biologically plausible concentration-response functions avoids these problems of interpretation without introducing material bias. However, it is important to keep in mind that a concentration-response function nor biologically plausible may indicate a more serious problem, such as inadequate adjustment for covariates. In such instances further investigation into the form of the model being used may be required.

The constrained concentration-response functions fit to the Cook County data did not provide evidence against a linear concentration-response function, and similar results have been reported in previous studies (Dominici et al. 2002a; Schwartz and Zanobetti 2000). Smith et al. (2000a) did not find statistically significant evidence in favor of nonlinear concentration-response functions. However, they question whether testing a nonlinear versus a linear concentration-response function is the appropriate formulation. Schwartz and Zanobetti (2000) also present an argument as to why the concentration-response relationship may look like a logistic curve. Other studies have shown evidence in favor of nonlinear particulate air pollution concentration-response functions (Daniels et al. 2000; Smith et al. 2000b).

The fact that for the Cook County data the constrained concentration-response functions were not statistically different from a linear concentration-response function should not affect the desirability of using constrained concentration-response functions. Constrained concentration-response functions are more accurate than unconstrained concentration-response functions, have low bias, and are biologically plausible.
Table 1. Comparing the fitted concentration--response
functions using the change in deviance test.
                                p-Value
Test                    Constrained   Unconstrained

Linear versus null          0.08          0.08
C1, UC1 versus null         0.18          0.05
C1, UC1 versus linear       0.40          0.10
C2, UC2 versus null         0.10          0.07
C2, UC2 versus linear       0.21          0.14


I thank P. Switzer for his comments and S. Zeger for the combined Cook County mortality and meteorological data Meteorological facts pertaining to the atmosphere, such as wind, temperature, air density, and other phenomena that affect military operations.  used in this paper. The author was partially supported by a Stanford Graduate Fellowship. The author declares he has no competing financial interests.

REFERENCES

Daniels MJ, Dominici F, Samet JM, Zeger SL. 2000. Estimating particulate matter-mortality dose-response curves dose-response curve A graphic representation of the effects that varous doses of an agent–eg, ionizing radiation or a chemotherapeutic agent, have on a given parameter–eg, cell viability, mutation frequency, DNA damage, tumor growth or metastasis or  and threshold levels Noun 1. threshold level - the intensity level that is just barely perceptible
intensity, intensity level, strength - the amount of energy transmitted (as by acoustic or electromagnetic radiation); "he adjusted the intensity of the sound"; "they measured the
: an analysis of daily time-series for the 20 largest US cities. Am J Epidemiol 152:397-406.

Dominici F, Daniels M, Zeger SL, Samet JM. 2002a. Air pollution and mortality: estimating regional and national dose-response relationships. J Am Stat Assoc 97:105-111,

Dominici F, McDermott A, Zeger SL, Samet JM. 2002b. On the use of generalized additive models In statistics, the generalized additive model (or GAM) is a statistical model developed by Trevor Hastie and Rob Tibshirani blending properties of multiple regression (a special case of general linear model) with additive models.  in time-series studies of air pollution and health, Am J Epidemiol 156:193-203.

Dominici F, Samet JM, Zeger SL. 2000. Combining evidence on air pollution and daily mortality from the 20 largest US cities: a hierarchical modelling strategy. J R Stat Soc Ser A 163:263-302.

Earth Info Environmental Databases. 2003. Available: http://earthinfo.com/databases/databases.htm [accessed 30 January 2004].

Green PJ, Silverman BW. 1994. Nonparametric Regression Nonparametric regression is a form of regression analysis in which the predictor does not take a predetermined form but is constructed according to information derived from the data.  and Generalized Linear Models: A Roughness Penalty Approach. London:Chapman and Hall Chapman and Hall was a British publishing house, founded in the first half of the 19th century by Edward Chapman and William Hall. Upon Hall's death in 1847, Chapman's cousin Frederic Chapman became partner in the company, of which he became sole manager upon the retirement of .

Hastie TJ, Tibshirani RJ. 1990. Generalized Additive Models. London:Chapman and Hall.

Health Effects Institute. 2003. Special Report: Revised Analyses of Time-Series Studies of Air Pollution and Health. Cambridge, MA:Health Effects Institute.

HEI Perspectives. 2002. Understanding the Health Effects of Components of the Particulate Matter Mix: Progress and Next Steps. Cambridge, MA:Health Effects Institute.

Ito K, Kinney PL, Thurston GD. 1995. Variations in PM-10 concentrations within two metropolitan areas and their implications for health effects analyses. Inhal Toxicol 7:235-745.

McCullagh P, Nelder JA. 1989 Generalized Linear Models. 2nd ad. London:Chapman and Hall.

Moolgavkar SH, Luebeek ED. 1996. A critical review of the evidence on particulate air pollution and mortality. Epidemiology epidemiology, field of medicine concerned with the study of epidemics, outbreaks of disease that affect large numbers of people. Epidemiologists, using sophisticated statistical analyses, field investigations, and complex laboratory techniques, investigate the cause  7:420428.

Pickle pickle, general term for fruits or vegetables preserved in vinegar or brine, usually with spices or sugar or both. Vegetables commonly pickled include the beet, cabbage, cauliflower, cucumber, olive, onion, pepper, and tomato.  LW, Mungiole M, Jones DK, White A. Atlas of United States United States, officially United States of America, republic (2005 est. pop. 295,734,000), 3,539,227 sq mi (9,166,598 sq km), North America. The United States is the world's third largest country in population and the fourth largest country in area.  Mortality. Hyattsville, MD:National Center for Health Statistics, 1996.

Samet JM, Dominici F, Curriero FC, Coursac I, Zeger SL. 2000, Fina particulate air pollution and mortality in 20 U.S. cities, 1987-1994. N Engl J Med 343:174-1749.

Schwartz J. 2000. Assessing confounding confounding

when the effects of two, or more, processes on results cannot be separated, the results are said to be confounded, a cause of bias in disease studies.


confounding factor
, effect modification effect modification Epidemiology An interaction among multiple possible cause-and-effect relationships, where the estimate of the effect of one factor on a disease process depends on other factors in the study , and thresholds in the association between ambient particles and daily deaths. Environ en·vi·ron  
tr.v. en·vi·roned, en·vi·ron·ing, en·vi·rons
To encircle; surround. See Synonyms at surround.



[Middle English envirounen, from Old French environner
 Health Perspect 108:563-568.

Schwartz J, Ballester F, Saez M, Perez-Hoyos S, Bellido J, Cambra K, et al. 2001. The concentration-response relation between air pollution and daily deaths. Environ Health Perspect 109:1001 1006.

Schwartz J, Zanobetti A. 2000. Using meta-smoothing to estimate dose-response trends across multiple studies, with application to air pollution and daily death. Epidemiology 11:666-672.

Smith RL, Davis JM, Sacks J, Speekman P, Styer P. 2000a. Regression models for air pollution and daily mortality: analysis of data from Birmingham, Alabama Birmingham (pronounced [ˈbɝmɪŋˌhæm]) is the largest city in the U.S. state of Alabama and is the county seat of Jefferson County. . Environmetrics 11:715-743.

Smith RL, Davis JM, Speckman P. 1999. Human health effects of environmental pollution in the atmosphere. In: Statistics for the Environment 4: Statistical Aspects of Health and the Environment (Barnett V, Stein A, Turkman F, eds). Chichester, UK:John Wiley John Wiley may refer to:
  • John Wiley & Sons, publishing company
  • John C. Wiley, American ambassador
  • John D. Wiley, Chancellor of the University of Wisconsin-Madison
  • John M. Wiley (1846–1912), U.S.
 and Sons, 91-116.

Smith RL, Spitzner D, Kim Y, Fuentes M. 20006. Threshold dependence of mortality effects for fine and coarse particles in Phoenix, Arizona Phoenix /ˈfiːˌnɪks/ (English: Phoenix, Navajo: Hoozdo, lit. "the place is hot", Western Apache: Fiinigis) is the capital and the most populous city of the U.S. . J Air Waste Manag Assoc 50:1367-1379.

U.S. Environmental Protection Agency. 21304. AirData: Access to Air Pollution Data, Available; http://www.epa.gov/air/data/aqsdb.html [accessed 30 January 2004].

Steven Roberts

School of Finance and Applied Statistics, Australian National University Australian National University, located in Canberra and state-sponsored, founded 1946 as Australia's only completely research-oriented university. Originally limited to graduate studies, it expanded in 1960, merging with Canberra University College (est. 1929). , Canberra, Australia

Address correspondence to S. Roberts, School of Finance and Applied Statistics, Australian National University, Crisp Building The Crisp Building is a historic site in Sarasota, Florida. It is located at 1970 Main Street. On April 21, 2000, it was added to the U.S. National Register of Historic Places.  026, Canberra ACT 0200, Australia. Telephone: 61 2 612 54508. Fax: 61 2 612 50087. E-mail: spr@stanfordalumni.org

Received 30 April 2003; accepted 24 November 2003.
COPYRIGHT 2004 National Institute of Environmental Health Sciences
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 2004, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Title Annotation:Research
Author:Roberts, Steven
Publication:Environmental Health Perspectives
Date:Mar 1, 2004
Words:5068
Previous Article:Reductions in blood lead overestimate reductions in brain lead after repeated succimer regimens in a rodent model of childhood lead...
Next Article:Disparities in cognitive functioning by race/ethnicity in the Baltimore Memory Study.(Research)



Related Articles
The concentration-response relation between air pollution and daily deaths. (Articles).
Assessing the public health benefits of reduced ozone concentrations. (Articles).
Effects of air pollutants on acute stroke mortality. (Articles).
The concentration--response relation between P[M.sub.2.5] and daily deaths. (Articles).
Air pollution and daily mortality in a city with low levels of pollution. (Research).
Fellowships, grants, & awards. (Announcements).
The association of daily diabetes mortality and outdoor air pollution in Shanghai, China.(International Perspectives)
Air pollution is a serious cardiovascular risk.(EH Update)
Using moving total mortality counts to obtain improved estimates for the effect of air pollution on mortality.(Research)
Ozone and daily mortality in Shanghai, China.(Research)

Terms of use | Copyright © 2010 Farlex, Inc. | Feedback | For webmasters | Submit articles