Confounding in air pollution epidemiology: when does two-stage regression identify the problem? (Commentary).A two-stage approach has recently been proposed to assess confounding confounding when the effects of two, or more, processes on results cannot be separated, the results are said to be confounded, a cause of bias in disease studies. confounding factor by copollutans or other variables in time--series epidemiology studies for airborne particulate matter particulate matter n. Abbr. PM Material suspended in the air in the form of minute solid particles or liquid droplets, especially when considered as an atmospheric pollutant. Noun 1. (PM), using independent series from different cities. In the first stage of the proposed method, two regression models are fitted for each city in the analysis. The first relates the health effect to the putative causal variable such as PM without including any copollutant or confounder con·found tr.v. con·found·ed, con·found·ing, con·founds 1. To cause to become confused or perplexed. See Synonyms at puzzle. 2. . The other first-stage model relates a putative confounding variable A confounding variable (also confounding factor, lurking variable, a confound, or confounder) is an extraneous variable in a statistical or research model that should have been experimentally controlled, but was not. to PM. In the second stage of the analysis, the estimated city-specific regression slopes for the health-effect-versus-PM model are regressed against the estimated city-specific regression slopes for the confounder-versus-PM model. Under the proposed method, a nonzero non·ze·ro adj. Not equal to zero. nonzero Not equal to zero. intercept estimate in the second-stage regression would be interpreted as indicating a direct pathway from PM to the health effect, and a nonzero slope estimate would be interpreted as indicating at least partial confounding of PM with the putative confounder. A simple counterexample coun·ter·ex·am·ple n. An example that refutes or disproves a hypothesis, proposition, or theorem. Noun 1. counterexample - refutation by example using an additional copollutant variable shows that inferences based on this method could be misleading. Key word: air pollution epidemiology, confounding, copollutants, model misspecification, multicollinearity, particulate matter epidemiology, two-stage regression. Environ Health Perspect 109:1193-1196 (2001). [Online 7 November 2001] http://ehpnet1.niehs.nih.gov/docs/2001/109p1193-1196marcus/ abstract.html ********** In air pollution epidemiology studies, evaluating modeled associations between individual pollutants pollutants see environmental pollution. and the health outcome of concern is often complicated by multicollinearity among the measured pollutant pol·lut·ant n. Something that pollutes, especially a waste material that contaminates air, soil, or water. concentrations. Statistical models that incorporate these copollutants simultaneously are often unstable, with the estimated regression coefficients Regression coefficient Term yielded by regression analysis that indicates the sensitivity of the dependent variable to a particular independent variable. See: Parameter. regression coefficient possibly changing in both magnitude and direction depending on which copollutants are included. Further, coefficient standard errors will likely be inflated so that the estimated effects may not achieve statistical significance, despite actual relationships that may exist. Conversely, if single-pollutant models are used to evaluate potential associations, the problem of confounding arises: the single pollutant represents not only its own health effect but also the effects of the excluded pollutants with which it is associated. Several authors (1-3) have recently applied two-stage regression techniques in an effort to identify the extent to which a pollutant is directly associated with human health effects, as opposed to acting indirectly through its association with other pollutants (confounding). Noting that relationships between copollutants differ by city, it has been proposed that such differences can facilitate the separation of direct and indirect effects through use of a second-stage meta-regression. When the first- and second-stage models have been correctly specified, the proposed approach may help clarify the role that confounding plays in observed associations between pollutants and health outcomes. In this paper we describe some general conditions under which the method may, however, be vulnerable to the effects of model misspecification. In the applications described by Schwartz and colleagues (1-3), the response variable denoted by Y is most often a community-level health index, such as the number of deaths or number of hospital admissions per day (possibly transformed), rather than an individual-level health index. In reality, Y may have a Poisson or hyper-Poisson distribution, but the examples discussed below are not affected by such distributional properties. The variables W, X, and Z usually represent community-level indices of airborne particles or gaseous gas·e·ous adj. 1. Of, relating to, or existing as a gas. 2. Full of or containing gas; gassy. pollutants averaged over one or more stationary, air monitoring stations. However, the two-stage method used in earlier papers (1-3) could potentially be applied over a much wider range of epidemiology studies, including those with individual-level health effects and exposure data. Mathematical Model
We express the mathematical model for the two-stage approach to evaluating confounding using nonspecific nonspecific /non·spe·cif·ic/ (non?spi-sif´ik) 1. not due to any single known cause. 2. not directed against a particular agent, but rather having a general effect. nonspecific 1. variables to illustrate the generality of the problem. The simplest setting involves copollutant variables denoted by Z and X and a health outcome variable denoted by Y. The variables Z and X are known to be associated with each other, and one or both may be directly associated with Y. Following the approach of Schwartz (3), a model characterizing these relationships may be expressed as: [1] X = [[gamma].sub.0] + ([[gamma].sub.1,city] x Z) + [[epsilon].sub.X] [2] Y = [[beta].sub.0] + ([[beta].sub.1] x Z) + ([[beta].sub.2] x X) + [[epsilon].sub.Y]. Thus, in our simple model, associations are linear and characterized parametrically. We assume that one "instance" of the system embodied in Equations 1 and 2 will be used to represent each city 1, ..., K in a multicity analysis. The parameters [[beta].sub.1] and [[beta].sub.2] characterize the direct associations of Z with Y and X with Y, respectively. The parameter [[gamma].sub.1,city] characterizes the association between Z and X. While [[beta].sub.1] and [[beta].sub.2] are assumed constant across cities, [[gamma].sub.1,city] varies from city to city as the relationship between Z and X varies from city to city. The intercept terms [[gamma].sub.0] and [[beta].sub.0] may or may not depend on the city. The analyses below do not depend on city-specific values for these parameters, and for simplicity, they are assumed to be the same for all cities. Equation 1 is used to represent the association between Z and X for each city, one of many possible ways of expressing such associations but consistent with the approach described by Schwartz (3). In Equation 2 the variable Y is a response or health effect (such as the logarithm logarithm (lŏg`ərĭthəm) [Gr.,=relation number], number associated with a positive number, being the power to which a third number, called the base, must be raised in order to obtain the given positive number. of mortality) that depends directly on Z as well as indirectly on Z, acting through X as a surrogate or proxy. By substituting Equation 1 into Equation 2, we have: [3] [MATHEMATICAL EXPRESSION A group of characters or symbols representing a quantity or an operation. See arithmetic expression. NOT REPRODUCIBLE IN ASCII ASCII or American Standard Code for Information Interchange, a set of codes used to represent letters, numbers, a few symbols, and control characters. Originally designed for teletype operations, it has found wide application in computers. ] The total Z effect is the regression coefficient of Y on Z which from Equation 3 is [4] Z [effect.sub.city] = [[beta].sub.1] + [[beta].sub.2] x [[gamma].sub.1,city]) for each city = 1, ..., K. This total effect reflects both the direct and indirect associations between Z and Y. Equation 3 motivates the two-stage regression approach. Assuming that data on Z, X, and Y are available for city = 1, ..., K, the two-stage regression process involves first estimating the marginal association between Y and Z using a first-stage regression model of Y against Z for each city, ignoring X. Consistent with the usual problems associated with model misspecification detailed in the regression literature (4) we anticipate that Z will also represent the effect of X and that we will recover estimates which in expectation should conform to Verb 1. conform to - satisfy a condition or restriction; "Does this paper meet the requirements for the degree?" fit, meet coordinate - be co-ordinated; "These activities coordinate well" [5] E[Z[effect.sub.city]] = [[beta].sub.1] + ([[beta].sub.2] x [[gamma].sub.1,city]) for city = 1, ..., K. Thus, the expected Z effects are based on parameters reflecting both the direct effect of Z on Y (characterized by the parameter [[beta].sub.1]) and the mediated effect of Z on Y acting through X (characterized by the parameters [[beta].sub.2] and [[gamma].sub.1,city]). It is therefore of interest to further extract estimates of [[beta].sub.1] and [[beta].sub.2], and this is accomplished using a second-stage regression. To carry out the second-stage regression, estimates of [[gamma].sub.1,city] for city = 1, ..., K are needed in addition to the estimated Z effects already obtained during the first stage. The estimates [[gamma].sub.1,city]. are obtained by fitting Equation 1 for each city. The second stage then consists of a regression of the estimated Z effects (i.e., Z[effect.sub.city], for city = 1, ..., K) against the estimated X with Z associations (i.e., [[gamma].sub.1,city], for city = 1, ..., K). Equivalently, the second stage consists of a regression of the estimates of the linear combinations [[beta].sub.1] + ([[beta].sub.2] x [[gamma].sub.1,city]) against estimates of the parameters [[gamma].sub.1,city] in an effort to estimate [[beta].sub.1] and [[beta].sub.2]. The second-stage regression results in the fitted model: [6] Z[effect.sub.city] = [[delta].sub.1] + ([[delta].sub.2] x [[gamma].sub.1,city]). When there is sufficient variation in the parameters [[gamma].sub.1,city] across cities, the claim is that [[delta].sub.1] provides an estimate of [[beta].sub.1] and that [[delta].sub.2] provides an estimate of [[beta].sub.2]. By way of these estimates it should be possible to separate the component of the Z effect due to direct association of Z with Y (by virtue of the estimated intercept [[delta].sub.1] in the second-stage regression) from the component due to indirect association through an intermediate variable (by virtue of the estimated slope [[delta].sub.2] in the second-stage regression). In particular, a near-zero estimate for the second-stage intercept suggests that there is no direct association between Z and Y, whereas a nonzero (generally positive) intercept estimate is consistent with a direct association between Z and Y. Similarly, a near-zero estimate for the second-stage slope suggests absence of any indirect association between Z and Y, whereas a nonzero (generally positive) slope estimate suggests an indirect association between Z and Y (confounding). If the estimated second-stage intercept is positive and the estimated slope is close to zero, then the two-stage approach described by Schwartz and colleagues (1-3) would imply an association of Z with Y primarily through a direct pathway (i.e., statistical associations between Z and Y would not be substantially attributed to confounding). A Class of Counterexamples The situation becomes more complicated with the introduction of another pollutant variable. As before, the variable Y may be regarded as the health outcome. The variables X and Z, and a new variable W, represent pollutants. Suppose now that Z is not directly associated with the outcome Y and is only indirectly associated through X and W. Moreover, both X and W are assumed to have associations with Z that may vary from city to city. A structural equation model analogous to the system of Equation 1 -- Equation 2 that expresses these relationships is [7] X = [[gamma].sub.0] + ([[gamma].sub.1,city] x Z) + [[epsilon].sub.X] [8] W = [[tau].sub.0] + ([[tau].sub.1,city] x Z) + [[epsilon].sub.W] [9] Y = [[beta].sub.0] + ([[beta].sub.2] x X) + ([[beta].sub.3] x W) + [[epsilon].sub.Y]. As before, one instance of the system will be used to represent each city. The parameters [[beta].sub.2] and [[beta].sub.3], which characterize the direct relationships of Y with X and Y with W, are treated as constant across all instances of the system, whereas the parameters [[gamma].sub.1,city] and [[tau].sub.1,city], which characterize the relationships between Z and X and between Z and W, respectively, are allowed to vary across instances (cities) but are to be regarded as constant within any instance of the system. Assuming that Z is the variable to be evaluated for a direct effect, the first stage of the analysis involves a regression of Y against Z. Again, due to model misspecification the effects picked up should be that of Z acting through both X and W. The first-stage estimates of the Z effects should thus be characterized by [10'] E[Z [effect.sub.city]] = ([[beta].sub.2] x [[gamma].sub.1,city]) + ([[beta].sub.3] x [[tau].sub.1,city]) for city = 1, ..., K. Although the variable W is present in the modified system, the second-stage regression uses only one set of estimated copollutant relationships, either X with Z associations or W with Z associations; for consistency, we assume the former. The results of the second-stage regression will thus be affected by the relationship (if any) between [[tau].sub.1,city] and [[gamma].sub.1,city]. For example, suppose that [[tau].sub.1,city] = [[tau].sub.1] for city = 1, ..., K. Then, in the second-stage regression of estimated Z effects against estimated X with Z associations, we should anticipate an intercept approximating [[beta].sub.3] x [[tau].sub.1] and a slope approximating [[beta].sub.2]. If [[beta].sub.3] x [[tau].sub.1] > 0 and [[beta].sub.2] > 0, then we are likely to correctly conclude that Z has an indirect effect on Y (confounding), but incorrectly conclude that Z also has a direct effect on Y. The reason that a zero intercept would not be expected is that the confounding mechanism involves two intermediaries, with Z acting through one of them in varying proportion over cities, and through the other in constant proportion (absent the error terms). More generally, suppose that [11] [[tau].sub.1,city] = [[eta].sub.0] + ([[eta].sub.1] x [[gamma].sub.1,city]) for city = 1, ..., K. By substitution of Equation 11 into Equation 10, it follows that the expected Z effects from the first-stage regression are given by [10] E[Z [effect.sub.city]] = ([[beta].sub.3] x [[eta].sub.0]) + [[beta].sub.2] + ([[beta].sub.3] x [[eta].sub.1])] x [[gamma].sub.1,city]. The second-stage regression can then be expected to yield an intercept that approximates [[beta].sub.3] x [[eta].sub.0] and a slope that approximates [[beta].sub.2] + ([[beta].sub.3] x [[eta].sub.1]]). The two-stage method would likely lead to the incorrect conclusion that both indirect and direct Z effects exist and would also incorrectly estimate the magnitude of the Z-to-X-to-Y pathway. Simulations We developed a program using SAS (1) (SAS Institute Inc., Cary, NC, www.sas.com) A software company that specializes in data warehousing and decision support software based on the SAS System. Founded in 1976, SAS is one of the world's largest privately held software companies. See SAS System. software (SAS Institute SAS Institute Inc., headquartered in Cary, North Carolina, USA, has been a major producer of software since it was founded in 1976 by Anthony Barr, James Goodnight, John Sall and Jane Helwig. Inc., Cary, NC) that simulates data for a system which fits into the class of models described by Equations 7-9. The simulation is carried out for K = 10 cities and N = 100 observations (days) per city under the following parameter settings: [[gamma].sub.0] = 100; [[gamma].sub.1,city] = 0.05, 0.15, ..., 0.95 successively for city = 1, ..., K [[tau].sub.0] = 100; [[tau].sub.1,city] = [[eta].sub.0] + ([[eta].sub.1] x [[gamma].sub.1,city]) = 0.5 + (0.05 x [[gamma].sub.1,city]) for city = 1, ..., K [[beta].sub.0] = 100; [[beta].sub.2] = 0.3; [[beta].sub.3] = 1.0. Positive values were assigned to the intercept terms to minimize generation of negative quantities during simulation; however, the intercepts do not play an essential role in the analysis. Note that the relationship between [[tau].sub.1,city] and [[gamma].sub.1,city] is relatively flat, so that [[tau].sub.1,city] is nearly constant with respect to [[gamma].sub.1,city]. For simplicity, the variable Z is simulated according to according to prep. 1. As stated or indicated by; on the authority of: according to historians. 2. In keeping with: according to instructions. 3. a normal distribution: Z ~ N(100,[25.sup.2]). Each simulated value of Z is substituted into Equations 7 and 8 to obtain the systematic components of X and W. The values for X and W are completed with the addition of error terms [[epsilon].sub.X] and [[epsilon].sub.W], simulated according to normal distributions: [[epsilon].sub.X] ~ N(0, [20.sup.2]) [[epsilon].sub.W] ~ N(0, [20.sup.2]). The values for X and W are then substituted into Equation 9 to obtain the systematic component of Y. The value for Y is completed by addition of the error term [[epsilon].sub.Y], which is simulated according to a normal distribution: [[epsilon].sub.Y] ~ N(0, [20.sup.2]). The simulation structure for Z, [[epsilon].sub.X], [[epsilon].sub.W], and [[epsilon].sub.Y] does not incorporate any dependencies, so that effectively variables are independent and for simplicity the time-series values are free of serial correlation serial correlation The relationship that one event has to a series of past events. In technical analysis, serial correlation is used to test whether various chart formations are useful in projecting a security's future price movements. . The magnitude of the error terms was selected to introduce enough randomness to make the simulation and estimation process meaningful while retaining substantial collinearity collinearity very high correlation between variables. between Z, X, and W. After the data have been simulated, the first-stage and second-stage regressions are performed with the variable W omitted from the analysis. Referring to Equation 10' we anticipate that the second-stage intercept [[delta].sub.1] should approximate [[beta].sub.3] x [[eta].sub.0] = 0.5 and that the slope estimate [[delta].sub.2] should approximate [[beta].sub.2] + ([[beta].sub.3] x [[eta].sub.1]) = 0.35. The complete simulation was replicated 1,000 times. Over all simulations, the second-stage intercept estimate [[delta].sub.1] had a mean value of 0.498 (SD = 0.073) and the second-stage slope estimate [[delta].sub.2] had a mean value of 0.351 (SD = 0.125). On average, therefore, we recovered parameter estimates that conform to the underlying model structure. Moreover, the reported standard deviations In statistics, the average amount a number varies from the average number in a series of numbers. (statistics) standard deviation - (SD) A measure of the range of values in a set of numbers. indicate that nonpositive estimates of either the second-stage intercept or the second-stage slope would be rare. Figure 1 shows an example of the results obtained in a single simulation run. This particular replication was selected for display because the second-stage regression recovered intercept and slope estimates close to the underlying quantities [[beta].sub.3] x [[eta].sub.0] and [[beta].sub.2] + ([[beta].sub.3] x [[eta].sub.1]). Based on the two-stage analysis, the (incorrect) conclusion would be that Z has a direct pathway to Y and also a comparatively weak indirect pathway to Y. Hence, the complete confounding of Z is largely obscured. [FIGURE 1 OMITTED] Discussion We have shown that two-stage regression approaches cannot necessarily be trusted in scenarios where a third factor, such as another air pollutant, may also play a role. Although the class of counterexamples presented is based on one particular model, many variations are possible. For example, introducing a direct association between Z and Y produces a new model that supports additional counterexamples. Depending on the parameters that characterize the relationships between variables in this revised model, different outcomes would be expected using the two-stage regression approach. In particular, if Z and the omitted variable W vary inversely, the direct effect of Z on Y could be partially or totally obscured. Hence the bias can go in either direction. More complicated examples involving multiple variables that are omitted from the estimation process can also be constructed, but such examples appear to offer little additional insight. Samet et al.'s Figure 33 (1) and Schwartz's Figure 4 (3) are consistent with the hypothesis that the estimated P[M.sub.10] (particluate matter < 10 [micro]m in aerodynamic diameter Drug particles for pulmonary delivery are typically characterized by aerodynamic diameter rather than geometric diameter. The velocity at which the drug settles is proportional to the aerodynamic diameter, da. ) effects for hospital admissions and mortality, respectively, are not strongly confounded with sulfur dioxide sulfur dioxide, chemical compound, SO2, a colorless gas with a pungent, suffocating odor. It is readily soluble in cold water, sparingly soluble in hot water, and soluble in alcohol, acetic acid, and sulfuric acid. and ozone. These figures are also not inconsistent with the hypothesis that the P[M.sub.10] effect is due to another excluded air pollutant through mechanisms similar to those in the class of counterexamples presented above. Without necessarily subscribing to the logic of direct and indirect pathways implied by such models, the variable (P[M.sub.10]) to be tested for confounding using the proposed methodology must nonetheless enter the analysis as the explanatory variable Z in the first-stage analysis. The two-stage models, discussed in this paper, therefore, appropriately describe the type of models employed in the confounding analyses in previous reports (1-3). In practice, multicollinearity is most effectively evaluated on the basis of entire correlation structures (e.g., as opposed to pairwise correlations), and factor analysis may be useful for this purpose. Published results providing adequate details on correlations between numerous pollutants in a multicity study, however, are not readily available. As a surrogate in this discussion, we consider results published by Schwartz (2) that show the correlation matrices for P[M.sub.10], carbon monoxide carbon monoxide, chemical compound, CO, a colorless, odorless, tasteless, extremely poisonous gas that is less dense than air under ordinary conditions. It is very slightly soluble in water and burns in air with a characteristic blue flame, producing carbon dioxide; , temperature, and dew point dew point: see dew. for eight cities, but include no other air pollutants. Factor analyses Verb 1. factor analyse - to perform a factor analysis of correlational data factor analyze analyse, analyze - break down into components or essential features; "analyze today's financial market" of these four variables reveal considerable differences among the cities. The two smallest eigenvalues eigenvalues statistical term meaning latent root. for each city are < 0.025, indicating strong multicollinearity among the four variables. In the four western cities (Colorado Springs, Colorado The City of Colorado Springs is the second most populous city (after Denver) in the state of Colorado and the 48th most populous city in the United States.[4] The city is the county seat of El Paso County. , and Seattle, Spokane, and Tacoma, Washington) and in New Haven New Haven, city (1990 pop. 130,474), New Haven co., S Conn., a port of entry where the Quinnipiac and other small rivers enter Long Island Sound; inc. 1784. Firearms and ammunition, clocks and watches, tools, rubber and paper products, and textiles are among the many , Connecticut, the first principal component explains 88-95% of the variation in these data, but includes both P[M.sub.10] and CO, suggesting that these pollutants have similar sources and that their health effects may be difficult to separate. The second principal component in these cities explains < 13% of the variation, and reflects mainly the difference between P[M.sub.10] and CO variations. By contrast, in three midwestern cities (Chicago, Illinois, and Minneapolis and St. Paul St. Paul as a missionary he fearlessly confronts the “perils of waters, of robbers, in the city, in the wilderness.” [N.T.: II Cor. 11:26] See : Bravery , Minnesota), the first principal component explains only 72-75% of the variation and puts much less weight on P[M.sub.10] variations than on CO variations. The second principal component in the midwestern cities explains 26-27% of the variation, with P[M.sub.10] the dominant variable, suggesting that it may be easier to separate the effects of the two pollutants in those cities than in western cities. These results suggest not only that multicollinearity is a significant problem in air pollution epidemiology but also that the nature of the problem does indeed vary from city to city. In short, the structural characteristics of the hypothetical models underlying the two-stage analyses reported by Schwartz and colleagues (1-3) and the more complicated counterexample we describe in this paper are quite plausible. Factor analysis can also be useful for constructing alternative model inputs. Specifically, by obtaining factors that are often identifiable as "bundles" of closely related pollutants and that by construction are orthogonal At right angles. The term is used to describe electronic signals that appear at 90 degree angles to each other. It is also widely used to describe conditions that are contradictory, or opposite, rather than in parallel or in sync with each other. , the multicollinearity problem can be largely eliminated. Although most recent efforts of this type have focused on factor analyses of concentrations of the elemental components of fine particles Fine particles are an air pollutant mainly produced by cars running on diesel. Other sources are the combustion of fossil fuels in power plants and various industrial processes. on filters (5-8), they have also included gaseous copollutant concentrations as variables in the analyses. Mar et al. (6) used a factor analysis model that included S[O.sub.2], nitrogen dioxide nitrogen dioxide n. A poisonous brown gas, NO2, often found in smog and automobile exhaust fumes and synthesized for use as a nitrating agent, a catalyst, and an oxidizing agent. Noun 1. , and CO along with particle elements for Phoenix, Arizona Phoenix /ˈfiːˌnɪks/ (English: Phoenix, Navajo: Hoozdo, lit. "the place is hot", Western Apache: Fiinigis) is the capital and the most populous city of the U.S. . Tsai et al. (7) used a model that included CO and sulfate sulfate, chemical compound containing the sulfate (SO4) radical. Sulfates are salts or esters of sulfuric acid, H2SO4, formed by replacing one or both of the hydrogens with a metal (e.g., sodium) or a radical (e.g., ammonium or ethyl). concentrations, along with eight metals in respirable respirable /res·pir·a·ble/ (re-spir´ah-b'l) 1. suitable for respiration. 2. small enough to be inhaled. res·pi·ra·ble adj. 1. Fit for breathing, as air. particles for three New Jersey cities. Ozkaynak et al. (8) used a model with the coefficient of haze The coefficient of haze is a measurement of visibility interference in the atmosphere. (an index of black particles) as well as N[O.sub.2] and CO and meteorologic me·te·or·ol·o·gy n. The science that deals with the phenomena of the atmosphere, especially weather and weather conditions. [French météorologie, from Greek variables for Toronto, Ontario, Canada. These models generally identified several important factors, or pollutant bundles, that suggested several significant sources contributed to urban air pollution, including motor vehicle emissions, coal combustion, fuel oil combustion, vegetative vegetative /veg·e·ta·tive/ (vej?e-ta?tiv) 1. of, pertaining to, or characteristic of plants. 2. concerned with growth and nutrition, as opposed to reproduction. 3. burning, resuspended road dust, soil and crustal crust·al adj. Of or relating to a crust, especially that of the earth or the moon. Adj. 1. crustal - of or relating to or characteristic of the crust of the earth or moon material, local sources of S[O.sub.2], regional sulfate sources, nonferrous non·fer·rous adj. 1. Not composed of or containing iron. 2. Of or relating to metals other than iron. nonferrous Adjective 1. metal processing, and sea salt. We suggest only that factor analysis is one useful tool for evaluating multicollinearity and is useful as a preprocessing A preliminary processing of data in order to prepare it for the primary processing or for further analysis. The term can be applied to any first or preparatory processing stage when there are several steps required to prepare data for the user. step in regression modeling; we do not assert that it is the best or the only alternative to the proposed two-stage approach. A comprehensive treatment of the confounding/multicollinearity problem is beyond the scope of this paper, but clearly deserves additional study. In this paper, we have purposely pur·pose·ly adv. With specific purpose. purposely Adverb on purpose USAGE: See at purposeful. Adv. 1. avoided the added complications associated with the "errors in variables" problem (4,9). In particular, if Z is measured with error, we may anticipate that attenuation Loss of signal power in a transmission. Attenuation The reduction in level of a transmitted quantity as a function of a parameter, usually distance. It is applied mainly to acoustic or electromagnetic waves and is expressed as the ratio of power densities. bias will emerge in the first-stage regressions in situations properly characterized by Equations 1 and 2. A completely rigorous treatment would incorporate this aspect of modeling and estimation. However, dealing with such concerns in the present analysis would not appreciably alter the general conclusions and would introduce an unnecessary technical distraction. Conclusion We have given a brief background on the use of two-stage regression as it has been applied to the evaluation of direct and indirect associations of copollutants with human health outcomes. Selected counterexamples show, in simple idealized i·de·al·ize v. i·de·al·ized, i·de·al·iz·ing, i·de·al·iz·es v.tr. 1. To regard as ideal. 2. To make or envision as ideal. v.intr. 1. terms, mechanisms through which such analyses can lead to erroneous interpretations. REFERENCES AND NOTES (1.) Samet JM, Zeger SL, Dominici F, Curriero F, Coursac I, Dockery DW, Schwartz J, Zanobetti A. The National Morbidity, Mortality, and Air Pollution Study Part II: Morbidity, Mortality, and Air Pollution in the United States United States, officially United States of America, republic (2005 est. pop. 295,734,000), 3,539,227 sq mi (9,166,598 sq km), North America. The United States is the world's third largest country in population and the fourth largest country in area. . HEI HEI Higher Education Institution (UK) HEI Health Effects Institute HEI Hautes Études Internationales HEI House Ear Institute HEI Healthy Eating Index HEI Hautes Etudes d'Ingénieur HEI High-Explosive Incendiary Research Report no. 94, Part II. Cambridge, MA:Health Effects Institute The Health Effects Institute (HEI) is a non-partisan, non-profit corporation specializing in research on the health effects of air pollution. It is headquartered in Charlestown, Massachusetts, USA. , 2000. (2.) Schwartz J. Air pollution and hospital admissions for heart disease in eight U.S. counties. Epidemiology 10:17-22 (1999). (3.) Schwartz J. Assessing confounding, effect modification effect modification Epidemiology An interaction among multiple possible cause-and-effect relationships, where the estimate of the effect of one factor on a disease process depends on other factors in the study , and thresholds in the association between ambient particles and daily deaths. Environ Health Perspect 106:563-568 (2000). (4.) Hanushek EA, Jackson JE. Statistical Methods for Social Scientists. New York New York, state, United States New York, Middle Atlantic state of the United States. It is bordered by Vermont, Massachusetts, Connecticut, and the Atlantic Ocean (E), New Jersey and Pennsylvania (S), Lakes Erie and Ontario and the Canadian province of :Academic Press, 1977. (5.) Laden F, Neas LM, Dockery DW, Schwartz J. Association of fine particulate matter from different sources with daily mortality in six U.S. cities. Environ Health Perspect 108:941-947 (2000). (6.) Mar TF, Norris GA, Koenig JQ, Larson TV. Associations between air pollution and mortality in Phoenix, 1995-1997. Environ Health Perspect 108:347-353 (2000). (7.) Tsai FC, Apte MG, Daisey JM. An exploratory analysis of the relationship between mortality and the chemical composition of airborne particulate matter. Inhal Toxicol 12(suppl 2):121-135 (2000). (8.) Ozkaynak H, Xue J, Zhou H, Raizenne M. Associations between Daily Mortality and Motor Vehicle Pollution in Toronto, Canada. Boston:Department of Environmental Health, Harvard University Harvard University, mainly at Cambridge, Mass., including Harvard College, the oldest American college. Harvard College Harvard College, originally for men, was founded in 1636 with a grant from the General Court of the Massachusetts Bay Colony. School of Public Health, for Environmental Health Directorate, Health Canada Health Canada (French: Santé Canada) is the department of the government of Canada with responsibility for national public health. Health Canada's goal is to improve Canadian life by improving Canadian longevity, lifestyle and use of public healthcare. , Ottawa, Ontario, Canada, 1996. (9.) Fuller WA. Measurement Error Models. New York:John Wiley John Wiley may refer to:
Address correspondence to A.H. Marcus, Room 358 Catawba Building, 3210 Highway 54, Research Triangle Park Research Triangle Park, research, business, medical, and educational complex situated in central North Carolina. It has an area of 6,900 acres (2,795 hectares) and is 8 × 2 mi (13 × 3 km) in size. Named for the triangle formed by Duke Univ. , NC 27709 USA. Telephone: (919) 541-0636. Fax: (919) 541-1818. E-mail: marcus.allan@epa.gov The views expressed in this paper are those of the authors and do not necessarily reflect the views or policies of the U.S. Environmental Protection Agency Environmental Protection Agency (EPA), independent agency of the U.S. government, with headquarters in Washington, D.C. It was established in 1970 to reduce and control air and water pollution, noise pollution, and radiation and to ensure the safe handling and . Received 25 October 2000: accepted 15 May 2001. |
|
||||||||||||||

Printer friendly
Cite/link
Email
Feedback
Reader Opinion