Lipid adjustment in the analysis of environmental contaminants and human health risks.The literature on exposure to lipophilic lipophilic, adj/n the ability to dissolve or attach to lipids. lipophilic (lipōfil´ik), adj 1. showing a marked attraction to, or solubility in, lipids. 2. agents such as polychlorinated biphenyls polychlorinated biphenyls, (pol´ēklôr´ adipose tissue or fatty tissue Connective tissue consisting mainly of fat cells, specialized to synthesize and contain large globules of fat, within a , which makes it necessary to model serum lipids serum lipid Any major lipid in the circulation–total cholesterol, HDL, LDL, TGs. See Cholesterol, Triglyceride. when assessing health risks of PCBs. Using a simulation study, we evaluated four statistical models (unadjusted, standardized, adjusted, and two-stage) for the analysis of PCB PCB: see polychlorinated biphenyl. PCB in full polychlorinated biphenyl Any of a class of highly stable organic compounds prepared by the reaction of chlorine with biphenyl, a two-ring compound. exposure, serum lipids, and health outcome risk (breast cancer). We applied eight candidate true causal scenarios, depicted by directed acyclic graphs directed acyclic graph - (DAG) A directed graph containing no cycles. This means that if there is a route from node A to node B then there is no way back. , to illustrate the ramifications ramifications npl → Auswirkungen pl of misspecification of underlying assumptions when interpreting results. Statistical models that deviated from underlying causal assumptions generated biased results. Lipid standardization, or the division of serum concentrations serum concentration Therapeutics The amount of a drug or other compound in the circulation, both bound to proteins and unbound, the latter of which generally corresponds to the theraepeutically active fraction by serum lipids, was observed to be highly prone to bias. We conclude that investigators must consider biology, biologic medium (e.g., nonfasting blood samples), laboratory measurement, and other underlying modeling assumptions when devising a statistical plan for assessing health outcomes in relation to environmental exposures. Key words: causal modeling, directed acyclic graphs, organochlorines organochlorines see chlorinated hydrocarbons. organochlorines poisoning cause excitement and irritability, tremor, ataxia, weakness, paralysis, convulsions. , polychlorinated biphenyls, risk estimation, serum lipids. Environ Health Perspect 113:853-857 (2005). doi:10.1289/ehp.7640 available via http://dx.doi.org/[Online 17 March 2005] ********** Persistent lipophilic xenobiotics pose particular methodologic challenges when assessing potential human health risks. The human health effects literature on exposure to lipophilic agents such as organochlorines (OCs) is equivocal EQUIVOCAL. What has a double sense. 2. In the construction of contracts, it is a general rule that when an expression may be taken in two senses, that shall be preferred which gives it effect. Vide Ambiguity; Construction; Interpretation; and Dig. , impairing our ability to quantify risks (Calle et al. 2002; Hunter et al. 1997; Laden et al. 2001a, 2001b). For example, Wolff and colleagues (Wolff 1985; Wolff and Toniolo 1995; Wolff et al. 1993, 2000) found an increased odds ratio for breast cancer for the highest quintile quin·tile n. 1. The astrological aspect of planets distant from each other by 72° or one fifth of the zodiac. 2. Statistics The portion of a frequency distribution containing one fifth of the total sample. of wet-weight dichlorodiphenyl-dichloroethylene (DDE (Dynamic Data Exchange) A message protocol in Windows that allows application programs to request and exchange data between them automatically. DDE - Dynamic Data Exchange ) and polychlorinated biphenyls (PCBs; expressed as nanograms analyte per milliliter milliliter /mil·li·li·ter/ (mL) (-le?ter) one thousandth (10-3) of a liter. mil·li·li·ter n. Abbr. serum) when compared with the lowest quintile, whereas Laden et al. (2001a, 2001b) found no association when concentrations of DDE and PCBs were standardized for serum triglycerides Triglycerides Fatty compounds synthesized from carbohydrates during the process of digestion and stored in the body's adipose (fat) tissues. High levels of triglycerides in the blood are associated with insulin resistance. and cholesterol. No association was reported for PCBs and risk of breast cancer when expressing concentrations either as wet weight or lipid standardization values (Helzlsouer et al. 1999). Varying laboratory practices for expressing PCB concentrations may in part account for the equivocal findings for human health end points. Serum PCB concentrations, as with other lipophilic xenobiotics, are dependent on serum lipid concentrations (Eyster et al. 1983; Guo et al. 1987). Under certain circumstances an equilibrium is reached, and information regarding serum PCB levels and serum lipid levels may be predictive of PCB body burden (Brown and Lawton 1984). If serum lipids indeed act in this manner, higher serum lipid levels should correspond to higher serum PCB concentrations (Calvert et al. 1996). However, serum OC concentrations and lipids are affected postprandially and need to be considered in relation to quantity and timing of food consumption (Phillips et al. 1989). When it is not possible to collect adipose tissue, serum samples are frequently used. However, serum (or plasma) introduces methodologic challenges with regard to lipids when estimating health risks, particularly when nonfasting samples are used (Whitcomb et al. 2005). Collection of fasting samples can hamper the feasibility of epidemiologic research and may adversely impact study participation. Nonfasting samples require further attention to serum lipids (Brown and Lawton 1984; Brown et al. 1994; Eyster et al. 1983). Our limited understanding of the true relation between serum and adipose tissue concentrations of lipophilic xenobiotics in relation to serum lipids and particular health outcomes makes model specifcation difficult (Calvert et al. 1996; Mussalo-Rauhamaa 1991). Investigators typically express measurements on a wet-weight basis or per unit volume of serum or as lipid-standardized values, where the concentration is divided by serum lipids. Lipid standardization may be useful for comparing exposure concentrations across tissue specimens or across study populations by expressing PCB concentrations per gram of fat (Morgan and Roan roan a coat color consisting of a relatively uniform mixture of white and colored hairs, giving a 'silvered' hue; self-describing colors are red-roan, blue-roan, chestnut roan. 1970). Use of lipid weight (PCB per unit of serum lipids) as opposed to wet weight (PCB per unit of serum) has been advocated for the measurement of persistent lipophilic chemicals (Brown and Lawton 1984), especially if one assumes body burden equilibrium. Other approaches reported in the literature include the use of a log-linear model log-linear model a statistical model which models frequency counts in contingency tables by using an analysis of variance approach. with serum lipids included as a separate term in the regression equation Regression equation An equation that describes the average relationship between a dependent variable and a set of explanatory variables. (Moysich et al. 1998). Other investigators have conducted two-stage analyses wherein serum lipids are regressed on serum PCB concentrations with the residuals entered as an individual risk factor (Hunter et al. 1997). The issue of how best to model the relation among serum PCBs, lipids, and health outcomes remains an understudied area critical for the assessment of health effects. Here we demonstrate the impact of model (mis)specification and its effect on the interpretation of study findings. We used directed acyclic graphs (DAGs) to define a causal framework among exposure, lipids, and health outcome and values for parameters as informed by the literature (Hernan et al. 2002; Robins et al. 2000). Using DAGs to supply a causal framework and parameter values informed by the literature, we present the results of a simulation study. These results identify the best statistical model for each circumstance and the bias produced by a mismatch between the DAG and the statistical analysis. Materials and Methods Statistical models and DAGs. Optimal modeling of the statistical relations among serum PCBs, serum lipids, and health outcomes requires positing an underlying causal model that reflects the following considerations: a) biologic plausibility; b) laboratory capability for quantifying compounds and lipids; c) underlying statistical assumptions (e.g., error structure); and d) other relevant study covariates (e.g., known and potential confounders). To focus on bias, we assume perfect laboratory measurement of PCBs and the absence of unmeasured confounding confounding when the effects of two, or more, processes on results cannot be separated, the results are said to be confounded, a cause of bias in disease studies. confounding factor . We depict each scenario via a simple DAG that shows relations but does not dictate a specific statistical model (i.e., mean and error structures). A single-headed arrow represents a causal relation between the ancestor (tail) and the descendant (head). A dashed line represents a noncausal association between two variables, suggesting a shared ancestor that may or may not have been measured; the absence of an arrow signifies no relation. The true causal structure relating PCBs and serum lipids depends on the outcome under study. Investigators typically have insufficient biologic information to specify the correct analytic model, often resulting in analytic strategies based on unverified assumptions. For example, research indicates a possible causal effect of PCBs on serum lipid levels (Hennig et al. 2005; Langer et al. 2003). Additionally, lipid levels have been suggested to affect breast cancer risk (Atalay et al. 2004), but their impact on other health end points has received limited attention. For our purposes in this study, our scenarios, hypothetical "causal truths," are based on the literature and their relation to frequently used statistical models. Statistical models. We investigated four statistical models (unadjusted, standardized, adjusted, and two-stage) for the analysis of hypothesized PCB exposure, serum lipids, and a health outcome (breast cancer), along with eight plausible DAGs for each model to illustrate the choices facing investigators. For illustrative purposes, all models assume that there are no unmeasured confounders. For all models, P = Pr(Y= 1|X, SL), where Y is a dichotomous di·chot·o·mous adj. 1. Divided or dividing into two parts or classifications. 2. Characterized by dichotomy. di·chot dependent variable representing the presence/absence of the disease; X= PCB; and SL = serum lipids. Unadjusted model. The unadjusted model is equivalent to the use of wet-weight values when estimating the effect of an exposure such as PCBs on a health outcome without further consideration of serum lipids. logit (P) = [[alpha].sub.1] + [[beta].sub.1]ln(x) [1] Accordingly, this model is suitable for use when it is reasonable to assume that serum lipids are not a confounder con·found tr.v. con·found·ed, con·found·ing, con·founds 1. To cause to become confused or perplexed. See Synonyms at puzzle. 2. . This assumption holds true regardless of the relation between lipids and the outcome. Inclusion or exclusion of lipids as an adjustor may affect model fit, but it will not impact PCB exposure/response estimates. Four DAGs, shown in Figure 1, are appropriately evaluated by use of the unadjusted statistical model. Figure 1A reflects a scenario that will result in an unbiased risk estimate as serum lipids are assumed to be unrelated to PCB levels. Use of this model for Figure 1B yields optimal estimates, if serum lipids are unrelated to both PCBs and the outcome. An unadjusted model is also appropriate for Figure 1C, where PCBs are assumed to have an indirect effect via serum lipids; adjustment for a variable in the causal pathway may introduce an undesirable bias when estimating direct effects (Greenland 1996, 2003; Greenland and Morgenstern 2001). In Figure 1D, PCBs are assumed to affect both serum lipids and the outcome, creating a spurious association (Robins et al. 2000). Here, only an unadjusted model is appropriate for risk estimation. Because they vary with PCBs, adjustment for serum lipids is tantamount tan·ta·mount adj. Equivalent in effect or value: a request tantamount to a demand. [From obsolete tantamount, an equivalent, from Anglo-Norman to partial adjustment for the exposure itself. Standardized model. The lipid-standardized model is one way to account for the effect of serum lipids on serum PCB levels. This model is used frequently and is conceptually similar to use of the body mass index (BMI BMI body mass index. BMI abbr. body mass index Body mass index (BMI) A measurement that has replaced weight as the preferred determinant of obesity. ; weight in kilograms divided by the squared height in meters) to adjust weight for height in measuring adiposity adiposity /ad·i·pos·i·ty/ (ad?i-pos´i-te) obesity. cerebral adiposity fatness due to cerebral disease, especially of the hypothalamus. adiposity obesity. . logit(P) = [[alpha].sub.2] + [[beta].sub.2]ln([X/[SL.sup.m]]) = [[alpha].sub.2] + [[beta].sub.2][ln(X) - m x ln(SL)] [2] The power, m in Equation 2 is a factor that generalizes the relation of PCBs and serum lipids. Due to measurement error in the quantification of lipids, use of Equation 2 when Figure 1A holds can result in biased estimates. If Figure 1B holds, estimates will be affected by a scaling issue, as the beta coefficient is that for the log of the ratio of PCB to lipids. If the true relations follow Figure 1 (C or D), then use of Equation 2 will adjust, albeit incompletely, for the exposure of interest, as in both Figure 1C and D, PCBs determine the variance of serum lipids. Figure 1C depicts a causal relation between both PCBs and serum lipids with the outcome, and a noncausal association between PCBs and serum lipids resulting from a common ancestor, A. Use of the standardization model will be valid for this situation only if the standardization completely accounts for the association between PCB and serum lipids. Otherwise, use of this model will result in biased estimates. Figure 1F is modeled similarly to Figure 1D in that the relation between PCBs and lipids is due to a common cause, A. In this scenario, the standardized model again suffers from a scale issue. All other models will produce unbiased estimates, but precision of the estimate may vary depending on several factors, including measurement error. The potential error associated with the measurement of serum lipids can exceed that for the analyte itself (Needham and Wang 2002) and is an important source of bias. Figure 1G represents two possible circumstances in which serum PCBs are causally related or correlated with the true exposure/ outcome association. If the relation between serum and adipose adipose /ad·i·pose/ (ad´i-pos) 1. fatty. 2. the fat present in the cells of adipose tissue. ad·i·pose adj. Of, relating to, or composed of animal fat; fatty. concentration levels of PCBs is governed by serum lipid levels, then standardization may allow use of one as a proxy for the other. Adjusted model. In the adjusted model, there is an assumption that PCBs are not standardized for serum lipids, reflecting the absence of an association between lipids and the study outcome. Note that the standardized model is a member of the family of adjusted models. logit(P) = [[alpha].sub.3] + [[beta].sub.3]ln(X) + [[beta].sub.4]ln(SL) [3] When comparing the lipid component in the standardized model [ln(X) - m x ln(SL)] with the lipid term of the adjusted model [[beta].sub.4] ln(SL)], equivalent results are produced in that [[beta].sub.4] is forced to be equal to -m. If m is set equal to 1, PCBs are divided by serum lipids, as is the case with the standardized model. However, the adjusted model is more flexible than the standardized model and, in general, is applicable under the same set of assumptions. For Figure 1A, the adjusted model will produce unbiased estimates without regard for the degree of standardization, while the standardized model is conditional on standardization being sufficient. The adjusted model will yield unbiased estimates for Figure 1A, B, D, and F. For Figure 1C, E and H, the adjusted model will yield biased estimates because the adjustment is performed for a variable in the causal pathway; for Figure 1H this bias is to estimates of the total effect due to its partitioning into direct and indirect. Two-stage model. The two-stage model includes the effects of PCBs and serum lipids on the outcome: ln(SL) = [alpha] + [[beta].sub.5]ln(X) + R logit(P) = [[alpha].sub.4] + [[beta].sub.6]ln(X) + [[beta].sub.7] x (R) [4] Implications of the two-stage model arise from its relation to the adjusted model. Both the intercept and the beta coefficient in the two-stage model are simple functions of the parameters from the adjusted model and the regression of serum lipids on log PCBs. The coefficient for the residual term, R, is precisely that of the adjusted model's lipids term: [[alpha].sub.4] = [[alpha].sub.3] - [[beta].sub.4][alpha] [[beta].sub.6] = [[beta].sub.3] - [[beta].sub.5][[beta].sub.4] [[beta].sub.7] = [[beta].sub.4] Use of the two-stage model for Figure 1A will result in estimates similar to those produced by the adjusted model, because there is no assumption about an association between PCBs and serum lipids. Therefore, the residuals will be equivalent to the lipid term in the model. The two-stage model may also be used to represent Figure 1F, with an important caveat that the risk estimates now have a different interpretation in that they separate the PCB effect from the lipid effect on the outcome. In some circumstances, the two-stage model will generate unbiased risk estimates for Figure 1B, although they will be inefficient. Similarly, the model may yield unbiased risk estimates for Figure 1C although confounding is not addressed. The two-stage model is appropriate when it is important to distinguish direct and indirect effects of PCBs (Figure 1H). In this scenario, the effect of serum lipids is an indirect effect via PCBs; their inclusion introduces bias as is the case for the standardized model where assumptions of causality causality, in philosophy, the relationship between cause and effect. A distinction is often made between a cause that produces something new (e.g., a moth from a caterpillar) and one that produces a change in an existing substance (e.g. may not be clearly delineated de·lin·e·ate tr.v. de·lin·e·at·ed, de·lin·e·at·ing, de·lin·e·ates 1. To draw or trace the outline of; sketch out. 2. To represent pictorially; depict. 3. . Simulations. In addition to showing causality in a statistical model, each DAG can be used to guide model selection. We conducted a simulation study to evaluate the utility of various models for various scenarios depicted by DAGs. We used the causal structures they define, assigned lognormal distributions Lognormal distribution Pattern of frequency of occurrence in which the logarithm of the variable follows a normal distribution. Lognormal distributions are used to describe returns calculated over periods of a year or more. for PCB and serum lipids, and assumed a binomial binomial (bī'nō`mēəl), polynomial expression (see polynomial) containing two terms, for example, x+y. The binomial theorem, or binomial formula, gives the expansion of the nth power of a binomial (x+ outcome variable Y with Pr(Y=1 | PCB, serum lipids). For example, in Figure 1H PCB causes disease Y and affects serum lipid (which in turn also affects disease); these associations motivate the model: ln(SL) = [[alpha].sub.0] + [gamma] [ln(X)] logit(P) = [[alpha].sub.1] + [[beta].sub.1]ln(X) + [[beta].sub.2]{E[ln(SL)|X]} = [[alpha].sub.0] + [[alpha].sub.1] + ([[beta].sub.1] + [[beta].sub.2][gamma])[ln(X)] [5] The log odds [logit(P(X, SL)] equals an intercept ([[alpha].sub.0]), the prevalence among the unexposed, plus the factor, [[beta].sub.1]+[[beta].sub.2][gamma], by which PCB affects the probability of the event. There is no serum lipid term, denoting that there is no linear influence of serum lipid levels. In Figure 1, the assumptive as·sump·tive adj. 1. Characterized by assumption. 2. Taken for granted; assumed. 3. Presumptuous; assuming. as·sump role of serum lipids is variously a) an independent came, b) a dependent cause, c) an independent noncause, d) a dependent noncause, and e) a modifier (programming) modifier - An operation that alters the state of an object. Modifiers often have names that begin with "set" and corresponding selector functions whose names begin with "get". . A represents an unmeasured variable that is an ancestor to both PCB and serum lipids (e.g., fish consumption) that may result in confounding (Hernan 2001, Hernan et al. 2002). Additionally, we assessed the effects of serum lipid measurement error [[epsilon]~N(0, [[sigma].sub.e.sup.2])] with different values of [[sigma].sub.e.sup.2] and the relation between PCB and serum lipids by varying the strength of their linear relation, [alpha], from the linear regression Linear regression A statistical technique for fitting a straight line to a set of data points. , SL = [[alpha].sub.0] + [alpha]X. In these quantitative representations of the DAGs, it is clear that magnitude of effects, error, and bias will be functions of the values chosen for the parameters. We set the independent effect of PCB as a constant ([[beta].sub.lnPCB] = 0.6 in the logistic regression In statistics, logistic regression is a regression model for binomially distributed response/dependent variables. It is useful for modeling the probability of an event occurring as a function of other factors. model), with approximate values taken from the literature (Wolff and Toniolo 1995). In our unpublished data, we observed a significant linear relation between total serum PCBs and serum lipids with a regression coefficient Regression coefficient Term yielded by regression analysis that indicates the sensitivity of the dependent variable to a particular independent variable. See: Parameter. regression coefficient value of approximately 0.3. The values provided for the strength of the linear relation between PCB, and serum lipids represented a very weak association ([alpha] = 0.01) to a strong association ([alpha] = 2.0). Results Table 1 displays the bias and mean square error for estimates that result from the four statistical models given the underlying causal truths for [[sigma].sub.e.sup.2] = 1, and [alpha] = 0.3. For Figure 1A, which represents PCB and SL as independent causes of the outcome, all models except the standardized produce minimally biased estimates. The standardized model results in a biased underestimate of the PCB effect on outcome. When SL is completely extraneous ex·tra·ne·ous adj. 1. Not constituting a vital element or part. 2. Inessential or unrelated to the topic or matter at hand; irrelevant. See Synonyms at irrelevant. 3. , as in Figure 1B, bias occurs similar to the previous situation. Figure 1C depicts the effect of PCB acting strictly through SL and is estimated unbiasedly by the two-stage approach. The unadjusted model produces minimal bias. Adjustment for SL results in a large underestimate of effect, as does standardization, though underestimates resulting from standardization are substantially greater (351%). When SL is affected by PCB but does not directly influence the outcome (Figure 1D), standardization is the only modeling approach with substantial bias, underestimating the true effect by nearly 80%, whereas the other models are within 1% of the true effect. In the confounded case, (Figure 1E), only the adjusted model performed well. Lack of adjustment failed to address the confounding by SL, and standardization was not a sufficient method to account for this confounder. In adjusting for serum lipids via the residuals, the two-stage model misattributes the association between PCB and SL as a causal link and results in biased estimates of the effect of interest--the total effect of PCB on risk. Figure 1F represents a noncausal correlation between PCB and SL and, as for Figure 1A, B, and D, produced biased underestimates using the standardized model. Figure 1G is unique among the DAGs in that it posits that serum levels of PCB are dependent on levels in adipose, which are in turn causally related to the outcome. In this situation, standardization functioned optimally; the adjusted model produced similarly unbiased estimates, while neither the unadjusted nor two-stage model worked well. Figure 1H represents a direct and indirect causal link of PCB with outcome. The relation was modeled well by the unadjusted (which estimates total effect) and the two-stage (which separates total into estimated direct and indirect) approaches. Adjustment resulted in a small amount of bias, and standardization produced the most biased estimates in this scenario. The foregoing results indicate that the standardized and the adjusted models should be compared. With the exception of Figure 1G, the adjusted model produces smaller bias than the standardized model. However, even under conditions ideally suited for the standardized model (Figure 1G: adipose PCB causes both serum PCB per serum lipids and the outcome), the adjusted model yielded a nearly identically unbiased estimate. The two-stage model produced results similar to those of the unadjusted model, though less biased, for Figure 1C, for which serum lipids are in the causal pathway of PCBs and outcome. Measurement error. To address the potential for measurement error accompanying quantification of serum lipids, an error term with mean 0 and variance ([[sigma].sub.e.sup.2] was added to the simulated distribution of serum lipids. Figures 2-4 display bias as a function of this measurement error at 4 values of [alpha] for each of the models (unadjusted, standardized, adjusted, and two-stage). Bias as a function of [[sigma].sub.e.sup.2] followed three distinct patterns among the eight DAGs. Figure 2 displays the pattern for Figure 1A, B, D, and F; with increasing measurement error, bias was stable for the unadjusted, adjusted, and two-stage models, staying close to zero. For the standardized model the relation between bias and [[sigma].sub.e.sup.2] was more complicated; bias increased with measurement error when the relation between PCB and lipids was weak, but at the highest value of [alpha] evaluated, bias decreased with measurement error. The value of [[sigma].sub.e.sup.2] at the inflection point Inflection Point An event that changes the way we think and act. -Andy Grove, Founder of Intel. Notes: For example, the fall of the Berlin Wall was an inflection point in global politics and the commercialization of the Internet was an inflection point in technology. varied from 0.5 for Figure 1F to 3.0 for Figure 1A. [FIGURES 3-4 OMITTED] Figure 3 displays the pattern of bias observed when Figure 1C, E, and H depict the truth. Similar to pattern 1, bias for the standardized model varied in a nonlinear A system in which the output is not a uniform relationship to the input. nonlinear - (Scientific computation) A property of a system whose output is not proportional to its input. manner, increasing for all values of [alpha] but the highest ([alpha] = 2). The adjusted and two-stage models were essentially robust to measurement error; however, both the unadjusted and adjusted did not always produce unbiased estimates of parameters for all underlying DAGs, especially at different levels of [alpha]. A stronger linear relation between PCB and lipids resulted in greater bias in the adjusted model. Bias of estimates produced by the unadjusted model varied slightly with [[sigma].sub.e.sup.2] for Figure 1C and H bias increased slightly with increasing measurement error (from 0 to 0.1 for 8, from 0 to 0.2 for 3). Increasing measurement error in Figure 1E reduced bias as the strength of the noncausal relation between PCBs and serum lipids was altered by the variance in serum lipids. [FIGURE 3 OMITTED] Figure 4 displays bias for the four models under the conditions represented by Figure 1G. Both the standardized and adjusted models produced unbiased estimates robust to measurement error, whereas the unadjusted and two-stage models produced biased estimates that were equally prone to measurement error. Changes in the strength of the linear relation between PCB and lipids did not affect bias for any of the four models in this scenario. [FIGURE 4 OMITTED] Discussion We have described and evaluated four statistical models (unadjusted, standardized, adjusted, and two-stage) commonly used to assess the effects of lipophilic environmental contaminants on human health when relying on blood specimens for quantifying toxicant toxicant /tox·i·cant/ (tok´si-kant) 1. poisonous. 2. poison. tox·i·cant n. 1. A poison or poisonous agent. 2. An intoxicant. adj. concentrations. Our simulations show that each statistical model has minimal bias for at least the causal truth for which it is ideally suited. Although most models performed well under all but one causal scenario, the standardized model produced large biases for most of the evaluated DAGs. The adjusted model produced only a small bias even for the DAG for which standardization is optimal. We evaluated basic causal scenarios; the eight DAGs we considered included only two to four factors. When additional factors impact levels of both PCB and serum lipids as well as health outcome risk, the evaluation will be more complex, and the trade-off between statistical efficiency and robustness will be more important. Although the adjusted model produced consistently unbiased estimates, there are circumstances where adjustment (or stratification) is inappropriate and should be avoided. For example, adjustment for a collider col`lid´er n. 1. (Physics) a A discussion of causality, particularly when regarding estimation of direct and indirect causes, necessitates consideration of counterfactuals. Consistent estimation of a direct or indirect effect require at minimum the absence of unmeasured confounding as well as the assumptions of consistency and the existence of a direct effect (Cole and Hernan 2002; Robins 2003). Estimation of causal effects and their relations to DAGs is intimately tied to the notion of counterfactuals. In reality, when a factor impacts an outcome through both direct and indirect pathways, we cannot observe A type of fire control which indicates that the observer or spotter will be unable to adjust fire, but believes a target exists at the given location and is of sufficient importance to justify firing upon it without adjustment or observation. the direct effect in absence of the indirect effect, and vice versa VICE VERSA. On the contrary; on opposite sides. ; their estimation depends on counterfactual coun·ter·fac·tu·al adj. Running contrary to the facts: "Cold war historiography vividly illustrates how the selection of the counterfactual question to be asked generally anticipates the desired answer" comparisons (Robins 2003). A general counterfactual model has been proposed that permits the estimation of total and direct effects of fixed and time-varying exposures in longitudinal studies longitudinal studies, n.pl the epidemiologic studies that record data from a respresentative sample at repeated intervals over an extended span of time rather than at a single or limited number over a short period. whether randomized ran·dom·ize tr.v. ran·dom·ized, ran·dom·iz·ing, ran·dom·iz·es To make random in arrangement, especially in order to control the variables in an experiment. or observational in design (Robins et al. 2000). However, a more detailed discussion is beyond the scope of this paper. Findings from our simulations demonstrate that statistical models failing to uphold underlying assumptions about causality lead to biased results with implications for the interpretation of effects of exposures on human health end points. We speculate that equivocal findings may arise, at least in part, from the varying laboratory and analytic approaches for specifying serum lipids when using nonfasting blood specimens to estimate risk. Investigators must remember to consider biology, biologic medium, and laboratory methodology when specifying a statistical model and its underlying assumptions appropriate for study. CORRECTION Equation 4 was incorrect in the manuscript originally published online but has been corrected here. The authors declare they have no competing financial interests. Received 6 October 2004; accepted 17 March 2005. REFERENCES Atalay G, Dirix L, Biganzoli L, Beex L, Nooij M, Cameron D, et al. 2004. The effect of exemestane on serum lipid profile in postmenopausal post·men·o·paus·al adj. Of or occurring in the time following menopause. postmenopausal Change of life Gynecology adjective Referring to the time in ♀ when menstrual periods stop for ≥ 1 yr women with metastatic Metastatic The term used to describe a secondary cancer, or one that has spread from one area of the body to another. Mentioned in: Coagulation Disorders metastatic pertaining to or of the nature of a metastasis. breast cancer: a companion study to EORTC EORTC European Organization for Research and Treatment of Cancer Trial 10951, 'Randomized phase II study in first line hormonal treatment for metastatic breast cancer with exemestane or tamoxifen tamoxifen (təmŏk`sĭfĕn'), synthetic hormone used in the treatment of breast cancer. Introduced in 1978, tamoxifen is used to prevent recurrences of cancer in women who have already undergone surgery to remove their tumors. in postmenopausal patients. Ann Oncol 15:211-217. Brown JF Jr, Lawton RW. 1984. Polychlorinated biphenyl polychlorinated biphenyl or PCB, any of a group of organic compounds originally widely used in industrial processes but later found to be dangerous environmental pollutants. (PCB) partitioning between adipose tissue and serum. Bull Environ Contain Toxicol 33:277-280. Brown JF Jr, Lawton RW, Morgan CB. 1994. PCB metabolism, persistence, and health effects after occupational exposure: implications for risk assessment. Chemosphere chemosphere: see atmosphere. 29:2287-2294. Calle EE, Frumkin H, Henley SJ, Savitz DA, Thun MJ. 2002. Organochlorines and breast cancer risk. CA Cancer J Clin 52:301-309. Calvert GM, Willie KK, Sweeney MH, Fingerhut MA, Halperin WE. 1996. Evaluation of serum lipid concentrations among U.S. workers exposed to 2,3,7,8-tetrachlorodibenzo-p-dioxin. Arch Environ Health 51:100-107. Cole SR, Hernan MA. 2002. Fallibility fal·li·ble adj. 1. Capable of making an error: Humans are only fallible. 2. Tending or likely to be erroneous: fallible hypotheses. in estimating direct effects. Int J Epidemiol 31:163-165. Eyster JT, Humphrey HE, Kimbrough RD. 1983. Partitioning of polybrominated biphenyls polybrominated biphenyls see biphenyl. (PBBs)in serum, adipose tissue, breast milk, placenta placenta (pləsĕn`tə) or afterbirth, organ that develops in the uterus during pregnancy. It is a unique characteristic of the higher (or placental) mammals. In humans it is a thick mass, about 7 in. , cord blood cord blood n. Blood present in the umbilical vessels at the time of delivery. , biliary biliary /bil·i·a·ry/ (bil´e-ar?e) pertaining to the bile, to the bile ducts, or to the gallbladder. bil·i·ar·y adj. 1. Of or relating to bile, the bile ducts, or the gallbladder. fluid, and feces feces or excrement or stools Solid bodily waste discharged from the colon through the anus during defecation. Normal feces are 75% water. The rest is about 30% dead bacteria, 30% indigestible food matter, 10–20% cholesterol and other fats, . Arch Environ Health 38:47-53. Greenland S. 1996. Basic methods for sensitivity analysis of biases. Int J Epidemiol 25:1107-1116. Greenland S. 2003. Quantifying biases in causal models: classical confounding vs collider-stratification bias. Epidemiology 14:300-306. Greenland S, Brumback B. 2002. An overview of relations among causal modelling methods. Int J Epidemiol 31:1030-1037. Greenland S, Morgenstern H. 2001. Confounding in health research. Annu Rev Public Health 22:189-212. Guo YL, Emmett EA, Pellizzari ED, Rohde CA. 1987. Influence of serum cholesterol and albumin on partitioning of PCB congeners between human serum and adipose tissue. Toxicol Appl Pharmacol 87:48-56. Helzlsouer KJ, Alberg AJ, Huang HY, Hoffman SC, Strickland PT, Brock brock n. Chiefly British A badger. [Middle English brok, from Old English broc, of Celtic origin.] JW, et al. 1999. Serum concentrations of organochlorine or·gan·o·chlo·rine n. Any of various hydrocarbon pesticides, such as DDT, that contain chlorine. compounds and the subsequent development of breast cancer. Cancer Epidemiol Biomarkers Prev 8:525-532. Hennig B, Reiterer G, Toborek M, Matveev SV, Daugherty A, Smart E, et al. 2005. Dietary fat interacts with PCBs to induce changes in lipid metabolism Lipid metabolism The assimilation of dietary lipids and the synthesis and degradation of lipids; this article is restricted to mammals. The principal dietary fat is triglyceride. in mice deficient in low-density lipoprotein low-density lipoprotein n. Abbr. LDL A lipoprotein that contains relatively high amounts of cholesterol and is associated with an increased risk of atherosclerosis and coronary artery disease. receptor. Environ Health Perspect 113:83-87. Hernan MA. 2001. Expert knowledge, confounding and causal methods [in Spanish). Gac Sanit 15(suppl 4):44-48. Hernan MA, Hernandez-Diaz S, Werler MM, Mitchell AA. 2002. Causal knowledge as a prerequisite for confounding evaluation: an application to birth defects birth defects, abnormalities in physical or mental structure or function that are present at birth. They range from minor to seriously deforming or life-threatening. A major defect of some type occurs in approximately 3% of all births. epidemiology. Am J Epidemiol 155:176-184. Hunter DJ, Hankinson SE, Laden F, Colditz GA, Manson JE, Willett WC, et al. 1997. Plasma organochlorine levels and the risk of breast cancer. N Engl J Med 337:1253-1258. Laden F, Collman G, Iwamoto K, Alberg AJ, Berkowitz GS, Freudenheim JL, et al. 2001a. 1,1-Dichloro-2,2-bis(p-chlorophenyl)ethylene ethylene (ĕth`əlēn') or ethene (ĕth`ēn), H2C=CH2, a gaseous unsaturated hydrocarbon. It is the simplest alkene. and polychlorinated biphenyls and breast cancer: combined analysis of five U.S. studies. J Natl Cancer Inst 93:768-776. Laden F, Hankinson SE, Wolff MS, Colditz GA, Willett WC, Speizer FE, et al. 2001b. Plasma organochlorine levels and the risk of breast cancer: an extended follow-up in the Nurses' Health Study Nurses' Health Study Cardiology A large cohort study that evaluated the effect of exogenous HRT on the risk of cardiovascular disease. See Estrogen replacement therapy, Osteoporosis. . Int J Cancer 91:568-574. Langer P, Kocan A, Tajtakova M, Petrik J, Chovancova J, Drobna B, et al. 2003. Possible effects of polychlorinated biphenyls and organochlorinated pesticides on the thyroid after long-term exposure to heavy environmental pollution. J Occup Environ Med 45:526-532. Morgan DP, Roan CC. 1970. Chlorinated chlorinated /chlo·ri·nat·ed/ (klor´i-nat?ed) treated or charged with chlorine. chlorinated charged with chlorine. chlorinated acids some, e.g. hydrocarbon pesticide residue Pesticide residue refers to the pesticides that may remain on or in food after they are applied to food crops.[1] Regulation of pesticide residue in the US in human tissues. Arch Environ Health 20:452-457. Moysich KB, Ambrosone CB, Vena JE, Shields PG, Mendola P, Kostyniak P, et al. 1998. Environmental organochlorine exposure and postmenopausal breast cancer risk. Cancer Epidemiol Biomarkers Prev 7:181-188. Mussalo-Rauhamaa H. 1991. Partitioning and levels of neutral organochlorine compounds in human serum, blood cells blood cells, n.pl the formed elements of the blood, including red cells (erythrocytes), white cells (leukocytes), and platelets (thrombocytes). blood cells See erythrocyte and leukocyte. Platelets are classed separately. , and adipose and liver tissue. Sci Total Environ 103:159-175. Needham LL, Wang RY. 2002. Analytic considerations for measuring environmental chemicals in breast milk. Environ Health Perspect 110:A317-A324. Phillips DL, Smith AB, Burse burse n. 1. A purse. 2. Ecclesiastical A flat cloth case for carrying the corporal that is used in celebrating the Eucharist. [Late Latin bursa; see bursa.] VW, Steele GK, Needham LL, Hannon WH. 1989. Half-life of polychlorinated biphenyls in occupationally exposed workers. Arch Environ Health 44:351-354. Robins JM 2003. Semantics of causal diagrams and the identification of direct and indirect effects. In: Highly Structured Stochastic By guesswork; by chance; using or containing random values. stochastic - probabilistic Systems (Green P, Hjort NL, Richardson S, eds). New York New York, state, United States New York, Middle Atlantic state of the United States. It is bordered by Vermont, Massachusetts, Connecticut, and the Atlantic Ocean (E), New Jersey and Pennsylvania (S), Lakes Erie and Ontario and the Canadian province of : Oxford University Press, 70-81. Robins JM, Hernan MA, Brumback B. 2000. Marginal structural models and causal inference in epidemiology. Epidemiology 11:550-560. Whitcomb BW, Schisterman EF, Buck GM, Weiner JM, Greizerstein H, Kostyniak P. 2005. Relative concentrations of organochlorines in adipose tissue and serum among reproductive age women. Environ Toxicol Pharmacol 19:203-213. Wolff MS. 1985. Occupational exposure to polychlorinated biphenyls (PCBs). Environ Health Perspect 60:133-138. Wolff MS, Toniolo PG. 1995. Environmental organochlorine exposure as a potential etiologic factor in breast cancer. Environ Health Perspect 103(suppl 7):141-145. Wolff MS, Toniolo PG, Lee EW, Rivera M, Dubin N. 1993. Blood levels of organochlorine residues and risk of breast cancer. J Natl Cancer Inst 85:648-652. Wolff MS, Zeleniuch-Jacquotte A, Dubin N, Toniolo P. 2000. Risk of breast cancer and organochlorine exposure. Cancer Epidemiol Biomarkers Prev 9:271-277. Enrique F. Schisterman, (1) Brian W. Whitcomb, (1) Germaine M. Buck Louis, (1) and Thomas A. Louis (2) (1) Division of Epidemiology, Statistics and Prevention Research, National Institute of Child Health and Human Development, National Institutes of Health, Department of Health and Human Services Noun 1. Department of Health and Human Services - the United States federal department that administers all federal programs dealing with health and welfare; created in 1979 Health and Human Services, HHS , Rockville, Maryland Rockville is the county seat of Montgomery County, Maryland, United States. According to the 2006 census update, the city had a total population of 59,114, making it the second largest city in Maryland. , USA; (2) Department of Biostatistics biostatistics /bio·sta·tis·tics/ (-stah-tis´tiks) biometry. bi·o·sta·tis·tics n. The science of statistics applied to the analysis of biological or medical data. , Johns Hopkins Bloomberg School of Public Health The Johns Hopkins Bloomberg School of Public Health is part of Johns Hopkins University in Baltimore, Maryland, U.S. It was the first institution of its kind in the world. Founded in 1916 by William H. Welch and John D. , Johns Hopkins University Johns Hopkins University, mainly at Baltimore, Md. Johns Hopkins in 1867 had a group of his associates incorporated as the trustees of a university and a hospital, endowing each with $3.5 million. Daniel C. , Baltimore, Maryland "Baltimore" redirects here. For the surrounding county, see Baltimore County, Maryland. For other uses, see Baltimore (disambiguation). Baltimore is an independent city located in the state of Maryland in the United States. , USA Address correspondence to E.F. Schisterman, Epidemiology Branch, Division of Epidemiology, Statistics and Prevention Research, National Institute of Child Health and Human Development, 6100 Executive Blvd., Room 7B03, Rockville, MD 20852 USA. Telephone: (301) 435-6893. Fax: (301) 402-2084. E-mail: schistee@mail.nih.gov
Table 1. Percent bias of estimates of effect of
PCBs on outcome for evaluated statistical models.
Percent bias (MSE) (a)
DAG (Figure 1) Unadjusted Standardized
A 1.2 (1.26) -51.3 (10.3)
B -0.8 (1.34) -75.9 (21.1)
C -15.4 (2.78) -351.3 (161.1)
D 0.4 (1.14) -79.8 (23.3)
E 24.0 (3.37) -128.8 (60.3)
F -0.4 (1.29) -85.0 (26.4)
G -86.3 (27.0) -1.0 (1.51)
H -11.2 (1.75) -128.3 (59.7)
Percent bias (MSE) (a)
DAG (Figure 1) Adjusted Two-stage
A 1.8 (1.28) 1.8 (1.28)
B -0.7 (1.35) -0.7 (1.33)
C -99.4 (1.59) 1.1 (2.78)
D 0.8 (1.17) 0.5 (1.14)
E 0.1 (1.39) 27.2 (3.37)
F -0.1 (1.41) -0.3 (1.29)
G -1.0 (1.51) -85.9 (27.0)
H -25.4 (3.65) -8.7 (1.75)
Serum lipid measurement error distributed normally with mean 0,
variance 1; [alpha] (strength of linear relation between log
PCB and log serum lipids) = 0.3; 500 repetitions; n = 1,000.
(a) Mean square error multiplied by 100 for illustration
(shown in parentheses).
|
|
||||||||||||||||||

Printer friendly
Cite/link
Email
Feedback
Reader Opinion